[Back to introns by organism] [Back to home page]
Information for So.us.I3-1 intron (Format of information for each intron)
[Intron and flanking sequence]
Note: Redundant intron copies found in
CP000473 Solibacter usitatus Ellin6076 (4192502-4194442)
Sequence from Genbank entry (intron is on the antisense strand).
The boundaries of the intron are marked as red and ORF is marked
as blue, with start and stop codons underlined.
Intron on antisense strand
3' end
atc ggcatagggg
3661
cggcggtcac ccgccgaccc ctgccacacc accgtgcgta cgggtccgta cacggcggtt
3721
cgagatggtt acgctaacag tcctcgaaca atgacggaag tcccagcgat
ttgaagtaag
3781 catttgaaag cccaacagcc agggccttgg ccttcgcgag gtaccaagga ccgcgaccgc
3841 tgccagcggt attagtcgcc agccgctctc gcacccccag ttccaacaga gctgcccggc
3901 gacggcgcgg tgttttccac tgccgccaca tggccgcccg gagtcgcaac cggacccagc
3961 gcgttaaacc gatcagcacc tcgggcgttt cgcagaatcc gaaatagctg cgccaacccc
4021 gcatatacgg agccagttcc tccatcgtcg tcttcatgct aacgcccttt gcccgtcccg
4081 tgatctcccg gattcgtcgc ttaaaccgat ccaaggcctt cggtgcaatc acgcgcttgg
4141 cctccggacc agccgtaaag ctgaacccga gaaacttccg ttcctgcggt cgtgccaccg
4201 cactcttcgt ctcatttacc ttgagcttga gcttttgcgt gatgaattgc gtaatgctct
4261 ccatcactcg ttgccccgcc cgctcgctgc gaacgtagat attacagtcg tccgcatatc
4321 gaacgaagcg atgaccccgg cgctccaact cccggtcgaa ttcgtcgagc acgaggttac
4381 tgagcagtgg cgaaagagga cctccttgcg gagttccttc cacgctcggg ctgaccaacc
4441 cgttctccat caccccggca ttcaagaatg cccggatgag tttcagcagc cgcttgtccg
4501 cgatccgttt ggcgatctga cccatcaatt tgtcgtgatt gactcgatcg aagaatttct
4561 ccaaatcaag atcaacgcac cagccgtgac cttcggcaat atactgctgc gcctgtgcca
4621 ccgcttgctg agccgaccgt cccggccgga acccgtagct gtaatcagaa aacgtccggt
4681 cccaccgcct ctgcagaacc tgcatcaccg cctgctggat aaatcgatcc aacaccgttg
4741 ggataccaag ctttcgcacc cctccgtccg gtttggcaat ctctacccgc ctcaccggtt
4801 tcggctcgta agtcccactc aacagttgtc cccggatggc tggccagtgc tgcttcaggt
4861 aatccttgat gccgatgacg gtcatcccgt caacgcccgg actacctttg ttggccttca
4921 cccgctgcaa tgctgccttc aggttttctc gctcacatac ttcctccatc aatcgattcg
4981 tgctggctgg gttttcggtc ccattcatcg ccccggccga ttcagtctct tcccttcccg
5041 cctctcgggc ttcacccgtt agcacagaag agaagtccag ttgcatctgg atgttctgct
5101 gcttgtcgtc cttgagactc atggcctact ggccgctcct tctcgttcgg gccttcagcc
5161 atcgttcccg gcttggccta tccgttgctc cgcctttcgg catgcggagt gcctcactag
5221 ccttgccgac gtcatgacct aatatgccct ctgctgactt ctgccccgcg gtcaagccgc
5281 ctttcgacag cctcagtcgc cgaagcgaca cggagcagat ctcctggggt aagttcagcc
5341 gtcttctgtg cacagccgcc gaatctacgc ttcgcaccct tgatggatat ggacttcgcg
5401 gtaagttgcc cgctcgtccg gcgctaacgc cttgtatcag gtttttgtcc atcgactcgc
5461
acatttgcta tacgcttctt tcagacccca cctcgcggcg gtagcccttg cgttctcgct
5521
agcccttcac ctccatcagg ttgggcaggg gacttgcacc cccaaactgc tgaacatgcc
5581 cagcacac
5' end
Intron on sense strand
5' end
gtgtgct gggcatgttc agcagtttgg gggtgcaagt cccctgccca
acctgatgga
3421
ggtgaagggc tagcgagaac gcaagggcta ccgccgcgag gtggggtctg aaagaagcgt
3481 atagcaaatg tgcgagtcga tggacaaaaa cctgatacaa ggcgttagcg ccggacgagc
3541 gggcaactta ccgcgaagtc catatccatc aagggtgcga agcgtagatt cggcggctgt
3601 gcacagaaga cggctgaact taccccagga gatctgctcc gtgtcgcttc ggcgactgag
3661 gctgtcgaaa ggcggcttga ccgcggggca gaagtcagca gagggcatat taggtcatga
3721 cgtcggcaag gctagtgagg cactccgcat gccgaaaggc ggagcaacgg ataggccaag
3781 ccgggaacga tggctgaagg cccgaacgag aaggagcggc cagtaggcca tgagtctcaa
3841 ggacgacaag cagcagaaca tccagatgca actggacttc tcttctgtgc taacgggtga
3901 agcccgagag gcgggaaggg aagagactga atcggccggg gcgatgaatg ggaccgaaaa
3961 cccagccagc acgaatcgat tgatggagga agtatgtgag cgagaaaacc tgaaggcagc
4021 attgcagcgg gtgaaggcca acaaaggtag tccgggcgtt gacgggatga ccgtcatcgg
4081 catcaaggat tacctgaagc agcactggcc agccatccgg ggacaactgt tgagtgggac
4141 ttacgagccg aaaccggtga ggcgggtaga gattgccaaa ccggacggag gggtgcgaaa
4201 gcttggtatc ccaacggtgt tggatcgatt tatccagcag gcggtgatgc aggttctgca
4261 gaggcggtgg gaccggacgt tttctgatta cagctacggg ttccggccgg gacggtcggc
4321 tcagcaagcg gtggcacagg cgcagcagta tattgccgaa ggtcacggct ggtgcgttga
4381 tcttgatttg gagaaattct tcgatcgagt caatcacgac aaattgatgg gtcagatcgc
4441 caaacggatc gcggacaagc ggctgctgaa actcatccgg gcattcttga atgccggggt
4501 gatggagaac gggttggtca gcccgagcgt ggaaggaact ccgcaaggag gtcctctttc
4561 gccactgctc agtaacctcg tgctcgacga attcgaccgg gagttggagc gccggggtca
4621 tcgcttcgtt cgatatgcgg acgactgtaa tatctacgtt cgcagcgagc gggcggggca
4681 acgagtgatg gagagcatta cgcaattcat cacgcaaaag ctcaagctca aggtaaatga
4741 gacgaagagt gcggtggcac gaccgcagga acggaagttt ctcgggttca gctttacggc
4801 tggtccggag gccaagcgcg tgattgcacc gaaggccttg gatcggttta agcgacgaat
4861 ccgggagatc acgggacggg caaagggcgt tagcatgaag acgacgatgg aggaactggc
4921 tccgtatatg cggggttggc gcagctattt cggattctgc gaaacgcccg aggtgctgat
4981 cggtttaacg cgctgggtcc ggttgcgact ccgggcggcc atgtggcggc agtggaaaac
5041 accgcgccgt cgccgggcag ctctgttgga actgggggtg cgagagcggc tggcgactaa
5101 taccgctggc agcggtcgcg gtccttggta cctcgcgaag gccaaggccc tggctgttgg
5161
gctttcaaat gcttacttca aatcgctggg acttccgtca ttgttcgagg actgttagcg
5221
taaccatctc gaaccgccgt gtacggaccc gtacgcacgg tggtgtggca ggggtcggcg
5281 ggtgaccgcc gcccctatgc cgat
3' end
[top]
[Intron and flanking sequence]
3301 ggaactcgga ccaccaccga tgctgcaggc tgctgtccgc ccagcagcgc taccccgacg
3361 tgggccaacc tgagaacccg gagcacactc gcaaccatct tcaagcatga tcctacactt
3421 tttgctggat ttcggtagct cgtgccgccc cactgtgagc tgccggctgg cggtgtcgcc
3481 gatcgttccg gcccggacct cgaatcctgc gagaagagac atcagcggca attcgtagct
3541 gccgataatg gatattaacg ctcgggcttg cgccttcgcg gcgccccatc ttccgagcgc
3601
tggcgatgcg accaacgcaa acgggatgct tcccagaacg ccgtttgatc
ggcatagggg
3661
cggcggtcac ccgccgaccc ctgccacacc accgtgcgta cgggtccgta cacggcggtt
3721
cgagatggtt acgctaacag tcctcgaaca atgacggaag tcccagcgat ttgaagtaag
3781 catttgaaag cccaacagcc agggccttgg ccttcgcgag gtaccaagga ccgcgaccgc
3841 tgccagcggt attagtcgcc agccgctctc gcacccccag ttccaacaga gctgcccggc
3901 gacggcgcgg tgttttccac tgccgccaca tggccgcccg gagtcgcaac cggacccagc
3961 gcgttaaacc gatcagcacc tcgggcgttt cgcagaatcc gaaatagctg cgccaacccc
4021 gcatatacgg agccagttcc tccatcgtcg tcttcatgct aacgcccttt gcccgtcccg
4081 tgatctcccg gattcgtcgc ttaaaccgat ccaaggcctt cggtgcaatc acgcgcttgg
4141 cctccggacc agccgtaaag ctgaacccga gaaacttccg ttcctgcggt cgtgccaccg
4201 cactcttcgt ctcatttacc ttgagcttga gcttttgcgt gatgaattgc gtaatgctct
4261 ccatcactcg ttgccccgcc cgctcgctgc gaacgtagat attacagtcg tccgcatatc
4321 gaacgaagcg atgaccccgg cgctccaact cccggtcgaa ttcgtcgagc acgaggttac
4381 tgagcagtgg cgaaagagga cctccttgcg gagttccttc cacgctcggg ctgaccaacc
4441 cgttctccat caccccggca ttcaagaatg cccggatgag tttcagcagc cgcttgtccg
4501 cgatccgttt ggcgatctga cccatcaatt tgtcgtgatt gactcgatcg aagaatttct
4561 ccaaatcaag atcaacgcac cagccgtgac cttcggcaat atactgctgc gcctgtgcca
4621 ccgcttgctg agccgaccgt cccggccgga acccgtagct gtaatcagaa aacgtccggt
4681 cccaccgcct ctgcagaacc tgcatcaccg cctgctggat aaatcgatcc aacaccgttg
4741 ggataccaag ctttcgcacc cctccgtccg gtttggcaat ctctacccgc ctcaccggtt
4801 tcggctcgta agtcccactc aacagttgtc cccggatggc tggccagtgc tgcttcaggt
4861 aatccttgat gccgatgacg gtcatcccgt caacgcccgg actacctttg ttggccttca
4921 cccgctgcaa tgctgccttc aggttttctc gctcacatac ttcctccatc aatcgattcg
4981 tgctggctgg gttttcggtc ccattcatcg ccccggccga ttcagtctct tcccttcccg
5041 cctctcgggc ttcacccgtt agcacagaag agaagtccag ttgcatctgg atgttctgct
5101 gcttgtcgtc cttgagactc atggcctact ggccgctcct tctcgttcgg gccttcagcc
5161 atcgttcccg gcttggccta tccgttgctc cgcctttcgg catgcggagt gcctcactag
5221 ccttgccgac gtcatgacct aatatgccct ctgctgactt ctgccccgcg gtcaagccgc
5281 ctttcgacag cctcagtcgc cgaagcgaca cggagcagat ctcctggggt aagttcagcc
5341 gtcttctgtg cacagccgcc gaatctacgc ttcgcaccct tgatggatat ggacttcgcg
5401 gtaagttgcc cgctcgtccg gcgctaacgc cttgtatcag gtttttgtcc atcgactcgc
5461
acatttgcta tacgcttctt tcagacccca cctcgcggcg gtagcccttg cgttctcgct
5521
agcccttcac ctccatcagg ttgggcaggg gacttgcacc cccaaactgc tgaacatgcc
5581 cagcacacaa ctaagccgct cgcgcggcgg acgctacgcg tcaggccggg gctagcgccc
5641 attctttggt gggagtattt tgtgggtcac aaggaggtcc ctgtgaccca acttcgcaaa
5701 atggtgctcg aggaactcga gcgccgtaat tactctcaag ctaccgcacg tgcctacgtc
5761 ggcgccatcc agcggttcgc cgaacatttc catcgctcgc ccgatcaact tggccccgag
5821 cacatccgcg aatatcaact gcacctcgtg caggaccgca aactgcatcc caggaccgtc
5881 atgatccaga tgtccgcgct ccgcttcttc ttccgcaagg tcctgaagcg gcgctttgat
[top]
MSLKDDKQQNIQMQLDFSSVLTGEAREAGREETESAGAMNGTENPASTNRLMEEVCER
ENLKAALQRVKANKGSPGVDGMTVIGIKDYLKQHWPAIRGQLLSGTYEPKPVRRVEIA
KPDGGVRKLGIPTVLDRFIQQAVMQVLQRRWDRTFSDYSYGFRPGRSAQQAVAQAQQY
IAEGHGWCVDLDLEKFFDRVNHDKLMGQIAKRIADKRLLKLIRAFLNAGVMENGLVSP
SVEGTPQGGPLSPLLSNLVLDEFDRELERRGHRFVRYADDCNIYVRSERAGQRVMESI
TQFITQKLKLKVNETKSAVARPQERKFLGFSFTAGPEAKRVIAPKALDRFKRRIREIT
GRAKGVSMKTTMEELAPYMRGWRSYFGFCETPEVLIGLTRWVRLRLRAAMWRQWKTPR
RRRAALLELGVRERLATNTAGSGRGPWYLAKAKALAVGLSNAYFKSLGLPSLFEDC
[top]
[top]
| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |