[Back to introns by organism]   [Back to home page]

Information of S.t.I1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

[intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

               

5' end                                                              

                                               g tacgcccagc atgggcatga
 121 actaatgagg tgaaagtcct ctgtaggaag atcaccgtta ctctttctaa aagagaataa
 181 catactgtta ttaaacgtta actactagcg aatggcaagg gcttaatcgc gaggtttggt
 241 ctggaggaag ccgctagcaa atttgcgagc tgatgaacaa gaacatcata tgaggcgtag
 301 gctagaggtg agttggcaca agacgacgaa accacgtgat ctaatggcca ccgtaaatga
 361 tgcagcagtg caaagaaagt tcatgttctt atctggggag atctgcttaa cacgcgatcg
 421 gtcattaacc agtaaggctc tggtttataa gtgccttggc ttttcaactt catcgactga
 481 aaagagcgaa ccagatggaa accgatagcg catgacgggg taacttggca tgtgattaag
 541 cagaagtcag cagacggcat agtagcccaa cgcccatcgt aatggttggg atatggtgaa
 601 ggcctgaaca tttagaagaa ggaggagtct tgtcaaactt gtcgataccc actacgtcgc
 661 cagacgatta ctatcagcag cgtattgata tgcaaccggc cttcaacaac gatttattcc
 721 aacaactgct cgaacctgaa aatctacacc gagcttggcg tcaagttaaa gccaataaag
 781 gcgctgcggg catcgatggc atgactatcg aagccttccc gctctggatg caacaaggcg
 841 gctggcaact gtgtaaatct caattagagc gaggtgaata ccaaccctca gcggtcaggc
 901 gcgtagagat cgacaaaccc gatggcggta aacgcaaatt ggggatccct aacgtcattg
 961 accgtgtgat acagcaagcc attgctcaaa tactcacgcc actgtttgac ccattcttct
1021 cgaccaacag ctttggcttt aggccgaaca gaaatgccaa acaagcggta ctgcaagtaa
1081 gggatatcat caaacaaaaa cgcaaatttg ccgtcgatgt tgatctgtct aagttctttg
1141 accgagttaa tcatgatctc ttgatgactc aacttagaag caaggtgcag gataagcgtc
1201 tgttggcgct tattggtaaa taccttcgag ctggtgtgat ggtcaatggc caatttgaag
1261 caagctttga aggcgtgcct caaggcggcc cactctcgcc attactctct aacattatgc
1321 tggatagtct ggataaagag ctggaaagcc gagggcataa attcgcccgc tacgcggacg
1381 acttcatcat tttggtaaag tctattcgag caggagagcg ggtactcaag agcattactc
1441 aatatctggc cactaagcta aaactggccg ttaatgaaca gaaaagccaa gtggttaagg
1501 tcggtcaaag caagtttctt gggtttacat ttaaccgagg aaagatccag tggcatgcta
1561 aaacattacg cattttcaaa caaaagatga gacgactgac gaaccgaaat tggggagtta
1621 gtatgggata tcaattgttt aaagtgcggc agtacatgca aggctggatc aactactttg
1681 gcatagccaa tgcttaccaa ggatgtgtcg atttagacca ttggatccgt cgtcgcatac
1741 cgtatgtgct actggcgtca gtggcgaaaa ccacggacta aggtgcaaaa tttacttaag
1801 cgtggcgtca ggatcaagcc ggtgttgstt gtggtatcac aagtaaaggg ccttggcgta
1861 gctcttaaac accggggata cagcaagccc tgagtaatgg gtatctgaag aagcagccct
1921 tatccttaga gatggatggg
tcgcagttta ttatcctacc acaaagttta ttaagggttc
1981 taaatgaccc gccttgtgcg gaaccggcat gcagggtggt gtgggggctg aaggttagat
2041 acctccgggc tacccgat

3' end  

[top]


[Intron and flanking sequence]

 

   1 ttggaaaccn tttgagcaat tmtgtgctta gtgcatctaa cgcctgagct cagccgaccg
  61 aaaccgcgta gcggttttgg gtcggctgca gcgatttgtg tacgcccagc atgggcatga
 121 actaatgagg tgaaagtcct ctgtaggaag atcaccgtta ctctttctaa aagagaataa
 181 catactgtta ttaaacgtta actactagcg aatggcaagg gcttaatcgc gaggtttggt
 241 ctggaggaag ccgctagcaa atttgcgagc tgatgaacaa gaacatcata tgaggcgtag
 301 gctagaggtg agttggcaca agacgacgaa accacgtgat ctaatggcca ccgtaaatga
 361 tgcagcagtg caaagaaagt tcatgttctt atctggggag atctgcttaa cacgcgatcg
 421 gtcattaacc agtaaggctc tggtttataa gtgccttggc ttttcaactt catcgactga
 481 aaagagcgaa ccagatggaa accgatagcg catgacgggg taacttggca tgtgattaag
 541 cagaagtcag cagacggcat agtagcccaa cgcccatcgt aatggttggg atatggtgaa
 601 ggcctgaaca tttagaagaa ggaggagtct tgtcaaactt gtcgataccc actacgtcgc
 661 cagacgatta ctatcagcag cgtattgata tgcaaccggc cttcaacaac gatttattcc
 721 aacaactgct cgaacctgaa aatctacacc gagcttggcg tcaagttaaa gccaataaag
 781 gcgctgcggg catcgatggc atgactatcg aagccttccc gctctggatg caacaaggcg
 841 gctggcaact gtgtaaatct caattagagc gaggtgaata ccaaccctca gcggtcaggc
 901 gcgtagagat cgacaaaccc gatggcggta aacgcaaatt ggggatccct aacgtcattg
 961 accgtgtgat acagcaagcc attgctcaaa tactcacgcc actgtttgac ccattcttct
1021 cgaccaacag ctttggcttt aggccgaaca gaaatgccaa acaagcggta ctgcaagtaa
1081 gggatatcat caaacaaaaa cgcaaatttg ccgtcgatgt tgatctgtct aagttctttg
1141 accgagttaa tcatgatctc ttgatgactc aacttagaag caaggtgcag gataagcgtc
1201 tgttggcgct tattggtaaa taccttcgag ctggtgtgat ggtcaatggc caatttgaag
1261 caagctttga aggcgtgcct caaggcggcc cactctcgcc attactctct aacattatgc
1321 tggatagtct ggataaagag ctggaaagcc gagggcataa attcgcccgc tacgcggacg
1381 acttcatcat tttggtaaag tctattcgag caggagagcg ggtactcaag agcattactc
1441 aatatctggc cactaagcta aaactggccg ttaatgaaca gaaaagccaa gtggttaagg
1501 tcggtcaaag caagtttctt gggtttacat ttaaccgagg aaagatccag tggcatgcta
1561 aaacattacg cattttcaaa caaaagatga gacgactgac gaaccgaaat tggggagtta
1621 gtatgggata tcaattgttt aaagtgcggc agtacatgca aggctggatc aactactttg
1681 gcatagccaa tgcttaccaa ggatgtgtcg atttagacca ttggatccgt cgtcgcatac
1741 cgtatgtgct actggcgtca gtggcgaaaa ccacggacta aggtgcaaaa tttacttaag
1801 cgtggcgtca ggatcaagcc ggtgttgstt gtggtatcac aagtaaaggg ccttggcgta
1861 gctcttaaac accggggata cagcaagccc tgagtaatgg gtatctgaag aagcagccct
1921 tatccttaga gatggatggg tcgcagttta ttatcctacc acaaagttta ttaagggttc
1981 taaatgaccc gccttgtgcg gaaccggcat gcagggtggt gtgggggctg aaggttagat
2041 acctccgggc tacccgat
ta ggccgcatat cgcgacctga aagcggcacg caagacctca
2101 accttttccg ccccgagtga ggtgcatgcg agcctgtagg actctatgtg ctttgtaggc
2161 cagtccactg gtggtacttc atcggcatag taaaagtaat cccagatgat cgcctcccag
2221 ctgttacaac ggactggccg cccggcgatg acgccctcag ccgcctctgg gcacgagccc
2281 tgcggagcct ccgcgatttc atacgcttcg tctgcccacc aagcaggttc gcagtcaagt
2341 aactcatccc cgatctccgc taagaatcca tagtccaact cctccatgac gcgcccgccg
2401 agcatttcaa ctattgcctc gagctcgccg cgcctctcgc cgggaaacgt cagatcaata
2461 tcatcgtgct tgcgtgttac acgccctagc cgtgcatcga tcgcccagcc cccaccgatc
2521 cagagcggca gatttcgctc atctgccgca gctagaattt tgtgtatcaa tgtgacctgc
2581 gttgtgtcca tgcggcctaa ccttgtttta gggcgactgc cctgctgcgt aacatcgttg
2641 ctgctccata acatcaaaca tcgacccagg gnktnaa

[top]


[ORF sequence]

 

MQPAFNNDLFQQLLEPENLHRAWRQVKANKGAAGIDGMTIEAFPLWMQQGGWQLCKSQ

LERGEYQPSAVRRVEIDKPDGGKRKLGIPNVIDRVIQQAIAQILTPLFDPFFSTNSFG

FRPNRNAKQAVLQVRDIIKQKRKFAVDVDLSKFFDRVNHDLLMTQLRSKVQDKRLLAL

IGKYLRAGVMVNGQFEASFEGVPQGGPLSPLLSNIMLDSLDKELESRGHKFARYADDF

IILVKSIRAGERVLKSITQYLATKLKLAVNEQKSQVVKVGQSKFLGFTFNRGKIQWHA

KTLRIFKQKMRRLTNRNWGVSMGYQLFKVRQYMQGWINYFGIANAYQGCVDLDHWIRR

RIPYVLLASVAKTTD*GAKFT*AWRQDQAGVXCGITSKGPWRSS*TPGIQQALSNGYL

KKQPLSLEMDG

top]


[Secondary structure]

 

[top]


 

| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |