[Back to introns by organism]  [Back to home page]

Information for O.i.I2 intron  (Format of information for each intron

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence]

 

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

Intron on antisense strand

3' end

3001 gtagggtaga aaaaatacgc ctttctttaa atgatagaag gttttatttc tcctccctca

3061 gaaccgtacg tacacctttc agcgtatacg gctcgctagt tgaatatagc aggtcaacag

3121 gaaaagtttt atgttacact tcagaaaatt tttgaagaac tttctttcga tgtagtttta

3181 aatcatacat ttttataaac ataataaatg tttgattctt acacttatca ttacacacaa

3241 gagcgattct tttatttcga ggatgtttca tagacatagc aagctgcttc gctgttgtct

3301 tatatttaaa cgcaagcgtt tttaaaaagc tactctcagc taaatgaacc attttttcaa

3361 tatcattaaa atttacagca aatagacaat gattggcaaa tgaaataagt tcttcattgt

3421 atatctttaa aatttcttga gcagaaaaat gtaagagcct ttttctagcc aaaggtatgt

3481 gctcttgaaa aacaccataa cctttttgag aagcccattg gttcatttgt tgcttaggaa

3541 taatcaaatt aagttgctga cttgtaccag aatttgatag ataatttctt ttagaggaga

3601 gttgataatt ttgaaaataa attctgtcac ctcgatgatt taggtttgat tgtatatcat

3661 aaaaagaatg aagtccgctc cgatcaagct cattattaat ctccccaagt aaacggttcg

3721 catgaacttt tgtgctgttt aaaccaacaa gatactcttc cttattgaaa acgaatttga

3781 tagaagtacc attatttctt tcaaatgcat cgattatccc ttttaagttt tgtacgacag

3841 attcctggaa aatatgttct aggagattgg ataatttaat cattgtatag ctatatgggt

3901 gatgctgctt ttgcagactc agcattgatg cataaaacac aagttgaata aatcgttgat

3961 catttacttt tcttttaatc ttattaatta gaaattcaag atgttgttgt gaaaagggga

4021 ttttttttaa ctttattttt ataaaccaat taatggtacc ccagcttttt aatttttgaa

4081 gtgaagaaaa tgaaggatta tttttaatca gtagtagttt ttcaaagttc cctaaatgat

4141 atatatgttc tagtataaca actacggatt ctattattaa tatattcttc atatggaggt

4201 tctcatttaa ttttgatttt gagggctgtg agaagcggaa ttttcctgat tttaactcat

4261 aaaccaattc ttttatcatt ttatctttaa tttgtctgtg tagatatctt agacgaattg

4321 tacgcataat aacgatttgc cagagatcgt cattatataa taaccggtag caatcgaata

4381 ctacttttcc ttgttgagct gctctttgta ttacgtccaa ttgaggaaat ggatgaattg

4441 tcatattcat gactcctttc gtctagacgt atttcaacct atcttccttc cccttgccaa

4501 acctattaca ggctgacgtt gagtactatg aagactccgt taccttatgc cttatttttg

4561 agacatagca cttaggcaat ccctgcatta tgtcaatata tagatacgtg atgtttaggt

4621 acccatttcg atggtttctt cttattgaag atgggaggat aacccttacg atgagataga

4681 aagcggtgaa ttccaatctc atgataccct aactatttaa tagttacaca cccaatatcc

4741 cttcaccttg ggttcaagca gtatagcttt taccatgcat cacttggtct gaaagacaaa

4801 agtataggat gtactatttc cttcttttca catcatgcta ttgtccccta attggtttcc

4861 caattaaagt aagatgctga ccatagaaca ttgtaaggag ttctagatac tttgtgtatc

4921 actatataat tacaccttgc aggcgcac

5' end

Intron on sense strand

5' end

   1 gtgcgcctgc aaggtgtaat tatatagtga tacacaaagt atctagaact ccttacaatg 

   61 ttctatggtc agcatcttac tttaattggg aaaccaatta ggggacaata gcatgatgtg 

 121 aaaagaagga aatagtacat cctatacttt tgtctttcag accaagtgat gcatggtaaa 

 181 agctatactg cttgaaccca aggtgaaggg atattgggtg tgtaactatt aaatagttag 

 241 ggtatcatga gattggaatt caccgctttc tatctcatcg taagggttat cctcccatct 

 301 tcaataagaa gaaaccatcg aaatgggtac ctaaacatca cgtatctata tattgacata 

 361 atgcagggat tgcctaagtg ctatgtctca aaaataaggc ataaggtaac ggagtcttca 

 421 tagtactcaa cgtcagcctg taataggttt ggcaagggga aggaagatag gttgaaatac 

 481 gtctagacga aaggagtcat gaatatgaca attcatccat ttcctcaatt ggacgtaata 

 541 caaagagcag ctcaacaagg aaaagtagta ttcgattgct accggttatt atataatgac 

 601 gatctctggc aaatcgttat tatgcgtaca attcgtctaa gatatctaca cagacaaatt 

 661 aaagataaaa tgataaaaga attggtttat gagttaaaat caggaaaatt ccgcttctca 

 721 cagccctcaa aatcaaaatt aaatgagaac ctccatatga agaatatatt aataatagaa 

 781 tccgtagttg ttatactaga acatatatat catttaggga actttgaaaa actactactg 

 841 attaaaaata atccttcatt ttcttcactt caaaaattaa aaagctgggg taccattaat 

 901 tggtttataa aaataaagtt aaaaaaaatc cccttttcac aacaacatct tgaatttcta 

 961 attaataaga ttaaaagaaa agtaaatgat caacgattta ttcaacttgt gttttatgca 

1021 tcaatgctga gtctgcaaaa gcagcatcac ccatatagct atacaatgat taaattatcc 

1081 aatctcctag aacatatttt ccaggaatct gtcgtacaaa acttaaaagg gataatcgat 

1141 gcatttgaaa gaaataatgg tacttctatc aaattcgttt tcaataagga agagtatctt 

1201 gttggtttaa acagcacaaa agttcatgcg aaccgtttac ttggggagat taataatgag 

1261 cttgatcgga gcggacttca ttctttttat gatatacaat caaacctaaa tcatcgaggt 

1321 gacagaattt attttcaaaa ttatcaactc tcctctaaaa gaaattatct atcaaattct 

1381 ggtacaagtc agcaacttaa tttgattatt cctaagcaac aaatgaacca atgggcttct 

1441 caaaaaggtt atggtgtttt tcaagagcac atacctttgg ctagaaaaag gctcttacat 

1501 ttttctgctc aagaaatttt aaagatatac aatgaagaac ttatttcatt tgccaatcat 

1561 tgtctatttg ctgtaaattt taatgatatt gaaaaaatgg ttcatttagc tgagagtagc 

1621 tttttaaaaa cgcttgcgtt taaatataag acaacagcga agcagcttgc tatgtctatg 

1681 aaacatcctc gaaataaaag aatcgctctt gtgtgtaatg ataagtgtaa gaatcaaaca 

1741 tttattatgt ttataaaaat gtatgattta aaactacatc gaaagaaagt tcttcaaaaa 

1801 ttttctgaag tgtaacataa aacttttcct gttgacctgc tatattcaac tagcgagccg 

1861 tatacgctga aaggtgtacg tacggttctg agggaggaga aataaaacct tctatcattt 

1921 aaagaaaggc gtattttttc taccctac

3' end

[top]


[Intron and flanking sequence]

 

2701 tccttcgatt ctaattttct ttatgtatct cgcttttata acatttttca ttcattataa

2761 atgataagaa gcgcctctcc ctttcttgca atgaaagatg gcgctaaatt taccaattct

2821 tatttaatta tacgaaaaaa cttataaaat agatagtatt tataaataac ctttttcttt

2881 tagggatacg aattttttgt caccgataat cagatggtct aatacttcaa tgccaattaa

2941 tttaccagag tccactaaac gtttcgtcac atggatatct tcttgtgacg gtgttgggtc

3001 gtagggtaga aaaaatacgc ctttctttaa atgatagaag gttttatttc tcctccctca

3061 gaaccgtacg tacacctttc agcgtatacg gctcgctagt tgaatatagc aggtcaacag

3121 gaaaagtttt atgttacact tcagaaaatt tttgaagaac tttctttcga tgtagtttta

3181 aatcatacat ttttataaac ataataaatg tttgattctt acacttatca ttacacacaa

3241 gagcgattct tttatttcga ggatgtttca tagacatagc aagctgcttc gctgttgtct

3301 tatatttaaa cgcaagcgtt tttaaaaagc tactctcagc taaatgaacc attttttcaa

3361 tatcattaaa atttacagca aatagacaat gattggcaaa tgaaataagt tcttcattgt

3421 atatctttaa aatttcttga gcagaaaaat gtaagagcct ttttctagcc aaaggtatgt

3481 gctcttgaaa aacaccataa cctttttgag aagcccattg gttcatttgt tgcttaggaa

3541 taatcaaatt aagttgctga cttgtaccag aatttgatag ataatttctt ttagaggaga

3601 gttgataatt ttgaaaataa attctgtcac ctcgatgatt taggtttgat tgtatatcat

3661 aaaaagaatg aagtccgctc cgatcaagct cattattaat ctccccaagt aaacggttcg

3721 catgaacttt tgtgctgttt aaaccaacaa gatactcttc cttattgaaa acgaatttga

3781 tagaagtacc attatttctt tcaaatgcat cgattatccc ttttaagttt tgtacgacag

3841 attcctggaa aatatgttct aggagattgg ataatttaat cattgtatag ctatatgggt

3901 gatgctgctt ttgcagactc agcattgatg cataaaacac aagttgaata aatcgttgat

3961 catttacttt tcttttaatc ttattaatta gaaattcaag atgttgttgt gaaaagggga

4021 ttttttttaa ctttattttt ataaaccaat taatggtacc ccagcttttt aatttttgaa

4081 gtgaagaaaa tgaaggatta tttttaatca gtagtagttt ttcaaagttc cctaaatgat

4141 atatatgttc tagtataaca actacggatt ctattattaa tatattcttc atatggaggt

4201 tctcatttaa ttttgatttt gagggctgtg agaagcggaa ttttcctgat tttaactcat

4261 aaaccaattc ttttatcatt ttatctttaa tttgtctgtg tagatatctt agacgaattg

4321 tacgcataat aacgatttgc cagagatcgt cattatataa taaccggtag caatcgaata

4381 ctacttttcc ttgttgagct gctctttgta ttacgtccaa ttgaggaaat ggatgaattg

4441 tcatattcat gactcctttc gtctagacgt atttcaacct atcttccttc cccttgccaa

4501 acctattaca ggctgacgtt gagtactatg aagactccgt taccttatgc cttatttttg

4561 agacatagca cttaggcaat ccctgcatta tgtcaatata tagatacgtg atgtttaggt

4621 acccatttcg atggtttctt cttattgaag atgggaggat aacccttacg atgagataga

 4681 aagcggtgaa ttccaatctc atgataccct aactatttaa tagttacaca cccaatatcc

4741 cttcaccttg ggttcaagca gtatagcttt taccatgcat cacttggtct gaaagacaaa

4801 agtataggat gtactatttc cttcttttca catcatgcta ttgtccccta attggtttcc

4861 caattaaagt aagatgctga ccatagaaca ttgtaaggag ttctagatac tttgtgtatc

4921 actatataat tacaccttgc aggcgcacac cactcgggtg attgtgggct acaattaatg

4981 atgctgcaga acgtttaact gcttctttaa aaacctctct agggtgcact atcgaggcat

5041 ttaaactacc tataaaaact gtttgataat gaataatttg attttttgta tttagaaata

5101 aaactacaaa atgttcctga cttaaataac gcatctcttc catgatgtaa ctagcaccgt

5161 cttcaggaga ccggataaca taaggaccat cgggcttata ggtattcaat cttttcccta

5221 actcgatagc agcaagaatg ataagtcctt tcgctgtgcc aataccttta attgctataa

[top]


[ORF sequence]

 

MNMTIHPFPQLDVIQRAAQQGKVVFDCYRLLYNDDLWQIVIMRTIRLRYLHRQIKDKM

IKELVYELKSGKFRFSQPSKSKLNENLHMKNILIIESVVVILEHIYHLGNFEKLLLIK

NNPSFSSLQKLKSWGTINWFIKIKLKKIPFSQQHLEFLINKIKRKVNDQRFIQLVFYA

SMLSLQKQHHPYSYTMIKLSNLLEHIFQESVVQNLKGIIDAFERNNGTSIKFVFNKEE

YLVGLNSTKVHANRLLGEINNELDRSGLHSFYDIQSNLNHRGDRIYFQNYQLSSKRNY

LSNSGTSQQLNLIIPKQQMNQWASQKGYGVFQEHIPLARKRLLHFSAQEILKIYNEEL

ISFANHCLFAVNFNDIEKMVHLAESSFLKTLAFKYKTTAKQLAMSMKHPRNKRIALVC

NDKCKNQTFIMFIKMYDLKLHRKKVLQKFSEV

[top]


[Secondary structure]

                                                    

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |