[Back to introns by organism] [Back to home page]

Information of E.f.I4 intron   (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

[Intron sequence]

 

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

 

Intron on antisense strand

 

3' end

   1 ataacgatag gtaagaattt gatataatct ccatcttttc ccccgctcca caccgtacgt 

  61 gagcctttca cctcatacgg cgttccatct atttattttc tattgaatgt tttgcagatt 

 121 acacattttt cgataaaggt tgatttttgc tatttgctct ggcagcagtt tcagttcatt 

 181 gacgtagtga tcgattatca ttttatttgt ggtatggatc aaaagatgga tatccttatg 

 241 aataattctc aagttatcaa atttgtcgtc tccaccaaga ctttttggta aatagtgatg 

 301 acagtgaacg gcttttgctg gcaaaaactg ttttgtaatc tcacacttgc ctgatttcat 

 361 tgagtaacga gatattctat tatctaaata ctcaaccatt ctattcagta ttttcgattc 

 421 catcaatttt cttaattctt ggtatacaag agactgtaag cttttgtgct ccaacctttg 

 481 acgaccaaac ggggtaaacg gtgtatcttc cgtattaaat ccgtaaattg tttttgtaga 

 541 tacgtcgcaa agagggtaca aatagactcc ggctacttta tacgttttca tcgttgtcga 

 601 gtaaaacttt ttgtaagtag gggaagcttt gttgggataa catcttgtac tacaatttga 

 661 taatcggtta tacgtagtga atgaaagaac gtagttcatc ctattcaaat ctaagttaat 

 721 atgagtggca tacctgaaat aattatgaac accaagcact agagcattaa ataggagagc 

 781 attttgtgct gtagggctct tttgaatatc tttaattctt tgtttaatct cttttttaat 

 841 ttgatttttc tttttgtcag aaatatgcga attacaaacc catttttccc cttttgtttc 

 901 acgcaaatgg tgaatcctaa aaattccgat ttgcgtttac gcaagttgac gattttagac 

 961 ttctcattgg agatatctaa tttaagcctg tcttttaggt agagcttgac agcgtggaac 

1021 cattttaatg cggacggata atcattagtc atgattttaa aatcatctgc gtacctgaca 

1081 atatagcctt gctttagatt tgtatctctt aaagcacgaa atttattgta acctttcgta 

1141 tagggatact ttgtttcaaa ggtatgccac tgctttgaga cccaatggtc taagtcgttt 

1201 agaacaacat tagataatag tggagaaatg atacctcctt ggatcgttcc tttgctcgaa 

1261 ataccttctc cctgaatggg tgatttcaga gattttgaaa gaatagctaa gactcgtttg 

1321 tcacaaattc ctatattcca taattgtttt atcagtaaac ggtgatttac attatcaaag 

1381 aaacccttga tatcaatgtc tactgcatag tgcatcttac tgatgtttat gagatacata 

1441 atgcgaccta gagcatgttt tgcacttctc aatggtctga acccgtaact atgttcataa 

1501 aactttgctt cacatattgg ctccagtacc tgcttgaaca tctgttggat aattcgatca 

1561 atcatgcatg gaattcctaa tgggcgtttc tcgccgttgg gtttaggtat cattactcgc 

1621 tttatggatt ttggtttgta attttctaac tgactgagaa taagatgtat aaactctgct 

1681 tggttcattt ccttatagtt atcaattgtg aaggagtccg ttcctggtgt cgaagacccc 

1741 ttgtttgcct tgatagtgcg gtatgctaaa agaatattat tttcagaaat gattaactcg 

1801 tataattgat aaaacttttt cccgttttta ctttgagtaa atagctgatc gaatgtttct 

1861 tgtagatcgt aatattccca gtaacgtatt tttgtgttca agcgtggcac ctccataacg 

1921 gatttcccca cgttcttacc agatccttga acttcatctg tttcaacatt cattgttatt 

1981 aaatagactt agggctatcc ctccacgttt gttagacgct tcatcggtac tgtgccctta 

2041 ctttcacaga aacaaagcaa tttcgttatt acttatatgc acctatctag aagtagtcat 

2101 ttatttgata gttttctgct tacatcgttc caatgttttt agctatcatg tataacctta 

2161 ggtatctact atgagcctgt gaaattatag cttcttcgtt agctatcccg attttcatat 

2221 catggaacct tactcgtcgg tatttcacca tacgtccgta tgttcctatc tttcgacaag 

2281 atcagccttt tagacccgta cattcgcaga ttcgtcagtc attttcagac attcttacca 

2341 tagttatctt tttcagccgc ccgccctata gtcagcctta tgctcaaaca cttaggcaga 

2401 tttcagacga cttcacctag cttcatacca aaaccaatct acttcattga ttagagcatg 

2461 taggagtatt agcggagatg tttcagctca acttgaaggc tttcattcca tctctcaaaa 

2521 cattagttgt cactattgat aatctcaata ggcttccatt tcacatctat tcgatgcttg 

2581 taggaaaacg attcgcac

5' end  

 

Intron on sense strand

 

5' end  

   1 gtgcgaatcg ttttcctaca agcatcgaat agatgtgaaa tggaagccta ttgagattat 

  61 caatagtgac aactaatgtt ttgagagatg gaatgaaagc cttcaagttg agctgaaaca 

 121 tctccgctaa tactcctaca tgctctaatc aatgaagtag attggttttg gtatgaagct 

 181 aggtgaagtc gtctgaaatc tgcctaagtg tttgagcata aggctgacta tagggcgggc  

 241 ggctgaaaaa gataactatg gtaagaatgt ctgaaaatga ctgacgaatc tgcgaatgta 

 301 cgggtctaaa aggctgatct tgtcgaaaga taggaacata cggacgtatg gtgaaatacc 

 361 gacgagtaag gttccatgat atgaaaatcg ggatagctaa cgaagaagct ataatttcac 

 421 aggctcatag tagataccta aggttataca tgatagctaa aaacattgga acgatgtaag 

 481 cagaaaacta tcaaataaat gactacttct agataggtgc atataagtaa taacgaaatt 

 541 gctttgtttc tgtgaaagta agggcacagt accgatgaag cgtctaacaa acgtggaggg 

 601 atagccctaa gtctatttaa taacaatgaa tgttgaaaca gatgaagttc aaggatctgg 

 661 taagaacgtg gggaaatccg ttatggaggt gccacgcttg aacacaaaaa tacgttactg 

 721 ggaatattac gatctacaag aaacattcga tcagctattt actcaaagta aaaacgggaa 

 781 aaagttttat caattatacg agttaatcat ttctgaaaat aatattcttt tagcataccg 

 841 cactatcaag gcaaacaagg ggtcttcgac accaggaacg gactccttca caattgataa 

 901 ctataaggaa atgaaccaag cagagtttat acatcttatt ctcagtcagt tagaaaatta 

 961 caaaccaaaa tccataaagc gagtaatgat acctaaaccc aacggcgaga aacgcccatt 

1021 aggaattcca tgcatgattg atcgaattat ccaacagatg ttcaagcagg tactggagcc 

1081 aatatgtgaa gcaaagtttt atgaacatag ttacgggttc agaccattga gaagtgcaaa 

1141 acatgctcta ggtcgcatta tgtatctcat aaacatcagt aagatgcact atgcagtaga 

1201 cattgatatc aagggtttct ttgataatgt aaatcaccgt ttactgataa aacaattatg 

1261 gaatatagga atttgtgaca aacgagtctt agctattctt tcaaaatctc tgaaatcacc 

1321 cattcaggga gaaggtattt cgagcaaagg aacgatccaa ggaggtatca tttctccact 

1381 attatctaat gttgttctaa acgacttaga ccattgggtc tcaaagcagt ggcatacctt 

1441 tgaaacaaag tatccctata cgaaaggtta caataaattt cgtgctttaa gagatacaaa 

1501 tctaaagcaa ggctatattg tcaggtacgc agatgatttt aaaatcatga ctaatgatta 

1561 tccgtccgca ttaaaatggt tccacgctgt caagctctac ctaaaagaca ggcttaaatt 

1621 agatatctcc aatgagaagt ctaaaatcgt caacttgcgt aaacgcaaat cggaattttt 

1681 aggattcacc atttgcgtga aacaaaaggg gaaaaatggg tttgtaattc gcatatttct 

1741 gacaaaaaga aaaatcaaat taaaaaagag attaaacaaa gaattaaaga tattcaaaag 

1801 agccctacag cacaaaatgc tctcctattt aatgctctag tgcttggtgt tcataattat 

1861 ttcaggtatg ccactcatat taacttagat ttgaatagga tgaactacgt tctttcattc 

1921 actacgtata accgattatc aaattgtagt acaagatgtt atcccaacaa agcttcccct 

1981 acttacaaaa agttttactc gacaacgatg aaaacgtata aagtagccgg agtctatttg 

2041 taccctcttt gcgacgtatc tacaaaaaca atttacggat ttaatacgga agatacaccg 

2101 tttaccccgt ttggtcgtca aaggttggag cacaaaagct tacagtctct tgtataccaa 

2161 gaattaagaa aattgatgga atcgaaaata ctgaatagaa tggttgagta tttagataat 

2221 agaatatctc gttactcaat gaaatcaggc aagtgtgaga ttacaaaaca gtttttgcca 

2281 gcaaaagccg ttcactgtca tcactattta ccaaaaagtc ttggtggaga cgacaaattt 

2341 gataacttga gaattattca taaggatatc catcttttga tccataccac aaataaaatg 

2401 ataatcgatc actacgtcaa tgaactgaaa ctgctgccag agcaaatagc aaaaatcaac 

2461 ctttatcgaa aaatgtgtaa tctgcaaaac attcaataga aaataaatag atggaacgcc 

2521 gtatgaggtg aaaggctcac gtacggtgtg gagcggggga aaagatggag attatatcaa 

2581 attcttacct atcgttat

3' end  

[top]


[Intron and flanking sequence]

 

1021 ttttccatca ataacagaag aaataatttc cttcttgttc tccttggcat attgtaaaga
1081 atcatcctct gtcattttta ttgcacctcc ctgacacctt caaagctagt ttatcatact
1141 tttaacccga tgaactagta gattaaattt ttctgtatct gtcatttttc ttttgattgt
1201 gctacttgtt tgatggattc taagaaatca tacccttttg gcacaagtgg cgtataaaac
1261 tcactaatca cactccctcc agtatccaca tagcctcgac ctttgatgcg tttcataaag
1321 aaatttttat caacttctcc aaacatcatg gaatacccga gttcactcat acgacccaaa
1381 gccactcgga agttaaattg atcgcgaatc ccgtctccta aatattttgc atccggtctt
1441 tgacaagcca gaattagaaa gaaaccagac tgacgaccca acatcacgat ttgtttgagc
1501 ttatttaaaa tcaccgcact ttctttcgtc gttaacattt ataacgatag gtaagaattt
1561 gatataatct ccatcttttc ccccgctcca caccgtacgt gagcctttca cctcatacgg
1621 cgttccatct atttattttc tattgaatgt tttgcagatt acacattttt cgataaaggt
1681 tgatttttgc tatttgctct ggcagcagtt tcagttcatt gacgtagtga tcgattatca
1741 ttttatttgt ggtatggatc aaaagatgga tatccttatg aataattctc aagttatcaa
1801 atttgtcgtc tccaccaaga ctttttggta aatagtgatg acagtgaacg gcttttgctg
1861 gcaaaaactg ttttgtaatc tcacacttgc ctgatttcat tgagtaacga gatattctat
1921 tatctaaata ctcaaccatt ctattcagta ttttcgattc catcaatttt cttaattctt
1981 ggtatacaag agactgtaag cttttgtgct ccaacctttg acgaccaaac ggggtaaacg
2041 gtgtatcttc cgtattaaat ccgtaaattg tttttgtaga tacgtcgcaa agagggtaca
2101 aatagactcc ggctacttta tacgttttca tcgttgtcga gtaaaacttt ttgtaagtag
2161 gggaagcttt gttgggataa catcttgtac tacaatttga taatcggtta tacgtagtga
2221 atgaaagaac gtagttcatc ctattcaaat ctaagttaat atgagtggca tacctgaaat
2281 aattatgaac accaagcact agagcattaa ataggagagc attttgtgct gtagggctct
2341 tttgaatatc tttaattctt tgtttaatct cttttttaat ttgatttttc tttttgtcag
2401 aaatatgcga attacaaacc catttttccc cttttgtttc acgcaaatgg tgaatcctaa
2461 aaattccgat ttgcgtttac gcaagttgac gattttagac ttctcattgg agatatctaa
2521 tttaagcctg tcttttaggt agagcttgac agcgtggaac cattttaatg cggacggata
2581 atcattagtc atgattttaa aatcatctgc gtacctgaca atatagcctt gctttagatt
2641 tgtatctctt aaagcacgaa atttattgta acctttcgta tagggatact ttgtttcaaa
2701 ggtatgccac tgctttgaga cccaatggtc taagtcgttt agaacaacat tagataatag
2761 tggagaaatg atacctcctt ggatcgttcc tttgctcgaa ataccttctc cctgaatggg
2821 tgatttcaga gattttgaaa gaatagctaa gactcgtttg tcacaaattc ctatattcca
2881 taattgtttt atcagtaaac ggtgatttac attatcaaag aaacccttga tatcaatgtc
2941 tactgcatag tgcatcttac tgatgtttat gagatacata atgcgaccta gagcatgttt
3001 tgcacttctc aatggtctga acccgtaact atgttcataa aactttgctt cacatattgg
3061 ctccagtacc tgcttgaaca tctgttggat aattcgatca atcatgcatg gaattcctaa
3121 tgggcgtttc tcgccgttgg gtttaggtat cattactcgc tttatggatt ttggtttgta
3181 attttctaac tgactgagaa taagatgtat aaactctgct tggttcattt ccttatagtt
3241 atcaattgtg aaggagtccg ttcctggtgt cgaagacccc ttgtttgcct tgatagtgcg
3301 gtatgctaaa agaatattat tttcagaaat gattaactcg tataattgat aaaacttttt
3361 cccgttttta ctttgagtaa atagctgatc gaatgtttct tgtagatcgt aatattccca
3421 gtaacgtatt tttgtgttca agcgtggcac ctccataacg gatttcccca cgttcttacc
3481 agatccttga acttcatctg tttcaacatt cattgttatt aaatagactt agggctatcc
3541 ctccacgttt gttagacgct tcatcggtac tgtgccctta ctttcacaga aacaaagcaa
3601 tttcgttatt acttatatgc acctatctag aagtagtcat ttatttgata gttttctgct
3661 tacatcgttc caatgttttt agctatcatg tataacctta ggtatctact atgagcctgt
3721 gaaattatag cttcttcgtt agctatcccg attttcatat catggaacct tactcgtcgg
3781 tatttcacca tacgtccgta tgttcctatc tttcgacaag atcagccttt tagacccgta
3841 cattcgcaga ttcgtcagtc attttcagac attcttacca tagttatctt tttcagccgc
3901 ccgccctata gtcagcctta tgctcaaaca cttaggcaga tttcagacga cttcacctag
3961 cttcatacca aaaccaatct acttcattga ttagagcatg taggagtatt agcggagatg
4021 tttcagctca acttgaaggc tttcattcca tctctcaaaa cattagttgt cactattgat
4081 aatctcaata ggcttccatt tcacatctat tcgatgcttg taggaaaacg attcgcac
cc
4141 atataagcca cgtattcatc aaagattaaa aagtttggtg gaagtccaag ataggcataa
4201 ttttctcctg gtttgtagtt gggcatttct ttcattgcct tactacgagc tatcatgcgt
4261 tcataaaaat cttccacaca agcagaaatt tcttcctttt gagaatatac gtgaggcatc
4321 accgtaccta aatcagctaa atctgcattt ttgggatcga ggataaacaa ttctgcatct
4381 gacttcaata gtgcttcgat aatggtgagg aggaaatagg tcttcccccc acctgtgcca
4441 ccagcgatta acatatgggg taaagaatca taagcccata cttgattttt catcaatcgc
4501 agagtcccat tttctgccac tacttcgtct atcccaattc gattggcgat catatcatac

[top]


[ORF sequence]

 

MEVPRLNTKIRYWEYYDLQETFDQLFTQSKNGKKFYQLYELIISENNILLAYRTIKAN

KGSSTPGTDSFTIDNYKEMNQAEFIHLILSQLENYKPKSIKRVMIPKPNGEKRPLGIP

CMIDRIIQQMFKQVLEPICEAKFYEHSYGFRPLRSAKHALGRIMYLINISKMHYAVDI

DIKGFFDNVNHRLLIKQLWNIGICDKRVLAILSKSLKSPIQGEGISSKGTIQGGIISP

LLSNVVLNDLDHWVSKQWHTFETKYPYTKGYNKFRALRDTNLKQGYIVRYADDFKIMT

NDYPSALKWFHAVKLYLKDRLKLDISNEKSKIVNLRKRKSEFLGFTICVKQKGKNGFV

IRIFLTKRKIKLKKRLNKELKIFKRALQHKMLSYLMLXMNYVLSFTTYNRLSNCSTRC

YPNKASPTYKKFYSTTMKTYKVAGVYLYPLCDVSTKTIYGFNTEDTPFTPFGRQRLEH

KSLQSLVYQELRKLMESKILNRMVEYLDNRISRYSMKSGKCEITKQFLPAKAVHCHHY

LPKSLGGDDKFDNLRIIHKDIHLLIHTTNKMIIDHYVNELKLLPEQIAKINLYRKMCN

LQNIQ

[top]


[Secondary structure]

                                       

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |