[Back to introns by organism]  [Back to home page]

Information for W.e.I2-1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequences]

[ORF sequence]

[Secondary structure]

Note: Redundant Copy
 AE017258 (60501-62416)

Note: Multiple insertions
W.e.I2-1    AE017196 (245737-243822)
W.e.I2-2    AE017196 (488447-486532)
W.e.I2-3    AE017196 (582567-584482)

[Intron sequence]

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

 

Intron on antisense strand

 

3' end

   1 gtctggtaga gactcatgac cttgaaacca gcgatttggt ctcataacca tagtctcccc
  61 aaaagaaccg tgcttgcgga tttcccgcac acggctccac aatacagcgt tcacactagt
 121 gctcctgtgt ataaagaaac atataccttt ggttttggta atggatttac ttttaaatag
 181 tgtataaatt gttcccaatc catacttttc ttttgactac gtcggtttat ccacttaaat
 241 gctaactttg ttaccggtcg atagaattga atcaaacacc gataatttcc gctaactccg
 301 aagtagctat agtgtcctgt tagtttggct ttaagtttct gccaccaatc tttgagacag
 361 atacgacttc gtaccatctt caaccattct ttgatttctt taatctttct ggctaggctt
 421 atttttgaag ttttctgctt catcataagt ttaccattac gactttttcc acaataatgt
 481 gtaaatccta gaaagttgaa gctagccgtc ctacgtttct ctctttctgc ttgataccat
 541 tctttcttac caaactttac tatttttgtt ttattttcag ctatttccaa cccaaattta
 601 cttagtcttt gtttcagtaa ttctagaaat tcttttgcgt cttcttccct ctcgcagcca
 661 actacaaaat cgtcgcaaaa ccttattagc tgtaaatatc ctctggcttt tggcttaaat
 721 ttcttttcaa accataagtc cagcacatag tgtaagtata tattagctaa gacagggctt
 781 actataccac cttgtggtgt gccttgatcg gttgctttat aacatccaac ttcgactatt
 841 cctgccttta gaaatcgttt tattaaccac aataaatttg ggtcagctat tcgttccctt
 901 agacaattca ttagccattt atgttgaaca ttatcaaaga acttcttgat atccacttct
 961 acaatatagt taattggttt gtgcataact gctttatcta gagcgtttat cgcctgatga
1021 caatttcttc ctggtcgaaa tccatacgag ctgtccataa agtttgcttc ataaatattt
1081 tctaatatct tctttagcat tacctgtacc aacttatctt ccgttgatgg tattccgaga
1141 ccgcgcttct ctttgctccc ggctttgggt atgtataccc ttttcactgg tagcggtgat
1201 attgttttct cttcatacta tccactagtg ttttaagttt ctcttctaga ttttctccat
1261 aagcttccac tgtcacacga tctataccac aagccttgtt gcgttttagt tccttataac
1321 actctgcaag attctcttca ttaatcaaat gaactaatga tgtaaatttt actcgcttat
1381 cttgcttagc ccttactgct atctggttta gttttccttg catactcctc taatctctgt
1441 tgtattggta tagcaatccc tataccagtc actatattgc taaccgcttc cccttgtatg
1501 tggctttccc acactccgag tactatcagt tagtccgact tcctttgcat cttcctccag
1561 tcctcgcctt tattgacttg ttccagagta cccttacagg aatacaaagg atctcccaag
1621 ttcacgtaag gtccatctca acatatgacg cggccttcga tcccgatgag gtgagtaata
1681 tcttgcctta gcgttattac ccatgttgcc ttccaccaaa tgtaaagtgt cagcccccat
1741 gacgcgtgaa ttacgggact caataccttc actttcgttg caccctatgt tgtccattcc
1801 attagcttta ctacgctcgt taccttacgc agcataatgg tttgttccag gttgccggct
1861 aagctttacc cgtgttggat tttcaccaac tgttccttac gcacttcttg gcgcac
5' end
 
Intron on sense strand
 
5' end
   1 gtgcgccaag aagtgcgtaa ggaacagttg gtgaaaatcc aacacgggta aagcttagcc
  61 ggcaacctgg aacaaaccat tatgctgcgt aaggtaacga gcgtagtaaa gctaatggaa
 121 tggacaacat agggtgcaac gaaagtgaag gtattgagtc ccgtaattca cgcgtcatgg
 181 gggctgacac tttacatttg gtggaaggca acatgggtaa taacgctaag gcaagatatt
 241 actcacctca tcgggatcga aggccgcgtc atatgttgag atggacctta cgtgaacttg
 301 ggagatcctt tgtattcctg taagggtact ctggaacaag tcaataaagg cgaggactgg
 361 aggaagatgc aaaggaagtc ggactaactg atagtactcg gagtgtggga aagccacata
 421 caaggggaag cggttagcaa tatagtgact ggtataggga ttgctatacc aatacaacag
 481 agattagagg agtatgcaag gaaaactaaa ccagatagca gtaagggcta agcaagataa
 541 gcgagtaaaa tttacatcat tagttcattt gattaatgaa gagaatcttg cagagtgtta
 601 taaggaacta aaacgcaaca aggcttgtgg tatagatcgt gtgacagtgg aagcttatgg
 661 agaaaatcta gaagagaaac ttaaaacact agtggatagt atgaagagaa aacaatatca
 721 ccgctaccag tgaaaagggt atacataccc aaagccggga gcaaagagaa gcgcggtctc
 781 ggaataccat caacggaaga taagttggta caggtaatgc taaagaagat attagaaaat
 841 atttatgaag caaactttat ggacagctcg tatggatttc gaccaggaag aaattgtcat
 901 caggcgataa acgctctaga taaagcagtt atgcacaaac caattaacta tattgtagaa
 961 gtggatatca agaagttctt tgataatgtt caacataaat ggctaatgaa ttgtctaagg
1021 gaacgaatag ctgacccaaa tttattgtgg ttaataaaac gatttctaaa ggcaggaata
1081 gtcgaagttg gatgttataa agcaaccgat caaggcacac cacaaggtgg tatagtaagc
1141 cctgtcttag ctaatatata cttacactat gtgctggact tatggtttga aaagaaattt
1201 aagccaaaag ccagaggata tttacagcta ataaggtttt gcgacgattt tgtagttggc
1261 tgcgagaggg aagaagacgc aaaagaattt ctagaattac tgaaacaaag actaagtaaa
1321 tttgggttgg aaatagctga aaataaaaca aaaatagtaa agtttggtaa gaaagaatgg
1381 tatcaagcag aaagagagaa acgtaggacg gctagcttca actttctagg atttacacat
1441 tattgtggaa aaagtcgtaa tggtaaactt atgatgaagc agaaaacttc aaaaataagc
1501 ctagccagaa agattaaaga aatcaaagaa tggttgaaga tggtacgaag tcgtatctgt
1561 ctcaaagatt ggtggcagaa acttaaagcc aaactaacag gacactatag ctacttcgga
1621 gttagcggaa attatcggtg tttgattcaa ttctatcgac cggtaacaaa gttagcattt
1681 aagtggataa accgacgtag tcaaaagaaa agtatggatt gggaacaatt tatacactat
1741 ttaaaagtaa atccattacc aaaaccaaag gtatatgttt ctttatacac aggagcacta
1801 gtgtgaacgc tgtattgtgg agccgtgtgc gggaaatccg caagcacggt tcttttgggg
1861 agactatggt tatgagacca aatcgctggt ttcaaggtca tgagtctcta ccagac
3' end

[top]


[Intron and flanking sequences]

 

   1 tttaaagcta aaatacccaa cacttgtgat gataatgtga tccagcaacc taatccctat
  61 agtctgacaa gctttagcca agtctttagt tacagcttca tcctcatcag atggctttaa
 121 ccttccttct gggtggttat gcgatattat tattgacgtc gcttttctta ttaatgcttt
 181 tcttgttact tctcttatgt aaaaaggtac cttatccact gtaccagtaa acacttcttc
 241 cccctttaaa cggcactttt gatcaaaata tattattttc atcttttcct cctctgaata
 301 gtctggtaga gactcatgac cttgaaacca gcgatttggt ctcataacca tagtctcccc
 361 aaaagaaccg tgcttgcgga tttcccgcac acggctccac aatacagcgt tcacactagt
 421 gctcctgtgt ataaagaaac atataccttt ggttttggta atggatttac ttttaaatag
 481 tgtataaatt gttcccaatc catacttttc ttttgactac gtcggtttat ccacttaaat
 541 gctaactttg ttaccggtcg atagaattga atcaaacacc gataatttcc gctaactccg
 601 aagtagctat agtgtcctgt tagtttggct ttaagtttct gccaccaatc tttgagacag
 661 atacgacttc gtaccatctt caaccattct ttgatttctt taatctttct ggctaggctt
 721 atttttgaag ttttctgctt catcataagt ttaccattac gactttttcc acaataatgt
 781 gtaaatccta gaaagttgaa gctagccgtc ctacgtttct ctctttctgc ttgataccat
 841 tctttcttac caaactttac tatttttgtt ttattttcag ctatttccaa cccaaattta
 901 cttagtcttt gtttcagtaa ttctagaaat tcttttgcgt cttcttccct ctcgcagcca
 961 actacaaaat cgtcgcaaaa ccttattagc tgtaaatatc ctctggcttt tggcttaaat
1021 ttcttttcaa accataagtc cagcacatag tgtaagtata tattagctaa gacagggctt
1081 actataccac cttgtggtgt gccttgatcg gttgctttat aacatccaac ttcgactatt
1141 cctgccttta gaaatcgttt tattaaccac aataaatttg ggtcagctat tcgttccctt
1201 agacaattca ttagccattt atgttgaaca ttatcaaaga acttcttgat atccacttct
1261 acaatatagt taattggttt gtgcataact gctttatcta gagcgtttat cgcctgatga
1321 caatttcttc ctggtcgaaa tccatacgag ctgtccataa agtttgcttc ataaatattt
1381 tctaatatct tctttagcat tacctgtacc aacttatctt ccgttgatgg tattccgaga
1441 ccgcgcttct ctttgctccc ggctttgggt atgtataccc ttttcactgg tagcggtgat
1501 attgttttct cttcatacta tccactagtg ttttaagttt ctcttctaga ttttctccat
1561 aagcttccac tgtcacacga tctataccac aagccttgtt gcgttttagt tccttataac
1621 actctgcaag attctcttca ttaatcaaat gaactaatga tgtaaatttt actcgcttat
1681 cttgcttagc ccttactgct atctggttta gttttccttg catactcctc taatctctgt
1741 tgtattggta tagcaatccc tataccagtc actatattgc taaccgcttc cccttgtatg
1801 tggctttccc acactccgag tactatcagt tagtccgact tcctttgcat cttcctccag
1861 tcctcgcctt tattgacttg ttccagagta cccttacagg aatacaaagg atctcccaag
1921 ttcacgtaag gtccatctca acatatgacg cggccttcga tcccgatgag gtgagtaata
1981 tcttgcctta gcgttattac ccatgttgcc ttccaccaaa tgtaaagtgt cagcccccat
2041 gacgcgtgaa ttacgggact caataccttc actttcgttg caccctatgt tgtccattcc
2101 attagcttta ctacgctcgt taccttacgc agcataatgg tttgttccag gttgccggct
2161 aagctttacc cgtgttggat tttcaccaac tgttccttac gcacttcttg gcgcacacat
2221 tttacacccc ccttaaaatt aaaacaaaag gtggtaaaca agatgtcaaa cacagataac
2281 tcaactacaa gctggttttt gttgtagtta gatagttatt ttctcacttt ctttacttgc
2341 tttaattaat tcatcaaaat cagattcatg taatttgtcc tgtataactc tgtacttttc
2401 gtattctcct tcagccaggg cttttgcaac ttcagcacta actttaccgg catctttaag
2461 tacttcataa tcattgaata atagaaatgc atccaatttt tcagcccagt cttgca

[top]


[ORF sequence]

Frameshift in ORF (indicated by ///)

MQGKLNQIAVRAKQDKRVKFTSLVHLINEENLAECYKELKRNKACGIDRVTVEAYGENLEEKLKTL

VDSMKRKQYHRYQ*///VKRVYIPKAGSKEKRGLGIPSTEDKLVQVMLKKILENIYEANFMDSSYG

FRPGRNCHQAINALDKAVMHKPINYIVEVDIKKFFDNVQHKWLMNCLRERIADPNLLWLIKRFLKA

GIVEVGCYKATDQGTPQGGIVSPVLANIYLHYVLDLWFEKKFKPKARGYLQLIRFCDDFVVGCERE

EDAKEFLELLKQRLSKFGLEIAENKTKIVKFGKKEWYQAEREKRRTASFNFLGFTHYCGKSRNGKL

MMKQKTSKISLARKIKEIKEWLKMVRSRICLKDWWQKLKAKLTGHYSYFGVSGNYRCLIQFYRPVT

KLAFKWINRRSQKKSMDWEQFIHYLKVNPLPKPKVYVSLYTGALV

[top]


[Secondary structure]

                                                   

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |