[Back to introns by organism] [Back to home page]
Information of Th.e.I1-1 intron (Format of information for each intron)
[Intron and flanking sequence]
Note: Detailed information for introns in Thermosynechococcus elongatus please see the original paper Nakamura et al 2002
Note:
Multiple Insertions
Th.e.I1-1 BA000039
(27344-30566)
Th.e.I1-2 BA000039 (1148139-1151363)
Th.e.I1-3 BA000039 (1196954-1200176)
Th.e.I1-4 BA000039 (293406-296628)
Sequence
from Genbank entry. Intron is identified in red.
The ORF
is identified in blue with start and stop codons
underlined.
Th.e.I2 is in green.
5' end
gtgcgac gcgaaagcta
27361 gccagatgat tgtcccacta gcccaacaag ctagaacggg accggttgtt cccccaaccg
27421 tagcctaggg aggcatgcgc gactggtaac ggtcaggtat gaagccctcc cgacaacgga
27481 gcccgaaccg aaaggttgaa gccgaatccg tgaggaggaa gcaactttgc cagcgtcagg
27541 cgatagggag ctaggcttga gggtatggtg aacgcaagtg aagtgacgcc agaagcctcg
27601 ttactataaa caggccaaag acgccgatag gcctgagcca aaatggcaaa cggactggtt
27661 tcactgctgt catctcagtg gacggggaca atagttccgt cggggtaaag tcaccaccta
27721 acccctcgcg tcatctggtt ggaacgcggt aagcccgtat cttcgccttg aacattcaag
27781 gcaggcaaac cgtaaggaat gctgatgagg gtgcgggtat gggatgcagg agaaagcgaa
27841 tgctggtctg taacggaccg gataggggtt gaggaaacaa tccaacatca ccccgcccga
27901 aagggagcag acttcctgct ggtctccctt tgcgagataa cctgtagaac ctcttgaatg
27961 gagacaaggc agtgcgacgc gaaagctagc cagatgattg tcccactagc ccaacaagct
28021 agaacgggac cggttgttcc cccaaccgta gcctaaggag gcatgcgtga ctggtaacgg
28081 tcaggtatga agccctcccg acaatgaagc ccgaaccgga aggttggagc cgaatccgtg
28141 aggaggaagc aacttcacca gcgtcaggtg atagggagct aggcttgagg gtatggtgaa
28201 cgcaagtgaa gtgacgctag aagcctcgtt actccaagca ggccaaagat gctgataggc
28261 ctgagccaaa atggcaaagc ggattggatt cactgcttgc ttttcagtga acggggacag
28321 caactccgcc ggggtatagt caccacctaa cccctcgtgt catctggttg gaacgcggta
28381 agcccgtatc ttcgccttga acattcaagg caggcaaacc gtaaggaatg ctgatggggg
28441 tgcgggtaga ggaggtggga aaaagcgaat gctagcctgt aatgggctag atagggattg
28501 agaatgctgg caggacattc aacatcatcc cactcgaaag agggcagact tcccgccggt
28561 ccccccttac gagaaagtct atagaacctt cctaatggtg acaatgcaaa tggcggtgat
28621 cttgagtcac tggtgcggtc accaacctgc tcaaaaggta gagaggctgt agatgagtat
28681 cgtaaaggtt gcatgcacaa ccggttcctc ttaaatgagg ggtcctgggg ggctcgagcc
28741 ggatgcgggg aaacttgcac gtccggttcc tagggggcta gggggcagcg atgcccccct
28801 gctacccgac aatgacggtg gaccaaacca ctggtgcagt caccaaccaa acggaaataa
28861 gctggcacag cataaactgg gccaaagcca accgtgaggt aaagaggctg caagtgcgta
28921 tcgcaaaggc tgtgaaggaa ggacgctggg gcaaagtgaa agctttgcaa tggctcctga
28981 cccactcgtt ctacggcaaa gcccttgccg tgaaacgggt aactgacaac tcaggcagta
29041 aaacacctgg tgtggatggg ataacctggt ccacacaaga gcagaaaacc caagccataa
29101 agtccctcag gagaagaggc tataagcccc aacccctgag gcgggtatat atcccgaaag
29161 caaacggcaa acagcgcccg ctaggaatcc cgacaatgaa ggacagggca atgcaggcac
29221 tatatgcact agccctagaa ccagtcgcgg aaaccacagc agaccggaac tcctatgggt
29281 tccgccgagg gcgatgtacg gcagatgcgg caggacaatg cttccttgct ctggcaagag
29341 ccaagtcggc tgaacacgtc cttgacgctg acatatccgg atgctttgat aacatcagcc
29401 atgagtggct actagccaac actccactgg acaaagggat cttacggaaa tggcttaaat
29461 ctgggttcgt ctggaaacag caactcttcc ccacccatgc tgggacacct cagggagggg
29521 taatctcccc agttcttgcc aatataaccc tagatgggat ggaagaactg ttggccaaac
29581 acctcagagg tcaaaaagtc aacctcatcc gatatgctga cgattttgtc gtgacgggaa
29641 aagatgagga aaccctggag aaagccagaa acctaatcca ggagttccta aaagaacggg
29701 gcttgaccct gtcccccgag aagacaaaaa tcgtccatat tgaggaaggc ttcgactttc
29761 tcggatggaa cattcgcaag tacaacgggg ttcttctcat caaacccgcg aagaagaacg
29821 tgaaagcgtt cctcaagaaa atccgagaca ctctaaggga acttaggaca gcaacccagg
29881 aaatcgtgat agacacactc aacccaatca ttagaggttg ggccaactat cacaaaggac
29941 aagtctctaa ggaaaccttc aaccgagtgg acttcgccac ctggcacaaa ttgtggcgat
30001 gggcaaggcg ccggcaccca aacaaacctg cccaatgggt gaaggacaaa tacttcatca
30061 aaaacggaag cagagactgg gtgttcggta tggtgatgaa agacaagaac ggggaactga
30121 ggaccaaacg cctaatcaaa acctctgaca cccgaatcca acgccacgtc aaaatcaagg
30181 cagacgccaa tccgtttctc ccagagtggg cagaatactt tgagaaacgc aagaaactca
30241 aaaaagcccc tgctcaatat cggcgcatcc gccgagaact atggaagaaa cagggtggta
30301 tctgtccagt atgcgggggt gaaattgagc aagacatgct cactgacatc caccacatat
30361 tgcccaaaca caagggtggt tctgacgacc tggataatct tgtcttaatc cacgccaact
30421 gccacaaaca ggtgcacagc cgagatggtc agcacagccg gtccctcttg aaagaggggc
30481 tttgagaggc ctgagccgga tgctgggaaa ctagcacgtc cggttcttag ggggctaggg
30541 ggcagtaatg ccccccgcta cccgac
3' end
[Intron and flanking sequence]
26821 tcgtaattaa ggtggcgagg tctaagtcat caagacgaat ggccttaagg gcagcatcgg
26881 tttggcctaa aacaatggct gcatcccccg gacttaagtg ttgcaggctg gtgtagagca
26941 ggcccagatc aatcgtttct tggcagcctt gggttgaaat gcggtaggtg atgtcgtaga
27001 gccaatctag ggttaccaag tcatggttga gcatccagag ggcaacatta aaggcggttt
27061 ggggaaaggc agcttgggca agacttttga cgatgacttg ggcgcgatcg cgctggagca
27121 aggagatcaa ggccgcctgt ttttctgcct tggttgtgag ttcatgatag ccaaggagat
27181 cttggaagag tttttgatcg ctgggcagta atcgaattgc cgctggagga gtcataggct
27241 tagtgccgca gattgatgtg actcatttcc cctcacccga cgatgaaggc gatgaaggct
27301 agacaaggca acgctagctc ttaattccac tatagagggt ggcgtgcgac gcgaaagcta
27361 gccagatgat tgtcccacta gcccaacaag ctagaacggg accggttgtt cccccaaccg
27421 tagcctaggg aggcatgcgc gactggtaac ggtcaggtat gaagccctcc cgacaacgga
27481 gcccgaaccg aaaggttgaa gccgaatccg tgaggaggaa gcaactttgc cagcgtcagg
27541 cgatagggag ctaggcttga gggtatggtg aacgcaagtg aagtgacgcc agaagcctcg
27601 ttactataaa caggccaaag acgccgatag gcctgagcca aaatggcaaa cggactggtt
27661 tcactgctgt catctcagtg gacggggaca atagttccgt cggggtaaag tcaccaccta
27721 acccctcgcg tcatctggtt ggaacgcggt aagcccgtat cttcgccttg aacattcaag
27781 gcaggcaaac cgtaaggaat gctgatgagg gtgcgggtat gggatgcagg agaaagcgaa
27841 tgctggtctg taacggaccg gataggggtt gaggaaacaa tccaacatca ccccgcccga
27901 aagggagcag acttcctgct ggtctccctt tgcgagataa cctgtagaac ctcttgaatg
27961 gagacaaggc agtgcgacgc gaaagctagc cagatgattg tcccactagc ccaacaagct
28021 agaacgggac cggttgttcc cccaaccgta gcctaaggag gcatgcgtga ctggtaacgg
28081 tcaggtatga agccctcccg acaatgaagc ccgaaccgga aggttggagc cgaatccgtg
28141 aggaggaagc aacttcacca gcgtcaggtg atagggagct aggcttgagg gtatggtgaa
28201 cgcaagtgaa gtgacgctag aagcctcgtt actccaagca ggccaaagat gctgataggc
28261 ctgagccaaa atggcaaagc ggattggatt cactgcttgc ttttcagtga acggggacag
28321 caactccgcc ggggtatagt caccacctaa cccctcgtgt catctggttg gaacgcggta
28381 agcccgtatc ttcgccttga acattcaagg caggcaaacc gtaaggaatg ctgatggggg
28441 tgcgggtaga ggaggtggga aaaagcgaat gctagcctgt aatgggctag atagggattg
28501 agaatgctgg caggacattc aacatcatcc cactcgaaag agggcagact tcccgccggt
28561 ccccccttac gagaaagtct atagaacctt cctaatggtg acaatgcaaa tggcggtgat
28621 cttgagtcac tggtgcggtc accaacctgc tcaaaaggta gagaggctgt agatgagtat
28681 cgtaaaggtt gcatgcacaa ccggttcctc ttaaatgagg ggtcctgggg ggctcgagcc
28741 ggatgcgggg aaacttgcac gtccggttcc tagggggcta gggggcagcg atgcccccct
28801 gctacccgac aatgacggtg gaccaaacca ctggtgcagt caccaaccaa acggaaataa
28861 gctggcacag cataaactgg gccaaagcca accgtgaggt aaagaggctg caagtgcgta
28921 tcgcaaaggc tgtgaaggaa ggacgctggg gcaaagtgaa agctttgcaa tggctcctga
28981 cccactcgtt ctacggcaaa gcccttgccg tgaaacgggt aactgacaac tcaggcagta
29041 aaacacctgg tgtggatggg ataacctggt ccacacaaga gcagaaaacc caagccataa
29101 agtccctcag gagaagaggc tataagcccc aacccctgag gcgggtatat atcccgaaag
29161 caaacggcaa acagcgcccg ctaggaatcc cgacaatgaa ggacagggca atgcaggcac
29221 tatatgcact agccctagaa ccagtcgcgg aaaccacagc agaccggaac tcctatgggt
29281 tccgccgagg gcgatgtacg gcagatgcgg caggacaatg cttccttgct ctggcaagag
29341 ccaagtcggc tgaacacgtc cttgacgctg acatatccgg atgctttgat aacatcagcc
29401 atgagtggct actagccaac actccactgg acaaagggat cttacggaaa tggcttaaat
29461 ctgggttcgt ctggaaacag caactcttcc ccacccatgc tgggacacct cagggagggg
29521 taatctcccc agttcttgcc aatataaccc tagatgggat ggaagaactg ttggccaaac
29581 acctcagagg tcaaaaagtc aacctcatcc gatatgctga cgattttgtc gtgacgggaa
29641 aagatgagga aaccctggag aaagccagaa acctaatcca ggagttccta aaagaacggg
29701 gcttgaccct gtcccccgag aagacaaaaa tcgtccatat tgaggaaggc ttcgactttc
29761 tcggatggaa cattcgcaag tacaacgggg ttcttctcat caaacccgcg aagaagaacg
29821 tgaaagcgtt cctcaagaaa atccgagaca ctctaaggga acttaggaca gcaacccagg
29881 aaatcgtgat agacacactc aacccaatca ttagaggttg ggccaactat cacaaaggac
29941 aagtctctaa ggaaaccttc aaccgagtgg acttcgccac ctggcacaaa ttgtggcgat
30001 gggcaaggcg ccggcaccca aacaaacctg cccaatgggt gaaggacaaa tacttcatca
30061 aaaacggaag cagagactgg gtgttcggta tggtgatgaa agacaagaac ggggaactga
30121 ggaccaaacg cctaatcaaa acctctgaca cccgaatcca acgccacgtc aaaatcaagg
30181 cagacgccaa tccgtttctc ccagagtggg cagaatactt tgagaaacgc aagaaactca
30241 aaaaagcccc tgctcaatat cggcgcatcc gccgagaact atggaagaaa cagggtggta
30301 tctgtccagt atgcgggggt gaaattgagc aagacatgct cactgacatc caccacatat
30361 tgcccaaaca caagggtggt tctgacgacc tggataatct tgtcttaatc cacgccaact
30421 gccacaaaca ggtgcacagc cgagatggtc agcacagccg gtccctcttg aaagaggggc
30481 tttgagaggc ctgagccgga tgctgggaaa ctagcacgtc cggttcttag ggggctaggg
30541 ggcagtaatg ccccccgcta cccgactaga gtatcgaggc tattccccca ccttagcttc
30601 taatttgcgc actcgcttga ggagatctgg caattgcttc actgcggctg agaccttgag
30661 gtagagggtt tgatccatcg ccgggatccc ccccatgcgg ctatctgggg gtaccgagct
30721 actaatgccg gatttggcgg agacaacagt gcgatcgcca atggtcaaat gacccgctgc
30781 ccccacctgt ccagccaaga ccacgtgatt accaatgtgg gttgaacctg ccaaccccac
30841 ttgggcacag agaatggcgt tttcaccaat ggtgcagttg tgggccacca tcgttaagtt
30901 gtcaattttt gtgccattgg ccacagtggt ctctccaagg gtggcgcgat caatcgttgt
30961 gccagcacca atttctacat cgttgccaat gaccacggtg cccacttggg gaattttgta
31021 gtgacgacca tcgggtaggg ggacatagcc aaagccatca ctccctaaaa ccacgctgtt
31081 ttgcaagatg acgcgatcgc ccaattgcac ccgctcccgc agggcacagt ggctgtagat
METRQ
Th.e.I2
MTVDQTTGAVTNQTEISWHSINWAKANREVKRLQVRIAKAVKEGRWGKVKALQWLLTH
SFYGKALAVKRVTDNSGSKTPGVDGITWSTQEQKTQAIKSLRRRGYKPQPLRRVYIPK
ANGKQRPLGIPTMKDRAMQALYALALEPVAETTADRNSYGFRRGRCTADAAGQCFLAL
ARAKSAEHVLDADISGCFDNISHEWLLANTPLDKGILRKWLKSGFVWKQQLFPTHAGT
PQGGVISPVLANITLDGMEELLAKHLRGQKVNLIRYADDFVVTGKDEETLEKARNLIQ
EFLKERGLTLSPEKTKIVHIEEGFDFLGWNIRKYNGVLLIKPAKKNVKAFLKKIRDTL
RELRTATQEIVIDTLNPIIRGWANYHKGQVSKETFNRVDFATWHKLWRWARRRHPNKP
AQWVKDKYFIKNGSRDWVFGMVMKDKNGELRTKRLIKTSDTRIQRHVKIKADANPFLP
EWAEYFEKRKKLKKAPAQYRRIRRELWKKQGGICPVCGGEIEQDMLTDIHHILPKHKG
GSDDLDNLVLIHANCHKQVHSRDGQHSRSLLKEGL

| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |