[Back to introns by organism]   [Back to home page]

Information of Th.e.I3 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

[Intron sequence]

 

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

  

Intron on antisense strand

             

3' end

                                                    gccgggta gcagggaggc
91381 attactgcca tccccgcccc ctaagaaccg gacttgtgag tttccccaca tccggctcag
91441 gcctctcaaa gcccctcttt caagaggaac cggctgtgct gaccgtctcg gctgtgcacc
91501 tgtttgtgac agttggcgtg gattaagaca agattatcca ggtcgtcaga accacccttg
91561 tgtttgggca atatgtggtg gatttcagtg agcatgtctt gctcaatttc acccccgcat
91621 actggacaga taccaccctg tttcttccat agttctcggc ggatgcgccg atattgagca
91681 ggggcttctt tgagtttctt gcgttcctca aagtattctg cccactctgg gagaaacgga
91741 ttggcgtctg ccttgatttt gacatgacgt tggattcgag tatctccggc tttaattagg
91801 tatctggacc gtaaccttcc ctctttgtcc tttttccatg tgccaaacac ccagtgccta
91861 ttcccgattt tgatgaagta tttgttcttt gtccatcggg ctggtttgtt tgggtgccga
91921 cgttttgccc atcgccataa tttatgccag atattgtcat ccgctctgtt gaagattctt
91981 ttggaaacct ggtttctgtg atagttagcc cagcctttga taattgggtt gagtgtgtct
92041 atcacagctt cctgggtggc tgttctgagt tccttcagtg cgtctcggat tttcttgtgg
92101 aacgccttga tgttcttctt cgcaggtttg atgagaagct tctcaccata tttgcgaatg
92161 ttccatccaa gaaagtcaaa tccttcttcg atatgaacga cctttgtctt ttcttctgat
92221 agggtaaggc ccctttcctt gaggaattct tggattacag ttgtaacctt ttccaaggtt
92281 tcctttgatt caccagttac gacaaagtcg cctgcatatc gtatgaggtt gactttttgt
92341 tttctgaggt gtttcttcag caattcttcc atcccatcta gggtcatatt ggccagcatt
92401 ggggagatta cccctccctg aggtgtccca gcatgggttg ggaagagttg ctgtttccag
92461 acgaacccag atttaagcca cttccgcaga acctctttgt ccagcgggat gttgtctagt
92521 agccattcgt ggctaatgtt gtcaaagcat ccggtgatgt cagcatcaag gatatatttt
92581 gcacagtcag atcggcctag cacagtgaag cactgcccgg cagcatctgc cgtgcaccgt
92641 ccttgacgga acccgtatga gttccgatcc gctgtggttt ccgcgacagg ttctagagct
92701 agggcatata atgcctgcat tgccctgtcc ttcgttgtcg ggattcctag cgggcgctgc
92761 ttgccgcttg ctttcgggat gtatacccgc ctcaggggtt ggggtttgta gcctcttctc
92821 ctgagggact ttatggcttg ggctttctgc tcttgtgtgg accaggttat cccgtccaca
92881 ccaggtgttt tgctgcccga gttgtcagtt acccgtttca cggcgagggc tttgccgtag
92941 aacgagtggg tcaggagcca ttgcaaagct ttcactttgc cccagcgtcc ttccttcaca
93001 gcctttgcga tacgcacttg cagcctcttt acctcacggt tggctttggc ccagtctatg
93061 ctgtgccagc ttgtttccgt ttggttggtg accgcaccag tggtttgttc cactgccatt
93121 tgccttgtct ccat
tcaaga ggttctacag gctatctcgc aaagagagac cagcaggaag
93181 tctgctccct ttcgggcggg gtgatgttgg attgtctcct caacccctat ccggtccatt
93241 acagaccggc attcgctttc tcctgcatcc catacccgca cccccatcag cattccttac
93301 ggtttgcctg ccttgagtgt tcaaggcgag gatacgggct taccgcgttc caaccagatg
93361 acacgagggg ttaggtggtg actgtgcccc gatggattga cgtccccgta caccgacgag
93421 cgggcgatgt agcctatcca tttgccgttt tggctcaggc ctatcagtgt ctttggcctg
93481 ttggtctgtg acgaggcttc tgacatcact tcgcatgtgc tcaccatacc ctcaagccta
93541 gttccctatc acctgacact ggtgaagttg cttcctcctc acggattcag cttcaacctt
93601 gcggttcggg ctacattgtc gggagggctt cacacctgac cgttaccagt cacgcatgcc
93661 tccctaggct acggttgggg gaacaaccgg tcccgttcta gcttgttggg ctagtgggac
93721 aatcatctgg ctagctttcg cgtcgcaa

5' end  

 

Intron on sense strand

 

5' end 

   1 ttgcgacgcg aaagctagcc agatgattgt cccactagcc caacaagcta gaacgggacc 

  61 ggttgttccc ccaaccgtag cctagggagg catgcgtgac tggtaacggt caggtgtgaa 

 121 gccctcccga caatgtagcc cgaaccgcaa ggttgaagct gaatccgtga ggaggaagca 

 181 acttcaccag tgtcaggtga tagggaacta ggcttgaggg tatggtgagc acatgcgaag 

 241 tgatgtcaga agcctcgtca cagaccaaca ggccaaagac actgataggc ctgagccaaa 

 301 acggcaaatg gataggctac atcgcccgct cgtcggtgta cggggacgtc aatccatcgg 

 361 ggcacagtca ccacctaacc cctcgtgtca tctggttgga acgcggtaag cccgtatcct 

 421 cgccttgaac actcaaggca ggcaaaccgt aaggaatgct gatgggggtg cgggtatggg 

 481 atgcaggaga aagcgaatgc cggtctgtaa tggaccggat aggggttgag gagacaatcc 

 541 aacatcaccc cgcccgaaag ggagcagact tcctgctggt ctctctttgc gagatagcct 

 601 gtagaacctc ttgaatggag acaaggcaaa tggcagtgga acaaaccact ggtgcggtca 

 661 ccaaccaaac ggaaacaagc tggcacagca tagactgggc caaagccaac cgtgaggtaa 

 721 agaggctgca agtgcgtatc gcaaaggctg tgaaggaagg acgctggggc aaagtgaaag 

 781 ctttgcaatg gctcctgacc cactcgttct acggcaaagc cctcgccgtg aaacgggtaa 

 841 ctgacaactc gggcagcaaa acacctggtg tggacgggat aacctggtcc acacaagagc 

 901 agaaagccca agccataaag tccctcagga gaagaggcta caaaccccaa cccctgaggc 

 961 gggtatacat cccgaaagca agcggcaagc agcgcccgct aggaatcccg acaacgaagg 

1021 acagggcaat gcaggcatta tatgccctag ctctagaacc tgtcgcggaa accacagcgg 

1081 atcggaactc atacgggttc cgtcaaggac ggtgcacggc agatgctgcc gggcagtgct 

1141 tcactgtgct aggccgatct gactgtgcaa aatatatcct tgatgctgac atcaccggat 

1201 gctttgacaa cattagccac gaatggctac tagacaacat cccgctggac aaagaggttc 

1261 tgcggaagtg gcttaaatct gggttcgtct ggaaacagca actcttccca acccatgctg 

1321 ggacacctca gggaggggta atctccccaa tgctggccaa tatgacccta gatgggatgg 

1381 aagaattgct gaagaaacac ctcagaaaac aaaaagtcaa cctcatacga tatgcaggcg 

1441 actttgtcgt aactggtgaa tcaaaggaaa ccttggaaaa ggttacaact gtaatccaag 

1501 aattcctcaa ggaaaggggc cttaccctat cagaagaaaa gacaaaggtc gttcatatcg 

1561 aagaaggatt tgactttctt ggatggaaca ttcgcaaata tggtgagaag cttctcatca 

1621 aacctgcgaa gaagaacatc aaggcgttcc acaagaaaat ccgagacgca ctgaaggaac 

1681 tcagaacagc cacccaggaa gctgtgatag acacactcaa cccaattatc aaaggctggg 

1741 ctaactatca cagaaaccag gtttccaaaa gaatcttcaa cagagcggat gacaatatct 

1801 ggcataaatt atggcgatgg gcaaaacgtc ggcacccaaa caaaccagcc cgatggacaa 

1861 agaacaaata cttcatcaaa atcgggaata ggcactgggt gtttggcaca tggaaaaagg 

1921 acaaagaggg aaggttacgg tccagatacc taattaaagc cggagatact cgaatccaac 

1981 gtcatgtcaa aatcaaggca gacgccaatc cgtttctccc agagtgggca gaatactttg 

2041 aggaacgcaa gaaactcaaa gaagcccctg ctcaatatcg gcgcatccgc cgagaactat 

2101 ggaagaaaca gggtggtatc tgtccagtat gcgggggtga aattgagcaa gacatgctca 

2161 ctgaaatcca ccacatattg cccaaacaca agggtggttc tgacgacctg gataatcttg 

2221 tcttaatcca cgccaactgt cacaaacagg tgcacagccg agacggtcag cacagccggt 

2281 tcctcttgaa agaggggctt tgagaggcct gagccggatg tggggaaact cacaagtccg 

2341 gttcttaggg ggcggggatg gcagtaatgc ctccctgcta cccggc

3' end  

[top]


[Intron and flanking sequence]

 

90901 aaggtaatct cttgagccgt acccgcaaag ggatgatccg gggttagaaa cagaagacag
90961 tgacactccc gccgttcgcg catgggtaca caggggcagt tccagtaggc ggctgccacc
91021 tctgcttctt tgtcctcata atggcggcag ggacacaatg gggagccgta gtcgtcctta
91081 tgtttggcta atccttcaag aacaactgcc gttgtcccta aatctgagca aaagtaggtt
91141 cctgtgcgct tggcataggt ttccgcaaac ttgcgcatgg cttcaaggtt tttgtcggag
91201 gcctgttggg gtttgtagct gctactcatg gcaaaatttg cgacacgaaa cttctttcag
91261 tgtagcggaa ggcgatcgct caattggcct cccgtggctt aaaagatcca gggtgctgga
91321 gtcgaaccag cctatggcga attatgagtt cgctgcctca tcgccgggta gcagggaggc
91381 attactgcca tccccgcccc ctaagaaccg gacttgtgag tttccccaca tccggctcag
91441 gcctctcaaa gcccctcttt caagaggaac cggctgtgct gaccgtctcg gctgtgcacc
91501 tgtttgtgac agttggcgtg gattaagaca agattatcca ggtcgtcaga accacccttg
91561 tgtttgggca atatgtggtg gatttcagtg agcatgtctt gctcaatttc acccccgcat
91621 actggacaga taccaccctg tttcttccat agttctcggc ggatgcgccg atattgagca
91681 ggggcttctt tgagtttctt gcgttcctca aagtattctg cccactctgg gagaaacgga
91741 ttggcgtctg ccttgatttt gacatgacgt tggattcgag tatctccggc tttaattagg
91801 tatctggacc gtaaccttcc ctctttgtcc tttttccatg tgccaaacac ccagtgccta
91861 ttcccgattt tgatgaagta tttgttcttt gtccatcggg ctggtttgtt tgggtgccga
91921 cgttttgccc atcgccataa tttatgccag atattgtcat ccgctctgtt gaagattctt
91981 ttggaaacct ggtttctgtg atagttagcc cagcctttga taattgggtt gagtgtgtct
92041 atcacagctt cctgggtggc tgttctgagt tccttcagtg cgtctcggat tttcttgtgg
92101 aacgccttga tgttcttctt cgcaggtttg atgagaagct tctcaccata tttgcgaatg
92161 ttccatccaa gaaagtcaaa tccttcttcg atatgaacga cctttgtctt ttcttctgat
92221 agggtaaggc ccctttcctt gaggaattct tggattacag ttgtaacctt ttccaaggtt
92281 tcctttgatt caccagttac gacaaagtcg cctgcatatc gtatgaggtt gactttttgt
92341 tttctgaggt gtttcttcag caattcttcc atcccatcta gggtcatatt ggccagcatt
92401 ggggagatta cccctccctg aggtgtccca gcatgggttg ggaagagttg ctgtttccag
92461 acgaacccag atttaagcca cttccgcaga acctctttgt ccagcgggat gttgtctagt
92521 agccattcgt ggctaatgtt gtcaaagcat ccggtgatgt cagcatcaag gatatatttt
92581 gcacagtcag atcggcctag cacagtgaag cactgcccgg cagcatctgc cgtgcaccgt
92641 ccttgacgga acccgtatga gttccgatcc gctgtggttt ccgcgacagg ttctagagct
92701 agggcatata atgcctgcat tgccctgtcc ttcgttgtcg ggattcctag cgggcgctgc
92761 ttgccgcttg ctttcgggat gtatacccgc ctcaggggtt ggggtttgta gcctcttctc
92821 ctgagggact ttatggcttg ggctttctgc tcttgtgtgg accaggttat cccgtccaca
92881 ccaggtgttt tgctgcccga gttgtcagtt acccgtttca cggcgagggc tttgccgtag
92941 aacgagtggg tcaggagcca ttgcaaagct ttcactttgc cccagcgtcc ttccttcaca
93001 gcctttgcga tacgcacttg cagcctcttt acctcacggt tggctttggc ccagtctatg
93061 ctgtgccagc ttgtttccgt ttggttggtg accgcaccag tggtttgttc cactgccatt
93121 tgccttgtct ccattcaaga ggttctacag gctatctcgc aaagagagac cagcaggaag
93181 tctgctccct ttcgggcggg gtgatgttgg attgtctcct caacccctat ccggtccatt
93241 acagaccggc attcgctttc tcctgcatcc catacccgca cccccatcag cattccttac
93301 ggtttgcctg ccttgagtgt tcaaggcgag gatacgggct taccgcgttc caaccagatg
93361 acacgagggg ttaggtggtg actgtgcccc gatggattga cgtccccgta caccgacgag
93421 cgggcgatgt agcctatcca tttgccgttt tggctcaggc ctatcagtgt ctttggcctg
93481 ttggtctgtg acgaggcttc tgacatcact tcgcatgtgc tcaccatacc ctcaagccta
93541 gttccctatc acctgacact ggtgaagttg cttcctcctc acggattcag cttcaacctt
93601 gcggttcggg ctacattgtc gggagggctt cacacctgac cgttaccagt cacgcatgcc
93661 tccctaggct acggttgggg gaacaaccgg tcccgttcta gcttgttggg ctagtgggac
93721 aatcatctgg ctagctttcg cgtcgcaa
cg ctcggccaac cctgggaaaa tctctatcat
93781 ttccaatata gccccccttg gcgccttcta cccctaggtt gcctgactgg ccagtgaggg
93841 gggattgctt gcgagggttt ttgggaagat tgggaatgag gccaatttac ttagatggac
93901 tagccactac accggtggac ccccaagtgt tagcagcaat gttgccctat tttagcgatc
93961 gccctggtaa tcccagtaat cgtggccatg cctacggttg ggaggccgct gccgccattg
94021 atgttgcccg cgaaaccatt gccgcagcca tccatgccca tccagaggag ataattttta
94081 ccagtggtgc gaccgaggcc aacaatttgg ccatcaaggg ggtggcggaa gcctaccata
94141 gtcgcggtcg ccatatcatt accgtacaga cggaacacaa tgcggtgctt gccccctgcc
94201 gctatttgga gtccctgggc tttcgggtga cctatctggg ggtgaatgag aaagggttta
94261 ttgatctcgc tgagttagag caagccttta cgcccgagac cctcttggtg tctgtgatgg
94321 ctgccaacaa tgaaattggc gtgctgcaac ccttagctga gattggtcgc cgctgtcgcg

[top]


[ORF sequence]

 

METRQMAVEQTTGAVTNQTETSWHSIDWAKANREVKRLQVRIAKAVKEGRWGKVKALQ

WLLTHSFYGKALAVKRVTDNSGSKTPGVDGITWSTQEQKAQAIKSLRRRGYKPQPLRR

VYIPKASGKQRPLGIPTTKDRAMQALYALALEPVAETTADRNSYGFRQGRCTADAAGQ

CFTVLGRSDCAKYILDADITGCFDNISHEWLLDNIPLDKEVLRKWLKSGFVWKQQLFP

THAGTPQGGVISPMLANMTLDGMEELLKKHLRKQKVNLIRYAGDFVVTGESKETLEKV

TTVIQEFLKERGLTLSEEKTKVVHIEEGFDFLGWNIRKYGEKLLIKPAKKNIKAFHKK

IRDALKELRTATQEAVIDTLNPIIKGWANYHRNQVSKRIFNRADDNIWHKLWRWAKRR

HPNKPARWTKNKYFIKIGNRHWVFGTWKKDKEGRLRSRYLIKAGDTRIQRHVKIKADA

NPFLPEWAEYFEERKKLKEAPAQYRRIRRELWKKQGGICPVCGGEIEQDMLTEIHHIL

PKHKGGSDDLDNLVLIHANCHKQVHSRDGQHSRFLLKEGL

top]


[Secondary structure]

 

Essentially the same as Th.e.I1, please see the structure of Th.e.I1.

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |