[Back to introns by organism]  [Back to home page]

Information for B.th.I3 intron  (Format of information for each intron

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence]

 

Sequence from Genbank entry.  The intron boundaries are identified in red

and the ORF in blue, with start and stop codons underlined. 

 

5' end

   1 gcgtgtccag ataaggacac attatctagt tggtgaaagt ccaatcggga gaatagactc
  61 cactcccgta gcctgagagg cgccatgaag tgaaagcgac atggcgaagc cctctgacat
 121 cacatgcaaa aggcatgtgg gcaaatctcc aggttgtaac gtacagtgaa cgtagatgta
 181 gcctcgttac acgcaattcc ggtggctgag acggtcgata acgtgcgaaa gcaacaagaa
 241 ctgaccgaaa cgagtctgga acggcagttc tcaccggcgg ggtatcagcg gcgacatgga
 301 gagaaagata atgtggaaac tggagaagcc ctcgacaccc gatgaagaaa tctcatcgag
 361 gagaccgtct ctataaccgc gatcgggaag tgagaacggt aggtgtcagg gtggcggaac
 421 ggatcgtagt accagagatc tgcgtgcagc aaaacacgcg gggaggaaag gatccgaacc
 481 tgtaaacatt tcgtatgact aaaggaggca agggtgctat gacaaaaaca cccatcactt
 541 tgcaggaact aaggcaacga atctatcgaa aggcgaagtc tgaacctacg caccggtttt
 601 gggggatatt tacccacatt accaaaatta caacccttca cgaagcatat caacaggcga
 661 ggaaaaacaa tggtgcccca ggcattgatg gaaaaagttt tgctgatata gaactagaag
 721 gagttatccc attcttaacg ggtattcaag aagaattaca agctggaata taccaaccac
 781 aagccaatcg aaaagtagaa atcccaaaga caaacggcaa aatgcgaact ctgcaaattc
 841 cgtgtatacg agatcgtgta gtacaaggag cgctaaaact catattagaa gcaatttttg
 901 aagctgattt ctgtccaaac tcgtatgggt ttcggccgaa acgctctcca catcaggcat
 961 tggcagaagt acgacgcagt atattgcgcc gtatgaccat aataattgat gttgatttgt
1021 cacgctactt tgatacaatc cgacacaaca tattattgga gaaaatcgcg aaacgtgtcc
1081 aagacccaca ggttatgcat cttgtaaaac aggtaattaa ggcaacagga aagatcggtg
1141 tcccgcaagg gggaccattt tctccactag ccgcgaacat ttacctcaat gaagtggatt
1201 ggacatttga taccattcga cgtaaaacag ctgatggcaa ttacgaagct gtaaactatc
1261 accgttttgc cgacgatata gttatcgctg taagcggaca ctccagtaaa agtggatggg
1321 ctgaactggc attacgacga ctgtgggaac agttaaagcc tttgggagtt gaactgaatc
1381 tggagaagac tcagatggtc aatgtcctaa aaggagaatc cttcggattt ttaggatttg
1441 acctgagacg aataccgaac cgaaataaga atggattctt cgttttcatg atcccgaaga
1501 agaaagctcg cacgacagtg aaagcaaaaa ttcgcgaact cattcaaaac gcaggagcga
1561 agccagcaca agacttaata aaacaaatca atgccgtact gactggatgg gtgaactact
1621 tccgggttgg gaactctagt caagctttca gtgaagtacg tgattatacg gagatgaaaa
1681 ttcgcacact gttaacgaga aggaaacgac gacgaaagcg tagcatcgga tggcagagat
1741 ggagtaacga atatctctat ggtgtactgg ggctttactg ggactggaaa gtccttcccc
1801 tgaaaagtgc agagagtttc cgatga
aagt gtctgccatt tgataggtct cataaccctt
1861 ttgatgaagt ttatggggtg agctgcttga gggaaaacct cacgagcagt tcttatgggg
1921 agaggctgga aacggatcac aactgatacc gcgccagtct tttacccgac
                                       

3' end

[top]


[Intron and flanking sequence]

 

   1 tccaagttga tttagagaac attatgcaca cgttaaaacc aggacaaacg tatgaaataa
  61 aagagtcgta tattggcaag aatcaaagat tatttacaag agtaattatc taccgattaa
 121 cagaggaaca aatactggaa cgtagaaaaa aacaaagcta taccgaaagt aaaaagggga
 181 ttacattttc agaaaagagt aaacgattaa cgggtatcaa catatatgtt acgaatacgc
 241 cttgggaagt ggttccgatg gaacaaatcc atgattttta ctccctccgc tggcagatcg
 301 gcgtgtccag ataaggacac attatctagt tggtgaaagt ccaatcggga gaatagactc
 361 cactcccgta gcctgagagg cgccatgaag tgaaagcgac atggcgaagc cctctgacat
 421 cacatgcaaa aggcatgtgg gcaaatctcc aggttgtaac gtacagtgaa cgtagatgta
 481 gcctcgttac acgcaattcc ggtggctgag acggtcgata acgtgcgaaa gcaacaagaa
 541 ctgaccgaaa cgagtctgga acggcagttc tcaccggcgg ggtatcagcg gcgacatgga
 601 gagaaagata atgtggaaac tggagaagcc ctcgacaccc gatgaagaaa tctcatcgag
 661 gagaccgtct ctataaccgc gatcgggaag tgagaacggt aggtgtcagg gtggcggaac
 721 ggatcgtagt accagagatc tgcgtgcagc aaaacacgcg gggaggaaag gatccgaacc
 781 tgtaaacatt tcgtatgact aaaggaggca agggtgctat gacaaaaaca cccatcactt
 841 tgcaggaact aaggcaacga atctatcgaa aggcgaagtc tgaacctacg caccggtttt
 901 gggggatatt tacccacatt accaaaatta caacccttca cgaagcatat caacaggcga
 961 ggaaaaacaa tggtgcccca ggcattgatg gaaaaagttt tgctgatata gaactagaag
1021 gagttatccc attcttaacg ggtattcaag aagaattaca agctggaata taccaaccac
1081 aagccaatcg aaaagtagaa atcccaaaga caaacggcaa aatgcgaact ctgcaaattc
1141 cgtgtatacg agatcgtgta gtacaaggag cgctaaaact catattagaa gcaatttttg
1201 aagctgattt ctgtccaaac tcgtatgggt ttcggccgaa acgctctcca catcaggcat
1261 tggcagaagt acgacgcagt atattgcgcc gtatgaccat aataattgat gttgatttgt
1321 cacgctactt tgatacaatc cgacacaaca tattattgga gaaaatcgcg aaacgtgtcc
1381 aagacccaca ggttatgcat cttgtaaaac aggtaattaa ggcaacagga aagatcggtg
1441 tcccgcaagg gggaccattt tctccactag ccgcgaacat ttacctcaat gaagtggatt
1501 ggacatttga taccattcga cgtaaaacag ctgatggcaa ttacgaagct gtaaactatc
1561 accgttttgc cgacgatata gttatcgctg taagcggaca ctccagtaaa agtggatggg
1621 ctgaactggc attacgacga ctgtgggaac agttaaagcc tttgggagtt gaactgaatc
1681 tggagaagac tcagatggtc aatgtcctaa aaggagaatc cttcggattt ttaggatttg
1741 acctgagacg aataccgaac cgaaataaga atggattctt cgttttcatg atcccgaaga
1801 agaaagctcg cacgacagtg aaagcaaaaa ttcgcgaact cattcaaaac gcaggagcga
1861 agccagcaca agacttaata aaacaaatca atgccgtact gactggatgg gtgaactact
1921 tccgggttgg gaactctagt caagctttca gtgaagtacg tgattatacg gagatgaaaa
1981 ttcgcacact gttaacgaga aggaaacgac gacgaaagcg tagcatcgga tggcagagat
2041 ggagtaacga atatctctat ggtgtactgg ggctttactg ggactggaaa gtccttcccc
2101 tgaaaagtgc agagagtttc cgatgaaagt gtctgccatt tgataggtct cataaccctt
2161 ttgatgaagt ttatggggtg agctgcttga gggaaaacct cacgagcagt tcttatgggg
2221 agaggctgga aacggatcac aactgatacc gcgccagtct
tttacccgac aaatcatatt
2281 taaaacgtgg aaatctctat ttcaaattca tcattggcaa actatcaaac aagagcgatt
2341 agaatgccat gtgtatggaa aactcattgc catttttata tgttcttcca cgatgtttaa
2401 gatgcgccaa cttctgttgc aaaagcacaa aagagaacta agcgaatata aagcaattgg
2461 gatgattcaa gatcatctat ccctgttata tcaagcgata cagagaaaca cccgtataat
2521 aacaaaggtt ttaatccgcc tgtttaccct actaaagaaa aatggccgga

[top]


[ORF sequence]

 

MTKGGKGAMTKTPITLQELRQRIYRKAKSEPTHRFWGIFTHITKITTLHEAYQQARKN

NGAPGIDGKSFADIELEGVIPFLTGIQEELQAGIYQPQANRKVEIPKTNGKMRTLQIP

CIRDRVVQGALKLILEAIFEADFCPNSYGFRPKRSPHQALAEVRRSILRRMTIIIDVD

LSRYFDTIRHNILLEKIAKRVQDPQVMHLVKQVIKATGKIGVPQGGPFSPLAANIYLN

EVDWTFDTIRRKTADGNYEAVNYHRFADDIVIAVSGHSSKSGWAELALRRLWEQLKPL

GVELNLEKTQMVNVLKGESFGFLGFDLRRIPNRNKNGFFVFMIPKKKARTTVKAKIRE

LIQNAGAKPAQDLIKQINAVLTGWVNYFRVGNSSQAFSEVRDYTEMKIRTLLTRRKRR

RKRSIGWQRWSNEYLYGVLGLYWDWKVLPLKSAESFR

[top]


[Secondary structure]

 

                                                    

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |