[Back to introns by organism]  [Back to home page]

Information for B.c.I6 intron  (Format of information for each intron

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

 

5' end

   1 gtgcgagacg ttctctcata agtaatgctc aggcataagc gacagagagg actactctaa
  61 ggtttaataa ccaaaatgag taggataact attcagacga gataaggaat gatggagtaa
 121 cgccctgaaa cttatccagt gagatttcca cccgcattat acaggcatgt atagatgtgg
 181 aaagccggat aaaatatgca atcacgaacc gaagtgtccg ataaggataa ccgaaagccc
 241 aagggaataa taaacccgtg ggaattgttc aagtattgta tgctgtcaac cgacatggtg
 301 agaatgtggt aaaccactga cgaacttgcg aatgtacgct tcgataataa tacggtacag
 361 aaatgtatcg ggtctgaaaa tatgatttgt ctaatggaaa gaaaggtatt aaacattgcc
 421 ctaatcctaa cgttgttata ggcaacggag ctagacacgg agtaaagcaa gcacctaaat
 481 cggcaaatga tagtctgaat ggaacgtcga aagctttcgg gcgattaaac aaaagggggc
 541 accctgcctg tagacgaaac ccaacaggga aataagctac ttaaccgaaa gtgagagtgc
 601 tggcacgact gacgatatcc cttgtaatga gggaaggagg aacagccagt agacattata
 661 taagggaaaa tgtggtttca ctaaatccta gcatgggttc tggtatgacg aaaaaggtca
 721 cggaactatg aagggaagtg atagaccatg aataaacaat ctgtggaata tatgccaaaa
 781 gacaaaaaga tcagacattc ggaatactat ggtatgatgg aaaggtttga tgaactttat
 841 cagaaggcaa aaaacaaaca aaacttccga aatctaatgc gttacattac cgcagatgaa
 901 aatattctct tggcttaccg taatatcaag cgaaataagg gaagccaaac accatctatg
 961 gataaagtaa ccatacgaga agtagcgaaa atgacacaag aaatgctcat taattttgta
1021 aaacagcgct ttgacaatta tcaaccacga gtggtacgta gaaaggaaat cccaaaaccg
1081 aatggagaaa cgagacctct gggcatacca tccttttggg acagactaat acaacaatgt
1141 atccttcaag ttcttgagcc aatatgtgag gcgaattttc acactagaag ttatggcttt
1201 cgccctaacc gaagtgccga aaatgcaata gccgatgcaa caaaaaaaat caacatacaa
1261 ggactaacgc aagtggtaga cgttgacatc aaagggtttt tcgatgaagt aaatcatgcg
1321 aaacttatac gccagttatg gacactaggt attcgtgata aacaacttct cgtaattatc
1381 cggaagatat taaaagcgcc agtgctaatg ccaaatggaa aagtaatgta tccactgaaa
1441 ggaacccctc aaggaggtat tctcagccct ctactagcaa atattaattt aaatgaattt
1501 gactggtgga taagcaacca gtgggagact ttttcagcga agaaagtaaa accgagaata
1561 aaagatggaa tttggagcaa tgacaatgta accaactgtc tagcgagaaa ttcaaattta
1621 aagcccatgt atattgttcg ctatgctgac gattttaaaa tcttcacaaa cacaaaaagc
1681 aatgccgata aaatcttcat ggcaagcaaa ttgtggttgg aaagacgcct aaaattacct
1741 atctcagaag ggaaatcgag ggtaacaaac ttgaaaaagc aatcaagtga gtttttagga
1801 ttcacgctga aagctgtcaa gaaaggtaaa cggcaaagcg ttacaaaata tgtggtaaaa
1861 actcatatcg caccaaaggc attaaaagca ctaagagtaa aattaagtgg gcaaataagg
1921 agaatgcaga aatcaccgaa tagcatgaac tgtataaagg aaatcggaaa gtacaacagt
1981 atggttatcg gaatgcataa ctattatagc attgcgacac atatcagcct tgaccttaaa
2041 agaatgggct ttgaattaac tgaacaaatg tataatcgtt ttcctaaagc aaaaatgcaa
2101 gataagaaga actccaacgg tttcacaaat atgggcgaat ataatgggaa agacaaggga
2161 ttgaagccat acctaaaatc aaaagcatta cgctacttaa tgaaggtacc aattgtccct
2221 gtgtcagcta ttaaacaccg aaatcctatg atgaaacgac aagctgtcaa taagtatacg
2281 gaagaaggtc gcaagcttat tcatgccagc ttaaaaattg tctcagagga agagcttaaa
2341 tggctaaggg aacatcctat cctctcgaac agggcaacga ttgagcttaa tgataatcgt
2401 atttcccttt ttgtcgctca aaacgggaaa tgtggagtaa cgggtgaaaa actggaccta
2461 actgatatgc attgtcacca taagaaacta tggagtaaaa cacatgatga tagttatcaa
2521 aatttaattc ttatcaaatc ggacgttcat aggctaatac atgcgacaaa acaagaaaca
2581 atagataaac tccttcaaac attaaactta aatgaaaaac aactacttaa actaaataaa
2641 ttgcgcaagc tagccgaaaa tgaagaaatc tgtatctaa
a cctaaaattg tgatgaaatt
2701 ttcgttgttt gatttaacaa ggcacataac aacagaatgt ttatgattta ggagaaacca
2761 gacttatata atgttggaac gccgtatgcg gtgaaagtcg cacgtgcggt gtgaagcggg
2821 ggaaaaaatg gagataactt caaagtttta cctatcgcta tt

3' end

[top]


[Intron and flanking sequence]

 

2701 gaacaaaaaa gaactagaaa tgtttttagg aaatcatgta acaactgcag aattagctga

2761 tgaatttaca tcagactatt ttagtacaaa taagaactac aaagttactt ataaaagtaa

2821 accaagtcta tttaattatg aaaaaccaag aactgttaca aaagtaaaaa aaggattgtt

2881 tttggtaaaa caaaacgact taattctgga atttaaatat gtaccagaaa tagatggatt

2941 tcgaatttct gagattacat atttaaaata aaaaaatatt aggaggaatt tattatgttt

3001 gtgcgagacg ttctctcata agtaatgctc aggcataagc gacagagagg actactctaa

3061 ggtttaataa ccaaaatgag taggataact attcagacga gataaggaat gatggagtaa

3121 cgccctgaaa cttatccagt gagatttcca cccgcattat acaggcatgt atagatgtgg

3181 aaagccggat aaaatatgca atcacgaacc gaagtgtccg ataaggataa ccgaaagccc

3241 aagggaataa taaacccgtg ggaattgttc aagtattgta tgctgtcaac cgacatggtg

3301 agaatgtggt aaaccactga cgaacttgcg aatgtacgct tcgataataa tacggtacag

3361 aaatgtatcg ggtctgaaaa tatgatttgt ctaatggaaa gaaaggtatt aaacattgcc

3421 ctaatcctaa cgttgttata ggcaacggag ctagacacgg agtaaagcaa gcacctaaat

3481 cggcaaatga tagtctgaat ggaacgtcga aagctttcgg gcgattaaac aaaagggggc

3541 accctgcctg tagacgaaac ccaacaggga aataagctac ttaaccgaaa gtgagagtgc

3601 tggcacgact gacgatatcc cttgtaatga gggaaggagg aacagccagt agacattata

3661 taagggaaaa tgtggtttca ctaaatccta gcatgggttc tggtatgacg aaaaaggtca

3721 cggaactatg aagggaagtg atagaccatg aataaacaat ctgtggaata tatgccaaaa

3781 gacaaaaaga tcagacattc ggaatactat ggtatgatgg aaaggtttga tgaactttat

3841 cagaaggcaa aaaacaaaca aaacttccga aatctaatgc gttacattac cgcagatgaa

3901 aatattctct tggcttaccg taatatcaag cgaaataagg gaagccaaac accatctatg

3961 gataaagtaa ccatacgaga agtagcgaaa atgacacaag aaatgctcat taattttgta

4021 aaacagcgct ttgacaatta tcaaccacga gtggtacgta gaaaggaaat cccaaaaccg

4081 aatggagaaa cgagacctct gggcatacca tccttttggg acagactaat acaacaatgt

4141 atccttcaag ttcttgagcc aatatgtgag gcgaattttc acactagaag ttatggcttt

4201 cgccctaacc gaagtgccga aaatgcaata gccgatgcaa caaaaaaaat caacatacaa

4261 ggactaacgc aagtggtaga cgttgacatc aaagggtttt tcgatgaagt aaatcatgcg

4321 aaacttatac gccagttatg gacactaggt attcgtgata aacaacttct cgtaattatc

4381 cggaagatat taaaagcgcc agtgctaatg ccaaatggaa aagtaatgta tccactgaaa

4441 ggaacccctc aaggaggtat tctcagccct ctactagcaa atattaattt aaatgaattt

4501 gactggtgga taagcaacca gtgggagact ttttcagcga agaaagtaaa accgagaata

4561 aaagatggaa tttggagcaa tgacaatgta accaactgtc tagcgagaaa ttcaaattta

4621 aagcccatgt atattgttcg ctatgctgac gattttaaaa tcttcacaaa cacaaaaagc

4681 aatgccgata aaatcttcat ggcaagcaaa ttgtggttgg aaagacgcct aaaattacct

4741 atctcagaag ggaaatcgag ggtaacaaac ttgaaaaagc aatcaagtga gtttttagga

4801 ttcacgctga aagctgtcaa gaaaggtaaa cggcaaagcg ttacaaaata tgtggtaaaa

4861 actcatatcg caccaaaggc attaaaagca ctaagagtaa aattaagtgg gcaaataagg

4921 agaatgcaga aatcaccgaa tagcatgaac tgtataaagg aaatcggaaa gtacaacagt

4981 atggttatcg gaatgcataa ctattatagc attgcgacac atatcagcct tgaccttaaa

5041 agaatgggct ttgaattaac tgaacaaatg tataatcgtt ttcctaaagc aaaaatgcaa

5101 gataagaaga actccaacgg tttcacaaat atgggcgaat ataatgggaa agacaaggga

5161 ttgaagccat acctaaaatc aaaagcatta cgctacttaa tgaaggtacc aattgtccct

5221 gtgtcagcta ttaaacaccg aaatcctatg atgaaacgac aagctgtcaa taagtatacg

5281 gaagaaggtc gcaagcttat tcatgccagc ttaaaaattg tctcagagga agagcttaaa

5341 tggctaaggg aacatcctat cctctcgaac agggcaacga ttgagcttaa tgataatcgt

5401 atttcccttt ttgtcgctca aaacgggaaa tgtggagtaa cgggtgaaaa actggaccta

5461 actgatatgc attgtcacca taagaaacta tggagtaaaa cacatgatga tagttatcaa

5521 aatttaattc ttatcaaatc ggacgttcat aggctaatac atgcgacaaa acaagaaaca

5581 atagataaac tccttcaaac attaaactta aatgaaaaac aactacttaa actaaataaa

5641 ttgcgcaagc tagccgaaaa tgaagaaatc tgtatctaaa cctaaaattg tgatgaaatt

5701 ttcgttgttt gatttaacaa ggcacataac aacagaatgt ttatgattta ggagaaacca

5761 gacttatata atgttggaac gccgtatgcg gtgaaagtcg cacgtgcggt gtgaagcggg

5821 ggaaaaaatg gagataactt caaagtttta cctatcgcta tttacaaatt aggactgtta

5881 ttgctgatgc attaagaatt gatgaagaag tgaatggttt tttgaagtat tgtgctaatc

5941 acgggaaaat agttaaagaa ataaaaccag gtgggatcat taatcgtgga aatgatcaag

6001 gccaaccact tgtaaccgtt atagttgttt atgaagagaa aaattaactg cagttatgaa

6061 aaagatgagg attaattcct tgtctttttt attttctaag aaaggtggat atatcgacta

6121 tgaaaataat aagcaaagtg gccgtcttta attacaaaga attcgagagt tctttgttag

[top]


[ORF sequence]

 

MNKQSVEYMPKDKKIRHSEYYGMMERFDELYQKAKNKQNFRNLMRYITADENILLAYR

NIKRNKGSQTPSMDKVTIREVAKMTQEMLINFVKQRFDNYQPRVVRRKEIPKPNGETR

PLGIPSFWDRLIQQCILQVLEPICEANFHTRSYGFRPNRSAENAIADATKKINIQGLT

QVVDVDIKGFFDEVNHAKLIRQLWTLGIRDKQLLVIIRKILKAPVLMPNGKVMYPLKG

TPQGGILSPLLANINLNEFDWWISNQWETFSAKKVKPRIKDGIWSNDNVTNCLARNSN

LKPMYIVRYADDFKIFTNTKSNADKIFMASKLWLERRLKLPISEGKSRVTNLKKQSSE

FLGFTLKAVKKGKRQSVTKYVVKTHIAPKALKALRVKLSGQIRRMQKSPNSMNCIKEI

GKYNSMVIGMHNYYSIATHISLDLKRMGFELTEQMYNRFPKAKMQDKKNSNGFTNMGE

YNGKDKGLKPYLKSKALRYLMKVPIVPVSAIKHRNPMMKRQAVNKYTEEGRKLIHASL

KIVSEEELKWLREHPILSNRATIELNDNRISLFVAQNGKCGVTGEKLDLTDMHCHHKK

LWSKTHDDSYQNLILIKSDVHRLIHATKQETIDKLLQTLNLNEKQLLKLNKLRKLAEN

EEICI

[top]


[Secondary structure]

 

 

                                                    

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |