[Back to introns by organism]  [Back to home page]

Information for B.sp.I1 intron  (Format of information for each intron

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence]

 

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

Intron on antisense strand

3' end

                                 atcga gtaggaggca cgtcaccgtg ccgacctctc
96421 acagcaccgt acgtaccgtt cggtatacgg cgcctcgaaa gttttatgaa atgattactt
96481 gtaacgattt taaacctagc tgtcgccaat acttattgct caaggcggca ttaattacag
96541 ggtttcggga gttgcgccaa taaccctttc gggtgttggc gcttttccgt gcattttctc
96601 ttgagatacc taatttccta aggtttttgt acttggtctt tatctttttc cattgtttcc
96661 acaggcacgc tctcagtcta cgacggatat gtttatcgat atcctctgcg tatgatttta
96721 tatccccaac tttaaaatag ttcccccacc cctggatgag ttgattaatt ttaaggattc
96781 tgtaagtcat acttactccc cagttacggt ttgtcagctt cagtagtttg gtttcaagcc
96841 tttgttttga tttctttggc acgtacaccc tggttttcgt tctatagaaa ctcactccta
96901 agaatgtccg tgcactcggc tttcccaccg cacttttctc ctcgttgacc tttagcttta
96961 attttctctc tagaaattct gtaactcctt gtttaactct ctctccagct ttaatgctct
97021 taacatagat attgcagtca tctgcgtagc gtacaaattt atgtcctctt ttctccagct
97081 ccttatctaa ttcattaagg ataatattgc tgagaagagg gcttaaggga cctccttgcg
97141 gagtcccctc tctgttggta ataaccacac cgttcaccat aaccccggct tgaagaaatc
97201 ttcggataag cttaagtgtc ggtttatcag agatggtctt ggctatcaga gacatgagct
97261 tatcatgctg gactcggtca aagaatttct ctagatccat atctacaacg aagttgtagc
97321 cttcttcgat gtattgtctt gcctgttcga tggcttgatg tgcactctta ttcggtcgaa
97381 aaccataact aaagtggctg aactgccttt caaatatagg tgttaacctt tgaaccacgg
97441 cttgctgaat cactcggtcc gtcacagtag gaatcccaag ttgtcgtttt ccaccattag
97501 cttttggtat ctcaaccctt ctaactggct gaggtttgta ttttccttct cgtatcagtt
97561 gaataatctc ttgtccattt tccctcaaat ataggcgagt agcttctata tctttctcat
97621 cgacacctgc tgctcctttg tttcgtttaa cctttttaaa agcaaggtca aggttttcag
97681 tatcaacgat acggtcaatt aatgtttggt tacattccat
attgtttcca tgctcttgag
97741 attgcacaaa agtaactcca cgctagttta aggtttccag ctttccgagc tttctgcctt
97801 aaaccagatt tgttggctgt gtcatcgttc ttttgtaaag ttgctctaac atggtagctc
97861 ctttcgttcc gcccttctca acaaatcgtt gatactatgg cttctgctga cttctcttag
97921 ttcagctgct tatcactaag caggttctcc ttccggaggt ttctaagaga cctcccgggg
97981 taagttcatc cactttctcc tcatatatct gccacattta tgtctgttcc atccggggat
98041 attttggact ttgttttgtt ttgcaaactc gtcctgaaac agtccacctc aaatgtgatt
98101 cgtgtacctc agaccgagga tttgcctagg gcttccttca gattccacgt cgccatggac
98161 acccttgcct tcagctaatg gttcgcatat cccaacgccc atagcggact tgcaccgcct
98221 agttgatgaa catgcccggc acac

5' end

 

Intron on sense strand

 

5' end

   1 gtgtgccggg catgttcatc aactaggcgg tgcaagtccg ctatgggcgt tgggatatgc 

  61 gaaccattag ctgaaggcaa gggtgtccat ggcgacgtgg aatctgaagg aagccctagg 

 121 caaatcctcg gtctgaggta cacgaatcac atttgaggtg gactgtttca ggacgagttt 

 181 gcaaaacaaa acaaagtcca aaatatcccc ggatggaaca gacataaatg tggcagatat 

 241 atgaggagaa agtggatgaa cttaccccgg gaggtctctt agaaacctcc ggaaggagaa

 301 cctgcttagt gataagcagc tgaactaaga gaagtcagca gaagccatag tatcaacgat 

 361 ttgttgagaa gggcggaacg aaaggagcta ccatgttaga gcaactttac aaaagaacga 

 421 tgacacagcc aacaaatctg gtttaaggca gaaagctcgg aaagctggaa accttaaact 

 481 agcgtggagt tacttttgtg caatctcaag agcatggaaa caatatggaa tgtaaccaaa 

 541 cattaattga ccgtatcgtt gatactgaaa accttgacct tgcttttaaa aaggttaaac 

 601 gaaacaaagg agcagcaggt gtcgatgaga aagatataga agctactcgc ctatatttga 

 661 gggaaaatgg acaagagatt attcaactga tacgagaagg aaaatacaaa cctcagccag 

 721 ttagaagggt tgagatacca aaagctaatg gtggaaaacg acaacttggg attcctactg 

 781 tgacggaccg agtgattcag caagccgtgg ttcaaaggtt aacacctata tttgaaaggc 

 841 agttcagcca ctttagttat ggttttcgac cgaataagag tgcacatcaa gccatcgaac 

 901 aggcaagaca atacatcgaa gaaggctaca acttcgttgt agatatggat ctagagaaat 

 961 tctttgaccg agtccagcat gataagctca tgtctctgat agccaagacc atctctgata 

1021 aaccgacact taagcttatc cgaagatttc ttcaagccgg ggttatggtg aacggtgtgg 

1081 ttattaccaa cagagagggg actccgcaag gaggtccctt aagccctctt ctcagcaata 

1141 ttatccttaa tgaattagat aaggagctgg agaaaagagg acataaattt gtacgctacg 

1201 cagatgactg caatatctat gttaagagca ttaaagctgg agagagagtt aaacaaggag 

1261 ttacagaatt tctagagaga aaattaaagc taaaggtcaa cgaggagaaa agtgcggtgg 

1321 gaaagccgag tgcacggaca ttcttaggag tgagtttcta tagaacgaaa accagggtgt 

1381 acgtgccaaa gaaatcaaaa caaaggcttg aaaccaaact actgaagctg acaaaccgta 

1441 actggggagt aagtatgact tacagaatcc ttaaaattaa tcaactcatc caggggtggg 

1501 ggaactattt taaagttggg gatataaaat catacgcaga ggatatcgat aaacatatcc 

1561 gtcgtagact gagagcgtgc ctgtggaaac aatggaaaaa gataaagacc aagtacaaaa 

1621 accttaggaa attaggtatc tcaagagaaa atgcacggaa aagcgccaac acccgaaagg 

1681 gttattggcg caactcccga aaccctgtaa ttaatgccgc cttgagcaat aagtattggc 

1741 gacagctagg tttaaaatcg ttacaagtaa tcatttcata aaactttcga ggcgccgtat 

1801 accgaacggt acgtacggtg ctgtgagagg tcggcacggt gacgtgcctc ctactcgat

3' end

[top]


[Intron and flanking sequence]

 

96001 tccttgcgcc tgccagcctc gccctggcat cagagatgga ttggatgact tccttctcat
96061 gtgctgcata tcctttgact gtattaacga ggtttgggat cagatccagc cttcgctgaa
96121 gctggttttc gatctgcgca tacgctttgt ccacatcttc ctcagcattt acaaatccat
96181 tatagctgga tcccagcatc ataaaaatga tgactaaggc tacaacggca atcgctatcc
96241 ctgcaaaacc ctttttcaag ttgtccactt ccaattcata gatgatttat ctgcttgaac
96301 attatttttc cttcttttcc ggaaagctaa actttcaaga agggaatgaa agatggcatt
96361 ttcaggatat aactttccag cttatatcga gtaggaggca cgtcaccgtg ccgacctctc
96421 acagcaccgt acgtaccgtt cggtatacgg cgcctcgaaa gttttatgaa atgattactt
96481 gtaacgattt taaacctagc tgtcgccaat acttattgct caaggcggca ttaattacag
96541 ggtttcggga gttgcgccaa taaccctttc gggtgttggc gcttttccgt gcattttctc
96601 ttgagatacc taatttccta aggtttttgt acttggtctt tatctttttc cattgtttcc
96661 acaggcacgc tctcagtcta cgacggatat gtttatcgat atcctctgcg tatgatttta
96721 tatccccaac tttaaaatag ttcccccacc cctggatgag ttgattaatt ttaaggattc
96781 tgtaagtcat acttactccc cagttacggt ttgtcagctt cagtagtttg gtttcaagcc
96841 tttgttttga tttctttggc acgtacaccc tggttttcgt tctatagaaa ctcactccta
96901 agaatgtccg tgcactcggc tttcccaccg cacttttctc ctcgttgacc tttagcttta
96961 attttctctc tagaaattct gtaactcctt gtttaactct ctctccagct ttaatgctct
97021 taacatagat attgcagtca tctgcgtagc gtacaaattt atgtcctctt ttctccagct
97081 ccttatctaa ttcattaagg ataatattgc tgagaagagg gcttaaggga cctccttgcg
97141 gagtcccctc tctgttggta ataaccacac cgttcaccat aaccccggct tgaagaaatc
97201 ttcggataag cttaagtgtc ggtttatcag agatggtctt ggctatcaga gacatgagct
97261 tatcatgctg gactcggtca aagaatttct ctagatccat atctacaacg aagttgtagc
97321 cttcttcgat gtattgtctt gcctgttcga tggcttgatg tgcactctta ttcggtcgaa
97381 aaccataact aaagtggctg aactgccttt caaatatagg tgttaacctt tgaaccacgg
97441 cttgctgaat cactcggtcc gtcacagtag gaatcccaag ttgtcgtttt ccaccattag
97501 cttttggtat ctcaaccctt ctaactggct gaggtttgta ttttccttct cgtatcagtt
97561 gaataatctc ttgtccattt tccctcaaat ataggcgagt agcttctata tctttctcat
97621 cgacacctgc tgctcctttg tttcgtttaa cctttttaaa agcaaggtca aggttttcag
97681 tatcaacgat acggtcaatt aatgtttggt tacattccat attgtttcca tgctcttgag
97741 attgcacaaa agtaactcca cgctagttta aggtttccag ctttccgagc tttctgcctt
97801 aaaccagatt tgttggctgt gtcatcgttc ttttgtaaag ttgctctaac atggtagctc
97861 ctttcgttcc gcccttctca acaaatcgtt gatactatgg cttctgctga cttctcttag
97921 ttcagctgct tatcactaag caggttctcc ttccggaggt ttctaagaga cctcccgggg
97981 taagttcatc cactttctcc tcatatatct gccacattta tgtctgttcc atccggggat
98041 attttggact ttgttttgtt ttgcaaactc gtcctgaaac agtccacctc aaatgtgatt
98101 cgtgtacctc agaccgagga tttgcctagg gcttccttca gattccacgt cgccatggac

98161 acccttgcct tcagctaatg gttcgcatat cccaacgccc atagcggact tgcaccgcct
98221 agttgatgaa catgcccggc acac
aaaaag aagcagcccc cttcaggact gcttcttcat
98281 atctcttttg atgaaaatgg cagcaataat ttaatgccgc gcctgttaaa aggtaaaaac
98341 ttcatatcgg tgataagatg gctcgcatag gcagccatgc aggtcttaaa gactccttct
98401 atcccaaggg agctatcgag ctgataggca atcaccccaa agaaaatcag gccagccgct
98461 gaatgcgtgt aggtcctatg ggagacaaat gatgctgcaa tgatgtatat gcccaaaagc
98521 aaaagccagc tttcctgaag agacaagccg ccggccgcta tcccgattcc tgtaaccgtc
98581 agcatcctcc tcttcgtgat gaaggatgcc aggattacaa tggctgcacc agcccccatt

[top]


[ORF sequence]

 

MECNQTLIDRIVDTENLDLAFKKVKRNKGAAGVDEKDIEATRLYLRENGQEIIQLIRE

GKYKPQPVRRVEIPKANGGKRQLGIPTVTDRVIQQAVVQRLTPIFERQFSHFSYGFRP

NKSAHQAIEQARQYIEEGYNFVVDMDLEKFFDRVQHDKLMSLIAKTISDKPTLKLIRR

FLQAGVMVNGVVITNREGTPQGGPLSPLLSNIILNELDKELEKRGHKFVRYADDCNIY

VKSIKAGERVKQGVTEFLERKLKLKVNEEKSAVGKPSARTFLGVSFYRTKTRVYVPKK

SKQRLETKLLKLTNRNWGVSMTYRILKINQLIQGWGNYFKVGDIKSYAEDIDKHIRRR

LRACLWKQWKKIKTKYKNLRKLGISRENARKSANTRKGYWRNSRNPVINAALSNKYWR

QLGLKSLQVIIS

[top]


[Secondary structure]

                                       

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |