[Back to introns by organism] [Back to home page]

Information of B.t.I4 intron   (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

[Intron sequence]

 

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

 

Intron on antisense strand

 

3' end

                    gccgagta agcaagggga gtttcacccc gagcttctca ataaaccgtg
239641 cgtgaaactc tcgcttcaca cggctttact tatttaacca ttatcaattg ggaactcacc
239701 ttttctttat tggtaaatct tggcagagcg ccacatttcc aatgataaaa gataagcccc
239761 tctttctctg caatacgggc aagccattct tgagcaggat agaaacgatg tctataacgc
239821 tcgaacttac acattaccca acgggctaga cgtccgttca acgtctgcat aaaccacttc
239881 aaacgggtag gataaaactt accataataa gtcatccaac ctcgtacgat aggatttata
239941 tcagacgcaa ggtgttccaa tgtgagattt gagtgcctgt ttaggtgcca actgcgcatc
240001 gtctcattga ttcgtttcat tgattttcgg ctgatagccg gcaggaaacc cgtaaaacgg
240061 ttgccatact tatctaccga ttcccggggt tgaaacgtaa atccaagaaa gtcaaaagtc
240121 acattgggat agcactcctt ttgacggcta cttttgcaat agacgatttt cgttttctct
240181 tcattgagcc gcaacctgca gcactcaaac cgttgttgaa tcatcgcttg catatactca
240241 gcctgtttaa gactatggca gtgacaaatt gtatcatcag catatctctc aaatggaaca
240301 cggggaaagt tcttttccat ccacttgtca aaggtataat gaagaaacag attcgcaaga
240361 accggaccta tgacactgcc ttggggtaca ccgagagccc tgtcaacctg acttccatcc
240421 gatttctcat aaggaacttt cagccaacgt tctatataca tcaataccca tttttcttgc
240481 gtatgccgtt tcaaggcttt cagcagtagt tcatggtcta tggtgtcaaa gaatttactg
240541 atgtccatgt ccagaaccca cgcatacttc caacaacgct cacgagcttt gccaacagca
240601 tcatgcgccg agcgatgggg acggtaggca taagaatcct catgaaaaca aggttcgata
240661 gaaggggtta tcagcattac tacagccatt tgagctacac gatcagatac cgtcgggata
240721 cccaatggac gtttaccacc agtactcttc ggtatttcta ccagcttcac cgatggagga
240781 aagtaactgc cggaactcat gcgattccat agtttataaa ggttacctcg cagattcttt
240841 tcataatctt caagagtgac cttatcaatc cctgcacttc cacggtttgc tttcacacgc
240901 agaaatgcat cataaaccaa ttgcttcgat attgatattg gttttgcgtt gtcgttttgc
240961 at
cttcctaa ttgtttatca gttgtactaa cttttacgct aaataagctg accccttcgc
241021 tccataccca ttacaggcac ttctacacta ctacgagtca gtccgcccct actataagca
241081 tccgtacttt cttccttgaa ggttctgccc cttggaattt tccgttaaca tcttatagca
241141 ggttcccacg ttccacataa gagcctctta acaggtcatg ccatctgtac accggatgcc
241201 ataacgacaa taaacaggta atctcgttac ttatcccaaa agagtgagaa actcttggtt
241261 ttgacatcat ctacggttat tttcgatgct tcatcaatgg ttcccttgcg gtcatctcct
241321 attaagctac ctgacaattt atacattgcc ttttcatata tcgttcacta ccctgaccat
241381 ggagaaagag cagcatatac gggtttgcaa cccactcctg caagttgatt gcgagggacc
241441 taccctcatc tcctacatag ttgcgaagca tttagcacat gcttctcgtg gcacaa

5' end  

 

Intron on sense strand

 

5' end  

   1 ttgtgccacg agaagcatgt gctaaatgct tcgcaactat gtaggagatg agggtaggtc 

  61 cctcgcaatc aacttgcagg agtgggttgc aaacccgtat atgctgctct ttctccatgg 

 121 tcagggtagt gaacgatata tgaaaaggca atgtataaat tgtcaggtag cttaatagga 

 181 gatgaccgca agggaaccat tgatgaagca tcgaaaataa ccgtagatga tgtcaaaacc 

 241 aagagtttct cactcttttg ggataagtaa cgagattacc tgtttattgt cgttatggca 

 301 tccggtgtac agatggcatg acctgttaag aggctcttat gtggaacgtg ggaacctgct 

 361 ataagatgtt aacggaaaat tccaaggggc agaaccttca aggaagaaag tacggatgct 

 421 tatagtaggg gcggactgac tcgtagtagt gtagaagtgc ctgtaatggg tatggagcga 

 481 aggggtcagc ttatttagcg taaaagttag tacaactgat aaacaattag gaagatgcaa 

 541 aacgacaacg caaaaccaat atcaatatcg aagcaattgg tttatgatgc atttctgcgt 

 601 gtgaaagcaa accgtggaag tgcagggatt gataaggtca ctcttgaaga ttatgaaaag 

 661 aatctgcgag gtaaccttta taaactatgg aatcgcatga gttccggcag ttactttcct 

 721 ccatcggtga agctggtaga aataccgaag agtactggtg gtaaacgtcc attgggtatc 

 781 ccgacggtat ctgatcgtgt agctcaaatg gctgtagtaa tgctgataac cccttctatc 

 841 gaaccttgtt ttcatgagga ttcttatgcc taccgtcccc atcgctcggc gcatgatgct 

 901 gttggcaaag ctcgtgagcg ttgttggaag tatgcgtggg ttctggacat ggacatcagt

 961 aaattctttg acaccataga ccatgaacta ctgctgaaag ccttgaaacg gcatacgcaa 

1021 gaaaaatggg tattgatgta tatagaacgt tggctgaaag ttccttatga gaaatcggat 

1081 ggaagtcagg ttgacagggc tctcggtgta ccccaaggca gtgtcatagg tccggttctt

1141 gcgaatctgt ttcttcatta tacctttgac aagtggatgg aaaagaactt tccccgtgtt 

1201 ccatttgaga gatatgctga tgatacaatt tgtcactgcc atagtcttaa acaggctgag 

1261 tatatgcaag cgatgattca acaacggttt gagtgctgca ggttgcggct caatgaagag 

1321 aaaacgaaaa tcgtctattg caaaagtagc cgtcaaaagg agtgctatcc caatgtgact 

1381 tttgactttc ttggatttac gtttcaaccc cgggaatcgg tagataagta tggcaaccgt 

1441 tttacgggtt tcctgccggc tatcagccga aaatcaatga aacgaatcaa tgagacgatg 

1501 cgcagttggc acctaaacag gcactcaaat ctcacattgg aacaccttgc gtctgatata 

1561 aatcctatcg tacgaggttg gatgacttat tatggtaagt tttatcctac ccgtttgaag 

1621 tggtttatgc agacgttgaa cggacgtcta gcccgttggg taatgtgtaa gttcgagcgt 

1681 tatagacatc gtttctatcc tgctcaagaa tggcttgccc gtattgcaga gaaagagggg 

1741 cttatctttt atcattggaa atgtggcgct ctgccaagat ttaccaataa agaaaaggtg 

1801 agttcccaat tgataatggt taaataagta aagccgtgtg aagcgagagt ttcacgcacg 

1861 gtttattgag aagctcgggg tgaaactccc cttgcttact cggcgg          

3' end  

[top]


[Intron and flanking sequence]

 

239101 ccggcaggat gtgtccacct caatcaacac gcaactcgac tctcttattc ctgcctccaa
239161 gatagcaaat ctttcacaag gaacattcgt tggtgccgta tcggacaact tcggagagaa
239221 aatagaacag aagatattcc acgccgaaat catcgtagac cacgaaaaag tcagcgggga
239281 agagaagaat tacaaaaaaa tacccgtcat caatgaattc agggacaggg agggaaacga
239341 catcatgacg cagcagatcg gacggaacta tgacaggatc aaggcggatg cgcaggcaat
239401 cattaacatg gaaatggaac gcattaaaaa tgacccggaa ctctgcaggc ggctgggact
239461 ggagaatgaa gatgacaaaa agaagagaaa cgggaaacag gaataaatgc gcattttccc
239521 ctccataatt ttgaaagaat cgagtaggag gaaaacgaat tggtattctt cctacttttt
239581 cgttttccga ccgccgagta agcaagggga gtttcacccc gagcttctca ataaaccgtg
239641 cgtgaaactc tcgcttcaca cggctttact tatttaacca ttatcaattg ggaactcacc
239701 ttttctttat tggtaaatct tggcagagcg ccacatttcc aatgataaaa gataagcccc
239761 tctttctctg caatacgggc aagccattct tgagcaggat agaaacgatg tctataacgc
239821 tcgaacttac acattaccca acgggctaga cgtccgttca acgtctgcat aaaccacttc
239881 aaacgggtag gataaaactt accataataa gtcatccaac ctcgtacgat aggatttata
239941 tcagacgcaa ggtgttccaa tgtgagattt gagtgcctgt ttaggtgcca actgcgcatc
240001 gtctcattga ttcgtttcat tgattttcgg ctgatagccg gcaggaaacc cgtaaaacgg
240061 ttgccatact tatctaccga ttcccggggt tgaaacgtaa atccaagaaa gtcaaaagtc
240121 acattgggat agcactcctt ttgacggcta cttttgcaat agacgatttt cgttttctct
240181 tcattgagcc gcaacctgca gcactcaaac cgttgttgaa tcatcgcttg catatactca
240241 gcctgtttaa gactatggca gtgacaaatt gtatcatcag catatctctc aaatggaaca
240301 cggggaaagt tcttttccat ccacttgtca aaggtataat gaagaaacag attcgcaaga
240361 accggaccta tgacactgcc ttggggtaca ccgagagccc tgtcaacctg acttccatcc
240421 gatttctcat aaggaacttt cagccaacgt tctatataca tcaataccca tttttcttgc
240481 gtatgccgtt tcaaggcttt cagcagtagt tcatggtcta tggtgtcaaa gaatttactg
240541 atgtccatgt ccagaaccca cgcatacttc caacaacgct cacgagcttt gccaacagca
240601 tcatgcgccg agcgatgggg acggtaggca taagaatcct catgaaaaca aggttcgata
240661 gaaggggtta tcagcattac tacagccatt tgagctacac gatcagatac cgtcgggata
240721 cccaatggac gtttaccacc agtactcttc ggtatttcta ccagcttcac cgatggagga
240781 aagtaactgc cggaactcat gcgattccat agtttataaa ggttacctcg cagattcttt
240841 tcataatctt caagagtgac cttatcaatc cctgcacttc cacggtttgc tttcacacgc
240901 agaaatgcat cataaaccaa ttgcttcgat attgatattg gttttgcgtt gtcgttttgc
240961 atcttcctaa ttgtttatca gttgtactaa cttttacgct aaataagctg accccttcgc
241021 tccataccca ttacaggcac ttctacacta ctacgagtca gtccgcccct actataagca
241081 tccgtacttt cttccttgaa ggttctgccc cttggaattt tccgttaaca tcttatagca
241141 ggttcccacg ttccacataa gagcctctta acaggtcatg ccatctgtac accggatgcc
241201 ataacgacaa taaacaggta atctcgttac ttatcccaaa agagtgagaa actcttggtt
241261 ttgacatcat ctacggttat tttcgatgct tcatcaatgg ttcccttgcg gtcatctcct
241321 attaagctac ctgacaattt atacattgcc ttttcatata tcgttcacta ccctgaccat
241381 ggagaaagag cagcatatac gggtttgcaa cccactcctg caagttgatt gcgagggacc
241441 taccctcatc tcctacatag ttgcgaagca tttagcacat gcttctcgtg gcacaa
tctc
241501 acaccaccat acgtaccgtt cggcatacgg cggttcctat tttgggcacc atttgagata
241561 gcagtccatg aactccatat agcctgcctt acgcaattta tcattggaca ttacaaaggt
241621 tactattggg ctgttcgaga cacgccaata accaagacga gtgtttcccc attgatatgc
241681 ctgccactca gagacaccgc atttctttaa gtttgctact cgggttttgg cattcttcca
241741 tgatttccag atacacatgc gaatacgtcg tcttagccat tcgtcggttt caatacaaag
241801 ccttttcatt tgtgcaaggc agaagtaggc aatccatcct cttatgtaat ctttgagttt
241861 ctgtttcctt ttctcatatc cccatccatt actccgactt gtcaattctt tgagttttga
241921 tttcatcttg tttttggatt ttgagtgtac aagtagaata cagtcccctt tcttcacata
241981 aaaggagtac cccaagaact ttactccttt cacataagat acgaccgtct tttcatggtt
242041 cactttaagg aagagtttcc cttctatgaa cagagttatc gattccctta cacgtttggc

[top]


[ORF sequence]

 

MQNDNAKPISISKQLVYDAFLRVKANRGSAGIDKVTLEDYEKNLRGNLYKLWNRMSSG

SYFPPSVKLVEIPKSTGGKRPLGIPTVSDRVAQMAVVMLITPSIEPCFHEDSYAYRPH

RSAHDAVGKARERCWKYAWVLDMDISKFFDTIDHELLLKALKRHTQEKWVLMYIERWL

KVPYEKSDGSQVDRALGVPQGSVIGPVLANLFLHYTFDKWMEKNFPRVPFERYADDTI

CHCHSLKQAEYMQAMIQQRFECCRLRLNEEKTKIVYCKSSRQKECYPNVTFDFLGFTF

QPRESVDKYGNRFTGFLPAISRKSMKRINETMRSWHLNRHSNLTLEHLASDINPIVRG

WMTYYGKFYPTRLKWFMQTLNGRLARWVMCKFERYRHRFYPAQEWLARIAEKEGLIFY

HWKCGALPRFTNKEKVSSQLIMVK

[top]


[Secondary structure]

 

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |