[Back to introns by organism]   [Back to home page]

Information of A.v.I5 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

[Intron sequence]

 

Sequence from Genbank entry.  The intron boundaries are identified in red

and the ORF in blue, with start and stop codons underlined.  

               

5' end

                                                                     gt
26221 gcg
ccagtta aagctggcgg aggatagtcg atgaaagttc gatactggta agaaatggcg
26281 aatcaaccag cccccgagtc atgcacgttg caccgtgagg tgcagggtga agcgttgaca
26341 ggggaaaccg atgggccagc cattgagccg cgtaatcaga aatccgggac gccgatgctg
26401 ttaagcgaag cagaaggcaa tacggggcat ggcgttatac gccagtcatg ccctggtcct
26461 gcgcggtcgg agaccctgcg cacgtcggga agtccttcgc acagaaactg ggagatctca
26521 gtgatgcccg gtgcgcatgc actgggcgga gcaggcaagg ccaaaagctg taaccctgct
26581 gtctacgtcg ctgagaagtc ggatacgtcc gtagtacctg agaaaccgtc gaacaaaggg
26641 gctagccctg cggagagggt ggaggaaagg ggcgtagcca aggggaacac cgacaagaac
26701 cccacgcccc ggacgctgag ccggagcagt tgcgtgtcga tgggacttga aggtgtacgt
26761 gaagcagccc gtagggacaa gggtatgcag ttcacggcat tgttgcacca catcacaccg
26821 caactgttgg cgcagagctt ctacgcactt cgccgcgacg cggcggtggg agtggacggt
26881
atgtcgtggc gagaatacga agaagacctc caccagaggg tgggcaagtt gcacgcacgg
26941 ctccacagag gagcctatcg ggcaacgccg tcacggcggg tatatatccc caaagctgat
27001 ggcaggcagc ggccactggg tattgcctct ttggaggaca agatcgtaca gcaggcagtc
27061 gtcaccgttc tgaatgcgat ctatgaggag gatttccagg ggttctcgta tgggtttcgg
27121 ccgggacgaa gccagcacga tgcgctggat gcgttgacgg tcgcgctcaa gagccagaag
27181 gtgaactgga tactggacgc ggacatcacg tcgttctttg acgagatcga ccatgaatgg
27241 atgctgatgt ttctggggca ccgaattgca gaccgacgca tgctcggact catctgcaaa
27301 tggcttcagg cgggagtaat ggaggatggc cgtaggttgg ctgcgaccaa ggggactccc
27361 caaggcgcag tgatatcgcc gttgctggcg aatatctatc ttcactacgt gctggatctg
27421 tgggcaaggc agtggcgcca acggcatgcc cgtggcgaga tgattgttgt gcgctacgcg
27481 gacgacagcg tggtgggttt caggacgcaa tggcaggcgc agcgtttcct ggtgcagttg
27541 caggaacgca tggccaggtt cggtctatct cttaatgcct cgaaaacacg gctgatcgag
27601 ttcggtcgct ttgctgtaca gaatcgcagg cggcaagggc tgggcaaacc ggagacgttt
27661 gatttcctgg gcttcactca ttgctgtagt accaacagaa gcgggggatt tcaaatcctg
27721 cggctgacgg tcaagaagcg gatgcgtgcg acactgcaag ccatccggat agcactgaat
27781 cgccggcgac atgagccgat ccgagtcgta ggtcaatggc tcggcagtgt ggttggagga
27841 tatttcaact accacgccgt gccgggaaac ctgatccgtc ttgacggttt tcgggtggcg
27901 gtttgccgtc tttggcggca agccctcaaa cggcgcagcc agcgtaaccg gctccagtgg
27961 tcgcgctatg gacgccttgc agacctctat ataccaagac ccagaactgc acatccttac
28021 cctgaggagc gcttcgcgtc acgtacctga ggcaggagcc gtatgcggta g
ttccgcacg
28081 tacggatctg tgcggggggt ggcaggtaac tgccattcct accgc
gac

3' end  

[top]


[Intron and flanking sequence]

 

25621 tcgccggcga aattgaattc ttcgtcgccg acacccgccg cttcgaggcc gaccccgact
25681 accgggtggt acgcctgcaa ccgcagcgct ggacgttctg ctgccgggag gatcacccac
25741 tggcgcaaac cggggcgccg acctgccggg agctgttcgc cttcccgctg gccaccacct
25801 tccgtccgcc gaacattcgc aaggtattgg tggattacag cggtcgacgt gacttccagc
25861 ccgccgtgga gtgcgagcac tcctacgcgc tgctcaacgt ggtgctgcat tcggacgcca
25921 tcggcatcgg cagtgcgctc aacctggagc cctacatcct gcgcggcaac ctcaggctcc
25981 tgaccccgcg ggatctgccc gagcatctgg aggagctgca cacccgctac ggaatcgtca
26041 gccgcaacgg gcgcacgctc tcgccgctgg cccaggcgat gatcgcccgc atcgaataca
26101 ccgaccgcgc tcttacggaa cggttgtccg gcctgccgtc gacctgcgga tgctgttgat
26161 gcacgaggct cgctcgcttg tcacctgggg ccaaaggacc gagtgtcttg aacgcttagt
26221 gcgccagtta aagctggcgg aggatagtcg atgaaagttc gatactggta agaaatggcg
26281 aatcaaccag cccccgagtc atgcacgttg caccgtgagg tgcagggtga agcgttgaca
26341 ggggaaaccg atgggccagc cattgagccg cgtaatcaga aatccgggac gccgatgctg
26401 ttaagcgaag cagaaggcaa tacggggcat ggcgttatac gccagtcatg ccctggtcct
26461 gcgcggtcgg agaccctgcg cacgtcggga agtccttcgc acagaaactg ggagatctca
26521 gtgatgcccg gtgcgcatgc actgggcgga gcaggcaagg ccaaaagctg taaccctgct
26581 gtctacgtcg ctgagaagtc ggatacgtcc gtagtacctg agaaaccgtc gaacaaaggg
26641 gctagccctg cggagagggt ggaggaaagg ggcgtagcca aggggaacac cgacaagaac
26701 cccacgcccc ggacgctgag ccggagcagt tgcgtgtcga tgggacttga aggtgtacgt
26761 gaagcagccc gtagggacaa gggtatgcag ttcacggcat tgttgcacca catcacaccg
26821 caactgttgg cgcagagctt ctacgcactt cgccgcgacg cggcggtggg agtggacggt
26881 atgtcgtggc gagaatacga agaagacctc caccagaggg tgggcaagtt gcacgcacgg
26941 ctccacagag gagcctatcg ggcaacgccg tcacggcggg tatatatccc caaagctgat
27001 ggcaggcagc ggccactggg tattgcctct ttggaggaca agatcgtaca gcaggcagtc
27061 gtcaccgttc tgaatgcgat ctatgaggag gatttccagg ggttctcgta tgggtttcgg
27121 ccgggacgaa gccagcacga tgcgctggat gcgttgacgg tcgcgctcaa gagccagaag
27181 gtgaactgga tactggacgc ggacatcacg tcgttctttg acgagatcga ccatgaatgg
27241 atgctgatgt ttctggggca ccgaattgca gaccgacgca tgctcggact catctgcaaa
27301 tggcttcagg cgggagtaat ggaggatggc cgtaggttgg ctgcgaccaa ggggactccc
27361 caaggcgcag tgatatcgcc gttgctggcg aatatctatc ttcactacgt gctggatctg
27421 tgggcaaggc agtggcgcca acggcatgcc cgtggcgaga tgattgttgt gcgctacgcg
27481 gacgacagcg tggtgggttt caggacgcaa tggcaggcgc agcgtttcct ggtgcagttg
27541 caggaacgca tggccaggtt cggtctatct cttaatgcct cgaaaacacg gctgatcgag
27601 ttcggtcgct ttgctgtaca gaatcgcagg cggcaagggc tgggcaaacc ggagacgttt
27661 gatttcctgg gcttcactca ttgctgtagt accaacagaa gcgggggatt tcaaatcctg
27721 cggctgacgg tcaagaagcg gatgcgtgcg acactgcaag ccatccggat agcactgaat
27781 cgccggcgac atgagccgat ccgagtcgta ggtcaatggc tcggcagtgt ggttggagga
27841 tatttcaact accacgccgt gccgggaaac ctgatccgtc ttgacggttt tcgggtggcg
27901 gtttgccgtc tttggcggca agccctcaaa cggcgcagcc agcgtaaccg gctccagtgg
27961 tcgcgctatg gacgccttgc agacctctat ataccaagac ccagaactgc acatccttac
28021 cctgaggagc gcttcgcgtc acgtacctga ggcaggagcc gtatgcggta gttccgcacg
28081 tacggatctg tgcggggggt ggcaggtaac tgccattcct accgcgac
ct ataggaacct
28141 agctgtggac aaccatatcg ctcggaatcg tgcgttgaca tcctccccgt cctttaggac
28201 ggggaggatg tgcgcgccca gcatgggcgc gatcttggag gtgaaagttc tcccacgagc
28261 tggctacagc gagtgacggc tgccccctat gccgatcaaa ggaaggtgaa gatgtcttcg
28321 gtcggcaggc gcgacttcgg cagcttggcg ttgaagtcgt cttcgctgcg atagcccagg
28381 ctgacgatca ccacgctggt gaagccgcgc tcgcgcaggc ccagttcttc gtcgagcctg
28441 tggaagtcga agccctcgat gggtgtggcg tccacgcccg aggcggcggc gccgagcagc

[top]


[ORF sequence]

 

MSWREYEEDLHQRVGKLHARLHRGAYRATPSRRVYIPKADGRQRPLGIASLEDKIVQQ

AVVTVLNAIYEEDFQGFSYGFRPGRSQHDALDALTVALKSQKVNWILDADITSFFDEI

DHEWMLMFLGHRIADRRMLGLICKWLQAGVMEDGRRLAATKGTPQGAVISPLLANIYL

HYVLDLWARQWRQRHARGEMIVVRYADDSVVGFRTQWQAQRFLVQLQERMARFGLSLN

ASKTRLIEFGRFAVQNRRRQGLGKPETFDFLGFTHCCSTNRSGGFQILRLTVKKRMRA

TLQAIRIALNRRRHEPIRVVGQWLGSVVGGYFNYHAVPGNLIRLDGFRVAVCRLWRQA

LKRRSQRNRLQWSRYGRLADLYIPRPRTAHPYPEERFASRT

top]


[Secondary structure]

 

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |