[Back to introns by organism]  [Back to home page]

Information for Pe.th.I2 intron  (Format of information for each intron

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

 

5' end

                                         gagcgtc gtgaaacttc cgggtaacca

2521 gcgggttgaa atgtacctgt cacggtaatt tctcgtacac cgtgaagtcg tcaggcaggc

2581 gtcttcgggg tgacctaaga cgtcaagctg cctgaaaagg ccaaaaggcc gagacaggat

2641 gcaggccgta gcgaaagcga accagtggtg gcctcgaaaa cgcagttgct gttggcgacc

2701 ctgcctcaac aaggggaagc ctgccacttg ccgcgaagaa acgaggaaat gcagcggcag

2761 ggacagccgt ggtgattagg ggcggcatgt atccaaggtt agccggaaag acacgagagg

2821 cctctatggg cggaagtggc gcttccaaag gcaggatata aggtccaagc ggccgaaatt

2881 cctgacctgt ccatagagga gtcggaagtg cccatactat ctatgacgac tggacagcat

2941 aaccagtctg agaaaagggg cactgcttca ccaatgtttg catgatagga ggacggctga

3001 ttgcaaccgc tactacagtt cccggaaggg aaagtccgca actttcagcg caaactctac

3061 gtcaaggcca aacaggaaaa gacatttcga ttttacagcc tctacgacaa actttaccgg

3121 gaagatgttc tccagtatgc atggcagcag tgccgggcaa acaaaggtgc tcccggagca

3181 gacgggcaga gttttaaaga catcgaagaa aaagtcggag tcgaaagatt tctaaaagaa

3241 atcgccgaag aacttcgcaa cgggacatat cgcccaatgc ctgtcaggcg ggtatacatc

3301 ctaaagccgg atggcagcca gcgtccacta ggtataccca ccatcaaaga cagaatcgca

3361 caaatggcat gccttaccgt aatccaacca atattcgaag cagactttct agactgctcc

3421 tatggatttc ggcccaaacg taatgcccac caggccatag gggctattac agaaaatatc

3481 aaacaagggt ttaccgccgt atacgatgct gacctgacaa aatgctttga tagcatacag

3541 cacaggttga tcatggactc tctggcggag cgtataacag acgggaaagt actgcgcctg

3601 attaagggat ggctcgaagc acctatagtg gaacccggtg gcccgaagca gggaaggaag

3661 aattaccagg gcacgcccca gggaggggta atatcccctc tactcgccaa tatcgtcctg

3721 aacaggctgg ataggctctg gcataggccg ggagggccgc gcgagaggta taacgcaagg

3781 ttagtgcgct atgctgacga ctttgtggtt ttggccaggt ttatcggcga acccattaag

3841 aacgaattag agtctatcat cacgtcaatg gggctcaacc tcaacgagaa aaagacacgc

3901 atacttgacc ttaacaaagg ggacatctta aactttctcg gatacagcat ccgtatcagc

3961 cgggataaga atcggcgcat aacgataaag ccgagcgata aagcaattgc acggttgcgc

4021 gataaaatac gtgaaatcat ctcccgggag agactatatc acggattaaa gggaataatc

4081 gcagaaataa atcctgtgct aagaggctgg aagcagtatt ttaagctaac taatgtcagt

4141 aggatattct caggcttaaa cttttatatt actgctcgat tttaccgtgt aggaaggaaa

4201 accagtcagc ggtatagcaa gatcttcaag ccaggggtct acgtaacctt acgtaaaatg

4261 gggttatact gccttgccac tgattgacct gtgaatgcct tacgtgaatg gtgtcggata

4321 gccgtatgag ggaaaacctc acgtacggtt agatgagggt ctgctggcat attggccatg

4381 gtcaggctag tgaggcactc ccagaggaaa cggggagaaa ctgataggca ttgacctaaa

4441 tcttttatgc cagggtctac tctac                                   

3' end

[top]


[Intron and flanking sequence]

 

2161 tgaacctggc cagggaattc cagggaagaa catatgattc tatggtggcc cataccacga

2221 ttgtcttctg ccgttacatc atgctcgctc tagaaaaccg tgaaagcaag gacccgagga

2281 ctctcggtga tctattttat gtttgctgtg acgaactgca ggacataagt tttgcagagg

2341 catttcaatt gcttttggca atgctcaaaa atacgttgag aaaatttctt gcaataacag

2401 atggtgcttt gcaggagcta gtcaataatt ttatttcctg tttgccttcg ttcttaaagg

2461 gtcgcctcaa actttcgccc tgcgaaagtt gaggagcgtc gtgaaacttc cgggtaacca

2521 gcgggttgaa atgtacctgt cacggtaatt tctcgtacac cgtgaagtcg tcaggcaggc

2581 gtcttcgggg tgacctaaga cgtcaagctg cctgaaaagg ccaaaaggcc gagacaggat

2641 gcaggccgta gcgaaagcga accagtggtg gcctcgaaaa cgcagttgct gttggcgacc

2701 ctgcctcaac aaggggaagc ctgccacttg ccgcgaagaa acgaggaaat gcagcggcag

2761 ggacagccgt ggtgattagg ggcggcatgt atccaaggtt agccggaaag acacgagagg

2821 cctctatggg cggaagtggc gcttccaaag gcaggatata aggtccaagc ggccgaaatt

2881 cctgacctgt ccatagagga gtcggaagtg cccatactat ctatgacgac tggacagcat

2941 aaccagtctg agaaaagggg cactgcttca ccaatgtttg catgatagga ggacggctga

3001 ttgcaaccgc tactacagtt cccggaaggg aaagtccgca actttcagcg caaactctac

3061 gtcaaggcca aacaggaaaa gacatttcga ttttacagcc tctacgacaa actttaccgg

3121 gaagatgttc tccagtatgc atggcagcag tgccgggcaa acaaaggtgc tcccggagca

3181 gacgggcaga gttttaaaga catcgaagaa aaagtcggag tcgaaagatt tctaaaagaa

3241 atcgccgaag aacttcgcaa cgggacatat cgcccaatgc ctgtcaggcg ggtatacatc

3301 ctaaagccgg atggcagcca gcgtccacta ggtataccca ccatcaaaga cagaatcgca

3361 caaatggcat gccttaccgt aatccaacca atattcgaag cagactttct agactgctcc

3421 tatggatttc ggcccaaacg taatgcccac caggccatag gggctattac agaaaatatc

3481 aaacaagggt ttaccgccgt atacgatgct gacctgacaa aatgctttga tagcatacag

3541 cacaggttga tcatggactc tctggcggag cgtataacag acgggaaagt actgcgcctg

3601 attaagggat ggctcgaagc acctatagtg gaacccggtg gcccgaagca gggaaggaag

3661 aattaccagg gcacgcccca gggaggggta atatcccctc tactcgccaa tatcgtcctg

3721 aacaggctgg ataggctctg gcataggccg ggagggccgc gcgagaggta taacgcaagg

3781 ttagtgcgct atgctgacga ctttgtggtt ttggccaggt ttatcggcga acccattaag

3841 aacgaattag agtctatcat cacgtcaatg gggctcaacc tcaacgagaa aaagacacgc

3901 atacttgacc ttaacaaagg ggacatctta aactttctcg gatacagcat ccgtatcagc

3961 cgggataaga atcggcgcat aacgataaag ccgagcgata aagcaattgc acggttgcgc

4021 gataaaatac gtgaaatcat ctcccgggag agactatatc acggattaaa gggaataatc

4081 gcagaaataa atcctgtgct aagaggctgg aagcagtatt ttaagctaac taatgtcagt

4141 aggatattct caggcttaaa cttttatatt actgctcgat tttaccgtgt aggaaggaaa

4201 accagtcagc ggtatagcaa gatcttcaag ccaggggtct acgtaacctt acgtaaaatg

4261 gggttatact gccttgccac tgattgacct gtgaatgcct tacgtgaatg gtgtcggata

4321 gccgtatgag ggaaaacctc acgtacggtt agatgagggt ctgctggcat attggccatg

4381 gtcaggctag tgaggcactc ccagaggaaa cggggagaaa ctgataggca ttgacctaaa

4441 tcttttatgc cagggtctac tctacttttt aaaacctttc aagatgaata caaccttagg

4501 ccacacaaaa attaaaagaa tctcccgagg aaaaatttaa gcatgaaaaa gattatttgc

4561 agccagcagt tcttgtaaag ccaagtgtct tgtatttccg ggaaccaaaa agagtcagta

4621 acgacggcta catctcctgc aaagggaatc tttatcccgt acccatgcac ctgtgtttga

4681 agatagtctg ggtagaatcc atctacggcc ggaaatttaa aagtttacga tgaaaaagga

4741 atactggcca acgaacagga gttttgtttg taaaagcaag ccgagcggac tacccaccca

[top]


[ORF sequence]

 

MQPLLQFPEGKVRNFQRKLYVKAKQEKTFRFYSLYDKLYREDVLQYAWQQCRANKGAP

GADGQSFKDIEEKVGVERFLKEIAEELRNGTYRPMPVRRVYILKPDGSQRPLGIPTIK

DRIAQMACLTVIQPIFEADFLDCSYGFRPKRNAHQAIGAITENIKQGFTAVYDADLTK

CFDSIQHRLIMDSLAERITDGKVLRLIKGWLEAPIVEPGGPKQGRKNYQGTPQGGVIS

PLLANIVLNRLDRLWHRPGGPRERYNARLVRYADDFVVLARFIGEPIKNELESIITSM

GLNLNEKKTRILDLNKGDILNFLGYSIRISRDKNRRITIKPSDKAIARLRDKIREIIS

RERLYHGLKGIIAEINPVLRGWKQYFKLTNVSRIFSGLNFYITARFYRVGRKTSQRYS

KIFKPGVYVTLRKMGLYCLATD

[top]


[Secondary structure]

 

                                                       

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |