[Back to introns by organism] [Back to home page]
Information for Pe.th.I2 intron (Format of information for each intron)
[Intron and flanking sequence]
Sequence
from Genbank entry. Intron is identified in red.
The ORF
is identified in blue with start and stop codons
underlined.
5' end
gagcgtc gtgaaacttc cgggtaacca
2521
gcgggttgaa atgtacctgt cacggtaatt tctcgtacac cgtgaagtcg tcaggcaggc
2581
gtcttcgggg tgacctaaga cgtcaagctg cctgaaaagg ccaaaaggcc gagacaggat
2641
gcaggccgta gcgaaagcga accagtggtg gcctcgaaaa cgcagttgct gttggcgacc
2701
ctgcctcaac aaggggaagc ctgccacttg ccgcgaagaa acgaggaaat gcagcggcag
2761
ggacagccgt ggtgattagg ggcggcatgt atccaaggtt agccggaaag acacgagagg
2821
cctctatggg cggaagtggc gcttccaaag gcaggatata aggtccaagc ggccgaaatt
2881
cctgacctgt ccatagagga gtcggaagtg cccatactat ctatgacgac tggacagcat
2941
aaccagtctg agaaaagggg cactgcttca ccaatgtttg catgatagga ggacggctga
3001
ttgcaaccgc tactacagtt cccggaaggg
aaagtccgca actttcagcg caaactctac
3061
gtcaaggcca aacaggaaaa gacatttcga ttttacagcc tctacgacaa actttaccgg
3121
gaagatgttc tccagtatgc atggcagcag tgccgggcaa acaaaggtgc tcccggagca
3181
gacgggcaga gttttaaaga catcgaagaa aaagtcggag tcgaaagatt tctaaaagaa
3241
atcgccgaag aacttcgcaa cgggacatat cgcccaatgc ctgtcaggcg ggtatacatc
3301
ctaaagccgg atggcagcca gcgtccacta ggtataccca ccatcaaaga cagaatcgca
3361
caaatggcat gccttaccgt aatccaacca atattcgaag cagactttct agactgctcc
3421
tatggatttc ggcccaaacg taatgcccac caggccatag gggctattac agaaaatatc
3481
aaacaagggt ttaccgccgt atacgatgct gacctgacaa aatgctttga tagcatacag
3541
cacaggttga tcatggactc tctggcggag cgtataacag acgggaaagt actgcgcctg
3601
attaagggat ggctcgaagc acctatagtg gaacccggtg gcccgaagca gggaaggaag
3661
aattaccagg gcacgcccca gggaggggta atatcccctc tactcgccaa tatcgtcctg
3721
aacaggctgg ataggctctg gcataggccg ggagggccgc gcgagaggta taacgcaagg
3781
ttagtgcgct atgctgacga ctttgtggtt ttggccaggt ttatcggcga acccattaag
3841
aacgaattag agtctatcat cacgtcaatg gggctcaacc tcaacgagaa aaagacacgc
3901
atacttgacc ttaacaaagg ggacatctta aactttctcg gatacagcat ccgtatcagc
3961
cgggataaga atcggcgcat aacgataaag ccgagcgata aagcaattgc acggttgcgc
4021
gataaaatac gtgaaatcat ctcccgggag agactatatc acggattaaa gggaataatc
4081
gcagaaataa atcctgtgct aagaggctgg aagcagtatt ttaagctaac taatgtcagt
4141
aggatattct caggcttaaa cttttatatt actgctcgat tttaccgtgt aggaaggaaa
4201
accagtcagc ggtatagcaa gatcttcaag ccaggggtct acgtaacctt acgtaaaatg
4261
gggttatact gccttgccac tgattgacct
gtgaatgcct tacgtgaatg gtgtcggata
4321
gccgtatgag ggaaaacctc acgtacggtt agatgagggt ctgctggcat attggccatg
4381
gtcaggctag tgaggcactc ccagaggaaa cggggagaaa ctgataggca ttgacctaaa
4441 tcttttatgc cagggtctac tctac
3' end
[top]
[Intron and flanking sequence]
2161
tgaacctggc cagggaattc cagggaagaa catatgattc tatggtggcc cataccacga
2221
ttgtcttctg ccgttacatc atgctcgctc tagaaaaccg tgaaagcaag gacccgagga
2281
ctctcggtga tctattttat gtttgctgtg acgaactgca ggacataagt tttgcagagg
2341
catttcaatt gcttttggca atgctcaaaa atacgttgag aaaatttctt gcaataacag
2401
atggtgcttt gcaggagcta gtcaataatt ttatttcctg tttgccttcg ttcttaaagg
2461
gtcgcctcaa actttcgccc tgcgaaagtt gaggagcgtc gtgaaacttc
cgggtaacca
2521
gcgggttgaa atgtacctgt cacggtaatt tctcgtacac cgtgaagtcg tcaggcaggc
2581
gtcttcgggg tgacctaaga cgtcaagctg cctgaaaagg ccaaaaggcc gagacaggat
2641
gcaggccgta gcgaaagcga accagtggtg gcctcgaaaa cgcagttgct gttggcgacc
2701
ctgcctcaac aaggggaagc ctgccacttg ccgcgaagaa acgaggaaat gcagcggcag
2761
ggacagccgt ggtgattagg ggcggcatgt atccaaggtt agccggaaag acacgagagg
2821
cctctatggg cggaagtggc gcttccaaag gcaggatata aggtccaagc ggccgaaatt
2881
cctgacctgt ccatagagga gtcggaagtg cccatactat ctatgacgac tggacagcat
2941
aaccagtctg agaaaagggg cactgcttca ccaatgtttg catgatagga ggacggctga
3001
ttgcaaccgc tactacagtt cccggaaggg aaagtccgca actttcagcg caaactctac
3061
gtcaaggcca aacaggaaaa gacatttcga ttttacagcc tctacgacaa actttaccgg
3121
gaagatgttc tccagtatgc atggcagcag tgccgggcaa acaaaggtgc tcccggagca
3181
gacgggcaga gttttaaaga catcgaagaa aaagtcggag tcgaaagatt tctaaaagaa
3241
atcgccgaag aacttcgcaa cgggacatat cgcccaatgc ctgtcaggcg ggtatacatc
3301
ctaaagccgg atggcagcca gcgtccacta ggtataccca ccatcaaaga cagaatcgca
3361
caaatggcat gccttaccgt aatccaacca atattcgaag cagactttct agactgctcc
3421
tatggatttc ggcccaaacg taatgcccac caggccatag gggctattac agaaaatatc
3481
aaacaagggt ttaccgccgt atacgatgct gacctgacaa aatgctttga tagcatacag
3541
cacaggttga tcatggactc tctggcggag cgtataacag acgggaaagt actgcgcctg
3601
attaagggat ggctcgaagc acctatagtg gaacccggtg gcccgaagca gggaaggaag
3661
aattaccagg gcacgcccca gggaggggta atatcccctc tactcgccaa tatcgtcctg
3721
aacaggctgg ataggctctg gcataggccg ggagggccgc gcgagaggta taacgcaagg
3781
ttagtgcgct atgctgacga ctttgtggtt ttggccaggt ttatcggcga acccattaag
3841
aacgaattag agtctatcat cacgtcaatg gggctcaacc tcaacgagaa aaagacacgc
3901
atacttgacc ttaacaaagg ggacatctta aactttctcg gatacagcat ccgtatcagc
3961
cgggataaga atcggcgcat aacgataaag ccgagcgata aagcaattgc acggttgcgc
4021
gataaaatac gtgaaatcat ctcccgggag agactatatc acggattaaa gggaataatc
4081
gcagaaataa atcctgtgct aagaggctgg aagcagtatt ttaagctaac taatgtcagt
4141
aggatattct caggcttaaa cttttatatt actgctcgat tttaccgtgt aggaaggaaa
4201
accagtcagc ggtatagcaa gatcttcaag ccaggggtct acgtaacctt acgtaaaatg
4261
gggttatact gccttgccac tgattgacct gtgaatgcct tacgtgaatg gtgtcggata
4321
gccgtatgag ggaaaacctc acgtacggtt agatgagggt ctgctggcat attggccatg
4381
gtcaggctag tgaggcactc ccagaggaaa cggggagaaa ctgataggca ttgacctaaa
4441
tcttttatgc cagggtctac tctacttttt aaaacctttc aagatgaata caaccttagg
4501
ccacacaaaa attaaaagaa tctcccgagg aaaaatttaa gcatgaaaaa gattatttgc
4561
agccagcagt tcttgtaaag ccaagtgtct tgtatttccg ggaaccaaaa agagtcagta
4621
acgacggcta catctcctgc aaagggaatc tttatcccgt acccatgcac ctgtgtttga
4681
agatagtctg ggtagaatcc atctacggcc ggaaatttaa aagtttacga tgaaaaagga
4741
atactggcca acgaacagga gttttgtttg taaaagcaag ccgagcggac tacccaccca
[top]
MQPLLQFPEGKVRNFQRKLYVKAKQEKTFRFYSLYDKLYREDVLQYAWQQCRANKGAP
GADGQSFKDIEEKVGVERFLKEIAEELRNGTYRPMPVRRVYILKPDGSQRPLGIPTIK
DRIAQMACLTVIQPIFEADFLDCSYGFRPKRNAHQAIGAITENIKQGFTAVYDADLTK
CFDSIQHRLIMDSLAERITDGKVLRLIKGWLEAPIVEPGGPKQGRKNYQGTPQGGVIS
PLLANIVLNRLDRLWHRPGGPRERYNARLVRYADDFVVLARFIGEPIKNELESIITSM
GLNLNEKKTRILDLNKGDILNFLGYSIRISRDKNRRITIKPSDKAIARLRDKIREIIS
RERLYHGLKGIIAEINPVLRGWKQYFKLTNVSRIFSGLNFYITARFYRVGRKTSQRYS
KIFKPGVYVTLRKMGLYCLATD
[top]
[top]
| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |