[Back to introns by organism] [Back to home page]
Information for intron My.va.I1 (Format of information for each intron)
[Intron and flanking sequence]
Sequence
from Genbank entry. Intron is identified in red.
The ORF
is identified in blue with start and stop codons
underlined.
5' end
t tgcgtcctga aagttggact
3001
ggactagcgg gtggaagtcc cgtcccggta ggtaccggag cgccgggtag caggccccgg
3061 ttcgtcgtgg agacgcggcg ggctgagcgg ggtgtcgaga gcctcttcgg agggagcaag
3121 tgtgcgggcc gtagcgttaa gcgaactctg cagcctcgtc aaactcacaa cgggagagct
3181 gagccgctca tgtcacggcg aaggccatgt ccggcgagcc ctggtccggg gttagctcgt
3241 cgggtctctc cggggtacgg gagatggcac gtgcacacag tctggttcgg aacaggagag
3301 acccgtctgc cccgccttcg tcgggcaaag accaagggta taagccgatg gtgaaatccc
3361 ttggagggca gcgggagtcc gatggggtcg tagtaccgct gatcggcgtg caacataacg
3421 cgccgggagg gaagggccct gactttgatc acgcgggtga agcgggtaag tgcgagggca
3481 tgaccgggat gtctcggtcc aaccaccccg gcaggcgatt gcctgtcgta gccgacgagg
3541 agccgcttgc ggtgtcgccg gtgaaagtgc gacaactgca acgcacgcta tgggctgcgg
3601 ccaagcagtc gcagggtcgg cgtttccacg ccctgtatga ccgtgtctac aggggtgacg
3661 tcctgtggga ggcgtgggag cgggtgcgta agaacagggg tgcggccggg gtggatcgtg
3721 tcaccttggt cgcggtggag gagtacggcg tggaccgcat gttgcgtgag ttgcgccatg
3781 acctccgcga gggtgtgtac tgtccggcgc cggcgcgtcg ggtggagatc ccgaaaccac
3841 ggggcggtac gcggccgttg gggattccga cggtgcggga ccgggtggcc caggcggcgg
3901 ccaagatcgt gctggaaccg atcttcgagg cggacttcat gtcgtgctcg tatgggtttc
3961 ggccgaaacg gtcggccacg caggcgatgg aacggcttcg tgtcggcttc atcgagggct
4021 cccagtttgt ggtcgagttc gacatcgcca atttcttcgg cgagatcgac cacgaccggc
4081 tactggctga ggtcagcaga cgggtctcgg accggcgggt gctcaaactg ctgcggttgt
4141 ggctgcaggc gggagtgatg gtggacgggg tggtgtcgcg gacggtcgcg ggcactccgc
4201 agggcggggt gatctcgccg ttgttggcca acatctatct gcatgtgctc gacaccgaac
4261 tcgctcgacg caatgtgggt gagttggtgc gctacgccga tgacggtgtg gtgttgtgcc
4321 gcagcgcggc ccaagctgag cacgctctgg cggcggtggg ggagatcctg gcgtcgttgg
4381 ggttgcggct acatccggac aagacgaagg tggtcgacct gcgtgagggc ggtgagggcc
4441 tggactttct gggttgtcac ttccgggcac gtatgtcggg gcggttgtgg gaacagaggc
4501 gcatcgtgcg ctactacctg caccgctggc ccagccaaac ggcgatggtt cgcttgcggg
4561 agaaggttcg tgagcgcacc ggccgtaacc gggtcggatt cgacatccgt gatgtgatcg
4621 cggtgttgaa tccgatcttg cgtggctggg gcaactactt tcgcaccggc aacgccgccg
4681 acaagttccg ccagatcgac cactacgtca cgcggcgcct gaaggagctt ctgatcaaga
4741 agcgcggtcg gaacctgcgt gctggacaag ccgatcagtg gactgaagag tggttcaacg
4801 ggcacggcct gcaccgcctg cgcggcacta tccgctaccc gaaggcagcg taaccatgca
4861
cagaagatca tcggtaagcc gtgtgcggga aaaccgcacg cacggattga aagggggatg
4921 gggaaacgga tccgctctgc ggacaccgcg cccctgacta ccaatg
3' end
[top]
[Intron and flanking sequence]
2641 aatgtcatca ggggcgagat gagtctggtc ggtccgcgtc cggagcgccc ggagtacgtc
2701 gacctgttca acgtccagat cgcccgctac ggtgaccggc accgggtcaa ggccgggatc
2761 accggttggg cgcaggtgca cggcctgcgc gggcagacgt cgatcgccga ccgcgccgag
2821 tgggacaact tctacatcga gaactggtcg gtgttgctgg actggaagat cctggcgatg
2881 accgtcgggg cggtgctgcg ccgggccgag tgagcccggg gcgcgacctt ctcgacattg
2941
cggaccccgc cggtacgctt tggtaccgtt cgggccgtgt tgcgtcctga
aagttggact
3001
ggactagcgg gtggaagtcc cgtcccggta ggtaccggag cgccgggtag caggccccgg
3061 ttcgtcgtgg agacgcggcg ggctgagcgg ggtgtcgaga gcctcttcgg agggagcaag
3121 tgtgcgggcc gtagcgttaa gcgaactctg cagcctcgtc aaactcacaa cgggagagct
3181 gagccgctca tgtcacggcg aaggccatgt ccggcgagcc ctggtccggg gttagctcgt
3241 cgggtctctc cggggtacgg gagatggcac gtgcacacag tctggttcgg aacaggagag
3301 acccgtctgc cccgccttcg tcgggcaaag accaagggta taagccgatg gtgaaatccc
3361 ttggagggca gcgggagtcc gatggggtcg tagtaccgct gatcggcgtg caacataacg
3421 cgccgggagg gaagggccct gactttgatc acgcgggtga agcgggtaag tgcgagggca
3481 tgaccgggat gtctcggtcc aaccaccccg gcaggcgatt gcctgtcgta gccgacgagg
3541 agccgcttgc ggtgtcgccg gtgaaagtgc gacaactgca acgcacgcta tgggctgcgg
3601 ccaagcagtc gcagggtcgg cgtttccacg ccctgtatga ccgtgtctac aggggtgacg
3661 tcctgtggga ggcgtgggag cgggtgcgta agaacagggg tgcggccggg gtggatcgtg
3721 tcaccttggt cgcggtggag gagtacggcg tggaccgcat gttgcgtgag ttgcgccatg
3781 acctccgcga gggtgtgtac tgtccggcgc cggcgcgtcg ggtggagatc ccgaaaccac
3841 ggggcggtac gcggccgttg gggattccga cggtgcggga ccgggtggcc caggcggcgg
3901 ccaagatcgt gctggaaccg atcttcgagg cggacttcat gtcgtgctcg tatgggtttc
3961 ggccgaaacg gtcggccacg caggcgatgg aacggcttcg tgtcggcttc atcgagggct
4021 cccagtttgt ggtcgagttc gacatcgcca atttcttcgg cgagatcgac cacgaccggc
4081 tactggctga ggtcagcaga cgggtctcgg accggcgggt gctcaaactg ctgcggttgt
4141 ggctgcaggc gggagtgatg gtggacgggg tggtgtcgcg gacggtcgcg ggcactccgc
4201 agggcggggt gatctcgccg ttgttggcca acatctatct gcatgtgctc gacaccgaac
4261 tcgctcgacg caatgtgggt gagttggtgc gctacgccga tgacggtgtg gtgttgtgcc
4321 gcagcgcggc ccaagctgag cacgctctgg cggcggtggg ggagatcctg gcgtcgttgg
4381 ggttgcggct acatccggac aagacgaagg tggtcgacct gcgtgagggc ggtgagggcc
4441 tggactttct gggttgtcac ttccgggcac gtatgtcggg gcggttgtgg gaacagaggc
4501 gcatcgtgcg ctactacctg caccgctggc ccagccaaac ggcgatggtt cgcttgcggg
4561 agaaggttcg tgagcgcacc ggccgtaacc gggtcggatt cgacatccgt gatgtgatcg
4621 cggtgttgaa tccgatcttg cgtggctggg gcaactactt tcgcaccggc aacgccgccg
4681 acaagttccg ccagatcgac cactacgtca cgcggcgcct gaaggagctt ctgatcaaga
4741 agcgcggtcg gaacctgcgt gctggacaag ccgatcagtg gactgaagag tggttcaacg
4801 ggcacggcct gcaccgcctg cgcggcacta tccgctaccc gaaggcagcg taaccatgca
4861
cagaagatca tcggtaagcc gtgtgcggga aaaccgcacg cacggattga aagggggatg
4921 gggaaacgga tccgctctgc ggacaccgcg cccctgacta ccaatggatc gtgtcaacgc
4981 gttcggcgac gacgccctgg gtgacctcga cgccgtcggt ctggtggacg aactgcgccg
5041 cggggcggtg tcggcgagcg aactcgtcga ggccgcgatc gcgcgcaccg agaaggcgaa
5101 cccggcgctg aacggactca ccttcgaggc atacgagcgg gccaggacgc gggcccgact
5161 gcaccgcccc tacggtggct tcttcgacgg ggtgccgtcg ttcgtcaagg acaacgtggc
5221 cgtcgccggg atgccggcga tgaacggcac cgacgcctgg gagcccgtcc ccgccgccca
[top]
MVKSLGGQRESDGVVVPLIGVQHNAPGGKGPDFDHAGEAGKCEGMTGMSRSNHPGRRL
PVVADEEPLAVSPVKVRQLQRTLWAAAKQSQGRRFHALYDRVYRGDVLWEAWERVRKN
RGAAGVDRVTLVAVEEYGVDRMLRELRHDLREGVYCPAPARRVEIPKPRGGTRPLGIP
TVRDRVAQAAAKIVLEPIFEADFMSCSYGFRPKRSATQAMERLRVGFIEGSQFVVEFD
IANFFGEIDHDRLLAEVSRRVSDRRVLKLLRLWLQAGVMVDGVVSRTVAGTPQGGVIS
PLLANIYLHVLDTELARRNVGELVRYADDGVVLCRSAAQAEHALAAVGEILASLGLRL
HPDKTKVVDLREGGEGLDFLGCHFRARMSGRLWEQRRIVRYYLHRWPSQTAMVRLREK
VRERTGRNRVGFDIRDVIAVLNPILRGWGNYFRTGNAADKFRQIDHYVTRRLKELLIK
KRGRNLRAGQADQWTEEWFNGHGLHRLRGTIRYPKAA
[top]
[top]
| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |