[Back to introns by organism]   [Back to home page]

Information for R.m.I1-1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

Note: Multiple Insertions

R.m.I1-2        CP000352 (3134534-3137479)

 

[Intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

               

5' end

                             gtgcgaca cgcactttcg tgtccggcag ctctacgtta
4561 agagcatcga cctttgttga gacaagggaa aggcttcggc cagaaccgtc caatctcaac
4621 catttcttgc tttgacaagc ccagccgtaa ggttggagcg agtccatgtg ttgtgggaag
4681 gccagaccat cggctgactc gcagcttgct atttgctgga tatatgtcca gccattgccc
4741 aaaggcgggg acgcttgcct ttgggcaagt tctctttaaa cgccctactt cggtaggtgc
4801 gggtgggttt gcgactgtag ttttcgttgc atctccgcta catagcattc aaaccggccc
4861 ggttgggtcg gcgctacgtt ctttgggacg ggtctgcgct cgaaagagac cgccgccgtc
4921 aagcgcctgc cactatggag gcactgaagt aatgaagtga ggtttcccgg agcgtatagc
4981 tctgcgcgtt atggttcggt cgaatacacc cgcgagggtg cctcgataga gaacctgtca
5041 tgcgactgac agtagcctag ggagacatgc gtcactggca acggtggggt gtgaagcttt
5101 cctgacaacg tagcccccgc aagggtgcgt cattatcgtg agataaggac gcagagactc
5161 cgagcagcaa tggggtcaag gagccaggct ggctggtatg ggtaatgtaa gtgaactgct
5221 gataaacgcc gtgaatatga cggtgcctaa gctgctgcgt ggcactaacc aaaaggtacg
5281 tggtcggtta ttccgctgga agctcggaat acgtcgctgt aagaccaccg gcgaagaggc
5341 aggccctaac ccggtcttgt aacgcgcagg gaacgtggta agcccgtatg gctgccgggg
5401 cgctgtatgg ctctggcaag cgaaccgcaa ggaacgctgt tggctgtgcg ggtatgggag
5461 gttggagaaa gcgaacgtcg ggctgtaatg gtccggatag gggttgaaac atcatcccac
5521 gcgaaagcgg gcagacttcc aactggtctt tcgtcgcaag acggctgact gaactcgctt
5581 gtttaatttt cttcccttgg ggggtgacat accccgcaag gggcatgttg tctctgctag
5641 ttggggaagt cctcaagata ggagagcagc atggaagcat tgatccgcga ggatgaatct
5701 gcgccctccg gcattccgca gcgatggggc gacatcaatt ggcgccgcgt cgaacagaat
5761 gttcgagcaa tgcagattcg aattgcgaag gcgacacagg aacgggactg gcgcagggtg
5821 aaagctctgc agcggtctct gacccgctcg ttttccgcca aggcatcggc ggtgcgacga
5881 gtgacggaga accaaggtaa acggacggca ggtgtcgacc gcgaactctg ggactcgcct
5941 gaagttcgat gggtagccat tggacggttg agacgccggg ggtatcggcc catgccatta
6001 cggagggtct tcatccccaa agccaacgga aaggaacgcc ctctgggtat cccgaccatg
6061 ttggaccggg ccatgcaggc gctatatctg ctggcgctgg aaccggtgtc cgaggggacg
6121 agcgatccga actcttatgg gttcaggtcc aatcgttcta ctgcggatgc tatgtcccag
6181 ctattcgtca acttatccag aaaggtctcg gcctcatgga tattggaggc ggacatccgc
6241 gggtgttttg accatatcag tcacgactgg ctggagcgca atgtccctat ggacaaagcg
6301 atccttcgga agtggttgaa agctggcgtg gtatttcagg gccagtttca ggcgaccgaa
6361 gccggtacgc cacagggggg catcatctcc ccgaccttag ccaatgtggc gctgaacgga
6421 ctggaacaac agctagctcg gtttctcgaa accacgctgg gtgtcaacca gacccggaag
6481 ctcaaagtaa atgttgtgcg ctatgcggac gatttcgtga tcaccggcag cacgccggaa
6541 gttttggaac acgaggtgaa gccgtgggtg gaacagttcc tcgcaattcg agggttgtcc
6601 ctgtccacgg agaaaacgcg gatcgtgagc attgacgagg gttttgactt ccttgggtgg
6661 aatttccgga agtattccgg aacgctgctc attaagccaa gcaggaaaaa cgcgcaggcg
6721 ttctatcgca aggttaagga ggttatcagt gctaacaaga cggtaaagca ggaggtcatc
6781 attcgcctgc tgaacccgat gctgagaggc tgggcgcagt atcacagccc ggtagtggca
6841 aaagaagcgt tcagcaagat gcagagccga gttttccggg cgctgctttg gtggacaaaa
6901 cggcgacaca gagggaagaa cgccgaatgg gtgcggaaga aatacttcgc ttcgtttggt
6961 gatcgaaact gggtgttcgg gactgagttt gtggaagacg atggggaacg gcgctggcag
7021 gagttgtatt cgcttgctag cacgcccatc agacggcaca agaagattcg gggggacttc
7081 aatcctttcg atccagcgca agagatgtat ggcgaaaatc tacgccggga ccgcatgctg
7141 gaaagcatgt ctcaccggaa acagtggatc aggctatatg tcgatcagcg gggcttatgc
7201 gcggtgtgtc agtgcaagat aaccaaagag accgggtggg atgaccatca catcgtctac
7261 agaacccacg gaggttcaga cgctctagga aatcgcgtat tgctgcatcc caactgccac
7321 gtccaagtac accatcatgg tcgaaccgtt gtgaaaccgg tgctggagat gtccagcatt
7381 ttgtaa
gggc ttgagccgta tgcggggaaa ctcgcacgta cggttcttag gggggagcaa
7441 accagcaatg gtatgctcct accctcc

3' end  

[top]


[Intron and flanking sequence]

 

4081 taagcccatt tatcgaagta ttcgcggcgc ggaccggccc gggcggtccg cgccgatgtt
4141 aagaatacaa agagcataac ggataggacc gagtagtgaa tgtacagcac ggtggtgtgc
4201 ccctactgcc agatggcaga gcgtctgctc aagcagcggg gcgtcgaggc gatcgagaaa
4261 atcctgatcg accgcgaacc cggcaagcgc gaggaaatga tgactcgcac cggccgccgg
4321 accgtgccgc agatctatat cgacgagaca cacgtgggtg gattcgacga cttgtccgcg
4381 ctggaccgtc agggcggcct ggtgccgctg ctggcggcct gagccccggc cccggcggtt
4441 tcccagcgag atcaacgacc tatccgattg acgccggccc cgggtgaatt cgcccacggc
4501 gtcatgtacc atatgcgctt tcgtgcgaca cgcactttcg tgtccggcag ctctacgtta
4561 agagcatcga cctttgttga gacaagggaa aggcttcggc cagaaccgtc caatctcaac
4621 catttcttgc tttgacaagc ccagccgtaa ggttggagcg agtccatgtg ttgtgggaag
4681 gccagaccat cggctgactc gcagcttgct atttgctgga tatatgtcca gccattgccc
4741 aaaggcgggg acgcttgcct ttgggcaagt tctctttaaa cgccctactt cggtaggtgc
4801 gggtgggttt gcgactgtag ttttcgttgc atctccgcta catagcattc aaaccggccc
4861 ggttgggtcg gcgctacgtt ctttgggacg ggtctgcgct cgaaagagac cgccgccgtc
4921 aagcgcctgc cactatggag gcactgaagt aatgaagtga ggtttcccgg agcgtatagc
4981 tctgcgcgtt atggttcggt cgaatacacc cgcgagggtg cctcgataga gaacctgtca
5041 tgcgactgac agtagcctag ggagacatgc gtcactggca acggtggggt gtgaagcttt
5101 cctgacaacg tagcccccgc aagggtgcgt cattatcgtg agataaggac gcagagactc
5161 cgagcagcaa tggggtcaag gagccaggct ggctggtatg ggtaatgtaa gtgaactgct
5221 gataaacgcc gtgaatatga cggtgcctaa gctgctgcgt ggcactaacc aaaaggtacg
5281 tggtcggtta ttccgctgga agctcggaat acgtcgctgt aagaccaccg gcgaagaggc
5341 aggccctaac ccggtcttgt aacgcgcagg gaacgtggta agcccgtatg gctgccgggg
5401 cgctgtatgg ctctggcaag cgaaccgcaa ggaacgctgt tggctgtgcg ggtatgggag
5461 gttggagaaa gcgaacgtcg ggctgtaatg gtccggatag gggttgaaac atcatcccac
5521 gcgaaagcgg gcagacttcc aactggtctt tcgtcgcaag acggctgact gaactcgctt
5581 gtttaatttt cttcccttgg ggggtgacat accccgcaag gggcatgttg tctctgctag
5641 ttggggaagt cctcaagata ggagagcagc atggaagcat tgatccgcga ggatgaatct
5701 gcgccctccg gcattccgca gcgatggggc gacatcaatt ggcgccgcgt cgaacagaat
5761 gttcgagcaa tgcagattcg aattgcgaag gcgacacagg aacgggactg gcgcagggtg
5821 aaagctctgc agcggtctct gacccgctcg ttttccgcca aggcatcggc ggtgcgacga
5881 gtgacggaga accaaggtaa acggacggca ggtgtcgacc gcgaactctg ggactcgcct
5941 gaagttcgat gggtagccat tggacggttg agacgccggg ggtatcggcc catgccatta
6001 cggagggtct tcatccccaa agccaacgga aaggaacgcc ctctgggtat cccgaccatg
6061 ttggaccggg ccatgcaggc gctatatctg ctggcgctgg aaccggtgtc cgaggggacg
6121 agcgatccga actcttatgg gttcaggtcc aatcgttcta ctgcggatgc tatgtcccag
6181 ctattcgtca acttatccag aaaggtctcg gcctcatgga tattggaggc ggacatccgc
6241 gggtgttttg accatatcag tcacgactgg ctggagcgca atgtccctat ggacaaagcg
6301 atccttcgga agtggttgaa agctggcgtg gtatttcagg gccagtttca ggcgaccgaa
6361 gccggtacgc cacagggggg catcatctcc ccgaccttag ccaatgtggc gctgaacgga
6421 ctggaacaac agctagctcg gtttctcgaa accacgctgg gtgtcaacca gacccggaag
6481 ctcaaagtaa atgttgtgcg ctatgcggac gatttcgtga tcaccggcag cacgccggaa
6541 gttttggaac acgaggtgaa gccgtgggtg gaacagttcc tcgcaattcg agggttgtcc
6601 ctgtccacgg agaaaacgcg gatcgtgagc attgacgagg gttttgactt ccttgggtgg
6661 aatttccgga agtattccgg aacgctgctc attaagccaa gcaggaaaaa cgcgcaggcg
6721 ttctatcgca aggttaagga ggttatcagt gctaacaaga cggtaaagca ggaggtcatc
6781 attcgcctgc tgaacccgat gctgagaggc tgggcgcagt atcacagccc ggtagtggca
6841 aaagaagcgt tcagcaagat gcagagccga gttttccggg cgctgctttg gtggacaaaa
6901 cggcgacaca gagggaagaa cgccgaatgg gtgcggaaga aatacttcgc ttcgtttggt
6961 gatcgaaact gggtgttcgg gactgagttt gtggaagacg atggggaacg gcgctggcag
7021 gagttgtatt cgcttgctag cacgcccatc agacggcaca agaagattcg gggggacttc
7081 aatcctttcg atccagcgca agagatgtat ggcgaaaatc tacgccggga ccgcatgctg
7141 gaaagcatgt ctcaccggaa acagtggatc aggctatatg tcgatcagcg gggcttatgc
7201 gcggtgtgtc agtgcaagat aaccaaagag accgggtggg atgaccatca catcgtctac
7261 agaacccacg gaggttcaga cgctctagga aatcgcgtat tgctgcatcc caactgccac
7321 gtccaagtac accatcatgg tcgaaccgtt gtgaaaccgg tgctggagat gtccagcatt
7381 ttgtaagggc ttgagccgta tgcggggaaa ctcgcacgta cggttcttag gggggagcaa
7441 accagcaatg gtatgctcct accctcc
caa ctggccgccg gtgatcgaca ccaccggcgc
7501 cgcatggcgc agcgcgccca aaccgatcca tccggaagcc tttcatgagc gaccagcaac
7561 aagccaacca gcaggacgac cagcccttct tcaacattca gcgcgtgtac ctgaaggaca
7621 tgtcgctgga gcagccgaat tcgccgggca tcttcctcga atcggaagcc ccctcggtgg
7681 aagtccaggt caacgtgggc gcctcgcaac tgcaggaagg catcttcgaa gtggtcgtga
7741 ccggtaccgt gacgaccaag gtgcaagaca aggtggcctt cctggtggaa gcgcaccagg
7801 ccggcatctt cgacatccgc aatgtgccgg tggaacaact ggacccgctg ctgggcatcg

[top]


[ORF sequence]

 

MEALIREDESAPSGIPQRWGDINWRRVEQNVRAMQIRIAKATQERDWRRVKALQRSLT

RSFSAKASAVRRVTENQGKRTAGVDRELWDSPEVRWVAIGRLRRRGYRPMPLRRVFIP

KANGKERPLGIPTMLDRAMQALYLLALEPVSEGTSDPNSYGFRSNRSTADAMSQLFVN

LSRKVSASWILEADIRGCFDHISHDWLERNVPMDKAILRKWLKAGVVFQGQFQATEAG

TPQGGIISPTLANVALNGLEQQLARFLETTLGVNQTRKLKVNVVRYADDFVITGSTPE

VLEHEVKPWVEQFLAIRGLSLSTEKTRIVSIDEGFDFLGWNFRKYSGTLLIKPSRKNA

QAFYRKVKEVISANKTVKQEVIIRLLNPMLRGWAQYHSPVVAKEAFSKMQSRVFRALL

WWTKRRHRGKNAEWVRKKYFASFGDRNWVFGTEFVEDDGERRWQELYSLASTPIRRHK

KIRGDFNPFDPAQEMYGENLRRDRMLESMSHRKQWIRLYVDQRGLCAVCQCKITKETG

WDDHHIVYRTHGGSDALGNRVLLHPNCHVQVHHHGRTVVKPVLEMSSIL

top]


[Secondary structure]

 

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |