[Back to introns by organism]   [Back to home page]

Information of R.r.I1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

[Intron sequence]

               

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

 

Intron on the antisense strand

 

3' end                                          

                                           gtcgggt agcagcgggg cattgctgcc

134041 ccgccgcccc ctaagaaccg ggcgcgcgga tcactccgca cccggctcaa gccattctcc

134101 agcgccacgt tgtgacaccg ggctccccgg cggtatgttt cgcgtcctga cggcgggcga

134161 ggtagtcatc ccatgtggga tcgaacgggt tagccagcgc cggaattttg gcatgtctct

134221 tgattgggac agccgctgca cggacaagcc catacttggc gtcctccgtg ctgaaatccc

134281 atgactggct accgttgacc ctgaaatacc ttctcctgat ccaccgtgct cctttgttag

134341 ggtgtcgtcg gcacgcccat ctccagagca tgcgccagat gtagaagtct atcgaagaga

134401 aggtcgcctt cgccacgaca tggcggtgat acatcgccca tccccgaatg atcgggttca

134461 gcatttgtat caatccttcc tgcgcgatgg cggcgttgcc tttgatgatt ctccggacct

134521 tatccagcag cgccttgacg ctgtgtttgg acggcgtgat gagaagtttg ccgttgtact

134581 ttcgcacatt ctggcccaga aaatcgaaac catccgcaat gttcgtgatt ctgcttttct

134641 cgtccgagag ttccagtcca cgaacagcca ggtatttcct gatcgccggc aagaccttgt

134701 gttccagcac ctctttcgat gcactggtta cgataaagtc gtcggcataa cggatgacgt

134761 ggggcttggc ttttcggtac gtggatttcg acgatcccac gcttgcatcg actgcggcct

134821 caagcccatc caacactctg ttagcgattg tgggcgaaat cacgccccct tgcggggttc

134881 ccgcccgcgt ctcgaacaga gttccctctt caacatatcc ggcttgaagc cacttccgca

134941 ggatcgcctt gtccataggc atatgctcga ggatgtaatc atggctgatt tcatcgaaac

135001 agcttttgat atcgccctct agaacccacg tggcggactt ccgtcttgcc aacgtgatga

135061 agcactgctc aatggcgtcg gctgtcgagc gtttgggccg aaatccatag gaattcgggt

135121 ccgccaaggt ctccgtcact ggttccagcg caagcttcca taaggcttgc attgcgcggc

135181 acagcatacg gggtatcccg agcgggcgct tcttgccgtt actcttggga atataaacgc

135241 gtcgcaacgg catggtacgg tagccgtgat gccgcagcga cattactccc ttccatctgg

135301 ccaccggggt tgtccagatt tctccgtcca ctcccggcgt tttcttcccg cgattctccg

135361 tcacccgctt cacagccagc atcttgccgc tgtgcgagcg ggtcagcaga cgttgcaagg

135421 cttttacctt gccccatcgg cattcgcgag tggccttggc gatacgcact tgaagtcgct

135481 tcacggttgc ttcaatctgc gaccagtctg tctgatcgca aatctgccct ccgtgcgagg

135541 acgcaccagc attcgctgct ttcgcaccga ttgccgtcat ctgcttttcc tcgtccagca

135601 attagtcatg ttctctcgcc atgaatgacc gagcggaagt ctgcccactt tcgtgtcggg

135661 cgatgttgca gcccgtatcc cgaccattac agccgggcct tcgctttctc cgccttcctt

135721 tacccacaac accaacagcg ttccttacgg ttcgcctgcc gttaccggca gcgctatggg

135781 cttaccctgt tccgcatgaa tctcagagcc agtcggaccc ttccttttcg ccggcagctt

135841 tccgtccatg gaggtccaac cggaaaggac cacaccttgc tgcacacctt ttggttcaag

135901 cctatcagca catttggctt gttccttgtt acggcgttta tcggaagttc acttgtgttg

135961 gtcgtcattg ctcatccctt gcacctcacc gcctttgtgc tggcagtttc cacttcgcct

136021 cacggttcgg tggaccggtc acccggtggt tacgttgtcc ccgaagctcc acacggagtc

136081 gttgccaact tcgcatgttc gggtagggaa cgaccgatgg aacggtcggt ttcgtaaagt

136141 tcacatcacg tgttctcctc tcgaaacaga gattcatgcg actttcaggt cgcac

5' end  

 

Intron on the sense strand

 

5' end

   1 gtgcgacctg aaagtcgcat gaatctctgt ttcgagagga gaacacgtga tgtgaacttt 

  61 acgaaaccga ccgttccatc ggtcgttccc tacccgaaca tgcgaagttg gcaacgactc 

 121 cgtgtggagc ttcggggaca acgtaaccac cgggtgaccg gtccaccgaa ccgtgaggcg 

 181 aagtggaaac tgccagcaca aaggcggtga ggtgcaaggg atgagcaatg acgaccaaca 

 241 caagtgaact tccgataaac gccgtaacaa ggaacaagcc aaatgtgctg ataggcttga 

 301 accaaaaggt gtgcagcaag gtgtggtcct ttccggttgg acctccatgg acggaaagct 

 361 gccggcgaaa aggaagggtc cgactggctc tgagattcat gcggaacagg gtaagcccat 

 421 agcgctgccg gtaacggcag gcgaaccgta aggaacgctg ttggtgttgt gggtaaagga 

 481 aggcggagaa agcgaaggcc cggctgtaat ggtcgggata cgggctgcaa catcgcccga 

 541 cacgaaagtg ggcagacttc cgctcggtca ttcatggcga gagaacatga ctaattgctg 

 601 gacgaggaaa agcagatgac ggcaatcggt gcgaaagcag cgaatgctgg tgcgtcctcg 

 661 cacggagggc agatttgcga tcagacagac tggtcgcaga ttgaagcaac cgtgaagcga 

 721 cttcaagtgc gtatcgccaa ggccactcgc gaatgccgat ggggcaaggt aaaagccttg 

 781 caacgtctgc tgacccgctc gcacagcggc aagatgctgg ctgtgaagcg ggtgacggag 

 841 aatcgcggga agaaaacgcc gggagtggac ggagaaatct ggacaacccc ggtggccaga 

 901 tggaagggag taatgtcgct gcggcatcac ggctaccgta ccatgccgtt gcgacgcgtt 

 961 tatattccca agagtaacgg caagaagcgc ccgctcggga taccccgtat gctgtgccgc 

1021 gcaatgcaag ccttatggaa gcttgcgctg gaaccagtga cggagacctt ggcggacccg 

1081 aattcctatg gatttcggcc caaacgctcg acagccgacg ccattgagca gtgcttcatc 

1141 acgttggcaa gacggaagtc cgccacgtgg gttctagagg gcgatatcaa aagctgtttc 

1201 gatgaaatca gccatgatta catcctcgag catatgccta tggacaaggc gatcctgcgg 

1261 aagtggcttc aagccggata tgttgaagag ggaactctgt tcgagacgcg ggcgggaacc 

1321 ccgcaagggg gcgtgatttc gcccacaatc gctaacagag tgttggatgg gcttgaggcc 

1381 gcagtcgatg caagcgtggg atcgtcgaaa tccacgtacc gaaaagccaa gccccacgtc 

1441 atccgttatg ccgacgactt tatcgtaacc agtgcatcga aagaggtgct ggaacacaag 

1501 gtcttgccgg cgatcaggaa atacctggct gttcgtggac tggaactctc ggacgagaaa 

1561 agcagaatca cgaacattgc ggatggtttc gattttctgg gccagaatgt gcgaaagtac 

1621 aacggcaaac ttctcatcac gccgtccaaa cacagcgtca aggcgctgct ggataaggtc 

1681 cggagaatca tcaaaggcaa cgccgccatc gcgcaggaag gattgataca aatgctgaac 

1741 ccgatcattc ggggatgggc gatgtatcac cgccatgtcg tggcgaaggc gaccttctct 

1801 tcgatagact tctacatctg gcgcatgctc tggagatggg cgtgccgacg acaccctaac 

1861 aaaggagcac ggtggatcag gagaaggtat ttcagggtca acggtagcca gtcatgggat 

1921 ttcagcacgg aggacgccaa gtatgggctt gtccgtgcag cggctgtccc aatcaagaga 

1981 catgccaaaa ttccggcgct ggctaacccg ttcgatccca catgggatga ctacctcgcc 

2041 cgccgtcagg acgcgaaaca taccgccggg gagcccggtg tcacaacgtg gcgctggaga 

2101 atggcttgag ccgggtgcgg agtgatccgc gcgcccggtt cttagggggc ggcggggcag 

2161 caatgccccg ctgctacccg ac

3' end

[top]


[Intron and flanking sequence]

 

133501 tcagcgagac cgggactgcc ctgaaaacca tcgagggcta cattgtcacc gtcaaccatc

133561 atatggactc gatcgcgaca tcggcccggg aacagtcggt tggtctggcc gaggtgaata

133621 ccgccgtcaa tcagatggat caggtcaccc agcagaatgc cgccatggtc gaggaaagca

133681 atgcagcgag cgcgaccctt gccggtgaag caggacggct gcgcgacctg atcagccagt

133741 tccaatttgg cgaagccatg cgccggccgg cgcccgtcac cgaggccagc ccttccaacc

133801 ggccggtcgc atcgccggcg cgccggattg tcggtgcggt tgccaaggcc ttttccggaa

133861 acgctgccgt caagcagacc tgggaagagt tctgagcgga gctcacggcg cattgacgga

133921 aggtcttttg gccatcggag ctggcaagcg gtcgcgtcct attcaggacg cggtcctatc

133981 tctctctctg acgtcccatc ttgcacccgc cgtgtcgggt agcagcgggg cattgctgcc

134041 ccgccgcccc ctaagaaccg ggcgcgcgga tcactccgca cccggctcaa gccattctcc

134101 agcgccacgt tgtgacaccg ggctccccgg cggtatgttt cgcgtcctga cggcgggcga

134161 ggtagtcatc ccatgtggga tcgaacgggt tagccagcgc cggaattttg gcatgtctct

134221 tgattgggac agccgctgca cggacaagcc catacttggc gtcctccgtg ctgaaatccc

134281 atgactggct accgttgacc ctgaaatacc ttctcctgat ccaccgtgct cctttgttag

134341 ggtgtcgtcg gcacgcccat ctccagagca tgcgccagat gtagaagtct atcgaagaga

134401 aggtcgcctt cgccacgaca tggcggtgat acatcgccca tccccgaatg atcgggttca

134461 gcatttgtat caatccttcc tgcgcgatgg cggcgttgcc tttgatgatt ctccggacct

134521 tatccagcag cgccttgacg ctgtgtttgg acggcgtgat gagaagtttg ccgttgtact

134581 ttcgcacatt ctggcccaga aaatcgaaac catccgcaat gttcgtgatt ctgcttttct

134641 cgtccgagag ttccagtcca cgaacagcca ggtatttcct gatcgccggc aagaccttgt

134701 gttccagcac ctctttcgat gcactggtta cgataaagtc gtcggcataa cggatgacgt

134761 ggggcttggc ttttcggtac gtggatttcg acgatcccac gcttgcatcg actgcggcct

134821 caagcccatc caacactctg ttagcgattg tgggcgaaat cacgccccct tgcggggttc

134881 ccgcccgcgt ctcgaacaga gttccctctt caacatatcc ggcttgaagc cacttccgca

134941 ggatcgcctt gtccataggc atatgctcga ggatgtaatc atggctgatt tcatcgaaac

135001 agcttttgat atcgccctct agaacccacg tggcggactt ccgtcttgcc aacgtgatga

135061 agcactgctc aatggcgtcg gctgtcgagc gtttgggccg aaatccatag gaattcgggt

135121 ccgccaaggt ctccgtcact ggttccagcg caagcttcca taaggcttgc attgcgcggc

135181 acagcatacg gggtatcccg agcgggcgct tcttgccgtt actcttggga atataaacgc

135241 gtcgcaacgg catggtacgg tagccgtgat gccgcagcga cattactccc ttccatctgg

135301 ccaccggggt tgtccagatt tctccgtcca ctcccggcgt tttcttcccg cgattctccg

135361 tcacccgctt cacagccagc atcttgccgc tgtgcgagcg ggtcagcaga cgttgcaagg

135421 cttttacctt gccccatcgg cattcgcgag tggccttggc gatacgcact tgaagtcgct

135481 tcacggttgc ttcaatctgc gaccagtctg tctgatcgca aatctgccct ccgtgcgagg

135541 acgcaccagc attcgctgct ttcgcaccga ttgccgtcat ctgcttttcc tcgtccagca

135601 attagtcatg ttctctcgcc atgaatgacc gagcggaagt ctgcccactt tcgtgtcggg

135661 cgatgttgca gcccgtatcc cgaccattac agccgggcct tcgctttctc cgccttcctt

135721 tacccacaac accaacagcg ttccttacgg ttcgcctgcc gttaccggca gcgctatggg

135781 cttaccctgt tccgcatgaa tctcagagcc agtcggaccc ttccttttcg ccggcagctt

135841 tccgtccatg gaggtccaac cggaaaggac cacaccttgc tgcacacctt ttggttcaag

135901 cctatcagca catttggctt gttccttgtt acggcgttta tcggaagttc acttgtgttg

135961 gtcgtcattg ctcatccctt gcacctcacc gcctttgtgc tggcagtttc cacttcgcct

136021 cacggttcgg tggaccggtc acccggtggt tacgttgtcc ccgaagctcc acacggagtc

136081 gttgccaact tcgcatgttc gggtagggaa cgaccgatgg aacggtcggt ttcgtaaagt

136141 tcacatcacg tgttctcctc tcgaaacaga gattcatgcg actttcaggt cgcactccgg

136201 cgtcaaggac tgcgcggccc ctccaatttt ccccgcaccg caagcggcac agagaaagtc

136261 ggttcctccg ctcgccgcac ggcggccgcg ttccgcggtc cctgacccct cacggacttc

136321 ggtgcggaca cttttcgtca gaaaacgaaa aggagaccct gatgacacag catcacatca

136381 cacagccggt tagcgaagag cggaaacttg cggatcagca gccgaccggg cttgaacatc

136441 tgcgcagcag tttcgatgct gaagtgcacc ttcccgccga tatctcgcgc gagtttctgt

136501 ctgcggcgct gctttgggcc atcgacaaca aggtcgattt cggtttgttc cacgagcccg

136561 gaaagatcat catcgcgcac ttcggtggtg atgagatcta cctcccctcc cggtggtccg

[top]


[ORF sequence]

 

MTAIGAKAANAGASSHGGQICDQTDWSQIEATVKRLQVRIAKATRECRWGKVKALQRL

LTRSHSGKMLAVKRVTENRGKKTPGVDGEIWTTPVARWKGVMSLRHHGYRTMPLRRVY

IPKSNGKKRPLGIPRMLCRAMQALWKLALEPVTETLADPNSYGFRPKRSTADAIEQCF

ITLARRKSATWVLEGDIKSCFDEISHDYILEHMPMDKAILRKWLQAGYVEEGTLFETR

AGTPQGGVISPTIANRVLDGLEAAVDASVGSSKSTYRKAKPHVIRYADDFIVTSASKE

VLEHKVLPAIRKYLAVRGLELSDEKSRITNIADGFDFLGQNVRKYNGKLLITPSKHSV

KALLDKVRRIIKGNAAIAQEGLIQMLNPIIRGWAMYHRHVVAKATFSSIDFYIWRMLW

RWACRRHPNKGARWIRRRYFRVNGSQSWDFSTEDAKYGLVRAAAVPIKRHAKIPALAN

PFDPTWDDYLARRQDAKHTAGEPGVTTWRWRMA

top]


[Secondary structure]

 

[top]


 

| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |