[Back to introns by organism]  [Back to home page]

Information for Rh.sp.I1 intron  (Format of information for each intron

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence]

 

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

Intron on antisense strand

3' end

                                             aat agggtcggga actggcgcgg

3841 tgacgcagta cgtgcgttcc gtttccagcc cccgccacct cgattccgtg catgcggttc

3901 tcccgcacac ggctcaccga cgtcgttcac caccggcatt cggcttcccc cgccagtccc

3961 ggaagggcct gggggcgacg acgttcccag aaaggctgac caggcccaga tggtccgggg

4021 aggcatacgc caccacgtgc catccgaagc tgcgggagcg cttgtgacgt ttgccgatga

4081 acagcgccat ccgcatccgc acgtactccc tgatcacccc gaagcggtgt gctgagttgc

4141 cgtacttgaa gtacgccgcc cagccgttca ggaacaagtt gaggtcctga accaccacct

4201 cggtcggcag cagcaatcgg gagcggtccg tcaattcccg gatccgttcc cgtgcgtgcc

4261 gcaccgcctt gtccgcgggc cagcgggcga ggaaggggaa cggccgccgc ccggggcgag

4321 cgggcgcgtt caccaaccga tgatggaaac cgaggaagtc cactccctcg cctcccaccc

4381 gaaggtgcac aatcctggtc ttggcctcct tcggctccaa gccgagatcg gccagcagtt

4441 cccgaagccg cgtcaacgcg gcctcggcct gccgccggga cctgcacatc accagcgcgt

4501 cgtcggcata acgcaccagg accccgtgct cgtccacgtc ccatgcccgg tcgagccggt

4561 gcaggtagac gttgcagagc aaggcggatg caaccccgcc ctgtggggtt ccggtgaccg

4621 gccgacggac ctgcccctcc tccatcactc ccgcgcgcaa catcacgcgc agcagcttca

4681 ggaacggctg gtcgcagaca cgttcctcga ccgcctgcat caacttctcg atcggaatcg

4741 cctcgaaaca gttggcgatg tccgtctcca ccacccaccg acacccccgc caggactcat

4801 cgatgagcac ctgcagcgcg tcatgggcac cacgccgggg acggaacccg aacgagcacg

4861 gaaggaaatc cgcctcgaac accggctccg ccaccagttt ccacgcggcc tggacgatcc

4921 ggtcgcgcac cgaaggaatc gacagcggcc tttgctccac cgtgccgggc ttggggatga

4981 acacccgacg cgccgggagc ggacgatagc tgccttcctt cagttccacg gccagttcgt

5041 cgaggaggcg ggcaaccccg tactcctcga cctcctccag ggtgattcgg tcgatacccg

5101 gcgcgccgtt gttgcggcgc accgcgaccc acccacgcca caggacgtcc ctgcgcgaga

5161 ccttgtccat cagcgcgtgg aaccgtcgtc cgggatcgac cttggccgcc cggtagagcg

5221 catgctgcaa ggcacggacc ggatccgagg gagaacgtcc cccgtcggcg ggactagcca

5281 cctgggcact caccgggccc ctcctccaca tctctacgca tcgacgaagt agcggccctt

5341 ccctcaccgg cggttatgtt gtccgctcgg ctccagcagt actacggccg cctctgacgc

5401 ccacccggcc ggcacccact tcccgaggtc atcggttata gggcacccag cttccagcgg

5461 cacctgcgcg caggccgccg ggccggggag ggcctctcca gttcccgccg tcactatccg

5521 aacgttccgc gccccttacg ccggggagtt catcacgact gcgattccag gctcttcgcc

5581 gcttccatgg ccttcgccct gacactccgg gctcggctct cccttgtccc acctcacgat

5641 ggggattttc acgacgcggc aggcttcgct tgatgctacg gaccactcgg ttgctccccc

5701 ttgaagggct tttgacgctg ggcttcgaca ccgggcgttt ccccccgatg ccgccagcct

5761 gctaccgggc ctcctggcag ttacccggac cggacttgca ccggcaagcg acgacgagct

5821 tacgactgac gatcaatcac ctacatgacc aacctccgtc ttctggacgc ac

5' end

Intron on sense strand

5' end

            gtg cgtccagaag acggaggttg gtcatgtagg tgattgatcg tcagtcgtaa

3421 gctcgtcgtc gcttgccggt gcaagtccgg tccgggtaac tgccaggagg cccggtagca

3481 ggctggcggc atcgggggga aacgcccggt gtcgaagccc agcgtcaaaa gcccttcaag

3541 ggggagcaac cgagtggtcc gtagcatcaa gcgaagcctg ccgcgtcgtg aaaatcccca

3601 tcgtgaggtg ggacaaggga gagccgagcc cggagtgtca gggcgaaggc catggaagcg

3661 gcgaagagcc tggaatcgca gtcgtgatga actccccggc gtaaggggcg cggaacgttc

3721 ggatagtgac ggcgggaact ggagaggccc tccccggccc ggcggcctgc gcgcaggtgc

3781 cgctggaagc tgggtgccct ataaccgatg acctcgggaa gtgggtgccg gccgggtggg

3841 cgtcagaggc ggccgtagta ctgctggagc cgagcggaca acataaccgc cggtgaggga

3901 agggccgcta cttcgtcgat gcgtagagat gtggaggagg ggcccggtga gtgcccaggt

3961 ggctagtccc gccgacgggg gacgttctcc ctcggatccg gtccgtgcct tgcagcatgc

4021 gctctaccgg gcggccaagg tcgatcccgg acgacggttc cacgcgctga tggacaaggt

4081 ctcgcgcagg gacgtcctgt ggcgtgggtg ggtcgcggtg cgccgcaaca acggcgcgcc

4141 gggtatcgac cgaatcaccc tggaggaggt cgaggagtac ggggttgccc gcctcctcga

4201 cgaactggcc gtggaactga aggaaggcag ctatcgtccg ctcccggcgc gtcgggtgtt

4261 catccccaag cccggcacgg tggagcaaag gccgctgtcg attccttcgg tgcgcgaccg

4321 gatcgtccag gccgcgtgga aactggtggc ggagccggtg ttcgaggcgg atttccttcc

4381 gtgctcgttc gggttccgtc cccggcgtgg tgcccatgac gcgctgcagg tgctcatcga

4441 tgagtcctgg cgggggtgtc ggtgggtggt ggagacggac atcgccaact gtttcgaggc

4501 gattccgatc gagaagttga tgcaggcggt cgaggaacgt gtctgcgacc agccgttcct

4561 gaagctgctg cgcgtgatgt tgcgcgcggg agtgatggag gaggggcagg tccgtcggcc

4621 ggtcaccgga accccacagg gcggggttgc atccgccttg ctctgcaacg tctacctgca

4681 ccggctcgac cgggcatggg acgtggacga gcacggggtc ctggtgcgtt atgccgacga

4741 cgcgctggtg atgtgcaggt cccggcggca ggccgaggcc gcgttgacgc ggcttcggga

4801 actgctggcc gatctcggct tggagccgaa ggaggccaag accaggattg tgcaccttcg

4861 ggtgggaggc gagggagtgg acttcctcgg tttccatcat cggttggtga acgcgcccgc

4921 tcgccccggg cggcggccgt tccccttcct cgcccgctgg cccgcggaca aggcggtgcg

4981 gcacgcacgg gaacggatcc gggaattgac ggaccgctcc cgattgctgc tgccgaccga

5041 ggtggtggtt caggacctca acttgttcct gaacggctgg gcggcgtact tcaagtacgg

5101 caactcagca caccgcttcg gggtgatcag ggagtacgtg cggatgcgga tggcgctgtt

5161 catcggcaaa cgtcacaagc gctcccgcag cttcggatgg cacgtggtgg cgtatgcctc

5221 cccggaccat ctgggcctgg tcagcctttc tgggaacgtc gtcgccccca ggcccttccg

5281 ggactggcgg gggaagccga atgccggtgg tgaacgacgt cggtgagccg tgtgcgggag

5341 aaccgcatgc acggaatcga ggtggcgggg gctggaaacg gaacgcacgt actgcgtcac

5401 cgcgccagtt cccgacccta t

3' end

[top]


[Intron and flanking sequence]

 

3481 gcagggcgat ccggtgcggg tggtcgatca cagtttcttc accggaccga atgcggaacg

3541 ggccatcccg tacggcgtcc acgacttgac caccgacgca ggttgggtca atgtcggcgt

3601 cgaccacgac accgccgcgt tcgcggtcgc ctccatccgc cgctggtggc aggcccgcgg

3661 cgccgccgac tacccgcacg cgacccggct gctgatcacc gccgacgccg gcggctccaa

3721 cagctaccgg taccggttgt ggaaggcgga attggccgca ctggcaaccg agaccgggtt

3781 ggcgatcacc gtctgccatt tcccgcccgg cacctcgaat agggtcggga actggcgcgg

3841 tgacgcagta cgtgcgttcc gtttccagcc cccgccacct cgattccgtg catgcggttc

3901 tcccgcacac ggctcaccga cgtcgttcac caccggcatt cggcttcccc cgccagtccc

3961 ggaagggcct gggggcgacg acgttcccag aaaggctgac caggcccaga tggtccgggg

4021 aggcatacgc caccacgtgc catccgaagc tgcgggagcg cttgtgacgt ttgccgatga

4081 acagcgccat ccgcatccgc acgtactccc tgatcacccc gaagcggtgt gctgagttgc

4141 cgtacttgaa gtacgccgcc cagccgttca ggaacaagtt gaggtcctga accaccacct

4201 cggtcggcag cagcaatcgg gagcggtccg tcaattcccg gatccgttcc cgtgcgtgcc

4261 gcaccgcctt gtccgcgggc cagcgggcga ggaaggggaa cggccgccgc ccggggcgag

4321 cgggcgcgtt caccaaccga tgatggaaac cgaggaagtc cactccctcg cctcccaccc

4381 gaaggtgcac aatcctggtc ttggcctcct tcggctccaa gccgagatcg gccagcagtt

4441 cccgaagccg cgtcaacgcg gcctcggcct gccgccggga cctgcacatc accagcgcgt

4501 cgtcggcata acgcaccagg accccgtgct cgtccacgtc ccatgcccgg tcgagccggt

4561 gcaggtagac gttgcagagc aaggcggatg caaccccgcc ctgtggggtt ccggtgaccg

4621 gccgacggac ctgcccctcc tccatcactc ccgcgcgcaa catcacgcgc agcagcttca

4681 ggaacggctg gtcgcagaca cgttcctcga ccgcctgcat caacttctcg atcggaatcg

4741 cctcgaaaca gttggcgatg tccgtctcca ccacccaccg acacccccgc caggactcat

4801 cgatgagcac ctgcagcgcg tcatgggcac cacgccgggg acggaacccg aacgagcacg

4861 gaaggaaatc cgcctcgaac accggctccg ccaccagttt ccacgcggcc tggacgatcc

4921 ggtcgcgcac cgaaggaatc gacagcggcc tttgctccac cgtgccgggc ttggggatga

4981 acacccgacg cgccgggagc ggacgatagc tgccttcctt cagttccacg gccagttcgt

5041 cgaggaggcg ggcaaccccg tactcctcga cctcctccag ggtgattcgg tcgatacccg

5101 gcgcgccgtt gttgcggcgc accgcgaccc acccacgcca caggacgtcc ctgcgcgaga

5161 ccttgtccat cagcgcgtgg aaccgtcgtc cgggatcgac cttggccgcc cggtagagcg

5221 catgctgcaa ggcacggacc ggatccgagg gagaacgtcc cccgtcggcg ggactagcca

5281 cctgggcact caccgggccc ctcctccaca tctctacgca tcgacgaagt agcggccctt

5341 ccctcaccgg cggttatgtt gtccgctcgg ctccagcagt actacggccg cctctgacgc

5401 ccacccggcc ggcacccact tcccgaggtc atcggttata gggcacccag cttccagcgg

5461 cacctgcgcg caggccgccg ggccggggag ggcctctcca gttcccgccg tcactatccg

5521 aacgttccgc gccccttacg ccggggagtt catcacgact gcgattccag gctcttcgcc

5581 gcttccatgg ccttcgccct gacactccgg gctcggctct cccttgtccc acctcacgat

5641 ggggattttc acgacgcggc aggcttcgct tgatgctacg gaccactcgg ttgctccccc

5701 ttgaagggct tttgacgctg ggcttcgaca ccgggcgttt ccccccgatg ccgccagcct

5761 gctaccgggc ctcctggcag ttacccggac cggacttgca ccggcaagcg acgacgagct

5821 tacgactgac gatcaatcac ctacatgacc aacctccgtc ttctggacgc acagtggaac

5881 aaaatcgaac accggttgtt ctcccagatc accgtgaact ggcggggacg gccgctgacc

5941 agccacgagg tcgtggtcaa gaccatcgcc tccacccgca cccgcaccgg gctgcgcgtg

6001 gacgccgagc tggacaccgg cgactacccg atcgggatct cgattggtcg agacgagttg

6061 cgcgcgttac ccatccaccc gcacgcccag tgcggaacat ggaactacac catcgaaccc

6121 acccacgccg acgccgctcc ggttcccggc cgcgaccggg aacgcgagcg tgccacggcc

[top]


[ORF sequence]

 

MSAQVASPADGGRSPSDPVRALQHALYRAAKVDPGRRFHALMDKVSRRDVLWRGWVAV

RRNNGAPGIDRITLEEVEEYGVARLLDELAVELKEGSYRPLPARRVFIPKPGTVEQRP

LSIPSVRDRIVQAAWKLVAEPVFEADFLPCSFGFRPRRGAHDALQVLIDESWRGCRWV

VETDIANCFEAIPIEKLMQAVEERVCDQPFLKLLRVMLRAGVMEEGQVRRPVTGTPQG

GVASALLCNVYLHRLDRAWDVDEHGVLVRYADDALVMCRSRRQAEAALTRLRELLADL

GLEPKEAKTRIVHLRVGGEGVDFLGFHHRLVNAPARPGRRPFPFLARWPADKAVRHAR

ERIRELTDRSRLLLPTEVVVQDLNLFLNGWAAYFKYGNSAHRFGVIREYVRMRMALFI

GKRHKRSRSFGWHVVAYASPDHLGLVSLSGNVVAPRPFRDWRGKPNAGGERRR

[top]


[Secondary structure]

 

                                                            

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |