[Back to introns by organism] [Back to home page]

Information of C.sp.I1 intron  (Format of information for each intron)

[Intron sequence]

[Intron with flanking sequence]

[ORF sequence]

[Secondary structure]

 

[Intron sequence]

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  
               

5' end                 

                                gtgcg attcgttggc tagaaaccaa tcccaatagg

 481 gcttaaggag gattagccgc ttaagattca tgagaatctt ttgataactg aataacaatg

 541 taggtgcaaa acctaccctc caagtgctgg agtgaaagca tatcccctac tgtagagact

 601 agccatcaat cggtaaagcg gggataaaag gtggaaatac tgcaagccta atgtggtaat

 661 caccaagcaa agtatccatg agtaaagcca aggcttaaag acacgtctga atcatcgtga

 721 taaaacagta gcgcaatcag gctgaatcgc tacaaccaaa aggtaaagga taaaggactg

 781 gcaaatttta ccttgaccag tatgcactcc caataaaccc ggtgaatagt gtcactctaa

 841 agatacatag cattgttatt cggaacgaag taacccgtta tttactctca gatggtcgag

 901 tatctgagta ggtgctaacc ggatgtaata acgtggagag attgagtaag aagcaaatgt

 961 ttaattgtaa tgattgaaat atgctgacgt actcacaatg atttgaaagg attggtctga

1021 ttaggaagta tatatgtcta atacgagttt aaagactacg gcggaatgga atacgatacc

1081 ctggcgaaag ttagaacgta gcgtatacaa gcttcagaaa agaatatacc aagcttctca

1141 acgtggtgat actaaggcag tccgcaaact ccaaaaaacc ctgatgaggt cttggtctgg

1201 aaaagctctt gcagtaagaa aagtcaccca agataaccaa ggaaagaagg cagcaggtat

1261 tgacggtgta aaatccctta aaccatcagc cagactcact ttggtaatga atatgaagct

1321 taaccataag gtaaaagcaa ctcgtagagt atggattcca aaacccggaa acgttgaaaa

1381 acgaccacta ggaataccca caatgcaaga tagagcaact caatcgcttg tcaaactggc

1441 attagaacca gaatgggagg caaaatttga gcccaatagt tacggtttca gacccggacg

1501 caatgcccat gatgcaagag aagcaatatt taatagtatc agatactcaa acaaatgggt

1561 attagatgct gatatttcaa aatgcttcga taaaataaac cacgaaaaac tattgaccaa

1621 aattaacaca ttcccgacca tgaggaggca aataaaagca tggttgaaag caggagtact

1681 agacaatggt catttctcag aaactactga gggaacgcca caaggcggtg taatatctcc

1741 actattagcc aatattgcct tacatgggct agaaaagcta gtaaaggagt ttgcagccag

1801 ccagagagga ggaaaggtga agaatcagaa cagtatatcc ctaattagat atgcagacga

1861 ttttgtgatt cttgccccca ataaaactca aataatagta ctcaaagaaa tagtaaaaac

1921 gtggttagca gaaatgggac tggaattaaa ccctaacaaa acccgtatag tttcgacttt

1981 caaaagctca gagatattcg cctcgcaaga agtaggattt aatttcctcg gtttcaatgt

2041 gaaacaatac aaagtgggaa agaatgactc tggaaaatta tcgaacggta aaaagttagg

2101 gtataaaaca ctcatcaagc ctagtgtaaa atcagtaaag aaacactacg atgacatagc

2161 aagaataatc gataatcaca aaaatgctgc acaagaaaca ctaataagta aacttaaccc

2221 tgtaatcagg ggatgggtta actattactc aacatcagtg agtaaagaga tattctcaaa

2281 gctaagtcat ctaatatatc agaagctgaa acgctgggga aaacgtcgtc accctgacaa

2341 gtctaatgta tgggtaacca agaaatattg gcatacagta ggtggcgata actgggtatt

2401 cgcagcaaca aagaacggag aaatcacaat gaggctattc aaacactcac aaaaagaaat

2461 tgtgagacac gtaaaagtaa aaggtgatgc ctcaccgttc gatgggaact taaaatattg

2521 gagttcaaga aagggcgaaa atcccttagt acctaaaaga gtagcaatac tacttaaaaa

2581 gnntttgggg aaatgctctc attgcggatt gtactttaga gaagatgacc taatcgagat

2641 tgaccatatc attcctaaat cgcaaggtgg aaaggatgta tacgacaatc tgcaagcatt

2701 acataggcat tgtcacgacg ttaaaactgc cactgacaac tcttataatc aacctaagag

2761 cgatacagaa ataaatgtga tgtggtagtg agaagtaccc acgacttggg tcaagtcatt

2821 gaggagccgt atgaggtgaa agtctcaagt acggttttga agaccagcag gattggtgac

2881 agtcttgctg agtttaat

3' end  

[top]


[Intron and flanking sequence]

 

   1 gcaacggatc acctcaaatc atcaaatacg cctacagcaa taagaaaaat ggggaatcag

  61 cacattcccc taaaacaaaa ttctcccttg caaactacca aaaaaattca aaactatgaa

 121 cctatccaat attttcagca ctctggactc gcaaattccg attgtggcat tggaagtgct

 181 atccccagag gaagccacaa tcattcaatg gctaacaaca actgctcaag aaaagctcac

 241 cctaccagtg tacttctgga acttaggagt ttctacctta gagcaatgcc tgattgctgt

 301 tgatggggga ctggtattca agccagtgcc agattacaaa aaaccgcctc ttgctgaccc

 361 attgatattt atctttgatt gcatcaacag ctttgatggt gctggtgtgt ttatcttagg

 421 tgatgttcac ccatttatcg gcaaagtgcg attcgttggc tagaaaccaa tcccaatagg

 481 gcttaaggag gattagccgc ttaagattca tgagaatctt ttgataactg aataacaatg

 541 taggtgcaaa acctaccctc caagtgctgg agtgaaagca tatcccctac tgtagagact

 601 agccatcaat cggtaaagcg gggataaaag gtggaaatac tgcaagccta atgtggtaat

 661 caccaagcaa agtatccatg agtaaagcca aggcttaaag acacgtctga atcatcgtga

 721 taaaacagta gcgcaatcag gctgaatcgc tacaaccaaa aggtaaagga taaaggactg

 781 gcaaatttta ccttgaccag tatgcactcc caataaaccc ggtgaatagt gtcactctaa

 841 agatacatag cattgttatt cggaacgaag taacccgtta tttactctca gatggtcgag

 901 tatctgagta ggtgctaacc ggatgtaata acgtggagag attgagtaag aagcaaatgt

 961 ttaattgtaa tgattgaaat atgctgacgt actcacaatg atttgaaagg attggtctga

1021 ttaggaagta tatatgtcta atacgagttt aaagactacg gcggaatgga atacgatacc

1081 ctggcgaaag ttagaacgta gcgtatacaa gcttcagaaa agaatatacc aagcttctca

1141 acgtggtgat actaaggcag tccgcaaact ccaaaaaacc ctgatgaggt cttggtctgg

1201 aaaagctctt gcagtaagaa aagtcaccca agataaccaa ggaaagaagg cagcaggtat

1261 tgacggtgta aaatccctta aaccatcagc cagactcact ttggtaatga atatgaagct

1321 taaccataag gtaaaagcaa ctcgtagagt atggattcca aaacccggaa acgttgaaaa

1381 acgaccacta ggaataccca caatgcaaga tagagcaact caatcgcttg tcaaactggc

1441 attagaacca gaatgggagg caaaatttga gcccaatagt tacggtttca gacccggacg

1501 caatgcccat gatgcaagag aagcaatatt taatagtatc agatactcaa acaaatgggt

1561 attagatgct gatatttcaa aatgcttcga taaaataaac cacgaaaaac tattgaccaa

1621 aattaacaca ttcccgacca tgaggaggca aataaaagca tggttgaaag caggagtact

1681 agacaatggt catttctcag aaactactga gggaacgcca caaggcggtg taatatctcc

1741 actattagcc aatattgcct tacatgggct agaaaagcta gtaaaggagt ttgcagccag

1801 ccagagagga ggaaaggtga agaatcagaa cagtatatcc ctaattagat atgcagacga

1861 ttttgtgatt cttgccccca ataaaactca aataatagta ctcaaagaaa tagtaaaaac

1921 gtggttagca gaaatgggac tggaattaaa ccctaacaaa acccgtatag tttcgacttt

1981 caaaagctca gagatattcg cctcgcaaga agtaggattt aatttcctcg gtttcaatgt

2041 gaaacaatac aaagtgggaa agaatgactc tggaaaatta tcgaacggta aaaagttagg

2101 gtataaaaca ctcatcaagc ctagtgtaaa atcagtaaag aaacactacg atgacatagc

2161 aagaataatc gataatcaca aaaatgctgc acaagaaaca ctaataagta aacttaaccc

2221 tgtaatcagg ggatgggtta actattactc aacatcagtg agtaaagaga tattctcaaa

2281 gctaagtcat ctaatatatc agaagctgaa acgctgggga aaacgtcgtc accctgacaa

2341 gtctaatgta tgggtaacca agaaatattg gcatacagta ggtggcgata actgggtatt

2401 cgcagcaaca aagaacggag aaatcacaat gaggctattc aaacactcac aaaaagaaat

2461 tgtgagacac gtaaaagtaa aaggtgatgc ctcaccgttc gatgggaact taaaatattg

2521 gagttcaaga aagggcgaaa atcccttagt acctaaaaga gtagcaatac tacttaaaaa

2581 gnntttgggg aaatgctctc attgcggatt gtactttaga gaagatgacc taatcgagat

2641 tgaccatatc attcctaaat cgcaaggtgg aaaggatgta tacgacaatc tgcaagcatt

2701 acataggcat tgtcacgacg ttaaaactgc cactgacaac tcttataatc aacctaagag

2761 cgatacagaa ataaatgtga tgtggtagtg agaagtaccc acgacttggg tcaagtcatt

2821 gaggagccgt atgaggtgaa agtctcaagt acggttttga agaccagcag gattggtgac

2881 agtcttgctg agtttaataa ctctcctcaa ctatcgtggg atattttgac cagagtgaaa

2941 aatctttacc accgactcaa acccactgag aaaagaatta tctttctggg acaaaacatc

3001 gaactacatg aatctttggt acgcctcatt ccctactgtg aagtgccact acccttggtc

3061 gagcagattg aggaacattt acagtcttac ctgcaatatt tgttagagtc agcacaagag

3121 caagaagtac agttcacagt ttccttggct gcggaagaaa gagaaacttt ggcaagggct

3181 gcactaggct tgacgcttga ggaaattagc gacttcctcc gcttaacagt caaagagcgt

3241 ttgagccgca atggtatcgt catagatgct actgtcacgc cattagttgt taaatacaaa

3301 actcggctac ttgcacaaat gggtattgag ttgg

[top]


[ORF sequence]

 

MSNTSLKTTAEWNTIPWRKLERSVYKLQKRIYQASQRGDTKAVRKLQKTLMRSWSGKA

LAVRKVTQDNQGKKAAGIDGVKSLKPSARLTLVMNMKLNHKVKATRRVWIPKPGNVEK

RPLGIPTMQDRATQSLVKLALEPEWEAKFEPNSYGFRPGRNAHDAREAIFNSIRYSNK

WVLDADISKCFDKINHEKLLTKINTFPTMRRQIKAWLKAGVLDNGHFSETTEGTPQGG

VISPLLANIALHGLEKLVKEFAASQRGGKVKNQNSISLIRYADDFVILAPNKTQIIVL

KEIVKTWLAEMGLELNPNKTRIVSTFKSSEIFASQEVGFNFLGFNVKQYKVGKNDSGK

LSNGKKLGYKTLIKPSVKSVKKHYDDIARIIDNHKNAAQETLISKLNPVIRGWVNYYS

TSVSKEIFSKLSHLIYQKLKRWGKRRHPDKSNVWVTKKYWHTVGGDNWVFAATKNGEI

TMRLFKHSQKEIVRHVKVKGDASPFDGNLKYWSSRKGENPLVPKRVAILLKKXLGKCS

HCGLYFREDDLIEIDHIIPKSQGGKDVYDNLQALHRHCHDVKTATDNSYNQPKSDTEI

NVMW

[top]


[Secondary structure]

 

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |