[Back to introns by organism] [Back to home page]
Information of C.sp.I1 intron (Format of information for each intron)
[Intron with flanking sequence]
Sequence
from Genbank entry. Intron is identified in red.
The ORF
is identified in blue with start and stop codons
underlined.
5' end
gtgcg attcgttggc tagaaaccaa tcccaatagg
481 gcttaaggag gattagccgc ttaagattca tgagaatctt ttgataactg aataacaatg
541 taggtgcaaa acctaccctc caagtgctgg agtgaaagca tatcccctac tgtagagact
601 agccatcaat cggtaaagcg gggataaaag gtggaaatac tgcaagccta atgtggtaat
661 caccaagcaa agtatccatg agtaaagcca aggcttaaag acacgtctga atcatcgtga
721 taaaacagta gcgcaatcag gctgaatcgc tacaaccaaa aggtaaagga taaaggactg
781 gcaaatttta ccttgaccag tatgcactcc caataaaccc ggtgaatagt gtcactctaa
841 agatacatag cattgttatt cggaacgaag taacccgtta tttactctca gatggtcgag
901 tatctgagta ggtgctaacc ggatgtaata acgtggagag attgagtaag aagcaaatgt
961 ttaattgtaa tgattgaaat atgctgacgt actcacaatg atttgaaagg attggtctga
1021 ttaggaagta tatatgtcta atacgagttt aaagactacg gcggaatgga atacgatacc
1081 ctggcgaaag ttagaacgta gcgtatacaa gcttcagaaa agaatatacc aagcttctca
1141 acgtggtgat actaaggcag tccgcaaact ccaaaaaacc ctgatgaggt cttggtctgg
1201 aaaagctctt gcagtaagaa aagtcaccca agataaccaa ggaaagaagg cagcaggtat
1261 tgacggtgta aaatccctta aaccatcagc cagactcact ttggtaatga atatgaagct
1321 taaccataag gtaaaagcaa ctcgtagagt atggattcca aaacccggaa acgttgaaaa
1381 acgaccacta ggaataccca caatgcaaga tagagcaact caatcgcttg tcaaactggc
1441 attagaacca gaatgggagg caaaatttga gcccaatagt tacggtttca gacccggacg
1501 caatgcccat gatgcaagag aagcaatatt taatagtatc agatactcaa acaaatgggt
1561 attagatgct gatatttcaa aatgcttcga taaaataaac cacgaaaaac tattgaccaa
1621 aattaacaca ttcccgacca tgaggaggca aataaaagca tggttgaaag caggagtact
1681 agacaatggt catttctcag aaactactga gggaacgcca caaggcggtg taatatctcc
1741 actattagcc aatattgcct tacatgggct agaaaagcta gtaaaggagt ttgcagccag
1801 ccagagagga ggaaaggtga agaatcagaa cagtatatcc ctaattagat atgcagacga
1861 ttttgtgatt cttgccccca ataaaactca aataatagta ctcaaagaaa tagtaaaaac
1921 gtggttagca gaaatgggac tggaattaaa ccctaacaaa acccgtatag tttcgacttt
1981 caaaagctca gagatattcg cctcgcaaga agtaggattt aatttcctcg gtttcaatgt
2041 gaaacaatac aaagtgggaa agaatgactc tggaaaatta tcgaacggta aaaagttagg
2101 gtataaaaca ctcatcaagc ctagtgtaaa atcagtaaag aaacactacg atgacatagc
2161 aagaataatc gataatcaca aaaatgctgc acaagaaaca ctaataagta aacttaaccc
2221 tgtaatcagg ggatgggtta actattactc aacatcagtg agtaaagaga tattctcaaa
2281 gctaagtcat ctaatatatc agaagctgaa acgctgggga aaacgtcgtc accctgacaa
2341 gtctaatgta tgggtaacca agaaatattg gcatacagta ggtggcgata actgggtatt
2401 cgcagcaaca aagaacggag aaatcacaat gaggctattc aaacactcac aaaaagaaat
2461 tgtgagacac gtaaaagtaa aaggtgatgc ctcaccgttc gatgggaact taaaatattg
2521 gagttcaaga aagggcgaaa atcccttagt acctaaaaga gtagcaatac tacttaaaaa
2581 gnntttgggg aaatgctctc attgcggatt gtactttaga gaagatgacc taatcgagat
2641 tgaccatatc attcctaaat cgcaaggtgg aaaggatgta tacgacaatc tgcaagcatt
2701 acataggcat tgtcacgacg ttaaaactgc cactgacaac tcttataatc aacctaagag
2761 cgatacagaa ataaatgtga tgtggtagtg agaagtaccc acgacttggg tcaagtcatt
2821 gaggagccgt atgaggtgaa agtctcaagt acggttttga agaccagcag gattggtgac
2881 agtcttgctg agtttaat
3' end
[Intron and flanking sequence]
1 gcaacggatc acctcaaatc atcaaatacg cctacagcaa taagaaaaat ggggaatcag
61 cacattcccc taaaacaaaa ttctcccttg caaactacca aaaaaattca aaactatgaa
121 cctatccaat attttcagca ctctggactc gcaaattccg attgtggcat tggaagtgct
181 atccccagag gaagccacaa tcattcaatg gctaacaaca actgctcaag aaaagctcac
241 cctaccagtg tacttctgga acttaggagt ttctacctta gagcaatgcc tgattgctgt
301 tgatggggga ctggtattca agccagtgcc agattacaaa aaaccgcctc ttgctgaccc
361 attgatattt atctttgatt gcatcaacag ctttgatggt gctggtgtgt ttatcttagg
421 tgatgttcac ccatttatcg gcaaagtgcg attcgttggc tagaaaccaa tcccaatagg
481 gcttaaggag gattagccgc ttaagattca tgagaatctt ttgataactg aataacaatg
541 taggtgcaaa acctaccctc caagtgctgg agtgaaagca tatcccctac tgtagagact
601 agccatcaat cggtaaagcg gggataaaag gtggaaatac tgcaagccta atgtggtaat
661 caccaagcaa agtatccatg agtaaagcca aggcttaaag acacgtctga atcatcgtga
721 taaaacagta gcgcaatcag gctgaatcgc tacaaccaaa aggtaaagga taaaggactg
781 gcaaatttta ccttgaccag tatgcactcc caataaaccc ggtgaatagt gtcactctaa
841 agatacatag cattgttatt cggaacgaag taacccgtta tttactctca gatggtcgag
901 tatctgagta ggtgctaacc ggatgtaata acgtggagag attgagtaag aagcaaatgt
961 ttaattgtaa tgattgaaat atgctgacgt actcacaatg atttgaaagg attggtctga
1021 ttaggaagta tatatgtcta atacgagttt aaagactacg gcggaatgga atacgatacc
1081 ctggcgaaag ttagaacgta gcgtatacaa gcttcagaaa agaatatacc aagcttctca
1141 acgtggtgat actaaggcag tccgcaaact ccaaaaaacc ctgatgaggt cttggtctgg
1201 aaaagctctt gcagtaagaa aagtcaccca agataaccaa ggaaagaagg cagcaggtat
1261 tgacggtgta aaatccctta aaccatcagc cagactcact ttggtaatga atatgaagct
1321 taaccataag gtaaaagcaa ctcgtagagt atggattcca aaacccggaa acgttgaaaa
1381 acgaccacta ggaataccca caatgcaaga tagagcaact caatcgcttg tcaaactggc
1441 attagaacca gaatgggagg caaaatttga gcccaatagt tacggtttca gacccggacg
1501 caatgcccat gatgcaagag aagcaatatt taatagtatc agatactcaa acaaatgggt
1561 attagatgct gatatttcaa aatgcttcga taaaataaac cacgaaaaac tattgaccaa
1621 aattaacaca ttcccgacca tgaggaggca aataaaagca tggttgaaag caggagtact
1681 agacaatggt catttctcag aaactactga gggaacgcca caaggcggtg taatatctcc
1741 actattagcc aatattgcct tacatgggct agaaaagcta gtaaaggagt ttgcagccag
1801 ccagagagga ggaaaggtga agaatcagaa cagtatatcc ctaattagat atgcagacga
1861 ttttgtgatt cttgccccca ataaaactca aataatagta ctcaaagaaa tagtaaaaac
1921 gtggttagca gaaatgggac tggaattaaa ccctaacaaa acccgtatag tttcgacttt
1981 caaaagctca gagatattcg cctcgcaaga agtaggattt aatttcctcg gtttcaatgt
2041 gaaacaatac aaagtgggaa agaatgactc tggaaaatta tcgaacggta aaaagttagg
2101 gtataaaaca ctcatcaagc ctagtgtaaa atcagtaaag aaacactacg atgacatagc
2161 aagaataatc gataatcaca aaaatgctgc acaagaaaca ctaataagta aacttaaccc
2221 tgtaatcagg ggatgggtta actattactc aacatcagtg agtaaagaga tattctcaaa
2281 gctaagtcat ctaatatatc agaagctgaa acgctgggga aaacgtcgtc accctgacaa
2341 gtctaatgta tgggtaacca agaaatattg gcatacagta ggtggcgata actgggtatt
2401 cgcagcaaca aagaacggag aaatcacaat gaggctattc aaacactcac aaaaagaaat
2461 tgtgagacac gtaaaagtaa aaggtgatgc ctcaccgttc gatgggaact taaaatattg
2521 gagttcaaga aagggcgaaa atcccttagt acctaaaaga gtagcaatac tacttaaaaa
2581 gnntttgggg aaatgctctc attgcggatt gtactttaga gaagatgacc taatcgagat
2641 tgaccatatc attcctaaat cgcaaggtgg aaaggatgta tacgacaatc tgcaagcatt
2701 acataggcat tgtcacgacg ttaaaactgc cactgacaac tcttataatc aacctaagag
2761 cgatacagaa ataaatgtga tgtggtagtg agaagtaccc acgacttggg tcaagtcatt
2821 gaggagccgt atgaggtgaa agtctcaagt acggttttga agaccagcag gattggtgac
2881 agtcttgctg agtttaataa ctctcctcaa ctatcgtggg atattttgac cagagtgaaa
2941 aatctttacc accgactcaa acccactgag aaaagaatta tctttctggg acaaaacatc
3001 gaactacatg aatctttggt acgcctcatt ccctactgtg aagtgccact acccttggtc
3061 gagcagattg aggaacattt acagtcttac ctgcaatatt tgttagagtc agcacaagag
3121 caagaagtac agttcacagt ttccttggct gcggaagaaa gagaaacttt ggcaagggct
3181 gcactaggct tgacgcttga ggaaattagc gacttcctcc gcttaacagt caaagagcgt
3241 ttgagccgca atggtatcgt catagatgct actgtcacgc cattagttgt taaatacaaa
3301 actcggctac ttgcacaaat gggtattgag ttgg
MSNTSLKTTAEWNTIPWRKLERSVYKLQKRIYQASQRGDTKAVRKLQKTLMRSWSGKA
LAVRKVTQDNQGKKAAGIDGVKSLKPSARLTLVMNMKLNHKVKATRRVWIPKPGNVEK
RPLGIPTMQDRATQSLVKLALEPEWEAKFEPNSYGFRPGRNAHDAREAIFNSIRYSNK
WVLDADISKCFDKINHEKLLTKINTFPTMRRQIKAWLKAGVLDNGHFSETTEGTPQGG
VISPLLANIALHGLEKLVKEFAASQRGGKVKNQNSISLIRYADDFVILAPNKTQIIVL
KEIVKTWLAEMGLELNPNKTRIVSTFKSSEIFASQEVGFNFLGFNVKQYKVGKNDSGK
LSNGKKLGYKTLIKPSVKSVKKHYDDIARIIDNHKNAAQETLISKLNPVIRGWVNYYS
TSVSKEIFSKLSHLIYQKLKRWGKRRHPDKSNVWVTKKYWHTVGGDNWVFAATKNGEI
TMRLFKHSQKEIVRHVKVKGDASPFDGNLKYWSSRKGENPLVPKRVAILLKKXLGKCS
HCGLYFREDDLIEIDHIIPKSQGGKDVYDNLQALHRHCHDVKTATDNSYNQPKSDTEI
NVMW

| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |