[Back to introns by organism] [Back to home page]
Information for C.w.I7 intron (Format of information for each intron)
[Intron and flanking sequence]
Sequence from Genbank entry (intron is on the antisense strand).
The boundaries of the intron are marked as red and ORF is marked
as blue, with start and stop codons underlined.
Intron on antisense strand
3' end
gt ttgagtcgag gggagctatt aaactccgcc cctctcagtt agatctggac
9541 gtgcgacttt caccgcatcc ggctcccgat aatctagact
tttccagtct tacccatgtg
9601 gatgtaatcg tgacaacttt gatggactgc caaaaggttt tgggttctcc agttatcatg
9661 attcccatca atgtgatgga attggacttt ttcatcagat atgaatttaa gtccacagtg
9721 tccgcatgat tggttctgtc ttcttagaca tctagaggtg aaatcagtgt agagtttgct
9781 gttacgcttg ctccaatata ctgtatctcc gtcgtagggg gatttatttc ctttgacgct
9841 gacgaatcga ttttctgcgt atggaactga tgggaatgct tttaaaagta attccctagt
9901 tttgtagcgg tcaagtttct tttccttatt gaatttcttg aaggctctat aagccgttct
9961 ccataaggaa aatctggagc cactcatttt gcagtatcta tgataatttc gccatcctct
10021 gactagaggt gcaatttttt ctgcttttac tctagcccca atattcgagt tattaacgat
10081 ttttttgact ttattgataa aagttttata gttatcctct gatggaacac atctaaactt
10141 tccgttttgt tggactttaa aatgccagcc caggaagtca aatccatctg tcgagtgtac
10201 tagctttgtt ttgcttgact ttatttttag tcctcgttcg gctagaaact gttcaaccct
10261 taaaagtatc aactcagcgt tatcttttgg ctttaaaatg aaaaccatat catccgcata
10321 acggatgcta ttgtggaggt cttctattcc atctaacgct atgtttgcta atagagggct
10381 gaccactccc ccttgggggg ttcggagttc tggaaattca gggccgactc ctgcctttag
10441 acatttccaa aggccaagtt taatagcttg tggtgctatg acccgttcca ttattgatgt
10501 gtggtcaatc ctgtcgaaac acttttctat atctagctct agtattctct tattaatccc
10561 attagcgtgt gagcttaggt tattgaagag aattttttgc gcatcgtgcg ttgagcgtcc
10621 gggtctgaac ccatagcttt tggcgtggaa aagtgcctca tgtgctggtt ctatggcgta
10681 tttaacaagg cattgccatg ctctgtctgc catagtggga accttcagca ttctggttgt
10741 tccatccttt ttagggatgg gaactgctct tagctttgaa tgattccatt tgttagctgt
10801 gagtaacctt tttgctaagg cgaacctttg tttgaagttc agggattttt gcccatccac
10861 tcctgcggtt ctttttccag cgtttagctg tgtaacttgt cggactgcca gtaatcttgc
10921 tgctgaggat ttcagaatta gcttttgcag gttctttgct cgtaccctgt cattctctcg
10981 aatggctttg aacagtcttg tttgtaatcg gaaaagattc ctttggagtt tcttccaagg
11041 gagcttttgc caaagttcac tgaagttgat ttcgtgtcta accatgattt
acgtacttga
11101 ccattttctg attacctcaa ggcagttacg ccttgtccta cccgaaataa aggggttcgg
11161 tgtctcgtct cacttacctg ggttttcgac ctttcccaag acctttattc cgtttttatt
11221 cgttccgacg gtgcgatttc ttgccatctt tagagcgaac cactttgccc atttgcctct
11281 cgagggttac agttatcatt cgaataatac tgtgagagta gcaaatcaac tcggtcattt
11341 ggattcagat tgaccggtca tacgtaggac acttttatag gttctgttta ctcatggatg
11401 actaccataa cgccagtgtg gtctatctgc tttgacctct gattgcgtcc ttttgccagc
11461 ttcagcctcc agaattccga gtctggtcac tgtggcaagg tttagagtca cctgtcctct
11521 caggggtgag aatttaactc acatcggctc cgtcagttat gtagttaaga attgttacga
11581 agaacctctt aactatcacc tcagcttatg aattaggctg agaacgaacc
gcac
5' end
Intron on sense strand
5' end
1 gtgcggttcg ttctcagcct aattcataag ctgaggtgat agttaagagg ttcttcgtaa
61 caattcttaa ctacataact gacggagccg atgtgagtta aattctcacc cctgagagga
121 caggtgactc taaaccttgc cacagtgacc agactcggaa ttctggaggc tgaagctggc
181 aaaaggacgc aatcagaggt caaagcagat agaccacact ggcgttatgg tagtcatcca
241 tgagtaaaca gaacctataa aagtgtccta cgtatgaccg gtcaatctga atccaaatga
301 ccgagttgat ttgctactct cacagtatta ttcgaatgat aactgtaacc ctcgagaggc
361 aaatgggcaa agtggttcgc tctaaagatg gcaagaaatc gcaccgtcgg aacgaataaa
421 aacggaataa aggtcttggg aaaggtcgaa aacccaggta agtgagacga gacaccgaac
481 ccctttattt cgggtaggac aaggcgtaac tgccttgagg taatcagaaa atggtcaagt
541 acgtaaatca tggttagaca cgaaatcaac ttcagtgaac tttggcaaaa gctcccttgg
601 aagaaactcc aaaggaatct tttccgatta caaacaagac tgttcaaagc cattcgagag
661 aatgacaggg tacgagcaaa gaacctgcaa aagctaattc tgaaatcctc agcagcaaga
721 ttactggcag tccgacaagt tacacagcta aacgctggaa aaagaaccgc aggagtggat
781 gggcaaaaat ccctgaactt caaacaaagg ttcgccttag caaaaaggtt actcacagct
841 aacaaatgga atcattcaaa gctaagagca gttcccatcc ctaaaaagga tggaacaacc
901 agaatgctga aggttcccac tatggcagac agagcatggc aatgccttgt taaatacgcc
961 atagaaccag cacatgaggc acttttccac gccaaaagct atgggttcag acccggacgc
1021 tcaacgcacg atgcgcaaaa aattctcttc aataacctaa gctcacacgc taatgggatt
1081 aataagagaa tactagagct agatatagaa aagtgtttcg acaggattga ccacacatca
1141 ataatggaac gggtcatagc accacaagct attaaacttg gcctttggaa atgtctaaag
1201 gcaggagtcg gccctgaatt tccagaactc cgaacccccc aagggggagt ggtcagccct
1261 ctattagcaa acatagcgtt agatggaata gaagacctcc acaatagcat ccgttatgcg
1321 gatgatatgg ttttcatttt aaagccaaaa gataacgctg agttgatact tttaagggtt
1381 gaacagtttc tagccgaacg aggactaaaa ataaagtcaa gcaaaacaaa gctagtacac
1441 tcgacagatg gatttgactt cctgggctgg cattttaaag tccaacaaaa cggaaagttt
1501 agatgtgttc catcagagga taactataaa acttttatca ataaagtcaa aaaaatcgtt
1561 aataactcga atattggggc tagagtaaaa gcagaaaaaa ttgcacctct agtcagagga
1621 tggcgaaatt atcatagata ctgcaaaatg agtggctcca gattttcctt atggagaacg
1681 gcttatagag ccttcaagaa attcaataag gaaaagaaac ttgaccgcta caaaactagg
1741 gaattacttt taaaagcatt cccatcagtt ccatacgcag aaaatcgatt cgtcagcgtc
1801 aaaggaaata aatcccccta cgacggagat acagtatatt ggagcaagcg taacagcaaa
1861 ctctacactg atttcacctc tagatgtcta agaagacaga accaatcatg cggacactgt
1921 ggacttaaat tcatatctga tgaaaaagtc caattccatc acattgatgg gaatcatgat
1981 aactggagaa cccaaaacct tttggcagtc catcaaagtt gtcacgatta catccacatg
2041 ggtaagactg gaaaagtcta gattatcggg agccggatgc ggtgaaagtc gcacgtccag
2101 atctaactga gaggggcgga gtttaatagc tcccctcgac tcaaac
3' end
[top]
[Intron and flanking sequence]
9121
caacatcttt ccagcctaaa atttctgtga cttttttctc cagatttttg taggtacgga
9181 aaaattccgc aatttcttct aaacggtgag gagcaatgtc tttgatagat tttacttcgc
9241 taaagcgtgg atcttcatca gggacacaaa gaactttttc atcgcgatct ccaccatcga
9301 tcatttctaa cataccaata ggacgggctg caatgacaca acccgggaaa gtaggttgat
9361 caatgatcac catgccatcc aaaggatcac cgtcatccgc taaggtatta ggaacaaacc
9421 catagtcgta aggatattgt accgaggcaa acaaaacgcg atcgagagca aaggctttca
9481 tatccttggt ttgagtcgag gggagctatt aaactccgcc
cctctcagtt agatctggac
9541 gtgcgacttt caccgcatcc ggctcccgat aatctagact tttccagtct tacccatgtg
9601 gatgtaatcg tgacaacttt gatggactgc caaaaggttt tgggttctcc agttatcatg
9661 attcccatca atgtgatgga attggacttt ttcatcagat atgaatttaa gtccacagtg
9721 tccgcatgat tggttctgtc ttcttagaca tctagaggtg aaatcagtgt agagtttgct
9781 gttacgcttg ctccaatata ctgtatctcc gtcgtagggg gatttatttc ctttgacgct
9841 gacgaatcga ttttctgcgt atggaactga tgggaatgct tttaaaagta attccctagt
9901 tttgtagcgg tcaagtttct tttccttatt gaatttcttg aaggctctat aagccgttct
9961 ccataaggaa aatctggagc cactcatttt gcagtatcta tgataatttc gccatcctct
10021 gactagaggt gcaatttttt ctgcttttac tctagcccca atattcgagt tattaacgat
10081 ttttttgact ttattgataa aagttttata gttatcctct gatggaacac atctaaactt
10141 tccgttttgt tggactttaa aatgccagcc caggaagtca aatccatctg tcgagtgtac
10201 tagctttgtt ttgcttgact ttatttttag tcctcgttcg gctagaaact gttcaaccct
10261 taaaagtatc aactcagcgt tatcttttgg ctttaaaatg aaaaccatat catccgcata
10321 acggatgcta ttgtggaggt cttctattcc atctaacgct atgtttgcta atagagggct
10381 gaccactccc ccttgggggg ttcggagttc tggaaattca gggccgactc ctgcctttag
10441 acatttccaa aggccaagtt taatagcttg tggtgctatg acccgttcca ttattgatgt
10501 gtggtcaatc ctgtcgaaac acttttctat atctagctct agtattctct tattaatccc
10561 attagcgtgt gagcttaggt tattgaagag aattttttgc gcatcgtgcg ttgagcgtcc
10621 gggtctgaac ccatagcttt tggcgtggaa aagtgcctca tgtgctggtt ctatggcgta
10681 tttaacaagg cattgccatg ctctgtctgc catagtggga accttcagca ttctggttgt
10741 tccatccttt ttagggatgg gaactgctct tagctttgaa tgattccatt tgttagctgt
10801 gagtaacctt tttgctaagg cgaacctttg tttgaagttc agggattttt gcccatccac
10861 tcctgcggtt ctttttccag cgtttagctg tgtaacttgt cggactgcca gtaatcttgc
10921 tgctgaggat ttcagaatta gcttttgcag gttctttgct cgtaccctgt cattctctcg
10981 aatggctttg aacagtcttg tttgtaatcg gaaaagattc ctttggagtt tcttccaagg
11041 gagcttttgc caaagttcac tgaagttgat ttcgtgtcta accatgattt acgtacttga
11101 ccattttctg attacctcaa ggcagttacg ccttgtccta cccgaaataa aggggttcgg
11161 tgtctcgtct cacttacctg ggttttcgac ctttcccaag acctttattc cgtttttatt
11221 cgttccgacg gtgcgatttc ttgccatctt tagagcgaac cactttgccc atttgcctct
11281 cgagggttac agttatcatt cgaataatac tgtgagagta gcaaatcaac tcggtcattt
11341 ggattcagat tgaccggtca tacgtaggac acttttatag gttctgttta ctcatggatg
11401 actaccataa cgccagtgtg gtctatctgc tttgacctct gattgcgtcc ttttgccagc
11461 ttcagcctcc agaattccga gtctggtcac tgtggcaagg tttagagtca cctgtcctct
11521 caggggtgag aatttaactc acatcggctc cgtcagttat gtagttaaga attgttacga
11581 agaacctctt aactatcacc tcagcttatg aattaggctg agaacgaacc gcactcatac
11641 tcgtatttat tcttgcttcc tgccggaatt tcgattaaaa cattgattag accaggtttt
11701 ggttgagccg gaatgagtga taagtccacg attttctccg aatgcttact aattactttg
11761 attgaggaag atagtgagat tatggcttga gaacgttaat attattcttt cttgagtcca
11821 taattttccc ttcggtgctt attttacaac gaatttaccc caagattgat atgaatcaca
11881 tgatcactgt gatcaaaacc attgagggta aaggagaaac cgaagataat aaatacagga
11941 ttcaaaaagc gggtgcatac aacagcacag ggttatttat tgtttttaac agtgaatggc
[top]
MVRHEINFSELWQKLPWKKLQRNLFRLQTRLFKAIRENDRVRAKNLQKLILKSSAARL
LAVRQVTQLNAGKRTAGVDGQKSLNFKQRFALAKRLLTANKWNHSKLRAVPIPKKDGT
TRMLKVPTMADRAWQCLVKYAIEPAHEALFHAKSYGFRPGRSTHDAQKILFNNLSSHA
NGINKRILELDIEKCFDRIDHTSIMERVIAPQAIKLGLWKCLKAGVGPEFPELRTPQG
GVVSPLLANIALDGIEDLHNSIRYADDMVFILKPKDNAELILLRVEQFLAERGLKIKS
SKTKLVHSTDGFDFLGWHFKVQQNGKFRCVPSEDNYKTFINKVKKIVNNSNIGARVKA
EKIAPLVRGWRNYHRYCKMSGSRFSLWRTAYRAFKKFNKEKKLDRYKTRELLLKAFPS
VPYAENRFVSVKGNKSPYDGDTVYWSKRNSKLYTDFTSRCLRRQNQSCGHCGLKFISD
EKVQFHHIDGNHDNWRTQNLLAVHQSCHDYIHMGKTGKV
[top]

[top]
| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |