[Back to introns by organism]  [Back to home page]

Information for C.w.I7 intron  (Format of information for each intron

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence]

 

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

 

Intron on antisense strand

 

3' end

             gt ttgagtcgag gggagctatt aaactccgcc cctctcagtt agatctggac
 9541 gtgcgacttt caccgcatcc ggctcccgat aatctagact tttccagtct tacccatgtg
 9601 gatgtaatcg tgacaacttt gatggactgc caaaaggttt tgggttctcc agttatcatg
 9661 attcccatca atgtgatgga attggacttt ttcatcagat atgaatttaa gtccacagtg
 9721 tccgcatgat tggttctgtc ttcttagaca tctagaggtg aaatcagtgt agagtttgct
 9781 gttacgcttg ctccaatata ctgtatctcc gtcgtagggg gatttatttc ctttgacgct
 9841 gacgaatcga ttttctgcgt atggaactga tgggaatgct tttaaaagta attccctagt
 9901 tttgtagcgg tcaagtttct tttccttatt gaatttcttg aaggctctat aagccgttct
 9961 ccataaggaa aatctggagc cactcatttt gcagtatcta tgataatttc gccatcctct
10021 gactagaggt gcaatttttt ctgcttttac tctagcccca atattcgagt tattaacgat
10081 ttttttgact ttattgataa aagttttata gttatcctct gatggaacac atctaaactt
10141 tccgttttgt tggactttaa aatgccagcc caggaagtca aatccatctg tcgagtgtac
10201 tagctttgtt ttgcttgact ttatttttag tcctcgttcg gctagaaact gttcaaccct
10261 taaaagtatc aactcagcgt tatcttttgg ctttaaaatg aaaaccatat catccgcata
10321 acggatgcta ttgtggaggt cttctattcc atctaacgct atgtttgcta atagagggct
10381 gaccactccc ccttgggggg ttcggagttc tggaaattca gggccgactc ctgcctttag
10441 acatttccaa aggccaagtt taatagcttg tggtgctatg acccgttcca ttattgatgt
10501 gtggtcaatc ctgtcgaaac acttttctat atctagctct agtattctct tattaatccc
10561 attagcgtgt gagcttaggt tattgaagag aattttttgc gcatcgtgcg ttgagcgtcc
10621 gggtctgaac ccatagcttt tggcgtggaa aagtgcctca tgtgctggtt ctatggcgta
10681 tttaacaagg cattgccatg ctctgtctgc catagtggga accttcagca ttctggttgt
10741 tccatccttt ttagggatgg gaactgctct tagctttgaa tgattccatt tgttagctgt
10801 gagtaacctt tttgctaagg cgaacctttg tttgaagttc agggattttt gcccatccac
10861 tcctgcggtt ctttttccag cgtttagctg tgtaacttgt cggactgcca gtaatcttgc
10921 tgctgaggat ttcagaatta gcttttgcag gttctttgct cgtaccctgt cattctctcg
10981 aatggctttg aacagtcttg tttgtaatcg gaaaagattc ctttggagtt tcttccaagg
11041 gagcttttgc caaagttcac tgaagttgat ttcgtgtcta accat
gattt acgtacttga
11101 ccattttctg attacctcaa ggcagttacg ccttgtccta cccgaaataa aggggttcgg
11161 tgtctcgtct cacttacctg ggttttcgac ctttcccaag acctttattc cgtttttatt
11221 cgttccgacg gtgcgatttc ttgccatctt tagagcgaac cactttgccc atttgcctct
11281 cgagggttac agttatcatt cgaataatac tgtgagagta gcaaatcaac tcggtcattt
11341 ggattcagat tgaccggtca tacgtaggac acttttatag gttctgttta ctcatggatg
11401 actaccataa cgccagtgtg gtctatctgc tttgacctct gattgcgtcc ttttgccagc
11461 ttcagcctcc agaattccga gtctggtcac tgtggcaagg tttagagtca cctgtcctct
11521 caggggtgag aatttaactc acatcggctc cgtcagttat gtagttaaga attgttacga
11581 agaacctctt aactatcacc tcagcttatg aattaggctg agaacgaacc gcac

5' end

Intron on sense strand

5' end

   1 gtgcggttcg ttctcagcct aattcataag ctgaggtgat agttaagagg ttcttcgtaa 

  61 caattcttaa ctacataact gacggagccg atgtgagtta aattctcacc cctgagagga 

 121 caggtgactc taaaccttgc cacagtgacc agactcggaa ttctggaggc tgaagctggc 

 181 aaaaggacgc aatcagaggt caaagcagat agaccacact ggcgttatgg tagtcatcca 

 241 tgagtaaaca gaacctataa aagtgtccta cgtatgaccg gtcaatctga atccaaatga 

 301 ccgagttgat ttgctactct cacagtatta ttcgaatgat aactgtaacc ctcgagaggc 

 361 aaatgggcaa agtggttcgc tctaaagatg gcaagaaatc gcaccgtcgg aacgaataaa 

 421 aacggaataa aggtcttggg aaaggtcgaa aacccaggta agtgagacga gacaccgaac 

 481 ccctttattt cgggtaggac aaggcgtaac tgccttgagg taatcagaaa atggtcaagt 

 541 acgtaaatca tggttagaca cgaaatcaac ttcagtgaac tttggcaaaa gctcccttgg 

 601 aagaaactcc aaaggaatct tttccgatta caaacaagac tgttcaaagc cattcgagag 

 661 aatgacaggg tacgagcaaa gaacctgcaa aagctaattc tgaaatcctc agcagcaaga 

 721 ttactggcag tccgacaagt tacacagcta aacgctggaa aaagaaccgc aggagtggat 

 781 gggcaaaaat ccctgaactt caaacaaagg ttcgccttag caaaaaggtt actcacagct 

 841 aacaaatgga atcattcaaa gctaagagca gttcccatcc ctaaaaagga tggaacaacc 

 901 agaatgctga aggttcccac tatggcagac agagcatggc aatgccttgt taaatacgcc 

 961 atagaaccag cacatgaggc acttttccac gccaaaagct atgggttcag acccggacgc 

1021 tcaacgcacg atgcgcaaaa aattctcttc aataacctaa gctcacacgc taatgggatt 

1081 aataagagaa tactagagct agatatagaa aagtgtttcg acaggattga ccacacatca 

1141 ataatggaac gggtcatagc accacaagct attaaacttg gcctttggaa atgtctaaag 

1201 gcaggagtcg gccctgaatt tccagaactc cgaacccccc aagggggagt ggtcagccct 

1261 ctattagcaa acatagcgtt agatggaata gaagacctcc acaatagcat ccgttatgcg 

1321 gatgatatgg ttttcatttt aaagccaaaa gataacgctg agttgatact tttaagggtt 

1381 gaacagtttc tagccgaacg aggactaaaa ataaagtcaa gcaaaacaaa gctagtacac 

1441 tcgacagatg gatttgactt cctgggctgg cattttaaag tccaacaaaa cggaaagttt 

1501 agatgtgttc catcagagga taactataaa acttttatca ataaagtcaa aaaaatcgtt 

1561 aataactcga atattggggc tagagtaaaa gcagaaaaaa ttgcacctct agtcagagga 

1621 tggcgaaatt atcatagata ctgcaaaatg agtggctcca gattttcctt atggagaacg 

1681 gcttatagag ccttcaagaa attcaataag gaaaagaaac ttgaccgcta caaaactagg 

1741 gaattacttt taaaagcatt cccatcagtt ccatacgcag aaaatcgatt cgtcagcgtc 

1801 aaaggaaata aatcccccta cgacggagat acagtatatt ggagcaagcg taacagcaaa 

1861 ctctacactg atttcacctc tagatgtcta agaagacaga accaatcatg cggacactgt 

1921 ggacttaaat tcatatctga tgaaaaagtc caattccatc acattgatgg gaatcatgat 

1981 aactggagaa cccaaaacct tttggcagtc catcaaagtt gtcacgatta catccacatg 

2041 ggtaagactg gaaaagtcta gattatcggg agccggatgc ggtgaaagtc gcacgtccag 

2101 atctaactga gaggggcgga gtttaatagc tcccctcgac tcaaac

3' end

[top]


[Intron and flanking sequence]

 

 9121 caacatcttt ccagcctaaa atttctgtga cttttttctc cagatttttg taggtacgga
 9181 aaaattccgc aatttcttct aaacggtgag gagcaatgtc tttgatagat tttacttcgc
 9241 taaagcgtgg atcttcatca gggacacaaa gaactttttc atcgcgatct ccaccatcga
 9301 tcatttctaa cataccaata ggacgggctg caatgacaca acccgggaaa gtaggttgat
 9361 caatgatcac catgccatcc aaaggatcac cgtcatccgc taaggtatta ggaacaaacc
 9421 catagtcgta aggatattgt accgaggcaa acaaaacgcg atcgagagca aaggctttca
 9481 tatccttggt ttgagtcgag gggagctatt aaactccgcc cctctcagtt agatctggac
 9541 gtgcgacttt caccgcatcc ggctcccgat aatctagact tttccagtct tacccatgtg
 9601 gatgtaatcg tgacaacttt gatggactgc caaaaggttt tgggttctcc agttatcatg
 9661 attcccatca atgtgatgga attggacttt ttcatcagat atgaatttaa gtccacagtg
 9721 tccgcatgat tggttctgtc ttcttagaca tctagaggtg aaatcagtgt agagtttgct
 9781 gttacgcttg ctccaatata ctgtatctcc gtcgtagggg gatttatttc ctttgacgct
 9841 gacgaatcga ttttctgcgt atggaactga tgggaatgct tttaaaagta attccctagt
 9901 tttgtagcgg tcaagtttct tttccttatt gaatttcttg aaggctctat aagccgttct
 9961 ccataaggaa aatctggagc cactcatttt gcagtatcta tgataatttc gccatcctct
10021 gactagaggt gcaatttttt ctgcttttac tctagcccca atattcgagt tattaacgat
10081 ttttttgact ttattgataa aagttttata gttatcctct gatggaacac atctaaactt
10141 tccgttttgt tggactttaa aatgccagcc caggaagtca aatccatctg tcgagtgtac
10201 tagctttgtt ttgcttgact ttatttttag tcctcgttcg gctagaaact gttcaaccct
10261 taaaagtatc aactcagcgt tatcttttgg ctttaaaatg aaaaccatat catccgcata
10321 acggatgcta ttgtggaggt cttctattcc atctaacgct atgtttgcta atagagggct
10381 gaccactccc ccttgggggg ttcggagttc tggaaattca gggccgactc ctgcctttag
10441 acatttccaa aggccaagtt taatagcttg tggtgctatg acccgttcca ttattgatgt
10501 gtggtcaatc ctgtcgaaac acttttctat atctagctct agtattctct tattaatccc
10561 attagcgtgt gagcttaggt tattgaagag aattttttgc gcatcgtgcg ttgagcgtcc
10621 gggtctgaac ccatagcttt tggcgtggaa aagtgcctca tgtgctggtt ctatggcgta
10681 tttaacaagg cattgccatg ctctgtctgc catagtggga accttcagca ttctggttgt
10741 tccatccttt ttagggatgg gaactgctct tagctttgaa tgattccatt tgttagctgt
10801 gagtaacctt tttgctaagg cgaacctttg tttgaagttc agggattttt gcccatccac
10861 tcctgcggtt ctttttccag cgtttagctg tgtaacttgt cggactgcca gtaatcttgc
10921 tgctgaggat ttcagaatta gcttttgcag gttctttgct cgtaccctgt cattctctcg
10981 aatggctttg aacagtcttg tttgtaatcg gaaaagattc ctttggagtt tcttccaagg
11041 gagcttttgc caaagttcac tgaagttgat ttcgtgtcta accatgattt acgtacttga
11101 ccattttctg attacctcaa ggcagttacg ccttgtccta cccgaaataa aggggttcgg
11161 tgtctcgtct cacttacctg ggttttcgac ctttcccaag acctttattc cgtttttatt
11221 cgttccgacg gtgcgatttc ttgccatctt tagagcgaac cactttgccc atttgcctct
11281 cgagggttac agttatcatt cgaataatac tgtgagagta gcaaatcaac tcggtcattt
11341 ggattcagat tgaccggtca tacgtaggac acttttatag gttctgttta ctcatggatg
11401 actaccataa cgccagtgtg gtctatctgc tttgacctct gattgcgtcc ttttgccagc
11461 ttcagcctcc agaattccga gtctggtcac tgtggcaagg tttagagtca cctgtcctct
11521 caggggtgag aatttaactc acatcggctc cgtcagttat gtagttaaga attgttacga
11581 agaacctctt aactatcacc tcagcttatg aattaggctg agaacgaacc gcac
tcatac
11641 tcgtatttat tcttgcttcc tgccggaatt tcgattaaaa cattgattag accaggtttt
11701 ggttgagccg gaatgagtga taagtccacg attttctccg aatgcttact aattactttg
11761 attgaggaag atagtgagat tatggcttga gaacgttaat attattcttt cttgagtcca
11821 taattttccc ttcggtgctt attttacaac gaatttaccc caagattgat atgaatcaca
11881 tgatcactgt gatcaaaacc attgagggta aaggagaaac cgaagataat aaatacagga
11941 ttcaaaaagc gggtgcatac aacagcacag ggttatttat tgtttttaac agtgaatggc

[top]


[ORF sequence]

 

MVRHEINFSELWQKLPWKKLQRNLFRLQTRLFKAIRENDRVRAKNLQKLILKSSAARL

LAVRQVTQLNAGKRTAGVDGQKSLNFKQRFALAKRLLTANKWNHSKLRAVPIPKKDGT

TRMLKVPTMADRAWQCLVKYAIEPAHEALFHAKSYGFRPGRSTHDAQKILFNNLSSHA

NGINKRILELDIEKCFDRIDHTSIMERVIAPQAIKLGLWKCLKAGVGPEFPELRTPQG

GVVSPLLANIALDGIEDLHNSIRYADDMVFILKPKDNAELILLRVEQFLAERGLKIKS

SKTKLVHSTDGFDFLGWHFKVQQNGKFRCVPSEDNYKTFINKVKKIVNNSNIGARVKA

EKIAPLVRGWRNYHRYCKMSGSRFSLWRTAYRAFKKFNKEKKLDRYKTRELLLKAFPS

VPYAENRFVSVKGNKSPYDGDTVYWSKRNSKLYTDFTSRCLRRQNQSCGHCGLKFISD

EKVQFHHIDGNHDNWRTQNLLAVHQSCHDYIHMGKTGKV

[top]


[Secondary structure]

                                                   

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |