[Back to introns by organism]  [Back to home page]

Information for Ch.ph.I2-1 intron  (Format of information for each intron

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

Note: Redundant intron copies found in

CP000492 Chlorobium phaeobacteroides DSM 266 (2463357-2465245)

[Intron sequence]

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

Intron on antisense strand

3' end

                                                             atcccggta
3841 gggacaagag ttacccctca ccccccggac agatccggac aagcggaatt cccgcatccg
3901 gctcctaact tgagtgtttg acgcaaaacc gttgttccgg ccatgggtga aggatctttg
3961 ggtttgggag gtactgatca atatatatca tcattcgctc ccatgtcatc acatgccgct
4021 ggcttcggcg cctgaggctc ttcatccaac gtctggcaac atggtagcgg aatgccgata
4081 acgccctcgt gtttgtgggc actgcgaaat aggcatagtg acctctcaac acgctgttca
4141 gccactttcc ctgttccgat actgagacat gcattcgccg tctcagctct ttctttacct
4201 ctccgagctt ggttctcact gatccgcgct tcgtcttgcg caagagctga aagcctttcc
4261 ccactacttt ctctccacag atgtgggtaa atccgagaaa gtcaaaggtt tccggccgcc
4321 cttgtccccg tctttgtcgg ttcttggcag catacctgcc gaactcaatc agtcgggttt
4381 tctctggatg caacgttagc gcgaactttg ccagtcgctc tttcagatca gctaggaatc
4441 tttcgccatc ttccttgttc tggaagcctg cgacactatc gtcagcgaaa cgcacgacga
4501 tcatatcacc tttgcagtgc ttttcccgcc actgcttagc ccacaggtca taagcatagt
4561 gaaggtagat gttcgcaagg agcggtgata tcacggctcc ctgcggtgtt ccttcttctg
4621 cctcgatccg tacgctatct tccagtaccc cagctttcag ccacttgata atcagtcgga
4681 caatccgctt gtcaccgata cggtgttcca aaaagcgtat catccactca tggctgatcg
4741 tatcaaaaaa ccgggaaata tcggcgtcca atacccagcc gattttcttc accttgatcc
4801 cgtatgcgag tgcatccagc gcatcatgct gactgcgtcc cggacggaat ccatagctga
4861 accccagaaa ctccgcttcg tagattggcg tgagaatcgc caccactgcc cgctgtacaa
4921 ttttgtcttc caatgcggcg atgccgagcg gtcgttgttg gccatttgcc ttcggtatgt
4981 acttccggcg cgatggctgt gcccggtacg ctcccgtgtg gattcgccgg tgcaggtctg
5041 caaggttctc ttccagtcct tccccgtaat ctttccacgt tatcccatcg actcccgttg
5101 ctgcggtctt cttcagttcg aagtacgacc agcgcaggca gtctacggag atgtgatgca
5161 gaagcgctgt cagcttttct cctcgatttc tggttactgc ttcacgtatg cgaccctgcg
5221 cttggaacac ggcttcccga ccctgtgtcc gggccatgct ttgcagttcc gcatttctct
5281 tgttcccacc ccttccctcc acgaactccg ctacgggctt acccgctttg ttcgctcgct
5341 tctccggtag tatgggcagg tctgacttct ccgtaccgta catcat
cggc ttcaactatt
5401 acgttttccc gatgcgaccc gccaacgacg atgggtggta tcggagatct cccggttccc
5461 gtgtaagagg cttacataca tgccaggttc tctgacgtcg tggggacagt cttgtgctcg
5521 cgttatcgca cttcgtctgt attgccttct gcttttagga gggcatgggc tccccaaatt
5581 tgctcatttt cgccgctcaa tggctggcct atatgcaccc ctgtcaatgc ttcgcgcata
5641 tcctcgcggt tatgcgcgca tgactcgggg ccagtgcgga tcgctaatcc tttcactgca
5701 gggacttgca ccctttacct cttaccggtc tcccggcgca c

5' end

Intron on sense strand

5' end

                  gtgcgccg ggagaccggt aagaggtaaa gggtgcaagt ccctgcagtg

3421 aaaggattag cgatccgcac tggccccgag tcatgcgcgc ataaccgcga ggatatgcgc

3481 gaagcattga caggggtgca tataggccag ccattgagcg gcgaaaatga gcaaatttgg

3541 ggagcccatg ccctcctaaa agcagaaggc aatacagacg aagtgcgata acgcgagcac

3601 aagactgtcc ccacgacgtc agagaacctg gcatgtatgt aagcctctta cacgggaacc

3661 gggagatctc cgataccacc catcgtcgtt ggcgggtcgc atcgggaaaa cgtaatagtt

3721 gaagccgatg atgtacggta cggagaagtc agacctgccc atactaccgg agaagcgagc

3781 gaacaaagcg ggtaagcccg tagcggagtt cgtggaggga aggggtggga acaagagaaa

3841 tgcggaactg caaagcatgg cccggacaca gggtcgggaa gccgtgttcc aagcgcaggg

3901 tcgcatacgt gaagcagtaa ccagaaatcg aggagaaaag ctgacagcgc ttctgcatca

3961 catctccgta gactgcctgc gctggtcgta cttcgaactg aagaagaccg cagcaacggg

4021 agtcgatggg ataacgtgga aagattacgg ggaaggactg gaagagaacc ttgcagacct

4081 gcaccggcga atccacacgg gagcgtaccg ggcacagcca tcgcgccgga agtacatacc

4141 gaaggcaaat ggccaacaac gaccgctcgg catcgccgca ttggaagaca aaattgtaca

4201 gcgggcagtg gtggcgattc tcacgccaat ctacgaagcg gagtttctgg ggttcagcta

4261 tggattccgt ccgggacgca gtcagcatga tgcgctggat gcactcgcat acgggatcaa

4321 ggtgaagaaa atcggctggg tattggacgc cgatatttcc cggttttttg atacgatcag

4381 ccatgagtgg atgatacgct ttttggaaca ccgtatcggt gacaagcgga ttgtccgact

4441 gattatcaag tggctgaaag ctggggtact ggaagatagc gtacggatcg aggcagaaga

4501 aggaacaccg cagggagccg tgatatcacc gctccttgcg aacatctacc ttcactatgc

4561 ttatgacctg tgggctaagc agtggcggga aaagcactgc aaaggtgata tgatcgtcgt

4621 gcgtttcgct gacgatagtg tcgcaggctt ccagaacaag gaagatggcg aaagattcct

4681 agctgatctg aaagagcgac tggcaaagtt cgcgctaacg ttgcatccag agaaaacccg

4741 actgattgag ttcggcaggt atgctgccaa gaaccgacaa agacggggac aagggcggcc

4801 ggaaaccttt gactttctcg gatttaccca catctgtgga gagaaagtag tggggaaagg

4861 ctttcagctc ttgcgcaaga cgaagcgcgg atcagtgaga accaagctcg gagaggtaaa

4921 gaaagagctg agacggcgaa tgcatgtctc agtatcggaa cagggaaagt ggctgaacag

4981 cgtgttgaga ggtcactatg cctatttcgc agtgcccaca aacacgaggg cgttatcggc

5041 attccgctac catgttgcca gacgttggat gaagagcctc aggcgccgaa gccagcggca

5101 tgtgatgaca tgggagcgaa tgatgatata tattgatcag tacctcccaa acccaaagat

5161 ccttcaccca tggccggaac aacggttttg cgtcaaacac tcaagttagg agccggatgc

5221 gggaattccg cttgtccgga tctgtccggg gggtgagggg taactcttgt ccctaccggg

5281 at 

3' end

[top]


[Intron and flanking sequence]

 

3481 gattgagtta agggtgctgt caaatcaatg cctgaatcgt cgaatcgata ccataactga

3541 ggtgcgtcgt caggttagtg cttgggagaa agaacgcaat aacaaggaag ccgttatcaa

3601 ctggcgcttt acaacagaag atgcacgaat aaaattgaag aggctttatc cgtcagttta

3661 atcttgacat gacactagca gtctgtcgga attgatgttt cagcaaaaaa gcaatgacat

3721 cgcccataag ctgcaaatca ccttattttt taattccggt catttgtaac ggtacttttt

3781 ttaagagacc aaaacagggt caaccaggcg cttatcccgt ccatatttaa aatcccggta

3841 gggacaagag ttacccctca ccccccggac agatccggac aagcggaatt cccgcatccg

3901 gctcctaact tgagtgtttg acgcaaaacc gttgttccgg ccatgggtga aggatctttg

3961 ggtttgggag gtactgatca atatatatca tcattcgctc ccatgtcatc acatgccgct

4021 ggcttcggcg cctgaggctc ttcatccaac gtctggcaac atggtagcgg aatgccgata

4081 acgccctcgt gtttgtgggc actgcgaaat aggcatagtg acctctcaac acgctgttca

4141 gccactttcc ctgttccgat actgagacat gcattcgccg tctcagctct ttctttacct

4201 ctccgagctt ggttctcact gatccgcgct tcgtcttgcg caagagctga aagcctttcc

4261 ccactacttt ctctccacag atgtgggtaa atccgagaaa gtcaaaggtt tccggccgcc

4321 cttgtccccg tctttgtcgg ttcttggcag catacctgcc gaactcaatc agtcgggttt

4381 tctctggatg caacgttagc gcgaactttg ccagtcgctc tttcagatca gctaggaatc

4441 tttcgccatc ttccttgttc tggaagcctg cgacactatc gtcagcgaaa cgcacgacga

4501 tcatatcacc tttgcagtgc ttttcccgcc actgcttagc ccacaggtca taagcatagt

4561 gaaggtagat gttcgcaagg agcggtgata tcacggctcc ctgcggtgtt ccttcttctg

4621 cctcgatccg tacgctatct tccagtaccc cagctttcag ccacttgata atcagtcgga

4681 caatccgctt gtcaccgata cggtgttcca aaaagcgtat catccactca tggctgatcg

4741 tatcaaaaaa ccgggaaata tcggcgtcca atacccagcc gattttcttc accttgatcc

4801 cgtatgcgag tgcatccagc gcatcatgct gactgcgtcc cggacggaat ccatagctga

4861 accccagaaa ctccgcttcg tagattggcg tgagaatcgc caccactgcc cgctgtacaa

4921 ttttgtcttc caatgcggcg atgccgagcg gtcgttgttg gccatttgcc ttcggtatgt

4981 acttccggcg cgatggctgt gcccggtacg ctcccgtgtg gattcgccgg tgcaggtctg

5041 caaggttctc ttccagtcct tccccgtaat ctttccacgt tatcccatcg actcccgttg

5101 ctgcggtctt cttcagttcg aagtacgacc agcgcaggca gtctacggag atgtgatgca

5161 gaagcgctgt cagcttttct cctcgatttc tggttactgc ttcacgtatg cgaccctgcg

5221 cttggaacac ggcttcccga ccctgtgtcc gggccatgct ttgcagttcc gcatttctct

5281 tgttcccacc ccttccctcc acgaactccg ctacgggctt acccgctttg ttcgctcgct

5341 tctccggtag tatgggcagg tctgacttct ccgtaccgta catcatcggc ttcaactatt

5401 acgttttccc gatgcgaccc gccaacgacg atgggtggta tcggagatct cccggttccc

5461 gtgtaagagg cttacataca tgccaggttc tctgacgtcg tggggacagt cttgtgctcg

5521 cgttatcgca cttcgtctgt attgccttct gcttttagga gggcatgggc tccccaaatt

5581 tgctcatttt cgccgctcaa tggctggcct atatgcaccc ctgtcaatgc ttcgcgcata

5641 tcctcgcggt tatgcgcgca tgactcgggg ccagtgcgga tcgctaatcc tttcactgca

5701 gggacttgca ccctttacct cttaccggtc tcccggcgca ccctaaattg agcgatagcc

5761 atctggcagg cgacctcagc cgataaaaaa ctgccgacag ctcctgctga gttaccgttt

5821 taaccttttg accatgtctt tgcacgcaat accgctcacc taacgccttc cgtcagacaa

5881 tagtacacca taaccactga cagcgcaatc gattggacgc actgatcccg tcttttttgt

5941 tcagttcatg ctcaatcgaa ctggctaaca gagttgtggc aagcgattcc ggcatatctc

6001 aaagagctta ccgaattgga gccgctcata agctgctctc gggactttta tctgcacctt

[top]


[ORF sequence]

 

MMYGTEKSDLPILPEKRANKAGKPVAEFVEGRGGNKRNAELQSMARTQGREAVFQAQG

RIREAVTRNRGEKLTALLHHISVDCLRWSYFELKKTAATGVDGITWKDYGEGLEENLA

DLHRRIHTGAYRAQPSRRKYIPKANGQQRPLGIAALEDKIVQRAVVAILTPIYEAEFL

GFSYGFRPGRSQHDALDALAYGIKVKKIGWVLDADISRFFDTISHEWMIRFLEHRIGD

KRIVRLIIKWLKAGVLEDSVRIEAEEGTPQGAVISPLLANIYLHYAYDLWAKQWREKH

CKGDMIVVRFADDSVAGFQNKEDGERFLADLKERLAKFALTLHPEKTRLIEFGRYAAK

NRQRRGQGRPETFDFLGFTHICGEKVVGKGFQLLRKTKRGSVRTKLGEVKKELRRRMH

VSVSEQGKWLNSVLRGHYAYFAVPTNTRALSAFRYHVARRWMKSLRRRSQRHVMTWER

MMIYIDQYLPNPKILHPWPEQRFCVKHSS  

[top]


[Secondary structure]

 

                                                                

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |