[Back to introns by organism]  [Back to home page]

Information for So.us.I3-1 intron  (Format of information for each intron

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

Note: Redundant intron copies found in

CP000473 Solibacter usitatus Ellin6076 (4192502-4194442)

[Intron sequence]

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

Intron on antisense strand

3' end

                                                        atc ggcatagggg

3661 cggcggtcac ccgccgaccc ctgccacacc accgtgcgta cgggtccgta cacggcggtt

3721 cgagatggtt acgctaacag tcctcgaaca atgacggaag tcccagcgat ttgaagtaag

3781 catttgaaag cccaacagcc agggccttgg ccttcgcgag gtaccaagga ccgcgaccgc

3841 tgccagcggt attagtcgcc agccgctctc gcacccccag ttccaacaga gctgcccggc

3901 gacggcgcgg tgttttccac tgccgccaca tggccgcccg gagtcgcaac cggacccagc

3961 gcgttaaacc gatcagcacc tcgggcgttt cgcagaatcc gaaatagctg cgccaacccc

4021 gcatatacgg agccagttcc tccatcgtcg tcttcatgct aacgcccttt gcccgtcccg

4081 tgatctcccg gattcgtcgc ttaaaccgat ccaaggcctt cggtgcaatc acgcgcttgg

4141 cctccggacc agccgtaaag ctgaacccga gaaacttccg ttcctgcggt cgtgccaccg

4201 cactcttcgt ctcatttacc ttgagcttga gcttttgcgt gatgaattgc gtaatgctct

4261 ccatcactcg ttgccccgcc cgctcgctgc gaacgtagat attacagtcg tccgcatatc

4321 gaacgaagcg atgaccccgg cgctccaact cccggtcgaa ttcgtcgagc acgaggttac

4381 tgagcagtgg cgaaagagga cctccttgcg gagttccttc cacgctcggg ctgaccaacc

4441 cgttctccat caccccggca ttcaagaatg cccggatgag tttcagcagc cgcttgtccg

4501 cgatccgttt ggcgatctga cccatcaatt tgtcgtgatt gactcgatcg aagaatttct

4561 ccaaatcaag atcaacgcac cagccgtgac cttcggcaat atactgctgc gcctgtgcca

4621 ccgcttgctg agccgaccgt cccggccgga acccgtagct gtaatcagaa aacgtccggt

4681 cccaccgcct ctgcagaacc tgcatcaccg cctgctggat aaatcgatcc aacaccgttg

4741 ggataccaag ctttcgcacc cctccgtccg gtttggcaat ctctacccgc ctcaccggtt

4801 tcggctcgta agtcccactc aacagttgtc cccggatggc tggccagtgc tgcttcaggt

4861 aatccttgat gccgatgacg gtcatcccgt caacgcccgg actacctttg ttggccttca

4921 cccgctgcaa tgctgccttc aggttttctc gctcacatac ttcctccatc aatcgattcg

4981 tgctggctgg gttttcggtc ccattcatcg ccccggccga ttcagtctct tcccttcccg

5041 cctctcgggc ttcacccgtt agcacagaag agaagtccag ttgcatctgg atgttctgct

5101 gcttgtcgtc cttgagactc atggcctact ggccgctcct tctcgttcgg gccttcagcc

5161 atcgttcccg gcttggccta tccgttgctc cgcctttcgg catgcggagt gcctcactag

5221 ccttgccgac gtcatgacct aatatgccct ctgctgactt ctgccccgcg gtcaagccgc

5281 ctttcgacag cctcagtcgc cgaagcgaca cggagcagat ctcctggggt aagttcagcc

5341 gtcttctgtg cacagccgcc gaatctacgc ttcgcaccct tgatggatat ggacttcgcg

5401 gtaagttgcc cgctcgtccg gcgctaacgc cttgtatcag gtttttgtcc atcgactcgc

5461 acatttgcta tacgcttctt tcagacccca cctcgcggcg gtagcccttg cgttctcgct

5521 agcccttcac ctccatcagg ttgggcaggg gacttgcacc cccaaactgc tgaacatgcc

5581 cagcacac

5' end

Intron on sense strand

5' end

        gtgtgct gggcatgttc agcagtttgg gggtgcaagt cccctgccca acctgatgga

3421 ggtgaagggc tagcgagaac gcaagggcta ccgccgcgag gtggggtctg aaagaagcgt

3481 atagcaaatg tgcgagtcga tggacaaaaa cctgatacaa ggcgttagcg ccggacgagc

3541 gggcaactta ccgcgaagtc catatccatc aagggtgcga agcgtagatt cggcggctgt

3601 gcacagaaga cggctgaact taccccagga gatctgctcc gtgtcgcttc ggcgactgag

3661 gctgtcgaaa ggcggcttga ccgcggggca gaagtcagca gagggcatat taggtcatga

3721 cgtcggcaag gctagtgagg cactccgcat gccgaaaggc ggagcaacgg ataggccaag

3781 ccgggaacga tggctgaagg cccgaacgag aaggagcggc cagtaggcca tgagtctcaa

3841 ggacgacaag cagcagaaca tccagatgca actggacttc tcttctgtgc taacgggtga

3901 agcccgagag gcgggaaggg aagagactga atcggccggg gcgatgaatg ggaccgaaaa

3961 cccagccagc acgaatcgat tgatggagga agtatgtgag cgagaaaacc tgaaggcagc

4021 attgcagcgg gtgaaggcca acaaaggtag tccgggcgtt gacgggatga ccgtcatcgg

4081 catcaaggat tacctgaagc agcactggcc agccatccgg ggacaactgt tgagtgggac

4141 ttacgagccg aaaccggtga ggcgggtaga gattgccaaa ccggacggag gggtgcgaaa

4201 gcttggtatc ccaacggtgt tggatcgatt tatccagcag gcggtgatgc aggttctgca

4261 gaggcggtgg gaccggacgt tttctgatta cagctacggg ttccggccgg gacggtcggc

4321 tcagcaagcg gtggcacagg cgcagcagta tattgccgaa ggtcacggct ggtgcgttga

4381 tcttgatttg gagaaattct tcgatcgagt caatcacgac aaattgatgg gtcagatcgc

4441 caaacggatc gcggacaagc ggctgctgaa actcatccgg gcattcttga atgccggggt

4501 gatggagaac gggttggtca gcccgagcgt ggaaggaact ccgcaaggag gtcctctttc

4561 gccactgctc agtaacctcg tgctcgacga attcgaccgg gagttggagc gccggggtca

4621 tcgcttcgtt cgatatgcgg acgactgtaa tatctacgtt cgcagcgagc gggcggggca

4681 acgagtgatg gagagcatta cgcaattcat cacgcaaaag ctcaagctca aggtaaatga

4741 gacgaagagt gcggtggcac gaccgcagga acggaagttt ctcgggttca gctttacggc

4801 tggtccggag gccaagcgcg tgattgcacc gaaggccttg gatcggttta agcgacgaat

4861 ccgggagatc acgggacggg caaagggcgt tagcatgaag acgacgatgg aggaactggc

4921 tccgtatatg cggggttggc gcagctattt cggattctgc gaaacgcccg aggtgctgat

4981 cggtttaacg cgctgggtcc ggttgcgact ccgggcggcc atgtggcggc agtggaaaac

5041 accgcgccgt cgccgggcag ctctgttgga actgggggtg cgagagcggc tggcgactaa

5101 taccgctggc agcggtcgcg gtccttggta cctcgcgaag gccaaggccc tggctgttgg

5161 gctttcaaat gcttacttca aatcgctggg acttccgtca ttgttcgagg actgttagcg

5221 taaccatctc gaaccgccgt gtacggaccc gtacgcacgg tggtgtggca ggggtcggcg

5281 ggtgaccgcc gcccctatgc cgat

3' end

[top]


[Intron and flanking sequence]

 

3301 ggaactcgga ccaccaccga tgctgcaggc tgctgtccgc ccagcagcgc taccccgacg

3361 tgggccaacc tgagaacccg gagcacactc gcaaccatct tcaagcatga tcctacactt

3421 tttgctggat ttcggtagct cgtgccgccc cactgtgagc tgccggctgg cggtgtcgcc

3481 gatcgttccg gcccggacct cgaatcctgc gagaagagac atcagcggca attcgtagct

3541 gccgataatg gatattaacg ctcgggcttg cgccttcgcg gcgccccatc ttccgagcgc

3601 tggcgatgcg accaacgcaa acgggatgct tcccagaacg ccgtttgatc ggcatagggg

3661 cggcggtcac ccgccgaccc ctgccacacc accgtgcgta cgggtccgta cacggcggtt

3721 cgagatggtt acgctaacag tcctcgaaca atgacggaag tcccagcgat ttgaagtaag

3781 catttgaaag cccaacagcc agggccttgg ccttcgcgag gtaccaagga ccgcgaccgc

3841 tgccagcggt attagtcgcc agccgctctc gcacccccag ttccaacaga gctgcccggc

3901 gacggcgcgg tgttttccac tgccgccaca tggccgcccg gagtcgcaac cggacccagc

3961 gcgttaaacc gatcagcacc tcgggcgttt cgcagaatcc gaaatagctg cgccaacccc

4021 gcatatacgg agccagttcc tccatcgtcg tcttcatgct aacgcccttt gcccgtcccg

4081 tgatctcccg gattcgtcgc ttaaaccgat ccaaggcctt cggtgcaatc acgcgcttgg

4141 cctccggacc agccgtaaag ctgaacccga gaaacttccg ttcctgcggt cgtgccaccg

4201 cactcttcgt ctcatttacc ttgagcttga gcttttgcgt gatgaattgc gtaatgctct

4261 ccatcactcg ttgccccgcc cgctcgctgc gaacgtagat attacagtcg tccgcatatc

4321 gaacgaagcg atgaccccgg cgctccaact cccggtcgaa ttcgtcgagc acgaggttac

4381 tgagcagtgg cgaaagagga cctccttgcg gagttccttc cacgctcggg ctgaccaacc

4441 cgttctccat caccccggca ttcaagaatg cccggatgag tttcagcagc cgcttgtccg

4501 cgatccgttt ggcgatctga cccatcaatt tgtcgtgatt gactcgatcg aagaatttct

4561 ccaaatcaag atcaacgcac cagccgtgac cttcggcaat atactgctgc gcctgtgcca

4621 ccgcttgctg agccgaccgt cccggccgga acccgtagct gtaatcagaa aacgtccggt

4681 cccaccgcct ctgcagaacc tgcatcaccg cctgctggat aaatcgatcc aacaccgttg

4741 ggataccaag ctttcgcacc cctccgtccg gtttggcaat ctctacccgc ctcaccggtt

4801 tcggctcgta agtcccactc aacagttgtc cccggatggc tggccagtgc tgcttcaggt

4861 aatccttgat gccgatgacg gtcatcccgt caacgcccgg actacctttg ttggccttca

4921 cccgctgcaa tgctgccttc aggttttctc gctcacatac ttcctccatc aatcgattcg

4981 tgctggctgg gttttcggtc ccattcatcg ccccggccga ttcagtctct tcccttcccg

5041 cctctcgggc ttcacccgtt agcacagaag agaagtccag ttgcatctgg atgttctgct

5101 gcttgtcgtc cttgagactc atggcctact ggccgctcct tctcgttcgg gccttcagcc

5161 atcgttcccg gcttggccta tccgttgctc cgcctttcgg catgcggagt gcctcactag

5221 ccttgccgac gtcatgacct aatatgccct ctgctgactt ctgccccgcg gtcaagccgc

5281 ctttcgacag cctcagtcgc cgaagcgaca cggagcagat ctcctggggt aagttcagcc

5341 gtcttctgtg cacagccgcc gaatctacgc ttcgcaccct tgatggatat ggacttcgcg

5401 gtaagttgcc cgctcgtccg gcgctaacgc cttgtatcag gtttttgtcc atcgactcgc

5461 acatttgcta tacgcttctt tcagacccca cctcgcggcg gtagcccttg cgttctcgct

5521 agcccttcac ctccatcagg ttgggcaggg gacttgcacc cccaaactgc tgaacatgcc

5581 cagcacacaa ctaagccgct cgcgcggcgg acgctacgcg tcaggccggg gctagcgccc

5641 attctttggt gggagtattt tgtgggtcac aaggaggtcc ctgtgaccca acttcgcaaa

5701 atggtgctcg aggaactcga gcgccgtaat tactctcaag ctaccgcacg tgcctacgtc

5761 ggcgccatcc agcggttcgc cgaacatttc catcgctcgc ccgatcaact tggccccgag

5821 cacatccgcg aatatcaact gcacctcgtg caggaccgca aactgcatcc caggaccgtc

5881 atgatccaga tgtccgcgct ccgcttcttc ttccgcaagg tcctgaagcg gcgctttgat

[top]


[ORF sequence]

 

MSLKDDKQQNIQMQLDFSSVLTGEAREAGREETESAGAMNGTENPASTNRLMEEVCER

ENLKAALQRVKANKGSPGVDGMTVIGIKDYLKQHWPAIRGQLLSGTYEPKPVRRVEIA

KPDGGVRKLGIPTVLDRFIQQAVMQVLQRRWDRTFSDYSYGFRPGRSAQQAVAQAQQY

IAEGHGWCVDLDLEKFFDRVNHDKLMGQIAKRIADKRLLKLIRAFLNAGVMENGLVSP

SVEGTPQGGPLSPLLSNLVLDEFDRELERRGHRFVRYADDCNIYVRSERAGQRVMESI

TQFITQKLKLKVNETKSAVARPQERKFLGFSFTAGPEAKRVIAPKALDRFKRRIREIT

GRAKGVSMKTTMEELAPYMRGWRSYFGFCETPEVLIGLTRWVRLRLRAAMWRQWKTPR

RRRAALLELGVRERLATNTAGSGRGPWYLAKAKALAVGLSNAYFKSLGLPSLFEDC

[top]


[Secondary structure]

 

                                                              

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |