[Back to introns by organism]  [Back to home page]

Information for Ge.sp.I1 intron  (Format of information for each intron

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence]

 

Sequence from Genbank entry.  The intron boundaries are identified in red

and the ORF in blue, with start and stop codons underlined. 

 

5' end

   1 gtacgcccgg catgggcgtg aacttatggg gtgcaagtcc cctgtacggt gaatccagca

  61 ctgtcgagac gacagcgtaa ccaaagtact agccgaaggc aagggcgtct ccgcgaggag

 121 gggtctggag gaagccggag cgcaaagcta tgagccgacg aacagaaaca gcatataagg

 181 ccgtcatccg ggcaagtcag cacaacatga cgaagtccca tggattcagg agaaatggta

 241 aatgctgcgg ttgtgtagcg acagttcacg ttcttatccg gggagacctg tctcacatgc

 301 cattgaagtc taatggcgcg gtatctggta acagctgccg tgatggggca ggagtcagca

 361 gaagccatag taggcgatag gacgttgccg caagcgatcc ggcaactgga aacaagccgg

 421 ggaagaaccc cggaagactt accccgccga agggctaaac gatgaagagg aggtaagccc

 481 caatgagctt tcacgacaca cagaacctgt ccgggggagt gccgatgaag cgcgtcgttg

 541 atccgcaaac tccggagaac cacctactgg agcggattct gtccccagag aatatggaac

 601 tggcgtggaa acgggtacgt gctaacaaag gcgcacccgg tgtcgatggg gtgaacatcg

 661 acgactttcc cgacattacc agacctctct ggggcgacat ccgcgcatcg cttgcgacag

 721 gtagttatct tccaaagccg gttcttcggg tggagatacc gaaaccaacg ggcggcaacc

 781 gcccgttggg tatccccact gtgctggacc ggctgatcca acagtccatt gcccaggtgc

 841 tcacgccgat cttcgatccc ggattctcgg aatcaagctt cggtttccgc cccggccgtt

 901 cggcgcatga tgccgtgcgg cagctgcggg agtatctccg gcagggctac cgcattgccg

 961 tggacatcga ccttgccaag tttttcgaca cggtcaatca cgatctcctc atgacgtttg

1021 tggggaggaa ggtccgcgac aagcgcgtgc tcgccctgat cggcagatac ttgagagccg

1081 gggtggaggt tgacggaaga ctggaaaaga cccgcatggg cgttccccaa ggcggtccgc

1141 tttctccgct tcttgccaac atcctcctcg atcacctgga caaagagctt gagaaacgtg

1201 gccacaagtt cgtccgttac gccgatgact tcgttattct cgtcaaaagc gagcgggctg

1261 gcgaacgggt catgggaagc gtgaggaagc atctcacgac aaagctcaag ctcacggtca

1321 acgaagacaa aagcaaggtt gccaaaagcg accaaatcag cttccttggc ttcgtcttca

1381 agggcaccaa aatcctctgg tctgacaagg cgtacaagga gtttcgccgc cgggtcagga

1441 agtacaccgg aagaagctgg ttcgtctcca tggagtaccg gctgaacaag ctgtccacct
1501 acatccgtgg ctggatggga tacttcggga tttccgaggc ctaccacgac atcccggaga
1561 tagacggctg gatcaggcgc agggtgcggc tctgttactg gaaacagtgg cggtggtgcc
1621 gcaccaagat tcggaatctg ctgaaactgg gagttcaact aggaacttcc atcagagcag
1681 gactgaatcg tggcggtccg tgggccatgg ctcgccgact ggccgctcag cacggtatga
1741 ccaatcaatg gctgaaagat caaggtctca tatctgtcaa agaactgtgg gtgaagactc
1801 attacccggc tacggctcgg taa
cttcagc gaaccgcccg gtgcggaccc gcatgccggg

1861 tggtgtgggg agggggagag aaaagctccc ccttacccga t                                       

3' end

[top]


[Intron and flanking sequence]

 

   1 cagttcgcct cgcggcacca caaaggctca ggtctttttt gtattccgtc aggtttaccc

  61 atctttttta tgacagatgc ttccgtatct cctatccgta cagcctcaag ccatttcggg

 121 tagtgagaaa tcttatacca tctcgtagca ccgtaactcc cacctgacac cagtagcaat

 181 ccgagcatga tgaaaattac ttttcttctt tcgttttgtt tgggttctgt cttcatattc

 241 tttcgtttgg ccaacggctc aaggctcagc ggccgcttgc ggtccgctgc agccgctggt

 301 gtacgcccgg catgggcgtg aacttatggg gtgcaagtcc cctgtacggt gaatccagca

 361 ctgtcgagac gacagcgtaa ccaaagtact agccgaaggc aagggcgtct ccgcgaggag

 421 gggtctggag gaagccggag cgcaaagcta tgagccgacg aacagaaaca gcatataagg

 481 ccgtcatccg ggcaagtcag cacaacatga cgaagtccca tggattcagg agaaatggta

 541 aatgctgcgg ttgtgtagcg acagttcacg ttcttatccg gggagacctg tctcacatgc

 601 cattgaagtc taatggcgcg gtatctggta acagctgccg tgatggggca ggagtcagca

 661 gaagccatag taggcgatag gacgttgccg caagcgatcc ggcaactgga aacaagccgg

 721 ggaagaaccc cggaagactt accccgccga agggctaaac gatgaagagg aggtaagccc

 781 caatgagctt tcacgacaca cagaacctgt ccgggggagt gccgatgaag cgcgtcgttg

 841 atccgcaaac tccggagaac cacctactgg agcggattct gtccccagag aatatggaac

 901 tggcgtggaa acgggtacgt gctaacaaag gcgcacccgg tgtcgatggg gtgaacatcg

 961 acgactttcc cgacattacc agacctctct ggggcgacat ccgcgcatcg cttgcgacag

1021 gtagttatct tccaaagccg gttcttcggg tggagatacc gaaaccaacg ggcggcaacc

1081 gcccgttggg tatccccact gtgctggacc ggctgatcca acagtccatt gcccaggtgc

1141 tcacgccgat cttcgatccc ggattctcgg aatcaagctt cggtttccgc cccggccgtt

1201 cggcgcatga tgccgtgcgg cagctgcggg agtatctccg gcagggctac cgcattgccg

1261 tggacatcga ccttgccaag tttttcgaca cggtcaatca cgatctcctc atgacgtttg

1321 tggggaggaa ggtccgcgac aagcgcgtgc tcgccctgat cggcagatac ttgagagccg

1381 gggtggaggt tgacggaaga ctggaaaaga cccgcatggg cgttccccaa ggcggtccgc

1441 tttctccgct tcttgccaac atcctcctcg atcacctgga caaagagctt gagaaacgtg

1501 gccacaagtt cgtccgttac gccgatgact tcgttattct cgtcaaaagc gagcgggctg

1561 gcgaacgggt catgggaagc gtgaggaagc atctcacgac aaagctcaag ctcacggtca

1621 acgaagacaa aagcaaggtt gccaaaagcg accaaatcag cttccttggc ttcgtcttca

1681 agggcaccaa aatcctctgg tctgacaagg cgtacaagga gtttcgccgc cgggtcagga

1741 agtacaccgg aagaagctgg ttcgtctcca tggagtaccg gctgaacaag ctgtccacct

1801 acatccgtgg ctggatggga tacttcggga tttccgaggc ctaccacgac atcccggaga

1861 tagacggctg gatcaggcgc agggtgcggc tctgttactg gaaacagtgg cggtggtgcc

1921 gcaccaagat tcggaatctg ctgaaactgg gagttcaact aggaacttcc atcagagcag

1981 gactgaatcg tggcggtccg tgggccatgg ctcgccgact ggccgctcag cacggtatga

2041 ccaatcaatg gctgaaagat caaggtctca tatctgtcaa agaactgtgg gtgaagactc

2101 attacccggc tacggctcgg taacttcagc gaaccgcccg gtgcggaccc gcatgccggg

2161 tggtgtgggg agggggagag aaaagctccc ccttacccga ttaggcacag attccgcctt

2221 tcagaatgtt ttgtgtacat ggctttatac cgcttagtag agctttacca gccttattcc

2281 ttataaggct actgatttgt ttctgaaaac tgtcaattgc ttgtgcttcg aattgcacct

2341 ccattgttac cttgtgcttt tcgtttttgc gctgattttg ggcaatattg gcttctaagt

2401 ccacaagtgc agcggtatga ccagaatttg agaaacaata gaagcttatt gaaacaaaag

2461 aatagccgtc ctttgcacct gcattaaact cgacacgttc a

[top]


[ORF sequence]

 

MSFHDTQNLSGGVPMKRVVDPQTPENHLLERILSPENMELAWKRVRANKGAPGVDGVN

IDDFPDITRPLWGDIRASLATGSYLPKPVLRVEIPKPTGGNRPLGIPTVLDRLIQQSI

AQVLTPIFDPGFSESSFGFRPGRSAHDAVRQLREYLRQGYRIAVDIDLAKFFDTVNHD

LLMTFVGRKVRDKRVLALIGRYLRAGVEVDGRLEKTRMGVPQGGPLSPLLANILLDHL

DKELEKRGHKFVRYADDFVILVKSERAGERVMGSVRKHLTTKLKLTVNEDKSKVAKSD

QISFLGFVFKGTKILWSDKAYKEFRRRVRKYTGRSWFVSMEYRLNKLSTYIRGWMGYF

GISEAYHDIPEIDGWIRRRVRLCYWKQWRWCRTKIRNLLKLGVQLGTSIRAGLNRGGP

WAMARRLAAQHGMTNQWLKDQGLISVKELWVKTHYPATAR

[top]


[Secondary structure]

 

                                                    

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |