[Back to introns by organism]  [Back to home page]

Information for Ge.ur.I2 intron  (Format of information for each intron

[Intron sequence]

[Intron and flanking sequences]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

 

5' end

   1 gtgcgccagg catggcgcgt ggcgcgtaag cgccatccaa ttgggtatgg tgcctgattg 

  61 gtgacttgga ggtgaaagtc ctcttcggac cctgatggtg ggaaccgata gctgaacagc 

 121 aagggtgtcc ggggcgaccc ggaatctgaa ggaagctgta ggcaaagtcc tggcccgacg  

 181 aacaggaatc gcatatgagg cagccacatc tgggcgagaa ggcaaacaga ttcaaagccc 

 241 gataaccatc cagaaagatg tggaagtaga tgcggcgggt atatgggatg aaggtcacgc 

 301 gcattaccct gggaggtctg tcaacctgcc ttgtgctacc ggcatcgaga ggtgccggga 

 361 tgggttggca gaagtcagca gaggccatac gagtcggaat aaccgaccga caaagggccg 

 421 aacttgaagt aagcggacag gagccttgag tttcgatgac gacaggagca gcagaaggcc 

 481 gggttgggat acccggcgcc agcccggagg gtagcggttg gaaaccgcga gagaaaggga

 541 gaggtgcggc aaacgtcacg ggaaggaaag cagctcactg gccggaagcg gcaacgggac 

 601 tgatggagaa gatcgtcagt cgcggcaaca tgatggcggc atactcacgg gtaatgcgca 

 661 acaagggagc gcccggcgtc gataacatgc cggtaacggc cctgaagggc tacctacagg 

 721 aggaatggcc gcgcatcaga gaagaactgc tgacgggaac gtatcacccg caaccggtgc 

 781 ggaaagtaga aattcccaaa ccgggaggcg gcacacggat gctgggaatt cccaccgtcc 

 841 ttgaccgact cattcagcag gcggtgcatc aagttttgag cccgctgttc gaccctggtt 

 901 tctccataag ttcccacggg tttcgccctg gacggagcgc ccatcaagcg atcaaggcag 

 961 ctcggaagta tgtagagagc ggccttaggt gggtggttga cattgacctt gagaaatttt 

1021 tcgaccgagt ccaccacgac acccttatgt cgctggtgaa acgcaaggtt ggagaccgtc 

1081 tggtgctgtc ccttatcgac agctacctta aggcagggat acttgaagga ggagtgacgt 

1141 caccccggtt ggaaggcacg ccgcaaggcg ggcctctctc gccgctgctg tccaacatcc 

1201 tcctcgacga actggacaag aaactggaaa gaaggggcca taagttctgc cgctatgccg 

1261 acgacgcaaa tatctatgtt gcaacgagga gaagcggcga gcgtgtcatg gcctcaatta 

1321 ccggctacct gtccgagcgg ctcaagctca cggtcaacca gggcaaaagc gcggttgacc 

1381 gtccgtggaa aaggtcgttt ctgagttaca gcatgacccg gcaccgcaaa ccgcggctta 

1441 ccgtggccaa gaaagcggct gccaggctca aggccaacct caagacgatc tttaggcggg 

1501 gaaggggtca gaacatccaa accaccgttg aagagacaac cccgaaactt aggggttggt 

1561 taaactactt tcggtacgcg gaagtaaaag gcatcttcga ggaactggac ggatggctga 

1621 gacgtaaact gcgtcgcatt ctctggaagc agtggaagcg ccccaagacg cgagcaaaga 

1681 aactgatgcg gcggggatta tcggaggcaa ctgcatggcg gtcggccaca aacggccgcg 

1741 gcccctggtg gaatgcaggg gccgcgcata tgaacaaagc tgttccgaaa ttctacttcg 

1801 acaaactggg gctggtatca ctcatagacc agcttcaccg acttcaacgt acttcatga

1861 ccgccgtgta cggaaccgta cgcacggtgg tgtgggagga cggcgggagc aatcccgcct 

1921 cctacccgat

3' end

[top]


[Intron and flanking sequence]

 

   1 aattgtcacc accggcaagg gggggacgga cgatggcgaa ggccgggtcg aattcatcgc 

  61 ccgtttccgg gaaaaggggg tgaaaaaggc ccatcacgag ttggcggagt ttaagaaaga 

 121 cgacggcaaa tggttcttta ccgacggttc ggccgttccc cggaagcccg ctacaagcgt 

 181 caaggtcggc cgcaacgacc cctgcacatg cggcagcggt ttgaaataca agaaatgctg 

 241 cgggaaatga cccgcgacga tgcctatgaa aaaagcccgc gaaagcgggc ttttttgttt 

 301 gtgcgccagg catggcgcgt ggcgcgtaag cgccatccaa ttgggtatgg tgcctgattg 

 361 gtgacttgga ggtgaaagtc ctcttcggac cctgatggtg ggaaccgata gctgaacagc 

 421 aagggtgtcc ggggcgaccc ggaatctgaa ggaagctgta ggcaaagtcc tggcccgacg 

 481 aacaggaatc gcatatgagg cagccacatc tgggcgagaa ggcaaacaga ttcaaagccc 

 541 gataaccatc cagaaagatg tggaagtaga tgcggcgggt atatgggatg aaggtcacgc 

 601 gcattaccct gggaggtctg tcaacctgcc ttgtgctacc ggcatcgaga ggtgccggga 

 661 tgggttggca gaagtcagca gaggccatac gagtcggaat aaccgaccga caaagggccg 

 721 aacttgaagt aagcggacag gagccttgag tttcgatgac gacaggagca gcagaaggcc 

 781 gggttgggat acccggcgcc agcccggagg gtagcggttg gaaaccgcga gagaaaggga 

 841 gaggtgcggc aaacgtcacg ggaaggaaag cagctcactg gccggaagcg gcaacgggac 

 901 tgatggagaa gatcgtcagt cgcggcaaca tgatggcggc atactcacgg gtaatgcgca 

 961 acaagggagc gcccggcgtc gataacatgc cggtaacggc cctgaagggc tacctacagg 

1021 aggaatggcc gcgcatcaga gaagaactgc tgacgggaac gtatcacccg caaccggtgc 

1081 ggaaagtaga aattcccaaa ccgggaggcg gcacacggat gctgggaatt cccaccgtcc 

1141 ttgaccgact cattcagcag gcggtgcatc aagttttgag cccgctgttc gaccctggtt 

1201 tctccataag ttcccacggg tttcgccctg gacggagcgc ccatcaagcg atcaaggcag 

1261 ctcggaagta tgtagagagc ggccttaggt gggtggttga cattgacctt gagaaatttt 

1321 tcgaccgagt ccaccacgac acccttatgt cgctggtgaa acgcaaggtt ggagaccgtc 

1381 tggtgctgtc ccttatcgac agctacctta aggcagggat acttgaagga ggagtgacgt 

1441 caccccggtt ggaaggcacg ccgcaaggcg ggcctctctc gccgctgctg tccaacatcc 

1501 tcctcgacga actggacaag aaactggaaa gaaggggcca taagttctgc cgctatgccg 

1561 acgacgcaaa tatctatgtt gcaacgagga gaagcggcga gcgtgtcatg gcctcaatta 

1621 ccggctacct gtccgagcgg ctcaagctca cggtcaacca gggcaaaagc gcggttgacc 

1681 gtccgtggaa aaggtcgttt ctgagttaca gcatgacccg gcaccgcaaa ccgcggctta 

1741 ccgtggccaa gaaagcggct gccaggctca aggccaacct caagacgatc tttaggcggg 

1801 gaaggggtca gaacatccaa accaccgttg aagagacaac cccgaaactt aggggttggt 

1861 taaactactt tcggtacgcg gaagtaaaag gcatcttcga ggaactggac ggatggctga 

1921 gacgtaaact gcgtcgcatt ctctggaagc agtggaagcg ccccaagacg cgagcaaaga 

1981 aactgatgcg gcggggatta tcggaggcaa ctgcatggcg gtcggccaca aacggccgcg 

2041 gcccctggtg gaatgcaggg gccgcgcata tgaacaaagc tgttccgaaa ttctacttcg 

2101 acaaactggg gctggtatca ctcatagacc agcttcaccg acttcaacgt acttcatgaa 

2161 ccgccgtgta cggaaccgta cgcacggtgg tgtgggagga cggcgggagc aatcccgcct 

2221 cctacccgat ttaaccgcaa ggggaatact ggcaataccc ggcttgtgcg gtacaatttt 

2281 tctacggaga agtcaggcgg ttctcaagga ggacgaggat gaacatattt ctcagcggcg

2341 gcactggctt tgtcggcggc catcttagga gggccttgct ggagaaaggg caccggatca 

2401 ggctccttgc ccacaagagg ggggatggtt tcgaagatgg gattgaggtg gtggaagggg 

2461 acgtgacccg ccctgacacc tttgccgggc agctcgcggg gtgcgaagcc gcaatcaacc 

2521 tggtggggat

[top]


[ORF sequence]

 

MTTGAAEGRVGIPGASPEGSGWKPREKGRGAANVTGRKAAHWPEAATGLMEKIVSRGN

MMAAYSRVMRNKGAPGVDNMPVTALKGYLQEEWPRIREELLTGTYHPQPVRKVEIPKP

GGGTRMLGIPTVLDRLIQQAVHQVLSPLFDPGFSISSHGFRPGRSAHQAIKAARKYVE

SGLRWVVDIDLEKFFDRVHHDTLMSLVKRKVGDRLVLSLIDSYLKAGILEGGVTSPRL

EGTPQGGPLSPLLSNILLDELDKKLERRGHKFCRYADDANIYVATRRSGERVMASITG

YLSERLKLTVNQGKSAVDRPWKRSFLSYSMTRHRKPRLTVAKKAAARLKANLKTIFRR

GRGQNIQTTVEETTPKLRGWLNYFRYAEVKGIFEELDGWLRRKLRRILWKQWKRPKTR

AKKLMRRGLSEATAWRSATNGRGPWWNAGAAHMNKAVPKFYFDKLGLVSLIDQLHRLQ

RTS

[top]


[Secondary structure]

                                       

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |