[Back to introns by organism]  [Back to home page]

Information for G.k.I2-1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

Note: Multiple insertions
G.k.I2-1    BA000043 (1374694-1376580)
G.k.I2-2    BA000043 (1495456-1497341)

[Intron sequence]

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

 

5' end

                                            gtgcgcc cggcatgggt gcagtctata

1374721 gggtgaaagt cccgaactgc gaaggcagaa gtagcagtta gcttaacgca agggtgtctg

1374781 cggcgacgca gaatctgaag gaagcgggcg gcaaacttcc ggtctgagga acacgaactt

1374841 cataggaggc tgggtatcat tgggtgagtt tgcacgacaa aacgaagccc tttctgccga

1374901 aggtgatacc gagtaaatga agcagatgga tggaaggaaa gactgcactc ttacccgggg

1374961 aggtctgtcc gggaagccaa gtgcgcttgg caaccgttgg agcgatccaa cgctgaacgg

1375021 acagaagtca gcagaggtca tagtacccgt ctagcttaga tagaagggga aggaccgaac

1375081 catgaaggag aacggccact aggcgttcat cttctttgat gaagcagata acccgaaagg

1375141 gcctgcttga ggggggaaat ggtgaagtcc atggggaacc tcaagagggt ggagaagaag

1375201 atggcacaaa tagaacggac cgttcacgta gagaggaaga atcggaatgt ggatgaaaca

1375261 ggtaatgtca cgagagaatc tcctgcgagc actcaaacaa gtggaaaaga ataaagggtc

1375321 ccatggaacc gatggaatgt ccgtcaaaga cctgcgaaga cacctcgtgg aacattggga

1375381 cgcgatacgg cacgctttag aagaagggac ctacgaacct tgcccggtcc gacgggtcga

1375441 aatcccgaaa ccgaacggag gagtcaggtt actaggaatc ccgaccgtga cagaccggtt

1375501 catccaacag gccatcgccc aagtgctcac gccgatcttt gacccatcct tttcggaaca

1375561 cagctacggg tttcgtcccg gtcgaagagg acacgacgcg gtgaaaaagg cgaagcagta

1375621 tattcaggaa ggatatacat gggtggtaga tatcgacttg gaaaagttct ttgatcgagt

1375681 caaccatgac aaactgatgg ggatattagc gaaacgaatt ccagacaaaa tcctcctaaa

1375741 gttgatacgg aagtatctac aggcaggggt catgatcaac ggggtggtca tggaaacaca

1375801 agaggggact ccacaaggag ggccgctcag tccactcctg tccaacattc tcctggatga

1375861 gctggacaaa gaattggaaa aacgagggca caagtttgta cggtatgcgg atgactgcaa

1375921 tatctacgta aggacgaaga aggccgggga acgggtgatg aaatcgatca cggcattcat

1375981 cgaaaagaaa ctccggctga aagtcaacga aaccaaatcg gcagtggatc gtccgtggag

1376041 gagaaaattc ctcggtttta gcttcacccc aagtaaggag ccaaaaatcc gaattgcaaa

1376101 ggaaagcatt cggcgcatga agcaaaggat acgcaccatg acgagccgat cgaaaccgat

1376161 tcccatgccc gaacgaatcg aacagctcaa tcaatacatt ctgggatggt gtggatactt

1376221 ttcgctagca gagaccccaa gtgtgttcaa agaactagat ggatggattc gacgaaggct

1376281 gcgcatgtgc caatggaaag agtggaaact tccgagaacc agagtccgaa aactgcaaag

1376341 tttaggagtg cccaagcaga aagcatatga atggggaaac actcggaaga aatattggag

1376401 agtggccgcc agtcccatcc tgcataaagc ccttggcaac tcctattggg agagccaagg

1376461 gctgaagagt ctttatcaac gatatgaatc tctgcgtcag acttaatgga accgccgtat

1376521 accgaacggt acgtacggtg gtgtgagagg acgagggtta gtcaccctct cctactcgat

3' end

[top]


[Intron and flanking sequence]

 

1374241 ttgccttcga cagtcaaaaa tggatgagcc acttttccgt caaggatatg ttaaacgatt

1374301 tcgttgcatc tatatatatc gttttgctgt tattgccgcg cctcgcgaag ctggtgttcg

1374361 gcgcgggcca atgtttgttc gcaacgggcg agaaaatcac cgtcgacgcc cgttgcttct

1374421 tggcgggcgc gcgcaagttg gtcgcgggcg tttttcaccg cttgttcggc atggcgaagc

1374481 atctcggggt cgagctgcat cgtcgcctgg ccgaccattt tctccgccgt ctcgacgaac

1374541 atttccactt gctttaagtc gttgtatccc gtatgtacgt cggcatcgcg tctggccata

1374601 tgaacacctc ccttttttta ttgttcgccg cttggttgaa tctatgcaaa tcgcaatgaa

1374661 aaaggcaccc tgcttaagca gggtgcctct cgtgtgcgcc cggcatgggt gcagtctata

1374721 gggtgaaagt cccgaactgc gaaggcagaa gtagcagtta gcttaacgca agggtgtctg

1374781 cggcgacgca gaatctgaag gaagcgggcg gcaaacttcc ggtctgagga acacgaactt

1374841 cataggaggc tgggtatcat tgggtgagtt tgcacgacaa aacgaagccc tttctgccga

1374901 aggtgatacc gagtaaatga agcagatgga tggaaggaaa gactgcactc ttacccgggg

1374961 aggtctgtcc gggaagccaa gtgcgcttgg caaccgttgg agcgatccaa cgctgaacgg

1375021 acagaagtca gcagaggtca tagtacccgt ctagcttaga tagaagggga aggaccgaac

1375081 catgaaggag aacggccact aggcgttcat cttctttgat gaagcagata acccgaaagg

1375141 gcctgcttga ggggggaaat ggtgaagtcc atggggaacc tcaagagggt ggagaagaag

1375201 atggcacaaa tagaacggac cgttcacgta gagaggaaga atcggaatgt ggatgaaaca

1375261 ggtaatgtca cgagagaatc tcctgcgagc actcaaacaa gtggaaaaga ataaagggtc

1375321 ccatggaacc gatggaatgt ccgtcaaaga cctgcgaaga cacctcgtgg aacattggga

1375381 cgcgatacgg cacgctttag aagaagggac ctacgaacct tgcccggtcc gacgggtcga

1375441 aatcccgaaa ccgaacggag gagtcaggtt actaggaatc ccgaccgtga cagaccggtt

1375501 catccaacag gccatcgccc aagtgctcac gccgatcttt gacccatcct tttcggaaca

1375561 cagctacggg tttcgtcccg gtcgaagagg acacgacgcg gtgaaaaagg cgaagcagta

1375621 tattcaggaa ggatatacat gggtggtaga tatcgacttg gaaaagttct ttgatcgagt

1375681 caaccatgac aaactgatgg ggatattagc gaaacgaatt ccagacaaaa tcctcctaaa

1375741 gttgatacgg aagtatctac aggcaggggt catgatcaac ggggtggtca tggaaacaca

1375801 agaggggact ccacaaggag ggccgctcag tccactcctg tccaacattc tcctggatga

1375861 gctggacaaa gaattggaaa aacgagggca caagtttgta cggtatgcgg atgactgcaa

1375921 tatctacgta aggacgaaga aggccgggga acgggtgatg aaatcgatca cggcattcat

1375981 cgaaaagaaa ctccggctga aagtcaacga aaccaaatcg gcagtggatc gtccgtggag

1376041 gagaaaattc ctcggtttta gcttcacccc aagtaaggag ccaaaaatcc gaattgcaaa

1376101 ggaaagcatt cggcgcatga agcaaaggat acgcaccatg acgagccgat cgaaaccgat

1376161 tcccatgccc gaacgaatcg aacagctcaa tcaatacatt ctgggatggt gtggatactt

1376221 ttcgctagca gagaccccaa gtgtgttcaa agaactagat ggatggattc gacgaaggct

1376281 gcgcatgtgc caatggaaag agtggaaact tccgagaacc agagtccgaa aactgcaaag

1376341 tttaggagtg cccaagcaga aagcatatga atggggaaac actcggaaga aatattggag

1376401 agtggccgcc agtcccatcc tgcataaagc ccttggcaac tcctattggg agagccaagg

1376461 gctgaagagt ctttatcaac gatatgaatc tctgcgtcag acttaatgga accgccgtat

1376521 accgaacggt acgtacggtg gtgtgagagg acgagggtta gtcaccctct cctactcgat

1376581 tccgctcccg ttctctcgtc tatgtcaacg aagtcttata atttgacaac gttcgctgct

1376641 tgcggtccgc ggtttccttg gacgatttca aacgaaactt cttggccttc ttctaacgtt

1376701 ttgaaccctt caccttggat cgccgtgaag tggacgaata cgtcggaacc gccttccact

1376761 tcgatgaaac cgtagccttt ttcgttgtta aaccatttta ctttaccacg ttgcataata

1376821 ctgaattcct cctaatacct ttagcctgtt cggctaacct caagattttt atcaaataac

1376881 aaaatatact tcacagccat ccaagatgca ccttcatgtt gaatgcctga aatatattct

1376941 gttatatgat gactttacta tacactaccg cgggtggccg cgtcaagcaa aagaaaggaa

[top]


[ORF sequence]

 

MWMKQVMSRENLLRALKQVEKNKGSHGTDGMSVKDLRRHLVEHWDAIRHALEEGTYEP

CPVRRVEIPKPNGGVRLLGIPTVTDRFIQQAIAQVLTPIFDPSFSEHSYGFRPGRRGH

DAVKKAKQYIQEGYTWVVDIDLEKFFDRVNHDKLMGILAKRIPDKILLKLIRKYLQAG

VMINGVVMETQEGTPQGGPLSPLLSNILLDELDKELEKRGHKFVRYADDCNIYVRTKK

AGERVMKSITAFIEKKLRLKVNETKSAVDRPWRRKFLGFSFTPSKEPKIRIAKESIRR

MKQRIRTMTSRSKPIPMPERIEQLNQYILGWCGYFSLAETPSVFKELDGWIRRRLRMC

QWKEWKLPRTRVRKLQSLGVPKQKAYEWGNTRKKYWRVAASPILHKALGNSYWESQGL

KSLYQRYESLRQT  

[top]

[Secondary structure]

 

                                           

 

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |