[Back to introns by organism]   [Back to home page]

Information of Ha.ch.I2-1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[secondary structure]

Note: Multiple insertions

Hahella chejuensis KCTC 2396 (1913689-1915613)

Hahella chejuensis KCTC 2396 (2177921-2179845)

Hahella chejuensis KCTC 2396 (4684165-4686089)

Hahella chejuensis KCTC 2396 (5343057-5344981)

Hahella chejuensis KCTC 2396 (6844512-6846436)

 

[Intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

 

5' end

                               gtgcgcca ggcatggcgc gtagtgcctt gacgggcgct
 98761 gttccggttc aggtggtgct ggccggatga cttgggggtg caagtcccct gtgggcccgg
 98821 caaggggaac cactagccga acggcaaggg tgtccatcgt gaggtggaat ctgaaggaag
 98881 ccgtaggcaa agcactggcc tgacgaacag gaatcgcata cgaggcggcc acactgggta
 98941 aggcggcaaa taccgtcaaa gcccgtactt gcacggaagg tgtggacgta gatgcggcgg
 99001 gcataagtgt gaaggtcacg cgtcttaccc tgggaagtct ggttctctgc cgttgtgcta
 99061 ctcacctcgt gaggggtggg gatggagcgc cagaactcag ccgaggtcat agtaggtgcg
 99121 ttaccgcgta ctgaaggact gaacatggtt gaccgccagt aggcggtgag ttctcgtgat
 99181 gtgcttatga agacagaagc gatccggatg ggtcagggta ttcagggaga cgtcggacgg
 99241 tatccggcag agactgaagc ccgtgttgag gcagacacgg gggctcactc gtggacgaac
 99301 gcggagccga atacgctgat ggaacaggtg cttgagcgcc cgaacctgat gcatgcgtat
 99361 cagcgggtga tgtccaacaa gggcgccgcc ggtgtcgatc agatgccggt ggcggccctg
 99421 aaaggccacc tgcaacagca ttggccaacg ctgcgagagc ggctgttggc cggagactac
 99481 catcctcaac cagtacgccg ggtcagtatc cccaaaccgc aaggcggaga acggatactg
 99541 ggcatcccaa cggtacagga tcgcctgatc caacaagccc tgcaccaagt gctgagcccg
 99601 atgctggagc cgatcttctc ggaccacagt tacgggttcc gccccgggcg aagcgcccat
 99661 caggcggtta gggcgatgca acggcacatc aacgacggtc accgctgggt ggtcgatctg
 99721 gatctggagc agttctttga ccgggtcaac cacgatgtgc tgatgggcct gctggcccgc
 99781 cgcatcgccg accgacggat gctcaccctg atccgccgat acctgcaagc cggtatgctg
 99841 gacggtgggc tggtcagccc aaggcgggaa ggcgcgccgc aaggcggccc cctgtcacct
 99901 ctgctctcta atgtgctcct gaccgaactg gaccgggagc tggagcgccg gggccaccgg
 99961 ttctgccgct acgcggacga ctgcaatatc tacgtccgaa gcgagcgggc cggtcaccgg
100021 gtcatgacta gcattaccca ttacctgaaa atgcaccttc gcctgaaagt gaacgcggag
100081 aaaagtgttg tggatcgacc atggcgccgg agctatctgg gttacagcgt gagctggcgc
100141 aagcaggtac ggctgaggat cgcgccgaag agcctgaagc gctaccaggc caaactgcgt
100201 cagttgctca gacaatcacg agggcgaccg ctgcagacca ccattgcgcg actgaatccg
100261 gcactccggg gctgggcgaa ctactatcgc ctgacgacct cgaagcgccc agtggaagga
100321 ctcgatggct gggtgcgacg gcggttgcgc ctgttgctct ggcgacagtg gaaacggact
100381 tacacccgag cccgtaatct aatgcgacta ggactgtccg agcagagggc ttggagtagc
100441 gcaagcaatg gtcggggacc ttggtggaat agtggcgcgt cccatatgaa cgcggcgctt
100501 cccaagagga tgttcgaccg cctgtctctg gtaagcttgc tggatacgat gaaccggctt
100561 cagcgccaat catga
accgc cgtggtacgg aaccgtatgc ccggtggtgt gagaggacgg
100621 gggaggtaac tccccctcct actcgat

3' end

[top]


[Intron and flanking sequence]

 

 98401 tgaggaagct gttctatttg gagccctccc ctcagactgt tgcgttataa atgaaaaaac
 98461 aggggcttac ggtcgcttcg taagagatgg cgctggcgat ggtgagacat gtctcaaggc
 98521 gatgagagaa tgtcttacaa cagatagcgg atatgtagcc tgttgcgtcc cagggctgct
 98581 gactaatata gaaagcgtag ggacacacat cagacgtaag gacacgatgc agggaagcaa
 98641 caaggatgaa taacgagcca taaggaacgg caagggaagc atctgatacg aaagcgcggc
 98701 acggaagccg cgttttttat ttgtgcgcca ggcatggcgc gtagtgcctt gacgggcgct
 98761 gttccggttc aggtggtgct ggccggatga cttgggggtg caagtcccct gtgggcccgg
 98821 caaggggaac cactagccga acggcaaggg tgtccatcgt gaggtggaat ctgaaggaag
 98881 ccgtaggcaa agcactggcc tgacgaacag gaatcgcata cgaggcggcc acactgggta
 98941 aggcggcaaa taccgtcaaa gcccgtactt gcacggaagg tgtggacgta gatgcggcgg
 99001 gcataagtgt gaaggtcacg cgtcttaccc tgggaagtct ggttctctgc cgttgtgcta
 99061 ctcacctcgt gaggggtggg gatggagcgc cagaactcag ccgaggtcat agtaggtgcg
 99121 ttaccgcgta ctgaaggact gaacatggtt gaccgccagt aggcggtgag ttctcgtgat
 99181 gtgcttatga agacagaagc gatccggatg ggtcagggta ttcagggaga cgtcggacgg
 99241 tatccggcag agactgaagc ccgtgttgag gcagacacgg gggctcactc gtggacgaac
 99301 gcggagccga atacgctgat ggaacaggtg cttgagcgcc cgaacctgat gcatgcgtat
 99361 cagcgggtga tgtccaacaa gggcgccgcc ggtgtcgatc agatgccggt ggcggccctg
 99421 aaaggccacc tgcaacagca ttggccaacg ctgcgagagc ggctgttggc cggagactac
 99481 catcctcaac cagtacgccg ggtcagtatc cccaaaccgc aaggcggaga acggatactg
 99541 ggcatcccaa cggtacagga tcgcctgatc caacaagccc tgcaccaagt gctgagcccg
 99601 atgctggagc cgatcttctc ggaccacagt tacgggttcc gccccgggcg aagcgcccat
 99661 caggcggtta gggcgatgca acggcacatc aacgacggtc accgctgggt ggtcgatctg
 99721 gatctggagc agttctttga ccgggtcaac cacgatgtgc tgatgggcct gctggcccgc
 99781 cgcatcgccg accgacggat gctcaccctg atccgccgat acctgcaagc cggtatgctg
 99841 gacggtgggc tggtcagccc aaggcgggaa ggcgcgccgc aaggcggccc cctgtcacct
 99901 ctgctctcta atgtgctcct gaccgaactg gaccgggagc tggagcgccg gggccaccgg
 99961 ttctgccgct acgcggacga ctgcaatatc tacgtccgaa gcgagcgggc cggtcaccgg
100021 gtcatgacta gcattaccca ttacctgaaa atgcaccttc gcctgaaagt gaacgcggag
100081 aaaagtgttg tggatcgacc atggcgccgg agctatctgg gttacagcgt gagctggcgc

100141 aagcaggtac ggctgaggat cgcgccgaag agcctgaagc gctaccaggc caaactgcgt
100201 cagttgctca gacaatcacg agggcgaccg ctgcagacca ccattgcgcg actgaatccg
100261 gcactccggg gctgggcgaa ctactatcgc ctgacgacct cgaagcgccc agtggaagga
100321 ctcgatggct gggtgcgacg gcggttgcgc ctgttgctct ggcgacagtg gaaacggact
100381 tacacccgag cccgtaatct aatgcgacta ggactgtccg agcagagggc ttggagtagc
100441 gcaagcaatg gtcggggacc ttggtggaat agtggcgcgt cccatatgaa cgcggcgctt
100501 cccaagagga tgttcgaccg cctgtctctg gtaagcttgc tggatacgat gaaccggctt
100561 cagcgccaat catgaaccgc cgtggtacgg aaccgtatgc ccggtggtgt gagaggacgg
100621 gggaggtaac tccccctcct actcgatcag cctgat
acgg gcggctttta ttgtcacgtt
100681 ttagcttttc gcaaagctct tatttatatg ccctgagaga accgcttccc attcctgtca
100741 tttttaatct tccttctgta gagttaatgt aagttttcgt taactttttt gacgcctttg
100801 catacaatcc ctcgctatag gtgattgata aagccactaa tttaaaagtg gcggttgcta
100861 tagccggttt aatcaatcgg ttacggagtg tggtgacaat acggaagttg aagggttaat
100921 cactttatcg ccctcaaaat aatcacaatt tcagttcaca ctcattttcc accctgcaag

[top]


[ORF sequence]

 

MKTEAIRMGQGIQGDVGRYPAETEARVEADTGAHSWTNAEPNTLMEQVLERPNLMHAY

QRVMSNKGAAGVDQMPVAALKGHLQQHWPTLRERLLAGDYHPQPVRRVSIPKPQGGER

ILGIPTVQDRLIQQALHQVLSPMLEPIFSDHSYGFRPGRSAHQAVRAMQRHINDGHRW

VVDLDLEQFFDRVNHDVLMGLLARRIADRRMLTLIRRYLQAGMLDGGLVSPRREGAPQ

GGPLSPLLSNVLLTELDRELERRGHRFCRYADDCNIYVRSERAGHRVMTSITHYLKMH

LRLKVNAEKSVVDRPWRRSYLGYSVSWRKQVRLRIAPKSLKRYQAKLRQLLRQSRGRP

LQTTIARLNPALRGWANYYRLTTSKRPVEGLDGWVRRRLRLLLWRQWKRTYTRARNLM

RLGLSEQRAWSSASNGRGPWWNSGASHMNAALPKRMFDRLSLVSLLDTMNRLQRQS

[top]


[Secondary structure]

                                                   

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |