[Back to introns by organism] [Back to home page]
Information for Ha.ch.I1 intron (Format of information for each intron)
[Intron and flanking sequence]
Sequence
from Genbank entry. Intron is identified in red.
The ORF
is identified in blue with start and stop codons
underlined.
5' end
gtgc
4095001
gacgagagtc gcgcttgcgc gctgtaacat gctgatttat
atagcatttt acggcattga
4095061 agtgcagtct attctgtact tcaacccctg tcaacaaggt ggctttaagc tggtacttgg
4095121 cgacaggttc aacggcggtg ctgttggttt ttaacaacag gtgccgttga aaatcctgca
4095181 accctggctt ttgctggggt tgcatacgca cctcagtaat tccaggaatc ggggttggac
4095241 gacaggaagt aatttggcga cactcttggt gacgttgaac tgctttctaa cccttggaga
4095301 aatgaacgat gaaagatcgg aatcacgatg acagtgatct ggtgaacttt gtgaagtctt
4095361 tgccttcggt aaaacacctc aaagtttatg cacccggcga agctgaaaag ctgccaccgg
4095421 aacagctatc acctgccgtc aggcggttac tgatacgcga gcgcaatacc agcaagcgtt
4095481 aatcgtgatc gcgagaaatc atctgttacg acgacacccg ttgtcaaagt agctctcttc
4095541 aaaaaggggc cgtatggaga gtcaaacagg taacggtttg agcttcagcg agtcgatgca
4095601 agcgtgtaac gctgcgttga tgattgtcgg cctgacctgt gatcgccttt cttttctgaa
4095661 tagctaatca catctcagta aacgaccggt tgtgtgacca accgtagcct aggggcagcg
4095721 catttctggc aacggatttg tgtgaagcgc cctgacaatg tacccccgct ttagcggtgc
4095781 cgttttatcg tgaggtgaac ggtagatact gccagtagca aaggcggtta aggggctagg
4095841 ctgggaggta aggtgaacgc aagtgaactg ttgataaacg tcgttatcgg gaaaaagcca
4095901 aagctactga caggttttaa ccaaaaggtg tgtggttgga tactcctttt tacattggga
4095961 atacgtcacc agtaaaacca ccggcgaaga gacagcacct aacccttccg tgtcatcagc
4096021 gcggaactcg gtaagcccgt agttgtgcct tttccttaac cggacaaggc aggagcgccg
4096081 taaggcaatc tgttgcggct gcgggtagag gaggatggaa aaagcgaatg tccggctgta
4096141 atggccgaaa taccggggtg aaacataccc ggcagcgaaa gctgtgccca cttccatctg
4096201 gtctctcatg gcgagagatc tggataaccc ttgaagcacc cgaattgtag ataaagaaaa
4096261 ggcagaccca agggcctgtc gattcagagt ttacatatca gggtaaaaat cggtggggag
4096321 ttcgaacatt ttgttcggct cctgaccgag ccgagttcga actccttcca attacaagaa
4096381 ggagactagc atgacaagcg tttcgctcac
agcggaatct gcattctccg gcagaccgga
4096441 acactggcac cagattgact ggaagcatgt gaaccagact gtccgaggga tacagattcg
4096501 tatcgctaag gcaacgaagg aagaggattg gcgcaaggtg aaagccctgc aacgattctt
4096561 gacccgttcg ttctgtggcc gtgttttagc cataagacga gtgactgaga accaagggcg
4096621 caggactccg ggagtagacg gggttttgtg gtcaacgccc gaagctaaat gggcagcgat
4096681 tggtcaattg aagcgccggg gctatcgcgc cctgcccttg aggcgggtgc gtatccccaa
4096741 ggctaatggg aaagagcgcc ttctgggcat tccgaccatg caagacaggg ccatgcaagc
4096801 attgtatttg cttgcgttac agccggtatc agaaacgcga gcagatcggg attcttatgg
4096861 attccggcct gatcgctcca cagcggatgc cattatgcag tgctatatgc tgctgcgtaa
4096921 aaaaggttcg gctcagtggg ttctggaggc tgatattaaa ggctgctttg atcatatcga
4096981 ccaccaatgg ctgattgata atgtcccgat ggacaaactg atgttgcgta aatggctgaa
4097041 agccggagtc gtcgatatgg ggcgagtatg gaaaacggag gaaggaacac cgcaaggtgg
4097101 aattatctcc cctactctcg ccaatatggc gcttgacggc atagaggcat tactggctca
4097161 gcacttcggt gccaaaggca gtaagaaatt acgtcaatat aaggttggtc ttgtgcgtta
4097221 tgcggacgac tttgtgatta ccggcagctc gaaagaactg ttagaaaatg aagttaaacc
4097281 gctgattgag aaatttctgg ccgttcgcgg tctgaaacta tcggttgaaa aaacgcaggt
4097341 gacacacatc aatcacggct ttgattttct ggggtggacg gtgcgtaagt accagagcaa
4097401 attgctcatc aagccatcca gaaagaacac caaagcattt ctaacgaagt gccgcgatgt
4097461 aatcaatgcc aataaaagtg ctaaacagga aaacctgata cacaggttaa accccatcat
4097521 tcgtggttgg gtgaattacc ataaacatca ggtcgccagt gacgcattcg ctcgcgtgga
4097581 tgctcaactc tggcacgcct tatggcgctg ggcgcggagg cgacattcga agaaggggaa
4097641 gcggtggatt gccagtcgct actggcagca catcgacaac cggctctgga cgtttgccga
4097701 cacaacaatt gatgatttag gcgcagagaa aacggtgaaa ctggtttacg ccacagacac
4097761 caaaatcaag cggcatacca aggtcaaatc ggaggctaat ccttttgatc ctgagtggga
4097821 gttgtacttt gaggagttgc gaggcaagcg tatgcgggat tccctgcaat atcgccgccg
4097881 cgttaatagc ctttatgtac agcaatttgg taagtgcgca ctatgcgaac aagccatcac
4097941 ccacgaaacc ggctggcacg agcatcattt gatccatcgg gtcaatggtg gcgatgacag
4098001 tcttgccaac ttggtgcttt tgcatcctgc ctgtcatatg caggttcatc atcagcacat
4098061 ttcagtaacg aagccggctc tttcgggagt ttcgtagagg cttgagccgt
atgcggggaa
4098121 actcgcacgt acggttctta gggggctggg gcgcagtaat gcgtcctggc tacccgac
3' end
[top]
[Intron and flanking sequence]
4094641
ccagcgagtg tccgaaaatc agctatccga agccggataa caaactgact ttcgatcgtc
4094701 tgacctcagt gttcctgtca aacaccaacc atgaagaaga tcagccctgt catttgaccc
4094761 tcaaggacaa agacgtgccg atcaactaca atctgcccaa gtacgacgaa ccggcgcagc
4094821 gctactgccc tgccggcgtt tacgaagtgg ttgagaatga cggcggcggc aaacgcttcc
4094881 agatcaacgc gcaaaactgc atacactgca aaacctgcga tatcaaagac cccacccaga
4094941 acatcatctg ggtcaccccg gaaggcggcg gcggtcccaa ctatcccaac atgtaggtgc
4095001 gacgagagtc gcgcttgcgc gctgtaacat gctgatttat atagcatttt acggcattga
4095061 agtgcagtct attctgtact tcaacccctg tcaacaaggt ggctttaagc tggtacttgg
4095121 cgacaggttc aacggcggtg ctgttggttt ttaacaacag gtgccgttga aaatcctgca
4095181 accctggctt ttgctggggt tgcatacgca cctcagtaat tccaggaatc ggggttggac
4095241 gacaggaagt aatttggcga cactcttggt gacgttgaac tgctttctaa cccttggaga
4095301 aatgaacgat gaaagatcgg aatcacgatg acagtgatct ggtgaacttt gtgaagtctt
4095361 tgccttcggt aaaacacctc aaagtttatg cacccggcga agctgaaaag ctgccaccgg
4095421 aacagctatc acctgccgtc aggcggttac tgatacgcga gcgcaatacc agcaagcgtt
4095481 aatcgtgatc gcgagaaatc atctgttacg acgacacccg ttgtcaaagt agctctcttc
4095541 aaaaaggggc cgtatggaga gtcaaacagg taacggtttg agcttcagcg agtcgatgca
4095601 agcgtgtaac gctgcgttga tgattgtcgg cctgacctgt gatcgccttt cttttctgaa
4095661 tagctaatca catctcagta aacgaccggt tgtgtgacca accgtagcct aggggcagcg
4095721 catttctggc aacggatttg tgtgaagcgc cctgacaatg tacccccgct ttagcggtgc
4095781 cgttttatcg tgaggtgaac ggtagatact gccagtagca aaggcggtta aggggctagg
4095841 ctgggaggta aggtgaacgc aagtgaactg ttgataaacg tcgttatcgg gaaaaagcca
4095901 aagctactga caggttttaa ccaaaaggtg tgtggttgga tactcctttt tacattggga
4095961 atacgtcacc agtaaaacca ccggcgaaga gacagcacct aacccttccg tgtcatcagc
4096021 gcggaactcg gtaagcccgt agttgtgcct tttccttaac cggacaaggc aggagcgccg
4096081 taaggcaatc tgttgcggct gcgggtagag gaggatggaa aaagcgaatg tccggctgta
4096141 atggccgaaa taccggggtg aaacataccc ggcagcgaaa gctgtgccca cttccatctg
4096201 gtctctcatg gcgagagatc tggataaccc ttgaagcacc cgaattgtag ataaagaaaa
4096261 ggcagaccca agggcctgtc gattcagagt ttacatatca gggtaaaaat cggtggggag
4096321 ttcgaacatt ttgttcggct cctgaccgag ccgagttcga actccttcca attacaagaa
4096381 ggagactagc atgacaagcg tttcgctcac agcggaatct gcattctccg gcagaccgga
4096441 acactggcac cagattgact ggaagcatgt gaaccagact gtccgaggga tacagattcg
4096501 tatcgctaag gcaacgaagg aagaggattg gcgcaaggtg aaagccctgc aacgattctt
4096561 gacccgttcg ttctgtggcc gtgttttagc cataagacga gtgactgaga accaagggcg
4096621 caggactccg ggagtagacg gggttttgtg gtcaacgccc gaagctaaat gggcagcgat
4096681 tggtcaattg aagcgccggg gctatcgcgc cctgcccttg aggcgggtgc gtatccccaa
4096741 ggctaatggg aaagagcgcc ttctgggcat tccgaccatg caagacaggg ccatgcaagc
4096801 attgtatttg cttgcgttac agccggtatc agaaacgcga gcagatcggg attcttatgg
4096861 attccggcct gatcgctcca cagcggatgc cattatgcag tgctatatgc tgctgcgtaa
4096921 aaaaggttcg gctcagtggg ttctggaggc tgatattaaa ggctgctttg atcatatcga
4096981 ccaccaatgg ctgattgata atgtcccgat ggacaaactg atgttgcgta aatggctgaa
4097041 agccggagtc gtcgatatgg ggcgagtatg gaaaacggag gaaggaacac cgcaaggtgg
4097101 aattatctcc cctactctcg ccaatatggc gcttgacggc atagaggcat tactggctca
4097161 gcacttcggt gccaaaggca gtaagaaatt acgtcaatat aaggttggtc ttgtgcgtta
4097221 tgcggacgac tttgtgatta ccggcagctc gaaagaactg ttagaaaatg aagttaaacc
4097281 gctgattgag aaatttctgg ccgttcgcgg tctgaaacta tcggttgaaa aaacgcaggt
4097341 gacacacatc aatcacggct ttgattttct ggggtggacg gtgcgtaagt accagagcaa
4097401 attgctcatc aagccatcca gaaagaacac caaagcattt ctaacgaagt gccgcgatgt
4097461 aatcaatgcc aataaaagtg ctaaacagga aaacctgata cacaggttaa accccatcat
4097521 tcgtggttgg gtgaattacc ataaacatca ggtcgccagt gacgcattcg ctcgcgtgga
4097581 tgctcaactc tggcacgcct tatggcgctg ggcgcggagg cgacattcga agaaggggaa
4097641 gcggtggatt gccagtcgct actggcagca catcgacaac cggctctgga cgtttgccga
4097701 cacaacaatt gatgatttag gcgcagagaa aacggtgaaa ctggtttacg ccacagacac
4097761 caaaatcaag cggcatacca aggtcaaatc ggaggctaat ccttttgatc ctgagtggga
4097821 gttgtacttt gaggagttgc gaggcaagcg tatgcgggat tccctgcaat atcgccgccg
4097881 cgttaatagc ctttatgtac agcaatttgg taagtgcgca ctatgcgaac aagccatcac
4097941 ccacgaaacc ggctggcacg agcatcattt gatccatcgg gtcaatggtg gcgatgacag
4098001 tcttgccaac ttggtgcttt tgcatcctgc ctgtcatatg caggttcatc atcagcacat
4098061 ttcagtaacg aagccggctc tttcgggagt ttcgtagagg cttgagccgt atgcggggaa
4098121 actcgcacgt acggttctta gggggctggg gcgcagtaat gcgtcctggc tacccgactg
4098181 caaaaaagcc ggctcacgcc ggctttttta tttctgttgt ttaccctact ccgccagcca
4098241 gatactccaa cggtcagtcc acctgcaatt gcactctgac caacatggtg tttttcttat
4098301 cgggtttggc ggccagggtt tccaggctgg gccagtatgg catttgtttg tcgccgtcgt
4098361 attcgtaagt cagggtcagg ctgtctttgc tttgctcggt aatcaggcca gtgcgaccct
4098421 cgcccagtat gggagcgtcg ctcaattgct gtagcgactg ccagatttcc tgttcgaatt
4098481 tcttcattcc cgccttgatc tcctccggat catcgctgct cgccaaagga taatcattaa
[top]
MTSVSLTAESAFSGRPEHWHQIDWKHVNQTVRGIQIRIAKATKEEDWRKVKALQRFLT
RSFCGRVLAIRRVTENQGRRTPGVDGVLWSTPEAKWAAIGQLKRRGYRALPLRRVRIP
KANGKERLLGIPTMQDRAMQALYLLALQPVSETRADRDSYGFRPDRSTADAIMQCYML
LRKKGSAQWVLEADIKGCFDHIDHQWLIDNVPMDKLMLRKWLKAGVVDMGRVWKTEEG
TPQGGIISPTLANMALDGIEALLAQHFGAKGSKKLRQYKVGLVRYADDFVITGSSKEL
LENEVKPLIEKFLAVRGLKLSVEKTQVTHINHGFDFLGWTVRKYQSKLLIKPSRKNTK
AFLTKCRDVINANKSAKQENLIHRLNPIIRGWVNYHKHQVASDAFARVDAQLWHALWR
WARRRHSKKGKRWIASRYWQHIDNRLWTFADTTIDDLGAEKTVKLVYATDTKIKRHTK
VKSEANPFDPEWELYFEELRGKRMRDSLQYRRRVNSLYVQQFGKCALCEQAITHETGW
HEHHLIHRVNGGDDSLANLVLLHPACHMQVHHQHISVTKPALSGVS
[top]

[top]
| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |