[Back to introns by organism] [Back to home page]
Information for H.s.I1 intron (Format of information for each intron)
[Intron and flanking sequence]
Sequence
from Genbank entry. Intron is identified in red.
The ORF
is identified in blue with start and stop codons
underlined.
5' end
gtgcgac
aagtgttgtg cttactgggg ttaagtcctt atatatcaag gatttaactc
10981 aattaagaga aagctatgta atgtagcttt ctcttgacta tccagatctt ctcgggaaga
11041 gtttgggtag cagactagaa aaggagcgtt gctcaagatc tagttctaag ttggtaagaa
11101 tatctccaac tttatgcggt aataaggtat ggtcttcact tttatgtact taatatttag
11161 ataaatatta agaggagtaa gcaagcaaga tccgttatcg aaagatagga ctaccgttag
11221 tcgcctattt atagagtgac ttgggtcaaa tccaaaacgt catatcgtcc ataaagaatg
11281 attattcaac cctactataa ataaagtccg aaagataccg tagcaatctg aaacggttcg
11341 agtataagaa agtggaattt cttctatttg gttgccacct ttacacttat ctttatttac
11401 tgtacttcgg gatattcttt tattaagaaa caccctcagc aacttgaaag taatagttgc
11461 ccgtgaatgg cttgcggaac agcactgcta cttctcggtc taaaatttac ccaaatcccc
11521 tttcattgaa ataatcatta ctaacaaatg agtttttatt tgttaaaggt tagttatttc
11581 ctattagagg attaaacgta aacaaagtac tatcaaagta ctattaaaca tttgtttaat
11641 ggataaagta agttttaaag caaaccagca agggcttatt tattgctgta ttaggtaaca
11701 acttagttaa gtatgagtaa gctgttatta ctattgttac aatttgtttc aacgttgata
11761 tatgcagttg cgaagaaata aatgagtgta acctgttata tgactaacag taacctaggg
11821 atagtgcata attggcaacg attttgtgcg gagcttcctg acaatgtacc cccgataagg
11881 ttattaatac tgtgaagtat taataagaga ctcgaaacga ccattgggtt gagggactag
11941 gttggataga aagataaacg taagtgaata gtcgtaattc tcgttatgta gaacaagcct
12001 acggtcgcta tcggcttgaa ccaaaaggta tgcagtgggt tatctcattt tagccttggg
12061 atacgtcatc ataaacactg tcgggaaaaa agactatatc ttccctatcc atgtagtaat
12121 aatggaactt ggtaagccta tatctccgcc tgaaaaatgg caggcaacgc gtaagcaatg
12181 ctgttggtgg tgtaggtaaa agaagttaga gaaagcgaag gttgtactgt aatggtgcaa
12241 atagggtatg aaatatgacc taacccgaaa gggtgcagac ttctaaccgg tctttcatcg
12301 caagatgtct gacaaattcc tttaaaattt gacgggataa aagcgagcgt tggcttgcag
12361 attgaaaacc aatcaaataa aaccaacgca cgcactcttt atccttagtg attcaagagt
12421 gtaagtttat tggtaaccat agcaactctt gaagtgtcca aacagaaaga ggaacgatta
12481 atttaccaag aaaacttatc tctcaacgga ggtaagtatg
aaagagcaaa ctcatcatac
12541 taagaaatta gtaagcagtc tagctccagc gacctcacat atatcgcaat ggcattatat
12601 caattggtat aaagcaaacc gatatgtgaa aggaatgcag gtgcggattg cgaaggcaac
12661 acaggaaagc aattggcgta aggttaaaaa cttacaacga atgcttaccc actcgtttta
12721 tgcaaaagca ttagcagtac gacgggtgac ggaaaataca ggcaaacgaa ccgctggtat
12781 tgataaacgg atttgggata cacctgaatc caaatggatt gctatacaag atctatcaag
12841 taaagggtat caacctaaac cattaagacg ggttttcatt ccaaagtcaa atgggaagaa
12901 acgtccatta ggcataccca cgatgaaaga tagagcaatg caaatgttgt atttactagc
12961 attgcaaccc atagcagaaa caacagcaga caataattcg tatggattta gattaaaccg
13021 ttcaacggca gatgctatat cacatataca tagtatattt tcaacaaaag gaaaccaatc
13081 tcgtcaaatg gcagaatggg tactggatgc agatatacat ggttgttttg attttattaa
13141 ccacgactgg ctactcaagc atattccaat gaataagcga atactcaaga aatggttaaa
13201 atcaggtgtt gttgagttcg gtcaactaaa accaacaacg gaggggacac cgcaaggtga
13261 tattatctct ccaacgttgg caaatatggc gttagatggg ttagaaaaag agctaatcaa
13321 gcactttggt gcaaagaata gtcttaaaat agcaaaacat cggacatacc tcgttaggta
13381 tgcagatgac tttattattt caggtatatc aaaagaatta ctggaagaac aagttatccc
13441 tatggtgaaa aactttcttg cagaaagagg gctttcctta tcggaaagta aaaccaaggt
13501 agtgcatatc gaacacggat ttgacttttt aggttggact gttaaacgtt ttgataaaaa
13561 attgattatt aaaccaagta agaaaaatgc aaaagcattt tacgacaagg taaaacaatc
13621 aatttcaaaa atgaaaatgg cgaaacaaga tgacctaata aaagtattaa atccaatgat
13681 aagaggttgg acaaactatc ataaacacgt tgtggctaaa gtgatattta accgaatgga
13741 tagtttaatt tggaaagcct tatggcgttg gtgcagacga agacatccaa acaaaggaaa
13801 aatttggatt aaagagaaat acttttattc aaatgcaact cggaattgga ttttcggcac
13861 agtcaccaac agcaataaag aagaacaaat cccaataaat ttactctatt gtggatatgt
13921 gaaaattaag cgacacagaa agataaaatc acaatacaaa ccatttttac ctgaatggga
13981 aatgtatgga gaaaaccttg ctcaagccag aatgtatgat gaacaatcac acagacagca
14041 atggcaagct ctttacaaag agcagaaagg aaaatgtgca ttgtgtaata cgtcaataac
14101 gaaagaaagc ggctggcacg atcatcatat aatttacaaa atgtatggtg gaacagattc
14161 actaaataac agatgtttag tacatcctga gtgtcaccaa caaattcatc gcctgaattt
14221 gaacgttgcg aaaccgactg cgtagcagtt aatagaaagg
cttgagctgt atgcagagaa
14281 atttgcttgt acagttctta gggggctaag ttgtagtaat acaacttggc tacccgat
3' end
[top]
[Intron and flanking sequence]
10561 gttggtaaag ttatccctgc attgaatggt
aaattaacag gtatggcatt ccgtgttcct
10621 acaccaaacg tttccgtcgt tgatttaact gtaaacttgg aaaaaccggc aacttatgca
10681 gaaatctgtg cagaaatcaa acgtgcttct gaaaatgaaa tgaaaggtgt attaggttat
10741 actgaagacg ctgtagtttc tactgatttc aacggttgta gcttgacttc tgtattcgat
10801 gcggcagcag gtattgcatt aactgatact ttcgttaaat tagtttcttg gtatgataat
10861 gaaactggct actcaaataa agtattagat ttagttgctc atgtacataa ctacaaaggc
10921 taagtgcgac aagtgttgtg cttactgggg ttaagtcctt
atatatcaag gatttaactc
10981 aattaagaga aagctatgta atgtagcttt ctcttgacta tccagatctt ctcgggaaga
11041 gtttgggtag cagactagaa aaggagcgtt gctcaagatc tagttctaag ttggtaagaa
11101 tatctccaac tttatgcggt aataaggtat ggtcttcact tttatgtact taatatttag
11161 ataaatatta agaggagtaa gcaagcaaga tccgttatcg aaagatagga ctaccgttag
11221 tcgcctattt atagagtgac ttgggtcaaa tccaaaacgt catatcgtcc ataaagaatg
11281 attattcaac cctactataa ataaagtccg aaagataccg tagcaatctg aaacggttcg
11341 agtataagaa agtggaattt cttctatttg gttgccacct ttacacttat ctttatttac
11401 tgtacttcgg gatattcttt tattaagaaa caccctcagc aacttgaaag taatagttgc
11461 ccgtgaatgg cttgcggaac agcactgcta cttctcggtc taaaatttac ccaaatcccc
11521 tttcattgaa ataatcatta ctaacaaatg agtttttatt tgttaaaggt tagttatttc
11581 ctattagagg attaaacgta aacaaagtac tatcaaagta ctattaaaca tttgtttaat
11641 ggataaagta agttttaaag caaaccagca agggcttatt tattgctgta ttaggtaaca
11701 acttagttaa gtatgagtaa gctgttatta ctattgttac aatttgtttc aacgttgata
11761 tatgcagttg cgaagaaata aatgagtgta acctgttata tgactaacag taacctaggg
11821 atagtgcata attggcaacg attttgtgcg gagcttcctg acaatgtacc cccgataagg
11881 ttattaatac tgtgaagtat taataagaga ctcgaaacga ccattgggtt gagggactag
11941 gttggataga aagataaacg taagtgaata gtcgtaattc tcgttatgta gaacaagcct
12001 acggtcgcta tcggcttgaa ccaaaaggta tgcagtgggt tatctcattt tagccttggg
12061 atacgtcatc ataaacactg tcgggaaaaa agactatatc ttccctatcc atgtagtaat
12121 aatggaactt ggtaagccta tatctccgcc tgaaaaatgg caggcaacgc gtaagcaatg
12181 ctgttggtgg tgtaggtaaa agaagttaga gaaagcgaag gttgtactgt aatggtgcaa
12241 atagggtatg aaatatgacc taacccgaaa gggtgcagac ttctaaccgg tctttcatcg
12301 caagatgtct gacaaattcc tttaaaattt gacgggataa aagcgagcgt tggcttgcag
12361 attgaaaacc aatcaaataa aaccaacgca cgcactcttt atccttagtg attcaagagt
12421 gtaagtttat tggtaaccat agcaactctt gaagtgtcca aacagaaaga ggaacgatta
12481 atttaccaag aaaacttatc tctcaacgga ggtaagtatg aaagagcaaa ctcatcatac
12541 taagaaatta gtaagcagtc tagctccagc gacctcacat atatcgcaat ggcattatat
12601 caattggtat aaagcaaacc gatatgtgaa aggaatgcag gtgcggattg cgaaggcaac
12661 acaggaaagc aattggcgta aggttaaaaa cttacaacga atgcttaccc actcgtttta
12721 tgcaaaagca ttagcagtac gacgggtgac ggaaaataca ggcaaacgaa ccgctggtat
12781 tgataaacgg atttgggata cacctgaatc caaatggatt gctatacaag atctatcaag
12841 taaagggtat caacctaaac cattaagacg ggttttcatt ccaaagtcaa atgggaagaa
12901 acgtccatta ggcataccca cgatgaaaga tagagcaatg caaatgttgt atttactagc
12961 attgcaaccc atagcagaaa caacagcaga caataattcg tatggattta gattaaaccg
13021 ttcaacggca gatgctatat cacatataca tagtatattt tcaacaaaag gaaaccaatc
13081 tcgtcaaatg gcagaatggg tactggatgc agatatacat ggttgttttg attttattaa
13141 ccacgactgg ctactcaagc atattccaat gaataagcga atactcaaga aatggttaaa
13201 atcaggtgtt gttgagttcg gtcaactaaa accaacaacg gaggggacac cgcaaggtga
13261 tattatctct ccaacgttgg caaatatggc gttagatggg ttagaaaaag agctaatcaa
13321 gcactttggt gcaaagaata gtcttaaaat agcaaaacat cggacatacc tcgttaggta
13381 tgcagatgac tttattattt caggtatatc aaaagaatta ctggaagaac aagttatccc
13441 tatggtgaaa aactttcttg cagaaagagg gctttcctta tcggaaagta aaaccaaggt
13501 agtgcatatc gaacacggat ttgacttttt aggttggact gttaaacgtt ttgataaaaa
13561 attgattatt aaaccaagta agaaaaatgc aaaagcattt tacgacaagg taaaacaatc
13621 aatttcaaaa atgaaaatgg cgaaacaaga tgacctaata aaagtattaa atccaatgat
13681 aagaggttgg acaaactatc ataaacacgt tgtggctaaa gtgatattta accgaatgga
13741 tagtttaatt tggaaagcct tatggcgttg gtgcagacga agacatccaa acaaaggaaa
13801 aatttggatt aaagagaaat acttttattc aaatgcaact cggaattgga ttttcggcac
13861 agtcaccaac agcaataaag aagaacaaat cccaataaat ttactctatt gtggatatgt
13921 gaaaattaag cgacacagaa agataaaatc acaatacaaa ccatttttac ctgaatggga
13981 aatgtatgga gaaaaccttg ctcaagccag aatgtatgat gaacaatcac acagacagca
14041 atggcaagct ctttacaaag agcagaaagg aaaatgtgca ttgtgtaata cgtcaataac
14101 gaaagaaagc ggctggcacg atcatcatat aatttacaaa atgtatggtg gaacagattc
14161 actaaataac agatgtttag tacatcctga gtgtcaccaa caaattcatc gcctgaattt
14221 gaacgttgcg aaaccgactg cgtagcagtt aatagaaagg cttgagctgt atgcagagaa
14281 atttgcttgt acagttctta gggggctaag ttgtagtaat acaacttggc tacccgattt
14341 aaaaccttcg caaaaaaacc gctctttagg agcggttttt ttataatatt cgatttgctg
14401 atatattcta tatcgtaata acgatctaaa ttgacgttgg ttatacaaat tcaatataga
14461 gtttaaatat accgcaccat tcgtatgagt atgcttatgc aaggacttgt gtaaaagcta
14521 aagcttaaaa cgataaatta cttgattggc ggatcctcgc caaatcaaac ttggatcggc
14581 tttttcttgc tcaaacttac catcaattaa cacatcaatg taaggcaaca ttttccgttg
14641 caattcatca agctccgcta ataaataacc tgtccatagc caaatatctt tatcaggaca
14701 ttcctttttt actctctgca caaaaggcaa aagtgcggta atattttgag gatgtaaagg
[top]
MKEQTHHTKKLVSSLAPATSHISQWHYINWYKANRYVKGMQVRIAKATQESNWRKVKN
LQRMLTHSFYAKALAVRRVTENTGKRTAGIDKRIWDTPESKWIAIQDLSSKGYQPKPL
RRVFIPKSNGKKRPLGIPTMKDRAMQMLYLLALQPIAETTADNNSYGFRLNRSTADAI
SHIHSIFSTKGNQSRQMAEWVLDADIHGCFDFINHDWLLKHIPMNKRILKKWLKSGVV
EFGQLKPTTEGTPQGDIISPTLANMALDGLEKELIKHFGAKNSLKIAKHRTYLVRYAD
DFIISGISKELLEEQVIPMVKNFLAERGLSLSESKTKVVHIEHGFDFLGWTVKRFDKK
LIIKPSKKNAKAFYDKVKQSISKMKMAKQDDLIKVLNPMIRGWTNYHKHVVAKVIFNR
MDSLIWKALWRWCRRRHPNKGKIWIKEKYFYSNATRNWIFGTVTNSNKEEQIPINLLY
CGYVKIKRHRKIKSQYKPFLPEWEMYGENLAQARMYDEQSHRQQWQALYKEQKGKCAL
CNTSITKESGWHDHHIIYKMYGGTDSLNNRCLVHPECHQQIHRLNLNVAKPTA
[top]

[top]
| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |