[Back to introns by organism] [Back to home page]
Information for D.a.I1-1 intron (Format of information for each intron)
[Intron and flanking sequence]
Note: Multiple insertions
D.a.I1-1 CP000089
(759875-761862)
D.a.I1-2 CP000089
(2435455-2433475)
D.a.I1-3 CP000089
(4109792-4107812)
Sequence from Genbank entry. The intron boundaries are identified in red
and the ORF in blue, with start and stop codons underlined.
5' end
gtgcgc
cgtgcaaagg cggcgctttt
759901 ccagtgggtg aaagccccac ccggcgaaat gctccagccg gaagcaaccg gagcagtcat
759961 ggaggtaacg aaatggctga agccttcgga tagcgggcca caaattctgg tggcagcgcg
760021 agtgtgcagg ccgtaacgcg agtgaacgct gaataagcct cgaaaaggac gatgcacagg
760081 ccgacccagc agcccttctg gggaaggctg atacgactga agggatgagc aaggatgtac
760141 cgtcagtcgc tgtgccgggg tattggcgac agcatgtaca caaggaaagc gcacgcaaca
760201 cgggaagccc tatggcgtgg tcagggatgg ccaaccgaac acccgtgagg gatggttcgg
760261 gcgtcatagg gtggcggaga ggtccgtagt accggggaag ccgggtaatg ccggtggagg
760321 gaaggggcct tggttcatga cagacgcact acgtagtgag gacgtggaga ttgggcaacc
760381 tatcaactcc gttgcatgtt cagaaactgc agatggcgtt
acacgcgaaa gcgaagtcgg
760441 aatctggata tcgtttctac gcgctgtacg acaagattta tcgcacggat attttggcgc
760501 atgcctatgc ccagtgccgc tccaataagg gcgcgccggg tgtggatcgt caggatttcg
760561 aggatgtcga ggcgtatggt gtgcggcgat ggctggagga actggcgctt gcgctcaaag
760621 aggagagcta ccgaccggat ccaattcgga gagtgtttat cccgaaagcc aatggcaagt
760681 taaggcctct gggcatttca acgctgcatg atcgagtgtg tatgacagca gccatgctgg
760741 tactcgaacc tatctttgaa gctgatcttc ctgatgaaca gtatgcctac cggccgggcc
760801 gcaatgccca gcaggcggca gaagaagtga agaaccggct ctaccttgga caaacggacg
760861 ttgtcgatgc cgacctgtcg gactacttcg gcagcattcc acattctgaa ctgatgaagt
760921 cgctggcgcg acgcatcgtg gatcggcgtg tgctacatct tatcaagatg tggctggagt
760981 gcgcggttga ggaaaccgat cagcgaggac ggaagaaacg gacgaccgag gccaaagatc
761041 aggggcgagg tatcccgcaa ggctcaccga tctcgccgct gctatcgaat ttgtatatgc
761101 ggcggtttgt gctggcgtgg aagaaactgg ggttggagcg aagccttggc agtcgcatcg
761161 tcacctatgc cgacgacctc gtgatcctgt gcaagtgtgg caaggcggaa gaagccttgc
761221 aatggatgcg cacgatcatg gggaaactga agctcacggt gaacgaggaa aagacacgaa
761281 tctgtcaggt accggcaggg acgttcgact ttctgggtta ctcgtttgga cggcgatatg
761341 tgccgcgcac agggaagccg cagatcgctc tgtggccgtc gaagaaaagc attcgacgca
761401 tggtggagaa aatccatgac atgactgagc ggcaaacggg ttggcaagag accacggagc
761461 tggtgggcaa gttgaatcgg acgctacgcg gctgggcgaa ctacttcagt gtagggaacg
761521 tcagtcgcgc ctatcgtgcg ctcgacagtt acacggcaac gcggttgcgt cggtggttgc
761581 gctacaagta caagctcaga cattgcaagg gcgggagcta tccactctcg cacctctacg
761641 ggtactttgg tctcgtacgt ctgggcgcac gtgggcgcag cgaggcgtgg gtgaaggcgt
761701 gatgtcgtgt ccgagagccg gatgcgggaa atctgcatgt ccggttcgat gagggggatg
761761 tgaaaacggg gttacggcag agttacttgg gcaccgccag acgaaagggg cggacaacag
761821 acataccaag cctaatgcta ccgcgtcaca tctctactct ac
3' end
[top]
[Intron and flanking sequence]
759541
taccgtggcg atcggggttt gtcggcttcc ggacgatctg aagtggactc acatcgttgg
759601 cgccggcatc ttgggaggta ttggttttac gatgtccatt ttcatcacga atcttgcatt
759661 taccaacaat gcgagcatta tcaacgcgtc aaaaatggct atcttgatgg catctgttgc
759721 tgcggggggg ctcggctttc tttggctgag tttttttcag caaaacgata accaatgagc
759781 ctgtaaggag ttgagggcga ccattgactg caatggcggc aatactttgt taacagccga
759841 ctagcagatt gcggctagaa ggtccgcaac gggcgtgcgc
cgtgcaaagg cggcgctttt
759901 ccagtgggtg aaagccccac ccggcgaaat gctccagccg gaagcaaccg gagcagtcat
759961 ggaggtaacg aaatggctga agccttcgga tagcgggcca caaattctgg tggcagcgcg
760021 agtgtgcagg ccgtaacgcg agtgaacgct gaataagcct cgaaaaggac gatgcacagg
760081 ccgacccagc agcccttctg gggaaggctg atacgactga agggatgagc aaggatgtac
760141 cgtcagtcgc tgtgccgggg tattggcgac agcatgtaca caaggaaagc gcacgcaaca
760201 cgggaagccc tatggcgtgg tcagggatgg ccaaccgaac acccgtgagg gatggttcgg
760261 gcgtcatagg gtggcggaga ggtccgtagt accggggaag ccgggtaatg ccggtggagg
760321 gaaggggcct tggttcatga cagacgcact acgtagtgag gacgtggaga ttgggcaacc
760381 tatcaactcc gttgcatgtt cagaaactgc agatggcgtt acacgcgaaa gcgaagtcgg
760441 aatctggata tcgtttctac gcgctgtacg acaagattta tcgcacggat attttggcgc
760501 atgcctatgc ccagtgccgc tccaataagg gcgcgccggg tgtggatcgt caggatttcg
760561 aggatgtcga ggcgtatggt gtgcggcgat ggctggagga actggcgctt gcgctcaaag
760621 aggagagcta ccgaccggat ccaattcgga gagtgtttat cccgaaagcc aatggcaagt
760681 taaggcctct gggcatttca acgctgcatg atcgagtgtg tatgacagca gccatgctgg
760741 tactcgaacc tatctttgaa gctgatcttc ctgatgaaca gtatgcctac cggccgggcc
760801 gcaatgccca gcaggcggca gaagaagtga agaaccggct ctaccttgga caaacggacg
760861 ttgtcgatgc cgacctgtcg gactacttcg gcagcattcc acattctgaa ctgatgaagt
760921 cgctggcgcg acgcatcgtg gatcggcgtg tgctacatct tatcaagatg tggctggagt
760981 gcgcggttga ggaaaccgat cagcgaggac ggaagaaacg gacgaccgag gccaaagatc
761041 aggggcgagg tatcccgcaa ggctcaccga tctcgccgct gctatcgaat ttgtatatgc
761101 ggcggtttgt gctggcgtgg aagaaactgg ggttggagcg aagccttggc agtcgcatcg
761161 tcacctatgc cgacgacctc gtgatcctgt gcaagtgtgg caaggcggaa gaagccttgc
761221 aatggatgcg cacgatcatg gggaaactga agctcacggt gaacgaggaa aagacacgaa
761281 tctgtcaggt accggcaggg acgttcgact ttctgggtta ctcgtttgga cggcgatatg
761341 tgccgcgcac agggaagccg cagatcgctc tgtggccgtc gaagaaaagc attcgacgca
761401 tggtggagaa aatccatgac atgactgagc ggcaaacggg ttggcaagag accacggagc
761461 tggtgggcaa gttgaatcgg acgctacgcg gctgggcgaa ctacttcagt gtagggaacg
761521 tcagtcgcgc ctatcgtgcg ctcgacagtt acacggcaac gcggttgcgt cggtggttgc
761581 gctacaagta caagctcaga cattgcaagg gcgggagcta tccactctcg cacctctacg
761641 ggtactttgg tctcgtacgt ctgggcgcac gtgggcgcag cgaggcgtgg gtgaaggcgt
761701 gatgtcgtgt ccgagagccg gatgcgggaa atctgcatgt ccggttcgat gagggggatg
761761 tgaaaacggg gttacggcag agttacttgg gcaccgccag acgaaagggg cggacaacag
761821 acataccaag cctaatgcta ccgcgtcaca tctctactct acacgctccg gccgtgtcaa
761881 tattcagaaa gcagccgtag ccacgaaatg gctcgtatca ggcagcacga ctgcctcaag
61941 acggctacaa ccggccgtag gagctttgaa atattgtgct ttgcgaacgg caactcacta
762001 atcagactcc cgccattcag cacagccaga tgagagacaa ttgctcagag gtgagcgcga
762061 gttttgccga tatgagacct tcagtcagca atgcaacggt ttgcggatga gcgctagcgc
762121 acatagcgca gttgatgaat gctctaacct ctcgcggcat ctccagtttt ccaactgcct
762181 ggcgagatta ggcaatcacc aaaggtcaaa gattatggaa cttaatcctg cgacatctgc
[top]
MALHAKAKSESGYRFYALYDKIYRTDILAHAYAQCRSNKGAPGVDRQDFEDVEAYGVR
RWLEELALALKEESYRPDPIRRVFIPKANGKLRPLGISTLHDRVCMTAAMLVLEPIFE
ADLPDEQYAYRPGRNAQQAAEEVKNRLYLGQTDVVDADLSDYFGSIPHSELMKSLARR
IVDRRVLHLIKMWLECAVEETDQRGRKKRTTEAKDQGRGIPQGSPISPLLSNLYMRRF
VLAWKKLGLERSLGSRIVTYADDLVILCKCGKAEEALQWMRTIMGKLKLTVNEEKTRI
CQVPAGTFDFLGYSFGRRYVPRTGKPQIALWPSKKSIRRMVEKIHDMTERQTGWQETT
ELVGKLNRTLRGWANYFSVGNVSRAYRALDSYTATRLRRWLRYKYKLRHCKGGSYPLS
HLYGYFGLVRLGARGRSEAWVKA
[top]

[top]
| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |