[Back to introns by organism]   [Back to home page]

Information of M.m.I1-2 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

Note: Multiple insertions

M.m.I1-1        AE013515 (3337-5483)

M.m.I1-2        AE013516 (7432-10327)

 

 

[Intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

A putative transposase (in green) inserted into the intron ORF.

               

5' end

                                                              gtgcgacga
7441  gaagttatgc tatggattgc ggaacaaccc gccattctat ccattcagaa tgtccctatt
7501  ctgatgtgca tttctggcaa cggatctgtg caaagctcag aggacaatga ggtgtaaaac
7561  tgaacctgac agttccaaac cgctagggaa aaggatatat ggagaaccga aaggtgaaga
7621  tctgtttgag agatcgtaac gtataacatg cgaaagcagg ttaatacgca tgagccaaaa
7681  tggcatagga gcctggttat gacattccat gttgtgtcat gacgaatgga catatggctt
7741  ccggtctata ggatcctcct aagtatccaa caactatcta tgacatagaa ctcggtaaac
7801  catttgtggt ccttaataag gtaggaattc cgcaaggaaa accgacgcca caggtggcag
7861  aggaattggc aaaaagcaaa cggtttcctg taatggggaa tatagaggaa aaaaaatgcc
7921  tcaatgcgaa agcatgccca cttgccatta gtcacgaaag caacagggat tgtttaatta
7981  tgaatgtaag agaaccagag ataacttcgg ttacggacct tacagacaaa gaactcactc
8041  tacgatggaa aagtattgat tggaaaagag tgaaagaagc cgttaataac ctacagtctc
8101  gaattgcaag tgcagctaag aacggaaatt ggaaaactgt gaacaaactc tcccgtcttc
8161  tgacccggtc cttttatgcc aaacttcttt cggtacgtaa agtaaccact aacaagggaa
8221  gtcgaactcc cggaatagat ggcgtcattt ggtcatcatc ggcagataag atgcgtgccg
8281  ctctacaact aacgaacaaa ggctatcgtg caaaaccatt aacacggaag tacattcgaa
8341  agaagagcgg taaactacga cctcttagca taccaactat gtttgacaga gcaatgcaaa
8401  ctttgcactc tctggtgttg ggcgcaatcg aatctgcagc aggtgacaaa acttcgtttg
8461  ggtttaaacc ttatcgttct acaaaagatg cttatgccta ccttcacctc tgtttaagca
8521  agaaagtttc tcctgaatgg atcgttgaag gtgacattaa agcttgcttt gatgaaatca
8581  gccataattg atacttgata acatccctct ggataaacga atccttaaag agttcctaaa
8641  agccggatat atcgaga
att aggcattgtt aatcaaatat agctaattct ttatttctgt
8701  gtttcatcaa tagaagaaca gagtacttta gcatttcaag actcttcgta taacactttg
8761  tctttcgtct caaccttgct agaaagtgcc tgaatatgct gttatatcct tcaactgtat
8821  acgtttctgc ttttgattga gtatgaatag tttctggaat aaactctgca tatggtttcc
8881  agtgatcagt cattacttct tcaatctttt tcgtctttaa tttttcccag atcagttgtc
8941  cagtttctgt tcctctgctg ccaaaagagc agttgatgaa ttttttccca actctatcaa
9001  cagcaatcca tatccagcaa tatttttttt gttaccgatg taagtgtgca tctcatccat
9061  ttcaacaata gatatctcat tttcgctttt caggtcctct agctcccgac caaatttctt
9121  tatccatttt tgaacagaaa catgacttac acctaaaaat cgccctattg agcgaaatcc
9181  taatccttca agatagagtt gcaaagcttg tctcttaaca gaagtggggc tagcagttga
9241  tttcagctct acggaatagt tatatccaca atcatggcat ttgtaccgtt gacgtccatc
9301  aactataccg ttctttttgt gagtggaact attacatctt gggcagtcca tacagaacta
9361  taggctctca taatagataa ctatgtttaa ttaccaaagc cc
gagaatta ttatctgttt
9421  cctaccgaaa aaggtacacc tcaaggaggg cctatatctc caataattgg aaatatgtcc
9481  ttaaacggct tagaaaacgc cttagcgatg agattttact ccagatcaga tggaacaatt
9541  gacaaatctc atcaaaacag acacaaggtt aattatgttc gttttgctga tgaccttgtg
9601  gtaactgcca attccccgga aacggctctt gaaatagtcg atgtcatcca agcattttta
9661  gatcctcgtg gacttaagct cagcgaagaa aagactcttg tgaccaatat tagtgaaggg
9721  ttcaatttcc taggatggaa cttcaggaaa tataaaggaa aacttcttcc gaagccatct
9781  aaagattctc aaaaggaagt tattaagaaa atccgtgacg tacttcacaa agcaaaagca
9841  tgggatcaag accgattgat acaaacactc aacccaatca ttaggggatg ggcacagtat
9901  cataatcacg cagtttcttc tgctatcttc aacaaacttg atgaaatagt ctataacatg
9961  cttatctcct gggcgaaaag aagacactca aataaaggtt tcacctggat aacgaccaaa
10021 tactggcata aatccggtaa aagaaaatac gtattctgca cagaactaca tacgttggag
10081 agattctcca atgccaaaat tgttaggcaa agactagcaa gccttaataa gaatccattt
10141 atcgacaaag aatatttcga acaatggaaa ttcatggaat accaccggaa gaaacgcatc
10201 actaacccca attttgttct aaactga
cac ccgaaagggt agtagtggct cgagccggat
10261 gaatcgaaag gttcaagtcc ggttcctaga agacggggta gggagaaatc ccaccctgtt
10321 attcgac

3' end  

[top]


[Intron and flanking sequence]

 

7021  agaaagagca agcttgatcc tttcaaacct tacatacaag aaaaactcaa agaaggtccc
7081  tatactgctg ctcgcctcta tcgggaaatc aaagaaatgg gttttgatgg aggaaaaacc
7141  atagtcaagg acttcgtaca aaaaatccga cccgagcagg gaatccctgc cgtactccgc
7201  tatgaaacaa aaccaggtgt ccaggctcag gttgactggg gagagttagg aacagttgag
7261  gttgatggaa aggtaaagaa actcttttgc ttcaacatga ttcttggata ttccagaatg
7321  agatatgttg aatttacact gagtatagac actccaactc ttatccagtg tcatctgaat
7381  gcttttgagt actttggagg atttacacag gagattctat acgataacat ggtgcgacga
7441  gaagttatgc tatggattgc ggaacaaccc gccattctat ccattcagaa tgtccctatt
7501  ctgatgtgca tttctggcaa cggatctgtg caaagctcag aggacaatga ggtgtaaaac
7561  tgaacctgac agttccaaac cgctagggaa aaggatatat ggagaaccga aaggtgaaga
7621  tctgtttgag agatcgtaac gtataacatg cgaaagcagg ttaatacgca tgagccaaaa
7681  tggcatagga gcctggttat gacattccat gttgtgtcat gacgaatgga catatggctt
7741  ccggtctata ggatcctcct aagtatccaa caactatcta tgacatagaa ctcggtaaac
7801  catttgtggt ccttaataag gtaggaattc cgcaaggaaa accgacgcca caggtggcag
7861  aggaattggc aaaaagcaaa cggtttcctg taatggggaa tatagaggaa aaaaaatgcc
7921  tcaatgcgaa agcatgccca cttgccatta gtcacgaaag caacagggat tgtttaatta
7981  tgaatgtaag agaaccagag ataacttcgg ttacggacct tacagacaaa gaactcactc
8041  tacgatggaa aagtattgat tggaaaagag tgaaagaagc cgttaataac ctacagtctc
8101  gaattgcaag tgcagctaag aacggaaatt ggaaaactgt gaacaaactc tcccgtcttc
8161  tgacccggtc cttttatgcc aaacttcttt cggtacgtaa agtaaccact aacaagggaa
8221  gtcgaactcc cggaatagat ggcgtcattt ggtcatcatc ggcagataag atgcgtgccg
8281  ctctacaact aacgaacaaa ggctatcgtg caaaaccatt aacacggaag tacattcgaa
8341  agaagagcgg taaactacga cctcttagca taccaactat gtttgacaga gcaatgcaaa
8401  ctttgcactc tctggtgttg ggcgcaatcg aatctgcagc aggtgacaaa acttcgtttg
8461  ggtttaaacc ttatcgttct acaaaagatg cttatgccta ccttcacctc tgtttaagca
8521  agaaagtttc tcctgaatgg atcgttgaag gtgacattaa agcttgcttt gatgaaatca
8581  gccataattg atacttgata acatccctct ggataaacga atccttaaag agttcctaaa
8641  agccggatat atcgagaatt aggcattgtt aatcaaatat agctaattct ttatttctgt
8701  gtttcatcaa tagaagaaca gagtacttta gcatttcaag actcttcgta taacactttg
8761  tctttcgtct caaccttgct agaaagtgcc tgaatatgct gttatatcct tcaactgtat
8821  acgtttctgc ttttgattga gtatgaatag tttctggaat aaactctgca tatggtttcc
8881  agtgatcagt cattacttct tcaatctttt tcgtctttaa tttttcccag atcagttgtc
8941  cagtttctgt tcctctgctg ccaaaagagc agttgatgaa ttttttccca actctatcaa
9001  cagcaatcca tatccagcaa tatttttttt gttaccgatg taagtgtgca tctcatccat
9061  ttcaacaata gatatctcat tttcgctttt caggtcctct agctcccgac caaatttctt
9121  tatccatttt tgaacagaaa catgacttac acctaaaaat cgccctattg agcgaaatcc
9181  taatccttca agatagagtt gcaaagcttg tctcttaaca gaagtggggc tagcagttga
9241  tttcagctct acggaatagt tatatccaca atcatggcat ttgtaccgtt gacgtccatc
9301  aactataccg ttctttttgt gagtggaact attacatctt gggcagtcca tacagaacta
9361  taggctctca taatagataa ctatgtttaa ttaccaaagc ccgagaatta ttatctgttt
9421  cctaccgaaa aaggtacacc tcaaggaggg cctatatctc caataattgg aaatatgtcc
9481  ttaaacggct tagaaaacgc cttagcgatg agattttact ccagatcaga tggaacaatt
9541  gacaaatctc atcaaaacag acacaaggtt aattatgttc gttttgctga tgaccttgtg
9601  gtaactgcca attccccgga aacggctctt gaaatagtcg atgtcatcca agcattttta
9661  gatcctcgtg gacttaagct cagcgaagaa aagactcttg tgaccaatat tagtgaaggg
9721  ttcaatttcc taggatggaa cttcaggaaa tataaaggaa aacttcttcc gaagccatct
9781  aaagattctc aaaaggaagt tattaagaaa atccgtgacg tacttcacaa agcaaaagca
9841  tgggatcaag accgattgat acaaacactc aacccaatca ttaggggatg ggcacagtat
9901  cataatcacg cagtttcttc tgctatcttc aacaaacttg atgaaatagt ctataacatg
9961  cttatctcct gggcgaaaag aagacactca aataaaggtt tcacctggat aacgaccaaa
10021 tactggcata aatccggtaa aagaaaatac gtattctgca cagaactaca tacgttggag
10081 agattctcca atgccaaaat tgttaggcaa agactagcaa gccttaataa gaatccattt
10141 atcgacaaag aatatttcga acaatggaaa ttcatggaat accaccggaa gaaacgcatc
10201 actaacccca attttgttct aaactgacac ccgaaagggt agtagtggct cgagccggat
10261 gaatcgaaag gttcaagtcc ggttcctaga agacggggta gggagaaatc ccaccctgtt
10321 attcgac
aaa caggttgtta tcaaaagagc cttaaaatca tcagattctg aatggaactc
10381 acagtttgag gatttcttca aatgctttgg ttttactccc cggttatgca ggccttacag
10441 gcctcagaca aaaggtaaaa ttgaaaatac tgtcggctat gtcaagaggg atttcgtcct
10501 tggaagacag tttacctctc tcgaagacct gaacggccaa gttcacaggt ggttggaaag
10561 ggtaaactca actgtccatg gaacaaccta tcaaatccct cttgaacgct ttaaggagga
10621 gaacctgagc cctctggatc aggttcctcc ttacaaagtt gtccataagg agaccagaaa

[top]


[ORF sequence]

 

MNVREPEITSVTDLTDKELTLRWKSIDWKRVKEAVNNLQSRIASAAKNGNWKTVNKLS

RLLTRSFYAKLLSVRKVTTNKGSRTPGIDGVIWSSSADKMRAALQLTNKGYRAKPLTR

KYIRKKSGKLRPLSIPTMFDRAMQTLHSLVLGAIESAAGDKTSFGFKPYRSTKDAYAY

LHLCLSKKVSPEWIVEGDIKACFDEISHNXILDNIPLDKRILKEFLKAGYIE

transposase

NYYLFPTEKGTPQGGPISPIIGNMSLNGLENALAMRFYSRSDGTIDKSHQNRHKVNYV

RFADDLVVTANSPETALEIIDVIQAFLDPRGLKLSEEKTLVTNISEGFNFLGWNFRKY

KGKLLPKPSKDSQKEVIKKIRDVLHKAKAWDQDRLIQTLNPIIRGWAQYHNHAVSSAI

FNKLDEIVYNMLISWAKRRHSNKGFTWITTKYWHKSGKRKYVFCTELHTLERFSNAKI

VRQRLASLNKNPFIDKEYFEQWKFMEYHRKKRITNPNFVLN

top]


[Secondary structure]

 

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |