[Back to introns by organism]   [Back to home page]

Information of M.m.I1-1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

Note: Multiple insertions

M.m.I1-1        AE013515 (3337-5483)

M.m.I1-2        AE013516 (7432-10327)

 

[Intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

               

5' end

                                            gtgc gacgagaagt tatgctatgg
3361 attgcggaac aacccgccat tctatccatt cagaatgtcc ctattctgat gtgcatttct
3421 ggcaacggat ctgtgcaaag ctcagaggac aatgaggtgt aaaactgaac ctgacagttc
3481 caaaccgcta gggaaaagga tatatggaga accgaaaggt gaagatctgt ttgagagatc
3541 gtaacgtata acatgcgaaa gcaggttaat acgcatgagc caaaatggca taggagcctg
3601 gttatgacat tccatgttgt gtcatgacga atggacatat ggcttccggt ctataggatc
3661 ctcctaagta tccaacaact atctatgaca tagaactcgg taaaccattt gtggtcctta
3721 ataaggtagg aattccgcaa ggaaaaccga cgccacaggt ggcagaggaa ttggcaaaaa
3781 gcaaacggtt tcctgtaatg gggaatatag aggaaaaaaa tgcctcaatg cgaaagcatg
3841 cccacttgcc attagtcacg aaagcaacag ggattgttta attatgaatg taagagaacc
3901 agagataact tcggttacgg accttacaga caaagaactc actcaacaat ggaaaattat
3961 tgattggaaa agagtgaaag aagtcgttaa taacctacag tctcgaattg caagtgcagc
4021 taagagcgga aattggaaaa ccgtgaacaa actctcccgt cttctgaccc ggtcctttta
4081 tgccaaactt ctttcgatac gtaaagtaac cactaacaag ggaagtcgaa ctcctggaat
4141 tgatggcatc atttggtcat cgtcggcaga taagatgcgt gccgctctac aactaacgaa
4201 caaaggctat cgtgcaaaac cattaacacg gaagtacatt cgaaagaaga gcggtaaact
4261 acgacctctt agcataccaa ctatgtatga cagagcaatg caaactttgc actctctggt
4321 gttgggcgca atcgaatctg cagcaggtga caaaacttcg tttgggttta aaccttatcg
4381 ttctacaaaa gatgcttatg cctaccttca cctctgttta agcaagaaag tttctcctga
4441 atggatcgtt gaaggtgaca ttaaagcttg ctttgatgaa atcagccata actggatact
4501 tgataacatc cctattgata aacgaatcct taaagagttc ctaaaagccg gatatatcga
4561 gaattatcat ctgtttccta ccgaaaaagg cacacctcaa ggagggccta tatctccaat
4621 aattggaaat atgtccttaa acggcttaga aaacgcctta gcgatgagat tttactccag
4681 atcagatgga acaattgaca aatctcatca aaacaggcgc aaggttaatt atgttcgttt
4741 tgctgatgac cttgtggtaa ctgctgattc cccggaaacg gctcttgaaa taatcgacgt
4801 catccaagca tttttagatc ctcgtggact taagctcagc gaagaaaaga ctcttgtgac
4861 caatattagt gaagggttca atttcctcgg atggaacttc aggaagtata aaggaaaact
4921 ccttccgaag ccatccaaag actctcaaaa ggaaattatc aagaaaatcc gtgacgtact
4981 tcacaaagca aaagcatggg atcaagaccg attgatacaa accctcaacc caatcattag
5041 gggatgggca gagtatcata atcacgcagt ttcttctgct atcttcaaca aacttgatga
5101 aatagtctat aacatgctta tctcctgggc taaaagaaga cactcaaata aaggcttcac
5161 ctggataacg accaaatact ggcataaatc cggtaaaaga aaatacgtat tctgcacaga
5221 actacagacg ttggaaagat tctccaatgc caaaattgtt aggcaaagat tagcaagcct
5281 taataagaat ccatttatcg acaaagaata tttcgaacaa tggaaattca tggagtacca
5341 ccggaagaaa cgcatcacta accccaattc tgttctaaac tga
cacccga aagggtagta
5401 gtggctcgag ccggatgaat tgaaaaattc ttgtccggtt cctagaagac ggggcaggga
5461 gaaatcccac cctgttattc gac

3' end  

[top]


[Intron and flanking sequence]

 

2761 ggattgtaga taatgctgaa aatggaggaa tggctattga tacgagattt gtattcacaa
2821 ggctttagca tcagtaaaat ctccagacaa acaggttatg ccagggcaac tgtgaggaaa
2881 tatcttaaca agaaaaccgt cccagaaccc cagaaacgtc ccggaagaaa aagcaagctt
2941 gatccttaca aaccttacat tcttgagaaa ctcaatgaag gtccctatac tgcttctcgc
3001 ctctatcgag aaatcaaaga aatgggtttt gatggaggaa aaaccatcgt caaagacttc
3061 gtaagagaag taagacccaa gcagggagtc cccgccatac ttcgctacga aacaaaacct
3121 ggagttcagg ctcaggttga ctggggagag ttaggaacag ttgaggttga tggaaaggta
3181 aagaaactct tttgcttcaa catgattctt ggatattcca gaatgagata tgttgaattt
3241 acactgagta tagacactcc aactcttatc cagtgtcatc tgaatgcttt tgagtacttt
3301 ggaggattta cacaggagat tctatacgat aatatggtgc gacgagaagt tatgctatgg
3361 attgcggaac aacccgccat tctatccatt cagaatgtcc ctattctgat gtgcatttct
3421 ggcaacggat ctgtgcaaag ctcagaggac aatgaggtgt aaaactgaac ctgacagttc
3481 caaaccgcta gggaaaagga tatatggaga accgaaaggt gaagatctgt ttgagagatc
3541 gtaacgtata acatgcgaaa gcaggttaat acgcatgagc caaaatggca taggagcctg
3601 gttatgacat tccatgttgt gtcatgacga atggacatat ggcttccggt ctataggatc
3661 ctcctaagta tccaacaact atctatgaca tagaactcgg taaaccattt gtggtcctta
3721 ataaggtagg aattccgcaa ggaaaaccga cgccacaggt ggcagaggaa ttggcaaaaa
3781 gcaaacggtt tcctgtaatg gggaatatag aggaaaaaaa tgcctcaatg cgaaagcatg
3841 cccacttgcc attagtcacg aaagcaacag ggattgttta attatgaatg taagagaacc
3901 agagataact tcggttacgg accttacaga caaagaactc actcaacaat ggaaaattat
3961 tgattggaaa agagtgaaag aagtcgttaa taacctacag tctcgaattg caagtgcagc
4021 taagagcgga aattggaaaa ccgtgaacaa actctcccgt cttctgaccc ggtcctttta
4081 tgccaaactt ctttcgatac gtaaagtaac cactaacaag ggaagtcgaa ctcctggaat
4141 tgatggcatc atttggtcat cgtcggcaga taagatgcgt gccgctctac aactaacgaa
4201 caaaggctat cgtgcaaaac cattaacacg gaagtacatt cgaaagaaga gcggtaaact
4261 acgacctctt agcataccaa ctatgtatga cagagcaatg caaactttgc actctctggt
4321 gttgggcgca atcgaatctg cagcaggtga caaaacttcg tttgggttta aaccttatcg
4381 ttctacaaaa gatgcttatg cctaccttca cctctgttta agcaagaaag tttctcctga
4441 atggatcgtt gaaggtgaca ttaaagcttg ctttgatgaa atcagccata actggatact
4501 tgataacatc cctattgata aacgaatcct taaagagttc ctaaaagccg gatatatcga
4561 gaattatcat ctgtttccta ccgaaaaagg cacacctcaa ggagggccta tatctccaat
4621 aattggaaat atgtccttaa acggcttaga aaacgcctta gcgatgagat tttactccag
4681 atcagatgga acaattgaca aatctcatca aaacaggcgc aaggttaatt atgttcgttt
4741 tgctgatgac cttgtggtaa ctgctgattc cccggaaacg gctcttgaaa taatcgacgt
4801 catccaagca tttttagatc ctcgtggact taagctcagc gaagaaaaga ctcttgtgac
4861 caatattagt gaagggttca atttcctcgg atggaacttc aggaagtata aaggaaaact
4921 ccttccgaag ccatccaaag actctcaaaa ggaaattatc aagaaaatcc gtgacgtact
4981 tcacaaagca aaagcatggg atcaagaccg attgatacaa accctcaacc caatcattag
5041 gggatgggca gagtatcata atcacgcagt ttcttctgct atcttcaaca aacttgatga
5101 aatagtctat aacatgctta tctcctgggc taaaagaaga cactcaaata aaggcttcac
5161 ctggataacg accaaatact ggcataaatc cggtaaaaga aaatacgtat tctgcacaga
5221 actacagacg ttggaaagat tctccaatgc caaaattgtt aggcaaagat tagcaagcct
5281 taataagaat ccatttatcg acaaagaata tttcgaacaa tggaaattca tggagtacca
5341 ccggaagaaa cgcatcacta accccaattc tgttctaaac tgacacccga aagggtagta
5401 gtggctcgag ccggatgaat tgaaaaattc ttgtccggtt cctagaagac ggggcaggga
5461 gaaatcccac cctgttattc gac
aaacagg ttgttatcaa aagagcctta aaatcatctg
5521 attctgaatg gaactcacag tttgaggatt ttttcaaatg ctttggtttt attcccaggt
5581 tatgcaggcc ttacaggcct cagacaaaag gcaagattga aaatacggta gggtatgtca
5641 agagggactt cttccttgga agacaattta cctctctcga agatctgaac ggccaagtta
5701 acaggtggtt agaaagggta aattcaactg tccacggaac aacctatcaa atccctcttg
5761 aacgctttaa ggaggagaac ctgagccctc tgggccaggt tcctccttac aaagttgccc
5821 ataaggaggc cagaaaggtc tccagagact gttatatttc atttcttgga aataagtatt
5881 ctgttcctta caggtttgca ggaagaacta cagaacttcg aatctttgaa ggaaaattcg

[top]


[ORF sequence]

 

MNVREPEITSVTDLTDKELTQQWKIIDWKRVKEVVNNLQSRIASAAKSGNWKTVNKLS

RLLTRSFYAKLLSIRKVTTNKGSRTPGIDGIIWSSSADKMRAALQLTNKGYRAKPLTR

KYIRKKSGKLRPLSIPTMYDRAMQTLHSLVLGAIESAAGDKTSFGFKPYRSTKDAYAY

LHLCLSKKVSPEWIVEGDIKACFDEISHNWILDNIPIDKRILKEFLKAGYIENYHLFP

TEKGTPQGGPISPIIGNMSLNGLENALAMRFYSRSDGTIDKSHQNRRKVNYVRFADDL

VVTADSPETALEIIDVIQAFLDPRGLKLSEEKTLVTNISEGFNFLGWNFRKYKGKLLP

KPSKDSQKEIIKKIRDVLHKAKAWDQDRLIQTLNPIIRGWAEYHNHAVSSAIFNKLDE

IVYNMLISWAKRRHSNKGFTWITTKYWHKSGKRKYVFCTELQTLERFSNAKIVRQRLA

SLNKNPFIDKEYFEQWKFMEYHRKKRITNPNSVLN

top]


[Secondary structure]

 

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |