[Back to introns by organism]   [Back to home page]

Information of M.sp.I1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

[Intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

               

5' end                                          

                                                         gtg tgccatgcac

29401 actttcctta cctctatggt gaggtttttt ttagaccaag tcccgagtga gcccagatgg

29461 agaggaatcg ttagccgaag gcaagggtgt cctgagcaat tgggaatctg aaggcaaaat

29521 gtggatttga cgtacaggaa gtagatatta ggcctaagcc tgggggtaag cttgcattgg

29581 aagatgacgc ccaaaatcca tccgtaaccg agcaagggta aatctgacga atccacagga

29641 aagaaggaaa gtttaccatg ggagatcctg ccaacgagag ttggtcagga agtcagcagc

29701 ggccatagta ctgcttaagg gtggttaaca gccaccttag ggaagggcca aatctttgac

29761 aaggagcagt agcttcataa cttgtaatag tcatgccaaa gacatgaaaa tatcaggtat

29821 tccagatgag tctgttagac gaatgggaaa gagatagcct tccggtgaag ggcacagacg

29881 tattctgtaa acgaaccggc aaggtaggtc atcagtggct atcagcgtgt gaagaagaac

29941 gagccttgac gagaggtttg atgtataagg tatgcgatct atcaaatctt actgcatcat

30001 tgaagcaggt agtaaagaac ggaggatcac cgggagtaga cggaatgcag gtcaaagaac

30061 tgcgatattg gttttccaat aaccaccaga agttaattga acaactgaag gaagggaact

30121 acagaccaat gacaatcaaa ggacaggaaa ttcccaaacc tggcggtggc gttcgtcaac

30181 taggcatacc cactgtccaa gaccggttgg ttcaacaagc cattgctcag caattaagta

30241 aaagatacga tcccaccttt tcacagtaca gttatggctt ccgcaagggc agaaatgccc

30301 accaagcttt aaggcaagct ggagcgtatg taaaggaagg gttcaattat gtagtagacc

30361 ttgacttgga gaaattcttt gacaaggtga accatgatcg tttaatgtgg ctgctgggga

30421 gacgtatatc agacaagcgg gtactaaagc tgataggtaa gtttctgcga tctggcatat

30481 taataggagg cttagagaac caacggataa gcggtactcc tcagggaagc cccttgtccc

30541 cactgctatc caacatcgta ctggatgaac tagacaaaga gttagagcga agaggccacc

30601 gttttgtacg ctatgcagat gatatgatcc ttctagtaag gagccaagag gcagcagagc

30661 gtgcatattc atcgattacc agctttatcg aaaatcgcct tctattgaag gtgaacaaag

30721 ataagagccg gatatgtagg ccttaccagc ttaactttct gggacactcg ataatgtggg

30781 atggtaagtt gggtcttagt cggcaaagcg aacaacgatt taaggaaaag gtaaagaaag

30841 tcacccgcag aaaccgtggt atcagccttg agcagatggt caaggagctt aacagggtac

30901 ttcggggatg gcttaattac ttcaggagcg caaagatgct aagcaagcta cagcggctaa

30961 gcagttggat acatcgtagg ataagatgtt ttcggctcaa acaatgtaaa cgtgcaatag

31021 gcataacgcg atttctggta agccttggac taccgaaatg gcgcagttta cttttggcaa

31081 cctcccacaa gggatggttc agaaaagctg gaagtccgca agcccatgaa ggtatgaata

31141 aggagtggtt ccgacagatc ggattgttca atctagttga atattatagt ttaaacttca

31201 aagaaaccgc ctagtacgag agtacgctgg gtggtgtgag aggacaatga ggctggattg

31261 atcatccatc ctcatttcct actcgat

3' end  

[top]


[Intron and flanking sequence]

 

29041 gagtaggtaa tatgaatagc cggtacctta ttgtgcccac gtctgatgtg gaaggtgttg

29101 atattaaggt ttttcccaat ccaactacag atgatttgag aattcaaggt ttagacgata

29161 aaatgtacca ggtctattta tacgatttgg gaggcacgaa tgtgtattcc aggcaagtaa

29221 aaggggcaga agcgcggctg gatgtaagtc agttgagcga tggtatatat cttcttaagt

29281 tggagggtga aagtctacag caacagatga aattacacat tcgcaaataa gttgttgtag

29341 gacgagagtt ttccggagca gtactttaat actgctccgg tttctttgtg tgccatgcac

29401 actttcctta cctctatggt gaggtttttt ttagaccaag tcccgagtga gcccagatgg

29461 agaggaatcg ttagccgaag gcaagggtgt cctgagcaat tgggaatctg aaggcaaaat

29521 gtggatttga cgtacaggaa gtagatatta ggcctaagcc tgggggtaag cttgcattgg

29581 aagatgacgc ccaaaatcca tccgtaaccg agcaagggta aatctgacga atccacagga

29641 aagaaggaaa gtttaccatg ggagatcctg ccaacgagag ttggtcagga agtcagcagc

29701 ggccatagta ctgcttaagg gtggttaaca gccaccttag ggaagggcca aatctttgac

29761 aaggagcagt agcttcataa cttgtaatag tcatgccaaa gacatgaaaa tatcaggtat

29821 tccagatgag tctgttagac gaatgggaaa gagatagcct tccggtgaag ggcacagacg

29881 tattctgtaa acgaaccggc aaggtaggtc atcagtggct atcagcgtgt gaagaagaac

29941 gagccttgac gagaggtttg atgtataagg tatgcgatct atcaaatctt actgcatcat

30001 tgaagcaggt agtaaagaac ggaggatcac cgggagtaga cggaatgcag gtcaaagaac

30061 tgcgatattg gttttccaat aaccaccaga agttaattga acaactgaag gaagggaact

30121 acagaccaat gacaatcaaa ggacaggaaa ttcccaaacc tggcggtggc gttcgtcaac

30181 taggcatacc cactgtccaa gaccggttgg ttcaacaagc cattgctcag caattaagta

30241 aaagatacga tcccaccttt tcacagtaca gttatggctt ccgcaagggc agaaatgccc

30301 accaagcttt aaggcaagct ggagcgtatg taaaggaagg gttcaattat gtagtagacc

30361 ttgacttgga gaaattcttt gacaaggtga accatgatcg tttaatgtgg ctgctgggga

30421 gacgtatatc agacaagcgg gtactaaagc tgataggtaa gtttctgcga tctggcatat

30481 taataggagg cttagagaac caacggataa gcggtactcc tcagggaagc cccttgtccc

30541 cactgctatc caacatcgta ctggatgaac tagacaaaga gttagagcga agaggccacc

30601 gttttgtacg ctatgcagat gatatgatcc ttctagtaag gagccaagag gcagcagagc

30661 gtgcatattc atcgattacc agctttatcg aaaatcgcct tctattgaag gtgaacaaag

30721 ataagagccg gatatgtagg ccttaccagc ttaactttct gggacactcg ataatgtggg

30781 atggtaagtt gggtcttagt cggcaaagcg aacaacgatt taaggaaaag gtaaagaaag

30841 tcacccgcag aaaccgtggt atcagccttg agcagatggt caaggagctt aacagggtac

30901 ttcggggatg gcttaattac ttcaggagcg caaagatgct aagcaagcta cagcggctaa

30961 gcagttggat acatcgtagg ataagatgtt ttcggctcaa acaatgtaaa cgtgcaatag

31141 aggagtggtt ccgacagatc ggattgttca atctagttga atattatagt ttaaacttca

31201 aagaaaccgc ctagtacgag agtacgctgg gtggtgtgag aggacaatga ggctggattg

31261 atcatccatc ctcatttcct actcgattag cttatcggcg cccaaatcga gaataaaatc

31321 aaggtgtatt ttgttgcatt gttgctcatt ccggatattt atccataaag atttagaaat

31381 attgattgca gtacttttta acagcagtaa gaatgagtcg catttatcta ataacctgat

31441 ttcgatataa aacacccaat aaaatgtctg cgtgacccgg tcatatcgta atgaaatgga

31501 gatatcttcc acagcttttt gtataatctt gattttaaag aggtctttat gggtaaaaaa

31561 gattcactgg ccgtgcgcct cctgcgtcga aaagacattc tattgggtgt ttttatgctt

31621 ttttaatgtc actgtcgggg aatgggagaa aatccctcgc ctttaggcga agactttagt

31681 atttgcgata tttccttatg caaaactatc gaaagggttc tcatacactg tttgacctaa

[top]


[ORF sequence]

 

MSLLDEWERDSLPVKGTDVFCKRTGKVGHQWLSACEEERALTRGLMYKVCDLSNLTAS

LKQVVKNGGSPGVDGMQVKELRYWFSNNHQKLIEQLKEGNYRPMTIKGQEIPKPGGGV

RQLGIPTVQDRLVQQAIAQQLSKRYDPTFSQYSYGFRKGRNAHQALRQAGAYVKEGFN

YVVDLDLEKFFDKVNHDRLMWLLGRRISDKRVLKLIGKFLRSGILIGGLENQRISGTP

QGSPLSPLLSNIVLDELDKELERRGHRFVRYADDMILLVRSQEAAERAYSSITSFIEN

RLLLKVNKDKSRICRPYQLNFLGHSIMWDGKLGLSRQSEQRFKEKVKKVTRRNRGISL

EQMVKELNRVLRGWLNYFRSAKMLSKLQRLSSWIHRRIRCFRLKQCKRAIGITRFLVS

LGLPKWRSLLLATSHKGWFRKAGSPQAHEGMNKEWFRQIGLFNLVEYYSLNFKETA

top]


[Secondary structure]

 

[top]


 

| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |