[Back to introns by organism]  [Back to home page]

Information for La.re.I1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

 

5' end

       gtgcgccc ggcatgggta ttagctaggc ggtgagagtc cgctatgggc cgtagtagtc

 661 ggaaccatga gctgaggaca agggtgtcca ctgcgaggtg gaatctgaag gaagtctaag
 721 gcaaagtact gcaccgatga acaagaagta gctataaggc tggaactaac tggataagac
 781 tgcatgacaa gttaaagtcc aacactactc gaagttactt tcagtaaagc taccggtgac
 841 atggtacgaa agttaatatc cttacccggg gagatctggc ctacacgttt ccgacaagag
 901 gaataagttt aatttccaca gaaacaagcg gtgcagtgat gtaacgttga gtaagccaga
 961 agtcagccga ggtcatagta gtctgagtaa tcagatgaag gactgaacga caataacttg
1021 taacttatat cggaggtgta atcaggtgcg acaatcgcag aaaacagaac aacaagctga
1081 ccgcttgtcg aggataggtt tggaaaaccg aaagtacaca agggcgcgta gtaccgatta
1141 tggtgaaggt aaaggtatga gtgtcactat ccaagactta gtcttggacc gcaataacct
1201 taatcaggct tatttgcgag ttaagagaaa taaaggagca gcaggcattg acgatatgac
1261 agtcaatgac cttctgccat atctcagaga aaataagacg gaattgatcg ctagtttgcg
1321 tgagggcaag tataaaccag caccagtcaa acgggtagaa attccgaagc ctaatggtgg
1381 agtaagaaaa ctcggaatac caacagtggt ggaccgaatg gttcaacaag ctgtggccca
1441 aattcttaca cctatctttg agcgtgtttt ctctgataat agttttggct tccgccctca
1501 ccgtggggct cacgacgcta ttgcaaaagt agtagatctt tataatcaag gttatcgaag
1561 agttgtcgac ttagacctaa aagcctattt tgataatgtt aatcatgact tgatgattaa
1621 gtatcttcaa caatatattg atgacccatg gacactaagg ctcattcgta agtttctaac
1681 tagcggagtc ttagaccatg ggcttttcgc taagagtgaa aaaggaaccc cacaaggagg
1741 gccattgtca ccaatactgg cgaatatcta tctaaatgag ttggataaag agttgactag
1801 acgtggtcac cactttgtgc gctatgcgga tgattgtaac atttatgtta aaagtcaacg
1861 agccggagaa cgagtaatgc gaagcattac ccagtttctt gaaaagcgat tgaaagttaa
1921 agtgaaccca gataaaacca aagtcggtag cccgctacgg ttaaagtttc ttggcttttc
1981 gttgggtgta gaccacaatg gagcctacgc ccgtccagca aaacaatcgc aacaacgagt
2041 aaagaaagca ttgaggttat taactaaacg taatcgtgga atatccctga caagaatgtt
2101 tgaagaaatt catcgaaaaa tgcgtggatg gcttcagtac tactcaattg ggaaactaac
2161 tgactttatt caacgccttg acaagtggtt gagggcccga ataagacagt atatctggaa
2221 gcaatggaag aagcttaaaa ctaaggtaac taacttacag aagctggggc tgtcccagcg
2281 tgatgcatat gtcttcgcta gtacccgcaa gggctactgg cgaactgcac acagtaagac
2341 cttgagctat tctctaacta atagaaaact ggaacaactc ggacttatga atatgtccaa
2401 gacgctccag tcaattcaat gtgattaa
gt tgtcgaaccg ccgtatacgg aaccgtacgt
2461 acggtggtgt gagaggtcga taattgaact aatcaattat ctcctactcg at

3' end

[top]


[Intron and flanking sequence]

 

  301 ctgatatatc aaggccctct acaaaagaat acgaatgatg ttgcttatta acttgatagc
 361 ggaatcctaa tttaacggca tttgaactgt gcaaaatcaa ttcaaatcta taaaaagtaa
 421 cattgttccc taccaaaacc ctaccaaaaa cttataatat tgagcgtaaa aaaagcaatc
 481 agcatttaca attgatagta agtgtttgat tgctttttca ttttgaactt catcttagtt
 541 aatatataga cgaaaataac aggtactaca taattgctaa atgtgtagta cctgttgttt
 601 atgtgcgccc ggcatgggta ttagctaggc ggtgagagtc cgctatgggc cgtagtagtc
 661 ggaaccatga gctgaggaca agggtgtcca ctgcgaggtg gaatctgaag gaagtctaag
 721 gcaaagtact gcaccgatga acaagaagta gctataaggc tggaactaac tggataagac
 781 tgcatgacaa gttaaagtcc aacactactc gaagttactt tcagtaaagc taccggtgac
 841 atggtacgaa agttaatatc cttacccggg gagatctggc ctacacgttt ccgacaagag
 901 gaataagttt aatttccaca gaaacaagcg gtgcagtgat gtaacgttga gtaagccaga
 961 agtcagccga ggtcatagta gtctgagtaa tcagatgaag gactgaacga caataacttg
1021 taacttatat cggaggtgta atcaggtgcg acaatcgcag aaaacagaac aacaagctga
1081 ccgcttgtcg aggataggtt tggaaaaccg aaagtacaca agggcgcgta gtaccgatta
1141 tggtgaaggt aaaggtatga gtgtcactat ccaagactta gtcttggacc gcaataacct
1201 taatcaggct tatttgcgag ttaagagaaa taaaggagca gcaggcattg acgatatgac
1261 agtcaatgac cttctgccat atctcagaga aaataagacg gaattgatcg ctagtttgcg
1321 tgagggcaag tataaaccag caccagtcaa acgggtagaa attccgaagc ctaatggtgg
1381 agtaagaaaa ctcggaatac caacagtggt ggaccgaatg gttcaacaag ctgtggccca
1441 aattcttaca cctatctttg agcgtgtttt ctctgataat agttttggct tccgccctca
1501 ccgtggggct cacgacgcta ttgcaaaagt agtagatctt tataatcaag gttatcgaag
1561 agttgtcgac ttagacctaa aagcctattt tgataatgtt aatcatgact tgatgattaa
1621 gtatcttcaa caatatattg atgacccatg gacactaagg ctcattcgta agtttctaac
1681 tagcggagtc ttagaccatg ggcttttcgc taagagtgaa aaaggaaccc cacaaggagg
1741 gccattgtca ccaatactgg cgaatatcta tctaaatgag ttggataaag agttgactag
1801 acgtggtcac cactttgtgc gctatgcgga tgattgtaac atttatgtta aaagtcaacg
1861 agccggagaa cgagtaatgc gaagcattac ccagtttctt gaaaagcgat tgaaagttaa
1921 agtgaaccca gataaaacca aagtcggtag cccgctacgg ttaaagtttc ttggcttttc
1981 gttgggtgta gaccacaatg gagcctacgc ccgtccagca aaacaatcgc aacaacgagt
2041 aaagaaagca ttgaggttat taactaaacg taatcgtgga atatccctga caagaatgtt
2101 tgaagaaatt catcgaaaaa tgcgtggatg gcttcagtac tactcaattg ggaaactaac
2161 tgactttatt caacgccttg acaagtggtt gagggcccga ataagacagt atatctggaa
2221 gcaatggaag aagcttaaaa ctaaggtaac taacttacag aagctggggc tgtcccagcg
2281 tgatgcatat gtcttcgcta gtacccgcaa gggctactgg cgaactgcac acagtaagac
2341 cttgagctat tctctaacta atagaaaact ggaacaactc ggacttatga atatgtccaa
2401 gacgctccag tcaattcaat gtgattaagt tgtcgaaccg ccgtatacgg aaccgtacgt
2461 acggtggtgt gagaggtcga taattgaact aatcaattat ctcctactcg at
ttttgctg
2521 attatttttt gtggaagaat tatgaacaac gatatttgaa ctaaccaagt ctaagacgtt
2581 tttagaaatt tcttccgctt tctgttcatc aatctgatca agtagattga tgagggtact
2641 ctgataatgt cctcttatat gttgctggtt agtattacca tcaattaatt tttccattgt
2701 agttttcagc ccttttgcta ttttcgaaag cgttaaagct cttatgtact caacctcacc
2761 acgttctagt cttgatacgt aattggtgga aagatcagta gcattagcta gttcttcttg
2821 agtcatttta agctcgtggc gcctgtgact gatgttaact ccgattttac tcatctgtga

[top]


[ORF sequence]

 

MSVTIQDLVLDRNNLNQAYLRVKRNKGAAGIDDMTVNDLLPYLRENKTELIASLREGK

YKPAPVKRVEIPKPNGGVRKLGIPTVVDRMVQQAVAQILTPIFERVFSDNSFGFRPHR

GAHDAIAKVVDLYNQGYRRVVDLDLKAYFDNVNHDLMIKYLQQYIDDPWTLRLIRKFL

TSGVLDHGLFAKSEKGTPQGGPLSPILANIYLNELDKELTRRGHHFVRYADDCNIYVK

SQRAGERVMRSITQFLEKRLKVKVNPDKTKVGSPLRLKFLGFSLGVDHNGAYARPAKQ

SQQRVKKALRLLTKRNRGISLTRMFEEIHRKMRGWLQYYSIGKLTDFIQRLDKWLRAR

IRQYIWKQWKKLKTKVTNLQKLGLSQRDAYVFASTRKGYWRTAHSKTLSYSLTNRKLE

QLGLMNMSKTLQSIQCD

[top]


[Secondary structure]

 

                                           

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |