[Back to introns by organism]   [Back to home page]

Information of Tr.e.I9 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

[Intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

 

5' end

   1 gtgcgactcg ttggtcacta aaccaggaaa ctgggtataa atgaccgtcc taaatcaata 

  61 catgggatac cccgatttag aacaactctt attaccttga tggtgaaatt ccatccacct 

 121 tgttgttagg ttgacagcat aaaccacagg gtagcgactg aacatcaatc cctgaagctg 

 181 gtttaaaagc gggaaaatac cagttaccga gaaacgatac cgcgagcaaa ggttgacaag 

 241 tggacgaaca ggaaggtaat caaactcccg tcatacagga ccattccaaa ggtagctgtg 

 301 gaatgtagga cgaaagtcag aggaataatg ggcataaata tccattggtt tatgtcaaca 

 361 accattcaag gcctccggga gagtttaccc cactaaatca atcgtaataa aggaataagg 

 421 gaacgaggaa acctctttaa gacacctaag taaataggtc gaaaacttag gaagccattc

 481 aaggtcaatc tacacaaaag ttgtaggtct taaagggaat aggattgagt aagaagcatt 

 541 gcagatgcag acgtactcac tgttgaacgt aatatagagt aatgattagt catgaacaaa 

 601 gctgaaatac taaaaagaaa attggacaac ccaaggccct tttcaagtgt ggcatgggac 

 661 gcttacgata tacctagcca agcgtgtgtt aatccaaacc taaaatggaa agacattaac 

 721 tggaaaaagg tagaaaagta tgtgtttaag ttacaaaagt taatctatag agcatccagc 

 781 cgtggcgaaa tccgcaaaat gcgtaaatat caaaaacttc tgaccaaaag ttattatgca

 841 aggttgctag ctgtcaggcg cgttacccag gacaaccagg gaaagaaaac tgctggtata 

 901 gatggcataa aaagccttcc cccaatgcaa agatttaata ccaatttcct aaagtccgac 

 961 tacacataat gcttcaagaa taaaatcata tcctgtacaa gaagcaatat ctttttgact 

1021 aaatccagtt aatatatttc tcaggattct gcgtcagaaa tctaaggaat cgaaaactcg 

1081 atttttcaag aattttttca cctctttcca cagtctttca atgggattaa cttcgggaca 

1141 ataaggaggt tgaaataata aaatgatgtg atcagggata gataaatcat tccctagatg 

1201 ccctcgcccg ttatctagtt gaataatatg aatctcttca ggatatgttt gagaaaaaag 

1261 ctctaaaaat ctttcaaagc aaggagtatc tagatgacta aattcataaa aaaaggaatc 

1321 tcctgttttt ggttctacta aaccataaag ccatagatat ttaaatagcc attgatataa 

1381 accaatgggc ttaacaccgc aagcagtaat gactttacct gttatagttt ttaatccaat

1441 tctagtctcg tcttgacacc aatagcgaat tggcagttga ttttccgaat atttattaat 

1501 taacttaatt ttctcttcta gttgagatga gagttttttt taaagttttc ttgagcttct 

1561 ttattttgtt ttatgtgatt tggacgtgga acttttaatt tactttttaa tttaaatctg 

1621 actaattgat gaactgttcg ataagacagc tcgatgtcaa aacaagtcgt taaccactgt 

1681 tgaatttcac catagctttg gaaaccttcc gggttttgca attctgcttc taatttctcg 

1741 attacatttg gtggaattga acttggcctc cctcctagat ttttctttgg ttctagtaaa 

1801 agattcagtc cacctgtacg atatttccct aaccattttt gtactgtgac tcgattgcgt 

1861 cctaacatta tcgcaatttc ttttactgtt tctactgttt gagttttcaa ccagtaaata 

1921 gtttgaagtc tttccttttc ttgagctgta ctcgcacttt ttaaaagttt tttgagagtt 

1981 tctaccgatt catttatctc aatttttact actcctgaca tggggacatt ttctataata 

2041 ggttacttct aaaagtgtag cttaagttta tgaaattagt ataacttggt agaaatgtta 

2101 aggtcacgac atcttaaagc aagcccaacc cgtagagtct ggataccaaa accagggaga 

2161 gaagaaaaac gcccattagg cataccaact atgtacgaca gggcacttca agcactggta 

2221 aagctaggta ggtcaccaga atgggaggcg cttttcgaac ctaatagtta tggtcttttc 

2281 cggaggtcaa cacatgatgc tatcgcagca atctatgtca gtattaacca caaaccaaaa 

2341 tatgttttag atgctgacat atccaagtgt tttgaccgaa ttaatcatga tgcactgttg 

2401 agaaaaatag gtcgaacacc ttacagacga ttaatcaagc aatggttaaa atctggagta 

2461 ttcgacaaca aacaattctc aaacactgtg gaaggtacac cacagggagg ggtaatatca

2521 cccttaaaga gcaaacatcg ccctacacgg ggaaagaaaa atgcctaaaa aattacgcag 

2581 aaactcttcg agggaacaaa cgtaataata aacatgcatt atccttaata cgatatgctg 

2641 atgactttgt aatcctacac aaagacatca aagtattgtt acaagcaaaa accgtaatac 

2701 aggaatggtt aaatcatgta ggattagaac taaaaccaga aaaaactaaa attgcccaca 

2761 ccttggaaga atatgacggc aacaaacccg gatttgactt cctaggattc acaataaggc 

2821 aatggaaagt taagacaacc aaacaaggat tcaaaacact gattaaaccg tcttccaaga 

2881 gcataaaaac tcattatcgg aagctggcgg atatatgtga taaacacaaa aacgccccta 

2941 caaaagcttt aatagctaaa cttaatccgg tgattagagg atgggccaac tacttctcca 

3001 ctataatcag taaagaaacc ttttcaaagc tagattacct actctggaga aggttaggtc 

3061 gatggacaag taggaggcat ccaaaataag tcagccaaat gggtcaagaa gaagtacttc 

3121 cctcgctgca aagtcaccag aaactggata cttaacgacg gcgagtatat acttaaccaa 

3181 cactcagacg ttgctatagt aagacacgtc aaggttaaag gtaataaatc cccattagac 

3241 ggtgattgga cttattggag cagtagaata ggaaaacatc caggcataag gaaagaagtt 

3301 acaacgctgt taaaacggca aaagaataaa tgcgcatttt gtggactaac ctttagatta 

3361 accgacctta tggaaatcga ccatataaaa ccaaggtctg aaggcggtga taacacaact 

3421 aaaaacaaac aactgttaca ccgacattgt cacgatacta aaactgcttg attataataa 

3481 aacatacaca aaacctaagt tacaggactt acctgatgaa tacctatggg tagatgatat 

3541 gttaattcta acacagggat gtacctatga aaaaggacgt ttaggagagg agccggatga 

3601 ggtgaaagtc tcacgtccgg ttttgaagac gagtcgggta aggtaactta cctggcgaat 

3661 gtttaac                

3' end 

[top]


[Intron and flanking sequence]

 

   1 gcctataaga cggtagtcag tttggtgagg acgacctttt aaaagggaga aatgattcat 

  61 aatggccatg tagaaaatct gaaaacctca caggttagca gtgataaatt ttggtcattt 

 121 gttcaaaaaa acaaaaacat ggtcacatga ataaattgac agaaggtgac tgtgggatag 

 181 gcatcactca agcttcatcc agcctattaa atttaagtga agcgagtaaa taaacatact 

 241 gaccagttta ttgaacaatt aattgtcaac acagaaggga aaactaactg taaacaatgg 

 301 gtgcgactcg ttggtcacta aaccaggaaa ctgggtataa atgaccgtcc taaatcaata 

 361 catgggatac cccgatttag aacaactctt attaccttga tggtgaaatt ccatccacct 

 421 tgttgttagg ttgacagcat aaaccacagg gtagcgactg aacatcaatc cctgaagctg 

 481 gtttaaaagc gggaaaatac cagttaccga gaaacgatac cgcgagcaaa ggttgacaag 

 541 tggacgaaca ggaaggtaat caaactcccg tcatacagga ccattccaaa ggtagctgtg 

 601 gaatgtagga cgaaagtcag aggaataatg ggcataaata tccattggtt tatgtcaaca 

 661 accattcaag gcctccggga gagtttaccc cactaaatca atcgtaataa aggaataagg 

 721 gaacgaggaa acctctttaa gacacctaag taaataggtc gaaaacttag gaagccattc 

 781 aaggtcaatc tacacaaaag ttgtaggtct taaagggaat aggattgagt aagaagcatt 

 841 gcagatgcag acgtactcac tgttgaacgt aatatagagt aatgattagt catgaacaaa 

 901 gctgaaatac taaaaagaaa attggacaac ccaaggccct tttcaagtgt ggcatgggac 

 961 gcttacgata tacctagcca agcgtgtgtt aatccaaacc taaaatggaa agacattaac 

1021 tggaaaaagg tagaaaagta tgtgtttaag ttacaaaagt taatctatag agcatccagc 

1081 cgtggcgaaa tccgcaaaat gcgtaaatat caaaaacttc tgaccaaaag ttattatgca 

1141 aggttgctag ctgtcaggcg cgttacccag gacaaccagg gaaagaaaac tgctggtata 

1201 gatggcataa aaagccttcc cccaatgcaa agatttaata ccaatttcct aaagtccgac 

1261 tacacataat gcttcaagaa taaaatcata tcctgtacaa gaagcaatat ctttttgact 

1321 aaatccagtt aatatatttc tcaggattct gcgtcagaaa tctaaggaat cgaaaactcg 

1381 atttttcaag aattttttca cctctttcca cagtctttca atgggattaa cttcgggaca 

1441 ataaggaggt tgaaataata aaatgatgtg atcagggata gataaatcat tccctagatg 

1501 ccctcgcccg ttatctagtt gaataatatg aatctcttca ggatatgttt gagaaaaaag 

1561 ctctaaaaat ctttcaaagc aaggagtatc tagatgacta aattcataaa aaaaggaatc 

1621 tcctgttttt ggttctacta aaccataaag ccatagatat ttaaatagcc attgatataa 

1681 accaatgggc ttaacaccgc aagcagtaat gactttacct gttatagttt ttaatccaat 

1741 tctagtctcg tcttgacacc aatagcgaat tggcagttga ttttccgaat atttattaat 

1801 taacttaatt ttctcttcta gttgagatga gagttttttt taaagttttc ttgagcttct 

1861 ttattttgtt ttatgtgatt tggacgtgga acttttaatt tactttttaa tttaaatctg 

1921 actaattgat gaactgttcg ataagacagc tcgatgtcaa aacaagtcgt taaccactgt 

1981 tgaatttcac catagctttg gaaaccttcc gggttttgca attctgcttc taatttctcg 

2041 attacatttg gtggaattga acttggcctc cctcctagat ttttctttgg ttctagtaaa 

2101 agattcagtc cacctgtacg atatttccct aaccattttt gtactgtgac tcgattgcgt 

2161 cctaacatta tcgcaatttc ttttactgtt tctactgttt gagttttcaa ccagtaaata 

2221 gtttgaagtc tttccttttc ttgagctgta ctcgcacttt ttaaaagttt tttgagagtt 

2281 tctaccgatt catttatctc aatttttact actcctgaca tggggacatt ttctataata 

2341 ggttacttct aaaagtgtag cttaagttta tgaaattagt ataacttggt agaaatgtta

2401 aggtcacgac atcttaaagc aagcccaacc cgtagagtct ggataccaaa accagggaga 

2461 gaagaaaaac gcccattagg cataccaact atgtacgaca gggcacttca agcactggta 

2521 aagctaggta ggtcaccaga atgggaggcg cttttcgaac ctaatagtta tggtcttttc 

2581 cggaggtcaa cacatgatgc tatcgcagca atctatgtca gtattaacca caaaccaaaa 

2641 tatgttttag atgctgacat atccaagtgt tttgaccgaa ttaatcatga tgcactgttg

2701 agaaaaatag gtcgaacacc ttacagacga ttaatcaagc aatggttaaa atctggagta 

2761 ttcgacaaca aacaattctc aaacactgtg gaaggtacac cacagggagg ggtaatatca 

2821 cccttaaaga gcaaacatcg ccctacacgg ggaaagaaaa atgcctaaaa aattacgcag 

2881 aaactcttcg agggaacaaa cgtaataata aacatgcatt atccttaata cgatatgctg

2941 atgactttgt aatcctacac aaagacatca aagtattgtt acaagcaaaa accgtaatac

3001 aggaatggtt aaatcatgta ggattagaac taaaaccaga aaaaactaaa attgcccaca 

3061 ccttggaaga atatgacggc aacaaacccg gatttgactt cctaggattc acaataaggc 

3121 aatggaaagt taagacaacc aaacaaggat tcaaaacact gattaaaccg tcttccaaga 

3181 gcataaaaac tcattatcgg aagctggcgg atatatgtga taaacacaaa aacgccccta 

3241 caaaagcttt aatagctaaa cttaatccgg tgattagagg atgggccaac tacttctcca 

3301 ctataatcag taaagaaacc ttttcaaagc tagattacct actctggaga aggttaggtc 

3361 gatggacaag taggaggcat ccaaaataag tcagccaaat gggtcaagaa gaagtacttc 

3421 cctcgctgca aagtcaccag aaactggata cttaacgacg gcgagtatat acttaaccaa 

3481 cactcagacg ttgctatagt aagacacgtc aaggttaaag gtaataaatc cccattagac 

3541 ggtgattgga cttattggag cagtagaata ggaaaacatc caggcataag gaaagaagtt 

3601 acaacgctgt taaaacggca aaagaataaa tgcgcatttt gtggactaac ctttagatta 

3661 accgacctta tggaaatcga ccatataaaa ccaaggtctg aaggcggtga taacacaact 

3721 aaaaacaaac aactgttaca ccgacattgt cacgatacta aaactgcttg attataataa 

3781 aacatacaca aaacctaagt tacaggactt acctgatgaa tacctatggg tagatgatat 

3841 gttaattcta acacagggat gtacctatga aaaaggacgt ttaggagagg agccggatga 

3901 ggtgaaagtc tcacgtccgg ttttgaagac gagtcgggta aggtaactta cctggcgaat 

3961 gtttaactat acagatgatt ggggagggcc taaacgagcc ctacccccaa cagtagagga 

4021 tattattggt gaaaaccaaa cccaccataa ggatggaact aacggaattg ttgggtaact 

4081 tctaggtctc aagaaaagga aatacttcaa gttatttact ggttaaaaac taaagtagta 

4141 gaaacagtca aagaaatttt gattatgtta tactaatttc ataaacttaa gctacacttt 

4201 tagaagtaac ctattataga aaatgtcccc atgtcaggag tagtaaaaat tgagataaat 

4261 gaatcgg

[top]


[ORF sequence]

 

MNKAEILKRKLDNPRPFSSVAWDAYDIPSQACVNPNLKWKDINWKKVEKYVFKLQKLI

YRASSRGEIRKMRKYQKLLTKSYYAEMLRSRHLKASPTRRVWIPKPGREEKRPLGIPT

MYDRALQALVKLGRSPEWEALFEPNSYGLFRRSTHDAIAAIYVSINHKPKYVLDADIS

KCFDRINHDALLRKIGRTPYRRLIKQWLKSGVFDNKQFSNTVEGTPQGGVISPLXKSK

HRPTGKEKCLKNYAETLRGNKRNNKHALSLIRYADDFVILHKDIKVLLQAKTVIQEWL

NHVGLELKPEKTKIAHTLEEYDGNKPGFDFLGFTIRQWKVKTTKQGFKTLIKPSSKSI

KTHYRKLADICDKHKNAPTKALIAKLNPVIRGWANYFSTIISKETFSKLDYLLWRRLG

RWTSRRHPK*VSQMGQEEVLPNKSAKWVKKKYFPRCKVTRNWILNDGEYILNQHSDVA

IVRHVKVKGNKSPLDGDWTYWSSRIGKHPGIRKEVTTLLKRQKNKCAFCGLTFRLTDL

MEIDHIKPRSEGGDNTTKNKQLLHRHCHDTKTA

top]


[Secondary structure]

 

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |