[Back to introns by organism]  [Back to home page]

Information for UA.I1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

 

5' end

                                ttgttg acaagaagtt gcatcacaga ttgcccaact
21661 ggctacccct tccattcggg gtaatcctac cgcggaatgc atggatggca acgcccacgc
21721 agtaagcctg cggaacaacg tggaaaatca gaaaatccga ttcggatcaa ctgctaggat
21781 acgaatgcta tgaagagtgt aatgcgaacc atcgaaagag tcgtaacaca ggataagcga
21841 aagcggactg ttacgcttag accaaaaggt acgaaatcag actataactg ctgcagagac
21901 gttatagagg gtggacaaac ggtttccggc ttacagatgg cttctaaggc gttcataatt
21961 atctgtgacg tagaacttgg taagccccct aagctcccgg aatatctggg tagagacccc
22021 gtgagggcga acgatggcat agggggtaga ggaatcgata aaaagcgaat gacatcttgt
22081 aatgaggtgc atagaggttc aaaatttgcc ttaacccaaa agggtgctga cttatcgagt
22141 ggtgtctcat tgcatgaacg gaggcaaacc tgtgaatgta agtaattcaa ttacgccctt
22201 aaaaggcgag aagcttacag acaagcgacc aaaaccccgg tgggacgata ccgactggaa
22261 gagggtcgaa gaacatgtta acaggctgca aaccagaatt gcaaaagcag ttaagcaaga
22321 caagtggaat ttggtcaaaa ggttaagatt tctgctaact aactcgtttt atgcgaaatt
22381 gttggcagtn aagcgtgtaa cccagaatag agggaagaga acagctggaa tcgatggcgt
22441 gaagtgggca acactgaact ccaaaatgaa cgctgctctt atattatccg atgtgaagta
22501 taaagcaaag ccgttaagaa gagtttacat ctcaaagcag ggaacgacga agaagaggcc
22561 tttaggaatc ccaaccatgt acgatagggc gatgcaagcg ctgtacgccc ttgcgttact
22621 accgattgca gagacgacat ctgatccacg ttcgttcggc ttcaggatac atagaagcac
22681 acaagatgtg cgccaatatg catattgttg ccttggtggg aaatactctg caaaatgggt
22741 tttggaaggg gatatcaagg gatgtttcga taatatcgac catgactggc ttctgaacaa
22801 catcccgatg gataaatcga ttctaaagca attccttaag gctggatttg tgtataatcg
22861 acatttgaat cctacccccg caggaacacc tcagggagga attatctccc cgatactggc
22921 gaacatgaca ctggatggga tggaaaaagc tatctcatct gtgtattacg tcggtaagaa
22981 tgggaaaatc gataaacatc gatataatct ccacaaggtg aatttcgtga gatatgcgga
23041 cgatttcatc gttaccgcaa actcagaaga gacggcaaag gagattgcgg agttgattaa
23101 agagttcctg aaggcacgag gtctggaatt gtcagaggaa aagactcata tcacccatat
23161 cgattgtggc tttgactttt tgggctggaa cttccgcaaa tacggaggga agcttctgat
23221 aaagccatcc aagaactcga tggggaatct catccgtaaa atcggtgatg tgatcaagcg
23281 agcgaaggca tggaaacagg aagacctcat caacgtattg aaccccctca ttactggctg
23341 gtcgaattat catcgatcgg ctgtagctaa ggagatattc agcaaattag atcatattgt
23401 ctgggatatg ctctggaggt gggctaagag gagacacccg gacaaacgta atacgtgggt
23461 tgctaataga tattggcatt ctgtgggaac tcggaacagg gtgttttcta ccggaaggaa
23521 taggttgaaa ctattctcgg atacgaagat tgtccggtgt gctggcttga aattggataa
23581 gaaccccttt attgaccaag actactttaa cttacggaac tgctgcccga tactgaaagg
23641 gttatga
gat gcttgagcgg tatgaggtga aagtctcacg taccgttctg agatgaggag
23701 gggcgggtaa ccgccctatt cttaa

3' end

[top]


[Intron and flanking sequence]

 

21301 tttgatgtct tttatgggtt ccggtatgtt cctatggatt ctatacggga ttcatattgg
21361 ctctatcccg cttatagtta cgaatgttat gggcgtaagc tgtaacagcc tattactttt
21421 tatgaaatat agctacgcta gaaataaccg gatggatatg tgatcatttt ggtcaatagt
21481 cattcatggt agctacgccc gcgttatgaa agattttgta ttgtgccttt ttcgttgtag
21541 ataaactata tcttttgcca ccgagttgtt acaccgagag caccgagtat tttccctctg
21601 ctcggtgtgc tctgtggtct ctgtttgttg acaagaagtt gcatcacaga ttgcccaact
21661 ggctacccct tccattcggg gtaatcctac cgcggaatgc atggatggca acgcccacgc
21721 agtaagcctg cggaacaacg tggaaaatca gaaaatccga ttcggatcaa ctgctaggat
21781 acgaatgcta tgaagagtgt aatgcgaacc atcgaaagag tcgtaacaca ggataagcga
21841 aagcggactg ttacgcttag accaaaaggt acgaaatcag actataactg ctgcagagac
21901 gttatagagg gtggacaaac ggtttccggc ttacagatgg cttctaaggc gttcataatt
21961 atctgtgacg tagaacttgg taagccccct aagctcccgg aatatctggg tagagacccc
22021 gtgagggcga acgatggcat agggggtaga ggaatcgata aaaagcgaat gacatcttgt
22081 aatgaggtgc atagaggttc aaaatttgcc ttaacccaaa agggtgctga cttatcgagt
22141 ggtgtctcat tgcatgaacg gaggcaaacc tgtgaatgta agtaattcaa ttacgccctt
22201 aaaaggcgag aagcttacag acaagcgacc aaaaccccgg tgggacgata ccgactggaa
22261 gagggtcgaa gaacatgtta acaggctgca aaccagaatt gcaaaagcag ttaagcaaga
22321 caagtggaat ttggtcaaaa ggttaagatt tctgctaact aactcgtttt atgcgaaatt
22381 gttggcagtn aagcgtgtaa cccagaatag agggaagaga acagctggaa tcgatggcgt
22441 gaagtgggca acactgaact ccaaaatgaa cgctgctctt atattatccg atgtgaagta
22501 taaagcaaag ccgttaagaa gagtttacat ctcaaagcag ggaacgacga agaagaggcc
22561 tttaggaatc ccaaccatgt acgatagggc gatgcaagcg ctgtacgccc ttgcgttact
22621 accgattgca gagacgacat ctgatccacg ttcgttcggc ttcaggatac atagaagcac
22681 acaagatgtg cgccaatatg catattgttg ccttggtggg aaatactctg caaaatgggt
22741 tttggaaggg gatatcaagg gatgtttcga taatatcgac catgactggc ttctgaacaa
22801 catcccgatg gataaatcga ttctaaagca attccttaag gctggatttg tgtataatcg
22861 acatttgaat cctacccccg caggaacacc tcagggagga attatctccc cgatactggc
22921 gaacatgaca ctggatggga tggaaaaagc tatctcatct gtgtattacg tcggtaagaa
22981 tgggaaaatc gataaacatc gatataatct ccacaaggtg aatttcgtga gatatgcgga
23041 cgatttcatc gttaccgcaa actcagaaga gacggcaaag gagattgcgg agttgattaa
23101 agagttcctg aaggcacgag gtctggaatt gtcagaggaa aagactcata tcacccatat
23161 cgattgtggc tttgactttt tgggctggaa cttccgcaaa tacggaggga agcttctgat
23221 aaagccatcc aagaactcga tggggaatct catccgtaaa atcggtgatg tgatcaagcg
23281 agcgaaggca tggaaacagg aagacctcat caacgtattg aaccccctca ttactggctg
23341 gtcgaattat catcgatcgg ctgtagctaa ggagatattc agcaaattag atcatattgt
23401 ctgggatatg ctctggaggt gggctaagag gagacacccg gacaaacgta atacgtgggt
23461 tgctaataga tattggcatt ctgtgggaac tcggaacagg gtgttttcta ccggaaggaa
23521 taggttgaaa ctattctcgg atacgaagat tgtccggtgt gctggcttga aattggataa
23581 gaaccccttt attgaccaag actactttaa cttacggaac tgctgcccga tactgaaagg
23641 gttatgagat gcttgagcgg tatgaggtga aagtctcacg taccgttctg agatgaggag
23701 gggcgggtaa ccgccctatt cttaa
tctgc ggctatcctt ttttgagttt caagtgcggg
23761 gcagtctagc tgcgcacacg ataaactgtg tcgatactta ttccacaaac atccgcgtca
23821 tatcgactaa cggcttaact aaaccatctc gtaccgtagt tcgcagccct tcattcgccc
23881 ttatcaagtc tgcaaccggc ggactagtag tataatataa tttgaccatt gcttgtccgg
23941 ctgaattggg catcatatat tcgtctcgga acttgcgtaa tacatttatg tcttcatgca
24001 atggcgaccc atatgctgct gttgctatga aacaaggtga tggttccccg ctaaagttca

[top]


[ORF sequence]

 

MVKRLRFLLTNSFYAKLLAVKRVTQNRGKRTAGIDGVKWATLNSKMNAALILSDVKYK

AKPLRRVYISKQGTTKKRPLGIPTMYDRAMQALYALALLPIAETTSDPRSFGFRIHRS

TQDVRQYAYCCLGGKYSAKWVLEGDIKGCFDNIDHDWLLNNIPMDKSILKQFLKAGFV

YNRHLNPTPAGTPQGGIISPILANMTLDGMEKAISSVYYVGKNGKIDKHRYNLHKVNF

VRYADDFIVTANSEETAKEIAELIKEFLKARGLELSEEKTHITHIDCGFDFLGWNFRK

YGGKLLIKPSKNSMGNLIRKIGDVIKRAKAWKQEDLINVLNPLITGWSNYHRSAVAKE

IFSKLDHIVWDMLWRWAKRRHPDKRNTWVANRYWHSVGTRNRVFSTGRNRLKLFSDTK

IVRCAGLKLDKNPFIDQDYFNLRNCCPILKGL

[top]


[Secondary structure]

                                           

 [top]

| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |