[Back to introns by organism]  [Back to home page]

Information for UB.I1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence] 

 

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

 

Intron on antisense strand

 

3' end

   1 gtcacggtag gaatgccggt tacccagcac cccccgttca gatcccagcg tgcggaacta 

  61 ccgcactggg ctcctacctc aggtaataac gctgaaatct ctggtcagga aaaggatgtg 

 121 caattcgaac tctcggtatc aaccacctga ttaattttgt aaagcgcaac caactgaaac 

 181 tttgcccttt gtggcttctg cgtcttagat tatgtagcca tgcccgcgtt gtttggtcat 

 241 agaaactctt taccaaattc atattaccag ggactgcata gtaattttga aatcccctga 

 301 tgactgaatt cagccatttt ccagtttcct tcacatcatc atgcatacgt tccttcagtt 

 361 cttgcttcac ttccttatac ttctggacaa accgtttctt aatcgtttta cggcgtatga 

 421 agaatcgtcc agtgcgtgtt ttggagcagg agtgtgtgaa gcccaagaaa tcaaacgttt 

 481 caggttttct ttcgcctctt ttccggcgat taaccactgc aaagcgtcca aattctatta 

 541 atcgagtttt gactgggtgt agttcgagac cgaactgacc cattctctca atcagatcct 

 601 tcaagaacct gtctgcatct tttctgtatt gaaaccctac aacactatca tcggcgtaac 

 661 gcacaataat cacgtctccg tctgcatgcc tttttctcca ttggtgtgcc caaagatctt 

 721 gtgcgtagtg caggtaaaca tttgccaaca cgggtgaaat cacagaccct tgtggggtac 

 781 caacttcaag tgatgttcgt ttaccatctt caatcacacc gacttttagc cattttttaa 

 841 tcagccgcaa tattttgcga tccgctactc gatgctccag gaatttcaat aaccagtcat 

 901 gattgatttt atcaaaaaat ccagagatat ctgcatcaag tatgtaattt atcttacgcc 

 961 tcgatattcc aatatataaa gcatcgaggg cattgtgttg actacgtttt tctctgaaac 

1021 catagctgaa acccataaaa tcggtctcat agatttgatt taatatcgtt gagactgcct 

1081 gctggactat cttatcctcc agagcagtaa caccaagtgg ccttttccgg ccgtcagatt 

1141 tattaatgta gcttcttcta acgggtttag ctctatagct tccggtatgc acacgttcat 

1201 ttaaatctgc tattcgtcca ggcgagcctt ccttgtactt gcgccatgtc atttcatcta 

1261 ctccagcagc tgcttttcta ttcaggttgt agaaacttga agtcaacagc tctggcgtaa 

1321 tatggtgaaa aagattgtta aatctggcgc tcttgtcctg ctgagctttt ctacacacac 

1381 cacaaaggcc ggtcgacgca gctacctgtc cctgtgtaca gactgcggca gcgtttatga 

1441 tgttcccctt ggtcaacgcc cttccctcca cctgctccgc ggatgtttaa aatccttgtt 

1501 cgcaagcttc tcaggtacta tgttgttgtc cgacttccgt aagtcgttca tgccaggcgt 

1561 acagaatttc tctttcactg gccgttcctt tcattttgag gaagactcac ggatctcccg 

1621 attctcgcgt aaaggatgtc cacacatgca aggttctagg actccgccgg gtcgtcttac 

1681 agctcgcata ataacgcggc aaaacgtgtt gccttcccca tcggaccacg tggtcggcac 

1741 ccgagaatga gtgatttcgg agctcaatag cccgcctgtg cttccctctg tcaacgcttc 

1801 agccaatgca ttactgcaat tgccgcatga ctcgaggctg gtgtggtttg ctaaaccttc 

1861 caccataaga ctctttcatt ctctatccaa taccgatttt aatcggcgct ttc

5' end

Intron on sense strand

5' end

   1 gaaagcgccg attaaaatcg gtattggata gagaatgaaa gagtcttatg gtggaaggtt 

  61 tagcaaacca caccagcctc gagtcatgcg gcaattgcag taatgcattg gctgaagcgt 

 121 tgacagaggg aagcacaggc gggctattga gctccgaaat cactcattct cgggtgccga 

 181 ccacgtggtc cgatggggaa ggcaacacgt tttgccgcgt tattatgcga gctgtaagac 

 241 gacccggcgg agtcctagaa ccttgcatgt gtggacatcc tttacgcgag aatcgggaga 

 301 tccgtgagtc ttcctcaaaa tgaaaggaac ggccagtgaa agagaaattc tgtacgcctg 

 361 gcatgaacga cttacggaag tcggacaaca acatagtacc tgagaagctt gcgaacaagg 

 421 attttaaaca tccgcggagc aggtggaggg aagggcgttg accaagggga acatcataaa 

 481 cgctgccgca gtctgtacac agggacaggt agctgcgtcg accggccttt gtggtgtgtg  

 541 tagaaaagct cagcaggaca agagcgccag atttaacaat ctttttcacc atattacgcc 

 601 agagctgttg acttcaagtt tctacaacct gaatagaaaa gcagctgctg gagtagatga 

 661 aatgacatgg cgcaagtaca aggaaggctc gcctggacga atagcagatt taaatgaacg 

 721 tgtgcatacc ggaagctata gagctaaacc cgttagaaga agctacatta ataaatctga 

 781 cggccggaaa aggccacttg gtgttactgc tctggaggat aagatagtcc agcaggcagt 

 841 ctcaacgata ttaaatcaaa tctatgagac cgattttatg ggtttcagct atggtttcag 

 901 agaaaaacgt agtcaacaca atgccctcga tgctttatat attggaatat cgaggcgtaa 

 961 gataaattac atacttgatg cagatatctc tggatttttt gataaaatca atcatgactg 

1021 gttattgaaa ttcctggagc atcgagtagc ggatcgcaaa atattgcggc tgattaaaaa 

1081 atggctaaaa gtcggtgtga ttgaagatgg taaacgaaca tcacttgaag ttggtacccc 

1141 acaagggtct gtgatttcac ccgtgttggc aaatgtttac ctgcactacg cacaagatct 

1201 ttgggcacac caatggagaa aaaggcatgc agacggagac gtgattattg tgcgttacgc 

1261 cgatgatagt gttgtagggt ttcaatacag aaaagatgca gacaggttct tgaaggatct 

1321 gattgagaga atgggtcagt tcggtctcga actacaccca gtcaaaactc gattaataga 

1381 atttggacgc tttgcagtgg ttaatcgccg gaaaagaggc gaaagaaaac ctgaaacgtt 

1441 tgatttcttg ggcttcacac actcctgctc caaaacacgc actggacgat tcttcatacg 

1501 ccgtaaaacg attaagaaac ggtttgtcca gaagtataag gaagtgaagc aagaactgaa 

1561 ggaacgtatg catgatgatg tgaaggaaac tggaaaatgg ctgaattcag tcatcagggg 

1621 atttcaaaat tactatgcag tccctggtaa tatgaatttg gtaaagagtt tctatgacca 

1681 aacaacgcgg gcatggctac ataatctaag acgcagaagc cacaaagggc aaagtttcag 

1741 ttggttgcgc tttacaaaat taatcaggtg gttgataccg agagttcgaa ttgcacatcc 

1801 ttttcctgac cagagatttc agcgttatta cctgaggtag gagcccagtg cggtagttcc 

1861 gcacgctggg atctgaacgg ggggtgctgg gtaaccggca ttcctaccgt gac

3' end

 [top]

[Intron and flanking sequence]

 

2041 tatacccgcc agtatctgct aattgactcc caaccctatc caaaccatag ggcaggtaat
2101 cacccgattc aatagaaacc tctccatttt cagataaggt tgttaccact gcagtgcttc
2161 ccagatgtat tcacacgatc aggccgccaa gcatgttcta tatggaattc aaacctgctt
2221 cttaaaagca ctttttcggg ggtgaaatgg gcaacctaag accgatttaa ggcatctttt
2281 tgatggcatt aactatgatt ttagtcactt gatacggccc gaccccaaat tatatgtaaa
2341 agcacttttt cgggggtgaa atgggcaacc taagaccgat ttaaggcatc tttttgatgg
2401 cattaactat gattttagtc acttgatacg tcacggtagg aatgccggtt acccagcacc
2461 ccccgttcag atcccagcgt gcggaactac cgcactgggc tcctacctca ggtaataacg
2521 ctgaaatctc tggtcaggaa aaggatgtgc aattcgaact ctcggtatca accacctgat
2581 taattttgta aagcgcaacc aactgaaact ttgccctttg tggcttctgc gtcttagatt
2641 atgtagccat gcccgcgttg tttggtcata gaaactcttt accaaattca tattaccagg
2701 gactgcatag taattttgaa atcccctgat gactgaattc agccattttc cagtttcctt
2761 cacatcatca tgcatacgtt ccttcagttc ttgcttcact tccttatact tctggacaaa
2821 ccgtttctta atcgttttac ggcgtatgaa gaatcgtcca gtgcgtgttt tggagcagga
2881 gtgtgtgaag cccaagaaat caaacgtttc aggttttctt tcgcctcttt tccggcgatt
2941 aaccactgca aagcgtccaa attctattaa tcgagttttg actgggtgta gttcgagacc
3001 gaactgaccc attctctcaa tcagatcctt caagaacctg tctgcatctt ttctgtattg
3061 aaaccctaca acactatcat cggcgtaacg cacaataatc acgtctccgt ctgcatgcct
3121 ttttctccat tggtgtgccc aaagatcttg tgcgtagtgc aggtaaacat ttgccaacac
3181 gggtgaaatc acagaccctt gtggggtacc aacttcaagt gatgttcgtt taccatcttc
3241 aatcacaccg acttttagcc attttttaat cagccgcaat attttgcgat ccgctactcg
3301 atgctccagg aatttcaata accagtcatg attgatttta tcaaaaaatc cagagatatc
3361 tgcatcaagt atgtaattta tcttacgcct cgatattcca atatataaag catcgagggc
3421 attgtgttga ctacgttttt ctctgaaacc atagctgaaa cccataaaat cggtctcata
3481 gatttgattt aatatcgttg agactgcctg ctggactatc ttatcctcca gagcagtaac
3541 accaagtggc cttttccggc cgtcagattt attaatgtag cttcttctaa cgggtttagc
3601 tctatagctt ccggtatgca cacgttcatt taaatctgct attcgtccag gcgagccttc
3661 cttgtacttg cgccatgtca tttcatctac tccagcagct gcttttctat tcaggttgta
3721 gaaacttgaa gtcaacagct ctggcgtaat atggtgaaaa agattgttaa atctggcgct
3781 cttgtcctgc tgagcttttc tacacacacc acaaaggccg gtcgacgcag ctacctgtcc
3841 ctgtgtacag actgcggcag cgtttatgat gttccccttg gtcaacgccc ttccctccac
3901 ctgctccgcg gatgtttaaa atccttgttc gcaagcttct caggtactat gttgttgtcc
3961 gacttccgta agtcgttcat gccaggcgta cagaatttct ctttcactgg ccgttccttt
4021 cattttgagg aagactcacg gatctcccga ttctcgcgta aaggatgtcc acacatgcaa
4081 ggttctagga ctccgccggg tcgtcttaca gctcgcataa taacgcggca aaacgtgttg
4141 ccttccccat cggaccacgt ggtcggcacc cgagaatgag tgatttcgga gctcaatagc
4201 ccgcctgtgc ttccctctgt caacgcttca gccaatgcat tactgcaatt gccgcatgac
4261 tcgaggctgg tgtggtttgc taaaccttcc accataagac tctttcattc tctatccaat
4321 accgatttta atcggcgctt
tcggcccgac cccaaattac aacaaattat caacaaatta
4381 tcagattcaa attatcaaat tatataagcc aaattatatt tggtacagct gttacagaat
4441 gggtcgcaga tcaaataggg ttagaagctc tatcgttatt tgctcgaaat agcgggaaag
4501 gaatggattc ggtcaaacgg gaattc

[top]


[ORF sequence]

 

VEGRALTKGNIINAAAVCTQGQVAASTGLCGVCRKAQQDKSARFNNLFHHITPELLTS

SFYNLNRKAAAGVDEMTWRKYKEGSPGRIADLNERVHTGSYRAKPVRRSYINKSDGRK

RPLGVTALEDKIVQQAVSTILNQIYETDFMGFSYGFREKRSQHNALDALYIGISRRKI

NYILDADISGFFDKINHDWLLKFLEHRVADRKILRLIKKWLKVGVIEDGKRTSLEVGT

PQGSVISPVLANVYLHYAQDLWAHQWRKRHADGDVIIVRYADDSVVGFQYRKDADRFL

KDLIERMGQFGLELHPVKTRLIEFGRFAVVNRRKRGERKPETFDFLGFTHSCSKTRTG

RFFIRRKTIKKRFVQKYKEVKQELKERMHDDVKETGKWLNSVIRGFQNYYAVPGNMNL

VKSFYDQTTRAWLHNLRRRSHKGQSFSWLRFTKLIRWLIPRVRIAHPFPDQRFQRYYL

R

[top]

[Secondary structure]

                                                       

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |