[Back to introns by organism]  [Back to home page]

Information for Sc.s.I1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence]

 

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

Intron on antisense strand

3' end
                                                                gtaggc
 841 taagtaccca gcgccttacc gtattactac ggcaggtttc caaatactcg cccccgaacc
 901 gtacgtacac ctttcaatgt atacggctct ccatttataa actcaatcta acttgccata
 961 atgtaatttc ttatggcaat ctttacacaa tgctattgtt ttccgttgtc tagaaatcat
1021 caaacgttcc caaaaagtct ttttctttaa atccttgagt tttctcacat gatgaatttc
1081 aatgagacta ttggtagctt gacagtactc acaacggttt gcttttaatc tatcaattaa
1141 attggttcgg ctaaaatatt ttgccgtatt tggtaaatta tcattttcta ggaaagattt
1201 cttttgacgt ttaaagcctc cattatagaa ataacgggtc attgtatctc ctttacgtcc
1261 aatatactgt atcctgaatt gaccatcctt tttatattta cggatgatgt gagatttagt
1321 cgtacgatac ttggtcgcat aggttttata catgctatat tccataatat acttaaagcg
1381 atggaggata gaactgttgt tagcgataca ataatagtta taaaaacctc ttatttctgc
1441 attgtagcgt tctaaaatct ctaaatcatc acagtctttc ataaaataac gagctgttgg
1501 tttccaaact tcgtgacctc tatgataagt cattttcata gccccatagg acagcaatct
1561 atctcgaatc gtttcaatag aaacctctaa aaccaatcta cctgtataat ttctgaccaa
1621 tcgtcctgac gaatcacgtt tggctaaatt ggattggcga atgtacagat gataacccag
1681 aaacctcgct ttatctcttg cattggtaat cagggttttc tcttcagaga gttccaattt
1741 gagaaccgtt tccaaatagt ccttaatgtc tgctttaata cgacgagcgt cctctttact
1801 tccaatcaca ccacagataa aatcatctgc atatcgggta taggttagac gcttaaaact
1861 actatccata gggtcgctat gaggaatcaa tactctttct ttttctaatt gacgaatacg
1921 ttgaatggct tcttggcgtt ggttttccgt cgaagtacac tctagtgttc ttcttgcttt
1981 tcccagtgca atctcatttt ggcgatattc tggggtacgt ttacggtatt tcccttcaca
2041 gaagttcttc acataatctg tcatatattt atcaaactta tccaaataga tattggctag
2101 aattggactg attattcctc cttgtggagt tcctgagtag gttttgtaga atttccaatc
2161 ttccacatat cctgcattga gaaatttacg aatcaggcgt aagaaacgct catccgtaat
2221 tctttctcgt agaatttgaa ttatcacatc gtggttgata ttatcaaaaa acgatttgat
2281 atctccttca ataaaccact ttactcctgt ataggttttc tgaatttgag taagtgctga
2341 atgacagctt ttattcggac gaaaaccatg ggaagaggat tcaaattgtc cctcgtagat
2401 ggattccaaa atcattttga taacctgttg taagagctta tcatcaaatg atggaatacc
2461 tagtggacgt ggcttcccat tctttttagg aatataggtt cgtcttgatg gatgtggctg
2521 ataagattca tctttcaggg aatcaatcaa ttggtcaatt ctagccatac tcataccatc
2581 aatcgtcaac tcatctactc caggtgtcat gtgtcctggg ttagcgtaaa tcgtttgata
2641 ggcaactaag tacat
ttctt tgttgtaaag taagcgataa agtcgctcaa attgatagtt
2701 tttatcttta ctgtgttttg ttagattgtt taacacattt tgaggatttc tcatggtgtc
2761 tcacacacct tcctttccat aatgttaatg ttataaactg ttccccttcg ccatgtacgt
2821 gccattaaca cgctcgaact actacgggaa ctccgttacc ttatcaaata ttcataaacc
2881 tgatgtttaa tagcttttac aagcattttg acttaggtaa tctccgttta gctggtatag
2941 taacatagct tgataatttc atcggatagg acgttcatct gttttcattt actatgaata
3001 cgctatggaa tccctttgct tcactacctc gaacagtgat aatgaagttc catagtatga
3061 gggttttagt tatcttacct cacagtcccg acacagaccg tttagactat ccttcaatca
3121 gtttagattt tatccttata attatctcag taacaactgt tacattgtca tctcctcatt
3181 cagtcgtgat gtgataactc atcaacttat gagtttccca acgtgctttg ttccctgata
3241 tttggtttcc ctcacaggtc agttgggtga tgataggcat taaggtcacg cctactactt
3301 tgccaaaaag tagatttact ataccgcctt tacggacgca c

5' end

 

Intron on sense strand

 

5' end

   1 gtgcgtccgt aaaggcggta tagtaaatct actttttggc aaagtagtag gcgtgacctt

  61 aatgcctatc atcacccaac tgacctgtga gggaaaccaa atatcaggga acaaagcacg 

 121 ttgggaaact cataagttga tgagttatca catcacgact gaatgaggag atgacaatgt 

 181 aacagttgtt actgagataa ttataaggat aaaatctaaa ctgattgaag gatagtctaa 

 241 acggtctgtg tcgggactgt gaggtaagat aactaaaacc ctcatactat ggaacttcat 

 301 tatcactgtt cgaggtagtg aagcaaaggg attccatagc gtattcatag taaatgaaaa 

 361 cagatgaacg tcctatccga tgaaattatc aagctatgtt actataccag ctaaacggag 

 421 attacctaag tcaaaatgct tgtaaaagct attaaacatc aggtttatga atatttgata 

 481 aggtaacgga gttcccgtag tagttcgagc gtgttaatgg cacgtacatg gcgaagggga 

 541 acagtttata acattaacat tatggaaagg aaggtgtgtg agacaccatg agaaatcctc 

 601 aaaatgtgtt aaacaatcta acaaaacaca gtaaagataa aaactatcaa tttgagcgac 

 661 tttatcgctt actttacaac aaagaaatgt acttagttgc ctatcaaacg atttacgcta 

 721 acccaggaca catgacacct ggagtagatg agttgacgat tgatggtatg agtatggcta 

 781 gaattgacca attgattgat tccctgaaag atgaatctta tcagccacat ccatcaagac 

 841 gaacctatat tcctaaaaag aatgggaagc cacgtccact aggtattcca tcatttgatg 

 901 ataagctctt acaacaggtt atcaaaatga ttttggaatc catctacgag ggacaatttg 

 961 aatcctcttc ccatggtttt cgtccgaata aaagctgtca ttcagcactt actcaaattc 

1021 agaaaaccta tacaggagta aagtggttta ttgaaggaga tatcaaatcg ttttttgata 

1081 atatcaacca cgatgtgata attcaaattc tacgagaaag aattacggat gagcgtttct 

1141 tacgcctgat tcgtaaattt ctcaatgcag gatatgtgga agattggaaa ttctacaaaa 

1201 cctactcagg aactccacaa ggaggaataa tcagtccaat tctagccaat atctatttgg 

1261 ataagtttga taaatatatg acagattatg tgaagaactt ctgtgaaggg aaataccgta 

1321 aacgtacccc agaatatcgc caaaatgaga ttgcactggg aaaagcaaga agaacactag 

1381 agtgtacttc gacggaaaac caacgccaag aagccattca acgtattcgt caattagaaa 

1441 aagaaagagt attgattcct catagcgacc ctatggatag tagttttaag cgtctaacct 

1501 atacccgata tgcagatgat tttatctgtg gtgtgattgg aagtaaagag gacgctcgtc 

1561 gtattaaagc agacattaag gactatttgg aaacggttct caaattggaa ctctctgaag 

1621 agaaaaccct gattaccaat gcaagagata aagcgaggtt tctgggttat catctgtaca 

1681 ttcgccaatc caatttagcc aaacgtgatt cgtcaggacg attggtcaga aattatacag 

1741 gtagattggt tttagaggtt tctattgaaa cgattcgaga tagattgctg tcctatgggg 

1801 ctatgaaaat gacttatcat agaggtcacg aagtttggaa accaacagct cgttatttta 

1861 tgaaagactg tgatgattta gagattttag aacgctacaa tgcagaaata agaggttttt 

1921 ataactatta ttgtatcgct aacaacagtt ctatcctcca tcgctttaag tatattatgg 

1981 aatatagcat gtataaaacc tatgcgacca agtatcgtac gactaaatct cacatcatcc 

2041 gtaaatataa aaaggatggt caattcagga tacagtatat tggacgtaaa ggagatacaa 

2101 tgacccgtta tttctataat ggaggcttta aacgtcaaaa gaaatctttc ctagaaaatg 

2161 ataatttacc aaatacggca aaatatttta gccgaaccaa tttaattgat agattaaaag 

2221 caaaccgttg tgagtactgt caagctacca atagtctcat tgaaattcat catgtgagaa 

2281 aactcaagga tttaaagaaa aagacttttt gggaacgttt gatgatttct agacaacgga 

2341 aaacaatagc attgtgtaaa gattgccata agaaattaca ttatggcaag ttagattga

2401 tttataaatg gagagccgta tacattgaaa ggtgtacgta cggttcgggg gcgagtattt 

2461 ggaaacctgc cgtagtaata cggtaaggcg ctgggtactt agcctac

3' end

 [top]

[Intron and flanking sequence]

 

 481 taatagggag gtgcttctcg ctataggaaa tggtatggcg atactcgtct ttggagcgat
 541 taaaggcgcg tttttgattt tctaaaactg tcagttcatt ttccaactcc attttgagct
 601 taaggtaggg attaccagtg gctaaagcct taaagtcaga agcagtcatg gtttgttcat
 661 caatgtcttc agctgatctc acaggatctt ttgaggtcat tatctgcgta atgtatttga
 721 gcttattctc ctgtgtctgc cagaggtaat tgtcgaagct ccctttagta atatagtgat
 781 aaatatctac ctcctggtgc atgtttccct gacgaatcaa tctaccattt cgctgtaggc
 841 taagtaccca gcgccttacc gtattactac ggcaggtttc caaatactcg cccccgaacc
 901 gtacgtacac ctttcaatgt atacggctct ccatttataa actcaatcta acttgccata
 961 atgtaatttc ttatggcaat ctttacacaa tgctattgtt ttccgttgtc tagaaatcat
1021 caaacgttcc caaaaagtct ttttctttaa atccttgagt tttctcacat gatgaatttc
1081 aatgagacta ttggtagctt gacagtactc acaacggttt gcttttaatc tatcaattaa
1141 attggttcgg ctaaaatatt ttgccgtatt tggtaaatta tcattttcta ggaaagattt
1201 cttttgacgt ttaaagcctc cattatagaa ataacgggtc attgtatctc ctttacgtcc
1261 aatatactgt atcctgaatt gaccatcctt tttatattta cggatgatgt gagatttagt
1321 cgtacgatac ttggtcgcat aggttttata catgctatat tccataatat acttaaagcg
1381 atggaggata gaactgttgt tagcgataca ataatagtta taaaaacctc ttatttctgc
1441 attgtagcgt tctaaaatct ctaaatcatc acagtctttc ataaaataac gagctgttgg
1501 tttccaaact tcgtgacctc tatgataagt cattttcata gccccatagg acagcaatct
1561 atctcgaatc gtttcaatag aaacctctaa aaccaatcta cctgtataat ttctgaccaa
1621 tcgtcctgac gaatcacgtt tggctaaatt ggattggcga atgtacagat gataacccag
1681 aaacctcgct ttatctcttg cattggtaat cagggttttc tcttcagaga gttccaattt
1741 gagaaccgtt tccaaatagt ccttaatgtc tgctttaata cgacgagcgt cctctttact
1801 tccaatcaca ccacagataa aatcatctgc atatcgggta taggttagac gcttaaaact
1861 actatccata gggtcgctat gaggaatcaa tactctttct ttttctaatt gacgaatacg
1921 ttgaatggct tcttggcgtt ggttttccgt cgaagtacac tctagtgttc ttcttgcttt
1981 tcccagtgca atctcatttt ggcgatattc tggggtacgt ttacggtatt tcccttcaca
2041 gaagttcttc acataatctg tcatatattt atcaaactta tccaaataga tattggctag
2101 aattggactg attattcctc cttgtggagt tcctgagtag gttttgtaga atttccaatc
2161 ttccacatat cctgcattga gaaatttacg aatcaggcgt aagaaacgct catccgtaat
2221 tctttctcgt agaatttgaa ttatcacatc gtggttgata ttatcaaaaa acgatttgat
2281 atctccttca ataaaccact ttactcctgt ataggttttc tgaatttgag taagtgctga
2341 atgacagctt ttattcggac gaaaaccatg ggaagaggat tcaaattgtc cctcgtagat
2401 ggattccaaa atcattttga taacctgttg taagagctta tcatcaaatg atggaatacc
2461 tagtggacgt ggcttcccat tctttttagg aatataggtt cgtcttgatg gatgtggctg
2521 ataagattca tctttcaggg aatcaatcaa ttggtcaatt ctagccatac tcataccatc
2581 aatcgtcaac tcatctactc caggtgtcat gtgtcctggg ttagcgtaaa tcgtttgata
2641 ggcaactaag tacatttctt tgttgtaaag taagcgataa agtcgctcaa attgatagtt
2701 tttatcttta ctgtgttttg ttagattgtt taacacattt tgaggatttc tcatggtgtc
2761 tcacacacct tcctttccat aatgttaatg ttataaactg ttccccttcg ccatgtacgt
2821 gccattaaca cgctcgaact actacgggaa ctccgttacc ttatcaaata ttcataaacc
2881 tgatgtttaa tagcttttac aagcattttg acttaggtaa tctccgttta gctggtatag
2941 taacatagct tgataatttc atcggatagg acgttcatct gttttcattt actatgaata
3001 cgctatggaa tccctttgct tcactacctc gaacagtgat aatgaagttc catagtatga
3061 gggttttagt tatcttacct cacagtcccg acacagaccg tttagactat ccttcaatca
3121 gtttagattt tatccttata attatctcag taacaactgt tacattgtca tctcctcatt
3181 cagtcgtgat gtgataactc atcaacttat gagtttccca acgtgctttg ttccctgata
3241 tttggtttcc ctcacaggtc agttgggtga tgataggcat taaggtcacg cctactactt
3301 tgccaaaaag tagatttact ataccgcctt tacggacgca c
ggacaatgt ctgagggacg
3361 ccatggaaca tctaaatggt gaacagcttt catgcgtgat tgtacgttta gacccgttcc
3421 acctttttca gtcgaagcca tgagaatccg tacttcccca ctattgacct ttcgtgagag
3481 agagtttttc ttctcatcgg tattggcatc atggacaaag gcaatttctt cttttggaat
3541 ccctcgatca accagcaaag ctttcagttc gttgtagaca tcaaagcctt cttccttact
3601 tttaggtgtt ccaatatcag agaaaatcat ctgagtggct ttgtattcag ctccttcacg

[top]


[ORF sequence]

 

MYLVAYQTIYANPGHMTPGVDELTIDGMSMARIDQLIDSLKDESYQPHPSRRTYIPKK

NGKPRPLGIPSFDDKLLQQVIKMILESIYEGQFESSSHGFRPNKSCHSALTQIQKTYT

GVKWFIEGDIKSFFDNINHDVIIQILRERITDERFLRLIRKFLNAGYVEDWKFYKTYS

GTPQGGIISPILANIYLDKFDKYMTDYVKNFCEGKYRKRTPEYRQNEIALGKARRTLE

CTSTENQRQEAIQRIRQLEKERVLIPHSDPMDSSFKRLTYTRYADDFICGVIGSKEDA

RRIKADIKDYLETVLKLELSEEKTLITNARDKARFLGYHLYIRQSNLAKRDSSGRLVR

NYTGRLVLEVSIETIRDRLLSYGAMKMTYHRGHEVWKPTARYFMKDCDDLEILERYNA

EIRGFYNYYCIANNSSILHRFKYIMEYSMYKTYATKYRTTKSHIIRKYKKDGQFRIQY

IGRKGDTMTRYFYNGGFKRQKKSFLENDNLPNTAKYFSRTNLIDRLKANRCEYCQATN

SLIEIHHVRKLKDLKKKTFWERLMISRQRKTIALCKDCHKKLHYGKLD

[top]


[Secondary structure]

 

                                                                                  [top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |