[Back to introns by organism] [Back to home page]
Information of S.ag.I2 intron (Format of information for each intron)
[Intron and flanking sequence]
Sequence
from Genbank entry. Intron is identified in red.
The ORF
is identified in blue with start and stop codons
underlined.
5' end
gag
tgtttcgcta
10201 agcttctctc aatctaacag gtgaaagtcc tgtcctaaca attaaccagt tcggttaggt
10261 agctatggca tacgtatgcg gagtaatcca aatgcggaag caatgccgac taacgttatc
10321 cccagtaggc tgatattcat tggtcttatc tcagtccgtg ggagcgagca aacctacagt
10381 ccgtaacgaa agtgaacttc cgctaagtgt cgttaaggtc attccaatga gtataactga
10441 gagagttgga aatagagttg tcgagccttt agaatggggg cgaagactgg aattgttgtc
10501 agttctaact ggtatatact ggcaataaag ctctcggcat atagaagatg gaatgtaggt
10561 aaagatgagc gaggaacgaa agagaccctt ttctatcata catatcccat gtatgtaagg
10621 ttagatataa ggaaaacacc gaagttctaa ttgacgtagg aatagggagt cggagaggtt
10681 catagtaccg ataatcatac gagacaacaa aactcgatgt agagaaggag ccttgccttt
10741 atcggtctta ccgatgaaag aggaactaga tgattgctca gaaagctata aacatctttg
10801 aaaaggttca agtatttcaa cggaagatat atctatcgac caaggcagat aataagcgaa
10861 agtttggtgt tttatacgac aaagtatatc gtaaagatat cctgaaagtt gcgtggttct
10921 atgtaaaaag gaataaaggt tcggcaggca ttgatgactt cacgattgaa gaaattgaag
10981 cctacggtgt acagaaattt cttgatgaaa tagaagacca gttgagaaac aagaaatatc
11041 aacctaaggc agtaaagcga gtttatattc caaaagcgaa tggtaaaaag agaccgttgg
11101 gaattccgac agtccgagat agagttgttc aaacagctgt gaaaatagtt atagaaccga
11161 tattcgaagc ggactttcaa gaattttctt atggttttag acctaagcga agtgctaacc
11221 aagctataag agagatatat aaatatctca attatggttg tgagtgggtt attgatgccg
11281 acttaaaagg ttactttgac acgatacctc atgataagtt actactttta gttaaggaac
11341 gagtaactga taagtctatt atcaagttac ttagtttatg gctagaagca ggtatcatgg
11401 aagacaatca agtgagaagt aatattttag gtactccaca aggaggtgtc atatcaccgt
11461 tgttagcgaa tatctaccta aatgctttag acagatattg gaagaacaat cggttagaag
11521 gaagagggca tgatgcccac ttgatacgtt atgcggatga ctttgttatt ctatgctcaa
11581 ataatccgaa gaaatattat cagtatgcga aacagcgtat tgataagcta ggattaacat
11641 tgaacgaaga aaagacaaga attgtgcatg cgacagaggg atttgatttt ctaggctata
11701 cgcttaggaa atcaaaatct cacaaaagtg gtaagtataa aacctactac tatccttcaa
11761 gaaaatctat gaaatcaata aaaggcaaag taaaagatgt tatccaaaca ggacaacacc
11821 tgaaccttcc tgatgtcatg gaaagattga atccaatgtt gagaggatgg gctaattatt
11881 tcaaagctgg gaactctaag caacacttca agagtataga taactatgtc atatataatc
11941 taacaattat gcttagaaag aagcacaaga agtctggaaa gggatggagg gaacatccgc
12001 cgtcatggta ctataactac tttggactgg tttgtttaag gaagttgagt accaatatca
12061 atgatgatag tcagagatat ggtagataac ttgtgaatgg ctataacgtc gaagaaagaa
12121 gatattggga aagccgtgta agggaaaacc ttacgcacgg tttgacgagg ggtttctgag
12181 gagatggctc aaatcagcga cctactctac
3' end
[Intron and flanking sequence]
9481 gatgacggca acgaagatag taaaccaaac aattccaggc aatcgccaaa agctacaact
9541 aaaaaaacac aaaagacagg atatcaaaca ccaaaaatca gcaatatcca aatcgagact
9601 tacaagtctg atttaaatga tattgcgaaa gccacaaacc aaaacgttga agagttaaca
9661 aaatggctaa ccgatacttt aaaagtgggg gaactagaaa atttgcatac ggaacacatt
9721 gtttcagcag acgaattaat caataaactc aaaaagaaag caggactaaa aaatgattaa
9781 caacattgta ctagtaggtc gcatgaccaa ggatgccgaa cttcgttata caccaagtaa
9841 tcaagcggta gctacttttt cacttgcagt taatcgtaat tttaaaaatc aatctggcga
9901 acgtgaggct gattttatta actgtgttat ttggcgccaa caagctgaaa acttggctaa
9961 ctgggcaaaa aaaggtgctt tggttggaat tacaggtcgt atccaaacgc gtaattatga
10021 aaaccaacaa ggtcaacgta tctatgtaac agaagttgtt gcggaaaatt tccaattatt
10081 agaaagtcgc aatagccaac aacagactaa tcaaagcggc aatagttcta attcttattt
10141 tggcaatgcc aacaaaatgg atatttcaga tgatgactta ccattctgag tgtttcgcta
10201 agcttctctc aatctaacag gtgaaagtcc tgtcctaaca attaaccagt tcggttaggt
10261 agctatggca tacgtatgcg gagtaatcca aatgcggaag caatgccgac taacgttatc
10321 cccagtaggc tgatattcat tggtcttatc tcagtccgtg ggagcgagca aacctacagt
10381 ccgtaacgaa agtgaacttc cgctaagtgt cgttaaggtc attccaatga gtataactga
10441 gagagttgga aatagagttg tcgagccttt agaatggggg cgaagactgg aattgttgtc
10501 agttctaact ggtatatact ggcaataaag ctctcggcat atagaagatg gaatgtaggt
10561 aaagatgagc gaggaacgaa agagaccctt ttctatcata catatcccat gtatgtaagg
10621 ttagatataa ggaaaacacc gaagttctaa ttgacgtagg aatagggagt cggagaggtt
10681 catagtaccg ataatcatac gagacaacaa aactcgatgt agagaaggag ccttgccttt
10741 atcggtctta ccgatgaaag aggaactaga tgattgctca gaaagctata aacatctttg
10801 aaaaggttca agtatttcaa cggaagatat atctatcgac caaggcagat aataagcgaa
10861 agtttggtgt tttatacgac aaagtatatc gtaaagatat cctgaaagtt gcgtggttct
10921 atgtaaaaag gaataaaggt tcggcaggca ttgatgactt cacgattgaa gaaattgaag
10981 cctacggtgt acagaaattt cttgatgaaa tagaagacca gttgagaaac aagaaatatc
11041 aacctaaggc agtaaagcga gtttatattc caaaagcgaa tggtaaaaag agaccgttgg
11101 gaattccgac agtccgagat agagttgttc aaacagctgt gaaaatagtt atagaaccga
11161 tattcgaagc ggactttcaa gaattttctt atggttttag acctaagcga agtgctaacc
11221 aagctataag agagatatat aaatatctca attatggttg tgagtgggtt attgatgccg
11281 acttaaaagg ttactttgac acgatacctc atgataagtt actactttta gttaaggaac
11341 gagtaactga taagtctatt atcaagttac ttagtttatg gctagaagca ggtatcatgg
11401 aagacaatca agtgagaagt aatattttag gtactccaca aggaggtgtc atatcaccgt
11461 tgttagcgaa tatctaccta aatgctttag acagatattg gaagaacaat cggttagaag
11521 gaagagggca tgatgcccac ttgatacgtt atgcggatga ctttgttatt ctatgctcaa
11581 ataatccgaa gaaatattat cagtatgcga aacagcgtat tgataagcta ggattaacat
11641 tgaacgaaga aaagacaaga attgtgcatg cgacagaggg atttgatttt ctaggctata
11701 cgcttaggaa atcaaaatct cacaaaagtg gtaagtataa aacctactac tatccttcaa
11761 gaaaatctat gaaatcaata aaaggcaaag taaaagatgt tatccaaaca ggacaacacc
11821 tgaaccttcc tgatgtcatg gaaagattga atccaatgtt gagaggatgg gctaattatt
11881 tcaaagctgg gaactctaag caacacttca agagtataga taactatgtc atatataatc
11941 taacaattat gcttagaaag aagcacaaga agtctggaaa gggatggagg gaacatccgc
12001 cgtcatggta ctataactac tttggactgg tttgtttaag gaagttgagt accaatatca
12061 atgatgatag tcagagatat ggtagataac ttgtgaatgg ctataacgtc gaagaaagaa
12121 gatattggga aagccgtgta agggaaaacc ttacgcacgg tttgacgagg ggtttctgag
12181 gagatggctc aaatcagcga cctactctac aaaaatgacg aagccgagga aacaaaggat
12241 atatgcaata tatgatgacg acaaatttgt cgacgttggc acaaaagaag agttatcggc
12301 acggcttgga attaaaaaag caacaataga acagtacatg actaaatcat atcaagcgtt
12361 agctagctca aaacgaattg cattgttggt agggattgaa gaggaatatg acttttaaga
12421 cagaatttga aataccaatc gaaccaaaac ctcaaactag acctaagttc agcaaatttg
12481 gtacgtacga agatccaaag atgaagagat ggcgaaaaga ggtttctgga tggatagaaa
12541 aaaattatga tggaccgttt ttcgatgatt gcataaaggt agaggtaacc ttttacatga
12601 aagcccccaa aacgctatca aaagagccta cacaacgttc taaaggtaaa acaatacaaa
12661 tatatcagaa cttcgtgcgt gagcttatat ggcacgctaa gaagcctgat attgataatc
MIAQKAINIFEKVQVFQRKIYLSTKADNKRKFGVLYDKVYRKDILKVAWFYVKRNKGS
AGIDDFTIEEIEAYGVQKFLDEIEDQLRNKKYQPKAVKRVYIPKANGKKRPLGIPTVR
DRVVQTAVKIVIEPIFEADFQEFSYGFRPKRSANQAIREIYKYLNYGCEWVIDADLKG
YFDTIPHDKLLLLVKERVTDKSIIKLLSLWLEAGIMEDNQVRSNILGTPQGGVISPLL
ANIYLNALDRYWKNNRLEGRGHDAHLIRYADDFVILCSNNPKKYYQYAKQRIDKLGLT
LNEEKTRIVHATEGFDFLGYTLRKSKSHKSGKYKTYYYPSRKSMKSIKGKVKDVIQTG
QHLNLPDVMERLNPMLRGWANYFKAGNSKQHFKSIDNYVIYNLTIMLRKKHKKSGKGW
REHPPSWYYNYFGLVCLRKLSTNINDDSQRYGR

| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |