[Back to introns by organism] [Back to home page]
Information of S.f.I1 intron (Format of information for each intron)
[Intron and flanking sequence]
Note: Redundant intron copies found in
AF200692 Shigella flexneri (21025-23296)
AE015311 Shigella flexneri (585-2272)
AE016988 Shigella flexneri (150712-152981)
Sequence from Genbank entry (intron is on the antisense strand).
The boundaries of the intron are marked as red and ORF is marked
as blue, with start and stop codons underlined.
Intron on the antisense strand
3' end
gtaag agtaagaggc agggcgtagt
541 cggactattt ccctgcctct cctccccgaa ccggacgtgc acctttcagc gcatccggct
601 ctccatttaa atgctggcga acgccattgc cacttctgta aagcgcgatg tatacgtgtt
661 tctggtctcc gtcctcagat agggattacc ttcgggtagc cgccagcgga acagcttctt
721 gccttgcccc accaaccggt acagtatttc gccgctgagc ttgccgtgat tggttttacc
781 aaataaaacc cacgttttgc tctgacccgg tttcggtgat ttacaccacc acctcatcag
841 ggaagcgata cctgttacgg tatttgcggg ccagccagtg agccagcttc cagaacacga
901 cacggtcgat ataactgaag actttggcct taaaatcaac gaactgatag aacatggccc
961 agcctttcag ttttcggttg agttgttcag ccatatcgac tttgctttca ctgtagttgc
1021 ctgataacag tgctgtcagc gatgcggcga agtttctggc tttctcctgc gggatcgttg
1081 agaccactcg catctcgcca taacgactgc gtttgcgaat gatcctgtgc cccagaaaga
1141 taaagccgtc attaacatgg gtgattttag tcttatccat gttcagcctg agtttcagac
1201 tgccttcgag cacaccccga cactcctccc tgatggcttc cgcctgtgct ttggtgcctt
1261 tgacgatgag gacaaaatca tcggcatagc ggcagtacgc caccgcgggt ttccactgcc
1321 agttttctct gaccgccgta cttcggcccc gttggatact gttattccag taccaccgat
1381 cttttctggc tttcccgctc aggtagcgct catgcaggta ttgatcgaac tcattcagca
1441 tgatgttcga taatagcggc gatataacac cgccctgtgg cacaccttca ctggccgccc
1501 gaaagagacc gacatcgata tgtcccgcct tgatggtttt ccacagcaga gtcatgaaac
1561 gtgcgtcact gatcctgcgg cgtacagcct tcatcagcag tcgatgatgt acggtgtcga
1621 agtaactgga caggtcgcct tcaatcaccc agcgtccccg ggtttcacca cagtctgtga
1681 gctgtaattt caccgtgcgg atcgcgtggt ggacactgcg ctcaggccgg aagccatatg
1741 agagcgtatg aaaatcactn tcccatatcg gctccatcgc catcagcatg gcccgctgaa
1801 caatacgatc ccgcaacgcg gggataccca gtggtcgcag tttgccgttg cttttaggga
1861 tgtaaacccg tctggcgggc aagggctggt agtggcntga gagtaattca tccctgagga
1921 tttgcagctc aacagccagt ctggcctgta gcattgtttt gttcacgcca tcaacgccgg
1981 gggtatgggc cccctttgat gaaagcgtga tccgcgccgc ttcagccagc cattctggtt
2041 gtgttatcag acgcagcagc cgttgaatcc gtagggacgg atcggtggct gcccatgtgg
2101 caagcttgcg ttgcatttcg ctgattatca aaggtcttca cctcgttagg tcagttaatt
2161 cacgtcgcaa acacattcaa actgcttccc ttcgccatgt aatgggcttt ccccatcgcg
2221 gactactacg gaagctccgc cagccagcgc gtcatcggag ccatgccccc ttaacatccg
2281 tcgctgacct tccccggttt acctgcctgg actcaggcat actgaggagg ctgcccgtcg
2341 cactctttat ccttgcttgc cgcaagttgg cagaagtcag caacgcaagt gtgatagacg
2401 ctgctgcccc ggtgtttcgc atacatgtca aaacaccttc gaccggcagt gcttacgtat
2461 cactgccagt tcctcctgca cggcctgtca gatcacgtag gccgtggtga cgttttcaac
2521 ccacagaggc ggattaacgg gttcatgttc ttcagccttt cagtacttaa ccttgaggat
2581 catctcggct tagtgatctc gcctcaatcc ccgttgtcag cgggttacat caccctgcgg
2641 gcatgccgca ggtcactgcc gctcaggttc tccaccgtca cacccggtgg gattgttggg
2701 tttctcatcg tgagttaccg gttcaatatt ccagacagac tcgcggttca tttaagcatc
2761 catgcccgcc ctgaactccg ggcacac
5' end
Intron on the sense strand
5' end
1 gtgtgcccgg agttcagggc gggcatggat gcttaaatga accgcgagtc tgtctggaat
61 attgaaccgg taactcacga tgagaaaccc aacaatccca ccgggtgtga cggtggagaa
121 cctgagcggc agtgacctgc ggcatgcccg cagggtgatg taacccgctg acaacgggga
181 ttgaggcgag atcactaagc cgagatgatc ctcaaggtta agtactgaaa ggctgaagaa
241 catgaacccg ttaatccgcc tctgtgggtt gaaaacgtca ccacggccta cgtgatctga
301 caggccgtgc aggaggaact ggcagtgata cgtaagcact gccggtcgaa ggtgttttga
361 catgtatgcg aaacaccggg gcagcagcgt ctatcacact tgcgttgctg acttctgcca
421 acttgcggca agcaaggata aagagtgcga cgggcagcct cctcagtatg cctgagtcca
481 ggcaggtaaa ccggggaagg tcagcgacgg atgttaaggg ggcatggctc cgatgacgcg
541 ctggctggcg gagcttccgt agtagtccgc gatggggaaa gcccattaca tggcgaaggg
601 aagcagtttg aatgtgtttg cgacgtgaat taactgacct aacgaggtga agacctttga
661 taatcagcga aatgcaacgc aagcttgcca catgggcagc caccgatccg tccctacgga
721 ttcaacggct gctgcgtctg ataacacaac cagaatggct ggctgaagcg gcgcggatca
781 cgctttcatc aaagggggcc catacccccg gcgttgatgg cgtgaacaaa acaatgctac
841 aggccagact ggctgttgag ctgcaaatcc tcagggatga attactctca ngccactacc
901 agcccttgcc cgccagacgg gtttacatcc ctaaaagcaa cggcaaactg cgaccactgg
961 gtatccccgc gttgcgggat cgtattgttc agcgggccat gctgatggcg atggagccga
1021 tatggganag tgattttcat acgctctcat atggcttccg gcctgagcgc agtgtccacc
1081 acgcgatccg cacggtgaaa ttacagctca cagactgtgg tgaaacccgg ggacgctggg
1141 tgattgaagg cgacctgtcc agttacttcg acaccgtaca tcatcgactg ctgatgaagg
1201 ctgtacgccg caggatcagt gacgcacgtt tcatgactct gctgtggaaa accatcaagg
1261 cgggacatat cgatgtcggt ctctttcggg cggccagtga aggtgtgcca cagggcggtg
1321 ttatatcgcc gctattatcg aacatcatgc tgaatgagtt cgatcaatac ctgcatgagc
1381 gctacctgag cgggaaagcc agaaaagatc ggtggtactg gaataacagt atccaacggg
1441 gccgaagtac ggcggtcaga gaaaactggc agtggaaacc cgcggtggcg tactgccgct
1501 atgccgatga ttttgtcctc atcgtcaaag gcaccaaagc acaggcggaa gccatcaggg
1561 aggagtgtcg gggtgtgctc gaaggcagtc tgaaactcag gctgaacatg gataagacta
1621 aaatcaccca tgttaatgac ggctttatct ttctggggca caggatcatt cgcaaacgca
1681 gtcgttatgg cgagatgcga gtggtctcaa cgatcccgca ggagaaagcc agaaacttcg
1741 ccgcatcgct gacagcactg ttatcaggca actacagtga aagcaaagtc gatatggctg
1801 aacaactcaa ccgaaaactg aaaggctggg ccatgttcta tcagttcgtt gattttaagg
1861 ccaaagtctt cagttatatc gaccgtgtcg tgttctggaa gctggctcac tggctggccc
1921 gcaaataccg taacaggtat cgcttccctg atgaggtggt ggtgtaaatc accgaaaccg
1981 ggtcagagca aaacgtgggt tttatttggt aaaaccaatc acggcaagct cagcggcgaa
2041 atactgtacc ggttggtggg gcaaggcaag aagctgttcc gctggcggct acccgaaggt
2101 aatccctatc tgaggacgga gaccagaaac acgtatacat cgcgctttac agaagtggca
2161 atggcgttcg ccagcattta aatggagagc cggatgcgct gaaaggtgca cgtccggttc
2221 ggggaggaga ggcagggaaa tagtccgact acgccctgcc tcttactctt ac
3' end
[Intron and flanking sequence]
121 ttccagcaat cgtcgattgt tataccagtc cacccacgtg agtgtggcca gctccacttc
181 tgtccggttt ttccagctct tacggtgtat tacctccgct ttgtaaaggc ccttgatgat
241 ctctgccatc gcgttgtcat acgagtcacc agtactccct gttgatgcca gcagttttgc
301 ttcttttagt cgcgccttat aggccagtga cacatactga gagcctttat cgctgtgatg
361 gatggtgcca gacggacgac gggcccacaa cgcctgctcc agtgcatcca gcacgaatgt
421 cgtttccata gacgatgaga ctcgtcaccn cacgatgtat ccggcaaaca catcaatgat
481 gaacgccaca tanacgaagc cctgccatgt gctgagtaag agtaagaggc agggcgtagt
541 cggactattt ccctgcctct cctccccgaa ccggacgtgc acctttcagc gcatccggct
601 ctccatttaa atgctggcga acgccattgc cacttctgta aagcgcgatg tatacgtgtt
661 tctggtctcc gtcctcagat agggattacc ttcgggtagc cgccagcgga acagcttctt
721 gccttgcccc accaaccggt acagtatttc gccgctgagc ttgccgtgat tggttttacc
781 aaataaaacc cacgttttgc tctgacccgg tttcggtgat ttacaccacc acctcatcag
841 ggaagcgata cctgttacgg tatttgcggg ccagccagtg agccagcttc cagaacacga
901 cacggtcgat ataactgaag actttggcct taaaatcaac gaactgatag aacatggccc
961 agcctttcag ttttcggttg agttgttcag ccatatcgac tttgctttca ctgtagttgc
1021 ctgataacag tgctgtcagc gatgcggcga agtttctggc tttctcctgc gggatcgttg
1081 agaccactcg catctcgcca taacgactgc gtttgcgaat gatcctgtgc cccagaaaga
1141 taaagccgtc attaacatgg gtgattttag tcttatccat gttcagcctg agtttcagac
1201 tgccttcgag cacaccccga cactcctccc tgatggcttc cgcctgtgct ttggtgcctt
1261 tgacgatgag gacaaaatca tcggcatagc ggcagtacgc caccgcgggt ttccactgcc
1321 agttttctct gaccgccgta cttcggcccc gttggatact gttattccag taccaccgat
1381 cttttctggc tttcccgctc aggtagcgct catgcaggta ttgatcgaac tcattcagca
1441 tgatgttcga taatagcggc gatataacac cgccctgtgg cacaccttca ctggccgccc
1501 gaaagagacc gacatcgata tgtcccgcct tgatggtttt ccacagcaga gtcatgaaac
1561 gtgcgtcact gatcctgcgg cgtacagcct tcatcagcag tcgatgatgt acggtgtcga
1621 agtaactgga caggtcgcct tcaatcaccc agcgtccccg ggtttcacca cagtctgtga
1681 gctgtaattt caccgtgcgg atcgcgtggt ggacactgcg ctcaggccgg aagccatatg
1741 agagcgtatg aaaatcactn tcccatatcg gctccatcgc catcagcatg gcccgctgaa
1801 caatacgatc ccgcaacgcg gggataccca gtggtcgcag tttgccgttg cttttaggga
1861 tgtaaacccg tctggcgggc aagggctggt agtggcntga gagtaattca tccctgagga
1921 tttgcagctc aacagccagt ctggcctgta gcattgtttt gttcacgcca tcaacgccgg
1981 gggtatgggc cccctttgat gaaagcgtga tccgcgccgc ttcagccagc cattctggtt
2041 gtgttatcag acgcagcagc cgttgaatcc gtagggacgg atcggtggct gcccatgtgg
2101 caagcttgcg ttgcatttcg ctgattatca aaggtcttca cctcgttagg tcagttaatt
2161 cacgtcgcaa acacattcaa actgcttccc ttcgccatgt aatgggcttt ccccatcgcg
2221 gactactacg gaagctccgc cagccagcgc gtcatcggag ccatgccccc ttaacatccg
2281 tcgctgacct tccccggttt acctgcctgg actcaggcat actgaggagg ctgcccgtcg
2341 cactctttat ccttgcttgc cgcaagttgg cagaagtcag caacgcaagt gtgatagacg
2401 ctgctgcccc ggtgtttcgc atacatgtca aaacaccttc gaccggcagt gcttacgtat
2461 cactgccagt tcctcctgca cggcctgtca gatcacgtag gccgtggtga cgttttcaac
2521 ccacagaggc ggattaacgg gttcatgttc ttcagccttt cagtacttaa ccttgaggat
2581 catctcggct tagtgatctc gcctcaatcc ccgttgtcag cgggttacat caccctgcgg
2641 gcatgccgca ggtcactgcc gctcaggttc tccaccgtca cacccggtgg gattgttggg
2701 tttctcatcg tgagttaccg gttcaatatt ccagacagac tcgcggttca tttaagcatc
2761 catgcccgcc ctgaactccg ggcacaccgt aagtaaaatc agccacccac agctggtcag
2821 gtcgttctgc cacgaactga cggtttacgc ggtcgcctgc ggcaaaggct ttccggctga
2881 tggtcgtacg gaccttttta ccccggagaa caccggcaag tcccataacc gccatgaggc
2941 gcgccactgt acatctggcc accctgattc cttcccgtaa caactgacgc cagactttac
3001 gcacaccgta taccttgtga ttttcatcgt atacgcgntg natctntttc ttcagccagt
MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQA
RLAVELQILRDELLSXHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEP
IWXSDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLL
MKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQ
YLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKA
QAEAIREECRGVLEGSLKLRLNMDKTKITHVNDGFIFLGHRIIRKRSRYGEMRVVSTI
PQEKARNFAASLTALLSGNYSESKVDMAEQLNRKLKGWAMFYQFVDFKAKVFSYIDRV
VFWKLAHWLARKYRXTGIASLMRWWCKSPKPGQSKTWVLFGKTNHGKLSGEILYRLVG
QGKKLFRWRLPEGNPYLRTETRNTYTSRFTEVAMAFASI

| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |