[Back to introns by organism]   [Back to home page]

Information of S.f.I1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

Note: Redundant intron copies found in

AF200692 Shigella flexneri (21025-23296)

AE015311 Shigella flexneri (585-2272)

AE016988 Shigella flexneri (150712-152981)

 

[Intron sequence]

               

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

 

Intron on the antisense strand

 

3' end                                          

                                           gtaag agtaagaggc agggcgtagt

 541 cggactattt ccctgcctct cctccccgaa ccggacgtgc acctttcagc gcatccggct

 601 ctccatttaa atgctggcga acgccattgc cacttctgta aagcgcgatg tatacgtgtt

 661 tctggtctcc gtcctcagat agggattacc ttcgggtagc cgccagcgga acagcttctt

 721 gccttgcccc accaaccggt acagtatttc gccgctgagc ttgccgtgat tggttttacc

 781 aaataaaacc cacgttttgc tctgacccgg tttcggtgat ttacaccacc acctcatcag

 841 ggaagcgata cctgttacgg tatttgcggg ccagccagtg agccagcttc cagaacacga

 901 cacggtcgat ataactgaag actttggcct taaaatcaac gaactgatag aacatggccc

 961 agcctttcag ttttcggttg agttgttcag ccatatcgac tttgctttca ctgtagttgc

1021 ctgataacag tgctgtcagc gatgcggcga agtttctggc tttctcctgc gggatcgttg

1081 agaccactcg catctcgcca taacgactgc gtttgcgaat gatcctgtgc cccagaaaga

1141 taaagccgtc attaacatgg gtgattttag tcttatccat gttcagcctg agtttcagac

1201 tgccttcgag cacaccccga cactcctccc tgatggcttc cgcctgtgct ttggtgcctt

1261 tgacgatgag gacaaaatca tcggcatagc ggcagtacgc caccgcgggt ttccactgcc

1321 agttttctct gaccgccgta cttcggcccc gttggatact gttattccag taccaccgat

1381 cttttctggc tttcccgctc aggtagcgct catgcaggta ttgatcgaac tcattcagca

1441 tgatgttcga taatagcggc gatataacac cgccctgtgg cacaccttca ctggccgccc

1501 gaaagagacc gacatcgata tgtcccgcct tgatggtttt ccacagcaga gtcatgaaac

1561 gtgcgtcact gatcctgcgg cgtacagcct tcatcagcag tcgatgatgt acggtgtcga

1621 agtaactgga caggtcgcct tcaatcaccc agcgtccccg ggtttcacca cagtctgtga

1681 gctgtaattt caccgtgcgg atcgcgtggt ggacactgcg ctcaggccgg aagccatatg

1741 agagcgtatg aaaatcactn tcccatatcg gctccatcgc catcagcatg gcccgctgaa

1801 caatacgatc ccgcaacgcg gggataccca gtggtcgcag tttgccgttg cttttaggga

1861 tgtaaacccg tctggcgggc aagggctggt agtggcntga gagtaattca tccctgagga

1921 tttgcagctc aacagccagt ctggcctgta gcattgtttt gttcacgcca tcaacgccgg

1981 gggtatgggc cccctttgat gaaagcgtga tccgcgccgc ttcagccagc cattctggtt

2041 gtgttatcag acgcagcagc cgttgaatcc gtagggacgg atcggtggct gcccatgtgg

2101 caagcttgcg ttgcatttcg ctgattatca aaggtcttca cctcgttagg tcagttaatt

2161 cacgtcgcaa acacattcaa actgcttccc ttcgccatgt aatgggcttt ccccatcgcg

2221 gactactacg gaagctccgc cagccagcgc gtcatcggag ccatgccccc ttaacatccg

2281 tcgctgacct tccccggttt acctgcctgg actcaggcat actgaggagg ctgcccgtcg

2341 cactctttat ccttgcttgc cgcaagttgg cagaagtcag caacgcaagt gtgatagacg

2401 ctgctgcccc ggtgtttcgc atacatgtca aaacaccttc gaccggcagt gcttacgtat

2461 cactgccagt tcctcctgca cggcctgtca gatcacgtag gccgtggtga cgttttcaac

2521 ccacagaggc ggattaacgg gttcatgttc ttcagccttt cagtacttaa ccttgaggat

2581 catctcggct tagtgatctc gcctcaatcc ccgttgtcag cgggttacat caccctgcgg

2641 gcatgccgca ggtcactgcc gctcaggttc tccaccgtca cacccggtgg gattgttggg

2701 tttctcatcg tgagttaccg gttcaatatt ccagacagac tcgcggttca tttaagcatc

2761 catgcccgcc ctgaactccg ggcacac

5' end  

 

Intron on the sense strand

 

5' end  

   1 gtgtgcccgg agttcagggc gggcatggat gcttaaatga accgcgagtc tgtctggaat 

  61 attgaaccgg taactcacga tgagaaaccc aacaatccca ccgggtgtga cggtggagaa 

 121 cctgagcggc agtgacctgc ggcatgcccg cagggtgatg taacccgctg acaacgggga 

 181 ttgaggcgag atcactaagc cgagatgatc ctcaaggtta agtactgaaa ggctgaagaa 

 241 catgaacccg ttaatccgcc tctgtgggtt gaaaacgtca ccacggccta cgtgatctga 

 301 caggccgtgc aggaggaact ggcagtgata cgtaagcact gccggtcgaa ggtgttttga 

 361 catgtatgcg aaacaccggg gcagcagcgt ctatcacact tgcgttgctg acttctgcca 

 421 acttgcggca agcaaggata aagagtgcga cgggcagcct cctcagtatg cctgagtcca 

 481 ggcaggtaaa ccggggaagg tcagcgacgg atgttaaggg ggcatggctc cgatgacgcg 

 541 ctggctggcg gagcttccgt agtagtccgc gatggggaaa gcccattaca tggcgaaggg 

 601 aagcagtttg aatgtgtttg cgacgtgaat taactgacct aacgaggtga agacctttga 

 661 taatcagcga aatgcaacgc aagcttgcca catgggcagc caccgatccg tccctacgga 

 721 ttcaacggct gctgcgtctg ataacacaac cagaatggct ggctgaagcg gcgcggatca 

 781 cgctttcatc aaagggggcc catacccccg gcgttgatgg cgtgaacaaa acaatgctac 

 841 aggccagact ggctgttgag ctgcaaatcc tcagggatga attactctca ngccactacc 

 901 agcccttgcc cgccagacgg gtttacatcc ctaaaagcaa cggcaaactg cgaccactgg 

 961 gtatccccgc gttgcgggat cgtattgttc agcgggccat gctgatggcg atggagccga 

1021 tatggganag tgattttcat acgctctcat atggcttccg gcctgagcgc agtgtccacc 

1081 acgcgatccg cacggtgaaa ttacagctca cagactgtgg tgaaacccgg ggacgctggg 

1141 tgattgaagg cgacctgtcc agttacttcg acaccgtaca tcatcgactg ctgatgaagg 

1201 ctgtacgccg caggatcagt gacgcacgtt tcatgactct gctgtggaaa accatcaagg 

1261 cgggacatat cgatgtcggt ctctttcggg cggccagtga aggtgtgcca cagggcggtg 

1321 ttatatcgcc gctattatcg aacatcatgc tgaatgagtt cgatcaatac ctgcatgagc 

1381 gctacctgag cgggaaagcc agaaaagatc ggtggtactg gaataacagt atccaacggg 

1441 gccgaagtac ggcggtcaga gaaaactggc agtggaaacc cgcggtggcg tactgccgct 

1501 atgccgatga ttttgtcctc atcgtcaaag gcaccaaagc acaggcggaa gccatcaggg 

1561 aggagtgtcg gggtgtgctc gaaggcagtc tgaaactcag gctgaacatg gataagacta 

1621 aaatcaccca tgttaatgac ggctttatct ttctggggca caggatcatt cgcaaacgca 

1681 gtcgttatgg cgagatgcga gtggtctcaa cgatcccgca ggagaaagcc agaaacttcg 

1741 ccgcatcgct gacagcactg ttatcaggca actacagtga aagcaaagtc gatatggctg 

1801 aacaactcaa ccgaaaactg aaaggctggg ccatgttcta tcagttcgtt gattttaagg 

1861 ccaaagtctt cagttatatc gaccgtgtcg tgttctggaa gctggctcac tggctggccc 

1921 gcaaataccg taacaggtat cgcttccctg atgaggtggt ggtgtaaatc accgaaaccg 

1981 ggtcagagca aaacgtgggt tttatttggt aaaaccaatc acggcaagct cagcggcgaa 

2041 atactgtacc ggttggtggg gcaaggcaag aagctgttcc gctggcggct acccgaaggt 

2101 aatccctatc tgaggacgga gaccagaaac acgtatacat cgcgctttac agaagtggca 

2161 atggcgttcg ccagcattta aatggagagc cggatgcgct gaaaggtgca cgtccggttc 

2221 ggggaggaga ggcagggaaa tagtccgact acgccctgcc tcttactctt ac

3' end  

[top]


[Intron and flanking sequence]

 

121  ttccagcaat cgtcgattgt tataccagtc cacccacgtg agtgtggcca gctccacttc

181  tgtccggttt ttccagctct tacggtgtat tacctccgct ttgtaaaggc ccttgatgat

241  ctctgccatc gcgttgtcat acgagtcacc agtactccct gttgatgcca gcagttttgc

301  ttcttttagt cgcgccttat aggccagtga cacatactga gagcctttat cgctgtgatg

361  gatggtgcca gacggacgac gggcccacaa cgcctgctcc agtgcatcca gcacgaatgt

421  cgtttccata gacgatgaga ctcgtcaccn cacgatgtat ccggcaaaca catcaatgat

481  gaacgccaca tanacgaagc cctgccatgt gctgagtaag agtaagaggc agggcgtagt

541  cggactattt ccctgcctct cctccccgaa ccggacgtgc acctttcagc gcatccggct

601  ctccatttaa atgctggcga acgccattgc cacttctgta aagcgcgatg tatacgtgtt

661  tctggtctcc gtcctcagat agggattacc ttcgggtagc cgccagcgga acagcttctt

721  gccttgcccc accaaccggt acagtatttc gccgctgagc ttgccgtgat tggttttacc

781  aaataaaacc cacgttttgc tctgacccgg tttcggtgat ttacaccacc acctcatcag

841  ggaagcgata cctgttacgg tatttgcggg ccagccagtg agccagcttc cagaacacga

901  cacggtcgat ataactgaag actttggcct taaaatcaac gaactgatag aacatggccc

961  agcctttcag ttttcggttg agttgttcag ccatatcgac tttgctttca ctgtagttgc

1021 ctgataacag tgctgtcagc gatgcggcga agtttctggc tttctcctgc gggatcgttg

1081 agaccactcg catctcgcca taacgactgc gtttgcgaat gatcctgtgc cccagaaaga

1141 taaagccgtc attaacatgg gtgattttag tcttatccat gttcagcctg agtttcagac

1201 tgccttcgag cacaccccga cactcctccc tgatggcttc cgcctgtgct ttggtgcctt

1261 tgacgatgag gacaaaatca tcggcatagc ggcagtacgc caccgcgggt ttccactgcc

1321 agttttctct gaccgccgta cttcggcccc gttggatact gttattccag taccaccgat

1381 cttttctggc tttcccgctc aggtagcgct catgcaggta ttgatcgaac tcattcagca

1441 tgatgttcga taatagcggc gatataacac cgccctgtgg cacaccttca ctggccgccc

1501 gaaagagacc gacatcgata tgtcccgcct tgatggtttt ccacagcaga gtcatgaaac

1561 gtgcgtcact gatcctgcgg cgtacagcct tcatcagcag tcgatgatgt acggtgtcga

1621 agtaactgga caggtcgcct tcaatcaccc agcgtccccg ggtttcacca cagtctgtga

1681 gctgtaattt caccgtgcgg atcgcgtggt ggacactgcg ctcaggccgg aagccatatg

1741 agagcgtatg aaaatcactn tcccatatcg gctccatcgc catcagcatg gcccgctgaa

1801 caatacgatc ccgcaacgcg gggataccca gtggtcgcag tttgccgttg cttttaggga

1861 tgtaaacccg tctggcgggc aagggctggt agtggcntga gagtaattca tccctgagga

1921 tttgcagctc aacagccagt ctggcctgta gcattgtttt gttcacgcca tcaacgccgg

1981 gggtatgggc cccctttgat gaaagcgtga tccgcgccgc ttcagccagc cattctggtt

2041 gtgttatcag acgcagcagc cgttgaatcc gtagggacgg atcggtggct gcccatgtgg

2101 caagcttgcg ttgcatttcg ctgattatca aaggtcttca cctcgttagg tcagttaatt

2161 cacgtcgcaa acacattcaa actgcttccc ttcgccatgt aatgggcttt ccccatcgcg

2221 gactactacg gaagctccgc cagccagcgc gtcatcggag ccatgccccc ttaacatccg

2281 tcgctgacct tccccggttt acctgcctgg actcaggcat actgaggagg ctgcccgtcg

2341 cactctttat ccttgcttgc cgcaagttgg cagaagtcag caacgcaagt gtgatagacg

2401 ctgctgcccc ggtgtttcgc atacatgtca aaacaccttc gaccggcagt gcttacgtat

2461 cactgccagt tcctcctgca cggcctgtca gatcacgtag gccgtggtga cgttttcaac

2521 ccacagaggc ggattaacgg gttcatgttc ttcagccttt cagtacttaa ccttgaggat

2581 catctcggct tagtgatctc gcctcaatcc ccgttgtcag cgggttacat caccctgcgg

2641 gcatgccgca ggtcactgcc gctcaggttc tccaccgtca cacccggtgg gattgttggg

2701 tttctcatcg tgagttaccg gttcaatatt ccagacagac tcgcggttca tttaagcatc

2761 catgcccgcc ctgaactccg ggcacaccgt aagtaaaatc agccacccac agctggtcag

2821 gtcgttctgc cacgaactga cggtttacgc ggtcgcctgc ggcaaaggct ttccggctga

2881 tggtcgtacg gaccttttta ccccggagaa caccggcaag tcccataacc gccatgaggc

2941 gcgccactgt acatctggcc accctgattc cttcccgtaa caactgacgc cagactttac

3001 gcacaccgta taccttgtga ttttcatcgt atacgcgntg natctntttc ttcagccagt

[top]


[ORF sequence]

 

MQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQA

RLAVELQILRDELLSXHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEP

IWXSDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLL

MKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQ

YLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKA

QAEAIREECRGVLEGSLKLRLNMDKTKITHVNDGFIFLGHRIIRKRSRYGEMRVVSTI

PQEKARNFAASLTALLSGNYSESKVDMAEQLNRKLKGWAMFYQFVDFKAKVFSYIDRV

VFWKLAHWLARKYRXTGIASLMRWWCKSPKPGQSKTWVLFGKTNHGKLSGEILYRLVG

QGKKLFRWRLPEGNPYLRTETRNTYTSRFTEVAMAFASI 

top]


[Secondary structure]

[top]


 

| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |