[Back to introns by organism] [Back to home page]
Information for Ph.p.I1 intron (Format of information for each intron)
[Intron and flanking sequence]
Sequence from Genbank entry (intron is on the antisense strand).
The boundaries of the intron are marked as red and ORF is marked
as blue, with start and stop codons underlined.
Insertion in ORF is in green.
5' end
gtac gcccagcatg
ggcatgaact aatgaggtga
156421 aagtcctctg taggaaaatc accctttaat attcaactcg ttgatattaa atgttaacta
156481 ctagcgaatg gcaagggctt ttccgcgagg aacggtctga aggaagccgc tagcaaaact
156541 gcgagctgat gaacaagaac atcatatgag gcgtaggtca gaggcgagtt ggcacaagac
156601 gacgaagcca agtgatttaa tgaccaccgt aaatgatgca gttgcgcagt gaaagtccat
156661 gtccttatct ggggagatct gcttaacatg cgatcgttaa cctaaccagc ttggcctctg
156721 gttcaacagg cctggcgtgt aagcgtcatc aacttacgcg agcgaaaccg atgaacaacg
156781 acagcgcttt acgaggtaac tcgttaagtg attaagcaga agtcagcaga cggcatagta
156841 gccaaatgcc catcgtaatg gttgggacac ggtgaaggcc tgaacagtta gaaaaaggag
156901 gagccttgcc acatttgaaa acacgaatgc
cgaccggtag tgacgcgatt cagcatgctg
156961 atatgaagcc agcctttaac caaaatctat tcgaacaaac cttgcagcga gataatttac
157021 aagccgcatg gaaacgtgtt cgagccaata aaggggctgc tggcgtcgat agcatgacta
157081 tcagcgaatt ccccgactgg gttaaatcag gccaatggga cagagttgaa accgttctgc
157141 gagcagagca gtatcaaccc tcacccgtca gacgtgttga gatagataaa cccgatggcg
157201 gtaaacgcca actgggtatc caaaccgtca tttaccgggt gatccaacaa gccatcgctc
157261 aggtactgac acccatcttt gaccccgatt tctcgaacaa cagttacgga tttcgtccgg
157321 gtcgaaacgg gcagcaagcc gtcaggcagg ttcaaagcat
cattaagcaa catgcctgtg
157381 cacttagctc agtacgaatg tataggtgcg tgaataattg cgaggtttaa taacctatta
157441 gggatctctc tttccatttt ccttcgattg aagcggtaac agaattcatc aaggtattct
157501 tgcaagtatt gtcccgacat accatgaaaa gtgccaagca aaaatgtttt taagttaccg
157561 atggctatat gaacccaggg aagccactca tcaaccaact cactgggagt aaccttggct
157621 tcatgttgtt gagtgttgtc tataatattc agcgcaggta gcgcatcagt atggacttct
157681 tgctgctcat ttaagtgctt agcaacaaac ttgttcactg tatcatggca aacactgtct
157741 actgcctgca ttgcaataaa accggctctt ttgcctttgc tttcaaccgc tactataacg
157801 ggagtctttc cttcagcgcc acggccacgc ttaccttttc tcctgcctcc taccaaggcg
157861 tcatcaattt ctataacacc tgaaagccga tacaggctat ctctatgacc cattgctgtt
157921 ctcaatttac tcagaatcaa tcgtgccgtt cgccagttaa cctcgatgag cttgctaagt
157981 cttaatgctg aaatgctgcc tttatctgag cctagaaagt aaatagccca gaaccattta
158041 gttaacggaa tacggctacc atgaaataag gtgtcggcgg ttatcgaggt ttgtttatga
158101 cattggctgc actcataggt attacgagtt gtcacgtcat atccgtggtc acaaccacac
158161 ctagggcaaa caaagccatt aggccatctc atctgcttaa ggtgatttaa gcaatcagct
158221 tcagtcccaa actggcgttg ccattcaaaa aagctagctt ctggcatttt catagcacac
158281 tgcactttta agcgtgattg atactaagaa tataggccaa actgactcat gtgcagaggc
158341 atgttaagca aagacgtcat tacgcagtcg atgttgattt gtcgaaattc tttgaccgag
158401 tcaaccacga cctattgatg actcgccttg gctacaaagt gaaagataag cgtctactta
158461 agttaattag tcgatatctt agagctggcg tcatttgtca atcaaaaggc gataacccgc
158521 tttatatgaa gagtcgagaa ggcgtccctc aaggaggccc attgtcacca ttattggcga
158581 acatcatgct cgatctgctc gacaaagaac ttgagaaacg aggacataaa tttgccagat
158641 acgccgatga cttcaccatt ctggtgaaaa gtcagcatgc gggccaacgc gtgctcctaa
158701 gtatcagtcg ctatttgcaa aaccgcctga aactcacggt aaacacaacc aaaagtcacg
158761 ttgtcaaaac cactgaaagc aaatttctgg gattcacatt ccgagcaaaa cgtattcaat
158821 ggcatcctaa aacactgctg aaatttaagc agcaagttag gcgactgaca aaccgcaact
158881 ggggagtatc gatgaaatat caactcttca aaaccagcca atatctacga ggttggatta
158941 attattttgg tatcgccaac tgctatcaac gctgtgtcga acttgatcat tggatcaggc
159001 gcagagttcg aatggcttat tggcgacagt ggcgaaagcc tcgcactaaa gtaaaaagcc
159061 tgttgaatcg aggcgttcgg attcaatcag ccgttgcgtg tggcattacc agtaaaggcc
159121 catggcgaag ttcaaagaca ccgggaatac agcaagcatt atccaatgct tacctaagat
159181 ctcaggggtt ggttgaacta cgtgatggat ggattagtct tcaccattct aagtgaaacg
159241 ccctgtgcgg atccgcatgc agggtgttgt gggggctgag ggttaaagac cctcggctac
159301 ccgat
3' end
[top]
[Intron and flanking sequence]
155941
cataacactg cgtttattct ctatatttca ataattcaag cacttgctgt tgagtttcgg
156001 cactaaactt atacaaatat tcatccgata aaataccata taatcgcatc gaaacgagct
156061 taaaaagtgc attgctgcgc tttaagtcat cgaacatttc accgattttg ttattttcag
156121 tttgaattat ctgccatagc tcaatatatg aagcatgctc attatcaact cgactttgag
156181 aaatacggtc aactttttta aatatattgt cacaagcaaa agctagaagc tcttccttcg
156241 agttacgaag tactttccaa tccttttctg gtattgagtt catagttcct ccttttgagt
156301 gctaacgcct taataactgg cgaaaaattg tgggctaaaa tcgcgcagcg atagcccgcg
156361 gtttttagtc cagttgatta acttgtgtac gcccagcatg
ggcatgaact aatgaggtga
156421 aagtcctctg taggaaaatc accctttaat attcaactcg ttgatattaa atgttaacta
156481 ctagcgaatg gcaagggctt ttccgcgagg aacggtctga aggaagccgc tagcaaaact
156541 gcgagctgat gaacaagaac atcatatgag gcgtaggtca gaggcgagtt ggcacaagac
156601 gacgaagcca agtgatttaa tgaccaccgt aaatgatgca gttgcgcagt gaaagtccat
156661 gtccttatct ggggagatct gcttaacatg cgatcgttaa cctaaccagc ttggcctctg
156721 gttcaacagg cctggcgtgt aagcgtcatc aacttacgcg agcgaaaccg atgaacaacg
156781 acagcgcttt acgaggtaac tcgttaagtg attaagcaga agtcagcaga cggcatagta
156841 gccaaatgcc catcgtaatg gttgggacac ggtgaaggcc tgaacagtta gaaaaaggag
156901 gagccttgcc acatttgaaa acacgaatgc cgaccggtag tgacgcgatt cagcatgctg
156961 atatgaagcc agcctttaac caaaatctat tcgaacaaac cttgcagcga gataatttac
157021 aagccgcatg gaaacgtgtt cgagccaata aaggggctgc tggcgtcgat agcatgacta
157081 tcagcgaatt ccccgactgg gttaaatcag gccaatggga cagagttgaa accgttctgc
157141 gagcagagca gtatcaaccc tcacccgtca gacgtgttga gatagataaa cccgatggcg
157201 gtaaacgcca actgggtatc caaaccgtca tttaccgggt gatccaacaa gccatcgctc
157261 aggtactgac acccatcttt gaccccgatt tctcgaacaa cagttacgga tttcgtccgg
157321 gtcgaaacgg gcagcaagcc gtcaggcagg ttcaaagcat cattaagcaa catgcctgtg
157381 cacttagctc agtacgaatg tataggtgcg tgaataattg cgaggtttaa taacctatta
157441 gggatctctc tttccatttt ccttcgattg aagcggtaac agaattcatc aaggtattct
157501 tgcaagtatt gtcccgacat accatgaaaa gtgccaagca aaaatgtttt taagttaccg
157561 atggctatat gaacccaggg aagccactca tcaaccaact cactgggagt aaccttggct
157621 tcatgttgtt gagtgttgtc tataatattc agcgcaggta gcgcatcagt atggacttct
157681 tgctgctcat ttaagtgctt agcaacaaac ttgttcactg tatcatggca aacactgtct
157741 actgcctgca ttgcaataaa accggctctt ttgcctttgc tttcaaccgc tactataacg
157801 ggagtctttc cttcagcgcc acggccacgc ttaccttttc tcctgcctcc taccaaggcg
157861 tcatcaattt ctataacacc tgaaagccga tacaggctat ctctatgacc cattgctgtt
157921 ctcaatttac tcagaatcaa tcgtgccgtt cgccagttaa cctcgatgag cttgctaagt
157981 cttaatgctg aaatgctgcc tttatctgag cctagaaagt aaatagccca gaaccattta
158041 gttaacggaa tacggctacc atgaaataag gtgtcggcgg ttatcgaggt ttgtttatga
158101 cattggctgc actcataggt attacgagtt gtcacgtcat atccgtggtc acaaccacac
158161 ctagggcaaa caaagccatt aggccatctc atctgcttaa ggtgatttaa gcaatcagct
158221 tcagtcccaa actggcgttg ccattcaaaa aagctagctt ctggcatttt catagcacac
158281 tgcactttta agcgtgattg atactaagaa tataggccaa actgactcat gtgcagaggc
158341 atgttaagca aagacgtcat tacgcagtcg atgttgattt gtcgaaattc tttgaccgag
158401 tcaaccacga cctattgatg actcgccttg gctacaaagt gaaagataag cgtctactta
158461 agttaattag tcgatatctt agagctggcg tcatttgtca atcaaaaggc gataacccgc
158521 tttatatgaa gagtcgagaa ggcgtccctc aaggaggccc attgtcacca ttattggcga
158581 acatcatgct cgatctgctc gacaaagaac ttgagaaacg aggacataaa tttgccagat
158641 acgccgatga cttcaccatt ctggtgaaaa gtcagcatgc gggccaacgc gtgctcctaa
158701 gtatcagtcg ctatttgcaa aaccgcctga aactcacggt aaacacaacc aaaagtcacg
158761 ttgtcaaaac cactgaaagc aaatttctgg gattcacatt ccgagcaaaa cgtattcaat
158821 ggcatcctaa aacactgctg aaatttaagc agcaagttag gcgactgaca aaccgcaact
158881 ggggagtatc gatgaaatat caactcttca aaaccagcca atatctacga ggttggatta
158941 attattttgg tatcgccaac tgctatcaac gctgtgtcga acttgatcat tggatcaggc
159001 gcagagttcg aatggcttat tggcgacagt ggcgaaagcc tcgcactaaa gtaaaaagcc
159061 tgttgaatcg aggcgttcgg attcaatcag ccgttgcgtg tggcattacc agtaaaggcc
159121 catggcgaag ttcaaagaca ccgggaatac agcaagcatt atccaatgct tacctaagat
159181 ctcaggggtt ggttgaacta cgtgatggat ggattagtct tcaccattct aagtgaaacg
159241 ccctgtgcgg atccgcatgc agggtgttgt gggggctgag ggttaaagac cctcggctac
159301 ccgattatgt ttatttcaag cgaagtacaa cagccaacaa gtttttcgat gctacagtca
159361 aattttgcag cccatcacca atgaacagat aattaccaaa aacatctgat ttaccagcac
159421 ttctagctgt atcctggtga ataatgaaca gtttttctat tggtttaatt tgtcttgttg
159481 ataagtttgc ctttagcatg attagatcat tagtcgataa cggattcgag taataaccgc
159541 tttccaaatc tactaaacct ttgattaatt tagttcttac cggcataaca atgttatatt
159601 cttttcgttt atctcttcca agcgcaaacc gattaccgat tagtaaccca actaaagttc
159661 caagaaatcc agataataat gcttccgtca tatgactcct tacttattat ataaacataa
[top]
Insertion in ORF (insertion within /// boundaries)
MPTGSDAIQHADMKPAFNQNLFEQTLQRDNLQAAWKRVRANKGAAGVDSMTISEFPDW
VKSGQWDRVETVLRAEQYQPSPVRRVEIDKPDGGKRQLGIQTVIYRVIQQAIAQVLTP
IFDPDFSNNSYGFRPGRNGQQAVRQV///QSIIKQHACALSSVRMYRCVNNCEV**PI
RDLSFHFPSIEAVTEFIKVFLQVLSRHTMKSAKQKCF*VTDGYMNPGKPLINQLTGSN
LGFMLLSVVYNIQRR*RISMDFLLLI*VLSNKLVHCIMANTVYCLHCNKTGSFAFAFN
RYYNGSLSFSATATLTFSPASYQGVINFYNT*KPIQAISMTHCCSQFTQNQSCRSPVN
LDELAKS*C*NAAFI*A*KVNSPEPFS*RNTATMK*GVGGYRGLFMTLAALIGITSCH
VISVVTTTPRANKAIRPSHLLVKI*AISFSPKLALPFKKASFWHFHSTLHF*A*LILR
I*AKLTHV///QRHVKQRRHYAVDVDLSKFFDRVNHDLLMTRLGYKVKDKRLLKLISR
YLRAGVICQSKGDNPLYMKSREGVPQGGPLSPLLANIMLDLLDKELEKRGHKFARYAD
DFTILVKSQHAGQRVLLSISRYLQNRLKLTVNTTKSHVVKTTESKFLGFTFRAKRIQW
HPKTLLKFKQQVRRLTNRNWGVSMKYQLFKTSQYLRGWINYFGIANCYQRCVELDHWI
RRRVRMAYWRQWRKPRTKVKSLLNRGVRIQSAVACGITSKGPWRSSKTPGIQQALSNA
YLRSQGLVELRDGWISLHHSK
[top]

[top]
| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |