[Back to introns by organism]   [Back to home page]

Information of N.p.I1 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

[Intron sequence]

               

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

 

Intron on antisense strand

 

3' end

                                                a ttagagtcga tggggagtat
59281 tacctcccgc acctctcctt cagaaccgga cgtgtagctt tcgctacatc cggctcctgg
59341 atgttctcag cttgcgctgt gcctgagtgg atgtactggt ggcaatgcat gtgtatagcc
59401 acaagattgt tctttttcca gttgtcatgg tttccgtcct tatggtgtag gttaatatca
59461 tcaccaggta agaagttcaa cccgcaatat ccacaaatat ggttttgttt tctaagcgct
59521 gtagccgttt cgccaaaata tagtttgcat tgtctcttac tccagtaggc gatatcatcg
59581 tcaaatggcg acttaccccc tttaacgcat acgaacttgc cctctgtcca agagacgcta
59641 ggaaaggctt tcttaacgag agcttcagcc ctatatttgg tcatcttctt ttcggctaga
59701 aacttcttga aagttgcgtg gttcaagaac caaagtgaat ctttggtgct gtccatattg
59761 catctgtcat ggtagttccg ccaccccctg acgatgggag ctattttttg cgcttttgtt
59821 tccgcaccat aatgagaagt attagcaata cttttgactt tcttgcggaa tgctgtgtag
59881 ttctccactg agggagtaca gacaaacttg ccgcttctct gaaccttgaa gttccatcca
59941 aggaagtcaa acccgtcggt cgctggtgta acctttgtct tttctgggct aatttccatg
60001 ccgcgttcgg ctaaaaattc ctcaatgttc ttgatgaatt tgctagcgtc atcctggggt
60061 tgtaggaaaa taatcatgtc atcggcgtat cggacggcat aaccgcgtcg atagctctta
60121 ttgacatctt cgattccatt gagcgcaatg ttggctagaa gcggactgac cacaccacct
60181 tgcggtgttc cttgttcagg aaatttgagt gatattcctg cttttaggca tcggaatata
60241 ccgagtttta gcgcttgggg tgcgatgagc ctatccatga ttgccttgtg ggagatgcgg
60301 tcaaaacact tcttaatatc aagttcgata actcttttat ccttaccgtt gcaagaactt
60361 ctgagattaa tcctgatata ttgttgagcg tcttttgcac tcctgccggg tcgaaaacca
60421 tagctccttt cattgaaggt cgcttcatgg gctggttcga gtgctagttt tgctaaacac
60481 tgccatgccc tatcagatat agttggcact ttaagtattc tcaccttccc atttttttag
60541 gaatagggat ttctttcaat ccctggtgtt tccattcaaa tgctccgtgt tgcagtaatt
60601 gcagcagttg cattctttcg gtttgtgtca gtgctttttg ccgtctatac ctgccgttcg
60661 ttttccccta ttaagttggg taacttgacg gatagccaac agtttcgctg aggtggactt
60721 cagaattagc ttttgaagag atttggcttt taccagattg ccctctttca tggctttgta
60781 tattcttctt tgtaaccgga atacgttttt ctggattttc ttccagggaa acttattcca
60841 agattcacta gttaaataag aactgtgcct aatcat
ggtt tactcggttg aatgtactct
60901 gaacacccgt cagcagtttt gctgacatct tacccgtcac ttggaattcc gtttgacata
60961 acctactgag gttttcgacg attaaaataa tgcctcagac caatgtgacg tttttactcg
61021 ttccgtagct tagattgttg tggtagttta gagcagtcca attcgcctga ctctatctcg
61081 gaggattgcg gttagaggta cgatactacc gcgagatttt atgagccgtg gtctacattt
61141 tttagttcag gatgtagcca ttcccgtaag acgcttttct aggacttatt ttactcatgc
61201 tacctccccc ttaacgccag tgttgttacc tgcttgtttt acaactgatt gcgtcctgct
61261 ggcagcttca ctctgctggg atgtcgcccg ctcggtgtgc ccaggtaggg agtcacttca
61321 cctctgaagg gtagggtttg cacctacatc taaaaacagt tatagatttt caggctttaa
61381 tcgcctgcga atccacctaa ctcatacccc tctagtcact agaggctcct agttagaacg
61441 agccgcac

5' end  

 

Intron on sense strand

 

5' end

   1 gtgcggctcg ttctaactag gagcctctag tgactagagg ggtatgagtt aggtggattc 

  61 gcaggcgatt aaagcctgaa aatctataac tgtttttaga tgtaggtgca aaccctaccc 

 121 ttcagaggtg aagtgactcc ctacctgggc acaccgagcg ggcgacatcc cagcagagtg  

 181 aagctgccag caggacgcaa tcagttgtaa aacaagcagg taacaacact ggcgttaagg 

 241 gggaggtagc atgagtaaaa taagtcctag aaaagcgtct tacgggaatg gctacatcct 

 301 gaactaaaaa atgtagacca cggctcataa aatctcgcgg tagtatcgta cctctaaccg 

 361 caatcctccg agatagagtc aggcgaattg gactgctcta aactaccaca acaatctaag 

 421 ctacggaacg agtaaaaacg tcacattggt ctgaggcatt attttaatcg tcgaaaacct 

 481 cagtaggtta tgtcaaacgg aattccaagt gacgggtaag atgtcagcaa aactgctgac 

 541 gggtgttcag agtacattca accgagtaaa ccatgattag gcacagttct tatttaacta 

 601 gtgaatcttg gaataagttt ccctggaaga aaatccagaa aaacgtattc cggttacaaa 

 661 gaagaatata caaagccatg aaagagggca atctggtaaa agccaaatct cttcaaaagc 

 721 taattctgaa gtccacctca gcgaaactgt tggctatccg tcaagttacc caacttaata 

 781 ggggaaaacg aacggcaggt atagacggca aaaagcactg acacaaaccg aaagaatgca 

 841 actgctgcaa ttactgcaac acggagcatt tgaatggaaa caccagggat tgaaagaaat 

 901 ccctattcct aaaaaaatgg gaaggtgaga atacttaaag tgccaactat atctgatagg 

 961 gcatggcagt gtttagcaaa actagcactc gaaccagccc atgaagcgac cttcaatgaa 

1021 aggagctatg gttttcgacc cggcaggagt gcaaaagacg ctcaacaata tatcaggatt 

1081 aatctcagaa gttcttgcaa cggtaaggat aaaagagtta tcgaacttga tattaagaag 

1141 tgttttgacc gcatctccca caaggcaatc atggataggc tcatcgcacc ccaagcgcta 

1201 aaactcggta tattccgatg cctaaaagca ggaatatcac tcaaatttcc tgaacaagga 

1261 acaccgcaag gtggtgtggt cagtccgctt ctagccaaca ttgcgctcaa tggaatcgaa 

1321 gatgtcaata agagctatcg acgcggttat gccgtccgat acgccgatga catgattatt 

1381 ttcctacaac cccaggatga cgctagcaaa ttcatcaaga acattgagga atttttagcc 

1441 gaacgcggca tggaaattag cccagaaaag acaaaggtta caccagcgac cgacgggttt 

1501 gacttccttg gatggaactt caaggttcag agaagcggca agtttgtctg tactccctca 

1561 gtggagaact acacagcatt ccgcaagaaa gtcaaaagta ttgctaatac ttctcattat 

1621 ggtgcggaaa caaaagcgca aaaaatagct cccatcgtca gggggtggcg gaactaccat 

1681 gacagatgca atatggacag caccaaagat tcactttggt tcttgaacca cgcaactttc 

1741 aagaagtttc tagccgaaaa gaagatgacc aaatataggg ctgaagctct cgttaagaaa 

1801 gcctttccta gcgtctcttg gacagagggc aagttcgtat gcgttaaagg gggtaagtcg 

1861 ccatttgacg atgatatcgc ctactggagt aagagacaat gcaaactata ttttggcgaa 

1921 acggctacag cgcttagaaa acaaaaccat atttgtggat attgcgggtt gaacttctta 

1981 cctggtgatg atattaacct acaccataag gacggaaacc atgacaactg gaaaaagaac 

2041 aatcttgtgg ctatacacat gcattgccac cagtacatcc actcaggcac agcgcaagct 

2101 gagaacatcc aggagccgga tgtagcgaaa gctacacgtc cggttctgaa ggagaggtgc 

2161 gggaggtaat actccccatc gactctaat

3' end  

[top]


[Intron and flanking sequence]


58741 ctttattttt atatcttact tatcaaggag cagcatttaa tatgtcaaat ctcaacccca
58801 gtcaccctac caaacaactt ctagctggtt actgtggcat tatccttgga ggatttgggg
58861 ttcataaatt tattctagga tacgctccag aaggctttat catgttggtg gtttctgtag
58921 ttgcgggttc tttcacctac ggtattgcct tgttaattat gcaacttgta ggtttaatcg
58981 aaggcatgat ctatttgaat aagcctcctg aagaatttgt aaatacctac tttgtgaata
59041 agcagggctg gttctaaaaa cttaagattt cccattttgg tttctctgta aatttatata
59101 tcttaagtgt atctttagaa taagttagga ctgacgcaaa acgacaagtt ctagggtcaa
59161 taccctgggg aaaacgtttt gtgtctacat ggaaacaagg gtttcagcta ctgtttacat
59221 aagccctgta ggtgtttgaa ttatctcctt atcctttgta ttagagtcga tggggagtat
59281 tacctcccgc acctctcctt cagaaccgga cgtgtagctt tcgctacatc cggctcctgg
59341 atgttctcag cttgcgctgt gcctgagtgg atgtactggt ggcaatgcat gtgtatagcc
59401 acaagattgt tctttttcca gttgtcatgg tttccgtcct tatggtgtag gttaatatca
59461 tcaccaggta agaagttcaa cccgcaatat ccacaaatat ggttttgttt tctaagcgct
59521 gtagccgttt cgccaaaata tagtttgcat tgtctcttac tccagtaggc gatatcatcg
59581 tcaaatggcg acttaccccc tttaacgcat acgaacttgc cctctgtcca agagacgcta
59641 ggaaaggctt tcttaacgag agcttcagcc ctatatttgg tcatcttctt ttcggctaga
59701 aacttcttga aagttgcgtg gttcaagaac caaagtgaat ctttggtgct gtccatattg
59761 catctgtcat ggtagttccg ccaccccctg acgatgggag ctattttttg cgcttttgtt
59821 tccgcaccat aatgagaagt attagcaata cttttgactt tcttgcggaa tgctgtgtag
59881 ttctccactg agggagtaca gacaaacttg ccgcttctct gaaccttgaa gttccatcca
59941 aggaagtcaa acccgtcggt cgctggtgta acctttgtct tttctgggct aatttccatg
60001 ccgcgttcgg ctaaaaattc ctcaatgttc ttgatgaatt tgctagcgtc atcctggggt
60061 tgtaggaaaa taatcatgtc atcggcgtat cggacggcat aaccgcgtcg atagctctta
60121 ttgacatctt cgattccatt gagcgcaatg ttggctagaa gcggactgac cacaccacct
60181 tgcggtgttc cttgttcagg aaatttgagt gatattcctg cttttaggca tcggaatata
60241 ccgagtttta gcgcttgggg tgcgatgagc ctatccatga ttgccttgtg ggagatgcgg
60301 tcaaaacact tcttaatatc aagttcgata actcttttat ccttaccgtt gcaagaactt
60361 ctgagattaa tcctgatata ttgttgagcg tcttttgcac tcctgccggg tcgaaaacca
60421 tagctccttt cattgaaggt cgcttcatgg gctggttcga gtgctagttt tgctaaacac
60481 tgccatgccc tatcagatat agttggcact ttaagtattc tcaccttccc atttttttag
60541 gaatagggat ttctttcaat ccctggtgtt tccattcaaa tgctccgtgt tgcagtaatt
60601 gcagcagttg cattctttcg gtttgtgtca gtgctttttg ccgtctatac ctgccgttcg
60661 ttttccccta ttaagttggg taacttgacg gatagccaac agtttcgctg aggtggactt
60721 cagaattagc ttttgaagag atttggcttt taccagattg ccctctttca tggctttgta
60781 tattcttctt tgtaaccgga atacgttttt ctggattttc ttccagggaa acttattcca
60841 agattcacta gttaaataag aactgtgcct aatcatggtt tactcggttg aatgtactct
60901 gaacacccgt cagcagtttt gctgacatct tacccgtcac ttggaattcc gtttgacata
60961 acctactgag gttttcgacg attaaaataa tgcctcagac caatgtgacg tttttactcg
61021 ttccgtagct tagattgttg tggtagttta gagcagtcca attcgcctga ctctatctcg
61081 gaggattgcg gttagaggta cgatactacc gcgagatttt atgagccgtg gtctacattt
61141 tttagttcag gatgtagcca ttcccgtaag acgcttttct aggacttatt ttactcatgc
61201 tacctccccc ttaacgccag tgttgttacc tgcttgtttt acaactgatt gcgtcctgct
61261 ggcagcttca ctctgctggg atgtcgcccg ctcggtgtgc ccaggtaggg agtcacttca
61321 cctctgaagg gtagggtttg cacctacatc taaaaacagt tatagatttt caggctttaa
61381 tcgcctgcga atccacctaa ctcatacccc tctagtcact agaggctcct agttagaacg
61441 agccgcac
ca aaccaaggga aattagtttg ctggggttta gtaggattct cctgtaaccc
61501 ctaattggta cttattttta tcaccaaggt tatagagcaa tacagttcag ataagaccaa
61561 aatacttgta gagacggcga tttatcgcgt cttaaaaccc acgattttgt agaagtagcg
61621 cttaacccaa acgtattggg ttatagaata agattcttag tttgtccaat ccagcatttg
61681 actggaattc catgttcaaa ttgtgggatg actcgctcct ttatggctat tgcccaaggc
61741 aatttgcttg aagcactggc agaaaattta ttcgatccac ttttatttgc cagttttgtg
61801 attgtggcag ttcatattac tctgaaacta gtcacaaaaa gccgaattac agcattttat
61861 tgtcatccag taaaacaaag aaaattacag ataatagggt tatttatact tttgatttac
61921 tattttctcc tcctgtataa actatcacag acaggagaaa tgtattctta ttttattaac

[top]


[ORF sequence]

 

MIRHSSYLTSESWNKFPWKKIQKNVFRLQRRIYKAMKEGNLVKAKSLQKLILKSTSAK

LLAIRQVTQLNRGKRTAGIDGKKHXTQTERMQLLQLLQHGAFEWKHQGLKEIPIPKXK

NGKVRILKVPTISDRAWQCLAKLALEPAHEATFNERSYGFRPGRSAKDAQQYIRINLR

SSCNGKDKRVIELDIKKCFDRISHKAIMDRLIAPQALKLGIFRCLKAGISLKFPEQGT

PQGGVVSPLLANIALNGIEDVNKSYRRGYAVRYADDMIIFLQPQDDASKFIKNIEEFL

AERGMEISPEKTKVTPATDGFDFLGWNFKVQRSGKFVCTPSVENYTAFRKKVKSIANT

SHYGAETKAQKIAPIVRGWRNYHDRCNMDSTKDSLWFLNHATFKKFLAEKKMTKYRAE

ALVKKAFPSVSWTEGKFVCVKGGKSPFDDDIAYWSKRQCKLYFGETATALRKQNHICG

YCGLNFLPGDDINLHHKDGNHDNWKKNNLVAIHMHCHQYIHSGTAQAENIQEPDVAKA

TRPVLKERCGR

top]


[Secondary structure]

 

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |