[Back to introns by organism]   [Back to home page]

Information of N.e.I1-1 intron  (Format of information for each intron)

[Intron sequence]

[Intron with flanking sequence]

[ORF sequence]

[Secondary structure]

Note: Redundant intron copies found in

BX321863 Nitrosomonas europaea (244947-246953)

 

Note: Multiple Insertions

N.e.I1-1        NZ_AAAY01000001 (2285101-2287107)

N.e.I1-2        NZ_AAAY01000001 (2378153-2380159)

 

[Intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

               

5' end

 

2285101 gtgcgcccag catgggcgcg aacttatggg gtgcaagtcc cctgtgggag taccgacttg
2285161 gattcattgt gaaccgagta taaccacgag ccgaaggcaa gggcaagccc gtgagggatt
2285221 gtctggagga agccggagtg caagaggtgc gaggcgacgg acagaaaccg catagaaggc
2285281 cgaggcaggc gaggcgagct ggcacatgac agcgaagccc aacggtacgc cggccacggt
2285341 aaatgcggcg cttgtgcacc gacagttcgc gttcttacct ggggaggtct gtgctgacgg
2285401 cggccagcct gatcgtgacg atgacccggc cggtcccacc gggcaaccaa ggaccaggag
2285461 ccgtggcgag ctgcccgcca cgagcgaaga aaccggcagc gacgcacgat cagacgggca
2285521 gcgcgtccac tggcaacggt gggcgtgacg gtgcagaagt cagcagaagc catagtaggc
2285581 agccaggtgt cgggctgccg aagggctgaa caacgagata ctaaggagca gacgtgaccg
2285641 aactcgacca aaccgatgaa gcccgcatgg gccaggcacg gcggcggtcg gcatgcaccg
2285701 agccttgaac caagacgacg accataacca agacggccaa gacctgctcg aggccgtgct
2285761 ggccagagac aacctggcgc gggcgtggcg cagggtcaaa tccaataggg gggcgccggg
2285821 tatcgacggc gtgaccacgg cggaatggcc cgaacacgcc cgcgcacact ggccagccac
2285881 gcgcgagcag atcgaggccg ggcgatatcg gccgcaaccg gtgcgacggg tggatatccc
2285941 caagcccgat ggcggccaac ggcaactggg catcccgacc gtcacggacc gggtcatcca
2286001 gcaggccatc gcccaggtgc tcataccgat cttcgatccg ggcttttcgg catcgagctt
2286061 cggcttccga ccgggccgca atgcacacca ggcgatccgc caggtgcagg cgcacgtgaa
2286121 ggccggctac cgttgggcgg tagatctgga cttggccagg ttcttcgaca acgtcaacca
2286181 tgacctgttg atgagcctgc tgagtcgaag catcgccgac aagcgactgc tggccctgat
2286241 cgggcgctac ctgcgcgccg gggtgctggt cggggagcac ccccaaccca gcgaggtggg
2286301 cacgccgcaa ggcgggccgc tctcgccgct gttggccaac gtcctgctgc accagttcga
2286361 tctcgaactg gaacgccgcg gacaccgctt cgcccgctac gccgacgatg tgatcatcct
2286421 ggtcaagtcc cgacgcgcgg ccgagcgggt gatgcaaagc ctcacgtact tcctgcaatc
2286481 caccctcaag ctcaccgtga acctggccaa gagccaagtc gcaccgatga gtgaatgcag
2286541 ctttctaggc ttcactcttg tgggcaagaa gatccgctgg acagagaaat ccctggcgaa
2286601 tttcaagcat cgggtgcggc aactcaccgg cagaagttgg ggcgtcagca tggagtaccg
2286661 gctggaaaag ctcggtcagt atctgcgggg atggttcggg tactacggga tcagccagta
2286721 ctaccggccg atcccggaac tggacgaatg gatccgacgt cgggtgcgta tgtgctactg
2286781 gaaacagtgg cgctgggcgc gcactaagat ccggcacctg ctggacttgg gcataccgct
2286841 gaaggctgcc atccaacacg gcgtgagcag cctgagctat tggcgcatgg ccagaactcc
2286901 ggtgacccaa caggccatgt ccaacgactg gctcagggca cagggactgc tcagcatcaa
2286961 agacctgtgg tgcaaagcac agagctacgg gccggacaag ggttga
aggc gtcgtcgtca
2287021 cacgagcacg ctcgttgaac cgcctactgc ggacccgcac ggtgggtggt gtgggggctg
2287081 ggagttaaaa gctcccagct acccgat

3' end  

[top]


[Intron and flanking sequence]

 

2284561 aatacgtgtc gatccgttat accgaacgtc tggcacaagc cggtattgaa ccgtctgtcg
2284621 gcagtcgcgg tgacagttac gataatgcgc tggctgaaac gatcaatggc ctgtacaagg
2284681 ctgaactgat acacaggaga gctccctgga aaaccagggc tgctgtggag ctggcaactt
2284741 tggaatgggt tgcctggtat aaccatcaac gcctgcttgg atctatcggg tatattcctc
2284801 cagctcaggc tgaagaaaac taccgacaaa cccaggataa taagacattg atggatattt
2284861 tactttaacc aaatagcctc ctcgattgtc gggacggttc attttggctc atccaccacg
2284921 acaaagtcga tcttgtgatc gtcacgactc tgtgcgcagc gattggcttt ctccggtgcc
2284981 acattgaacg gagccccagc aatttgtgtt tccagaaggc agataggaga gcgcatgttc
2285041 ttagcagcta acgcctgagt taagccgcgc cgcgaagcgg cgtcggcttg aatgaattgt
2285101 gtgcgcccag catgggcgcg aacttatggg gtgcaagtcc cctgtgggag taccgacttg
2285161 gattcattgt gaaccgagta taaccacgag ccgaaggcaa gggcaagccc gtgagggatt
2285221 gtctggagga agccggagtg caagaggtgc gaggcgacgg acagaaaccg catagaaggc
2285281 cgaggcaggc gaggcgagct ggcacatgac agcgaagccc aacggtacgc cggccacggt
2285341 aaatgcggcg cttgtgcacc gacagttcgc gttcttacct ggggaggtct gtgctgacgg
2285401 cggccagcct gatcgtgacg atgacccggc cggtcccacc gggcaaccaa ggaccaggag
2285461 ccgtggcgag ctgcccgcca cgagcgaaga aaccggcagc gacgcacgat cagacgggca
2285521 gcgcgtccac tggcaacggt gggcgtgacg gtgcagaagt cagcagaagc catagtaggc
2285581 agccaggtgt cgggctgccg aagggctgaa caacgagata ctaaggagca gacgtgaccg
2285641 aactcgacca aaccgatgaa gcccgcatgg gccaggcacg gcggcggtcg gcatgcaccg
2285701 agccttgaac caagacgacg accataacca agacggccaa gacctgctcg aggccgtgct
2285761 ggccagagac aacctggcgc gggcgtggcg cagggtcaaa tccaataggg gggcgccggg
2285821 tatcgacggc gtgaccacgg cggaatggcc cgaacacgcc cgcgcacact ggccagccac
2285881 gcgcgagcag atcgaggccg ggcgatatcg gccgcaaccg gtgcgacggg tggatatccc
2285941 caagcccgat ggcggccaac ggcaactggg catcccgacc gtcacggacc gggtcatcca
2286001 gcaggccatc gcccaggtgc tcataccgat cttcgatccg ggcttttcgg catcgagctt
2286061 cggcttccga ccgggccgca atgcacacca ggcgatccgc caggtgcagg cgcacgtgaa
2286121 ggccggctac cgttgggcgg tagatctgga cttggccagg ttcttcgaca acgtcaacca
2286181 tgacctgttg atgagcctgc tgagtcgaag catcgccgac aagcgactgc tggccctgat
2286241 cgggcgctac ctgcgcgccg gggtgctggt cggggagcac ccccaaccca gcgaggtggg
2286301 cacgccgcaa ggcgggccgc tctcgccgct gttggccaac gtcctgctgc accagttcga
2286361 tctcgaactg gaacgccgcg gacaccgctt cgcccgctac gccgacgatg tgatcatcct
2286421 ggtcaagtcc cgacgcgcgg ccgagcgggt gatgcaaagc ctcacgtact tcctgcaatc
2286481 caccctcaag ctcaccgtga acctggccaa gagccaagtc gcaccgatga gtgaatgcag
2286541 ctttctaggc ttcactcttg tgggcaagaa gatccgctgg acagagaaat ccctggcgaa
2286601 tttcaagcat cgggtgcggc aactcaccgg cagaagttgg ggcgtcagca tggagtaccg
2286661 gctggaaaag ctcggtcagt atctgcgggg atggttcggg tactacggga tcagccagta
2286721 ctaccggccg atcccggaac tggacgaatg gatccgacgt cgggtgcgta tgtgctactg
2286781 gaaacagtgg cgctgggcgc gcactaagat ccggcacctg ctggacttgg gcataccgct
2286841 gaaggctgcc atccaacacg gcgtgagcag cctgagctat tggcgcatgg ccagaactcc
2286901 ggtgacccaa caggccatgt ccaacgactg gctcagggca cagggactgc tcagcatcaa
2286961 agacctgtgg tgcaaagcac agagctacgg gccggacaag ggttgaaggc gtcgtcgtca
2287021 cacgagcacg ctcgttgaac cgcctactgc ggacccgcac ggtgggtggt gtgggggctg
2287081 ggagttaaaa gctcccagct acccgat
tag gcatcaatgc caacaatatc cttgctaact
2287141 ccaagcttgc tagcgacgta gctgataagt tcatcttccg agcggaactc cgcgttgtcg
2287201 ataagctctt ggtggtcttg caggtcgtta ccatcttcgt cttgagatgt gacggagcca
2287261 cagcgaatta tcccatccat gtactcaggg acgttatcgt caagttgtac ggacacataa
2287321 gctatttctt ttgacatagt tttctcctat aagtttgatg gtggatgtct ccaccatagc
2287381 tatttatcac tgcacgtgag agatgcgaac gactgatgcc taacgttgaa ggtaagaggc
2287441 gcgtagccgg cttgccggcg gagcgtccct cttgaccgaa gggttgggcg tcactttgtt
2287501 acgagttcgc ttttcggatt cgtattgctt ctgccactgc attttcgagg tgctgagcaa
2287561 tccatgcgta acgatcctga ctggtagcac ccgcagggtt atagcacttg acgagagagg

[top]


[ORF sequence]

 

MHRALNQDDDHNQDGQDLLEAVLARDNLARAWRRVKSNRGAPGIDGVTTAEWPEHARA

HWPATREQIEAGRYRPQPVRRVDIPKPDGGQRQLGIPTVTDRVIQQAIAQVLIPIFDP

GFSASSFGFRPGRNAHQAIRQVQAHVKAGYRWAVDLDLARFFDNVNHDLLMSLLSRSI

ADKRLLALIGRYLRAGVLVGEHPQPSEVGTPQGGPLSPLLANVLLHQFDLELERRGHR

FARYADDVIILVKSRRAAERVMQSLTYFLQSTLKLTVNLAKSQVAPMSECSFLGFTLV

GKKIRWTEKSLANFKHRVRQLTGRSWGVSMEYRLEKLGQYLRGWFGYYGISQYYRPIP

ELDEWIRRRVRMCYWKQWRWARTKIRHLLDLGIPLKAAIQHGVSSLSYWRMARTPVTQ

QAMSNDWLRAQGLLSIKDLWCKAQSYGPDKG

top]


[Secondary structure]

 

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |