[Back to introns by organism]   [Back to home page]

Information of N.a.I2 intron  (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

Note: Redundant intron copies found in

NZ_AAAV01000163 Novosphingobium aromaticivorans (52137-54684)

 

[Intron sequence]

 

Sequence from Genbank entry.  Intron is identified in red.  The ORF
is identified in blue with start and stop codons underlined.  

               

5' end                                          

                                                              gtgcgccgt

53821 tgcggcgctg aaacggagta tcaaagatga tgaggtgagt tgagatacct ctgcccgcca

53881 agcttacctc ggacccaaaa ggtctgaaat tccgaaagga acgggggacc atagcatgct

53941 tggagcaagg ccgaatgcat cccagtcgca aagggatgat gccggtcagg gcaagacgcg

54001 catggtgaaa gctaaactgc ttaaatccta gtctgcagca ttttatgtac gcgaagctgg

54061 cgaaagcgac tgacacgcca gatccggacg ttggcaacgg tctcatgtct tggccgtttg

54121 ttctcaaccc gtccggtgcg ccccttgtcc aggggttcag caggcgaaag ggacacctaa

54181 acgcgtcgca tctccgttca gcgtaactac gggatgccct aaacaggcgg cttcggctga

54241 cctgcatggg tacggagccg ccatagtatt caaatgcccg gggtaatgcc cgggacagct

54301 cccggaaggg agacggtgca accgcataag gaccgggcga ccgggccgaa accggcgacc

54361 accgggagaa gggcggcagt tgcgatactt caaccgcaaa tcatggggtg tgctgatgct

54421 accagatact gtgatggccc ggttggaggc cattccgacg atctctgcgt cgggcaaacg

54481 ggtaaatggg cttcatcgtt tgatgaggtc cccgcttctt tgggagcagg gacttcggaa

54541 aattgcctcc aatcgggggg cgcatgacgc cgggcatcga tggcaagaca ttcgaggatt

54601 tcggtcccga ccgtctcgct ccgttgatcg ccagcgttgc gaccggagcc tacaaaccaa

54661 aacctgtgcg tcgggtgttc atcccgaaag gcaaaggaaa gcggcgtccg ctggggattc

54721 ccacgcgaga cgaccgcctc gtccaggaag tggcacgcca actgctggaa cgaatctatg

54781 agccggtctt ctcgaaggcc tcgcatggat ttcgaccggg aagatcgtgt catacggccc

54841 tcgagcacgt gaaggctgtc tggacgggcg tcaaatggct tgtcgacgtg gatgtcgccg

54901 ggttcttcga gaacatcgac catgacattc tgctgaagct gctccggaaa aggatcgatg

54961 acgaaaggtt catcgacctg atccgcgaca tgctgaaggc aggagtcatg gagggaaggg

55021 ctcacaccca gacctatagc ggcacaccac aaggcgggat cgtctccccg atcctggcca

55081 acatctacct gcacgaactc gatgagttca tggcgggtcg gatcacggcc tttgaaaaag

55141 ggaagacccg cgccacgaac ccggaatacc ggagactggc gggccggatc gccaaacggc

55201 gagaacggct caaacgactg gaagccagtg acaacgctga tcaggtaacg gtgaaggcca

55261 tcttggccga aatcaacacc ttatcaaagc agatgcgttc gttgccgtcg agagacgcca

55321 tggacgccgg gtttcgccga cttcgctact gccgttacgc cgacgatttt cttatcggtg

55381 tgattggcag caaggacgat gcgagagggg tcttcgccga agtcaggacc ttcctgaccg

55441 aggtactggc cttgaccgta tccgaggaga agagcggaat tcgaaaagca agcgatggta

55501 ccaaattcct cggatacgag gtgcggactt acacgggacg ccaatggaca gtgcgaagcc

55561 agaacggcac acagcacttc aagcggcgcc cgccatcgga agtcatgcaa ctcaatgttc

55621 cgtgggatag ggtcactgcg tttgttgccc ggaaggcata cggagaatgg tcccgattga

55681 gggccaaaca ccgcaaccac cttctaagct gtagcgatgt cgagattgtc cttgcctaca

55741 acgccgaact gcgagggttc gcgaactact acgctctggc gcgcgatgtg aaattcaagc

55801 tcaaccggct tgaatacctt cagcgctgga gcatgttcaa aaccttggca agcaagcaca

55861 aatccagtgt gcgagttgtt gccgcccgca tgaggcaagg gctggaatac ctcgccggct

55921 atgaagtcgg cggccagccc cgatcagtca aagtctggaa aatgaccgat ctgaaccgtg

55981 accggataga cccggacaag gtggacgtcc aaccttggac gcaaatcttc tccggctcgc

56041 gaacagattg ggtcgaccgg cagaacgcca cgcaatgcga agcgtgcggc cgatccgacc

56101 tcccctgcca tgttcatcat gtcaggggaa tggccgatgt tgcgcacaga gaccaagcca

56161 cgaggaaagc catagccaga gcgcgcaaga cgaaggttct gtgcgtccct tgccacaagg

56221 cgatccatgg tggcccacta ccggagcaga gaacatgaat ggtattatcg caatggagag

56281 ccgcatgcag cgaaagctgc acgtgcggtt cggtgggggg aacagggctg actcccggag

56341 cagcaccact cctacccaac

3' end  

[top]


[Intron and flanking sequence]

 

53281 aatttcgccg ggcccgccgt ctccggcgag atcacccttc atggtgacga gatctacgtt

53341 caggtctcga tcccctgcat ccggcccggc cgcgaagtca tgttccgccg ctgcaagggc

53401 cgtcaggact acctcggcga ccgcaatcac ttctgcgaca tcgccgtcct tgccgctccg

53461 catagcttcc gcgcgctggt cgtccgcgag accggccttt ccatcaaccc acgcagccag

53521 gcgctgctct aggagaaaac ctcatgggat ggctgttcat gtcacgcggc gggatgtcgc

53581 ctttcgccac gccgaaggcc tatctcgaca accaatgcac ctatccgccc gatctggaca

53641 aaggccgcga gaccggcctg cgggtgctca aatcgacggt ccggtccggt gcctattacg

53701 ccgcctgcca gagctacgac gccgaaggtc caaaggagac cttcgcgatc atctgcctgg

53761 tcaaatggaa ccctggcgcg cgcagcggcg agaactttgg ctacaaagac agtgcgccgt

53821 tgcggcgctg aaacggagta tcaaagatga tgaggtgagt tgagatacct ctgcccgcca

53881 agcttacctc ggacccaaaa ggtctgaaat tccgaaagga acgggggacc atagcatgct

53941 tggagcaagg ccgaatgcat cccagtcgca aagggatgat gccggtcagg gcaagacgcg

54001 catggtgaaa gctaaactgc ttaaatccta gtctgcagca ttttatgtac gcgaagctgg

54061 cgaaagcgac tgacacgcca gatccggacg ttggcaacgg tctcatgtct tggccgtttg

54121 ttctcaaccc gtccggtgcg ccccttgtcc aggggttcag caggcgaaag ggacacctaa

54181 acgcgtcgca tctccgttca gcgtaactac gggatgccct aaacaggcgg cttcggctga

54241 cctgcatggg tacggagccg ccatagtatt caaatgcccg gggtaatgcc cgggacagct

54301 cccggaaggg agacggtgca accgcataag gaccgggcga ccgggccgaa accggcgacc

54361 accgggagaa gggcggcagt tgcgatactt caaccgcaaa tcatggggtg tgctgatgct

54421 accagatact gtgatggccc ggttggaggc cattccgacg atctctgcgt cgggcaaacg

54481 ggtaaatggg cttcatcgtt tgatgaggtc cccgcttctt tgggagcagg gacttcggaa

54541 aattgcctcc aatcgggggg cgcatgacgc cgggcatcga tggcaagaca ttcgaggatt

54601 tcggtcccga ccgtctcgct ccgttgatcg ccagcgttgc gaccggagcc tacaaaccaa

54661 aacctgtgcg tcgggtgttc atcccgaaag gcaaaggaaa gcggcgtccg ctggggattc

54721 ccacgcgaga cgaccgcctc gtccaggaag tggcacgcca actgctggaa cgaatctatg

54781 agccggtctt ctcgaaggcc tcgcatggat ttcgaccggg aagatcgtgt catacggccc

54841 tcgagcacgt gaaggctgtc tggacgggcg tcaaatggct tgtcgacgtg gatgtcgccg

54901 ggttcttcga gaacatcgac catgacattc tgctgaagct gctccggaaa aggatcgatg

54961 acgaaaggtt catcgacctg atccgcgaca tgctgaaggc aggagtcatg gagggaaggg

55021 ctcacaccca gacctatagc ggcacaccac aaggcgggat cgtctccccg atcctggcca

55081 acatctacct gcacgaactc gatgagttca tggcgggtcg gatcacggcc tttgaaaaag

55141 ggaagacccg cgccacgaac ccggaatacc ggagactggc gggccggatc gccaaacggc

55201 gagaacggct caaacgactg gaagccagtg acaacgctga tcaggtaacg gtgaaggcca

55261 tcttggccga aatcaacacc ttatcaaagc agatgcgttc gttgccgtcg agagacgcca

55321 tggacgccgg gtttcgccga cttcgctact gccgttacgc cgacgatttt cttatcggtg

55381 tgattggcag caaggacgat gcgagagggg tcttcgccga agtcaggacc ttcctgaccg

55441 aggtactggc cttgaccgta tccgaggaga agagcggaat tcgaaaagca agcgatggta

55501 ccaaattcct cggatacgag gtgcggactt acacgggacg ccaatggaca gtgcgaagcc

55561 agaacggcac acagcacttc aagcggcgcc cgccatcgga agtcatgcaa ctcaatgttc

55621 cgtgggatag ggtcactgcg tttgttgccc ggaaggcata cggagaatgg tcccgattga

55681 gggccaaaca ccgcaaccac cttctaagct gtagcgatgt cgagattgtc cttgcctaca

55741 acgccgaact gcgagggttc gcgaactact acgctctggc gcgcgatgtg aaattcaagc

55801 tcaaccggct tgaatacctt cagcgctgga gcatgttcaa aaccttggca agcaagcaca

55861 aatccagtgt gcgagttgtt gccgcccgca tgaggcaagg gctggaatac ctcgccggct

55921 atgaagtcgg cggccagccc cgatcagtca aagtctggaa aatgaccgat ctgaaccgtg

55981 accggataga cccggacaag gtggacgtcc aaccttggac gcaaatcttc tccggctcgc

56041 gaacagattg ggtcgaccgg cagaacgcca cgcaatgcga agcgtgcggc cgatccgacc

56101 tcccctgcca tgttcatcat gtcaggggaa tggccgatgt tgcgcacaga gaccaagcca

56161 cgaggaaagc catagccaga gcgcgcaaga cgaaggttct gtgcgtccct tgccacaagg

56221 cgatccatgg tggcccacta ccggagcaga gaacatgaat ggtattatcg caatggagag

56281 ccgcatgcag cgaaagctgc acgtgcggtt cggtgggggg aacagggctg actcccggag

56341 cagcaccact cctacccaac tgaccgaaac gatggggccg tatcactacg attgtccggc

56401 ctcgatcctc gacctgttgg gccctcccgg caacgaatat gccgccaaat ggcgcgaggc

56461 ctgccgggcg cgtctcgcgc tgacctcgcg ccgcaagccg cgaccgggcg acacgctggt

56521 gctggccgag ccgctcacgt tcactgacgg gcaaagcgag cgcagcttcc gggtggtcca

56581 gtcgggccgg aagaccatcc tgcgccggat gaacgatggg atgggcgtga agatcagcaa

56641 gctgatgagc cgtgcctgga cgattgtccc ggcccccgcc gccccctcgg cgacgtgact

56701 ggctcgcgag cttgcaccgc catgtcggac cagccacccc ccgaacgctc acctaaacgt

[top]


[ORF sequence]

 

MPPIGGRMTPGIDGKTFEDFGPDRLAPLIASVATGAYKPKPVRRVFIPKGKGKRRPLG

IPTRDDRLVQEVARQLLERIYEPVFSKASHGFRPGRSCHTALEHVKAVWTGVKWLVDV

DVAGFFENIDHDILLKLLRKRIDDERFIDLIRDMLKAGVMEGRAHTQTYSGTPQGGIV

SPILANIYLHELDEFMAGRITAFEKGKTRATNPEYRRLAGRIAKRRERLKRLEASDNA

DQVTVKAILAEINTLSKQMRSLPSRDAMDAGFRRLRYCRYADDFLIGVIGSKDDARGV

FAEVRTFLTEVLALTVSEEKSGIRKASDGTKFLGYEVRTYTGRQWTVRSQNGTQHFKR

RPPSEVMQLNVPWDRVTAFVARKAYGEWSRLRAKHRNHLLSCSDVEIVLAYNAELRGF

ANYYALARDVKFKLNRLEYLQRWSMFKTLASKHKSSVRVVAARMRQGLEYLAGYEVGG

QPRSVKVWKMTDLNRDRIDPDKVDVQPWTQIFSGSRTDWVDRQNATQCEACGRSDLPC

HVHHVRGMADVAHRDQATRKAIARARKTKVLCVPCHKAIHGGPLPEQRT

top]


[Secondary structure]

 

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |