[Back to introns by organism] [Back to home page]
Information of N.a.I2 intron (Format of information for each intron)
[Intron and flanking sequence]
Note: Redundant intron copies found in
NZ_AAAV01000163 Novosphingobium aromaticivorans (52137-54684)
Sequence
from Genbank entry. Intron is identified in red.
The ORF
is identified in blue with start and stop codons
underlined.
5' end
gtgcgccgt
53821 tgcggcgctg aaacggagta tcaaagatga tgaggtgagt tgagatacct ctgcccgcca
53881 agcttacctc ggacccaaaa ggtctgaaat tccgaaagga acgggggacc atagcatgct
53941 tggagcaagg ccgaatgcat cccagtcgca aagggatgat gccggtcagg gcaagacgcg
54001 catggtgaaa gctaaactgc ttaaatccta gtctgcagca ttttatgtac gcgaagctgg
54061 cgaaagcgac tgacacgcca gatccggacg ttggcaacgg tctcatgtct tggccgtttg
54121 ttctcaaccc gtccggtgcg ccccttgtcc aggggttcag caggcgaaag ggacacctaa
54181 acgcgtcgca tctccgttca gcgtaactac gggatgccct aaacaggcgg cttcggctga
54241 cctgcatggg tacggagccg ccatagtatt caaatgcccg gggtaatgcc cgggacagct
54301 cccggaaggg agacggtgca accgcataag gaccgggcga ccgggccgaa accggcgacc
54361 accgggagaa gggcggcagt tgcgatactt caaccgcaaa tcatggggtg tgctgatgct
54421 accagatact gtgatggccc ggttggaggc cattccgacg atctctgcgt cgggcaaacg
54481 ggtaaatggg cttcatcgtt tgatgaggtc cccgcttctt tgggagcagg gacttcggaa
54541 aattgcctcc aatcgggggg cgcatgacgc cgggcatcga tggcaagaca ttcgaggatt
54601 tcggtcccga ccgtctcgct ccgttgatcg ccagcgttgc gaccggagcc tacaaaccaa
54661 aacctgtgcg tcgggtgttc atcccgaaag gcaaaggaaa gcggcgtccg ctggggattc
54721 ccacgcgaga cgaccgcctc gtccaggaag tggcacgcca actgctggaa cgaatctatg
54781 agccggtctt ctcgaaggcc tcgcatggat ttcgaccggg aagatcgtgt catacggccc
54841 tcgagcacgt gaaggctgtc tggacgggcg tcaaatggct tgtcgacgtg gatgtcgccg
54901 ggttcttcga gaacatcgac catgacattc tgctgaagct gctccggaaa aggatcgatg
54961 acgaaaggtt catcgacctg atccgcgaca tgctgaaggc aggagtcatg gagggaaggg
55021 ctcacaccca gacctatagc ggcacaccac aaggcgggat cgtctccccg atcctggcca
55081 acatctacct gcacgaactc gatgagttca tggcgggtcg gatcacggcc tttgaaaaag
55141 ggaagacccg cgccacgaac ccggaatacc ggagactggc gggccggatc gccaaacggc
55201 gagaacggct caaacgactg gaagccagtg acaacgctga tcaggtaacg gtgaaggcca
55261 tcttggccga aatcaacacc ttatcaaagc agatgcgttc gttgccgtcg agagacgcca
55321 tggacgccgg gtttcgccga cttcgctact gccgttacgc cgacgatttt cttatcggtg
55381 tgattggcag caaggacgat gcgagagggg tcttcgccga agtcaggacc ttcctgaccg
55441 aggtactggc cttgaccgta tccgaggaga agagcggaat tcgaaaagca agcgatggta
55501 ccaaattcct cggatacgag gtgcggactt acacgggacg ccaatggaca gtgcgaagcc
55561 agaacggcac acagcacttc aagcggcgcc cgccatcgga agtcatgcaa ctcaatgttc
55621 cgtgggatag ggtcactgcg tttgttgccc ggaaggcata cggagaatgg tcccgattga
55681 gggccaaaca ccgcaaccac cttctaagct gtagcgatgt cgagattgtc cttgcctaca
55741 acgccgaact gcgagggttc gcgaactact acgctctggc gcgcgatgtg aaattcaagc
55801 tcaaccggct tgaatacctt cagcgctgga gcatgttcaa aaccttggca agcaagcaca
55861 aatccagtgt gcgagttgtt gccgcccgca tgaggcaagg gctggaatac ctcgccggct
55921 atgaagtcgg cggccagccc cgatcagtca aagtctggaa aatgaccgat ctgaaccgtg
55981 accggataga cccggacaag gtggacgtcc aaccttggac gcaaatcttc tccggctcgc
56041 gaacagattg ggtcgaccgg cagaacgcca cgcaatgcga agcgtgcggc cgatccgacc
56101 tcccctgcca tgttcatcat gtcaggggaa tggccgatgt tgcgcacaga gaccaagcca
56161 cgaggaaagc catagccaga gcgcgcaaga cgaaggttct gtgcgtccct tgccacaagg
56221 cgatccatgg tggcccacta ccggagcaga gaacatgaat ggtattatcg caatggagag
56281 ccgcatgcag cgaaagctgc acgtgcggtt cggtgggggg aacagggctg actcccggag
56341 cagcaccact cctacccaac
3' end
[Intron and flanking sequence]
53281 aatttcgccg ggcccgccgt ctccggcgag atcacccttc atggtgacga gatctacgtt
53341 caggtctcga tcccctgcat ccggcccggc cgcgaagtca tgttccgccg ctgcaagggc
53401 cgtcaggact acctcggcga ccgcaatcac ttctgcgaca tcgccgtcct tgccgctccg
53461 catagcttcc gcgcgctggt cgtccgcgag accggccttt ccatcaaccc acgcagccag
53521 gcgctgctct aggagaaaac ctcatgggat ggctgttcat gtcacgcggc gggatgtcgc
53581 ctttcgccac gccgaaggcc tatctcgaca accaatgcac ctatccgccc gatctggaca
53641 aaggccgcga gaccggcctg cgggtgctca aatcgacggt ccggtccggt gcctattacg
53701 ccgcctgcca gagctacgac gccgaaggtc caaaggagac cttcgcgatc atctgcctgg
53761 tcaaatggaa ccctggcgcg cgcagcggcg agaactttgg ctacaaagac agtgcgccgt
53821 tgcggcgctg aaacggagta tcaaagatga tgaggtgagt tgagatacct ctgcccgcca
53881 agcttacctc ggacccaaaa ggtctgaaat tccgaaagga acgggggacc atagcatgct
53941 tggagcaagg ccgaatgcat cccagtcgca aagggatgat gccggtcagg gcaagacgcg
54001 catggtgaaa gctaaactgc ttaaatccta gtctgcagca ttttatgtac gcgaagctgg
54061 cgaaagcgac tgacacgcca gatccggacg ttggcaacgg tctcatgtct tggccgtttg
54121 ttctcaaccc gtccggtgcg ccccttgtcc aggggttcag caggcgaaag ggacacctaa
54181 acgcgtcgca tctccgttca gcgtaactac gggatgccct aaacaggcgg cttcggctga
54241 cctgcatggg tacggagccg ccatagtatt caaatgcccg gggtaatgcc cgggacagct
54301 cccggaaggg agacggtgca accgcataag gaccgggcga ccgggccgaa accggcgacc
54361 accgggagaa gggcggcagt tgcgatactt caaccgcaaa tcatggggtg tgctgatgct
54421 accagatact gtgatggccc ggttggaggc cattccgacg atctctgcgt cgggcaaacg
54481 ggtaaatggg cttcatcgtt tgatgaggtc cccgcttctt tgggagcagg gacttcggaa
54541 aattgcctcc aatcgggggg cgcatgacgc cgggcatcga tggcaagaca ttcgaggatt
54601 tcggtcccga ccgtctcgct ccgttgatcg ccagcgttgc gaccggagcc tacaaaccaa
54661 aacctgtgcg tcgggtgttc atcccgaaag gcaaaggaaa gcggcgtccg ctggggattc
54721 ccacgcgaga cgaccgcctc gtccaggaag tggcacgcca actgctggaa cgaatctatg
54781 agccggtctt ctcgaaggcc tcgcatggat ttcgaccggg aagatcgtgt catacggccc
54841 tcgagcacgt gaaggctgtc tggacgggcg tcaaatggct tgtcgacgtg gatgtcgccg
54901 ggttcttcga gaacatcgac catgacattc tgctgaagct gctccggaaa aggatcgatg
54961 acgaaaggtt catcgacctg atccgcgaca tgctgaaggc aggagtcatg gagggaaggg
55021 ctcacaccca gacctatagc ggcacaccac aaggcgggat cgtctccccg atcctggcca
55081 acatctacct gcacgaactc gatgagttca tggcgggtcg gatcacggcc tttgaaaaag
55141 ggaagacccg cgccacgaac ccggaatacc ggagactggc gggccggatc gccaaacggc
55201 gagaacggct caaacgactg gaagccagtg acaacgctga tcaggtaacg gtgaaggcca
55261 tcttggccga aatcaacacc ttatcaaagc agatgcgttc gttgccgtcg agagacgcca
55321 tggacgccgg gtttcgccga cttcgctact gccgttacgc cgacgatttt cttatcggtg
55381 tgattggcag caaggacgat gcgagagggg tcttcgccga agtcaggacc ttcctgaccg
55441 aggtactggc cttgaccgta tccgaggaga agagcggaat tcgaaaagca agcgatggta
55501 ccaaattcct cggatacgag gtgcggactt acacgggacg ccaatggaca gtgcgaagcc
55561 agaacggcac acagcacttc aagcggcgcc cgccatcgga agtcatgcaa ctcaatgttc
55621 cgtgggatag ggtcactgcg tttgttgccc ggaaggcata cggagaatgg tcccgattga
55681 gggccaaaca ccgcaaccac cttctaagct gtagcgatgt cgagattgtc cttgcctaca
55741 acgccgaact gcgagggttc gcgaactact acgctctggc gcgcgatgtg aaattcaagc
55801 tcaaccggct tgaatacctt cagcgctgga gcatgttcaa aaccttggca agcaagcaca
55861 aatccagtgt gcgagttgtt gccgcccgca tgaggcaagg gctggaatac ctcgccggct
55921 atgaagtcgg cggccagccc cgatcagtca aagtctggaa aatgaccgat ctgaaccgtg
55981 accggataga cccggacaag gtggacgtcc aaccttggac gcaaatcttc tccggctcgc
56041 gaacagattg ggtcgaccgg cagaacgcca cgcaatgcga agcgtgcggc cgatccgacc
56101 tcccctgcca tgttcatcat gtcaggggaa tggccgatgt tgcgcacaga gaccaagcca
56161 cgaggaaagc catagccaga gcgcgcaaga cgaaggttct gtgcgtccct tgccacaagg
56221 cgatccatgg tggcccacta ccggagcaga gaacatgaat ggtattatcg caatggagag
56281 ccgcatgcag cgaaagctgc acgtgcggtt cggtgggggg aacagggctg actcccggag
56341 cagcaccact cctacccaac tgaccgaaac gatggggccg tatcactacg attgtccggc
56401 ctcgatcctc gacctgttgg gccctcccgg caacgaatat gccgccaaat ggcgcgaggc
56461 ctgccgggcg cgtctcgcgc tgacctcgcg ccgcaagccg cgaccgggcg acacgctggt
56521 gctggccgag ccgctcacgt tcactgacgg gcaaagcgag cgcagcttcc gggtggtcca
56581 gtcgggccgg aagaccatcc tgcgccggat gaacgatggg atgggcgtga agatcagcaa
56641 gctgatgagc cgtgcctgga cgattgtccc ggcccccgcc gccccctcgg cgacgtgact
56701 ggctcgcgag cttgcaccgc catgtcggac cagccacccc ccgaacgctc acctaaacgt
MPPIGGRMTPGIDGKTFEDFGPDRLAPLIASVATGAYKPKPVRRVFIPKGKGKRRPLG
IPTRDDRLVQEVARQLLERIYEPVFSKASHGFRPGRSCHTALEHVKAVWTGVKWLVDV
DVAGFFENIDHDILLKLLRKRIDDERFIDLIRDMLKAGVMEGRAHTQTYSGTPQGGIV
SPILANIYLHELDEFMAGRITAFEKGKTRATNPEYRRLAGRIAKRRERLKRLEASDNA
DQVTVKAILAEINTLSKQMRSLPSRDAMDAGFRRLRYCRYADDFLIGVIGSKDDARGV
FAEVRTFLTEVLALTVSEEKSGIRKASDGTKFLGYEVRTYTGRQWTVRSQNGTQHFKR
RPPSEVMQLNVPWDRVTAFVARKAYGEWSRLRAKHRNHLLSCSDVEIVLAYNAELRGF
ANYYALARDVKFKLNRLEYLQRWSMFKTLASKHKSSVRVVAARMRQGLEYLAGYEVGG
QPRSVKVWKMTDLNRDRIDPDKVDVQPWTQIFSGSRTDWVDRQNATQCEACGRSDLPC
HVHHVRGMADVAHRDQATRKAIARARKTKVLCVPCHKAIHGGPLPEQRT

| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |