[Back to introns by organism] [Back to home page]

Information of A.g.I1 intron   (Format of information for each intron)

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

Note: Intron is similar to

AY065966 Pseudomonas putida  (2474-4399)

AY029772 Pseudomonas aeruginosa (3515-5441)

AY030343 Serratia marcescens (1893-3818)

 

[Intron sequence]

 

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined. 

 

Intron on antisense strand

 

3' end

                                                                   atc

2881 gggtagccga gggtttctag ccctcagccc ccacaacacc ctgcatgcgg ctccgcacag

2941 ggcgtttcac taagagtggt gaagcttaat ccaaccatcg cgtaattcgt acagaccctg

3001 agattttaag taagcatttg ataacgcctg attaattccc ggagttttag agctacgcca

3061 tgggcctttg ctggtaatac cgcatgcaac tgcagcttgt acatgaacac ccagccgcat

3121 taagtttctt actttagtgc gtggcttgcg ccactgtcgc caataggcca ttcgcaccct

3181 gcgccggatc caatgatcca gctcaacgca gtgctgatag ccactggcaa taccaaagta

3241 gtttatccag ccgcgtaaat attggctgat cttaaatagc tggtatttca ttgacacgcc

3301 ccagttgcga ttcgtgagtt tgcgaacatt ctgcttaaac ttcagcagcg ttttcgggtg

3361 ccaatgaatg cggttcgcct tgaacgtaaa gcccaagaac tggctttcat tggttttaac

3421 cacacgactt ttgtcagtgt tgaccactaa cttcaaacgg ctttgcaagt actgactgat

3481 actgcgcagc actcgttctc cagctcgctg gctcttcact agaatcgtga aatcatccgc

3541 gtagcgggca aacttatgac cgcgtttttc aagttcttta tccaaagagt cgagcatgat

3601 attcgccagt aacggcgaca atggcccacc ttgcggtaca cccactcgac tctcaccttt

3661 gaactgatta tcgataaacc cagctctcag gtaacgttta atcagtctaa gcagacgttt

3721 gtctttcacc ttgtcgccaa ggcgcgtcat cagtagatcg tgattaactc ggtcaaagaa

3781 cttcgacaga tcaacatcca ccgcgaagcg gcgtccttcc ttgatgatgc tctgtacctg

3841 cttaaccgct tgctgcccat tacgacctgg acggaatcca aaactgtgtt cagaaaactc

3901 agggtcgaaa ataggtgtta atacttgggt aatggcttgt tggattacac gatcagtaac

3961 ggttggaata cccagctgac gagtgccgcc atctggttta gctatttcaa cacgccttac

4021 cggtgagggt tgataacagc ctgttaccag ttgttgtttc aatgctttcc agttgccgga

4081 tcttacccaa gcggggaatt catcgatggt catgccatcg atgccagcgg cccctttatt

4141 ggctcgaacg cgtttccatg ccgcagagag gttatcttcc tgcagcacgt gctcgaatag

4201 atcgttgcta aaggctggct tcatatcgct acgctgagtc acgtcatcgc cggacggcgt

4261 tcgtgtgttc aagtgtgaca aggctccccc tttgttctta atgttcaggc cttcaccgtg

4321 tcccaaccat tacgatgggc gtttggctac tatgccgtct gctgacttct gcttaatcac

4381 tcacagagtt gcctctgttc gcgctatcgg tttgcatcta attcgctctc caaggtcgat

4441 acattcctta gagccaaggc acttattaac cagagcctca ctggtggatt accgatcgct

4501 tgttaagcag atctccccag ataagagcat gtactgtcac tgcactgctg catcatttac

4561 ggtggccgtt agatcacatg gcttcgttgt cttgtgccaa ctcgccttca gcctacgcct

4621 catatgatgt ttttgttcat cagctcgcag ctttgcgtcc ggcttcctcc agacagttcc

4681 tcgcggagct gcccttgcca ttcgctagta gttaacatct aataacgatc ataaatcgct

4741 aaggatggtg accttcctac agaggacttt cacctcatta gttcatgccc atgctgggcg

4801 tac

5' end  

 

Intron on sense strand

 

5' end

   1 gtacgcccag catgggcatg aactaatgag gtgaaagtcc tctgtaggaa ggtcaccatc 

  61 cttagcgatt tatgatcgtt attagatgtt aactactagc gaatggcaag ggcagctccg 

 121 cgaggaactg tctggaggaa gccggacgca aagctgcgag ctgatgaaca aaaacatcat 

 181 atgaggcgta ggctgaaggc gagttggcac aagacaacga agccatgtga tctaacggcc 

 241 accgtaaatg atgcagcagt gcagtgacag tacatgctct tatctgggga gatctgctta 

 301 acaagcgatc ggtaatccac cagtgaggct ctggttaata agtgccttgg ctctaaggaa 

 361 tgtatcgacc ttggagagcg aattagatgc aaaccgatag cgcgaacaga ggcaactctg 

 421 tgagtgatta agcagaagtc agcagacggc atagtagcca aacgcccatc gtaatggttg 

 481 ggacacggtg aaggcctgaa cattaagaac aaagggggag ccttgtcaca cttgaacaca 

 541 cgaacgccgt ccggcgatga cgtgactcag cgtagcgata tgaagccagc ctttagcaac 

 601 gatctattcg agcacgtgct gcaggaagat aacctctctg cggcatggaa acgcgttcga 

 661 gccaataaag gggccgctgg catcgatggc atgaccatcg atgaattccc cgcttgggta 

 721 agatccggca actggaaagc attgaaacaa caactggtaa caggctgtta tcaaccctca 

 781 ccggtaaggc gtgttgaaat agctaaacca gatggcggca ctcgtcagct gggtattcca 

 841 accgttactg atcgtgtaat ccaacaagcc attacccaag tattaacacc tattttcgac 

 901 cctgagtttt ctgaacacag ttttggattc cgtccaggtc gtaatgggca gcaagcggtt 

 961 aagcaggtac agagcatcat caaggaagga cgccgcttcg cggtggatgt tgatctgtcg 

1021 aagttctttg accgagttaa tcacgatcta ctgatgacgc gccttggcga caaggtgaaa 

1081 gacaaacgtc tgcttagact gattaaacgt tacctgagag ctgggtttat cgataatcag 

1141 ttcaaaggtg agagtcgagt gggtgtaccg caaggtgggc cattgtcgcc gttactggcg 

1201 aatatcatgc tcgactcttt ggataaagaa cttgaaaaac gcggtcataa gtttgcccgc 

1261 tacgcggatg atttcacgat tctagtgaag agccagcgag ctggagaacg agtgctgcgc 

1321 agtatcagtc agtacttgca aagccgtttg aagttagtgg tcaacactga caaaagtcgt 

1381 gtggttaaaa ccaatgaaag ccagttcttg ggctttacgt tcaaggcgaa ccgcattcat 

1441 tggcacccga aaacgctgct gaagtttaag cagaatgttc gcaaactcac gaatcgcaac 

1501 tggggcgtgt caatgaaata ccagctattt aagatcagcc aatatttacg cggctggata 

1561 aactactttg gtattgccag tggctatcag cactgcgttg agctggatca ttggatccgg 

1621 cgcagggtgc gaatggccta ttggcgacag tggcgcaagc cacgcactaa agtaagaaac 

1681 ttaatgcggc tgggtgttca tgtacaagct gcagttgcat gcggtattac cagcaaaggc 

1741 ccatggcgta gctctaaaac tccgggaatt aatcaggcgt tatcaaatgc ttacttaaaa 

1801 tctcagggtc tgtacgaatt acgcgatggt tggattaagc ttcaccactc ttagtgaaac 

1861 gccctgtgcg gagccgcatg cagggtgttg tgggggctga gggctagaaa ccctcggcta 

1921 cccgat

3'end

[top]


[Intron and flanking sequence]

 

2401 caatttggag aatggcagcg caatgacatt cttgcaggta tcttcgagcc agccacgatc

2461 gacattgatc tggctatctt gctgacaaaa gcaagagaac atagcgttgc cttggtaggt

2521 ccagcggcgg aggaactctt tgatccggtt cctgaacagg atctatttga ggcgctaaat

2581 gaaaccttaa cgctatggaa ctcgccgccc gactgggctg gcgatgagcg aaatgtagtg

2641 cttacgttgt cccgcatttg gtacagcgca gtaaccggca aaatcgcgcc gaaggatgtc

2701 gctgccgact gggcaatgga gcgcctgccg gcccagtatc agcccgtcat acttgaagct

2761 agacaggctt atcttggaca agaagaagat cgcttggcct cccgcgcaga tcagttggaa

2821 gaatttgttc actacgtgaa aggcgagatc accaaggtag tcggcaaata atgtctaatc

2881 gggtagccga gggtttctag ccctcagccc ccacaacacc ctgcatgcgg ctccgcacag

2941 ggcgtttcac taagagtggt gaagcttaat ccaaccatcg cgtaattcgt acagaccctg

3001 agattttaag taagcatttg ataacgcctg attaattccc ggagttttag agctacgcca

3061 tgggcctttg ctggtaatac cgcatgcaac tgcagcttgt acatgaacac ccagccgcat

3121 taagtttctt actttagtgc gtggcttgcg ccactgtcgc caataggcca ttcgcaccct

3181 gcgccggatc caatgatcca gctcaacgca gtgctgatag ccactggcaa taccaaagta

3241 gtttatccag ccgcgtaaat attggctgat cttaaatagc tggtatttca ttgacacgcc

3301 ccagttgcga ttcgtgagtt tgcgaacatt ctgcttaaac ttcagcagcg ttttcgggtg

3361 ccaatgaatg cggttcgcct tgaacgtaaa gcccaagaac tggctttcat tggttttaac

3421 cacacgactt ttgtcagtgt tgaccactaa cttcaaacgg ctttgcaagt actgactgat

3481 actgcgcagc actcgttctc cagctcgctg gctcttcact agaatcgtga aatcatccgc

3541 gtagcgggca aacttatgac cgcgtttttc aagttcttta tccaaagagt cgagcatgat

3601 attcgccagt aacggcgaca atggcccacc ttgcggtaca cccactcgac tctcaccttt

3661 gaactgatta tcgataaacc cagctctcag gtaacgttta atcagtctaa gcagacgttt

3721 gtctttcacc ttgtcgccaa ggcgcgtcat cagtagatcg tgattaactc ggtcaaagaa

3781 cttcgacaga tcaacatcca ccgcgaagcg gcgtccttcc ttgatgatgc tctgtacctg

3841 cttaaccgct tgctgcccat tacgacctgg acggaatcca aaactgtgtt cagaaaactc

3901 agggtcgaaa ataggtgtta atacttgggt aatggcttgt tggattacac gatcagtaac

3961 ggttggaata cccagctgac gagtgccgcc atctggttta gctatttcaa cacgccttac

4021 cggtgagggt tgataacagc ctgttaccag ttgttgtttc aatgctttcc agttgccgga

4081 tcttacccaa gcggggaatt catcgatggt catgccatcg atgccagcgg cccctttatt

4141 ggctcgaacg cgtttccatg ccgcagagag gttatcttcc tgcagcacgt gctcgaatag

4201 atcgttgcta aaggctggct tcatatcgct acgctgagtc acgtcatcgc cggacggcgt

4261 tcgtgtgttc aagtgtgaca aggctccccc tttgttctta atgttcaggc cttcaccgtg

4321 tcccaaccat tacgatgggc gtttggctac tatgccgtct gctgacttct gcttaatcac

4381 tcacagagtt gcctctgttc gcgctatcgg tttgcatcta attcgctctc caaggtcgat

4441 acattcctta gagccaaggc acttattaac cagagcctca ctggtggatt accgatcgct

4501 tgttaagcag atctccccag ataagagcat gtactgtcac tgcactgctg catcatttac

4561 ggtggccgtt agatcacatg gcttcgttgt cttgtgccaa ctcgccttca gcctacgcct

4621 catatgatgt ttttgttcat cagctcgcag ctttgcgtcc ggcttcctcc agacagttcc

4681 tcgcggagct gcccttgcca ttcgctagta gttaacatct aataacgatc ataaatcgct

4741 aaggatggtg accttcctac agaggacttt cacctcatta gttcatgccc atgctgggcg

4801 tacacaattc gttcaagccg acgccgcttc gcggcgcggc ttaactcaag cgttagatgc

4861 actaagcaca taattgctca cagccaaacc tatcaggtca agtctgcttt tattattttt

4921 aagcgtgcat aataagccct acacaaattg ggagatatat catgaaaggc tggctttttc

4981 ttgttatcgc aa

[top]


[ORF sequence]

 

MKPAFSNDLFEHVLQEDNLSAAWKRVRANKGAAGIDGMTIDEFPAWVRSGNWKALKQQ

LVTGCYQPSPVRRVEIAKPDGGTRQLGIPTVTDRVIQQAITQVLTPIFDPEFSEHSFG

FRPGRNGQQAVKQVQSIIKEGRRFAVDVDLSKFFDRVNHDLLMTRLGDKVKDKRLLRL

IKRYLRAGFIDNQFKGESRVGVPQGGPLSPLLANIMLDSLDKELEKRGHKFARYADDF

TILVKSQRAGERVLRSISQYLQSRLKLVVNTDKSRVVKTNESQFLGFTFKANRIHWHP

KTLLKFKQNVRKLTNRNWGVSMKYQLFKISQYLRGWINYFGIASGYQHCVELDHWIRR

RVRMAYWRQWRKPRTKVRNLMRLGVHVQAAVACGITSKGPWRSSKTPGINQALSNAYL

KSQGLYELRDGWIKLHHS

[top]


[Secondary structure]

 

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |