[Back to introns by organism]  [Back to home page]

Information for Al.me.I4 intron  (Format of information for each intron

[Intron sequence]

[Intron and flanking sequence]

[ORF sequence]

[Secondary structure]

 

 

[Intron sequence]

 

Sequence from Genbank entry (intron is on the antisense strand). 

The boundaries of the intron are marked as red and ORF is marked 

as blue, with start and stop codons underlined.

Intron on antisense strand

3' end

                                                                    gtc
19681 gagtagaccc agggaactgc ccccccgagt ctctcacaga acctggactt gaaagtctcc
19741 cttcatccgg ctcttcttaa tcagttcttg ggacatatgc ccatcttcca gtggacaaac
19801 agtgatggtg tttcttctgc aatagagctt aatctcctta aagctgccat aaatttattt
19861 ttatagcgtt tagacttctt catcatccat ctggcaagac tcaaattgac acacgataga
19921 attgcatgaa ctttacttgc tgtaaattta tcgaaatagt tagcccaacc tcgaataata
19981 gggttcattt tctctgctaa ttcggttaaa gacagatagg aattctttct tagtttctta
20041 attttatcca tgaagctctt tatggcaatt ttgctaactg ccggggtgaa tcctgtgaaa
20101 taatttccct tctttgtttt caccgtacga gctctgaatg tatatcctag aaaatcaaag
20161 ttggtgtgga tatggttttg tttcctgttg tcatctttgc agtaaataat cttggtttta
20221 tttggatgga tttctaggtg gcaatcttta aatcgctctc ctagtttaag caagagtact
20281 tctgcttctt ttaaactatg actgtgtatt aatccatcat ctgcgtaacg tacccaaggg
20341 ttatttgagt gttttcgtgt catccagtgg tcaaaagcat agtgcataaa gagatttgca
20401 agcactgcgc tgattactcc accttggggg gttccttttg ttctttcgac gtgtattcca
20461 tctgacataa ccatcggcac ttttagtgtt ctttcaatat ataggataac ccacttctca
20521 tttgtatgtt gcttcactgc tttcatgaga aggtcgtgat taatgttgtc gaatagaccc
20581 acgatgtcaa actcaattaa ccagtcatac tcccaacacc gttttctcgt cacttcgata
20641 gcatctatag cacttttatt ctctctatac ccatatgaat cctcataaaa aatgggttct
20701 actttgggat ttagctccga gaccataaca ttttgtgcga tacgatcgtc aatacttgga
20761 acacctaaga ctctaactcc tccgtttttc ttgggaattt ctacaccgcg tacagcttgc
20821 ggaaagtaac ttcctgaaga catgctattc cataatttat agaggttatc ttccaagttg
20881 ttttcatatt cttggagtgt tacctcatca atgcctggag cacctttgtt agcttctact
20941 tttttaaatg cctcaaacac tgcgcgtttt gtaatttgaa agggtttctc cttcttcat
a
21001 gattcctccc atctgtggtt gacctacatt ttaggattga ttaagtgagt ccctttgctc
21061 cacgactatt accgtcgttt cttcactact acggactcat ccgcccctgt tcacggcttc
21121 gatacttgca tcctaacatt tcttgtgctt ggattcttct cttaacatcc gtcaacaggt
21181 tctcctgttc catctaaaag cctaaactac gctcctgcaa cctatgcacc ggctgtcatc
21241 tggtcagtaa gcagatttct tccagactta tcccagcatt tcccgactac actggttttg
21301 acatgcattt ttcctgataa cgatgcttta atggttattc actttcgttc agcttcgcag
21361 ttcacacttg actgtttata cagccttttt cccataacgc tcaataccga agacattaat
21421 ctacagcacc tatgggcagt ttgaaacccg tatctgcata ccgatttcga aggaccgacc
21481 ttcatctttt agatagcatc atttcagacg ttttaaacct cctttctttg tcttactttg
21541 ttcaggacac ac

5' end

Intron on sense strand

5' end

   1 gtgtgtcctg aacaaagtaa gacaaagaaa ggaggtttaa aacgtctgaa atgatgctat 

  61 ctaaaagatg aaggtcggtc cttcgaaatc ggtatgcaga tacgggtttc aaactgccca 

 121 taggtgctgt agattaatgt cttcggtatt gagcgttatg ggaaaaaggc tgtataaaca 

 181 gtcaagtgtg aactgcgaag ctgaacgaaa gtgaataacc attaaagcat cgttatcagg 

 241 aaaaatgcat gtcaaaacca gtgtagtcgg gaaatgctgg gataagtctg gaagaaatct 

 301 gcttactgac cagatgacag ccggtgcata ggttgcagga gcgtagttta ggcttttaga 

 361 tggaacagga gaacctgttg acggatgtta agagaagaat ccaagcacaa gaaatgttag 

 421 gatgcaagta tcgaagccgt gaacaggggc ggatgagtcc gtagtagtga agaaacgacg 

 481 gtaatagtcg tggagcaaag ggactcactt aatcaatcct aaaatgtagg tcaaccacag 

 541 atgggaggaa tctatgaaga aggagaaacc ctttcaaatt acaaaacgcg cagtgtttga 

 601 ggcatttaaa aaagtagaag ctaacaaagg tgctccaggc attgatgagg taacactcca 

 661 agaatatgaa aacaacttgg aagataacct ctataaatta tggaatagca tgtcttcagg 

 721 aagttacttt ccgcaagctg tacgcggtgt agaaattccc aagaaaaacg gaggagttag 

 781 agtcttaggt gttccaagta ttgacgatcg tatcgcacaa aatgttatgg tctcggagct 

 841 aaatcccaaa gtagaaccca ttttttatga ggattcatat gggtatagag agaataaaag 

 901 tgctatagat gctatcgaag tgacgagaaa acggtgttgg gagtatgact ggttaattga 

 961 gtttgacatc gtgggtctat tcgacaacat taatcacgac cttctcatga aagcagtgaa 

1021 gcaacataca aatgagaagt gggttatcct atatattgaa agaacactaa aagtgccgat 

1081 ggttatgtca gatggaatac acgtcgaaag aacaaaagga accccccaag gtggagtaat 

1141 cagcgcagtg cttgcaaatc tctttatgca ctatgctttt gaccactgga tgacacgaaa 

1201 acactcaaat aacccttggg tacgttacgc agatgatgga ttaatacaca gtcatagttt 

1261 aaaagaagca gaagtactct tgcttaaact aggagagcga tttaaagatt gccacctaga 

1321 aatccatcca aataaaacca agattattta ctgcaaagat gacaacagga aacaaaacca 

1381 tatccacacc aactttgatt ttctaggata tacattcaga gctcgtacgg tgaaaacaaa 

1441 gaagggaaat tatttcacag gattcacccc ggcagttagc aaaattgcca taaagagctt 

1501 catggataaa attaagaaac taagaaagaa ttcctatctg tctttaaccg aattagcaga 

1561 gaaaatgaac cctattattc gaggttgggc taactatttc gataaattta cagcaagtaa 

1621 agttcatgca attctatcgt gtgtcaattt gagtcttgcc agatggatga tgaagaagtc 

1681 taaacgctat aaaaataaat ttatggcagc tttaaggaga ttaagctcta ttgcagaaga 

1741 aacaccatca ctgtttgtcc actggaagat gggcatatgt cccaagaact gattaagaag 

1801 agccggatga agggagactt tcaagtccag gttctgtgag agactcgggg gggcagttcc 

1861 ctgggtctac tcgac

3' end

[top]


[Intron and flanking sequences]

 

19321 gaggtcaata aaccctttga ggcattatct gtctatagaa gcaaggatat cgtcgaaaaa
19381 ggtttcggca accttaaaga gcgattgaat ttcaggagaa tgcaggtttc ttctgaatta
19441 tgcttgaacg gcaagctttt tattgaattt attgcactca tctacttatc ctatgtgaaa
19501 aagaggatgc aagatgcaga tctttttgat aagtggacgc tgcaaggatt attggatgaa
19561 ctcgatgtga ttgagtcatt tgaggcgcct ggacatggtc ggatactagg cgaggtcact
19621 gataaacaga aaaagctcta tgaagccctg gaggtggaac caccctcgtt ataaattgtc
19681 gagtagaccc agggaactgc ccccccgagt ctctcacaga acctggactt gaaagtctcc
19741 cttcatccgg ctcttcttaa tcagttcttg ggacatatgc ccatcttcca gtggacaaac
19801 agtgatggtg tttcttctgc aatagagctt aatctcctta aagctgccat aaatttattt
19861 ttatagcgtt tagacttctt catcatccat ctggcaagac tcaaattgac acacgataga
19921 attgcatgaa ctttacttgc tgtaaattta tcgaaatagt tagcccaacc tcgaataata
19981 gggttcattt tctctgctaa ttcggttaaa gacagatagg aattctttct tagtttctta
20041 attttatcca tgaagctctt tatggcaatt ttgctaactg ccggggtgaa tcctgtgaaa
20101 taatttccct tctttgtttt caccgtacga gctctgaatg tatatcctag aaaatcaaag
20161 ttggtgtgga tatggttttg tttcctgttg tcatctttgc agtaaataat cttggtttta
20221 tttggatgga tttctaggtg gcaatcttta aatcgctctc ctagtttaag caagagtact
20281 tctgcttctt ttaaactatg actgtgtatt aatccatcat ctgcgtaacg tacccaaggg
20341 ttatttgagt gttttcgtgt catccagtgg tcaaaagcat agtgcataaa gagatttgca
20401 agcactgcgc tgattactcc accttggggg gttccttttg ttctttcgac gtgtattcca
20461 tctgacataa ccatcggcac ttttagtgtt ctttcaatat ataggataac ccacttctca
20521 tttgtatgtt gcttcactgc tttcatgaga aggtcgtgat taatgttgtc gaatagaccc
20581 acgatgtcaa actcaattaa ccagtcatac tcccaacacc gttttctcgt cacttcgata
20641 gcatctatag cacttttatt ctctctatac ccatatgaat cctcataaaa aatgggttct
20701 actttgggat ttagctccga gaccataaca ttttgtgcga tacgatcgtc aatacttgga
20761 acacctaaga ctctaactcc tccgtttttc ttgggaattt ctacaccgcg tacagcttgc
20821 ggaaagtaac ttcctgaaga catgctattc cataatttat agaggttatc ttccaagttg
20881 ttttcatatt cttggagtgt tacctcatca atgcctggag cacctttgtt agcttctact
20941 tttttaaatg cctcaaacac tgcgcgtttt gtaatttgaa agggtttctc cttcttcata
21001 gattcctccc atctgtggtt gacctacatt ttaggattga ttaagtgagt ccctttgctc
21061 cacgactatt accgtcgttt cttcactact acggactcat ccgcccctgt tcacggcttc
21121 gatacttgca tcctaacatt tcttgtgctt ggattcttct cttaacatcc gtcaacaggt
21181 tctcctgttc catctaaaag cctaaactac gctcctgcaa cctatgcacc ggctgtcatc
21241 tggtcagtaa gcagatttct tccagactta tcccagcatt tcccgactac actggttttg
21301 acatgcattt ttcctgataa cgatgcttta atggttattc actttcgttc agcttcgcag
21361 ttcacacttg actgtttata cagccttttt cccataacgc tcaataccga agacattaat
21421 ctacagcacc tatgggcagt ttgaaacccg tatctgcata ccgatttcga aggaccgacc
21481 ttcatctttt agatagcatc atttcagacg ttttaaacct cctttctttg tcttactttg
21541 ttcaggacac ac
tcgggaat gcaggtaaca aatagctata catatacata aaactccttt
21601 aaatagcact taattctact atttaaagga gttgttaagt gcaaaagaaa tacatgttct
21661 ttcaaacaaa tataaatatg ataaatatcc aaagcaaaag tgtcacatat taagtaaaca
21721 atcggttggg tactcccctt cttttgtaca aactgctata gggattgttt aattaaatta
21781 aaagaatttt ttgcattttg tgcagctgca caagatagag ctccatcaat aataatagaa
21841 aatggaaaaa ttacgtttgt aaaagcgtat aatgcgtcaa aaactgcatc ctcataatca

[top]


[ORF sequence]

 

MKKEKPFQITKRAVFEAFKKVEANKGAPGIDEVTLQEYENNLEDNLYKLWNSMSSGSY

FPQAVRGVEIPKKNGGVRVLGVPSIDDRIAQNVMVSELNPKVEPIFYEDSYGYRENKS

AIDAIEVTRKRCWEYDWLIEFDIVGLFDNINHDLLMKAVKQHTNEKWVILYIERTLKV

PMVMSDGIHVERTKGTPQGGVISAVLANLFMHYAFDHWMTRKHSNNPWVRYADDGLIH

SHSLKEAEVLLLKLGERFKDCHLEIHPNKTKIIYCKDDNRKQNHIHTNFDFLGYTFRA

RTVKTKKGNYFTGFTPAVSKIAIKSFMDKIKKLRKNSYLSLTELAEKMNPIIRGWANY

FDKFTASKVHAILSCVNLSLARWMMKKSKRYKNKFMAALRRLSSIAEETPSLFVHWKM

GICPKN

[top]


[Secondary structure]

                                                   

[top]


| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragmentsAlignment of insertion sequence |

| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |