[Back to introns by organism] [Back to home page]
Information for Sy.fu.I1 intron (Format of information for each intron)
[Intron and flanking sequence]
Sequence
from Genbank entry. Intron is identified in red.
The ORF
is identified in blue with start and stop codons
underlined.
5' end
gtgcgcccgg
18901 cagggcgcgt tcacttggag gtgcaagtcc tctacaggcc cgacaggggg aactgttagc
18961 cggacggcaa gggtgtccat cgcgaggtgg aatctgaagg aagccgcagg caaagcgctg
19021 gctcgacgaa taggaaccgc atatgaggcg gtcatgccgg atgagagggc caatatcttc
19081 aaagtccgat acctgtacgg aaggcgtggg cgtaaatgcg gcgggcataa gcgtgaaggt
19141 gggtgcgcat tacccgggga ggtctgtcga tctgccttgt gctaccgctg tcgagaggcg
19201 gcgggaaggg tcgacagaag tcagcagagg gcatagtagg tccttcgacc ggactgaagg
19261 cccgaacgtg aagtgtgtga cgggagcctt gaatttcgat gaggaatgga
gacgcggaca
19321 gccggactgg gacgcccggc acatcttcgg gggggagcgg agggaatccg caaggacccg
19381 ttgcttgtgc gtcaagcctc gcggcaagga gagatgactc ccgccagagg acaatgcagt
19441 tgatggaggc ggtggtcgaa cgcgagaaca tgtttggagc gcttcgccag gtggaggcga
19501 ataaaggttc ggcgggcgtg gacggagtaa gcgttgatgc cctcagagcc tgccttcgtg
19561 aacactggcc gcgcatcaag gaagaactgc ttgagggcag gtaccaaccg caacccgtgc
19621 gaaaggtgga aatacccaag ccgggcggga agggaatgcg acaattgggc attccaacgg
19681 tgatggaccg gctcatacag caggcgctca atcaggtaat gcaacccatc ttcgacccgg
19741 acttttccga gtcgagctac ggattccgac ccggtcgcag tgcgcaccag gccgtgctca
19801 gagcacggga atatgccgca acagaccggc ggtgggtcgt ggacatggac ctggagaagt
19861 tctttgaccg cgtcaatcac gatatcctga tggcgcggct cgcccgcaaa atagcggata
19921 ggagagttct tcaactcatc cgtcgctatc ttcaggcagg gtcgatggtc ggaggtgtcg
19981 tatcgccccg aacggaggga accccacagg ggggaccact ctcaccactt ctttccaaca
20041 tcttgcttga tgacttggat aaagaactgg agcaaagggg ccatgcgttt tgtcggtacg
20101 ctgacgactg caacatctat gtaaagtcaa ggcgcgcagg gcaaagggta ctggaaagtc
20161 tcacccggtt tcttgccaac agactgaagc tcaaggtcaa cgtggacaag agcgcggtag
20221 cgcgtccgtg ggtccgcaag ttcctgggct acagcatgac gttccataaa cggccaaggc
20281 tgagagtggc ccccgccgtg gtggatcgca tgaaagcgaa gctaagggaa cagtttcgga
20341 tgggccgagg acgcaatatc cgccgcgtta tcgaagagct gacacctgtt ttacgaggtt
20401 gggtgaacta tttccggctg tccgaggtga agggaaattt cgaggagctc gatgaatgga
20461 tacgccgcaa gtttcggtgc atcatctggc gacagtggaa aaggacctac actcgggcaa
20521 agaacatgat gaagtgcggt ttgggggaag agagggcttg gcgatcagcc aagaatcaga
20581 gaggcccctg gtggaactcc ggagcctcac acatgaacca atgtttcccc aaacgcttct
20641 ttgaacgact tggactcgtg tcactgctaa gtcagctacg aagacttcaa tgtacttcat
20701 gaaccgccgt gtacggaacc gtacgcacgg tggtgtggga ggacggggaa
ggcgacttcc
20761 cctcctaccc gat
3' end
[top]
[Intron and flanking sequence]
18541
tcgggatacc gtcggactcg gtgtcgattc ggtcatgaac cctgtgttct ggctggcgag
18601 cacgcctgtg agcgcagggg tgaaggccgg gaaaacggtc aacagcatgt cgttgaggat
18661 aggcgagtac gaggatttca agaaatcggc cctcgatccc tacatctcca tgcgcgaggc
18721 ttacacgcag tatcgtgcgg aggagatggc gaagtagtgt gccccgcggc gccgggcttt
18781 ggcggcccgc ggcgtcgatg gcgctgtggg acggggttgc cgcggcaggg tgatgccgcg
18841 aacgaaggtg catgaaagaa gggcgggatg atccccgccc ttttcttttg gtgcgcccgg
18901 cagggcgcgt tcacttggag gtgcaagtcc tctacaggcc cgacaggggg aactgttagc
18961 cggacggcaa gggtgtccat cgcgaggtgg aatctgaagg aagccgcagg caaagcgctg
19021 gctcgacgaa taggaaccgc atatgaggcg gtcatgccgg atgagagggc caatatcttc
19081 aaagtccgat acctgtacgg aaggcgtggg cgtaaatgcg gcgggcataa gcgtgaaggt
19141 gggtgcgcat tacccgggga ggtctgtcga tctgccttgt gctaccgctg tcgagaggcg
19201 gcgggaaggg tcgacagaag tcagcagagg gcatagtagg tccttcgacc ggactgaagg
19261 cccgaacgtg aagtgtgtga cgggagcctt gaatttcgat gaggaatgga gacgcggaca
19321 gccggactgg gacgcccggc acatcttcgg gggggagcgg agggaatccg caaggacccg
19381 ttgcttgtgc gtcaagcctc gcggcaagga gagatgactc ccgccagagg acaatgcagt
19441 tgatggaggc ggtggtcgaa cgcgagaaca tgtttggagc gcttcgccag gtggaggcga
19501 ataaaggttc ggcgggcgtg gacggagtaa gcgttgatgc cctcagagcc tgccttcgtg
19561 aacactggcc gcgcatcaag gaagaactgc ttgagggcag gtaccaaccg caacccgtgc
19621 gaaaggtgga aatacccaag ccgggcggga agggaatgcg acaattgggc attccaacgg
19681 tgatggaccg gctcatacag caggcgctca atcaggtaat gcaacccatc ttcgacccgg
19741 acttttccga gtcgagctac ggattccgac ccggtcgcag tgcgcaccag gccgtgctca
19801 gagcacggga atatgccgca acagaccggc ggtgggtcgt ggacatggac ctggagaagt
19861 tctttgaccg cgtcaatcac gatatcctga tggcgcggct cgcccgcaaa atagcggata
19921 ggagagttct tcaactcatc cgtcgctatc ttcaggcagg gtcgatggtc ggaggtgtcg
19981 tatcgccccg aacggaggga accccacagg ggggaccact ctcaccactt ctttccaaca
20041 tcttgcttga tgacttggat aaagaactgg agcaaagggg ccatgcgttt tgtcggtacg
20101 ctgacgactg caacatctat gtaaagtcaa ggcgcgcagg gcaaagggta ctggaaagtc
20161 tcacccggtt tcttgccaac agactgaagc tcaaggtcaa cgtggacaag agcgcggtag
20221 cgcgtccgtg ggtccgcaag ttcctgggct acagcatgac gttccataaa cggccaaggc
20281 tgagagtggc ccccgccgtg gtggatcgca tgaaagcgaa gctaagggaa cagtttcgga
20341 tgggccgagg acgcaatatc cgccgcgtta tcgaagagct gacacctgtt ttacgaggtt
20401 gggtgaacta tttccggctg tccgaggtga agggaaattt cgaggagctc gatgaatgga
20461 tacgccgcaa gtttcggtgc atcatctggc gacagtggaa aaggacctac actcgggcaa
20521 agaacatgat gaagtgcggt ttgggggaag agagggcttg gcgatcagcc aagaatcaga
20581 gaggcccctg gtggaactcc ggagcctcac acatgaacca atgtttcccc aaacgcttct
20641 ttgaacgact tggactcgtg tcactgctaa gtcagctacg aagacttcaa tgtacttcat
20701 gaaccgccgt gtacggaacc gtacgcacgg tggtgtggga ggacggggaa ggcgacttcc
20761 cctcctaccc gatggtgtgc ccgtcagatg tagacaaggg gaaggctttg tgccctccca
20821 cccagtgcgc tcatcttgaa tgcaaaattg tttcagtcaa ccgggtcttc gcccgggacc
20881 ggcttttccg gtctctttgc ctcgaaatgc gagcaaatgg cgttcgccag ccctggaatg
20941 gtgtattcgg cgggcatgat gtggactttc aggccgtgtt tcacggcggt ctgcgcggtg
21001 accggtccga tgcaggcgat tgcgacccct tcgaggagcg gcagaatctc gccccggtca
21061 aagagactga agaaattgga gacggtggac gaggaggtga aggtgaggca atgaatttcg
21121 gccttccgga agcgttcggc aatttccggg ccgcgctccc tcgggatgac ggtccgatag
[top]
MRNGDADSRTGTPGTSSGGSGGNPQGPVACASSLAARRDDSRQRTMQLMEAVVERENM
FGALRQVEANKGSAGVDGVSVDALRACLREHWPRIKEELLEGRYQPQPVRKVEIPKPG
GKGMRQLGIPTVMDRLIQQALNQVMQPIFDPDFSESSYGFRPGRSAHQAVLRAREYAA
TDRRWVVDMDLEKFFDRVNHDILMARLARKIADRRVLQLIRRYLQAGSMVGGVVSPRT
EGTPQGGPLSPLLSNILLDDLDKELEQRGHAFCRYADDCNIYVKSRRAGQRVLESLTR
FLANRLKLKVNVDKSAVARPWVRKFLGYSMTFHKRPRLRVAPAVVDRMKAKLREQFRM
GRGRNIRRVIEELTPVLRGWVNYFRLSEVKGNFEELDEWIRRKFRCIIWRQWKRTYTR
AKNMMKCGLGEERAWRSAKNQRGPWWNSGASHMNQCFPKRFFERLGLVSLLSQLRRLQ
CTS
[top]

[top]
| Introduction | Intron secondary structure | Intron ORF structure | Introns listed by organism | Bacterial intron fragments | Alignment of insertion sequence |
| Phylogenetic tree of intron ORFs | How to find group II intron | Site map | Contact us |