CFG603
Library CF
(Link to library)
Clone ID CFG603
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U12859-1|Contig-U16301-1
Original site URL
Representative seq. ID CFG603P
(Link to Original site)
Representative DNA sequence
>CFG603 (CFG603Q) /CSM/CF/CFG6-A/CFG603Q.Seq.d/
ATTATAATGTCTGTAACTTTAAAAAATATTATTGCACCAACACCAGCAACTACTCGTGGT
AAATNAGTAGCAATTAATGGTGACCCAAAGGGTGAAAATATTGTATATGCAAGTGGTAGC
AGTATTATTATTANGAAATGTAAAGAATCCAATGGTAGCAGATATTTACTATGAACATCC
ATGTCAAACCACTGTTGCAAAGTATGCACCAAGTGGTAATTATATTGCAAGTGGTGATGT
TCAAGGTAATTTACGTATTTGGGATACATTACAAAAGGAACATATCTTAAAAGCAACTTA
CAAAGTATTGAATGGTGCAATCCTTGATATTGCATGGACATCAGATAATCAACGTTTAGT
TGTAGTTGGTGATGGTAAAGAGAGATTTGGTGCAGCCATCCTTTGGGATAGTGGTTCATC
ATGTGGTGAAATCACTGGTCACTCAAAGATGATTCTCTCTTGTGACATTAAATCAACTCG
TCCATTCAGAGCCGCCACTGGTAGTGAAGATTTTGCAGTCAATTGGTTCGAAGGTCCACC
CATTCAAATTCCAAAAGAAXXXXXXXXXXTGATCAACTCATTACTTGCGCTATGGATGAT
AGTGTTAAAATCTCAAGTATCTCTAAAAAAACCTATGGTGAAAGTATTGGTGTAGATAGT
CCAGCTCAAGCTGTTGCTTTCTCTGGTGATGTTGTCGTTGCTGTCTCAATGAAAACCATT
TATGTCATTAAAGGTGGTAAAATCGTTTCACAAACTGCTGCCACTTGGGAACCAACCTCT
GTTGCCATCAATGATACTGAAGTTTCAGTCGGTGGTAAAGATAACAAGATTCACGTTTTC
ACTCTCAGTGGTAACAATTTAACTGCCAGTCATACTTTAGATAATCATCGTGGTGCTATT
ACCGATCTTTCATACTCACCATGTGGTAAATATTTAGCTTCAGGTTGTTCAAATCGTGAA
GTTATCGTTTGGAGTGGTAAAGAAGCCAAATCTAAAGGTTGGGTTAACCATACCGCTCGT
ATTAATGCTGTCGCTTGGTCTAATGATTCTAAATTTGTTGCCTCTGCTTCACTCGATTCT
CAAATTTACATTTGGAATGTTGAAAATCCAACCGCTTCACCAGTTCAAGTTAAAAACTCT
CATTTAGGTGGTGTCAATGATGTTATTTAGGTTCAAACAANAAATTTTCTCTGCAGGTAA
NAAGGGCNATAAAATT
sequence update 2001. 6. 1
Translated Amino Acid sequence
l*cl*l*killhqhqqllvvnx*QLMVTQRVKILYMQVVAVLLLXNVKNPMVADIYYEHP
CQTTVAKYAPSGNYIASGDVQGNLRIWDTLQKEHILKATYKVLNGAILDIAWTSDNQRLV
VVGDGKERFGAAILWDSGSSCGEITGHSKMILSCDIKSTRPFRAATGSEDFAVNWFEGPP
IQIPKE---

---DQLITCAMDDSVKISSISKKTYGESIGVDSPAQAVAFSGDVVVAVSMKTIYVIKGGK
IVSQTAATWEPTSVAINDTEVSVGGKDNKIHVFTLSGNNLTASHTLDNHRGAITDLSYSP
CGKYLASGCSNREVIVWSGKEAKSKGWVNHTARINAVAWSNDSKFVASASLDSQIYIWNV
ENPTASPVQVKNSHLGGVNDVI*vqtxnflcr*xgx*n


Translated Amino Acid sequence (All Frames)
Frame A:
iimsvtlkniiaptpattrgkxvaingdpkgenivyasgssiiixkckesngsryll*ts
msnhcckvctkw*lyckw*csr*ftylgyitkgtylksnlqsiewcnp*ycmdir*stfs
csw*w*reiwcshplg*wfimw*nhwslkddsll*h*inssiqsrhw**rfcsqlvrrst
hsnskr---

---*sthylryg**c*nlkyl*knlw*kywcr*ssssccflw*ccrcclnenhlch*rw*
nrftncchlgtnlcchq*y*sfsrw*r*qdsrfhsqw*qfncqsyfr*sswcyyrsfilt
mw*ifsfrlfks*syrlew*rsqi*rlg*pyrsy*ccrlv**f*icclcftrfsnlhlec
*ksnrftsss*klsfrwcq*cylgsnxkfslqvxraik

Frame B:
l*cl*l*killhqhqqllvvnx*QLMVTQRVKILYMQVVAVLLLXNVKNPMVADIYYEHP
CQTTVAKYAPSGNYIASGDVQGNLRIWDTLQKEHILKATYKVLNGAILDIAWTSDNQRLV
VVGDGKERFGAAILWDSGSSCGEITGHSKMILSCDIKSTRPFRAATGSEDFAVNWFEGPP
IQIPKE---

---DQLITCAMDDSVKISSISKKTYGESIGVDSPAQAVAFSGDVVVAVSMKTIYVIKGGK
IVSQTAATWEPTSVAINDTEVSVGGKDNKIHVFTLSGNNLTASHTLDNHRGAITDLSYSP
CGKYLASGCSNREVIVWSGKEAKSKGWVNHTARINAVAWSNDSKFVASASLDSQIYIWNV
ENPTASPVQVKNSHLGGVNDVI*vqtxnflcr*xgx*n

Frame C:
ynvcnfkkyyctntsnysw*xssn*w*pkg*kycickw*qyyyxem*riqw*qiftmnih
vkpllqsmhqvviilqvvmfkviyvfgihykrnis*kqltky*mvqslilhghqiinv*l
*lvmvkrdlvqpsfgivvhhvvkslvtqr*fslvtlnqlvhsepplvvkilqsigskvhp
fkfqk---

---insllalwmivlksqvslkkpmvkvlv*ivqlklllslvmlsllsq*kpfmslkvvk
sfhkllplgnqpllpsmilkfqsvvkitrftfslsvvti*lpvil*iiivvllpifhthh
vvni*lqvvqivklsfgvvkkpnlkvgltiplvlmlslglmilnllpllhsilkftfgml
kiqplhqfklktli*vvsmmlfrfkqxifsagxkgxki

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

CFG603 (CFG603Q) /CSM/CF/CFG6-A/CFG603Q.Seq.d/ 2280 0.0
VHN440 (VHN440Q) /CSM/VH/VHN4-B/VHN440Q.Seq.d/ 1223 0.0
CFH889 (CFH889Q) /CSM/CF/CFH8-D/CFH889Q.Seq.d/ 1223 0.0
AFI778 (AFI778Q) /CSM/AF/AFI7-D/AFI778Q.Seq.d/ 1223 0.0
CFA233 (CFA233Q) /CSM/CF/CFA2-B/CFA233Q.Seq.d/ 1215 0.0
AFB543 (AFB543Q) /CSM/AF/AFB5-B/AFB543Q.Seq.d/ 1215 0.0
SFH408 (SFH408Q) /CSM/SF/SFH4-A/SFH408Q.Seq.d/ 1211 0.0
CFC301 (CFC301Q) /CSM/CF/CFC3-A/CFC301Q.Seq.d/ 1211 0.0
VHO132 (VHO132Q) /CSM/VH/VHO1-B/VHO132Q.Seq.d/ 1207 0.0
CFF734 (CFF734Q) /CSM/CF/CFF7-B/CFF734Q.Seq.d/ 1207 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

U36936|U36936.1 Dictyostelium discoideum WD40 repeat protein 2 mRNA, compete cds. 1067 0.0 3
AF045082|AF045082.1 Drosophila sturtevanti 14045-0901.0 cytochrome oxidase II (COII) gene, mitochondrial gene encoding mitochondrial protein, complete cds. 38 0.27 2
AF045083|AF045083.1 Drosophila sturtevanti 14043-0871.2 cytochrome oxidase II (COII) gene, mitochondrial gene encoding mitochondrial protein, complete cds. 38 0.27 2
AF045084|AF045084.1 Drosophila sturtevanti 14043-0871.9 cytochrome oxidase II (COII) gene, mitochondrial gene encoding mitochondrial protein, complete cds. 38 0.27 2
AF461281|AF461281.1 Drosophila eugracilis cytochrome oxidase II (COII) gene, partial cds; mitochondrial gene for mitochondrial product. 38 0.58 2
AY162976|AY162976.1 Drosophila polymorpha isolate H48 cytochrome oxidase subunit II (COII) gene, partial cds; mitochondrial gene for mitochondrial product. 38 0.70 2
AF423987|AF423987.1 Apteropanorpa evansi cytochrome oxidase subunit 2 (COII) gene, partial cds; mitochondrial gene for mitochondrial product. 38 0.95 2
AF474079|AF474079.1 Drosophila eugracilis cytochrome oxidase II (COII) gene, partial cds; mitochondrial gene for mitochondrial product. 38 1.0 2
AF045085|AF045085.1 Drosophila subsaltans cytochrome oxidase II (COII) gene, mitochondrial gene encoding mitochondrial protein, complete cds. 38 1.0 2
AF045093|AF045093.1 Drosophila emarginata 14042-0841.7 cytochrome oxidase II (COII) gene, mitochondrial gene encoding mitochondrial protein, complete cds. 38 1.0 2
dna update 2004. 1.30
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

U36936_1(U36936|pid:none) Dictyostelium discoideum WD40 repeat p... 382 e-104
(P90587) RecName: Full=66 kDa stress protein; AltName: Full=p66;... 209 1e-63
BT045059_1(BT045059|pid:none) Salmo salar clone ssal-rgf-510-334... 183 4e-51
AY394939_1(AY394939|pid:none) Danio rerio clone RK046A1C09 WD re... 182 9e-51
(O93277) RecName: Full=WD repeat-containing protein 1; AltName: ... 185 2e-50
AB084171_1(AB084171|pid:none) Cavia porcellus WDR1 mRNA for WD r... 184 5e-50
(Q5RKI0) RecName: Full=WD repeat-containing protein 1; &AY98648... 184 8e-50
(O75083) RecName: Full=WD repeat-containing protein 1; AltName: ... 184 8e-50
AK004858_1(AK004858|pid:none) Mus musculus adult male liver cDNA... 185 1e-49
CR860191_1(CR860191|pid:none) Pongo abelii mRNA; cDNA DKFZp469E0... 183 1e-49
protein update 2009. 5.14
PSORT

psg: 0.94 gvh: 0.44 alm: 0.31 top: 0.30 tms: 0.07 mit: 0.34 mip: 0.03
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 1.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 1.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 0.00

24.0 %: endoplasmic reticulum
20.0 %: cytoplasmic
20.0 %: mitochondrial
16.0 %: nuclear
8.0 %: Golgi
4.0 %: plasma membrane
4.0 %: vesicles of secretory system
4.0 %: extracellular, including cell wall

>> prediction for CFG603 is end

5' end seq. ID CFG603F
5' end seq.
>CFG603F.Seq
ATTATAATGTCTGTAACTTTAAAAAATATTATTGCACCAACACCAGCAACTACTCGTGGT
AAATNAGTAGCAATTAATGGTGACCCAAAGGGTGAAAATATTGTATATGCAAGTGGTAGC
AGTATTATTATTANGAAATGTAAAGAATCCAATGGTAGCAGATATTTACTATGAACATCC
ATGTCAAACCACTGTTGCAAAGTATGCACCAAGTGGTAATTATATTGCAAGTGGTGATGT
TCAAGGTAATTTACGTATTTGGGATACATTACAAAAGGAACATATCTTAAAAGCAACTTA
CAAAGTATTGAATGGTGCAATCCTTGATATTGCATGGACATCAGATAATCAACGTTTAGT
TGTAGTTGGTGATGGTAAAGAGAGATTTGGTGCAGCCATCCTTTGGGATAGTGGTTCATC
ATGTGGTGAAATCACTGGTCACTCAAAGATGATTCTCTCTTGTGACATTAAATCAACTCG
TCCATTCAGAGCCGCCACTGGTAGTGAAGATTTTGCAGTCAATTGGTTCGAAGGTCCACC
CATTCAAATTCCAAAAGAA----------
Length of 5' end seq. 559
3' end seq. ID CFG603Z
3' end seq.
>CFG603Z.Seq
----------TGATCAACTCATTACTTGCGCTATGGATGATAGTGTTAAAATCTCAAGTA
TCTCTAAAAAAACCTATGGTGAAAGTATTGGTGTAGATAGTCCAGCTCAAGCTGTTGCTT
TCTCTGGTGATGTTGTCGTTGCTGTCTCAATGAAAACCATTTATGTCATTAAAGGTGGTA
AAATCGTTTCACAAACTGCTGCCACTTGGGAACCAACCTCTGTTGCCATCAATGATACTG
AAGTTTCAGTCGGTGGTAAAGATAACAAGATTCACGTTTTCACTCTCAGTGGTAACAATT
TAACTGCCAGTCATACTTTAGATAATCATCGTGGTGCTATTACCGATCTTTCATACTCAC
CATGTGGTAAATATTTAGCTTCAGGTTGTTCAAATCGTGAAGTTATCGTTTGGAGTGGTA
AAGAAGCCAAATCTAAAGGTTGGGTTAACCATACCGCTCGTATTAATGCTGTCGCTTGGT
CTAATGATTCTAAATTTGTTGCCTCTGCTTCACTCGATTCTCAAATTTACATTTGGAATG
TTGAAAATCCAACCGCTTCACCAGTTCAAGTTAAAAACTCTCATTTAGGTGGTGTCAATG
ATGTTATTTAGGTTCAAACAANAAATTTTCTCTGCAGGTAANAAGGGCNATAAAATT
Length of 3' end seq. 647
Connected seq. ID CFG603P
Connected seq.
>CFG603P.Seq
ATTATAATGTCTGTAACTTTAAAAAATATTATTGCACCAACACCAGCAACTACTCGTGGT
AAATNAGTAGCAATTAATGGTGACCCAAAGGGTGAAAATATTGTATATGCAAGTGGTAGC
AGTATTATTATTANGAAATGTAAAGAATCCAATGGTAGCAGATATTTACTATGAACATCC
ATGTCAAACCACTGTTGCAAAGTATGCACCAAGTGGTAATTATATTGCAAGTGGTGATGT
TCAAGGTAATTTACGTATTTGGGATACATTACAAAAGGAACATATCTTAAAAGCAACTTA
CAAAGTATTGAATGGTGCAATCCTTGATATTGCATGGACATCAGATAATCAACGTTTAGT
TGTAGTTGGTGATGGTAAAGAGAGATTTGGTGCAGCCATCCTTTGGGATAGTGGTTCATC
ATGTGGTGAAATCACTGGTCACTCAAAGATGATTCTCTCTTGTGACATTAAATCAACTCG
TCCATTCAGAGCCGCCACTGGTAGTGAAGATTTTGCAGTCAATTGGTTCGAAGGTCCACC
CATTCAAATTCCAAAAGAA----------TGATCAACTCATTACTTGCGCTATGGATGAT
AGTGTTAAAATCTCAAGTATCTCTAAAAAAACCTATGGTGAAAGTATTGGTGTAGATAGT
CCAGCTCAAGCTGTTGCTTTCTCTGGTGATGTTGTCGTTGCTGTCTCAATGAAAACCATT
TATGTCATTAAAGGTGGTAAAATCGTTTCACAAACTGCTGCCACTTGGGAACCAACCTCT
GTTGCCATCAATGATACTGAAGTTTCAGTCGGTGGTAAAGATAACAAGATTCACGTTTTC
ACTCTCAGTGGTAACAATTTAACTGCCAGTCATACTTTAGATAATCATCGTGGTGCTATT
ACCGATCTTTCATACTCACCATGTGGTAAATATTTAGCTTCAGGTTGTTCAAATCGTGAA
GTTATCGTTTGGAGTGGTAAAGAAGCCAAATCTAAAGGTTGGGTTAACCATACCGCTCGT
ATTAATGCTGTCGCTTGGTCTAATGATTCTAAATTTGTTGCCTCTGCTTCACTCGATTCT
CAAATTTACATTTGGAATGTTGAAAATCCAACCGCTTCACCAGTTCAAGTTAAAAACTCT
CATTTAGGTGGTGTCAATGATGTTATTTAGGTTCAAACAANAAATTTTCTCTGCAGGTAA
NAAGGGCNATAAAATT
Length of connected seq. 1206
Full length Seq ID -
Full length Seq. -
Length of full length seq. -