CFG102
Library CF
(Link to library)
Clone ID CFG102
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U12859-1|Contig-U16301-1
Original site URL
Representative seq. ID CFG102P
(Link to Original site)
Representative DNA sequence
>CFG102 (CFG102Q) /CSM/CF/CFG1-A/CFG102Q.Seq.d/
GTATTTATATATTTATTTATTTATATAAATAAACTACACATTATAATGTCTGTAACTTTA
AAAAATATTATTGCACCAACACCAGCAACTACTCGTGGTAAATCAGTAGCAATTAATGGT
GACCCAAAGGGTGAAAATATTGTATATGCAAGTGGTAGCAGTATTATTATTAGAAATGTA
AAGAATCCAATGGTAGCAGATATTTACTATGAACATCCATGTCAAACCACTGTTGCAAAG
TATGCACCAAGTGGTAATTATATTGCAAGTGGTGATGTTCAAGGTAATTTACGTATTTGG
GATACATTACAAAAGGAACATATCTTAAAAGCAACTTACAAAGTATTGAATGGTGCAATC
CTTGATATTGCATGGACATCAGATAATCAACGTTTAGTTGTAGTTGGTGATGGTAAAGAG
AGATTTGGTGCAGCCATCCTTTGGGATAGTGGTTCATCATGTGGTGAAATCACTGGTCAC
TCAAAGATGATTCTCTCTTGTGACATTAAATCAACTCGTCCATTCAGAGCCGCCACTGGT
AGTGAAGATTTTGCAGTCAATTGGTTCGAAGGTCCACCATTCAAATTCCAAAAGAATATT
GCCGCCGGTGATTTCACTCGTTTCGTXXXXXXXXXXATCTCTAAAAAAACCTATGGTGAA
AGTATTGGTGTAGATAGTCCAGCTCAAGCTGTTGCTTTCTCTGGTGATGTTGTCGTTGCT
GTCTCAATGAAAACCATTTATGTCATTAAAGGTGGTAAAATCGTTTCACAAACTGCTGCC
ACTTGGGAACCAACCTCTGTTGCCATCAATGATACTGAAGTTTCAGTCGGTGGTAAAGAT
AACAAGATTCACGTTTTCACTCTCAGTGGTAACAATTTAACTGCCAGTCATACTTTAGAT
AATCATCGTGGTGCTATTACCGATCTTTCATACTCACCATGTGGTAAATATTTAGCTTCA
GGTTGTTCAAATCGTGAAGTTATCGTTTGGAGTGGTAAAGAAGCCAAATCTAAAGGTTGG
GTTAACCATACCGCTCGTATTAATGCTGTCGCTTGGTCTAATGATTCTAAATTTGTTGCC
TCTGCTTCACTCGATTCTCAAATTTACATTTGGAATGTTGAAAATCCAACCGCTTCACCA
GTTCAAGTTAAAAACTCTCATTTAGGTGGTGTCAATGATGTTATTTATGGTTCAAACAAT
AAATTTTCTCTGCAGGTAANAAGGGCNATAAAATTNGAATGTATCAAATAAAAAAAAATT
T
sequence update 2001. 6. 1
Translated Amino Acid sequence
VFIYLFIYINKLHIIMSVTLKNIIAPTPATTRGKSVAINGDPKGENIVYASGSSIIIRNV
KNPMVADIYYEHPCQTTVAKYAPSGNYIASGDVQGNLRIWDTLQKEHILKATYKVLNGAI
LDIAWTSDNQRLVVVGDGKERFGAAILWDSGSSCGEITGHSKMILSCDIKSTRPFRAATG
SEDFAVNWFEGPPFKFQKNIAAGDFTRF---

---ISKKTYGESIGVDSPAQAVAFSGDVVVAVSMKTIYVIKGGKIVSQTAATWEPTSVAI
NDTEVSVGGKDNKIHVFTLSGNNLTASHTLDNHRGAITDLSYSPCGKYLASGCSNREVIV
WSGKEAKSKGWVNHTARINAVAWSNDSKFVASASLDSQIYIWNVENPTASPVQVKNSHLG
GVNDVIYGSNNKFSLQVXRAIKXECIK*kki


Translated Amino Acid sequence (All Frames)
Frame A:
VFIYLFIYINKLHIIMSVTLKNIIAPTPATTRGKSVAINGDPKGENIVYASGSSIIIRNV
KNPMVADIYYEHPCQTTVAKYAPSGNYIASGDVQGNLRIWDTLQKEHILKATYKVLNGAI
LDIAWTSDNQRLVVVGDGKERFGAAILWDSGSSCGEITGHSKMILSCDIKSTRPFRAATG
SEDFAVNWFEGPPFKFQKNIAAGDFTRF---

---ISKKTYGESIGVDSPAQAVAFSGDVVVAVSMKTIYVIKGGKIVSQTAATWEPTSVAI
NDTEVSVGGKDNKIHVFTLSGNNLTASHTLDNHRGAITDLSYSPCGKYLASGCSNREVIV
WSGKEAKSKGWVNHTARINAVAWSNDSKFVASASLDSQIYIWNVENPTASPVQVKNSHLG
GVNDVIYGSNNKFSLQVXRAIKXECIK*kki

Frame B:
ylyiylfi*inytl*cl*l*killhqhqqllvvnq*qlmvtqrvkilymqvvavlllem*
riqw*qiftmnihvkpllqsmhqvviilqvvmfkviyvfgihykrnis*kqltky*mvqs
lilhghqiinv*l*lvmvkrdlvqpsfgivvhhvvkslvtqr*fslvtlnqlvhsepplv
vkilqsigskvhhsnskrilppvislvs---

---slkkpmvkvlv*ivqlklllslvmlsllsq*kpfmslkvvksfhkllplgnqpllps
milkfqsvvkitrftfslsvvti*lpvil*iiivvllpifhthhvvni*lqvvqivklsf
gvvkkpnlkvgltiplvlmlslglmilnllpllhsilkftfgmlkiqplhqfklktli*v
vsmmlfmvqtinflcr*xgx*nxnvsnkkkf

Frame C:
iyifiylyk*tthynvcnfkkyyctntsnysw*issn*w*pkg*kycickw*qyyy*kck
esngsryll*tsmsnhcckvctkw*lyckw*csr*ftylgyitkgtylksnlqsiewcnp
*ycmdir*stfscsw*w*reiwcshplg*wfimw*nhwslkddsll*h*inssiqsrhw*
*rfcsqlvrrstiqipkeycrr*fhsfr---

---l*knlw*kywcr*ssssccflw*ccrcclnenhlch*rw*nrftncchlgtnlcchq
*y*sfsrw*r*qdsrfhsqw*qfncqsyfr*sswcyyrsfiltmw*ifsfrlfks*syrl
ew*rsqi*rlg*pyrsy*ccrlv**f*icclcftrfsnlhlec*ksnrftsss*klsfrw
cq*cylwfkq*ifsagxkgxkixmyqikkn

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SFH157 (SFH157Q) /CSM/SF/SFH1-C/SFH157Q.Seq.d/ 1189 0.0
CFH883 (CFH883Q) /CSM/CF/CFH8-D/CFH883Q.Seq.d/ 1189 0.0
CFG102 (CFG102Q) /CSM/CF/CFG1-A/CFG102Q.Seq.d/ 1189 0.0
AFF332 (AFF332Q) /CSM/AF/AFF3-B/AFF332Q.Seq.d/ 1189 0.0
CFA233 (CFA233Q) /CSM/CF/CFA2-B/CFA233Q.Seq.d/ 1178 0.0
SHJ331 (SHJ331Q) /CSM/SH/SHJ3-B/SHJ331Q.Seq.d/ 1174 0.0
SFI277 (SFI277Q) /CSM/SF/SFI2-D/SFI277Q.Seq.d/ 1152 0.0
CFE749 (CFE749Q) /CSM/CF/CFE7-C/CFE749Q.Seq.d/ 1138 0.0
AFI778 (AFI778Q) /CSM/AF/AFI7-D/AFI778Q.Seq.d/ 1134 0.0
CFH889 (CFH889Q) /CSM/CF/CFH8-D/CFH889Q.Seq.d/ 1132 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

U36936|U36936.1 Dictyostelium discoideum WD40 repeat protein 2 mRNA, compete cds. 1189 0.0 2
AF045085|AF045085.1 Drosophila subsaltans cytochrome oxidase II (COII) gene, mitochondrial gene encoding mitochondrial protein, complete cds. 38 0.044 3
AF474077|AF474077.1 Drosophila ananassae cytochrome oxidase II (COII) gene, partial cds; mitochondrial gene for mitochondrial product. 34 0.16 3
AF045084|AF045084.1 Drosophila sturtevanti 14043-0871.9 cytochrome oxidase II (COII) gene, mitochondrial gene encoding mitochondrial protein, complete cds. 38 0.30 2
AF045083|AF045083.1 Drosophila sturtevanti 14043-0871.2 cytochrome oxidase II (COII) gene, mitochondrial gene encoding mitochondrial protein, complete cds. 38 0.30 2
AF045082|AF045082.1 Drosophila sturtevanti 14045-0901.0 cytochrome oxidase II (COII) gene, mitochondrial gene encoding mitochondrial protein, complete cds. 38 0.30 2
AF461281|AF461281.1 Drosophila eugracilis cytochrome oxidase II (COII) gene, partial cds; mitochondrial gene for mitochondrial product. 38 0.63 2
AC127566|AC127566.3 Mus musculus BAC clone RP24-187F3 from chromosome 5, complete sequence. 38 0.76 7
AY162976|AY162976.1 Drosophila polymorpha isolate H48 cytochrome oxidase subunit II (COII) gene, partial cds; mitochondrial gene for mitochondrial product. 38 0.77 2
AF423987|AF423987.1 Apteropanorpa evansi cytochrome oxidase subunit 2 (COII) gene, partial cds; mitochondrial gene for mitochondrial product. 38 1.0 2
dna update 2004. 3. 6
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

U36936_1(U36936|pid:none) Dictyostelium discoideum WD40 repeat p... 397 e-109
(P90587) RecName: Full=66 kDa stress protein; AltName: Full=p66;... 282 2e-74
BT045059_1(BT045059|pid:none) Salmo salar clone ssal-rgf-510-334... 238 2e-61
AK004858_1(AK004858|pid:none) Mus musculus adult male liver cDNA... 238 3e-61
(O93277) RecName: Full=WD repeat-containing protein 1; AltName: ... 238 3e-61
AK004644_1(AK004644|pid:none) Mus musculus adult male lung cDNA,... 237 6e-61
BC049117_1(BC049117|pid:none) Mus musculus, WD repeat domain 1, ... 237 7e-61
AY394939_1(AY394939|pid:none) Danio rerio clone RK046A1C09 WD re... 236 9e-61
(Q5RKI0) RecName: Full=WD repeat-containing protein 1; &AY98648... 236 9e-61
(O75083) RecName: Full=WD repeat-containing protein 1; AltName: ... 236 9e-61
protein update 2009. 5.14
PSORT

psg: 0.90 gvh: 0.37 alm: 0.35 top: 0.63 tms: 0.00 mit: 0.34 mip: 0.06
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

48.0 %: cytoplasmic
20.0 %: mitochondrial
20.0 %: nuclear
8.0 %: cytoskeletal
4.0 %: plasma membrane

>> prediction for CFG102 is cyt

5' end seq. ID CFG102F
5' end seq.
>CFG102F.Seq
GTATTTATATATTTATTTATTTATATAAATAAACTACACATTATAATGTCTGTAACTTTA
AAAAATATTATTGCACCAACACCAGCAACTACTCGTGGTAAATCAGTAGCAATTAATGGT
GACCCAAAGGGTGAAAATATTGTATATGCAAGTGGTAGCAGTATTATTATTAGAAATGTA
AAGAATCCAATGGTAGCAGATATTTACTATGAACATCCATGTCAAACCACTGTTGCAAAG
TATGCACCAAGTGGTAATTATATTGCAAGTGGTGATGTTCAAGGTAATTTACGTATTTGG
GATACATTACAAAAGGAACATATCTTAAAAGCAACTTACAAAGTATTGAATGGTGCAATC
CTTGATATTGCATGGACATCAGATAATCAACGTTTAGTTGTAGTTGGTGATGGTAAAGAG
AGATTTGGTGCAGCCATCCTTTGGGATAGTGGTTCATCATGTGGTGAAATCACTGGTCAC
TCAAAGATGATTCTCTCTTGTGACATTAAATCAACTCGTCCATTCAGAGCCGCCACTGGT
AGTGAAGATTTTGCAGTCAATTGGTTCGAAGGTCCACCATTCAAATTCCAAAAGAATATT
GCCGCCGGTGATTTCACTCGTTTCGT----------
Length of 5' end seq. 626
3' end seq. ID CFG102Z
3' end seq.
>CFG102Z.Seq
----------ATCTCTAAAAAAACCTATGGTGAAAGTATTGGTGTAGATAGTCCAGCTCA
AGCTGTTGCTTTCTCTGGTGATGTTGTCGTTGCTGTCTCAATGAAAACCATTTATGTCAT
TAAAGGTGGTAAAATCGTTTCACAAACTGCTGCCACTTGGGAACCAACCTCTGTTGCCAT
CAATGATACTGAAGTTTCAGTCGGTGGTAAAGATAACAAGATTCACGTTTTCACTCTCAG
TGGTAACAATTTAACTGCCAGTCATACTTTAGATAATCATCGTGGTGCTATTACCGATCT
TTCATACTCACCATGTGGTAAATATTTAGCTTCAGGTTGTTCAAATCGTGAAGTTATCGT
TTGGAGTGGTAAAGAAGCCAAATCTAAAGGTTGGGTTAACCATACCGCTCGTATTAATGC
TGTCGCTTGGTCTAATGATTCTAAATTTGTTGCCTCTGCTTCACTCGATTCTCAAATTTA
CATTTGGAATGTTGAAAATCCAACCGCTTCACCAGTTCAAGTTAAAAACTCTCATTTAGG
TGGTGTCAATGATGTTATTTATGGTTCAAACAATAAATTTTCTCTGCAGGTAANAAGGGC
NATAAAATTNGAATGTATCAAATAAAAAAAAATTT
Length of 3' end seq. 625
Connected seq. ID CFG102P
Connected seq.
>CFG102P.Seq
GTATTTATATATTTATTTATTTATATAAATAAACTACACATTATAATGTCTGTAACTTTA
AAAAATATTATTGCACCAACACCAGCAACTACTCGTGGTAAATCAGTAGCAATTAATGGT
GACCCAAAGGGTGAAAATATTGTATATGCAAGTGGTAGCAGTATTATTATTAGAAATGTA
AAGAATCCAATGGTAGCAGATATTTACTATGAACATCCATGTCAAACCACTGTTGCAAAG
TATGCACCAAGTGGTAATTATATTGCAAGTGGTGATGTTCAAGGTAATTTACGTATTTGG
GATACATTACAAAAGGAACATATCTTAAAAGCAACTTACAAAGTATTGAATGGTGCAATC
CTTGATATTGCATGGACATCAGATAATCAACGTTTAGTTGTAGTTGGTGATGGTAAAGAG
AGATTTGGTGCAGCCATCCTTTGGGATAGTGGTTCATCATGTGGTGAAATCACTGGTCAC
TCAAAGATGATTCTCTCTTGTGACATTAAATCAACTCGTCCATTCAGAGCCGCCACTGGT
AGTGAAGATTTTGCAGTCAATTGGTTCGAAGGTCCACCATTCAAATTCCAAAAGAATATT
GCCGCCGGTGATTTCACTCGTTTCGT----------ATCTCTAAAAAAACCTATGGTGAA
AGTATTGGTGTAGATAGTCCAGCTCAAGCTGTTGCTTTCTCTGGTGATGTTGTCGTTGCT
GTCTCAATGAAAACCATTTATGTCATTAAAGGTGGTAAAATCGTTTCACAAACTGCTGCC
ACTTGGGAACCAACCTCTGTTGCCATCAATGATACTGAAGTTTCAGTCGGTGGTAAAGAT
AACAAGATTCACGTTTTCACTCTCAGTGGTAACAATTTAACTGCCAGTCATACTTTAGAT
AATCATCGTGGTGCTATTACCGATCTTTCATACTCACCATGTGGTAAATATTTAGCTTCA
GGTTGTTCAAATCGTGAAGTTATCGTTTGGAGTGGTAAAGAAGCCAAATCTAAAGGTTGG
GTTAACCATACCGCTCGTATTAATGCTGTCGCTTGGTCTAATGATTCTAAATTTGTTGCC
TCTGCTTCACTCGATTCTCAAATTTACATTTGGAATGTTGAAAATCCAACCGCTTCACCA
GTTCAAGTTAAAAACTCTCATTTAGGTGGTGTCAATGATGTTATTTATGGTTCAAACAAT
AAATTTTCTCTGCAGGTAANAAGGGCNATAAAATTNGAATGTATCAAATAAAAAAAAATT
T
Length of connected seq. 1251
Full length Seq ID -
Full length Seq. -
Length of full length seq. -