CFG634
Library CF
(Link to library)
Clone ID CFG634
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16444-1
Original site URL
Representative seq. ID CFG634E
(Link to Original site)
Representative DNA sequence
>CFG634 (CFG634Q) /CSM/CF/CFG6-B/CFG634Q.Seq.d/
AAACAAAGAGTGGATATTTTCTAAAAAAAATAAAAATAAAAATGGAAATGGAAGGATTTG
ATCATGTTACATTTTGGGTTGGTAATGCATTACAAGCAGCAACTTATTACATTGCTAGAT
TTGGATTTCAAAATTTAGCTTATAGTGGTTTAGAAACTGGTAATCGTCAATTCGCCACTC
ATGTTATCCATCAAAACAACATTATTATGGCTTTTACATCACCATTAACTGGTGACAACA
AAGATTATGCAGACCACATGATGAGACATGGTGATGGTGTCAAAGATATTGCTTTCAACG
TTAAAGATGTACAACACATTTATGATGAAGCAGTTAAAGCAGGAGCTCAATCAGTTAAAG
AACCACATCAAATTAAAGACGAACATGGTATTGTTACATTAGCAACTATCATGAGTCCAT
ATGGTGAAACTACACATACTTTTGTTGATAGATCTCAATATAAAGGTGCATTCTTACCAG
GTTTCACATACAAGGGTCGCTTCAGATCCATTATCAAATATCACCGAACCAGTTGGCCTC
AACTTAATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACCAGTCGTTCAA
TGGTACGAAAAGGTTTTACAATTCCACCGTTTCTGGTCAGTTGATGACAAAACCATTCAT
ACCGAATATTCATCATTAAGATCAGTCGTAGTTGCTGATAAGTCTGAAAAAGTTAAATTA
CCAATTAATGAACCAGCCAATGGTATTAGAAAGAGTCAAATTCAAGAATACGTAGATTTC
TACAATGGTGCTGGTGTTCAACATATCGCCTTAAAGACTGATAACATCATTGATGCTATC
TCAAAATTAAGATCTCGTGGTGTCTCTTTCCTCACTGTTCCAAAAACATACTATACATCA
CTCAGAGAGAAATTACAACACTCTTCATTAGAAATTAAAGAAGATTTGGACACTTTAGAG
AAATTACACATTTTAATCGATTATGATGACAAAGGTTATCTTTTACAAATCTTTACAAAT
AATGTTGAAGATAAACCAACTGTTTTCTTTGAAATTATCCAAAGAAACAACCATGATGGT
TTCGGTGCTGGTAACTTTAAATCCCTCTTTGAAGCAATCGAAAGACAACAAGAAACTCGT
GGAAACTTATAGGTGTGTCTCAATTACAAATGTTCTTATATAACTAAAACATTCAATNAT
CAAAATA
sequence update 2001. 6. 9
Translated Amino Acid sequence
kqrvdif*kk*k*kwkwkdlimlhfglvmhykqqlitlldldfki*livv*klvivnspl
mlsikttllwllhhh*lvttkimqtt**dmvmvskillstlkmyntfmmkqlkqelnqlk
nhiklktnmvllh*qls*VHMVKLHILLLIDLNIKVHSYQVSHTRVASDPLSNITEPVGL
NLIDHVVSNHADKMMEPVVQWYEKVLQFHRFWSVDDKTIHTEYSSLRSVVVADKSEKVKL
PINEPANGIRKSQIQEYVDFYNGAGVQHIALKTDNIIDAISKLRSRGVSFLTVPKTYYTS
LREKLQHSSLEIKEDLDTLEKLHILIDYDDKGYLLQIFTNNVEDKPTVFFEIIQRNNHDG
FGAGNFKSLFEAIERQQETRGNL*vclnykcsyitktfnxqn


Translated Amino Acid sequence (All Frames)
Frame A:
kqrvdif*kk*k*kwkwkdlimlhfglvmhykqqlitlldldfki*livv*klvivnspl
mlsikttllwllhhh*lvttkimqtt**dmvmvskillstlkmyntfmmkqlkqelnqlk
nhiklktnmvllh*qls*VHMVKLHILLLIDLNIKVHSYQVSHTRVASDPLSNITEPVGL
NLIDHVVSNHADKMMEPVVQWYEKVLQFHRFWSVDDKTIHTEYSSLRSVVVADKSEKVKL
PINEPANGIRKSQIQEYVDFYNGAGVQHIALKTDNIIDAISKLRSRGVSFLTVPKTYYTS
LREKLQHSSLEIKEDLDTLEKLHILIDYDDKGYLLQIFTNNVEDKPTVFFEIIQRNNHDG
FGAGNFKSLFEAIERQQETRGNL*vclnykcsyitktfnxqn


Frame B:
nkewifskknknkngngri*scyilgw*citssnllhc*iwiskfsl*wfrnw*ssirhs
cypskqhyygfyitinw*qqrlcrphdetw*wcqrycfqr*rctthl**ss*srssis*r
ttsn*rrtwycyisnyhesiw*nytyfc**isi*rciltrfhiqgslqihyqispnqlas
t**imsfqimqik*wnqsfngtkrfynstvsgqlmtkpfipnihh*dqs*llislkklny
qlmnqpmvlervkfknt*istmvlvfnisp*rlitslmlsqn*dlvvslsslfqkhtihh
sernyntlh*klkkiwtl*rnytf*simmtkvifykslqimlkinqlfslklskettmmv
svlvtlnpslkqskdnkklvetyrcvsitnvli*lkhsxiki


Frame C:
tksgyflkkikikmemegfdhvtfwvgnalqaatyyiarfgfqnlaysgletgnrqfath
vihqnniimaftspltgdnkdyadhmmrhgdgvkdiafnvkdvqhiydeavkagaqsvke
phqikdehgivtlatimspygetthtfvdrsqykgaflpgftykgrfrsiikyhrtswpq
lnrscrfkscr*ndgtsrsmvrkgftippflvs**qnhsyrifiikisrsc**v*ks*it
n**tsqwy*kesnsrirrflqwcwcstyrlkd**hh*cylkikiswclfphcsknilyit
qreittlfirn*rrfghfreithfnrl**qrlsftnlyk*c*r*tncfl*nypkkqp*wf
rcw*l*ipl*snrkttrnswkligvsqlqmflyn*niqxsk


Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

CFG634 (CFG634Q) /CSM/CF/CFG6-B/CFG634Q.Seq.d/ 2303 0.0
CFG888 (CFG888Q) /CSM/CF/CFG8-D/CFG888Q.Seq.d/ 2242 0.0
VFC717 (VFC717Q) /CSM/VF/VFC7-A/VFC717Q.Seq.d/ 2234 0.0
SFE524 (SFE524Q) /CSM/SF/SFE5-A/SFE524Q.Seq.d/ 2234 0.0
CHD806 (CHD806Q) /CSM/CH/CHD8-A/CHD806Q.Seq.d/ 2234 0.0
CHD784 (CHD784Q) /CSM/CH/CHD7-D/CHD784Q.Seq.d/ 2234 0.0
CHD202 (CHD202Q) /CSM/CH/CHD2-A/CHD202Q.Seq.d/ 2234 0.0
CFJ240 (CFJ240Q) /CSM/CF/CFJ2-B/CFJ240Q.Seq.d/ 2234 0.0
CFF757 (CFF757Q) /CSM/CF/CFF7-C/CFF757Q.Seq.d/ 2234 0.0
CFF524 (CFF524Q) /CSM/CF/CFF5-A/CFF524Q.Seq.d/ 2234 0.0

own update 2002.11.18
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

AC116978|AC116978.1 Dictyostelium discoideum chromosome 2 map 4846697-4874480 strain AX4, *** SEQUENCING IN PROGRESS ***. 1354 0.0 4
AC117076|AC117076.2 Dictyostelium discoideum chromosome 2 map 5862124-6045772 strain AX4, complete sequence. 1354 0.0 11
BI322343|BI322343.1 kx19h02.y3 Parastrongyloides trichosuri FL pAMP1 v1 Chiapelli McCarter Parastrongyloides trichosuri cDNA 5' similar to SW:HPPD_CAEEL Q22633 4-HYDROXYPHENYLPYRUVATE DIOXYGENASE ;, mRNA sequence. 42 3e-09 4
BI502176|BI502176.1 kt86c08.y1 Strongyloides ratti L2 pAMP1 v1 Chiapelli McCarter Strongyloides ratti cDNA 5' similar to SW:HPPD_CAEEL Q22633 4-HYDROXYPHENYLPYRUVATE DIOXYGENASE ;, mRNA sequence. 36 8e-09 5
L38493|L38493.1 Coccidioides immitis T-cell reactive protein (trcP) gene exons 1-4, complete cds. 52 5e-08 3
BM395514|BM395514.1 50072-2-9-E12.r.1 Chilcoat/Turkewitz cDNA (large fraction) Tetrahymena thermophila cDNA, mRNA sequence. 64 5e-06 1
M59429|M59429.1 T. thermophila F-antigen (tfa) gene, complete cds. 46 3e-05 5
AX417724|AX417724.1 Sequence 15 from Patent WO0231173. 44 0.002 3
AX085149|AX085149.1 Sequence 14 from Patent WO0112827. 44 0.002 3
BI863521|BI863521.1 kx45c02.y1 Parastrongyloides trichosuri FL pAMP1 v1 Chiapelli McCarter Parastrongyloides trichosuri cDNA 5' similar to SW:HPPD_CAEEL Q22633 4-HYDROXYPHENYLPYRUVATE DIOXYGENASE ;, mRNA sequence. 40 0.011 2
dna update 2003.12.19
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(Q76NV5) RecName: Full=4-hydroxyphenylpyruvate dioxygenase; ... 438 0.0
AE014296_3563(AE014296|pid:none) Drosophila melanogaster chromos... 293 e-120
BC077167_1(BC077167|pid:none) Danio rerio 4-hydroxyphenylpyruvat... 281 e-116
BC153801_1(BC153801|pid:none) Xenopus laevis hypothetical protei... 282 e-115
S32821(S32821;S35890;S35889)4-hydroxyphenylpyruvate dioxygenase ... 288 e-115
(Q5EA20) RecName: Full=4-hydroxyphenylpyruvate dioxygenase; ... 279 e-113
(Q6TGZ5) RecName: Full=4-hydroxyphenylpyruvate dioxygenase; ... 285 e-112
(P32754) RecName: Full=4-hydroxyphenylpyruvate dioxygenase; ... 281 e-112
(P32755) RecName: Full=4-hydroxyphenylpyruvate dioxygenase; ... 274 e-111
BC046075_1(BC046075|pid:none) Danio rerio zgc:56326, mRNA (cDNA ... 285 e-111
protein update 2009. 7. 1
PSORT

psg: 0.81 gvh: 0.57 alm: 0.40 top: 0.53 tms: 0.00 mit: 0.26 mip: 0.06
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.33 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

40.0 %: nuclear
32.0 %: cytoplasmic
16.0 %: mitochondrial
8.0 %: cytoskeletal
4.0 %: vacuolar

>> prediction for CFG634 is nuc

5' end seq. ID CFG634F
5' end seq.
>CFG634F.Seq
AAACAAAGAGTGGATATTTTCTAAAAAAAATAAAAATAAAAATGGAAATGGAAGGATTTG
ATCATGTTACATTTTGGGTTGGTAATGCATTACAAGCAGCAACTTATTACATTGCTAGAT
TTGGATTTCAAAATTTAGCTTATAGTGGTTTAGAAACTGGTAATCGTCAATTCGCCACTC
ATGTTATCCATCAAAACAACATTATTATGGCTTTTACATCACCATTAACTGGTGACAACA
AAGATTATGCAGACCACATGATGAGACATGGTGATGGTGTCAAAGATATTGCTTTCAACG
TTAAAGATGTACAACACATTTATGATGAAGCAGTTAAAGCAGGAGCTCAATCAGTTAAAG
AACCACATCAAATTAAAGACGAACATGGTATTGTTACATTAGCAACTATCATGAGTCCAT
ATGGTGAAACTACACATACTTTTGTTGATAGATCTCAATATAAAGGTGCATTCTTACCAG
GTTTCACATACAAGGTCGCTTCAGATCCATTATCAAATATCACCGAACCAGTTGGCCTCA
ACTTAATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACC----------
Length of 5' end seq. 589
3' end seq. ID CFG634Z
3' end seq.
>CFG634Z.Seq
----------TCACATACAAGGGTCGCTTCAGATCCATTATCAAATATCACCGAACCAGT
TGGCCTCAACTTAATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACCAGT
CGTTCAATGGTACGAAAAGGTTTTACAATTCCACCGTTTCTGGTCAGTTGATGACAAAAC
CATTCATACCGAATATTCATCATTAAGATCAGTCGTAGTTGCTGATAAGTCTGAAAAAGT
TAAATTACCAATTAATGAACCAGCCAATGGTATTAGAAAGAGTCAAATTCAAGAATACGT
AGATTTCTACAATGGTGCTGGTGTTCAACATATCGCCTTAAAGACTGATAACATCATTGA
TGCTATCTCAAAATTAAGATCTCGTGGTGTCTCTTTCCTCACTGTTCCAAAAACATACTA
TACATCACTCAGAGAGAAATTACAACACTCTTCATTAGAAATTAAAGAAGATTTGGACAC
TTTAGAGAAATTACACATTTTAATCGATTATGATGACAAAGGTTATCTTTTACAAATCTT
TACAAATAATGTTGAAGATAAACCAACTGTTTTCTTTGAAATTATCCAAAGAAACAACCA
TGATGGTTTCGGTGCTGGTAACTTTAAATCCCTCTTTGAAGCAATCGAAAGACAACAAGA
AACTCGTGGAAACTTATAGGTGTGTCTCAATTACAAATGTTCTTATATAACTAAAACATT
CAATNATCAAAATA
Length of 3' end seq. 724
Connected seq. ID CFG634P
Connected seq.
>CFG634P.Seq
AAACAAAGAGTGGATATTTTCTAAAAAAAATAAAAATAAAAATGGAAATGGAAGGATTTG
ATCATGTTACATTTTGGGTTGGTAATGCATTACAAGCAGCAACTTATTACATTGCTAGAT
TTGGATTTCAAAATTTAGCTTATAGTGGTTTAGAAACTGGTAATCGTCAATTCGCCACTC
ATGTTATCCATCAAAACAACATTATTATGGCTTTTACATCACCATTAACTGGTGACAACA
AAGATTATGCAGACCACATGATGAGACATGGTGATGGTGTCAAAGATATTGCTTTCAACG
TTAAAGATGTACAACACATTTATGATGAAGCAGTTAAAGCAGGAGCTCAATCAGTTAAAG
AACCACATCAAATTAAAGACGAACATGGTATTGTTACATTAGCAACTATCATGAGTCCAT
ATGGTGAAACTACACATACTTTTGTTGATAGATCTCAATATAAAGGTGCATTCTTACCAG
GTTTCACATACAAGGTCGCTTCAGATCCATTATCAAATATCACCGAACCAGTTGGCCTCA
ACTTAATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACC----------T
CACATACAAGGGTCGCTTCAGATCCATTATCAAATATCACCGAACCAGTTGGCCTCAACT
TAATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACCAGTCGTTCAATGGT
ACGAAAAGGTTTTACAATTCCACCGTTTCTGGTCAGTTGATGACAAAACCATTCATACCG
AATATTCATCATTAAGATCAGTCGTAGTTGCTGATAAGTCTGAAAAAGTTAAATTACCAA
TTAATGAACCAGCCAATGGTATTAGAAAGAGTCAAATTCAAGAATACGTAGATTTCTACA
ATGGTGCTGGTGTTCAACATATCGCCTTAAAGACTGATAACATCATTGATGCTATCTCAA
AATTAAGATCTCGTGGTGTCTCTTTCCTCACTGTTCCAAAAACATACTATACATCACTCA
GAGAGAAATTACAACACTCTTCATTAGAAATTAAAGAAGATTTGGACACTTTAGAGAAAT
TACACATTTTAATCGATTATGATGACAAAGGTTATCTTTTACAAATCTTTACAAATAATG
TTGAAGATAAACCAACTGTTTTCTTTGAAATTATCCAAAGAAACAACCATGATGGTTTCG
GTGCTGGTAACTTTAAATCCCTCTTTGAAGCAATCGAAAGACAACAAGAAACTCGTGGAA
ACTTATAGGTGTGTCTCAATTACAAATGTTCTTATATAACTAAAACATTCAATNATCAAA
ATA
Length of connected seq. 1313
Full length Seq ID CFG634E
Full length Seq.
>CFG634E.Seq
AAACAAAGAGTGGATATTTTCTAAAAAAAATAAAAATAAAAATGGAAATGGAAGGATTTG
ATCATGTTACATTTTGGGTTGGTAATGCATTACAAGCAGCAACTTATTACATTGCTAGAT
TTGGATTTCAAAATTTAGCTTATAGTGGTTTAGAAACTGGTAATCGTCAATTCGCCACTC
ATGTTATCCATCAAAACAACATTATTATGGCTTTTACATCACCATTAACTGGTGACAACA
AAGATTATGCAGACCACATGATGAGACATGGTGATGGTGTCAAAGATATTGCTTTCAACG
TTAAAGATGTACAACACATTTATGATGAAGCAGTTAAAGCAGGAGCTCAATCAGTTAAAG
AACCACATCAAATTAAAGACGAACATGGTATTGTTACATTAGCAACTATCATGAGTCCAT
ATGGTGAAACTACACATACTTTTGTTGATAGATCTCAATATAAAGGTGCATTCTTACCAG
GTTTCACATACAAGGGTCGCTTCAGATCCATTATCAAATATCACCGAACCAGTTGGCCTC
AACTTAATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACCAGTCGTTCAA
TGGTACGAAAAGGTTTTACAATTCCACCGTTTCTGGTCAGTTGATGACAAAACCATTCAT
ACCGAATATTCATCATTAAGATCAGTCGTAGTTGCTGATAAGTCTGAAAAAGTTAAATTA
CCAATTAATGAACCAGCCAATGGTATTAGAAAGAGTCAAATTCAAGAATACGTAGATTTC
TACAATGGTGCTGGTGTTCAACATATCGCCTTAAAGACTGATAACATCATTGATGCTATC
TCAAAATTAAGATCTCGTGGTGTCTCTTTCCTCACTGTTCCAAAAACATACTATACATCA
CTCAGAGAGAAATTACAACACTCTTCATTAGAAATTAAAGAAGATTTGGACACTTTAGAG
AAATTACACATTTTAATCGATTATGATGACAAAGGTTATCTTTTACAAATCTTTACAAAT
AATGTTGAAGATAAACCAACTGTTTTCTTTGAAATTATCCAAAGAAACAACCATGATGGT
TTCGGTGCTGGTAACTTTAAATCCCTCTTTGAAGCAATCGAAAGACAACAAGAAACTCGT
GGAAACTTATAGGTGTGTCTCAATTACAAATGTTCTTATATAACTAAAACATTCAATNAT
CAAAATA
Length of full length seq. 1207