CFG782
Library CF
(Link to library)
Clone ID CFG782
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16444-1
Original site URL
Representative seq. ID CFG782P
(Link to Original site)
Representative DNA sequence
>CFG782 (CFG782Q) /CSM/CF/CFG7-D/CFG782Q.Seq.d/
ATTTTTTTATTTTAAACAAAGAGTGGATATTTTCTAAAAAAAATAAAAATAAAAATGGAA
ATGGAAGGATTTGATCATGTTACATTTTGGGTTGGTAATGCATTACAAGCAGCAACTTAT
TACATTGCTAGATTTGGATTTCAAAATTTAGCTTATAGTGGTTTAGAAACTGGTAATCGT
CAATTCGCCACTCATGTTATCCATCAAAACAACATTATTATGGCTTTTACATCACCATTA
ACTGGTGACAACAAAGATTATGCAGACCACATGATGAGACATGGTGATGGTGTCAAAGAT
ATTGCTTTCAACGTTAAAGATGTACAACACATTTATGATGAAGCAGTTAAAGCAGGAGCT
CAATCAGTTAAAGAACCACATCAAATTAAAGACGAACATGGTATTGTTACATTAGCAACT
ATCATGAGTCCATATGGTGAAACTACACATACTTTTGTTGATAGATCTCAATATAAAGGT
GCATTCTTACCAGGTTTCACATACAAGGTCGCTTCAGATCCATTATCAAATATCACCGAA
XXXXXXXXXXTAATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACCAGTC
GTTCAATGGTACGAAAAAGGTTTTACCAATTCCCACCGTTTCTGGTCAGTTGATGACAAA
ACCATTCATACCGAATATTCATCATTAAGATCAGTCGTAGTTGCTGATAAGTCTGAAAAA
GTTAAATTACCAATTAATGAACCAGCCAATGGTATTAGAAAGAGTCAAATTCAAGAATAC
GTAGATTTCTACAATGGTGCTGGTGTTCAACATATCGCCTTAAAGACTGATAACATCATT
GATGCTATCTCAAAATTAAGATCTCGTGGTGTCTCTTTCCTCACTGTTCCAAAAACATAC
TATACATCACTCAGAGAGAAATTACAACACTCTTCATTAGAAATTAAAGAAGATTTGGAC
ACTTTAGAGAAATTACACATTTTAATCGATTATGATGACAAAGGTTATCTTTTACAAATC
TTTACAAATAATGTTGAAGATAAACCAACTGTTTTCTTTGAAATTATCCAAAGAAACAAC
CATGATGGTTTCGGTGCTGGTAACTTTAAATCCCTCTTTGAAGCAATCGAAAGACAACAA
GAAACTCGTGGAAACTTATAGGTGTGTCTCAATTACAAATGTTCTTTATTATAACTAAAT
AACATTACAATATATCAAAATAAAAATAA
sequence update 2001. 6. 1
Translated Amino Acid sequence
iflf*TKSGYFLKKIKIKMEMEGFDHVTFWVGNALQAATYYIARFGFQNLAYSGLETGNR
QFATHVIHQNNIIMAFTSPLTGDNKDYADHMMRHGDGVKDIAFNVKDVQHIYDEAVKAGA
QSVKEPHQIKDEHGIVTLATIMSPYGETTHTFVDRSQYKGAFLPGFTYKVASDPLSNITE
---

---IDHVVSNHADKMMEPVVQWYEKGFTNSHRFWSVDDKTIHTEYSSLRSVVVADKSEKV
KLPINEPANGIRKSQIQEYVDFYNGAGVQHIALKTDNIIDAISKLRSRGVSFLTVPKTYY
TSLREKLQHSSLEIKEDLDTLEKLHILIDYDDKGYLLQIFTNNVEDKPTVFFEIIQRNNH
DGFGAGNFKSLFEAIERQQETRGNL*vclnykcsll*lnnitiyqnkn


Translated Amino Acid sequence (All Frames)
Frame A:
iflf*TKSGYFLKKIKIKMEMEGFDHVTFWVGNALQAATYYIARFGFQNLAYSGLETGNR
QFATHVIHQNNIIMAFTSPLTGDNKDYADHMMRHGDGVKDIAFNVKDVQHIYDEAVKAGA
QSVKEPHQIKDEHGIVTLATIMSPYGETTHTFVDRSQYKGAFLPGFTYKVASDPLSNITE
---

---**imsfqimqik*wnqsfngtkkvlpiptvsgqlmtkpfipnihh*dqs*llislkk
lnyqlmnqpmvlervkfknt*istmvlvfnisp*rlitslmlsqn*dlvvslsslfqkht
ihhsernyntlh*klkkiwtl*rnytf*simmtkvifykslqimlkinqlfslklskett
mmvsvlvtlnpslkqskdnkklvetyrcvsitnvlyyn*itlqyikiki

Frame B:
ffyfkqrvdif*kk*k*kwkwkdlimlhfglvmhykqqlitlldldfki*livv*klviv
nsplmlsikttllwllhhh*lvttkimqtt**dmvmvskillstlkmyntfmmkqlkqel
nqlknhiklktnmvllh*qls*vhmvklhilllidlnikvhsyqvshtrslqihyqisp-
--

---nrscrfkscr*ndgtsrsmvrkrfyqfppflvs**qnhsyrifiikisrsc**v*ks
*itn**tsqwy*kesnsrirrflqwcwcstyrlkd**hh*cylkikiswclfphcsknil
yitqreittlfirn*rrfghfreithfnrl**qrlsftnlyk*c*r*tncfl*nypkkqp
*wfrcw*l*ipl*snrkttrnswkligvsqlqmffiitk*hynisk*k*

Frame C:
ffilnkewifskknknkngngri*scyilgw*citssnllhc*iwiskfsl*wfrnw*ss
irhscypskqhyygfyitinw*qqrlcrphdetw*wcqrycfqr*rctthl**ss*srss
is*rttsn*rrtwycyisnyhesiw*nytyfc**isi*rciltrfhiqgrfrsiikyhr-
--

---IDHVVSNHADKMMEPVVQWYEKGFTNSHRFWSVDDKTIHTEYSSLRSVVVADKSEKV
KLPINEPANGIRKSQIQEYVDFYNGAGVQHIALKTDNIIDAISKLRSRGVSFLTVPKTYY
TSLREKLQHSSLEIKEDLDTLEKLHILIDYDDKGYLLQIFTNNVEDKPTVFFEIIQRNNH
DGFGAGNFKSLFEAIERQQETRGNL*vclnykcsll*lnnitiyqnkn

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

CFG782 (CFG782Q) /CSM/CF/CFG7-D/CFG782Q.Seq.d/ 2268 0.0
VFO488 (VFO488Q) /CSM/VF/VFO4-D/VFO488Q.Seq.d/ 1298 0.0
VFA816 (VFA816Q) /CSM/VF/VFA8-A/VFA816Q.Seq.d/ 1298 0.0
SFC722 (SFC722Q) /CSM/SF/SFC7-A/SFC722Q.Seq.d/ 1298 0.0
CFG395 (CFG395Q) /CSM/CF/CFG3-D/CFG395Q.Seq.d/ 1298 0.0
CFB319 (CFB319Q) /CSM/CF/CFB3-A/CFB319Q.Seq.d/ 1298 0.0
AFJ180 (AFJ180Q) /CSM/AF/AFJ1-D/AFJ180Q.Seq.d/ 1291 0.0
VFK407 (VFK407Q) /CSM/VF/VFK4-A/VFK407Q.Seq.d/ 1289 0.0
CFI440 (CFI440Q) /CSM/CF/CFI4-B/CFI440Q.Seq.d/ 1287 0.0
VFL704 (VFL704Q) /CSM/VF/VFL7-A/VFL704Q.Seq.d/ 1285 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

AC116978|AC116978.1 Dictyostelium discoideum chromosome 2 map 4846697-4874480 strain AX4, *** SEQUENCING IN PROGRESS ***. 1110 0.0 5
AC117076|AC117076.2 Dictyostelium discoideum chromosome 2 map 5862124-6045772 strain AX4, complete sequence. 1110 0.0 15
BI322343|BI322343.1 kx19h02.y3 Parastrongyloides trichosuri FL pAMP1 v1 Chiapelli McCarter Parastrongyloides trichosuri cDNA 5' similar to SW:HPPD_CAEEL Q22633 4-HYDROXYPHENYLPYRUVATE DIOXYGENASE ;, mRNA sequence. 42 3e-09 4
BI502176|BI502176.1 kt86c08.y1 Strongyloides ratti L2 pAMP1 v1 Chiapelli McCarter Strongyloides ratti cDNA 5' similar to SW:HPPD_CAEEL Q22633 4-HYDROXYPHENYLPYRUVATE DIOXYGENASE ;, mRNA sequence. 36 9e-09 5
L38493|L38493.1 Coccidioides immitis T-cell reactive protein (trcP) gene exons 1-4, complete cds. 46 3e-06 3
BM395514|BM395514.1 50072-2-9-E12.r.1 Chilcoat/Turkewitz cDNA (large fraction) Tetrahymena thermophila cDNA, mRNA sequence. 64 5e-06 1
M59429|M59429.1 T. thermophila F-antigen (tfa) gene, complete cds. 46 3e-05 5
AC009650|AC009650.3 Homo sapiens chromosome 11 clone RP11-360I13 map 11, WORKING DRAFT SEQUENCE, 11 unordered pieces. 36 6e-05 10
AC116988|AC116988.2 Dictyostelium discoideum chromosome 2 map 6445720-6776760 strain AX4, complete sequence. 34 3e-04 17
AX417724|AX417724.1 Sequence 15 from Patent WO0231173. 44 0.002 3
dna update 2004. 1.30
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(Q76NV5) RecName: Full=4-hydroxyphenylpyruvate dioxygenase; ... 713 0.0
AE014296_3563(AE014296|pid:none) Drosophila melanogaster chromos... 426 e-117
BC077167_1(BC077167|pid:none) Danio rerio 4-hydroxyphenylpyruvat... 424 e-117
BC153801_1(BC153801|pid:none) Xenopus laevis hypothetical protei... 415 e-114
S32821(S32821;S35890;S35889)4-hydroxyphenylpyruvate dioxygenase ... 414 e-114
(Q5EA20) RecName: Full=4-hydroxyphenylpyruvate dioxygenase; ... 409 e-112
(Q6TGZ5) RecName: Full=4-hydroxyphenylpyruvate dioxygenase; ... 407 e-112
BC046075_1(BC046075|pid:none) Danio rerio zgc:56326, mRNA (cDNA ... 404 e-111
(P32754) RecName: Full=4-hydroxyphenylpyruvate dioxygenase; ... 404 e-111
AK149416_1(AK149416|pid:none) Mus musculus adult male liver tumo... 403 e-111
protein update 2009. 5.14
PSORT

psg: 0.64 gvh: 0.43 alm: 0.47 top: 0.53 tms: 0.00 mit: 0.19 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.33 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

40.0 %: nuclear
36.0 %: cytoplasmic
12.0 %: mitochondrial
8.0 %: vacuolar
4.0 %: cytoskeletal

>> prediction for CFG782 is nuc

5' end seq. ID CFG782F
5' end seq.
>CFG782F.Seq
ATTTTTTTATTTTAAACAAAGAGTGGATATTTTCTAAAAAAAATAAAAATAAAAATGGAA
ATGGAAGGATTTGATCATGTTACATTTTGGGTTGGTAATGCATTACAAGCAGCAACTTAT
TACATTGCTAGATTTGGATTTCAAAATTTAGCTTATAGTGGTTTAGAAACTGGTAATCGT
CAATTCGCCACTCATGTTATCCATCAAAACAACATTATTATGGCTTTTACATCACCATTA
ACTGGTGACAACAAAGATTATGCAGACCACATGATGAGACATGGTGATGGTGTCAAAGAT
ATTGCTTTCAACGTTAAAGATGTACAACACATTTATGATGAAGCAGTTAAAGCAGGAGCT
CAATCAGTTAAAGAACCACATCAAATTAAAGACGAACATGGTATTGTTACATTAGCAACT
ATCATGAGTCCATATGGTGAAACTACACATACTTTTGTTGATAGATCTCAATATAAAGGT
GCATTCTTACCAGGTTTCACATACAAGGTCGCTTCAGATCCATTATCAAATATCACCGAA
----------
Length of 5' end seq. 540
3' end seq. ID CFG782Z
3' end seq.
>CFG782Z.Seq
----------TAATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACCAGTC
GTTCAATGGTACGAAAAAGGTTTTACCAATTCCCACCGTTTCTGGTCAGTTGATGACAAA
ACCATTCATACCGAATATTCATCATTAAGATCAGTCGTAGTTGCTGATAAGTCTGAAAAA
GTTAAATTACCAATTAATGAACCAGCCAATGGTATTAGAAAGAGTCAAATTCAAGAATAC
GTAGATTTCTACAATGGTGCTGGTGTTCAACATATCGCCTTAAAGACTGATAACATCATT
GATGCTATCTCAAAATTAAGATCTCGTGGTGTCTCTTTCCTCACTGTTCCAAAAACATAC
TATACATCACTCAGAGAGAAATTACAACACTCTTCATTAGAAATTAAAGAAGATTTGGAC
ACTTTAGAGAAATTACACATTTTAATCGATTATGATGACAAAGGTTATCTTTTACAAATC
TTTACAAATAATGTTGAAGATAAACCAACTGTTTTCTTTGAAATTATCCAAAGAAACAAC
CATGATGGTTTCGGTGCTGGTAACTTTAAATCCCTCTTTGAAGCAATCGAAAGACAACAA
GAAACTCGTGGAAACTTATAGGTGTGTCTCAATTACAAATGTTCTTTATTATAACTAAAT
AACATTACAATATATCAAAATAAAAATAA
Length of 3' end seq. 679
Connected seq. ID CFG782P
Connected seq.
>CFG782P.Seq
ATTTTTTTATTTTAAACAAAGAGTGGATATTTTCTAAAAAAAATAAAAATAAAAATGGAA
ATGGAAGGATTTGATCATGTTACATTTTGGGTTGGTAATGCATTACAAGCAGCAACTTAT
TACATTGCTAGATTTGGATTTCAAAATTTAGCTTATAGTGGTTTAGAAACTGGTAATCGT
CAATTCGCCACTCATGTTATCCATCAAAACAACATTATTATGGCTTTTACATCACCATTA
ACTGGTGACAACAAAGATTATGCAGACCACATGATGAGACATGGTGATGGTGTCAAAGAT
ATTGCTTTCAACGTTAAAGATGTACAACACATTTATGATGAAGCAGTTAAAGCAGGAGCT
CAATCAGTTAAAGAACCACATCAAATTAAAGACGAACATGGTATTGTTACATTAGCAACT
ATCATGAGTCCATATGGTGAAACTACACATACTTTTGTTGATAGATCTCAATATAAAGGT
GCATTCTTACCAGGTTTCACATACAAGGTCGCTTCAGATCCATTATCAAATATCACCGAA
----------TAATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACCAGTC
GTTCAATGGTACGAAAAAGGTTTTACCAATTCCCACCGTTTCTGGTCAGTTGATGACAAA
ACCATTCATACCGAATATTCATCATTAAGATCAGTCGTAGTTGCTGATAAGTCTGAAAAA
GTTAAATTACCAATTAATGAACCAGCCAATGGTATTAGAAAGAGTCAAATTCAAGAATAC
GTAGATTTCTACAATGGTGCTGGTGTTCAACATATCGCCTTAAAGACTGATAACATCATT
GATGCTATCTCAAAATTAAGATCTCGTGGTGTCTCTTTCCTCACTGTTCCAAAAACATAC
TATACATCACTCAGAGAGAAATTACAACACTCTTCATTAGAAATTAAAGAAGATTTGGAC
ACTTTAGAGAAATTACACATTTTAATCGATTATGATGACAAAGGTTATCTTTTACAAATC
TTTACAAATAATGTTGAAGATAAACCAACTGTTTTCTTTGAAATTATCCAAAGAAACAAC
CATGATGGTTTCGGTGCTGGTAACTTTAAATCCCTCTTTGAAGCAATCGAAAGACAACAA
GAAACTCGTGGAAACTTATAGGTGTGTCTCAATTACAAATGTTCTTTATTATAACTAAAT
AACATTACAATATATCAAAATAAAAATAA
Length of connected seq. 1219
Full length Seq ID -
Full length Seq. -
Length of full length seq. -