CHD806
Library CH
(Link to library)
Clone ID CHD806
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16444-1
Original site URL
Representative seq. ID CHD806E
(Link to Original site)
Representative DNA sequence
>CHD806 (CHD806Q) /CSM/CH/CHD8-A/CHD806Q.Seq.d/
AAAAAGAGTGGATATTTTCTAAAAAAAATAAAAATAAAAATGGAAATGGAAGGATTTGAT
CATGTTACATTTTGGGTTGGTAATGCATTACAAGCAGCAACTTATTACATTGCTAGATTT
GGATTTCAAAATTTAGCTTATAGTGGTTTAGAAACTGGTAATCGTCAATTCGCCACTCAT
GTTATCCATCAAAACAACATTATTATGGCTTTTACATCACCATTAACTGGTGACAACAAA
GATTATGCAGACCACATGATGAGACATGGTGATGGTGTCAAAGATATTGCTTTCAACGTT
AAAGATGTACAACACATTTATGATGAAGCAGTTAAAGCAGGAGCTCAATCAGTTAAAGAA
CCACATCAAATTAAAGACGAACATGGTATTGTTACATTAGCAACTATCATGAGTCCATAT
GGTGAAACTACACATACTTTTGTTGATAGATCTCAATATAAAGGTGCATTCTTACCAGGT
TTCACATACAAGGTCGCTTCAGATCCATTATCAAATATCACCGAACCAGTTGGCCTCAAC
TTAATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACCAGTCGTTCAATGG
TACGAAAAGGTTTTACAATTCCACCGTTTCTGGTCAGTTGATGACAAAACCATTCATACC
GAATATTCATCATTAAGATCAGTCGTAGTTGCTGATAAGTCTGAAAAAGTTAAATTACCA
ATTAATGAACCAGCCAATGGTATTAGAAAGAGTCAAATTCAAGAATACGTAGATTTCTAC
AATGGTGCTGGTGTTCAACATATCGCCTTAAAGACTGATAACATCATTGATGCTATCTCA
AAATTAAGATCTCGTGGTGTCTCTTTCCTCACTGTTCCAAAAACATACTATACATCACTC
AGAGAGAAATTACAACACTCTTCATTAGAAATTAAAGAAGATTTGGACACTTTAGAGAAA
TTACACATTTTAATCGATTATGATGACAAAGGTTATCTTTTACAAATCTTTACAAATAAT
GTTGAAGATAAACCAACTGTTTTCTTTGAAATTATCCAAAGAAACAACCATGATGGTTTC
GGTGCTGGTAACTTTAAATCCCTCTTTGAAGCAATCGAAAGACAACAAGAAACTCGTGGA
AACTTATAGGTGTGTCTCAATTACAAATGTTCTTTATTATTAACTAAATAACATTACAAT
ATANCNAAATAAATAATAAATAAAACTCTATTTAAAAAGTA
sequence update 2002.10.27
Translated Amino Acid sequence
KKSGYFLKKIKIKMEMEGFDHVTFWVGNALQAATYYIARFGFQNLAYSGLETGNRQFATH
VIHQNNIIMAFTSPLTGDNKDYADHMMRHGDGVKDIAFNVKDVQHIYDEAVKAGAQSVKE
PHQIKDEHGIVTLATIMSPYGETTHTFVDRSQYKGAFLPGFTYKVASDPLSNITEPVGLN
LIDHVVSNHADKMMEPVVQWYEKVLQFHRFWSVDDKTIHTEYSSLRSVVVADKSEKVKLP
INEPANGIRKSQIQEYVDFYNGAGVQHIALKTDNIIDAISKLRSRGVSFLTVPKTYYTSL
REKLQHSSLEIKEDLDTLEKLHILIDYDDKGYLLQIFTNNVEDKPTVFFEIIQRNNHDGF
GAGNFKSLFEAIERQQETRGNL*vclnykcsllltk*hynixk*iinktlfkk


Translated Amino Acid sequence (All Frames)
Frame A:
KKSGYFLKKIKIKMEMEGFDHVTFWVGNALQAATYYIARFGFQNLAYSGLETGNRQFATH
VIHQNNIIMAFTSPLTGDNKDYADHMMRHGDGVKDIAFNVKDVQHIYDEAVKAGAQSVKE
PHQIKDEHGIVTLATIMSPYGETTHTFVDRSQYKGAFLPGFTYKVASDPLSNITEPVGLN
LIDHVVSNHADKMMEPVVQWYEKVLQFHRFWSVDDKTIHTEYSSLRSVVVADKSEKVKLP
INEPANGIRKSQIQEYVDFYNGAGVQHIALKTDNIIDAISKLRSRGVSFLTVPKTYYTSL
REKLQHSSLEIKEDLDTLEKLHILIDYDDKGYLLQIFTNNVEDKPTVFFEIIQRNNHDGF
GAGNFKSLFEAIERQQETRGNL*vclnykcsllltk*hynixk*iinktlfkk


Frame B:
krvdif*kk*k*kwkwkdlimlhfglvmhykqqlitlldldfki*livv*klvivnsplm
lsikttllwllhhh*lvttkimqtt**dmvmvskillstlkmyntfmmkqlkqelnqlkn
hiklktnmvllh*qls*vhmvklhilllidlnikvhsyqvshtrslqihyqispnqlast
**imsfqimqik*wnqsfngtkrfynstvsgqlmtkpfipnihh*dqs*llislkklnyq
lmnqpmvlervkfknt*istmvlvfnisp*rlitslmlsqn*dlvvslsslfqkhtihhs
ernyntlh*klkkiwtl*rnytf*simmtkvifykslqimlkinqlfslklskettmmvs
vlvtlnpslkqskdnkklvetyrcvsitnvlyy*lnnitixxnk**iklylks


Frame C:
kewifskknknkngngri*scyilgw*citssnllhc*iwiskfsl*wfrnw*ssirhsc
ypskqhyygfyitinw*qqrlcrphdetw*wcqrycfqr*rctthl**ss*srssis*rt
tsn*rrtwycyisnyhesiw*nytyfc**isi*rciltrfhiqgrfrsiikyhrtswpql
nrscrfkscr*ndgtsrsmvrkgftippflvs**qnhsyrifiikisrsc**v*ks*itn
**tsqwy*kesnsrirrflqwcwcstyrlkd**hh*cylkikiswclfphcsknilyitq
reittlfirn*rrfghfreithfnrl**qrlsftnlyk*c*r*tncfl*nypkkqp*wfr
cw*l*ipl*snrkttrnswkligvsqlqmffiin*itlqyxxinnk*nsi*kv


Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VFC717 (VFC717Q) /CSM/VF/VFC7-A/VFC717Q.Seq.d/ 2268 0.0
SFE524 (SFE524Q) /CSM/SF/SFE5-A/SFE524Q.Seq.d/ 2268 0.0
CHD806 (CHD806Q) /CSM/CH/CHD8-A/CHD806Q.Seq.d/ 2268 0.0
CHD784 (CHD784Q) /CSM/CH/CHD7-D/CHD784Q.Seq.d/ 2268 0.0
CHD202 (CHD202Q) /CSM/CH/CHD2-A/CHD202Q.Seq.d/ 2268 0.0
CFJ240 (CFJ240Q) /CSM/CF/CFJ2-B/CFJ240Q.Seq.d/ 2268 0.0
CFF757 (CFF757Q) /CSM/CF/CFF7-C/CFF757Q.Seq.d/ 2268 0.0
CFF524 (CFF524Q) /CSM/CF/CFF5-A/CFF524Q.Seq.d/ 2268 0.0
CFF369 (CFF369Q) /CSM/CF/CFF3-C/CFF369Q.Seq.d/ 2268 0.0
CFE672 (CFE672Q) /CSM/CF/CFE6-C/CFE672Q.Seq.d/ 2268 0.0

own update 2002.11.21
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

AC117081|AC117081.2 Dictyostelium discoideum chromosome 2 map 5862124-6045772 strain AX4, complete sequence. 2254 0.0 11
BI322343|BI322343.1 kx19h02.y3 Parastrongyloides trichosuri FL pAMP1 v1 Chiapelli McCarter Parastrongyloides trichosuri cDNA 5' similar to SW:HPPD_CAEEL Q22633 4-HYDROXYPHENYLPYRUVATE DIOXYGENASE ;, mRNA sequence. 42 3e-09 4
BI502176|BI502176.1 kt86c08.y1 Strongyloides ratti L2 pAMP1 v1 Chiapelli McCarter Strongyloides ratti cDNA 5' similar to SW:HPPD_CAEEL Q22633 4-HYDROXYPHENYLPYRUVATE DIOXYGENASE ;, mRNA sequence. 36 1e-08 5
CO013595|CO013595.1 EST801930 Coccidioides posadasii spherule cDNA library, 0.4 to 2.3 kb Coccidioides posadasii cDNA clone CIEBZ67 5' end, mRNA sequence. 52 1e-08 3
CO009511|CO009511.1 EST797846 Coccidioides posadasii spherule cDNA library, 0.4 to 2.3 kb Coccidioides posadasii cDNA clone CIEBC24 5' end, mRNA sequence. 52 1e-08 3
L38493|L38493.1 Coccidioides immitis T-cell reactive protein (trcP) gene exons 1-4, complete cds. 52 6e-08 3
CO006156|CO006156.1 EST794491 Coccidioides posadasii spherule cDNA library, 0.4 to 2.3 kb Coccidioides posadasii cDNA clone CIEAT15 3' end, mRNA sequence. 52 1e-07 2
CO005676|CO005676.1 EST794011 Coccidioides posadasii spherule cDNA library, 0.4 to 2.3 kb Coccidioides posadasii cDNA clone CIEAQ39 3' end, mRNA sequence. 52 2e-07 2
CO009510|CO009510.1 EST797845 Coccidioides posadasii spherule cDNA library, 0.4 to 2.3 kb Coccidioides posadasii cDNA clone CIEBC24 3' end, mRNA sequence. 52 2e-07 2
CO005677|CO005677.1 EST794012 Coccidioides posadasii spherule cDNA library, 0.4 to 2.3 kb Coccidioides posadasii cDNA clone CIEAQ39 5' end, mRNA sequence. 52 3e-07 3
dna update 2004. 9.20
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(Q76NV5) RecName: Full=4-hydroxyphenylpyruvate dioxygenase; ... 745 0.0
AE014296_3563(AE014296|pid:none) Drosophila melanogaster chromos... 446 e-124
BC077167_1(BC077167|pid:none) Danio rerio 4-hydroxyphenylpyruvat... 436 e-121
S32821(S32821;S35890;S35889)4-hydroxyphenylpyruvate dioxygenase ... 434 e-120
BC153801_1(BC153801|pid:none) Xenopus laevis hypothetical protei... 430 e-119
(Q5EA20) RecName: Full=4-hydroxyphenylpyruvate dioxygenase; ... 425 e-117
(Q6TGZ5) RecName: Full=4-hydroxyphenylpyruvate dioxygenase; ... 422 e-116
(P32754) RecName: Full=4-hydroxyphenylpyruvate dioxygenase; ... 422 e-116
BT045841_1(BT045841|pid:none) Salmo salar clone ssal-rgf-534-157... 420 e-116
BC046075_1(BC046075|pid:none) Danio rerio zgc:56326, mRNA (cDNA ... 419 e-115
protein update 2009. 4.12
PSORT

psg: 0.64 gvh: 0.43 alm: 0.47 top: 0.53 tms: 0.00 mit: 0.19 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.33 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

40.0 %: nuclear
36.0 %: cytoplasmic
12.0 %: mitochondrial
8.0 %: vacuolar
4.0 %: cytoskeletal

>> prediction for CHD806 is nuc

5' end seq. ID CHD806F
5' end seq.
>CHD806F.Seq
AAAAAGAGTGGATATTTTCTAAAAAAAATAAAAATAAAAATGGAAATGGAAGGATTTGAT
CATGTTACATTTTGGGTTGGTAATGCATTACAAGCAGCAACTTATTACATTGCTAGATTT
GGATTTCAAAATTTAGCTTATAGTGGTTTAGAAACTGGTAATCGTCAATTCGCCACTCAT
GTTATCCATCAAAACAACATTATTATGGCTTTTACATCACCATTAACTGGTGACAACAAA
GATTATGCAGACCACATGATGAGACATGGTGATGGTGTCAAAGATATTGCTTTCAACGTT
AAAGATGTACAACACATTTATGATGAAGCAGTTAAAGCAGGAGCTCAATCAGTTAAAGAA
CCACATCAAATTAAAGACGAACATGGTATTGTTACATTAGCAACTATCATGAGTCCATAT
GGTGAAACTACACATACTTTTGTTGATAGATCTCAATATAAAGGTGCATTCTTACCAGGT
TTCACATACAAGGTCGCTTCAGATCCATTATCAAATATCACCGAACCAGTTGGCCTCAAC
TTAATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACCAGTCGTTCAATGG
TACNNNNNNNNNN
Length of 5' end seq. 613
3' end seq. ID CHD806Z
3' end seq.
>CHD806Z.Seq
NNNNNNNNNNATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACCAGTCGT
TCAATGGTACGAAAAGGTTTTACAATTCCACCGTTTCTGGTCAGTTGATGACAAAACCAT
TCATACCGAATATTCATCATTAAGATCAGTCGTAGTTGCTGATAAGTCTGAAAAAGTTAA
ATTACCAATTAATGAACCAGCCAATGGTATTAGAAAGAGTCAAATTCAAGAATACGTAGA
TTTCTACAATGGTGCTGGTGTTCAACATATCGCCTTAAAGACTGATAACATCATTGATGC
TATCTCAAAATTAAGATCTCGTGGTGTCTCTTTCCTCACTGTTCCAAAAACATACTATAC
ATCACTCAGAGAGAAATTACAACACTCTTCATTAGAAATTAAAGAAGATTTGGACACTTT
AGAGAAATTACACATTTTAATCGATTATGATGACAAAGGTTATCTTTTACAAATCTTTAC
AAATAATGTTGAAGATAAACCAACTGTTTTCTTTGAAATTATCCAAAGAAACAACCATGA
TGGTTTCGGTGCTGGTAACTTTAAATCCCTCTTTGAAGCAATCGAAAGACAACAAGAAAC
TCGTGGAAACTTATAGGTGTGTCTCAATTACAAATGTTCTTTATTATTAACTAAATAACA
TTACAATATANCNAAATAAATAATAAATAAAACTCTATTTAAAAAGTA
Length of 3' end seq. 708
Connected seq. ID CHD806P
Connected seq.
>CHD806P.Seq
AAAAAGAGTGGATATTTTCTAAAAAAAATAAAAATAAAAATGGAAATGGAAGGATTTGAT
CATGTTACATTTTGGGTTGGTAATGCATTACAAGCAGCAACTTATTACATTGCTAGATTT
GGATTTCAAAATTTAGCTTATAGTGGTTTAGAAACTGGTAATCGTCAATTCGCCACTCAT
GTTATCCATCAAAACAACATTATTATGGCTTTTACATCACCATTAACTGGTGACAACAAA
GATTATGCAGACCACATGATGAGACATGGTGATGGTGTCAAAGATATTGCTTTCAACGTT
AAAGATGTACAACACATTTATGATGAAGCAGTTAAAGCAGGAGCTCAATCAGTTAAAGAA
CCACATCAAATTAAAGACGAACATGGTATTGTTACATTAGCAACTATCATGAGTCCATAT
GGTGAAACTACACATACTTTTGTTGATAGATCTCAATATAAAGGTGCATTCTTACCAGGT
TTCACATACAAGGTCGCTTCAGATCCATTATCAAATATCACCGAACCAGTTGGCCTCAAC
TTAATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACCAGTCGTTCAATGG
TAC----------ATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACCAGT
CGTTCAATGGTACGAAAAGGTTTTACAATTCCACCGTTTCTGGTCAGTTGATGACAAAAC
CATTCATACCGAATATTCATCATTAAGATCAGTCGTAGTTGCTGATAAGTCTGAAAAAGT
TAAATTACCAATTAATGAACCAGCCAATGGTATTAGAAAGAGTCAAATTCAAGAATACGT
AGATTTCTACAATGGTGCTGGTGTTCAACATATCGCCTTAAAGACTGATAACATCATTGA
TGCTATCTCAAAATTAAGATCTCGTGGTGTCTCTTTCCTCACTGTTCCAAAAACATACTA
TACATCACTCAGAGAGAAATTACAACACTCTTCATTAGAAATTAAAGAAGATTTGGACAC
TTTAGAGAAATTACACATTTTAATCGATTATGATGACAAAGGTTATCTTTTACAAATCTT
TACAAATAATGTTGAAGATAAACCAACTGTTTTCTTTGAAATTATCCAAAGAAACAACCA
TGATGGTTTCGGTGCTGGTAACTTTAAATCCCTCTTTGAAGCAATCGAAAGACAACAAGA
AACTCGTGGAAACTTATAGGTGTGTCTCAATTACAAATGTTCTTTATTATTAACTAAATA
ACATTACAATATANCNAAATAAATAATAAATAAAACTCTATTTAAAAAGTA
Length of connected seq. 1301
Full length Seq ID CHD806E
Full length Seq.
>CHD806E.Seq
AAAAAGAGTGGATATTTTCTAAAAAAAATAAAAATAAAAATGGAAATGGAAGGATTTGAT
CATGTTACATTTTGGGTTGGTAATGCATTACAAGCAGCAACTTATTACATTGCTAGATTT
GGATTTCAAAATTTAGCTTATAGTGGTTTAGAAACTGGTAATCGTCAATTCGCCACTCAT
GTTATCCATCAAAACAACATTATTATGGCTTTTACATCACCATTAACTGGTGACAACAAA
GATTATGCAGACCACATGATGAGACATGGTGATGGTGTCAAAGATATTGCTTTCAACGTT
AAAGATGTACAACACATTTATGATGAAGCAGTTAAAGCAGGAGCTCAATCAGTTAAAGAA
CCACATCAAATTAAAGACGAACATGGTATTGTTACATTAGCAACTATCATGAGTCCATAT
GGTGAAACTACACATACTTTTGTTGATAGATCTCAATATAAAGGTGCATTCTTACCAGGT
TTCACATACAAGGTCGCTTCAGATCCATTATCAAATATCACCGAACCAGTTGGCCTCAAC
TTAATAGATCATGTCGTTTCAAATCATGCAGATAAAATGATGGAACCAGTCGTTCAATGG
TACGAAAAGGTTTTACAATTCCACCGTTTCTGGTCAGTTGATGACAAAACCATTCATACC
GAATATTCATCATTAAGATCAGTCGTAGTTGCTGATAAGTCTGAAAAAGTTAAATTACCA
ATTAATGAACCAGCCAATGGTATTAGAAAGAGTCAAATTCAAGAATACGTAGATTTCTAC
AATGGTGCTGGTGTTCAACATATCGCCTTAAAGACTGATAACATCATTGATGCTATCTCA
AAATTAAGATCTCGTGGTGTCTCTTTCCTCACTGTTCCAAAAACATACTATACATCACTC
AGAGAGAAATTACAACACTCTTCATTAGAAATTAAAGAAGATTTGGACACTTTAGAGAAA
TTACACATTTTAATCGATTATGATGACAAAGGTTATCTTTTACAAATCTTTACAAATAAT
GTTGAAGATAAACCAACTGTTTTCTTTGAAATTATCCAAAGAAACAACCATGATGGTTTC
GGTGCTGGTAACTTTAAATCCCTCTTTGAAGCAATCGAAAGACAACAAGAAACTCGTGGA
AACTTATAGGTGTGTCTCAATTACAAATGTTCTTTATTATTAACTAAATAACATTACAAT
ATANCNAAATAAATAATAAATAAAACTCTATTTAAAAAGTA
Length of full length seq. 1241