CHD888
Library CH
(Link to library)
Clone ID CHD888
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U15456-1
Original site URL
Representative seq. ID CHD888P
(Link to Original site)
Representative DNA sequence
>CHD888 (CHD888Q) /CSM/CH/CHD8-D/CHD888Q.Seq.d/
AATTTAAATTAACAAAGATTTTGGCGCACAGATTTCAACAAAGATTTTTTTATTAATCAA
TTCCACATGATTCAATAAAGGGGAAAATGAAATTACTTATTTTAACTTTATTTTTAGCTA
CTATTGTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAAGCTTCAAGAGAAT
CTAGAAGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTACCA
CAATCCCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTACCC
CAGGTCAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATCAA
AGAAATGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGCCT
CAAGCACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTATGT
CAGGTTTTGTCTCTCAAGATTCCGTCACTGTTGGTTCATTAACTGTTAAAGATCAATTAT
TCGCTGAAGCCAXXXXXXXXXXTCTTGCTTTCCAATCCATCTCTGTTAATTCAATTCCAC
CAGTCTTTTACAACATGTTATCACAAGGTTTAGTTTCATCAACACTCTTCTCCTTCTGGT
TATCAAGAACTCCAGGTGCCAACGGTGGTGAACTCTCATTCGGTTCAATCGATAACACCA
AATACACTGGTGACATTACCTACGTCCCATTAACCAACGAAACCTATTGGGAATTCGTTA
TGGATGACTTTGCTATCGATGGTCAATCAGCTGGTTTCTGTGGTACTACTTGTCACGCAA
TTTGCGATTCAGGTACATCACTCATTGCTGGTCCAATGGCTGATATTACTGCCCTCAATG
AAAAATTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCTTCTCTGATTGTAGCGTTATCA
ACACCTTACCAAATGTTACCATCACCGTTGCTGGTCGTGAATTTGTTTTAACTCCAAAAG
AATACGTTTTAGAAGTTACTGAGTTCGGAAAGACTGAATGTTTGAGTGGATTTATGGGTA
TCGAGTTAAACATGGGAAATTTCTGGATCCTTGGTGATGTTTTCATCTCTGCTTACTATA
CTGTATTCGATTTTGGTAATAAACAAGTTGGTTTCGCAACTGCCATTTCAAGGTTAAATT
TTTTTAATTAAATTATATTTAAGNATAGTAAATACAACTAAATAA
sequence update 2002.10.25
Translated Amino Acid sequence
fkltkilahrfqqrffy*SIPHDSIKGKMKLLILTLFLATIVLAQALTVPLNFHQASRES
RRRVPQKWSNRLSALNAGTTIPISDFEDAQYYGAITIGTPGQAFKVVFDTGSSNLWIPSK
KCPITVVACDLHNKYNSGASSTYVANGTDFTIQYGSGAMSGFVSQDSVTVGSLTVKDQLF
AEA---

---LAFQSISVNSIPPVFYNMLSQGLVSSTLFSFWLSRTPGANGGELSFGSIDNTKYTGD
ITYVPLTNETYWEFVMDDFAIDGQSAGFCGTTCHAICDSGTSLIAGPMADITALNEKLGA
VILNGEGVFSDCSVINTLPNVTITVAGREFVLTPKEYVLEVTEFGKTECLSGFMGIELNM
GNFWILGDVFISAYYTVFDFGNKQVGFATAISRLNFFN*iifkxskyn*i


Translated Amino Acid sequence (All Frames)
Frame A:
nln*qrfwrtdfnkdffinqfhmiq*rgk*nylf*lyf*lllf*lkl*qyh*tsiklqen
leeefhkngqtdyllsmlvpqsqfqilkmlntmvplplvpqvkpsk*fsilvhptcgfhq
rnvqslllhviyitnitavpqahmsptelispsntvvvlcqvlslkipsllvh*llkiny
slkp---

---scfpihlc*fnstsllqhvitrfsfintllllviknsrcqrw*tlirfnr*hqihw*
hylrpinqrnllgiryg*lcyrwsiswflwyylsrnlrfryithcwsng*yycpq*kirc
chlkw*rcll*l*ryqhltkcyhhrcws*icfnskrirfrsy*vrkd*mfewiygyrvkh
gkfldpw*cfhlcllycirfw**tswfrnchfkvkff*lnyi*x**iqln

Frame B:
i*inkdfgaqistkifllinst*fnkgeneityfnfifsyycfsssfnstiklpssfkri
*kksstkmvkqiicsqcwyhnpnfrf*rcsilwchyhwyprsslqssfrywfiqlvdsik
emsnhcccm*ft*qi*qrclkhicrqrn*fhhpir*wcyvrfclsrfrhcwfinc*rsii
r*s---

---LAFQSISVNSIPPVFYNMLSQGLVSSTLFSFWLSRTPGANGGELSFGSIDNTKYTGD
ITYVPLTNETYWEFVMDDFAIDGQSAGFCGTTCHAICDSGTSLIAGPMADITALNEKLGA
VILNGEGVFSDCSVINTLPNVTITVAGREFVLTPKEYVLEVTEFGKTECLSGFMGIELNM
GNFWILGDVFISAYYTVFDFGNKQVGFATAISRLNFFN*iifkxskyn*i

Frame C:
fkltkilahrfqqrffy*SIPHDSIKGKMKLLILTLFLATIVLAQALTVPLNFHQASRES
RRRVPQKWSNRLSALNAGTTIPISDFEDAQYYGAITIGTPGQAFKVVFDTGSSNLWIPSK
KCPITVVACDLHNKYNSGASSTYVANGTDFTIQYGSGAMSGFVSQDSVTVGSLTVKDQLF
AEA---

---llsnpslliqfhqsfttcyhkv*fhqhsspsgyqelqvptvvnshsvqsitpntlvt
lptsh*ptkpignslwmtllsmvnqlvsvvllvtqfaiqvhhsllvqwlillpsmkn*vl
ss*mvkvsslivalstpyqmlpspllvvnlf*lqkntf*kllsserlnv*vdlwvss*tw
eisgslvmfsslltilysilvinklvsqlpfqg*ifliklylxivnttk*

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

CHD888 (CHD888Q) /CSM/CH/CHD8-D/CHD888Q.Seq.d/ 2319 0.0
CHB247 (CHB247Q) /CSM/CH/CHB2-B/CHB247Q.Seq.d/ 1287 0.0
SFA510 (SFA510Q) /CSM/SF/SFA5-A/SFA510Q.Seq.d/ 1269 0.0
VFN427 (VFN427Q) /CSM/VF/VFN4-B/VFN427Q.Seq.d/ 1259 0.0
VFN222 (VFN222Q) /CSM/VF/VFN2-A/VFN222Q.Seq.d/ 1259 0.0
VFK659 (VFK659Q) /CSM/VF/VFK6-C/VFK659Q.Seq.d/ 1259 0.0
VFG865 (VFG865Q) /CSM/VF/VFG8-C/VFG865Q.Seq.d/ 1259 0.0
VFG570 (VFG570Q) /CSM/VF/VFG5-C/VFG570Q.Seq.d/ 1259 0.0
VFF759 (VFF759Q) /CSM/VF/VFF7-C/VFF759Q.Seq.d/ 1259 0.0
VFF495 (VFF495Q) /CSM/VF/VFF4-D/VFF495Q.Seq.d/ 1259 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

Y16962|Y16962.1 Dictyostelium discoideum mRNA for cathepsin D. 1239 0.0 2
AJ243946|AJ243946.1 Dictyostelium discoideum ctsD gene for cathepsin D, exons 1 to 2. 1239 0.0 2
AB106552|AB106552.1 Todarodes pacificus tpaD mRNA for cathepsin D, complete cds. 58 1e-05 2
AL161500|AL161500.2 Arabidopsis thaliana DNA chromosome 4, contig fragment No. 12. 62 2e-05 1
AL765461|AL765461.1 Arabidopsis thaliana T-DNA flanking sequence GK-139E08-012875. 62 2e-05 1
AL762930|AL762930.1 Arabidopsis thaliana T-DNA flanking sequence GK-030C11-011522. 62 2e-05 1
AV567523|AV567523.1 Arabidopsis thaliana cDNA clone:SQL15g03F, 3' end. 62 2e-05 1
AF372974|AF372974.1 Arabidopsis thaliana AT4g04460/T26N6_7 mRNA, complete cds. 62 2e-05 1
AX059531|AX059531.1 Sequence 264 from Patent WO0055325. 62 2e-05 1
AF076243|AF076243.1 Arabidopsis thaliana BAC T26N6 from chromosome IV at 19.3 cM, complete sequence. 62 2e-05 1
dna update 2004. 9.20
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(Q03168) RecName: Full=Lysosomal aspartic protease; EC=... 348 2e-94
EF213114_1(EF213114|pid:none) Penaeus monodon cathepsin D mRNA, ... 343 5e-93
DQ010007_1(DQ010007|pid:none) Bombyx mori CathD mRNA, complete c... 339 1e-91
EF070454_1(EF070454|pid:none) Maconellicoccus hirsutus clone WHM... 337 5e-91
AB106552_1(AB106552|pid:none) Todarodes pacificus tpaD mRNA for ... 333 7e-90
DQ909010_1(DQ909010|pid:none) Clonorchis sinensis aspartic prote... 331 3e-89
FJ168036_1(FJ168036|pid:none) Fasciola hepatica cathepsin D-like... 327 4e-88
DQ131585_1(DQ131585|pid:none) Opisthorchis viverrini cathepsin D... 322 2e-86
AF454831_1(AF454831|pid:none) Apriona germari cathepsin D mRNA, ... 321 3e-86
EF000001_1(EF000001|pid:none) Fasciola hepatica cathepsin D-like... 321 4e-86
protein update 2009. 3.30
PSORT

psg: 0.86 gvh: 0.77 alm: 0.42 top: 0.60 tms: 0.00 mit: 0.32 mip: 0.10
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.33 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

32.0 %: extracellular, including cell wall
24.0 %: nuclear
20.0 %: mitochondrial
12.0 %: cytoplasmic
8.0 %: vacuolar
4.0 %: cytoskeletal

>> prediction for CHD888 is exc

5' end seq. ID CHD888F
5' end seq.
>CHD888F.Seq
AATTTAAATTAACAAAGATTTTGGCGCACAGATTTCAACAAAGATTTTTTTATTAATCAA
TTCCACATGATTCAATAAAGGGGAAAATGAAATTACTTATTTTAACTTTATTTTTAGCTA
CTATTGTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAAGCTTCAAGAGAAT
CTAGAAGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTACCA
CAATCCCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTACCC
CAGGTCAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATCAA
AGAAATGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGCCT
CAAGCACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTATGT
CAGGTTTTGTCTCTCAAGATTCCGTCACTGTTGGTTCATTAACTGTTAAAGATCAATTAT
TCGCTGAAGCCANNNNNNNNNN
Length of 5' end seq. 562
3' end seq. ID CHD888Z
3' end seq.
>CHD888Z.Seq
NNNNNNNNNNTCTTGCTTTCCAATCCATCTCTGTTAATTCAATTCCACCAGTCTTTTACA
ACATGTTATCACAAGGTTTAGTTTCATCAACACTCTTCTCCTTCTGGTTATCAAGAACTC
CAGGTGCCAACGGTGGTGAACTCTCATTCGGTTCAATCGATAACACCAAATACACTGGTG
ACATTACCTACGTCCCATTAACCAACGAAACCTATTGGGAATTCGTTATGGATGACTTTG
CTATCGATGGTCAATCAGCTGGTTTCTGTGGTACTACTTGTCACGCAATTTGCGATTCAG
GTACATCACTCATTGCTGGTCCAATGGCTGATATTACTGCCCTCAATGAAAAATTAGGTG
CTGTCATCTTAAATGGTGAAGGTGTCTTCTCTGATTGTAGCGTTATCAACACCTTACCAA
ATGTTACCATCACCGTTGCTGGTCGTGAATTTGTTTTAACTCCAAAAGAATACGTTTTAG
AAGTTACTGAGTTCGGAAAGACTGAATGTTTGAGTGGATTTATGGGTATCGAGTTAAACA
TGGGAAATTTCTGGATCCTTGGTGATGTTTTCATCTCTGCTTACTATACTGTATTCGATT
TTGGTAATAAACAAGTTGGTTTCGCAACTGCCATTTCAAGGTTAAATTTTTTTAATTAAA
TTATATTTAAGNATAGTAAATACAACTAAATAA
Length of 3' end seq. 693
Connected seq. ID CHD888P
Connected seq.
>CHD888P.Seq
AATTTAAATTAACAAAGATTTTGGCGCACAGATTTCAACAAAGATTTTTTTATTAATCAA
TTCCACATGATTCAATAAAGGGGAAAATGAAATTACTTATTTTAACTTTATTTTTAGCTA
CTATTGTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAAGCTTCAAGAGAAT
CTAGAAGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTACCA
CAATCCCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTACCC
CAGGTCAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATCAA
AGAAATGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGCCT
CAAGCACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTATGT
CAGGTTTTGTCTCTCAAGATTCCGTCACTGTTGGTTCATTAACTGTTAAAGATCAATTAT
TCGCTGAAGCCA----------TCTTGCTTTCCAATCCATCTCTGTTAATTCAATTCCAC
CAGTCTTTTACAACATGTTATCACAAGGTTTAGTTTCATCAACACTCTTCTCCTTCTGGT
TATCAAGAACTCCAGGTGCCAACGGTGGTGAACTCTCATTCGGTTCAATCGATAACACCA
AATACACTGGTGACATTACCTACGTCCCATTAACCAACGAAACCTATTGGGAATTCGTTA
TGGATGACTTTGCTATCGATGGTCAATCAGCTGGTTTCTGTGGTACTACTTGTCACGCAA
TTTGCGATTCAGGTACATCACTCATTGCTGGTCCAATGGCTGATATTACTGCCCTCAATG
AAAAATTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCTTCTCTGATTGTAGCGTTATCA
ACACCTTACCAAATGTTACCATCACCGTTGCTGGTCGTGAATTTGTTTTAACTCCAAAAG
AATACGTTTTAGAAGTTACTGAGTTCGGAAAGACTGAATGTTTGAGTGGATTTATGGGTA
TCGAGTTAAACATGGGAAATTTCTGGATCCTTGGTGATGTTTTCATCTCTGCTTACTATA
CTGTATTCGATTTTGGTAATAAACAAGTTGGTTTCGCAACTGCCATTTCAAGGTTAAATT
TTTTTAATTAAATTATATTTAAGNATAGTAAATACAACTAAATAA
Length of connected seq. 1235
Full length Seq ID -
Full length Seq. -
Length of full length seq. -