CFC781
Library CF
(Link to library)
Clone ID CFC781
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U15456-1
Original site URL
Representative seq. ID CFC781P
(Link to Original site)
Representative DNA sequence
>CFC781 (CFC781Q) /CSM/CF/CFC7-D/CFC781Q.Seq.d/
ATTATTTTTAATTTTTTTTTATTATTTTTATTATTTTTTATACATTAATTTTTTTTTAAA
AAAAAATTATTTAATTTAAATTAACAAAGATTTTGGCGCACAGATTTCAACAAAGATTTT
TTTATTAATCAATTCCACATGATTCAATAAAGGGGAAAATGAAATTACTTATTTTAACTT
TATTTTTAGCTACTATTGTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAAG
CTTCAAGAGAATCTAGAAGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCA
ATGCTGGTACCACAATCCCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTA
CCATTGGTACCCCAGGTCAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGT
GGATTCCATCAAAGAAATGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATA
ACAGCGGTGCCTCAAGCACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTA
GTGGTGCTATGTCAGGTTTTGTCTCTXXXXXXXXXXAAAGATCAATTATTCGCTGAAGCC
ACTGCCGAACCAGGTATTGCTTTCGATTTCGCCAAATTCGATGGTATTTTAGGTCTTGCT
TTCCAATCCATCTCTGTTAATTCAATTCCACCAGTCTTTTACAACATGTTATCACAAGGT
TTAGTTTCATCAACACTCTTCTCCTTCTGGTTATCAAGAACTCCAGGTGCCAACGATGGT
GAACTCTCATTCGGTTCAATCGATAACACCAAATACACTGGTGACATTACCTACGTCCCA
TTAACCAACGAAACCTATTGGGAATTCGTTATGGATGACTTTGCTATCGATGGTCAATCA
GCTGGTTTCTGTGGTACTACTTGTCACGCAATTTGCGATTCAGGTACATCACTCATTGCT
GGTCCAATGGCTGATATTACTGCCCTCAATGAAAAATTAGGTGCTGTCATCTTAAATGGT
GAAGGTGTCTTCTCTGATTGTAGCGTTATCAACACCTTACCAAATGTTACCATCACCGTT
GCTGGTCGTGAATTTGTTTTAACTCCAAAAGAATACGTTTTAGAAGTTACTGAGTTCGGA
AAGACTGAATGTTTGAGTGGATTTATGGGTATCGAGTTAAACATGGGAAATTTCTGGATC
CTTGGTGATGTTTTCATCTCTGCTTACTATACTGTATTCGATTTTGGTAATAAACAAGTT
GGTTTCGCAACTGCCATTCAAGGTTAAATTTTTTTAATTAATTTATATTTAAGATAGAAA
TAAAACTAAATAATAGAACAATATAT
sequence update 2001. 6. 1
Translated Amino Acid sequence
yf*fffiifiifytliff*kkii*fkltkilahrfqqrffy*SIPHDSIKGKMKLLILTL
FLATIVLAQALTVPLNFHQASRESRRRVPQKWSNRLSALNAGTTIPISDFEDAQYYGAIT
IGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGASSTYVANGTDFTIQYGS
GAMSGFVS---

---KDQLFAEATAEPGIAFDFAKFDGILGLAFQSISVNSIPPVFYNMLSQGLVSSTLFSF
WLSRTPGANDGELSFGSIDNTKYTGDITYVPLTNETYWEFVMDDFAIDGQSAGFCGTTCH
AICDSGTSLIAGPMADITALNEKLGAVILNGEGVFSDCSVINTLPNVTITVAGREFVLTP
KEYVLEVTEFGKTECLSGFMGIELNMGNFWILGDVFISAYYTVFDFGNKQVGFATAIQG*
iflinlylr*k*n*iieqy


Translated Amino Acid sequence (All Frames)
Frame A:
iifnffllfllffih*fffkkklfnln*qrfwrtdfnkdffinqfhmiq*rgk*nylf*l
yf*lllf*lkl*qyh*tsiklqenleeefhkngqtdyllsmlvpqsqfqilkmlntmvpl
plvpqvkpsk*fsilvhptcgfhqrnvqslllhviyitnitavpqahmsptelispsntv
vvlcqvls---

---KDQLFAEATAEPGIAFDFAKFDGILGLAFQSISVNSIPPVFYNMLSQGLVSSTLFSF
WLSRTPGANDGELSFGSIDNTKYTGDITYVPLTNETYWEFVMDDFAIDGQSAGFCGTTCH
AICDSGTSLIAGPMADITALNEKLGAVILNGEGVFSDCSVINTLPNVTITVAGREFVLTP
KEYVLEVTEFGKTECLSGFMGIELNMGNFWILGDVFISAYYTVFDFGNKQVGFATAIQG*
iflinlylr*k*n*iieqy

Frame B:
lfliffyyfyyflyinfflkknyli*inkdfgaqistkifllinst*fnkgeneityfnf
ifsyycfsssfnstiklpssfkri*kksstkmvkqiicsqcwyhnpnfrf*rcsilwchy
hwyprsslqssfrywfiqlvdsikemsnhcccm*ft*qi*qrclkhicrqrn*fhhpir*
wcyvrfcl---

---kinyslkplpnqvllsispnsmvf*vllsnpslliqfhqsfttcyhkv*fhqhssps
gyqelqvptmvnshsvqsitpntlvtlptsh*ptkpignslwmtllsmvnqlvsvvllvt
qfaiqvhhsllvqwlillpsmkn*vlss*mvkvsslivalstpyqmlpspllvvnlf*lq
kntf*kllsserlnv*vdlwvss*tweisgslvmfsslltilysilvinklvsqlpfkvk
ff*liyi*drnktk**nni

Frame C:
yf*fffiifiifytliff*kkii*fkltkilahrfqqrffy*SIPHDSIKGKMKLLILTL
FLATIVLAQALTVPLNFHQASRESRRRVPQKWSNRLSALNAGTTIPISDFEDAQYYGAIT
IGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGASSTYVANGTDFTIQYGS
GAMSGFVS---

---rsiir*shcrtrycfrfrqirwyfrscfpihlc*fnstsllqhvitrfsfintllll
viknsrcqrw*tlirfnr*hqihw*hylrpinqrnllgiryg*lcyrwsiswflwyylsr
nlrfryithcwsng*yycpq*kircchlkw*rcll*l*ryqhltkcyhhrcws*icfnsk
rirfrsy*vrkd*mfewiygyrvkhgkfldpw*cfhlcllycirfw**tswfrnchsrln
ffn*fifkieiklnnrtiy

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

CFC781 (CFC781Q) /CSM/CF/CFC7-D/CFC781Q.Seq.d/ 2339 0.0
VFL410 (VFL410Q) /CSM/VF/VFL4-A/VFL410Q.Seq.d/ 1477 0.0
VFL274 (VFL274Q) /CSM/VF/VFL2-D/VFL274Q.Seq.d/ 1477 0.0
VFG815 (VFG815Q) /CSM/VF/VFG8-A/VFG815Q.Seq.d/ 1477 0.0
VFF141 (VFF141Q) /CSM/VF/VFF1-B/VFF141Q.Seq.d/ 1477 0.0
VFD482 (VFD482Q) /CSM/VF/VFD4-D/VFD482Q.Seq.d/ 1477 0.0
CFC455 (CFC455Q) /CSM/CF/CFC4-C/CFC455Q.Seq.d/ 1477 0.0
VFM519 (VFM519Q) /CSM/VF/VFM5-A/VFM519Q.Seq.d/ 1475 0.0
VFL252 (VFL252Q) /CSM/VF/VFL2-C/VFL252Q.Seq.d/ 1471 0.0
VFD224 (VFD224Q) /CSM/VF/VFD2-A/VFD224Q.Seq.d/ 1469 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

Y16962|Y16962.1 Dictyostelium discoideum mRNA for cathepsin D. 1404 0.0 3
AJ243946|AJ243946.1 Dictyostelium discoideum ctsD gene for cathepsin D, exons 1 to 2. 1404 0.0 3
E33916|E33916.1 Candida boidinii strain with lowered protease activity and utilization thereof as host for producing foreign protein. 36 3e-06 5
AF076243|AF076243.1 Arabidopsis thaliana BAC T26N6 from chromosome IV at 19.3 cM, complete sequence. 62 5e-06 4
BX829292|BX829292.1 Arabidopsis thaliana Full-length cDNA Complete sequence from clone GSLTSIL89ZE03 of Silique of strain col-0 of Arabidopsis thaliana (thale cress). 62 2e-05 1
AL765461|AL765461.1 Arabidopsis thaliana T-DNA flanking sequence GK-139E08-012875. 62 2e-05 1
AV567523|AV567523.1 Arabidopsis thaliana cDNA clone:SQL15g03F, 3' end. 62 2e-05 1
AL762930|AL762930.1 Arabidopsis thaliana T-DNA flanking sequence GK-030C11-011522. 62 2e-05 1
AF372974|AF372974.1 Arabidopsis thaliana AT4g04460/T26N6_7 mRNA, complete cds. 62 2e-05 1
BX838383|BX838383.1 Arabidopsis thaliana Full-length cDNA 5PRIM end of clone GSLTSIL89ZD03 of Silique of strain col-0 of Arabidopsis thaliana (thale cress). 62 2e-05 1
dna update 2004. 3. 2
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(Q03168) RecName: Full=Lysosomal aspartic protease; EC=... 243 2e-96
EF213114_1(EF213114|pid:none) Penaeus monodon cathepsin D mRNA, ... 240 3e-94
BT080419_1(BT080419|pid:none) Caligus clemensi clone ccle-evs-50... 244 5e-94
EF070454_1(EF070454|pid:none) Maconellicoccus hirsutus clone WHM... 238 2e-93
DQ010007_1(DQ010007|pid:none) Bombyx mori CathD mRNA, complete c... 233 5e-93
FJ168036_1(FJ168036|pid:none) Fasciola hepatica cathepsin D-like... 236 3e-91
AB106552_1(AB106552|pid:none) Todarodes pacificus tpaD mRNA for ... 228 5e-91
DQ909010_1(DQ909010|pid:none) Clonorchis sinensis aspartic prote... 241 6e-91
EF000001_1(EF000001|pid:none) Fasciola hepatica cathepsin D-like... 230 3e-89
DQ131585_1(DQ131585|pid:none) Opisthorchis viverrini cathepsin D... 238 1e-88
protein update 2009. 5.12
PSORT

psg: 0.86 gvh: 0.77 alm: 0.41 top: 0.60 tms: 0.00 mit: 0.32 mip: 0.10
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.33 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

32.0 %: extracellular, including cell wall
24.0 %: nuclear
20.0 %: mitochondrial
12.0 %: cytoplasmic
8.0 %: vacuolar
4.0 %: cytoskeletal

>> prediction for CFC781 is exc

5' end seq. ID CFC781F
5' end seq.
>CFC781F.Seq
ATTATTTTTAATTTTTTTTTATTATTTTTATTATTTTTTATACATTAATTTTTTTTTAAA
AAAAAATTATTTAATTTAAATTAACAAAGATTTTGGCGCACAGATTTCAACAAAGATTTT
TTTATTAATCAATTCCACATGATTCAATAAAGGGGAAAATGAAATTACTTATTTTAACTT
TATTTTTAGCTACTATTGTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAAG
CTTCAAGAGAATCTAGAAGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCA
ATGCTGGTACCACAATCCCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTA
CCATTGGTACCCCAGGTCAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGT
GGATTCCATCAAAGAAATGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATA
ACAGCGGTGCCTCAAGCACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTA
GTGGTGCTATGTCAGGTTTTGTCTCT----------
Length of 5' end seq. 566
3' end seq. ID CFC781Z
3' end seq.
>CFC781Z.Seq
----------AAAGATCAATTATTCGCTGAAGCCACTGCCGAACCAGGTATTGCTTTCGA
TTTCGCCAAATTCGATGGTATTTTAGGTCTTGCTTTCCAATCCATCTCTGTTAATTCAAT
TCCACCAGTCTTTTACAACATGTTATCACAAGGTTTAGTTTCATCAACACTCTTCTCCTT
CTGGTTATCAAGAACTCCAGGTGCCAACGATGGTGAACTCTCATTCGGTTCAATCGATAA
CACCAAATACACTGGTGACATTACCTACGTCCCATTAACCAACGAAACCTATTGGGAATT
CGTTATGGATGACTTTGCTATCGATGGTCAATCAGCTGGTTTCTGTGGTACTACTTGTCA
CGCAATTTGCGATTCAGGTACATCACTCATTGCTGGTCCAATGGCTGATATTACTGCCCT
CAATGAAAAATTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCTTCTCTGATTGTAGCGT
TATCAACACCTTACCAAATGTTACCATCACCGTTGCTGGTCGTGAATTTGTTTTAACTCC
AAAAGAATACGTTTTAGAAGTTACTGAGTTCGGAAAGACTGAATGTTTGAGTGGATTTAT
GGGTATCGAGTTAAACATGGGAAATTTCTGGATCCTTGGTGATGTTTTCATCTCTGCTTA
CTATACTGTATTCGATTTTGGTAATAAACAAGTTGGTTTCGCAACTGCCATTCAAGGTTA
AATTTTTTTAATTAATTTATATTTAAGATAGAAATAAAACTAAATAATAGAACAATATAT
Length of 3' end seq. 770
Connected seq. ID CFC781P
Connected seq.
>CFC781P.Seq
ATTATTTTTAATTTTTTTTTATTATTTTTATTATTTTTTATACATTAATTTTTTTTTAAA
AAAAAATTATTTAATTTAAATTAACAAAGATTTTGGCGCACAGATTTCAACAAAGATTTT
TTTATTAATCAATTCCACATGATTCAATAAAGGGGAAAATGAAATTACTTATTTTAACTT
TATTTTTAGCTACTATTGTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAAG
CTTCAAGAGAATCTAGAAGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCA
ATGCTGGTACCACAATCCCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTA
CCATTGGTACCCCAGGTCAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGT
GGATTCCATCAAAGAAATGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATA
ACAGCGGTGCCTCAAGCACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTA
GTGGTGCTATGTCAGGTTTTGTCTCT----------AAAGATCAATTATTCGCTGAAGCC
ACTGCCGAACCAGGTATTGCTTTCGATTTCGCCAAATTCGATGGTATTTTAGGTCTTGCT
TTCCAATCCATCTCTGTTAATTCAATTCCACCAGTCTTTTACAACATGTTATCACAAGGT
TTAGTTTCATCAACACTCTTCTCCTTCTGGTTATCAAGAACTCCAGGTGCCAACGATGGT
GAACTCTCATTCGGTTCAATCGATAACACCAAATACACTGGTGACATTACCTACGTCCCA
TTAACCAACGAAACCTATTGGGAATTCGTTATGGATGACTTTGCTATCGATGGTCAATCA
GCTGGTTTCTGTGGTACTACTTGTCACGCAATTTGCGATTCAGGTACATCACTCATTGCT
GGTCCAATGGCTGATATTACTGCCCTCAATGAAAAATTAGGTGCTGTCATCTTAAATGGT
GAAGGTGTCTTCTCTGATTGTAGCGTTATCAACACCTTACCAAATGTTACCATCACCGTT
GCTGGTCGTGAATTTGTTTTAACTCCAAAAGAATACGTTTTAGAAGTTACTGAGTTCGGA
AAGACTGAATGTTTGAGTGGATTTATGGGTATCGAGTTAAACATGGGAAATTTCTGGATC
CTTGGTGATGTTTTCATCTCTGCTTACTATACTGTATTCGATTTTGGTAATAAACAAGTT
GGTTTCGCAACTGCCATTCAAGGTTAAATTTTTTTAATTAATTTATATTTAAGATAGAAA
TAAAACTAAATAATAGAACAATATAT
Length of connected seq. 1336
Full length Seq ID -
Full length Seq. -
Length of full length seq. -