CFF884
Library CF
(Link to library)
Clone ID CFF884
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U15456-1
Original site URL
Representative seq. ID CFF884P
(Link to Original site)
Representative DNA sequence
>CFF884 (CFF884Q) /CSM/CF/CFF8-D/CFF884Q.Seq.d/
TTTTATTTTTTATACATTAATTTTTTTTTAAAAAAAAATTATTTAATTTAAATTAACAAA
GATTTTGGCGCACAGATTTCAACAAAGATTTTTTTATTAATCAATTCCACATGATTCAAT
AAAGGGGAAAATGAAATTACTTATTTTAACTTTATTTTTAGCTACTATTGTTTTAGCTCA
AGCTTTAACAGTACCATTAAACTTCCATCAAGCTTCAAGAGAATCTAGAAGAAGAGTTCC
ACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTACCACAATCCCAATTTCAGA
TTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTACCCCAGGTCAAGCCTTCAA
AGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATCAAAGAAATGTCCAATCAC
TGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGCCTCAAGCACATATGTCGC
CAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTATGTCAGGTTTTGTCTCTCA
AGATTCCGTCACTGTTGGTTCATTAACTGTTAAAGATCAATTATTCGCTGAAGCCAXXXX
XXXXXXGTTAAAGATCAATTATTCGCTGAAGCCACTGCCGAACCAGGTATTGCTTTCGAT
TTCGCCAAATTCGATGGTATTTTAGATCTTGCTTTCCAATCCATCTCTGTTAATTCAATT
CCACCAGTCTTTTACAACATGTTATCACAAGGTTTAGTTTCATCAACACTCTTCTCCTTC
TGGTTATCAAGAACTCCAGGTGCCAACGGTGGTGAACTCTCATTCGGTTCAATCGATAAC
ACCAAATACACTGGTGACATTACCTACGTCCCATTAACCAACGAAACCTATTGGGAATTC
GTTATGGATGACTTTGCTATCGATGGTCAATCAGCTGGTTTCTGTGGTACTACTTGTCAC
GCAATTTGCGATTCAGGTACATCACTCATTGCTGGTCCAATGGCTGATATTACTGCCCTC
AATGAAAAATTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCTTCTCTGATTGTAGCGTT
ATCAACACCTTACCAAATGTTACCATCACCGTTGCTGGTCGTGAATTTGTTTTAACTCCA
AAAGAATACGTTTTAGAAGTTACTGAGTTCGGAAAGACTGAATGTTTGAGTGGATTTATG
GGTATCGAGTTAAACATGGGAAATTTCTGGATCCTTGGTGATGTTTTCATCTCTGCTTAC
TATACTGTATTCGATTTTGGTAATAAACAAGTTGGTTTCGCAACTGCCATTCAAGGTTAA
ATTTTTTTAATTAATTTATATTTAAGNATAGAAATAAAAC
sequence update 2001. 6. 1
Translated Amino Acid sequence
fifytliff*kkii*fkltkilahrfqqrffy*SIPHDSIKGKMKLLILTLFLATIVLAQ
ALTVPLNFHQASRESRRRVPQKWSNRLSALNAGTTIPISDFEDAQYYGAITIGTPGQAFK
VVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGASSTYVANGTDFTIQYGSGAMSGFVSQ
DSVTVGSLTVKDQLFAEA---

---VKDQLFAEATAEPGIAFDFAKFDGILDLAFQSISVNSIPPVFYNMLSQGLVSSTLFS
FWLSRTPGANGGELSFGSIDNTKYTGDITYVPLTNETYWEFVMDDFAIDGQSAGFCGTTC
HAICDSGTSLIAGPMADITALNEKLGAVILNGEGVFSDCSVINTLPNVTITVAGREFVLT
PKEYVLEVTEFGKTECLSGFMGIELNMGNFWILGDVFISAYYTVFDFGNKQVGFATAIQG
*iflinlylxieik


Translated Amino Acid sequence (All Frames)
Frame A:
fyflyinfflkknyli*inkdfgaqistkifllinst*fnkgeneityfnfifsyycfss
sfnstiklpssfkri*kksstkmvkqiicsqcwyhnpnfrf*rcsilwchyhwyprsslq
ssfrywfiqlvdsikemsnhcccm*ft*qi*qrclkhicrqrn*fhhpir*wcyvrfcls
rfrhcwfinc*rsiir*s---

---VKDQLFAEATAEPGIAFDFAKFDGILDLAFQSISVNSIPPVFYNMLSQGLVSSTLFS
FWLSRTPGANGGELSFGSIDNTKYTGDITYVPLTNETYWEFVMDDFAIDGQSAGFCGTTC
HAICDSGTSLIAGPMADITALNEKLGAVILNGEGVFSDCSVINTLPNVTITVAGREFVLT
PKEYVLEVTEFGKTECLSGFMGIELNMGNFWILGDVFISAYYTVFDFGNKQVGFATAIQG
*iflinlylxieik

Frame B:
fifytliff*kkii*fkltkilahrfqqrffy*SIPHDSIKGKMKLLILTLFLATIVLAQ
ALTVPLNFHQASRESRRRVPQKWSNRLSALNAGTTIPISDFEDAQYYGAITIGTPGQAFK
VVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGASSTYVANGTDFTIQYGSGAMSGFVSQ
DSVTVGSLTVKDQLFAEA---

---lkinyslkplpnqvllsispnsmvf*illsnpslliqfhqsfttcyhkv*fhqhssp
sgyqelqvptvvnshsvqsitpntlvtlptsh*ptkpignslwmtllsmvnqlvsvvllv
tqfaiqvhhsllvqwlillpsmkn*vlss*mvkvsslivalstpyqmlpspllvvnlf*l
qkntf*kllsserlnv*vdlwvss*tweisgslvmfsslltilysilvinklvsqlpfkv
kff*liyi*x*k*n

Frame C:
lffih*fffkkklfnln*qrfwrtdfnkdffinqfhmiq*rgk*nylf*lyf*lllf*lk
l*qyh*tsiklqenleeefhkngqtdyllsmlvpqsqfqilkmlntmvplplvpqvkpsk
*fsilvhptcgfhqrnvqslllhviyitnitavpqahmsptelispsntvvvlcqvlslk
ipsllvh*llkinyslkp---

---*rsiir*shcrtrycfrfrqirwyfrscfpihlc*fnstsllqhvitrfsfintlll
lviknsrcqrw*tlirfnr*hqihw*hylrpinqrnllgiryg*lcyrwsiswflwyyls
rnlrfryithcwsng*yycpq*kircchlkw*rcll*l*ryqhltkcyhhrcws*icfns
krirfrsy*vrkd*mfewiygyrvkhgkfldpw*cfhlcllycirfw**tswfrnchsrl
nffn*fifkxrnk

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

CFF884 (CFF884Q) /CSM/CF/CFF8-D/CFF884Q.Seq.d/ 2440 0.0
VFN222 (VFN222Q) /CSM/VF/VFN2-A/VFN222Q.Seq.d/ 1439 0.0
VFL761 (VFL761Q) /CSM/VF/VFL7-C/VFL761Q.Seq.d/ 1439 0.0
VFL385 (VFL385Q) /CSM/VF/VFL3-D/VFL385Q.Seq.d/ 1439 0.0
VFK659 (VFK659Q) /CSM/VF/VFK6-C/VFK659Q.Seq.d/ 1439 0.0
VFG570 (VFG570Q) /CSM/VF/VFG5-C/VFG570Q.Seq.d/ 1439 0.0
VFG218 (VFG218Q) /CSM/VF/VFG2-A/VFG218Q.Seq.d/ 1439 0.0
VFF759 (VFF759Q) /CSM/VF/VFF7-C/VFF759Q.Seq.d/ 1439 0.0
VFF495 (VFF495Q) /CSM/VF/VFF4-D/VFF495Q.Seq.d/ 1439 0.0
VFE438 (VFE438Q) /CSM/VF/VFE4-B/VFE438Q.Seq.d/ 1439 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

Y16962|Y16962.1 Dictyostelium discoideum mRNA for cathepsin D. 1409 0.0 2
AJ243946|AJ243946.1 Dictyostelium discoideum ctsD gene for cathepsin D, exons 1 to 2. 1409 0.0 3
AX059531|AX059531.1 Sequence 264 from Patent WO0055325. 62 2e-05 1
AF076243|AF076243.1 Arabidopsis thaliana BAC T26N6 from chromosome IV at 19.3 cM, complete sequence. 62 2e-05 1
BE525959|BE525959.1 M64C07STM Arabidopsis developing seed Arabidopsis thaliana cDNA clone 600034526R1 5', mRNA sequence. 62 2e-05 1
BX827174|BX827174.1 Arabidopsis thaliana Full-length cDNA Complete sequence from clone GSLTLS1ZB10 of Adult vegetative tissue of strain col-0 of Arabidopsis thaliana (thale cress). 62 2e-05 1
CD825453|CD825453.1 BN25.060N03F011129 BN25 Brassica napus cDNA clone BN25060N03, mRNA sequence. 62 2e-05 1
CB264579|CB264579.1 51-E015023-035-004-F13-T7R MPIZ-ADIS-035 Arabidopsis thaliana cDNA clone MPIZp2000F134Q 5-PRIME, mRNA sequence. 62 2e-05 1
BX829292|BX829292.1 Arabidopsis thaliana Full-length cDNA Complete sequence from clone GSLTSIL89ZE03 of Silique of strain col-0 of Arabidopsis thaliana (thale cress). 62 2e-05 1
AL161500|AL161500.2 Arabidopsis thaliana DNA chromosome 4, contig fragment No. 12. 62 2e-05 1
dna update 2004. 3. 6
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

EF213114_1(EF213114|pid:none) Penaeus monodon cathepsin D mRNA, ... 241 e-102
(Q03168) RecName: Full=Lysosomal aspartic protease; EC=... 243 e-101
DQ010007_1(DQ010007|pid:none) Bombyx mori CathD mRNA, complete c... 235 1e-99
EF070454_1(EF070454|pid:none) Maconellicoccus hirsutus clone WHM... 239 2e-99
BT080419_1(BT080419|pid:none) Caligus clemensi clone ccle-evs-50... 245 7e-99
AB106552_1(AB106552|pid:none) Todarodes pacificus tpaD mRNA for ... 229 2e-97
DQ909010_1(DQ909010|pid:none) Clonorchis sinensis aspartic prote... 241 2e-97
FJ168036_1(FJ168036|pid:none) Fasciola hepatica cathepsin D-like... 237 3e-96
FN316575_1(FN316575|pid:none) Schistosoma japonicum isolate Anhu... 228 1e-94
DQ131585_1(DQ131585|pid:none) Opisthorchis viverrini cathepsin D... 239 2e-94
protein update 2009. 5.14
PSORT

psg: 0.86 gvh: 0.77 alm: 0.42 top: 0.60 tms: 0.00 mit: 0.32 mip: 0.10
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.33 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

32.0 %: extracellular, including cell wall
24.0 %: nuclear
20.0 %: mitochondrial
12.0 %: cytoplasmic
8.0 %: vacuolar
4.0 %: cytoskeletal

>> prediction for CFF884 is exc

5' end seq. ID CFF884F
5' end seq.
>CFF884F.Seq
TTTTATTTTTTATACATTAATTTTTTTTTAAAAAAAAATTATTTAATTTAAATTAACAAA
GATTTTGGCGCACAGATTTCAACAAAGATTTTTTTATTAATCAATTCCACATGATTCAAT
AAAGGGGAAAATGAAATTACTTATTTTAACTTTATTTTTAGCTACTATTGTTTTAGCTCA
AGCTTTAACAGTACCATTAAACTTCCATCAAGCTTCAAGAGAATCTAGAAGAAGAGTTCC
ACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTACCACAATCCCAATTTCAGA
TTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTACCCCAGGTCAAGCCTTCAA
AGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATCAAAGAAATGTCCAATCAC
TGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGCCTCAAGCACATATGTCGC
CAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTATGTCAGGTTTTGTCTCTCA
AGATTCCGTCACTGTTGGTTCATTAACTGTTAAAGATCAATTATTCGCTGAAGCCA----
------
Length of 5' end seq. 596
3' end seq. ID CFF884Z
3' end seq.
>CFF884Z.Seq
----------GTTAAAGATCAATTATTCGCTGAAGCCACTGCCGAACCAGGTATTGCTTT
CGATTTCGCCAAATTCGATGGTATTTTAGATCTTGCTTTCCAATCCATCTCTGTTAATTC
AATTCCACCAGTCTTTTACAACATGTTATCACAAGGTTTAGTTTCATCAACACTCTTCTC
CTTCTGGTTATCAAGAACTCCAGGTGCCAACGGTGGTGAACTCTCATTCGGTTCAATCGA
TAACACCAAATACACTGGTGACATTACCTACGTCCCATTAACCAACGAAACCTATTGGGA
ATTCGTTATGGATGACTTTGCTATCGATGGTCAATCAGCTGGTTTCTGTGGTACTACTTG
TCACGCAATTTGCGATTCAGGTACATCACTCATTGCTGGTCCAATGGCTGATATTACTGC
CCTCAATGAAAAATTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCTTCTCTGATTGTAG
CGTTATCAACACCTTACCAAATGTTACCATCACCGTTGCTGGTCGTGAATTTGTTTTAAC
TCCAAAAGAATACGTTTTAGAAGTTACTGAGTTCGGAAAGACTGAATGTTTGAGTGGATT
TATGGGTATCGAGTTAAACATGGGAAATTTCTGGATCCTTGGTGATGTTTTCATCTCTGC
TTACTATACTGTATTCGATTTTGGTAATAAACAAGTTGGTTTCGCAACTGCCATTCAAGG
TTAAATTTTTTTAATTAATTTATATTTAAGNATAGAAATAAAAC
Length of 3' end seq. 754
Connected seq. ID CFF884P
Connected seq.
>CFF884P.Seq
TTTTATTTTTTATACATTAATTTTTTTTTAAAAAAAAATTATTTAATTTAAATTAACAAA
GATTTTGGCGCACAGATTTCAACAAAGATTTTTTTATTAATCAATTCCACATGATTCAAT
AAAGGGGAAAATGAAATTACTTATTTTAACTTTATTTTTAGCTACTATTGTTTTAGCTCA
AGCTTTAACAGTACCATTAAACTTCCATCAAGCTTCAAGAGAATCTAGAAGAAGAGTTCC
ACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTACCACAATCCCAATTTCAGA
TTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTACCCCAGGTCAAGCCTTCAA
AGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATCAAAGAAATGTCCAATCAC
TGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGCCTCAAGCACATATGTCGC
CAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTATGTCAGGTTTTGTCTCTCA
AGATTCCGTCACTGTTGGTTCATTAACTGTTAAAGATCAATTATTCGCTGAAGCCA----
------GTTAAAGATCAATTATTCGCTGAAGCCACTGCCGAACCAGGTATTGCTTTCGAT
TTCGCCAAATTCGATGGTATTTTAGATCTTGCTTTCCAATCCATCTCTGTTAATTCAATT
CCACCAGTCTTTTACAACATGTTATCACAAGGTTTAGTTTCATCAACACTCTTCTCCTTC
TGGTTATCAAGAACTCCAGGTGCCAACGGTGGTGAACTCTCATTCGGTTCAATCGATAAC
ACCAAATACACTGGTGACATTACCTACGTCCCATTAACCAACGAAACCTATTGGGAATTC
GTTATGGATGACTTTGCTATCGATGGTCAATCAGCTGGTTTCTGTGGTACTACTTGTCAC
GCAATTTGCGATTCAGGTACATCACTCATTGCTGGTCCAATGGCTGATATTACTGCCCTC
AATGAAAAATTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCTTCTCTGATTGTAGCGTT
ATCAACACCTTACCAAATGTTACCATCACCGTTGCTGGTCGTGAATTTGTTTTAACTCCA
AAAGAATACGTTTTAGAAGTTACTGAGTTCGGAAAGACTGAATGTTTGAGTGGATTTATG
GGTATCGAGTTAAACATGGGAAATTTCTGGATCCTTGGTGATGTTTTCATCTCTGCTTAC
TATACTGTATTCGATTTTGGTAATAAACAAGTTGGTTTCGCAACTGCCATTCAAGGTTAA
ATTTTTTTAATTAATTTATATTTAAGNATAGAAATAAAAC
Length of connected seq. 1350
Full length Seq ID -
Full length Seq. -
Length of full length seq. -