CFC321
Library CF
(Link to library)
Clone ID CFC321
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16592-1
Original site URL
Representative seq. ID CFC321P
(Link to Original site)
Representative DNA sequence
>CFC321 (CFC321Q) /CSM/CF/CFC3-A/CFC321Q.Seq.d/
AGTCAATATATTAGTGTTTACAATGGTATGGATATTACCATCAACTTTTACAATCAAGAT
AATACATACAATGTAGGTCCAGTTAAATATGATATGGTTTGTACCACTACCCCAGGTAAT
GGTTCATTAGTTAATGTTTTACCAACTGAACCTTCATCATGGGTTTACAATGGTACATCA
ACTGTTAACGGTGTCCAAGTCTTTGGTTACAGTCAAAAGATCACTCAATATGGTCGTACT
GGTTTCTACAACTTTTACGTTGATGCCAACGGTGTTCCAGTTCAATTCTATATGGATGGT
GTCGATTATGTATTTGGTAGTCACCCAGATGTTTACGTATTAAACTTTGATATCTACACC
ACCGATATCAGCTCATACGAATCATACTTTGATATTCCAGTTCTCTGTAATAACGCAAAG
GAAGCCCCAGCTAAAGAAAACCAATTCGATGGTCTTTTCTCATCAATCGGTGATAACTTA
TTAGCCAAAGAAGAACAAGCCTCAAACTTATTCAAAGAATACAAAGCTCAATACAACAAG
GAATACTCAAGCCAAGACGAACATGATGAACGTTTCATTAACTTTAAAGCTGXXXXXXXX
XXATCAAGGTATTTGCGGTTCATGTTGGACTTTTGGTTCAACTGGTTCATTAGAAGGTAC
CAACTGTGTCACCAACGGTGAATTAGTCTCCCTCTCTGAACAACAATTAGTTGATTGTGC
TATCCTTACCGGTAGTCAAGGTTGTGGTGGTGGTTTTGCATCATCTGCATTCCAATACGT
CATGGAAATTGGTAGTCTCGCCACCGAGTCCAACTATCCATACTTAATGCAAAATGGTCT
CTGCAGAGATAGAACTGTCACTCCATCAGGTGTTTCAATCACTGGTTACGTCAATGTTAC
CTCTGGTAGTGAATCTGCCCTTCAAAACGCTATCGCCACCACTGGTCCAGTCGCCATCGC
CATCGATGCCTCTGTTGATGATTTCCGTTACTACATGTCTGGTGTTTACAATAATCCAGC
CTGTAAAAATGGTTTAGATGATTTGGATCACGAAGTTTTAGCTATTGGTTATGGTACTTA
TCAAGGTCAAGATTATTTCTTAGTTAAAAACTCTTGGTCAACTAACTGGGGTATGGATGG
TTATGTTTACATGGCTAGAAATGATAACAATTTATGTGGTGTTTCAAGTCAAGCCACCTA
TCCAATTCCAACAAAGAATTAAATTTCTTCAATAAATCCAATAAATATATATTTTAAAC
sequence update 2001. 6. 1
Translated Amino Acid sequence
SQYISVYNGMDITINFYNQDNTYNVGPVKYDMVCTTTPGNGSLVNVLPTEPSSWVYNGTS
TVNGVQVFGYSQKITQYGRTGFYNFYVDANGVPVQFYMDGVDYVFGSHPDVYVLNFDIYT
TDISSYESYFDIPVLCNNAKEAPAKENQFDGLFSSIGDNLLAKEEQASNLFKEYKAQYNK
EYSSQDEHDERFINFKA---

---QGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQ
YVMEIGSLATESNYPYLMQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVA
IAIDASVDDFRYYMSGVYNNPACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGM
DGYVYMARNDNNLCGVSSQATYPIPTKN*issinpiniyfk


Translated Amino Acid sequence (All Frames)
Frame A:
SQYISVYNGMDITINFYNQDNTYNVGPVKYDMVCTTTPGNGSLVNVLPTEPSSWVYNGTS
TVNGVQVFGYSQKITQYGRTGFYNFYVDANGVPVQFYMDGVDYVFGSHPDVYVLNFDIYT
TDISSYESYFDIPVLCNNAKEAPAKENQFDGLFSSIGDNLLAKEEQASNLFKEYKAQYNK
EYSSQDEHDERFINFKA---

---ikvfavhvgllvqlvh*kvptvsptvn*spslnnn*livlslpvvkvvvvvlhhlhs
ntswklvvsppsptiht*ckmvsaeielslhqvfqslvtsmlplvvnlpfktlspplvqs
pspsmpllmisvttclvftiiqpvkmv*miwitkf*llvmvlikvkiis*lktlgqltgv
wmvmftwlemitiyvvfqvkppiqfqqrikflq*iq*iyiln

Frame B:
vnilvftmvwilpstftikiihtm*vqlnmiwfvplpqvmvh*lmfyqlnlhhgftmvhq
lltvskslvtvkrslnmvvlvsttftlmptvfqfnsiwmvsimylvvtqmfty*tlistp
pisahtnhtlifqfsvitqrkpqlkktnsmvfshqsvity*pkknkpqtyskntklnttr
ntqaktnmmnvsltlkl---

---srylrfmldfwfnwfirryqlchqr*islpl*ttis*lcypyr*srlwwwfciicip
irhgnw*srhrvqlsilnakwslqr*nchsircfnhwlrqcylw**icpskryrhhwssr
hrhrclc**fpllhvwclq*ssl*kwfr*fgsrsfsywlwylsrsrlfls*kllvn*lgy
gwlclhg*k**qfmwcfksshlsnsnkelnffnksnkyif*

Frame C:
siy*clqwygyyhqllqsr*yiqcrss*i*yglyhypr*wfis*cftn*tfimglqwyin
c*rcpslwlqskdhsiwsywflqllr*cqrcsssilygwcrlciw*sprclrikl*ylhh
ryqliriil*ysssl**rkgsps*rkpirwsflinr**lisqrrtslkliqriqssiqqg
ilkprrt**tfh*l*s---

---QGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQ
YVMEIGSLATESNYPYLMQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVA
IAIDASVDDFRYYMSGVYNNPACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGM
DGYVYMARNDNNLCGVSSQATYPIPTKN*issinpiniyfk

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

CFC321 (CFC321Q) /CSM/CF/CFC3-A/CFC321Q.Seq.d/ 2436 0.0
VFI581 (VFI581Q) /CSM/VF/VFI5-D/VFI581Q.Seq.d/ 1302 0.0
VFG444 (VFG444Q) /CSM/VF/VFG4-B/VFG444Q.Seq.d/ 1302 0.0
FC-AN13 (FC-AN13Q) /CSM/FC/FC-AN/FC-AN13Q.Seq.d/ 1302 0.0
AFE412 (AFE412Q) /CSM/AF/AFE4-A/AFE412Q.Seq.d/ 1302 0.0
VFM638 (VFM638Q) /CSM/VF/VFM6-B/VFM638Q.Seq.d/ 1296 0.0
SFE845 (SFE845Q) /CSM/SF/SFE8-B/SFE845Q.Seq.d/ 1292 0.0
VFO148 (VFO148Q) /CSM/VF/VFO1-B/VFO148Q.Seq.d/ 1281 0.0
SLE594 (SLE594Q) /CSM/SL/SLE5-D/SLE594Q.Seq.d/ 1279 0.0
CFI141 (CFI141Q) /CSM/CF/CFI1-B/CFI141Q.Seq.d/ 1271 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

S58669|S58669.1 Entamoeba histolytica cysteine proteinase precursor (ACP1) gene, partial cds. 54 9e-06 2
AZ547119|AZ547119.1 ENTFS26TR Entamoeba histolytica Sheared DNA Entamoeba histolytica genomic, DNA sequence. 54 1e-05 2
AZ674505|AZ674505.1 ENTIZ88TF Entamoeba histolytica Sheared DNA Entamoeba histolytica genomic, DNA sequence. 54 1e-05 2
X87214|X87214.1 E.histolytica mRNA for cysteine proteinase. 54 2e-05 2
X87213|X87213.1 E.dispar mRNA for cysteine proteinase. 54 6e-04 2
CR391952|CR391952.2 Zebrafish DNA sequence *** SEQUENCING IN PROGRESS *** from clone CH211-194B7. 40 0.001 7
M27307|M27307.1 Entamoeba histolytica cysteine protease gene, partial cds. 54 0.005 1
AC125515|AC125515.4 Pan troglodytes clone RP43-171A17, WORKING DRAFT SEQUENCE, 7 ordered pieces. 36 0.013 2
AF315312|AF315312.2 Homo sapiens chromosome 8 clone RP1-80K22 map 8q24.3, complete sequence. 36 0.013 2
AC103819|AC103819.3 Homo sapiens chromosome 8, clone CTD-3056O22, complete sequence. 36 0.013 2
dna update 2004. 5.25
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

BC075887_1(BC075887|pid:none) Danio rerio cathepsin L.1, mRNA (c... 219 1e-55
D82884_1(D82884|pid:none) Sitophilus zeamais mRNA for cysteine p... 213 9e-54
AY220615_1(AY220615|pid:none) Hydra vulgaris cathepsin L precurs... 212 2e-53
DQ280314_1(DQ280314|pid:none) Hymeniacidon perlevis cathepsin L ... 211 3e-53
AY363263_1(AY363263|pid:none) Triatoma infestans cathepsin L-lik... 211 6e-53
AY336798_1(AY336798|pid:none) Rhipicephalus haemaphysaloides hae... 210 9e-53
(P13277) RecName: Full=Digestive cysteine proteinase 1; ... 209 2e-52
EF070511_1(EF070511|pid:none) Maconellicoccus hirsutus clone WHM... 209 2e-52
AF194426_1(AF194426|pid:none) Myxine glutinosa clone hicl20 cyst... 209 2e-52
AY795054_1(AY795054|pid:none) Artemia franciscana cathepsin L pr... 209 2e-52
protein update 2009. 5.12
PSORT

psg: 0.75 gvh: 0.31 alm: 0.42 top: 0.53 tms: 0.00 mit: 0.17 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.20 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

44.0 %: cytoplasmic
32.0 %: nuclear
24.0 %: cytoskeletal

>> prediction for CFC321 is cyt

5' end seq. ID CFC321F
5' end seq.
>CFC321F.Seq
AGTCAATATATTAGTGTTTACAATGGTATGGATATTACCATCAACTTTTACAATCAAGAT
AATACATACAATGTAGGTCCAGTTAAATATGATATGGTTTGTACCACTACCCCAGGTAAT
GGTTCATTAGTTAATGTTTTACCAACTGAACCTTCATCATGGGTTTACAATGGTACATCA
ACTGTTAACGGTGTCCAAGTCTTTGGTTACAGTCAAAAGATCACTCAATATGGTCGTACT
GGTTTCTACAACTTTTACGTTGATGCCAACGGTGTTCCAGTTCAATTCTATATGGATGGT
GTCGATTATGTATTTGGTAGTCACCCAGATGTTTACGTATTAAACTTTGATATCTACACC
ACCGATATCAGCTCATACGAATCATACTTTGATATTCCAGTTCTCTGTAATAACGCAAAG
GAAGCCCCAGCTAAAGAAAACCAATTCGATGGTCTTTTCTCATCAATCGGTGATAACTTA
TTAGCCAAAGAAGAACAAGCCTCAAACTTATTCAAAGAATACAAAGCTCAATACAACAAG
GAATACTCAAGCCAAGACGAACATGATGAACGTTTCATTAACTTTAAAGCTG--------
--
Length of 5' end seq. 592
3' end seq. ID CFC321Z
3' end seq.
>CFC321Z.Seq
----------ATCAAGGTATTTGCGGTTCATGTTGGACTTTTGGTTCAACTGGTTCATTA
GAAGGTACCAACTGTGTCACCAACGGTGAATTAGTCTCCCTCTCTGAACAACAATTAGTT
GATTGTGCTATCCTTACCGGTAGTCAAGGTTGTGGTGGTGGTTTTGCATCATCTGCATTC
CAATACGTCATGGAAATTGGTAGTCTCGCCACCGAGTCCAACTATCCATACTTAATGCAA
AATGGTCTCTGCAGAGATAGAACTGTCACTCCATCAGGTGTTTCAATCACTGGTTACGTC
AATGTTACCTCTGGTAGTGAATCTGCCCTTCAAAACGCTATCGCCACCACTGGTCCAGTC
GCCATCGCCATCGATGCCTCTGTTGATGATTTCCGTTACTACATGTCTGGTGTTTACAAT
AATCCAGCCTGTAAAAATGGTTTAGATGATTTGGATCACGAAGTTTTAGCTATTGGTTAT
GGTACTTATCAAGGTCAAGATTATTTCTTAGTTAAAAACTCTTGGTCAACTAACTGGGGT
ATGGATGGTTATGTTTACATGGCTAGAAATGATAACAATTTATGTGGTGTTTCAAGTCAA
GCCACCTATCCAATTCCAACAAAGAATTAAATTTCTTCAATAAATCCAATAAATATATAT
TTTAAAC
Length of 3' end seq. 657
Connected seq. ID CFC321P
Connected seq.
>CFC321P.Seq
AGTCAATATATTAGTGTTTACAATGGTATGGATATTACCATCAACTTTTACAATCAAGAT
AATACATACAATGTAGGTCCAGTTAAATATGATATGGTTTGTACCACTACCCCAGGTAAT
GGTTCATTAGTTAATGTTTTACCAACTGAACCTTCATCATGGGTTTACAATGGTACATCA
ACTGTTAACGGTGTCCAAGTCTTTGGTTACAGTCAAAAGATCACTCAATATGGTCGTACT
GGTTTCTACAACTTTTACGTTGATGCCAACGGTGTTCCAGTTCAATTCTATATGGATGGT
GTCGATTATGTATTTGGTAGTCACCCAGATGTTTACGTATTAAACTTTGATATCTACACC
ACCGATATCAGCTCATACGAATCATACTTTGATATTCCAGTTCTCTGTAATAACGCAAAG
GAAGCCCCAGCTAAAGAAAACCAATTCGATGGTCTTTTCTCATCAATCGGTGATAACTTA
TTAGCCAAAGAAGAACAAGCCTCAAACTTATTCAAAGAATACAAAGCTCAATACAACAAG
GAATACTCAAGCCAAGACGAACATGATGAACGTTTCATTAACTTTAAAGCTG--------
--ATCAAGGTATTTGCGGTTCATGTTGGACTTTTGGTTCAACTGGTTCATTAGAAGGTAC
CAACTGTGTCACCAACGGTGAATTAGTCTCCCTCTCTGAACAACAATTAGTTGATTGTGC
TATCCTTACCGGTAGTCAAGGTTGTGGTGGTGGTTTTGCATCATCTGCATTCCAATACGT
CATGGAAATTGGTAGTCTCGCCACCGAGTCCAACTATCCATACTTAATGCAAAATGGTCT
CTGCAGAGATAGAACTGTCACTCCATCAGGTGTTTCAATCACTGGTTACGTCAATGTTAC
CTCTGGTAGTGAATCTGCCCTTCAAAACGCTATCGCCACCACTGGTCCAGTCGCCATCGC
CATCGATGCCTCTGTTGATGATTTCCGTTACTACATGTCTGGTGTTTACAATAATCCAGC
CTGTAAAAATGGTTTAGATGATTTGGATCACGAAGTTTTAGCTATTGGTTATGGTACTTA
TCAAGGTCAAGATTATTTCTTAGTTAAAAACTCTTGGTCAACTAACTGGGGTATGGATGG
TTATGTTTACATGGCTAGAAATGATAACAATTTATGTGGTGTTTCAAGTCAAGCCACCTA
TCCAATTCCAACAAAGAATTAAATTTCTTCAATAAATCCAATAAATATATATTTTAAAC
Length of connected seq. 1249
Full length Seq ID -
Full length Seq. -
Length of full length seq. -