SFD417
Library SF
(Link to library)
Clone ID SFD417
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U12939-1|Contig-U16593-1
Original site URL
Representative seq. ID SFD417P
(Link to Original site)
Representative DNA sequence
>SFD417 (SFD417Q) /CSM/SF/SFD4-A/SFD417Q.Seq.d/
ACTGGAAAAATTTCACAACCAGCAAAAAATAAAAATAAAAAAAAAAATAAAAAAATAAAA
NGTTGAAGGAATTTTCACAAAGTCTTTAAAAAAATTAAAAAANGAAAAAAAAAAXXXXXX
XXXXGTCCACGACGACGAATCACTCAGATCAATTCCATCAACTGTCGATTGGAGAAATCA
AAACTGTGTTACCCCAGTCAAAGATCAAGGTATTTGCGGTTCATGTTGGACTTTTGGTTC
AACTGGTTCATTAGAAGGTACCAACTGTGTCACCAACGGTGAATTAGTCTCCCTCTCTGA
ACAACAATTAGTTGATTGTGCTATCCTTACCGGTAGTCAAGGTTGTGGTGGCGGTTTTGC
ATCATCTGCATTCCAATACGTCATGGAAATTGGTAGTCTCGCCACCGAGTCCAACTATCC
ATACTTAATGCAAAATGGTCTCTGCAGAGATAGAACTGTCACTCCATCAGGTGTTTCAAT
CACTGGTTACGTCAATGTTACCTCTGGTAGTGAATCTGCCCTTCAAAACGCTATCGCCAC
CACTGGTCCAGTCGCCATCGCCATCGATGCCTCTGTTGATGATTTCCGTTACTACATGTC
TGGTGTTTACAATAATCCAGCCTGTAAAAATGGTTTAGATGATTTGGATCACGAAGTTTT
AGCTATTGGTTATGGTACTTATCAAGGTCAAGATTANTTCTTAGTTANNAACTCTTGGTC
AACTAACTGGGGTATGGACGGTTATGTTTACATGGCTAGAAATNATAACAATTTATGTGG
TGTTTCAAGTCAAGCCACCTATCCAATTCCAACAAAGAATTAAATTTCNTCAATAAATCC
sequence update 2001. 6. 1
Translated Amino Acid sequence
lekfhnqqkikikkkikk*XVEGIFTKSLKKLKXEKK---

---VHDDESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSL
SEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNYPYLMQNGLCRDRTVTPSGV
SITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYMSGVYNNPACKNGLDDLDHE
VLAIGYGTYQGQDXFLVXNSWSTNWGMDGYVYMARNXNNLCGVSSQATYPIPTKN*issi
n


Translated Amino Acid sequence (All Frames)
Frame A:
tgkisqpaknknkkknkkikx*rnfhkvfkkikkxkkk---

---VHDDESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSL
SEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNYPYLMQNGLCRDRTVTPSGV
SITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYMSGVYNNPACKNGLDDLDHE
VLAIGYGTYQGQDXFLVXNSWSTNWGMDGYVYMARNXNNLCGVSSQATYPIPTKN*issi
n

Frame B:
lekfhnqqkikikkkikk*XVEGIFTKSLKKLKXEKK---

---stttnhsdqfhqlsigeiktvlpqskikvfavhvgllvqlvh*kvptvsptvn*sps
lnnn*livlslpvvkvvvavlhhlhsntswklvvsppsptiht*ckmvsaeielslhqvf
qslvtsmlplvvnlpfktlspplvqspspsmpllmisvttclvftiiqpvkmv*miwitk
f*llvmvlikvkixs*lxtlgqltgvwtvmftwlexitiyvvfqvkppiqfqqrikfxq*
i

Frame C:
wknfttskk*k*kkk*knkxlkefsqsl*kn*kxkkk---

---prrritqinsincrleksklcypsqrsrylrfmldfwfnwfirryqlchqr*islpl
*ttis*lcypyr*srlwwrfciicipirhgnw*srhrvqlsilnakwslqr*nchsircf
nhwlrqcylw**icpskryrhhwssrhrhrclc**fpllhvwclq*ssl*kwfr*fgsrs
fsywlwylsrsrlxlsxxllvn*lgygrlclhg*kx*qfmwcfksshlsnsnkelnfxnk
s

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SFD417 (SFD417Q) /CSM/SF/SFD4-A/SFD417Q.Seq.d/ 1390 0.0
VFI581 (VFI581Q) /CSM/VF/VFI5-D/VFI581Q.Seq.d/ 1370 0.0
VFO640 (VFO640Q) /CSM/VF/VFO6-B/VFO640Q.Seq.d/ 1350 0.0
CFG118 (CFG118Q) /CSM/CF/CFG1-A/CFG118Q.Seq.d/ 1350 0.0
SFI335 (SFI335Q) /CSM/SF/SFI3-B/SFI335Q.Seq.d/ 1336 0.0
SFI435 (SFI435Q) /CSM/SF/SFI4-B/SFI435Q.Seq.d/ 1334 0.0
AFJ360 (AFJ360Q) /CSM/AF/AFJ3-C/AFJ360Q.Seq.d/ 1320 0.0
SFL481 (SFL481Q) /CSM/SF/SFL4-D/SFL481Q.Seq.d/ 1312 0.0
VFO148 (VFO148Q) /CSM/VF/VFO1-B/VFO148Q.Seq.d/ 1277 0.0
SFE845 (SFE845Q) /CSM/SF/SFE8-B/SFE845Q.Seq.d/ 1265 0.0

own update 2001.11.26
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

S58669|S58669.1 Entamoeba histolytica cysteine proteinase precursor (ACP1) gene, partial cds. 54 4e-06 2
AZ547119|AZ547119.1 ENTFS26TR Entamoeba histolytica Sheared DNA Entamoeba histolytica genomic, DNA sequence. 54 4e-06 2
AZ674505|AZ674505.1 ENTIZ88TF Entamoeba histolytica Sheared DNA Entamoeba histolytica genomic, DNA sequence. 54 4e-06 2
X87214|X87214.1 E.histolytica mRNA for cysteine proteinase. 54 6e-06 2
CD463608|CD463608.1 ETH1_45_A02.g1_A002 Ethylene-treated seedlings Sorghum bicolor cDNA clone ETH1_45_A02_A002 5', mRNA sequence. 42 9e-05 3
L36205|L36205.1 Dictyostelium discoideum cysteine proteinase CP5 mRNA, complete cds. 42 2e-04 3
X87213|X87213.1 E.dispar mRNA for cysteine proteinase. 54 2e-04 2
BM401017|BM401017.1 5009-0-81-E05.t.1 Chilcoat/Turkewitz cDNA (large fraction) Tetrahymena thermophila cDNA, mRNA sequence. 54 0.003 1
M27307|M27307.1 Entamoeba histolytica cysteine protease gene, partial cds. 54 0.003 1
BQ834906|BQ834906.1 Po_ad_04H08_TEXF1 Psoroptes ovis mixed Psoroptes ovis cDNA clone Po_ad_04H08 5' similar to O46030 CYSTEINE PROTEINASE. Sitophilus zeamais (maize weevil), mRNA sequence. 48 0.010 2
dna update 2003.12.22
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

BC075887_1(BC075887|pid:none) Danio rerio cathepsin L.1, mRNA (c... 242 1e-62
AY220615_1(AY220615|pid:none) Hydra vulgaris cathepsin L precurs... 239 8e-62
AF194426_1(AF194426|pid:none) Myxine glutinosa clone hicl20 cyst... 238 2e-61
D82884_1(D82884|pid:none) Sitophilus zeamais mRNA for cysteine p... 236 7e-61
AY363263_1(AY363263|pid:none) Triatoma infestans cathepsin L-lik... 235 1e-60
DQ459303_1(DQ459303|pid:none) Aedes aegypti cathepsin L (CAT-L1)... 234 2e-60
AY336798_1(AY336798|pid:none) Rhipicephalus haemaphysaloides hae... 234 3e-60
AY795054_1(AY795054|pid:none) Artemia franciscana cathepsin L pr... 233 4e-60
AF147207_1(AF147207|pid:none) Artemia franciscana cathepsin L-li... 232 1e-59
AY336797_1(AY336797|pid:none) Rhipicephalus haemaphysaloides hae... 232 1e-59
protein update 2009. 5.23
PSORT

psg: 0.42 gvh: 0.56 alm: 0.42 top: 0.53 tms: 0.00 mit: 0.23 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.20 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

60.0 %: cytoplasmic
28.0 %: nuclear
4.0 %: extracellular, including cell wall
4.0 %: vacuolar
4.0 %: peroxisomal

>> prediction for SFD417 is cyt

5' end seq. ID SFD417F
5' end seq.
>SFD417F.Seq
ACTGGAAAAATTTCACAACCAGCAAAAAATAAAAATAAAAAAAAAAATAAAAAAATAAAA
NGTTGAAGGAATTTTCACAAAGTCTTTAAAAAAATTAAAAAANGAAAAAAAAAA------
----
Length of 5' end seq. 114
3' end seq. ID SFD417Z
3' end seq.
>SFD417Z.Seq
----------GTCCACGACGACGAATCACTCAGATCAATTCCATCAACTGTCGATTGGAG
AAATCAAAACTGTGTTACCCCAGTCAAAGATCAAGGTATTTGCGGTTCATGTTGGACTTT
TGGTTCAACTGGTTCATTAGAAGGTACCAACTGTGTCACCAACGGTGAATTAGTCTCCCT
CTCTGAACAACAATTAGTTGATTGTGCTATCCTTACCGGTAGTCAAGGTTGTGGTGGCGG
TTTTGCATCATCTGCATTCCAATACGTCATGGAAATTGGTAGTCTCGCCACCGAGTCCAA
CTATCCATACTTAATGCAAAATGGTCTCTGCAGAGATAGAACTGTCACTCCATCAGGTGT
TTCAATCACTGGTTACGTCAATGTTACCTCTGGTAGTGAATCTGCCCTTCAAAACGCTAT
CGCCACCACTGGTCCAGTCGCCATCGCCATCGATGCCTCTGTTGATGATTTCCGTTACTA
CATGTCTGGTGTTTACAATAATCCAGCCTGTAAAAATGGTTTAGATGATTTGGATCACGA
AGTTTTAGCTATTGGTTATGGTACTTATCAAGGTCAAGATTANTTCTTAGTTANNAACTC
TTGGTCAACTAACTGGGGTATGGACGGTTATGTTTACATGGCTAGAAATNATAACAATTT
ATGTGGTGTTTCAAGTCAAGCCACCTATCCAATTCCAACAAAGAATTAAATTTCNTCAAT
AAATCC
Length of 3' end seq. 716
Connected seq. ID SFD417P
Connected seq.
>SFD417P.Seq
ACTGGAAAAATTTCACAACCAGCAAAAAATAAAAATAAAAAAAAAAATAAAAAAATAAAA
NGTTGAAGGAATTTTCACAAAGTCTTTAAAAAAATTAAAAAANGAAAAAAAAAA------
----GTCCACGACGACGAATCACTCAGATCAATTCCATCAACTGTCGATTGGAGAAATCA
AAACTGTGTTACCCCAGTCAAAGATCAAGGTATTTGCGGTTCATGTTGGACTTTTGGTTC
AACTGGTTCATTAGAAGGTACCAACTGTGTCACCAACGGTGAATTAGTCTCCCTCTCTGA
ACAACAATTAGTTGATTGTGCTATCCTTACCGGTAGTCAAGGTTGTGGTGGCGGTTTTGC
ATCATCTGCATTCCAATACGTCATGGAAATTGGTAGTCTCGCCACCGAGTCCAACTATCC
ATACTTAATGCAAAATGGTCTCTGCAGAGATAGAACTGTCACTCCATCAGGTGTTTCAAT
CACTGGTTACGTCAATGTTACCTCTGGTAGTGAATCTGCCCTTCAAAACGCTATCGCCAC
CACTGGTCCAGTCGCCATCGCCATCGATGCCTCTGTTGATGATTTCCGTTACTACATGTC
TGGTGTTTACAATAATCCAGCCTGTAAAAATGGTTTAGATGATTTGGATCACGAAGTTTT
AGCTATTGGTTATGGTACTTATCAAGGTCAAGATTANTTCTTAGTTANNAACTCTTGGTC
AACTAACTGGGGTATGGACGGTTATGTTTACATGGCTAGAAATNATAACAATTTATGTGG
TGTTTCAAGTCAAGCCACCTATCCAATTCCAACAAAGAATTAAATTTCNTCAATAAATCC
Length of connected seq. 830
Full length Seq ID -
Full length Seq. -
Length of full length seq. -