SFI542
Library SF
(Link to library)
Clone ID SFI542
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16593-1
Original site URL
Representative seq. ID
(Link to Original site)
Representative DNA sequence
>SFI542 (SFI542Q) /CSM/SF/SFI5-B/SFI542Q.Seq.d/
XXXXXXXXXXCTGGTTCATTAGAAGGTACCAACTGTGTCACCAACGGTGAATTAGTCTCC
CTCTCTGAACAACAATTAGTTGATTGTGCTATCCTTACCGGTAGTCAAGGTTGTGGTGGT
GGTTTTGCATCATCTGCATTCCAATACGTCATGGAAATTGGTAGTCTCGCCACCGAGTCC
AACTATCCATACTTAATGCAAAATGGTCTCTGCAGAGATAGAACTGTCACTCCATCAGGT
GTTTCAATCACTGGTTACGTCAATGTTACCTCTGGTAGTGAATCTGCCCTTCAAAACGCT
ATCGCCACCACTGGTCCAGTCGCCATCGCCATCGATGCCTCTGTTGATGATTTCCGTTAC
TACATGTCTGGTGTTTACAATAATCCAGCCTGTAAAAATGGTTTAGATGATTTGGATCAC
GAAGTTTTAGCTATTGGTTATGGTACTTATCAAGGTCAAGATTATTTCTTAGTTAAAAAC
TCTTGGTCAACTAACTGGGGTATGGATGGTCTATGNTTACATGGCT
sequence update 2001.11.22
Translated Amino Acid sequence
---GSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESN
YPYLMQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYY
MSGVYNNPACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGMDGLXLHG


Translated Amino Acid sequence (All Frames)
Frame A:
---wfirryqlchqr*islpl*ttis*lcypyr*srlwwwfciicipirhgnw*srhrvq
lsilnakwslqr*nchsircfnhwlrqcylw**icpskryrhhwssrhrhrclc**fpll
hvwclq*ssl*kwfr*fgsrsfsywlwylsrsrlfls*kllvn*lgygwsmxtw


Frame B:
---GSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESN
YPYLMQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYY
MSGVYNNPACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGMDGLXLHG


Frame C:
---vh*kvptvsptvn*spslnnn*livlslpvvkvvvvvlhhlhsntswklvvsppspt
iht*ckmvsaeielslhqvfqslvtsmlplvvnlpfktlspplvqspspsmpllmisvtt
clvftiiqpvkmv*miwitkf*llvmvlikvkiis*lktlgqltgvwmvyxyma


Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SFI542 (SFI542Q) /CSM/SF/SFI5-B/SFI542Q.Seq.d/ 1017 0.0
VFO640 (VFO640Q) /CSM/VF/VFO6-B/VFO640Q.Seq.d/ 1001 0.0
VFO148 (VFO148Q) /CSM/VF/VFO1-B/VFO148Q.Seq.d/ 1001 0.0
VFM638 (VFM638Q) /CSM/VF/VFM6-B/VFM638Q.Seq.d/ 1001 0.0
VFI581 (VFI581Q) /CSM/VF/VFI5-D/VFI581Q.Seq.d/ 1001 0.0
VFH711 (VFH711Q) /CSM/VF/VFH7-A/VFH711Q.Seq.d/ 1001 0.0
VFG444 (VFG444Q) /CSM/VF/VFG4-B/VFG444Q.Seq.d/ 1001 0.0
VFB191 (VFB191Q) /CSM/VF/VFB1-D/VFB191Q.Seq.d/ 1001 0.0
VFB166 (VFB166Q) /CSM/VF/VFB1-C/VFB166Q.Seq.d/ 1001 0.0
SFL481 (SFL481Q) /CSM/SF/SFL4-D/SFL481Q.Seq.d/ 1001 0.0

own update 2001.11.27
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

AZ674505|AZ674505.1 ENTIZ88TF Entamoeba histolytica Sheared DNA Entamoeba histolyticagenomic, genomic survey sequence. 54 0.002 1
M27307|M27307.1 Entamoeba histolytica cysteine protease gene, partial cds. 54 0.002 1
X87214|X87214.1 E.histolytica mRNA for cysteine proteinase. 54 0.002 1
X87213|X87213.1 E.dispar mRNA for cysteine proteinase. 54 0.002 1
AZ547119|AZ547119.1 ENTFS26TR Entamoeba histolytica Sheared DNA Entamoeba histolytica genomic, DNA sequence. 54 0.002 1
S58669|S58669.1 Entamoeba histolytica cysteine proteinase precursor (ACP1) gene, partial cds. 54 0.002 1
AF326781|AF326781.1 Triticum monococcum actin (ACT-1) gene, partial cds; putative chromosome condensation factor (CCF), putative resistance protein (RGA-2), putative resistance protein (RGA2) and putative nodulin-like-like protein (NLL) gene, complete cds; and retrotransposons Josephine, Angela-2, Angela-4, Heidi, Greti, Angela-3, Fatima, Erika-1, Angela-6, Angela-5, Barbara, Isabelle, Erika-2, and Claudia. 40 0.051 2
AY188332|AY188332.1 Triticum monococcum chromosome 5AL clone BAC 609E6, *** SEQUENCING IN PROGRESS ***, 2 ordered pieces. 40 0.092 2
AY063763|AY063763.1 Plasmodium berghei berghepain-2 mRNA, complete cds. 48 0.12 1
BM162482|BM162482.1 EST565005 PyBS Plasmodium yoelii yoelii cDNA clone PYCKT14 5' end, mRNA sequence. 48 0.12 1
dna update 2004. 1. 2
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

BC075887_1(BC075887|pid:none) Danio rerio cathepsin L.1, mRNA (c... 177 1e-43
AY220615_1(AY220615|pid:none) Hydra vulgaris cathepsin L precurs... 167 2e-40
DQ474246_1(DQ474246|pid:none) Lygus lineolaris cathepsin-L mRNA,... 166 2e-40
AF194426_1(AF194426|pid:none) Myxine glutinosa clone hicl20 cyst... 163 3e-39
DQ280314_1(DQ280314|pid:none) Hymeniacidon perlevis cathepsin L ... 163 3e-39
AF147207_1(AF147207|pid:none) Artemia franciscana cathepsin L-li... 163 3e-39
AY363263_1(AY363263|pid:none) Triatoma infestans cathepsin L-lik... 163 3e-39
AY795054_1(AY795054|pid:none) Artemia franciscana cathepsin L pr... 163 3e-39
AF542132_1(AF542132|pid:none) Theromyzon tessulatum cathepsin L ... 162 3e-39
EF070511_1(EF070511|pid:none) Maconellicoccus hirsutus clone WHM... 162 5e-39
protein update 2009. 5.26
PSORT

psg: 0.75 gvh: 0.59 alm: 0.42 top: 0.53 tms: 0.00 mit: 0.15 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.50
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

44.0 %: cytoplasmic
28.0 %: nuclear
12.0 %: Golgi
8.0 %: mitochondrial
4.0 %: cytoskeletal
4.0 %: vesicles of secretory system

>> prediction for SFI542 is cyt

5' end seq. ID -
5' end seq. -
Length of 5' end seq. -
3' end seq. ID SFI542Z
3' end seq.
>SFI542Z.Seq
NNNNNNNNNNCTGGTTCATTAGAAGGTACCAACTGTGTCACCAACGGTGAATTAGTCTCC
CTCTCTGAACAACAATTAGTTGATTGTGCTATCCTTACCGGTAGTCAAGGTTGTGGTGGT
GGTTTTGCATCATCTGCATTCCAATACGTCATGGAAATTGGTAGTCTCGCCACCGAGTCC
AACTATCCATACTTAATGCAAAATGGTCTCTGCAGAGATAGAACTGTCACTCCATCAGGT
GTTTCAATCACTGGTTACGTCAATGTTACCTCTGGTAGTGAATCTGCCCTTCAAAACGCT
ATCGCCACCACTGGTCCAGTCGCCATCGCCATCGATGCCTCTGTTGATGATTTCCGTTAC
TACATGTCTGGTGTTTACAATAATCCAGCCTGTAAAAATGGTTTAGATGATTTGGATCAC
GAAGTTTTAGCTATTGGTTATGGTACTTATCAAGGTCAAGATTATTTCTTAGTTAAAAAC
TCTTGGTCAACTAACTGGGGTATGGATGGTCTATGNTTACATGGCT
Length of 3' end seq. 526
Connected seq. ID -
Connected seq. -
Length of connected seq. -
Full length Seq ID -
Full length Seq. -
Length of full length seq. -