SFL106
Library SF
(Link to library)
Clone ID SFL106
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16593-1
Original site URL
Representative seq. ID SFL106Z
(Link to Original site)
Representative DNA sequence
>SFL106 (SFL106Q) /CSM/SF/SFL1-A/SFL106Q.Seq.d/
XXXXXXXXXXGGTTCATTAGAAGGTACCAACTGTGTCACCAACGGTGAATTAGTCTCCCT
CTCTGAACAACAATTAGTTGATTGTGCTATCCTTACCGGTAGTCAAGGTTGTGGTGGTGG
TTTTGCATCATCTGCATTCCAATACGTCATGGAAATTGGTAGTCTCGCCACCGAGTCCAA
CTATCCATACTTAATGCAAAATGGTCTCTGCAGAGATAGAACTGTCACTCCATCAGGTGT
TTCAATCACTGGTTACGTCAATGTTACCTCTGGTAGTGAATCTGCCCTTCAAAACGCTAT
CGCCACCACTGGTCCAGTCGCCATCGCCATCGATGCCTCTGTTGATGATTTCCGTTACTA
CATGTCTGGTGTTTACAATAATCCAGCCTGTAAAAATGGTTTAGATGATTTGGATCACGA
AGTTTTAGCTATTGGTTATGGTACTTATCAAGGTCAAGATTATTTCTTAGTTAAAAACTC
TTGGTCAACTAACTGGGGTATGGATGGTTATGTT
sequence update 2001. 6. 1
Translated Amino Acid sequence
---SLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNY
PYLMQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYM
SGVYNNPACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGMDGYV


Translated Amino Acid sequence (All Frames)
Frame A:
---vh*kvptvsptvn*spslnnn*livlslpvvkvvvvvlhhlhsntswklvvsppspt
iht*ckmvsaeielslhqvfqslvtsmlplvvnlpfktlspplvqspspsmpllmisvtt
clvftiiqpvkmv*miwitkf*llvmvlikvkiis*lktlgqltgvwmvm


Frame B:
---firryqlchqr*islpl*ttis*lcypyr*srlwwwfciicipirhgnw*srhrvql
silnakwslqr*nchsircfnhwlrqcylw**icpskryrhhwssrhrhrclc**fpllh
vwclq*ssl*kwfr*fgsrsfsywlwylsrsrlfls*kllvn*lgygwlc


Frame C:
---SLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNY
PYLMQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYM
SGVYNNPACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGMDGYV


Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VFO640 (VFO640Q) /CSM/VF/VFO6-B/VFO640Q.Seq.d/ 999 0.0
VFO148 (VFO148Q) /CSM/VF/VFO1-B/VFO148Q.Seq.d/ 999 0.0
VFM638 (VFM638Q) /CSM/VF/VFM6-B/VFM638Q.Seq.d/ 999 0.0
VFI581 (VFI581Q) /CSM/VF/VFI5-D/VFI581Q.Seq.d/ 999 0.0
VFH711 (VFH711Q) /CSM/VF/VFH7-A/VFH711Q.Seq.d/ 999 0.0
VFG444 (VFG444Q) /CSM/VF/VFG4-B/VFG444Q.Seq.d/ 999 0.0
VFB191 (VFB191Q) /CSM/VF/VFB1-D/VFB191Q.Seq.d/ 999 0.0
VFB166 (VFB166Q) /CSM/VF/VFB1-C/VFB166Q.Seq.d/ 999 0.0
SFL481 (SFL481Q) /CSM/SF/SFL4-D/SFL481Q.Seq.d/ 999 0.0
SFL106 (SFL106Q) /CSM/SF/SFL1-A/SFL106Q.Seq.d/ 999 0.0

own update 2001.11.27
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

AZ674505|AZ674505.1 ENTIZ88TF Entamoeba histolytica Sheared DNA Entamoeba histolyticagenomic, genomic survey sequence. 54 0.002 1
M27307|M27307.1 Entamoeba histolytica cysteine protease gene, partial cds. 54 0.002 1
X87214|X87214.1 E.histolytica mRNA for cysteine proteinase. 54 0.002 1
X87213|X87213.1 E.dispar mRNA for cysteine proteinase. 54 0.002 1
AZ547119|AZ547119.1 ENTFS26TR Entamoeba histolytica Sheared DNA Entamoeba histolytica genomic, DNA sequence. 54 0.002 1
S58669|S58669.1 Entamoeba histolytica cysteine proteinase precursor (ACP1) gene, partial cds. 54 0.002 1
AF326781|AF326781.1 Triticum monococcum actin (ACT-1) gene, partial cds; putative chromosome condensation factor (CCF), putative resistance protein (RGA-2), putative resistance protein (RGA2) and putative nodulin-like-like protein (NLL) gene, complete cds; and retrotransposons Josephine, Angela-2, Angela-4, Heidi, Greti, Angela-3, Fatima, Erika-1, Angela-6, Angela-5, Barbara, Isabelle, Erika-2, and Claudia. 40 0.049 2
X91645|X91645.1 E.histolytica DNA encoding for cysteine proteinase (1159 bp). 32 0.066 4
AY156067|AY156067.1 Entamoeba histolytica cysteine protease 9 (CP9) gene, complete cds. 38 0.077 3
AY188332|AY188332.1 Triticum monococcum chromosome 5AL clone BAC 609E6, *** SEQUENCING IN PROGRESS ***, 2 ordered pieces. 40 0.089 2
dna update 2004. 2. 4
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

BC075887_1(BC075887|pid:none) Danio rerio cathepsin L.1, mRNA (c... 180 2e-44
AY220615_1(AY220615|pid:none) Hydra vulgaris cathepsin L precurs... 171 1e-41
DQ474246_1(DQ474246|pid:none) Lygus lineolaris cathepsin-L mRNA,... 170 2e-41
AF194426_1(AF194426|pid:none) Myxine glutinosa clone hicl20 cyst... 167 2e-40
DQ280314_1(DQ280314|pid:none) Hymeniacidon perlevis cathepsin L ... 167 2e-40
AF147207_1(AF147207|pid:none) Artemia franciscana cathepsin L-li... 167 2e-40
AY363263_1(AY363263|pid:none) Triatoma infestans cathepsin L-lik... 167 2e-40
AY795054_1(AY795054|pid:none) Artemia franciscana cathepsin L pr... 167 2e-40
AF542132_1(AF542132|pid:none) Theromyzon tessulatum cathepsin L ... 166 2e-40
EF070511_1(EF070511|pid:none) Maconellicoccus hirsutus clone WHM... 166 3e-40
protein update 2009. 5.28
PSORT

psg: 0.75 gvh: 0.59 alm: 0.42 top: 0.53 tms: 0.00 mit: 0.16 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

56.0 %: cytoplasmic
28.0 %: nuclear
8.0 %: mitochondrial
4.0 %: cytoskeletal
4.0 %: Golgi

>> prediction for SFL106 is cyt

5' end seq. ID -
5' end seq. -
Length of 5' end seq. -
3' end seq. ID SFL106Z
3' end seq.
>SFL106Z.Seq
----------GGTTCATTAGAAGGTACCAACTGTGTCACCAACGGTGAATTAGTCTCCCT
CTCTGAACAACAATTAGTTGATTGTGCTATCCTTACCGGTAGTCAAGGTTGTGGTGGTGG
TTTTGCATCATCTGCATTCCAATACGTCATGGAAATTGGTAGTCTCGCCACCGAGTCCAA
CTATCCATACTTAATGCAAAATGGTCTCTGCAGAGATAGAACTGTCACTCCATCAGGTGT
TTCAATCACTGGTTACGTCAATGTTACCTCTGGTAGTGAATCTGCCCTTCAAAACGCTAT
CGCCACCACTGGTCCAGTCGCCATCGCCATCGATGCCTCTGTTGATGATTTCCGTTACTA
CATGTCTGGTGTTTACAATAATCCAGCCTGTAAAAATGGTTTAGATGATTTGGATCACGA
AGTTTTAGCTATTGGTTATGGTACTTATCAAGGTCAAGATTATTTCTTAGTTAAAAACTC
TTGGTCAACTAACTGGGGTATGGATGGTTATGTT
Length of 3' end seq. 504
Connected seq. ID -
Connected seq. -
Length of connected seq. -
Full length Seq ID -
Full length Seq. -
Length of full length seq. -