VSC107
Library VS
(Link to library)
Clone ID VSC107
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U13202-1
Original site URL
Representative seq. ID VSC107Z
(Link to Original site)
Representative DNA sequence
>VSC107 (VSC107Q) /CSM/VS/VSC1-A/VSC107Q.Seq.d/
XXXXXXXXXXTTTTATGGATATGGTTACTTGTGATGAAACTGATAACGGTTGTGAAGGTG
GTGATGCCTTCTCTGCATGGAATTGGTTAAGAAAGCAAGGTGCTGTATCAGAAGAATGTC
TTCCATATACAATTCCAACTTGTCCACCAGCTCAACAACCATGTTTAAATTTTGTCAACA
CTCCATCATGTACTAAAGAGTGTCAATCAAATTCCTCTTTAATTTATTCTCAAGACAAAC
ATAAAATGGCTAAAATTTATTCTTTCGACTCTGATGAAGCAATCATGCAAGAAATTGTTA
CTAATGGTCCAGTCGAAGCTTGTTTCACTGTCTTTGAAGATTTCCTTGCTTACAAATCTG
GTGTTTACGTCCACACAACTGGTAAAGATTTAGGTGGTCACTGTGTTAAACTCGTTGGTT
TCGGTACCTTAAATGGTGTTGATTACTATGCCGCTAACAACCAATGGACAACTTCATGGG
GTGATAATGGAACTTTCTTAATCAAACGTGGTGATTGCGGTATCTCTGATGACGTTGTTG
CTGGTTTACCATAAATAAAAATAAGTTTTAAACATTTTGTAAAA
sequence update 2000. 9.22
Translated Amino Acid sequence
---FMDMVTCDETDNGCEGGDAFSAWNWLRKQGAVSEECLPYTIPTCPPAQQPCLNFVNT
PSCTKECQSNSSLIYSQDKHKMAKIYSFDSDEAIMQEIVTNGPVEACFTVFEDFLAYKSG
VYVHTTGKDLGGHCVKLVGFGTLNGVDYYAANNQWTTSWGDNGTFLIKRGDCGISDDVVA
GLP*ikisfkhfvk


Translated Amino Acid sequence (All Frames)
Frame A:
---FMDMVTCDETDNGCEGGDAFSAWNWLRKQGAVSEECLPYTIPTCPPAQQPCLNFVNT
PSCTKECQSNSSLIYSQDKHKMAKIYSFDSDEAIMQEIVTNGPVEACFTVFEDFLAYKSG
VYVHTTGKDLGGHCVKLVGFGTLNGVDYYAANNQWTTSWGDNGTFLIKRGDCGISDDVVA
GLP*ikisfkhfvk


Frame B:
---lwiwllvmklitvvkvvmpslhgig*eskvlyqknvfhiqfqlvhqlnnhv*ilstl
hhvlksvnqipl*filktnikwlkfilstlmkqsckklllmvqsklvslslkislltnlv
ftstqlvki*vvtvlnslvsvp*mvlitmplttngqlhgvimels*snvviavslmtlll
vyhk*k*vlnil*


Frame C:
---ygygyl**n**rl*rw*cllcmelvkkarccirrmssiynsnlstssttmfkfcqhs
imy*rvsikflfnlfsrqt*ng*nlffrl**snharncy*wssrslfhcl*rfpclqiwc
lrphnw*rfrwslc*trwfrylkwc*llcr*qpmdnfmg**wnflnqtw*lryl**rccw
ftinknkf*tfck


Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VSC107 (VSC107Q) /CSM/VS/VSC1-A/VSC107Q.Seq.d/ 1138 0.0
SSL106 (SSL106Q) /CSM/SS/SSL1-A/SSL106Q.Seq.d/ 1134 0.0
SSK863 (SSK863Q) /CSM/SS/SSK8-C/SSK863Q.Seq.d/ 1134 0.0
SSK787 (SSK787Q) /CSM/SS/SSK7-D/SSK787Q.Seq.d/ 1134 0.0
SSK654 (SSK654Q) /CSM/SS/SSK6-C/SSK654Q.Seq.d/ 1134 0.0
SSE709 (SSE709Q) /CSM/SS/SSE7-A/SSE709Q.Seq.d/ 1134 0.0
SLH613 (SLH613Q) /CSM/SL/SLH6-A/SLH613Q.Seq.d/ 1134 0.0
SSK354 (SSK354Q) /CSM/SS/SSK3-C/SSK354Q.Seq.d/ 1132 0.0
VSB170 (VSB170Q) /CSM/VS/VSB1-C/VSB170Q.Seq.d/ 1128 0.0
SSL776 (SSL776Q) /CSM/SS/SSL7-D/SSL776Q.Seq.d/ 1128 0.0

own update 2002. 8. 8
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

BX248131|BX248131.3 Zebrafish DNA sequence *** SEQUENCING IN PROGRESS *** from clone CH211-236P5. 50 0.031 1
AL022310|AL022310.1 Human DNA sequence from clone 395P12 on chromosome 1q24-25. Contains the TXGP1 gene for tax-transcriptionally activated glycoprotein 1 (34kD) (OX40 ligand, OX40L) and a GOT2 (Aspartate Aminotransferase, mitochondrial precursor, EC 2.6.1.1, Transaminase A, Glutamate Oxaloacetate Transaminase-2) pseudogene. Contains ESTs, STSs and GSSs. 48 0.12 1
AL677696|AL677696.1 Xenopus tropicalis EST, clone TNeu057j18 5'. 48 0.12 1
CF264541|CF264541.1 AGENCOURT_15137220 NICHD_XGC_Emb7 Silurana tropicalis cDNA clone IMAGE:6976704 5', mRNA sequence. 48 0.12 1
AX346142|AX346142.1 Sequence 1213 from Patent WO0200928. 44 0.33 2
AC124778|AC124778.2 Mus musculus chromosome UNK clone RP23-92N3, WORKING DRAFT SEQUENCE, 4 unordered pieces. 46 0.48 1
AC116824|AC116824.4 Mus musculus clone RP24-366J7, WORKING DRAFT SEQUENCE, 7 ordered pieces. 46 0.48 1
AZ448597|AZ448597.1 1M0246E23F Mouse 10kb plasmid UUGC1M library Mus musculus genomic clone UUGC1M0246E23 F, DNA sequence. 46 0.48 1
AC142173|AC142173.2 Rattus norvegicus clone CH230-135C19, WORKING DRAFT SEQUENCE, 64 unordered pieces. 46 0.48 1
AC128755|AC128755.3 Rattus norvegicus clone CH230-183F18, *** SEQUENCING IN PROGRESS ***, 9 unordered pieces. 46 0.48 1
dna update 2003. 8.16
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(Q54QD9) RecName: Full=Cathepsin B; EC=3.4.22.1; AltNam... 389 e-107
BC072490_1(BC072490|pid:none) Rattus norvegicus cathepsin B, mRN... 171 2e-41
(P00787) RecName: Full=Cathepsin B; EC=3.4.22.1; AltNam... 171 2e-41
M11305_1(M11305|pid:none) Rat cathepsin B mRNA, 3' end. 171 2e-41
EU532428_1(EU532428|pid:none) Sus scrofa cathepsin B (CTSB) gene... 170 2e-41
(A1E295) RecName: Full=Cathepsin B; EC=3.4.22.1; Contai... 170 2e-41
AC149038_4(AC149038|pid:none) Medicago truncatula chromosome 7 c... 169 7e-41
BC115254_1(BC115254|pid:none) Danio rerio capthepsin B, b, mRNA ... 168 9e-41
(P07688) RecName: Full=Cathepsin B; EC=3.4.22.1; AltNam... 167 2e-40
AY888604_1(AY888604|pid:none) Synthetic construct Homo sapiens c... 166 6e-40
protein update 2009. 7.28
PSORT

psg: 0.75 gvh: 0.42 alm: 0.46 top: 0.53 tms: 0.00 mit: 0.24 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

48.0 %: cytoplasmic
24.0 %: nuclear
12.0 %: cytoskeletal
8.0 %: mitochondrial
4.0 %: vesicles of secretory system
4.0 %: endoplasmic reticulum

>> prediction for VSC107 is cyt

5' end seq. ID -
5' end seq. -
Length of 5' end seq. -
3' end seq. ID VSC107Z
3' end seq.
>VSC107Z.Seq
----------TTTTATGGATATGGTTACTTGTGATGAAACTGATAACGGTTGTGAAGGTG
GTGATGCCTTCTCTGCATGGAATTGGTTAAGAAAGCAAGGTGCTGTATCAGAAGAATGTC
TTCCATATACAATTCCAACTTGTCCACCAGCTCAACAACCATGTTTAAATTTTGTCAACA
CTCCATCATGTACTAAAGAGTGTCAATCAAATTCCTCTTTAATTTATTCTCAAGACAAAC
ATAAAATGGCTAAAATTTATTCTTTCGACTCTGATGAAGCAATCATGCAAGAAATTGTTA
CTAATGGTCCAGTCGAAGCTTGTTTCACTGTCTTTGAAGATTTCCTTGCTTACAAATCTG
GTGTTTACGTCCACACAACTGGTAAAGATTTAGGTGGTCACTGTGTTAAACTCGTTGGTT
TCGGTACCTTAAATGGTGTTGATTACTATGCCGCTAACAACCAATGGACAACTTCATGGG
GTGATAATGGAACTTTCTTAATCAAACGTGGTGATTGCGGTATCTCTGATGACGTTGTTG
CTGGTTTACCATAAATAAAAATAAGTTTTAAACATTTTGTAAAA
Length of 3' end seq. 574
Connected seq. ID -
Connected seq. -
Length of connected seq. -
Full length Seq ID -
Full length Seq. -
Length of full length seq. -