VSD113
Library VS
(Link to library)
Clone ID VSD113
Atlas ID -
NBRP ID
dictyBase ID
Link to Contig Contig-U16602-1
Original site URL
Representative seq. ID VSD113E
(Link to Original site)
Representative DNA sequence
>VSD113 (VSD113Q) /CSM/VS/VSD1-A/VSD113Q.Seq.d/
AATAATAATAAATAAATAAATAAAATGAATATACTTAAAGCTTTAGTAGTTTTATGTTTT
GTTTACGTTTCAATGGGTATCAAAGTTGATACTCAAACTGGTTTGAGATATACCAGAGAA
TCATGTTATAAAAAGAGTGATAATCAACAAAGAGAACTCATTTTATCAACTCAACCAAAA
GATATGAATCTTGAAGTTCCACAATCATGGGATTGGAGAAATGTAAGTGGTGTCAATTAT
CTCACAATGAATCGTAATCAACATATTCCACAATATTGTGGTGGTTGTTGGGCTTTCGCT
TCAACAAGTTCTATCTCTGATAGAATTAAAATTCAACGTAAAGCTGCTTTCCCAGATGTC
AACGTTGCTCCACAACATCTTATTGATTGTAATGGTGGTGGTACATGTGATGGTGGTGAC
CCAGGTGATGCTTTCGCTTTTATCAATGAAAATGGTATTGTTGATGAAACTTGCAAACCA
TACCAAGCTAAAAATTTACCAGACGAATGTTCACCAGCTTGCAAAACCTGTAATCCAGAT
GGTACTTGTCAAGCTATTCCAGTTCATACCAATATCACTGTAACTGAATATGGTTCAGTT
AGAGGTGCTAAAGATATGATGGCTGAAATTTATGCTCGTGGTCCAATCGCTTGTTCAATT
GATGCTACCTCTAAATTAGAAGCCTATACCTCAGGTATCTTCAAGGAATTCAAACTCGAC
CCACTTCCAAATCACATCATCAGTGTTATTGGTTGGGGTGTACAAGACTCAACTCCATAT
TGGATCGTTCGTAATTCTTGGGGTAGTTATTATGGTGAAGGTGGTTTCTTCAACATTGTC
CAAGGTTCTCTTTTCGAAAATTTAGGTATTGAGTTAGACTGTAACTGGGCTGTTCCATCT
GTATCTTTGCTTAAACTAAATAAATAAAATTATCATCATCGATT
sequence update 2001. 3.25
Translated Amino Acid sequence
nnnk*INKMNILKALVVLCFVYVSMGIKVDTQTGLRYTRESCYKKSDNQQRELILSTQPK
DMNLEVPQSWDWRNVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQRKAAFPDV
NVAPQHLIDCNGGGTCDGGDPGDAFAFINENGIVDETCKPYQAKNLPDECSPACKTCNPD
GTCQAIPVHTNITVTEYGSVRGAKDMMAEIYARGPIACSIDATSKLEAYTSGIFKEFKLD
PLPNHIISVIGWGVQDSTPYWIVRNSWGSYYGEGGFFNIVQGSLFENLGIELDCNWAVPS
VSLLKLNK*nyhhr


Translated Amino Acid sequence (All Frames)
Frame A:
nnnk*INKMNILKALVVLCFVYVSMGIKVDTQTGLRYTRESCYKKSDNQQRELILSTQPK
DMNLEVPQSWDWRNVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQRKAAFPDV
NVAPQHLIDCNGGGTCDGGDPGDAFAFINENGIVDETCKPYQAKNLPDECSPACKTCNPD
GTCQAIPVHTNITVTEYGSVRGAKDMMAEIYARGPIACSIDATSKLEAYTSGIFKEFKLD
PLPNHIISVIGWGVQDSTPYWIVRNSWGSYYGEGGFFNIVQGSLFENLGIELDCNWAVPS
VSLLKLNK*nyhhr


Frame B:
iiink*ik*iylkl**fyvlftfqwvsklilklv*dipenhvikrviinkensfyqlnqk
i*ilkfhnhgigem*vvsiisq*ivinifhnivvvvglslqqvlslielkfnvkllsqms
tllhnillivmvvvhvmvvtqvmlsllsmkmvllmklanhtklkiyqtnvhqlakpviqm
vlvklfqfipisl*lnmvqlevlki*wlkfmlvvqslvqlmlpln*kpipqvssrnsnst
hfqitssvllvgvyktqlhigsfvilgvvimvkvvsstlskvlfski*vls*tvtglfhl
ylcln*inkiiiid


Frame C:
***ink*neyt*sfssfmfclrfngyqs*ysnwfeiyqriml*ke**stkrthfinstkr
yes*sstimglekckwcqlshnes*stystilwwllgfrfnkfyl**n*nst*scfprcq
rcsttsy*l*wwwym*ww*pr*cfrfyq*kwyc**nlqtips*kftrrmftslqnl*srw
ylssysssyqyhcn*iwfs*rc*rydg*nlcswsnrlfn*cyl*irslylrylqgiqtrp
tskshhqcywlgctrlnsildrs*flg*llw*rwflqhcprfsfrkfry*vrl*lgcsic
ifa*tk*iklsssi


Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VSD113 (VSD113Q) /CSM/VS/VSD1-A/VSD113Q.Seq.d/ 1806 0.0
VFD780 (VFD780Q) /CSM/VF/VFD7-D/VFD780Q.Seq.d/ 1800 0.0
VSF638 (VSF638Q) /CSM/VS/VSF6-B/VSF638Q.Seq.d/ 1740 0.0
VFA818 (VFA818Q) /CSM/VF/VFA8-A/VFA818Q.Seq.d/ 1699 0.0
SSC608 (SSC608Q) /CSM/SS/SSC6-A/SSC608Q.Seq.d/ 1608 0.0
VSE347 (VSE347Q) /CSM/VS/VSE3-B/VSE347Q.Seq.d/ 1283 0.0
SSA582 (SSA582Q) /CSM/SS/SSA5-D/SSA582Q.Seq.d/ 1263 0.0
SSD547 (SSD547Q) /CSM/SS/SSD5-B/SSD547Q.Seq.d/ 1037 0.0
FC-BL05 (FC-BL05Q) /CSM/FC/FC-BL/FC-BL05Q.Seq.d/ 1019 0.0
VHF749 (VHF749Q) /CSM/VH/VHF7-C/VHF749Q.Seq.d/ 769 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

BG224657|BG224657.1 kp49c02.y1 TBN95TM-SSFH Strongyloides stercoralis cDNA 5' similar to WP:F32B5.8 CE09855 CYSTEINE PROTEINASE ;, mRNA sequence. 50 7e-07 3
BI743363|BI743363.1 kx43b03.y1 Parastrongyloides trichosuri FL pAMP1 v1 Chiapelli McCarter Parastrongyloides trichosuri cDNA 5' similar to TR:O01850 O01850 SIMILAR TO CYSTEINE PROTEINASE. [1] ;, mRNA sequence. 62 1e-05 1
BI743660|BI743660.1 kx50c01.y1 Parastrongyloides trichosuri PA pAMP1 v1 Chiapelli McCarter Parastrongyloides trichosuri cDNA 5' similar to TR:Q9XZI2 Q9XZI2 CATHEPSIN Z1 PREPROPROTEIN. ;, mRNA sequence. 62 1e-05 1
BI501371|BI501371.1 kx31d07.y1 Parastrongyloides trichosuri PA pAMP1 v1 Chiapelli McCarter Parastrongyloides trichosuri cDNA 5' similar to TR:O01850 O01850 SIMILAR TO CYSTEINE PROTEINASE. [1] ;, mRNA sequence. 62 1e-05 1
U71150|U71150.1 Onchocerca volvulus cysteine protease precursor mRNA, complete cds. 46 3e-05 3
BP522506|BP522506.1 Hydra magnipapillata cDNA, clone:hmp_20381. 40 0.001 2
BP509588|BP509588.1 Hydra magnipapillata cDNA, clone:hmp_04626. 38 0.069 2
AC115594|AC115594.2 Dictyostelium discoideum chromosome 2 map 4071862-4101005 strain AX4, complete sequence. 36 0.65 4
AL929507|AL929507.10 Zebrafish DNA sequence *** SEQUENCING IN PROGRESS *** from clone DKEY-7C18. 46 0.79 1
CF514057|CF514057.1 CAbud0007_IVR_A11 Vitis vinifera cv. cabernet sauvignon (Clone 8) Bud - CABUD Vitis vinifera cDNA clone CAbud0007_IVR_A11 3', mRNA sequence. 46 0.79 1
dna update 2003. 9.19
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

AY950579_1(AY950579|pid:none) Paralichthys olivaceus cathepsin Z... 234 2e-60
AY226449_1(AY226449|pid:none) Fundulus heteroclitus cathepsin Z ... 231 2e-59
BT075594_1(BT075594|pid:none) Osmerus mordax clone omor-eva-517-... 230 5e-59
AY890922_1(AY890922|pid:none) Synthetic construct Homo sapiens c... 228 2e-58
(Q9UBR2) RecName: Full=Cathepsin Z; EC=3.4.18.1; AltNam... 228 2e-58
AF009923_1(AF009923|pid:none) Homo sapiens preprocathepsin P mRN... 228 2e-58
AY891244_1(AY891244|pid:none) Synthetic construct Homo sapiens c... 228 2e-58
AY949988_1(AY949988|pid:none) Cyprinus carpio cathepsin Z (CTPZ)... 228 2e-58
BT019915_1(BT019915|pid:none) Homo sapiens cathepsin Z mRNA, com... 228 3e-58
AF032906_1(AF032906|pid:none) Homo sapiens cathepsin Z precursor... 227 4e-58
protein update 2009. 7.29
PSORT

psg: 0.97 gvh: 0.69 alm: 0.44 top: 0.40 tms: 0.00 mit: 0.26 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.20 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

52.0 %: extracellular, including cell wall
12.0 %: vacuolar
8.0 %: cytoplasmic
8.0 %: mitochondrial
8.0 %: nuclear
8.0 %: endoplasmic reticulum
4.0 %: Golgi

>> prediction for VSD113 is exc

5' end seq. ID VSD113F
5' end seq.
>VSD113F.Seq
AATAATAATAAATAAATAAATAAAATGAATATACTTAAAGCTTTAGTAGTTTTATGTTTT
GTTTACGTTTCAATGGGTATCAAAGTTGATACTCAAACTGGTTTGAGATATACCAGAGAA
TCATGTTATAAAAAGAGTGATAATCAACAAAGAGAACTCATTTTATCAACTCAACCAAAA
GATATGAATCTTGAAGTTCCACAATCATGGGATTGGAGAAATGTAAGTGGTGTCAATTAT
CTCACAATGAATCGTAATCAACATATTCCACAATATTGTGGTGGTTGTTGGGCTTTCGCT
TCAACAAGTTCTATCTCTGATAGAATTAAAATTCAACGTAAAGCTGCTTTCCCAGATGTC
AACGTTGCTCCACAACATCTTATTGATTGTAATGGTGGTGGTACATGTGATGGTGGTGAC
CCAGGTGATGCTTTCGCTTTTATCAATGAAAATGGTATTGTTGATGAAACTTGCAAACCA
TACCAAGCTAAAAATTTACCAGACGAATGTTCACCAGCTTGCAAAACCTGTAATCCAGAT
GGTACTTGTCAAGCT----------
Length of 5' end seq. 555
3' end seq. ID VSD113Z
3' end seq.
>VSD113Z.Seq
----------TGTAAGTGGTGTCAATTATCTCACAATGAATCGTAATCAACATATTCCAC
AATATTGTGGTGGTTGTTGGGCTTTCGCTTCAACAAGTTCTATCTCTGATAGAATTAAAA
TTCAACGTAAAGCTGCTTTCCCAGATGTCAACGTTGCTCCACAACATCTTATTGATTGTA
ATGGTGGTGGTACATGTGATGGTGGTGACCCAGGTGATGCTTTCGCTTTTATCAATGAAA
ATGGTATTGTTGATGAAACTTGCAAACCATACCAAGCTAAAAATTTACCAGACGAATGTT
CACCAGCTTGCAAAACCTGTAATCCAGATGGTACTTGTCAAGCTATTCCAGTTCATACCA
ATATCACTGTAACTGAATATGGTTCAGTTAGAGGTGCTAAAGATATGATGGCTGAAATTT
ATGCTCGTGGTCCAATCGCTTGTTCAATTGATGCTACCTCTAAATTAGAAGCCTATACCT
CAGGTATCTTCAAGGAATTCAAACTCGACCCACTTCCAAATCACATCATCAGTGTTATTG
GTTGGGGTGTACAAGACTCAACTCCATATTGGATCGTTCGTAATTCTTGGGGTAGTTATT
ATGGTGAAGGTGGTTTCTTCAACATTGTCCAAGGTTCTCTTTTCGAAAATTTAGGTATTG
AGTTAGACTGTAACTGGGCTGTTCCATCTGTATCTTTGCTTAAACTAAATAAATAAAATT
ATCATCATCGATT
Length of 3' end seq. 723
Connected seq. ID VSD113P
Connected seq.
>VSD113P.Seq
AATAATAATAAATAAATAAATAAAATGAATATACTTAAAGCTTTAGTAGTTTTATGTTTT
GTTTACGTTTCAATGGGTATCAAAGTTGATACTCAAACTGGTTTGAGATATACCAGAGAA
TCATGTTATAAAAAGAGTGATAATCAACAAAGAGAACTCATTTTATCAACTCAACCAAAA
GATATGAATCTTGAAGTTCCACAATCATGGGATTGGAGAAATGTAAGTGGTGTCAATTAT
CTCACAATGAATCGTAATCAACATATTCCACAATATTGTGGTGGTTGTTGGGCTTTCGCT
TCAACAAGTTCTATCTCTGATAGAATTAAAATTCAACGTAAAGCTGCTTTCCCAGATGTC
AACGTTGCTCCACAACATCTTATTGATTGTAATGGTGGTGGTACATGTGATGGTGGTGAC
CCAGGTGATGCTTTCGCTTTTATCAATGAAAATGGTATTGTTGATGAAACTTGCAAACCA
TACCAAGCTAAAAATTTACCAGACGAATGTTCACCAGCTTGCAAAACCTGTAATCCAGAT
GGTACTTGTCAAGCT----------TGTAAGTGGTGTCAATTATCTCACAATGAATCGTA
ATCAACATATTCCACAATATTGTGGTGGTTGTTGGGCTTTCGCTTCAACAAGTTCTATCT
CTGATAGAATTAAAATTCAACGTAAAGCTGCTTTCCCAGATGTCAACGTTGCTCCACAAC
ATCTTATTGATTGTAATGGTGGTGGTACATGTGATGGTGGTGACCCAGGTGATGCTTTCG
CTTTTATCAATGAAAATGGTATTGTTGATGAAACTTGCAAACCATACCAAGCTAAAAATT
TACCAGACGAATGTTCACCAGCTTGCAAAACCTGTAATCCAGATGGTACTTGTCAAGCTA
TTCCAGTTCATACCAATATCACTGTAACTGAATATGGTTCAGTTAGAGGTGCTAAAGATA
TGATGGCTGAAATTTATGCTCGTGGTCCAATCGCTTGTTCAATTGATGCTACCTCTAAAT
TAGAAGCCTATACCTCAGGTATCTTCAAGGAATTCAAACTCGACCCACTTCCAAATCACA
TCATCAGTGTTATTGGTTGGGGTGTACAAGACTCAACTCCATATTGGATCGTTCGTAATT
CTTGGGGTAGTTATTATGGTGAAGGTGGTTTCTTCAACATTGTCCAAGGTTCTCTTTTCG
AAAATTTAGGTATTGAGTTAGACTGTAACTGGGCTGTTCCATCTGTATCTTTGCTTAAAC
TAAATAAATAAAATTATCATCATCGATT
Length of connected seq. 1278
Full length Seq ID VSD113E
Full length Seq.
>VSD113E.Seq
AATAATAATAAATAAATAAATAAAATGAATATACTTAAAGCTTTAGTAGTTTTATGTTTT
GTTTACGTTTCAATGGGTATCAAAGTTGATACTCAAACTGGTTTGAGATATACCAGAGAA
TCATGTTATAAAAAGAGTGATAATCAACAAAGAGAACTCATTTTATCAACTCAACCAAAA
GATATGAATCTTGAAGTTCCACAATCATGGGATTGGAGAAATGTAAGTGGTGTCAATTAT
CTCACAATGAATCGTAATCAACATATTCCACAATATTGTGGTGGTTGTTGGGCTTTCGCT
TCAACAAGTTCTATCTCTGATAGAATTAAAATTCAACGTAAAGCTGCTTTCCCAGATGTC
AACGTTGCTCCACAACATCTTATTGATTGTAATGGTGGTGGTACATGTGATGGTGGTGAC
CCAGGTGATGCTTTCGCTTTTATCAATGAAAATGGTATTGTTGATGAAACTTGCAAACCA
TACCAAGCTAAAAATTTACCAGACGAATGTTCACCAGCTTGCAAAACCTGTAATCCAGAT
GGTACTTGTCAAGCTATTCCAGTTCATACCAATATCACTGTAACTGAATATGGTTCAGTT
AGAGGTGCTAAAGATATGATGGCTGAAATTTATGCTCGTGGTCCAATCGCTTGTTCAATT
GATGCTACCTCTAAATTAGAAGCCTATACCTCAGGTATCTTCAAGGAATTCAAACTCGAC
CCACTTCCAAATCACATCATCAGTGTTATTGGTTGGGGTGTACAAGACTCAACTCCATAT
TGGATCGTTCGTAATTCTTGGGGTAGTTATTATGGTGAAGGTGGTTTCTTCAACATTGTC
CAAGGTTCTCTTTTCGAAAATTTAGGTATTGAGTTAGACTGTAACTGGGCTGTTCCATCT
GTATCTTTGCTTAAACTAAATAAATAAAATTATCATCATCGATT
Length of full length seq. 944