VSJ209
Library VS
(Link to library)
Clone ID VSJ209
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U15456-1
Original site URL
Representative seq. ID VSJ209E
(Link to Original site)
Representative DNA sequence
>VSJ209 (VSJ209Q) /CSM/VS/VSJ2-A/VSJ209Q.Seq.d/
GAAATTACTTATTTTAACTTTATTTTTAGCTACTATTGTTTTAGCTCAAGCTTTAACAGT
ACCATTAAACTTCCATCAAGCTTCAAGAGAATCTAGAAGAAGAGTTCCACAAAAATGGTC
AAACAGATTATCTGCTCTCAATGCTGGTACCACAATCCCAATTTCAGATTTTGAAGATGC
TCAATACTATGGTGCCATTACCATTGGTACCCCAGGTCAAGCCTTCAAAGTAGTTTTCGA
TACTGGTTCATCCAACTTGTGGATTCCATCAAAGAAATGTCCAATCACTGTTGTTGCATG
TGATTTACATAACAAATATAACAGCGGTGCCTCAAGCACATATGTCGCCAACGGAACTGA
TTTCACCATCCAATACGGTAGTGGTGCTATGTCAGGTTTTGTCTCTCAAGATTCCGTCAC
TGTTGGTTCATTAACTGTTAAAGATCAATTATTCGCTGAAGCCACTGCCGAACCAGGTAT
TGCTTTCGATTTCGCCAAATTCGATGGTATTTTAGGTCTTGCTTTCCAATCCATCTNTNN
NNAATTCAATTCCACCAGTCTTTTACAACATGTTATCACAAGGTTTAGTTTCATCAACAC
TCTTCTCCTTCTGGTTATCAAGAACTCCAGGTGCCAACGGTGGTGAACTCTCATTCGGTT
CAATCGATAACACCAAANACACTGGTGACAATTACCTACGTCCCATTAACCAACGAAACC
TATTGGGAATTCGTTATGGATGACTTTGCTATCGATGGTCAATCAGCTGGTTTCTGTGGT
ACTACTTGTCACGCAATTTGCGATTCAGGTACATCACTCATTGCTGGTCCAATAGCTGAT
ATTACTGCCCTCAATGAAAAATTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCTTCTCT
GATTGTAGCGTTATCAACACCTTACCAAATGTTACCATCACCGTTGCTGGTCGTGAATTT
GTTTTAACTCCAAAAGAATACGTTTTAGAAGTTACTGAGTTCGGAAAGACTGAATGTTTG
AGTGGATTTATGGGTATCGAGTTAAACATGGNNAATTTCTGGATCCTTGGTGATGTTTTC
ATCTCTGCANTACTATACTGTATTCGATTTTGGTAATAAACAAGTTGGTTTCGCAACTGC
CATTCAAGGTTAAATTTTTAGT
sequence update 2001. 3.26
Translated Amino Acid sequence
KLLILTLFLATIVLAQALTVPLNFHQASRESRRRVPQKWSNRLSALNAGTTIPISDFEDA
QYYGAITIGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGASSTYVANGTD
FTIQYGSGAMSGFVSQDSVTVGSLTVKDQLFAEATAEPGIAFDFAKFDGILGLAFQSIXX
XFNSTSLLQHVITRFSFINTLLLLVIKNSRCQRW*tlirfnr*hqxhw*qlptsh*ptkp
ignslwmtllsmvnqlvsvvllvtqfaiqvhhsllvq*lillpsmkn*vlss*mvkvssl
ivalstpyqmlpspllvvnlf*lqkntf*kllsserlnv*vdlwvss*twxisgslvmfs
slxyytvfdfgnkqvgfataiqg*ifs


Translated Amino Acid sequence (All Frames)
Frame A:
eityfnfifsyycfsssfnstiklpssfkri*kksstkmvkqiicsqcwyhnpnfrf*rc
silwchyhwyprsslqssfrywfiqlvdsikemsnhcccm*ft*qi*qrclkhicrqrn*
fhhpir*wcyvrfclsrfrhcwfinc*rsiir*shcrtrycfrfrqirwyfrscfpihlx
xiqfhqsfttcyhkv*fhqhsspsgyqelqvptvvnshsvqsitpxtlvtityvpltnet
ywefvmddfaidgqsagfcgttchaicdsgtsliagpiaditalneklgavilngegvfs
dcsvintlpnvtitvagrefvltpkeyvlevtefgkteclsgfmgielnmxnfwilgdvf
isaxlycirfw**tswfrnchsrlnf*


Frame B:
KLLILTLFLATIVLAQALTVPLNFHQASRESRRRVPQKWSNRLSALNAGTTIPISDFEDA
QYYGAITIGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGASSTYVANGTD
FTIQYGSGAMSGFVSQDSVTVGSLTVKDQLFAEATAEPGIAFDFAKFDGILGLAFQSIXX
XFNSTSLLQHVITRFSFINTLLLLVIKNSRCQRW*tlirfnr*hqxhw*qlptsh*ptkp
ignslwmtllsmvnqlvsvvllvtqfaiqvhhsllvq*lillpsmkn*vlss*mvkvssl
ivalstpyqmlpspllvvnlf*lqkntf*kllsserlnv*vdlwvss*twxisgslvmfs
slxyytvfdfgnkqvgfataiqg*ifs


Frame C:
nylf*lyf*lllf*lkl*qyh*tsiklqenleeefhkngqtdyllsmlvpqsqfqilkml
ntmvplplvpqvkpsk*fsilvhptcgfhqrnvqslllhviyitnitavpqahmspteli
spsntvvvlcqvlslkipsllvh*llkinyslkplpnqvllsispnsmvf*vllsnpsxx
nsippvfynmlsqglvsstlfsfwlsrtpganggelsfgsidntkxtgdnylrpinqrnl
lgiryg*lcyrwsiswflwyylsrnlrfryithcwsns*yycpq*kircchlkw*rcll*
l*ryqhltkcyhhrcws*icfnskrirfrsy*vrkd*mfewiygyrvkhgxfldpw*cfh
lcxtilysilvinklvsqlpfkvkfl


Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VSJ209 (VSJ209Q) /CSM/VS/VSJ2-A/VSJ209Q.Seq.d/ 2250 0.0
VFN380 (VFN380Q) /CSM/VF/VFN3-D/VFN380Q.Seq.d/ 2194 0.0
VFN222 (VFN222Q) /CSM/VF/VFN2-A/VFN222Q.Seq.d/ 2194 0.0
VFM881 (VFM881Q) /CSM/VF/VFM8-D/VFM881Q.Seq.d/ 2194 0.0
VFM519 (VFM519Q) /CSM/VF/VFM5-A/VFM519Q.Seq.d/ 2194 0.0
VFM234 (VFM234Q) /CSM/VF/VFM2-B/VFM234Q.Seq.d/ 2194 0.0
VFL410 (VFL410Q) /CSM/VF/VFL4-A/VFL410Q.Seq.d/ 2194 0.0
VFL385 (VFL385Q) /CSM/VF/VFL3-D/VFL385Q.Seq.d/ 2194 0.0
VFL274 (VFL274Q) /CSM/VF/VFL2-D/VFL274Q.Seq.d/ 2194 0.0
VFL252 (VFL252Q) /CSM/VF/VFL2-C/VFL252Q.Seq.d/ 2194 0.0

own update 2004. 8. 9
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

Y16962|Y16962.1 Dictyostelium discoideum mRNA for cathepsin D. 1063 0.0 4
AJ243946|AJ243946.1 Dictyostelium discoideum ctsD gene for cathepsin D, exons 1 to 2. 950 0.0 5
AL762930|AL762930.1 Arabidopsis thaliana T-DNA flanking sequence GK-030C11-011522. 62 2e-05 1
AV567523|AV567523.1 Arabidopsis thaliana cDNA clone:SQL15g03F, 3' end. 62 2e-05 1
AF372974|AF372974.1 Arabidopsis thaliana AT4g04460/T26N6_7 mRNA, complete cds. 62 2e-05 1
BE525959|BE525959.1 M64C07STM Arabidopsis developing seed Arabidopsis thaliana cDNA clone 600034526R1 5', mRNA sequence. 62 2e-05 1
CD825453|CD825453.1 BN25.060N03F011129 BN25 Brassica napus cDNA clone BN25060N03, mRNA sequence. 62 2e-05 1
AL765461|AL765461.1 Arabidopsis thaliana T-DNA flanking sequence GK-139E08-012875. 62 2e-05 1
E33916|E33916.1 Candida boidinii strain with lowered protease activity and utilization thereof as host for producing foreign protein. 36 4e-05 4
CB814835|CB814835.1 USDA-FP_100247 Adult Alate Brown Citrus Aphid Toxoptera citricida cDNA clone WHWTC-04_A10 5', mRNA sequence. 36 4e-05 3
dna update 2003. 7.18
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(Q03168) RecName: Full=Lysosomal aspartic protease; EC=... 179 7e-87
EF213114_1(EF213114|pid:none) Penaeus monodon cathepsin D mRNA, ... 183 4e-85
DQ010007_1(DQ010007|pid:none) Bombyx mori CathD mRNA, complete c... 184 1e-84
EF070454_1(EF070454|pid:none) Maconellicoccus hirsutus clone WHM... 178 9e-84
FJ168036_1(FJ168036|pid:none) Fasciola hepatica cathepsin D-like... 172 1e-81
DQ909010_1(DQ909010|pid:none) Clonorchis sinensis aspartic prote... 170 2e-81
U90750_1(U90750|pid:none) Schistosoma japonicum aspartic proteas... 172 7e-80
L41346_1(L41346|pid:none) Schistosoma japonicum aspartic protein... 172 7e-80
EF000001_1(EF000001|pid:none) Fasciola hepatica cathepsin D-like... 172 1e-79
U60995_1(U60995|pid:none) Schistosoma mansoni aspartic proteinas... 172 6e-79
protein update 2009. 3.25
PSORT

psg: 0.98 gvh: 0.77 alm: 0.33 top: 0.70 tms: 0.00 mit: 0.37 mip: 0.08
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

44.0 %: extracellular, including cell wall
24.0 %: mitochondrial
12.0 %: cytoplasmic
8.0 %: vacuolar
4.0 %: plasma membrane
4.0 %: nuclear
4.0 %: endoplasmic reticulum

>> prediction for VSJ209 is exc

5' end seq. ID VSJ209F
5' end seq.
>VSJ209F.Seq
GAAATTACTTATTTTAACTTTATTTTTAGCTACTATTGTTTTAGCTCAAGCTTTAACAGT
ACCATTAAACTTCCATCAAGCTTCAAGAGAATCTAGAAGAAGAGTTCCACAAAAATGGTC
AAACAGATTATCTGCTCTCAATGCTGGTACCACAATCCCAATTTCAGATTTTGAAGATGC
TCAATACTATGGTGCCATTACCATTGGTACCCCAGGTCAAGCCTTCAAAGTAGTTTTCGA
TACTGGTTCATCCAACTTGTGGATTCCATCAAAGAAATGTCCAATCACTGTTGTTGCATG
TGATTTACATAACAAATATAACAGCGGTGCCTCAAGCACATATGTCGCCAACGGAACTGA
TTTCACCATCCAATACGGTAGTGGTGCTATGTCAGGTTTTGTCTCTCAAGATTCCGTCAC
TGTTGGTTCATTAACTGTTAAAGATCAATTATTCGCTGAAGCCACTGCCGAACCAGGTAT
TGCTTTCGATTTCGCCAAATTCGATGGTATTTTAGGTCTTGCTTTCCAATCCATCTCTGT
TA----------
Length of 5' end seq. 542
3' end seq. ID VSJ209Z
3' end seq.
>VSJ209Z.Seq
----------TATTCGCTGAAGCCACTGCCGAACCAGGTATTGCTTTCGATTTCGCCAAA
TTCGATGGTATTTTAGGTCTTGCTTTCCAATCCATCTATTGTTAATTCAATTCCACCAGT
CTTTTACAACATGTTATCACAAGGTTTAGTTTCATCAACACTCTTCTCCTTCTGGTTATC
AAGAACTCCAGGTGCCAACGGTGGTGAACTCTCATTCGGTTCAATCGATAACACCAAANA
CACTGGTGACAATTACCTACGTCCCATTAACCAACGAAACCTATTGGGAATTCGTTATGG
ATGACTTTGCTATCGATGGTCAATCAGCTGGTTTCTGTGGTACTACTTGTCACGCAATTT
GCGATTCAGGTACATCACTCATTGCTGGTCCAATAGCTGATATTACTGCCCTCAATGAAA
AATTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCTTCTCTGATTGTAGCGTTATCAACA
CCTTACCAAATGTTACCATCACCGTTGCTGGTCGTGAATTTGTTTTAACTCCAAAAGAAT
ACGTTTTAGAAGTTACTGAGTTCGGAAAGACTGAATGTTTGAGTGGATTTATGGGTATCG
AGTTAAACATGGNNAATTTCTGGATCCTTGGTGATGTTTTCATCTCTGCANTACTATACT
GTATTCGATTTTGGTAATAAACAAGTTGGTTTCGCAACTGCCATTCAAGGTTAAATTTTT
AGT
Length of 3' end seq. 713
Connected seq. ID VSJ209P
Connected seq.
>VSJ209P.Seq
GAAATTACTTATTTTAACTTTATTTTTAGCTACTATTGTTTTAGCTCAAGCTTTAACAGT
ACCATTAAACTTCCATCAAGCTTCAAGAGAATCTAGAAGAAGAGTTCCACAAAAATGGTC
AAACAGATTATCTGCTCTCAATGCTGGTACCACAATCCCAATTTCAGATTTTGAAGATGC
TCAATACTATGGTGCCATTACCATTGGTACCCCAGGTCAAGCCTTCAAAGTAGTTTTCGA
TACTGGTTCATCCAACTTGTGGATTCCATCAAAGAAATGTCCAATCACTGTTGTTGCATG
TGATTTACATAACAAATATAACAGCGGTGCCTCAAGCACATATGTCGCCAACGGAACTGA
TTTCACCATCCAATACGGTAGTGGTGCTATGTCAGGTTTTGTCTCTCAAGATTCCGTCAC
TGTTGGTTCATTAACTGTTAAAGATCAATTATTCGCTGAAGCCACTGCCGAACCAGGTAT
TGCTTTCGATTTCGCCAAATTCGATGGTATTTTAGGTCTTGCTTTCCAATCCATCTCTGT
TA----------TATTCGCTGAAGCCACTGCCGAACCAGGTATTGCTTTCGATTTCGCCA
AATTCGATGGTATTTTAGGTCTTGCTTTCCAATCCATCTATTGTTAATTCAATTCCACCA
GTCTTTTACAACATGTTATCACAAGGTTTAGTTTCATCAACACTCTTCTCCTTCTGGTTA
TCAAGAACTCCAGGTGCCAACGGTGGTGAACTCTCATTCGGTTCAATCGATAACACCAAA
NACACTGGTGACAATTACCTACGTCCCATTAACCAACGAAACCTATTGGGAATTCGTTAT
GGATGACTTTGCTATCGATGGTCAATCAGCTGGTTTCTGTGGTACTACTTGTCACGCAAT
TTGCGATTCAGGTACATCACTCATTGCTGGTCCAATAGCTGATATTACTGCCCTCAATGA
AAAATTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCTTCTCTGATTGTAGCGTTATCAA
CACCTTACCAAATGTTACCATCACCGTTGCTGGTCGTGAATTTGTTTTAACTCCAAAAGA
ATACGTTTTAGAAGTTACTGAGTTCGGAAAGACTGAATGTTTGAGTGGATTTATGGGTAT
CGAGTTAAACATGGNNAATTTCTGGATCCTTGGTGATGTTTTCATCTCTGCANTACTATA
CTGTATTCGATTTTGGTAATAAACAAGTTGGTTTCGCAACTGCCATTCAAGGTTAAATTT
TTAGT
Length of connected seq. 1255
Full length Seq ID VSJ209E
Full length Seq.
>VSJ209E.Seq
GAAATTACTTATTTTAACTTTATTTTTAGCTACTATTGTTTTAGCTCAAGCTTTAACAGT
ACCATTAAACTTCCATCAAGCTTCAAGAGAATCTAGAAGAAGAGTTCCACAAAAATGGTC
AAACAGATTATCTGCTCTCAATGCTGGTACCACAATCCCAATTTCAGATTTTGAAGATGC
TCAATACTATGGTGCCATTACCATTGGTACCCCAGGTCAAGCCTTCAAAGTAGTTTTCGA
TACTGGTTCATCCAACTTGTGGATTCCATCAAAGAAATGTCCAATCACTGTTGTTGCATG
TGATTTACATAACAAATATAACAGCGGTGCCTCAAGCACATATGTCGCCAACGGAACTGA
TTTCACCATCCAATACGGTAGTGGTGCTATGTCAGGTTTTGTCTCTCAAGATTCCGTCAC
TGTTGGTTCATTAACTGTTAAAGATCAATTATTCGCTGAAGCCACTGCCGAACCAGGTAT
TGCTTTCGATTTCGCCAAATTCGATGGTATTTTAGGTCTTGCTTTCCAATCCATCTNTNN
NNAATTCAATTCCACCAGTCTTTTACAACATGTTATCACAAGGTTTAGTTTCATCAACAC
TCTTCTCCTTCTGGTTATCAAGAACTCCAGGTGCCAACGGTGGTGAACTCTCATTCGGTT
CAATCGATAACACCAAANACACTGGTGACAATTACCTACGTCCCATTAACCAACGAAACC
TATTGGGAATTCGTTATGGATGACTTTGCTATCGATGGTCAATCAGCTGGTTTCTGTGGT
ACTACTTGTCACGCAATTTGCGATTCAGGTACATCACTCATTGCTGGTCCAATAGCTGAT
ATTACTGCCCTCAATGAAAAATTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCTTCTCT
GATTGTAGCGTTATCAACACCTTACCAAATGTTACCATCACCGTTGCTGGTCGTGAATTT
GTTTTAACTCCAAAAGAATACGTTTTAGAAGTTACTGAGTTCGGAAAGACTGAATGTTTG
AGTGGATTTATGGGTATCGAGTTAAACATGGNNAATTTCTGGATCCTTGGTGATGTTTTC
ATCTCTGCANTACTATACTGTATTCGATTTTGGTAATAAACAAGTTGGTTTCGCAACTGC
CATTCAAGGTTAAATTTTTAGT
Length of full length seq. 1162