VFA209
Library VF
(Link to library)
Clone ID VFA209
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U15456-1
Original site URL
Representative seq. ID VFA209P
(Link to Original site)
Representative DNA sequence
>VFA209 (VFA209Q) /CSM/VF/VFA2-A/VFA209Q.Seq.d/
AATTGTCACCTGACTTACATAAAAGAAAATGAAATTACTTATTTTAACTTTATTTTTAGC
TACTATTGTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAAGCTTCAAGAGA
ATCTAGAAGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTAC
CACAATCCCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTAC
CCCAGGTCAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATC
AAAGAAATGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGC
CTCAAGCACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTAT
GTCAGGTTTTGTCTCTCAAGATTCCGTCACTGTTGGTTCATTAACTGTTAAAGATCAATT
ATTCGCTGAAGCCACTGCCGAACCAGGTATTGCTTTCGATTTCGCCAAATTCGATGGTAT
TTTAGGTCTTGCTTTCCAAXXXXXXXXXXTTATCACAAGGTTTAGTTTCATCAACACTCT
TCCTCNTTCTGGTTATCAAGAACTCCAGGTGCCAACGGTGGTGAACTNTCATTCGGTTCA
ATCGATAACACCAAATACACTGGTGACATTACCTACGTCCCATTAACCAACGAAACCTAT
TGGGAATTCGTTATGGATGACTTTGCTATCGATGGTCAATCAGCTGGTTTCTGTGGTACT
ACTTGTCACGCAATTTGCGATTCAGGTACATCACTCATTGNTGGTCCAATGGCATGATAT
TACTGCCCTCAATGAAAAATTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCTTCTCTGA
TTGTAGCGTTATCAACACCTTACCAAATGTTACCATCACCGTTGCTGGTCGTGAATTTGT
TTTAACTCCAAAAGAATACGTTTTANAAGTTACTGAGTTCGGAAAGACTGAATGTTTGAG
TGGATTTATGGGTATCGAGTTAAACATGGGAAATTTCTGGATCCTTGGTGATGTTTTCAT
CTCTGCTTACTATACTGTATTCGATTTTGGTAATAAACAAGTNGGTTTCGCAACTGCCAT
TCAAGGTTAGATTTTTTAATTATTTATATTTAAGATAGAAAGNAAACNAAAATAGAACAA
sequence update 2001. 6. 1
Translated Amino Acid sequence
ivt*lt*KKMKLLILTLFLATIVLAQALTVPLNFHQASRESRRRVPQKWSNRLSALNAGT
TIPISDFEDAQYYGAITIGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGA
SSTYVANGTDFTIQYGSGAMSGFVSQDSVTVGSLTVKDQLFAEATAEPGIAFDFAKFDGI
LGLAFQ---

---ITRFSFINTLPXSGYQELQVPTVVNXHSVQSITPNTLVTLPTSH*ptkpignslwmt
llsmvnqlvsvvllvtqfaiqvhhslxvqwhditalneklgavilngegvfsdcsvintl
pnvtitvagrefvltpkeyvlxvtefgkteclsgfmgielnmgnfwilgdvfisayytvf
dfgnkqvgfataiqg*if*lfifkiexkxk*n


Translated Amino Acid sequence (All Frames)
Frame A:
nchltyikeneityfnfifsyycfsssfnstiklpssfkri*kksstkmvkqiicsqcwy
hnpnfrf*rcsilwchyhwyprsslqssfrywfiqlvdsikemsnhcccm*ft*qi*qrc
lkhicrqrn*fhhpir*wcyvrfclsrfrhcwfinc*rsiir*shcrtrycfrfrqirwy
frscfp---

---lsqglvsstlflxlviknsrcqrw*txirfnr*hqihw*hylrpinqrnllgiryg*
lcyrwsiswflwyylsrnlrfryithxwsngmillpsmkn*vlss*mvkvsslivalstp
yqmlpspllvvnlf*lqkntfxkllsserlnv*vdlwvss*tweisgslvmfsslltily
silvinkxvsqlpfkvrffnylylr*kxnxnrt

Frame B:
ivt*lt*KKMKLLILTLFLATIVLAQALTVPLNFHQASRESRRRVPQKWSNRLSALNAGT
TIPISDFEDAQYYGAITIGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGA
SSTYVANGTDFTIQYGSGAMSGFVSQDSVTVGSLTVKDQLFAEATAEPGIAFDFAKFDGI
LGLAFQ---

---yhkv*fhqhsssfwlsrtpganggelsfgsidntkytgdityvpltnetywefvmdd
faidgqsagfcgttchaicdsgtslixgpma*yycpq*kircchlkw*rcll*l*ryqhl
tkcyhhrcws*icfnskrirfxsy*vrkd*mfewiygyrvkhgkfldpw*cfhlcllyci
rfw**tsxfrnchsrldfliiyi*drkxtkieq

Frame C:
lspdlhkrk*nylf*lyf*lllf*lkl*qyh*tsiklqenleeefhkngqtdyllsmlvp
qsqfqilkmlntmvplplvpqvkpsk*fsilvhptcgfhqrnvqslllhviyitnitavp
qahmsptelispsntvvvlcqvlslkipsllvh*llkinyslkplpnqvllsispnsmvf
*vlls---

---ITRFSFINTLPXSGYQELQVPTVVNXHSVQSITPNTLVTLPTSH*ptkpignslwmt
llsmvnqlvsvvllvtqfaiqvhhslxvqwhditalneklgavilngegvfsdcsvintl
pnvtitvagrefvltpkeyvlxvtefgkteclsgfmgielnmgnfwilgdvfisayytvf
dfgnkqvgfataiqg*if*lfifkiexkxk*n

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VFA209 (VFA209Q) /CSM/VF/VFA2-A/VFA209Q.Seq.d/ 2278 0.0
VFJ894 (VFJ894Q) /CSM/VF/VFJ8-D/VFJ894Q.Seq.d/ 1146 0.0
SFG205 (SFG205Q) /CSM/SF/SFG2-A/SFG205Q.Seq.d/ 1124 0.0
SFG219 (SFG219Q) /CSM/SF/SFG2-A/SFG219Q.Seq.d/ 1122 0.0
SFB424 (SFB424Q) /CSM/SF/SFB4-A/SFB424Q.Seq.d/ 1122 0.0
VFK481 (VFK481Q) /CSM/VF/VFK4-D/VFK481Q.Seq.d/ 1120 0.0
VFE217 (VFE217Q) /CSM/VF/VFE2-A/VFE217Q.Seq.d/ 1120 0.0
CFG766 (CFG766Q) /CSM/CF/CFG7-C/CFG766Q.Seq.d/ 1118 0.0
VFM519 (VFM519Q) /CSM/VF/VFM5-A/VFM519Q.Seq.d/ 1116 0.0
VFM401 (VFM401Q) /CSM/VF/VFM4-A/VFM401Q.Seq.d/ 1116 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

Y16962|Y16962.1 Dictyostelium discoideum mRNA for cathepsin D. 1094 0.0 5
AJ243946|AJ243946.1 Dictyostelium discoideum ctsD gene for cathepsin D, exons 1 to 2. 936 0.0 6
CB264579|CB264579.1 51-E015023-035-004-F13-T7R MPIZ-ADIS-035 Arabidopsis thaliana cDNA clone MPIZp2000F134Q 5-PRIME, mRNA sequence. 62 2e-05 1
AL161500|AL161500.2 Arabidopsis thaliana DNA chromosome 4, contig fragment No. 12. 62 2e-05 1
AL765461|AL765461.1 Arabidopsis thaliana T-DNA flanking sequence GK-139E08-012875. 62 2e-05 1
AL762930|AL762930.1 Arabidopsis thaliana T-DNA flanking sequence GK-030C11-011522. 62 2e-05 1
AV567523|AV567523.1 Arabidopsis thaliana cDNA clone:SQL15g03F, 3' end. 62 2e-05 1
AF372974|AF372974.1 Arabidopsis thaliana AT4g04460/T26N6_7 mRNA, complete cds. 62 2e-05 1
CB264640|CB264640.1 48-E014661-035-002-P12-T7R MPIZ-ADIS-035 Arabidopsis thaliana cDNA clone MPIZp2000P122Q 5-PRIME, mRNA sequence. 62 2e-05 1
BE525959|BE525959.1 M64C07STM Arabidopsis developing seed Arabidopsis thaliana cDNA clone 600034526R1 5', mRNA sequence. 62 2e-05 1
dna update 2003. 8.15
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

EF213114_1(EF213114|pid:none) Penaeus monodon cathepsin D mRNA, ... 179 2e-84
BT080419_1(BT080419|pid:none) Caligus clemensi clone ccle-evs-50... 165 2e-83
(Q03168) RecName: Full=Lysosomal aspartic protease; EC=... 169 2e-82
AF420068_1(AF420068|pid:none) Clonorchis sinensis aspartic prote... 167 1e-78
AF454831_1(AF454831|pid:none) Apriona germari cathepsin D mRNA, ... 170 6e-76
EF193385_1(EF193385|pid:none) Musca domestica aspartic proteinas... 171 1e-75
AB078420_1(AB078420|pid:none) Brugia malayi asp-2 mRNA for aspar... 153 2e-75
BC154315_1(BC154315|pid:none) Danio rerio cathepsin D, mRNA (cDN... 168 5e-75
AY050516_1(AY050516|pid:none) Danio rerio cathepsin D precursor ... 168 5e-75
BT043515_1(BT043515|pid:none) Salmo salar clone HM4_0887 catheps... 169 6e-75
protein update 2009. 6.16
PSORT

psg: 0.98 gvh: 0.77 alm: 0.41 top: 0.57 tms: 0.00 mit: 0.36 mip: 0.08
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

48.0 %: extracellular, including cell wall
20.0 %: mitochondrial
12.0 %: cytoplasmic
8.0 %: vacuolar
4.0 %: Golgi
4.0 %: nuclear
4.0 %: endoplasmic reticulum

>> prediction for VFA209 is exc

5' end seq. ID VFA209F
5' end seq.
>VFA209F.Seq
AATTGTCACCTGACTTACATAAAAGAAAATGAAATTACTTATTTTAACTTTATTTTTAGC
TACTATTGTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAAGCTTCAAGAGA
ATCTAGAAGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTAC
CACAATCCCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTAC
CCCAGGTCAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATC
AAAGAAATGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGC
CTCAAGCACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTAT
GTCAGGTTTTGTCTCTCAAGATTCCGTCACTGTTGGTTCATTAACTGTTAAAGATCAATT
ATTCGCTGAAGCCACTGCCGAACCAGGTATTGCTTTCGATTTCGCCAAATTCGATGGTAT
TTTAGGTCTTGCTTTCCAA----------
Length of 5' end seq. 559
3' end seq. ID VFA209Z
3' end seq.
>VFA209Z.Seq
----------TTATCACAAGGTTTAGTTTCATCAACACTCTTCCTCNTTCTGGTTATCAA
GAACTCCAGGTGCCAACGGTGGTGAACTNTCATTCGGTTCAATCGATAACACCAAATACA
CTGGTGACATTACCTACGTCCCATTAACCAACGAAACCTATTGGGAATTCGTTATGGATG
ACTTTGCTATCGATGGTCAATCAGCTGGTTTCTGTGGTACTACTTGTCACGCAATTTGCG
ATTCAGGTACATCACTCATTGNTGGTCCAATGGCATGATATTACTGCCCTCAATGAAAAA
TTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCTTCTCTGATTGTAGCGTTATCAACACC
TTACCAAATGTTACCATCACCGTTGCTGGTCGTGAATTTGTTTTAACTCCAAAAGAATAC
GTTTTANAAGTTACTGAGTTCGGAAAGACTGAATGTTTGAGTGGATTTATGGGTATCGAG
TTAAACATGGGAAATTTCTGGATCCTTGGTGATGTTTTCATCTCTGCTTACTATACTGTA
TTCGATTTTGGTAATAAACAAGTNGGTTTCGCAACTGCCATTCAAGGTTAGATTTTTTAA
TTATTTATATTTAAGATAGAAAGNAAACNAAAATAGAACAA
Length of 3' end seq. 631
Connected seq. ID VFA209P
Connected seq.
>VFA209P.Seq
AATTGTCACCTGACTTACATAAAAGAAAATGAAATTACTTATTTTAACTTTATTTTTAGC
TACTATTGTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAAGCTTCAAGAGA
ATCTAGAAGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTAC
CACAATCCCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTAC
CCCAGGTCAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATC
AAAGAAATGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGC
CTCAAGCACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTAT
GTCAGGTTTTGTCTCTCAAGATTCCGTCACTGTTGGTTCATTAACTGTTAAAGATCAATT
ATTCGCTGAAGCCACTGCCGAACCAGGTATTGCTTTCGATTTCGCCAAATTCGATGGTAT
TTTAGGTCTTGCTTTCCAA----------TTATCACAAGGTTTAGTTTCATCAACACTCT
TCCTCNTTCTGGTTATCAAGAACTCCAGGTGCCAACGGTGGTGAACTNTCATTCGGTTCA
ATCGATAACACCAAATACACTGGTGACATTACCTACGTCCCATTAACCAACGAAACCTAT
TGGGAATTCGTTATGGATGACTTTGCTATCGATGGTCAATCAGCTGGTTTCTGTGGTACT
ACTTGTCACGCAATTTGCGATTCAGGTACATCACTCATTGNTGGTCCAATGGCATGATAT
TACTGCCCTCAATGAAAAATTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCTTCTCTGA
TTGTAGCGTTATCAACACCTTACCAAATGTTACCATCACCGTTGCTGGTCGTGAATTTGT
TTTAACTCCAAAAGAATACGTTTTANAAGTTACTGAGTTCGGAAAGACTGAATGTTTGAG
TGGATTTATGGGTATCGAGTTAAACATGGGAAATTTCTGGATCCTTGGTGATGTTTTCAT
CTCTGCTTACTATACTGTATTCGATTTTGGTAATAAACAAGTNGGTTTCGCAACTGCCAT
TCAAGGTTAGATTTTTTAATTATTTATATTTAAGATAGAAAGNAAACNAAAATAGAACAA
Length of connected seq. 1190
Full length Seq ID -
Full length Seq. -
Length of full length seq. -