VFF511
Library VF
(Link to library)
Clone ID VFF511
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U15456-1
Original site URL
Representative seq. ID VFF511P
(Link to Original site)
Representative DNA sequence
>VFF511 (VFF511Q) /CSM/VF/VFF5-A/VFF511Q.Seq.d/
ACCTGACTTACATAAAAGAAAATGAAATTACTTATTTTAACTTTATTTTTAGCTACTATT
GTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAAGCTTCAAGAGAATCTAGA
AGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTACCACAATC
CCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTACCCCAGGT
CAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATCAAAGAAA
TGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGCCTCAAGC
ACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTATGTCAGGT
TTTGTCTCTCAAGATTCCGTCACTGTTGGTTCATTAACTGTTAAAGATCAATTATTCGCT
GAAGCCACTGCCGAACCAGGTATTGCTTTCGATTTCGCCAAATTCGATGGTATTTTAGGT
CTTGCTTTXXXXXXXXXXTTTCGATTTCGCCAAATTCGATGGTATTTTAGGTCTTGCTTT
CCAATCCATCTCTGTTAATTCAATTCCACCAGTCTTTTACAACATGTTATCACAAGGTTT
AGTTTCATCAACACTCTTCTCCTTCTGGTTATCAAGAACTCCAGGTGCCAACGGTGGTGA
ACTCTCATTCGGTTCAATCGATAACACCAAATACACTGGTGACATTACCTACGTCCCATT
AACCAACGAAACCTATTGGGAATTCGTTATGGATGACTTTGCTATCGATGGTCAATCAGC
TGGTTTCTGTGGTACTACTTGTCACGCAATTTGCGATTCAGGTACATCACTCATTGCTGG
TCCAATGGCTGATATTACTGCCCTCAATGAAAAATTAGGTGCTGTCATCTTAAATGGTGA
AGGTGTCTTCTCTGATTGTAGCGTTATCAACACCTTACCAAATGTTACCATCACCGTTGC
TGGTCGTGAATTTGTTTTAACTCCAAAAGAATACGTTTTAGAAGTTACTGAGTTCGGAAA
GACTGAATGTTTGAGTGGATTTATGGGTATCGAGTTAAACATGGGAAATTTCTGGATCCT
TGGTGATGTTTTCATCTCTGCTTACTATACTGTATTCGATTTTGGTAATAAACAAGTTGG
TTTCGCAACTGCCATTCAAGGTTAAATTTTTTTAATTAATTTATATTTAAGATAGAAATA
AAACTAAATAATAGAACAA
sequence update 2001. 6. 1
Translated Amino Acid sequence
t*lt*KKMKLLILTLFLATIVLAQALTVPLNFHQASRESRRRVPQKWSNRLSALNAGTTI
PISDFEDAQYYGAITIGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGASS
TYVANGTDFTIQYGSGAMSGFVSQDSVTVGSLTVKDQLFAEATAEPGIAFDFAKFDGILG
LA---

---FDFAKFDGILGLAFQSISVNSIPPVFYNMLSQGLVSSTLFSFWLSRTPGANGGELSF
GSIDNTKYTGDITYVPLTNETYWEFVMDDFAIDGQSAGFCGTTCHAICDSGTSLIAGPMA
DITALNEKLGAVILNGEGVFSDCSVINTLPNVTITVAGREFVLTPKEYVLEVTEFGKTEC
LSGFMGIELNMGNFWILGDVFISAYYTVFDFGNKQVGFATAIQG*iflinlylr*k*n*i
ieq


Translated Amino Acid sequence (All Frames)
Frame A:
t*lt*KKMKLLILTLFLATIVLAQALTVPLNFHQASRESRRRVPQKWSNRLSALNAGTTI
PISDFEDAQYYGAITIGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGASS
TYVANGTDFTIQYGSGAMSGFVSQDSVTVGSLTVKDQLFAEATAEPGIAFDFAKFDGILG
LA---

---frfrqirwyfrscfpihlc*fnstsllqhvitrfsfintllllviknsrcqrw*tli
rfnr*hqihw*hylrpinqrnllgiryg*lcyrwsiswflwyylsrnlrfryithcwsng
*yycpq*kircchlkw*rcll*l*ryqhltkcyhhrcws*icfnskrirfrsy*vrkd*m
fewiygyrvkhgkfldpw*cfhlcllycirfw**tswfrnchsrlnffn*fifkieikln
nrt

Frame B:
pdlhkrk*nylf*lyf*lllf*lkl*qyh*tsiklqenleeefhkngqtdyllsmlvpqs
qfqilkmlntmvplplvpqvkpsk*fsilvhptcgfhqrnvqslllhviyitnitavpqa
hmsptelispsntvvvlcqvlslkipsllvh*llkinyslkplpnqvllsispnsmvf*v
ll---

---FDFAKFDGILGLAFQSISVNSIPPVFYNMLSQGLVSSTLFSFWLSRTPGANGGELSF
GSIDNTKYTGDITYVPLTNETYWEFVMDDFAIDGQSAGFCGTTCHAICDSGTSLIAGPMA
DITALNEKLGAVILNGEGVFSDCSVINTLPNVTITVAGREFVLTPKEYVLEVTEFGKTEC
LSGFMGIELNMGNFWILGDVFISAYYTVFDFGNKQVGFATAIQG*iflinlylr*k*n*i
ieq

Frame C:
ltyikeneityfnfifsyycfsssfnstiklpssfkri*kksstkmvkqiicsqcwyhnp
nfrf*rcsilwchyhwyprsslqssfrywfiqlvdsikemsnhcccm*ft*qi*qrclkh
icrqrn*fhhpir*wcyvrfclsrfrhcwfinc*rsiir*shcrtrycfrfrqirwyfrs
cf---

---sispnsmvf*vllsnpslliqfhqsfttcyhkv*fhqhsspsgyqelqvptvvnshs
vqsitpntlvtlptsh*ptkpignslwmtllsmvnqlvsvvllvtqfaiqvhhsllvqwl
illpsmkn*vlss*mvkvsslivalstpyqmlpspllvvnlf*lqkntf*kllsserlnv
*vdlwvss*tweisgslvmfsslltilysilvinklvsqlpfkvkff*liyi*drnktk*
*n

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VFF511 (VFF511Q) /CSM/VF/VFF5-A/VFF511Q.Seq.d/ 2434 0.0
VFM519 (VFM519Q) /CSM/VF/VFM5-A/VFM519Q.Seq.d/ 1388 0.0
VFL410 (VFL410Q) /CSM/VF/VFL4-A/VFL410Q.Seq.d/ 1388 0.0
VFL274 (VFL274Q) /CSM/VF/VFL2-D/VFL274Q.Seq.d/ 1388 0.0
VFL252 (VFL252Q) /CSM/VF/VFL2-C/VFL252Q.Seq.d/ 1388 0.0
VFG815 (VFG815Q) /CSM/VF/VFG8-A/VFG815Q.Seq.d/ 1388 0.0
VFF141 (VFF141Q) /CSM/VF/VFF1-B/VFF141Q.Seq.d/ 1388 0.0
VFD482 (VFD482Q) /CSM/VF/VFD4-D/VFD482Q.Seq.d/ 1388 0.0
VFD224 (VFD224Q) /CSM/VF/VFD2-A/VFD224Q.Seq.d/ 1388 0.0
VFD137 (VFD137Q) /CSM/VF/VFD1-B/VFD137Q.Seq.d/ 1388 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

Y16962|Y16962.1 Dictyostelium discoideum mRNA for cathepsin D. 1324 0.0 2
AJ243946|AJ243946.1 Dictyostelium discoideum ctsD gene for cathepsin D, exons 1 to 2. 1324 0.0 3
E33916|E33916.1 Candida boidinii strain with lowered protease activity and utilization thereof as host for producing foreign protein. 36 1e-06 5
AL161500|AL161500.2 Arabidopsis thaliana DNA chromosome 4, contig fragment No. 12. 62 2e-05 1
AL765461|AL765461.1 Arabidopsis thaliana T-DNA flanking sequence GK-139E08-012875. 62 2e-05 1
AV567523|AV567523.1 Arabidopsis thaliana cDNA clone:SQL15g03F, 3' end. 62 2e-05 1
AL762930|AL762930.1 Arabidopsis thaliana T-DNA flanking sequence GK-030C11-011522. 62 2e-05 1
AF372974|AF372974.1 Arabidopsis thaliana AT4g04460/T26N6_7 mRNA, complete cds. 62 2e-05 1
CB264640|CB264640.1 48-E014661-035-002-P12-T7R MPIZ-ADIS-035 Arabidopsis thaliana cDNA clone MPIZp2000P122Q 5-PRIME, mRNA sequence. 62 2e-05 1
BE525959|BE525959.1 M64C07STM Arabidopsis developing seed Arabidopsis thaliana cDNA clone 600034526R1 5', mRNA sequence. 62 2e-05 1
dna update 2003. 9.21
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

AF454831_1(AF454831|pid:none) Apriona germari cathepsin D mRNA, ... 181 1e-88
DQ868657_1(DQ868657|pid:none) Cucumis sativus aspartic proteinas... 183 1e-80
(Q805F2) RecName: Full=Cathepsin E-B; EC=3.4.23.34; Fla... 188 5e-79
BC161297_1(BC161297|pid:none) Xenopus tropicalis hypothetical pr... 189 6e-79
(P16228) RecName: Full=Cathepsin E; EC=3.4.23.34; Flags... 188 9e-78
Y10928_1(Y10928|pid:none) M.musculus gene encoding cathepsin E. 184 4e-76
(P70269) RecName: Full=Cathepsin E; EC=3.4.23.34; Flags... 184 5e-76
EF676352_1(EF676352|pid:none) Picea sitchensis clone WS02732_P05... 159 2e-75
EF678577_1(EF678577|pid:none) Picea sitchensis clone WS02928_J09... 159 2e-75
FN357346_14(FN357346|pid:none) Schistosoma mansoni genome sequen... 162 7e-74
protein update 2009. 6.19
PSORT

psg: 0.98 gvh: 0.77 alm: 0.38 top: 0.57 tms: 0.00 mit: 0.36 mip: 0.08
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.33 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

44.0 %: extracellular, including cell wall
20.0 %: mitochondrial
16.0 %: nuclear
12.0 %: vacuolar
4.0 %: cytoplasmic
4.0 %: cytoskeletal

>> prediction for VFF511 is exc

5' end seq. ID VFF511F
5' end seq.
>VFF511F.Seq
ACCTGACTTACATAAAAGAAAATGAAATTACTTATTTTAACTTTATTTTTAGCTACTATT
GTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAAGCTTCAAGAGAATCTAGA
AGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTACCACAATC
CCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTACCCCAGGT
CAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATCAAAGAAA
TGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGCCTCAAGC
ACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTATGTCAGGT
TTTGTCTCTCAAGATTCCGTCACTGTTGGTTCATTAACTGTTAAAGATCAATTATTCGCT
GAAGCCACTGCCGAACCAGGTATTGCTTTCGATTTCGCCAAATTCGATGGTATTTTAGGT
CTTGCTTT----------
Length of 5' end seq. 548
3' end seq. ID VFF511Z
3' end seq.
>VFF511Z.Seq
----------TTTCGATTTCGCCAAATTCGATGGTATTTTAGGTCTTGCTTTCCAATCCA
TCTCTGTTAATTCAATTCCACCAGTCTTTTACAACATGTTATCACAAGGTTTAGTTTCAT
CAACACTCTTCTCCTTCTGGTTATCAAGAACTCCAGGTGCCAACGGTGGTGAACTCTCAT
TCGGTTCAATCGATAACACCAAATACACTGGTGACATTACCTACGTCCCATTAACCAACG
AAACCTATTGGGAATTCGTTATGGATGACTTTGCTATCGATGGTCAATCAGCTGGTTTCT
GTGGTACTACTTGTCACGCAATTTGCGATTCAGGTACATCACTCATTGCTGGTCCAATGG
CTGATATTACTGCCCTCAATGAAAAATTAGGTGCTGTCATCTTAAATGGTGAAGGTGTCT
TCTCTGATTGTAGCGTTATCAACACCTTACCAAATGTTACCATCACCGTTGCTGGTCGTG
AATTTGTTTTAACTCCAAAAGAATACGTTTTAGAAGTTACTGAGTTCGGAAAGACTGAAT
GTTTGAGTGGATTTATGGGTATCGAGTTAAACATGGGAAATTTCTGGATCCTTGGTGATG
TTTTCATCTCTGCTTACTATACTGTATTCGATTTTGGTAATAAACAAGTTGGTTTCGCAA
CTGCCATTCAAGGTTAAATTTTTTTAATTAATTTATATTTAAGATAGAAATAAAACTAAA
TAATAGAACAA
Length of 3' end seq. 721
Connected seq. ID VFF511P
Connected seq.
>VFF511P.Seq
ACCTGACTTACATAAAAGAAAATGAAATTACTTATTTTAACTTTATTTTTAGCTACTATT
GTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAAGCTTCAAGAGAATCTAGA
AGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTACCACAATC
CCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTACCCCAGGT
CAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATCAAAGAAA
TGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGCCTCAAGC
ACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTATGTCAGGT
TTTGTCTCTCAAGATTCCGTCACTGTTGGTTCATTAACTGTTAAAGATCAATTATTCGCT
GAAGCCACTGCCGAACCAGGTATTGCTTTCGATTTCGCCAAATTCGATGGTATTTTAGGT
CTTGCTTT----------TTTCGATTTCGCCAAATTCGATGGTATTTTAGGTCTTGCTTT
CCAATCCATCTCTGTTAATTCAATTCCACCAGTCTTTTACAACATGTTATCACAAGGTTT
AGTTTCATCAACACTCTTCTCCTTCTGGTTATCAAGAACTCCAGGTGCCAACGGTGGTGA
ACTCTCATTCGGTTCAATCGATAACACCAAATACACTGGTGACATTACCTACGTCCCATT
AACCAACGAAACCTATTGGGAATTCGTTATGGATGACTTTGCTATCGATGGTCAATCAGC
TGGTTTCTGTGGTACTACTTGTCACGCAATTTGCGATTCAGGTACATCACTCATTGCTGG
TCCAATGGCTGATATTACTGCCCTCAATGAAAAATTAGGTGCTGTCATCTTAAATGGTGA
AGGTGTCTTCTCTGATTGTAGCGTTATCAACACCTTACCAAATGTTACCATCACCGTTGC
TGGTCGTGAATTTGTTTTAACTCCAAAAGAATACGTTTTAGAAGTTACTGAGTTCGGAAA
GACTGAATGTTTGAGTGGATTTATGGGTATCGAGTTAAACATGGGAAATTTCTGGATCCT
TGGTGATGTTTTCATCTCTGCTTACTATACTGTATTCGATTTTGGTAATAAACAAGTTGG
TTTCGCAACTGCCATTCAAGGTTAAATTTTTTTAATTAATTTATATTTAAGATAGAAATA
AAACTAAATAATAGAACAA
Length of connected seq. 1269
Full length Seq ID -
Full length Seq. -
Length of full length seq. -