VFH475
Library VF
(Link to library)
Clone ID VFH475
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U15456-1
Original site URL
Representative seq. ID VFH475P
(Link to Original site)
Representative DNA sequence
>VFH475 (VFH475Q) /CSM/VF/VFH4-D/VFH475Q.Seq.d/
AATTGTCACCTGACTTACATAAAAGAAAATGAAATTACTTATTTTAACTTTATTTTTAGC
TACTATTGTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAGGCTTCAAGAGA
ATCTAGAAGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTAC
CACAATCCCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTAC
CCCAGGTCAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATC
AAAGAAATGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGC
CTCAAGCACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTAT
GTCAGGTTTTGTCTCTCAAGATTCCGTCACTGTTGGTTCATTAACTGTTAAAXXXXXXXX
XXGGTGGTGAACTCTCATTCGGTTCAATCGATAACACCAAATACACTGGTGACATTACCT
ACGTCCCATTAACCAACGAAACCTATTGGGAATTCGTTATGGATGACTTTGCTATCGATG
GTCAATCAGCTGGTTTCTGTGGTACTACTTGTCACGCAATTTGCGATTCAGGTACATCAC
TCATTGCTGGTCCAATGGCTGATATTACTGCCCTCAATGAAAAATTAGGTGCTGTCATCT
TAAATGGTGAAGGTGTCTTACTCTGATTGTAGCGTTATCAACACCTTACCAAATGTTACC
ATCACCGTTGCTGGTCGTGAATTTGTTTTAACTNCCAAAAGAATACGTTTTAGAAGTTAC
TGAGTTCGGAAAGNACTGAATGTTTGAGTGGATTTATGGGTATCGAGTTAAACATGGGAA
ATTTNCTGGATNCCTTGGTGATGTTTTCATACTACTGCTTACTATACTGTATTNCGNATT
TTGGTAATAAACAAGTTGGTTTNCGCAACTGCCATGCAAGAGTTAANTTTTTTAAT
sequence update 2001. 6. 1
Translated Amino Acid sequence
ivt*lt*KKMKLLILTLFLATIVLAQALTVPLNFHQASRESRRRVPQKWSNRLSALNAGT
TIPISDFEDAQYYGAITIGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGA
SSTYVANGTDFTIQYGSGAMSGFVSQDSVTVGSLTVK---

---GGELSFGSIDNTKYTGDITYVPLTNETYWEFVMDDFAIDGQSAGFCGTTCHAICDSG
TSLIAGPMADITALNEKLGAVILNGEGVLL*l*ryqhltkcyhhrcws*icfnxqkntf*
kllssexteclsgfmgielnmgnxldxlvmfsyycllycixxfgnkqvgxrncharvxff
n


Translated Amino Acid sequence (All Frames)
Frame A:
nchltyikeneityfnfifsyycfsssfnstiklpsgfkri*kksstkmvkqiicsqcwy
hnpnfrf*rcsilwchyhwyprsslqssfrywfiqlvdsikemsnhcccm*ft*qi*qrc
lkhicrqrn*fhhpir*wcyvrfclsrfrhcwfinc*---

---GGELSFGSIDNTKYTGDITYVPLTNETYWEFVMDDFAIDGQSAGFCGTTCHAICDSG
TSLIAGPMADITALNEKLGAVILNGEGVLL*l*ryqhltkcyhhrcws*icfnxqkntf*
kllssexteclsgfmgielnmgnxldxlvmfsyycllycixxfgnkqvgxrncharvxff
n

Frame B:
ivt*lt*KKMKLLILTLFLATIVLAQALTVPLNFHQASRESRRRVPQKWSNRLSALNAGT
TIPISDFEDAQYYGAITIGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGA
SSTYVANGTDFTIQYGSGAMSGFVSQDSVTVGSLTVK---

---vvnshsvqsitpntlvtlptsh*ptkpignslwmtllsmvnqlvsvvllvtqfaiqv
hhsllvqwlillpsmkn*vlss*mvkvsysdcsvintlpnvtitvagrefvltxkrirfr
sy*vrkxlnv*vdlwvss*tweixwxpw*cfhttayytvxrilvinklvxatamqelxfl

Frame C:
lspdlhkrk*nylf*lyf*lllf*lkl*qyh*tsirlqenleeefhkngqtdyllsmlvp
qsqfqilkmlntmvplplvpqvkpsk*fsilvhptcgfhqrnvqslllhviyitnitavp
qahmsptelispsntvvvlcqvlslkipsllvh*ll---

---w*tlirfnr*hqihw*hylrpinqrnllgiryg*lcyrwsiswflwyylsrnlrfry
ithcwsng*yycpq*kircchlkw*rcltlivalstpyqmlpspllvvnlf*lpkeyvle
vtefgkx*mfewiygyrvkhgkfxgxlgdvfillltilyxxfw**tswfxqlpcks*xf*

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VFH475 (VFH475Q) /CSM/VF/VFH4-D/VFH475Q.Seq.d/ 1907 0.0
VFE111 (VFE111Q) /CSM/VF/VFE1-A/VFE111Q.Seq.d/ 936 0.0
VFO736 (VFO736Q) /CSM/VF/VFO7-B/VFO736Q.Seq.d/ 928 0.0
VFO719 (VFO719Q) /CSM/VF/VFO7-A/VFO719Q.Seq.d/ 928 0.0
VFO689 (VFO689Q) /CSM/VF/VFO6-D/VFO689Q.Seq.d/ 928 0.0
VFO526 (VFO526Q) /CSM/VF/VFO5-B/VFO526Q.Seq.d/ 928 0.0
VFO331 (VFO331Q) /CSM/VF/VFO3-B/VFO331Q.Seq.d/ 928 0.0
VFN875 (VFN875Q) /CSM/VF/VFN8-D/VFN875Q.Seq.d/ 928 0.0
VFN873 (VFN873Q) /CSM/VF/VFN8-D/VFN873Q.Seq.d/ 928 0.0
VFN780 (VFN780Q) /CSM/VF/VFN7-D/VFN780Q.Seq.d/ 928 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

Y16962|Y16962.1 Dictyostelium discoideum mRNA for cathepsin D. 914 0.0 9
AJ243946|AJ243946.1 Dictyostelium discoideum ctsD gene for cathepsin D, exons 1 to 2. 755 0.0 10
AX059531|AX059531.1 Sequence 264 from Patent WO0055325. 62 2e-05 1
AL161500|AL161500.2 Arabidopsis thaliana DNA chromosome 4, contig fragment No. 12. 62 2e-05 1
BE525959|BE525959.1 M64C07STM Arabidopsis developing seed Arabidopsis thaliana cDNA clone 600034526R1 5', mRNA sequence. 62 2e-05 1
AL762930|AL762930.1 Arabidopsis thaliana T-DNA flanking sequence GK-030C11-011522. 62 2e-05 1
CD825453|CD825453.1 BN25.060N03F011129 BN25 Brassica napus cDNA clone BN25060N03, mRNA sequence. 62 2e-05 1
AF076243|AF076243.1 Arabidopsis thaliana BAC T26N6 from chromosome IV at 19.3 cM, complete sequence. 62 2e-05 1
AV567523|AV567523.1 Arabidopsis thaliana cDNA clone:SQL15g03F, 3' end. 62 2e-05 1
CB264640|CB264640.1 48-E014661-035-002-P12-T7R MPIZ-ADIS-035 Arabidopsis thaliana cDNA clone MPIZp2000P122Q 5-PRIME, mRNA sequence. 62 2e-05 1
dna update 2004. 1.11
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

DQ010007_1(DQ010007|pid:none) Bombyx mori CathD mRNA, complete c... 144 5e-33
AB106552_1(AB106552|pid:none) Todarodes pacificus tpaD mRNA for ... 144 6e-33
AF454831_1(AF454831|pid:none) Apriona germari cathepsin D mRNA, ... 141 4e-32
EF070454_1(EF070454|pid:none) Maconellicoccus hirsutus clone WHM... 139 1e-31
EF213114_1(EF213114|pid:none) Penaeus monodon cathepsin D mRNA, ... 139 1e-31
FJ654712_1(FJ654712|pid:none) Chrysomela tremulae aspartic prote... 138 3e-31
AJ417035_1(AJ417035|pid:none) Pleurotus ostreatus partial mRNA f... 137 6e-31
(Q03168) RecName: Full=Lysosomal aspartic protease; EC=... 136 1e-30
CP001577_36(CP001577|pid:none) Micromonas sp. RCC299 chromosome ... 134 7e-30
(O93428) RecName: Full=Cathepsin D; EC=3.4.23.5; Flags:... 134 7e-30
protein update 2009. 6.20
PSORT

psg: 0.98 gvh: 0.77 alm: 0.46 top: 0.57 tms: 0.00 mit: 0.36 mip: 0.08
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

48.0 %: extracellular, including cell wall
20.0 %: mitochondrial
12.0 %: cytoplasmic
8.0 %: vacuolar
4.0 %: Golgi
4.0 %: nuclear
4.0 %: endoplasmic reticulum

>> prediction for VFH475 is exc

5' end seq. ID VFH475F
5' end seq.
>VFH475F.Seq
AATTGTCACCTGACTTACATAAAAGAAAATGAAATTACTTATTTTAACTTTATTTTTAGC
TACTATTGTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAGGCTTCAAGAGA
ATCTAGAAGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTAC
CACAATCCCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTAC
CCCAGGTCAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATC
AAAGAAATGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGC
CTCAAGCACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTAT
GTCAGGTTTTGTCTCTCAAGATTCCGTCACTGTTGGTTCATTAACTGTTAAA--------
--
Length of 5' end seq. 472
3' end seq. ID VFH475Z
3' end seq.
>VFH475Z.Seq
----------GGTGGTGAACTCTCATTCGGTTCAATCGATAACACCAAATACACTGGTGA
CATTACCTACGTCCCATTAACCAACGAAACCTATTGGGAATTCGTTATGGATGACTTTGC
TATCGATGGTCAATCAGCTGGTTTCTGTGGTACTACTTGTCACGCAATTTGCGATTCAGG
TACATCACTCATTGCTGGTCCAATGGCTGATATTACTGCCCTCAATGAAAAATTAGGTGC
TGTCATCTTAAATGGTGAAGGTGTCTTACTCTGATTGTAGCGTTATCAACACCTTACCAA
ATGTTACCATCACCGTTGCTGGTCGTGAATTTGTTTTAACTNCCAAAAGAATACGTTTTA
GAAGTTACTGAGTTCGGAAAGNACTGAATGTTTGAGTGGATTTATGGGTATCGAGTTAAA
CATGGGAAATTTNCTGGATNCCTTGGTGATGTTTTCATACTACTGCTTACTATACTGTAT
TNCGNATTTTGGTAATAAACAAGTTGGTTTNCGCAACTGCCATGCAAGAGTTAANTTTTT
TAAT
Length of 3' end seq. 534
Connected seq. ID VFH475P
Connected seq.
>VFH475P.Seq
AATTGTCACCTGACTTACATAAAAGAAAATGAAATTACTTATTTTAACTTTATTTTTAGC
TACTATTGTTTTAGCTCAAGCTTTAACAGTACCATTAAACTTCCATCAGGCTTCAAGAGA
ATCTAGAAGAAGAGTTCCACAAAAATGGTCAAACAGATTATCTGCTCTCAATGCTGGTAC
CACAATCCCAATTTCAGATTTTGAAGATGCTCAATACTATGGTGCCATTACCATTGGTAC
CCCAGGTCAAGCCTTCAAAGTAGTTTTCGATACTGGTTCATCCAACTTGTGGATTCCATC
AAAGAAATGTCCAATCACTGTTGTTGCATGTGATTTACATAACAAATATAACAGCGGTGC
CTCAAGCACATATGTCGCCAACGGAACTGATTTCACCATCCAATACGGTAGTGGTGCTAT
GTCAGGTTTTGTCTCTCAAGATTCCGTCACTGTTGGTTCATTAACTGTTAAA--------
--GGTGGTGAACTCTCATTCGGTTCAATCGATAACACCAAATACACTGGTGACATTACCT
ACGTCCCATTAACCAACGAAACCTATTGGGAATTCGTTATGGATGACTTTGCTATCGATG
GTCAATCAGCTGGTTTCTGTGGTACTACTTGTCACGCAATTTGCGATTCAGGTACATCAC
TCATTGCTGGTCCAATGGCTGATATTACTGCCCTCAATGAAAAATTAGGTGCTGTCATCT
TAAATGGTGAAGGTGTCTTACTCTGATTGTAGCGTTATCAACACCTTACCAAATGTTACC
ATCACCGTTGCTGGTCGTGAATTTGTTTTAACTNCCAAAAGAATACGTTTTAGAAGTTAC
TGAGTTCGGAAAGNACTGAATGTTTGAGTGGATTTATGGGTATCGAGTTAAACATGGGAA
ATTTNCTGGATNCCTTGGTGATGTTTTCATACTACTGCTTACTATACTGTATTNCGNATT
TTGGTAATAAACAAGTTGGTTTNCGCAACTGCCATGCAAGAGTTAANTTTTTTAAT
Length of connected seq. 1006
Full length Seq ID -
Full length Seq. -
Length of full length seq. -