VFB681
Library VF
(Link to library)
Clone ID VFB681
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16605-1
Original site URL
Representative seq. ID VFB681P
(Link to Original site)
Representative DNA sequence
>VFB681 (VFB681Q) /CSM/VF/VFB6-D/VFB681Q.Seq.d/
TGGCCTACTGGAATTAAATATATATATACTAAAAATGAGAGTTTTATCATTCCTTTGTTT
ATTATTAGTTAGCTACGCTTCTGCTAAACAACAATTCTCTGAATTACAATACAGAAATGC
TTTCACCAACTGGATGCAAGCTCACCAAAGAACTTATTCCTCTGAAGAATTTAATGCTCG
TTATCAAATCTTCAAATCCAATATGGATTATGTACACCAATGGAATTCAAAAGGTGGTGA
AACCGTTTTGGGTTTAAATGTTTTCGCTGATATTACCAACCAAGAATATAGAACTACCTA
CTTGGGTACCCCATTCGATGGTTCAGCCCTCATTGGTACTGAAGAAGAGAAAATCTTCTC
CACCCCAGCCCCAACTGTTGATTGGAGAGCTCAAGGTGCTGTCACACCAATTAAAAATCA
AGGTCAATGTGGTGGCTGCTGGTCATTCTCAACCACTGGTTCAACTGAAGGTGCTCACTT
TATTGCATCTGGAACAAAAAAAGATTTAGTTTCATTATCTGAACAAAACTTGATCGATTG
TTCAAAATCATACGGTAACAATGGTTGTGAAGGTGGTTTAATGACTCTTGCCTTTGAATA
TATCATCAATAACAAAGGTATTGATACTGAAAXXXXXXXXXXTTCAAAACATCAAACATT
GGTGCTCAAATTGTTTCATACCAAAATGTTACCTCTGGTTCTGAAGCTTCATTACAATCA
GCATCAAACAATGCTCCAGTCTCTGTTGCAATTGATGCTTCAAATGAATCCTTTCCAATT
ATATGAATCAGGTATCTACTATGAACCAGCATGTTCTCCAACCCAACTTGATCATGGTGT
TTTAGTTGTTGGTTATGGTTCAGGTTCAAGTTCATCATCTGGTTCATCATCTGGTAAATC
ATCATCATCATCATCAACTGGTGGTAAAACTTCATCCTCATCATCATCAGGTAAAGCTTC
ATCATCATCATCAGGCAAAGCTTCATCATCATCATCATCAGGTAAAACTTCATCTGCTGC
TTCATCAACCTCTGGTTCTCAATCAGGTTCCCAATCAGGTAGCCAATCAGGCCAATCCAC
CGGTTCACAATCAGGTCAAACCTCTGCTTCTGGTCAAGCATCAGCATCAGGTTCTGGTTC
TGGCTCAGGTTCAGGTTCAGGTTCAGGTTCAGGCTCAGGTGCTGTTGAGGCCTCATCTGG
TAACTACTGGATCGTTAAAAACTCATGGGGTACTTCATGGGGTATGGATGGTTACATTTT
TATGAGCAAAGATAGAAATAACAATTGTGGTATCGCAACAATGGCTTCTTTCCCAACTGC
CTCATCAAATTAAAATTTTATTTTTAATTGTCCGACTA
sequence update 2001. 6. 1
Translated Amino Acid sequence
GLLELNIYILKMRVLSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNAR
YQIFKSNMDYVHQWNSKGGETVLGLNVFADITNQEYRTTYLGTPFDGSALIGTEEEKIFS
TPAPTVDWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDC
SKSYGNNGCEGGLMTLAFEYIINNKGIDTE---

---SKHQTLVLKLFHTKMLPLVLKLHYNQHQTMLQSLLQLMLQMNPFQLYESGIYYEPAC
SPTQLDHGVLVVGYGSGSSSSSGSSSGKSSSSSSTGGKTSSSSSSGKASSSSSGKASSSS
SSGKTSSAASSTSGSQSGSQSGSQSGQSTGSQSGQTSASGQASASGSGSGSGSGSGSGSG
SGAVEASSGNYWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMASFPTASSN*nfifncp
t


Translated Amino Acid sequence (All Frames)
Frame A:
wptgikyiytknesfiiplfiis*lrfc*ttil*itiqkcfhqldasspknlfl*ri*cs
lsnlqiqyglctpmefkrw*nrfgfkcfr*yyqpri*nyllgypirwfsphwy*rrenll
hpspnc*lessrcchtn*ksrsmwwllvilnhwfn*rcslyciwnkkrfsfii*tkldrl
fkiir*qwl*rwfndscl*iyhq*qry*y*---

---fktsnigaqivsyqnvtsgseaslqsasnnapvsvaidasnesfpii*iryll*tsm
fsnpt*swcfscwlwfrfkfiiwfiiw*iiiiiinww*nfiliiir*sfiiiirqsfiii
iir*nficcfinlwfsirfpir*pirpihrftirsnlcfwssisirfwfwlrfrfrfrfr
lrcc*gliw*lldr*klmgyfmgygwlhfyeqr*k*qlwyrnngffpncliklkfyf*ls
d

Frame B:
GLLELNIYILKMRVLSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNAR
YQIFKSNMDYVHQWNSKGGETVLGLNVFADITNQEYRTTYLGTPFDGSALIGTEEEKIFS
TPAPTVDWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDC
SKSYGNNGCEGGLMTLAFEYIINNKGIDTE---

---SKHQTLVLKLFHTKMLPLVLKLHYNQHQTMLQSLLQLMLQMNPFQLYESGIYYEPAC
SPTQLDHGVLVVGYGSGSSSSSGSSSGKSSSSSSTGGKTSSSSSSGKASSSSSGKASSSS
SSGKTSSAASSTSGSQSGSQSGSQSGQSTGSQSGQTSASGQASASGSGSGSGSGSGSGSG
SGAVEASSGNYWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMASFPTASSN*nfifncp
t

Frame C:
aywn*iyiy*k*efyhsfvyy*latlllnnnslnyntemlsptgckltkeliplknlmlv
ikssnpiwimytngiqkvvkpfwv*mfslilptknielptwvphsmvqpslvlkkrkssp
pqpqlligelkvlshqlkikvnvvaaghsqplvqlkvltllhleqkki*fhylnkt*siv
qnhtvtmvvkvv**llplnissitkvlilk---

---qnikhwcsncfipkcylwf*sfitisikqcsslccn*cfk*ilsnymnqvstmnqhv
lqpnlimvf*llvmvqvqvhhlvhhlvnhhhhhqlvvklhphhhqvklhhhhqaklhhhh
hqvklhlllhqplvlnqvpnqvanqanppvhnqvkplllvkhqhqvlvlaqvqvqvqvqa
qvllrphlvttgslkthgvlhgvwmvtfl*akieitivvsqqwllsqlphqikilflivr
l

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VFB681 (VFB681Q) /CSM/VF/VFB6-D/VFB681Q.Seq.d/ 1622 0.0
VFN628 (VFN628Q) /CSM/VF/VFN6-B/VFN628Q.Seq.d/ 1189 0.0
VFN294 (VFN294Q) /CSM/VF/VFN2-D/VFN294Q.Seq.d/ 1189 0.0
VFM804 (VFM804Q) /CSM/VF/VFM8-A/VFM804Q.Seq.d/ 1189 0.0
VFL463 (VFL463Q) /CSM/VF/VFL4-C/VFL463Q.Seq.d/ 1189 0.0
VFL389 (VFL389Q) /CSM/VF/VFL3-D/VFL389Q.Seq.d/ 1189 0.0
VFK680 (VFK680Q) /CSM/VF/VFK6-D/VFK680Q.Seq.d/ 1189 0.0
VFK619 (VFK619Q) /CSM/VF/VFK6-A/VFK619Q.Seq.d/ 1189 0.0
VFK133 (VFK133Q) /CSM/VF/VFK1-B/VFK133Q.Seq.d/ 1189 0.0
VFI606 (VFI606Q) /CSM/VF/VFI6-A/VFI606Q.Seq.d/ 1189 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

L36204|L36204.1 Dictyostelium discoideum cysteine proteinase (CP4) mRNA, complete cds. 922 0.0 7
U72746|U72746.1 Dictyostelium discoideum cysteine proteinase (cprG) mRNA, complete cds. 145 e-145 9
L36205|L36205.1 Dictyostelium discoideum cysteine proteinase CP5 mRNA, complete cds. 121 e-139 10
AC117072|AC117072.2 Dictyostelium discoideum chromosome 2 map 3323568-3470138 strain AX4, complete sequence. 121 e-119 13
U72745|U72745.1 Dictyostelium discoideum cysteine proteinase (cprF) mRNA, complete cds. 115 e-119 8
X03344|X03344.1 Dictyostelium discoideum mRNA for cysteine proteinase 2. 76 1e-65 8
M16039|M16039.1 Dictyostelium discoideum pst-cath gene encoding pst-cathepsin, complete cds. 74 2e-52 10
X02407|X02407.1 D.discoideum mRNA for cysteine proteinase 1. 70 2e-18 3
BJ171907|BJ171907.1 Physcomitrella patens subsp. patens cDNA clone:pph30i20, 3' end, single read. 68 3e-12 2
AJ489298|AJ489298.1 Aphis gossypii mRNA for putative cathepsin L (catL gene). 56 4e-10 3
dna update 2003. 9. 5
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

L36204_1(L36204|pid:none) Dictyostelium discoideum cysteine prot... 449 e-125
(Q94504) RecName: Full=Cysteine proteinase 7; EC=3.4.22... 357 7e-97
(Q94503) RecName: Full=Cysteine proteinase 6; EC=3.4.22... 342 2e-92
(P54640) RecName: Full=Cysteine proteinase 5; EC=3.4.22... 340 5e-92
L36205_1(L36205|pid:none) Dictyostelium discoideum cysteine prot... 334 5e-90
(P04989) RecName: Full=Cysteine proteinase 2; EC=3.4.22... 273 8e-72
EF053509_1(EF053509|pid:none) Acanthamoeba castellanii cysteine ... 223 9e-57
AC117076_20(AC117076|pid:none) Dictyostelium discoideum chromoso... 194 6e-48
AY336797_1(AY336797|pid:none) Rhipicephalus haemaphysaloides hae... 148 6e-46
AY220615_1(AY220615|pid:none) Hydra vulgaris cathepsin L precurs... 182 3e-44
protein update 2009. 6.17
PSORT

psg: 0.81 gvh: 0.87 alm: 0.26 top: 0.47 tms: 0.07 mit: 0.24 mip: 0.08
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 1.00 tyr: 0.00 leu: 0.71 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 1.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 0.00

24.0 %: nuclear
20.0 %: cytoplasmic
20.0 %: endoplasmic reticulum
12.0 %: Golgi
12.0 %: mitochondrial
4.0 %: plasma membrane
4.0 %: vesicles of secretory system
4.0 %: peroxisomal

>> prediction for VFB681 is nuc

5' end seq. ID VFB681F
5' end seq.
>VFB681F.Seq
TGGCCTACTGGAATTAAATATATATATACTAAAAATGAGAGTTTTATCATTCCTTTGTTT
ATTATTAGTTAGCTACGCTTCTGCTAAACAACAATTCTCTGAATTACAATACAGAAATGC
TTTCACCAACTGGATGCAAGCTCACCAAAGAACTTATTCCTCTGAAGAATTTAATGCTCG
TTATCAAATCTTCAAATCCAATATGGATTATGTACACCAATGGAATTCAAAAGGTGGTGA
AACCGTTTTGGGTTTAAATGTTTTCGCTGATATTACCAACCAAGAATATAGAACTACCTA
CTTGGGTACCCCATTCGATGGTTCAGCCCTCATTGGTACTGAAGAAGAGAAAATCTTCTC
CACCCCAGCCCCAACTGTTGATTGGAGAGCTCAAGGTGCTGTCACACCAATTAAAAATCA
AGGTCAATGTGGTGGCTGCTGGTCATTCTCAACCACTGGTTCAACTGAAGGTGCTCACTT
TATTGCATCTGGAACAAAAAAAGATTTAGTTTCATTATCTGAACAAAACTTGATCGATTG
TTCAAAATCATACGGTAACAATGGTTGTGAAGGTGGTTTAATGACTCTTGCCTTTGAATA
TATCATCAATAACAAAGGTATTGATACTGAAA----------
Length of 5' end seq. 632
3' end seq. ID VFB681Z
3' end seq.
>VFB681Z.Seq
----------TTCAAAACATCAAACATTGGTGCTCAAATTGTTTCATACCAAAATGTTAC
CTCTGGTTCTGAAGCTTCATTACAATCAGCATCAAACAATGCTCCAGTCTCTGTTGCAAT
TGATGCTTCAAATGAATCCTTTCCAATTATATGAATCAGGTATCTACTATGAACCAGCAT
GTTCTCCAACCCAACTTGATCATGGTGTTTTAGTTGTTGGTTATGGTTCAGGTTCAAGTT
CATCATCTGGTTCATCATCTGGTAAATCATCATCATCATCATCAACTGGTGGTAAAACTT
CATCCTCATCATCATCAGGTAAAGCTTCATCATCATCATCAGGCAAAGCTTCATCATCAT
CATCATCAGGTAAAACTTCATCTGCTGCTTCATCAACCTCTGGTTCTCAATCAGGTTCCC
AATCAGGTAGCCAATCAGGCCAATCCACCGGTTCACAATCAGGTCAAACCTCTGCTTCTG
GTCAAGCATCAGCATCAGGTTCTGGTTCTGGCTCAGGTTCAGGTTCAGGTTCAGGTTCAG
GCTCAGGTGCTGTTGAGGCCTCATCTGGTAACTACTGGATCGTTAAAAACTCATGGGGTA
CTTCATGGGGTATGGATGGTTACATTTTTATGAGCAAAGATAGAAATAACAATTGTGGTA
TCGCAACAATGGCTTCTTTCCCAACTGCCTCATCAAATTAAAATTTTATTTTTAATTGTC
CGACTA
Length of 3' end seq. 716
Connected seq. ID VFB681P
Connected seq.
>VFB681P.Seq
TGGCCTACTGGAATTAAATATATATATACTAAAAATGAGAGTTTTATCATTCCTTTGTTT
ATTATTAGTTAGCTACGCTTCTGCTAAACAACAATTCTCTGAATTACAATACAGAAATGC
TTTCACCAACTGGATGCAAGCTCACCAAAGAACTTATTCCTCTGAAGAATTTAATGCTCG
TTATCAAATCTTCAAATCCAATATGGATTATGTACACCAATGGAATTCAAAAGGTGGTGA
AACCGTTTTGGGTTTAAATGTTTTCGCTGATATTACCAACCAAGAATATAGAACTACCTA
CTTGGGTACCCCATTCGATGGTTCAGCCCTCATTGGTACTGAAGAAGAGAAAATCTTCTC
CACCCCAGCCCCAACTGTTGATTGGAGAGCTCAAGGTGCTGTCACACCAATTAAAAATCA
AGGTCAATGTGGTGGCTGCTGGTCATTCTCAACCACTGGTTCAACTGAAGGTGCTCACTT
TATTGCATCTGGAACAAAAAAAGATTTAGTTTCATTATCTGAACAAAACTTGATCGATTG
TTCAAAATCATACGGTAACAATGGTTGTGAAGGTGGTTTAATGACTCTTGCCTTTGAATA
TATCATCAATAACAAAGGTATTGATACTGAAA----------TTCAAAACATCAAACATT
GGTGCTCAAATTGTTTCATACCAAAATGTTACCTCTGGTTCTGAAGCTTCATTACAATCA
GCATCAAACAATGCTCCAGTCTCTGTTGCAATTGATGCTTCAAATGAATCCTTTCCAATT
ATATGAATCAGGTATCTACTATGAACCAGCATGTTCTCCAACCCAACTTGATCATGGTGT
TTTAGTTGTTGGTTATGGTTCAGGTTCAAGTTCATCATCTGGTTCATCATCTGGTAAATC
ATCATCATCATCATCAACTGGTGGTAAAACTTCATCCTCATCATCATCAGGTAAAGCTTC
ATCATCATCATCAGGCAAAGCTTCATCATCATCATCATCAGGTAAAACTTCATCTGCTGC
TTCATCAACCTCTGGTTCTCAATCAGGTTCCCAATCAGGTAGCCAATCAGGCCAATCCAC
CGGTTCACAATCAGGTCAAACCTCTGCTTCTGGTCAAGCATCAGCATCAGGTTCTGGTTC
TGGCTCAGGTTCAGGTTCAGGTTCAGGTTCAGGCTCAGGTGCTGTTGAGGCCTCATCTGG
TAACTACTGGATCGTTAAAAACTCATGGGGTACTTCATGGGGTATGGATGGTTACATTTT
TATGAGCAAAGATAGAAATAACAATTGTGGTATCGCAACAATGGCTTCTTTCCCAACTGC
CTCATCAAATTAAAATTTTATTTTTAATTGTCCGACTA
Length of connected seq. 1348
Full length Seq ID -
Full length Seq. -
Length of full length seq. -