VFB856
Library VF
(Link to library)
Clone ID VFB856
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16605-1
Original site URL
Representative seq. ID VFB856P
(Link to Original site)
Representative DNA sequence
>VFB856 (VFB856Q) /CSM/VF/VFB8-C/VFB856Q.Seq.d/
ACCATCAATTAAATATATATATACTAAAAATGAGAGTTTTATCATTCCTTTGTTTATTAT
TAGTTAGCTACGCTTCTGCTAAACAACAATTCTCTGAATTACAATACAGAAATGCTTTCA
CCAACTGGATGCAAGCTCACCAAAGAACTTATTCCTCTGAAGAATTTAATGCTCGTTATC
AAATCTTCAAATCCAATATGGATTATGTACACCAATGGAATTCAAAAGGTGGTGAAACCG
TTTTGGGTTTAAATGTTTTCGCTGATATTACCAACCAAGAATATAGAACTACCTACTTGG
GTACCCCATTCGATGGTTCAGCCCTCATTGGTACTGAAGAAGAGAAAATCTTCTCCACCC
CAGCCCCAACTGTTGATTGGAGAGCTCAAGGTGCTGTCACACCAATTAAAAATCAAGGTC
AATGTGGTGGCTGCTGGTCATTCTCAACCACTGGTTCAACTGAAGGTGCTCACTTTATTG
CATCTGGAACAAAAAAAGATTTAGTTTCATTATCTGAACAAAACTTGATCGATTGTTCAA
AATCATACGGTAXXXXXXXXXXTGGTTATGGTTCAGGTTCAAGTTCATCATCTGGTTCAT
CATCTGGTAAATCATCATCATCATCATCAGGCTGGGTGGTAAAACTTCATCCTCATCATC
ATCAGGTAAAGCTTCATCATCATCATCAGGCAAAGCTTCATCATCATCATCATCAGGTAA
AACTTCATCTGCTGCTTCATCAACCTCTGGTTCTCAATCAGGTTCCCAATCAGGTAGCCA
ATCAGGCCAATCCACCGGTTCACAATCAGGTCAAACCTCTGCTTCTGGTCAAGCATCAGC
ATCAGGTTCTGGTTCTGGCTCAGGTTCAGGTTCAGGTTCAGGTTCAGGCTCAGGTGCTGT
TGAGGCCTCATCTGGTAACTACTGGATCGTTAAAAACTCATGGGGTACTTCATGGGGTAT
GGATGGTTACATTTTTATGAGCAAAGATAGAAATAACAATTGTGGTATCGCAACAATGGC
TTCTTTCCCAACTGCCTCATCAAATTAAAATTTTATTTTTAATTGTGCCGACTAACATTA
ATTNGNATTTGTAATAA
sequence update 2001. 6. 1
Translated Amino Acid sequence
HQLNIYILKMRVLSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQ
IFKSNMDYVHQWNSKGGETVLGLNVFADITNQEYRTTYLGTPFDGSALIGTEEEKIFSTP
APTVDWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDCSK
SYG---

---GYGSGSSSSSGSSSGKSSSSSSGWVVKLHPHHHQVKLHHHHQAKLHHHHHQVKLHLL
LHQPLVLNQVPNQVANQANPPVHNQVKPLLLVKHQHQVLVLAQVQVQVQVQAQVLLRPHL
VTTGSLKTHGVLHGVWMVTFL*akieitivvsqqwllsqlphqikilflivptninxxl*
*


Translated Amino Acid sequence (All Frames)
Frame A:
tin*iyiy*k*efyhsfvyy*latlllnnnslnyntemlsptgckltkeliplknlmlvi
kssnpiwimytngiqkvvkpfwv*mfslilptknielptwvphsmvqpslvlkkrksspp
qpqlligelkvlshqlkikvnvvaaghsqplvqlkvltllhleqkki*fhylnkt*sivq
nhtv---

---wlwfrfkfiiwfiiw*iiiiiirlggktssssssgkasssssgkassssssgktssa
asstsgsqsgsqsgsqsgqstgsqsgqtsasgqasasgsgsgsgsgsgsgsgsgaveass
gnywivknswgtswgmdgyifmskdrnnncgiatmasfptassn*nfifncad*h*xxfv
i

Frame B:
psikyiytknesfiiplfiis*lrfc*ttil*itiqkcfhqldasspknlfl*ri*csls
nlqiqyglctpmefkrw*nrfgfkcfr*yyqpri*nyllgypirwfsphwy*rrenllhp
spnc*lessrcchtn*ksrsmwwllvilnhwfn*rcslyciwnkkrfsfii*tkldrlfk
iir---

---GYGSGSSSSSGSSSGKSSSSSSGWVVKLHPHHHQVKLHHHHQAKLHHHHHQVKLHLL
LHQPLVLNQVPNQVANQANPPVHNQVKPLLLVKHQHQVLVLAQVQVQVQVQAQVLLRPHL
VTTGSLKTHGVLHGVWMVTFL*akieitivvsqqwllsqlphqikilflivptninxxl*
*

Frame C:
HQLNIYILKMRVLSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQ
IFKSNMDYVHQWNSKGGETVLGLNVFADITNQEYRTTYLGTPFDGSALIGTEEEKIFSTP
APTVDWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDCSK
SYG---

---vmvqvqvhhlvhhlvnhhhhhqagw*nfiliiir*sfiiiirqsfiiiiir*nficc
finlwfsirfpir*pirpihrftirsnlcfwssisirfwfwlrfrfrfrfrlrcc*gliw
*lldr*klmgyfmgygwlhfyeqr*k*qlwyrnngffpncliklkfyf*lcrltlixicn

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VFO306 (VFO306Q) /CSM/VF/VFO3-A/VFO306Q.Seq.d/ 1053 0.0
VFM544 (VFM544Q) /CSM/VF/VFM5-B/VFM544Q.Seq.d/ 1053 0.0
VFM320 (VFM320Q) /CSM/VF/VFM3-A/VFM320Q.Seq.d/ 1053 0.0
VFK619 (VFK619Q) /CSM/VF/VFK6-A/VFK619Q.Seq.d/ 1053 0.0
VFK603 (VFK603Q) /CSM/VF/VFK6-A/VFK603Q.Seq.d/ 1053 0.0
VFK133 (VFK133Q) /CSM/VF/VFK1-B/VFK133Q.Seq.d/ 1053 0.0
VFG572 (VFG572Q) /CSM/VF/VFG5-C/VFG572Q.Seq.d/ 1053 0.0
VFG473 (VFG473Q) /CSM/VF/VFG4-D/VFG473Q.Seq.d/ 1053 0.0
VFG110 (VFG110Q) /CSM/VF/VFG1-A/VFG110Q.Seq.d/ 1053 0.0
VFF738 (VFF738Q) /CSM/VF/VFF7-B/VFF738Q.Seq.d/ 1053 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

L36204|L36204.1 Dictyostelium discoideum cysteine proteinase (CP4) mRNA, complete cds. 922 0.0 6
U72746|U72746.1 Dictyostelium discoideum cysteine proteinase (cprG) mRNA, complete cds. 131 e-108 7
L36205|L36205.1 Dictyostelium discoideum cysteine proteinase CP5 mRNA, complete cds. 121 e-102 6
U72745|U72745.1 Dictyostelium discoideum cysteine proteinase (cprF) mRNA, complete cds. 115 2e-95 7
AC117072|AC117072.2 Dictyostelium discoideum chromosome 2 map 3323568-3470138 strain AX4, complete sequence. 121 1e-93 8
X03344|X03344.1 Dictyostelium discoideum mRNA for cysteine proteinase 2. 76 1e-31 5
M16039|M16039.1 Dictyostelium discoideum pst-cath gene encoding pst-cathepsin, complete cds. 64 1e-25 7
X02407|X02407.1 D.discoideum mRNA for cysteine proteinase 1. 70 2e-12 2
AJ517568|AJ517568.1 Dreissena polymorpha EST, clone atrado65. 68 3e-07 1
CB398740|CB398740.1 OSTR208E2_2 AD-wrmcDNA Caenorhabditis elegans cDNA, mRNA sequence. 54 8e-06 2
dna update 2003. 9.10
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

L36204_1(L36204|pid:none) Dictyostelium discoideum cysteine prot... 360 5e-98
(Q94504) RecName: Full=Cysteine proteinase 7; EC=3.4.22... 275 3e-72
(P54640) RecName: Full=Cysteine proteinase 5; EC=3.4.22... 273 6e-72
L36205_1(L36205|pid:none) Dictyostelium discoideum cysteine prot... 267 6e-70
(Q94503) RecName: Full=Cysteine proteinase 6; EC=3.4.22... 265 2e-69
(P04989) RecName: Full=Cysteine proteinase 2; EC=3.4.22... 194 6e-48
EF053509_1(EF053509|pid:none) Acanthamoeba castellanii cysteine ... 156 1e-36
AC117076_20(AC117076|pid:none) Dictyostelium discoideum chromoso... 152 2e-35
AK226753_1(AK226753|pid:none) Arabidopsis thaliana mRNA for papa... 137 9e-31
AC000132_24(AC000132|pid:none) Sequence of BAC F21M12 from Arabi... 137 9e-31
protein update 2009. 6.17
PSORT

psg: 0.96 gvh: 0.87 alm: 0.42 top: 0.37 tms: 0.00 mit: 0.33 mip: 0.08
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

72.0 %: extracellular, including cell wall
8.0 %: mitochondrial
8.0 %: vacuolar
8.0 %: endoplasmic reticulum
4.0 %: cytoplasmic

>> prediction for VFB856 is exc

5' end seq. ID VFB856F
5' end seq.
>VFB856F.Seq
ACCATCAATTAAATATATATATACTAAAAATGAGAGTTTTATCATTCCTTTGTTTATTAT
TAGTTAGCTACGCTTCTGCTAAACAACAATTCTCTGAATTACAATACAGAAATGCTTTCA
CCAACTGGATGCAAGCTCACCAAAGAACTTATTCCTCTGAAGAATTTAATGCTCGTTATC
AAATCTTCAAATCCAATATGGATTATGTACACCAATGGAATTCAAAAGGTGGTGAAACCG
TTTTGGGTTTAAATGTTTTCGCTGATATTACCAACCAAGAATATAGAACTACCTACTTGG
GTACCCCATTCGATGGTTCAGCCCTCATTGGTACTGAAGAAGAGAAAATCTTCTCCACCC
CAGCCCCAACTGTTGATTGGAGAGCTCAAGGTGCTGTCACACCAATTAAAAATCAAGGTC
AATGTGGTGGCTGCTGGTCATTCTCAACCACTGGTTCAACTGAAGGTGCTCACTTTATTG
CATCTGGAACAAAAAAAGATTTAGTTTCATTATCTGAACAAAACTTGATCGATTGTTCAA
AATCATACGGTA----------
Length of 5' end seq. 552
3' end seq. ID VFB856Z
3' end seq.
>VFB856Z.Seq
----------TGGTTATGGTTCAGGTTCAAGTTCATCATCTGGTTCATCATCTGGTAAAT
CATCATCATCATCATCAGGCTGGGTGGTAAAACTTCATCCTCATCATCATCAGGTAAAGC
TTCATCATCATCATCAGGCAAAGCTTCATCATCATCATCATCAGGTAAAACTTCATCTGC
TGCTTCATCAACCTCTGGTTCTCAATCAGGTTCCCAATCAGGTAGCCAATCAGGCCAATC
CACCGGTTCACAATCAGGTCAAACCTCTGCTTCTGGTCAAGCATCAGCATCAGGTTCTGG
TTCTGGCTCAGGTTCAGGTTCAGGTTCAGGTTCAGGCTCAGGTGCTGTTGAGGCCTCATC
TGGTAACTACTGGATCGTTAAAAACTCATGGGGTACTTCATGGGGTATGGATGGTTACAT
TTTTATGAGCAAAGATAGAAATAACAATTGTGGTATCGCAACAATGGCTTCTTTCCCAAC
TGCCTCATCAAATTAAAATTTTATTTTTAATTGTGCCGACTAACATTAATTNGNATTTGT
AATAA
Length of 3' end seq. 535
Connected seq. ID VFB856P
Connected seq.
>VFB856P.Seq
ACCATCAATTAAATATATATATACTAAAAATGAGAGTTTTATCATTCCTTTGTTTATTAT
TAGTTAGCTACGCTTCTGCTAAACAACAATTCTCTGAATTACAATACAGAAATGCTTTCA
CCAACTGGATGCAAGCTCACCAAAGAACTTATTCCTCTGAAGAATTTAATGCTCGTTATC
AAATCTTCAAATCCAATATGGATTATGTACACCAATGGAATTCAAAAGGTGGTGAAACCG
TTTTGGGTTTAAATGTTTTCGCTGATATTACCAACCAAGAATATAGAACTACCTACTTGG
GTACCCCATTCGATGGTTCAGCCCTCATTGGTACTGAAGAAGAGAAAATCTTCTCCACCC
CAGCCCCAACTGTTGATTGGAGAGCTCAAGGTGCTGTCACACCAATTAAAAATCAAGGTC
AATGTGGTGGCTGCTGGTCATTCTCAACCACTGGTTCAACTGAAGGTGCTCACTTTATTG
CATCTGGAACAAAAAAAGATTTAGTTTCATTATCTGAACAAAACTTGATCGATTGTTCAA
AATCATACGGTA----------TGGTTATGGTTCAGGTTCAAGTTCATCATCTGGTTCAT
CATCTGGTAAATCATCATCATCATCATCAGGCTGGGTGGTAAAACTTCATCCTCATCATC
ATCAGGTAAAGCTTCATCATCATCATCAGGCAAAGCTTCATCATCATCATCATCAGGTAA
AACTTCATCTGCTGCTTCATCAACCTCTGGTTCTCAATCAGGTTCCCAATCAGGTAGCCA
ATCAGGCCAATCCACCGGTTCACAATCAGGTCAAACCTCTGCTTCTGGTCAAGCATCAGC
ATCAGGTTCTGGTTCTGGCTCAGGTTCAGGTTCAGGTTCAGGTTCAGGCTCAGGTGCTGT
TGAGGCCTCATCTGGTAACTACTGGATCGTTAAAAACTCATGGGGTACTTCATGGGGTAT
GGATGGTTACATTTTTATGAGCAAAGATAGAAATAACAATTGTGGTATCGCAACAATGGC
TTCTTTCCCAACTGCCTCATCAAATTAAAATTTTATTTTTAATTGTGCCGACTAACATTA
ATTNGNATTTGTAATAA
Length of connected seq. 1087
Full length Seq ID -
Full length Seq. -
Length of full length seq. -