VFB791
Library VF
(Link to library)
Clone ID VFB791
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16285-1
Original site URL
Representative seq. ID VFB791E
(Link to Original site)
Representative DNA sequence
>VFB791 (VFB791Q) /CSM/VF/VFB7-D/VFB791Q.Seq.d/
ATTACATAGTTTTTATAAAAAAGAAAAAAATGAGATTCATCTTATTCTTTGTTTTAATGT
TAACAGCCTTGGCTGCTGGTAGAAGATTATCAGTTGAGGAAAGTCAATTCATTGCCTTCC
AAAATAAATATAATAAAATTTATTCAGCTGAAGAATATTTAGTTAAATTTGAAACCTTCA
AATCAAATTTATTAAATATTGATGCCTTAAACAAACAAGCCACCACCATTGGATCTGATA
CTAAATTTGGTGTCAACAAATTTGCTGATCTCTCAAAAGAAGAATTCAAAAAATATTACT
TAAGCAGCAAAGAAGCCCGTTTAACTGATGACCTCCCAATGTTACCAAACTTATCAGACG
ATATCATTTCAGCAACCCCAGCCGCTTTCGATTGGAGAAATACTGGTGGTTCAACTAAAT
TCCCACAAGGTACTCCAGTTACCGCTGTTAAAAACCAAGGTCAATGTGGTTCATGTTGGT
CATTCTCTACCACTGGTAACGTCGAAGGTCAACACTATTTATCAACTGGTACATTAGTTG
GTCTCTCTGAACAAAATTTAGTCGATTGTGATCATACTTGTATGACCTACGAAAACGAAA
ATGTTTGCAATGCTGGTTGTGATGGTGGTCTCCAACCAAATGCCTACAACTACATCATCA
AAAACGGAGGTATCCAAACCGAAGCTACCTATCCATACACTGCTGTTGATGGAGAATGTA
AATTTAACTCTGCCCAAGTTGGTGCTAAAATTTCATCTTTCACTATGGTTCCACAAAATG
AAACTCAAATTGCTTCCTACTTATTCAACAATGGTCCATTAGCTATTGCAGCTGATGCTG
AAGAATGGCAATTCTATATGGGAGGTGTTTTCGATTTCCCATGTGGTCAAACTTTAGATC
ACGGTATCTTAATTGTTGGTTATGGTGCTCAAGATACCATCGTCGGTAAAAATACTCCAT
ACTGGATCATTAAAAACTCATGGGGTGCCGATTGGGGTGAAGCTGGTTACTTAAAAGTTG
AAAGAAATACTGATAAATGTGGTGTTGCCAATTTCGTTTCTTCATCAATTGTGGTTCATC
AAACTAAAAACAATAAATAAAATTTTATTAAAT
sequence update 2001. 6. 9
Translated Amino Acid sequence
YIVFIKKKKMRFILFFVLMLTALAAGRRLSVEESQFIAFQNKYNKIYSAEEYLVKFETFK
SNLLNIDALNKQATTIGSDTKFGVNKFADLSKEEFKKYYLSSKEARLTDDLPMLPNLSDD
IISATPAAFDWRNTGGSTKFPQGTPVTAVKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVG
LSEQNLVDCDHTCMTYENENVCNAGCDGGLQPNAYNYIIKNGGIQTEATYPYTAVDGECK
FNSAQVGAKISSFTMVPQNETQIASYLFNNGPLAIAADAEEWQFYMGGVFDFPCGQTLDH
GILIVGYGAQDTIVGKNTPYWIIKNSWGADWGEAGYLKVERNTDKCGVANFVSSSIVVHQ
TKNNK*nfik


Translated Amino Acid sequence (All Frames)
Frame A:
it*fl*krkk*dssyslf*c*qpwllvedyqlrkvnslpskiniikfiqlkni*lnlkps
nqiy*ilmp*tnkpppldlilnlvstnllisqkknsknit*aakkpv*lmtsqcyqtyqt
isfqqpqplsigeilvvqlnshkvlqlpllktkvnvvhvghslplvtskvntiyqlvh*l
vslnki*siviilv*ptktkmfamlvvmvvsnqmptttssktevskpklpihtlllmenv
nltlpklvlkfhlslwfhkmklkllptystmvh*llqlmlkngnsiwevfsishvvkl*i
tvs*llvmvlkipssvkilhtgslkthgvpigvklvt*klkeilinvvlpisflhqlwfi
klktinkilln


Frame B:
lhsfykkekneihlilcfnvnslgcw*kiis*gksihclpk*i**nlfs*rifs*i*nlq
ikfiky*clkqtshhhwi*y*iwcqqic*slkrriqkillkqqrspfn**ppnvtklirr
yhfsnpsrfrlekywwfn*iptryssyrc*kprsmwfmlvilyhw*rrrstlfinwyisw
sl*tkfsrl*sylydlrkrkclqcwl*wwsptkclqlhhqkrrypnrsylsihcc*wrm*
i*lcpswc*nfifhygstk*nsncflliqqwsisycs*c*rmailygrcfrfpmwsnfrs
rylncwlwcsryhrr*kysildh*klmgcrlg*swllks*kky**mwccqfrffincgss
n*kq*ikfy*


Frame C:
YIVFIKKKKMRFILFFVLMLTALAAGRRLSVEESQFIAFQNKYNKIYSAEEYLVKFETFK
SNLLNIDALNKQATTIGSDTKFGVNKFADLSKEEFKKYYLSSKEARLTDDLPMLPNLSDD
IISATPAAFDWRNTGGSTKFPQGTPVTAVKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVG
LSEQNLVDCDHTCMTYENENVCNAGCDGGLQPNAYNYIIKNGGIQTEATYPYTAVDGECK
FNSAQVGAKISSFTMVPQNETQIASYLFNNGPLAIAADAEEWQFYMGGVFDFPCGQTLDH
GILIVGYGAQDTIVGKNTPYWIIKNSWGADWGEAGYLKVERNTDKCGVANFVSSSIVVHQ
TKNNK*nfik


Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VFB791 (VFB791Q) /CSM/VF/VFB7-D/VFB791Q.Seq.d/ 2147 0.0
VFL786 (VFL786Q) /CSM/VF/VFL7-D/VFL786Q.Seq.d/ 2133 0.0
VFL771 (VFL771Q) /CSM/VF/VFL7-C/VFL771Q.Seq.d/ 2133 0.0
VFL486 (VFL486Q) /CSM/VF/VFL4-D/VFL486Q.Seq.d/ 2133 0.0
VFI486 (VFI486Q) /CSM/VF/VFI4-D/VFI486Q.Seq.d/ 2133 0.0
VFF501 (VFF501Q) /CSM/VF/VFF5-A/VFF501Q.Seq.d/ 2133 0.0
VFD464 (VFD464Q) /CSM/VF/VFD4-C/VFD464Q.Seq.d/ 2133 0.0
VFG680 (VFG680Q) /CSM/VF/VFG6-D/VFG680Q.Seq.d/ 2125 0.0
VFE847 (VFE847Q) /CSM/VF/VFE8-B/VFE847Q.Seq.d/ 2125 0.0
VFD806 (VFD806Q) /CSM/VF/VFD8-A/VFD806Q.Seq.d/ 2125 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

X02407|X02407.1 D.discoideum mRNA for cysteine proteinase 1. 54 7e-32 7
U72746|U72746.1 Dictyostelium discoideum cysteine proteinase (cprG) mRNA, complete cds. 48 2e-15 5
X03344|X03344.1 Dictyostelium discoideum mRNA for cysteine proteinase 2. 46 1e-14 6
L36205|L36205.1 Dictyostelium discoideum cysteine proteinase CP5 mRNA, complete cds. 48 4e-13 5
L36204|L36204.1 Dictyostelium discoideum cysteine proteinase (CP4) mRNA, complete cds. 52 2e-12 4
BM395468|BM395468.1 50072-2-9-C11.f.1 Chilcoat/Turkewitz cDNA (large fraction) Tetrahymena thermophila cDNA, mRNA sequence. 66 5e-11 2
BM393359|BM393359.1 50071-2-9-C11.f.1 Chilcoat/Turkewitz cDNA (small fraction) Tetrahymena thermophila cDNA, mRNA sequence. 66 5e-11 2
AC117072|AC117072.2 Dictyostelium discoideum chromosome 2 map 3323568-3470138 strain AX4, complete sequence. 48 2e-10 10
M16039|M16039.1 Dictyostelium discoideum pst-cath gene encoding pst-cathepsin, complete cds. 38 9e-09 6
U42758|U42758.1 Naegleria fowleri cysteine proteinase homolog mRNA, partial cds. 50 8e-08 3
dna update 2003. 9. 9
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(P04988) RecName: Full=Cysteine proteinase 1; EC=3.4.22... 461 e-128
U42758_1(U42758|pid:none) Naegleria fowleri cysteine proteinase ... 312 2e-83
AC149637_11(AC149637|pid:none) Medicago truncatula clone mth2-18... 283 6e-75
AF411121_1(AF411121|pid:none) Sandersonia aurantiaca cysteine pr... 280 5e-74
AB270920_1(AB270920|pid:none) Phaseolus vulgaris CP2 gene for cy... 276 1e-72
FB844636_1(FB844636|pid:none) Sequence 63909 from Patent WO20080... 276 1e-72
T12040(T12040) cysteine proteinase (EC 3.4.22.-) 2 precursor - k... 274 4e-72
AJ242994_1(AJ242994|pid:none) Nicotiana tabacum mRNA for putativ... 272 1e-71
AJ580823_3(AJ580823|pid:none) Lotus corniculatus var. japonicus ... 271 2e-71
FJ475061_1(FJ475061|pid:none) Arachis hypogaea cysteine protease... 271 3e-71
protein update 2009. 7.10
PSORT

psg: 1.02 gvh: 0.83 alm: 0.45 top: 0.13 tms: 0.00 mit: 0.40 mip: 0.06
nuc: 0.04 erl: 0.00 erm: 0.20 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

60.0 %: extracellular, including cell wall
16.0 %: vacuolar
12.0 %: endoplasmic reticulum
8.0 %: cytoplasmic
4.0 %: Golgi

>> prediction for VFB791 is exc

5' end seq. ID VFB791F
5' end seq.
>VFB791F.Seq
ATTACATAGTTTTTATAAAAAAGAAAAAAATGAGATTCATCTTATTCTTTGTTTTAATGT
TAACAGCCTTGGCTGCTGGTAGAAGATTATCAGTTGAGGAAAGTCAATTCATTGCCTTCC
AAAATAAATATAATAAAATTTATTCAGCTGAAGAATATTTAGTTAAATTTGAAACCTTCA
AATCAAATTTATTAAATATTGATGCCTTAAACAAACAAGCCACCACCATTGGATCTGATA
CTAAATTTGGTGTCAACAAATTTGCTGATCTCTCAAAAGAAGAATTCAAAAAATATTACT
TAAGCAGCAAAGAAGCCCGTTTAACTGATGACCTCCCAATGTTACCAAACTTATCAGACG
ATATCATTTCAGCAACCCCAGCCGCTTTCGATTGGAGAAATACTGGTGGTTCAACTAAAT
TCCCACAAGGTACTCCAGTTACCGCTGTTAAAAACCAAGGTCAATGTGGTTCATGTTGGT
CATTCTCTACCACTGGTAACGTC----------
Length of 5' end seq. 503
3' end seq. ID VFB791Z
3' end seq.
>VFB791Z.Seq
----------AACTAAATTCCCACAAGGTACTCCAGTTACCGCTGTTAAAAACCAAGGTC
AATGTGGTTCATGTTGGTCATTCTCTACCACTGGTAACGTCGAAGGTCAACACTATTTAT
CAACTGGTACATTAGTTGGTCTCTCTGAACAAAATTTAGTCGATTGTGATCATACTTGTA
TGACCTACGAAAACGAAAATGTTTGCAATGCTGGTTGTGATGGTGGTCTCCAACCAAATG
CCTACAACTACATCATCAAAAACGGAGGTATCCAAACCGAAGCTACCTATCCATACACTG
CTGTTGATGGAGAATGTAAATTTAACTCTGCCCAAGTTGGTGCTAAAATTTCATCTTTCA
CTATGGTTCCACAAAATGAAACTCAAATTGCTTCCTACTTATTCAACAATGGTCCATTAG
CTATTGCAGCTGATGCTGAAGAATGGCAATTCTATATGGGAGGTGTTTTCGATTTCCCAT
GTGGTCAAACTTTAGATCACGGTATCTTAATTGTTGGTTATGGTGCTCAAGATACCATCG
TCGGTAAAAATACTCCATACTGGATCATTAAAAACTCATGGGGTGCCGATTGGGGTGAAG
CTGGTTACTTAAAAGTTGAAAGAAATACTGATAAATGTGGTGTTGCCAATTTCGTTTCTT
CATCAATTGTGGTTCATCAAACTAAAAACAATAAATAAAATTTTATTAAAT
Length of 3' end seq. 701
Connected seq. ID VFB791P
Connected seq.
>VFB791P.Seq
ATTACATAGTTTTTATAAAAAAGAAAAAAATGAGATTCATCTTATTCTTTGTTTTAATGT
TAACAGCCTTGGCTGCTGGTAGAAGATTATCAGTTGAGGAAAGTCAATTCATTGCCTTCC
AAAATAAATATAATAAAATTTATTCAGCTGAAGAATATTTAGTTAAATTTGAAACCTTCA
AATCAAATTTATTAAATATTGATGCCTTAAACAAACAAGCCACCACCATTGGATCTGATA
CTAAATTTGGTGTCAACAAATTTGCTGATCTCTCAAAAGAAGAATTCAAAAAATATTACT
TAAGCAGCAAAGAAGCCCGTTTAACTGATGACCTCCCAATGTTACCAAACTTATCAGACG
ATATCATTTCAGCAACCCCAGCCGCTTTCGATTGGAGAAATACTGGTGGTTCAACTAAAT
TCCCACAAGGTACTCCAGTTACCGCTGTTAAAAACCAAGGTCAATGTGGTTCATGTTGGT
CATTCTCTACCACTGGTAACGTC----------AACTAAATTCCCACAAGGTACTCCAGT
TACCGCTGTTAAAAACCAAGGTCAATGTGGTTCATGTTGGTCATTCTCTACCACTGGTAA
CGTCGAAGGTCAACACTATTTATCAACTGGTACATTAGTTGGTCTCTCTGAACAAAATTT
AGTCGATTGTGATCATACTTGTATGACCTACGAAAACGAAAATGTTTGCAATGCTGGTTG
TGATGGTGGTCTCCAACCAAATGCCTACAACTACATCATCAAAAACGGAGGTATCCAAAC
CGAAGCTACCTATCCATACACTGCTGTTGATGGAGAATGTAAATTTAACTCTGCCCAAGT
TGGTGCTAAAATTTCATCTTTCACTATGGTTCCACAAAATGAAACTCAAATTGCTTCCTA
CTTATTCAACAATGGTCCATTAGCTATTGCAGCTGATGCTGAAGAATGGCAATTCTATAT
GGGAGGTGTTTTCGATTTCCCATGTGGTCAAACTTTAGATCACGGTATCTTAATTGTTGG
TTATGGTGCTCAAGATACCATCGTCGGTAAAAATACTCCATACTGGATCATTAAAAACTC
ATGGGGTGCCGATTGGGGTGAAGCTGGTTACTTAAAAGTTGAAAGAAATACTGATAAATG
TGGTGTTGCCAATTTCGTTTCTTCATCAATTGTGGTTCATCAAACTAAAAACAATAAATA
AAATTTTATTAAAT
Length of connected seq. 1204
Full length Seq ID VFB791E
Full length Seq.
>VFB791E.Seq
ATTACATAGTTTTTATAAAAAAGAAAAAAATGAGATTCATCTTATTCTTTGTTTTAATGT
TAACAGCCTTGGCTGCTGGTAGAAGATTATCAGTTGAGGAAAGTCAATTCATTGCCTTCC
AAAATAAATATAATAAAATTTATTCAGCTGAAGAATATTTAGTTAAATTTGAAACCTTCA
AATCAAATTTATTAAATATTGATGCCTTAAACAAACAAGCCACCACCATTGGATCTGATA
CTAAATTTGGTGTCAACAAATTTGCTGATCTCTCAAAAGAAGAATTCAAAAAATATTACT
TAAGCAGCAAAGAAGCCCGTTTAACTGATGACCTCCCAATGTTACCAAACTTATCAGACG
ATATCATTTCAGCAACCCCAGCCGCTTTCGATTGGAGAAATACTGGTGGTTCAACTAAAT
TCCCACAAGGTACTCCAGTTACCGCTGTTAAAAACCAAGGTCAATGTGGTTCATGTTGGT
CATTCTCTACCACTGGTAACGTCGAAGGTCAACACTATTTATCAACTGGTACATTAGTTG
GTCTCTCTGAACAAAATTTAGTCGATTGTGATCATACTTGTATGACCTACGAAAACGAAA
ATGTTTGCAATGCTGGTTGTGATGGTGGTCTCCAACCAAATGCCTACAACTACATCATCA
AAAACGGAGGTATCCAAACCGAAGCTACCTATCCATACACTGCTGTTGATGGAGAATGTA
AATTTAACTCTGCCCAAGTTGGTGCTAAAATTTCATCTTTCACTATGGTTCCACAAAATG
AAACTCAAATTGCTTCCTACTTATTCAACAATGGTCCATTAGCTATTGCAGCTGATGCTG
AAGAATGGCAATTCTATATGGGAGGTGTTTTCGATTTCCCATGTGGTCAAACTTTAGATC
ACGGTATCTTAATTGTTGGTTATGGTGCTCAAGATACCATCGTCGGTAAAAATACTCCAT
ACTGGATCATTAAAAACTCATGGGGTGCCGATTGGGGTGAAGCTGGTTACTTAAAAGTTG
AAAGAAATACTGATAAATGTGGTGTTGCCAATTTCGTTTCTTCATCAATTGTGGTTCATC
AAACTAAAAACAATAAATAAAATTTTATTAAAT
Length of full length seq. 1113