VFC223
Library VF
(Link to library)
Clone ID VFC223
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U10989-1
Original site URL
Representative seq. ID VFC223P
(Link to Original site)
Representative DNA sequence
>VFC223 (VFC223Q) /CSM/VF/VFC2-A/VFC223Q.Seq.d/
CTGTTGGCCTACTGGTTTTAAAAGNATAGCTCTTCAAATATCGAGTTATCATTAATTATT
GCAAGTGATGATGAAACACTTGAATTGGGTATTGATGAAAGTTACTTTTTATTAGTGAAT
CAAGACACTTATCAAATAAAAGCCAATACAATCTATGGTGCAATGAGAGGTTTAGAAACA
TTCAAACAAATGGTAGTTTATGGTGTTGTAGAAAATAGTTACTCATTGACATGTGCTGAA
GTTGTAGACTATCCAACCTATCAATGGAGAGGATTGTTGGTTGATAATGCCCGTCATCTC
CTTCCAAAGAATATGGTACTTCATATTATTGACTCGATGGGTTATAATAAATTCAATACT
ATGCATTGGCATTTAATAGATACTGTTGCATTCCCAGTGGAATCGAAAACCTATCCAAAG
TTAACTGAAGCATTACTTGGACCTGGTGCAATTATTACACATGATGATATTTTAGAXXXX
XXXXXXGTACTTCAACATGGTGTTAAATTTGATAAAGAAACTACTTTGGTTCAAACTTGG
ACAAATATTAATGATCTAAGAGATGTACTAGCCGCTGGTTATAAAACTATAACATCGTTC
TTTTTCTATTTAGATAGACAATCACCAACTGGAAATCATTATCATTATGAATGGCAAGAT
ACTTGGGAAGATTTCTATGCATCAGATCCAAGATTAAATATTACTTCAAATGCTGAAAAT
ATTTTAGGTGGTGAAGCTACTATGTTTGGTGAACAAGTTAGTACCGTCAATTGGGATGCC
AGAGTTTGGCCAAGAGCTATTGGTATCTCTGAAAGATTATGGTCTGCTACTGAAATTAAT
AATATCACTCCTTGCTCTCCCTCGTATTGGCCAATTCTCCNTGTGATATGTCCTCGTCGT
GGTATTTCCNCTGGTCCATTATTNCCCTGATTTTTGCTCATTACCTGATGATTTNNCTTT
TNCTTTAAACCA
sequence update 2001. 6. 1
Translated Amino Acid sequence
llaywf*KXSSSNIELSLIIASDDETLELGIDESYFLLVNQDTYQIKANTIYGAMRGLET
FKQMVVYGVVENSYSLTCAEVVDYPTYQWRGLLVDNARHLLPKNMVLHIIDSMGYNKFNT
MHWHLIDTVAFPVESKTYPKLTEALLGPGAIITHDDIL---

---VLQHGVKFDKETTLVQTWTNINDLRDVLAAGYKTITSFFFYLDRQSPTGNHYHYEWQ
DTWEDFYASDPRLNITSNAENILGGEATMFGEQVSTVNWDARVWPRAIGISERLWSATEI
NNITPCSPSYWPILXVICPRRGISXGPLXP*fllit**fxfxfkp


Translated Amino Acid sequence (All Frames)
Frame A:
llaywf*KXSSSNIELSLIIASDDETLELGIDESYFLLVNQDTYQIKANTIYGAMRGLET
FKQMVVYGVVENSYSLTCAEVVDYPTYQWRGLLVDNARHLLPKNMVLHIIDSMGYNKFNT
MHWHLIDTVAFPVESKTYPKLTEALLGPGAIITHDDIL---

---VLQHGVKFDKETTLVQTWTNINDLRDVLAAGYKTITSFFFYLDRQSPTGNHYHYEWQ
DTWEDFYASDPRLNITSNAENILGGEATMFGEQVSTVNWDARVWPRAIGISERLWSATEI
NNITPCSPSYWPILXVICPRRGISXGPLXP*fllit**fxfxfkp

Frame B:
cwptgfkxialqissyh*llqvmmkhlnwvlmkvtfy**iktlik*kpiqsmvq*ev*kh
snkw*fmvl*kivth*hvlkl*tiqpingedcwlimpvisfqriwyfilltrwviinsil
cigi**illhsqwnrkpiqs*lkhyldlvqllhmmif*---

---yfnmvlnlikkllwfklgqilmi*emy*plvikl*hrsfsi*idnhqleiiiimngk
ilgkismhqiqd*illqmlkif*vvkllclvnklvpsigmpefgqellvslkdyglllkl
iisllalprigqfsx*yvlvvvfplvhyxpdfcslpddxxfxln

Frame C:
vgllvlkx*lfkyrviinyck***nt*igy**kllfisesrhlsnksqynlwcnerfrni
qtngslwccrk*llidmc*scrlsnlsmerivg**cpsspskeygtsyy*ldgl**iqyy
alafnryccipsgienlskvn*sitwtwcnyyt**yfr---

---tstwc*i**rnyfgsnldky**skrctsrwl*nynivlflfr*titnwkslsl*mar
ylgrflcirskikyyfkc*kyfrw*syyvw*ts*yrqlgcqslaksywyl*kimvcy*n*
*yhsllslvlanspcdmssswyfxwsiixlifahylmixlxl*t

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VFC223 (VFC223Q) /CSM/VF/VFC2-A/VFC223Q.Seq.d/ 1826 0.0
AFJ603 (AFJ603Q) /CSM/AF/AFJ6-A/AFJ603Q.Seq.d/ 884 0.0
AFN575 (AFN575Q) /CSM/AF/AFN5-D/AFN575Q.Seq.d/ 858 0.0
VHK827 (VHK827Q) /CSM/VH/VHK8-B/VHK827Q.Seq.d/ 846 0.0
VFK348 (VFK348Q) /CSM/VF/VFK3-B/VFK348Q.Seq.d/ 835 0.0
SLI877 (SLI877Q) /CSM/SL/SLI8-D/SLI877Q.Seq.d/ 791 0.0
VFA766 (VFA766Q) /CSM/VF/VFA7-C/VFA766Q.Seq.d/ 771 0.0
VFO641 (VFO641Q) /CSM/VF/VFO6-B/VFO641Q.Seq.d/ 541 e-153
VFM185 (VFM185Q) /CSM/VF/VFM1-D/VFM185Q.Seq.d/ 234 2e-60
VSB886 (VSB886Q) /CSM/VS/VSB8-D/VSB886Q.Seq.d/ 125 1e-27

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

J04065|J04065.1 D.discoideum beta-N-acetylhexosaminidase (nagA) mRNA, complete cds. 80 3e-59 8
BX571979|BX571979.2 Zebrafish DNA sequence *** SEQUENCING IN PROGRESS *** from clone DKEYP-51C11. 38 0.014 5
D49525|D49525.1 Caenorhabditis elegans gene for TPA-1A; TPA-1B, complete cds. 46 0.81 1
AC006741|AC006741.2 Caenorhabditis elegans clone Y38C1, WORKING DRAFT SEQUENCE, 43 unordered pieces. 46 0.81 1
AF078781|AF078781.1 Caenorhabditis elegans cosmid B0545, complete sequence. 46 0.81 1
AX347157|AX347157.1 Sequence 2228 from Patent WO0200928. 44 0.91 2
AL840626|AL840626.7 Mouse DNA sequence from clone RP23-151M8 on chromosome 2. 36 1.8 5
AC087321|AC087321.20 Homo sapiens 12p BAC RP11-606D9 (Roswell Park Cancer Institute Human BAC Library) complete sequence. 42 2.3 4
AC092454|AC092454.5 Homo sapiens chromosome 12 clone RP11-10A6, WORKING DRAFT SEQUENCE, 2 unordered pieces. 42 2.4 3
BU798521|BU798521.1 SJF2AEF09 SJF Schistosoma japonicum cDNA, mRNA sequence. 44 3.2 1
dna update 2003. 9.11
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(Q54SC9) RecName: Full=Beta-hexosaminidase subunit A2; ... 277 3e-73
(P13723) RecName: Full=Beta-hexosaminidase subunit A1; ... 199 1e-49
(Q54K55) RecName: Full=Beta-hexosaminidase subunit B1; ... 123 1e-26
AM493720_1(AM493720|pid:none) Arabidopsis thaliana mRNA for beta... 117 5e-25
AL132954_14(AL132954|pid:none) Arabidopsis thaliana DNA chromoso... 117 5e-25
AY629244_1(AY629244|pid:none) Oryctolagus cuniculus beta-hexosam... 113 9e-24
AC078977_6(AC078977|pid:none) Oryza sativa (japonica cultivar-gr... 112 3e-23
M19735_1(M19735|pid:none) Homo sapiens beta-hexosaminidase beta ... 111 3e-23
(P07686) RecName: Full=Beta-hexosaminidase subunit beta; ... 111 3e-23
BT042908_1(BT042908|pid:none) Zea mays full-length cDNA clone ZM... 111 4e-23
protein update 2009. 6.17
PSORT

psg: 0.81 gvh: 0.40 alm: 0.39 top: 0.53 tms: 0.00 mit: 0.25 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

52.0 %: cytoplasmic
28.0 %: nuclear
8.0 %: vesicles of secretory system
4.0 %: cytoskeletal
4.0 %: mitochondrial
4.0 %: endoplasmic reticulum

>> prediction for VFC223 is cyt

5' end seq. ID VFC223F
5' end seq.
>VFC223F.Seq
CTGTTGGCCTACTGGTTTTAAAAGNATAGCTCTTCAAATATCGAGTTATCATTAATTATT
GCAAGTGATGATGAAACACTTGAATTGGGTATTGATGAAAGTTACTTTTTATTAGTGAAT
CAAGACACTTATCAAATAAAAGCCAATACAATCTATGGTGCAATGAGAGGTTTAGAAACA
TTCAAACAAATGGTAGTTTATGGTGTTGTAGAAAATAGTTACTCATTGACATGTGCTGAA
GTTGTAGACTATCCAACCTATCAATGGAGAGGATTGTTGGTTGATAATGCCCGTCATCTC
CTTCCAAAGAATATGGTACTTCATATTATTGACTCGATGGGTTATAATAAATTCAATACT
ATGCATTGGCATTTAATAGATACTGTTGCATTCCCAGTGGAATCGAAAACCTATCCAAAG
TTAACTGAAGCATTACTTGGACCTGGTGCAATTATTACACATGATGATATTTTAGA----
------
Length of 5' end seq. 476
3' end seq. ID VFC223Z
3' end seq.
>VFC223Z.Seq
----------GTACTTCAACATGGTGTTAAATTTGATAAAGAAACTACTTTGGTTCAAAC
TTGGACAAATATTAATGATCTAAGAGATGTACTAGCCGCTGGTTATAAAACTATAACATC
GTTCTTTTTCTATTTAGATAGACAATCACCAACTGGAAATCATTATCATTATGAATGGCA
AGATACTTGGGAAGATTTCTATGCATCAGATCCAAGATTAAATATTACTTCAAATGCTGA
AAATATTTTAGGTGGTGAAGCTACTATGTTTGGTGAACAAGTTAGTACCGTCAATTGGGA
TGCCAGAGTTTGGCCAAGAGCTATTGGTATCTCTGAAAGATTATGGTCTGCTACTGAAAT
TAATAATATCACTCCTTGCTCTCCCTCGTATTGGCCAATTCTCCNTGTGATATGTCCTCG
TCGTGGTATTTCCNCTGGTCCATTATTNCCCTGATTTTTGCTCATTACCTGATGATTTNN
CTTTTNCTTTAAACCA
Length of 3' end seq. 486
Connected seq. ID VFC223P
Connected seq.
>VFC223P.Seq
CTGTTGGCCTACTGGTTTTAAAAGNATAGCTCTTCAAATATCGAGTTATCATTAATTATT
GCAAGTGATGATGAAACACTTGAATTGGGTATTGATGAAAGTTACTTTTTATTAGTGAAT
CAAGACACTTATCAAATAAAAGCCAATACAATCTATGGTGCAATGAGAGGTTTAGAAACA
TTCAAACAAATGGTAGTTTATGGTGTTGTAGAAAATAGTTACTCATTGACATGTGCTGAA
GTTGTAGACTATCCAACCTATCAATGGAGAGGATTGTTGGTTGATAATGCCCGTCATCTC
CTTCCAAAGAATATGGTACTTCATATTATTGACTCGATGGGTTATAATAAATTCAATACT
ATGCATTGGCATTTAATAGATACTGTTGCATTCCCAGTGGAATCGAAAACCTATCCAAAG
TTAACTGAAGCATTACTTGGACCTGGTGCAATTATTACACATGATGATATTTTAGA----
------GTACTTCAACATGGTGTTAAATTTGATAAAGAAACTACTTTGGTTCAAACTTGG
ACAAATATTAATGATCTAAGAGATGTACTAGCCGCTGGTTATAAAACTATAACATCGTTC
TTTTTCTATTTAGATAGACAATCACCAACTGGAAATCATTATCATTATGAATGGCAAGAT
ACTTGGGAAGATTTCTATGCATCAGATCCAAGATTAAATATTACTTCAAATGCTGAAAAT
ATTTTAGGTGGTGAAGCTACTATGTTTGGTGAACAAGTTAGTACCGTCAATTGGGATGCC
AGAGTTTGGCCAAGAGCTATTGGTATCTCTGAAAGATTATGGTCTGCTACTGAAATTAAT
AATATCACTCCTTGCTCTCCCTCGTATTGGCCAATTCTCCNTGTGATATGTCCTCGTCGT
GGTATTTCCNCTGGTCCATTATTNCCCTGATTTTTGCTCATTACCTGATGATTTNNCTTT
TNCTTTAAACCA
Length of connected seq. 962
Full length Seq ID -
Full length Seq. -
Length of full length seq. -