VSG208
Library VS
(Link to library)
Clone ID VSG208
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U03971-1
Original site URL
Representative seq. ID VSG208P
(Link to Original site)
Representative DNA sequence
>VSG208 (VSG208Q) /CSM/VS/VSG2-A/VSG208Q.Seq.d/
ANCGAGGTTATATAACAGTCGAGAAGAATTTAACAAAATGTCAGAAATCAAAGAAAAGAG
TTGTTACAAATGTAAAGAAGTTGGTCATATCTCACGTAATTGCCCAAAGAATCCAGAAGC
TGGTGATCGTGCTTGTTATGTTTGTAACGTTGTTGGTCATTTAAGCCGTGAATGTCCACA
AAACCCACAACCAACCTTCGAAAAGAAAGACCCAATTAAATGTTACCAATGTAATGGTTT
TGGTCATTTTGCTAGAGATTGTAGAAGAGGTAGAGATAACAAATGTTACAACTGTGGTGG
TTTAGGTCACATCTCCAAGGATTGTCCATCACCAAGCACCAGAGGTCAAGGGTCGTGATG
CTGCCAAATGTTACAAATGTAACCAACCAGGTCACATCGCCAAAGCTTGCCCAGAAAACC
AATCCGAAAATTAAATTTGAAAATTAATTATTTTCATATCATCATAATCATTCAATACGC
GATATGATGAATTTTTGTTTTTGTTTCTGTATAAAAACAXXXXXXXXXXAGCGAAGGTTA
TATAACAGTCGAGAAGAATTTAACAAAATGTCAGAAATCAAAGAAAAGAGTTGTTACAAA
TGTAAAGAAGTTGGTCATATCTCACGTAATTGCCCAAAGAATCCAGAAGCTGGTGATCGT
GCTTGTTATGTTTGTAACGTTGTTGGTCATTTAAGCCGTGAATGTCCACAAAACCCACAA
CCAACCTTCGAAAAGAAAGACCCAATTAAATGTTACCAATGTAATGGTTTTGGTCATTTT
GCTAGAGATTGTAGAAGAGGTAGAGATAACAAATGTTACAACTGTGGTGGTTTAGGTCAC
ATCTCCAAGGATTGTCCATCACCAAGCACCAGAGGTCAAGGTCGTGATGCTGCCAAATGT
TACAAATGTAACCAACCAGGTCACATCGCCAAAGCTTGCCCAGAAAACCAATCCGAAAAT
TAAATTTGAAAATTAATTATTTTCATATCATCATAATCATTCAATACGCGATATGATGAA
TTTTTGTTTTGTTTCTGTATAAAAACAATTGGTCAATTTAAATTTNATTGTATTTATTAA
TATCCCTCACCAAAATTTANAATTAAAAATAAAAAAAA
sequence update 2001. 3.22
Translated Amino Acid sequence
xevi*qsrri*qnvrnqrkellqm*rswsylt*lpkesrsw*scllcl*rcwsfkp*mst
kpttnlrkerpn*mlpm*wfwsfc*rl*kr*r*qmlqlwwfrshlqglsitkhqrsrvvm
lpnvtnvtnqvtspklaqktnpkikfen*LFSYHHNHSIRDMMNFCFCFCIKT---

---RRLYNSREEFNKMSEIKEKSCYKCKEVGHISRNCPKNPEAGDRACYVCNVVGHLSRE
CPQNPQPTFEKKDPIKCYQCNGFGHFARDCRRGRDNKCYNCGGLGHISKDCPSPSTRGQG
RDAAKCYKCNQPGHIAKACPENQSEN*i*kliifiss*sfntrydeflfcfciktigqfk
fxciy*ypspkfxiknkk


Translated Amino Acid sequence (All Frames)
Frame A:
xevi*qsrri*qnvrnqrkellqm*rswsylt*lpkesrsw*scllcl*rcwsfkp*mst
kpttnlrkerpn*mlpm*wfwsfc*rl*kr*r*qmlqlwwfrshlqglsitkhqrsrvvm
lpnvtnvtnqvtspklaqktnpkikfen*LFSYHHNHSIRDMMNFCFCFCIKT---

---segyitveknltkcqkskkrvvtnvkklvishviaqriqklvivlvmfvtllvi*av
nvhkthnqpskrktqlnvtnvmvlvilleiveeveitnvttvvv*vtsprivhhqapevk
vvmlpnvtnvtnqvtspklaqktnpkikfen*lfsyhhnhsirdmmnfcfvsv*kqlvnl
nxivfiniphqnlxlkikk

Frame B:
xrlynsreefnkmseikekscykckevghisrncpknpeagdracyvcnvvghlsrecpq
npqptfekkdpikcyqcngfghfardcrrgrdnkcyncgglghiskdcpspstrgqgs*c
cqmlqm*ptrshrqslprkpirklnlkinyfhiiiiiqyai**ifvfvsv*k---

---akvi*qsrri*qnvrnqrkellqm*rswsylt*lpkesrsw*scllcl*rcwsfkp*
mstkpttnlrkerpn*mlpm*wfwsfc*rl*kr*r*qmlqlwwfrshlqglsitkhqrsr
s*ccqmlqm*ptrshrqslprkpirklnlkinyfhiiiiiqyai**ifvlflyknnwsi*
ixlyllisltkixn*k*kk

Frame C:
rgyitveknltkcqkskkrvvtnvkklvishviaqriqklvivlvmfvtllvi*avnvhk
thnqpskrktqlnvtnvmvlvilleiveeveitnvttvvv*vtsprivhhqapevkgrda
akcykcnqpghiakacpenqsen*i*kliifiss*sfntrydeflflflykn---

---RRLYNSREEFNKMSEIKEKSCYKCKEVGHISRNCPKNPEAGDRACYVCNVVGHLSRE
CPQNPQPTFEKKDPIKCYQCNGFGHFARDCRRGRDNKCYNCGGLGHISKDCPSPSTRGQG
RDAAKCYKCNQPGHIAKACPENQSEN*i*kliifiss*sfntrydeflfcfciktigqfk
fxciy*ypspkfxiknkk

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VSG208 (VSG208Q) /CSM/VS/VSG2-A/VSG208Q.Seq.d/ 2093 0.0
VSI861 (VSI861Q) /CSM/VS/VSI8-C/VSI861Q.Seq.d/ 1084 0.0
VSE189 (VSE189Q) /CSM/VS/VSE1-D/VSE189Q.Seq.d/ 1021 0.0
VFF106 (VFF106Q) /CSM/VF/VFF1-A/VFF106Q.Seq.d/ 981 0.0
SLD529 (SLD529Q) /CSM/SL/SLD5-B/SLD529Q.Seq.d/ 846 0.0
VSG120 (VSG120Q) /CSM/VS/VSG1-A/VSG120Q.Seq.d/ 54 5e-06
VSH854 (VSH854Q) /CSM/VS/VSH8-C/VSH854Q.Seq.d/ 38 0.28
VSF252 (VSF252Q) /CSM/VS/VSF2-C/VSF252Q.Seq.d/ 38 0.28
VSE744 (VSE744Q) /CSM/VS/VSE7-B/VSE744Q.Seq.d/ 38 0.28
VSD845 (VSD845Q) /CSM/VS/VSD8-B/VSD845Q.Seq.d/ 38 0.28

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

AP003244|AP003244.3 Oryza sativa (japonica cultivar-group) genomic DNA, chromosome 1, PAC clone:P0419B01. 44 0.027 3
U88308|U88308.1 Caenorhabditis elegans cosmid C32E8. 36 0.028 4
CD268269|CD268269.1 tab82d02.x1 Hydra EST -III Hydra magnipapillata cDNA 3' similar to SW:HEXP_LEIMA Q04832 DNA-BINDING PROTEIN HEXBP ;, mRNA sequence. 36 0.043 3
BH959920|BH959920.1 odd60c06.g1 B.oleracea002 Brassica oleracea genomic, DNA sequence. 34 0.049 3
BZ014870|BZ014870.1 oeh50c09.b1 B.oleracea002 Brassica oleracea genomic, DNA sequence. 34 0.052 3
AL049911|AL049911.2 Homo sapiens chromosome 21 PAC RPCIP704C216Q2. 42 0.056 3
BX511259|BX511259.2 Zebrafish DNA sequence *** SEQUENCING IN PROGRESS *** from clone CH211-125E6. 34 0.094 8
AL424647|AL424647.1 T3 end of clone XAZ0AA001F04 of library XAZ0AA from strain CBS 712 of Kluyveromyces marxianus. 36 0.099 3
U10402|U10402.2 Caenorhabditis elegans cosmid C34E10, complete sequence. 42 0.10 3
CB936093|CB936093.1 taa33b08.y2 Hydra EST -III Hydra magnipapillata cDNA 5' similar to TR:Q9W6Q5 Q9W6Q5 CELLULAR NUCLEIC ACID BINDING PROTEIN. ;, mRNA sequence. 36 0.10 3
dna update 2003. 8.24
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

AM270155_34(AM270155|pid:none) Aspergillus niger contig An07c038... 129 2e-28
AP007161_857(AP007161|pid:none) Aspergillus oryzae RIB40 genomic... 127 5e-28
AM920427_519(AM920427|pid:none) Penicillium chrysogenum Wisconsi... 127 9e-28
(Q04832) RecName: Full=DNA-binding protein HEXBP; AltName: Full=... 124 7e-27
CT005272_169(CT005272|pid:none) Leishmania major strain Friedlin... 121 4e-26
EF070485_1(EF070485|pid:none) Maconellicoccus hirsutus clone WHM... 120 8e-26
AM502254_417(AM502254|pid:none) Leishmania infantum chromosome 36. 120 8e-26
L03710_1(L03710|pid:none) Tetrahymena thermophila cnjB (cnjB) ge... 117 5e-25
CU640366_1371(CU640366|pid:none) Podospora anserina genomic DNA ... 117 9e-25
EF638949_1(EF638949|pid:none) Triatoma infestans clone TI-138 E3... 116 1e-24
protein update 2009. 3.22
PSORT

psg: 0.80 gvh: 0.39 alm: 0.52 top: 0.53 tms: 0.00 mit: 0.35 mip: 0.05
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

36.0 %: mitochondrial
32.0 %: cytoplasmic
20.0 %: nuclear
4.0 %: cytoskeletal
4.0 %: endoplasmic reticulum
4.0 %: peroxisomal

>> prediction for VSG208 is mit

5' end seq. ID VSG208F
5' end seq.
>VSG208F.Seq
ANCGAGGTTATATAACAGTCGAGAAGAATTTAACAAAATGTCAGAAATCAAAGAAAAGAG
TTGTTACAAATGTAAAGAAGTTGGTCATATCTCACGTAATTGCCCAAAGAATCCAGAAGC
TGGTGATCGTGCTTGTTATGTTTGTAACGTTGTTGGTCATTTAAGCCGTGAATGTCCACA
AAACCCACAACCAACCTTCGAAAAGAAAGACCCAATTAAATGTTACCAATGTAATGGTTT
TGGTCATTTTGCTAGAGATTGTAGAAGAGGTAGAGATAACAAATGTTACAACTGTGGTGG
TTTAGGTCACATCTCCAAGGATTGTCCATCACCAAGCACCAGAGGTCAAGGGTCGTGATG
CTGCCAAATGTTACAAATGTAACCAACCAGGTCACATCGCCAAAGCTTGCCCAGAAAACC
AATCCGAAAATTAAATTTGAAAATTAATTATTTTCATATCATCATAATCATTCAATACGC
GATATGATGAATTTTTGTTTTTGTTTCTGTATAAAAACA----------
Length of 5' end seq. 519
3' end seq. ID VSG208Z
3' end seq.
>VSG208Z.Seq
----------AGCGAAGGTTATATAACAGTCGAGAAGAATTTAACAAAATGTCAGAAATC
AAAGAAAAGAGTTGTTACAAATGTAAAGAAGTTGGTCATATCTCACGTAATTGCCCAAAG
AATCCAGAAGCTGGTGATCGTGCTTGTTATGTTTGTAACGTTGTTGGTCATTTAAGCCGT
GAATGTCCACAAAACCCACAACCAACCTTCGAAAAGAAAGACCCAATTAAATGTTACCAA
TGTAATGGTTTTGGTCATTTTGCTAGAGATTGTAGAAGAGGTAGAGATAACAAATGTTAC
AACTGTGGTGGTTTAGGTCACATCTCCAAGGATTGTCCATCACCAAGCACCAGAGGTCAA
GGTCGTGATGCTGCCAAATGTTACAAATGTAACCAACCAGGTCACATCGCCAAAGCTTGC
CCAGAAAACCAATCCGAAAATTAAATTTGAAAATTAATTATTTTCATATCATCATAATCA
TTCAATACGCGATATGATGAATTTTTGTTTTGTTTCTGTATAAAAACAATTGGTCAATTT
AAATTTNATTGTATTTATTAATATCCCTCACCAAAATTTANAATTAAAAATAAAAAAAA
Length of 3' end seq. 589
Connected seq. ID VSG208P
Connected seq.
>VSG208P.Seq
ANCGAGGTTATATAACAGTCGAGAAGAATTTAACAAAATGTCAGAAATCAAAGAAAAGAG
TTGTTACAAATGTAAAGAAGTTGGTCATATCTCACGTAATTGCCCAAAGAATCCAGAAGC
TGGTGATCGTGCTTGTTATGTTTGTAACGTTGTTGGTCATTTAAGCCGTGAATGTCCACA
AAACCCACAACCAACCTTCGAAAAGAAAGACCCAATTAAATGTTACCAATGTAATGGTTT
TGGTCATTTTGCTAGAGATTGTAGAAGAGGTAGAGATAACAAATGTTACAACTGTGGTGG
TTTAGGTCACATCTCCAAGGATTGTCCATCACCAAGCACCAGAGGTCAAGGGTCGTGATG
CTGCCAAATGTTACAAATGTAACCAACCAGGTCACATCGCCAAAGCTTGCCCAGAAAACC
AATCCGAAAATTAAATTTGAAAATTAATTATTTTCATATCATCATAATCATTCAATACGC
GATATGATGAATTTTTGTTTTTGTTTCTGTATAAAAACA----------AGCGAAGGTTA
TATAACAGTCGAGAAGAATTTAACAAAATGTCAGAAATCAAAGAAAAGAGTTGTTACAAA
TGTAAAGAAGTTGGTCATATCTCACGTAATTGCCCAAAGAATCCAGAAGCTGGTGATCGT
GCTTGTTATGTTTGTAACGTTGTTGGTCATTTAAGCCGTGAATGTCCACAAAACCCACAA
CCAACCTTCGAAAAGAAAGACCCAATTAAATGTTACCAATGTAATGGTTTTGGTCATTTT
GCTAGAGATTGTAGAAGAGGTAGAGATAACAAATGTTACAACTGTGGTGGTTTAGGTCAC
ATCTCCAAGGATTGTCCATCACCAAGCACCAGAGGTCAAGGTCGTGATGCTGCCAAATGT
TACAAATGTAACCAACCAGGTCACATCGCCAAAGCTTGCCCAGAAAACCAATCCGAAAAT
TAAATTTGAAAATTAATTATTTTCATATCATCATAATCATTCAATACGCGATATGATGAA
TTTTTGTTTTGTTTCTGTATAAAAACAATTGGTCAATTTAAATTTNATTGTATTTATTAA
TATCCCTCACCAAAATTTANAATTAAAAATAAAAAAAA
Length of connected seq. 1108
Full length Seq ID -
Full length Seq. -
Length of full length seq. -