VSG226
Library VS
(Link to library)
Clone ID VSG226
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U09660-1
Original site URL
Representative seq. ID VSG226P
(Link to Original site)
Representative DNA sequence
>VSG226 (VSG226Q) /CSM/VS/VSG2-B/VSG226Q.Seq.d/
GTTCTAGATCGCGXXXXXXXXXXATTGGAAGATTTGAACCTGGAAGAGTATGTTTACCAT
CGATTTCTCTTAAGGCATTTTCAGATTCATCTCTGGTTGAGTATTTAATGAAACCATAAC
CCTTTGGTTTACCATCCTTTTCTCTAACAATGTTCAAATCTTCAATAATACCATAGTTTC
CAAACAAAATACGAATTTGATCTTCATCGGAGGTACCTAACATACCAATGAATAATTTTC
TTTCCATCTTTTCAATTTCATTATCTGAATATTTAACTTGTAATGGTTTATTCATATTTT
CGAGGAAAGTTCCACTTTCGTTTGTGGTATTTAATGCATTATCAGCTTCTTCTTTAGTTG
AGAATGTAATAAAAGCGCAACCTTTTGANACATTTGTTCTTTTATCTTTAANCATAGTAA
TATCGAGGATANTACCAAATTTATTAAAGATTTGAGAGACNCCTTCTTCATTCATTGAAG
ATGGAATATGGCCAACGAAAACAGTGAAACCACCGAGTTGANTTGAGATTGTTGTTGTTG
NTGTTGTTGGAGCGATTGCAT
sequence update 2001. 3.22
Translated Amino Acid sequence
VLDR---

---IGRFEPGRVCLPSISLKAFSDSSLVEYLMKP*pfglpsfsltmfkssiip*fpnkir
i*sssevpnipmnnflsifsislseyltcnglfifsrkvplsfvvfnalsasslvenvik
aqpfxtfvllslxivisrixpnllki*etpssfiedgiwptktvkpps*xeivvvxvvga
ia


Translated Amino Acid sequence (All Frames)
Frame A:
VLDR---

---IGRFEPGRVCLPSISLKAFSDSSLVEYLMKP*pfglpsfsltmfkssiip*fpnkir
i*sssevpnipmnnflsifsislseyltcnglfifsrkvplsfvvfnalsasslvenvik
aqpfxtfvllslxivisrixpnllki*etpssfiedgiwptktvkpps*xeivvvxvvga
ia

Frame B:
f*ia---

---ledlnleeyvyhrfllrhfqihlwlsi**nhnplvyhpfl*qcsnlq*yhsfqtkye
fdlhrryltyq*iiffpsfqfhylni*lvmvysyfrgkfhfrlwylmhyqlll*lrm**k
rnllxhlffyl*x**yrgxyqiy*rferxllhslkmeygqrkq*nhrvxlrllllxller
lh

Frame C:
srs---

---wki*twksmftidfs*gifrfisg*vfnetitlwftilfsnnvqifnntivskqntn
lifiggt*htne*fsfhlfnfii*ifnl*wfihifeesstfvcgi*ciisfffs*ecnks
atf*xicsfifxhsniedxtkfikdlrdxffih*rwnmanensettelx*dcccxccwsd
c

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VSG226 (VSG226Q) /CSM/VS/VSG2-B/VSG226Q.Seq.d/ 971 0.0
VFI818 (VFI818Q) /CSM/VF/VFI8-A/VFI818Q.Seq.d/ 262 5e-69
VFJ840 (VFJ840Q) /CSM/VF/VFJ8-B/VFJ840Q.Seq.d/ 222 4e-57
VFL496 (VFL496Q) /CSM/VF/VFL4-D/VFL496Q.Seq.d/ 111 1e-23
VFL435 (VFL435Q) /CSM/VF/VFL4-B/VFL435Q.Seq.d/ 78 2e-13
VFL443 (VFL443Q) /CSM/VF/VFL4-B/VFL443Q.Seq.d/ 62 1e-08
SLC857 (SLC857Q) /CSM/SL/SLC8-C/SLC857Q.Seq.d/ 56 6e-07
AFN136 (AFN136Q) /CSM/AF/AFN1-B/AFN136Q.Seq.d/ 46 6e-04
VFK165 (VFK165Q) /CSM/VF/VFK1-C/VFK165Q.Seq.d/ 42 0.009
VFG316 (VFG316Q) /CSM/VF/VFG3-A/VFG316Q.Seq.d/ 38 0.14

own update 2009. 4. 4
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

CF181981|CF181981.1 ISO4E9F Iso cDNA Isotricha sp. BBF-2003 cDNA similar to hypothetical protein 3D7, mRNA sequence. 42 0.27 2
CF182017|CF182017.1 ISO5H11R Iso cDNA Isotricha sp. BBF-2003 cDNA similar to hypothetical protein 3D7, mRNA sequence. 42 0.30 2
AB073701|AB073701.1 Borrelia duttonii plasmid DNA, complete sequence, strain:Ly. 32 0.62 5
AC139459|AC139459.1 Homo sapiens chromosome 5 clone RP11-1069F21, WORKING DRAFT SEQUENCE, 4 unordered pieces. 44 0.89 4
AC139457|AC139457.1 Homo sapiens chromosome 5 clone RP11-1042H2, WORKING DRAFT SEQUENCE, 10 unordered pieces. 44 1.2 2
AC140719|AC140719.1 Homo sapiens chromosome 16 clone XXfos-89271H3, WORKING DRAFT SEQUENCE, 55 unordered pieces. 44 1.8 1
AC140134|AC140134.1 Homo sapiens chromosome 5 clone RP11-1319K7, WORKING DRAFT SEQUENCE, 4 unordered pieces. 44 1.8 1
AC139808|AC139808.1 Homo sapiens chromosome 5 clone RP11-1360A14, WORKING DRAFT SEQUENCE, 18 unordered pieces. 44 1.8 1
AC138809|AC138809.1 Homo sapiens chromosome 5 clone RP11-1026N13, WORKING DRAFT SEQUENCE, 2 unordered pieces. 44 1.8 1
AC139284|AC139284.2 Homo sapiens chromosome 5 clone RP11-842E11, WORKING DRAFT SEQUENCE, 1 unordered piece. 44 1.8 1
dna update 2003. 8.24
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(Q6DGV1) RecName: Full=CUG-BP- and ETR-3-like factor 4; ... 132 8e-30
AC005275_3(AC005275|pid:none) Arabidopsis thaliana BAC F4C21 fro... 130 2e-29
AY065261_1(AY065261|pid:none) Arabidopsis thaliana putative ribo... 130 2e-29
(Q7ZWM3) RecName: Full=CUG-BP- and ETR-3-like factor 3-B; ... 130 2e-29
AK316961_1(AK316961|pid:none) Arabidopsis thaliana AT4G03110 mRN... 130 2e-29
CR760829_1(CR760829|pid:none) Xenopus tropicalis finished cDNA, ... 129 4e-29
(Q91579) RecName: Full=CUG-BP- and ETR-3-like factor 3-A; ... 129 4e-29
U16800_1(U16800|pid:none) Xenopus laevis elav-type ribonucleopro... 129 4e-29
BC154063_1(BC154063|pid:none) Xenopus tropicalis trinucleotide r... 129 4e-29
EF520347_1(EF520347|pid:none) Gallus gallus CUG-BP and ETR-3-lik... 129 4e-29
protein update 2009. 3.22
PSORT

psg: 0.72 gvh: 0.48 alm: 0.43 top: 0.53 tms: 0.00 mit: 0.33 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.40 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

32.0 %: nuclear
28.0 %: cytoplasmic
24.0 %: mitochondrial
8.0 %: vacuolar
4.0 %: cytoskeletal
4.0 %: endoplasmic reticulum

>> prediction for VSG226 is nuc

5' end seq. ID VSG226F
5' end seq.
>VSG226F.Seq
GTTCTAGATCGCG----------
Length of 5' end seq. 13
3' end seq. ID VSG226Z
3' end seq.
>VSG226Z.Seq
----------ATTGGAAGATTTGAACCTGGAAGAGTATGTTTACCATCGATTTCTCTTAA
GGCATTTTCAGATTCATCTCTGGTTGAGTATTTAATGAAACCATAACCCTTTGGTTTACC
ATCCTTTTCTCTAACAATGTTCAAATCTTCAATAATACCATAGTTTCCAAACAAAATACG
AATTTGATCTTCATCGGAGGTACCTAACATACCAATGAATAATTTTCTTTCCATCTTTTC
AATTTCATTATCTGAATATTTAACTTGTAATGGTTTATTCATATTTTCGAGGAAAGTTCC
ACTTTCGTTTGTGGTATTTAATGCATTATCAGCTTCTTCTTTAGTTGAGAATGTAATAAA
AGCGCAACCTTTTGANACATTTGTTCTTTTATCTTTAANCATAGTAATATCGAGGATANT
ACCAAATTTATTAAAGATTTGAGAGACNCCTTCTTCATTCATTGAAGATGGAATATGGCC
AACGAAAACAGTGAAACCACCGAGTTGANTTGAGATTGTTGTTGTTGNTGTTGTTGGAGC
GATTGCAT
Length of 3' end seq. 538
Connected seq. ID VSG226P
Connected seq.
>VSG226P.Seq
GTTCTAGATCGCG----------ATTGGAAGATTTGAACCTGGAAGAGTATGTTTACCAT
CGATTTCTCTTAAGGCATTTTCAGATTCATCTCTGGTTGAGTATTTAATGAAACCATAAC
CCTTTGGTTTACCATCCTTTTCTCTAACAATGTTCAAATCTTCAATAATACCATAGTTTC
CAAACAAAATACGAATTTGATCTTCATCGGAGGTACCTAACATACCAATGAATAATTTTC
TTTCCATCTTTTCAATTTCATTATCTGAATATTTAACTTGTAATGGTTTATTCATATTTT
CGAGGAAAGTTCCACTTTCGTTTGTGGTATTTAATGCATTATCAGCTTCTTCTTTAGTTG
AGAATGTAATAAAAGCGCAACCTTTTGANACATTTGTTCTTTTATCTTTAANCATAGTAA
TATCGAGGATANTACCAAATTTATTAAAGATTTGAGAGACNCCTTCTTCATTCATTGAAG
ATGGAATATGGCCAACGAAAACAGTGAAACCACCGAGTTGANTTGAGATTGTTGTTGTTG
NTGTTGTTGGAGCGATTGCAT
Length of connected seq. 551
Full length Seq ID -
Full length Seq. -
Length of full length seq. -