VSG230
Library VS
(Link to library)
Clone ID VSG230
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U05201-1
Original site URL
Representative seq. ID VSG230P
(Link to Original site)
Representative DNA sequence
>VSG230 (VSG230Q) /CSM/VS/VSG2-B/VSG230Q.Seq.d/
GAAAGCAGTAGAGGGTATCATTATTATTCATGATCATGGTCATCCTCATATATTACTTTT
ACAAGATAATAATTATTTTAAATTACCGGGAGGTAAACTTAAACCAGGAGAAAATGAAAT
TGATGGACTTATAAGAAAATTAACAAAAAAACTATCACCAACAGGTACTCCTGTATCAGA
TGCACCTTGGGAGATTGGTGATCATGTTTCAACATGGTGGAGACCAAATTTTGAACCATC
CCTTTTTCCATATATTCCTTCTCATATAACTAAACCAAAAGAATGTAAAAAACTATTTGT
TGTTACCCTTCCTGAAAAATGTAAATTTGCTGTTTCAAATAATTTAAGTTTAATTGCTGT
GTCACTTTATGAAATTTATAATAATTCTCAAAGATATGGTGCTGTAATTTCAAGTATTCC
TGCACTCATTAGTAGATATACTTTCGTTTATCTCAATGTAGATTAGATATCAAAAAAAAA
AXXXXXXXXXXCCACGNGTCCGGAAAANCAGTANAGGGTATCATTATTATTCATGATCAT
GGTCATCCTCATATATTACTTTTACAAGATAATAATTATTTTAAATTACCGGGAGGTAAA
CTTAAACCAGGAGAAAATGAAATTGATGGACTTATAANAAAATTAACAAAAAAACTATCA
CCAACAGGTACTCCTGTATCAGATGCACCTTGGGAGATTGGTGATCATGTTTCAACATGG
TGGAGACCAAATTTTGAACCATCCCTTTTTCCANATATTCCTTCTCATATAACTAAACCA
AAAGAATGTAAAAAACTATTTGTTGTTACCCNTCCTGAAAAATGTAAATTTGCTGTTTCA
AATAATTTAAGTTTAATTGCTGTGTCNCTTTATGAAATTTATAATAATTCTCAAAGANAT
GGTGCTGTAATTTCAAGTATTCCTGCACTCATTAGTAGANANACTTTCGTTTATCTCAAT
GTAGATTAGATATCAAAAAAAAAAATAAAAAAATAAAAAANNATAAAAAATAAAAATAAT
AAAAATA
sequence update 2001. 3.22
Translated Amino Acid sequence
essrgyhyys*swsssyitftr**lf*itgr*t*trrk*n*wtykkinkktitnryscir
ctlgdw*scfnmvetkf*tipfsiysfsyn*tkrm*kticcyps*km*iccfk*fkfncc
vtl*nl**fskiwccnfkyscth**IYFRLSQCRLDIKKK---

---HXSGKXVXGIIIIHDHGHPHILLLQDNNYFKLPGGKLKPGENEIDGLIXKLTKKLSP
TGTPVSDAPWEIGDHVSTWWRPNFEPSLFPXIPSHITKPKECKKLFVVTXPEKCKFAVSN
NLSLIAVSLYEIYNNSQRXGAVISSIPALISRXTFVYLNVD*iskkkikk*kxiknknnk
n


Translated Amino Acid sequence (All Frames)
Frame A:
essrgyhyys*swsssyitftr**lf*itgr*t*trrk*n*wtykkinkktitnryscir
ctlgdw*scfnmvetkf*tipfsiysfsyn*tkrm*kticcyps*km*iccfk*fkfncc
vtl*nl**fskiwccnfkyscth**IYFRLSQCRLDIKKK---

---prvrkxsxgyhyys*swsssyitftr**lf*itgr*t*trrk*n*wtyxkinkktit
nryscirctlgdw*scfnmvetkf*tipfsxysfsyn*tkrm*kticcyps*km*iccfk
*fkfnccvxl*nl**fskxwccnfkyscth**xxfrlsqcrldikkknkkikxxkk*k**
k

Frame B:
kavegiiiihdhghphilllqdnnyfklpggklkpgeneidglirkltkklsptgtpvsd
apweigdhvstwwrpnfepslfpyipshitkpkeckklfvvtlpekckfavsnnlsliav
slyeiynnsqrygavissipalisrytfvylnvd*iskkk---

---HXSGKXVXGIIIIHDHGHPHILLLQDNNYFKLPGGKLKPGENEIDGLIXKLTKKLSP
TGTPVSDAPWEIGDHVSTWWRPNFEPSLFPXIPSHITKPKECKKLFVVTXPEKCKFAVSN
NLSLIAVSLYEIYNNSQRXGAVISSIPALISRXTFVYLNVD*iskkkikk*kxiknknnk
n

Frame C:
kq*rvsllfmimviliyyfykiiiilnyrevnlnqekmklmdl*en*qknyhqqvllyqm
hlgrlvimfqhggdqilnhpffhiflli*lnqknvknylllpflknvnllfqii*v*llc
hfmkfiiilkdmvl*fqvflhslvdilsfism*iryqkk---

---txpexqxrvsllfmimviliyyfykiiiilnyrevnlnqekmklmdl*xn*qknyhq
qvllyqmhlgrlvimfqhggdqilnhpffxiflli*lnqknvknylllpxlknvnllfqi
i*v*llcxfmkfiiilkxmvl*fqvflhslvxxlsfism*iryqkkk*knkkx*kikiik
i

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VSG230 (VSG230Q) /CSM/VS/VSG2-B/VSG230Q.Seq.d/ 892 0.0
SSG192 (SSG192Q) /CSM/SS/SSG1-D/SSG192Q.Seq.d/ 850 0.0
VFD426 (VFD426Q) /CSM/VF/VFD4-B/VFD426Q.Seq.d/ 36 1.0
VFB456 (VFB456Q) /CSM/VF/VFB4-C/VFB456Q.Seq.d/ 36 1.0
SSL109 (SSL109Q) /CSM/SS/SSL1-A/SSL109Q.Seq.d/ 36 1.0
SSA355 (SSA355Q) /CSM/SS/SSA3-C/SSA355Q.Seq.d/ 36 1.0
SLH752 (SLH752Q) /CSM/SL/SLH7-C/SLH752Q.Seq.d/ 36 1.0
SFE373 (SFE373Q) /CSM/SF/SFE3-D/SFE373Q.Seq.d/ 36 1.0
SFE173 (SFE173Q) /CSM/SF/SFE1-D/SFE173Q.Seq.d/ 36 1.0
CHI257 (CHI257Q) /CSM/CH/CHI2-C/CHI257Q.Seq.d/ 36 1.0

own update 2003. 1. 9
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

BG882495|BG882495.1 sae89h08.y1 Gm-c1065 Glycine max cDNA clone GENOME SYSTEMS CLONE ID: Gm-c1065-3423 5' similar to TR:O65606 O65606 HYPOTHETICAL 23.9 KD PROTEIN. ;, mRNA sequence. 38 0.001 3
BE555749|BE555749.1 sp93d11.y1 Gm-c1045 Glycine max cDNA clone GENOME SYSTEMS CLONE ID: Gm-c1045-1198 5' similar to TR:O43809 O43809 MRNA CLEAVAGE FACTOR I 25 KDA SUBUNIT. ;, mRNA sequence. 38 0.002 3
BQ453890|BQ453890.1 sap01f05.y1 Gm-c1081 Glycine max cDNA clone SOYBEAN CLONE ID: Gm-c1081-4090 5' similar to TR:O65606 O65606 HYPOTHETICAL 23.9 KD PROTEIN. ;, mRNA sequence. 38 0.002 3
BQ454009|BQ454009.1 sap03e05.y1 Gm-c1081 Glycine max cDNA clone SOYBEAN CLONE ID: Gm-c1081-4066 5' similar to TR:O65606 O65606 HYPOTHETICAL 23.9 KD PROTEIN. ;, mRNA sequence. 38 0.002 3
AW519531|AW519531.1 up35a01.y1 Soares_mouse_NMGB_bcell Mus musculus cDNA clone IMAGE:2698920 5' similar to TR:O43809 O43809 MRNA CLEAVAGE FACTOR I 25 KDA SUBUNIT. ;, mRNA sequence. 52 0.014 1
BG599917|BG599917.1 EST504812 cSTS Solanum tuberosum cDNA clone cSTS26P21 5' sequence, mRNA sequence. 40 0.024 2
AJ560177|AJ560177.1 Antirrhinum majus EST, clone 018_1_12_i24. 42 0.025 2
BI008046|BI008046.1 MR4-RT0026-050201-101-h07 RT0026 Homo sapiens cDNA, mRNA sequence. 46 0.029 2
AL844507|AL844507.1 Plasmodium falciparum chromosome 8. 42 0.032 4
BE080954|BE080954.1 QV1-BT0631-130300-111-d11 BT0631 Homo sapiens cDNA, mRNA sequence. 46 0.040 2
dna update 2003. 8.24
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

EF081895_1(EF081895|pid:none) Picea sitchensis clone WS02813_J16... 162 2e-38
EF147558_1(EF147558|pid:none) Populus trichocarpa clone WS0123_H... 160 6e-38
AL606641_22(AL606641|pid:none) Oryza sativa genomic DNA, chromos... 157 4e-37
EU960657_1(EU960657|pid:none) Zea mays clone 227114 cleavage and... 157 4e-37
AP008210_2314(AP008210|pid:none) Oryza sativa (japonica cultivar... 157 4e-37
AK118070_1(AK118070|pid:none) Arabidopsis thaliana At4g25550 mRN... 157 5e-37
BT056340_1(BT056340|pid:none) Salmo salar clone ssal-eve-523-200... 150 8e-35
BT057879_1(BT057879|pid:none) Salmo salar clone ssal-evf-568-303... 149 2e-34
AY891817_1(AY891817|pid:none) Synthetic construct Homo sapiens c... 147 4e-34
(Q4KM65) RecName: Full=Cleavage and polyadenylation specificity ... 147 4e-34
protein update 2009. 3.22
PSORT

psg: 0.50 gvh: 0.24 alm: 0.39 top: 0.53 tms: 0.00 mit: 0.25 mip: 0.03
nuc: 0.02 erl: 0.00 erm: 0.20 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

52.0 %: cytoplasmic
32.0 %: nuclear
8.0 %: cytoskeletal
8.0 %: mitochondrial

>> prediction for VSG230 is cyt

5' end seq. ID VSG230F
5' end seq.
>VSG230F.Seq
GAAAGCAGTAGAGGGTATCATTATTATTCATGATCATGGTCATCCTCATATATTACTTTT
ACAAGATAATAATTATTTTAAATTACCGGGAGGTAAACTTAAACCAGGAGAAAATGAAAT
TGATGGACTTATAAGAAAATTAACAAAAAAACTATCACCAACAGGTACTCCTGTATCAGA
TGCACCTTGGGAGATTGGTGATCATGTTTCAACATGGTGGAGACCAAATTTTGAACCATC
CCTTTTTCCATATATTCCTTCTCATATAACTAAACCAAAAGAATGTAAAAAACTATTTGT
TGTTACCCTTCCTGAAAAATGTAAATTTGCTGTTTCAAATAATTTAAGTTTAATTGCTGT
GTCACTTTATGAAATTTATAATAATTCTCAAAGATATGGTGCTGTAATTTCAAGTATTCC
TGCACTCATTAGTAGATATACTTTCGTTTATCTCAATGTAGATTAGATATCAAAAAAAAA
A----------
Length of 5' end seq. 481
3' end seq. ID VSG230Z
3' end seq.
>VSG230Z.Seq
----------CCACGNGTCCGGAAAANCAGTANAGGGTATCATTATTATTCATGATCATG
GTCATCCTCATATATTACTTTTACAAGATAATAATTATTTTAAATTACCGGGAGGTAAAC
TTAAACCAGGAGAAAATGAAATTGATGGACTTATAANAAAATTAACAAAAAAACTATCAC
CAACAGGTACTCCTGTATCAGATGCACCTTGGGAGATTGGTGATCATGTTTCAACATGGT
GGAGACCAAATTTTGAACCATCCCTTTTTCCANATATTCCTTCTCATATAACTAAACCAA
AAGAATGTAAAAAACTATTTGTTGTTACCCNTCCTGAAAAATGTAAATTTGCTGTTTCAA
ATAATTTAAGTTTAATTGCTGTGTCNCTTTATGAAATTTATAATAATTCTCAAAGANATG
GTGCTGTAATTTCAAGTATTCCTGCACTCATTAGTAGANANACTTTCGTTTATCTCAATG
TAGATTAGATATCAAAAAAAAAAATAAAAAAATAAAAAANNATAAAAAATAAAAATAATA
AAAATA
Length of 3' end seq. 536
Connected seq. ID VSG230P
Connected seq.
>VSG230P.Seq
GAAAGCAGTAGAGGGTATCATTATTATTCATGATCATGGTCATCCTCATATATTACTTTT
ACAAGATAATAATTATTTTAAATTACCGGGAGGTAAACTTAAACCAGGAGAAAATGAAAT
TGATGGACTTATAAGAAAATTAACAAAAAAACTATCACCAACAGGTACTCCTGTATCAGA
TGCACCTTGGGAGATTGGTGATCATGTTTCAACATGGTGGAGACCAAATTTTGAACCATC
CCTTTTTCCATATATTCCTTCTCATATAACTAAACCAAAAGAATGTAAAAAACTATTTGT
TGTTACCCTTCCTGAAAAATGTAAATTTGCTGTTTCAAATAATTTAAGTTTAATTGCTGT
GTCACTTTATGAAATTTATAATAATTCTCAAAGATATGGTGCTGTAATTTCAAGTATTCC
TGCACTCATTAGTAGATATACTTTCGTTTATCTCAATGTAGATTAGATATCAAAAAAAAA
A----------CCACGNGTCCGGAAAANCAGTANAGGGTATCATTATTATTCATGATCAT
GGTCATCCTCATATATTACTTTTACAAGATAATAATTATTTTAAATTACCGGGAGGTAAA
CTTAAACCAGGAGAAAATGAAATTGATGGACTTATAANAAAATTAACAAAAAAACTATCA
CCAACAGGTACTCCTGTATCAGATGCACCTTGGGAGATTGGTGATCATGTTTCAACATGG
TGGAGACCAAATTTTGAACCATCCCTTTTTCCANATATTCCTTCTCATATAACTAAACCA
AAAGAATGTAAAAAACTATTTGTTGTTACCCNTCCTGAAAAATGTAAATTTGCTGTTTCA
AATAATTTAAGTTTAATTGCTGTGTCNCTTTATGAAATTTATAATAATTCTCAAAGANAT
GGTGCTGTAATTTCAAGTATTCCTGCACTCATTAGTAGANANACTTTCGTTTATCTCAAT
GTAGATTAGATATCAAAAAAAAAAATAAAAAAATAAAAAANNATAAAAAATAAAAATAAT
AAAAATA
Length of connected seq. 1017
Full length Seq ID -
Full length Seq. -
Length of full length seq. -