SHG431
Library SH
(Link to library)
Clone ID SHG431
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U11926-1
Original site URL
Representative seq. ID SHG431P
(Link to Original site)
Representative DNA sequence
>SHG431 (SHG431Q) /CSM/SH/SHG4-B/SHG431Q.Seq.d/
CTGATGATAACAATGCCTGTACACTTGATTCCTGTTCACCATCAACTGGTGTTTCCCACA
CCCCAATTAACTGTGATGATAGTAATCCATGTACCGTAGACTCATGTTCAAATTCAACCG
GTTGTGTAAACACTCCAGTCAATGTTGATGATAATAATCCATGTACTGTAGATGCTTGTA
CCAAATCAACAGGTGTCACACATACCCCAGTAAATGTAGATGATAACAACAAATGTACAA
TTGATGCATGTACCAAAGAAGGTGGTGTAACTCATACTCCAGTCAACACTGATGATAACA
ATGCCTGTACAATTGATTCTTGTTCACCATCAACCGGTATTTCCCACACCCCAATCAATT
GTGATGATAAAAANGCCTGTACANTTGATTCATGTTCAAATTCAACTGGTTGTGTAAATA
CTCCAATTNNCTGTGACGATAATAATCCATGTACTGTTGATTCTTGNNATGACTTAACTG
GTNGTNGNNATACTCCAATCAATGTTGATGATAATAATACATGTACCATCGATGCCTGTA
CTAAATCAACAGGTGTTACTCATACCCCAGTCAATGTTGATGATAATAACAANATGTACA
ATTGATNCATGTACCAAAGAAAGCGGGGTGACTCATACTCCCGTCAATACTGATGATAAC
AATGNNTGTACCCCTTGATTCCTGNTCANAATCAANCTGGTGGTTCCCATACCCCCAATA
AACTGTGATGATAATAATAAATGTACNTGTTGNATTCATTGTTCCAAAATCNACTGGXXX
XXXXXXXTACAGATGATAATANCAAATGTACATTAGATGCTTGTTCACCAAAGACTGGTG
TAACTCATACACCAATCAATTGTGATGATGGAAACAAATGTACAATCAATAGTTGTTCAC
CATCAGTTGGTTGTATCTCAACACCAGTTTCATGTCCAAAACCAAAAGATAAATGTTCAA
TCTCTCAATGTGATTCAGCCAAAGGTTGCATCGAAGTCCCAATGAATTGTACCTCTGATA
AATGTAATGAAGCATCATGTTGTGATGGTGTTTGTACCTCAAAACCAATTAGCTGTCCAA
AACCAAAGAATAAGTGTCAAGTTGCAAAATGTGATTTAATTAAAGGTTGTACCGTCTCAA
ATGTAGTATGTGATGATGGTAATGCTTGTACCGAAGATTCATGTTGTTCAGACACTGGTA
AATGTCAATTCGAACCAATCAAACTTCCAAAGAATAAAAACAAATGTATCATTTCAAAAT
GTGATCCAATTAAAGGTACAATCACCAACAGTACCGTAAACTGTGAATGTGATGACCTTT
GTAACATTGGTGAATGTTGTGAAGATACAGGAAAATGTAATTACAGACAAAAAGATTGTG
ATGATAATAATCCAAAAACAGCTGATAGTTGCGATTCCAAGACTGGTAAATGTATTAACA
AACCATATAATGTTATCACAAGTGGTTCTAATTTAATCTCTGGTTTAATTGGTGGTCTCA
TTGGTGGTGGTACAGGAGGTAAAGGTGATTGCAAAACTTGTAAAAATTAAATAANACTTT
TATT
sequence update 2002.10.25
Translated Amino Acid sequence
lmitmpvhlipvhhqlvfptpqltvmivihvp*thvqiqpvv*tlqsmlmiiihvl*mlv
pnqqvshipq*m*mittnvqlmhvpkkvv*lilqstlmitmpvqlilvhhqpvfptpqsi
vmikxpvxlihvqiqlvv*ilqxxvtiiihvllilxmt*lvvxilqsmlmiiihvpsmpv
lnqqvllipqsmlmiitxctidxctkesgvthtpvntddnnxctp*FLXXIXLVVPIPPI
NCDDNNKCTCXIHCSKIXW---

---TDDNXKCTLDACSPKTGVTHTPINCDDGNKCTINSCSPSVGCISTPVSCPKPKDKCS
ISQCDSAKGCIEVPMNCTSDKCNEASCCDGVCTSKPISCPKPKNKCQVAKCDLIKGCTVS
NVVCDDGNACTEDSCCSDTGKCQFEPIKLPKNKNKCIISKCDPIKGTITNSTVNCECDDL
CNIGECCEDTGKCNYRQKDCDDNNPKTADSCDSKTGKCINKPYNVITSGSNLISGLIGGL
IGGGTGGKGDCKTCKN*ixll


Translated Amino Acid sequence (All Frames)
Frame A:
lmitmpvhlipvhhqlvfptpqltvmivihvp*thvqiqpvv*tlqsmlmiiihvl*mlv
pnqqvshipq*m*mittnvqlmhvpkkvv*lilqstlmitmpvqlilvhhqpvfptpqsi
vmikxpvxlihvqiqlvv*ilqxxvtiiihvllilxmt*lvvxilqsmlmiiihvpsmpv
lnqqvllipqsmlmiitxctidxctkesgvthtpvntddnnxctp*FLXXIXLVVPIPPI
NCDDNNKCTCXIHCSKIXW---

---yr**xqmyirclftkdwcnsytnql**wkqmynq*lftiswlylntsfmsktkr*mf
nlsm*fsqrlhrspnelyl**m**siml*wclylktn*lsktke*vssckm*fn*rlyrl
kcsm**w*clyrrfmlfrhw*msirtnqtske*kqmyhfkm*sn*rynhqqyrkl*m**p
l*hw*ml*ryrkm*lqtkrl****skns**lrfqdw*my*qti*cyhkwf*fnlwfnwws
hwwwyrr*r*lqnl*klnxtfi

Frame B:
***qclyt*flftinwcfphpn*l****smyrrlmfkfnrlckhssqc****smycrcly
qinrchtypskcr**qqmyn*cmyqrrwcnsyssqh***qclyn*flftinryfphpnql
***kxlyx*fmfkfnwlckysnxl*r**smyc*flx*lnwxxxysnqc****ymyhrcly
*inrcysypsqc****qxvqlxhvpkkag*lilpsilmitmxvpldsxsxsxwwfpypq*
tvmiiinvxvxfivpkst---

---TDDNXKCTLDACSPKTGVTHTPINCDDGNKCTINSCSPSVGCISTPVSCPKPKDKCS
ISQCDSAKGCIEVPMNCTSDKCNEASCCDGVCTSKPISCPKPKNKCQVAKCDLIKGCTVS
NVVCDDGNACTEDSCCSDTGKCQFEPIKLPKNKNKCIISKCDPIKGTITNSTVNCECDDL
CNIGECCEDTGKCNYRQKDCDDNNPKTADSCDSKTGKCINKPYNVITSGSNLISGLIGGL
IGGGTGGKGDCKTCKN*ixll

Frame C:
ddnnactldscspstgvshtpincddsnpctvdscsnstgcvntpvnvddnnpctvdact
kstgvthtpvnvddnnkctidactkeggvthtpvntddnnactidscspstgishtpinc
ddkxactxdscsnstgcvntpixcddnnpctvdsxxdltgxxxtpinvddnntctidact
kstgvthtpvnvddnnxmyn*xmyqrkrgdsysrqy***qxxyplipxxnqxggshtpnk
l*****myxlxslfqnxl---

---qmiixnvh*mlvhqrlv*lihqsivmmetnvqsivvhhqlvvsqhqfhvqnqkinvq
slnviqpkvasksq*ivplinvmkhhvvmvfvpqnqlavqnqrisvklqnvi*lkvvpsq
m*yvmmvmlvpkihvvqtlvnvnsnqsnfqriktnvsfqnviqlkvqsptvp*tvnvmtf
vtlvnvvkiqenvitdkkivmiiiqkqlivaiprlvnvltnhimlsqvvli*slv*lvvs
lvvvqevkviaklvkik*xfy

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SHG431 (SHG431Q) /CSM/SH/SHG4-B/SHG431Q.Seq.d/ 2910 0.0
SHC394 (SHC394Q) /CSM/SH/SHC3-D/SHC394Q.Seq.d/ 1515 0.0
CHR684 (CHR684Q) /CSM/CH/CHR6-D/CHR684Q.Seq.d/ 1501 0.0
CFC542 (CFC542Q) /CSM/CF/CFC5-B/CFC542Q.Seq.d/ 1499 0.0
CHM421 (CHM421Q) /CSM/CH/CHM4-A/CHM421Q.Seq.d/ 1485 0.0
SHG408 (SHG408Q) /CSM/SH/SHG4-A/SHG408Q.Seq.d/ 1481 0.0
SHA152 (SHA152Q) /CSM/SH/SHA1-C/SHA152Q.Seq.d/ 1475 0.0
CHS294 (CHS294Q) /CSM/CH/CHS2-D/CHS294Q.Seq.d/ 1475 0.0
SHA109 (SHA109Q) /CSM/SH/SHA1-A/SHA109Q.Seq.d/ 1471 0.0
SHG420 (SHG420Q) /CSM/SH/SHG4-A/SHG420Q.Seq.d/ 1469 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

X78948|X78948.1 D.minutum ecmB gene. 82 1e-34 7
AF134683|AF134683.1 Plasmodium falciparum strain UNK1 CG2 omega repeat (cg2) gene, partial cds. 32 7e-05 5
AF134658|AF134658.1 Plasmodium falciparum strain SEN16 from Senegal CG2 omega repeat (cg2) gene, partial cds. 32 9e-05 5
G37776|G37776.1 atp Plasmodium falciparum haploid Plasmodium falciparum STS genomic, sequence tagged site. 30 1e-04 5
AJ310615|AJ310615.1 Plasmodium falciparum partial cg2 gene, isolate M27, omega domain. 36 2e-04 5
AF134686|AF134686.1 Plasmodium falciparum strain IVC10 from Cote d' Ivoire CG2 omega repeat (cg2) gene, partial cds. 32 3e-04 5
AF134662|AF134662.1 Plasmodium falciparum strain UPV4 from Burkina Faso CG2 omega repeat (cg2) gene, partial cds. 32 3e-04 5
AF134666|AF134666.1 Plasmodium falciparum strain IVC3 from Cote d' Ivoire CG2 omega repeat (cg2) gene, partial cds. 32 4e-04 5
AF134664|AF134664.1 Plasmodium falciparum strain IVC17 from Cote d' Ivoire CG2 omega repeat (cg2) gene, partial cds. 32 4e-04 5
CK991559|CK991559.1 EST0111 Eyestalk cDNA library Penaeus monodon cDNA clone ES-N-S03-555-W 5' similar to Histidine-rich glycoprotein precursor (Plasmodium lophurae), mRNA sequence. 30 5e-04 6
dna update 2006. 3.30
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

A27020(A27020)DIF-induced prestalk pDd63 protein precursor - sli... 618 e-175
(P11976) RecName: Full=Prestalk protein; AltName: Full=Extracell... 429 e-118
A26838(A26838)prestalk protein precursor - slime mold (Dictyoste... 423 e-117
S44208(S44208) extracellular matrix protein B - Dictyostelium mi... 275 3e-72
AC117072_61(AC117072|pid:none) Dictyostelium discoideum chromoso... 227 7e-58
AC116984_103(AC116984|pid:none) Dictyostelium discoideum chromos... 220 9e-56
(Q54CH8) RecName: Full=Protein psiN; Flags: Precursor; 120 1e-25
(Q54G85) RecName: Full=Protein psiJ; Flags: Precursor; 107 1e-21
(Q54C31) RecName: Full=Protein psiR; Flags: Precursor; 102 5e-20
(Q94494) RecName: Full=Protein psiH; Flags: Precursor; &U67940_... 101 8e-20
protein update 2009. 4.18
PSORT

psg: 0.91 gvh: 0.61 alm: 0.37 top: 0.50 tms: 0.00 mit: 0.24 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.40 pox: 0.00 px2: 0.00 vac: 0.33 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

36.0 %: cytoplasmic
28.0 %: mitochondrial
24.0 %: nuclear
8.0 %: endoplasmic reticulum
4.0 %: vacuolar

>> prediction for SHG431 is cyt

5' end seq. ID SHG431F
5' end seq.
>SHG431F.Seq
CTGATGATAACAATGCCTGTACACTTGATTCCTGTTCACCATCAACTGGTGTTTCCCACA
CCCCAATTAACTGTGATGATAGTAATCCATGTACCGTAGACTCATGTTCAAATTCAACCG
GTTGTGTAAACACTCCAGTCAATGTTGATGATAATAATCCATGTACTGTAGATGCTTGTA
CCAAATCAACAGGTGTCACACATACCCCAGTAAATGTAGATGATAACAACAAATGTACAA
TTGATGCATGTACCAAAGAAGGTGGTGTAACTCATACTCCAGTCAACACTGATGATAACA
ATGCCTGTACAATTGATTCTTGTTCACCATCAACCGGTATTTCCCACACCCCAATCAATT
GTGATGATAAAAANGCCTGTACANTTGATTCATGTTCAAATTCAACTGGTTGTGTAAATA
CTCCAATTNNCTGTGACGATAATAATCCATGTACTGTTGATTCTTGNNATGACTTAACTG
GTNGTNGNNATACTCCAATCAATGTTGATGATAATAATACATGTACCATCGATGCCTGTA
CTAAATCAACAGGTGTTACTCATACCCCAGTCAATGTTGATGATAATAACAANATGTACA
ATTGATNCATGTACCAAAGAAAGCGGGGTGACTCATACTCCCGTCAATACTGATGATAAC
AATGNNTGTACCCCTTGATTCCTGNTCANAATCAANCTGGTGGTTCCCATACCCCCAATA
AACTGTGATGATAATAATAAATGTACNTGTTGNATTCATTGTTCCAAAATCNACTGGNNN
NNNNNNN
Length of 5' end seq. 787
3' end seq. ID SHG431Z
3' end seq.
>SHG431Z.Seq
NNNNNNNNNNTACAGATGATAATANCAAATGTACATTAGATGCTTGTTCACCAAAGACTG
GTGTAACTCATACACCAATCAATTGTGATGATGGAAACAAATGTACAATCAATAGTTGTT
CACCATCAGTTGGTTGTATCTCAACACCAGTTTCATGTCCAAAACCAAAAGATAAATGTT
CAATCTCTCAATGTGATTCAGCCAAAGGTTGCATCGAAGTCCCAATGAATTGTACCTCTG
ATAAATGTAATGAAGCATCATGTTGTGATGGTGTTTGTACCTCAAAACCAATTAGCTGTC
CAAAACCAAAGAATAAGTGTCAAGTTGCAAAATGTGATTTAATTAAAGGTTGTACCGTCT
CAAATGTAGTATGTGATGATGGTAATGCTTGTACCGAAGATTCATGTTGTTCAGACACTG
GTAAATGTCAATTCGAACCAATCAAACTTCCAAAGAATAAAAACAAATGTATCATTTCAA
AATGTGATCCAATTAAAGGTACAATCACCAACAGTACCGTAAACTGTGAATGTGATGACC
TTTGTAACATTGGTGAATGTTGTGAAGATACAGGAAAATGTAATTACAGACAAAAAGATT
GTGATGATAATAATCCAAAAACAGCTGATAGTTGCGATTCCAAGACTGGTAAATGTATTA
ACAAACCATATAATGTTATCACAAGTGGTTCTAATTTAATCTCTGGTTTAATTGGTGGTC
TCATTGGTGGTGGTACAGGAGGTAAAGGTGATTGCAAAACTTGTAAAAATTAAATAANAC
TTTTATT
Length of 3' end seq. 787
Connected seq. ID SHG431P
Connected seq.
>SHG431P.Seq
CTGATGATAACAATGCCTGTACACTTGATTCCTGTTCACCATCAACTGGTGTTTCCCACA
CCCCAATTAACTGTGATGATAGTAATCCATGTACCGTAGACTCATGTTCAAATTCAACCG
GTTGTGTAAACACTCCAGTCAATGTTGATGATAATAATCCATGTACTGTAGATGCTTGTA
CCAAATCAACAGGTGTCACACATACCCCAGTAAATGTAGATGATAACAACAAATGTACAA
TTGATGCATGTACCAAAGAAGGTGGTGTAACTCATACTCCAGTCAACACTGATGATAACA
ATGCCTGTACAATTGATTCTTGTTCACCATCAACCGGTATTTCCCACACCCCAATCAATT
GTGATGATAAAAANGCCTGTACANTTGATTCATGTTCAAATTCAACTGGTTGTGTAAATA
CTCCAATTNNCTGTGACGATAATAATCCATGTACTGTTGATTCTTGNNATGACTTAACTG
GTNGTNGNNATACTCCAATCAATGTTGATGATAATAATACATGTACCATCGATGCCTGTA
CTAAATCAACAGGTGTTACTCATACCCCAGTCAATGTTGATGATAATAACAANATGTACA
ATTGATNCATGTACCAAAGAAAGCGGGGTGACTCATACTCCCGTCAATACTGATGATAAC
AATGNNTGTACCCCTTGATTCCTGNTCANAATCAANCTGGTGGTTCCCATACCCCCAATA
AACTGTGATGATAATAATAAATGTACNTGTTGNATTCATTGTTCCAAAATCNACTGG---
-------TACAGATGATAATANCAAATGTACATTAGATGCTTGTTCACCAAAGACTGGTG
TAACTCATACACCAATCAATTGTGATGATGGAAACAAATGTACAATCAATAGTTGTTCAC
CATCAGTTGGTTGTATCTCAACACCAGTTTCATGTCCAAAACCAAAAGATAAATGTTCAA
TCTCTCAATGTGATTCAGCCAAAGGTTGCATCGAAGTCCCAATGAATTGTACCTCTGATA
AATGTAATGAAGCATCATGTTGTGATGGTGTTTGTACCTCAAAACCAATTAGCTGTCCAA
AACCAAAGAATAAGTGTCAAGTTGCAAAATGTGATTTAATTAAAGGTTGTACCGTCTCAA
ATGTAGTATGTGATGATGGTAATGCTTGTACCGAAGATTCATGTTGTTCAGACACTGGTA
AATGTCAATTCGAACCAATCAAACTTCCAAAGAATAAAAACAAATGTATCATTTCAAAAT
GTGATCCAATTAAAGGTACAATCACCAACAGTACCGTAAACTGTGAATGTGATGACCTTT
GTAACATTGGTGAATGTTGTGAAGATACAGGAAAATGTAATTACAGACAAAAAGATTGTG
ATGATAATAATCCAAAAACAGCTGATAGTTGCGATTCCAAGACTGGTAAATGTATTAACA
AACCATATAATGTTATCACAAGTGGTTCTAATTTAATCTCTGGTTTAATTGGTGGTCTCA
TTGGTGGTGGTACAGGAGGTAAAGGTGATTGCAAAACTTGTAAAAATTAAATAANACTTT
TATT
Length of connected seq. 1554
Full length Seq ID -
Full length Seq. -
Length of full length seq. -