SHF556
Library SH
(Link to library)
Clone ID SHF556
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U11926-1
Original site URL
Representative seq. ID SHF556P
(Link to Original site)
Representative DNA sequence
>SHF556 (SHF556Q) /CSM/SH/SHF5-C/SHF556Q.Seq.d/
TAATAATCCATGTACCATCGATGCCTGTACTAAATCAACAGGTGTTACTCATACCCCAGT
CAATGTAGATGATAATAACAAATGTACAATTGATACATGTACCAAAGAAGGTGGTGTAAC
TCATACTCCAGTCAATACTGATGATAACAATGCCTGTACCCTTGATTCCTGCTCACCATC
AACTGGTGTTTCCCATACCCCAATAAACTGTGATGATAATAATAAATGTACTGTAGATTC
ATGTTCAAACTCAACTGGTTGTGTAAATACTCCAATTAACTGTGACGATAGTAATCCATG
TACCGTAGACTCATGCAATAATTCAACTGGTTGTGTAAACACTCCAGTCAATGTTGATGA
TAATAATCCATGTACCGTCGATGCCTGTACCAAATCAACAGGTGTTACTCATACCCCAGT
CAATGTAGATGATAACAACAAATGTACAATTGATGCATGTACCAAAGAANGTGGTGTAAC
TCATACCCCAGTCAATACTGATGATAACAATGCCTGTACAATTGATGCATGTACAAAAGA
ANGTGGTGTGACTCATACTCCANTCAATACTGATGATAACAATGCTTGTACCCTTGACTC
CTGTTCACCATCAACTGGCGTTTCCCACACCCCCAATTAACTGTGATGATAGTAATCCAT
GTACCGTAAACTCATGTTCNAATTCAACCGGGTGGTGCAACACTCCAATCAATGGTGATG
ATAANAATCCATGTNCTGTNAATCCTGTNCCAAACAACNGGGNGACTCATACCCCATCAT
GTNGATGATACAACAAXXXXXXXXXXCCAATCAATTGTGATGATGGAAACAAATGTACAA
TCAATAGTTGTTCACCATCAGTTGGTTGTATCTCAACACCAGTTTCATGTCCAAAACCAA
AAGATAAATGTTCAATCTCTCAATGTGATTCAGCCAAAGGTTGCATCGAAGTCCCAATGA
ATTGTACCTCTGATAAATGTAATGAAGCATCATGTTGTGATGGTGTTTGTACCTCAAAAC
CAATTAGCTGTCCAAAACCAAAGAATAAGTGTCAAGTTGCAAAATGTGATTTAATTAAAG
GTTGTACCGTCTCAAATGTAGTATGTGATGATGGTAATGCTTGTACCGAAGATTCATGTT
GTTCAGACACTGGTAAATGTCAATTCGAACCAATCAAACTTCCAAAGAATAAAAACAAAT
GTATCATTTCAAAATGTGATCCAATTAAAGGTACAATCACCAACAGTACCGTAAACTGTG
AATGTGATGACCTTTGTAACATTGGTGAATGTTGTGAAGATACAGGAAAATGTAATTACA
GACAAAAAGATTGTGATGATAATAATCCAAAAACAGCTGATAGTTGCGATTCCAAGACTG
GTAAATGTATTAACAAACCATATAATGTTATCACAAGTGGTTCTAATTTAATCTCTGGTT
TAATTGGTGGTCTCATTGGTGGTGGTACAGGAGGTAAAGGTNATTGCAAAACTTGTAAAA
ATTAAATAACTCTTTT
sequence update 2002.10.25
Translated Amino Acid sequence
**smyhrcly*inrcysypsqcr***qmyn*ymyqrrwcnsyssqy***qclyp*fllti
nwcfpypnkl*****mycrfmfklnwlckysn*l*r**smyrrlmq*fnwlckhssqc**
**smyrrclyqinrcysypsqcr**qqmyn*cmyqrxwcnsypsqy***qclyn*cmykr
xwcdsysxqy***qclyp*llftinwrfphpqltvmivihvp*THVXIQPGGATLQSMVM
IXIHVLXILXQTTGXLIPHHVDDTT---

---PINCDDGNKCTINSCSPSVGCISTPVSCPKPKDKCSISQCDSAKGCIEVPMNCTSDK
CNEASCCDGVCTSKPISCPKPKNKCQVAKCDLIKGCTVSNVVCDDGNACTEDSCCSDTGK
CQFEPIKLPKNKNKCIISKCDPIKGTITNSTVNCECDDLCNIGECCEDTGKCNYRQKDCD
DNNPKTADSCDSKTGKCINKPYNVITSGSNLISGLIGGLIGGGTGGKGXCKTCKN*itl


Translated Amino Acid sequence (All Frames)
Frame A:
**smyhrcly*inrcysypsqcr***qmyn*ymyqrrwcnsyssqy***qclyp*fllti
nwcfpypnkl*****mycrfmfklnwlckysn*l*r**smyrrlmq*fnwlckhssqc**
**smyrrclyqinrcysypsqcr**qqmyn*cmyqrxwcnsypsqy***qclyn*cmykr
xwcdsysxqy***qclyp*llftinwrfphpqltvmivihvp*THVXIQPGGATLQSMVM
IXIHVLXILXQTTGXLIPHHVDDTT---

---PINCDDGNKCTINSCSPSVGCISTPVSCPKPKDKCSISQCDSAKGCIEVPMNCTSDK
CNEASCCDGVCTSKPISCPKPKNKCQVAKCDLIKGCTVSNVVCDDGNACTEDSCCSDTGK
CQFEPIKLPKNKNKCIISKCDPIKGTITNSTVNCECDDLCNIGECCEDTGKCNYRQKDCD
DNNPKTADSCDSKTGKCINKPYNVITSGSNLISGLIGGLIGGGTGGKGXCKTCKN*itl

Frame B:
nnpctidactkstgvthtpvnvddnnkctidtctkeggvthtpvntddnnactldscsps
tgvshtpincddnnkctvdscsnstgcvntpincddsnpctvdscnnstgcvntpvnvdd
nnpctvdactkstgvthtpvnvddnnkctidactkexgvthtpvntddnnactidactke
xgvthtpxntddnnactldscspstgvshtpn*l****smyrklmfxfnrvvqhsnqw**
*xsmxcxscxkqxgdsypimxmiqq---

---qsivmmetnvqsivvhhqlvvsqhqfhvqnqkinvqslnviqpkvasksq*ivplin
vmkhhvvmvfvpqnqlavqnqrisvklqnvi*lkvvpsqm*yvmmvmlvpkihvvqtlvn
vnsnqsnfqriktnvsfqnviqlkvqsptvp*tvnvmtfvtlvnvvkiqenvitdkkivm
iiiqkqlivaiprlvnvltnhimlsqvvli*slv*lvvslvvvqevkviaklvkik*lf

Frame C:
iihvpsmpvlnqqvllipqsm*miitnvqlihvpkkvv*lilqsilmitmpvplipahhq
lvfpipq*tvmiiinvl*ihvqtqlvv*ilqltvtivihvp*thaiiqlvv*tlqsmlmi
iihvpsmpvpnqqvllipqsm*mittnvqlmhvpkxvv*lipqsilmitmpvqlmhvqkx
vv*lilxsilmitmlvpltpvhhqlafptppincddsnpctvnscsnstgwcntpingdd
xnpcxvnpvpnnxxthtpscx*yn---

---nql**wkqmynq*lftiswlylntsfmsktkr*mfnlsm*fsqrlhrspnelyl**m
**siml*wclylktn*lsktke*vssckm*fn*rlyrlkcsm**w*clyrrfmlfrhw*m
sirtnqtske*kqmyhfkm*sn*rynhqqyrkl*m**pl*hw*ml*ryrkm*lqtkrl**
**skns**lrfqdw*my*qti*cyhkwf*fnlwfnwwshwwwyrr*rxlqnl*klnnsf

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SHF556 (SHF556Q) /CSM/SH/SHF5-C/SHF556Q.Seq.d/ 2874 0.0
SHG408 (SHG408Q) /CSM/SH/SHG4-A/SHG408Q.Seq.d/ 1402 0.0
SHC394 (SHC394Q) /CSM/SH/SHC3-D/SHC394Q.Seq.d/ 1402 0.0
SFC263 (SFC263Q) /CSM/SF/SFC2-C/SFC263Q.Seq.d/ 1402 0.0
CHR684 (CHR684Q) /CSM/CH/CHR6-D/CHR684Q.Seq.d/ 1394 0.0
SHG431 (SHG431Q) /CSM/SH/SHG4-B/SHG431Q.Seq.d/ 1388 0.0
CFC542 (CFC542Q) /CSM/CF/CFC5-B/CFC542Q.Seq.d/ 1386 0.0
SHG549 (SHG549Q) /CSM/SH/SHG5-C/SHG549Q.Seq.d/ 1362 0.0
SHA109 (SHA109Q) /CSM/SH/SHA1-A/SHA109Q.Seq.d/ 1362 0.0
CHM421 (CHM421Q) /CSM/CH/CHM4-A/CHM421Q.Seq.d/ 1362 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

X78948|X78948.1 D.minutum ecmB gene. 50 5e-19 7
BU494788|BU494788.1 PfESToab67g03.y1 Plasmodium falciparum 3D7 asexual cDNA Plasmodium falciparum cDNA 5', mRNA sequence. 36 0.012 4
AF134659|AF134659.1 Plasmodium falciparum strain BEN6 from Benin CG2 omega repeat (cg2) gene, partial cds. 32 0.020 4
AF134658|AF134658.1 Plasmodium falciparum strain SEN16 from Senegal CG2 omega repeat (cg2) gene, partial cds. 32 0.022 4
CK991559|CK991559.1 EST0111 Eyestalk cDNA library Penaeus monodon cDNA clone ES-N-S03-555-W 5' similar to Histidine-rich glycoprotein precursor (Plasmodium lophurae), mRNA sequence. 30 0.024 4
AF134660|AF134660.1 Plasmodium falciparum strain COM6 from Comoros CG2 omega repeat (cg2) gene, partial cds. 32 0.028 4
AF134683|AF134683.1 Plasmodium falciparum strain UNK1 CG2 omega repeat (cg2) gene, partial cds. 32 0.032 4
AF134661|AF134661.1 Plasmodium falciparum strain CAF5 from Central African Republic CG2 omega repeat (cg2) gene, partial cds. 32 0.040 4
BQ739629|BQ739629.1 PfESToab48c05.y1 Plasmodium falciparum 3D7 asexual cDNA Plasmodium falciparum cDNA 5' similar to TR:Q9U0J6 Q9U0J6 HYPOTHETICAL 312.5 KD PROTEIN. ;, mRNA sequence. 30 0.042 4
AF134663|AF134663.1 Plasmodium falciparum strain AFR1 from Africa CG2 omega repeat (cg2) gene, partial cds. 32 0.050 4
dna update 2006. 3.18
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

A27020(A27020)DIF-induced prestalk pDd63 protein precursor - sli... 409 0.0
(P11976) RecName: Full=Prestalk protein; AltName: Full=Extracell... 317 e-132
A26838(A26838)prestalk protein precursor - slime mold (Dictyoste... 317 e-132
AC117072_61(AC117072|pid:none) Dictyostelium discoideum chromoso... 234 9e-76
S44208(S44208) extracellular matrix protein B - Dictyostelium mi... 276 1e-72
AC116984_103(AC116984|pid:none) Dictyostelium discoideum chromos... 244 6e-68
(Q54C31) RecName: Full=Protein psiR; Flags: Precursor; 108 1e-39
(Q54C32) RecName: Full=Protein psiQ; Flags: Precursor; 93 2e-29
(Q54CH8) RecName: Full=Protein psiN; Flags: Precursor; 126 2e-28
(Q54G85) RecName: Full=Protein psiJ; Flags: Precursor; 115 4e-24
protein update 2009. 4.18
PSORT

psg: 0.91 gvh: 0.57 alm: 0.33 top: 0.47 tms: 0.00 mit: 0.26 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.40 pox: 0.00 px2: 0.00 vac: 0.33 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

40.0 %: cytoplasmic
28.0 %: mitochondrial
24.0 %: nuclear
4.0 %: vacuolar
4.0 %: endoplasmic reticulum

>> prediction for SHF556 is cyt

5' end seq. ID SHF556F
5' end seq.
>SHF556F.Seq
TAATAATCCATGTACCATCGATGCCTGTACTAAATCAACAGGTGTTACTCATACCCCAGT
CAATGTAGATGATAATAACAAATGTACAATTGATACATGTACCAAAGAAGGTGGTGTAAC
TCATACTCCAGTCAATACTGATGATAACAATGCCTGTACCCTTGATTCCTGCTCACCATC
AACTGGTGTTTCCCATACCCCAATAAACTGTGATGATAATAATAAATGTACTGTAGATTC
ATGTTCAAACTCAACTGGTTGTGTAAATACTCCAATTAACTGTGACGATAGTAATCCATG
TACCGTAGACTCATGCAATAATTCAACTGGTTGTGTAAACACTCCAGTCAATGTTGATGA
TAATAATCCATGTACCGTCGATGCCTGTACCAAATCAACAGGTGTTACTCATACCCCAGT
CAATGTAGATGATAACAACAAATGTACAATTGATGCATGTACCAAAGAANGTGGTGTAAC
TCATACCCCAGTCAATACTGATGATAACAATGCCTGTACAATTGATGCATGTACAAAAGA
ANGTGGTGTGACTCATACTCCANTCAATACTGATGATAACAATGCTTGTACCCTTGACTC
CTGTTCACCATCAACTGGCGTTTCCCACACCCCCAATTAACTGTGATGATAGTAATCCAT
GTACCGTAAACTCATGTTCNAATTCAACCGGGTGGTGCAACACTCCAATCAATGGTGATG
ATAANAATCCATGTNCTGTNAATCCTGTNCCAAACAACNGGGNGACTCATACCCCATCAT
GTNGATGATACAACAANNNNNNNNNN
Length of 5' end seq. 806
3' end seq. ID SHF556Z
3' end seq.
>SHF556Z.Seq
NNNNNNNNNNCCAATCAATTGTGATGATGGAAACAAATGTACAATCAATAGTTGTTCACC
ATCAGTTGGTTGTATCTCAACACCAGTTTCATGTCCAAAACCAAAAGATAAATGTTCAAT
CTCTCAATGTGATTCAGCCAAAGGTTGCATCGAAGTCCCAATGAATTGTACCTCTGATAA
ATGTAATGAAGCATCATGTTGTGATGGTGTTTGTACCTCAAAACCAATTAGCTGTCCAAA
ACCAAAGAATAAGTGTCAAGTTGCAAAATGTGATTTAATTAAAGGTTGTACCGTCTCAAA
TGTAGTATGTGATGATGGTAATGCTTGTACCGAAGATTCATGTTGTTCAGACACTGGTAA
ATGTCAATTCGAACCAATCAAACTTCCAAAGAATAAAAACAAATGTATCATTTCAAAATG
TGATCCAATTAAAGGTACAATCACCAACAGTACCGTAAACTGTGAATGTGATGACCTTTG
TAACATTGGTGAATGTTGTGAAGATACAGGAAAATGTAATTACAGACAAAAAGATTGTGA
TGATAATAATCCAAAAACAGCTGATAGTTGCGATTCCAAGACTGGTAAATGTATTAACAA
ACCATATAATGTTATCACAAGTGGTTCTAATTTAATCTCTGGTTTAATTGGTGGTCTCAT
TGGTGGTGGTACAGGAGGTAAAGGTNATTGCAAAACTTGTAAAAATTAAATAACTCTTTT
Length of 3' end seq. 720
Connected seq. ID SHF556P
Connected seq.
>SHF556P.Seq
TAATAATCCATGTACCATCGATGCCTGTACTAAATCAACAGGTGTTACTCATACCCCAGT
CAATGTAGATGATAATAACAAATGTACAATTGATACATGTACCAAAGAAGGTGGTGTAAC
TCATACTCCAGTCAATACTGATGATAACAATGCCTGTACCCTTGATTCCTGCTCACCATC
AACTGGTGTTTCCCATACCCCAATAAACTGTGATGATAATAATAAATGTACTGTAGATTC
ATGTTCAAACTCAACTGGTTGTGTAAATACTCCAATTAACTGTGACGATAGTAATCCATG
TACCGTAGACTCATGCAATAATTCAACTGGTTGTGTAAACACTCCAGTCAATGTTGATGA
TAATAATCCATGTACCGTCGATGCCTGTACCAAATCAACAGGTGTTACTCATACCCCAGT
CAATGTAGATGATAACAACAAATGTACAATTGATGCATGTACCAAAGAANGTGGTGTAAC
TCATACCCCAGTCAATACTGATGATAACAATGCCTGTACAATTGATGCATGTACAAAAGA
ANGTGGTGTGACTCATACTCCANTCAATACTGATGATAACAATGCTTGTACCCTTGACTC
CTGTTCACCATCAACTGGCGTTTCCCACACCCCCAATTAACTGTGATGATAGTAATCCAT
GTACCGTAAACTCATGTTCNAATTCAACCGGGTGGTGCAACACTCCAATCAATGGTGATG
ATAANAATCCATGTNCTGTNAATCCTGTNCCAAACAACNGGGNGACTCATACCCCATCAT
GTNGATGATACAACAA----------CCAATCAATTGTGATGATGGAAACAAATGTACAA
TCAATAGTTGTTCACCATCAGTTGGTTGTATCTCAACACCAGTTTCATGTCCAAAACCAA
AAGATAAATGTTCAATCTCTCAATGTGATTCAGCCAAAGGTTGCATCGAAGTCCCAATGA
ATTGTACCTCTGATAAATGTAATGAAGCATCATGTTGTGATGGTGTTTGTACCTCAAAAC
CAATTAGCTGTCCAAAACCAAAGAATAAGTGTCAAGTTGCAAAATGTGATTTAATTAAAG
GTTGTACCGTCTCAAATGTAGTATGTGATGATGGTAATGCTTGTACCGAAGATTCATGTT
GTTCAGACACTGGTAAATGTCAATTCGAACCAATCAAACTTCCAAAGAATAAAAACAAAT
GTATCATTTCAAAATGTGATCCAATTAAAGGTACAATCACCAACAGTACCGTAAACTGTG
AATGTGATGACCTTTGTAACATTGGTGAATGTTGTGAAGATACAGGAAAATGTAATTACA
GACAAAAAGATTGTGATGATAATAATCCAAAAACAGCTGATAGTTGCGATTCCAAGACTG
GTAAATGTATTAACAAACCATATAATGTTATCACAAGTGGTTCTAATTTAATCTCTGGTT
TAATTGGTGGTCTCATTGGTGGTGGTACAGGAGGTAAAGGTNATTGCAAAACTTGTAAAA
ATTAAATAACTCTTTT
Length of connected seq. 1506
Full length Seq ID -
Full length Seq. -
Length of full length seq. -