SHA152
Library SH
(Link to library)
Clone ID SHA152
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U11926-1
Original site URL
Representative seq. ID SHA152P
(Link to Original site)
Representative DNA sequence
>SHA152 (SHA152Q) /CSM/SH/SHA1-C/SHA152Q.Seq.d/
GGGATAATAATCCATGTACCATCGATGCCTGTACTAAATCAACAGGTGTTACTCATACCC
CAGTCAATGTAGATGATAATAACAAATGTACAATTGATACATGTACCAAAGAAGGTGGTG
TAACTCATACTCCAGTCAATACTGATGATAACAATGCCTGTACCCTTGATTCCTGCTCAC
CATCAACTGGTGTTTCCCATACCCCAATAAACTGTGATGATAATAATAAATGTACTGTAG
ATTCATGTTCAAACTCAACTGGTTGTGTAAATACTCCAATTAACTGTGACGATAGTAATC
CATGTACCGTAGACTCATGCANTAATTCAACTGGTTGTGTAAACACTCCAGTCAATGTTG
ATGATAATAATCCATGTACCGTCGATGCCTGTACCAAATCAACAGGTGTTACTCATACCC
CAGTCAATGTAGATGATAACAACAAATGTACAATTGATGCATGTACCAAAGAAGGTGGTG
TAACTCATACCCCAGTCAATACTGATGATAACAATGCCTGTACAATTGATGCATGTACAA
AAGAAAGTGGTGTAACTCATACTCCANTCAATACTGATGATAACAATGCTTGTACCCTTG
ACTCCTGTTCACCATCAACTGGCGTTTCCCACACCCCAATTAACTGTGATGATAGTAATC
CATGTACCCGTAGACTCATGTTCAAATTCAACCGGNTGTTGCAACACTCCAATCAATGGT
GGATGATAATAATCCATGTACTGTAGATTCATGTACCAAACCAACAGGTGTCACTCATAC
CCCAATNAATGTTGNATGATAACAACAATGTNCATTGATGCATGTNCAAANAAGGNGGGG
TACTCATACTCCAXXXXXXXXXXACTCATACACCAATCAATACAGATGATAATAACAAAT
GTACATTAGATGCTTGTTCACCAAAGACTGGTGTAACTCATACACCAATCAATTGTGATG
ATGGAAACAAATGTACAATCAATAGTTGTTCACCATCAGTTGGTTGTATCTCAACACCAG
TTTCATGTCCAAAACCAAAAGATAAATGTTCAATCTCTCAATGTGATTCAGCCAAAGGTT
GCATCGAAGTCCCAATGAATTGTACCTCTGATAAATGTAATGAAGCATCATGTTGTGATG
GTGTTTGTACCTCAAAACCAATTAGCTGTCCAAAACCAAAGAATAAGTGTCAAGTTGCAA
AATGTGATTTAATTAAAGGTTGTACCGTCTCAAATGTAGTATGTGATGATGGTAATGCTT
GTACCGAAGATTCATGTTGTTCAGACACTGGTAAATGTCAATTCGAACCAATCAAACTTC
CAAAGAATAAAAACAAATGTATCATTTCAAAATGTGATCCAATTAAAGGTACAATCACCA
ACAGTACCGTAAACTGTGAATGTGATGACCTTTGTAACATTGGTGAATGTTGTGAAGATA
CAGGAAAATGTAATTACAGACAAAAAGATTGTGATGATAATAATCCAAAAACAGCTGATA
GTTGCGATTCCAAGACTGGTAAATGTATTAACAAACCATATAATGTTATCACAAGTGGTT
CTAATTTAATCTCTGGTTTAATTGGTGGTCTCATTGGTGGTGGTACAGGAGGTAAAGGNA
TTGCAAAACTGTAAAAATTAAATACTCTTTT
sequence update 2002.10.25
Translated Amino Acid sequence
giiihvpsmpvlnqqvllipqsm*miitnvqlihvpkkvv*lilqsilmitmpvplipah
hqlvfpipq*tvmiiinvl*ihvqtqlvv*ilqltvtivihvp*thaxiqlvv*tlqsml
miiihvpsmpvpnqqvllipqsm*mittnvqlmhvpkkvv*lipqsilmitmpvqlmhvq
kkvv*lilxsilmitmlvpltpvhhqlafptpqltvmivihvpvdscsnstgccntping
g***SMYCRFMYQTNRCHSYPNXCXMITTMXIDACXXKXGYSYS---

---THTPINTDDNNKCTLDACSPKTGVTHTPINCDDGNKCTINSCSPSVGCISTPVSCPK
PKDKCSISQCDSAKGCIEVPMNCTSDKCNEASCCDGVCTSKPISCPKPKNKCQVAKCDLI
KGCTVSNVVCDDGNACTEDSCCSDTGKCQFEPIKLPKNKNKCIISKCDPIKGTITNSTVN
CECDDLCNIGECCEDTGKCNYRQKDCDDNNPKTADSCDSKTGKCINKPYNVITSGSNLIS
GLIGGLIGGGTGGKGIAKL*klntl


Translated Amino Acid sequence (All Frames)
Frame A:
giiihvpsmpvlnqqvllipqsm*miitnvqlihvpkkvv*lilqsilmitmpvplipah
hqlvfpipq*tvmiiinvl*ihvqtqlvv*ilqltvtivihvp*thaxiqlvv*tlqsml
miiihvpsmpvpnqqvllipqsm*mittnvqlmhvpkkvv*lipqsilmitmpvqlmhvq
kkvv*lilxsilmitmlvpltpvhhqlafptpqltvmivihvpvdscsnstgccntping
g***SMYCRFMYQTNRCHSYPNXCXMITTMXIDACXXKXGYSYS---

---THTPINTDDNNKCTLDACSPKTGVTHTPINCDDGNKCTINSCSPSVGCISTPVSCPK
PKDKCSISQCDSAKGCIEVPMNCTSDKCNEASCCDGVCTSKPISCPKPKNKCQVAKCDLI
KGCTVSNVVCDDGNACTEDSCCSDTGKCQFEPIKLPKNKNKCIISKCDPIKGTITNSTVN
CECDDLCNIGECCEDTGKCNYRQKDCDDNNPKTADSCDSKTGKCINKPYNVITSGSNLIS
GLIGGLIGGGTGGKGIAKL*klntl

Frame B:
g**smyhrcly*inrcysypsqcr***qmyn*ymyqrrwcnsyssqy***qclyp*fllt
inwcfpypnkl*****mycrfmfklnwlckysn*l*r**smyrrlmx*fnwlckhssqc*
***smyrrclyqinrcysypsqcr**qqmyn*cmyqrrwcnsypsqy***qclyn*cmyk
rkwcnsysxqy***qclyp*llftinwrfphpn*l****smyp*thvqiqpxvatlqsmv
ddnnpctvdsctkptgvthtpxnvx**qqcxlmhvqxrxgthtp---

---lihqsiqmiitnvh*mlvhqrlv*lihqsivmmetnvqsivvhhqlvvsqhqfhvqn
qkinvqslnviqpkvasksq*ivplinvmkhhvvmvfvpqnqlavqnqrisvklqnvi*l
kvvpsqm*yvmmvmlvpkihvvqtlvnvnsnqsnfqriktnvsfqnviqlkvqsptvp*t
vnvmtfvtlvnvvkiqenvitdkkivmiiiqkqlivaiprlvnvltnhimlsqvvli*sl
v*lvvslvvvqevkxlqnckn*ilf

Frame C:
dnnpctidactkstgvthtpvnvddnnkctidtctkeggvthtpvntddnnactldscsp
stgvshtpincddnnkctvdscsnstgcvntpincddsnpctvdscxnstgcvntpvnvd
dnnpctvdactkstgvthtpvnvddnnkctidactkeggvthtpvntddnnactidactk
esgvthtpxntddnnactldscspstgvshtpincddsnpctrrlmfkfnrxlqhsnqww
miiihvl*ihvpnqqvslipqxmlxdnnnvh*cmxkxggvlil---

---sytnqyr***qmyirclftkdwcnsytnql**wkqmynq*lftiswlylntsfmskt
kr*mfnlsm*fsqrlhrspnelyl**m**siml*wclylktn*lsktke*vssckm*fn*
rlyrlkcsm**w*clyrrfmlfrhw*msirtnqtske*kqmyhfkm*sn*rynhqqyrkl
*m**pl*hw*ml*ryrkm*lqtkrl****skns**lrfqdw*my*qti*cyhkwf*fnlw
fnwwshwwwyrr*rxcktvkikysf

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SHA152 (SHA152Q) /CSM/SH/SHA1-C/SHA152Q.Seq.d/ 3118 0.0
CFC807 (CFC807Q) /CSM/CF/CFC8-A/CFC807Q.Seq.d/ 1536 0.0
CFC542 (CFC542Q) /CSM/CF/CFC5-B/CFC542Q.Seq.d/ 1528 0.0
CHM421 (CHM421Q) /CSM/CH/CHM4-A/CHM421Q.Seq.d/ 1526 0.0
SHA221 (SHA221Q) /CSM/SH/SHA2-A/SHA221Q.Seq.d/ 1524 0.0
SHA257 (SHA257Q) /CSM/SH/SHA2-C/SHA257Q.Seq.d/ 1522 0.0
SHA109 (SHA109Q) /CSM/SH/SHA1-A/SHA109Q.Seq.d/ 1518 0.0
CHS294 (CHS294Q) /CSM/CH/CHS2-D/CHS294Q.Seq.d/ 1518 0.0
SHC394 (SHC394Q) /CSM/SH/SHC3-D/SHC394Q.Seq.d/ 1507 0.0
CHR684 (CHR684Q) /CSM/CH/CHR6-D/CHR684Q.Seq.d/ 1507 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

X78948|X78948.1 D.minutum ecmB gene. 50 2e-19 8
BX231032|BX231032.1 Danio rerio genomic clone DKEY-254O18, genomic survey sequence. 30 6e-04 6
BU494788|BU494788.1 PfESToab67g03.y1 Plasmodium falciparum 3D7 asexual cDNA Plasmodium falciparum cDNA 5', mRNA sequence. 36 0.016 4
CK991559|CK991559.1 EST0111 Eyestalk cDNA library Penaeus monodon cDNA clone ES-N-S03-555-W 5' similar to Histidine-rich glycoprotein precursor (Plasmodium lophurae), mRNA sequence. 30 0.030 4
BQ739629|BQ739629.1 PfESToab48c05.y1 Plasmodium falciparum 3D7 asexual cDNA Plasmodium falciparum cDNA 5' similar to TR:Q9U0J6 Q9U0J6 HYPOTHETICAL 312.5 KD PROTEIN. ;, mRNA sequence. 30 0.053 4
AC116984|AC116984.2 Dictyostelium discoideum chromosome 2 map 2567470-3108875 strain AX4, complete sequence. 50 0.15 1
AF134683|AF134683.1 Plasmodium falciparum strain UNK1 CG2 omega repeat (cg2) gene, partial cds. 34 0.30 3
AF467890|AF467890.1 Mus musculus chromosome 17 clone 160i3 strain C57BL/6J, *** SEQUENCING IN PROGRESS ***, 4 ordered pieces. 48 0.60 1
AC154238|AC154238.1 Mus musculus chromosome 17 clone RP24-162O23, WORKING DRAFT SEQUENCE, 9 unordered pieces. 48 0.60 1
AC138642|AC138642.10 Mus musculus chromosome 3, clone RP23-21O7, complete sequence. 48 0.60 1
dna update 2005.12. 2
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

A27020(A27020)DIF-induced prestalk pDd63 protein precursor - sli... 709 0.0
A26838(A26838)prestalk protein precursor - slime mold (Dictyoste... 505 e-141
(P11976) RecName: Full=Prestalk protein; AltName: Full=Extracell... 503 e-140
S44208(S44208) extracellular matrix protein B - Dictyostelium mi... 296 1e-78
AC117072_61(AC117072|pid:none) Dictyostelium discoideum chromoso... 274 7e-72
AC116984_103(AC116984|pid:none) Dictyostelium discoideum chromos... 252 2e-65
(Q54C31) RecName: Full=Protein psiR; Flags: Precursor; 164 6e-39
(Q54CH8) RecName: Full=Protein psiN; Flags: Precursor; 132 5e-29
(Q54C32) RecName: Full=Protein psiQ; Flags: Precursor; 129 3e-28
(Q54G85) RecName: Full=Protein psiJ; Flags: Precursor; 120 2e-25
protein update 2009. 4.14
PSORT

psg: 0.63 gvh: 0.46 alm: 0.37 top: 0.53 tms: 0.00 mit: 0.44 mip: 0.03
nuc: 0.00 erl: 0.00 erm: 0.40 pox: 0.60 px2: 0.00 vac: 0.33 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

36.0 %: mitochondrial
32.0 %: cytoplasmic
20.0 %: peroxisomal
8.0 %: nuclear
4.0 %: cytoskeletal

>> prediction for SHA152 is mit

5' end seq. ID SHA152F
5' end seq.
>SHA152F.Seq
GGGATAATAATCCATGTACCATCGATGCCTGTACTAAATCAACAGGTGTTACTCATACCC
CAGTCAATGTAGATGATAATAACAAATGTACAATTGATACATGTACCAAAGAAGGTGGTG
TAACTCATACTCCAGTCAATACTGATGATAACAATGCCTGTACCCTTGATTCCTGCTCAC
CATCAACTGGTGTTTCCCATACCCCAATAAACTGTGATGATAATAATAAATGTACTGTAG
ATTCATGTTCAAACTCAACTGGTTGTGTAAATACTCCAATTAACTGTGACGATAGTAATC
CATGTACCGTAGACTCATGCANTAATTCAACTGGTTGTGTAAACACTCCAGTCAATGTTG
ATGATAATAATCCATGTACCGTCGATGCCTGTACCAAATCAACAGGTGTTACTCATACCC
CAGTCAATGTAGATGATAACAACAAATGTACAATTGATGCATGTACCAAAGAAGGTGGTG
TAACTCATACCCCAGTCAATACTGATGATAACAATGCCTGTACAATTGATGCATGTACAA
AAGAAAGTGGTGTAACTCATACTCCANTCAATACTGATGATAACAATGCTTGTACCCTTG
ACTCCTGTTCACCATCAACTGGCGTTTCCCACACCCCAATTAACTGTGATGATAGTAATC
CATGTACCCGTAGACTCATGTTCAAATTCAACCGGNTGTTGCAACACTCCAATCAATGGT
GGATGATAATAATCCATGTACTGTAGATTCATGTACCAAACCAACAGGTGTCACTCATAC
CCCAATNAATGTTGNATGATAACAACAATGTNCATTGATGCATGTNCAAANAAGGNGGGG
TACTCATACTCCANNNNNNNNNN
Length of 5' end seq. 863
3' end seq. ID SHA152Z
3' end seq.
>SHA152Z.Seq
NNNNNNNNNNACTCATACACCAATCAATACAGATGATAATAACAAATGTACATTAGATGC
TTGTTCACCAAAGACTGGTGTAACTCATACACCAATCAATTGTGATGATGGAAACAAATG
TACAATCAATAGTTGTTCACCATCAGTTGGTTGTATCTCAACACCAGTTTCATGTCCAAA
ACCAAAAGATAAATGTTCAATCTCTCAATGTGATTCAGCCAAAGGTTGCATCGAAGTCCC
AATGAATTGTACCTCTGATAAATGTAATGAAGCATCATGTTGTGATGGTGTTTGTACCTC
AAAACCAATTAGCTGTCCAAAACCAAAGAATAAGTGTCAAGTTGCAAAATGTGATTTAAT
TAAAGGTTGTACCGTCTCAAATGTAGTATGTGATGATGGTAATGCTTGTACCGAAGATTC
ATGTTGTTCAGACACTGGTAAATGTCAATTCGAACCAATCAAACTTCCAAAGAATAAAAA
CAAATGTATCATTTCAAAATGTGATCCAATTAAAGGTACAATCACCAACAGTACCGTAAA
CTGTGAATGTGATGACCTTTGTAACATTGGTGAATGTTGTGAAGATACAGGAAAATGTAA
TTACAGACAAAAAGATTGTGATGATAATAATCCAAAAACAGCTGATAGTTGCGATTCCAA
GACTGGTAAATGTATTAACAAACCATATAATGTTATCACAAGTGGTTCTAATTTAATCTC
TGGTTTAATTGGTGGTCTCATTGGTGGTGGTACAGGAGGTAAAGGNATTGCAAAACTGTA
AAAATTAAATACTCTTTT
Length of 3' end seq. 798
Connected seq. ID SHA152P
Connected seq.
>SHA152P.Seq
GGGATAATAATCCATGTACCATCGATGCCTGTACTAAATCAACAGGTGTTACTCATACCC
CAGTCAATGTAGATGATAATAACAAATGTACAATTGATACATGTACCAAAGAAGGTGGTG
TAACTCATACTCCAGTCAATACTGATGATAACAATGCCTGTACCCTTGATTCCTGCTCAC
CATCAACTGGTGTTTCCCATACCCCAATAAACTGTGATGATAATAATAAATGTACTGTAG
ATTCATGTTCAAACTCAACTGGTTGTGTAAATACTCCAATTAACTGTGACGATAGTAATC
CATGTACCGTAGACTCATGCANTAATTCAACTGGTTGTGTAAACACTCCAGTCAATGTTG
ATGATAATAATCCATGTACCGTCGATGCCTGTACCAAATCAACAGGTGTTACTCATACCC
CAGTCAATGTAGATGATAACAACAAATGTACAATTGATGCATGTACCAAAGAAGGTGGTG
TAACTCATACCCCAGTCAATACTGATGATAACAATGCCTGTACAATTGATGCATGTACAA
AAGAAAGTGGTGTAACTCATACTCCANTCAATACTGATGATAACAATGCTTGTACCCTTG
ACTCCTGTTCACCATCAACTGGCGTTTCCCACACCCCAATTAACTGTGATGATAGTAATC
CATGTACCCGTAGACTCATGTTCAAATTCAACCGGNTGTTGCAACACTCCAATCAATGGT
GGATGATAATAATCCATGTACTGTAGATTCATGTACCAAACCAACAGGTGTCACTCATAC
CCCAATNAATGTTGNATGATAACAACAATGTNCATTGATGCATGTNCAAANAAGGNGGGG
TACTCATACTCCA----------ACTCATACACCAATCAATACAGATGATAATAACAAAT
GTACATTAGATGCTTGTTCACCAAAGACTGGTGTAACTCATACACCAATCAATTGTGATG
ATGGAAACAAATGTACAATCAATAGTTGTTCACCATCAGTTGGTTGTATCTCAACACCAG
TTTCATGTCCAAAACCAAAAGATAAATGTTCAATCTCTCAATGTGATTCAGCCAAAGGTT
GCATCGAAGTCCCAATGAATTGTACCTCTGATAAATGTAATGAAGCATCATGTTGTGATG
GTGTTTGTACCTCAAAACCAATTAGCTGTCCAAAACCAAAGAATAAGTGTCAAGTTGCAA
AATGTGATTTAATTAAAGGTTGTACCGTCTCAAATGTAGTATGTGATGATGGTAATGCTT
GTACCGAAGATTCATGTTGTTCAGACACTGGTAAATGTCAATTCGAACCAATCAAACTTC
CAAAGAATAAAAACAAATGTATCATTTCAAAATGTGATCCAATTAAAGGTACAATCACCA
ACAGTACCGTAAACTGTGAATGTGATGACCTTTGTAACATTGGTGAATGTTGTGAAGATA
CAGGAAAATGTAATTACAGACAAAAAGATTGTGATGATAATAATCCAAAAACAGCTGATA
GTTGCGATTCCAAGACTGGTAAATGTATTAACAAACCATATAATGTTATCACAAGTGGTT
CTAATTTAATCTCTGGTTTAATTGGTGGTCTCATTGGTGGTGGTACAGGAGGTAAAGGNA
TTGCAAAACTGTAAAAATTAAATACTCTTTT
Length of connected seq. 1641
Full length Seq ID -
Full length Seq. -
Length of full length seq. -