SHA109
Library SH
(Link to library)
Clone ID SHA109
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U11926-1
Original site URL
Representative seq. ID SHA109P
(Link to Original site)
Representative DNA sequence
>SHA109 (SHA109Q) /CSM/SH/SHA1-A/SHA109Q.Seq.d/
TCAAATTCAACTGGTTGTGTAAATACTCCAATTTCCTGTGACGATAATAATCCATGTACT
GTTGATTCTTGCGATGACTTAACTGGTTGTTGCAATACTCCAATCAATGCCATGTACCAT
CGATGCCTGTACTAAATCAACAGGTGTTACTCATACCCCAGTCAATGTAGATGATAATAA
CAAATGTACAATTGATACATGTACCAAAGAAGGTGGTGTAACTCATACTCCAGTCAATAC
TGATGATAACAATGCCTGTACCCTTGATTCCTGCTCACCATCAACTGGTGTTTCCCATAC
CCCAATAAACTGTGATGATAATAATAAATGTACTGTAGATTCATGTTCAAACTCAACTGG
TTGTGTAAATACTCCAATTAACTGTGACGATAGTAATCCATGTACCGTAGACTCATGCAA
TAATTCAACTGGTTGTGTAAACACTCCAGTCAATGTTGATGATAATAATCCATGTACCGT
CGATGCCTGTACCAAATCAACAGGTGTTACTCATACCCCAGTCAATGTAGATGATAACAA
CAAATGTACAATTGATGCATGTACCAAAGAANGTGGTGTAACTCATACCCCAGTCAATAC
TGATGATAACAATGCCTGTACAATTGATGCATGTACAAAAGAAAGTGGGGTAACTCATAC
TCCAGTCAATACTGATGATAACAATGCTTGTACCCTTGACTTCCTGTTCACCATCAACCT
GGCGTTTCCCACACCCCCAATTAACTGNGATGATAGNAATCCATGTACCGTANACTCATG
TTCCAAATTCAACCGGGTTGGTNNCAACACTCCCATCAATGGTGGANGGANAANAAACCC
ATNTNCCGXXXXXXXXXXACTCATACACCAATCAATACAGATGATAATGACAAATGTACA
TTAGATGCTTGTTCACCAAAGACTGGTGTAACTCATACACCAATCAATTGTGATGATGGA
AACAAATGTACAATCAATAGTTGTTCACCATCAGTTGGTTGTATCTCAACACCAGTTTCA
TGTCCAAAACCAAAAGATAAATGTTCAATCTCTCAATGTGATTCAGCCAAAGGTTGCATC
GAAGCCCCAATGAATTGTACCTCTGATAAATGTAATGAAGCATCATGTTGTGATGGTGTT
TGTACCTCAAAACCAATTAGCTGTCCAAAACCAAAGAATAAGTGTCAAGTTGCAAAATGT
GATTTAATTAAAGGTTGTACCGTCTCAAATGTAGTATGTGATGATGGTAATGCTTGTACC
GAAGATTCATGTTGTTCAGACACTGGTAAATGTCAATTCGAACCAATCAAACTTCCAAAG
AATAAAAACAAATGTATCATTTCAAAATGTGATCCAATTAAAGGTACAATCACCAACAGT
ACCGTAAACTGTGAATGTGATGACCTTTGTAACATTGGTGAATGTTGTGAAGATACAGGA
AAATGTAATTACAGACAAAAAGATTGTGATGATAATAATCCAAAAACAGCTGATAGTTGC
GATTCCAAGACTGGTAAATGTATTAACAAACCATATAATGTTATCACAAGTGGTTCTAAT
TTAATCTCTGGTTTAATTGGTGGTCTCATTGGTGGTGGTACAGGAGGTAAAGGTNATTGC
AAAACTGTAAAAATTAAAAACTCTTTT
sequence update 2002.10.25
Translated Amino Acid sequence
qiqlvv*ilqfpvtiiihvllilamt*LVVAILQSMPCTIDACTKSTGVTHTPVNVDDNN
KCTIDTCTKEGGVTHTPVNTDDNNACTLDSCSPSTGVSHTPINCDDNNKCTVDSCSNSTG
CVNTPINCDDSNPCTVDSCNNSTGCVNTPVNVDDNNPCTVDACTKSTGVTHTPVNVDDNN
KCTIDACTKEXGVTHTPVNTDDNNACTIDACTKESGVTHTPVNTDDNNACTLDFLFTINL
AFPTPPINXDDXNPCTVXSCSKFNRVGXNTPINGGXXXNPXX---

---THTPINTDDNDKCTLDACSPKTGVTHTPINCDDGNKCTINSCSPSVGCISTPVSCPK
PKDKCSISQCDSAKGCIEAPMNCTSDKCNEASCCDGVCTSKPISCPKPKNKCQVAKCDLI
KGCTVSNVVCDDGNACTEDSCCSDTGKCQFEPIKLPKNKNKCIISKCDPIKGTITNSTVN
CECDDLCNIGECCEDTGKCNYRQKDCDDNNPKTADSCDSKTGKCINKPYNVITSGSNLIS
GLIGGLIGGGTGGKGXCKTVKIKNSF


Translated Amino Acid sequence (All Frames)
Frame A:
snstgcvntpiscddnnpctvdscddltgccntpinamyhrcly*inrcysypsqcr***
qmyn*ymyqrrwcnsyssqy***qclyp*flltinwcfpypnkl*****mycrfmfklnw
lckysn*l*r**smyrrlmq*fnwlckhssqc****smyrrclyqinrcysypsqcr**q
qmyn*cmyqrxwcnsypsqy***qclyn*cmykrkwgnsyssqy***qclyp*lpvhhqp
gvshtpn*lx**xsmyrxlmfqiqpgwxqhshqwwxxxkpxx---

---THTPINTDDNDKCTLDACSPKTGVTHTPINCDDGNKCTINSCSPSVGCISTPVSCPK
PKDKCSISQCDSAKGCIEAPMNCTSDKCNEASCCDGVCTSKPISCPKPKNKCQVAKCDLI
KGCTVSNVVCDDGNACTEDSCCSDTGKCQFEPIKLPKNKNKCIISKCDPIKGTITNSTVN
CECDDLCNIGECCEDTGKCNYRQKDCDDNNPKTADSCDSKTGKCINKPYNVITSGSNLIS
GLIGGLIGGGTGGKGXCKTVKIKNSF

Frame B:
qiqlvv*ilqfpvtiiihvllilamt*LVVAILQSMPCTIDACTKSTGVTHTPVNVDDNN
KCTIDTCTKEGGVTHTPVNTDDNNACTLDSCSPSTGVSHTPINCDDNNKCTVDSCSNSTG
CVNTPINCDDSNPCTVDSCNNSTGCVNTPVNVDDNNPCTVDACTKSTGVTHTPVNVDDNN
KCTIDACTKEXGVTHTPVNTDDNNACTIDACTKESGVTHTPVNTDDNNACTLDFLFTINL
AFPTPPINXDDXNPCTVXSCSKFNRVGXNTPINGGXXXNPXX---

---lihqsiqmimtnvh*mlvhqrlv*lihqsivmmetnvqsivvhhqlvvsqhqfhvqn
qkinvqslnviqpkvaskpq*ivplinvmkhhvvmvfvpqnqlavqnqrisvklqnvi*l
kvvpsqm*yvmmvmlvpkihvvqtlvnvnsnqsnfqriktnvsfqnviqlkvqsptvp*t
vnvmtfvtlvnvvkiqenvitdkkivmiiiqkqlivaiprlvnvltnhimlsqvvli*sl
v*lvvslvvvqevkviakl*klktl

Frame C:
kfnwlckysnfl*r**smyc*flr*lnwllqysnqchvpsmpvlnqqvllipqsm*miit
nvqlihvpkkvv*lilqsilmitmpvplipahhqlvfpipq*tvmiiinvl*ihvqtqlv
v*ilqltvtivihvp*thaiiqlvv*tlqsmlmiiihvpsmpvpnqqvllipqsm*mitt
nvqlmhvpkxvv*lipqsilmitmpvqlmhvqkkvg*lilqsilmitmlvpltscspstw
rfphpqltxmixihvpxthvpnstglvxtlpsmvxgxxthxp---

---sytnqyr***qmyirclftkdwcnsytnql**wkqmynq*lftiswlylntsfmskt
kr*mfnlsm*fsqrlhrspnelyl**m**siml*wclylktn*lsktke*vssckm*fn*
rlyrlkcsm**w*clyrrfmlfrhw*msirtnqtske*kqmyhfkm*sn*rynhqqyrkl
*m**pl*hw*ml*ryrkm*lqtkrl****skns**lrfqdw*my*qti*cyhkwf*fnlw
fnwwshwwwyrr*rxlqnckn*klf

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SHA109 (SHA109Q) /CSM/SH/SHA1-A/SHA109Q.Seq.d/ 3134 0.0
SHA152 (SHA152Q) /CSM/SH/SHA1-C/SHA152Q.Seq.d/ 1518 0.0
SHC394 (SHC394Q) /CSM/SH/SHC3-D/SHC394Q.Seq.d/ 1507 0.0
CHR684 (CHR684Q) /CSM/CH/CHR6-D/CHR684Q.Seq.d/ 1507 0.0
CFC807 (CFC807Q) /CSM/CF/CFC8-A/CFC807Q.Seq.d/ 1507 0.0
CHM421 (CHM421Q) /CSM/CH/CHM4-A/CHM421Q.Seq.d/ 1505 0.0
CFC542 (CFC542Q) /CSM/CF/CFC5-B/CFC542Q.Seq.d/ 1499 0.0
SHA221 (SHA221Q) /CSM/SH/SHA2-A/SHA221Q.Seq.d/ 1495 0.0
SHA257 (SHA257Q) /CSM/SH/SHA2-C/SHA257Q.Seq.d/ 1493 0.0
CHS294 (CHS294Q) /CSM/CH/CHS2-D/CHS294Q.Seq.d/ 1481 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

X78948|X78948.1 D.minutum ecmB gene. 50 3e-24 8
CK991559|CK991559.1 EST0111 Eyestalk cDNA library Penaeus monodon cDNA clone ES-N-S03-555-W 5' similar to Histidine-rich glycoprotein precursor (Plasmodium lophurae), mRNA sequence. 30 0.030 4
BQ739629|BQ739629.1 PfESToab48c05.y1 Plasmodium falciparum 3D7 asexual cDNA Plasmodium falciparum cDNA 5' similar to TR:Q9U0J6 Q9U0J6 HYPOTHETICAL 312.5 KD PROTEIN. ;, mRNA sequence. 30 0.053 4
AC116984|AC116984.2 Dictyostelium discoideum chromosome 2 map 2567470-3108875 strain AX4, complete sequence. 50 0.15 1
AF134683|AF134683.1 Plasmodium falciparum strain UNK1 CG2 omega repeat (cg2) gene, partial cds. 34 0.30 3
BX231032|BX231032.1 Danio rerio genomic clone DKEY-254O18, genomic survey sequence. 30 0.45 5
AF467890|AF467890.1 Mus musculus chromosome 17 clone 160i3 strain C57BL/6J, *** SEQUENCING IN PROGRESS ***, 4 ordered pieces. 48 0.60 1
AC154238|AC154238.1 Mus musculus chromosome 17 clone RP24-162O23, WORKING DRAFT SEQUENCE, 9 unordered pieces. 48 0.60 1
AC138642|AC138642.10 Mus musculus chromosome 3, clone RP23-21O7, complete sequence. 48 0.60 1
AC122491|AC122491.3 Mus musculus BAC clone RP24-390D3 from chromosome 3, complete sequence. 48 0.60 1
dna update 2005.12. 2
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

A27020(A27020)DIF-induced prestalk pDd63 protein precursor - sli... 422 0.0
(P11976) RecName: Full=Prestalk protein; AltName: Full=Extracell... 310 e-141
A26838(A26838)prestalk protein precursor - slime mold (Dictyoste... 310 e-140
S44208(S44208) extracellular matrix protein B - Dictyostelium mi... 267 1e-76
AC117072_61(AC117072|pid:none) Dictyostelium discoideum chromoso... 226 3e-74
AC116984_103(AC116984|pid:none) Dictyostelium discoideum chromos... 224 2e-64
(Q54C31) RecName: Full=Protein psiR; Flags: Precursor; 104 3e-40
(Q54CH8) RecName: Full=Protein psiN; Flags: Precursor; 124 1e-26
(Q54G85) RecName: Full=Protein psiJ; Flags: Precursor; 114 1e-23
(Q94494) RecName: Full=Protein psiH; Flags: Precursor; &U67940_... 101 9e-20
protein update 2009. 4.14
PSORT

psg: 0.88 gvh: 0.52 alm: 0.37 top: 0.50 tms: 0.00 mit: 0.23 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.20 pox: 0.00 px2: 0.00 vac: 0.33 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

48.0 %: cytoplasmic
28.0 %: nuclear
12.0 %: mitochondrial
8.0 %: cytoskeletal
4.0 %: vacuolar

>> prediction for SHA109 is cyt

5' end seq. ID SHA109F
5' end seq.
>SHA109F.Seq
TCAAATTCAACTGGTTGTGTAAATACTCCAATTTCCTGTGACGATAATAATCCATGTACT
GTTGATTCTTGCGATGACTTAACTGGTTGTTGCAATACTCCAATCAATGCCATGTACCAT
CGATGCCTGTACTAAATCAACAGGTGTTACTCATACCCCAGTCAATGTAGATGATAATAA
CAAATGTACAATTGATACATGTACCAAAGAAGGTGGTGTAACTCATACTCCAGTCAATAC
TGATGATAACAATGCCTGTACCCTTGATTCCTGCTCACCATCAACTGGTGTTTCCCATAC
CCCAATAAACTGTGATGATAATAATAAATGTACTGTAGATTCATGTTCAAACTCAACTGG
TTGTGTAAATACTCCAATTAACTGTGACGATAGTAATCCATGTACCGTAGACTCATGCAA
TAATTCAACTGGTTGTGTAAACACTCCAGTCAATGTTGATGATAATAATCCATGTACCGT
CGATGCCTGTACCAAATCAACAGGTGTTACTCATACCCCAGTCAATGTAGATGATAACAA
CAAATGTACAATTGATGCATGTACCAAAGAANGTGGTGTAACTCATACCCCAGTCAATAC
TGATGATAACAATGCCTGTACAATTGATGCATGTACAAAAGAAAGTGGGGTAACTCATAC
TCCAGTCAATACTGATGATAACAATGCTTGTACCCTTGACTTCCTGTTCACCATCAACCT
GGCGTTTCCCACACCCCCAATTAACTGNGATGATAGNAATCCATGTACCGTANACTCATG
TTCCAAATTCAACCGGGTTGGTNNCAACACTCCCATCAATGGTGGANGGANAANAAACCC
ATNTNCCGNNNNNNNNNN
Length of 5' end seq. 858
3' end seq. ID SHA109Z
3' end seq.
>SHA109Z.Seq
NNNNNNNNNNACTCATACACCAATCAATACAGATGATAATGACAAATGTACATTAGATGC
TTGTTCACCAAAGACTGGTGTAACTCATACACCAATCAATTGTGATGATGGAAACAAATG
TACAATCAATAGTTGTTCACCATCAGTTGGTTGTATCTCAACACCAGTTTCATGTCCAAA
ACCAAAAGATAAATGTTCAATCTCTCAATGTGATTCAGCCAAAGGTTGCATCGAAGCCCC
AATGAATTGTACCTCTGATAAATGTAATGAAGCATCATGTTGTGATGGTGTTTGTACCTC
AAAACCAATTAGCTGTCCAAAACCAAAGAATAAGTGTCAAGTTGCAAAATGTGATTTAAT
TAAAGGTTGTACCGTCTCAAATGTAGTATGTGATGATGGTAATGCTTGTACCGAAGATTC
ATGTTGTTCAGACACTGGTAAATGTCAATTCGAACCAATCAAACTTCCAAAGAATAAAAA
CAAATGTATCATTTCAAAATGTGATCCAATTAAAGGTACAATCACCAACAGTACCGTAAA
CTGTGAATGTGATGACCTTTGTAACATTGGTGAATGTTGTGAAGATACAGGAAAATGTAA
TTACAGACAAAAAGATTGTGATGATAATAATCCAAAAACAGCTGATAGTTGCGATTCCAA
GACTGGTAAATGTATTAACAAACCATATAATGTTATCACAAGTGGTTCTAATTTAATCTC
TGGTTTAATTGGTGGTCTCATTGGTGGTGGTACAGGAGGTAAAGGTNATTGCAAAACTGT
AAAAATTAAAAACTCTTTT
Length of 3' end seq. 799
Connected seq. ID SHA109P
Connected seq.
>SHA109P.Seq
TCAAATTCAACTGGTTGTGTAAATACTCCAATTTCCTGTGACGATAATAATCCATGTACT
GTTGATTCTTGCGATGACTTAACTGGTTGTTGCAATACTCCAATCAATGCCATGTACCAT
CGATGCCTGTACTAAATCAACAGGTGTTACTCATACCCCAGTCAATGTAGATGATAATAA
CAAATGTACAATTGATACATGTACCAAAGAAGGTGGTGTAACTCATACTCCAGTCAATAC
TGATGATAACAATGCCTGTACCCTTGATTCCTGCTCACCATCAACTGGTGTTTCCCATAC
CCCAATAAACTGTGATGATAATAATAAATGTACTGTAGATTCATGTTCAAACTCAACTGG
TTGTGTAAATACTCCAATTAACTGTGACGATAGTAATCCATGTACCGTAGACTCATGCAA
TAATTCAACTGGTTGTGTAAACACTCCAGTCAATGTTGATGATAATAATCCATGTACCGT
CGATGCCTGTACCAAATCAACAGGTGTTACTCATACCCCAGTCAATGTAGATGATAACAA
CAAATGTACAATTGATGCATGTACCAAAGAANGTGGTGTAACTCATACCCCAGTCAATAC
TGATGATAACAATGCCTGTACAATTGATGCATGTACAAAAGAAAGTGGGGTAACTCATAC
TCCAGTCAATACTGATGATAACAATGCTTGTACCCTTGACTTCCTGTTCACCATCAACCT
GGCGTTTCCCACACCCCCAATTAACTGNGATGATAGNAATCCATGTACCGTANACTCATG
TTCCAAATTCAACCGGGTTGGTNNCAACACTCCCATCAATGGTGGANGGANAANAAACCC
ATNTNCCG----------ACTCATACACCAATCAATACAGATGATAATGACAAATGTACA
TTAGATGCTTGTTCACCAAAGACTGGTGTAACTCATACACCAATCAATTGTGATGATGGA
AACAAATGTACAATCAATAGTTGTTCACCATCAGTTGGTTGTATCTCAACACCAGTTTCA
TGTCCAAAACCAAAAGATAAATGTTCAATCTCTCAATGTGATTCAGCCAAAGGTTGCATC
GAAGCCCCAATGAATTGTACCTCTGATAAATGTAATGAAGCATCATGTTGTGATGGTGTT
TGTACCTCAAAACCAATTAGCTGTCCAAAACCAAAGAATAAGTGTCAAGTTGCAAAATGT
GATTTAATTAAAGGTTGTACCGTCTCAAATGTAGTATGTGATGATGGTAATGCTTGTACC
GAAGATTCATGTTGTTCAGACACTGGTAAATGTCAATTCGAACCAATCAAACTTCCAAAG
AATAAAAACAAATGTATCATTTCAAAATGTGATCCAATTAAAGGTACAATCACCAACAGT
ACCGTAAACTGTGAATGTGATGACCTTTGTAACATTGGTGAATGTTGTGAAGATACAGGA
AAATGTAATTACAGACAAAAAGATTGTGATGATAATAATCCAAAAACAGCTGATAGTTGC
GATTCCAAGACTGGTAAATGTATTAACAAACCATATAATGTTATCACAAGTGGTTCTAAT
TTAATCTCTGGTTTAATTGGTGGTCTCATTGGTGGTGGTACAGGAGGTAAAGGTNATTGC
AAAACTGTAAAAATTAAAAACTCTTTT
Length of connected seq. 1637
Full length Seq ID -
Full length Seq. -
Length of full length seq. -