CHM421
Library CH
(Link to library)
Clone ID CHM421
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U11926-1
Original site URL
Representative seq. ID CHM421P
(Link to Original site)
Representative DNA sequence
>CHM421 (CHM421Q) /CSM/CH/CHM4-A/CHM421Q.Seq.d/
ANTGTAGATGATAACAACAAATGTACAATNGATGCATGTACCAAAGAAAGNGGNGTAACT
CATACCCCAGTCAATACTGATGATAACAATGCCTGTACAATTGATGCATGTACAAAAGAA
AGTGGTGTAACTCATACTCCANTCAATACTGATGATAACAATGCTTGTACCCTTGACTCC
TGTTCACCATCAACTGGCGTTTCCCACACCCCAATTAACTGTGATGATAGTAATCCATGT
ACCGTAGACTCATGTTCAAATTCAACCGGTTGTTGCAACACTCCAATCAATGTTGATGAT
AATAATCCATGTACTGTAGATTCATGTACCAAACCAACAGGTGTCACTCATACCCCANTC
AATGTAGATGATAACAACAAATGTACAATTGATGCATGTACCAAAGAAAGTGGTGTAACT
CATACTCCAGTCAATACTGATGATAACAATGCATGTACCCTTGATTCCTGTTCACCATCA
ACTGGTATTTCCCACGCTCCAATTAACTGTGATGATAGTAATCCATGTACTATAGATTCA
TGCAATAATTCAACTGGTTGTTGCAACACTCCAATCAATGTTGATGATAATAATCCATGT
ACCGTANATTCATGTACTAAATCAAGGTGGTGTCACTCATACTCCANTCATGTCGATGAT
AATAACAAATGTACCACTGATGCATGTACTAAAAAAAGNGGTGTAACTCATACTCCAATC
TCTTGTGATGATAATAATGCCTGTACAATTGATTCATGTTCAAACTCACTNGNTGTGTAA
ACACTCCATTAACTNTGATGATANAAAACCCATGGTNCTGTNGATACATGTACAAAANAA
AAAAGXXXXXXXXXXACTCATACACCAATCAATACAGATGATAATAACAAATGTACATTA
GATGCTTGTTCACCAAAGACTGGTGTAACTCATACACCAATCAATTGTGATGATGGAAAC
AAATGTACAATCAATAGTTGTTCACCATCAGTTGGTTGTATCTCAACACCAGTTTCATGT
CCAAAACCAAAAGATAAATGTTCAATCTCTCAATGTGATTCAGCCAAAGGTTGCATCGAA
GTCCCAATGAATTGTACCTCTGATAAATGTAATGAAGCATCATGTTGTGATGGTGTTTGT
ACCTCAAAACCAATTAGCTGTCCAAAACCAAAGAATAAGTGTCAAGTTGCAAAATGTGAT
TTAATTAAAGGTTGTACCGTCTCAAATGTAGTATGTGATGATGGTAATGCTTGTACCGAA
GATTCATGTTGTTCAGACACTGGTAAATGTCAATTCGAACCAATCAAACTTCCAAAGAAT
AAAAACAAATGTATCATTTCAAAATGTGATCCAATTAAAGGTACAATCACCAACAGTACC
GTAAACTGTGAATGTGATGACCTTTGTAACATTGGTGAATGTTGTGAAGATACAGGAAAA
TGTAATTACAGACAAAAAGATTGTGATGATAATAATCCAAAAACAGCTGATAGTTGCGAT
TCCAAGACTGGTAAATGTATTAACAAACCATATAATGTTATCACAAGTGGTTCTAATTTA
ATCTCTGGTTTAATTGGTGGCCTCATTGGTGGTGGTACAGGAGGTAAAGGGATTGCAAAA
CTTGTAAAAATTAAAAACTCTTTT
sequence update 2002.10.25
Translated Amino Acid sequence
x*mittnvqxmhvpkkxx*lipqsilmitmpvqlmhvqkkvv*lilxsilmitmlvpltp
vhhqlafptpqltvmivihvp*thvqiqpvvatlqsmlmiiihvl*ihvpnqqvslipxs
m*mittnvqlmhvpkkvv*lilqsilmitmhvplipvhhqlvfptlqltvmivihvl*ih
aiiqlvvatlqsmlmiiihvpxihvlnqggvthtpxmsmiitnvplmhvlkkxv*LILQS
LVMIIMPVQLIHVQTHXXCKHSINXDDXKPMVLXIHVQXKK---

---THTPINTDDNNKCTLDACSPKTGVTHTPINCDDGNKCTINSCSPSVGCISTPVSCPK
PKDKCSISQCDSAKGCIEVPMNCTSDKCNEASCCDGVCTSKPISCPKPKNKCQVAKCDLI
KGCTVSNVVCDDGNACTEDSCCSDTGKCQFEPIKLPKNKNKCIISKCDPIKGTITNSTVN
CECDDLCNIGECCEDTGKCNYRQKDCDDNNPKTADSCDSKTGKCINKPYNVITSGSNLIS
GLIGGLIGGGTGGKGIAKLVKIKNSF


Translated Amino Acid sequence (All Frames)
Frame A:
xvddnnkctxdactkexgvthtpvntddnnactidactkesgvthtpxntddnnactlds
cspstgvshtpincddsnpctvdscsnstgccntpinvddnnpctvdsctkptgvthtpx
nvddnnkctidactkesgvthtpvntddnnactldscspstgishapincddsnpctids
cnnstgccntpinvddnnpctvxsctksrwchsysxhvddnnkcttdactkkxgvthtpi
scddnnactidscsnslxv*tlh*l**xkthgxvdtctkxk---

---THTPINTDDNNKCTLDACSPKTGVTHTPINCDDGNKCTINSCSPSVGCISTPVSCPK
PKDKCSISQCDSAKGCIEVPMNCTSDKCNEASCCDGVCTSKPISCPKPKNKCQVAKCDLI
KGCTVSNVVCDDGNACTEDSCCSDTGKCQFEPIKLPKNKNKCIISKCDPIKGTITNSTVN
CECDDLCNIGECCEDTGKCNYRQKDCDDNNPKTADSCDSKTGKCINKPYNVITSGSNLIS
GLIGGLIGGGTGGKGIAKLVKIKNSF

Frame B:
x*mittnvqxmhvpkkxx*lipqsilmitmpvqlmhvqkkvv*lilxsilmitmlvpltp
vhhqlafptpqltvmivihvp*thvqiqpvvatlqsmlmiiihvl*ihvpnqqvslipxs
m*mittnvqlmhvpkkvv*lilqsilmitmhvplipvhhqlvfptlqltvmivihvl*ih
aiiqlvvatlqsmlmiiihvpxihvlnqggvthtpxmsmiitnvplmhvlkkxv*LILQS
LVMIIMPVQLIHVQTHXXCKHSINXDDXKPMVLXIHVQXKK---

---lihqsiqmiitnvh*mlvhqrlv*lihqsivmmetnvqsivvhhqlvvsqhqfhvqn
qkinvqslnviqpkvasksq*ivplinvmkhhvvmvfvpqnqlavqnqrisvklqnvi*l
kvvpsqm*yvmmvmlvpkihvvqtlvnvnsnqsnfqriktnvsfqnviqlkvqsptvp*t
vnvmtfvtlvnvvkiqenvitdkkivmiiiqkqlivaiprlvnvltnhimlsqvvli*sl
v*lvaslvvvqevkglqnl*klktl

Frame C:
cr**qqmynxcmyqrkxxnsypsqy***qclyn*cmykrkwcnsysxqy***qclyp*ll
ftinwrfphpn*l****smyrrlmfkfnrllqhsnqc****smycrfmyqtnrchsypxq
cr**qqmyn*cmyqrkwcnsyssqy***qcmyp*flftinwyfprsn*l****smyyrfm
q*fnwllqhsnqc****smyrxfmy*ikvvslilxscr***qmyh*cmy*kkxcnsysnl
l****clyn*fmfkltxcvntpltxmixnpwxcxymykxkk---

---sytnqyr***qmyirclftkdwcnsytnql**wkqmynq*lftiswlylntsfmskt
kr*mfnlsm*fsqrlhrspnelyl**m**siml*wclylktn*lsktke*vssckm*fn*
rlyrlkcsm**w*clyrrfmlfrhw*msirtnqtske*kqmyhfkm*sn*rynhqqyrkl
*m**pl*hw*ml*ryrkm*lqtkrl****skns**lrfqdw*my*qti*cyhkwf*fnlw
fnwwphwwwyrr*rdcktckn*klf

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

CHM421 (CHM421Q) /CSM/CH/CHM4-A/CHM421Q.Seq.d/ 3065 0.0
CFC542 (CFC542Q) /CSM/CF/CFC5-B/CFC542Q.Seq.d/ 1542 0.0
SHA152 (SHA152Q) /CSM/SH/SHA1-C/SHA152Q.Seq.d/ 1526 0.0
SHC394 (SHC394Q) /CSM/SH/SHC3-D/SHC394Q.Seq.d/ 1520 0.0
CHS294 (CHS294Q) /CSM/CH/CHS2-D/CHS294Q.Seq.d/ 1518 0.0
CHR684 (CHR684Q) /CSM/CH/CHR6-D/CHR684Q.Seq.d/ 1515 0.0
CFC807 (CFC807Q) /CSM/CF/CFC8-A/CFC807Q.Seq.d/ 1515 0.0
SHA221 (SHA221Q) /CSM/SH/SHA2-A/SHA221Q.Seq.d/ 1509 0.0
SHA109 (SHA109Q) /CSM/SH/SHA1-A/SHA109Q.Seq.d/ 1505 0.0
SHA257 (SHA257Q) /CSM/SH/SHA2-C/SHA257Q.Seq.d/ 1501 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

X78948|X78948.1 D.minutum ecmB gene. 70 1e-31 7
AC117072|AC117072.3 Dictyostelium discoideum chromosome 2 map 3879572-4071762 strain AX4, complete sequence. 60 2e-10 14
AE014845|AE014845.1 Plasmodium falciparum 3D7 chromosome 12, section 2 of 9 of the complete sequence. 32 6e-05 18
AC116984|AC116984.2 Dictyostelium discoideum chromosome 2 map 2567470-3108875 strain AX4, complete sequence. 50 9e-05 19
AJ548837|AJ548837.1 Dictyostelium discoideum mRNA for extracellular signalling molecule DicA (dicA gene). 44 1e-04 3
AJ548838|AJ548838.1 Dictyostelium discoideum dicA gene for extracellular signalling molecule DicA, exons 1-2. 44 2e-04 3
CR382399|CR382399.1 Plasmodium falciparum chromosome 6, complete sequence; segment 2/5. 36 0.003 21
AF134658|AF134658.1 Plasmodium falciparum strain SEN16 from Senegal CG2 omega repeat (cg2) gene, partial cds. 32 0.007 4
AC116957|AC116957.2 Dictyostelium discoideum chromosome 2 map 1685067-2090751 strain AX4, complete sequence. 36 0.008 21
AF134660|AF134660.1 Plasmodium falciparum strain COM6 from Comoros CG2 omega repeat (cg2) gene, partial cds. 32 0.009 4
dna update 2005. 4.24
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

A27020(A27020)DIF-induced prestalk pDd63 protein precursor - sli... 746 0.0
A26838(A26838)prestalk protein precursor - slime mold (Dictyoste... 565 e-159
(P11976) RecName: Full=Prestalk protein; AltName: Full=Extracell... 565 e-159
S44208(S44208) extracellular matrix protein B - Dictyostelium mi... 333 8e-90
AC117072_61(AC117072|pid:none) Dictyostelium discoideum chromoso... 331 4e-89
AC116984_103(AC116984|pid:none) Dictyostelium discoideum chromos... 318 3e-85
(Q54C31) RecName: Full=Protein psiR; Flags: Precursor; 186 3e-45
(Q54C32) RecName: Full=Protein psiQ; Flags: Precursor; 157 8e-37
(Q54G85) RecName: Full=Protein psiJ; Flags: Precursor; 143 2e-32
(Q54CH8) RecName: Full=Protein psiN; Flags: Precursor; 141 8e-32
protein update 2009. 4. 8
PSORT

psg: 0.97 gvh: 0.50 alm: 0.30 top: 0.57 tms: 0.07 mit: 0.30 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.60 pox: 0.00 px2: 0.00 vac: 0.33 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 1.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 1.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 0.00

28.0 %: cytoplasmic
28.0 %: nuclear
20.0 %: mitochondrial
8.0 %: plasma membrane
8.0 %: endoplasmic reticulum
4.0 %: vesicles of secretory system
4.0 %: peroxisomal

>> prediction for CHM421 is cyt

5' end seq. ID CHM421F
5' end seq.
>CHM421F.Seq
ANTGTAGATGATAACAACAAATGTACAATNGATGCATGTACCAAAGAAAGNGGNGTAACT
CATACCCCAGTCAATACTGATGATAACAATGCCTGTACAATTGATGCATGTACAAAAGAA
AGTGGTGTAACTCATACTCCANTCAATACTGATGATAACAATGCTTGTACCCTTGACTCC
TGTTCACCATCAACTGGCGTTTCCCACACCCCAATTAACTGTGATGATAGTAATCCATGT
ACCGTAGACTCATGTTCAAATTCAACCGGTTGTTGCAACACTCCAATCAATGTTGATGAT
AATAATCCATGTACTGTAGATTCATGTACCAAACCAACAGGTGTCACTCATACCCCANTC
AATGTAGATGATAACAACAAATGTACAATTGATGCATGTACCAAAGAAAGTGGTGTAACT
CATACTCCAGTCAATACTGATGATAACAATGCATGTACCCTTGATTCCTGTTCACCATCA
ACTGGTATTTCCCACGCTCCAATTAACTGTGATGATAGTAATCCATGTACTATAGATTCA
TGCAATAATTCAACTGGTTGTTGCAACACTCCAATCAATGTTGATGATAATAATCCATGT
ACCGTANATTCATGTACTAAATCAAGGTGGTGTCACTCATACTCCANTCATGTCGATGAT
AATAACAAATGTACCACTGATGCATGTACTAAAAAAAGNGGTGTAACTCATACTCCAATC
TCTTGTGATGATAATAATGCCTGTACAATTGATTCATGTTCAAACTCACTNGNTGTGTAA
ACACTCCATTAACTNTGATGATANAAAACCCATGGTNCTGTNGATACATGTACAAAANAA
AAAAGNNNNNNNNNN
Length of 5' end seq. 855
3' end seq. ID CHM421Z
3' end seq.
>CHM421Z.Seq
NNNNNNNNNNACTCATACACCAATCAATACAGATGATAATAACAAATGTACATTAGATGC
TTGTTCACCAAAGACTGGTGTAACTCATACACCAATCAATTGTGATGATGGAAACAAATG
TACAATCAATAGTTGTTCACCATCAGTTGGTTGTATCTCAACACCAGTTTCATGTCCAAA
ACCAAAAGATAAATGTTCAATCTCTCAATGTGATTCAGCCAAAGGTTGCATCGAAGTCCC
AATGAATTGTACCTCTGATAAATGTAATGAAGCATCATGTTGTGATGGTGTTTGTACCTC
AAAACCAATTAGCTGTCCAAAACCAAAGAATAAGTGTCAAGTTGCAAAATGTGATTTAAT
TAAAGGTTGTACCGTCTCAAATGTAGTATGTGATGATGGTAATGCTTGTACCGAAGATTC
ATGTTGTTCAGACACTGGTAAATGTCAATTCGAACCAATCAAACTTCCAAAGAATAAAAA
CAAATGTATCATTTCAAAATGTGATCCAATTAAAGGTACAATCACCAACAGTACCGTAAA
CTGTGAATGTGATGACCTTTGTAACATTGGTGAATGTTGTGAAGATACAGGAAAATGTAA
TTACAGACAAAAAGATTGTGATGATAATAATCCAAAAACAGCTGATAGTTGCGATTCCAA
GACTGGTAAATGTATTAACAAACCATATAATGTTATCACAAGTGGTTCTAATTTAATCTC
TGGTTTAATTGGTGGCCTCATTGGTGGTGGTACAGGAGGTAAAGGGATTGCAAAACTTGT
AAAAATTAAAAACTCTTTT
Length of 3' end seq. 799
Connected seq. ID CHM421P
Connected seq.
>CHM421P.Seq
ANTGTAGATGATAACAACAAATGTACAATNGATGCATGTACCAAAGAAAGNGGNGTAACT
CATACCCCAGTCAATACTGATGATAACAATGCCTGTACAATTGATGCATGTACAAAAGAA
AGTGGTGTAACTCATACTCCANTCAATACTGATGATAACAATGCTTGTACCCTTGACTCC
TGTTCACCATCAACTGGCGTTTCCCACACCCCAATTAACTGTGATGATAGTAATCCATGT
ACCGTAGACTCATGTTCAAATTCAACCGGTTGTTGCAACACTCCAATCAATGTTGATGAT
AATAATCCATGTACTGTAGATTCATGTACCAAACCAACAGGTGTCACTCATACCCCANTC
AATGTAGATGATAACAACAAATGTACAATTGATGCATGTACCAAAGAAAGTGGTGTAACT
CATACTCCAGTCAATACTGATGATAACAATGCATGTACCCTTGATTCCTGTTCACCATCA
ACTGGTATTTCCCACGCTCCAATTAACTGTGATGATAGTAATCCATGTACTATAGATTCA
TGCAATAATTCAACTGGTTGTTGCAACACTCCAATCAATGTTGATGATAATAATCCATGT
ACCGTANATTCATGTACTAAATCAAGGTGGTGTCACTCATACTCCANTCATGTCGATGAT
AATAACAAATGTACCACTGATGCATGTACTAAAAAAAGNGGTGTAACTCATACTCCAATC
TCTTGTGATGATAATAATGCCTGTACAATTGATTCATGTTCAAACTCACTNGNTGTGTAA
ACACTCCATTAACTNTGATGATANAAAACCCATGGTNCTGTNGATACATGTACAAAANAA
AAAAG----------ACTCATACACCAATCAATACAGATGATAATAACAAATGTACATTA
GATGCTTGTTCACCAAAGACTGGTGTAACTCATACACCAATCAATTGTGATGATGGAAAC
AAATGTACAATCAATAGTTGTTCACCATCAGTTGGTTGTATCTCAACACCAGTTTCATGT
CCAAAACCAAAAGATAAATGTTCAATCTCTCAATGTGATTCAGCCAAAGGTTGCATCGAA
GTCCCAATGAATTGTACCTCTGATAAATGTAATGAAGCATCATGTTGTGATGGTGTTTGT
ACCTCAAAACCAATTAGCTGTCCAAAACCAAAGAATAAGTGTCAAGTTGCAAAATGTGAT
TTAATTAAAGGTTGTACCGTCTCAAATGTAGTATGTGATGATGGTAATGCTTGTACCGAA
GATTCATGTTGTTCAGACACTGGTAAATGTCAATTCGAACCAATCAAACTTCCAAAGAAT
AAAAACAAATGTATCATTTCAAAATGTGATCCAATTAAAGGTACAATCACCAACAGTACC
GTAAACTGTGAATGTGATGACCTTTGTAACATTGGTGAATGTTGTGAAGATACAGGAAAA
TGTAATTACAGACAAAAAGATTGTGATGATAATAATCCAAAAACAGCTGATAGTTGCGAT
TCCAAGACTGGTAAATGTATTAACAAACCATATAATGTTATCACAAGTGGTTCTAATTTA
ATCTCTGGTTTAATTGGTGGCCTCATTGGTGGTGGTACAGGAGGTAAAGGGATTGCAAAA
CTTGTAAAAATTAAAAACTCTTTT
Length of connected seq. 1634
Full length Seq ID -
Full length Seq. -
Length of full length seq. -