CHS106
Library CH
(Link to library)
Clone ID CHS106
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16381-1|Contig-U16460-1
Original site URL
Representative seq. ID CHS106P
(Link to Original site)
Representative DNA sequence
>CHS106 (CHS106Q) /CSM/CH/CHS1-A/CHS106Q.Seq.d/
ATGGTAATAGACAATGTGTTGAAGATCAAATTACATTGCCACCATTTGATAAATGTGATA
ATGTCCATTGTCCAAAAGGATTTAATTGCAAATATGATTGGGAAAAAGATCTTGCTCTTT
GTGTTCCATGGAGACCATATCCACCAGTTTGTAGAACTAGATGTCCAGAAGGTCATGAAT
GTAAAGTTGATGAATGGGGTAAAGAATGTTGCGTAAAGATCAAATGTGATGATATTTGTG
ACTTGCGCTGTCCAAAGGGTCATGAATGCAAGATCAAACATGATGGTAGTAAATGCTGTG
TCCGTTCATGGAGACCAAGACCACATAAACCACATCCACGTCCACCAATCTGCAGATTAA
GATGTCCACCAGGTCATGAATGCAAACATGATGAACATGGTAAAGAATGTTGCGTTAAAA
AACGTCATCATGATAGATGTGACCTCAAATGTAAGAGAGGTTATGAATGTAAAATCAXXX
XXXXXXXGCTTCATCAGCACCAGCCGCCCCAGTTGCACCAGCTGTTTCATCCACTCCAGT
TGAATCAAAGAAAGGTCCAGGTTTAGGTGCAGTTTTCGGTGAACTTAGCAAAGGTGATGG
TGTTACCAGTGGTTTAAAAAAAGTTACCAGCGATATNNAATCCAAAAATTTCACCGNCAA
ATCATCAGTTGTTAAAGCTGCTGATACTAAAGTCGCCANAGTTGATGCTCCATCTAGACC
AGCCGTTTTTGCTNTCCAAGGTAACAAATGGTCCATTGAATATCAAGTTANCAACAAAGA
AATTGTCATTGCCGAGCCAGATAGTCGTCAAACTGTTTACATTTTCCAATGTGTAAACTC
TTTAGTTCAAATCAAAGGTAAAGTTAATGCAATTACTCTTGATGGTTGTAAAAAGACTTC
AATCGTTTTCGAAAATGCCATTTCCTCTTGTGAAGTTGTCAATTGTAATGGTGTTGAAAT
CCAAGTCACTGGTCGTGTACCATCAATTGCTATCGATAAGACAAGTGGTTGTCAA
sequence update 2002.10.25
Translated Amino Acid sequence
GNRQCVEDQITLPPFDKCDNVHCPKGFNCKYDWEKDLALCVPWRPYPPVCRTRCPEGHEC
KVDEWGKECCVKIKCDDICDLRCPKGHECKIKHDGSKCCVRSWRPRPHKPHPRPPICRLR
CPPGHECKHDEHGKECCVKKRHHDRCDLKCKRGYECKI---

---ASSAPAAPVAPAVSSTPVESKKGPGLGAVFGELSKGDGVTSGLKKVTSDXXSKNFTX
KSSVVKAADTKVAXVDAPSRPAVFAXQGNKWSIEYQVXNKEIVIAEPDSRQTVYIFQCVN
SLVQIKGKVNAITLDGCKKTSIVFENAISSCEVVNCNGVEIQVTGRVPSIAIDKTSGCQ


Translated Amino Acid sequence (All Frames)
Frame A:
mvidnvlkiklhchhlinvimsivqkdlianmigkkillfvfhgdhihqfveldvqkvmn
vklmngvknva*rsnvmifvtcavqrvmnarsnmmvvnavsvhgdqdhinhihvhqsad*
dvhqvmnanmmnmvknvalknvimidvtsnvrevmnvks---

---ASSAPAAPVAPAVSSTPVESKKGPGLGAVFGELSKGDGVTSGLKKVTSDXXSKNFTX
KSSVVKAADTKVAXVDAPSRPAVFAXQGNKWSIEYQVXNKEIVIAEPDSRQTVYIFQCVN
SLVQIKGKVNAITLDGCKKTSIVFENAISSCEVVNCNGVEIQVTGRVPSIAIDKTSGCQ

Frame B:
w**tmc*rsnyiati**m**cplskri*lqi*lgkrscslcsmetistsl*n*msrrs*m
*s**mg*rmlrkdqm**yl*lalskgs*mqdqt*w**mlcpfmetktt*ttststnlqik
mstrs*mqt**tw*rmlr*ktss**m*pqm*erl*m*n---

---lhqhqppqlhqlfhplqlnqrkvqv*vqfsvnlakvmvlpvv*kklpaixnpkispx
nhqllkllilkspxlmlhldqpfllskvtngplniklxtkklslpsqivvklftfsnv*t
l*fkskvklmqlllmvvkrlqsfskmpfplvklsivmvlkskslvvyhqllsirqvvv

Frame C:
GNRQCVEDQITLPPFDKCDNVHCPKGFNCKYDWEKDLALCVPWRPYPPVCRTRCPEGHEC
KVDEWGKECCVKIKCDDICDLRCPKGHECKIKHDGSKCCVRSWRPRPHKPHPRPPICRLR
CPPGHECKHDEHGKECCVKKRHHDRCDLKCKRGYECKI---

---fistsrpsctscfihss*ikersrfrcsfr*t*qr*wcyqwfkksyqryxiqkfhrq
iisc*sc*y*srxs*csi*tsrfcxpr*qmvh*issxqqrnchcrar*ssnclhfpmckl
fssnqr*s*cnys*wl*kdfnrfrkchfll*scql*wc*npshwsctincyr*dkwls

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

CHS106 (CHS106Q) /CSM/CH/CHS1-A/CHS106Q.Seq.d/ 1875 0.0
VFL760 (VFL760Q) /CSM/VF/VFL7-C/VFL760Q.Seq.d/ 961 0.0
VFA652 (VFA652Q) /CSM/VF/VFA6-C/VFA652Q.Seq.d/ 961 0.0
VFA148 (VFA148Q) /CSM/VF/VFA1-B/VFA148Q.Seq.d/ 961 0.0
SFE296 (SFE296Q) /CSM/SF/SFE2-D/SFE296Q.Seq.d/ 961 0.0
SFD161 (SFD161Q) /CSM/SF/SFD1-C/SFD161Q.Seq.d/ 961 0.0
CFF143 (CFF143Q) /CSM/CF/CFF1-B/CFF143Q.Seq.d/ 961 0.0
AFE320 (AFE320Q) /CSM/AF/AFE3-A/AFE320Q.Seq.d/ 961 0.0
AFE312 (AFE312Q) /CSM/AF/AFE3-A/AFE312Q.Seq.d/ 961 0.0
AFM838 (AFM838Q) /CSM/AF/AFM8-B/AFM838Q.Seq.d/ 954 0.0

own update 2009. 4. 4
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

U43027|U43027.1 Dictyostelium discoideum cyclase associated protein mRNA, complete cds. 735 0.0 2
X51892|X51892.1 Dictyostelium discoideum SP60 gene for spore coat protein. 938 0.0 1
M26239|M26239.1 D.discoideum spore coat protein SP60 gene, complete cds. 938 0.0 1
AC116977|AC116977.2 Dictyostelium discoideum chromosome 2 map 5515173-5817331 strain AX4, complete sequence. 930 0.0 1
X52105|X52105.1 Dictyostelium discoideum SP60 gene for spore coat protein. 696 0.0 1
U25144|U25144.1 Dictyostelium discoideum spore coat protein SP87 (PspD) gene, complete cds. 66 2e-15 4
AC117267|AC117267.2 Dictyostelium discoideum chromosome 2 map 5836255-5862024 strain AX4, complete sequence. 66 7e-14 5
CZ870853|CZ870853.1 OC__Ba0269E16.f OC__Ba Oryza coarctata genomic clone OC__Ba0269E16 5', genomic survey sequence. 54 0.006 1
AC139914|AC139914.2 Rattus norvegicus clone CH230-428K20, WORKING DRAFT SEQUENCE, 48 unordered pieces. 50 0.092 1
CF611007| ENSANGP00000012398 - Anopheles gambiae, mRNA sequence. 48 0.36 1
dna update 2005.11.26
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(P15270) RecName: Full=Spore coat protein SP60; Flags: Precursor... 293 5e-78
(P54654) RecName: Full=Adenylyl cyclase-associated protein; ... 270 7e-71
X52105_1(X52105|pid:none) Dictyostelium discoideum SP60 gene for... 217 6e-55
AK071446_1(AK071446|pid:none) Oryza sativa Japonica Group cDNA c... 139 1e-31
AB014884_1(AB014884|pid:none) Gossypium hirsutum GhCAP mRNA for ... 137 6e-31
EU106855_1(EU106855|pid:none) Gossypium arboreum strain DPL972 a... 137 6e-31
EU106853_1(EU106853|pid:none) Gossypium arboreum strain DPL971 a... 137 6e-31
AC006638_1(AC006638|pid:none) Caenorhabditis elegans cosmid F41G... 134 6e-30
AC006638_2(AC006638|pid:none) Caenorhabditis elegans cosmid F41G... 134 6e-30
AB014759_1(AB014759|pid:none) Arabidopsis thaliana mRNA for Atca... 132 2e-29
protein update 2009. 4.13
PSORT

psg: 0.75 gvh: 0.38 alm: 0.46 top: 0.53 tms: 0.00 mit: 0.30 mip: 0.02
nuc: 0.02 erl: 0.00 erm: 0.20 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.50
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

48.0 %: nuclear
20.0 %: cytoplasmic
12.0 %: cytoskeletal
8.0 %: Golgi
4.0 %: mitochondrial
4.0 %: vesicles of secretory system
4.0 %: peroxisomal

>> prediction for CHS106 is nuc

5' end seq. ID CHS106F
5' end seq.
>CHS106F.Seq
ATGGTAATAGACAATGTGTTGAAGATCAAATTACATTGCCACCATTTGATAAATGTGATA
ATGTCCATTGTCCAAAAGGATTTAATTGCAAATATGATTGGGAAAAAGATCTTGCTCTTT
GTGTTCCATGGAGACCATATCCACCAGTTTGTAGAACTAGATGTCCAGAAGGTCATGAAT
GTAAAGTTGATGAATGGGGTAAAGAATGTTGCGTAAAGATCAAATGTGATGATATTTGTG
ACTTGCGCTGTCCAAAGGGTCATGAATGCAAGATCAAACATGATGGTAGTAAATGCTGTG
TCCGTTCATGGAGACCAAGACCACATAAACCACATCCACGTCCACCAATCTGCAGATTAA
GATGTCCACCAGGTCATGAATGCAAACATGATGAACATGGTAAAGAATGTTGCGTTAAAA
AACGTCATCATGATAGATGTGACCTCAAATGTAAGAGAGGTTATGAATGTAAAATCANNN
NNNNNNN
Length of 5' end seq. 487
3' end seq. ID CHS106Z
3' end seq.
>CHS106Z.Seq
NNNNNNNNNNGCTTCATCAGCACCAGCCGCCCCAGTTGCACCAGCTGTTTCATCCACTCC
AGTTGAATCAAAGAAAGGTCCAGGTTTAGGTGCAGTTTTCGGTGAACTTAGCAAAGGTGA
TGGTGTTACCAGTGGTTTAAAAAAAGTTACCAGCGATATNNAATCCAAAAATTTCACCGN
CAAATCATCAGTTGTTAAAGCTGCTGATACTAAAGTCGCCANAGTTGATGCTCCATCTAG
ACCAGCCGTTTTTGCTNTCCAAGGTAACAAATGGTCCATTGAATATCAAGTTANCAACAA
AGAAATTGTCATTGCCGAGCCAGATAGTCGTCAAACTGTTTACATTTTCCAATGTGTAAA
CTCTTTAGTTCAAATCAAAGGTAAAGTTAATGCAATTACTCTTGATGGTTGTAAAAAGAC
TTCAATCGTTTTCGAAAATGCCATTTCCTCTTGTGAAGTTGTCAATTGTAATGGTGTTGA
AATCCAAGTCACTGGTCGTGTACCATCAATTGCTATCGATAAGACAAGTGGTTGTCAA
Length of 3' end seq. 538
Connected seq. ID CHS106P
Connected seq.
>CHS106P.Seq
ATGGTAATAGACAATGTGTTGAAGATCAAATTACATTGCCACCATTTGATAAATGTGATA
ATGTCCATTGTCCAAAAGGATTTAATTGCAAATATGATTGGGAAAAAGATCTTGCTCTTT
GTGTTCCATGGAGACCATATCCACCAGTTTGTAGAACTAGATGTCCAGAAGGTCATGAAT
GTAAAGTTGATGAATGGGGTAAAGAATGTTGCGTAAAGATCAAATGTGATGATATTTGTG
ACTTGCGCTGTCCAAAGGGTCATGAATGCAAGATCAAACATGATGGTAGTAAATGCTGTG
TCCGTTCATGGAGACCAAGACCACATAAACCACATCCACGTCCACCAATCTGCAGATTAA
GATGTCCACCAGGTCATGAATGCAAACATGATGAACATGGTAAAGAATGTTGCGTTAAAA
AACGTCATCATGATAGATGTGACCTCAAATGTAAGAGAGGTTATGAATGTAAAATCA---
-------GCTTCATCAGCACCAGCCGCCCCAGTTGCACCAGCTGTTTCATCCACTCCAGT
TGAATCAAAGAAAGGTCCAGGTTTAGGTGCAGTTTTCGGTGAACTTAGCAAAGGTGATGG
TGTTACCAGTGGTTTAAAAAAAGTTACCAGCGATATNNAATCCAAAAATTTCACCGNCAA
ATCATCAGTTGTTAAAGCTGCTGATACTAAAGTCGCCANAGTTGATGCTCCATCTAGACC
AGCCGTTTTTGCTNTCCAAGGTAACAAATGGTCCATTGAATATCAAGTTANCAACAAAGA
AATTGTCATTGCCGAGCCAGATAGTCGTCAAACTGTTTACATTTTCCAATGTGTAAACTC
TTTAGTTCAAATCAAAGGTAAAGTTAATGCAATTACTCTTGATGGTTGTAAAAAGACTTC
AATCGTTTTCGAAAATGCCATTTCCTCTTGTGAAGTTGTCAATTGTAATGGTGTTGAAAT
CCAAGTCACTGGTCGTGTACCATCAATTGCTATCGATAAGACAAGTGGTTGTCAA
Length of connected seq. 1005
Full length Seq ID -
Full length Seq. -
Length of full length seq. -