CHE842
Library CH
(Link to library)
Clone ID CHE842
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16419-1
Original site URL
Representative seq. ID CHE842P
(Link to Original site)
Representative DNA sequence
>CHE842 (CHE842Q) /CSM/CH/CHE8-B/CHE842Q.Seq.d/
ATTATTCAATTTTATTTTTAATCAATTCAAATTAAAAAAAAAACAATAACTAACTAAGTT
AAAAATAAAAAAATAATAAAGGATGTTGTTTCTAAAAAATATTGGAGTATTTTTTATGAT
ATTCCTTGTATCAAAATCCTATGCGACGGATTGTAATAAAATTACTAATGAAGAANAATG
TCATAAATCTTCGGAATGTATTGTTATAAATTATACACCATGTTGTGGTGAACAAAAATG
GGCTTGCTCTAAAGGTACATTTGATACTTGTACATATGAAAATAGTTGTTATANAAATTC
ATCCAATAACCAAGTTGTTGAAGTATCAAATAAATGCTTCAATCTTGATGGATTTATTAA
AATTACAACTCCAACCGAGTATTCTTGTTCTGATGCCAANATTAAAGAATGTGCCTTATT
AGGTAAATCATGTAGTTTCCAAAANAACTCTTGTTCAAATCCAACTTCCTGTTGCCCAGG
TGAATCAATTTGTGAAGGTTTAAGCTCTGGTAGTTCAACATCTGGTGGTGGTTCATCAGG
TGGTACATCAGGTGGTAGTTCATCAGGTGGTACATCAGGGGGGTAGTTCATCAGGTGGTA
CATCAGGTAGTTCATCAAGGNGGTAGTTCATCAGGTGGXXXXXXXXXXTTTAAGATGTCC
ACCAAATTCATGAATGTAGATTTAATGACCAAGGTTCATCAATGCTGTGTAAAGGTTCCC
CCATGATAGATGTTCATTAAGATGTCCACATGNCCCATGAATGTAAAGTTGATCNCCAAT
GGTAAAGAATGTTGCGTCAGATCCCATAGACCACCACCACCAGAAGTTTGTTCATTAAGA
TGCCCACCAAAACATGAATGTAAATTTGATGATCATGGTAAAAAATGTTGTGTAAAGATT
CATTGTGATGAAGTTTGTGATTTAGATTGTGGTAGAGGTTTTGAATGTAAAATTAGACAT
GATGGTTCAAAATGTTGTGTCCGTTCAGAAAGACCACACCCACCACAACATGAAAAATGT
AATAAGAGATGCCCACCAGGCCATGAATGTAAAGTTGATCAACATGGAAAAGAATGTTGC
GTTGTTGCCCATAGACCACCACCAAAATGCTCTTTAAGATGTCCACCAAGACATGAATGT
AGAGTCAACCACTTTGGTGAAGAATGTTGTGTTAAAGTTCACCACGATAAATGTTCATTG
AGATGTCCACCAGGCCATGAATGTAAAGTTGATCAACATGGAAAAGAATGTTGCGTTGTT
GCCCATAGACCACCACCAAAATGCTCATTAAGATGTCCACCAAAACATGAATGTAGAATC
AATCACTTTGGTGAAGAATGCTGTGTTAAAAGTAGAAATGATTGTTTAACTTGTGAAGAC
CTAAACTGTGAAAGAAAAGGTTTACATTGTGCCATGAAAACTGTACCAATTGATAAAGAA
AATTGTTGGAAAAAGNNACCAAGTACGTTACTC
sequence update 2002.10.25
Translated Amino Acid sequence
iiqfyf*siqikkktitn*vknkkiikdvvskkywsifydipcikilcdgl**ny**rxm
s*ifgmycyklytmlw*tkmgll*ryi*ylyi*k*llxkfiq*psc*sik*mlqs*wiy*
nynsnrvflf*cqx*rmclir*im*fpkxllfksnfllpr*inl*rfklw*fniwwwfir
wyirw*FIRWYIRGVVHQVVHQVVHQGGSSSG---

---FKMSTKFMNVDLMTKVHQCCVKVPP**mfikmstxpmnvklixngkeccvrshrppp
pevcslrcppkheckfddhgkkccvkihcdevcdldcgrgfeckirhdgskccvrserph
ppqhekcnkrcppgheckvdqhgkeccvvahrpppkcslrcpprhecrvnhfgeeccvkv
hhdkcslrcppgheckvdqhgkeccvvahrpppkcslrcppkhecrinhfgeeccvksrn
dcltcedlncerkglhcamktvpidkencwkkxpstll


Translated Amino Acid sequence (All Frames)
Frame A:
iiqfyf*siqikkktitn*vknkkiikdvvskkywsifydipcikilcdgl**ny**rxm
s*ifgmycyklytmlw*tkmgll*ryi*ylyi*k*llxkfiq*psc*sik*mlqs*wiy*
nynsnrvflf*cqx*rmclir*im*fpkxllfksnfllpr*inl*rfklw*fniwwwfir
wyirw*FIRWYIRGVVHQVVHQVVHQGGSSSG---

---FKMSTKFMNVDLMTKVHQCCVKVPP**mfikmstxpmnvklixngkeccvrshrppp
pevcslrcppkheckfddhgkkccvkihcdevcdldcgrgfeckirhdgskccvrserph
ppqhekcnkrcppgheckvdqhgkeccvvahrpppkcslrcpprhecrvnhfgeeccvkv
hhdkcslrcppgheckvdqhgkeccvvahrpppkcslrcppkhecrinhfgeeccvksrn
dcltcedlncerkglhcamktvpidkencwkkxpstll

Frame B:
lfnfifnqfklkkkq*ltklkikk**rmlflknigvffmiflvsksyatdcnkitneexc
hkssecivinytpccgeqkwacskgtfdtctyenscyxnssnnqvvevsnkcfnldgfik
ittpteyscsdaxikecallgkscsfqxnscsnptsccpgesiceglssgsstsgggssg
gtsggsssggtsgg*firwyir*fikxvvhqv---

---lrcppns*m*i**prfinav*rfphdrcslrcphxp*m*s*spmvknvasdpidhhh
qkfvh*dahqnmnvnlmimvknvv*rfivmkfvi*ivvevlnvkldmmvqnvvsvqkdht
hhnmknvirdahqamnvklinmeknvallpidhhqnal*dvhqdmnvesttlvknvvlkf
ttinvh*dvhqamnvklinmeknvallpidhhqnah*dvhqnmnvesitlvknavlkvem
iv*lvkt*tvkekvyivp*klyqlikkivgkxxqvry

Frame C:
ysilflinsn*kknnn*ls*k*knnkgccf*kileyfl*yslyqnpmrrivikllmkxnv
inlrnvll*iihhvvvnknglalkvhlilvhmkivvixihpitkllkyqinasilmdllk
lqlqpsilvlmpxlknvpy*vnhvvskxtlvqiqlpvaqvnqfvkv*alvvqhlvvvhqv
vhqvvvhqvvhqggsssggtsgsssrx*firw---

---*dvhqihecrfndqgssmlckgspmidvh*dvhmxheckvdxqw*rmlrqip*tttt
rslfikmptkt*m*i**sw*kmlckdsl**sl*frlw*rf*m*n*t*wfkmlcpfrkttp
ttt*km**emptrp*m*s*stwkrmlrccp*tttkmlfkmstkt*m*sqplw*rmlc*ss
pr*mfiemstrp*m*s*stwkrmlrccp*tttkmlikmstkt*m*nqslw*rmlc*k*k*
lfnl*rpkl*kkrftlchenctn**rkllekxtkyvt

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

CHE842 (CHE842Q) /CSM/CH/CHE8-B/CHE842Q.Seq.d/ 2627 0.0
CHG728 (CHG728Q) /CSM/CH/CHG7-B/CHG728Q.Seq.d/ 2494 0.0
AHF431 (AHF431Q) /CSM/AH/AHF4-B/AHF431Q.Seq.d/ 1542 0.0
AHD802 (AHD802Q) /CSM/AH/AHD8-A/AHD802Q.Seq.d/ 1536 0.0
CHM493 (CHM493Q) /CSM/CH/CHM4-D/CHM493Q.Seq.d/ 1534 0.0
CHR563 (CHR563Q) /CSM/CH/CHR5-C/CHR563Q.Seq.d/ 1526 0.0
CHI363 (CHI363Q) /CSM/CH/CHI3-C/CHI363Q.Seq.d/ 1524 0.0
CHN267 (CHN267Q) /CSM/CH/CHN2-C/CHN267Q.Seq.d/ 1522 0.0
CHI435 (CHI435Q) /CSM/CH/CHI4-B/CHI435Q.Seq.d/ 1520 0.0
AHO207 (AHO207Q) /CSM/AH/AHO2-A/AHO207Q.Seq.d/ 1520 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

U25144|U25144.1 Dictyostelium discoideum spore coat protein SP87 (PspD) gene, complete cds. 1332 0.0 14
AC117267|AC117267.2 Dictyostelium discoideum chromosome 2 map 5836255-5862024 strain AX4, complete sequence. 1332 0.0 15
M26239|M26239.1 D.discoideum spore coat protein SP60 gene, complete cds. 66 1e-14 4
X51892|X51892.1 Dictyostelium discoideum SP60 gene for spore coat protein. 66 3e-14 4
AC124167|AC124167.2 Tetraodon nigroviridis clone GSTNB-42J2, WORKING DRAFT SEQUENCE, 14 unordered pieces. 38 0.001 6
AC150239|AC150239.2 Sorex araneus clone SA_Ba-527L4, WORKING DRAFT SEQUENCE, 7 ordered pieces. 36 0.027 8
X52105|X52105.1 Dictyostelium discoideum SP60 gene for spore coat protein. 48 0.046 3
AC141955|AC141955.2 Rattus norvegicus clone CH230-418P10, WORKING DRAFT SEQUENCE, 29 unordered pieces. 48 0.17 4
AC146784|AC146784.19 Medicago truncatula clone mth2-172c4, WORKING DRAFT SEQUENCE, 2 ordered pieces. 34 0.27 7
AC127160|AC127160.3 Rattus norvegicus clone CH230-460D14, *** SEQUENCING IN PROGRESS ***, 1 ordered piece. 48 0.41 1
dna update 2004. 9.26
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

AC117267_7(AC117267|pid:none) Dictyostelium discoideum chromosom... 542 e-152
(P15270) RecName: Full=Spore coat protein SP60; Flags: Precursor... 187 6e-46
X52105_1(X52105|pid:none) Dictyostelium discoideum SP60 gene for... 110 2e-22
(Q6TU45) RecName: Full=Probable spore coat protein sigD; AltName... 94 2e-17
(Q54QX2) RecName: Full=Probable spore coat protein DDB_G0283555;... 87 1e-15
(Q04503) RecName: Full=Prespore protein Dp87; Flags: Precursor; 70 1e-10
S07638(S07638;A60942;B60942) spore coat protein SP96 precursor -... 70 2e-10
(P14328) RecName: Full=Spore coat protein SP96; &AC117075_49(AC... 70 2e-10
D13973_1(D13973|pid:none) Dictyostelium discoideum gene for Dp87... 67 2e-09
(P54704) RecName: Full=Spore coat protein SP85; AltName: Full=Ce... 63 2e-08
protein update 2009. 3.31
PSORT

psg: 0.80 gvh: 0.39 alm: 0.51 top: 0.53 tms: 0.00 mit: 0.41 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.40 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

36.0 %: mitochondrial
32.0 %: nuclear
24.0 %: cytoplasmic
8.0 %: vacuolar

>> prediction for CHE842 is mit

5' end seq. ID CHE842F
5' end seq.
>CHE842F.Seq
ATTATTCAATTTTATTTTTAATCAATTCAAATTAAAAAAAAAACAATAACTAACTAAGTT
AAAAATAAAAAAATAATAAAGGATGTTGTTTCTAAAAAATATTGGAGTATTTTTTATGAT
ATTCCTTGTATCAAAATCCTATGCGACGGATTGTAATAAAATTACTAATGAAGAANAATG
TCATAAATCTTCGGAATGTATTGTTATAAATTATACACCATGTTGTGGTGAACAAAAATG
GGCTTGCTCTAAAGGTACATTTGATACTTGTACATATGAAAATAGTTGTTATANAAATTC
ATCCAATAACCAAGTTGTTGAAGTATCAAATAAATGCTTCAATCTTGATGGATTTATTAA
AATTACAACTCCAACCGAGTATTCTTGTTCTGATGCCAANATTAAAGAATGTGCCTTATT
AGGTAAATCATGTAGTTTCCAAAANAACTCTTGTTCAAATCCAACTTCCTGTTGCCCAGG
TGAATCAATTTGTGAAGGTTTAAGCTCTGGTAGTTCAACATCTGGTGGTGGTTCATCAGG
TGGTACATCAGGTGGTAGTTCATCAGGTGGTACATCAGGGGGGTAGTTCATCAGGTGGTA
CATCAGGTAGTTCATCAAGGNGGTAGTTCATCAGGTGGNNNNNNNNNN
Length of 5' end seq. 648
3' end seq. ID CHE842Z
3' end seq.
>CHE842Z.Seq
NNNNNNNNNNTTTAAGATGTCCACCAAATTCATGAATGTAGATTTAATGACCAAGGTTCA
TCAATGCTGTGTAAAGGTTCCCCCATGATAGATGTTCATTAAGATGTCCACATGNCCCAT
GAATGTAAAGTTGATCNCCAATGGTAAAGAATGTTGCGTCAGATCCCATAGACCACCACC
ACCAGAAGTTTGTTCATTAAGATGCCCACCAAAACATGAATGTAAATTTGATGATCATGG
TAAAAAATGTTGTGTAAAGATTCATTGTGATGAAGTTTGTGATTTAGATTGTGGTAGAGG
TTTTGAATGTAAAATTAGACATGATGGTTCAAAATGTTGTGTCCGTTCAGAAAGACCACA
CCCACCACAACATGAAAAATGTAATAAGAGATGCCCACCAGGCCATGAATGTAAAGTTGA
TCAACATGGAAAAGAATGTTGCGTTGTTGCCCATAGACCACCACCAAAATGCTCTTTAAG
ATGTCCACCAAGACATGAATGTAGAGTCAACCACTTTGGTGAAGAATGTTGTGTTAAAGT
TCACCACGATAAATGTTCATTGAGATGTCCACCAGGCCATGAATGTAAAGTTGATCAACA
TGGAAAAGAATGTTGCGTTGTTGCCCATAGACCACCACCAAAATGCTCATTAAGATGTCC
ACCAAAACATGAATGTAGAATCAATCACTTTGGTGAAGAATGCTGTGTTAAAAGTAGAAA
TGATTGTTTAACTTGTGAAGACCTAAACTGTGAAAGAAAAGGTTTACATTGTGCCATGAA
AACTGTACCAATTGATAAAGAAAATTGTTGGAAAAAGNNACCAAGTACGTTACTC
Length of 3' end seq. 835
Connected seq. ID CHE842P
Connected seq.
>CHE842P.Seq
ATTATTCAATTTTATTTTTAATCAATTCAAATTAAAAAAAAAACAATAACTAACTAAGTT
AAAAATAAAAAAATAATAAAGGATGTTGTTTCTAAAAAATATTGGAGTATTTTTTATGAT
ATTCCTTGTATCAAAATCCTATGCGACGGATTGTAATAAAATTACTAATGAAGAANAATG
TCATAAATCTTCGGAATGTATTGTTATAAATTATACACCATGTTGTGGTGAACAAAAATG
GGCTTGCTCTAAAGGTACATTTGATACTTGTACATATGAAAATAGTTGTTATANAAATTC
ATCCAATAACCAAGTTGTTGAAGTATCAAATAAATGCTTCAATCTTGATGGATTTATTAA
AATTACAACTCCAACCGAGTATTCTTGTTCTGATGCCAANATTAAAGAATGTGCCTTATT
AGGTAAATCATGTAGTTTCCAAAANAACTCTTGTTCAAATCCAACTTCCTGTTGCCCAGG
TGAATCAATTTGTGAAGGTTTAAGCTCTGGTAGTTCAACATCTGGTGGTGGTTCATCAGG
TGGTACATCAGGTGGTAGTTCATCAGGTGGTACATCAGGGGGGTAGTTCATCAGGTGGTA
CATCAGGTAGTTCATCAAGGNGGTAGTTCATCAGGTGG----------TTTAAGATGTCC
ACCAAATTCATGAATGTAGATTTAATGACCAAGGTTCATCAATGCTGTGTAAAGGTTCCC
CCATGATAGATGTTCATTAAGATGTCCACATGNCCCATGAATGTAAAGTTGATCNCCAAT
GGTAAAGAATGTTGCGTCAGATCCCATAGACCACCACCACCAGAAGTTTGTTCATTAAGA
TGCCCACCAAAACATGAATGTAAATTTGATGATCATGGTAAAAAATGTTGTGTAAAGATT
CATTGTGATGAAGTTTGTGATTTAGATTGTGGTAGAGGTTTTGAATGTAAAATTAGACAT
GATGGTTCAAAATGTTGTGTCCGTTCAGAAAGACCACACCCACCACAACATGAAAAATGT
AATAAGAGATGCCCACCAGGCCATGAATGTAAAGTTGATCAACATGGAAAAGAATGTTGC
GTTGTTGCCCATAGACCACCACCAAAATGCTCTTTAAGATGTCCACCAAGACATGAATGT
AGAGTCAACCACTTTGGTGAAGAATGTTGTGTTAAAGTTCACCACGATAAATGTTCATTG
AGATGTCCACCAGGCCATGAATGTAAAGTTGATCAACATGGAAAAGAATGTTGCGTTGTT
GCCCATAGACCACCACCAAAATGCTCATTAAGATGTCCACCAAAACATGAATGTAGAATC
AATCACTTTGGTGAAGAATGCTGTGTTAAAAGTAGAAATGATTGTTTAACTTGTGAAGAC
CTAAACTGTGAAAGAAAAGGTTTACATTGTGCCATGAAAACTGTACCAATTGATAAAGAA
AATTGTTGGAAAAAGNNACCAAGTACGTTACTC
Length of connected seq. 1463
Full length Seq ID -
Full length Seq. -
Length of full length seq. -