SHF231
Library SH
(Link to library)
Clone ID SHF231
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig -
Original site URL
Representative seq. ID SHF231P
(Link to Original site)
Representative DNA sequence
>SHF231 (SHF231Q) /CSM/SH/SHF2-B/SHF231Q.Seq.d/
GCAACCAAGTGAAATGGGTATGGCAAATATTACAGTTTTCTTAAGAAGTGGTTCTAACCT
ATCACAAACCATTGCAACAACAACAACAGATGCCAATGGAACATACATCTTTACCCATTT
AGCTCCAGGAAATTATTGCGTTTCACTCACTGTACCAAAAGAATTTTATCCAACTCTCTT
AACTCCAACAACATTTAATAGAGGTGATTCTAATGCTATTGCTTGTACCAATGTCTCTGT
ATTCAGTCCAACCGTTCAAAGATTCTCAATTCCAAGTCATACTGAAGATACCTATCAAGA
TGCCACTGTTAATTTTGGTTTAGCCCCATACAAATATGCAGTTGGAACATATATTTGGGT
AGATAAGAATGGTAATGGTGGAGCTGAATCATATGAACCTGCAGTTGAAGGTATTACTGT
TAGAATTTATGATAGTAACTTTAACTTTATAACATCTACCGTAACAAATCCAAGTGGTAT
CTACATTTTCGATAATTTATACCCAGGAGTTTACAATTTAGCAATTACTCCACCAGTTGG
TTTTACAATCTCCAATAATACTCTTCAAGXXXXXXXXXXGCACCAGATATTTTACGTATT
TGTTTGGTCAATGGAGTTTATGTCCCAGAAAGAGATGGTAAATGTGGTGGTGCTATTGGT
TCTCATACAGGTCCATCAGGTTATAAAGGTCAAATTGAAGGTCCAGGTACTGGTGAATTC
TATAATGATAACTTTAGAAAAGGTCAAATTGGTCATGATGATACTGGTGGTCTTTCAGTT
TTCCAAGTTCCAGGTTTCCCAGAGGTTGCATCAGCTTCATTCGATGCTACAACTGTATTC
GAAGGTGTAGTCAAATTTTATAACAATAACAATGGTACCCTTCGCTCTTCATTCCAAGTT
TATCTCACTGATAATTCAGATACAGTCAATCCAGTTACCTTTGGCAAAGCATCGGGTCTT
GGTCAACTTGTAGCCCACTGTTCTCCAAAACCAATTACCCTTGGTAGTATTGTCTTTATT
GATACAAATGGTAATGGTATTCAAGAACCATGGGAGCAAGGTAAACAAGGTGTTGCCGTT
AGTTTATTATTTGCCAATGGTACCTTAATTCAAAAACAATCAACCGATTCATTAGGTCTA
TTTAAATTCATTAATCCACCATACCAAAATCAACAATATATTATTACAGCTGATTCAGTT
ACACTTTCAATTATTCCATCACCTCTACCAACTTCATCTATTCCTTATAACGCTGCAACA
ATGGTTAATGGAAAAGCAACAATTTCAAATATTCTTATTTTAGATCCAATGATAGCTTCA
AGTAGATANATTTAA
sequence update 2002.10.25
Translated Amino Acid sequence
QPSEMGMANITVFLRSGSNLSQTIATTTTDANGTYIFTHLAPGNYCVSLTVPKEFYPTLL
TPTTFNRGDSNAIACTNVSVFSPTVQRFSIPSHTEDTYQDATVNFGLAPYKYAVGTYIWV
DKNGNGGAESYEPAVEGITVRIYDSNFNFITSTVTNPSGIYIFDNLYPGVYNLAITPPVG
FTISNNTLQ---

---APDILRICLVNGVYVPERDGKCGGAIGSHTGPSGYKGQIEGPGTGEFYNDNFRKGQI
GHDDTGGLSVFQVPGFPEVASASFDATTVFEGVVKFYNNNNGTLRSSFQVYLTDNSDTVN
PVTFGKASGLGQLVAHCSPKPITLGSIVFIDTNGNGIQEPWEQGKQGVAVSLLFANGTLI
QKQSTDSLGLFKFINPPYQNQQYIITADSVTLSIIPSPLPTSSIPYNAATMVNGKATISN
ILILDPMIASSRXI*


Translated Amino Acid sequence (All Frames)
Frame A:
atk*ngygkyysflkkwf*pitnhcnnnnrcqwnihlypfssrkllrfthctkrilsnsl
nsnni**r*f*cyclyqclciqsnrskilnsksy*rylsrchc*fwfspiqicswniylg
r*ew*wws*ii*tcs*ryyc*nl***l*lyniyrnkskwylhfr*fiprslqfsnystsw
fynlq*yss---

---APDILRICLVNGVYVPERDGKCGGAIGSHTGPSGYKGQIEGPGTGEFYNDNFRKGQI
GHDDTGGLSVFQVPGFPEVASASFDATTVFEGVVKFYNNNNGTLRSSFQVYLTDNSDTVN
PVTFGKASGLGQLVAHCSPKPITLGSIVFIDTNGNGIQEPWEQGKQGVAVSLLFANGTLI
QKQSTDSLGLFKFINPPYQNQQYIITADSVTLSIIPSPLPTSSIPYNAATMVNGKATISN
ILILDPMIASSRXI*

Frame B:
QPSEMGMANITVFLRSGSNLSQTIATTTTDANGTYIFTHLAPGNYCVSLTVPKEFYPTLL
TPTTFNRGDSNAIACTNVSVFSPTVQRFSIPSHTEDTYQDATVNFGLAPYKYAVGTYIWV
DKNGNGGAESYEPAVEGITVRIYDSNFNFITSTVTNPSGIYIFDNLYPGVYNLAITPPVG
FTISNNTLQ---

---hqifyvfvwsmefmsqkemvnvvvllvliqvhqvikvklkvqvlvnsimitlekvkl
vmmilvvfqfskfqvsqrlhqlhsmlqlyskv*snfititmvpfalhskfisliiqiqsi
qlplakhrvlvnl*ptvlqnqlplvvlslliqmvmvfknhgskvnkvlplvyylpmvp*f
knnqpih*vylnslihhtkinnillqliqlhfqlfhhlyqlhlflitlqqwlmekqqfqi
flf*iq**lqvdxf

Frame C:
nqvkwvwqilqfs*evvltyhkplqqqqqmpmehtslpi*lqeiiafhslyqknfiqls*
lqqhlievilmlllvpmslysvqpfkdsqfqvilkipikmpllilv*phtnmqlehifg*
irmvmvelnhmnlqlkvlllefmivtltl*hlp*qiqvvstfsiiytqefti*qllhqlv
lqspiilfk---

---tryftylfgqwslcprkrw*mwwcywfsyrsirl*rsn*rsryw*il***l*krsnw
s**ywwsfsfpssrfprgcisfircyncirrcsqil*q*qwypslfipslsh**frysqs
sylwqsigswstcsplfsktnypw*ycly*ykw*wysrtmgar*trccr*fiicqwylns
ktinrfirsi*ih*stipkstiyyys*fsytfnysitstnfiysl*rcnng*wksnnfky
syfrsndsfk*ixl

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SHF231 (SHF231Q) /CSM/SH/SHF2-B/SHF231Q.Seq.d/ 2581 0.0
AHO237 (AHO237Q) /CSM/AH/AHO2-B/AHO237Q.Seq.d/ 1477 0.0
AHE421 (AHE421Q) /CSM/AH/AHE4-A/AHE421Q.Seq.d/ 1477 0.0
AHD671 (AHD671Q) /CSM/AH/AHD6-C/AHD671Q.Seq.d/ 1469 0.0
AHJ422 (AHJ422Q) /CSM/AH/AHJ4-A/AHJ422Q.Seq.d/ 1457 0.0
AHN869 (AHN869Q) /CSM/AH/AHN8-C/AHN869Q.Seq.d/ 1441 0.0
AHF508 (AHF508Q) /CSM/AH/AHF5-A/AHF508Q.Seq.d/ 1435 0.0
AHL335 (AHL335Q) /CSM/AH/AHL3-B/AHL335Q.Seq.d/ 1429 0.0
AHL820 (AHL820Q) /CSM/AH/AHL8-A/AHL820Q.Seq.d/ 1427 0.0
AHB681 (AHB681Q) /CSM/AH/AHB6-D/AHB681Q.Seq.d/ 1427 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

AX346506|AX346506.1 Sequence 1577 from Patent WO0200928. 34 0.45 4
AL157858|AL157858.5 Human chromosome 14 DNA sequence BAC R-187A6 of library RPCI-11 from chromosome 14 of Homo sapiens (Human). 48 0.52 1
BI074124|BI074124.1 kt40c07.y1 Strongyloides ratti L2 pAMP1 v1 Chiapelli McCarter Strongyloides ratti cDNA 5' similar to TR:Q9U1Q4 Q9U1Q4 Y87G2A.5 PROTEIN. [1] ;, mRNA sequence. 36 1.4 2
AC174974|AC174974.2 Bos taurus clone CH240-173C6, WORKING DRAFT SEQUENCE, 24 unordered pieces. 46 2.0 1
AC160363|AC160363.2 Bos taurus clone CH240-88I19, WORKING DRAFT SEQUENCE, 3 unordered pieces. 46 2.0 1
CZ535376|CZ535376.1 SRAA-aac96g10.b1 Strongyloides ratti whole genome shotgun library (SRAAGSS 004) Strongyloides ratti genomic, genomic survey sequence. 36 2.4 2
CZ534086|CZ534086.1 SRAA-aac89b11.b1 Strongyloides ratti whole genome shotgun library (SRAAGSS 004) Strongyloides ratti genomic, genomic survey sequence. 36 2.4 2
X16522|X16522.1 Dictyostelium discoiedeum AAC-rich mRNA (AAC11). 38 4.2 3
AC094625|AC094625.8 Rattus norvegicus clone CH230-5D5, *** SEQUENCING IN PROGRESS ***, 5 unordered pieces. 42 4.9 3
DU121930|DU121930.1 KBrH104H11F Brassica rapa BAC library KBrH Brassica rapa genomic clone KBrH104H11, genomic survey sequence. 34 7.9 2
dna update 2006. 3.13
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

CP000686_3542(CP000686|pid:none) Roseiflexus sp. RS-1, complete ... 87 1e-15
CP001337_286(CP001337|pid:none) Chloroflexus aggregans DSM 9485,... 80 2e-13
CP000909_404(CP000909|pid:none) Chloroflexus aurantiacus J-10-fl... 77 1e-12
CP001337_3037(CP001337|pid:none) Chloroflexus aggregans DSM 9485... 74 1e-11
AM076227_1(AM076227|pid:none) Staphylococcus aureus partial sdrD... 73 2e-11
(Q8KWM1) RecName: Full=Serine-aspartate repeat-containing protei... 73 3e-11
(Q7A780) RecName: Full=Serine-aspartate repeat-containing protei... 72 4e-11
AM076203_1(AM076203|pid:none) Staphylococcus aureus partial sdrD... 72 4e-11
AM076200_1(AM076200|pid:none) Staphylococcus aureus partial sdrD... 72 4e-11
AM076196_1(AM076196|pid:none) Staphylococcus aureus partial sdrD... 72 4e-11
protein update 2009. 4.18
PSORT

psg: 0.75 gvh: 0.43 alm: 0.42 top: 0.53 tms: 0.00 mit: 0.30 mip: 0.15
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

48.0 %: nuclear
24.0 %: mitochondrial
20.0 %: cytoplasmic
4.0 %: cytoskeletal
4.0 %: endoplasmic reticulum

>> prediction for SHF231 is nuc

5' end seq. ID SHF231F
5' end seq.
>SHF231F.Seq
GCAACCAAGTGAAATGGGTATGGCAAATATTACAGTTTTCTTAAGAAGTGGTTCTAACCT
ATCACAAACCATTGCAACAACAACAACAGATGCCAATGGAACATACATCTTTACCCATTT
AGCTCCAGGAAATTATTGCGTTTCACTCACTGTACCAAAAGAATTTTATCCAACTCTCTT
AACTCCAACAACATTTAATAGAGGTGATTCTAATGCTATTGCTTGTACCAATGTCTCTGT
ATTCAGTCCAACCGTTCAAAGATTCTCAATTCCAAGTCATACTGAAGATACCTATCAAGA
TGCCACTGTTAATTTTGGTTTAGCCCCATACAAATATGCAGTTGGAACATATATTTGGGT
AGATAAGAATGGTAATGGTGGAGCTGAATCATATGAACCTGCAGTTGAAGGTATTACTGT
TAGAATTTATGATAGTAACTTTAACTTTATAACATCTACCGTAACAAATCCAAGTGGTAT
CTACATTTTCGATAATTTATACCCAGGAGTTTACAATTTAGCAATTACTCCACCAGTTGG
TTTTACAATCTCCAATAATACTCTTCAAGNNNNNNNNNN
Length of 5' end seq. 579
3' end seq. ID SHF231Z
3' end seq.
>SHF231Z.Seq
NNNNNNNNNNGCACCAGATATTTTACGTATTTGTTTGGTCAATGGAGTTTATGTCCCAGA
AAGAGATGGTAAATGTGGTGGTGCTATTGGTTCTCATACAGGTCCATCAGGTTATAAAGG
TCAAATTGAAGGTCCAGGTACTGGTGAATTCTATAATGATAACTTTAGAAAAGGTCAAAT
TGGTCATGATGATACTGGTGGTCTTTCAGTTTTCCAAGTTCCAGGTTTCCCAGAGGTTGC
ATCAGCTTCATTCGATGCTACAACTGTATTCGAAGGTGTAGTCAAATTTTATAACAATAA
CAATGGTACCCTTCGCTCTTCATTCCAAGTTTATCTCACTGATAATTCAGATACAGTCAA
TCCAGTTACCTTTGGCAAAGCATCGGGTCTTGGTCAACTTGTAGCCCACTGTTCTCCAAA
ACCAATTACCCTTGGTAGTATTGTCTTTATTGATACAAATGGTAATGGTATTCAAGAACC
ATGGGAGCAAGGTAAACAAGGTGTTGCCGTTAGTTTATTATTTGCCAATGGTACCTTAAT
TCAAAAACAATCAACCGATTCATTAGGTCTATTTAAATTCATTAATCCACCATACCAAAA
TCAACAATATATTATTACAGCTGATTCAGTTACACTTTCAATTATTCCATCACCTCTACC
AACTTCATCTATTCCTTATAACGCTGCAACAATGGTTAATGGAAAAGCAACAATTTCAAA
TATTCTTATTTTAGATCCAATGATAGCTTCAAGTAGATANATTTAA
Length of 3' end seq. 766
Connected seq. ID SHF231P
Connected seq.
>SHF231P.Seq
GCAACCAAGTGAAATGGGTATGGCAAATATTACAGTTTTCTTAAGAAGTGGTTCTAACCT
ATCACAAACCATTGCAACAACAACAACAGATGCCAATGGAACATACATCTTTACCCATTT
AGCTCCAGGAAATTATTGCGTTTCACTCACTGTACCAAAAGAATTTTATCCAACTCTCTT
AACTCCAACAACATTTAATAGAGGTGATTCTAATGCTATTGCTTGTACCAATGTCTCTGT
ATTCAGTCCAACCGTTCAAAGATTCTCAATTCCAAGTCATACTGAAGATACCTATCAAGA
TGCCACTGTTAATTTTGGTTTAGCCCCATACAAATATGCAGTTGGAACATATATTTGGGT
AGATAAGAATGGTAATGGTGGAGCTGAATCATATGAACCTGCAGTTGAAGGTATTACTGT
TAGAATTTATGATAGTAACTTTAACTTTATAACATCTACCGTAACAAATCCAAGTGGTAT
CTACATTTTCGATAATTTATACCCAGGAGTTTACAATTTAGCAATTACTCCACCAGTTGG
TTTTACAATCTCCAATAATACTCTTCAAG----------GCACCAGATATTTTACGTATT
TGTTTGGTCAATGGAGTTTATGTCCCAGAAAGAGATGGTAAATGTGGTGGTGCTATTGGT
TCTCATACAGGTCCATCAGGTTATAAAGGTCAAATTGAAGGTCCAGGTACTGGTGAATTC
TATAATGATAACTTTAGAAAAGGTCAAATTGGTCATGATGATACTGGTGGTCTTTCAGTT
TTCCAAGTTCCAGGTTTCCCAGAGGTTGCATCAGCTTCATTCGATGCTACAACTGTATTC
GAAGGTGTAGTCAAATTTTATAACAATAACAATGGTACCCTTCGCTCTTCATTCCAAGTT
TATCTCACTGATAATTCAGATACAGTCAATCCAGTTACCTTTGGCAAAGCATCGGGTCTT
GGTCAACTTGTAGCCCACTGTTCTCCAAAACCAATTACCCTTGGTAGTATTGTCTTTATT
GATACAAATGGTAATGGTATTCAAGAACCATGGGAGCAAGGTAAACAAGGTGTTGCCGTT
AGTTTATTATTTGCCAATGGTACCTTAATTCAAAAACAATCAACCGATTCATTAGGTCTA
TTTAAATTCATTAATCCACCATACCAAAATCAACAATATATTATTACAGCTGATTCAGTT
ACACTTTCAATTATTCCATCACCTCTACCAACTTCATCTATTCCTTATAACGCTGCAACA
ATGGTTAATGGAAAAGCAACAATTTCAAATATTCTTATTTTAGATCCAATGATAGCTTCA
AGTAGATANATTTAA
Length of connected seq. 1325
Full length Seq ID -
Full length Seq. -
Length of full length seq. -