SHE755
Library SH
(Link to library)
Clone ID SHE755
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U12126-1|Contig-U16576-1
Original site URL
Representative seq. ID SHE755P
(Link to Original site)
Representative DNA sequence
>SHE755 (SHE755Q) /CSM/SH/SHE7-C/SHE755Q.Seq.d/
CTGTTGGCCTACTGGGNAATTGGCTGATAAATAGNAAAAGNGNAAACAAGNATAAAGAAT
TGGCTGATAAACTTGCAAAAGAAAAAGAAGAAAAAGAAAGAAAAGAGAAAGAATTGGCTG
ATAAATTAGAAAAAGAGAAAAAAGATAAAGNAATTGGCTGATAAAGTTACAAAAGAAAAA
GAAGAAAAAGATAAAAAAGAAAAAGAATTTAAATTAAAATTAGAAAAAGAACAAAAAGAA
AAAGAATTAAAATTAAAACAAGAAAGAGAATTTGCAGAAAAAGAAGAAAGAGATCGTTTA
GAAAGAGAGAAAATTTCAAAATCTATTGAAAAAGAAACAAAATCATCCACAATAACTGAT
CAATTTAAATTATCAATTGAAAAACAACTTCAAAGTCAATTAGAAAATAAAAAGAAACCA
GTACAAGTTACTGAAGATAATAGTAGTAGTGATGGATCAACATCAACTTTAACATCATTA
ACTAAAGATAGAGTTAAAATGAAAGGTAGACAAGCACCAAGTAAAAGTCATAAAGCACCA
GGTTCAACTXXXXXXXXXXATTAATTGGTGNAAATGGTAAATATGCAAACTTTGATTATG
TTTATCAAAATGTACCAACTGATTGGGAAAATCAAATTAAATTATTTGCAATCGTAAATA
CTGGTACCATTATTAGAGCTGATGAAATCTATCGTTTCAGTCAATATGATTTAACTCCAA
GTAAAGTCTATCTATTGGATAATAGAAAGAATGTTTTCGTTTGGTCAGGTTTAAGAGCAC
AAGAAAAAGAAAAAAAGAGAGGTATGGAAATTGCAATTGATTATGTAAAATATTTAGCTG
ATTCTAGAACTGAAAATGATGTTTTATTCATTACTCAAGGTGATGAACCACTTTCTTTCA
CTTGTTATTTCCATTGTTGGGATTCTTTAAGACTTAACAACAGCAACAACAACAACAACA
ACAGCAATGGTTCTTCAAATAATACTGCTGATGGTGCTGATGAACTTGATGATGGTGCAT
CACCAAAGAATGCTGTTAATCTACTAAAGAAATATTATCAAGTTTTACCATTTGAAAAAT
TAATTGAAAAGAATACACCTCCTGAAATTGATCGTTCAGTTTTAGAAATGTATCTTAGTG
ATGAAGATTTGAAAAACATCTTGGTAGTAGAGCTGAATGGGATGCTTTACCAGCTGGAAA
AAAACTGAAAAGAAAAAAGTTT
sequence update 2002.10.25
Translated Amino Acid sequence
llaywxig**ixkxxtxiknwlinlqkkkkkkkekrknwlin*KKRKKIKXLADKVTKEK
EEKDKKEKEFKLKLEKEQKEKELKLKQEREFAEKEERDRLEREKISKSIEKETKSSTITD
QFKLSIEKQLQSQLENKKKPVQVTEDNSSSDGSTSTLTSLTKDRVKMKGRQAPSKSHKAP
GST---

---LIGXNGKYANFDYVYQNVPTDWENQIKLFAIVNTGTIIRADEIYRFSQYDLTPSKVY
LLDNRKNVFVWSGLRAQEKEKKRGMEIAIDYVKYLADSRTENDVLFITQGDEPLSFTCYF
HCWDSLRLNNSNNNNNNSNGSSNNTADGADELDDGASPKNAVNLLKKYYQVLPFEKLIEK
NTPPEIDRSVLEMYLSDEDLKNILVVELNGMLYQLEKN*keks


Translated Amino Acid sequence (All Frames)
Frame A:
llaywxig**ixkxxtxiknwlinlqkkkkkkkekrknwlin*KKRKKIKXLADKVTKEK
EEKDKKEKEFKLKLEKEQKEKELKLKQEREFAEKEERDRLEREKISKSIEKETKSSTITD
QFKLSIEKQLQSQLENKKKPVQVTEDNSSSDGSTSTLTSLTKDRVKMKGRQAPSKSHKAP
GST---

---inwxkw*ickl*lclskctn*lgksn*iicnrkywyhy*s**nlsfqsi*fnsk*sl
sig**kecfrlvrfkstrkrkkerygncn*lckifs*f*n*k*cfihysr**ttffhllf
pllgffkt*qqqqqqqqqqwffk*yc*wc**t**wcitkecc*stkeilssfti*kin*k
eyts*n*sfsfrnvs***rfekhlgsraewdalpagkklkrkkf

Frame B:
cwptgxladk*xkxkqx*rig**tckrkrrkrkkrerig**irkrekr*xnwliklqkkk
kkkikkkknln*n*kknkkkkn*n*nkkenlqkkkkeiv*kerkfqnllkkkqnhpq*li
nlnyqlknnfkvn*kikrnqykllkiivvvmdqhql*hh*lkielk*kvdkhqvkvikhq
vq---

---LIGXNGKYANFDYVYQNVPTDWENQIKLFAIVNTGTIIRADEIYRFSQYDLTPSKVY
LLDNRKNVFVWSGLRAQEKEKKRGMEIAIDYVKYLADSRTENDVLFITQGDEPLSFTCYF
HCWDSLRLNNSNNNNNNSNGSSNNTADGADELDDGASPKNAVNLLKKYYQVLPFEKLIEK
NTPPEIDRSVLEMYLSDEDLKNILVVELNGMLYQLEKN*keks

Frame C:
vgllgnwlinxkxxnkxkeladklakekeekerkekeladklekekkdkxig**sykrkr
rkr*krkri*ikirkrtkrkrikiktrkricrkrrkrsfrkrenfkiy*krnkiihnn*s
i*iin*kttsksirk*ketstsy*r*****wininfniin*r*s*ner*tstk*ks*str
fn---

---*lvxmvnmqtlimfikmyqligkiklnylqs*ilvpllelmksivsvnmi*lqvksi
ywiiermfsfgqv*ehkkkkkrevwklqlim*ni*lilelkmmfysllkvmnhflslvis
ivgil*dlttattttttamvlqiillmvlmnlmmvhhqrmlliy*rniikfyhlkn*lkr
ihllklivqf*kcilvmki*ktsw**s*mgcftswkktekkkv

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SHE755 (SHE755Q) /CSM/SH/SHE7-C/SHE755Q.Seq.d/ 866 0.0
VHE887 (VHE887Q) /CSM/VH/VHE8-D/VHE887Q.Seq.d/ 468 e-130
SLF402 (SLF402Q) /CSM/SL/SLF4-A/SLF402Q.Seq.d/ 468 e-130
SFC352 (SFC352Q) /CSM/SF/SFC3-C/SFC352Q.Seq.d/ 468 e-130
CHG656 (CHG656Q) /CSM/CH/CHG6-C/CHG656Q.Seq.d/ 468 e-130
CFF854 (CFF854Q) /CSM/CF/CFF8-C/CFF854Q.Seq.d/ 468 e-130
AHQ312 (AHQ312Q) /CSM/AH/AHQ3-A/AHQ312Q.Seq.d/ 468 e-130
AHD852 (AHD852Q) /CSM/AH/AHD8-C/AHD852Q.Seq.d/ 468 e-130
AHA304 (AHA304Q) /CSM/AH/AHA3-A/AHA304Q.Seq.d/ 468 e-130
SLK491 (SLK491Q) /CSM/SL/SLK4-D/SLK491Q.Seq.d/ 462 e-129

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

CP000013|CP000013.1 Borrelia garinii PBi, complete genome. 42 0.088 13
CO120716|CO120716.1 GR__Eb024G21.f GR__Eb Gossypium raimondii cDNA clone GR__Eb024G21 5', mRNA sequence. 50 0.12 1
BX957322|BX957322.7 Zebrafish DNA sequence from clone CH211-179J10 in linkage group 10. 46 0.37 3
BF295885|BF295885.1 029PbE11 Pb cDNA #17, Tommaso Pace, Marta Ponzi, and Clara Frontali Plasmodium berghei cDNA 5', mRNA sequence. 46 0.40 2
BF294538|BF294538.1 006PbB09 Pb cDNA #17, Tommaso Pace, Marta Ponzi, and Clara Frontali Plasmodium berghei cDNA 5', mRNA sequence. 46 0.44 2
AL663105|AL663105.4 Human DNA sequence from clone RP4-743D20 on chromosome 1 Contains a novel gene and a CpG island. 32 0.96 2
AL357112|AL357112.9 Human DNA sequence *** SEQUENCING CANCELLED *** from clone RP4-816I15. 32 0.98 2
CG734269|CG734269.1 RP11-210E16, SP6 RPCI-11 Human Male BAC Library Homo sapiens genomic clone RP11-210E16, genomic survey sequence. 32 1.3 2
BY485289|BY485289.1 Mus musculus bone marrow macrophage cDNA, RIKEN full-length enriched library, clone:G530123E19, 3' end partial sequence. 46 1.9 1
AC138294|AC138294.1 Mus musculus chromosome 10 clone RP23-235D19 map 10, LOW-PASS SEQUENCE SAMPLING. 46 1.9 1
dna update 2006. 3. 8
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(P36418) RecName: Full=Protovillin; AltName: Full=100 kDa actin-... 75 3e-12
AJ427856_1(AJ427856|pid:none) Dictyostelium discoideum ORF encod... 75 4e-12
(Q8WQ85) RecName: Full=Villidin; 75 4e-12
BC168981_1(BC168981|pid:none) Rattus norvegicus villin 1, mRNA (... 73 2e-11
BC015267_1(BC015267|pid:none) Mus musculus villin 1, mRNA (cDNA ... 73 2e-11
AK154851_1(AK154851|pid:none) Mus musculus NOD-derived CD11c +ve... 72 4e-11
AB019233_13(AB019233|pid:none) Arabidopsis thaliana genomic DNA,... 72 5e-11
AF099929_1(AF099929|pid:none) Rattus norvegicus pervin mRNA, com... 69 4e-10
BC111730_1(BC111730|pid:none) Homo sapiens advillin, mRNA (cDNA ... 68 7e-10
AK314362_1(AK314362|pid:none) Homo sapiens cDNA, FLJ95130, Homo ... 68 7e-10
protein update 2009. 4.17
PSORT

psg: 0.27 gvh: -0.01 alm: 0.47 top: 0.53 tms: 0.00 mit: 0.22 mip: 0.02
nuc: 0.11 erl: 0.00 erm: 0.40 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

60.0 %: nuclear
28.0 %: cytoplasmic
8.0 %: mitochondrial
4.0 %: cytoskeletal

>> prediction for SHE755 is nuc

5' end seq. ID SHE755F
5' end seq.
>SHE755F.Seq
CTGTTGGCCTACTGGGNAATTGGCTGATAAATAGNAAAAGNGNAAACAAGNATAAAGAAT
TGGCTGATAAACTTGCAAAAGAAAAAGAAGAAAAAGAAAGAAAAGAGAAAGAATTGGCTG
ATAAATTAGAAAAAGAGAAAAAAGATAAAGNAATTGGCTGATAAAGTTACAAAAGAAAAA
GAAGAAAAAGATAAAAAAGAAAAAGAATTTAAATTAAAATTAGAAAAAGAACAAAAAGAA
AAAGAATTAAAATTAAAACAAGAAAGAGAATTTGCAGAAAAAGAAGAAAGAGATCGTTTA
GAAAGAGAGAAAATTTCAAAATCTATTGAAAAAGAAACAAAATCATCCACAATAACTGAT
CAATTTAAATTATCAATTGAAAAACAACTTCAAAGTCAATTAGAAAATAAAAAGAAACCA
GTACAAGTTACTGAAGATAATAGTAGTAGTGATGGATCAACATCAACTTTAACATCATTA
ACTAAAGATAGAGTTAAAATGAAAGGTAGACAAGCACCAAGTAAAAGTCATAAAGCACCA
GGTTCAACTNNNNNNNNNN
Length of 5' end seq. 559
3' end seq. ID SHE755Z
3' end seq.
>SHE755Z.Seq
NNNNNNNNNNATTAATTGGTGNAAATGGTAAATATGCAAACTTTGATTATGTTTATCAAA
ATGTACCAACTGATTGGGAAAATCAAATTAAATTATTTGCAATCGTAAATACTGGTACCA
TTATTAGAGCTGATGAAATCTATCGTTTCAGTCAATATGATTTAACTCCAAGTAAAGTCT
ATCTATTGGATAATAGAAAGAATGTTTTCGTTTGGTCAGGTTTAAGAGCACAAGAAAAAG
AAAAAAAGAGAGGTATGGAAATTGCAATTGATTATGTAAAATATTTAGCTGATTCTAGAA
CTGAAAATGATGTTTTATTCATTACTCAAGGTGATGAACCACTTTCTTTCACTTGTTATT
TCCATTGTTGGGATTCTTTAAGACTTAACAACAGCAACAACAACAACAACAACAGCAATG
GTTCTTCAAATAATACTGCTGATGGTGCTGATGAACTTGATGATGGTGCATCACCAAAGA
ATGCTGTTAATCTACTAAAGAAATATTATCAAGTTTTACCATTTGAAAAATTAATTGAAA
AGAATACACCTCCTGAAATTGATCGTTCAGTTTTAGAAATGTATCTTAGTGATGAAGATT
TGAAAAACATCTTGGTAGTAGAGCTGAATGGGATGCTTTACCAGCTGGAAAAAAACTGAA
AAGAAAAAAGTTT
Length of 3' end seq. 673
Connected seq. ID SHE755P
Connected seq.
>SHE755P.Seq
CTGTTGGCCTACTGGGNAATTGGCTGATAAATAGNAAAAGNGNAAACAAGNATAAAGAAT
TGGCTGATAAACTTGCAAAAGAAAAAGAAGAAAAAGAAAGAAAAGAGAAAGAATTGGCTG
ATAAATTAGAAAAAGAGAAAAAAGATAAAGNAATTGGCTGATAAAGTTACAAAAGAAAAA
GAAGAAAAAGATAAAAAAGAAAAAGAATTTAAATTAAAATTAGAAAAAGAACAAAAAGAA
AAAGAATTAAAATTAAAACAAGAAAGAGAATTTGCAGAAAAAGAAGAAAGAGATCGTTTA
GAAAGAGAGAAAATTTCAAAATCTATTGAAAAAGAAACAAAATCATCCACAATAACTGAT
CAATTTAAATTATCAATTGAAAAACAACTTCAAAGTCAATTAGAAAATAAAAAGAAACCA
GTACAAGTTACTGAAGATAATAGTAGTAGTGATGGATCAACATCAACTTTAACATCATTA
ACTAAAGATAGAGTTAAAATGAAAGGTAGACAAGCACCAAGTAAAAGTCATAAAGCACCA
GGTTCAACT----------ATTAATTGGTGNAAATGGTAAATATGCAAACTTTGATTATG
TTTATCAAAATGTACCAACTGATTGGGAAAATCAAATTAAATTATTTGCAATCGTAAATA
CTGGTACCATTATTAGAGCTGATGAAATCTATCGTTTCAGTCAATATGATTTAACTCCAA
GTAAAGTCTATCTATTGGATAATAGAAAGAATGTTTTCGTTTGGTCAGGTTTAAGAGCAC
AAGAAAAAGAAAAAAAGAGAGGTATGGAAATTGCAATTGATTATGTAAAATATTTAGCTG
ATTCTAGAACTGAAAATGATGTTTTATTCATTACTCAAGGTGATGAACCACTTTCTTTCA
CTTGTTATTTCCATTGTTGGGATTCTTTAAGACTTAACAACAGCAACAACAACAACAACA
ACAGCAATGGTTCTTCAAATAATACTGCTGATGGTGCTGATGAACTTGATGATGGTGCAT
CACCAAAGAATGCTGTTAATCTACTAAAGAAATATTATCAAGTTTTACCATTTGAAAAAT
TAATTGAAAAGAATACACCTCCTGAAATTGATCGTTCAGTTTTAGAAATGTATCTTAGTG
ATGAAGATTTGAAAAACATCTTGGTAGTAGAGCTGAATGGGATGCTTTACCAGCTGGAAA
AAAACTGAAAAGAAAAAAGTTT
Length of connected seq. 1212
Full length Seq ID -
Full length Seq. -
Length of full length seq. -