SHK802
Library SH
(Link to library)
Clone ID SHK802
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U11477-1
Original site URL
Representative seq. ID SHK802P
(Link to Original site)
Representative DNA sequence
>SHK802 (SHK802Q) /CSM/SH/SHK8-A/SHK802Q.Seq.d/
AACAACAACAACAACAACAACAAGGTAGCGAATCACCAAATTCAACAACACCAAAATCAT
CAACACCAACTCAAACTAGACATCATAGTGTATTATCAAGAAATCCATCATATACATCAT
TGAAAAGTTCAGGTGGATTTTCAAAACCATCACCATCATCATCAACATCGACACCACCAT
CAACATTTGCATCACCATTATCAATTGCTCAAACTTCATCAACCACCACTACCACTACTA
CCACCACCACCACTACATCTTCTTCTTCAACACCACCACCATTAAGAGCAAACAATTCAA
CATCATCAACTCCATTAAATTCAAGTACAGGTATTAAACAAAGTGATATTGTAATTGAAG
GTAATTTAACAGAATTAATACCAGGTTTATTATGGAATTCAAATACAGAGAAATGGTATG
TAGTTTCAGAGGGTATGCTTTATTCATACAAGAATAAATCATTGAAACAATTGGATCCAT
TAGAGACTATTCATTTGGAGAAAGCAATCTCTGCACATAAAACCAAAGAAATTTTCACAA
TTCAAGGTTTTAGTTTCCAATTGTCAACACCAAATCGTATCATTCATTTATTAGCAAAAA
CCAAAGAAGAACGTGGATCATGGTTATCAGTATTACGTCAAAATCTAAAXXXXXXXXXXT
CTTAAAAGAAGGTAGTGAATCCAATGAATTTTGGTCAGCTTTTGAAAGTACAGGTGGTAG
ACAAAAATACTTTAATGATATTATGATTCAAAGTAGTTCAATTCCAACCTCATTCACTTA
TAAACCACGTTTCTTTGTTTGTAGTAATGCATCGGGTATCGTTGAAGTTACAGAGGAATC
ACCATTCTCTCAAGATGATCTAGATATTGGATCCGTTTGTATATTGGATGTACAAAGTCA
TATTTACCTTTGGATTGGTAGTCGTGCAACTCATCGTACTAAACGTGCCTCTATGGAAGT
TGTATTAAACTTTATTGAAACTTCAAAATTGGGTCATTCAAAAGAGCACACTAAAGTACT
AATTGCAACTCCATTCGAAGAACCAATTGGTTTCAAAAGTTATTTCCGTGCTTGGTGTAC
CTCTAAATATCCAAAGAATAAATTACCTTTGGTTGAAAAAGATGGTATCCCAGTTGAACA
AGTACTTAAAGATTATCTCAAAGAAATCTATACCTATGAAGAGTTATTGGCCGATCCATT
ACCTGCTGGCGTTGACTCTACTAAATTGGATACTTATCTCAATGATGAAGATTTTGAAAA
AGTTTTCAAAATGACTAGAACTGAATGGTTAAAGATTCCAGCTTGGAAGAGAGAAGGTAT
TAAAAAAGAATTATTCTTATTTTAAATAATAAAAATAATCTCTATCTATAATATACATTC
ATATAAATACCTATATACATATATTTAATACATATAAATTTACTATTTAAAA
sequence update 2002.10.25
Translated Amino Acid sequence
QQQQQQQGSESPNSTTPKSSTPTQTRHHSVLSRNPSYTSLKSSGGFSKPSPSSSTSTPPS
TFASPLSIAQTSSTTTTTTTTTTTTSSSSTPPPLRANNSTSSTPLNSSTGIKQSDIVIEG
NLTELIPGLLWNSNTEKWYVVSEGMLYSYKNKSLKQLDPLETIHLEKAISAHKTKEIFTI
QGFSFQLSTPNRIIHLLAKTKEERGSWLSVLRQNL---

---LKEGSESNEFWSAFESTGGRQKYFNDIMIQSSSIPTSFTYKPRFFVCSNASGIVEVT
EESPFSQDDLDIGSVCILDVQSHIYLWIGSRATHRTKRASMEVVLNFIETSKLGHSKEHT
KVLIATPFEEPIGFKSYFRAWCTSKYPKNKLPLVEKDGIPVEQVLKDYLKEIYTYEELLA
DPLPAGVDSTKLDTYLNDEDFEKVFKMTRTEWLKIPAWKREGIKKELFLF*iikiisiyn
ihsykylytyi*yi*iyylk


Translated Amino Acid sequence (All Frames)
Frame A:
nnnnnnnkvanhqiqqhqnhqhqlkldiivyyqeihhihh*kvqvdfqnhhhhhqhrhhh
qhlhhhyqllklhqpplpllpppplhlllqhhhh*eqtiqhhqlh*iqvqvlnkvil*lk
vi*qn*yqvyygiqiqrngm*fqrvcfihtrinh*nnwih*rlfiwrkqslhikpkkfsq
fkvlvsncqhqivsfiy*qkpkknvdhgyqyyvki*---

---s*kkvvnpmnfgqllkvqvvdkntlmil*fkvvqfqphslinhvslfvvmhrvslkl
qrnhhslkmi*ildpfvywmykviftfglvvvqlivlnvplwkly*tllklqnwviqkst
lky*lqlhsknqlvskvisvlgvplniqrinylwlkkmvsqlnkylkiiskksipmksyw
pihyllaltllnwilismmkilkkfsk*lelng*rfqlgrekvlkknysyfk**k*slsi
iyihintyihifntykfti*

Frame B:
tttttttr*ritkfnntkiintnsn*ts*ciikksiiyiiekfrwifktitiiinidtti
nicitiincsnfinhhyhyyhhhhyifffntttikskqfniinsikfkyry*tk*ycn*r
*fnrintrfimefkyremvcsfrgyalfiqe*iietigsirdysfgesnlct*nqrnfhn
srf*fpivntksyhsfisknqrrtwimvisitsksk---

---lkrr**iq*ilvsf*kyrw*tkil**yydsk*fnsnlihl*ttflcl**cigyr*sy
rgitilsr*srywirlyigctksylpldw*scnssy*tclygscikly*nfkigsfkrah
*stncnsirrtnwfqklfpclvyl*iske*itfg*krwyps*tst*rlsqrnlyl*rvig
rsitcwr*ly*igylsq**rf*ksfqnd*n*mvkdssleerry*kriililnnknnlyl*
ytfi*ipiyiylihinllfk

Frame C:
QQQQQQQGSESPNSTTPKSSTPTQTRHHSVLSRNPSYTSLKSSGGFSKPSPSSSTSTPPS
TFASPLSIAQTSSTTTTTTTTTTTTSSSSTPPPLRANNSTSSTPLNSSTGIKQSDIVIEG
NLTELIPGLLWNSNTEKWYVVSEGMLYSYKNKSLKQLDPLETIHLEKAISAHKTKEIFTI
QGFSFQLSTPNRIIHLLAKTKEERGSWLSVLRQNL---

---LKEGSESNEFWSAFESTGGRQKYFNDIMIQSSSIPTSFTYKPRFFVCSNASGIVEVT
EESPFSQDDLDIGSVCILDVQSHIYLWIGSRATHRTKRASMEVVLNFIETSKLGHSKEHT
KVLIATPFEEPIGFKSYFRAWCTSKYPKNKLPLVEKDGIPVEQVLKDYLKEIYTYEELLA
DPLPAGVDSTKLDTYLNDEDFEKVFKMTRTEWLKIPAWKREGIKKELFLF*iikiisiyn
ihsykylytyi*yi*iyylk

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SHK802 (SHK802Q) /CSM/SH/SHK8-A/SHK802Q.Seq.d/ 2085 0.0
AFI116 (AFI116Q) /CSM/AF/AFI1-A/AFI116Q.Seq.d/ 1326 0.0
CHI516 (CHI516Q) /CSM/CH/CHI5-A/CHI516Q.Seq.d/ 1320 0.0
AHI567 (AHI567Q) /CSM/AH/AHI5-C/AHI567Q.Seq.d/ 1320 0.0
SHJ849 (SHJ849Q) /CSM/SH/SHJ8-C/SHJ849Q.Seq.d/ 1289 0.0
AHN388 (AHN388Q) /CSM/AH/AHN3-D/AHN388Q.Seq.d/ 1205 0.0
SSB880 (SSB880Q) /CSM/SS/SSB8-D/SSB880Q.Seq.d/ 1114 0.0
AHC823 (AHC823Q) /CSM/AH/AHC8-A/AHC823Q.Seq.d/ 1068 0.0
AHF158 (AHF158Q) /CSM/AH/AHF1-C/AHF158Q.Seq.d/ 1009 0.0
AHD338 (AHD338Q) /CSM/AH/AHD3-B/AHD338Q.Seq.d/ 975 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

AJ427856|AJ427856.1 Dictyostelium discoideum ORF encoding villidin. 1342 0.0 1
U78754|U78754.1 Dictyostelium discoideum villin (vilA) mRNA, partial cds. 1342 0.0 1
AC148477|AC148477.3 Homo sapiens 12 BAC RP13-895J2 (Roswell Park Cancer Institute Human BAC Library) complete sequence. 50 1e-04 2
BZ393782|BZ393782.1 EINAM68TR EI_10_12_KB Entamoeba invadens genomic clone EINAM68, DNA sequence. 48 1e-04 2
AC177620|AC177620.1 Strongylocentrotus purpuratus clone R3-1020P15, WORKING DRAFT SEQUENCE, 19 unordered pieces. 58 6e-04 1
AC176618|AC176618.1 Strongylocentrotus purpuratus clone R3-3079N11, WORKING DRAFT SEQUENCE, 19 unordered pieces. 58 6e-04 1
AC111852|AC111852.3 Rattus norvegicus clone CH230-12H7, *** SEQUENCING IN PROGRESS ***, 10 unordered pieces. 46 0.002 2
AC016765|AC016765.2 Homo sapiens chromosome 11 clone RP11-555F1, WORKING DRAFT SEQUENCE, 20 unordered pieces. 56 0.002 1
AC108448|AC108448.12 Homo sapiens chromosome 11, clone CTD-2544D21, complete sequence. 56 0.002 1
AC173359|AC173359.2 Strongylocentrotus purpuratus clone R3-3074P23, WORKING DRAFT SEQUENCE, 22 unordered pieces. 50 0.003 3
dna update 2006. 6.24
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

AJ427856_1(AJ427856|pid:none) Dictyostelium discoideum ORF encod... 451 e-125
(Q8WQ85) RecName: Full=Villidin; 451 e-125
BC135895_1(BC135895|pid:none) Xenopus tropicalis hypothetical pr... 114 1e-23
DQ453501_1(DQ453501|pid:none) Heliocidaris erythrogramma advilli... 109 2e-22
DQ453500_1(DQ453500|pid:none) Heliocidaris tuberculata advillin ... 108 5e-22
BC054960_1(BC054960|pid:none) Xenopus laevis villin 1, mRNA (cDN... 107 8e-22
EU334660_1(EU334660|pid:none) Strongylocentrotus purpuratus vill... 103 1e-20
DQ453502_1(DQ453502|pid:none) Strongylocentrotus purpuratus advi... 103 2e-20
AK154851_1(AK154851|pid:none) Mus musculus NOD-derived CD11c +ve... 102 3e-20
BC129018_1(BC129018|pid:none) Xenopus tropicalis supervillin, mR... 102 4e-20
protein update 2009. 5.30
PSORT

psg: 0.75 gvh: 0.41 alm: 0.40 top: 0.53 tms: 0.00 mit: 0.94 mip: 0.16
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

76.0 %: mitochondrial
12.0 %: cytoplasmic
8.0 %: nuclear
4.0 %: cytoskeletal

>> prediction for SHK802 is mit

5' end seq. ID SHK802F
5' end seq.
>SHK802F.Seq
AACAACAACAACAACAACAACAAGGTAGCGAATCACCAAATTCAACAACACCAAAATCAT
CAACACCAACTCAAACTAGACATCATAGTGTATTATCAAGAAATCCATCATATACATCAT
TGAAAAGTTCAGGTGGATTTTCAAAACCATCACCATCATCATCAACATCGACACCACCAT
CAACATTTGCATCACCATTATCAATTGCTCAAACTTCATCAACCACCACTACCACTACTA
CCACCACCACCACTACATCTTCTTCTTCAACACCACCACCATTAAGAGCAAACAATTCAA
CATCATCAACTCCATTAAATTCAAGTACAGGTATTAAACAAAGTGATATTGTAATTGAAG
GTAATTTAACAGAATTAATACCAGGTTTATTATGGAATTCAAATACAGAGAAATGGTATG
TAGTTTCAGAGGGTATGCTTTATTCATACAAGAATAAATCATTGAAACAATTGGATCCAT
TAGAGACTATTCATTTGGAGAAAGCAATCTCTGCACATAAAACCAAAGAAATTTTCACAA
TTCAAGGTTTTAGTTTCCAATTGTCAACACCAAATCGTATCATTCATTTATTAGCAAAAA
CCAAAGAAGAACGTGGATCATGGTTATCAGTATTACGTCAAAATCTAAANNNNNNNNNN
Length of 5' end seq. 659
3' end seq. ID SHK802Z
3' end seq.
>SHK802Z.Seq
NNNNNNNNNNTCTTAAAAGAAGGTAGTGAATCCAATGAATTTTGGTCAGCTTTTGAAAGT
ACAGGTGGTAGACAAAAATACTTTAATGATATTATGATTCAAAGTAGTTCAATTCCAACC
TCATTCACTTATAAACCACGTTTCTTTGTTTGTAGTAATGCATCGGGTATCGTTGAAGTT
ACAGAGGAATCACCATTCTCTCAAGATGATCTAGATATTGGATCCGTTTGTATATTGGAT
GTACAAAGTCATATTTACCTTTGGATTGGTAGTCGTGCAACTCATCGTACTAAACGTGCC
TCTATGGAAGTTGTATTAAACTTTATTGAAACTTCAAAATTGGGTCATTCAAAAGAGCAC
ACTAAAGTACTAATTGCAACTCCATTCGAAGAACCAATTGGTTTCAAAAGTTATTTCCGT
GCTTGGTGTACCTCTAAATATCCAAAGAATAAATTACCTTTGGTTGAAAAAGATGGTATC
CCAGTTGAACAAGTACTTAAAGATTATCTCAAAGAAATCTATACCTATGAAGAGTTATTG
GCCGATCCATTACCTGCTGGCGTTGACTCTACTAAATTGGATACTTATCTCAATGATGAA
GATTTTGAAAAAGTTTTCAAAATGACTAGAACTGAATGGTTAAAGATTCCAGCTTGGAAG
AGAGAAGGTATTAAAAAAGAATTATTCTTATTTTAAATAATAAAAATAATCTCTATCTAT
AATATACATTCATATAAATACCTATATACATATATTTAATACATATAAATTTACTATTTA
AAA
Length of 3' end seq. 783
Connected seq. ID SHK802P
Connected seq.
>SHK802P.Seq
AACAACAACAACAACAACAACAAGGTAGCGAATCACCAAATTCAACAACACCAAAATCAT
CAACACCAACTCAAACTAGACATCATAGTGTATTATCAAGAAATCCATCATATACATCAT
TGAAAAGTTCAGGTGGATTTTCAAAACCATCACCATCATCATCAACATCGACACCACCAT
CAACATTTGCATCACCATTATCAATTGCTCAAACTTCATCAACCACCACTACCACTACTA
CCACCACCACCACTACATCTTCTTCTTCAACACCACCACCATTAAGAGCAAACAATTCAA
CATCATCAACTCCATTAAATTCAAGTACAGGTATTAAACAAAGTGATATTGTAATTGAAG
GTAATTTAACAGAATTAATACCAGGTTTATTATGGAATTCAAATACAGAGAAATGGTATG
TAGTTTCAGAGGGTATGCTTTATTCATACAAGAATAAATCATTGAAACAATTGGATCCAT
TAGAGACTATTCATTTGGAGAAAGCAATCTCTGCACATAAAACCAAAGAAATTTTCACAA
TTCAAGGTTTTAGTTTCCAATTGTCAACACCAAATCGTATCATTCATTTATTAGCAAAAA
CCAAAGAAGAACGTGGATCATGGTTATCAGTATTACGTCAAAATCTAAA----------T
CTTAAAAGAAGGTAGTGAATCCAATGAATTTTGGTCAGCTTTTGAAAGTACAGGTGGTAG
ACAAAAATACTTTAATGATATTATGATTCAAAGTAGTTCAATTCCAACCTCATTCACTTA
TAAACCACGTTTCTTTGTTTGTAGTAATGCATCGGGTATCGTTGAAGTTACAGAGGAATC
ACCATTCTCTCAAGATGATCTAGATATTGGATCCGTTTGTATATTGGATGTACAAAGTCA
TATTTACCTTTGGATTGGTAGTCGTGCAACTCATCGTACTAAACGTGCCTCTATGGAAGT
TGTATTAAACTTTATTGAAACTTCAAAATTGGGTCATTCAAAAGAGCACACTAAAGTACT
AATTGCAACTCCATTCGAAGAACCAATTGGTTTCAAAAGTTATTTCCGTGCTTGGTGTAC
CTCTAAATATCCAAAGAATAAATTACCTTTGGTTGAAAAAGATGGTATCCCAGTTGAACA
AGTACTTAAAGATTATCTCAAAGAAATCTATACCTATGAAGAGTTATTGGCCGATCCATT
ACCTGCTGGCGTTGACTCTACTAAATTGGATACTTATCTCAATGATGAAGATTTTGAAAA
AGTTTTCAAAATGACTAGAACTGAATGGTTAAAGATTCCAGCTTGGAAGAGAGAAGGTAT
TAAAAAAGAATTATTCTTATTTTAAATAATAAAAATAATCTCTATCTATAATATACATTC
ATATAAATACCTATATACATATATTTAATACATATAAATTTACTATTTAAAA
Length of connected seq. 1422
Full length Seq ID -
Full length Seq. -
Length of full length seq. -