SFC182
Library SF
(Link to library)
Clone ID SFC182
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U13965-1
Original site URL
Representative seq. ID SFC182E
(Link to Original site)
Representative DNA sequence
>SFC182 (SFC182Q) /CSM/SF/SFC1-D/SFC182Q.Seq.d/
AAATAATTTATATATATAAAAAAATGAGATTTTCATACATCATTTGTTTAATCTTTGTAT
CTTTTTACTTTGCTTCAGTTTGTTTAGGTTCATTCCTTGATAAACCAGTTTTAGATGATA
ACCTCATCAATTCAATCAATAATAATAAAAAATCATCATGGACTGCCCATAGAAATAAAA
ATTTCGAAGGTAAGACTTTTGGTGATATCATTGGTATGATGGGTACTAAAAAAACTGCTG
CTCCATTCAAATTAACTGAAAATGGTGAAGAACTCAAAGGTTCAATCCCAACTTCATTCG
ATTCTCGTGTCCAATGGCCAGACTGTATCCACCCAATCCTCAACCAAGAACAATGTGGTT
CATGTTGGGCCTTTTCTTCATCTGAAGTTTTAAGTGATAGATTATGTATTGCCTCAAATA
ATAAAACTAACCCAGGTGCTCTCAGTCCACAAACTTTAGTTGCTTGTGATGTATATGGTA
ATGATGGTTGTAGTGGTGGTATCCCACAATTAGCTTGGGAATATATGGAACTTAAAGGTT
TACCAACTGACTCATGCGTCCCATACACTGCTGGTAACGGTACTGTCTACTCTTGTCAAA
GATCATGTTCCGATAGTGAAGATTACAGTTTATACAGAGCTAAGCCATTCACCTTAAAGA
CTTGCTCTTCAGTTCAATGTATCCAAGAAAACATTTTAGCTTATGGTCCAATCGTTGGTA
CTATGGAAGTTTATGAAGATTTTATGAGCTACAGCTCAGGTGTTTACGTTATGACTCCAG
GTTCATCTTATTAGGTGGTCATGCTATTAAAATTGTTGGTTGGGGCTTTGATCAAACCTC
TCAATTAAACTACTGGATTGTTGCTAATTCATGGGGTGCTGACTGGGGTCAACAAGGTTT
CTTTTTCATTTCAATGGAAACTTGTTCAATTTCTAGTGATGCAAGTGCTGCCGAAGCCCG
TGTTTAAATTGTTTCAAATAAAATTTCATTTACACATTAATAATTTT
sequence update 2001. 6. 9
Translated Amino Acid sequence
IIYIYKKMRFSYIICLIFVSFYFASVCLGSFLDKPVLDDNLINSINNNKKSSWTAHRNKN
FEGKTFGDIIGMMGTKKTAAPFKLTENGEELKGSIPTSFDSRVQWPDCIHPILNQEQCGS
CWAFSSSEVLSDRLCIASNNKTNPGALSPQTLVACDVYGNDGCSGGIPQLAWEYMELKGL
PTDSCVPYTAGNGTVYSCQRSCSDSEDYSLYRAKPFTLKTCSSVQCIQENILAYGPIVGT
MEVYEDFMSYSSGVYVMTPGSSY*vvmllkllvgalikpln*ttglllihgvltgvnkvs
fsfqwklvqflvmqvlpkpvfklfqikfhlhinnf


Translated Amino Acid sequence (All Frames)
Frame A:
k*fiyikk*dfhtsfv*slylftllqfv*vhslinqf*mitssiqsiiiknhhglpieik
iskvrllvislv*wvlkklllhsn*lkmvknskvqsqlhsilvsngqtvstqsstknnvv
hvgpflhlkf*vidyvlpqiikltqvlsvhkl*llvmymvmmvvvvvshn*lgniwnlkv
yqlthashtllvtvlstlvkdhvpivkitvytelshsp*rlalqfnvskktf*lmvqslv
lwkfmkil*ataqvftl*lqvhlirwscy*ncwlgl*snlsiklldcc*fmgc*lgstrf
lfhfngnlfnf**ckccrspclncfk*nfiytlii


Frame B:
nnlyi*kneifihhlfnlcifllcfslfrfip**tsfr**phqfnq***kiimdcp*k*k
frr*dfw*yhwydgy*knccsiqin*kw*rtqrfnpnfirfscpmarlyppnpqprtmwf
mlglffi*sfk**imyclk**n*prcsqstnfscl*ciw**wl*wwyptislgiygt*rf
tn*lmrpihcw*ryclllskimfr**rlqfiqs*aihlkdllfssmyprkhfslwsnrwy
ygsl*rfyelqlrclrydsrfillgghaikivgwgfdqtsqlnywivanswgadwgqqgf
ffismetcsissdasaaearv*ivsnkisfth**f


Frame C:
IIYIYKKMRFSYIICLIFVSFYFASVCLGSFLDKPVLDDNLINSINNNKKSSWTAHRNKN
FEGKTFGDIIGMMGTKKTAAPFKLTENGEELKGSIPTSFDSRVQWPDCIHPILNQEQCGS
CWAFSSSEVLSDRLCIASNNKTNPGALSPQTLVACDVYGNDGCSGGIPQLAWEYMELKGL
PTDSCVPYTAGNGTVYSCQRSCSDSEDYSLYRAKPFTLKTCSSVQCIQENILAYGPIVGT
MEVYEDFMSYSSGVYVMTPGSSY*vvmllkllvgalikpln*ttglllihgvltgvnkvs
fsfqwklvqflvmqvlpkpvfklfqikfhlhinnf


Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SFC182 (SFC182Q) /CSM/SF/SFC1-D/SFC182Q.Seq.d/ 1913 0.0
VFI135 (VFI135Q) /CSM/VF/VFI1-B/VFI135Q.Seq.d/ 1897 0.0
VFG553 (VFG553Q) /CSM/VF/VFG5-C/VFG553Q.Seq.d/ 1887 0.0
VFD411 (VFD411Q) /CSM/VF/VFD4-A/VFD411Q.Seq.d/ 1885 0.0
VFI602 (VFI602Q) /CSM/VF/VFI6-A/VFI602Q.Seq.d/ 1879 0.0
VFF125 (VFF125Q) /CSM/VF/VFF1-B/VFF125Q.Seq.d/ 1875 0.0
VFM316 (VFM316Q) /CSM/VF/VFM3-A/VFM316Q.Seq.d/ 1859 0.0
VSJ659 (VSJ659Q) /CSM/VS/VSJ6-C/VSJ659Q.Seq.d/ 1853 0.0
VFN281 (VFN281Q) /CSM/VF/VFN2-D/VFN281Q.Seq.d/ 1848 0.0
VHH551 (VHH551Q) /CSM/VH/VHH5-C/VHH551Q.Seq.d/ 1816 0.0

own update 2002.11.29
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

CF779981|CF779981.1 tad06b06.x1 Hydra EST -IV Hydra magnipapillata cDNA 3' similar to SW:CATB_BOVIN P07688 CATHEPSIN B PRECURSOR ;, mRNA sequence. 52 1e-06 2
AC116982|AC116982.2 Dictyostelium discoideum chromosome 2 map 3622643-3879522 strain AX4, complete sequence. 34 5e-05 16
AC117176|AC117176.2 Dictyostelium discoideum chromosome 2 map 5018074-5200947 strain AX4, complete sequence. 32 3e-04 14
BX000475|BX000475.6 Zebrafish DNA sequence from clone CH211-112P5. 38 0.006 9
AC116986|AC116986.2 Dictyostelium discoideum chromosome 2 map 2234041-2567370 strain AX4, complete sequence. 36 0.016 16
AF409138|AF409138.1 Lumpy skin disease virus isolate Neethling vaccine LW 1959, complete genome. 38 0.032 10
AE014825|AE014825.1 Plasmodium falciparum 3D7 chromosome 14 section 10 of 13 of the complete sequence. 32 0.039 11
AC117891|AC117891.5 Rattus norvegicus clone CH230-290B16, WORKING DRAFT SEQUENCE, 2 unordered pieces. 40 0.044 7
BX004878|BX004878.8 Zebrafish DNA sequence from clone DKEY-11H2 in linkage group 6. 38 0.044 10
AC117032|AC117032.3 Rattus norvegicus clone CH230-207K17, *** SEQUENCING IN PROGRESS ***, 15 unordered pieces. 36 0.050 11
dna update 2003.12.18
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

AY204512_1(AY204512|pid:none) Sterkiella histriomuscorum catheps... 146 5e-44
DQ363675_1(DQ363675|pid:none) Streblomastix strix cathepsin B ge... 141 1e-43
AF483623_1(AF483623|pid:none) Apriona germari cathepsin B mRNA, ... 128 3e-38
AF358667_1(AF358667|pid:none) Oncorhynchus mykiss procathepsin B... 119 3e-38
EF474111_1(EF474111|pid:none) Monocercomonoides sp. PA cathepsin... 131 4e-38
U51892_1(U51892|pid:none) Ascaris suum cathepsin B-like cysteine... 120 1e-37
EF474109_1(EF474109|pid:none) Monocercomonoides sp. PA cathepsin... 130 1e-37
AY737533_1(AY737533|pid:none) Toxoptera citricida putative cathe... 125 2e-37
AY813193_1(AY813193|pid:none) Schistosoma japonicum clone SJCHGC... 121 3e-37
AY553271_1(AY553271|pid:none) Triatoma sordida cathepsin B-like ... 117 3e-37
protein update 2009. 7. 3
PSORT

psg: 0.98 gvh: 0.61 alm: 0.22 top: 0.13 tms: 0.07 mit: 0.37 mip: 0.03
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 1.00 tyr: 1.15 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 1.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 0.00

20.0 %: Golgi
20.0 %: mitochondrial
20.0 %: nuclear
20.0 %: endoplasmic reticulum
12.0 %: cytoplasmic
4.0 %: extracellular, including cell wall
4.0 %: plasma membrane

>> prediction for SFC182 is gol

5' end seq. ID SFC182F
5' end seq.
>SFC182F.Seq
AAATAATTTATATATATAAAAAAATGAGATTTTCATACATCATTTGTTTAATCTTTGTAT
CTTTTTACTTTGCTTCAGTTTGTTTAGGTTCATTCCTTGATAAACCAGTTTTAGATGATA
ACCTCATCAATTCAATCAATAATAATAAAAAATCATCATGGACTGCCCATAGAAATAAAA
ATTTCGAAGGTAAGACTTTTGGTGATATCATTGGTATGATGGGTACTAAAAAAACTGCTG
CTCCATTCAAATTAACTGAAAATGGTGAAGAACTCAAAGGTTCAATCCCAACTTCATTCG
ATTCTCGTGTCCAATGGCCAGACTGTATCCACCCAATCCTCAACCAAGAACAATGTGGTT
CATGTTGGGCCTTTTCTTCATCTGAAGTTTTAAGTGATAGATTATGTATTGCCTCAAATA
ATAAAACTAACCCAGGTGCTCTCAGTCCACAAACTTTAGTTGCTTGTGATGTATATGGTA
ATGATGGTTGTAGTGGTGGTATCCCACAATTAGCTTGGGAATATATGGAACTTAAAGGTT
TACCAACTGACTCATGCGTCCCATACACTGCTGGTAACGGTACTGTCTACTCTTGTCAAA
GATCATGTTCCGATAGTGAAGATTACAGT----------
Length of 5' end seq. 629
3' end seq. ID SFC182Z
3' end seq.
>SFC182Z.Seq
----------AATGGCCAGACTGTATCCACCCAATCCTCAACCAAGAACAATGTGGTTCA
TGTTGGGCCTTTTCTTCATCTGAAGTTTTAAGTGATAGATTATGTATTGCCTCAAATAAT
AAAACTAACCCAGGTGCTCTCAGTCCACAAACTTTAGTTGCTTGTGATGTATATGGTAAT
GATGGTTGTAGTGGTGGTATCCCACAATTAGCTTGGGAATATATGGAACTTAAAGGTTTA
CCAACTGACTCATGCGTCCCATACACTGCTGGTAACGGTACTGTCTACTCTTGTCAAAGA
TCATGTTCCGATAGTGAAGATTACAGTTTATACAGAGCTAAGCCATTCACCTTAAAGACT
TGCTCTTCAGTTCAATGTATCCAAGAAAACATTTTAGCTTATGGTCCAATCGTTGGTACT
ATGGAAGTTTATGAAGATTTTATGAGCTACAGCTCAGGTGTTTACGTTATGACTCCAGGT
TCATCTTATTAGGTGGTCATGCTATTAAAATTGTTGGTTGGGGCTTTGATCAAACCTCTC
AATTAAACTACTGGATTGTTGCTAATTCATGGGGTGCTGACTGGGGTCAACAAGGTTTCT
TTTTCATTTCAATGGAAACTTGTTCAATTTCTAGTGATGCAAGTGCTGCCGAAGCCCGTG
TTTAAATTGTTTCAAATAAAATTTCATTTACACATTAATAATTTT
Length of 3' end seq. 695
Connected seq. ID SFC182P
Connected seq.
>SFC182P.Seq
AAATAATTTATATATATAAAAAAATGAGATTTTCATACATCATTTGTTTAATCTTTGTAT
CTTTTTACTTTGCTTCAGTTTGTTTAGGTTCATTCCTTGATAAACCAGTTTTAGATGATA
ACCTCATCAATTCAATCAATAATAATAAAAAATCATCATGGACTGCCCATAGAAATAAAA
ATTTCGAAGGTAAGACTTTTGGTGATATCATTGGTATGATGGGTACTAAAAAAACTGCTG
CTCCATTCAAATTAACTGAAAATGGTGAAGAACTCAAAGGTTCAATCCCAACTTCATTCG
ATTCTCGTGTCCAATGGCCAGACTGTATCCACCCAATCCTCAACCAAGAACAATGTGGTT
CATGTTGGGCCTTTTCTTCATCTGAAGTTTTAAGTGATAGATTATGTATTGCCTCAAATA
ATAAAACTAACCCAGGTGCTCTCAGTCCACAAACTTTAGTTGCTTGTGATGTATATGGTA
ATGATGGTTGTAGTGGTGGTATCCCACAATTAGCTTGGGAATATATGGAACTTAAAGGTT
TACCAACTGACTCATGCGTCCCATACACTGCTGGTAACGGTACTGTCTACTCTTGTCAAA
GATCATGTTCCGATAGTGAAGATTACAGT----------AATGGCCAGACTGTATCCACC
CAATCCTCAACCAAGAACAATGTGGTTCATGTTGGGCCTTTTCTTCATCTGAAGTTTTAA
GTGATAGATTATGTATTGCCTCAAATAATAAAACTAACCCAGGTGCTCTCAGTCCACAAA
CTTTAGTTGCTTGTGATGTATATGGTAATGATGGTTGTAGTGGTGGTATCCCACAATTAG
CTTGGGAATATATGGAACTTAAAGGTTTACCAACTGACTCATGCGTCCCATACACTGCTG
GTAACGGTACTGTCTACTCTTGTCAAAGATCATGTTCCGATAGTGAAGATTACAGTTTAT
ACAGAGCTAAGCCATTCACCTTAAAGACTTGCTCTTCAGTTCAATGTATCCAAGAAAACA
TTTTAGCTTATGGTCCAATCGTTGGTACTATGGAAGTTTATGAAGATTTTATGAGCTACA
GCTCAGGTGTTTACGTTATGACTCCAGGTTCATCTTATTAGGTGGTCATGCTATTAAAAT
TGTTGGTTGGGGCTTTGATCAAACCTCTCAATTAAACTACTGGATTGTTGCTAATTCATG
GGGTGCTGACTGGGGTCAACAAGGTTTCTTTTTCATTTCAATGGAAACTTGTTCAATTTC
TAGTGATGCAAGTGCTGCCGAAGCCCGTGTTTAAATTGTTTCAAATAAAATTTCATTTAC
ACATTAATAATTTT
Length of connected seq. 1324
Full length Seq ID SFC182E
Full length Seq.
>SFC182E.Seq
AAATAATTTATATATATAAAAAAATGAGATTTTCATACATCATTTGTTTAATCTTTGTAT
CTTTTTACTTTGCTTCAGTTTGTTTAGGTTCATTCCTTGATAAACCAGTTTTAGATGATA
ACCTCATCAATTCAATCAATAATAATAAAAAATCATCATGGACTGCCCATAGAAATAAAA
ATTTCGAAGGTAAGACTTTTGGTGATATCATTGGTATGATGGGTACTAAAAAAACTGCTG
CTCCATTCAAATTAACTGAAAATGGTGAAGAACTCAAAGGTTCAATCCCAACTTCATTCG
ATTCTCGTGTCCAATGGCCAGACTGTATCCACCCAATCCTCAACCAAGAACAATGTGGTT
CATGTTGGGCCTTTTCTTCATCTGAAGTTTTAAGTGATAGATTATGTATTGCCTCAAATA
ATAAAACTAACCCAGGTGCTCTCAGTCCACAAACTTTAGTTGCTTGTGATGTATATGGTA
ATGATGGTTGTAGTGGTGGTATCCCACAATTAGCTTGGGAATATATGGAACTTAAAGGTT
TACCAACTGACTCATGCGTCCCATACACTGCTGGTAACGGTACTGTCTACTCTTGTCAAA
GATCATGTTCCGATAGTGAAGATTACAGTTTATACAGAGCTAAGCCATTCACCTTAAAGA
CTTGCTCTTCAGTTCAATGTATCCAAGAAAACATTTTAGCTTATGGTCCAATCGTTGGTA
CTATGGAAGTTTATGAAGATTTTATGAGCTACAGCTCAGGTGTTTACGTTATGACTCCAG
GTTCATCTTATTAGGTGGTCATGCTATTAAAATTGTTGGTTGGGGCTTTGATCAAACCTC
TCAATTAAACTACTGGATTGTTGCTAATTCATGGGGTGCTGACTGGGGTCAACAAGGTTT
CTTTTTCATTTCAATGGAAACTTGTTCAATTTCTAGTGATGCAAGTGCTGCCGAAGCCCG
TGTTTAAATTGTTTCAAATAAAATTTCATTTACACATTAATAATTTT
Length of full length seq. 1007