CFG231
Library CF
(Link to library)
Clone ID CFG231
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16296-1
Original site URL
Representative seq. ID CFG231E
(Link to Original site)
Representative DNA sequence
>CFG231 (CFG231Q) /CSM/CF/CFG2-B/CFG231Q.Seq.d/
ATATTATATATGAAACATTAAAAAAATGAAAGTTATATTATTATTTGTTTTAGCTGTTTT
TACTGTTTTTGTTTCAAGTAGAGGAATTCCATTAGAAGAACAAAGTCAATTCCTTGAATT
TCAAGATAAATTCAATAAAAAATATTCACATGAAGAATATTTGGAAAGATTTGAAATTTT
TAAAAGCAATTTAGGAAAAATTGAAGAATTAAATCTAATAGCCATTAATCACAAAGCTGA
TACTAAATTTGGTGTAAACAAGTTTGCAGATCTTTCCAGTGACGAATTTAAAAATTATTA
TTTAAATAATAAGGAAGCAATATTCACTGATGACCTTCCAGTTGCTGATTATCTTGATGA
TGAATTCCATTAATTCAATTCCAACTGCATTTGATTGGAGAACTAGAGGGTGCTGTTACA
CCTGTAAAAAATCAAGGTCAATGTGGTAGTTGTTGGTCATTTTCAACTANTGGTAATGTT
GAGGGACAACATTTCATTAGTCAGAATAAATTAGTTTCATTATCAGAGCAAAACTTGGTA
GATTGTGATCATGAGTGTATGGAATATGAAGGTGAACAAGCTTGTGATGAGGGTTGTAAT
GGTGGTCTTCAACCAAATGCATATAATTATATCATTAAAAATGGTGGAATTCAAACAGAA
TCTTCATATCCTTACACTGCTGAAACAGGTACACAATGTAACTTTAACTCTGCCAATATT
GGTGCAAAGATTTCCAATTTTACAATGATCCCAAAGAATGAAACTGTAATGGCTGGGTAC
ATCGTTAGTACTGGACCACTCGCAATTGCTGCTGATGCTGTTGAGTGGCAATTTTATATT
GGTGGTGTATTTGATATTCCATGTAATCCAAATTCACTTGATCATGGTATTTTAATTGTT
GGTTACTCTGCTAAAAATACAATTTTCCGTAAAAATATGCCATATTGGATTGTAAAGAAT
TCTTGGGGTGCAGATTGGGGAGAACAAGGATACATTTATTTAAGAAGAGGAAAGAATACA
TGTGGTGTAACAAAACTA
sequence update 2001. 6. 9
Translated Amino Acid sequence
ilymkh*knesyiiicfscfycfcfk*rnsirrtksip*isr*iq*kift*rifgki*nf
*kqfrkn*riksnsh*sqs*y*iwckqvcrsfq*ri*kllfk**gsnih**pssc*ls**
*IPLIQFQLHLIGELEGAVTPVKNQGQCGSCWSFSTXGNVEGQHFISQNKLVSLSEQNLV
DCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANI
GAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIV
GYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVTKL


Translated Amino Acid sequence (All Frames)
Frame A:
ilymkh*knesyiiicfscfycfcfk*rnsirrtksip*isr*iq*kift*rifgki*nf
*kqfrkn*riksnsh*sqs*y*iwckqvcrsfq*ri*kllfk**gsnih**pssc*ls**
*IPLIQFQLHLIGELEGAVTPVKNQGQCGSCWSFSTXGNVEGQHFISQNKLVSLSEQNLV
DCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANI
GAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIV
GYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVTKL


Frame B:
yyi*nikkmkvillfvlavftvfvssrgipleeqsqflefqdkfnkkysheeylerfeif
ksnlgkieelnliainhkadtkfgvnkfadlssdefknyylnnkeaiftddlpvadyldd
efh*fnsnci*len*rvllhl*kikvnvvvvghfqlxvmlrdnislvrin*fhyqsktw*
ivimsvwnmkvnklvmrvvmvvfnqmhiiislkmvefkqnlhiltllkqvhnvtltlpil
vqrfpilq*sqrmkl*wlgtslvldhsqlllmllsgnfilvvylifhviqihlimvf*ll
vtllkiqfsvkichigl*rilgvqigenkdtfi*eeerihvv*qn


Frame C:
iiyetlkk*klyyylf*lfllflfqveefh*knkvnslnfkinsiknihmkniwkdlkfl
kai*eklkn*i**plitklilnlv*tslqifpvtnlkiii*iirkqyslmtfqlliilmm
nsinsiptafdwrtrgccytckksrsmw*llvifnxw*c*gttfh*se*isfiiraklgr
l*s*vygi*r*tsl**gl*wwsstkci*lyh*kwwnsnrifislhc*nrytm*l*lcqyw
ckdfqfyndpke*ncngwvhr*ywttrncc*cc*vailywwci*ysm*skft*swyfncw
llc*kynfp*kyaildckeflgcrlgrtrihlfkkrkeymwcnkt


Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

CFG231 (CFG231Q) /CSM/CF/CFG2-B/CFG231Q.Seq.d/ 1905 0.0
VFF582 (VFF582Q) /CSM/VF/VFF5-D/VFF582Q.Seq.d/ 1857 0.0
SFJ881 (SFJ881Q) /CSM/SF/SFJ8-D/SFJ881Q.Seq.d/ 1850 0.0
SFE852 (SFE852Q) /CSM/SF/SFE8-C/SFE852Q.Seq.d/ 1850 0.0
CFF469 (CFF469Q) /CSM/CF/CFF4-C/CFF469Q.Seq.d/ 1850 0.0
AFI295 (AFI295Q) /CSM/AF/AFI2-D/AFI295Q.Seq.d/ 1850 0.0
VFA268 (VFA268Q) /CSM/VF/VFA2-C/VFA268Q.Seq.d/ 1848 0.0
SFH480 (SFH480Q) /CSM/SF/SFH4-D/SFH480Q.Seq.d/ 1832 0.0
AFE263 (AFE263Q) /CSM/AF/AFE2-C/AFE263Q.Seq.d/ 1828 0.0
AHL332 (AHL332Q) /CSM/AH/AHL3-B/AHL332Q.Seq.d/ 1824 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

X02407|X02407.1 D.discoideum mRNA for cysteine proteinase 1. 1219 0.0 4
AJ510164|AJ510164.1 Cloning vector pDXA-3strep. 111 4e-22 2
X85119|X85119.1 Artificial sequences cloning vector DNA pDXA-3H. 111 4e-22 2
X85122|X85122.1 Artificial sequences cloning vector DNA pDXA-HY. 111 4e-22 2
AJ510165|AJ510165.1 Cloning vector pDXA-3FLAG. 111 4e-22 2
X85118|X85118.1 Artificial sequences cloning vector DNA pDXA-3C. 111 4e-22 2
X85123|X85123.1 Artificial sequences cloning vector DNA pDXA-HC. 111 4e-22 2
AF269236|AF269236.1 Cloning vector pDXA-FLAG, complete sequence. 111 4e-22 2
X85120|X85120.1 Artificial sequences cloning vector DNA pDXD-3H. 111 5e-22 2
AJ510166|AJ510166.1 Cloning vector pDXA-GST. 111 5e-22 2
dna update 2003.12.19
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(P04988) RecName: Full=Cysteine proteinase 1; EC=3.4.22... 442 e-180
U42758_1(U42758|pid:none) Naegleria fowleri cysteine proteinase ... 251 1e-74
FB844544_1(FB844544|pid:none) Sequence 63817 from Patent WO20080... 239 4e-67
FJ609256_1(FJ609256|pid:none) Solanum lycopersicum cysteine prot... 234 2e-66
AJ580823_4(AJ580823|pid:none) Lotus corniculatus var. japonicus ... 236 2e-66
AJ580823_3(AJ580823|pid:none) Lotus corniculatus var. japonicus ... 236 2e-66
BT071299_1(BT071299|pid:none) Picea sitchensis clone WS02822_C20... 229 1e-65
AC149637_11(AC149637|pid:none) Medicago truncatula clone mth2-18... 233 1e-65
AF411121_1(AF411121|pid:none) Sandersonia aurantiaca cysteine pr... 229 4e-65
(P25804) RecName: Full=Cysteine proteinase 15A; EC=3.4.... 223 2e-64
protein update 2009. 6.30
PSORT

psg: 0.83 gvh: 0.50 alm: 0.38 top: 0.53 tms: 0.00 mit: 0.12 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.20 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

52.0 %: cytoplasmic
32.0 %: nuclear
8.0 %: mitochondrial
8.0 %: peroxisomal

>> prediction for CFG231 is cyt

5' end seq. ID CFG231F
5' end seq.
>CFG231F.Seq
ATATTATATATGAAACATTAAAAAAATGAAAGTTATATTATTATTTGTTTTAGCTGTTTT
TACTGTTTTTGTTTCAAGTAGAGGAATTCCATTAGAAGAACAAAGTCAATTCCTTGAATT
TCAAGATAAATTCAATAAAAAATATTCACATGAAGAATATTTGGAAAGATTTGAAATTTT
TAAAAGCAATTTAGGAAAAATTGAAGAATTAAATCTAATAGCCATTAATCACAAAGCTGA
TACTAAATTTGGTGTAAACAAGTTTGCAGATCTTTCCAGTGACGAATTTAAAAATTATTA
TTTAAATAATAAGGAAGCAATATTCACTGATGACCTTCCAGTTGCTGATTATCTTGATGA
TGAATTCATTAATTCAATTCCAACTGCATTTGATTGGAGAACTAGAGGTGCTGTTACACC
TGTAAAAAATCAAGGTCAATGTGGTAGTTGTTGGTCATTTTCAACTACTGGTAATGTTGA
GGGACAACATTTCATTAGTCAGAATAAATTAGTTTCATTATCAGAGCAAAACTTGGTAGA
TTGTGATCATGAGTGTATGGAATATGAAGGTGAACAAGCTTGTGATGAGGGTTGTAATGG
TGGTCTTCAACCAAATGCATATAATTATATCATTAAA----------
Length of 5' end seq. 637
3' end seq. ID CFG231Z
3' end seq.
>CFG231Z.Seq
----------TGATTATCTTGATGATGAATTCCATTAATTCAATTCCAACTGCATTTGAT
TGGAGAACTAGAGGGTGCTGTTACACCTGTAAAAAATCAAGGTCAATGTGGTAGTTGTTG
GTCATTTTCAACTATTGGTAATGTTGAGGGACAACATTTCATTAGTCAGAATAAATTAGT
TTCATTATCAGAGCAAAACTTGGTAGATTGTGATCATGAGTGTATGGAATATGAAGGTGA
ACAAGCTTGTGATGAGGGTTGTAATGGTGGTCTTCAACCAAATGCATATAATTATATCAT
TAAAAATGGTGGAATTCAAACAGAATCTTCATATCCTTACACTGCTGAAACAGGTACACA
ATGTAACTTTAACTCTGCCAATATTGGTGCAAAGATTTCCAATTTTACAATGATCCCAAA
GAATGAAACTGTAATGGCTGGGTACATCGTTAGTACTGGACCACTCGCAATTGCTGCTGA
TGCTGTTGAGTGGCAATTTTATATTGGTGGTGTATTTGATATTCCATGTAATCCAAATTC
ACTTGATCATGGTATTTTAATTGTTGGTTACTCTGCTAAAAATACAATTTTCCGTAAAAA
TATGCCATATTGGATTGTAAAGAATTCTTGGGGTGCAGATTGGGGAGAACAAGGATACAT
TTATTTAAGAAGAGGAAAGAATACATGTGGTGTAACAAAACTA
Length of 3' end seq. 693
Connected seq. ID CFG231P
Connected seq.
>CFG231P.Seq
ATATTATATATGAAACATTAAAAAAATGAAAGTTATATTATTATTTGTTTTAGCTGTTTT
TACTGTTTTTGTTTCAAGTAGAGGAATTCCATTAGAAGAACAAAGTCAATTCCTTGAATT
TCAAGATAAATTCAATAAAAAATATTCACATGAAGAATATTTGGAAAGATTTGAAATTTT
TAAAAGCAATTTAGGAAAAATTGAAGAATTAAATCTAATAGCCATTAATCACAAAGCTGA
TACTAAATTTGGTGTAAACAAGTTTGCAGATCTTTCCAGTGACGAATTTAAAAATTATTA
TTTAAATAATAAGGAAGCAATATTCACTGATGACCTTCCAGTTGCTGATTATCTTGATGA
TGAATTCATTAATTCAATTCCAACTGCATTTGATTGGAGAACTAGAGGTGCTGTTACACC
TGTAAAAAATCAAGGTCAATGTGGTAGTTGTTGGTCATTTTCAACTACTGGTAATGTTGA
GGGACAACATTTCATTAGTCAGAATAAATTAGTTTCATTATCAGAGCAAAACTTGGTAGA
TTGTGATCATGAGTGTATGGAATATGAAGGTGAACAAGCTTGTGATGAGGGTTGTAATGG
TGGTCTTCAACCAAATGCATATAATTATATCATTAAA----------TGATTATCTTGAT
GATGAATTCCATTAATTCAATTCCAACTGCATTTGATTGGAGAACTAGAGGGTGCTGTTA
CACCTGTAAAAAATCAAGGTCAATGTGGTAGTTGTTGGTCATTTTCAACTATTGGTAATG
TTGAGGGACAACATTTCATTAGTCAGAATAAATTAGTTTCATTATCAGAGCAAAACTTGG
TAGATTGTGATCATGAGTGTATGGAATATGAAGGTGAACAAGCTTGTGATGAGGGTTGTA
ATGGTGGTCTTCAACCAAATGCATATAATTATATCATTAAAAATGGTGGAATTCAAACAG
AATCTTCATATCCTTACACTGCTGAAACAGGTACACAATGTAACTTTAACTCTGCCAATA
TTGGTGCAAAGATTTCCAATTTTACAATGATCCCAAAGAATGAAACTGTAATGGCTGGGT
ACATCGTTAGTACTGGACCACTCGCAATTGCTGCTGATGCTGTTGAGTGGCAATTTTATA
TTGGTGGTGTATTTGATATTCCATGTAATCCAAATTCACTTGATCATGGTATTTTAATTG
TTGGTTACTCTGCTAAAAATACAATTTTCCGTAAAAATATGCCATATTGGATTGTAAAGA
ATTCTTGGGGTGCAGATTGGGGAGAACAAGGATACATTTATTTAAGAAGAGGAAAGAATA
CATGTGGTGTAACAAAACTA
Length of connected seq. 1330
Full length Seq ID CFG231E
Full length Seq.
>CFG231E.Seq
ATATTATATATGAAACATTAAAAAAATGAAAGTTATATTATTATTTGTTTTAGCTGTTTT
TACTGTTTTTGTTTCAAGTAGAGGAATTCCATTAGAAGAACAAAGTCAATTCCTTGAATT
TCAAGATAAATTCAATAAAAAATATTCACATGAAGAATATTTGGAAAGATTTGAAATTTT
TAAAAGCAATTTAGGAAAAATTGAAGAATTAAATCTAATAGCCATTAATCACAAAGCTGA
TACTAAATTTGGTGTAAACAAGTTTGCAGATCTTTCCAGTGACGAATTTAAAAATTATTA
TTTAAATAATAAGGAAGCAATATTCACTGATGACCTTCCAGTTGCTGATTATCTTGATGA
TGAATTCCATTAATTCAATTCCAACTGCATTTGATTGGAGAACTAGAGGGTGCTGTTACA
CCTGTAAAAAATCAAGGTCAATGTGGTAGTTGTTGGTCATTTTCAACTANTGGTAATGTT
GAGGGACAACATTTCATTAGTCAGAATAAATTAGTTTCATTATCAGAGCAAAACTTGGTA
GATTGTGATCATGAGTGTATGGAATATGAAGGTGAACAAGCTTGTGATGAGGGTTGTAAT
GGTGGTCTTCAACCAAATGCATATAATTATATCATTAAAAATGGTGGAATTCAAACAGAA
TCTTCATATCCTTACACTGCTGAAACAGGTACACAATGTAACTTTAACTCTGCCAATATT
GGTGCAAAGATTTCCAATTTTACAATGATCCCAAAGAATGAAACTGTAATGGCTGGGTAC
ATCGTTAGTACTGGACCACTCGCAATTGCTGCTGATGCTGTTGAGTGGCAATTTTATATT
GGTGGTGTATTTGATATTCCATGTAATCCAAATTCACTTGATCATGGTATTTTAATTGTT
GGTTACTCTGCTAAAAATACAATTTTCCGTAAAAATATGCCATATTGGATTGTAAAGAAT
TCTTGGGGTGCAGATTGGGGAGAACAAGGATACATTTATTTAAGAAGAGGAAAGAATACA
TGTGGTGTAACAAAACTA
Length of full length seq. 1038