CHR223
Library CH
(Link to library)
Clone ID CHR223
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16590-1
Original site URL
Representative seq. ID CHR223P
(Link to Original site)
Representative DNA sequence
>CHR223 (CHR223Q) /CSM/CH/CHR2-A/CHR223Q.Seq.d/
CAATGTCACACCAGNAGGTTTCAAGAGTTGCAATCGTTGGCTTGGGTAAGAAAGAAAATA
ATAATAGCACCACCTATGAAAAGAATGAAAACACTCGTAAAGCAATCGGTAGTGGTGTTA
AAGCATTGAAATCAAAGAATGCCACCCATTTAACCATCGATTCAAACATTGGTGATGCCA
AACAAACTGCAGAGGGTGCATTCCTTTCAAACTTCAAATTTGATTTCAAAACAGGTACTT
CTGGTAAAACAGCCAATTCCACCAATGAATCCATTCAAGTTCAATTATCACCATCACCAT
CAAGTGAAGAATGTTTTAAAGAAGGTAAAATCTTGGCTGAATCTCAAAATTTCGCAAGAG
TTTTAATGGAGACACCAGCAAATCTTTTAACACCAACCAATTTCGTTCAACATGTTAGCA
GTCAAATGAAAGAGTTAATCGATAGTGGAAAGGTGGAGATGATCGTCCGTGAAGAACAAT
GGGTTAAAGATCAAAAGATGGGTATGTTTTGGGGTGTTGCTAAAGGTTCCGATGAACCAT
TGAAATTCTTAGAACTTCACTATCGTGGTGCATCTGCCGATGGTAAGGATTCAATXXXXX
XXXXXGGTGCATCTGCCGATGGTAAGGATTCAATAGTTTATGTTGGTAAAGGTATCACTT
TTGATAGTGGTGGTATTTCAATTAAACCATCAGCAAATATGGGTTTAATGAAGGGTGATA
TGGGTGGTGCTGCCACTGCTGTCTCTGCAATGTTTGGTGTTGCTTCATTGGGTTTAAAGG
TCAATTTAATTACAATCACTCCATTATGTGAAAATATGCCATCAGGTAAAGCAACTAAAC
CAGGTGATATCCTTACCGCTGCAAATGGTAAAACCGTTGAAGTCGACAATACCGATGCCG
AGGGTCGTTTAATCTTGGGTGATGCTTTACATTATGCTTGTTCATTCAAACCAACTCATA
TCATTGATATCGCTACCTTGACTGGTGCCATCGATGTTGCCTTGGGTCAACATTATGCTG
GTTGTTTTACAACCACCGACTCACTTTGGGATCAATTAAATGAATGTGGTAACATTAGTG
GTGAAAGATTATGGAGAATGCCATTGATTCCAGAATATCGTAAACAAATGGAAACCTCAA
AAGTTGCCGATTTAATCAATTCTGCTGGTCGTTCAGGTGGTGCTTGTTGTGCTGCTGGTT
TCCTTAAAGAATTCATTACAGCCGATCAATCTTGGTCTCACCTTGATATTGCTGGTGTTA
TGTCATCATCTGAAGATGGTCCATACATTAGAAAAGGTAGACTGGTAAACCAACTCGTAC
TTTAATAGAATTGCTAAAAAGAATCAACAA
sequence update 2002.10.25
Translated Amino Acid sequence
MSHQXVSRVAIVGLGKKENNNSTTYEKNENTRKAIGSGVKALKSKNATHLTIDSNIGDAK
QTAEGAFLSNFKFDFKTGTSGKTANSTNESIQVQLSPSPSSEECFKEGKILAESQNFARV
LMETPANLLTPTNFVQHVSSQMKELIDSGKVEMIVREEQWVKDQKMGMFWGVAKGSDEPL
KFLELHYRGASADGKDS---

---GASADGKDSIVYVGKGITFDSGGISIKPSANMGLMKGDMGGAATAVSAMFGVASLGL
KVNLITITPLCENMPSGKATKPGDILTAANGKTVEVDNTDAEGRLILGDALHYACSFKPT
HIIDIATLTGAIDVALGQHYAGCFTTTDSLWDQLNECGNISGERLWRMPLIPEYRKQMET
SKVADLINSAGRSGGACCAAGFLKEFITADQSWSHLDIAGVMSSSEDGPYIRKGRLVNQL
VL**nc*kest


Translated Amino Acid sequence (All Frames)
Frame A:
qchtxrfqelqslawvrkkiiiappmkrmktlvkqsvvvlkh*nqrmppi*psiqtlvmp
nklqrvhsfqtsnliskqvllvkqpippmnpfkfnyhhhhqvknvlkkvkswlnlkisqe
f*wrhqqif*hqpisfnmlavk*ks*siverwr*ssvknnglkikrwvcfgvllkvpmnh
*ns*nftivvhlpmvriq---

---GASADGKDSIVYVGKGITFDSGGISIKPSANMGLMKGDMGGAATAVSAMFGVASLGL
KVNLITITPLCENMPSGKATKPGDILTAANGKTVEVDNTDAEGRLILGDALHYACSFKPT
HIIDIATLTGAIDVALGQHYAGCFTTTDSLWDQLNECGNISGERLWRMPLIPEYRKQMET
SKVADLINSAGRSGGACCAAGFLKEFITADQSWSHLDIAGVMSSSEDGPYIRKGRLVNQL
VL**nc*kest

Frame B:
nvtpxgfkscnrwlg*erk***hhl*ke*khs*snr*wc*sieikechpfnhrfkhw*cq
tncrgcipfklqi*fqnryfw*nsqfhq*ihsssiititik*rmf*rr*nlg*iskfrks
fngdtsksfntnqfrstc*qsnervnr*wkggddrp*rtmg*rskdgyvlgcc*rfr*ti
eilrtslswcicrw*gfn---

---vhlpmvriq*fmlvkvsllivvvfqlnhqqiwv**rviwvvlpllslqclvllhwv*
rsi*lqslhyvkichqvkqlnqvislplqmvkplkstipmprvv*swvmlyimlvhsnql
islislp*lvpsmlpwvnimlvvlqppthfgin*mnvvtlvvkdygech*fqnivnkwkp
qklpi*sillvvqvvlvvllvslknslqpinlgltlillvlchhlkmvhtlekvdw*tns
yfnriakknqq

Frame C:
MSHQXVSRVAIVGLGKKENNNSTTYEKNENTRKAIGSGVKALKSKNATHLTIDSNIGDAK
QTAEGAFLSNFKFDFKTGTSGKTANSTNESIQVQLSPSPSSEECFKEGKILAESQNFARV
LMETPANLLTPTNFVQHVSSQMKELIDSGKVEMIVREEQWVKDQKMGMFWGVAKGSDEPL
KFLELHYRGASADGKDS---

---cicrw*gfnslcw*ryhf**wwyfn*tiskygfneg*ygwcchcclcnvwccfigfk
gqfnynhsim*kyair*sn*tr*ypyrckw*nr*srqyrcrgsfnlg*cftlclfiqtns
yh*yryldwchrcclgstlcwlfynhrltlgsik*mw*h*w*kimenaidsris*tngnl
kscrfnqfcwsfrwcllccwfp*rihysrsilvsp*ycwcyvii*rwsih*kr*tgkptr
tliellkrin

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

CHR223 (CHR223Q) /CSM/CH/CHR2-A/CHR223Q.Seq.d/ 2611 0.0
VFG447 (VFG447Q) /CSM/VF/VFG4-B/VFG447Q.Seq.d/ 1463 0.0
AFG522 (AFG522Q) /CSM/AF/AFG5-A/AFG522Q.Seq.d/ 1463 0.0
AFM362 (AFM362Q) /CSM/AF/AFM3-C/AFM362Q.Seq.d/ 1449 0.0
CFK183 (CFK183Q) /CSM/CF/CFK1-D/CFK183Q.Seq.d/ 1411 0.0
AFO810 (AFO810Q) /CSM/AF/AFO8-A/AFO810Q.Seq.d/ 1392 0.0
AFE161 (AFE161Q) /CSM/AF/AFE1-C/AFE161Q.Seq.d/ 1390 0.0
SFH132 (SFH132Q) /CSM/SF/SFH1-B/SFH132Q.Seq.d/ 1384 0.0
AFK487 (AFK487Q) /CSM/AF/AFK4-D/AFK487Q.Seq.d/ 1372 0.0
VFF745 (VFF745Q) /CSM/VF/VFF7-B/VFF745Q.Seq.d/ 1360 0.0

own update 2009. 4. 4
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

AY581145|AY581145.1 Dictyostelium discoideum leucine aminopeptidase (L-A-P) gene, complete cds. 1376 0.0 3
CN558054|CN558054.1 tae47c07.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to SW:AMPL_BOVIN P00727 CYTOSOL AMINOPEPTIDASE ;, mRNA sequence. 36 2e-06 4
DR444142|DR444142.1 AR1005E06 A. gomesiana hemocytes normalized library Acanthoscurria gomesiana cDNA clone AR1005E06 5', mRNA sequence. 60 3e-06 2
AI877971|AI877971.1 fc55g11.y1 Zebrafish WashU MPIMG EST Danio rerio cDNA clone IMAGE:3725348 5' similar to SW:AMPL_BOVIN P00727 CYTOSOL AMINOPEPTIDASE ;, mRNA sequence. 48 1e-04 3
AR376563|AR376563.1 Sequence 1569 from patent US 6605709. 54 0.001 2
AR318728|AR318728.1 Sequence 1278 from patent US 6562958. 46 0.002 3
DR445042|DR445042.1 AR1015F06 A. gomesiana hemocytes normalized library Acanthoscurria gomesiana cDNA clone AR1015F06 5', mRNA sequence. 52 0.031 1
AY583235|AY583235.1 Mycoplasma fermentans strain PG18 putative transport operon, partial sequence; bacteriophage phiMFV1a, complete sequence; thymidine kinase (tmk) gene, complete sequence and leucine aminopeptidase (lap) gene, partial sequence sequence. 48 0.073 3
BX842642|BX842642.1 Mycoplasma mycoides subsp. mycoides SC genomic DNA, complete sequence; segment 1/4. 40 0.12 11
CD202921|CD202921.1 MS1-0139P-V386-A07-U.B MS1-0139 Schistosoma mansoni cDNA clone MS1-0139P-V386-A07.B, mRNA sequence. 50 0.12 1
dna update 2005.11.13
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(Q5V9F0) RecName: Full=Cytosol aminopeptidase; EC=3.4.1... 785 0.0
AF061738_1(AF061738|pid:none) Homo sapiens leucine aminopeptidas... 350 4e-96
AK022055_1(AK022055|pid:none) Homo sapiens cDNA FLJ11993 fis, cl... 350 4e-96
BC068707_1(BC068707|pid:none) Xenopus laevis hypothetical protei... 352 5e-96
(P00727) RecName: Full=Cytosol aminopeptidase; EC=3.4.1... 344 2e-93
NRL(1BLLE) Leucine aminopeptidase (EC 3.4.11.1) complex with ama... 344 2e-93
CT010237_1(CT010237|pid:none) Mus musculus full open reading fra... 342 5e-93
CT010301_1(CT010301|pid:none) Mus musculus full open reading fra... 341 9e-93
AK010502_1(AK010502|pid:none) Mus musculus ES cells cDNA, RIKEN ... 341 9e-93
(Q9CPY7) RecName: Full=Cytosol aminopeptidase; EC=3.4.1... 341 9e-93
protein update 2009. 4.12
PSORT

psg: 0.82 gvh: 0.30 alm: 0.35 top: 0.53 tms: 0.00 mit: 0.35 mip: 0.03
nuc: 0.03 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

36.0 %: nuclear
28.0 %: cytoplasmic
24.0 %: mitochondrial
8.0 %: plasma membrane
4.0 %: peroxisomal

>> prediction for CHR223 is nuc

5' end seq. ID CHR223F
5' end seq.
>CHR223F.Seq
CAATGTCACACCAGNAGGTTTCAAGAGTTGCAATCGTTGGCTTGGGTAAGAAAGAAAATA
ATAATAGCACCACCTATGAAAAGAATGAAAACACTCGTAAAGCAATCGGTAGTGGTGTTA
AAGCATTGAAATCAAAGAATGCCACCCATTTAACCATCGATTCAAACATTGGTGATGCCA
AACAAACTGCAGAGGGTGCATTCCTTTCAAACTTCAAATTTGATTTCAAAACAGGTACTT
CTGGTAAAACAGCCAATTCCACCAATGAATCCATTCAAGTTCAATTATCACCATCACCAT
CAAGTGAAGAATGTTTTAAAGAAGGTAAAATCTTGGCTGAATCTCAAAATTTCGCAAGAG
TTTTAATGGAGACACCAGCAAATCTTTTAACACCAACCAATTTCGTTCAACATGTTAGCA
GTCAAATGAAAGAGTTAATCGATAGTGGAAAGGTGGAGATGATCGTCCGTGAAGAACAAT
GGGTTAAAGATCAAAAGATGGGTATGTTTTGGGGTGTTGCTAAAGGTTCCGATGAACCAT
TGAAATTCTTAGAACTTCACTATCGTGGTGCATCTGCCGATGGTAAGGATTCAATNNNNN
NNNNN
Length of 5' end seq. 605
3' end seq. ID CHR223Z
3' end seq.
>CHR223Z.Seq
NNNNNNNNNNGGTGCATCTGCCGATGGTAAGGATTCAATAGTTTATGTTGGTAAAGGTAT
CACTTTTGATAGTGGTGGTATTTCAATTAAACCATCAGCAAATATGGGTTTAATGAAGGG
TGATATGGGTGGTGCTGCCACTGCTGTCTCTGCAATGTTTGGTGTTGCTTCATTGGGTTT
AAAGGTCAATTTAATTACAATCACTCCATTATGTGAAAATATGCCATCAGGTAAAGCAAC
TAAACCAGGTGATATCCTTACCGCTGCAAATGGTAAAACCGTTGAAGTCGACAATACCGA
TGCCGAGGGTCGTTTAATCTTGGGTGATGCTTTACATTATGCTTGTTCATTCAAACCAAC
TCATATCATTGATATCGCTACCTTGACTGGTGCCATCGATGTTGCCTTGGGTCAACATTA
TGCTGGTTGTTTTACAACCACCGACTCACTTTGGGATCAATTAAATGAATGTGGTAACAT
TAGTGGTGAAAGATTATGGAGAATGCCATTGATTCCAGAATATCGTAAACAAATGGAAAC
CTCAAAAGTTGCCGATTTAATCAATTCTGCTGGTCGTTCAGGTGGTGCTTGTTGTGCTGC
TGGTTTCCTTAAAGAATTCATTACAGCCGATCAATCTTGGTCTCACCTTGATATTGCTGG
TGTTATGTCATCATCTGAAGATGGTCCATACATTAGAAAAGGTAGACTGGTAAACCAACT
CGTACTTTAATAGAATTGCTAAAAAGAATCAACAA
Length of 3' end seq. 755
Connected seq. ID CHR223P
Connected seq.
>CHR223P.Seq
CAATGTCACACCAGNAGGTTTCAAGAGTTGCAATCGTTGGCTTGGGTAAGAAAGAAAATA
ATAATAGCACCACCTATGAAAAGAATGAAAACACTCGTAAAGCAATCGGTAGTGGTGTTA
AAGCATTGAAATCAAAGAATGCCACCCATTTAACCATCGATTCAAACATTGGTGATGCCA
AACAAACTGCAGAGGGTGCATTCCTTTCAAACTTCAAATTTGATTTCAAAACAGGTACTT
CTGGTAAAACAGCCAATTCCACCAATGAATCCATTCAAGTTCAATTATCACCATCACCAT
CAAGTGAAGAATGTTTTAAAGAAGGTAAAATCTTGGCTGAATCTCAAAATTTCGCAAGAG
TTTTAATGGAGACACCAGCAAATCTTTTAACACCAACCAATTTCGTTCAACATGTTAGCA
GTCAAATGAAAGAGTTAATCGATAGTGGAAAGGTGGAGATGATCGTCCGTGAAGAACAAT
GGGTTAAAGATCAAAAGATGGGTATGTTTTGGGGTGTTGCTAAAGGTTCCGATGAACCAT
TGAAATTCTTAGAACTTCACTATCGTGGTGCATCTGCCGATGGTAAGGATTCAAT-----
-----GGTGCATCTGCCGATGGTAAGGATTCAATAGTTTATGTTGGTAAAGGTATCACTT
TTGATAGTGGTGGTATTTCAATTAAACCATCAGCAAATATGGGTTTAATGAAGGGTGATA
TGGGTGGTGCTGCCACTGCTGTCTCTGCAATGTTTGGTGTTGCTTCATTGGGTTTAAAGG
TCAATTTAATTACAATCACTCCATTATGTGAAAATATGCCATCAGGTAAAGCAACTAAAC
CAGGTGATATCCTTACCGCTGCAAATGGTAAAACCGTTGAAGTCGACAATACCGATGCCG
AGGGTCGTTTAATCTTGGGTGATGCTTTACATTATGCTTGTTCATTCAAACCAACTCATA
TCATTGATATCGCTACCTTGACTGGTGCCATCGATGTTGCCTTGGGTCAACATTATGCTG
GTTGTTTTACAACCACCGACTCACTTTGGGATCAATTAAATGAATGTGGTAACATTAGTG
GTGAAAGATTATGGAGAATGCCATTGATTCCAGAATATCGTAAACAAATGGAAACCTCAA
AAGTTGCCGATTTAATCAATTCTGCTGGTCGTTCAGGTGGTGCTTGTTGTGCTGCTGGTT
TCCTTAAAGAATTCATTACAGCCGATCAATCTTGGTCTCACCTTGATATTGCTGGTGTTA
TGTCATCATCTGAAGATGGTCCATACATTAGAAAAGGTAGACTGGTAAACCAACTCGTAC
TTTAATAGAATTGCTAAAAAGAATCAACAA
Length of connected seq. 1340
Full length Seq ID -
Full length Seq. -
Length of full length seq. -