SHG203
Library SH
(Link to library)
Clone ID SHG203
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U10823-1
Original site URL
Representative seq. ID SHG203P
(Link to Original site)
Representative DNA sequence
>SHG203 (SHG203Q) /CSM/SH/SHG2-A/SHG203Q.Seq.d/
GTGTTAAAAATATAAACTAAAACTAAAAAAAAAATGAAACTTTTGTCTTCATTAATTATT
TTTTTTGTTATTGTATTATTTTGNGTTGTTGGATCATTATCAGCATCATTATGTAAATAT
CCAGGTTATTCAACTCAAGGNGTTACAAAAACAAATAATGGTTATGAGGCAACACTTAAC
CTTATTTCAGCAGGTCCATATGGTAACGATATANAACAATTAAATTTTCAATTAACTTTT
GAAACTAGTCAAATTTTTANAGTTANAATTACTGACCCAAATAATCAAAGATGGGAAGTC
CCACCAACTGTTAATCAATTAGTTGGANAAAATCCANATTCAACTGATTATATAATTGAA
TTTACAAATAATCCATTTGGTTTTGCAGCAACTCGTATTTCAACTGGNGAAGTTTTATTT
AATACAACTCAACCAAGTGATTGTTCATTTAATGGTTTAATTTATTCAAATTATTATTTA
NAATTAAGTACATCATTCACAGAGAGTAATCCAAATATTTATGGTTTAGGXXXXXXXXXX
GCAATCAATGGTAAATTAACTTTGTTACCATTCTATTACACATTGTTCCATATTTCTCAT
GTTTCTGGTGATCCAGTTGTTAGACCATTATTCTTTGAATATCCATCAGATCCAAATACT
TTTGCAATTGATCAACAATTTTTAGTTGGTACAGGTTTAATGGTATCACCAGTTCTCACT
CAAGGTGCTACCACAGTGAATGCTTACTTCCCAAATGATATCTGGTATGAATATGGTAAT
GGTTCATTGGTTCAATCAGTTGGTACCCATCAAACTTTAAATGCTCCATTCGATGTAATC
AACGTTCATATGCGTGGTGGTAATATCATTCCAACTCAACCAACCTCCTCATATGTTACA
CCAGTTGATGGTATTCCAATTACCACTAAAATCTCTAGAACTTTACCATTTGAATTGATT
ATTGCCTTGGATTCTTCATTACAAGCAACTGGTCAATTATTCTTGGATGATGGTGAATCA
ATTCAAACCTATGTTGATAATAAATACTCTTTCATTCAATTCGATGTTGTCTCCTCACCA
TCTTCATCTGCCTACAAATTACAATCAACCATTCTCAATAACAATTATAATGGTACCGCT
TCTTTAATCATTAATTCTATCCAAATCTACGGTTCGCCATCAGTTCAACAAGTCATTGTT
AATGGTAGCCCAATCAATTCATTTAATGCTGTTTCNGATTCAACTCTCTCTGTTTCAAAT
TTACAACTGCTTTAGATGAATCCTTGAAGTGATTT
sequence update 2002.10.25
Translated Amino Acid sequence
vlki*TKTKKKMKLLSSLIIFFVIVLFXVVGSLSASLCKYPGYSTQGVTKTNNGYEATLN
LISAGPYGNDIXQLNFQLTFETSQIFXVXITDPNNQRWEVPPTVNQLVGXNPXSTDYIIE
FTNNPFGFAATRISTGEVLFNTTQPSDCSFNGLIYSNYYLXLSTSFTESNPNIYGL---

---AINGKLTLLPFYYTLFHISHVSGDPVVRPLFFEYPSDPNTFAIDQQFLVGTGLMVSP
VLTQGATTVNAYFPNDIWYEYGNGSLVQSVGTHQTLNAPFDVINVHMRGGNIIPTQPTSS
YVTPVDGIPITTKISRTLPFELIIALDSSLQATGQLFLDDGESIQTYVDNKYSFIQFDVV
SSPSSSAYKLQSTILNNNYNGTASLIINSIQIYGSPSVQQVIVNGSPINSFNAVSDSTLS
VSNLQLL*mnp*sd


Translated Amino Acid sequence (All Frames)
Frame A:
vlki*TKTKKKMKLLSSLIIFFVIVLFXVVGSLSASLCKYPGYSTQGVTKTNNGYEATLN
LISAGPYGNDIXQLNFQLTFETSQIFXVXITDPNNQRWEVPPTVNQLVGXNPXSTDYIIE
FTNNPFGFAATRISTGEVLFNTTQPSDCSFNGLIYSNYYLXLSTSFTESNPNIYGL---

---AINGKLTLLPFYYTLFHISHVSGDPVVRPLFFEYPSDPNTFAIDQQFLVGTGLMVSP
VLTQGATTVNAYFPNDIWYEYGNGSLVQSVGTHQTLNAPFDVINVHMRGGNIIPTQPTSS
YVTPVDGIPITTKISRTLPFELIIALDSSLQATGQLFLDDGESIQTYVDNKYSFIQFDVV
SSPSSSAYKLQSTILNNNYNGTASLIINSIQIYGSPSVQQVIVNGSPINSFNAVSDSTLS
VSNLQLL*mnp*sd

Frame B:
c*kyklklkkk*nfclh*lfflllyyfxlldhyqhhyvniqviqlkxlqkqimvmrqhlt
lfqqvhmvtixnn*ifn*llklvkflxlxlltqiikdgkshqllin*lxkixiqlii*ln
lqiihlvlqqlvfqlxkfyliqlnqvivhlmv*fiqiiixn*vhhsqrviqifmv*---

---qsmvn*lcyhsithcsiflmflviqlldhyslnihqiqillqlinnf*lvqv*wyhq
fslkvlpq*mltsqmisgmnmvmvhwfnqlvpikl*mlhsm*stficvvvisfqlnqpph
mlhqlmvfqlplkslelyhln*llpwilhykqlvnyswmmvnqfkpmliintlsfnsmls
phhlhlptnynqpfsitiimvpll*slilskstvrhqfnksllmvaqsihlmlfxiqlsl
fqiyncfr*ilevi

Frame C:
vknin*n*kknetfvfinyffcyciilxcwiiisiim*isrlfnsrxyknk*wl*gnt*p
yfsrsiw*ryxtikfsinf*n*snfxsxny*pk*skmgsptnc*siswxksxfn*lyn*i
yk*siwfcsnsyfnwxsfi*ynstk*lfi*wfnlfkllfxikyiihre*skylwfr---

---nqw*infvtillhivpyfscfw*ssc*tiil*isirskyfcn*stifswyrfngits
shsrcyhsecllpk*ylv*iw*wfigsiswypsnfkcsircnqrsyaww*yhsnstnlli
cyts*wysnyh*nl*nfti*idyclgffitsnwsiilg*w*insnlc***ilfhsirccl
ltificlqitinhsq*ql*wyrffnh*fypnlrfaisstshc*w*pnqfi*ccfxfnslc
fkfttaldeslk*f

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SHG203 (SHG203Q) /CSM/SH/SHG2-A/SHG203Q.Seq.d/ 2341 0.0
AHD417 (AHD417Q) /CSM/AH/AHD4-A/AHD417Q.Seq.d/ 1439 0.0
CHR138 (CHR138Q) /CSM/CH/CHR1-B/CHR138Q.Seq.d/ 1384 0.0
AHR217 (AHR217Q) /CSM/AH/AHR2-A/AHR217Q.Seq.d/ 1374 0.0
AHM502 (AHM502Q) /CSM/AH/AHM5-A/AHM502Q.Seq.d/ 1366 0.0
CHL607 (CHL607Q) /CSM/CH/CHL6-A/CHL607Q.Seq.d/ 1360 0.0
CHS485 (CHS485Q) /CSM/CH/CHS4-D/CHS485Q.Seq.d/ 1348 0.0
SHJ811 (SHJ811Q) /CSM/SH/SHJ8-A/SHJ811Q.Seq.d/ 1330 0.0
AHN661 (AHN661Q) /CSM/AH/AHN6-C/AHN661Q.Seq.d/ 1328 0.0
CHR858 (CHR858Q) /CSM/CH/CHR8-C/CHR858Q.Seq.d/ 1326 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

AC115599|AC115599.2 Dictyostelium discoideum chromosome 2 map 4229098-4354721 strain AX4, complete sequence. 34 0.31 10
AC116330|AC116330.2 Dictyostelium discoideum chromosome 2 map 3191214-3323468 strain AX4, complete sequence. 40 0.44 10
Z70754|Z70754.2 Caenorhabditis elegans Cosmid F58E6. 42 1.4 3
AC116032|AC116032.2 Dictyostelium discoideum chromosome 2 map 4158743-4189373 strain AX4, complete sequence. 36 2.5 4
AC177362|AC177362.1 Strongylocentrotus purpuratus clone R3-52G09, WORKING DRAFT SEQUENCE, 22 unordered pieces. 40 3.3 8
AL449924|AL449924.1 Streptococcus pneumoniae serotype 19F *** SEQUENCING IN PROGRESS *** from clone G54. 36 3.6 4
AE014822|AE014822.1 Plasmodium falciparum 3D7 chromosome 14 section 7 of 13 of the complete sequence. 32 3.6 2
AZ690043|AZ690043.1 ENTMH81TF Entamoeba histolytica Sheared DNA Entamoeba histolytica genomic, DNA sequence. 36 5.2 3
DY772000|DY772000.1 5TH_new03_P06 Bicyclus anynana wings - 5TH instar larvae Bicyclus anynana cDNA 3', mRNA sequence. 36 5.5 2
DY771880|DY771880.1 5TH_new03_J24 Bicyclus anynana wings - 5TH instar larvae Bicyclus anynana cDNA 3', mRNA sequence. 36 5.5 2
dna update 2006. 3.27
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

AB000967_1(AB000967|pid:none) Coturnix japonica GAAI mRNA for ac... 164 6e-39
AF118226_1(AF118226|pid:none) Hordeum vulgare high pI alpha-gluc... 163 1e-38
(Q43763) RecName: Full=Alpha-glucosidase; EC=3.2.1.20; ... 159 1e-37
AB006754_1(AB006754|pid:none) Coturnix japonica GAAII mRNA for a... 154 5e-36
AF016833_1(AF016833|pid:none) Homo sapiens maltase-glucoamylase ... 152 3e-35
(O43451) RecName: Full=Maltase-glucoamylase, intestinal; Include... 150 1e-34
EU937530_1(EU937530|pid:none) Mus musculus sucrase-isomaltase mR... 149 2e-34
AM430903_1(AM430903|pid:none) Vitis vinifera contig VV78X052784.... 149 3e-34
(P70699) RecName: Full=Lysosomal alpha-glucosidase; EC=... 148 3e-34
BC010210_1(BC010210|pid:none) Mus musculus glucosidase, alpha, a... 148 3e-34
protein update 2009. 4.18
PSORT

psg: 1.02 gvh: 0.72 alm: 0.40 top: 0.27 tms: 0.00 mit: 0.33 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

64.0 %: extracellular, including cell wall
12.0 %: cytoplasmic
8.0 %: vacuolar
8.0 %: endoplasmic reticulum
4.0 %: mitochondrial
4.0 %: nuclear

>> prediction for SHG203 is exc

5' end seq. ID SHG203F
5' end seq.
>SHG203F.Seq
GTGTTAAAAATATAAACTAAAACTAAAAAAAAAATGAAACTTTTGTCTTCATTAATTATT
TTTTTTGTTATTGTATTATTTTGNGTTGTTGGATCATTATCAGCATCATTATGTAAATAT
CCAGGTTATTCAACTCAAGGNGTTACAAAAACAAATAATGGTTATGAGGCAACACTTAAC
CTTATTTCAGCAGGTCCATATGGTAACGATATANAACAATTAAATTTTCAATTAACTTTT
GAAACTAGTCAAATTTTTANAGTTANAATTACTGACCCAAATAATCAAAGATGGGAAGTC
CCACCAACTGTTAATCAATTAGTTGGANAAAATCCANATTCAACTGATTATATAATTGAA
TTTACAAATAATCCATTTGGTTTTGCAGCAACTCGTATTTCAACTGGNGAAGTTTTATTT
AATACAACTCAACCAAGTGATTGTTCATTTAATGGTTTAATTTATTCAAATTATTATTTA
NAATTAAGTACATCATTCACAGAGAGTAATCCAAATATTTATGGTTTAGGNNNNNNNNNN
Length of 5' end seq. 540
3' end seq. ID SHG203Z
3' end seq.
>SHG203Z.Seq
NNNNNNNNNNGCAATCAATGGTAAATTAACTTTGTTACCATTCTATTACACATTGTTCCA
TATTTCTCATGTTTCTGGTGATCCAGTTGTTAGACCATTATTCTTTGAATATCCATCAGA
TCCAAATACTTTTGCAATTGATCAACAATTTTTAGTTGGTACAGGTTTAATGGTATCACC
AGTTCTCACTCAAGGTGCTACCACAGTGAATGCTTACTTCCCAAATGATATCTGGTATGA
ATATGGTAATGGTTCATTGGTTCAATCAGTTGGTACCCATCAAACTTTAAATGCTCCATT
CGATGTAATCAACGTTCATATGCGTGGTGGTAATATCATTCCAACTCAACCAACCTCCTC
ATATGTTACACCAGTTGATGGTATTCCAATTACCACTAAAATCTCTAGAACTTTACCATT
TGAATTGATTATTGCCTTGGATTCTTCATTACAAGCAACTGGTCAATTATTCTTGGATGA
TGGTGAATCAATTCAAACCTATGTTGATAATAAATACTCTTTCATTCAATTCGATGTTGT
CTCCTCACCATCTTCATCTGCCTACAAATTACAATCAACCATTCTCAATAACAATTATAA
TGGTACCGCTTCTTTAATCATTAATTCTATCCAAATCTACGGTTCGCCATCAGTTCAACA
AGTCATTGTTAATGGTAGCCCAATCAATTCATTTAATGCTGTTTCNGATTCAACTCTCTC
TGTTTCAAATTTACAACTGCTTTAGATGAATCCTTGAAGTGATTT
Length of 3' end seq. 765
Connected seq. ID SHG203P
Connected seq.
>SHG203P.Seq
GTGTTAAAAATATAAACTAAAACTAAAAAAAAAATGAAACTTTTGTCTTCATTAATTATT
TTTTTTGTTATTGTATTATTTTGNGTTGTTGGATCATTATCAGCATCATTATGTAAATAT
CCAGGTTATTCAACTCAAGGNGTTACAAAAACAAATAATGGTTATGAGGCAACACTTAAC
CTTATTTCAGCAGGTCCATATGGTAACGATATANAACAATTAAATTTTCAATTAACTTTT
GAAACTAGTCAAATTTTTANAGTTANAATTACTGACCCAAATAATCAAAGATGGGAAGTC
CCACCAACTGTTAATCAATTAGTTGGANAAAATCCANATTCAACTGATTATATAATTGAA
TTTACAAATAATCCATTTGGTTTTGCAGCAACTCGTATTTCAACTGGNGAAGTTTTATTT
AATACAACTCAACCAAGTGATTGTTCATTTAATGGTTTAATTTATTCAAATTATTATTTA
NAATTAAGTACATCATTCACAGAGAGTAATCCAAATATTTATGGTTTAGG----------
GCAATCAATGGTAAATTAACTTTGTTACCATTCTATTACACATTGTTCCATATTTCTCAT
GTTTCTGGTGATCCAGTTGTTAGACCATTATTCTTTGAATATCCATCAGATCCAAATACT
TTTGCAATTGATCAACAATTTTTAGTTGGTACAGGTTTAATGGTATCACCAGTTCTCACT
CAAGGTGCTACCACAGTGAATGCTTACTTCCCAAATGATATCTGGTATGAATATGGTAAT
GGTTCATTGGTTCAATCAGTTGGTACCCATCAAACTTTAAATGCTCCATTCGATGTAATC
AACGTTCATATGCGTGGTGGTAATATCATTCCAACTCAACCAACCTCCTCATATGTTACA
CCAGTTGATGGTATTCCAATTACCACTAAAATCTCTAGAACTTTACCATTTGAATTGATT
ATTGCCTTGGATTCTTCATTACAAGCAACTGGTCAATTATTCTTGGATGATGGTGAATCA
ATTCAAACCTATGTTGATAATAAATACTCTTTCATTCAATTCGATGTTGTCTCCTCACCA
TCTTCATCTGCCTACAAATTACAATCAACCATTCTCAATAACAATTATAATGGTACCGCT
TCTTTAATCATTAATTCTATCCAAATCTACGGTTCGCCATCAGTTCAACAAGTCATTGTT
AATGGTAGCCCAATCAATTCATTTAATGCTGTTTCNGATTCAACTCTCTCTGTTTCAAAT
TTACAACTGCTTTAGATGAATCCTTGAAGTGATTT
Length of connected seq. 1285
Full length Seq ID -
Full length Seq. -
Length of full length seq. -