VHB348
Library VH
(Link to library)
Clone ID VHB348
Atlas ID -
NBRP ID
dictyBase ID
Link to Contig Contig-U10996-1
Original site URL
Representative seq. ID VHB348P
(Link to Original site)
Representative DNA sequence
>VHB348 (VHB348Q) /CSM/VH/VHB3-B/VHB348Q.Seq.d/
TGGCCTACTGGTAAAAAAAATTCTAATTTTATTAAAACCCTTTTATTTGTTTAATAAACA
TAATTTAAATAAACTTCAATAATGTCAGATAATGAACAAGAAGAATCATCACAAGTAGTA
TTAAAGGAGGATAAAACCTTTGTTACATTTTTTCAAAGTTTAGTATCCTCTAATGAAGAT
ACAGACACAATTAGATTATTTGATAGAAAAGGATATTACTCAATTCATGGTGAAGATGCA
GTATTTGTAGCAATGATGCATTTTAAATCAAAGAAATCATTAAAATATTGGAGTATTAGT
GATCCAAATCCAAAAAAAGAAAATTAAAATTGATAATGATGGTTCATTAACAACAACTGC
ATCATCATCCCAACAACAACAACAAGAATTAGGATTAGCGGTATTAACAATTAGACAAGG
TTATGAATTTGAAAATATAGTTAAAGAATTATTAGATGAAAAGAAAAAGATTGAAATTTG
GTCAATGAAACCAAATAGTAAACAACAATGGGAACTAATTAAAAAAGGCTCACCAGGTAA
TACACAAATGTTTGAAGATGTTTTATTGAATGGTAATTGTGAAGGATCAGTTATGATGGC
ATTAAAXXXXXXXXXXAGAAATTCAAGATAATGTTAATTTCATTGCAAATGATATTGATT
TAACTCGTGGTCAATCCCAATTTCAAATTATAACAGGACCAAATATGGGTGGTAAATCAA
CATTTATTCGTCAAGTTGGATTAATAGTATTAATGGCACAAATTGGTTGTTTTGTACCAG
CACAAAAAGCAACAATTGCAGTTGTCGATTGTATTTTATCAAGAGTTGGTGCAGGTGATA
GTCAATTACGTGGTGTTTCAACATTTATGGCAGAAATGTTAGAGACATCTTACATTTTAA
AGGTTGCAACTAAAAATTCTTTAATCATTATTGATGAACTTGGTAGAGGTACTTCAACAT
ATGATGGTTTTGGTTTAGCTTGGGGTATTGCAGAGTATATTTGTAATCAAATTGGTGGTT
TCTGTCTATTTGCAACTCATTTCCATGAATTGACAATTCTATCAGATTTACTTCCAATGG
TTAAAAATTTACATGTTTCAGCTTCAACCCAAAACAATACTTTTACTTTACTCTATAAAG
TTGAACAAGGTCCTTGTGATCAAAGTTTTGGTATTCATGTTGCAATTTTAGCAAATTTCC
CTTCACAAGTTATTGAAAATGCAAAACAAAAAGCAAAAGAATTGGAATCTTTTGAATCAA
ATACACTTAAACAAAATCATAATAAATTTTTGGAAGAATTTAAAGAAATTAATTTCAATT
CAAATGATGTAGAAAAATCATTAAGTTTAGTTAATAGTTTATTAAATAAATATTCAATAG
ATATCAATTAATAAAATANAAATTTCTACTAAAAA
sequence update 2002.10.25
Translated Amino Acid sequence
gllvkkilillkpfylfnkhnlnklq*cqimnkknhhk*y*rrikpllhffkv*yplmki
qtqldyliekditqfmvkmqyl*q*cilnqrnh*NIGVLVIQIQKKKIKIDNDGSLTTTA
SSSQQQQQELGLAVLTIRQGYEFENIVKELLDEKKKIEIWSMKPNSKQQWELIKKGSPGN
TQMFEDVLLNGNCEGSVMMAL---

---EIQDNVNFIANDIDLTRGQSQFQIITGPNMGGKSTFIRQVGLIVLMAQIGCFVPAQK
ATIAVVDCILSRVGAGDSQLRGVSTFMAEMLETSYILKVATKNSLIIIDELGRGTSTYDG
FGLAWGIAEYICNQIGGFCLFATHFHELTILSDLLPMVKNLHVSASTQNNTFTLLYKVEQ
GPCDQSFGIHVAILANFPSQVIENAKQKAKELESFESNTLKQNHNKFLEEFKEINFNSND
VEKSLSLVNSLLNKYSIDIN**nxnfy*k


Translated Amino Acid sequence (All Frames)
Frame A:
wptgkknsnfiktllfv**t*fk*tsimsdneqeessqvvlkedktfvtffqslvssned
tdtirlfdrkgyysihgedavfvammhfkskkslkywsisdpnpkken*n***wfinnnc
iiipttttririsginn*trl*i*kys*riir*kekd*nlvnetk**ttmgtn*krltr*
ytnv*rcfiew*l*risydgik---

---rnsr*c*fhck*y*fnswsipisnynrtkygw*iniyssswinsingtnwlfctstk
snncscrlyfikswcr**sitwcfniygrnvrdilhfkgcn*kffnhy**tw*ryfni*w
fwfslgycrvyl*snwwflsicnsfp*idnsirftsng*kftcfsfnpkqyfyftl*s*t
rsl*skfwysccnfskfpftsy*kcktkskrigif*ikyt*tks**ifgri*rn*fqfk*
crkiikfs**fik*ifnryqlikxkfllk

Frame B:
gllvkkilillkpfylfnkhnlnklq*cqimnkknhhk*y*rrikpllhffkv*yplmki
qtqldyliekditqfmvkmqyl*q*cilnqrnh*NIGVLVIQIQKKKIKIDNDGSLTTTA
SSSQQQQQELGLAVLTIRQGYEFENIVKELLDEKKKIEIWSMKPNSKQQWELIKKGSPGN
TQMFEDVLLNGNCEGSVMMAL---

---EIQDNVNFIANDIDLTRGQSQFQIITGPNMGGKSTFIRQVGLIVLMAQIGCFVPAQK
ATIAVVDCILSRVGAGDSQLRGVSTFMAEMLETSYILKVATKNSLIIIDELGRGTSTYDG
FGLAWGIAEYICNQIGGFCLFATHFHELTILSDLLPMVKNLHVSASTQNNTFTLLYKVEQ
GPCDQSFGIHVAILANFPSQVIENAKQKAKELESFESNTLKQNHNKFLEEFKEINFNSND
VEKSLSLVNSLLNKYSIDIN**nxnfy*k

Frame C:
ayw*kkf*fy*npficlinii*infnnvr**trriitssikgg*nlcyifskfsil**ry
rhn*ii**krillnsw*rcsicsndaf*ikeiikiley**skskkrklklimmvh*qqlh
hhpnnnnkn*d*ry*qldkvmnlki*lkny*mkrkrlkfgq*nqivnnngn*lkkahqvi
hkclkmfy*mvivkdql*wh*---

---kfkimlislqmili*lvvnpnfkl*qdqiwvvnqhlfvkld**y*whklvvlyqhkk
qqlqlsivfyqelvqvivnyvvfqhlwqkc*rhltf*rlqlkil*sllmnlvevlqhmmv
lv*lgvlqsifviklvvsvylqlismn*qfyqiyfqwlkiymfqlqpktilllysiklnk
vlvikvlvfmlqf*qislhkllkmqnkkqknwnllnqihlnkiiinfwknlkklisiqmm
*knh*v*livy*iniq*isinkixistk

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VHB348 (VHB348Q) /CSM/VH/VHB3-B/VHB348Q.Seq.d/ 2658 0.0
CHG289 (CHG289Q) /CSM/CH/CHG2-D/CHG289Q.Seq.d/ 1570 0.0
AHL270 (AHL270Q) /CSM/AH/AHL2-C/AHL270Q.Seq.d/ 1550 0.0
CHR180 (CHR180Q) /CSM/CH/CHR1-D/CHR180Q.Seq.d/ 1536 0.0
CHQ653 (CHQ653Q) /CSM/CH/CHQ6-C/CHQ653Q.Seq.d/ 1534 0.0
AHF711 (AHF711Q) /CSM/AH/AHF7-A/AHF711Q.Seq.d/ 1534 0.0
AHF319 (AHF319Q) /CSM/AH/AHF3-A/AHF319Q.Seq.d/ 1534 0.0
AHC623 (AHC623Q) /CSM/AH/AHC6-A/AHC623Q.Seq.d/ 1534 0.0
AHE763 (AHE763Q) /CSM/AH/AHE7-C/AHE763Q.Seq.d/ 1522 0.0
AHD346 (AHD346Q) /CSM/AH/AHD3-B/AHD346Q.Seq.d/ 1522 0.0

own update 2004.12.24
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

AC116960|AC116960.2 Dictyostelium discoideum chromosome 2 map complement(1004496-821614) strain AX4, complete sequence. 1126 0.0 1
CA302967|CA302967.1 taa01d08.y1 Hydra cDNA library Hydra magnipapillata cDNA 5' similar to SW:MSH2_HUMAN P43246 DNA MISMATCH REPAIR PROTEIN MSH2. [1] ;, mRNA sequence. 76 9e-25 5
CX054860|CX054860.1 taj02h05.y2 Hydra EST UCI 5 ALP Hydra magnipapillata cDNA 5' similar to SW:MSH2_MOUSE P43247 DNA MISMATCH REPAIR PROTEIN MSH2. ;, mRNA sequence. 76 4e-21 3
BY949890|BY949890.1 Physcomitrella patens subsp. patens cDNA clone: PPLS029B01, 5'end. 72 1e-20 4
BH165037|BH165037.1 ENTTE68TR Entamoeba histolytica Sheared DNA Entamoeba histolytica genomic, DNA sequence. 64 3e-20 6
CS480546|CS480546.1 Sequence 1 from Patent WO2006134496. 72 7e-20 4
CS458932|CS458932.1 Sequence 1 from Patent EP1734125. 72 7e-20 4
BH136983|BH136983.1 ENTNM86TF Entamoeba histolytica Sheared DNA Entamoeba histolytica genomic, DNA sequence. 64 4e-19 6
BY949900|BY949900.1 Physcomitrella patens subsp. patens cDNA clone: PPLS029B11, 5'end. 72 8e-18 4
DR914248|DR914248.1 EST1105787 Aquilegia cDNA library Aquilegia formosa x Aquilegia pubescens cDNA clone CO1LC44, mRNA sequence. 70 2e-17 3
dna update 2007. 4. 3
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

AC116960_5(AC116960|pid:none) Dictyostelium discoideum chromosom... 478 e-133
BC161846_1(BC161846|pid:none) Rattus norvegicus mutS homolog 2 (... 313 7e-84
(P54275) RecName: Full=DNA mismatch repair protein Msh2; AltName... 310 6e-83
AB179432_1(AB179432|pid:none) Macaca fascicularis testis cDNA cl... 310 1e-82
(Q5XXB5) RecName: Full=DNA mismatch repair protein Msh2; AltName... 310 1e-82
AK222860_1(AK222860|pid:none) Homo sapiens mRNA for mutS homolog... 308 2e-82
CR861269_1(CR861269|pid:none) Pongo abelii mRNA; cDNA DKFZp459K0... 308 2e-82
(P43246) RecName: Full=DNA mismatch repair protein Msh2; AltName... 308 2e-82
AK297763_1(AK297763|pid:none) Homo sapiens cDNA FLJ50998 complet... 308 2e-82
AK296831_1(AK296831|pid:none) Homo sapiens cDNA FLJ57316 complet... 308 2e-82
protein update 2009. 6.27
PSORT

psg: 0.31 gvh: 0.48 alm: 0.32 top: 0.53 tms: 0.00 mit: 0.23 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.12 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

44.0 %: nuclear
40.0 %: cytoplasmic
8.0 %: cytoskeletal
4.0 %: vacuolar
4.0 %: mitochondrial

>> prediction for VHB348 is nuc

5' end seq. ID VHB348F
5' end seq.
>VHB348F.Seq
TGGCCTACTGGTAAAAAAAATTCTAATTTTATTAAAACCCTTTTATTTGTTTAATAAACA
TAATTTAAATAAACTTCAATAATGTCAGATAATGAACAAGAAGAATCATCACAAGTAGTA
TTAAAGGAGGATAAAACCTTTGTTACATTTTTTCAAAGTTTAGTATCCTCTAATGAAGAT
ACAGACACAATTAGATTATTTGATAGAAAAGGATATTACTCAATTCATGGTGAAGATGCA
GTATTTGTAGCAATGATGCATTTTAAATCAAAGAAATCATTAAAATATTGGAGTATTAGT
GATCCAAATCCAAAAAAAGAAAATTAAAATTGATAATGATGGTTCATTAACAACAACTGC
ATCATCATCCCAACAACAACAACAAGAATTAGGATTAGCGGTATTAACAATTAGACAAGG
TTATGAATTTGAAAATATAGTTAAAGAATTATTAGATGAAAAGAAAAAGATTGAAATTTG
GTCAATGAAACCAAATAGTAAACAACAATGGGAACTAATTAAAAAAGGCTCACCAGGTAA
TACACAAATGTTTGAAGATGTTTTATTGAATGGTAATTGTGAAGGATCAGTTATGATGGC
ATTAAANNNNNNNNNN
Length of 5' end seq. 616
3' end seq. ID VHB348Z
3' end seq.
>VHB348Z.Seq
NNNNNNNNNNAGAAATTCAAGATAATGTTAATTTCATTGCAAATGATATTGATTTAACTC
GTGGTCAATCCCAATTTCAAATTATAACAGGACCAAATATGGGTGGTAAATCAACATTTA
TTCGTCAAGTTGGATTAATAGTATTAATGGCACAAATTGGTTGTTTTGTACCAGCACAAA
AAGCAACAATTGCAGTTGTCGATTGTATTTTATCAAGAGTTGGTGCAGGTGATAGTCAAT
TACGTGGTGTTTCAACATTTATGGCAGAAATGTTAGAGACATCTTACATTTTAAAGGTTG
CAACTAAAAATTCTTTAATCATTATTGATGAACTTGGTAGAGGTACTTCAACATATGATG
GTTTTGGTTTAGCTTGGGGTATTGCAGAGTATATTTGTAATCAAATTGGTGGTTTCTGTC
TATTTGCAACTCATTTCCATGAATTGACAATTCTATCAGATTTACTTCCAATGGTTAAAA
ATTTACATGTTTCAGCTTCAACCCAAAACAATACTTTTACTTTACTCTATAAAGTTGAAC
AAGGTCCTTGTGATCAAAGTTTTGGTATTCATGTTGCAATTTTAGCAAATTTCCCTTCAC
AAGTTATTGAAAATGCAAAACAAAAAGCAAAAGAATTGGAATCTTTTGAATCAAATACAC
TTAAACAAAATCATAATAAATTTTTGGAAGAATTTAAAGAAATTAATTTCAATTCAAATG
ATGTAGAAAAATCATTAAGTTTAGTTAATAGTTTATTAAATAAATATTCAATAGATATCA
ATTAATAAAATANAAATTTCTACTAAAAA
Length of 3' end seq. 809
Connected seq. ID VHB348P
Connected seq.
>VHB348P.Seq
TGGCCTACTGGTAAAAAAAATTCTAATTTTATTAAAACCCTTTTATTTGTTTAATAAACA
TAATTTAAATAAACTTCAATAATGTCAGATAATGAACAAGAAGAATCATCACAAGTAGTA
TTAAAGGAGGATAAAACCTTTGTTACATTTTTTCAAAGTTTAGTATCCTCTAATGAAGAT
ACAGACACAATTAGATTATTTGATAGAAAAGGATATTACTCAATTCATGGTGAAGATGCA
GTATTTGTAGCAATGATGCATTTTAAATCAAAGAAATCATTAAAATATTGGAGTATTAGT
GATCCAAATCCAAAAAAAGAAAATTAAAATTGATAATGATGGTTCATTAACAACAACTGC
ATCATCATCCCAACAACAACAACAAGAATTAGGATTAGCGGTATTAACAATTAGACAAGG
TTATGAATTTGAAAATATAGTTAAAGAATTATTAGATGAAAAGAAAAAGATTGAAATTTG
GTCAATGAAACCAAATAGTAAACAACAATGGGAACTAATTAAAAAAGGCTCACCAGGTAA
TACACAAATGTTTGAAGATGTTTTATTGAATGGTAATTGTGAAGGATCAGTTATGATGGC
ATTAAA----------AGAAATTCAAGATAATGTTAATTTCATTGCAAATGATATTGATT
TAACTCGTGGTCAATCCCAATTTCAAATTATAACAGGACCAAATATGGGTGGTAAATCAA
CATTTATTCGTCAAGTTGGATTAATAGTATTAATGGCACAAATTGGTTGTTTTGTACCAG
CACAAAAAGCAACAATTGCAGTTGTCGATTGTATTTTATCAAGAGTTGGTGCAGGTGATA
GTCAATTACGTGGTGTTTCAACATTTATGGCAGAAATGTTAGAGACATCTTACATTTTAA
AGGTTGCAACTAAAAATTCTTTAATCATTATTGATGAACTTGGTAGAGGTACTTCAACAT
ATGATGGTTTTGGTTTAGCTTGGGGTATTGCAGAGTATATTTGTAATCAAATTGGTGGTT
TCTGTCTATTTGCAACTCATTTCCATGAATTGACAATTCTATCAGATTTACTTCCAATGG
TTAAAAATTTACATGTTTCAGCTTCAACCCAAAACAATACTTTTACTTTACTCTATAAAG
TTGAACAAGGTCCTTGTGATCAAAGTTTTGGTATTCATGTTGCAATTTTAGCAAATTTCC
CTTCACAAGTTATTGAAAATGCAAAACAAAAAGCAAAAGAATTGGAATCTTTTGAATCAA
ATACACTTAAACAAAATCATAATAAATTTTTGGAAGAATTTAAAGAAATTAATTTCAATT
CAAATGATGTAGAAAAATCATTAAGTTTAGTTAATAGTTTATTAAATAAATATTCAATAG
ATATCAATTAATAAAATANAAATTTCTACTAAAAA
Length of connected seq. 1405
Full length Seq ID -
Full length Seq. -
Length of full length seq. -