VHA801
Library VH
(Link to library)
Clone ID VHA801
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U15296-1|Contig-U15756-1
Original site URL
Representative seq. ID VHA801P
(Link to Original site)
Representative DNA sequence
>VHA801 (VHA801Q) /CSM/VH/VHA8-A/VHA801Q.Seq.d/
AAGTGGTATGAGTCATGATGAAGATCAATTAATTCCAAATCTTTATCGTTACATTCAACC
ATGGGAACAAGAAATTAAAGATAGTCAACGTGTTTGGGCAGAATATGCAATCAAATATGA
AGAGGCTAAATCACAAAATAAGAATCTTACATTGGAGGATCTTGAAGATAGTTGGGATCG
TGGATACCACGTATCAATACATTATTCCAAAAGAGTCGTCATACTTTAGCTTATGATAAA
GGTTGGAGAGTAAGAACCGATTGGAAACAATATCAAGTCCTAAAGAATAATCCATTCTGG
TGGACTAATCAACGTCATGATGGTAAACTTTGGAATCTTAATAATTATCGTACAGATATC
ATTCAAGCATTGGGTGGTGTTGAAGGTATTCTAGAGCATACTCTATTCAAAGGTACTTAT
TTCCCAACTTGGGAAGGTTTATTTTGGGAGAAAGCATCAGGTTTCGAAGAGTCTATGAAA
TATAAAAAACTTACACACGCTCAACGTTCAGGTCTTAATCAAATTCCAAATCGTCGTTTC
ACACTTTGGTGGTCTCCAACTATCAATCGTAAGAATGTTTATGTTXXXXXXXXXXGCTAA
AGAAATTGGTGGTTTCACTTATGTTTTCCCAAAGAATATCTTAAAGAAATTCATTACCAT
TGCCGATCTTCGTACTCAAATCATGGGTTATTGTTATGGTATCTCACCACCCGATAATCC
ATCAGTAAAAGAAATTCGTTGTATTGTAATGCCACCACAATGGGGTACACCAGTTCATGT
AACCGTACCAAATCAATTACCAGAACATGAATATCTCAAAGATTTAGAACCATTGGGTTG
GATTCACACTCAACCAACTGAATTACCACAACTATCACCACAAGATGTTATCACTCATTC
AAAGATTATGTCTGACAATAAGAGTTGGGATGGTGAGAAAACTGTTATCATCTCTGTCTC
TGTGGCTTGGCCTTGTACTCTAACTGCTTACCATTTGACTCCAAGTGGTTTTGAATGGGG
TAAAAATAATAAAGATTCTCTCAATTATCAAGGTTATCAACCACAATTCTATGAGAAGGT
TCAAATGTTATTATCCGATCGTTTCCTTGGTTTCTATATGGTACCTGATCGTGGTTCTTG
GAATTATAATTTCATGGGTGTTAAACATTCCACAAATATGACCTATGGTTTGAAATTAGA
TTATCCTAAAAACTTTTATGATGAATCTCATAGACCTGCTCATTTCCAAAATTGGACTCA
AATGGCTCCTTCAGCAAATGATGATGAAGAAAATCAACCNAAAANAAAATTTATTTNAAT
AATTATAAAAATAATAATTTTCTTTTAAAAACATTTAATAAATAAATAAAAAAA
sequence update 2002.10.25
Translated Amino Acid sequence
kwyes**rsinskslslhstmgtrn*r*stclgricnqi*rg*itk*esyiggs*r*LGS
WIPRINTLFQKSRHTLAYDKGWRVRTDWKQYQVLKNNPFWWTNQRHDGKLWNLNNYRTDI
IQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESMKYKKLTHAQRSGLNQIPNRRF
TLWWSPTINRKNVYV---

---AKEIGGFTYVFPKNILKKFITIADLRTQIMGYCYGISPPDNPSVKEIRCIVMPPQWG
TPVHVTVPNQLPEHEYLKDLEPLGWIHTQPTELPQLSPQDVITHSKIMSDNKSWDGEKTV
IISVSVAWPCTLTAYHLTPSGFEWGKNNKDSLNYQGYQPQFYEKVQMLLSDRFLGFYMVP
DRGSWNYNFMGVKHSTNMTYGLKLDYPKNFYDESHRPAHFQNWTQMAPSANDDEENQPKX
KFIXIIIKIIIFF*khlink*k


Translated Amino Acid sequence (All Frames)
Frame A:
kwyes**rsinskslslhstmgtrn*r*stclgricnqi*rg*itk*esyiggs*r*LGS
WIPRINTLFQKSRHTLAYDKGWRVRTDWKQYQVLKNNPFWWTNQRHDGKLWNLNNYRTDI
IQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESMKYKKLTHAQRSGLNQIPNRRF
TLWWSPTINRKNVYV---

---AKEIGGFTYVFPKNILKKFITIADLRTQIMGYCYGISPPDNPSVKEIRCIVMPPQWG
TPVHVTVPNQLPEHEYLKDLEPLGWIHTQPTELPQLSPQDVITHSKIMSDNKSWDGEKTV
IISVSVAWPCTLTAYHLTPSGFEWGKNNKDSLNYQGYQPQFYEKVQMLLSDRFLGFYMVP
DRGSWNYNFMGVKHSTNMTYGLKLDYPKNFYDESHRPAHFQNWTQMAPSANDDEENQPKX
KFIXIIIKIIIFF*khlink*k

Frame B:
sgmshdedqlipnlyryiqpweqeikdsqrvwaeyaikyeeaksqnknltledledswdr
gyhvsihyskrvvil*lmikvge*epignniks*riihsgglinvmmvnfgiliiivqis
fkhwvvlkvf*silyskvlisqlgkvyfgrkhqvsksl*niknlhtlnvqvlikfqivvs
hfgglqlsivrmfm---

---lkklvvslmfsqris*rnslplpifvlkswvivmvshhpiihq*kkfvvl*chhngv
hqfm*pyqinyqnmniski*nhwvgftlnqlnyhnyhhkmlsliqrlcltirvgmvrkll
sslslwlglvl*llti*lqvvlngvkiikilsiikvinhnsmrrfkcyypivslvsiwyl
ivvlgiiiswvlnipqi*pmv*n*iilktfmmnlidlliskiglkwllqqmmmkkinxkx
nlfx*l*k**fsfkni**inkk

Frame C:
vv*vmmkin*fqifivtfnhgnkklkivnvfgqnmqsnmkrlnhkirilhwrilkivgiv
dttyqyiipkessyfsl**rlesknrletisspke*silvd*sts*w*tles**lsyryh
ssigwc*rysraysiqrylfpnlgrfilgesirfrrvyei*ktytrstfrs*snskssfh
tlvvsnyqs*eclc---

---*rnwwfhlcfpkeylkeihyhcrssysnhglllwylttr*siskrnslycnattmgy
tsscnrtksitrt*isqrfrtigldshstn*itttittrcyhsfkdyv*q*elgw*ency
hlclcglalysnclpfdskwf*mg*k**rfsqlsrlsttil*egsnviirsfpwflygt*
swflel*fhgc*tfhkydlwfeirls*kll**is*tcsfpkldsngsfsk***rkstxxk
iyxnnyknnnfllktfnk*ikk

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

VHA801 (VHA801Q) /CSM/VH/VHA8-A/VHA801Q.Seq.d/ 2516 0.0
AHF452 (AHF452Q) /CSM/AH/AHF4-C/AHF452Q.Seq.d/ 1348 0.0
AHN642 (AHN642Q) /CSM/AH/AHN6-B/AHN642Q.Seq.d/ 1195 0.0
FC-AI07 (FC-AI07Q) /CSM/FC/FC-AI/FC-AI07Q.Seq.d/ 1191 0.0
AHI237 (AHI237Q) /CSM/AH/AHI2-B/AHI237Q.Seq.d/ 1068 0.0
FC-BT07 (FC-BT07Q) /CSM/FC/FC-BT/FC-BT07Q.Seq.d/ 940 0.0
CFD164 (CFD164Q) /CSM/CF/CFD1-C/CFD164Q.Seq.d/ 737 0.0
SSL142 (SSL142Q) /CSM/SS/SSL1-B/SSL142Q.Seq.d/ 38 0.35
SLE396 (SLE396Q) /CSM/SL/SLE3-D/SLE396Q.Seq.d/ 38 0.35
VSF593 (VSF593Q) /CSM/VS/VSF5-D/VSF593Q.Seq.d/ 36 1.4

own update 2004.12.24
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

AC116956|AC116956.2 Dictyostelium discoideum chromosome 2 map 1418423-1684967 strain AX4, complete sequence. 1364 0.0 1
DY893369|DY893369.1 CeleSEQ12690 Cunninghamella elegans pBluescript (EcoRI-XhoI) Cunninghamella elegans cDNA clone CeleSEQ12690, mRNA sequence. 218 3e-83 3
CZ529473|CZ529473.1 SRAA-aac61d08.g1 Strongyloides ratti whole genome shotgun library (SRAAGSS 004) Strongyloides ratti genomic, genomic survey sequence. 111 6e-57 4
DT614605| CG8877-PA [Drosophila melanogaster], mRNA sequence. 159 1e-48 3
AF397148|AF397148.1 Paramecium tetraurelia macronuclear pre-mRNA processing factor 8 gene, complete cds. 119 3e-43 10
CA217231|CA217231.1 SCSFAD1125C07.g AD1 Saccharum officinarum cDNA clone SCSFAD1125C07 5', mRNA sequence. 180 6e-42 2
DY887341|DY887341.1 CeleSEQ3300 Cunninghamella elegans pBluescript (EcoRI-XhoI) Cunninghamella elegans cDNA clone CeleSEQ3300, mRNA sequence. 117 8e-42 2
CA098022|CA098022.1 SCMCCL6053A12.g CL6 Saccharum officinarum cDNA clone SCMCCL6053A12 5', mRNA sequence. 180 7e-41 1
AC190756|AC190756.3 Zea mays chromosome 9 clone CH201-223D9; ZMMBBc0223D09, *** SEQUENCING IN PROGRESS ***, 7 unordered pieces. 178 8e-41 7
AC190756|AC190756.3 Zea mays chromosome 9 clone CH201-223D9; ZMMBBc0223D09, *** SEQUENCING IN PROGRESS ***, 7 unordered pieces. 178 9e-41 7
dna update 2007. 3.22
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(Q8T295) RecName: Full=Pre-mRNA-processing-splicing factor 8 hom... 508 e-142
AB023482_13(AB023482|pid:none) Oryza sativa Japonica Group genom... 308 e-101
AK099780_1(AK099780|pid:none) Oryza sativa Japonica Group cDNA c... 308 e-101
AE013599_1462(AE013599|pid:none) Drosophila melanogaster chromos... 317 e-100
AL591496_12(AL591496|pid:none) Mouse DNA sequence from clone RP2... 322 e-100
(Q99PV0) RecName: Full=Pre-mRNA-processing-splicing factor 8; Al... 322 e-100
BC064370_1(BC064370|pid:none) Homo sapiens PRP8 pre-mRNA process... 322 e-100
BC045266_1(BC045266|pid:none) Xenopus laevis pre-mRNA processing... 320 e-100
BC034648_1(BC034648|pid:none) Mus musculus pre-mRNA processing f... 322 e-100
AK296344_1(AK296344|pid:none) Homo sapiens cDNA FLJ57882 complet... 318 e-100
protein update 2009. 6.26
PSORT

psg: 0.69 gvh: 0.37 alm: 0.38 top: 0.53 tms: 0.00 mit: 0.41 mip: 0.04
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

52.0 %: mitochondrial
24.0 %: nuclear
16.0 %: cytoplasmic
4.0 %: cytoskeletal
4.0 %: peroxisomal

>> prediction for VHA801 is mit

5' end seq. ID VHA801F
5' end seq.
>VHA801F.Seq
AAGTGGTATGAGTCATGATGAAGATCAATTAATTCCAAATCTTTATCGTTACATTCAACC
ATGGGAACAAGAAATTAAAGATAGTCAACGTGTTTGGGCAGAATATGCAATCAAATATGA
AGAGGCTAAATCACAAAATAAGAATCTTACATTGGAGGATCTTGAAGATAGTTGGGATCG
TGGATACCACGTATCAATACATTATTCCAAAAGAGTCGTCATACTTTAGCTTATGATAAA
GGTTGGAGAGTAAGAACCGATTGGAAACAATATCAAGTCCTAAAGAATAATCCATTCTGG
TGGACTAATCAACGTCATGATGGTAAACTTTGGAATCTTAATAATTATCGTACAGATATC
ATTCAAGCATTGGGTGGTGTTGAAGGTATTCTAGAGCATACTCTATTCAAAGGTACTTAT
TTCCCAACTTGGGAAGGTTTATTTTGGGAGAAAGCATCAGGTTTCGAAGAGTCTATGAAA
TATAAAAAACTTACACACGCTCAACGTTCAGGTCTTAATCAAATTCCAAATCGTCGTTTC
ACACTTTGGTGGTCTCCAACTATCAATCGTAAGAATGTTTATGTTNNNNNNNNNN
Length of 5' end seq. 595
3' end seq. ID VHA801Z
3' end seq.
>VHA801Z.Seq
NNNNNNNNNNGCTAAAGAAATTGGTGGTTTCACTTATGTTTTCCCAAAGAATATCTTAAA
GAAATTCATTACCATTGCCGATCTTCGTACTCAAATCATGGGTTATTGTTATGGTATCTC
ACCACCCGATAATCCATCAGTAAAAGAAATTCGTTGTATTGTAATGCCACCACAATGGGG
TACACCAGTTCATGTAACCGTACCAAATCAATTACCAGAACATGAATATCTCAAAGATTT
AGAACCATTGGGTTGGATTCACACTCAACCAACTGAATTACCACAACTATCACCACAAGA
TGTTATCACTCATTCAAAGATTATGTCTGACAATAAGAGTTGGGATGGTGAGAAAACTGT
TATCATCTCTGTCTCTGTGGCTTGGCCTTGTACTCTAACTGCTTACCATTTGACTCCAAG
TGGTTTTGAATGGGGTAAAAATAATAAAGATTCTCTCAATTATCAAGGTTATCAACCACA
ATTCTATGAGAAGGTTCAAATGTTATTATCCGATCGTTTCCTTGGTTTCTATATGGTACC
TGATCGTGGTTCTTGGAATTATAATTTCATGGGTGTTAAACATTCCACAAATATGACCTA
TGGTTTGAAATTAGATTATCCTAAAAACTTTTATGATGAATCTCATAGACCTGCTCATTT
CCAAAATTGGACTCAAATGGCTCCTTCAGCAAATGATGATGAAGAAAATCAACCNAAAAN
AAAATTTATTTNAATAATTATAAAAATAATAATTTTCTTTTAAAAACATTTAATAAATAA
ATAAAAAAA
Length of 3' end seq. 789
Connected seq. ID VHA801P
Connected seq.
>VHA801P.Seq
AAGTGGTATGAGTCATGATGAAGATCAATTAATTCCAAATCTTTATCGTTACATTCAACC
ATGGGAACAAGAAATTAAAGATAGTCAACGTGTTTGGGCAGAATATGCAATCAAATATGA
AGAGGCTAAATCACAAAATAAGAATCTTACATTGGAGGATCTTGAAGATAGTTGGGATCG
TGGATACCACGTATCAATACATTATTCCAAAAGAGTCGTCATACTTTAGCTTATGATAAA
GGTTGGAGAGTAAGAACCGATTGGAAACAATATCAAGTCCTAAAGAATAATCCATTCTGG
TGGACTAATCAACGTCATGATGGTAAACTTTGGAATCTTAATAATTATCGTACAGATATC
ATTCAAGCATTGGGTGGTGTTGAAGGTATTCTAGAGCATACTCTATTCAAAGGTACTTAT
TTCCCAACTTGGGAAGGTTTATTTTGGGAGAAAGCATCAGGTTTCGAAGAGTCTATGAAA
TATAAAAAACTTACACACGCTCAACGTTCAGGTCTTAATCAAATTCCAAATCGTCGTTTC
ACACTTTGGTGGTCTCCAACTATCAATCGTAAGAATGTTTATGTT----------GCTAA
AGAAATTGGTGGTTTCACTTATGTTTTCCCAAAGAATATCTTAAAGAAATTCATTACCAT
TGCCGATCTTCGTACTCAAATCATGGGTTATTGTTATGGTATCTCACCACCCGATAATCC
ATCAGTAAAAGAAATTCGTTGTATTGTAATGCCACCACAATGGGGTACACCAGTTCATGT
AACCGTACCAAATCAATTACCAGAACATGAATATCTCAAAGATTTAGAACCATTGGGTTG
GATTCACACTCAACCAACTGAATTACCACAACTATCACCACAAGATGTTATCACTCATTC
AAAGATTATGTCTGACAATAAGAGTTGGGATGGTGAGAAAACTGTTATCATCTCTGTCTC
TGTGGCTTGGCCTTGTACTCTAACTGCTTACCATTTGACTCCAAGTGGTTTTGAATGGGG
TAAAAATAATAAAGATTCTCTCAATTATCAAGGTTATCAACCACAATTCTATGAGAAGGT
TCAAATGTTATTATCCGATCGTTTCCTTGGTTTCTATATGGTACCTGATCGTGGTTCTTG
GAATTATAATTTCATGGGTGTTAAACATTCCACAAATATGACCTATGGTTTGAAATTAGA
TTATCCTAAAAACTTTTATGATGAATCTCATAGACCTGCTCATTTCCAAAATTGGACTCA
AATGGCTCCTTCAGCAAATGATGATGAAGAAAATCAACCNAAAANAAAATTTATTTNAAT
AATTATAAAAATAATAATTTTCTTTTAAAAACATTTAATAAATAAATAAAAAAA
Length of connected seq. 1364
Full length Seq ID -
Full length Seq. -
Length of full length seq. -