DK950220
Clone id TST38A01NGRL0008_B08
Library
Length 637
Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0008_B08. 5' end sequence.
Accession
Tissue type prothallia
Developmental stage gametophyte
Contig ID -
Sequence
CACACACACACACACACACACACACACACACACACACACACACACACACACACACACACA
CACGTGGTTTACTGCTATTCGCTTTTAGTGTGGCTCTTGAGGATGACAAGTGTTGAGGAT
ACAAGCATGATGCTCGCCTTTCAGGCGAAGTCTGGAAGCAACTGTGCCGCCTATGCTCCC
GCTCCCTCACCTATGGAGTGTTCCCCAGTAACCATTTCAGCAGAGGATCTGGATCTTCTT
ATCTGCTCGGGGTCGCTCGAACAAGAAATGTTATTGCTCTCTGGACTCGAGCCATCCCCT
TCTGCAATGGCCATGCACGGCGATGAATTTAGCTGCCTACTAGAGCATTTTGATAAGTGG
AGCTCACCTGTAGAGCCCATGTTTTCCACATGGAACCCTCGGTTTTGTTATCAGAAGGAT
ATGGCGGATGTGGCCTTGAGTGAGACTATCCAGTCGGGCAGCCTTAAATCAGGCAAGGAC
CAGAAAAACAGATATGAGGAAAAAGAAGTTTGTGAGAAGCGGAAGACTGCTCTTGCATCC
TCTAATACCCCTTTGAAGGCAGTAACACCTTCGGGAAGGAGGATTTGGAAGGGTGGTCCA
CCGGCCATGACAAGGTCTAGCTCGAAGTCTCGTCTTT
■■Homology search results ■■ -
sp_hit_id Q8BK58
Definition sp|Q8BK58|HBAP1_MOUSE HSPB1-associated protein 1 OS=Mus musculus
Align length 82
Score (bit) 38.5
E-value 0.03
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK950220|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0008_B08, 5'
(637 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|Q8BK58|HBAP1_MOUSE HSPB1-associated protein 1 OS=Mus musculus... 39 0.030
sp|P09514|MCAPS_TYYVF Minor capsid protein OS=Turnip yellows vir... 33 0.95
sp|Q5BKC6|HBAP1_RAT HSPB1-associated protein 1 OS=Rattus norvegi... 32 3.6
sp|Q9LP03|PPR73_ARATH Pentatricopeptide repeat-containing protei... 31 4.7
sp|P74644|KAIA_SYNY3 Circadian clock protein kaiA OS=Synechocyst... 31 4.7
sp|Q6PGN9|PSRC1_HUMAN Proline/serine-rich coiled-coil protein 1 ... 31 6.2
sp|P75503|Y274_MYCPN Uncharacterized protein MG133 homolog OS=My... 30 8.0
sp|Q8K327|ZN828_MOUSE Zinc finger protein 828 OS=Mus musculus GN... 30 8.1
sp|Q6UXM1|LRIG3_HUMAN Leucine-rich repeats and immunoglobulin-li... 30 8.1

>sp|Q8BK58|HBAP1_MOUSE HSPB1-associated protein 1 OS=Mus musculus
GN=Hspbap1 PE=1 SV=1
Length = 483

Score = 38.5 bits (88), Expect = 0.030
Identities = 28/82 (34%), Positives = 39/82 (47%), Gaps = 4/82 (4%)
Frame = -1

Query: 295 MARVQRAITFLVRATPSR*EDPDPLLKWLLGNTP*VREREHRRHSCFQTSPERRASCLYP 116
+ARV+ AIT ++ T EDP WL N V E H +SC+ S A C +
Sbjct: 280 LARVEEAITRMLVCTLKTAEDPHHPRTWL--NPTEVEETSHEVNSCYLNS----AVCAFF 333

Query: 115 QHLSSSRA----TLKANSSKPR 62
H ++A L AN ++PR
Sbjct: 334 DHCEKAKAVELQVLSANGAEPR 355


>sp|P09514|MCAPS_TYYVF Minor capsid protein OS=Turnip yellows virus
(isolate FL-1) GN=ORF3/ORF5 PE=1 SV=2
Length = 670

Score = 33.5 bits (75), Expect = 0.95
Identities = 31/109 (28%), Positives = 46/109 (42%), Gaps = 3/109 (2%)
Frame = +1

Query: 232 DLLICSGSLEQEMLLLSGLEPSPSAMAMHGDEFSCLLEHFDKWSSPVEPMFSTWNPRFCY 411
DL + E+ M SGL P + + + +FD P + W P
Sbjct: 514 DLSTKNSQEEEAMSSESGLRPQLKPPGLPKPQPIRTIRNFD----PTPDLVEAWRPDVNP 569

Query: 412 QKDMADVALSETIQSGSLKSGKDQKNRYEEKEVCEKRK---TALASSNT 549
ADVA + I GS+K G+ ++ K V + RK ++LASS T
Sbjct: 570 GYSKADVAAATIIAGGSIKDGRSMIDK-RNKAVLDGRKSWGSSLASSLT 617


>sp|Q5BKC6|HBAP1_RAT HSPB1-associated protein 1 OS=Rattus norvegicus
GN=Hspbap1 PE=1 SV=1
Length = 479

Score = 31.6 bits (70), Expect = 3.6
Identities = 24/81 (29%), Positives = 36/81 (44%), Gaps = 4/81 (4%)
Frame = -1

Query: 295 MARVQRAITFLVRATPSR*EDPDPLLKWLLGNTP*VREREHRRHSCFQTSPERRASCLYP 116
+ARV+ A+T ++ T EDP WL N V E H +SC+ S A C +
Sbjct: 280 LARVEEAVTRMLVCTLKTAEDPHHPRTWL--NPTEVEETSHEVNSCYLNS----AVCAFF 333

Query: 115 QHLSSSR----ATLKANSSKP 65
H ++ +AN +P
Sbjct: 334 DHCERAKEVEMQAPRANGEEP 354


>sp|Q9LP03|PPR73_ARATH Pentatricopeptide repeat-containing protein
At1g43980, mitochondrial OS=Arabidopsis thaliana
GN=PCMP-E58 PE=2 SV=1
Length = 633

Score = 31.2 bits (69), Expect = 4.7
Identities = 13/35 (37%), Positives = 17/35 (48%)
Frame = +1

Query: 367 PVEPMFSTWNPRFCYQKDMADVALSETIQSGSLKS 471
P EP W P C D+ D L+ET+ L+S
Sbjct: 512 PFEPSSHIWEPILCASLDLGDTRLAETVAKTMLES 546


>sp|P74644|KAIA_SYNY3 Circadian clock protein kaiA OS=Synechocystis
sp. (strain PCC 6803) GN=kaiA PE=3 SV=1
Length = 299

Score = 31.2 bits (69), Expect = 4.7
Identities = 29/85 (34%), Positives = 38/85 (44%), Gaps = 1/85 (1%)
Frame = +1

Query: 97 LRMTSVEDTSMMLAFQAKSGSNCAAYAPAPSPMECSPVTISAEDLDLLICSGSLEQEMLL 276
LR D + FQA CA P ++C V A L +L + EQ LL
Sbjct: 19 LRSIFQGDRHYLSTFQALDDF-CAFLEDKPERIDCLLVYYEANSLPVL--NRLYEQGRLL 75

Query: 277 -LSGLEPSPSAMAMHGDEFSCLLEH 348
+ LEPSPSA+A DE ++ H
Sbjct: 76 PIILLEPSPSALAKTTDEHPTIVYH 100


>sp|Q6PGN9|PSRC1_HUMAN Proline/serine-rich coiled-coil protein 1
OS=Homo sapiens GN=PSRC1 PE=1 SV=1
Length = 363

Score = 30.8 bits (68), Expect = 6.2
Identities = 39/165 (23%), Positives = 62/165 (37%), Gaps = 10/165 (6%)
Frame = +1

Query: 169 AYAPAPSPMECSPVTISAEDLDLLICSGS---LEQEMLLLSGLEPSPSAMAMHGDEFSCL 339
A APAP + S +S E L+ ++ + + E L E + + + S
Sbjct: 54 AVAPAPQGVRLSLGPLSPEKLEEILDEANRLAAQLEQCALQDRESAGEGLGPRRVKPSPR 113

Query: 340 LEHFDKWSSPVEPMFSTWNP--RFCYQKDMADVALSETIQSGSLK-----SGKDQKNRYE 498
E F SPV + T N R L + GS++ SGK N
Sbjct: 114 RETFVLKDSPVRDLLPTVNSLTRSTPSPSSLTPRLRSNDRKGSVRALRATSGKRPSNMKR 173

Query: 499 EKEVCEKRKTALASSNTPLKAVTPSGRRIWKGGPPAMTRSSSKSR 633
E C + + +++PL TP R + GP +S ++R
Sbjct: 174 ESPTCNLFPASKSPASSPLTRSTPPVR--GRAGPSGRAAASEETR 216


>sp|P75503|Y274_MYCPN Uncharacterized protein MG133 homolog
OS=Mycoplasma pneumoniae GN=MPN_274 PE=4 SV=1
Length = 266

Score = 30.4 bits (67), Expect = 8.0
Identities = 10/16 (62%), Positives = 15/16 (93%)
Frame = -3

Query: 518 FSQTSFSSYLFFWSLP 471
FSQT ++SY++FWS+P
Sbjct: 222 FSQTKYNSYVWFWSIP 237


>sp|Q8K327|ZN828_MOUSE Zinc finger protein 828 OS=Mus musculus
GN=Znf828 PE=1 SV=1
Length = 802

Score = 30.4 bits (67), Expect = 8.1
Identities = 27/91 (29%), Positives = 38/91 (41%), Gaps = 8/91 (8%)
Frame = +1

Query: 355 KWSSPVEPMF-STWNP-------RFCYQKDMADVALSETIQSGSLKSGKDQKNRYEEKEV 510
K S PV+PM W P + M+ + ++ SGS K+ ++
Sbjct: 325 KSSKPVQPMSPGPWKPIPSVSPGPWKPAPSMSTASWKSSVSSGSWKTPPTSPESWKSGPP 384

Query: 511 CEKRKTALASSNTPLKAVTPSGRRIWKGGPP 603
E RKTAL S KAV P + + GPP
Sbjct: 385 -ELRKTALPLSPEHWKAVPPVSPELRRPGPP 414


>sp|Q6UXM1|LRIG3_HUMAN Leucine-rich repeats and immunoglobulin-like
domains protein 3 OS=Homo sapiens GN=LRIG3 PE=2 SV=1
Length = 1119

Score = 30.4 bits (67), Expect = 8.1
Identities = 23/69 (33%), Positives = 35/69 (50%)
Frame = +1

Query: 346 HFDKWSSPVEPMFSTWNPRFCYQKDMADVALSETIQSGSLKSGKDQKNRYEEKEVCEKRK 525
H D +SS +P S PR Y K + S + SGS + GK++ + EE +C ++
Sbjct: 1049 HLDAYSSFGQP--SDCQPRAFYLKAHS----SPDLDSGSEEDGKERTDFQEENHICTFKQ 1102

Query: 526 TALASSNTP 552
T L + TP
Sbjct: 1103 T-LENYRTP 1110


tr_hit_id B3FGU1
Definition tr|B3FGU1|B3FGU1_9LUTE Read-through protein P5 OS=Cucurbit aphid-borne yellows virus
Align length 64
Score (bit) 38.1
E-value 0.45
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK950220|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0008_B08, 5'
(637 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|B3FGU1|B3FGU1_9LUTE Read-through protein P5 OS=Cucurbit aphid... 38 0.45
tr|O73685|O73685_CHICK G-protein coupled receptor kinase 1 OS=Ga... 37 1.3
tr|Q65972|Q65972_9LUTE Putative uncharacterized protein OS=Cucur... 36 2.3
tr|Q8CGS6|Q8CGS6_MOUSE DNA polymerase theta short isoform OS=Mus... 36 2.3
tr|Q80XB7|Q80XB7_MOUSE DNA polymerase theta OS=Mus musculus GN=P... 36 2.3
tr|Q7TQC0|Q7TQC0_MOUSE DNA polymerase Q OS=Mus musculus GN=Polq ... 36 2.3
tr|Q3U1F8|Q3U1F8_MOUSE Putative uncharacterized protein (Fragmen... 36 2.3
tr|A2SCZ5|A2SCZ5_METPP Hydrolase, putative OS=Methylibium petrol... 35 3.8
tr|B7T510|B7T510_OREMO Prolactin receptor 1 OS=Oreochromis mossa... 35 5.0
tr|B8MHH6|B8MHH6_9EURO Putative uncharacterized protein OS=Talar... 34 6.5

>tr|B3FGU1|B3FGU1_9LUTE Read-through protein P5 OS=Cucurbit
aphid-borne yellows virus PE=4 SV=1
Length = 668

Score = 38.1 bits (87), Expect = 0.45
Identities = 19/64 (29%), Positives = 31/64 (48%)
Frame = +1

Query: 364 SPVEPMFSTWNPRFCYQKDMADVALSETIQSGSLKSGKDQKNRYEEKEVCEKRKTALASS 543
+P + W P + ADVA + I GS+ G+D R +EK + ++K + SS
Sbjct: 554 NPNPDLVEAWRPDLAPEYSKADVAAATVIAGGSIHEGRDMLRRRDEKVMDSRKKWGVLSS 613

Query: 544 NTPL 555
+ L
Sbjct: 614 ASSL 617


>tr|O73685|O73685_CHICK G-protein coupled receptor kinase 1
OS=Gallus gallus GN=GRK1 PE=2 SV=1
Length = 593

Score = 36.6 bits (83), Expect = 1.3
Identities = 24/75 (32%), Positives = 32/75 (42%)
Frame = +1

Query: 241 ICSGSLEQEMLLLSGLEPSPSAMAMHGDEFSCLLEHFDKWSSPVEPMFSTWNPRFCYQKD 420
+C G L ++ G AM FS L H+ WS P P F +PR Y KD
Sbjct: 456 LCEGLLAKDPQKRLGFRDGNCAMLRSQPVFSAL--HWGSWSGPPPPPFVP-DPRRVYAKD 512

Query: 421 MADVALSETIQSGSL 465
+ DV T++ L
Sbjct: 513 LGDVGAFSTVRGVEL 527


>tr|Q65972|Q65972_9LUTE Putative uncharacterized protein OS=Cucurbit
aphid-borne yellows virus PE=4 SV=2
Length = 667

Score = 35.8 bits (81), Expect = 2.3
Identities = 18/64 (28%), Positives = 29/64 (45%)
Frame = +1

Query: 364 SPVEPMFSTWNPRFCYQKDMADVALSETIQSGSLKSGKDQKNRYEEKEVCEKRKTALASS 543
+P + W P ADVA + + GS+ G+D R E K + ++K + SS
Sbjct: 553 NPGPDLIEVWRPDLAPGYSKADVAAATVLAGGSVHEGRDMLERREAKVMDSRKKWGILSS 612

Query: 544 NTPL 555
+ L
Sbjct: 613 TSSL 616


>tr|Q8CGS6|Q8CGS6_MOUSE DNA polymerase theta short isoform OS=Mus
musculus GN=Polq PE=2 SV=1
Length = 2265

Score = 35.8 bits (81), Expect = 2.3
Identities = 31/111 (27%), Positives = 47/111 (42%), Gaps = 2/111 (1%)
Frame = +1

Query: 253 SLEQEMLLLSGLEPSPSAMAMHG-DEFSCLLEHFDKWSSPVEPMFSTWNPRFCYQKDMAD 429
S+ + + SG PS A G D+ S + K+ EP P C +
Sbjct: 652 SMGRNSIRASGSNDKPSPDAERGIDDCSEHADSLCKFQGNFEPQ----TPSICTARKRTS 707

Query: 430 VALSETIQSGSLKSGKDQKNRYEEKEVCEK-RKTALASSNTPLKAVTPSGR 579
+ +++ + SLK GK + EK RKTAL+ S+ + PSGR
Sbjct: 708 LGINKEMLRKSLKEGKPSTKEVLQTFSSEKTRKTALSFSSEQVNNTLPSGR 758


>tr|Q80XB7|Q80XB7_MOUSE DNA polymerase theta OS=Mus musculus GN=Polq
PE=2 SV=1
Length = 2544

Score = 35.8 bits (81), Expect = 2.3
Identities = 31/111 (27%), Positives = 47/111 (42%), Gaps = 2/111 (1%)
Frame = +1

Query: 253 SLEQEMLLLSGLEPSPSAMAMHG-DEFSCLLEHFDKWSSPVEPMFSTWNPRFCYQKDMAD 429
S+ + + SG PS A G D+ S + K+ EP P C +
Sbjct: 931 SMGRNSIRASGSNDKPSPDAERGIDDCSEHADSLCKFQGNFEPQ----TPSICTARKRTS 986

Query: 430 VALSETIQSGSLKSGKDQKNRYEEKEVCEK-RKTALASSNTPLKAVTPSGR 579
+ +++ + SLK GK + EK RKTAL+ S+ + PSGR
Sbjct: 987 LGINKEMLRKSLKEGKPSTKEVLQTFSSEKTRKTALSFSSEQVNNTLPSGR 1037


>tr|Q7TQC0|Q7TQC0_MOUSE DNA polymerase Q OS=Mus musculus GN=Polq PE=1
SV=1
Length = 2587

Score = 35.8 bits (81), Expect = 2.3
Identities = 31/111 (27%), Positives = 47/111 (42%), Gaps = 2/111 (1%)
Frame = +1

Query: 253 SLEQEMLLLSGLEPSPSAMAMHG-DEFSCLLEHFDKWSSPVEPMFSTWNPRFCYQKDMAD 429
S+ + + SG PS A G D+ S + K+ EP P C +
Sbjct: 974 SMGRNSIRASGSNDKPSPDAERGIDDCSEHADSLCKFQGNFEPQ----TPSICTARKRTS 1029

Query: 430 VALSETIQSGSLKSGKDQKNRYEEKEVCEK-RKTALASSNTPLKAVTPSGR 579
+ +++ + SLK GK + EK RKTAL+ S+ + PSGR
Sbjct: 1030 LGINKEMLRKSLKEGKPSTKEVLQTFSSEKTRKTALSFSSEQVNNTLPSGR 1080


>tr|Q3U1F8|Q3U1F8_MOUSE Putative uncharacterized protein (Fragment)
OS=Mus musculus GN=Polq PE=2 SV=1
Length = 929

Score = 35.8 bits (81), Expect = 2.3
Identities = 31/111 (27%), Positives = 47/111 (42%), Gaps = 2/111 (1%)
Frame = +1

Query: 253 SLEQEMLLLSGLEPSPSAMAMHG-DEFSCLLEHFDKWSSPVEPMFSTWNPRFCYQKDMAD 429
S+ + + SG PS A G D+ S + K+ EP P C +
Sbjct: 404 SMGRNSIRASGSNDKPSPDAERGIDDCSEHADSLCKFQGNFEPQ----TPSICTARKRTS 459

Query: 430 VALSETIQSGSLKSGKDQKNRYEEKEVCEK-RKTALASSNTPLKAVTPSGR 579
+ +++ + SLK GK + EK RKTAL+ S+ + PSGR
Sbjct: 460 LGINKEMLRKSLKEGKPSTKEVLQTFSSEKTRKTALSFSSEQVNNTLPSGR 510


>tr|A2SCZ5|A2SCZ5_METPP Hydrolase, putative OS=Methylibium
petroleiphilum (strain PM1) GN=Mpe_A0472 PE=4 SV=1
Length = 298

Score = 35.0 bits (79), Expect = 3.8
Identities = 26/70 (37%), Positives = 35/70 (50%), Gaps = 4/70 (5%)
Frame = -3

Query: 611 VMAGGPPFQILLPEGVTAFKGVLEDARAVFRFSQTSFSSYLFFW----SLPDLRLPDWIV 444
++AG P F++ +G TA VL D R S F+ L F + P RLPDW+
Sbjct: 152 LLAGNPRFELRELKGHTASDLVLVD-----RISGVVFAGGLVFVDRVPTTPHARLPDWLA 206

Query: 443 SLKATSAISF 414
SL A +A F
Sbjct: 207 SLDALAAQPF 216


>tr|B7T510|B7T510_OREMO Prolactin receptor 1 OS=Oreochromis
mossambicus PE=2 SV=1
Length = 630

Score = 34.7 bits (78), Expect = 5.0
Identities = 19/58 (32%), Positives = 30/58 (51%)
Frame = +1

Query: 88 VWLLRMTSVEDTSMMLAFQAKSGSNCAAYAPAPSPMECSPVTISAEDLDLLICSGSLE 261
V LL+ VE+ SM + A+S + P P CSP+ +S +D +L SG ++
Sbjct: 564 VLLLQREVVEEESMEMGGAAESCYTSSIAFTTPKPTACSPIVLSVQDERVLAVSGYVD 621


>tr|B8MHH6|B8MHH6_9EURO Putative uncharacterized protein
OS=Talaromyces stipitatus ATCC 10500 GN=TSTA_010730 PE=4
SV=1
Length = 1462

Score = 34.3 bits (77), Expect = 6.5
Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 3/107 (2%)
Frame = +1

Query: 217 SAEDLDLLICSGSLEQEMLLLSGLEPSPSAMAMHGDEFSCLLEHFDKWSSPVEPMFSTW- 393
S D+D S E E L+ P +A+ H D FS ++ P E W
Sbjct: 431 SLRDVDEAKASRRAEIERRCLALHPPLTAAVLSHMDSFSAAIQ------IPHELTDKDWE 484

Query: 394 --NPRFCYQKDMADVALSETIQSGSLKSGKDQKNRYEEKEVCEKRKT 528
PR Q++MA+ E +Q L K ++ R +E + E +++
Sbjct: 485 YLKPRLLAQREMAESKEIERLQESQLLQAKTEERRQQEARLKEDKQS 531