DK957075
Clone id TST39A01NGRL0027_G16
Library
Length 570
Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0027_G16. 5' end sequence.
Accession
Tissue type prothallia with plantlets
Developmental stage gametophytes with sporophytes
Contig ID
Sequence
GAGAGAGCGAGAGAGAAAGAGGGTTTTCTTCTTCCCGCTTCTCCACATACACTCACTCCC
CCTCCCAGCGCGTGGCGGCCGCTTCTGCTCCTGAAGTCTGCAGCTTCCGGCTCCATTTTC
GAGCTCGCCATTGTTCCTGTACCTTGCCCCTCTACAGAAATTCGGCTTATTGCGGCTTCC
ATGGCTCTGGCTGAAGTGGTGGCCGCGGCGCCTGACTCGGCTCACCATACAGGTGATCCG
GTTCGGACTATGGGTGGTGGTTTCAAAGGGGACCTACCTCAGGAACATGTTTCCGACGAC
AAGAATGGGGATGCTACTCAGTTTAAAAGATTAGATGGAGCAGCTGAAGCGGCCAATGCT
GGCTATAGTATACCTGTCTTGGCAAGCTATGATGGAGGTGTGGGATGTGATACAAGACAC
GTATTCTCTGGCATGCTTGCTGAGAATGGGCAGTCTATGTACTATGCCCCTGGTTATGAG
TTTCCACAGCAATCTCCTTATTGCTCACAGCCTCAAGGGGGCTTACATGTCAAATATGGG
ATCTGGAACAACGGGGGTACTGTGGACCGC
■■Homology search results ■■ -
sp_hit_id Q68749
Definition sp|Q68749|POLG_HCVBB Genome polyprotein OS=Hepatitis C virus genotype 2c (isolate BEBE1)
Align length 109
Score (bit) 33.5
E-value 0.77
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK957075|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0027_G16, 5'
(570 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|Q68749|POLG_HCVBB Genome polyprotein OS=Hepatitis C virus gen... 33 0.77
sp|Q8AZM0|POLS_BSNV Structural polyprotein OS=Blotched snakehead... 31 3.8
sp|Q7XWS7|FH12_ORYSJ Formin-like protein 12 OS=Oryza sativa subs... 31 3.8
sp|Q64347|CLCN1_MOUSE Chloride channel protein, skeletal muscle ... 31 3.8
sp|Q54KT2|Y8896_DICDI Putative uncharacterized protein DDB_G0287... 31 4.9
sp|Q6UXD1|HRCT1_HUMAN Histidine-rich carboxyl terminus protein 1... 31 4.9
sp|Q6ZU45|YA021_HUMAN Putative C-type lectin domain-containing p... 30 6.5
sp|Q6GMB0|DCC1_XENLA Sister chromatid cohesion protein DCC1 OS=X... 30 6.5
sp|Q8BYG0|TTC24_MOUSE Tetratricopeptide repeat protein 24 OS=Mus... 30 8.4
sp|Q4WXZ5|RNY1_ASPFU Ribonuclease T2-like OS=Aspergillus fumigat... 30 8.5
sp|P35523|CLCN1_HUMAN Chloride channel protein, skeletal muscle ... 30 8.5

>sp|Q68749|POLG_HCVBB Genome polyprotein OS=Hepatitis C virus genotype
2c (isolate BEBE1) PE=3 SV=3
Length = 3037

Score = 33.5 bits (75), Expect = 0.77
Identities = 28/109 (25%), Positives = 40/109 (36%), Gaps = 3/109 (2%)
Frame = +1

Query: 22 GFLLPASPHTLTPPPSAWRPLLLLKSAASGSIFELAIVPVPCPSTEIRLIXXXXXXXXXX 201
G LP T PPP R ++L +S ++ ELAI CP
Sbjct: 2317 GCALPPPGTTPVPPPRRRRAVVLDQSNVGEALKELAIKSFGCPPP--------------- 2361

Query: 202 XXXXXXXHHTGDPVRTMGGGFKGDL---PQEHVSDDKNGDATQFKRLDG 339
+GDP + GGG G+ P + D + G + L+G
Sbjct: 2362 ---------SGDPGHSTGGGTTGETSKSPPDEPDDSEAGSVSSMPPLEG 2401


>sp|Q8AZM0|POLS_BSNV Structural polyprotein OS=Blotched snakehead
virus PE=1 SV=1
Length = 1069

Score = 31.2 bits (69), Expect = 3.8
Identities = 21/54 (38%), Positives = 25/54 (46%)
Frame = -3

Query: 232 LYGEPSQAPRPPLQPEPWKPQ*AEFL*RGKVQEQWRARKWSRKLQTSGAEAAAT 71
+YG P+QAP PP E E RG Q Q R + SG+ AAAT
Sbjct: 983 IYGSPNQAPAPPEFVEEVAAVLMENNGRGPNQAQMRELRLKALTMKSGSGAAAT 1036


>sp|Q7XWS7|FH12_ORYSJ Formin-like protein 12 OS=Oryza sativa subsp.
japonica GN=FH12 PE=3 SV=3
Length = 1669

Score = 31.2 bits (69), Expect = 3.8
Identities = 13/26 (50%), Positives = 14/26 (53%)
Frame = -3

Query: 259 HHP*SEPDHLYGEPSQAPRPPLQPEP 182
HHP P +L GE AP PP P P
Sbjct: 1043 HHPPERPHYLPGEVGGAPSPPSPPPP 1068


>sp|Q64347|CLCN1_MOUSE Chloride channel protein, skeletal muscle
OS=Mus musculus GN=Clcn1 PE=2 SV=2
Length = 994

Score = 31.2 bits (69), Expect = 3.8
Identities = 19/52 (36%), Positives = 26/52 (50%)
Frame = +1

Query: 7 AREKEGFLLPASPHTLTPPPSAWRPLLLLKSAASGSIFELAIVPVPCPSTEI 162
A E+E ++P P T PPPS P L + A G + EL +V P E+
Sbjct: 922 APERE-VMVPTMPETPVPPPSPEAPSCLAPARAEGELEELEMVGSLEPEEEL 972


>sp|Q54KT2|Y8896_DICDI Putative uncharacterized protein DDB_G0287191
OS=Dictyostelium discoideum GN=DDB_G0287191 PE=4 SV=1
Length = 72

Score = 30.8 bits (68), Expect = 4.9
Identities = 23/83 (27%), Positives = 30/83 (36%)
Frame = -3

Query: 421 RVLYHIPHLHHSLPRQVYYSQHWPLQLLHLIF*TE*HPHSCRRKHVPEVGPL*NHHP*SE 242
R+ YH + HHS P ++ + H HS H +HH S
Sbjct: 4 RLNYHHHNYHHSYPHHHHHHNY--------------HYHSYPHHH--------HHH--SH 39

Query: 241 PDHLYGEPSQAPRPPLQPEPWKP 173
H + PS P PP P P P
Sbjct: 40 HHHHHHLPSSPPSPPSPPSPPSP 62


>sp|Q6UXD1|HRCT1_HUMAN Histidine-rich carboxyl terminus protein 1
OS=Homo sapiens GN=HRCT1 PE=2 SV=1
Length = 115

Score = 30.8 bits (68), Expect = 4.9
Identities = 17/50 (34%), Positives = 23/50 (46%)
Frame = -3

Query: 376 QVYYSQHWPLQLLHLIF*TE*HPHSCRRKHVPEVGPL*NHHP*SEPDHLY 227
+V +Q WP + + H H HVP VG +HHP P HL+
Sbjct: 52 RVRRAQPWPFRRRGHLGIFHHHRHPGHVSHVPNVGLHHHHHPRHTPHHLH 101


>sp|Q6ZU45|YA021_HUMAN Putative C-type lectin domain-containing
protein FLJ44005 OS=Homo sapiens PE=2 SV=1
Length = 233

Score = 30.4 bits (67), Expect = 6.5
Identities = 18/38 (47%), Positives = 20/38 (52%), Gaps = 2/38 (5%)
Frame = -1

Query: 138 RNNG--ELENGAGSCRLQEQKRPPRAGRGSECMWRSGK 31
RN G ELE AGS R +EQ P GR W SG+
Sbjct: 7 RNRGLRELEEVAGSSRTREQLVPRLGGRAGYDWWASGE 44


>sp|Q6GMB0|DCC1_XENLA Sister chromatid cohesion protein DCC1
OS=Xenopus laevis GN=dscc1 PE=2 SV=1
Length = 390

Score = 30.4 bits (67), Expect = 6.5
Identities = 20/88 (22%), Positives = 37/88 (42%), Gaps = 1/88 (1%)
Frame = +1

Query: 292 SDDKNGDATQFKRLDGAAEAANAGYSIPVLASY-DGGVGCDTRHVFSGMLAENGQSMYYA 468
S+ +GD + + D + AG S+ + D V C + +A+ + +
Sbjct: 33 SEFTSGDYSLMELDDTLCKQIEAGDSLVIRGDKSDHAVLCSQDKTYDLKIADTSNLLLFI 92

Query: 469 PGYEFPQQSPYCSQPQGGLHVKYGIWNN 552
PG + P Q P QP +H + ++N
Sbjct: 93 PGCKLPDQLPADQQPLSVIHCEIAGFSN 120


>sp|Q8BYG0|TTC24_MOUSE Tetratricopeptide repeat protein 24 OS=Mus
musculus GN=Ttc24 PE=2 SV=1
Length = 334

Score = 30.0 bits (66), Expect = 8.4
Identities = 28/86 (32%), Positives = 41/86 (47%), Gaps = 2/86 (2%)
Frame = -3

Query: 307 HSCRRKHVPEVGPL*NHHP*SEPDHLYGEPSQAPRPPLQPEPWKPQ*AEFL*R--GKVQE 134
H K+ E L H P S + L + + A R + E K A FL GK+Q
Sbjct: 168 HDQALKYYKEALALCQHEPSSVRERLVAKLADAMRTFVAQE--KIAQARFLPSAPGKLQT 225

Query: 133 QWRARKWSRKLQTSGAEAAATRWEGE 56
+A K S ++Q+S +A ++WEGE
Sbjct: 226 SRKA-KTSARVQSSAEDAQESQWEGE 250


>sp|Q4WXZ5|RNY1_ASPFU Ribonuclease T2-like OS=Aspergillus fumigatus
GN=rny1 PE=3 SV=2
Length = 408

Score = 30.0 bits (66), Expect = 8.5
Identities = 12/41 (29%), Positives = 20/41 (48%)
Frame = +1

Query: 415 RHVFSGMLAENGQSMYYAPGYEFPQQSPYCSQPQGGLHVKY 537
+H+ + G S + P +E Q S CS+P+ H +Y
Sbjct: 9 QHILKALTGSLGLSTIFEPDHEASQNSFQCSKPELSCHAQY 49


tr_hit_id B4N2C7
Definition tr|B4N2C7|B4N2C7_DROWI GK16142 OS=Drosophila willistoni
Align length 115
Score (bit) 38.5
E-value 0.26
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK957075|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0027_G16, 5'
(570 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|B4N2C7|B4N2C7_DROWI GK16142 OS=Drosophila willistoni GN=GK161... 39 0.26
tr|B4LKR2|B4LKR2_DROVI GJ20116 OS=Drosophila virilis GN=GJ20116 ... 37 0.77
tr|B4KYB6|B4KYB6_DROMO GI12500 OS=Drosophila mojavensis GN=GI125... 36 1.3
tr|B4J7V6|B4J7V6_DROGR GH20604 OS=Drosophila grimshawi GN=GH2060... 36 1.3
tr|B4KQW7|B4KQW7_DROMO GI20446 OS=Drosophila mojavensis GN=GI204... 36 1.7
tr|A6G5F2|A6G5F2_9DELT Putative uncharacterized protein OS=Plesi... 35 3.8
tr|A1ALZ0|A1ALZ0_PELPD Glutamate synthase (Ferredoxin) OS=Peloba... 35 3.9
tr|A6N3K6|A6N3K6_9PLAN Polysulfide reductase subunit C (Fragment... 34 5.0
tr|B4N5E4|B4N5E4_DROWI GK20557 OS=Drosophila willistoni GN=GK205... 34 5.0
tr|A3Z252|A3Z252_9SYNE Deoxyribodipyrimidine photolyase OS=Synec... 34 5.0
tr|A9U768|A9U768_PHYPA Predicted protein (Fragment) OS=Physcomit... 34 5.0
tr|Q2BC21|Q2BC21_9BACI Helicase, SWF/SNF family protein OS=Bacil... 34 6.6
tr|A1CP48|A1CP48_ASPCL Putative uncharacterized protein OS=Asper... 34 6.6
tr|A7S4H3|A7S4H3_NEMVE Predicted protein OS=Nematostella vectens... 33 8.6

>tr|B4N2C7|B4N2C7_DROWI GK16142 OS=Drosophila willistoni GN=GK16142
PE=4 SV=1
Length = 809

Score = 38.5 bits (88), Expect = 0.26
Identities = 30/115 (26%), Positives = 45/115 (39%), Gaps = 7/115 (6%)
Frame = -3

Query: 505 SNKEIAVETHNQGHST*TAHSQQACQRIRVLYHIPHLHHS-----LPRQVYYSQHWPLQL 341
+N + V+ H Q H+T H A +H+ H HH+ P ++ H +
Sbjct: 170 TNSAVNVKPHTQFHNTLAHHMTVAHHAAAAAHHV-HAHHAPHPHPHPHHSHHHHHHHAAM 228

Query: 340 LHLIF*T--E*HPHSCRRKHVPEVGPL*NHHP*SEPDHLYGEPSQAPRPPLQPEP 182
H + HPH+ HVP VG + + P P P P L P+P
Sbjct: 229 AHHLLANGFHPHPHALALAHVPVVGGQQSTAAVAPP-----APPTLPPPTLMPQP 278


>tr|B4LKR2|B4LKR2_DROVI GJ20116 OS=Drosophila virilis GN=GJ20116
PE=4 SV=1
Length = 793

Score = 37.0 bits (84), Expect = 0.77
Identities = 24/78 (30%), Positives = 31/78 (39%), Gaps = 3/78 (3%)
Frame = -3

Query: 397 LHHSLPRQVYYSQHWPLQLLHLIF*TE*HPHSCRRKHVPEVGPL*NHHP*SEPDHLYGEP 218
L H P Y ++ P H + E P H P P+ HP + P HL+G P
Sbjct: 210 LDHRRPPIDPYDRYGPPIHPHAVHPREYRPMHHEYPHPPRGPPMHRGHPHAHPHHLHGHP 269

Query: 217 ---SQAPRPPLQPEPWKP 173
AP P+ P P P
Sbjct: 270 PPHQYAPMRPMAPRPHVP 287


>tr|B4KYB6|B4KYB6_DROMO GI12500 OS=Drosophila mojavensis GN=GI12500
PE=4 SV=1
Length = 788

Score = 36.2 bits (82), Expect = 1.3
Identities = 19/65 (29%), Positives = 27/65 (41%)
Frame = -3

Query: 526 VSPLEAVSNKEIAVETHNQGHST*TAHSQQACQRIRVLYHIPHLHHSLPRQVYYSQHWPL 347
+SP+E + K+ + H H QQ Q + YH H Q +Y QH+
Sbjct: 301 LSPMEIKTEKQQRQQQQQHQHQQQQQHQQQQQQHQQQQYHQQQQHQQQQHQQHYQQHYQQ 360

Query: 346 QLLHL 332
Q HL
Sbjct: 361 QQQHL 365


>tr|B4J7V6|B4J7V6_DROGR GH20604 OS=Drosophila grimshawi GN=GH20604
PE=4 SV=1
Length = 793

Score = 36.2 bits (82), Expect = 1.3
Identities = 24/78 (30%), Positives = 30/78 (38%), Gaps = 3/78 (3%)
Frame = -3

Query: 397 LHHSLPRQVYYSQHWPLQLLHLIF*TE*HPHSCRRKHVPEVGPL*NHHP*SEPDHLYGEP 218
L H P Y ++ P H E P H P P+ HP + P HL+G P
Sbjct: 210 LDHRRPPVDPYDRYGPPLHPHAAHPREYRPMHHEYPHPPRGPPMHRGHPHTHPHHLHGHP 269

Query: 217 ---SQAPRPPLQPEPWKP 173
AP P+ P P P
Sbjct: 270 PPHQYAPMRPMAPRPHVP 287


>tr|B4KQW7|B4KQW7_DROMO GI20446 OS=Drosophila mojavensis GN=GI20446
PE=4 SV=1
Length = 791

Score = 35.8 bits (81), Expect = 1.7
Identities = 23/78 (29%), Positives = 31/78 (39%), Gaps = 3/78 (3%)
Frame = -3

Query: 397 LHHSLPRQVYYSQHWPLQLLHLIF*TE*HPHSCRRKHVPEVGPL*NHHP*SEPDHLYGEP 218
L H P Y ++ P H + E P H P P+ HP + P H++G P
Sbjct: 210 LDHRRPPIDPYDRYGPPIHPHSVHPREYRPMHHEYPHPPRGPPIHRGHPHAHPHHMHGHP 269

Query: 217 ---SQAPRPPLQPEPWKP 173
AP P+ P P P
Sbjct: 270 PPHQYAPMRPMAPRPHVP 287


>tr|A6G5F2|A6G5F2_9DELT Putative uncharacterized protein
OS=Plesiocystis pacifica SIR-1 GN=PPSIR1_03463 PE=4 SV=1
Length = 480

Score = 34.7 bits (78), Expect = 3.8
Identities = 11/18 (61%), Positives = 15/18 (83%)
Frame = -3

Query: 226 GEPSQAPRPPLQPEPWKP 173
G+P+ AP PP +PEPW+P
Sbjct: 154 GDPADAPEPPPEPEPWEP 171


>tr|A1ALZ0|A1ALZ0_PELPD Glutamate synthase (Ferredoxin) OS=Pelobacter
propionicus (strain DSM 2379) GN=Ppro_0729 PE=4 SV=1
Length = 1507

Score = 34.7 bits (78), Expect = 3.9
Identities = 21/71 (29%), Positives = 32/71 (45%), Gaps = 2/71 (2%)
Frame = +1

Query: 352 ANAGYSIPVLASYDGGVGCDTRHV--FSGMLAENGQSMYYAPGYEFPQQSPYCSQPQGGL 525
A AG I ++ YDGG G +H F G+ AE G + +C+ Q G+
Sbjct: 1010 AKAGADIITISGYDGGTGAARKHAIKFVGLPAEIG------------VREAHCALVQAGM 1057

Query: 526 HVKYGIWNNGG 558
+ +W +GG
Sbjct: 1058 RDRVELWADGG 1068


>tr|A6N3K6|A6N3K6_9PLAN Polysulfide reductase subunit C (Fragment)
OS=planctomycete Zi62 GN=psrC PE=4 SV=1
Length = 308

Score = 34.3 bits (77), Expect = 5.0
Identities = 27/112 (24%), Positives = 43/112 (38%), Gaps = 14/112 (12%)
Frame = +2

Query: 248 LWVVVSKGTYLRNMFPTTRMGMLLSLKD*MEQLKRPMLAIVY----LSWQAMMEVWDVIQ 415
LW V + TY GM+ L ++ K P+L IVY L W W +
Sbjct: 85 LWDVFAVSTYATVSVIFWYTGMIPDLATLRDRTKNPILRIVYSVLSLGWTGSARKWSRYE 144

Query: 416 DTYSLACLLRMGSLCTMPLVMSF----------HSNLLIAHSLKGAYMSNMG 541
Y+L L + ++ ++SF H+ + + + GA S G
Sbjct: 145 KAYTLFAALAAPLVLSVHTIVSFDFAVSQLPGWHTTIFPPYFVAGAVFSGFG 196


>tr|B4N5E4|B4N5E4_DROWI GK20557 OS=Drosophila willistoni GN=GK20557
PE=4 SV=1
Length = 880

Score = 34.3 bits (77), Expect = 5.0
Identities = 26/76 (34%), Positives = 29/76 (38%), Gaps = 2/76 (2%)
Frame = -3

Query: 403 PHLHHSLPRQVYYSQHWPLQLLHLIF*TE*HPHSCRRKHV-PEVGPL*NHHP*SEPDHLY 227
PHLH P Y QH P H HPH+ H P P + HP + P H
Sbjct: 742 PHLHQGHPHFAYQPQHHPHFYPHQ------HPHAHAHAHAHPHPHPHPHPHPHAHPPH-P 794

Query: 226 GEPSQAPRPPLQ-PEP 182
P Q P Q P P
Sbjct: 795 SHPHQQQHPHQQHPHP 810


>tr|A3Z252|A3Z252_9SYNE Deoxyribodipyrimidine photolyase
OS=Synechococcus sp. WH 5701 GN=WH5701_10699 PE=3 SV=1
Length = 508

Score = 34.3 bits (77), Expect = 5.0
Identities = 15/37 (40%), Positives = 20/37 (54%)
Frame = +1

Query: 52 LTPPPSAWRPLLLLKSAASGSIFELAIVPVPCPSTEI 162
+ PPP A +PL L A S EL + P PCP ++
Sbjct: 154 MVPPPEALQPLAGLDPGAIPSASELGLAPDPCPGRQV 190