DK953330
Clone id TST39A01NGRL0017_H20
Library
Length 628
Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0017_H20. 5' end sequence.
Accession
Tissue type prothallia with plantlets
Developmental stage gametophytes with sporophytes
Contig ID -
Sequence
ATGGCTACCTCCTCCGTCATATGGGCCCTCTTGGGCGTTTTCGCCCTCTTCTTGACACTC
TCCTCTGCCGATAACTCTCATGTGTTAAGCTACTCCCCTTCCGATCTCCACTCCCAAGCC
AAGCTTGTTAGCCTCTTTGACGCATGGAACATGAAGCATGGAAAACATTACACAGCCGCC
CAGAAGTTAGAGAAGCACAAGAGATTCCACATCTTCAGAGACAACCTGATGCGCATAGAG
GCGCACAACAGCAAGGGATCTACTTTTAAGCTTGGTCTCAACCGTTTCGCCGATTTGACT
CAAGATGAATTCAAGCAGAGCCGGCGTCTTGGTCTCAAGCTTCCTTCTGTCAAGCTTGGA
TCCCTCCGCAGGCGGTCCCACTTCCATCACAAGTCTGAGACCCCTATGGTAACAGCTGAA
TCTTTGGACTGGAGAACCCTTGGCGCCGTTACCCCAGTGAAAGATCAGGGCATGTGTGGA
AGCTGCTGGGCTTTCTCTGCCACAGGAGCTATTGAAGGAGCCAACGCTGTTGCAACAGGA
AACCTTGTCAGTGTTTCGGAGGAAGAGCTTGTGACATGCAGCAGTGAGAGTGGATGTGAT
GGGGGGCTGATGGATGATGCCTTTGAAT
■■Homology search results ■■ -
sp_hit_id P43297
Definition sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana
Align length 205
Score (bit) 174.0
E-value 3.0e-43
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK953330|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0017_H20, 5'
(628 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis t... 174 3e-43
sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis ... 167 3e-41
sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis ... 167 5e-41
sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2 158 2e-38
sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS... 154 5e-37
sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2 152 1e-36
sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. ... 151 3e-36
sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1 148 3e-35
sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2 145 2e-34
sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=A... 142 1e-33
sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 ... 141 2e-33
sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. j... 141 3e-33
sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. G... 141 3e-33
sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=A... 140 4e-33
sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1 139 9e-33
sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=A... 139 9e-33
sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=A... 139 2e-32
sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Br... 138 3e-32
sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium dis... 137 3e-32
sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 ... 137 3e-32
sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2... 136 8e-32
sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulg... 136 1e-31
sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulg... 136 1e-31
sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp.... 134 5e-31
sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium dis... 133 9e-31
sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis t... 132 2e-30
sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1 131 2e-30
sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4 131 2e-30
sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersi... 129 9e-30
sp|Q94503|CYSP6_DICDI Cysteine proteinase 6 OS=Dictyostelium dis... 129 1e-29

>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis
thaliana GN=RD21A PE=1 SV=1
Length = 462

Score = 174 bits (442), Expect = 3e-43
Identities = 93/205 (45%), Positives = 134/205 (65%), Gaps = 8/205 (3%)
Frame = +1

Query: 37 VFALFLTLSSADNSHVLSY------SPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKH 198
+F + +SSA + ++SY S + S+A+++S+++AW +KHGK + +EK
Sbjct: 11 LFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKD 70

Query: 199 KRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRS 378
+RF IF+DNL ++ HN K +++LGL RFADLT DE++ S+ LG K+ K G RR+
Sbjct: 71 RRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR-SKYLGAKME--KKGE--RRT 125

Query: 379 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 558
+++ ES+DWR GAV VKDQG CGSCWAFS GA+EG N + TG+L+++S
Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185

Query: 559 EEELVTC--SSESGCDGGLMDDAFE 627
E+ELV C S GC+GGLMD AFE
Sbjct: 186 EQELVDCDTSYNEGCNGGLMDYAFE 210


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis
thaliana GN=XCP2 PE=1 SV=2
Length = 356

Score = 167 bits (424), Expect = 3e-41
Identities = 89/201 (44%), Positives = 124/201 (61%), Gaps = 2/201 (0%)
Frame = +1

Query: 31 LGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHKRFH 210
L +L L+ +S+ + ++ YSP DL S KL+ LF+ W K Y + EK RF
Sbjct: 16 LSAASLSLSFASSHDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVE--EKFLRFE 73

Query: 211 IFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRSHFHH 390
+F+DNL I+ N KG ++ LGLN FADL+ +EFK+ LGLK V+ R + F +
Sbjct: 74 VFKDNLKHIDETNKKGKSYWLGLNEFADLSHEEFKKMY-LGLKTDIVRRDEERSYAEFAY 132

Query: 391 KSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEEL 570
+ + +S+DWR GAV VK+QG CGSCWAFS A+EG N + TGNL ++SE+EL
Sbjct: 133 RDVEAV--PKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQEL 190

Query: 571 VTCSS--ESGCDGGLMDDAFE 627
+ C + +GC+GGLMD AFE
Sbjct: 191 IDCDTTYNNGCNGGLMDYAFE 211


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis
thaliana GN=XCP1 PE=1 SV=1
Length = 355

Score = 167 bits (422), Expect = 5e-41
Identities = 93/212 (43%), Positives = 133/212 (62%), Gaps = 13/212 (6%)
Frame = +1

Query: 31 LGVFALFLTLSS--------ADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQK 186
L F+L + +S+ A + ++ Y+P L + KL+ LF++W +H K Y + +
Sbjct: 8 LSKFSLLVAISASALLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVE- 66

Query: 187 LEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSL 366
EK RF +FR+NLM I+ N++ +++ LGLN FADLT +EFK R LGL P
Sbjct: 67 -EKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFK-GRYLGLAKPQFS---- 120

Query: 367 RRR---SHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVAT 537
R+R ++F ++ T + +S+DWR GAV PVKDQG CGSCWAFS A+EG N + T
Sbjct: 121 RKRQPSANFRYRDITDL--PKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITT 178

Query: 538 GNLVSVSEEELVTCSS--ESGCDGGLMDDAFE 627
GNL S+SE+EL+ C + SGC+GGLMD AF+
Sbjct: 179 GNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQ 210


>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352

Score = 158 bits (399), Expect = 2e-38
Identities = 94/206 (45%), Positives = 121/206 (58%), Gaps = 4/206 (1%)
Frame = +1

Query: 1 MATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAA 180
M++ S I L + + LSSAD + + YS DL S +L+ LFD+W +KH K Y +
Sbjct: 4 MSSISKIIFLATCLIIHMGLSSAD-FYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESI 62

Query: 181 QKLEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLG 360
EK RF IFRDNLM I+ N K +++ LGLN FADL+ DEFK+ K
Sbjct: 63 D--EKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKK------KYVGFVAE 114

Query: 361 SLRRRSHFHHKSET-PMVT--AESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAV 531
HF ++ T VT +S+DWR GAVTPVK+QG CGSCWAFS +EG N +
Sbjct: 115 DFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKI 174

Query: 532 ATGNLVSVSEEELVTCSSES-GCDGG 606
TGNL+ +SE+ELV C S GC GG
Sbjct: 175 VTGNLLELSEQELVDCDKHSYGCKGG 200


>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1
OS=Arabidopsis thaliana GN=GCP1 PE=2 SV=2
Length = 376

Score = 154 bits (388), Expect = 5e-37
Identities = 85/219 (38%), Positives = 132/219 (60%), Gaps = 10/219 (4%)
Frame = +1

Query: 1 MATSSVIWALLGVFALFLTLSSAD----NSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKH 168
MA S+ + +LL ++ + ++L+S D N H+ S + ++ S++ W+ +HGK
Sbjct: 1 MAPSTKVLSLLLLYVV-VSLASGDESIINDHLQLPSDGKWRTDEEVRSIYLQWSAEHGKT 59

Query: 169 YTAAQKL--EKHKRFHIFRDNLMRIEAHN--SKGSTFKLGLNRFADLTQDEFKQSRRLGL 336
+ ++ KRF+IF+DNL I+ HN +K +T+KLGL +F DLT DE+++
Sbjct: 60 NNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDEYRKLYLGAR 119

Query: 337 KLPSVKLGSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIE 516
P+ ++ + + + + E++DWR GAV P+KDQG CGSCWAFS T A+E
Sbjct: 120 TEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVE 179

Query: 517 GANAVATGNLVSVSEEELVTC--SSESGCDGGLMDDAFE 627
G N + TG L+S+SE+ELV C S GC+GGLMD AF+
Sbjct: 180 GINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQ 218


>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362

Score = 152 bits (385), Expect = 1e-36
Identities = 90/214 (42%), Positives = 126/214 (58%), Gaps = 5/214 (2%)
Frame = +1

Query: 1 MATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAA 180
MAT ++W +L F+L L ++++ + H DL S+ L L++ W H+T +
Sbjct: 1 MATKKLLWVVLS-FSLVLGVANSFDFH-----DKDLASEESLWDLYERWR----SHHTVS 50

Query: 181 QKL-EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKL--PSV 351
+ L EKHKRF++F+ NLM + N +KL LN+FAD+T EF+ S G K+ P +
Sbjct: 51 RSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFR-STYAGSKVNHPRM 109

Query: 352 KLGSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAV 531
G+ F + E + S+DWR GAVT VKDQG CGSCWAFS A+EG N +
Sbjct: 110 FRGTPHENGAFMY--EKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQI 167

Query: 532 ATGNLVSVSEEELVTCSSE--SGCDGGLMDDAFE 627
T LV++SE+ELV C E GC+GGLM+ AFE
Sbjct: 168 KTNKLVALSEQELVDCDKEENQGCNGGLMESAFE 201


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp.
japonica GN=Os04g0650000 PE=1 SV=2
Length = 458

Score = 151 bits (381), Expect = 3e-36
Identities = 88/200 (44%), Positives = 123/200 (61%), Gaps = 6/200 (3%)
Frame = +1

Query: 46 LFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHKRFHIFRDN 225
L L+L++AD S ++SY S+ + L+ W +HGK Y A E+ +R+ FRDN
Sbjct: 14 LLLSLAAADMS-IVSYGE---RSEEEARRLYAEWKAEHGKSYNAVG--EEERRYAAFRDN 67

Query: 226 LMRIEAHNSKGS----TFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRSHFHHK 393
L I+ HN+ +F+LGLNRFADLT +E++ + LGL+ + R+ +
Sbjct: 68 LRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRDTY-LGLRNKPRR----ERKVSDRYL 122

Query: 394 SETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELV 573
+ ES+DWRT GAV +KDQG CGSCWAFSA A+EG N + TG+L+S+SE+ELV
Sbjct: 123 AADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELV 182

Query: 574 TC--SSESGCDGGLMDDAFE 627
C S GC+GGLMD AF+
Sbjct: 183 DCDTSYNEGCNGGLMDYAFD 202


>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362

Score = 148 bits (373), Expect = 3e-35
Identities = 89/214 (41%), Positives = 123/214 (57%), Gaps = 5/214 (2%)
Frame = +1

Query: 1 MATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAA 180
MA ++W +L +L L L A++ + DL S+ L L++ W H+T +
Sbjct: 1 MAMKKLLWVVL---SLSLVLGVANS---FDFHEKDLESEESLWDLYERWR----SHHTVS 50

Query: 181 QKL-EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKL 357
+ L EKHKRF++F+ N+M + N +KL LN+FAD+T EF+ S G K+ K+
Sbjct: 51 RSLGEKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADMTNHEFR-STYAGSKVNHHKM 109

Query: 358 --GSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAV 531
GS F ++ + S+DWR GAVT VKDQG CGSCWAFS A+EG N +
Sbjct: 110 FRGSQHGSGTFMYEKVGSVPA--SVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQI 167

Query: 532 ATGNLVSVSEEELVTCSSE--SGCDGGLMDDAFE 627
T LVS+SE+ELV C E GC+GGLM+ AFE
Sbjct: 168 KTNKLVSLSEQELVDCDKEENQGCNGGLMESAFE 201


>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348

Score = 145 bits (366), Expect = 2e-34
Identities = 84/207 (40%), Positives = 116/207 (56%), Gaps = 2/207 (0%)
Frame = +1

Query: 13 SVIWALLGVFALFLTLS-SADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKL 189
S+ L LF+ +S S + ++ YS DL S +L+ LF++W + H K Y
Sbjct: 6 SISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVD-- 63

Query: 190 EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLR 369
EK RF IF+DNL I+ N K +++ LGLN FADL+ DEF + + S+ ++
Sbjct: 64 EKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFNEKY-----VGSLIDATIE 118

Query: 370 RRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLV 549
+ +E + E++DWR GAVTPV+ QG CGSCWAFSA +EG N + TG LV
Sbjct: 119 QSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLV 178

Query: 550 SVSEEELVTCSSES-GCDGGLMDDAFE 627
+SE+ELV C S GC GG A E
Sbjct: 179 ELSEQELVDCERRSHGCKGGYPPYALE 205


>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960
OS=Arabidopsis thaliana GN=At3g43960 PE=2 SV=1
Length = 376

Score = 142 bits (359), Expect = 1e-33
Identities = 88/214 (41%), Positives = 129/214 (60%), Gaps = 5/214 (2%)
Frame = +1

Query: 1 MATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAA 180
MA S ALL + L +++S V++ + S ++ +++++++ W +++GK+Y
Sbjct: 1 MAISFRTLALLTLSVLLISISLG----VVTATESQ-RNEGEVLTMYEQWLVENGKNYNGL 55

Query: 181 QKLEKHKRFHIFRDNLMRIEAHNSKGS-TFKLGLNRFADLTQDEFKQSRRLGLKLPSVKL 357
EK +RF IF+DNL RIE HNS + +++ GLN+F+DLT DEF Q+ LG K+ L
Sbjct: 56 G--EKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEF-QASYLGGKMEKKSL 112

Query: 358 GSLRRRSHFHHKSETPMVTAESLDWRTLGAVTP-VKDQGMCGSCWAFSATGAIEGANAVA 534
+ R + P + +DWR GAV P VK QG CGSCWAF+ATGA+EG N +
Sbjct: 113 SDVAERYQYKEGDVLP----DEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQIT 168

Query: 535 TGNLVSVSEEELVTC---SSESGCDGGLMDDAFE 627
TG LVS+SE+EL+ C + GC GG AFE
Sbjct: 169 TGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFE 202


tr_hit_id Q94BX1
Definition tr|Q94BX1|Q94BX1_ARATH F2G19.31/F2G19.31 OS=Arabidopsis thaliana
Align length 205
Score (bit) 176.0
E-value 8.0e-43
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK953330|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0017_H20, 5'
(628 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|Q94BX1|Q94BX1_ARATH F2G19.31/F2G19.31 OS=Arabidopsis thaliana... 176 8e-43
tr|Q56XI5|Q56XI5_ARATH Cysteine proteinase RD21A OS=Arabidopsis ... 174 3e-42
tr|O24323|O24323_PHAVU Cysteine proteinase OS=Phaseolus vulgaris... 172 1e-41
tr|Q6Y1E9|Q6Y1E9_TRIRP Cysteine protease 14 OS=Trifolium repens ... 172 2e-41
tr|Q84M29|Q84M29_HELAN Cysteine protease-1 OS=Helianthus annuus ... 171 3e-41
tr|A5HIJ6|A5HIJ6_ACTDE Cysteine protease Cp6 OS=Actinidia delici... 171 3e-41
tr|Q6Y1F0|Q6Y1F0_TRIRP Cysteine protease 14 OS=Trifolium repens ... 171 3e-41
tr|A9NW12|A9NW12_PICSI Putative uncharacterized protein OS=Picea... 169 1e-40
tr|Q0WT15|Q0WT15_ARATH Putative cysteine proteinase OS=Arabidops... 167 4e-40
tr|Q9ST61|Q9ST61_SOLTU Cysteine protease OS=Solanum tuberosum GN... 167 5e-40
tr|Q3E9R1|Q3E9R1_ARATH Uncharacterized protein At4g35350.2 OS=Ar... 167 6e-40
tr|B4ESE6|B4ESE6_HORVD Papain-like cysteine proteinase OS=Hordeu... 166 8e-40
tr|Q41064|Q41064_PEA Thiolprotease OS=Pisum sativum GN=tpp PE=2 ... 166 1e-39
tr|Q75NB3|Q75NB3_DIACA Cysteine proteinase OS=Dianthus caryophyl... 165 2e-39
tr|O49877|O49877_SOLLC CYP1 (Cysteine protease TDI-65) OS=Solanu... 165 2e-39
tr|Q6F6A9|Q6F6A9_DAUCA Cysteine protease OS=Daucus carota GN=DcC... 164 3e-39
tr|Q2AAC8|Q2AAC8_9ASTR Cysteine proteinase OS=Platycodon grandif... 164 3e-39
tr|Q155L4|Q155L4_HEVBR Cysteine protease OS=Hevea brasiliensis G... 164 3e-39
tr|A7P8S5|A7P8S5_VITVI Chromosome chr3 scaffold_8, whole genome ... 164 4e-39
tr|Q40922|Q40922_PSEMZ Pseudotzain OS=Pseudotsuga menziesii GN=P... 163 7e-39
tr|B4ESE7|B4ESE7_HORVD Papain-like cysteine proteinase OS=Hordeu... 163 7e-39
tr|A9NUC2|A9NUC2_PICSI Putative uncharacterized protein OS=Picea... 163 7e-39
tr|Q84M26|Q84M26_HELAN Cysteine protease-4 OS=Helianthus annuus ... 163 9e-39
tr|A9NV34|A9NV34_PICSI Putative uncharacterized protein OS=Picea... 163 9e-39
tr|Q52QX8|Q52QX8_MANES Cysteine protease CP1 OS=Manihot esculent... 162 1e-38
tr|A9PFF7|A9PFF7_POPTR Putative uncharacterized protein OS=Popul... 162 1e-38
tr|A9P285|A9P285_PICSI Putative uncharacterized protein OS=Picea... 162 2e-38
tr|A5HIJ2|A5HIJ2_ACTDE Cysteine protease Cp2 OS=Actinidia delici... 162 2e-38
tr|A5B6Y2|A5B6Y2_VITVI Putative uncharacterized protein OS=Vitis... 162 2e-38
tr|Q94HK7|Q94HK7_ORYSA Putative cysteine proteinase OS=Oryza sat... 161 3e-38

>tr|Q94BX1|Q94BX1_ARATH F2G19.31/F2G19.31 OS=Arabidopsis thaliana
PE=2 SV=1
Length = 462

Score = 176 bits (447), Expect = 8e-43
Identities = 94/205 (45%), Positives = 135/205 (65%), Gaps = 8/205 (3%)
Frame = +1

Query: 37 VFALFLTLSSADNSHVLSY------SPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKH 198
+F +T+SSA + ++SY S + S+A+++S+++AW +KHGK + +EK
Sbjct: 11 LFLAMVTVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKD 70

Query: 199 KRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRS 378
+RF IF+DNL ++ HN K +++LGL RFADLT DE++ S+ LG K+ K G RR+
Sbjct: 71 RRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR-SKYLGAKME--KKGE--RRT 125

Query: 379 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 558
+++ ES+DWR GAV VKDQG CGSCWAFS GA+EG N + TG+L+++S
Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185

Query: 559 EEELVTC--SSESGCDGGLMDDAFE 627
E+ELV C S GC+GGLMD AFE
Sbjct: 186 EQELVDCDTSYNEGCNGGLMDYAFE 210


>tr|Q56XI5|Q56XI5_ARATH Cysteine proteinase RD21A OS=Arabidopsis
thaliana GN=At1g47128 PE=2 SV=1
Length = 433

Score = 174 bits (442), Expect = 3e-42
Identities = 93/205 (45%), Positives = 134/205 (65%), Gaps = 8/205 (3%)
Frame = +1

Query: 37 VFALFLTLSSADNSHVLSY------SPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKH 198
+F + +SSA + ++SY S + S+A+++S+++AW +KHGK + +EK
Sbjct: 11 LFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKD 70

Query: 199 KRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRS 378
+RF IF+DNL ++ HN K +++LGL RFADLT DE++ S+ LG K+ K G RR+
Sbjct: 71 RRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR-SKYLGAKME--KKGE--RRT 125

Query: 379 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 558
+++ ES+DWR GAV VKDQG CGSCWAFS GA+EG N + TG+L+++S
Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185

Query: 559 EEELVTC--SSESGCDGGLMDDAFE 627
E+ELV C S GC+GGLMD AFE
Sbjct: 186 EQELVDCDTSYNEGCNGGLMDYAFE 210


>tr|O24323|O24323_PHAVU Cysteine proteinase OS=Phaseolus vulgaris
PE=3 SV=1
Length = 455

Score = 172 bits (436), Expect = 1e-41
Identities = 99/205 (48%), Positives = 133/205 (64%), Gaps = 8/205 (3%)
Frame = +1

Query: 37 VFALFLTLSSADNSHVLSYS-----PSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHK 201
+FALF LSSA + ++SY + + ++ SL++ W +KHGK Y A EK K
Sbjct: 3 LFALF-ALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALG--EKDK 59

Query: 202 RFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKL-PSVKLGSLRRRS 378
RF IF+DNL I+ N++ T+KLGLNRFADLT +E++ +R LG K+ P+ +LG
Sbjct: 60 RFQIFKDNLRFIDQQNAENRTYKLGLNRFADLTNEEYR-ARYLGTKIDPNRRLGRTPSNR 118

Query: 379 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 558
+ ET +S+DWR GAV PVKDQ CGSCWAFSA GA+EG N + TG+L+S+S
Sbjct: 119 YAPRVGET---LPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLS 175

Query: 559 EEELVTCSS--ESGCDGGLMDDAFE 627
E+ELV C + GC+GGLMD AFE
Sbjct: 176 EQELVDCDTGYNMGCNGGLMDYAFE 200


>tr|Q6Y1E9|Q6Y1E9_TRIRP Cysteine protease 14 OS=Trifolium repens
PE=2 SV=1
Length = 351

Score = 172 bits (435), Expect = 2e-41
Identities = 95/207 (45%), Positives = 126/207 (60%), Gaps = 2/207 (0%)
Frame = +1

Query: 10 SSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKL 189
SS L LFL+L+ + ++ YS DL S KL+ LF++W +HGK Y +
Sbjct: 5 SSKTLVLTCSLCLFLSLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYETIE-- 62

Query: 190 EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLR 369
EK RF +F+DNL I+ N S + LGLN FADL+ EFK ++ LGLK V L R
Sbjct: 63 EKLLRFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQEFK-NKYLGLK---VNLSQRR 118

Query: 370 RRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLV 549
S+ + + +S+DWR GAVTPVK+QG CGSCWAFS A+EG N + TGNL
Sbjct: 119 ESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 178

Query: 550 SVSEEELVTCSS--ESGCDGGLMDDAF 624
S+SE+EL+ C + +GC+GGLMD AF
Sbjct: 179 SLSEQELIDCDTTYNNGCNGGLMDYAF 205


>tr|Q84M29|Q84M29_HELAN Cysteine protease-1 OS=Helianthus annuus
GN=scp1 PE=2 SV=1
Length = 461

Score = 171 bits (434), Expect = 3e-41
Identities = 95/212 (44%), Positives = 135/212 (63%), Gaps = 3/212 (1%)
Frame = +1

Query: 1 MATSSVIWALLGVFALFLTLSSADNSHVLSYSPS-DLHSQAKLVSLFDAWNMKHGKHYTA 177
MAT S + + A+ +++ + D +H+ S S S L + ++ +L+++W +KHGK Y A
Sbjct: 6 MATLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKTYNA 65

Query: 178 AQKLEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKL 357
EK +RF IF+DNL I+ HNS T+KLGLN+FADLT +E++ + + K
Sbjct: 66 LG--EKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMTYTGIKTIDDKKK 123

Query: 358 GSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVAT 537
S + + ++S + E +DWR GAVT VKDQG CGSCWAFS TG++EG N + T
Sbjct: 124 LSKMKSDRYAYRSGDSL--PEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIVT 181

Query: 538 GNLVSVSEEELVTC--SSESGCDGGLMDDAFE 627
G+L+SVSE+ELV C S GC+GGLMD AFE
Sbjct: 182 GDLISVSEQELVNCDTSYNQGCNGGLMDYAFE 213


>tr|A5HIJ6|A5HIJ6_ACTDE Cysteine protease Cp6 OS=Actinidia deliciosa
PE=2 SV=1
Length = 461

Score = 171 bits (434), Expect = 3e-41
Identities = 100/207 (48%), Positives = 130/207 (62%), Gaps = 6/207 (2%)
Frame = +1

Query: 25 ALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHKR 204
ALL +F+LF S+ D S + S S + ++++++++W +KHGK Y A EK KR
Sbjct: 11 ALLLLFSLFALSSALDMSIIGELSSS--RTDDEVMAMYESWLVKHGKSYNAIG--EKEKR 66

Query: 205 FHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRSHF 384
F IF+DNL I+ HN++ T+K+GLNRFADLT DE++ S LG + GS RR S
Sbjct: 67 FQIFKDNLRFIDEHNAESRTYKVGLNRFADLTNDEYR-SMYLG-----ARTGSRRRLSTQ 120

Query: 385 HHKSETPMVTAESL----DWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVS 552
V ESL DWR GAV VKDQG CGSCWAFS A+EG N + TG+L+S
Sbjct: 121 KRSDRYVPVAGESLPDSVDWREKGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLIS 180

Query: 553 VSEEELVTC--SSESGCDGGLMDDAFE 627
+SE+ELV C S GC+GGLMD AFE
Sbjct: 181 LSEQELVDCDTSYNEGCNGGLMDYAFE 207


>tr|Q6Y1F0|Q6Y1F0_TRIRP Cysteine protease 14 OS=Trifolium repens
PE=3 SV=1
Length = 351

Score = 171 bits (433), Expect = 3e-41
Identities = 95/207 (45%), Positives = 126/207 (60%), Gaps = 2/207 (0%)
Frame = +1

Query: 10 SSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKL 189
SS L LFL+L+ + ++ YS DL S KL+ LF++W +HGK Y +
Sbjct: 5 SSKTLVLTCSLCLFLSLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYETIE-- 62

Query: 190 EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLR 369
EK RF +F+DNL I+ N S + LGLN FADL+ EFK ++ LGLK V L R
Sbjct: 63 EKLLRFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQEFK-NKYLGLK---VDLSQRR 118

Query: 370 RRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLV 549
S+ + + +S+DWR GAVTPVK+QG CGSCWAFS A+EG N + TGNL
Sbjct: 119 ESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 178

Query: 550 SVSEEELVTCSS--ESGCDGGLMDDAF 624
S+SE+EL+ C + +GC+GGLMD AF
Sbjct: 179 SLSEQELIDCDTTYNNGCNGGLMDYAF 205


>tr|A9NW12|A9NW12_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 294

Score = 169 bits (428), Expect = 1e-40
Identities = 104/205 (50%), Positives = 130/205 (63%), Gaps = 5/205 (2%)
Frame = +1

Query: 28 LLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHKRF 207
+L + L L SS ++Y+P DL S+ L+SLFD W HGK YTA Q+ RF
Sbjct: 7 ILKLVMLLLVFSSVT---AITYNPRDL-SENGLLSLFDRWCNHHGKTYTAKQR---PLRF 59

Query: 208 HIFRDNLMRIEAHNSKGS-TFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRR--RS 378
+F++NL I HNS+G+ TF LGLN F+DLT DEF+ ++++GL+ L S RR +S
Sbjct: 60 QVFKENLFYISEHNSRGNHTFWLGLNAFSDLTSDEFR-TQQMGLRGHPPSLKSRRREPKS 118

Query: 379 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 558
P SLDWR AVT VKDQG CG CWAFSATGAIEG N + TG+LVS+S
Sbjct: 119 GLLELYNIP----SSLDWRDKDAVTGVKDQGACGDCWAFSATGAIEGINKIVTGSLVSLS 174

Query: 559 EEELVTC--SSESGCDGGLMDDAFE 627
E+EL C S SGCDGGLMD AF+
Sbjct: 175 EQELCDCDTSYNSGCDGGLMDYAFQ 199


>tr|Q0WT15|Q0WT15_ARATH Putative cysteine proteinase OS=Arabidopsis
thaliana GN=At1g20850 PE=2 SV=1
Length = 356

Score = 167 bits (424), Expect = 4e-40
Identities = 89/201 (44%), Positives = 124/201 (61%), Gaps = 2/201 (0%)
Frame = +1

Query: 31 LGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHKRFH 210
L +L L+ +S+ + ++ YSP DL S KL+ LF+ W K Y + EK RF
Sbjct: 16 LSAASLSLSFASSHDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVE--EKFLRFE 73

Query: 211 IFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRSHFHH 390
+F+DNL I+ N KG ++ LGLN FADL+ +EFK+ LGLK V+ R + F +
Sbjct: 74 VFKDNLKHIDETNKKGKSYWLGLNEFADLSHEEFKKMY-LGLKTDIVRRDEERSYAEFAY 132

Query: 391 KSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEEL 570
+ + +S+DWR GAV VK+QG CGSCWAFS A+EG N + TGNL ++SE+EL
Sbjct: 133 RDVEAV--PKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQEL 190

Query: 571 VTCSS--ESGCDGGLMDDAFE 627
+ C + +GC+GGLMD AFE
Sbjct: 191 IDCDTTYNNGCNGGLMDYAFE 211


>tr|Q9ST61|Q9ST61_SOLTU Cysteine protease OS=Solanum tuberosum
GN=cyp PE=2 SV=1
Length = 466

Score = 167 bits (423), Expect = 5e-40
Identities = 100/216 (46%), Positives = 132/216 (61%), Gaps = 8/216 (3%)
Frame = +1

Query: 4 ATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLH--SQAKLVSLFDAWNMKHGKHYTA 177
A SS + L + +F TLSSA + ++SY + +H S ++ +L+++W ++HGK Y A
Sbjct: 3 AHSSTLTISLLLMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNA 62

Query: 178 AQKLEKHKRFHIFRDNLMRIEAHNS-KGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVK 354
EK KRF IF+DNL I+ NS ++KLGL +FADLT +E++ S LG K
Sbjct: 63 LG--EKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYR-SIYLGTK----S 115

Query: 355 LGSLRRRSHFHHKSETPMV---TAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGAN 525
G R+ S P V ES+DWR G + VKDQG CGSCWAFSA A+E N
Sbjct: 116 SGDRRKLSKNKSDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESIN 175

Query: 526 AVATGNLVSVSEEELVTC--SSESGCDGGLMDDAFE 627
A+ TGNL+S+SE+ELV C S GCDGGLMD AFE
Sbjct: 176 AIVTGNLISLSEQELVDCDKSYNEGCDGGLMDYAFE 211