DK952021
Clone id TST38A01NGRL0012_P13
Library
Length 653
Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0012_P13. 5' end sequence.
Accession
Tissue type prothallia
Developmental stage gametophyte
Contig ID -
Sequence
CAAGCAGAGCCGGCGTCTTGGTCTCAAGCTTCCTTCTGTCAAGCTTGGATCCCTCCGCAG
GCGGTCCCACTTCCATCACAAGTCTGAGACCCCTATGGTAACAGCTGAATCTTTGGACTG
GAGAACCCTTGGCGCCGTTACCCCAGTGAAAGATCAGGGCATGTGTGGAAGCTGCTGGGC
TTTCTCTGCCACAGGAGCTATTGAAGGAGCCAACGCTGTTGCAACAGGAAACCTTGTCAG
TGTTTCGGAGGAAGAGCTTGTGACATGCAGCAGTGAGAGTGGATGTGATGGGGGGCTGAT
GGATGATGCCTTTGAATGGGTTATTGACAATGGCGGGATTGCCACAGAAGATAATTATCC
TTATCTAAGCTACAGTGGCTCCTCTGGTGCTTGCGACACCAAAATTGAAGAGGAAGAAAA
AGCTGTGTCTATTGATGGTTATGCTGATGTGCAACCCTACAGCGAAGCAGCACTTTTAGA
AGCTGTCGCCAAGCAGCCCGTTAGTGTAGCCATTGAAGCCTCTTCTTGGGATCTTCAGCT
TTATGTTGAGGGTGTGTACAATGGCACCTGCTCAAGCGATCCTTATGATATCAACCATGG
TGTATTGCTCGTTGGATATGGCTCTGAGAATGGCGTTGATTATTGGATCGTGA
■■Homology search results ■■ -
sp_hit_id P43297
Definition sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana
Align length 218
Score (bit) 209.0
E-value 1.0e-53
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK952021|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0012_P13, 5'
(653 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis t... 209 1e-53
sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. ... 206 1e-52
sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=A... 206 1e-52
sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. j... 205 2e-52
sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=A... 202 1e-51
sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinas... 201 3e-51
sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS... 199 1e-50
sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=A... 197 4e-50
sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis ... 197 5e-50
sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2 194 3e-49
sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1 193 6e-49
sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Br... 191 3e-48
sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 ... 191 3e-48
sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis ... 187 3e-47
sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. G... 185 2e-46
sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp.... 182 1e-45
sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1 182 1e-45
sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulg... 179 1e-44
sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulg... 179 1e-44
sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1 177 4e-44
sp|P22895|P34_SOYBN P34 probable thiol protease OS=Glycine max P... 177 6e-44
sp|P09648|CATL1_CHICK Cathepsin L1 (Fragments) OS=Gallus gallus ... 176 1e-43
sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata P... 174 4e-43
sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4 174 4e-43
sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1 174 5e-43
sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium dis... 171 3e-42
sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1 169 9e-42
sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus... 169 2e-41
sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=C... 169 2e-41
sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata P... 168 2e-41

>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis
thaliana GN=RD21A PE=1 SV=1
Length = 462

Score = 209 bits (532), Expect = 1e-53
Identities = 110/218 (50%), Positives = 146/218 (66%), Gaps = 2/218 (0%)
Frame = +2

Query: 5 QSRRLGLKLPSVKLGSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAF 184
+S+ LG K+ K G RR+ +++ ES+DWR GAV VKDQG CGSCWAF
Sbjct: 109 RSKYLGAKME--KKGE--RRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAF 164

Query: 185 SATGAIEGANAVATGNLVSVSEEELVTC--SSESGCDGGLMDDAFEWVIDNGGIATEDNY 358
S GA+EG N + TG+L+++SE+ELV C S GC+GGLMD AFE++I NGGI T+ +Y
Sbjct: 165 STIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDY 224

Query: 359 PYLSYSGSSGACDTKIEEEEKAVSIDGYADVQPYSEAALLEAVAKQPVSVAIEASSWDLQ 538
P Y G G CD +I + K V+ID Y DV YSE +L +AVA QP+S+AIEA Q
Sbjct: 225 P---YKGVDGTCD-QIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQ 280

Query: 539 LYVEGVYNGTCSSDPYDINHGVLLVGYGSENGVDYWIV 652
LY G+++G+C + ++HGV+ VGYG+ENG DYWIV
Sbjct: 281 LYDSGIFDGSCGT---QLDHGVVAVGYGTENGKDYWIV 315


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp.
japonica GN=Os04g0650000 PE=1 SV=2
Length = 458

Score = 206 bits (523), Expect = 1e-52
Identities = 104/184 (56%), Positives = 128/184 (69%), Gaps = 2/184 (1%)
Frame = +2

Query: 107 ESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELVTC--SSES 280
ES+DWRT GAV +KDQG CGSCWAFSA A+EG N + TG+L+S+SE+ELV C S
Sbjct: 131 ESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNE 190

Query: 281 GCDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIEEEEKAVSIDGYADVQPY 460
GC+GGLMD AF+++I+NGGI TED+YP Y G CD + K V+ID Y DV P
Sbjct: 191 GCNGGLMDYAFDFIINNGGIDTEDDYP---YKGKDERCDVN-RKNAKVVTIDSYEDVTPN 246

Query: 461 SEAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYDINHGVLLVGYGSENGVD 640
SE +L +AVA QPVSVAIEA QLY G++ G C + ++HGV VGYG+ENG D
Sbjct: 247 SETSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGT---ALDHGVAAVGYGTENGKD 303

Query: 641 YWIV 652
YWIV
Sbjct: 304 YWIV 307


>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320
OS=Arabidopsis thaliana GN=At4g11320 PE=2 SV=1
Length = 371

Score = 206 bits (523), Expect = 1e-52
Identities = 94/193 (48%), Positives = 142/193 (73%), Gaps = 1/193 (0%)
Frame = +2

Query: 77 HKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEE 256
+K+ V +S+DWR GAVT VKDQG+C SCWAFS GA+EG N + TG LV++SE++
Sbjct: 136 YKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGELVTLSEQD 195

Query: 257 LVTCSSE-SGCDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIEEEEKAVSI 433
L+ C+ E +GC GG ++ A+E++++NGG+ T+++YP Y +G C+ +++E+ K V I
Sbjct: 196 LINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYP---YKALNGVCEGRLKEDNKNVMI 252

Query: 434 DGYADVQPYSEAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYDINHGVLLV 613
DGY ++ EAAL++AVA QPV+ +++SS + QLY GV++GTC + ++NHGV++V
Sbjct: 253 DGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGT---NLNHGVVVV 309

Query: 614 GYGSENGVDYWIV 652
GYG+ENG DYWIV
Sbjct: 310 GYGTENGRDYWIV 322


>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp.
japonica GN=Os04g0670200 PE=1 SV=2
Length = 466

Score = 205 bits (521), Expect = 2e-52
Identities = 102/185 (55%), Positives = 130/185 (70%), Gaps = 3/185 (1%)
Frame = +2

Query: 107 ESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELVTCSS---E 277
ES+DWR GAV PVK+QG CGSCWAFSA +E N + TG ++++SE+ELV CS+
Sbjct: 143 ESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQN 202

Query: 278 SGCDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIEEEEKAVSIDGYADVQP 457
SGC+GGLMDDAF+++I NGGI TED+YPY + G CD E K VSIDG+ DV
Sbjct: 203 SGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGK---CDIN-RENAKVVSIDGFEDVPQ 258

Query: 458 YSEAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYDINHGVLLVGYGSENGV 637
E +L +AVA QPVSVAIEA + QLY GV++G C + ++HGV+ VGYG++NG
Sbjct: 259 NDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGT---SLDHGVVAVGYGTDNGK 315

Query: 638 DYWIV 652
DYWIV
Sbjct: 316 DYWIV 320


>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310
OS=Arabidopsis thaliana GN=At4g11310 PE=2 SV=1
Length = 364

Score = 202 bits (514), Expect = 1e-51
Identities = 93/193 (48%), Positives = 139/193 (72%), Gaps = 1/193 (0%)
Frame = +2

Query: 77 HKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEE 256
+K+ V +S+DWR GAVT VKDQG C SCWAFS GA+EG N + TG LV++SE++
Sbjct: 129 YKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQD 188

Query: 257 LVTCSSE-SGCDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIEEEEKAVSI 433
L+ C+ E +GC GG ++ A+E+++ NGG+ T+++YP Y +G CD +++E K V I
Sbjct: 189 LINCNKENNGCGGGKLETAYEFIMKNGGLGTDNDYP---YKAVNGVCDGRLKENNKNVMI 245

Query: 434 DGYADVQPYSEAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYDINHGVLLV 613
DGY ++ E+AL++AVA QPV+ I++SS + QLY GV++G+C + ++NHGV++V
Sbjct: 246 DGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGT---NLNHGVVVV 302

Query: 614 GYGSENGVDYWIV 652
GYG+ENG DYW+V
Sbjct: 303 GYGTENGRDYWLV 315


>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase
(Fragment) OS=Solanum lycopersicum PE=2 SV=1
Length = 346

Score = 201 bits (511), Expect = 3e-51
Identities = 101/184 (54%), Positives = 127/184 (69%), Gaps = 2/184 (1%)
Frame = +2

Query: 107 ESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELVTC--SSES 280
ES+DWR G + VKDQG CGSCWAFSA A+E NA+ TGNL+S+SE+ELV C S
Sbjct: 20 ESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSYNE 79

Query: 281 GCDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIEEEEKAVSIDGYADVQPY 460
GCDGGLMD AFE+VI NGGI TE++YP Y +G CD + + K V ID Y DV
Sbjct: 80 GCDGGLMDYAFEFVIKNGGIDTEEDYP---YKERNGVCD-QYRKNAKVVKIDSYEDVPVN 135

Query: 461 SEAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYDINHGVLLVGYGSENGVD 640
+E AL +AVA QPVS+A+EA D Q Y G++ G C + ++HGV++ GYG+ENG+D
Sbjct: 136 NEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGT---AVDHGVVIAGYGTENGMD 192

Query: 641 YWIV 652
YWIV
Sbjct: 193 YWIV 196


>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1
OS=Arabidopsis thaliana GN=GCP1 PE=2 SV=2
Length = 376

Score = 199 bits (505), Expect = 1e-50
Identities = 98/184 (53%), Positives = 129/184 (70%), Gaps = 2/184 (1%)
Frame = +2

Query: 107 ESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELVTC--SSES 280
E++DWR GAV P+KDQG CGSCWAFS T A+EG N + TG L+S+SE+ELV C S
Sbjct: 147 ETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQ 206

Query: 281 GCDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIEEEEKAVSIDGYADVQPY 460
GC+GGLMD AF++++ NGG+ TE +YP Y G G C++ + + + VSIDGY DV
Sbjct: 207 GCNGGLMDYAFQFIMKNGGLNTEKDYP---YRGFGGKCNSFL-KNSRVVSIDGYEDVPTK 262

Query: 461 SEAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYDINHGVLLVGYGSENGVD 640
E AL +A++ QPVSVAIEA Q Y G++ G+C + +++H V+ VGYGSENGVD
Sbjct: 263 DETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCGT---NLDHAVVAVGYGSENGVD 319

Query: 641 YWIV 652
YWIV
Sbjct: 320 YWIV 323


>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400
OS=Arabidopsis thaliana GN=At3g19400 PE=2 SV=1
Length = 362

Score = 197 bits (501), Expect = 4e-50
Identities = 99/188 (52%), Positives = 126/188 (67%), Gaps = 3/188 (1%)
Frame = +2

Query: 98 VTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELVTCSS- 274
V + +DWR GAV VKDQG CGSCWAFSA GA+EG N + TG L+S+SE+ELV C
Sbjct: 129 VLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRG 188

Query: 275 --ESGCDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIEEEEKAVSIDGYAD 448
+GCDGG+M+ AFE+++ NGGI T+ +YPY + G C+ + V+IDGY D
Sbjct: 189 FVNAGCDGGIMNYAFEFIMKNGGIETDQDYPY--NANDLGLCNADKNNNTRVVTIDGYED 246

Query: 449 VQPYSEAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYDINHGVLLVGYGSE 628
V E +L +AVA QPVSVAIEASS QLY GV GTC ++HGV++VGYGS
Sbjct: 247 VPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVMTGTCG---ISLDHGVVVVGYGST 303

Query: 629 NGVDYWIV 652
+G DYWI+
Sbjct: 304 SGEDYWII 311


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis
thaliana GN=XCP1 PE=1 SV=1
Length = 355

Score = 197 bits (500), Expect = 5e-50
Identities = 108/221 (48%), Positives = 142/221 (64%), Gaps = 5/221 (2%)
Frame = +2

Query: 5 QSRRLGLKLPSVKLGSLRRR---SHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSC 175
+ R LGL P R+R ++F ++ T + +S+DWR GAV PVKDQG CGSC
Sbjct: 108 KGRYLGLAKPQFS----RKRQPSANFRYRDITDL--PKSVDWRKKGAVAPVKDQGQCGSC 161

Query: 176 WAFSATGAIEGANAVATGNLVSVSEEELVTCSS--ESGCDGGLMDDAFEWVIDNGGIATE 349
WAFS A+EG N + TGNL S+SE+EL+ C + SGC+GGLMD AF+++I GG+ E
Sbjct: 162 WAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKE 221

Query: 350 DNYPYLSYSGSSGACDTKIEEEEKAVSIDGYADVQPYSEAALLEAVAKQPVSVAIEASSW 529
D+YPYL G C + E+ E+ V+I GY DV + +L++A+A QPVSVAIEAS
Sbjct: 222 DDYPYLM---EEGICQEQKEDVER-VTISGYEDVPENDDESLVKALAHQPVSVAIEASGR 277

Query: 530 DLQLYVEGVYNGTCSSDPYDINHGVLLVGYGSENGVDYWIV 652
D Q Y GV+NG C + D++HGV VGYGS G DY IV
Sbjct: 278 DFQFYKGGVFNGKCGT---DLDHGVAAVGYGSSKGSDYVIV 315


>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362

Score = 194 bits (493), Expect = 3e-49
Identities = 101/184 (54%), Positives = 124/184 (67%), Gaps = 3/184 (1%)
Frame = +2

Query: 110 SLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELVTCSSE--SG 283
S+DWR GAVT VKDQG CGSCWAFS A+EG N + T LV++SE+ELV C E G
Sbjct: 131 SVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQG 190

Query: 284 CDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIEEEEKAVSIDGYADVQPYS 463
C+GGLM+ AFE++ GGI TE NYP Y G CD + + AVSIDG+ +V
Sbjct: 191 CNGGLMESAFEFIKQKGGITTESNYP---YKAQEGTCDAS-KVNDLAVSIDGHENVPAND 246

Query: 464 EAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYDINHGVLLVGYGSE-NGVD 640
E ALL+AVA QPVSVAI+A D Q Y EGV+ G CS+ D+NHGV +VGYG+ +G +
Sbjct: 247 EDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCST---DLNHGVAIVGYGTTVDGTN 303

Query: 641 YWIV 652
YWIV
Sbjct: 304 YWIV 307


tr_hit_id Q84Y03
Definition tr|Q84Y03|Q84Y03_GOSHI Cysteine protease OS=Gossypium hirsutum
Align length 182
Score (bit) 229.0
E-value 1.0e-58
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK952021|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0012_P13, 5'
(653 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|Q84Y03|Q84Y03_GOSHI Cysteine protease OS=Gossypium hirsutum P... 229 1e-58
tr|Q9SP93|Q9SP93_9ASTR Thiol protease OS=Matricaria chamomilla G... 226 7e-58
tr|A5HIJ5|A5HIJ5_ACTDE Cysteine protease Cp5 OS=Actinidia delici... 225 2e-57
tr|A9PHQ3|A9PHQ3_POPTR Putative uncharacterized protein OS=Popul... 224 5e-57
tr|Q7XYU8|Q7XYU8_ANTAD Cysteine protease OS=Anthurium andraeanum... 223 1e-56
tr|Q93XQ9|Q93XQ9_IPOBA Putative cysteine protease OS=Ipomoea bat... 219 9e-56
tr|A9P285|A9P285_PICSI Putative uncharacterized protein OS=Picea... 216 1e-54
tr|A9TY71|A9TY71_PHYPA Predicted protein OS=Physcomitrella paten... 214 4e-54
tr|Q9SX19|Q9SX19_MAIZE Cysteine protease Mir1 OS=Zea mays GN=mir... 211 4e-53
tr|A9RNZ6|A9RNZ6_PHYPA Predicted protein OS=Physcomitrella paten... 210 5e-53
tr|A5HIJ2|A5HIJ2_ACTDE Cysteine protease Cp2 OS=Actinidia delici... 210 5e-53
tr|O24323|O24323_PHAVU Cysteine proteinase OS=Phaseolus vulgaris... 210 7e-53
tr|A2PZE2|A2PZE2_IPONI Cysteine proteinase OS=Ipomoea nil GN=In2... 210 7e-53
tr|A1KXJ7|A1KXJ7_ELAGV Oil palm polygalacturonase allergen PEST4... 210 7e-53
tr|Q94BX1|Q94BX1_ARATH F2G19.31/F2G19.31 OS=Arabidopsis thaliana... 209 1e-52
tr|Q56XI5|Q56XI5_ARATH Cysteine proteinase RD21A OS=Arabidopsis ... 209 1e-52
tr|Q6F6A9|Q6F6A9_DAUCA Cysteine protease OS=Daucus carota GN=DcC... 209 2e-52
tr|Q6F6A3|Q6F6A3_DAUCA Cysteine protease OS=Daucus carota GN=DcC... 209 2e-52
tr|B4FS90|B4FS90_MAIZE Cysteine protease 1 OS=Zea mays PE=2 SV=1 209 2e-52
tr|B8LMA7|B8LMA7_PICSI Putative uncharacterized protein OS=Picea... 208 3e-52
tr|B6TMV9|B6TMV9_MAIZE Cysteine protease 1 OS=Zea mays PE=2 SV=1 207 3e-52
tr|Q0WXG7|Q0WXG7_WHEAT Triticain beta OS=Triticum aestivum PE=2 ... 207 5e-52
tr|Q155L4|Q155L4_HEVBR Cysteine protease OS=Hevea brasiliensis G... 207 6e-52
tr|B7XDE9|B7XDE9_RAPSA Daikon cysteine protease RD21 (Fragment) ... 207 6e-52
tr|A9S553|A9S553_PHYPA Predicted protein OS=Physcomitrella paten... 207 6e-52
tr|A6N8F8|A6N8F8_ELAGV Cysteine proteinase OS=Elaeis guineensis ... 206 8e-52
tr|Q0WYE6|Q0WYE6_WHEAT RD21A-like cysteine protease (Fragment) O... 206 1e-51
tr|Q52QX8|Q52QX8_MANES Cysteine protease CP1 OS=Manihot esculent... 206 1e-51
tr|Q0J9I6|Q0J9I6_ORYSJ Os04g0650000 protein OS=Oryza sativa subs... 206 1e-51
tr|B8AVK3|B8AVK3_ORYSI Putative uncharacterized protein OS=Oryza... 206 1e-51

>tr|Q84Y03|Q84Y03_GOSHI Cysteine protease OS=Gossypium hirsutum PE=2
SV=1
Length = 389

Score = 229 bits (583), Expect = 1e-58
Identities = 114/182 (62%), Positives = 142/182 (78%), Gaps = 1/182 (0%)
Frame = +2

Query: 110 SLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELVTC-SSESGC 286
SLDWR G VT VKDQG CGSCWAFS+TGA+EG NA+ TG+L+S+SE+ELV C +S GC
Sbjct: 143 SLDWRNYGVVTAVKDQGSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTSNYGC 202

Query: 287 DGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIEEEEKAVSIDGYADVQPYSE 466
+GG MD AFEWVI+NGGI +E +YP Y+G G C+T +EE K VSIDGY DV+ S+
Sbjct: 203 EGGYMDYAFEWVINNGGIDSESDYP---YTGVDGTCNT-TKEETKVVSIDGYQDVE-QSD 257

Query: 467 AALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYDINHGVLLVGYGSENGVDYW 646
+ALL AVA+QPVSV I+ S+ D QLY G+Y+G+CS DP DI+H VL+VGYGSE+ +YW
Sbjct: 258 SALLCAVAQQPVSVGIDGSAIDFQLYTGGIYDGSCSDDPDDIDHAVLIVGYGSEDSEEYW 317

Query: 647 IV 652
IV
Sbjct: 318 IV 319


>tr|Q9SP93|Q9SP93_9ASTR Thiol protease OS=Matricaria chamomilla
GN=ctp PE=2 SV=1
Length = 501

Score = 226 bits (577), Expect = 7e-58
Identities = 112/217 (51%), Positives = 152/217 (70%), Gaps = 2/217 (0%)
Frame = +2

Query: 8 SRRLGLKLPSVKLGSLRRRSHFHHKS-ETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAF 184
S+ G + +K+G ++R ++ + P SLDWR G VTP+KDQG CGSCWAF
Sbjct: 115 SKVKGSRSNELKMGGVKRNMSVSSRTCDAPT----SLDWRDKGVVTPMKDQGQCGSCWAF 170

Query: 185 SATGAIEGANAVATGNLVSVSEEELVTCSS-ESGCDGGLMDDAFEWVIDNGGIATEDNYP 361
S +G+IE ANA+ATG+L+ +SE+ELV C + + GCDGG MD A+ W+I NGG+ +ED+YP
Sbjct: 171 SVSGSIESANAIATGDLIRLSEQELVDCDTYDYGCDGGNMDTAYRWIIKNGGLDSEDDYP 230

Query: 362 YLSYSGSSGACDTKIEEEEKAVSIDGYADVQPYSEAALLEAVAKQPVSVAIEASSWDLQL 541
Y S +G G CD K + + VS+D Y +V+ +E A+L AVA PV++ I S++D QL
Sbjct: 231 YTSSNGRDGKCD-KTKSAKSVVSLDSYVEVES-NEDAVLCAVATTPVTIGIVGSAYDFQL 288

Query: 542 YVEGVYNGTCSSDPYDINHGVLLVGYGSENGVDYWIV 652
Y GVYNG CSS PYDI+H VL+VGYGS++G DYWIV
Sbjct: 289 YTGGVYNGQCSSKPYDIDHAVLIVGYGSQDGKDYWIV 325


>tr|A5HIJ5|A5HIJ5_ACTDE Cysteine protease Cp5 OS=Actinidia deliciosa
PE=2 SV=1
Length = 509

Score = 225 bits (574), Expect = 2e-57
Identities = 117/214 (54%), Positives = 149/214 (69%), Gaps = 4/214 (1%)
Frame = +2

Query: 23 LKLPSVKLGSLRRRSHFHHKSETPMVTAE---SLDWRTLGAVTPVKDQGMCGSCWAFSAT 193
+K P+ K ++ RR + + + SLDWR G VT VKDQG CGSCWAFS+T
Sbjct: 118 VKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQGDCGSCWAFSST 177

Query: 194 GAIEGANAVATGNLVSVSEEELVTC-SSESGCDGGLMDDAFEWVIDNGGIATEDNYPYLS 370
GAIEG NA+A G+L+S+SE+ELV C S+ GC+GG MD AFEWV+ NGGI TE +YP
Sbjct: 178 GAIEGINALANGDLISLSEQELVDCDSTNDGCEGGYMDYAFEWVMSNGGIDTETDYP--- 234

Query: 371 YSGSSGACDTKIEEEEKAVSIDGYADVQPYSEAALLEAVAKQPVSVAIEASSWDLQLYVE 550
Y+G G C+T +EE KAVSIDGY DV E+AL AV KQP+SV I+ + D QLY
Sbjct: 235 YTGEDGTCNT-TKEETKAVSIDGYEDVAE-EESALFCAVLKQPISVGIDGGAIDFQLYTG 292

Query: 551 GVYNGTCSSDPYDINHGVLLVGYGSENGVDYWIV 652
G+Y+G CS DP DI+H VL+VGYG+E+G +YWI+
Sbjct: 293 GIYDGDCSDDPDDIDHAVLVVGYGAESGEEYWII 326


>tr|A9PHQ3|A9PHQ3_POPTR Putative uncharacterized protein OS=Populus
trichocarpa PE=2 SV=1
Length = 498

Score = 224 bits (570), Expect = 5e-57
Identities = 113/201 (56%), Positives = 145/201 (72%), Gaps = 3/201 (1%)
Frame = +2

Query: 59 RRSHFHHKS-ETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNL 235
+R H H ++ + P SLDWR G VT VKDQG CGSCW+FS TGAIE NA+ TG+L
Sbjct: 126 KRKHRHLQTCDAP----SSLDWRNKGVVTAVKDQGDCGSCWSFSTTGAIEAINAIVTGDL 181

Query: 236 VSVSEEELVTCSSES--GCDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIE 409
+S+SE+ELV C + + GC+GG MD AF+WVI NGGI TE +YPY +G G C+T +
Sbjct: 182 ISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGIDTEADYPY---TGVDGTCNTA-K 237

Query: 410 EEEKAVSIDGYADVQPYSEAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYD 589
EE+K VSI+GY DV P S++ALL A +QP+SV ++ S+ D QLY G+Y+G CS DP D
Sbjct: 238 EEKKVVSIEGYVDVDP-SDSALLCATVQQPISVGMDGSALDFQLYTGGIYDGDCSGDPND 296

Query: 590 INHGVLLVGYGSENGVDYWIV 652
I+H +L+VGYGSEN DYWIV
Sbjct: 297 IDHAILIVGYGSENDEDYWIV 317


>tr|Q7XYU8|Q7XYU8_ANTAD Cysteine protease OS=Anthurium andraeanum
PE=2 SV=1
Length = 502

Score = 223 bits (567), Expect = 1e-56
Identities = 113/182 (62%), Positives = 136/182 (74%), Gaps = 1/182 (0%)
Frame = +2

Query: 110 SLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELVTC-SSESGC 286
SLDWR GAVT VK+QG CGSCWAFS+TGA+EG NA+ TG L+S+SE+ELV C ++ GC
Sbjct: 149 SLDWRKRGAVTAVKNQGDCGSCWAFSSTGAMEGINAITTGELISLSEQELVDCDTTNEGC 208

Query: 287 DGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIEEEEKAVSIDGYADVQPYSE 466
DGG MD AFEWVI+NGGI +E NYPY + S C+T +EE K VSIDGY DV SE
Sbjct: 209 DGGYMDYAFEWVINNGGIDSEANYPYTGQADS--VCNT-TKEEIKVVSIDGYEDVAT-SE 264

Query: 467 AALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYDINHGVLLVGYGSENGVDYW 646
+ALL A +QPVSV I+ SS D QLY G+Y+G CS +P DI+H VL+VGYG + G DYW
Sbjct: 265 SALLCAAVQQPVSVGIDGSSLDFQLYAGGIYDGDCSGNPDDIDHAVLVVGYGQQGGTDYW 324

Query: 647 IV 652
IV
Sbjct: 325 IV 326


>tr|Q93XQ9|Q93XQ9_IPOBA Putative cysteine protease OS=Ipomoea
batatas PE=2 SV=1
Length = 462

Score = 219 bits (559), Expect = 9e-56
Identities = 111/184 (60%), Positives = 132/184 (71%), Gaps = 2/184 (1%)
Frame = +2

Query: 107 ESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELVTC--SSES 280
+S+DWR GAV VKDQG CGSCWAFS A+EG N + TG L+S+SE+ELV C S
Sbjct: 141 DSIDWREKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTSYNE 200

Query: 281 GCDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIEEEEKAVSIDGYADVQPY 460
GC+GGLMD AFE++I NGGI TE +YP Y+G G CD + + K VSIDGY DV PY
Sbjct: 201 GCNGGLMDYAFEFIIKNGGIDTEADYP---YTGRYGRCD-QTRKNAKVVSIDGYEDVTPY 256

Query: 461 SEAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYDINHGVLLVGYGSENGVD 640
EAAL EAVA QPVSVAIEA D QLY G++ G+C + D++HGV VGYG+ENGVD
Sbjct: 257 DEAALKEAVAGQPVSVAIEAGGRDFQLYSSGIFTGSCGT---DLDHGVTAVGYGTENGVD 313

Query: 641 YWIV 652
YWIV
Sbjct: 314 YWIV 317


>tr|A9P285|A9P285_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 367

Score = 216 bits (550), Expect = 1e-54
Identities = 110/184 (59%), Positives = 133/184 (72%), Gaps = 1/184 (0%)
Frame = +2

Query: 104 AESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELVTCSSES- 280
A SLDWR GAVT VKDQ CGSCWAFS TGAIEG N ++TG LVS+SE+ELV C + +
Sbjct: 143 ASSLDWRKKGAVTGVKDQAQCGSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACDATNY 202

Query: 281 GCDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIEEEEKAVSIDGYADVQPY 460
GC+GG MD AF WVI NGGI TE +Y SY+G C+T +E +K VSIDGY DV P
Sbjct: 203 GCEGGDMDYAFTWVIQNGGIDTEKDY---SYTGVDSTCNTN-KEAKKIVSIDGYTDVSP- 257

Query: 461 SEAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYDINHGVLLVGYGSENGVD 640
++ALL A QPVSV I+ S+ D QLY G+Y+G CS +P DI+H VL+VGY ++NG D
Sbjct: 258 DDSALLCAAGSQPVSVGIDGSAIDFQLYTGGIYDGDCSGNPDDIDHAVLVVGYSAKNGKD 317

Query: 641 YWIV 652
YWIV
Sbjct: 318 YWIV 321


>tr|A9TY71|A9TY71_PHYPA Predicted protein OS=Physcomitrella patens
subsp. patens GN=PHYPADRAFT_63513 PE=3 SV=1
Length = 461

Score = 214 bits (545), Expect = 4e-54
Identities = 114/202 (56%), Positives = 140/202 (69%), Gaps = 3/202 (1%)
Frame = +2

Query: 56 RRRSHFHHK-SETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGN 232
+RR+ F + SE P ES+DWR GAVT VKDQG CGSCWAFSA G++EG NA+ G
Sbjct: 125 KRRTGFRYADSEAP----ESVDWRKNGAVTSVKDQGSCGSCWAFSAVGSVEGINAIRNGE 180

Query: 233 LVSVSEEELVTCSSE--SGCDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKI 406
VS+SE+ELV C E GC+GGLMD AF+++I NGGI TE +YP Y G G CD
Sbjct: 181 AVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQNGGIDTEKDYP---YKGFDGRCDNS- 236

Query: 407 EEEEKAVSIDGYADVQPYSEAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPY 586
++ V+IDGY DV E AL +AVA QPVSVAIEA D QLY +GV++G C +
Sbjct: 237 KKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYAQGVFSGECGT--- 293

Query: 587 DINHGVLLVGYGSENGVDYWIV 652
D++HGVL VGYG+E+GVDYWIV
Sbjct: 294 DLDHGVLAVGYGTEDGVDYWIV 315


>tr|Q9SX19|Q9SX19_MAIZE Cysteine protease Mir1 OS=Zea mays GN=mir1
PE=2 SV=1
Length = 398

Score = 211 bits (536), Expect = 4e-53
Identities = 104/183 (56%), Positives = 132/183 (72%), Gaps = 1/183 (0%)
Frame = +2

Query: 107 ESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELVTC-SSESG 283
+++DWR LGAVT VKDQ CG CWAFSA AIEG NA+ATGNLVS+SE+E++ C + +SG
Sbjct: 159 DAIDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQDSG 218

Query: 284 CDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKIEEEEKAVSIDGYADVQPYS 463
CDGG M++AF +VI NGGI TE +YP++ G+ G CD E+ EK +IDG +V +
Sbjct: 219 CDGGQMENAFRFVIGNGGIDTEADYPFI---GTDGTCDASKEKNEKVATIDGLVEVASNN 275

Query: 464 EAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPYDINHGVLLVGYGSENGVDY 643
E AL EAVA QPVSVAI+AS Q Y G++NG C + ++HGV VGYGSE+G DY
Sbjct: 276 ETALQEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGT---SLDHGVTAVGYGSESGKDY 332

Query: 644 WIV 652
WIV
Sbjct: 333 WIV 335


>tr|A9RNZ6|A9RNZ6_PHYPA Predicted protein OS=Physcomitrella patens
subsp. patens GN=PHYPADRAFT_204314 PE=3 SV=1
Length = 454

Score = 210 bits (535), Expect = 5e-53
Identities = 112/202 (55%), Positives = 139/202 (68%), Gaps = 3/202 (1%)
Frame = +2

Query: 56 RRRSHFHHK-SETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGN 232
+R++ F + SE P ES+DWR GAVT VKDQG CGSCWAFSA G++EG NA+ TG
Sbjct: 118 KRKTGFRYADSEAP----ESVDWRKKGAVTTVKDQGSCGSCWAFSAIGSVEGINAIRTGE 173

Query: 233 LVSVSEEELVTCSSE--SGCDGGLMDDAFEWVIDNGGIATEDNYPYLSYSGSSGACDTKI 406
VS+SE+ELV C E GC+GGLMD AF+++++NGGI TE++YP Y G G CD
Sbjct: 174 AVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILENGGIDTENDYP---YKGLDGRCDNN- 229

Query: 407 EEEEKAVSIDGYADVQPYSEAALLEAVAKQPVSVAIEASSWDLQLYVEGVYNGTCSSDPY 586
++ V+IDGY DV E AL +AVA QPVSVAIEA D QLY GV+ G C +
Sbjct: 230 KKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGGVFTGECGT--- 286

Query: 587 DINHGVLLVGYGSENGVDYWIV 652
D++HGVL VGYGSE +DYWIV
Sbjct: 287 DLDHGVLAVGYGSEGSLDYWIV 308