DK962477
Clone id TST39A01NGRL0013_P18
Library
Length 652
Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0013_P18. 5' end sequence.
Accession
Tissue type prothallia with plantlets
Developmental stage gametophytes with sporophytes
Contig ID
Sequence
CAGCTTTACATAAGACCGGGTCCTCCGTCTGCCCTGCTCTCTGTTGCTTCTACCACCATG
GCCACCTGACAGTGAAAGACCTACTCGTTACTGACAGGCCAAGCTCGCCATGGCAGCTCC
AACCCTGGAGCTCCTCCTCTTTTCCTTCCTTCTCCTCCTCTCTGGGCTCGCTTCTGGCGA
TGTCCTCCAATATTCCGAAGAGGATTTACTCTCTGAAAGCCGCCTCTCTACCCTCTTCGA
CGCCTGGAACGCCAAGCATGGCAAACATTACCCCGCCCTCGGCTCCCCCCAAAAGGAGAA
GCGCTTTGAAATCTTCAAAGAGAATCTTGCCCATATCCAGCAGCACAACAGCAAGGGCAC
CTCCTCTTACACACTGGGCCTCACCCGCTTTGCAGATCTCACTAATGAGGAGTTCAAAGC
GCTTAATTACTTCGGAGTGCGGCCTGTCCTCTCTACAAGGAACCACAGCTCTTTAGCAGC
TGCTAATCACAGGCGCAGAGTACACTCGTGCGATTCAAGCGATTTGGATACTGCCTTCGA
TTGGCGTGATGAGGATGCTGTGGAGAGCGTCAAGGATCAAGGCAGCTGCGGTAGCTGCTG
GGCTTTTGCGGCTGTGGGGGCCATAGAGAGTGCCAATGCCATATCTATGGGC
■■Homology search results ■■ -
sp_hit_id P43297
Definition sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana
Align length 139
Score (bit) 118.0
E-value 2.0e-26
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK962477|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0013_P18, 5'
(652 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis t... 118 2e-26
sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis ... 117 7e-26
sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=A... 114 6e-25
sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. ... 110 6e-24
sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=A... 106 1e-22
sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2 105 3e-22
sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=A... 104 5e-22
sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1 103 1e-21
sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis ... 102 2e-21
sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=A... 101 4e-21
sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4 100 9e-21
sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS... 99 1e-20
sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. j... 99 2e-20
sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1 99 2e-20
sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Br... 96 2e-19
sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium dis... 95 3e-19
sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 ... 94 6e-19
sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2 94 8e-19
sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2 92 2e-18
sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1 92 2e-18
sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. ... 92 3e-18
sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp.... 91 5e-18
sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis t... 91 7e-18
sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium dis... 89 2e-17
sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 ... 89 3e-17
sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. G... 88 4e-17
sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulg... 88 4e-17
sp|P22895|P34_SOYBN P34 probable thiol protease OS=Glycine max P... 87 6e-17
sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 OS=Dro... 87 8e-17
sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2... 87 1e-16

>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis
thaliana GN=RD21A PE=1 SV=1
Length = 462

Score = 118 bits (296), Expect = 2e-26
Identities = 61/139 (43%), Positives = 82/139 (58%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
++AW KHGK +K++RFEIFK+NL + +HN K S Y LGLTRFADLTN+E+
Sbjct: 50 YEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLS-YRLGLTRFADLTNDEY 108

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
++ Y G + +SL R + +L + DWR + AV VKDQG CGS
Sbjct: 109 RS-KYLGAKMEKKGERRTSL-------RYEARVGDELPESIDWRKKGAVAEVKDQGGCGS 160

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+ +GA+E N I G
Sbjct: 161 CWAFSTIGAVEGINQIVTG 179


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis
thaliana GN=XCP1 PE=1 SV=1
Length = 355

Score = 117 bits (292), Expect = 7e-26
Identities = 66/140 (47%), Positives = 88/140 (62%), Gaps = 1/140 (0%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
F++W ++H K Y ++ +K RFE+F+ENL HI Q N++ +SY LGL FADLT+EEF
Sbjct: 51 FESWMSEHSKAYKSV--EEKVHRFEVFRENLMHIDQRNNE-INSYWLGLNEFADLTHEEF 107

Query: 416 KALNYFGV-RPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCG 592
K Y G+ +P S + S AN R R D +DL + DWR + AV VKDQG CG
Sbjct: 108 KG-RYLGLAKPQFSRKRQPS---ANFRYR----DITDLPKSVDWRKKGAVAPVKDQGQCG 159

Query: 593 SCWAFAAVGAIESANAISMG 652
SCWAF+ V A+E N I+ G
Sbjct: 160 SCWAFSTVAAVEGINQITTG 179


>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400
OS=Arabidopsis thaliana GN=At3g19400 PE=2 SV=1
Length = 362

Score = 114 bits (284), Expect = 6e-25
Identities = 60/139 (43%), Positives = 86/139 (61%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
++ W ++ K+Y LG +KE+RF+IFK+NL + +HNS ++ +GLTRFADLTNEEF
Sbjct: 44 YEQWLVENRKNYNGLG--EKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEF 101

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
+A+ L + + + R ++ + L DWR AV SVKDQG+CGS
Sbjct: 102 RAI-------YLRKKMERTKDSVKTERYLYK-EGDVLPDEVDWRANGAVVSVKDQGNCGS 153

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+AVGA+E N I+ G
Sbjct: 154 CWAFSAVGAVEGINQITTG 172


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp.
japonica GN=Os04g0650000 PE=1 SV=2
Length = 458

Score = 110 bits (275), Expect = 6e-24
Identities = 59/139 (42%), Positives = 82/139 (58%), Gaps = 3/139 (2%)
Frame = +2

Query: 245 WNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSK---GTSSYTLGLTRFADLTNEEF 415
W A+HGK Y A+G ++E+R+ F++NL +I +HN+ G S+ LGL RFADLTNEE+
Sbjct: 43 WKAEHGKSYNAVG--EEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEEY 100

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
+ Y G+R S R + D+ L + DWR + AV +KDQG CGS
Sbjct: 101 RD-TYLGLRNKPRRERKVS-------DRYLAADNEALPESVDWRTKGAVAEIKDQGGCGS 152

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+A+ A+E N I G
Sbjct: 153 CWAFSAIAAVEGINQIVTG 171


>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960
OS=Arabidopsis thaliana GN=At3g43960 PE=2 SV=1
Length = 376

Score = 106 bits (264), Expect = 1e-22
Identities = 60/140 (42%), Positives = 85/140 (60%), Gaps = 1/140 (0%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
++ W ++GK+Y LG +KE+RF+IFK+NL I++HNS SY GL +F+DLT +EF
Sbjct: 41 YEQWLVENGKNYNGLG--EKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEF 98

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDA-VESVKDQGSCG 592
+A +Y G + + ++ S +A R + L DWR+ A V VK QG CG
Sbjct: 99 QA-SYLGGK--MEKKSLSDVA-----ERYQYKEGDVLPDEVDWRERGAVVPRVKRQGECG 150

Query: 593 SCWAFAAVGAIESANAISMG 652
SCWAFAA GA+E N I+ G
Sbjct: 151 SCWAFAATGAVEGINQITTG 170


>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345

Score = 105 bits (261), Expect = 3e-22
Identities = 55/139 (39%), Positives = 81/139 (58%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
F+ W A++G+ Y + +K RF+IFK N+ HI+ N++ +SYTLG+ +F D+TN EF
Sbjct: 37 FEEWMAEYGRVYK--DNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNEF 94

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
A Y G+ L+ + ++ D S + + DWRD AV SVK+QG CGS
Sbjct: 95 VA-QYTGLSLPLNIKREPVVS-------FDDVDISSVPQSIDWRDSGAVTSVKNQGRCGS 146

Query: 596 CWAFAAVGAIESANAISMG 652
CWAFA++ +ES I G
Sbjct: 147 CWAFASIATVESIYKIKRG 165


>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320
OS=Arabidopsis thaliana GN=At4g11320 PE=2 SV=1
Length = 371

Score = 104 bits (259), Expect = 5e-22
Identities = 60/139 (43%), Positives = 82/139 (58%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
F++W KHGK Y ++ +KE+R IF++NL I N++ S Y LGL RFADL+ E+
Sbjct: 56 FESWMVKHGKVYDSVA--EKERRLTIFEDNLRFITNRNAENLS-YRLGLNRFADLSLHEY 112

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
+ + G P RNH + ++N R + D L + DWR+E AV VKDQG C S
Sbjct: 113 GEICH-GADP-RPPRNHVFMTSSN---RYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRS 167

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+ VGA+E N I G
Sbjct: 168 CWAFSTVGAVEGLNKIVTG 186


>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380

Score = 103 bits (256), Expect = 1e-21
Identities = 55/139 (39%), Positives = 81/139 (58%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
+++W K+GK Y +LG + E+RFEIFKE L I +HN+ SY +GL +FADLT+EEF
Sbjct: 42 YESWLIKYGKSYNSLG--EWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEF 99

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
++ Y G ++ ++ + + + RV L + DWR AV +K QG CG
Sbjct: 100 RS-TYLG----FTSGSNKTKVSNRYEPRV----GQVLPSYVDWRSAGAVVDIKSQGECGG 150

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+A+ +E N I G
Sbjct: 151 CWAFSAIATVEGINKIVTG 169


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis
thaliana GN=XCP2 PE=1 SV=2
Length = 356

Score = 102 bits (254), Expect = 2e-21
Identities = 60/139 (43%), Positives = 79/139 (56%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
F+ W + K Y + +K RFE+FK+NL HI + N KG SY LGL FADL++EEF
Sbjct: 51 FENWISNFEKAYETV--EEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFADLSHEEF 107

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
K + Y G++ + R+ A R V + S DWR + AV VK+QGSCGS
Sbjct: 108 KKM-YLGLKTDIVRRDEERSYAEFAYRDVEAVPKS-----VDWRKKGAVAEVKNQGSCGS 161

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+ V A+E N I G
Sbjct: 162 CWAFSTVAAVEGINKIVTG 180


>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310
OS=Arabidopsis thaliana GN=At4g11310 PE=2 SV=1
Length = 364

Score = 101 bits (251), Expect = 4e-21
Identities = 59/139 (42%), Positives = 82/139 (58%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
F++W KHGK Y ++ +KE+R IF++NL I N++ S Y LGLT FADL+ E+
Sbjct: 49 FESWMVKHGKVYGSVA--EKERRLTIFEDNLRFINNRNAENLS-YRLGLTGFADLSLHEY 105

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
K + + G P RNH + +++ R + L + DWR+E AV VKDQG C S
Sbjct: 106 KEVCH-GADP-RPPRNHVFMTSSD---RYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRS 160

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+ VGA+E N I G
Sbjct: 161 CWAFSTVGAVEGLNKIVTG 179


tr_hit_id Q6F6A6
Definition tr|Q6F6A6|Q6F6A6_DAUCA Cysteine protease OS=Daucus carota
Align length 139
Score (bit) 130.0
E-value 5.0e-29
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK962477|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0013_P18, 5'
(652 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|Q6F6A6|Q6F6A6_DAUCA Cysteine protease OS=Daucus carota GN=DcC... 130 5e-29
tr|Q93XQ9|Q93XQ9_IPOBA Putative cysteine protease OS=Ipomoea bat... 128 3e-28
tr|A6N8F9|A6N8F9_ELAGV Cysteine proteinase OS=Elaeis guineensis ... 128 3e-28
tr|Q41064|Q41064_PEA Thiolprotease OS=Pisum sativum GN=tpp PE=2 ... 128 3e-28
tr|A9PFF7|A9PFF7_POPTR Putative uncharacterized protein OS=Popul... 127 5e-28
tr|Q9ST61|Q9ST61_SOLTU Cysteine protease OS=Solanum tuberosum GN... 126 1e-27
tr|O49877|O49877_SOLLC CYP1 (Cysteine protease TDI-65) OS=Solanu... 125 3e-27
tr|A9NUC2|A9NUC2_PICSI Putative uncharacterized protein OS=Picea... 125 3e-27
tr|A6N8F8|A6N8F8_ELAGV Cysteine proteinase OS=Elaeis guineensis ... 124 5e-27
tr|Q84M29|Q84M29_HELAN Cysteine protease-1 OS=Helianthus annuus ... 123 9e-27
tr|Q8W182|Q8W182_BRAOL Senescence-associated cysteine protease (... 123 1e-26
tr|A5HIJ2|A5HIJ2_ACTDE Cysteine protease Cp2 OS=Actinidia delici... 123 1e-26
tr|A5B6Y2|A5B6Y2_VITVI Putative uncharacterized protein OS=Vitis... 123 1e-26
tr|A1KXJ7|A1KXJ7_ELAGV Oil palm polygalacturonase allergen PEST4... 123 1e-26
tr|Q6F6A3|Q6F6A3_DAUCA Cysteine protease OS=Daucus carota GN=DcC... 122 1e-26
tr|Q2AAC8|Q2AAC8_9ASTR Cysteine proteinase OS=Platycodon grandif... 122 2e-26
tr|Q7XYU7|Q7XYU7_ANTAD Senescence-associated cysteine protease O... 122 2e-26
tr|A5HIJ6|A5HIJ6_ACTDE Cysteine protease Cp6 OS=Actinidia delici... 121 3e-26
tr|O24137|O24137_TOBAC Cysteine proteinase OS=Nicotiana tabacum ... 121 4e-26
tr|Q155L4|Q155L4_HEVBR Cysteine protease OS=Hevea brasiliensis G... 120 6e-26
tr|A9NV34|A9NV34_PICSI Putative uncharacterized protein OS=Picea... 120 7e-26
tr|Q9FMH8|Q9FMH8_ARATH Cysteine protease component of protease-i... 120 9e-26
tr|Q84YH7|Q84YH7_TOBAC CPR1-like cysteine proteinase OS=Nicotian... 120 9e-26
tr|Q5K4K7|Q5K4K7_GOSHI Cysteine proteinase OS=Gossypium hirsutum... 120 9e-26
tr|B2LSD2|B2LSD2_MUCPR Mucunain OS=Mucuna pruriens PE=2 SV=1 120 9e-26
tr|Q6F6A9|Q6F6A9_DAUCA Cysteine protease OS=Daucus carota GN=DcC... 119 1e-25
tr|A9TQW9|A9TQW9_PHYPA Predicted protein OS=Physcomitrella paten... 119 2e-25
tr|Q94BX1|Q94BX1_ARATH F2G19.31/F2G19.31 OS=Arabidopsis thaliana... 118 3e-25
tr|Q8W181|Q8W181_BRAOL Senescence-associated cysteine protease O... 118 3e-25
tr|Q56XI5|Q56XI5_ARATH Cysteine proteinase RD21A OS=Arabidopsis ... 118 3e-25

>tr|Q6F6A6|Q6F6A6_DAUCA Cysteine protease OS=Daucus carota
GN=DcCysP5 PE=2 SV=1
Length = 437

Score = 130 bits (328), Expect = 5e-29
Identities = 66/139 (47%), Positives = 89/139 (64%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
+++W KHGK Y ALG +KE RF+IFK+NL +I HN+ SY LGL RFADLTNEE+
Sbjct: 49 YNSWLVKHGKSYNALG--EKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADLTNEEY 106

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
+A Y G + +R + R + +L + DWR++ AV +VKDQGSCGS
Sbjct: 107 RA-KYLGTK----SRESRPKLSKGPSDRYAPVEGEELPDSIDWREKGAVAAVKDQGSCGS 161

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+A+GA+E N I+ G
Sbjct: 162 CWAFSAIGAVEGINQITTG 180


>tr|Q93XQ9|Q93XQ9_IPOBA Putative cysteine protease OS=Ipomoea
batatas PE=2 SV=1
Length = 462

Score = 128 bits (322), Expect = 3e-28
Identities = 65/139 (46%), Positives = 86/139 (61%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
+++W +HGK Y LG +K+KRFEIFK+NL +I + NS+G SY LGL RFADLTNEE+
Sbjct: 49 YESWLVEHGKSYNGLGG-EKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNRFADLTNEEY 107

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
++ Y G + T +A RR L + DWR++ AV VKDQGSCGS
Sbjct: 108 RS-TYLGAK----TDARRRIAKTKSDRRYAPKAGGSLPDSIDWREKGAVAEVKDQGSCGS 162

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+ + A+E N I G
Sbjct: 163 CWAFSTIAAVEGINQIVTG 181


>tr|A6N8F9|A6N8F9_ELAGV Cysteine proteinase OS=Elaeis guineensis
var. tenera GN=CPRF PE=2 SV=1
Length = 469

Score = 128 bits (322), Expect = 3e-28
Identities = 71/142 (50%), Positives = 90/142 (63%), Gaps = 3/142 (2%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSK---GTSSYTLGLTRFADLTN 406
+ AW A+H + Y AL + E+R EIF++NL I QHN+ G S+ LGLTRFADLTN
Sbjct: 47 YQAWKAQHARSYNALD--EDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRFADLTN 104

Query: 407 EEFKALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGS 586
EE+++ Y GVR S R +S +N R R S D DL + DWRD+ AV VKDQGS
Sbjct: 105 EEYRS-TYLGVRTAGSRRRRNSTVGSN-RYRFRSSD--DLPDSIDWRDKGAVVDVKDQGS 160

Query: 587 CGSCWAFAAVGAIESANAISMG 652
CGSCWAF+ + A+E N I G
Sbjct: 161 CGSCWAFSTIAAVEGINHIVTG 182


>tr|Q41064|Q41064_PEA Thiolprotease OS=Pisum sativum GN=tpp PE=2
SV=2
Length = 464

Score = 128 bits (321), Expect = 3e-28
Identities = 69/139 (49%), Positives = 88/139 (63%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
++ W KHGK+Y ALG +KEKRFEIFK+NL I +HNSK S + LGL RFADLTNEE+
Sbjct: 47 YEEWLVKHGKNYNALG--EKEKRFEIFKDNLGFIDEHNSKNLS-FRLGLNRFADLTNEEY 103

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
+ + G R + RN + N R + L + DWR E AV VKDQGSCGS
Sbjct: 104 RT-RFLGTRINPNRRNRKVNSQTN---RYATRVGDKLPESVDWRKEGAVVGVKDQGSCGS 159

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+A+ A+E N ++ G
Sbjct: 160 CWAFSAIAAVEGVNKLATG 178


>tr|A9PFF7|A9PFF7_POPTR Putative uncharacterized protein OS=Populus
trichocarpa PE=2 SV=1
Length = 465

Score = 127 bits (320), Expect = 5e-28
Identities = 71/139 (51%), Positives = 88/139 (63%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
++ W KHGK+Y ALG +KEKRFEIFK+NL I QHNS+ + YT+GL RFADLTNEEF
Sbjct: 51 YEEWLVKHGKNYNALG--EKEKRFEIFKDNLMFIDQHNSENRT-YTVGLNRFADLTNEEF 107

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
+++ Y G R T + L + R DS L + DWR E AV VKDQG CGS
Sbjct: 108 RSM-YLGTR----TGHKKRLPKTSDRYAPRVGDS--LPDSVDWRKEGAVAEVKDQGGCGS 160

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+ + A+E N I G
Sbjct: 161 CWAFSTIAAVEGINKIVTG 179


>tr|Q9ST61|Q9ST61_SOLTU Cysteine protease OS=Solanum tuberosum
GN=cyp PE=2 SV=1
Length = 466

Score = 126 bits (317), Expect = 1e-27
Identities = 66/139 (47%), Positives = 90/139 (64%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
+++W +HGK Y ALG +K+KRF+IFK+NL +I + NS SY LGLT+FADLTNEE+
Sbjct: 49 YESWLIEHGKSYNALG--EKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEY 106

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
+++ Y G + R S + + +V L + DWRD+ + VKDQGSCGS
Sbjct: 107 RSI-YLGTKSSGDRRKLSKNKSDRYLPKV----GDSLPESVDWRDKGVLVGVKDQGSCGS 161

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+AV A+ES NAI G
Sbjct: 162 CWAFSAVAAMESINAIVTG 180


>tr|O49877|O49877_SOLLC CYP1 (Cysteine protease TDI-65) OS=Solanum
lycopersicum GN=C14 PE=2 SV=1
Length = 466

Score = 125 bits (313), Expect = 3e-27
Identities = 66/139 (47%), Positives = 92/139 (66%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
+++W +HGK Y ALG +K+KRF+IFK+NL +I + NS SY LGLT+FADLTNEE+
Sbjct: 49 YESWLIEHGKSYNALG--EKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEY 106

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
+++ Y G + S+ + L+ R + S L + DWR++ + VKDQGSCGS
Sbjct: 107 RSI-YLGTK---SSGDRKKLSKNKSDRYLPKVGDS-LPESIDWREKGVLVGVKDQGSCGS 161

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+AV A+ES NAI G
Sbjct: 162 CWAFSAVAAMESINAIVTG 180


>tr|A9NUC2|A9NUC2_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 463

Score = 125 bits (313), Expect = 3e-27
Identities = 65/139 (46%), Positives = 88/139 (63%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
++ W A+H K Y LG +K+ RF +FK+N +I QHN++G SY LGL +FADL++EEF
Sbjct: 44 YELWLAQHKKAYNGLG--EKQNRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEF 101

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
KA Y G + L T+ S + + R D DL + DWR++ AV +VKDQGSCGS
Sbjct: 102 KA-TYLGAK--LDTKKRLSNSPSP---RYQYSDGEDLPESIDWREKGAVTAVKDQGSCGS 155

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+ V A+E N I G
Sbjct: 156 CWAFSTVAAVEGINQIVTG 174


>tr|A6N8F8|A6N8F8_ELAGV Cysteine proteinase OS=Elaeis guineensis
var. tenera GN=CPRZ PE=2 SV=1
Length = 470

Score = 124 bits (311), Expect = 5e-27
Identities = 67/142 (47%), Positives = 89/142 (62%), Gaps = 3/142 (2%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSK---GTSSYTLGLTRFADLTN 406
++ W AKHG+ Y ALG +KE+RFEIFK+N+ I HN+ G S+ LGL RFAD+TN
Sbjct: 50 YEGWLAKHGRAYNALG--EKERRFEIFKDNVLFIDAHNAAADAGHRSFRLGLNRFADMTN 107

Query: 407 EEFKALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGS 586
EE++A+ Y G RP R + + + +R DL + DWR + AV +VKDQGS
Sbjct: 108 EEYRAV-YLGTRPA-GHRRRARVGSDRYRYNA----GEDLPESVDWRAKGAVAAVKDQGS 161

Query: 587 CGSCWAFAAVGAIESANAISMG 652
CGSCWAF+ V A+E N I G
Sbjct: 162 CGSCWAFSTVAAVEGINKIVTG 183


>tr|Q84M29|Q84M29_HELAN Cysteine protease-1 OS=Helianthus annuus
GN=scp1 PE=2 SV=1
Length = 461

Score = 123 bits (309), Expect = 9e-27
Identities = 62/139 (44%), Positives = 88/139 (63%)
Frame = +2

Query: 236 FDAWNAKHGKHYPALGSPQKEKRFEIFKENLAHIQQHNSKGTSSYTLGLTRFADLTNEEF 415
+++W KHGK Y ALG +K++RF+IFK+NL I +HNS G +Y LGL +FADLTNEE+
Sbjct: 52 YESWLVKHGKTYNALG--EKDRRFQIFKDNLRFIDEHNS-GDHTYKLGLNKFADLTNEEY 108

Query: 416 KALNYFGVRPVLSTRNHSSLAAANHRRRVHSCDSSDLDTAFDWRDEDAVESVKDQGSCGS 595
+ + Y G++ + + S + + + R L DWR++ AV VKDQGSCGS
Sbjct: 109 R-MTYTGIKTIDDKKKLSKMKSDRYAYR----SGDSLPEYVDWREQGAVTDVKDQGSCGS 163

Query: 596 CWAFAAVGAIESANAISMG 652
CWAF+ G++E N I G
Sbjct: 164 CWAFSTTGSVEGVNKIVTG 182