DK953689 |
Clone id |
TST39A01NGRL0018_H03 |
Library |
TST39 |
Length |
613 |
Definition |
Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0018_H03. 5' end sequence. |
Accession |
DK953689 |
Tissue type |
prothallia with plantlets |
Developmental stage |
gametophytes with sporophytes |
Contig ID |
- |
Sequence |
CATAGCCCAAGGAAGAAGAAATCTGTATCCCTGCCTTAATGGCTACCTCCTCCGTCATAT GGGCCCTCTTGGGCGTTTTCGCCCTCTTCTTGACACTCTCCTCTGCCGATAACTCTCATG TGTTAAGCTACTCCCCTTCCGATCTCCACTCCCAAGCCAAGCTTGTTAGCCTCTTTGACG CATGGAACATGAAGCATGGAAAACATTACACAGCCGCCCAGAAGTTAGAGAAGCACAAGA GATTCCACATCTTCAGAGACAACCTGATGCGCATAGAGGCGCACAACAGCAAGGGATCTA CTTTTAAGCTTGGTCTCAACCGTTTCGCCGATTTGACTCAAGATGAATTCAAGCAGAGCC GGCGTCTTGGTCTCAAGCTTCCTTCTGTCAAGCTTGGATCCCTCCGCAGGCGGTCCCACT TCCATCACAAGTCTGAGACCCCTATGGTAACAGCTGAATCTTTGGACTGGAGAACCCTTG GCGCCGTTACCCCAGTGAAAGATCAGGGCATGTGTGGAAGCTGCTGGGCTTTCTCTGCCA CAGGAGCTATTGAAGGAGCCAACGCTGTTGCAACAGGAAACCTTGTCAGTGTTTCGGAGG AAAAGCTTGTGAC |
■■Homology search results ■■ |
- |
Swiss-Prot (release 56.9) |
Link to BlastX Result : Swiss-Prot |
sp_hit_id |
P43297 |
Definition |
sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana |
Align length |
185 |
Score (bit) |
153.0 |
E-value |
8.0e-37 |
Report |
BLASTX 2.2.19 [Nov-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402.
Query= DK953689|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0018_H03, 5' (613 letters)
Database: uniprot_sprot.fasta 412,525 sequences; 148,809,765 total letters
Searching..................................................done
Score E Sequences producing significant alignments: (bits) Value
sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis t... 153 8e-37 sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2 147 3e-35 sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis ... 147 4e-35 sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis ... 145 2e-34 sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2 133 6e-31 sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS... 133 6e-31 sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=A... 133 8e-31 sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 ... 131 3e-30 sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. ... 130 4e-30 sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2 130 5e-30 sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1 127 3e-29 sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1 125 1e-28 sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=A... 122 1e-27 sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=A... 121 3e-27 sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=A... 121 3e-27 sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. G... 119 9e-27 sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Br... 117 4e-26 sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2... 117 6e-26 sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1 117 6e-26 sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4 117 6e-26 sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 ... 115 1e-25 sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. j... 115 2e-25 sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersi... 114 4e-25 sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulg... 114 5e-25 sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulg... 114 5e-25 sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium dis... 113 7e-25 sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 ... 113 9e-25 sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis t... 112 1e-24 sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium dis... 112 1e-24 sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. ... 111 3e-24
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1 SV=1 Length = 462
Score = 153 bits (386), Expect = 8e-37 Identities = 80/185 (43%), Positives = 121/185 (65%), Gaps = 6/185 (3%) Frame = +3
Query: 75 VFALFLTLSSADNSHVLSY------SPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKH 236 +F + +SSA + ++SY S + S+A+++S+++AW +KHGK + +EK Sbjct: 11 LFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKD 70
Query: 237 KRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRS 416 +RF IF+DNL ++ HN K +++LGL RFADLT DE++ S+ LG K+ K G RR+ Sbjct: 71 RRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR-SKYLGAKME--KKGE--RRT 125
Query: 417 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 596 +++ ES+DWR GAV VKDQG CGSCWAFS GA+EG N + TG+L+++S Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185
Query: 597 EEKLV 611 E++LV Sbjct: 186 EQELV 190
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2 Length = 352
Score = 147 bits (372), Expect = 3e-35 Identities = 88/196 (44%), Positives = 116/196 (59%), Gaps = 3/196 (1%) Frame = +3
Query: 33 ALMATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYT 212 A M++ S I L + + LSSAD + + YS DL S +L+ LFD+W +KH K Y Sbjct: 2 ATMSSISKIIFLATCLIIHMGLSSAD-FYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYE 60
Query: 213 AAQKLEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVK 392 + EK RF IFRDNLM I+ N K +++ LGLN FADL+ DEFK+ K Sbjct: 61 SID--EKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKK------KYVGFV 112
Query: 393 LGSLRRRSHFHHKSET-PMVT--AESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGAN 563 HF ++ T VT +S+DWR GAVTPVK+QG CGSCWAFS +EG N Sbjct: 113 AEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGIN 172
Query: 564 AVATGNLVSVSEEKLV 611 + TGNL+ +SE++LV Sbjct: 173 KIVTGNLLELSEQELV 188
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1 SV=2 Length = 356
Score = 147 bits (371), Expect = 4e-35 Identities = 84/196 (42%), Positives = 118/196 (60%), Gaps = 3/196 (1%) Frame = +3
Query: 33 ALMATSSVIWALLGVFALFLTLSSADNSH---VLSYSPSDLHSQAKLVSLFDAWNMKHGK 203 AL + S ++ L + A L+LS A +SH ++ YSP DL S KL+ LF+ W K Sbjct: 2 ALSSPSRILCFALALSAASLSLSFA-SSHDYSIVGYSPEDLESHDKLIELFENWISNFEK 60
Query: 204 HYTAAQKLEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLP 383 Y + EK RF +F+DNL I+ N KG ++ LGLN FADL+ +EFK+ LGLK Sbjct: 61 AYETVE--EKFLRFEVFKDNLKHIDETNKKGKSYWLGLNEFADLSHEEFKKMY-LGLKTD 117
Query: 384 SVKLGSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGAN 563 V+ R + F ++ + +S+DWR GAV VK+QG CGSCWAFS A+EG N Sbjct: 118 IVRRDEERSYAEFAYRDVEAV--PKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGIN 175
Query: 564 AVATGNLVSVSEEKLV 611 + TGNL ++SE++L+ Sbjct: 176 KIVTGNLTTLSEQELI 191
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1 SV=1 Length = 355
Score = 145 bits (366), Expect = 2e-34 Identities = 81/192 (42%), Positives = 119/192 (61%), Gaps = 11/192 (5%) Frame = +3
Query: 69 LGVFALFLTLSS--------ADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQK 224 L F+L + +S+ A + ++ Y+P L + KL+ LF++W +H K Y + + Sbjct: 8 LSKFSLLVAISASALLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVE- 66
Query: 225 LEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSL 404 EK RF +FR+NLM I+ N++ +++ LGLN FADLT +EFK R LGL P Sbjct: 67 -EKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFK-GRYLGLAKPQFS---- 120
Query: 405 RRR---SHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVAT 575 R+R ++F ++ T + +S+DWR GAV PVKDQG CGSCWAFS A+EG N + T Sbjct: 121 RKRQPSANFRYRDITDL--PKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITT 178
Query: 576 GNLVSVSEEKLV 611 GNL S+SE++L+ Sbjct: 179 GNLSSLSEQELI 190
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2 Length = 348
Score = 133 bits (335), Expect = 6e-31 Identities = 75/188 (39%), Positives = 108/188 (57%), Gaps = 1/188 (0%) Frame = +3
Query: 51 SVIWALLGVFALFLTLS-SADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKL 227 S+ L LF+ +S S + ++ YS DL S +L+ LF++W + H K Y Sbjct: 6 SISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVD-- 63
Query: 228 EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLR 407 EK RF IF+DNL I+ N K +++ LGLN FADL+ DEF + + S+ ++ Sbjct: 64 EKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFNEKY-----VGSLIDATIE 118
Query: 408 RRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLV 587 + +E + E++DWR GAVTPV+ QG CGSCWAFSA +EG N + TG LV Sbjct: 119 QSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLV 178
Query: 588 SVSEEKLV 611 +SE++LV Sbjct: 179 ELSEQELV 186
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana GN=GCP1 PE=2 SV=2 Length = 376
Score = 133 bits (335), Expect = 6e-31 Identities = 73/199 (36%), Positives = 119/199 (59%), Gaps = 8/199 (4%) Frame = +3
Query: 39 MATSSVIWALLGVFALFLTLSSAD----NSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKH 206 MA S+ + +LL ++ + ++L+S D N H+ S + ++ S++ W+ +HGK Sbjct: 1 MAPSTKVLSLLLLYVV-VSLASGDESIINDHLQLPSDGKWRTDEEVRSIYLQWSAEHGKT 59
Query: 207 YTAAQKL--EKHKRFHIFRDNLMRIEAHN--SKGSTFKLGLNRFADLTQDEFKQSRRLGL 374 + ++ KRF+IF+DNL I+ HN +K +T+KLGL +F DLT DE+++ Sbjct: 60 NNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDEYRKLYLGAR 119
Query: 375 KLPSVKLGSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIE 554 P+ ++ + + + + E++DWR GAV P+KDQG CGSCWAFS T A+E Sbjct: 120 TEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVE 179
Query: 555 GANAVATGNLVSVSEEKLV 611 G N + TG L+S+SE++LV Sbjct: 180 GINKIVTGELISLSEQELV 198
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana GN=At3g43960 PE=2 SV=1 Length = 376
Score = 133 bits (334), Expect = 8e-31 Identities = 79/193 (40%), Positives = 120/193 (62%), Gaps = 2/193 (1%) Frame = +3
Query: 39 MATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAA 218 MA S ALL + L +++S V++ + S ++ +++++++ W +++GK+Y Sbjct: 1 MAISFRTLALLTLSVLLISISLG----VVTATESQ-RNEGEVLTMYEQWLVENGKNYNGL 55
Query: 219 QKLEKHKRFHIFRDNLMRIEAHNSKGS-TFKLGLNRFADLTQDEFKQSRRLGLKLPSVKL 395 EK +RF IF+DNL RIE HNS + +++ GLN+F+DLT DEF Q+ LG K+ L Sbjct: 56 G--EKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEF-QASYLGGKMEKKSL 112
Query: 396 GSLRRRSHFHHKSETPMVTAESLDWRTLGAVTP-VKDQGMCGSCWAFSATGAIEGANAVA 572 + R + P + +DWR GAV P VK QG CGSCWAF+ATGA+EG N + Sbjct: 113 SDVAERYQYKEGDVLP----DEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQIT 168
Query: 573 TGNLVSVSEEKLV 611 TG LVS+SE++L+ Sbjct: 169 TGELVSLSEQELI 181
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3 Length = 348
Score = 131 bits (329), Expect = 3e-30 Identities = 74/174 (42%), Positives = 102/174 (58%) Frame = +3
Query: 90 LTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHKRFHIFRDNLM 269 ++LS D S ++ YS DL S +L+ LF++W +KH K+Y EK RF IF+DNL Sbjct: 21 MSLSYCDFS-IVGYSQDDLTSTERLIQLFNSWMLKHNKNYKNVD--EKLYRFEIFKDNLK 77
Query: 270 RIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRSHFHHKSETPMV 449 I+ N + + LGLN F+DL+ DEFK+ + S+ + +E + Sbjct: 78 YIDERNKMINGYWLGLNEFSDLSNDEFKEKY-----VGSLPEDYTNQPYDEEFVNEDIVD 132
Query: 450 TAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEKLV 611 ES+DWR GAVTPVK QG C SCWAFS +EG N + TGNLV +SE++LV Sbjct: 133 LPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKIKTGNLVELSEQELV 186
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000 PE=1 SV=2 Length = 458
Score = 130 bits (328), Expect = 4e-30 Identities = 76/180 (42%), Positives = 110/180 (61%), Gaps = 4/180 (2%) Frame = +3
Query: 84 LFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHKRFHIFRDN 263 L L+L++AD S ++SY S+ + L+ W +HGK Y A E+ +R+ FRDN Sbjct: 14 LLLSLAAADMS-IVSYGE---RSEEEARRLYAEWKAEHGKSYNAVG--EEERRYAAFRDN 67
Query: 264 LMRIEAHNSKGS----TFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRSHFHHK 431 L I+ HN+ +F+LGLNRFADLT +E++ + LGL+ + R+ + Sbjct: 68 LRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRDTY-LGLRNKPRR----ERKVSDRYL 122
Query: 432 SETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEKLV 611 + ES+DWRT GAV +KDQG CGSCWAFSA A+EG N + TG+L+S+SE++LV Sbjct: 123 AADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELV 182
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2 Length = 362
Score = 130 bits (327), Expect = 5e-30 Identities = 78/194 (40%), Positives = 113/194 (58%), Gaps = 3/194 (1%) Frame = +3
Query: 39 MATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAA 218 MAT ++W +L F+L L ++++ + H DL S+ L L++ W H+T + Sbjct: 1 MATKKLLWVVLS-FSLVLGVANSFDFH-----DKDLASEESLWDLYERWR----SHHTVS 50
Query: 219 QKL-EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKL--PSV 389 + L EKHKRF++F+ NLM + N +KL LN+FAD+T EF+ S G K+ P + Sbjct: 51 RSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFR-STYAGSKVNHPRM 109
Query: 390 KLGSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAV 569 G+ F + E + S+DWR GAVT VKDQG CGSCWAFS A+EG N + Sbjct: 110 FRGTPHENGAFMY--EKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQI 167
Query: 570 ATGNLVSVSEEKLV 611 T LV++SE++LV Sbjct: 168 KTNKLVALSEQELV 181
|
TrEMBL (release 39.9) |
Link to BlastX Result : TrEMBL |
tr_hit_id |
Q94BX1 |
Definition |
tr|Q94BX1|Q94BX1_ARATH F2G19.31/F2G19.31 OS=Arabidopsis thaliana |
Align length |
185 |
Score (bit) |
155.0 |
E-value |
2.0e-36 |
Report |
BLASTX 2.2.19 [Nov-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402.
Query= DK953689|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0018_H03, 5' (613 letters)
Database: uniprot_trembl.fasta 7,341,751 sequences; 2,391,615,440 total letters
Searching..................................................done
Score E Sequences producing significant alignments: (bits) Value
tr|Q94BX1|Q94BX1_ARATH F2G19.31/F2G19.31 OS=Arabidopsis thaliana... 155 2e-36 tr|Q56XI5|Q56XI5_ARATH Cysteine proteinase RD21A OS=Arabidopsis ... 153 9e-36 tr|Q6Y1E9|Q6Y1E9_TRIRP Cysteine protease 14 OS=Trifolium repens ... 152 1e-35 tr|Q6Y1F0|Q6Y1F0_TRIRP Cysteine protease 14 OS=Trifolium repens ... 151 3e-35 tr|O24323|O24323_PHAVU Cysteine proteinase OS=Phaseolus vulgaris... 151 3e-35 tr|A5HIJ6|A5HIJ6_ACTDE Cysteine protease Cp6 OS=Actinidia delici... 150 7e-35 tr|Q84M29|Q84M29_HELAN Cysteine protease-1 OS=Helianthus annuus ... 149 1e-34 tr|Q9SMI1|Q9SMI1_CARPA Chymopapain isoform II OS=Carica papaya G... 147 4e-34 tr|Q9SMI0|Q9SMI0_CARPA Chymopapain isoform III OS=Carica papaya ... 147 4e-34 tr|Q0WT15|Q0WT15_ARATH Putative cysteine proteinase OS=Arabidops... 147 5e-34 tr|A9NW12|A9NW12_PICSI Putative uncharacterized protein OS=Picea... 145 1e-33 tr|Q3E9R1|Q3E9R1_ARATH Uncharacterized protein At4g35350.2 OS=Ar... 145 2e-33 tr|A9P285|A9P285_PICSI Putative uncharacterized protein OS=Picea... 145 2e-33 tr|A7P8S5|A7P8S5_VITVI Chromosome chr3 scaffold_8, whole genome ... 145 2e-33 tr|Q41064|Q41064_PEA Thiolprotease OS=Pisum sativum GN=tpp PE=2 ... 144 3e-33 tr|Q155L4|Q155L4_HEVBR Cysteine protease OS=Hevea brasiliensis G... 144 3e-33 tr|B4ESE6|B4ESE6_HORVD Papain-like cysteine proteinase OS=Hordeu... 144 3e-33 tr|Q9ST61|Q9ST61_SOLTU Cysteine protease OS=Solanum tuberosum GN... 144 4e-33 tr|Q84M26|Q84M26_HELAN Cysteine protease-4 OS=Helianthus annuus ... 143 7e-33 tr|Q6F6A9|Q6F6A9_DAUCA Cysteine protease OS=Daucus carota GN=DcC... 143 7e-33 tr|Q6ZHP9|Q6ZHP9_ORYSJ Os02g0715000 protein OS=Oryza sativa subs... 143 9e-33 tr|Q40922|Q40922_PSEMZ Pseudotzain OS=Pseudotsuga menziesii GN=P... 143 9e-33 tr|A9NUC2|A9NUC2_PICSI Putative uncharacterized protein OS=Picea... 143 9e-33 tr|A7LHN5|A7LHN5_PINPS Cysteine protease (Fragment) OS=Pinus pin... 143 9e-33 tr|Q5G0K2|Q5G0K2_PINTA Cysteine protease (Fragment) OS=Pinus tae... 142 1e-32 tr|Q2AAC8|Q2AAC8_9ASTR Cysteine proteinase OS=Platycodon grandif... 142 1e-32 tr|A3AAP5|A3AAP5_ORYSJ Putative uncharacterized protein OS=Oryza... 142 1e-32 tr|A2X8X3|A2X8X3_ORYSI Putative uncharacterized protein OS=Oryza... 142 1e-32 tr|Q5G0J0|Q5G0J0_PINTA Cysteine protease (Fragment) OS=Pinus tae... 142 2e-32 tr|B4ESE7|B4ESE7_HORVD Papain-like cysteine proteinase OS=Hordeu... 142 2e-32
>tr|Q94BX1|Q94BX1_ARATH F2G19.31/F2G19.31 OS=Arabidopsis thaliana PE=2 SV=1 Length = 462
Score = 155 bits (391), Expect = 2e-36 Identities = 81/185 (43%), Positives = 122/185 (65%), Gaps = 6/185 (3%) Frame = +3
Query: 75 VFALFLTLSSADNSHVLSY------SPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKH 236 +F +T+SSA + ++SY S + S+A+++S+++AW +KHGK + +EK Sbjct: 11 LFLAMVTVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKD 70
Query: 237 KRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRS 416 +RF IF+DNL ++ HN K +++LGL RFADLT DE++ S+ LG K+ K G RR+ Sbjct: 71 RRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR-SKYLGAKME--KKGE--RRT 125
Query: 417 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 596 +++ ES+DWR GAV VKDQG CGSCWAFS GA+EG N + TG+L+++S Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185
Query: 597 EEKLV 611 E++LV Sbjct: 186 EQELV 190
>tr|Q56XI5|Q56XI5_ARATH Cysteine proteinase RD21A OS=Arabidopsis thaliana GN=At1g47128 PE=2 SV=1 Length = 433
Score = 153 bits (386), Expect = 9e-36 Identities = 80/185 (43%), Positives = 121/185 (65%), Gaps = 6/185 (3%) Frame = +3
Query: 75 VFALFLTLSSADNSHVLSY------SPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKH 236 +F + +SSA + ++SY S + S+A+++S+++AW +KHGK + +EK Sbjct: 11 LFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKD 70
Query: 237 KRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRS 416 +RF IF+DNL ++ HN K +++LGL RFADLT DE++ S+ LG K+ K G RR+ Sbjct: 71 RRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR-SKYLGAKME--KKGE--RRT 125
Query: 417 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 596 +++ ES+DWR GAV VKDQG CGSCWAFS GA+EG N + TG+L+++S Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185
Query: 597 EEKLV 611 E++LV Sbjct: 186 EQELV 190
>tr|Q6Y1E9|Q6Y1E9_TRIRP Cysteine protease 14 OS=Trifolium repens PE=2 SV=1 Length = 351
Score = 152 bits (384), Expect = 1e-35 Identities = 84/188 (44%), Positives = 113/188 (60%) Frame = +3
Query: 48 SSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKL 227 SS L LFL+L+ + ++ YS DL S KL+ LF++W +HGK Y + Sbjct: 5 SSKTLVLTCSLCLFLSLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYETIE-- 62
Query: 228 EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLR 407 EK RF +F+DNL I+ N S + LGLN FADL+ EFK ++ LGLK V L R Sbjct: 63 EKLLRFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQEFK-NKYLGLK---VNLSQRR 118
Query: 408 RRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLV 587 S+ + + +S+DWR GAVTPVK+QG CGSCWAFS A+EG N + TGNL Sbjct: 119 ESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 178
Query: 588 SVSEEKLV 611 S+SE++L+ Sbjct: 179 SLSEQELI 186
>tr|Q6Y1F0|Q6Y1F0_TRIRP Cysteine protease 14 OS=Trifolium repens PE=3 SV=1 Length = 351
Score = 151 bits (382), Expect = 3e-35 Identities = 84/188 (44%), Positives = 113/188 (60%) Frame = +3
Query: 48 SSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKL 227 SS L LFL+L+ + ++ YS DL S KL+ LF++W +HGK Y + Sbjct: 5 SSKTLVLTCSLCLFLSLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYETIE-- 62
Query: 228 EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLR 407 EK RF +F+DNL I+ N S + LGLN FADL+ EFK ++ LGLK V L R Sbjct: 63 EKLLRFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQEFK-NKYLGLK---VDLSQRR 118
Query: 408 RRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLV 587 S+ + + +S+DWR GAVTPVK+QG CGSCWAFS A+EG N + TGNL Sbjct: 119 ESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 178
Query: 588 SVSEEKLV 611 S+SE++L+ Sbjct: 179 SLSEQELI 186
>tr|O24323|O24323_PHAVU Cysteine proteinase OS=Phaseolus vulgaris PE=3 SV=1 Length = 455
Score = 151 bits (382), Expect = 3e-35 Identities = 87/185 (47%), Positives = 120/185 (64%), Gaps = 6/185 (3%) Frame = +3
Query: 75 VFALFLTLSSADNSHVLSYS-----PSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHK 239 +FALF LSSA + ++SY + + ++ SL++ W +KHGK Y A EK K Sbjct: 3 LFALF-ALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALG--EKDK 59
Query: 240 RFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKL-PSVKLGSLRRRS 416 RF IF+DNL I+ N++ T+KLGLNRFADLT +E++ +R LG K+ P+ +LG Sbjct: 60 RFQIFKDNLRFIDQQNAENRTYKLGLNRFADLTNEEYR-ARYLGTKIDPNRRLGRTPSNR 118
Query: 417 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 596 + ET +S+DWR GAV PVKDQ CGSCWAFSA GA+EG N + TG+L+S+S Sbjct: 119 YAPRVGET---LPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLS 175
Query: 597 EEKLV 611 E++LV Sbjct: 176 EQELV 180
>tr|A5HIJ6|A5HIJ6_ACTDE Cysteine protease Cp6 OS=Actinidia deliciosa PE=2 SV=1 Length = 461
Score = 150 bits (378), Expect = 7e-35 Identities = 87/187 (46%), Positives = 117/187 (62%), Gaps = 4/187 (2%) Frame = +3
Query: 63 ALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHKR 242 ALL +F+LF S+ D S + S S + ++++++++W +KHGK Y A EK KR Sbjct: 11 ALLLLFSLFALSSALDMSIIGELSSS--RTDDEVMAMYESWLVKHGKSYNAIG--EKEKR 66
Query: 243 FHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRSHF 422 F IF+DNL I+ HN++ T+K+GLNRFADLT DE++ S LG + GS RR S Sbjct: 67 FQIFKDNLRFIDEHNAESRTYKVGLNRFADLTNDEYR-SMYLG-----ARTGSRRRLSTQ 120
Query: 423 HHKSETPMVTAESL----DWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVS 590 V ESL DWR GAV VKDQG CGSCWAFS A+EG N + TG+L+S Sbjct: 121 KRSDRYVPVAGESLPDSVDWREKGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLIS 180
Query: 591 VSEEKLV 611 +SE++LV Sbjct: 181 LSEQELV 187
>tr|Q84M29|Q84M29_HELAN Cysteine protease-1 OS=Helianthus annuus GN=scp1 PE=2 SV=1 Length = 461
Score = 149 bits (377), Expect = 1e-34 Identities = 82/192 (42%), Positives = 122/192 (63%), Gaps = 1/192 (0%) Frame = +3
Query: 39 MATSSVIWALLGVFALFLTLSSADNSHVLSYSPS-DLHSQAKLVSLFDAWNMKHGKHYTA 215 MAT S + + A+ +++ + D +H+ S S S L + ++ +L+++W +KHGK Y A Sbjct: 6 MATLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKTYNA 65
Query: 216 AQKLEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKL 395 EK +RF IF+DNL I+ HNS T+KLGLN+FADLT +E++ + + K Sbjct: 66 LG--EKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMTYTGIKTIDDKKK 123
Query: 396 GSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVAT 575 S + + ++S + E +DWR GAVT VKDQG CGSCWAFS TG++EG N + T Sbjct: 124 LSKMKSDRYAYRSGDSL--PEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIVT 181
Query: 576 GNLVSVSEEKLV 611 G+L+SVSE++LV Sbjct: 182 GDLISVSEQELV 193
>tr|Q9SMI1|Q9SMI1_CARPA Chymopapain isoform II OS=Carica papaya GN=chymoII PE=2 SV=1 Length = 352
Score = 147 bits (372), Expect = 4e-34 Identities = 88/196 (44%), Positives = 116/196 (59%), Gaps = 3/196 (1%) Frame = +3
Query: 33 ALMATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYT 212 A M++ S I L + + LSSAD + + YS DL S +L+ LFD+W +KH K Y Sbjct: 2 ATMSSISKIIFLATCLIIHMGLSSAD-FYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYE 60
Query: 213 AAQKLEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVK 392 + EK RF IFRDNLM I+ N K +++ LGLN FADL+ DEFK+ K Sbjct: 61 SID--EKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKK------KYVGFV 112
Query: 393 LGSLRRRSHFHHKSET-PMVT--AESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGAN 563 HF ++ T VT +S+DWR GAVTPVK+QG CGSCWAFS +EG N Sbjct: 113 AEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGIN 172
Query: 564 AVATGNLVSVSEEKLV 611 + TGNL+ +SE++LV Sbjct: 173 KIVTGNLLELSEQELV 188
>tr|Q9SMI0|Q9SMI0_CARPA Chymopapain isoform III OS=Carica papaya GN=chymoIII PE=2 SV=1 Length = 361
Score = 147 bits (372), Expect = 4e-34 Identities = 88/196 (44%), Positives = 116/196 (59%), Gaps = 3/196 (1%) Frame = +3
Query: 33 ALMATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYT 212 A M++ S I L + + LSSAD + + YS DL S +L+ LFD+W +KH K Y Sbjct: 2 ATMSSISKIIFLATCLIIHMGLSSAD-FYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYE 60
Query: 213 AAQKLEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVK 392 + EK RF IFRDNLM I+ N K +++ LGLN FADL+ DEFK+ K Sbjct: 61 SID--EKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKK------KYVGFV 112
Query: 393 LGSLRRRSHFHHKSET-PMVT--AESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGAN 563 HF ++ T VT +S+DWR GAVTPVK+QG CGSCWAFS +EG N Sbjct: 113 AEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGIN 172
Query: 564 AVATGNLVSVSEEKLV 611 + TGNL+ +SE++LV Sbjct: 173 KIVTGNLLELSEQELV 188
>tr|Q0WT15|Q0WT15_ARATH Putative cysteine proteinase OS=Arabidopsis thaliana GN=At1g20850 PE=2 SV=1 Length = 356
Score = 147 bits (371), Expect = 5e-34 Identities = 84/196 (42%), Positives = 118/196 (60%), Gaps = 3/196 (1%) Frame = +3
Query: 33 ALMATSSVIWALLGVFALFLTLSSADNSH---VLSYSPSDLHSQAKLVSLFDAWNMKHGK 203 AL + S ++ L + A L+LS A +SH ++ YSP DL S KL+ LF+ W K Sbjct: 2 ALSSPSRILCFALALSAASLSLSFA-SSHDYSIVGYSPEDLESHDKLIELFENWISNFEK 60
Query: 204 HYTAAQKLEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLP 383 Y + EK RF +F+DNL I+ N KG ++ LGLN FADL+ +EFK+ LGLK Sbjct: 61 AYETVE--EKFLRFEVFKDNLKHIDETNKKGKSYWLGLNEFADLSHEEFKKMY-LGLKTD 117
Query: 384 SVKLGSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGAN 563 V+ R + F ++ + +S+DWR GAV VK+QG CGSCWAFS A+EG N Sbjct: 118 IVRRDEERSYAEFAYRDVEAV--PKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGIN 175
Query: 564 AVATGNLVSVSEEKLV 611 + TGNL ++SE++L+ Sbjct: 176 KIVTGNLTTLSEQELI 191
|