DK952607
Clone id TST38A01NGRL0014_I23
Library
Length 625
Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0014_I23. 5' end sequence.
Accession
Tissue type prothallia
Developmental stage gametophyte
Contig ID
Sequence
AGAGATGGCGTTCTCATATTCTTCGACAATCCGGGGCTTTTTGTGCCTGTCAGTACTCGT
GCTCGTCATCTGCTTCGCTTCTGCGCGCCATCTTGAGTACAATGAAGACGATCTCGCTTC
CGAAGATCGCCTGCTGCAGCTCTTCGAGAAATGGGCAACCAAGCACTCTAAGAACTACAC
CTCCCCCCATGAATCCTCTCAGAAGCACTCGCGCTTTCAAGTCTTCAAGCAGAACCTTGC
TTACATTCACCAGCAGAATAGCAACAAACAGAAGGAGTCTTCCCACAGGCTGGGCTTGAC
CCGCTTCGCAGATCTCACCCTTAACGAGTTTAAAGCTCGACATTTTGGCTTCAGAAACCG
CCCCAGCCCTGTTCCCCTTCAGGAATACAGCTCTGTCTGCGATACCAAGAAACTCCCTGC
ATCTGTTGATTGGAGAAAGCATGGTGCTGTTACCCCAGTTAAAGATCAAGGAACATGCGG
AAGCTGTTGGGCTTTCTCGTCTGTTGGTGCTATTGAGGGTGCACATGCTATAGCCATCGG
GGAGCTTGTGAGCTTGTCTGAACAGGAGCTTGTCAGCTGTGTTCACACTAACTTTGGCTG
CCATGGTGGCCTCATGAACCCCGCA
■■Homology search results ■■ -
sp_hit_id O65493
Definition sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana
Align length 217
Score (bit) 154.0
E-value 4.0e-37
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK952607|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0014_I23, 5'
(625 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis ... 154 4e-37
sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis ... 141 3e-33
sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2 140 4e-33
sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2 139 9e-33
sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2 139 9e-33
sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. G... 139 1e-32
sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1 138 2e-32
sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 ... 137 4e-32
sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1 135 2e-31
sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 ... 134 3e-31
sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS... 133 8e-31
sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulg... 132 1e-30
sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulg... 132 1e-30
sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis t... 132 2e-30
sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. ... 132 2e-30
sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Br... 127 4e-29
sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=A... 123 9e-28
sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2... 122 1e-27
sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=A... 119 1e-26
sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium dis... 119 1e-26
sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=A... 119 1e-26
sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=A... 118 2e-26
sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2 117 5e-26
sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis t... 116 1e-25
sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium dis... 116 1e-25
sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1 115 1e-25
sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. j... 114 3e-25
sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1 114 3e-25
sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1 113 7e-25
sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4 112 1e-24

>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis
thaliana GN=XCP1 PE=1 SV=1
Length = 355

Score = 154 bits (389), Expect = 4e-37
Identities = 97/217 (44%), Positives = 125/217 (57%), Gaps = 10/217 (4%)
Frame = +2

Query: 5 MAFSYSSTIRGFLCLSVLV-LVICFASARHLE---YNEDDLASEDRLLQLFEKWATKHSK 172
MAFS S + L +++ ++C A AR Y + L + D+LL+LFE W ++HSK
Sbjct: 1 MAFSAPSLSKFSLLVAISASALLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSK 60

Query: 173 NYTSPHESSQKHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG- 349
Y S E K RF+VF++NL +I Q+N+ + +S+ LGL FADLT EFK R+ G
Sbjct: 61 AYKSVEE---KVHRFEVFRENLMHIDQRNN---EINSYWLGLNEFADLTHEEFKGRYLGL 114

Query: 350 ----FRNRPSPVPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXX 517
F + P Y + D LP SVDWRK GAV PVKDQG CGSCWAFS+V
Sbjct: 115 AKPQFSRKRQPSANFRYRDITD---LPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVE 171

Query: 518 XXXXXXXXXLVSLSEQELVSCVHT-NFGCHGGLMNPA 625
L SLSEQEL+ C T N GC+GGLM+ A
Sbjct: 172 GINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYA 208


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis
thaliana GN=XCP2 PE=1 SV=2
Length = 356

Score = 141 bits (355), Expect = 3e-33
Identities = 92/216 (42%), Positives = 120/216 (55%), Gaps = 9/216 (4%)
Frame = +2

Query: 5 MAFSYSSTIRGF-LCLSVLVLVICFASARHLE---YNEDDLASEDRLLQLFEKWATKHSK 172
MA S S I F L LS L + FAS+ Y+ +DL S D+L++LFE W + K
Sbjct: 1 MALSSPSRILCFALALSAASLSLSFASSHDYSIVGYSPEDLESHDKLIELFENWISNFEK 60

Query: 173 NYTSPHESSQKHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGF 352
Y + E K RF+VFK NL +I + N +K S+ LGL FADL+ EFK + G
Sbjct: 61 AYETVEE---KFLRFEVFKDNLKHIDETN---KKGKSYWLGLNEFADLSHEEFKKMYLGL 114

Query: 353 RN----RPSPVPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXX 520
+ R E++ D + +P SVDWRK GAV VK+QG+CGSCWAFS+V
Sbjct: 115 KTDIVRRDEERSYAEFA-YRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEG 173

Query: 521 XXXXXXXXLVSLSEQELVSCVHT-NFGCHGGLMNPA 625
L +LSEQEL+ C T N GC+GGLM+ A
Sbjct: 174 INKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYA 209


>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348

Score = 140 bits (354), Expect = 4e-33
Identities = 84/193 (43%), Positives = 109/193 (56%), Gaps = 3/193 (1%)
Frame = +2

Query: 41 LCLSVLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQ 220
+CL V + V F + Y++DDL S +RL+QLF W H+K Y + E K RF+
Sbjct: 15 ICLFVHMSV-SFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDE---KLYRFE 70

Query: 221 VFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQEYSSVC 400
+FK NL YI + N +K +S+ LGL FADL+ +EF ++ G + Q Y
Sbjct: 71 IFKDNLNYIDETN---KKNNSYWLGLNEFADLSNDEFNEKYVG--SLIDATIEQSYDEEF 125

Query: 401 ---DTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQEL 571
DT LP +VDWRK GAVTPV+ QG+CGSCWAFS+V LV LSEQEL
Sbjct: 126 INEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQEL 185

Query: 572 VSCVHTNFGCHGG 610
V C + GC GG
Sbjct: 186 VDCERRSHGCKGG 198


>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352

Score = 139 bits (351), Expect = 9e-33
Identities = 78/175 (44%), Positives = 105/175 (60%), Gaps = 4/175 (2%)
Frame = +2

Query: 98 YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQNLAYIHQQNSNKQKE 277
Y++DDL S +RL+QLF+ W KH+K Y S E K RF++F+ NL YI + N +K
Sbjct: 33 YSQDDLTSIERLIQLFDSWMLKHNKIYESIDE---KIYRFEIFRDNLMYIDETN---KKN 86

Query: 278 SSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQEYSSVCDTKK----LPASVDWRKHG 445
+S+ LGL FADL+ +EFK ++ GF L+ + + T K P S+DWR G
Sbjct: 87 NSYWLGLNGFADLSNDEFKKKYVGFVAEDF-TGLEHFDNEDFTYKHVTNYPQSIDWRAKG 145

Query: 446 AVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELVSCVHTNFGCHGG 610
AVTPVK+QG CGSCWAFS++ L+ LSEQELV C ++GC GG
Sbjct: 146 AVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKHSYGCKGG 200


>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362

Score = 139 bits (351), Expect = 9e-33
Identities = 84/200 (42%), Positives = 110/200 (55%), Gaps = 9/200 (4%)
Frame = +2

Query: 53 VLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQ 232
VL + A ++++ DLASE+ L L+E+W + H T +KH RF VFK
Sbjct: 10 VLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHH----TVSRSLGEKHKRFNVFKA 65

Query: 233 NLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG--------FRNRPSPVPLQEY 388
NL ++H N+NK + ++L L +FAD+T +EF++ + G FR P Y
Sbjct: 66 NLMHVH--NTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMY 122

Query: 389 SSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQE 568
V +P SVDWRK GAVT VKDQG CGSCWAFS+V LV+LSEQE
Sbjct: 123 EKVVS---VPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQE 179

Query: 569 LVSC-VHTNFGCHGGLMNPA 625
LV C N GC+GGLM A
Sbjct: 180 LVDCDKEENQGCNGGLMESA 199


>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp.
GN=SEN102 PE=2 SV=1
Length = 360

Score = 139 bits (350), Expect = 1e-32
Identities = 83/203 (40%), Positives = 122/203 (60%), Gaps = 7/203 (3%)
Frame = +2

Query: 38 FLCLSVLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRF 217
F+ L+++ L + A+ + + E DLASED L L+EKW T H T + +K+ RF
Sbjct: 6 FIALALVALSF-LSIAQSIPFTEKDLASEDSLWNLYEKWRTHH----TVARDLDEKNRRF 60

Query: 218 QVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG--FRNRPSPVPLQEYS 391
VFK+N+ +IH+ N++K++ ++L L +F D+T EF++++ G ++ S +Q+ +
Sbjct: 61 NVFKENVKFIHE--FNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNT 118

Query: 392 SVC---DTKKLPA-SVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLS 559
+ LPA S+DWR GAVT VKDQG CGSCWAFS++ LVSLS
Sbjct: 119 GSFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLS 178

Query: 560 EQELVSC-VHTNFGCHGGLMNPA 625
EQELV C N GC+GGLM+ A
Sbjct: 179 EQELVDCDTSYNEGCNGGLMDYA 201


>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362

Score = 138 bits (348), Expect = 2e-32
Identities = 84/200 (42%), Positives = 110/200 (55%), Gaps = 9/200 (4%)
Frame = +2

Query: 53 VLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQ 232
VL L + A +++E DL SE+ L L+E+W + H T +KH RF VFK
Sbjct: 10 VLSLSLVLGVANSFDFHEKDLESEESLWDLYERWRSHH----TVSRSLGEKHKRFNVFKA 65

Query: 233 NLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG--------FRNRPSPVPLQEY 388
N+ ++H N+NK + ++L L +FAD+T +EF++ + G FR Y
Sbjct: 66 NVMHVH--NTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMY 122

Query: 389 SSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQE 568
V +PASVDWRK GAVT VKDQG CGSCWAFS++ LVSLSEQE
Sbjct: 123 EKV---GSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQE 179

Query: 569 LVSC-VHTNFGCHGGLMNPA 625
LV C N GC+GGLM A
Sbjct: 180 LVDCDKEENQGCNGGLMESA 199


>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1
SV=3
Length = 348

Score = 137 bits (345), Expect = 4e-32
Identities = 78/174 (44%), Positives = 101/174 (58%), Gaps = 3/174 (1%)
Frame = +2

Query: 98 YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQNLAYIHQQNSNKQKE 277
Y++DDL S +RL+QLF W KH+KNY + E K RF++FK NL YI ++N +
Sbjct: 33 YSQDDLTSTERLIQLFNSWMLKHNKNYKNVDE---KLYRFEIFKDNLKYIDERN---KMI 86

Query: 278 SSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQEYSSVC---DTKKLPASVDWRKHGA 448
+ + LGL F+DL+ +EFK ++ G + P Q Y D LP SVDWR GA
Sbjct: 87 NGYWLGLNEFSDLSNDEFKEKYVG--SLPEDYTNQPYDEEFVNEDIVDLPESVDWRAKGA 144

Query: 449 VTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELVSCVHTNFGCHGG 610
VTPVK QG C SCWAFS+V LV LSEQELV C ++GC+ G
Sbjct: 145 VTPVKHQGYCESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQSYGCNRG 198


>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345

Score = 135 bits (340), Expect = 2e-31
Identities = 78/192 (40%), Positives = 110/192 (57%), Gaps = 2/192 (1%)
Frame = +2

Query: 41 LCLSVLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQ 220
+CL V + + F + Y+++DL S +RL+QLFE W KH+K Y + E K RF+
Sbjct: 15 ICLFVY-MGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDE---KIYRFE 70

Query: 221 VFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQEYSSVC 400
+FK NL YI + N +K +S+ LGL FAD++ +EFK ++ G Y V
Sbjct: 71 IFKDNLKYIDETN---KKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVL 127

Query: 401 DTK--KLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELV 574
+ +P VDWR+ GAVTPVK+QG+CGSCWAFS+V L SEQEL+
Sbjct: 128 NDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELL 187

Query: 575 SCVHTNFGCHGG 610
C ++GC+GG
Sbjct: 188 DCDRRSYGCNGG 199


>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1
SV=1
Length = 360

Score = 134 bits (338), Expect = 3e-31
Identities = 87/202 (43%), Positives = 114/202 (56%), Gaps = 9/202 (4%)
Frame = +2

Query: 47 LSVLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVF 226
L L L + A +++E +L SE+ L L+E+W + H+ + S HE K RF VF
Sbjct: 6 LLALSLALVLAITESFDFHEKELESEESLWGLYERWRSHHTVS-RSLHE---KQKRFNVF 61

Query: 227 KQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG--------FRNRPSPVPLQ 382
K N ++H N+NK + ++L L +FAD+T +EF+ + G FR P
Sbjct: 62 KHNAMHVH--NANKM-DKPYKLKLNKFADMTNHEFRNTYSGSKVKHHRMFRGGPRGNGTF 118

Query: 383 EYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSE 562
Y V DT +PASVDWRK GAVT VKDQG CGSCWAFS++ LVSLSE
Sbjct: 119 MYEKV-DT--VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSE 175

Query: 563 QELVSC-VHTNFGCHGGLMNPA 625
QELV C N GC+GGLM+ A
Sbjct: 176 QELVDCDTDQNQGCNGGLMDYA 197


tr_hit_id Q3E9R1
Definition tr|Q3E9R1|Q3E9R1_ARATH Uncharacterized protein At4g35350.2 OS=Arabidopsis thaliana
Align length 217
Score (bit) 154.0
E-value 4.0e-36
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK952607|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0014_I23, 5'
(625 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|Q3E9R1|Q3E9R1_ARATH Uncharacterized protein At4g35350.2 OS=Ar... 154 4e-36
tr|A9NUC2|A9NUC2_PICSI Putative uncharacterized protein OS=Picea... 151 4e-35
tr|B4ESE7|B4ESE7_HORVD Papain-like cysteine proteinase OS=Hordeu... 150 6e-35
tr|Q6F6A3|Q6F6A3_DAUCA Cysteine protease OS=Daucus carota GN=DcC... 150 8e-35
tr|Q6ZHP9|Q6ZHP9_ORYSJ Os02g0715000 protein OS=Oryza sativa subs... 149 1e-34
tr|A3AAP5|A3AAP5_ORYSJ Putative uncharacterized protein OS=Oryza... 149 1e-34
tr|A2X8X3|A2X8X3_ORYSI Putative uncharacterized protein OS=Oryza... 149 1e-34
tr|A9P285|A9P285_PICSI Putative uncharacterized protein OS=Picea... 149 2e-34
tr|B4ESE6|B4ESE6_HORVD Papain-like cysteine proteinase OS=Hordeu... 147 4e-34
tr|Q94HK7|Q94HK7_ORYSA Putative cysteine proteinase OS=Oryza sat... 146 1e-33
tr|Q7XBA4|Q7XBA4_ORYSJ Os05g0108600 protein OS=Oryza sativa subs... 146 1e-33
tr|Q40922|Q40922_PSEMZ Pseudotzain OS=Pseudotsuga menziesii GN=P... 146 1e-33
tr|A2XZJ0|A2XZJ0_ORYSI Putative uncharacterized protein OS=Oryza... 146 1e-33
tr|Q948S1|Q948S1_DAUCA Cysteine proteinase (Fragment) OS=Daucus ... 145 1e-33
tr|B6TLC8|B6TLC8_MAIZE Xylem cysteine proteinase 2 OS=Zea mays P... 145 1e-33
tr|B0ZRH2|B0ZRH2_PINSY Cysteine protease (Fragment) OS=Pinus syl... 145 1e-33
tr|Q94DH7|Q94DH7_ORYSJ cDNA clone:001-029-D05, full insert seque... 145 3e-33
tr|Q41064|Q41064_PEA Thiolprotease OS=Pisum sativum GN=tpp PE=2 ... 145 3e-33
tr|Q0JFN1|Q0JFN1_ORYSJ Os01g0971400 protein (Fragment) OS=Oryza ... 145 3e-33
tr|A2WZK0|A2WZK0_ORYSI Putative uncharacterized protein OS=Oryza... 145 3e-33
tr|A9NV34|A9NV34_PICSI Putative uncharacterized protein OS=Picea... 144 3e-33
tr|A7Y7Y0|A7Y7Y0_SOLLC KDEL-tailed cysteine endopeptidase OS=Sol... 144 6e-33
tr|A1Y2K8|A1Y2K8_9ROSI VXH-C (Fragment) OS=Vasconcellea x heilbo... 144 6e-33
tr|Q6RCL8|Q6RCL8_IRIHO Putative cysteine protease 2 OS=Iris holl... 143 7e-33
tr|A9TQ45|A9TQ45_PHYPA Predicted protein OS=Physcomitrella paten... 143 7e-33
tr|A9NW12|A9NW12_PICSI Putative uncharacterized protein OS=Picea... 143 7e-33
tr|Q8LNY4|Q8LNY4_ZINEL Cysteine proteinase OS=Zinnia elegans GN=... 143 1e-32
tr|B6TIC7|B6TIC7_MAIZE Putative uncharacterized protein OS=Zea m... 142 2e-32
tr|Q41690|Q41690_9FABA Cysteinyl endopeptidase OS=Vigna radiata ... 142 2e-32
tr|A1Y2K4|A1Y2K4_9ROSI VXH-A (Fragment) OS=Vasconcellea x heilbo... 142 2e-32

>tr|Q3E9R1|Q3E9R1_ARATH Uncharacterized protein At4g35350.2
OS=Arabidopsis thaliana GN=At4g35350 PE=3 SV=1
Length = 288

Score = 154 bits (389), Expect = 4e-36
Identities = 97/217 (44%), Positives = 125/217 (57%), Gaps = 10/217 (4%)
Frame = +2

Query: 5 MAFSYSSTIRGFLCLSVLV-LVICFASARHLE---YNEDDLASEDRLLQLFEKWATKHSK 172
MAFS S + L +++ ++C A AR Y + L + D+LL+LFE W ++HSK
Sbjct: 1 MAFSAPSLSKFSLLVAISASALLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSK 60

Query: 173 NYTSPHESSQKHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG- 349
Y S E K RF+VF++NL +I Q+N+ + +S+ LGL FADLT EFK R+ G
Sbjct: 61 AYKSVEE---KVHRFEVFRENLMHIDQRNN---EINSYWLGLNEFADLTHEEFKGRYLGL 114

Query: 350 ----FRNRPSPVPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXX 517
F + P Y + D LP SVDWRK GAV PVKDQG CGSCWAFS+V
Sbjct: 115 AKPQFSRKRQPSANFRYRDITD---LPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVE 171

Query: 518 XXXXXXXXXLVSLSEQELVSCVHT-NFGCHGGLMNPA 625
L SLSEQEL+ C T N GC+GGLM+ A
Sbjct: 172 GINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYA 208


>tr|A9NUC2|A9NUC2_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 463

Score = 151 bits (381), Expect = 4e-35
Identities = 92/207 (44%), Positives = 116/207 (56%), Gaps = 12/207 (5%)
Frame = +2

Query: 41 LCLSVLVLVICFASARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQK 205
L +VL L SA + Y+ DL +D +++L+E W +H K Y E K
Sbjct: 5 LLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGE---K 61

Query: 206 HSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGF------RNRPS 367
+RF VFK N YIHQ N+ Q S++LGL +FADL+ EFKA + G R S
Sbjct: 62 QNRFSVFKDNFLYIHQHNN--QGNPSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSNS 119

Query: 368 PVPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXL 547
P P +YS D + LP S+DWR+ GAVT VKDQG+CGSCWAFS+V L
Sbjct: 120 PSPRYQYS---DGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNL 176

Query: 548 VSLSEQELVSC-VHTNFGCHGGLMNPA 625
SLSEQELV C N GC+GGLM+ A
Sbjct: 177 TSLSEQELVDCDTSYNQGCNGGLMDYA 203


>tr|B4ESE7|B4ESE7_HORVD Papain-like cysteine proteinase OS=Hordeum
vulgare var. distichum GN=pap-5 PE=2 SV=1
Length = 351

Score = 150 bits (379), Expect = 6e-35
Identities = 95/207 (45%), Positives = 122/207 (58%), Gaps = 14/207 (6%)
Frame = +2

Query: 47 LSVLVLVICFAS--ARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQK 205
LSV VL++C + AR+ + Y+E+DL+S DRL++LFEKW KH K Y S E K
Sbjct: 5 LSVAVLLLCVGACVARNSDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEE---K 61

Query: 206 HSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQE 385
RF+VFK NL I + N ++ +S+ LGL FADLT +EFK + G SP P +
Sbjct: 62 LHRFEVFKDNLKLIDEIN---REVTSYWLGLNEFADLTHDEFKTTYLGL----SPPPARR 114

Query: 386 YSSVC------DTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXL 547
SS LP +VDWRK GAVT VK+QG CGSCWAFS+V L
Sbjct: 115 SSSRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNL 174

Query: 548 VSLSEQELVSC-VHTNFGCHGGLMNPA 625
+LSEQEL+ C V N GC+GG+M+ A
Sbjct: 175 TALSEQELIDCSVDGNSGCNGGMMDYA 201


>tr|Q6F6A3|Q6F6A3_DAUCA Cysteine protease OS=Daucus carota
GN=DcCysP8 PE=2 SV=1
Length = 460

Score = 150 bits (378), Expect = 8e-35
Identities = 85/206 (41%), Positives = 121/206 (58%), Gaps = 7/206 (3%)
Frame = +2

Query: 29 IRGFLCLSVLVLVICFASARHLEYNEDDL--ASEDRLLQLFEKWATKHSKNYTSPHESSQ 202
I L LS+L + A + Y++ +++D ++ +E W KH K+Y + E Q
Sbjct: 4 ILSLLSLSLLAAAVTAADMSIITYDQTHAVGSTDDVIMAAYESWLVKHGKSYNALGEKEQ 63

Query: 203 KHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPL- 379
RFQ+FK N YI +QN+ K + S +LGL RFADLT E+++++ G R + S +
Sbjct: 64 ---RFQIFKDNFLYIDEQNAAKDR--SFKLGLNRFADLTNEEYRSKYTGIRTKDSRKKVS 118

Query: 380 ---QEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLV 550
Q Y+S+ + LP SVDWR+HGAV VKDQG CGSCWAFS++ L+
Sbjct: 119 GKSQRYASLAG-ESLPESVDWREHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLI 177

Query: 551 SLSEQELVSCVHT-NFGCHGGLMNPA 625
+LSEQELV C + N GC+GGLM+ A
Sbjct: 178 TLSEQELVDCDRSYNEGCNGGLMDDA 203


>tr|Q6ZHP9|Q6ZHP9_ORYSJ Os02g0715000 protein OS=Oryza sativa subsp.
japonica GN=OJ1191_G08.11 PE=2 SV=1
Length = 366

Score = 149 bits (377), Expect = 1e-34
Identities = 87/199 (43%), Positives = 118/199 (59%), Gaps = 10/199 (5%)
Frame = +2

Query: 53 VLVLVICFASARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRF 217
+L V C A+A H + Y+++DLA ++L+ LF W+ KHSK Y SP E K R+
Sbjct: 20 LLGFVACSATASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKE---KVKRY 76

Query: 218 QVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQEYSSV 397
++FK+NL +I + N ++ S+ LGL FAD+ EFKA + G + + Q + S
Sbjct: 77 EIFKRNLRHIVETN---RRNGSYWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGST 133

Query: 398 ----CDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQ 565
+ LP +VDWRK GAVTPVK+QG CGSCWAFS+V LVSLSEQ
Sbjct: 134 TFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSEQ 193

Query: 566 ELVSCVHT-NFGCHGGLMN 619
EL+ C +T N GC GGLM+
Sbjct: 194 ELMDCDNTFNHGCRGGLMD 212


>tr|A3AAP5|A3AAP5_ORYSJ Putative uncharacterized protein OS=Oryza
sativa subsp. japonica GN=OsJ_007867 PE=3 SV=1
Length = 357

Score = 149 bits (377), Expect = 1e-34
Identities = 87/199 (43%), Positives = 118/199 (59%), Gaps = 10/199 (5%)
Frame = +2

Query: 53 VLVLVICFASARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRF 217
+L V C A+A H + Y+++DLA ++L+ LF W+ KHSK Y SP E K R+
Sbjct: 11 LLGFVACSATASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKE---KVKRY 67

Query: 218 QVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQEYSSV 397
++FK+NL +I + N ++ S+ LGL FAD+ EFKA + G + + Q + S
Sbjct: 68 EIFKRNLRHIVETN---RRNGSYWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGST 124

Query: 398 ----CDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQ 565
+ LP +VDWRK GAVTPVK+QG CGSCWAFS+V LVSLSEQ
Sbjct: 125 TFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSEQ 184

Query: 566 ELVSCVHT-NFGCHGGLMN 619
EL+ C +T N GC GGLM+
Sbjct: 185 ELMDCDNTFNHGCRGGLMD 203


>tr|A2X8X3|A2X8X3_ORYSI Putative uncharacterized protein OS=Oryza
sativa subsp. indica GN=OsI_08685 PE=3 SV=1
Length = 357

Score = 149 bits (377), Expect = 1e-34
Identities = 87/199 (43%), Positives = 118/199 (59%), Gaps = 10/199 (5%)
Frame = +2

Query: 53 VLVLVICFASARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRF 217
+L V C A+A H + Y+++DLA ++L+ LF W+ KHSK Y SP E K R+
Sbjct: 11 LLGFVACSATASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKE---KVKRY 67

Query: 218 QVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQEYSSV 397
++FK+NL +I + N ++ S+ LGL FAD+ EFKA + G + + Q + S
Sbjct: 68 EIFKRNLRHIVETN---RRNGSYWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGST 124

Query: 398 ----CDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQ 565
+ LP +VDWRK GAVTPVK+QG CGSCWAFS+V LVSLSEQ
Sbjct: 125 TFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSEQ 184

Query: 566 ELVSCVHT-NFGCHGGLMN 619
EL+ C +T N GC GGLM+
Sbjct: 185 ELMDCDNTFNHGCRGGLMD 203


>tr|A9P285|A9P285_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 367

Score = 149 bits (375), Expect = 2e-34
Identities = 90/210 (42%), Positives = 114/210 (54%), Gaps = 17/210 (8%)
Frame = +2

Query: 47 LSVLVLVICFASARHL---EYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRF 217
L + +IC SA Y D+ S + L++LF++W +H K Y S HE +K R
Sbjct: 8 LLISATIICLVSAAKAVQHSYEVGDINSGNGLVRLFDRWLGRHGKLYGS-HE--EKARRL 64

Query: 218 QVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRP----------- 364
Q+F+ NL YIH N N SS RLGL +FADLT EFK R+FG ++
Sbjct: 65 QIFRTNLQYIHAHNKNSN--SSFRLGLNKFADLTNEEFKTRYFGKNSKQWRDRRRTELEG 122

Query: 365 ---SPVPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXX 535
PV Q S + + +S+DWRK GAVT VKDQ CGSCWAFS+
Sbjct: 123 AELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFSTTGAIEGVNFIS 182

Query: 536 XXXLVSLSEQELVSCVHTNFGCHGGLMNPA 625
LVSLSEQELV+C TN+GC GG M+ A
Sbjct: 183 TGKLVSLSEQELVACDATNYGCEGGDMDYA 212


>tr|B4ESE6|B4ESE6_HORVD Papain-like cysteine proteinase OS=Hordeum
vulgare var. distichum GN=pap-4 PE=2 SV=1
Length = 356

Score = 147 bits (372), Expect = 4e-34
Identities = 93/204 (45%), Positives = 121/204 (59%), Gaps = 11/204 (5%)
Frame = +2

Query: 47 LSVLVLVICFAS--ARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQK 205
LS +L++C + AR+ + Y+E+DL+S +RL++LFEKW KH K Y S E K
Sbjct: 10 LSGALLLLCVGACVARNSDFSIVGYSEEDLSSNERLVELFEKWLAKHQKAYASFEE---K 66

Query: 206 HSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQE 385
RF+VFK NL +I + N ++ +S+ LGL FADLT +EFKA + G P+
Sbjct: 67 LHRFEVFKDNLKHIDKIN---REVTSYWLGLNEFADLTHDEFKAAYLGLDAAPARRGSSR 123

Query: 386 ---YSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSL 556
Y V LP SVDWRK GAVT VK+QG CGSCWAFS+V L +L
Sbjct: 124 SFRYEDV-SASDLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTAL 182

Query: 557 SEQELVSC-VHTNFGCHGGLMNPA 625
SEQEL+ C V N GC+GGLM+ A
Sbjct: 183 SEQELIDCSVDGNSGCNGGLMDYA 206


>tr|Q94HK7|Q94HK7_ORYSA Putative cysteine proteinase OS=Oryza sativa
PE=3 SV=1
Length = 358

Score = 146 bits (368), Expect = 1e-33
Identities = 86/183 (46%), Positives = 108/183 (59%), Gaps = 7/183 (3%)
Frame = +2

Query: 98 YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQNLAYIHQQNSNKQKE 277
Y+E+DLAS DRL++LFEKW K+ K Y S E K RF+VFK NL +I N +K
Sbjct: 36 YSEEDLASHDRLIELFEKWVAKYRKAYASFEE---KVRRFEVFKDNLNHIDDIN---KKV 89

Query: 278 SSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQEYSSV------CDTKKLPASVDWRK 439
+S+ LGL FADLT +EFKA + G P+ + YSS ++P +DWRK
Sbjct: 90 TSYWLGLNEFADLTHDEFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRK 149

Query: 440 HGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELVSC-VHTNFGCHGGLM 616
AVT VK+QG CGSCWAFS+V L SLSEQEL+ C N GC+GGLM
Sbjct: 150 KNAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLM 209

Query: 617 NPA 625
+ A
Sbjct: 210 DYA 212