DK951843
Clone id TST38A01NGRL0012_H16
Library
Length 508
Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0012_H16. 5' end sequence.
Accession
Tissue type prothallia
Developmental stage gametophyte
Contig ID -
Sequence
GCCACAGTTGCAATTCCATCACAAGCATATGTATATGGTGAACCGGAGGAACAAACGGGA
TCAGAAAATAGTCGGGGAGATTCGTTGGAAAATTCGGCAGGGGTGAATCATATAAAGCCC
AGTCCTAGCAGTGTTAAAGCCAAGATCAAAGCCTTTGAAACAACCAGGCCAGAGGGTGCC
ACCAAACAAGGCCTTACACTCCCTTCAACCGTTGAGGTGCACATGTCGAGCTCTCAATTC
AGAAGCTCTCCAAGAAGAAATGCTGAATGTGAGGCAGAGCAGGCAGAATTACCCAGAAAG
CATCTTAAACTCATGGATGCCGATAGAACAGGGAGTGAAGAAGTTGAATCTGGAAATGGA
ATGGCAAGTGTGGAACCTTTCAAGACGGGTCCTGAGAGTGATTTGGCTGTTTCAGGGGCT
TCAGGAGCAGATTCTTGTAATAAAGAAAAGGATTGTTTGCCTGAAAGTGAAGGTAAAGAG
ACAGAACGCAGTCTGGAGGCTGTAACCA
■■Homology search results ■■ -
sp_hit_id Q8NI08
Definition sp|Q8NI08|NCOA7_HUMAN Nuclear receptor coactivator 7 OS=Homo sapiens
Align length 167
Score (bit) 37.0
E-value 0.054
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK951843|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0012_H16, 5'
(508 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|Q8NI08|NCOA7_HUMAN Nuclear receptor coactivator 7 OS=Homo sap... 37 0.054
sp|Q13428|TCOF_HUMAN Treacle protein OS=Homo sapiens GN=TCOF1 PE... 37 0.070
sp|Q8BHB9|CLIC6_MOUSE Chloride intracellular channel protein 6 O... 36 0.12
sp|P98193|DMP1_RAT Dentin matrix acidic phosphoprotein 1 OS=Ratt... 34 0.35
sp|P19334|TRP_DROME Transient receptor potential protein OS=Dros... 34 0.46
sp|Q8N806|UBR7_HUMAN Putative E3 ubiquitin-protein ligase UBR7 O... 33 0.59
sp|P12297|SUWA_DROME Protein suppressor of white apricot OS=Dros... 33 0.78
sp|P0C7A5|SEMG2_PONPY Semenogelin-2 OS=Pongo pygmaeus GN=SEMG2 P... 33 1.0
sp|P0C7A4|SEMG2_PONAB Semenogelin-2 OS=Pongo abelii GN=SEMG2 PE=... 33 1.0
sp|Q55242|SACB_STRSL Levansucrase OS=Streptococcus salivarius GN... 33 1.0
sp|Q8TEK3|DOT1L_HUMAN Histone-lysine N-methyltransferase, H3 lys... 33 1.0
sp|O95568|CA156_HUMAN UPF0558 protein C1orf156 OS=Homo sapiens G... 32 1.3
sp|P46285|S17P_WHEAT Sedoheptulose-1,7-bisphosphatase, chloropla... 32 1.7
sp|Q9USH9|YJQ1_SCHPO Uncharacterized ABC transporter ATP-binding... 32 1.7
sp|P27123|NASP_RABIT Nuclear autoantigenic sperm protein (Fragme... 32 1.7
sp|A6SIJ6|END3_BOTFB Actin cytoskeleton-regulatory complex prote... 32 1.7
sp|Q9Z0P4|PALM_MOUSE Paralemmin OS=Mus musculus GN=Palm PE=1 SV=1 32 2.3
sp|Q6GQX2|K1602_MOUSE Uncharacterized protein KIAA1602 homolog O... 32 2.3
sp|Q7LL22|YF71_SCHPO Uncharacterized protein C3G9.01 OS=Schizosa... 31 2.9
sp|Q96AY4|TTC28_HUMAN Tetratricopeptide repeat protein 28 OS=Hom... 31 2.9
sp|Q6CDX0|SMI1_YARLI KNR4/SMI1 homolog OS=Yarrowia lipolytica GN... 31 2.9
sp|O60437|PEPL_HUMAN Periplakin OS=Homo sapiens GN=PPL PE=1 SV=2 31 2.9
sp|Q926W4|NOC_LISIN Nucleoid occlusion protein OS=Listeria innoc... 31 2.9
sp|Q02383|SEMG2_HUMAN Semenogelin-2 OS=Homo sapiens GN=SEMG2 PE=... 31 3.9
sp|Q9VFS5|PP4R3_DROME Serine/threonine-protein phosphatase 4 reg... 31 3.9
sp|Q9I7U4|TITIN_DROME Titin OS=Drosophila melanogaster GN=sls PE... 30 5.0
sp|Q68D10|SPT2_HUMAN Protein SPT2 homolog OS=Homo sapiens GN=SPT... 30 5.0
sp|Q6CHN0|SLA1_YARLI Actin cytoskeleton-regulatory complex prote... 30 5.0
sp|Q68FE6|FA65A_MOUSE Protein FAM65A OS=Mus musculus GN=Fam65a P... 30 5.0
sp|A7E7N7|END3_SCLS1 Actin cytoskeleton-regulatory complex prote... 30 5.0

>sp|Q8NI08|NCOA7_HUMAN Nuclear receptor coactivator 7 OS=Homo
sapiens GN=NCOA7 PE=1 SV=2
Length = 942

Score = 37.0 bits (84), Expect = 0.054
Identities = 43/167 (25%), Positives = 66/167 (39%), Gaps = 25/167 (14%)
Frame = +1

Query: 40 EPEEQTGSENSRGDS-----LENSAGVNHIKPSPSSVKAKIKAFETTRPEGATKQGLTLP 204
E +Q G + DS LE S G KPS SSV K+K +++R +T
Sbjct: 331 EKRQQNGEKIMTSDSRPIVPLEKSTGHTPTKPSGSSVSEKLKKLDSSRETSHGSPTVTKL 390

Query: 205 STVEVHMSSSQFRSSPRRNAECE----AEQAELPR------------KHLKLMDADRTGS 336
S E +SS F S+ + N E + EL K +D +
Sbjct: 391 SK-EPSDTSSAFESTAKENFLGEDDDFVDLEELSSQTGGGMHKKDTLKECLSLDPEERKK 449

Query: 337 EEVESGNGMASVEPFKT----GPESDLAVSGASGADSCNKEKDCLPE 465
E + N ++ G E+D+ + GA ++C K+ D +PE
Sbjct: 450 AESQINNSAVEMQVQSALAFLGTENDVELKGALDLETCEKQ-DIMPE 495


>sp|Q13428|TCOF_HUMAN Treacle protein OS=Homo sapiens GN=TCOF1 PE=1
SV=2
Length = 1488

Score = 36.6 bits (83), Expect = 0.070
Identities = 36/162 (22%), Positives = 72/162 (44%), Gaps = 8/162 (4%)
Frame = +1

Query: 10 AIPSQAYVYGEPEEQTGSENSRGDSLENSAGVNHIKPSPSS--VKAKIKAFETTRPEGA- 180
A+ + A PEE +GS DS E + + KPS + ++A + + + +GA
Sbjct: 853 AVATAAQAQTGPEEDSGSSEEESDSEEEAETLAQAKPSGKTHQIRAALAPAKESPRKGAA 912

Query: 181 -TKQGLTLPSTVEVHMSSSQFRSSPRRNAECEA----EQAELPRKHLKLMDADRTGSEEV 345
T G T PS + SS +++ EA A++ + L +D +R+ +
Sbjct: 913 PTPPGKTGPSAAQAGKQDDSGSSSEESDSDGEAPAAVTSAQVIKPPLIFVDPNRSPAGPA 972

Query: 346 ESGNGMASVEPFKTGPESDLAVSGASGADSCNKEKDCLPESE 471
+ A + T ++ + S A + S ++++D +P ++
Sbjct: 973 AT---PAQAQAASTPRKARASESTARSSSSESEDEDVIPATQ 1011



Score = 30.0 bits (66), Expect = 6.6
Identities = 33/160 (20%), Positives = 60/160 (37%), Gaps = 13/160 (8%)
Frame = +1

Query: 37 GEPEEQTGSENSRGDSLENSAGVNHIKPSPSSVKAKIKAFETTRPEGATKQGLTLPSTVE 216
G+ EE + S + DS E + + PS +++A E K P
Sbjct: 404 GKREEDSQSSSEESDSEEEAPA----QAKPSGKAPQVRAASAPAKESPRKGAAPAPPRKT 459

Query: 217 VHMSSSQFRSSPRRNAECEAEQAELPRKHLKLMDADRTG----SEEVES---------GN 357
++ ++ +E+++ R+ L M+A + S +V+ G
Sbjct: 460 GPAAAQVQVGKQEEDSRSSSEESDSDREALAAMNAAQVKPLGKSPQVKPASTMGMGPLGK 519

Query: 358 GMASVEPFKTGPESDLAVSGASGADSCNKEKDCLPESEGK 477
G V P K GP + A G DS + ++ S+G+
Sbjct: 520 GAGPVPPGKVGPATPSAQVGKWEEDSESSSEESSDSSDGE 559


>sp|Q8BHB9|CLIC6_MOUSE Chloride intracellular channel protein 6
OS=Mus musculus GN=Clic6 PE=2 SV=1
Length = 596

Score = 35.8 bits (81), Expect = 0.12
Identities = 35/120 (29%), Positives = 51/120 (42%), Gaps = 7/120 (5%)
Frame = +1

Query: 166 RPEGATKQGLTLPSTVEVHMSSSQFRSSPRRNAECEAEQAELPRKHLKLMDADRTGSEE- 342
+PEGAT +G P ++ E E AE PR ++A +G EE
Sbjct: 17 QPEGATIEGPGEPGAADLE------------GREASEEAAEAPRDLGAGVEARASGKEEG 64

Query: 343 ---VESGNGMASVEPFKTGPESDLAVSGASGADSCNKEKDCLPES---EGKETERSLEAV 504
+ G G A + +TGPE++ GASGA + + PE +G E S + V
Sbjct: 65 GCGQDEGTGGAQAQDPRTGPEAE--TPGASGAPGEAEAAERDPEGAIPQGAEEAPSAQQV 122


>sp|P98193|DMP1_RAT Dentin matrix acidic phosphoprotein 1 OS=Rattus
norvegicus GN=Dmp1 PE=2 SV=1
Length = 489

Score = 34.3 bits (77), Expect = 0.35
Identities = 31/139 (22%), Positives = 59/139 (42%), Gaps = 26/139 (18%)
Frame = +1

Query: 37 GEPEEQTGSEN-------SRGDSLENSAGVNHIKPSPSSVKAKIKAFETTRPEGATKQG- 192
GEP +++ SE+ SRGD+ +N++ + S SS + ++ F ++ + +QG
Sbjct: 315 GEPSQESSSESQEGVASESRGDNPDNTSQTGDQRDSESSEEDRLNTFSSSESQSTEEQGD 374

Query: 193 ------LTL------------PSTVEVHMSSSQFRSSPRRNAECEAEQAELPRKHLKLMD 318
L+L S+ E S S R S + ++ E + + D
Sbjct: 375 SESNESLSLSEESQESAQDEDSSSQEGLQSQSASRESRSQESQSEQDSRSEENRDSDSQD 434

Query: 319 ADRTGSEEVESGNGMASVE 375
+ R+ E +G+ +S E
Sbjct: 435 SSRSKEESNSTGSTSSSEE 453


>sp|P19334|TRP_DROME Transient receptor potential protein
OS=Drosophila melanogaster GN=trp PE=1 SV=3
Length = 1275

Score = 33.9 bits (76), Expect = 0.46
Identities = 34/143 (23%), Positives = 58/143 (40%), Gaps = 1/143 (0%)
Frame = +1

Query: 37 GEPEEQT-GSENSRGDSLENSAGVNHIKPSPSSVKAKIKAFETTRPEGATKQGLTLPSTV 213
G P+ Q G+ + G+S + A KP + A K E+ +PE A K+ + +
Sbjct: 1040 GAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAP-KPGESAKPEAAAKKEESSKTEA 1098

Query: 214 EVHMSSSQFRSSPRRNAECEAEQAELPRKHLKLMDADRTGSEEVESGNGMASVEPFKTGP 393
+++ S +A +A+ KL E ++ NG + + K+GP
Sbjct: 1099 SKPAATNGAAKSAAPSAPSDAKPDS------KLKPGAAGAPEATKATNGASKPDEKKSGP 1152

Query: 394 ESDLAVSGASGADSCNKEKDCLP 462
E +G S K+KD P
Sbjct: 1153 EEPKKAAGDSKPGDDAKDKDKKP 1175


>sp|Q8N806|UBR7_HUMAN Putative E3 ubiquitin-protein ligase UBR7
OS=Homo sapiens GN=UBR7 PE=1 SV=2
Length = 425

Score = 33.5 bits (75), Expect = 0.59
Identities = 24/80 (30%), Positives = 34/80 (42%), Gaps = 1/80 (1%)
Frame = +1

Query: 121 SPSSVKAK-IKAFETTRPEGATKQGLTLPSTVEVHMSSSQFRSSPRRNAECEAEQAELPR 297
S SVK + + A T PEG G+ L + E H S F +RN C+ ++
Sbjct: 49 SQGSVKRQALYACSTCTPEGEEPAGICLACSYECHGSHKLFELYTKRNFRCDCGNSKFKN 108

Query: 298 KHLKLMDADRTGSEEVESGN 357
KL+ +V SGN
Sbjct: 109 LECKLL----PDKAKVNSGN 124


>sp|P12297|SUWA_DROME Protein suppressor of white apricot
OS=Drosophila melanogaster GN=su(w[a]) PE=1 SV=3
Length = 963

Score = 33.1 bits (74), Expect = 0.78
Identities = 37/152 (24%), Positives = 58/152 (38%), Gaps = 15/152 (9%)
Frame = +1

Query: 43 PEEQTGSENSRGDSLENSAGVNHIKPS-PSSVKAKIKAFE--------------TTRPEG 177
P+E + E S N+AGV H++P P SV+ IK E T P
Sbjct: 593 PQEASDEETS-----SNAAGVEHVRPGMPDSVQRAIKQVETQLLARTAGQKGNITASPSC 647

Query: 178 ATKQGLTLPSTVEVHMSSSQFRSSPRRNAECEAEQAELPRKHLKLMDADRTGSEEVESGN 357
++ Q + V +Q +Q +L RK L ++ E G+
Sbjct: 648 SSPQKEQRQAEERVKDKLAQIAREKLNGMISREKQLQLERKRKALAFLNQIKGEGAIVGS 707

Query: 358 GMASVEPFKTGPESDLAVSGASGADSCNKEKD 453
+ V GP + +GA+ ADS ++ D
Sbjct: 708 AVPVV-----GPNPPESAAGAATADSGDESGD 734


>sp|P0C7A5|SEMG2_PONPY Semenogelin-2 OS=Pongo pygmaeus GN=SEMG2 PE=2
SV=1
Length = 581

Score = 32.7 bits (73), Expect = 1.0
Identities = 42/178 (23%), Positives = 74/178 (41%), Gaps = 13/178 (7%)
Frame = +1

Query: 7 VAIPSQAYVYGEPEEQTGSENSRGDSLENSAGVNHIKPSPSSVKAKIKAFETTRPEGATK 186
V IPSQA YG E + ++S + ++G I+ S I+ E + G ++
Sbjct: 389 VRIPSQAQEYGHKENKISYQSSSTEERRLNSGEKDIQKGVSKGSISIQTEE--KIHGKSQ 446

Query: 187 QGLTLPSTVEVH------MSSSQFRSSPRR------NAECEAEQAELPRKHLKLMDADRT 330
+T+PS + H MS + RR N + + Q+ + + KL++
Sbjct: 447 DQVTIPSQDQEHGHKENKMSYQSSSTEERRLNYGGKNTQKDVSQSSISFQTEKLVE---- 502

Query: 331 GSEEVESGNGMASVEPFKTGPESDLAVSGASGADSCNKEKDCLP-ESEGKETERSLEA 501
G ++++ N S G SG S ++E+D L E +G+ + S A
Sbjct: 503 GKSQIQTPNP-------NQDQWSGQNAKGKSG-QSADREQDLLSHEQKGRYQQESSAA 552


>sp|P0C7A4|SEMG2_PONAB Semenogelin-2 OS=Pongo abelii GN=SEMG2 PE=3
SV=1
Length = 581

Score = 32.7 bits (73), Expect = 1.0
Identities = 42/178 (23%), Positives = 74/178 (41%), Gaps = 13/178 (7%)
Frame = +1

Query: 7 VAIPSQAYVYGEPEEQTGSENSRGDSLENSAGVNHIKPSPSSVKAKIKAFETTRPEGATK 186
V IPSQA YG E + ++S + ++G I+ S I+ E + G ++
Sbjct: 389 VRIPSQAQEYGHKENKISYQSSSTEERRLNSGEKDIQKGVSKGSISIQTEE--KIHGKSQ 446

Query: 187 QGLTLPSTVEVH------MSSSQFRSSPRR------NAECEAEQAELPRKHLKLMDADRT 330
+T+PS + H MS + RR N + + Q+ + + KL++
Sbjct: 447 DQVTIPSQDQEHGHKENKMSYQSSSTEERRLNYGGKNTQKDVSQSSISFQTEKLVE---- 502

Query: 331 GSEEVESGNGMASVEPFKTGPESDLAVSGASGADSCNKEKDCLP-ESEGKETERSLEA 501
G ++++ N S G SG S ++E+D L E +G+ + S A
Sbjct: 503 GKSQIQTPNP-------NQDQWSGQNAKGKSG-QSADREQDLLSHEQKGRYQQESSAA 552


>sp|Q55242|SACB_STRSL Levansucrase OS=Streptococcus salivarius
GN=ftf PE=3 SV=1
Length = 969

Score = 32.7 bits (73), Expect = 1.0
Identities = 38/145 (26%), Positives = 53/145 (36%), Gaps = 1/145 (0%)
Frame = +1

Query: 4 TVAIPSQAYVYGEPEEQTGSENSRGDSLENSAGVNHIKPSPSSVKAKIKAFETTRPEGAT 183
T P+ A PE T S ++ + A ++ S + + K T+P T
Sbjct: 56 TETAPAVATATATPETSTASLTVASETATSVATSEAVESSVAHSEVATKPVTETQPSNTT 115

Query: 184 KQGLTLPSTVEVHMSSSQFRSSPRRNAECEAEQAELPRKHLKLMDAD-RTGSEEVESGNG 360
PS VE SS+ SS A P + + A T VE+
Sbjct: 116 ------PSVVEEKASSTVVTSS---------SDATTPSATVAAVSAPAHTSEAAVEAPTS 160

Query: 361 MASVEPFKTGPESDLAVSGASGADS 435
AS E T E DL VS S A++
Sbjct: 161 TASSEAADTHTEVDLKVSENSAANA 185


tr_hit_id Q1JT76
Definition tr|Q1JT76|Q1JT76_TOXGO Calpain-7, putative OS=Toxoplasma gondii RH
Align length 109
Score (bit) 39.3
E-value 0.11
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK951843|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0012_H16, 5'
(508 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|Q1JT76|Q1JT76_TOXGO Calpain-7, putative OS=Toxoplasma gondii ... 39 0.11
tr|B6KQR0|B6KQR0_TOXGO Calpain family cysteine protease domain-c... 39 0.11
tr|Q9U5D8|Q9U5D8_9HEMI Vitellogenin-1 OS=Plautia stali GN=Vg-1 P... 39 0.15
tr|A0DYV0|A0DYV0_PARTE Chromosome undetermined scaffold_7, whole... 39 0.15
tr|B6LN36|B6LN36_BRAFL Putative uncharacterized protein OS=Branc... 39 0.19
tr|Q1RM01|Q1RM01_DANRE Zgc:136639 OS=Danio rerio GN=zgc:136639 P... 38 0.25
tr|B0S8H1|B0S8H1_DANRE Novel protein similar to vertebrate myelo... 38 0.25
tr|A3A568|A3A568_ORYSJ Putative uncharacterized protein OS=Oryza... 38 0.25
tr|Q59FZ2|Q59FZ2_HUMAN TCOF1 protein variant (Fragment) OS=Homo ... 38 0.25
tr|B4DRA2|B4DRA2_HUMAN cDNA FLJ57828, highly similar to Treacle ... 38 0.25
tr|A0JLU0|A0JLU0_HUMAN TCOF1 protein (Fragment) OS=Homo sapiens ... 38 0.25
tr|Q2HAN2|Q2HAN2_CHAGB Putative uncharacterized protein OS=Chaet... 38 0.32
tr|B0WEX5|B0WEX5_CULQU Polypeptide of 976 aa OS=Culex quinquefas... 37 0.42
tr|A8NEM8|A8NEM8_COPC7 Putative uncharacterized protein OS=Copri... 37 0.42
tr|B4MDN9|B4MDN9_DROVI GJ16287 OS=Drosophila virilis GN=GJ16287 ... 37 0.55
tr|B3L9Y6|B3L9Y6_PLAKH Putative uncharacterized protein OS=Plasm... 37 0.55
tr|Q2S466|Q2S466_SALRD Putative uncharacterized protein OS=Salin... 37 0.72
tr|A2X335|A2X335_ORYSI Putative uncharacterized protein OS=Oryza... 37 0.72
tr|A5K3X9|A5K3X9_PLAVI Putative uncharacterized protein OS=Plasm... 37 0.72
tr|B4E111|B4E111_HUMAN cDNA FLJ57346, highly similar to Homo sap... 37 0.72
tr|B3CLH4|B3CLH4_WOLPP Putative uncharacterized protein OS=Wolba... 36 0.94
tr|B6Y913|B6Y913_9RICK Putative uncharacterized protein OS=Wolba... 36 0.94
tr|Q4DYU5|Q4DYU5_TRYCR Mucin-associated surface protein (MASP), ... 36 0.94
tr|Q0CLI7|Q0CLI7_ASPTN Putative uncharacterized protein OS=Asper... 36 0.94
tr|B2AYD6|B2AYD6_PODAN Predicted CDS Pa_1_10750 OS=Podospora ans... 36 0.94
tr|Q8H8Q5|Q8H8Q5_ORYSJ Putative uncharacterized protein OSJNBa00... 36 1.2
tr|Q962H8|Q962H8_TOXGO Membrane skeleton protein IMC2A OS=Toxopl... 36 1.2
tr|A7E6A1|A7E6A1_SCLS1 Putative uncharacterized protein OS=Scler... 36 1.2
tr|Q4DXA6|Q4DXA6_TRYCR Mucin-associated surface protein (MASP), ... 35 1.6
tr|Q22SA1|Q22SA1_TETTH Putative uncharacterized protein OS=Tetra... 35 1.6

>tr|Q1JT76|Q1JT76_TOXGO Calpain-7, putative OS=Toxoplasma gondii RH
GN=TgIa.1460 PE=3 SV=1
Length = 2101

Score = 39.3 bits (90), Expect = 0.11
Identities = 32/109 (29%), Positives = 48/109 (44%), Gaps = 3/109 (2%)
Frame = +1

Query: 139 AKIKAFETTRPEGATKQGLT-LPSTVEVHMSSSQFRSSPRRNAECEAEQAELPRKH--LK 309
+K ++ R T+ GL+ P+TVE ++ + S A CEA E PRK ++
Sbjct: 575 SKSQSDSVVRRSEGTRAGLSSTPATVEGTPREAKRQGSRGSEARCEAADGE-PRKRGGIR 633

Query: 310 LMDADRTGSEEVESGNGMASVEPFKTGPESDLAVSGASGADSCNKEKDC 456
+AD G EE E AS +P P + G +G + K C
Sbjct: 634 AREADEPGKEEEEVSADAASEKPVDGTPGGVSSPEGRAGQGTQEPGKKC 682


>tr|B6KQR0|B6KQR0_TOXGO Calpain family cysteine protease
domain-containing protein OS=Toxoplasma gondii ME49
GN=TGME49_093820 PE=4 SV=1
Length = 2196

Score = 39.3 bits (90), Expect = 0.11
Identities = 32/109 (29%), Positives = 48/109 (44%), Gaps = 3/109 (2%)
Frame = +1

Query: 139 AKIKAFETTRPEGATKQGLT-LPSTVEVHMSSSQFRSSPRRNAECEAEQAELPRKH--LK 309
+K ++ R T+ GL+ P+TVE ++ + S A CEA E PRK ++
Sbjct: 575 SKSQSDSVVRRSEGTRAGLSSTPATVEGTPREAKRQGSRGSEARCEAADGE-PRKRGGIR 633

Query: 310 LMDADRTGSEEVESGNGMASVEPFKTGPESDLAVSGASGADSCNKEKDC 456
+AD G EE E AS +P P + G +G + K C
Sbjct: 634 AREADEPGKEEEEVSADAASEKPVDGTPGGVSSPEGRAGQGTQEPGKKC 682


>tr|Q9U5D8|Q9U5D8_9HEMI Vitellogenin-1 OS=Plautia stali GN=Vg-1 PE=2
SV=1
Length = 1907

Score = 38.9 bits (89), Expect = 0.15
Identities = 33/145 (22%), Positives = 56/145 (38%), Gaps = 1/145 (0%)
Frame = +1

Query: 28 YVYGEPEEQTGSENSRGDSLENSAGVNHIKPSPSSVKAKIKAFETTRPEGATKQGLTLPS 207
Y YG+P+ SE S S +S+ + + S SS +P AT G S
Sbjct: 324 YEYGKPQNGESSEESSSSSSSSSSSSSSSESSSSSSDESNWNQSLRKPASATSTGSLSSS 383

Query: 208 TVEVHMSSSQFRSSPRRNAECEAEQAELPRKHLKLMDAD-RTGSEEVESGNGMASVEPFK 384
+ V SSS + ++ P H ++ + S S + +S P +
Sbjct: 384 SSSVSSSSSSSEETNYNGGLKRKTRSVTPPPHSSSSSSESSSASSSSSSSSDESSNSPIR 443

Query: 385 TGPESDLAVSGASGADSCNKEKDCL 459
G S + S +S + S + ++ L
Sbjct: 444 QGASSSSSSSSSSESSSISSSEEYL 468


>tr|A0DYV0|A0DYV0_PARTE Chromosome undetermined scaffold_7, whole
genome shotgun sequence OS=Paramecium tetraurelia
GN=GSPATT00003185001 PE=4 SV=1
Length = 508

Score = 38.9 bits (89), Expect = 0.15
Identities = 25/88 (28%), Positives = 44/88 (50%)
Frame = +1

Query: 82 SLENSAGVNHIKPSPSSVKAKIKAFETTRPEGATKQGLTLPSTVEVHMSSSQFRSSPRRN 261
+LE+S NH++P+ + K ++K F+ P ++ L PS + + + Q +SSP R
Sbjct: 302 NLEHSRN-NHLQPTTHTPKQELKIFQNRFPSNSSNLELVQPS-IPITNNQQQIQSSPIRI 359

Query: 262 AECEAEQAELPRKHLKLMDADRTGSEEV 345
LP + K +ADR E++
Sbjct: 360 QSQVVPPQYLPINNPKQQEADRLRQEQI 387


>tr|B6LN36|B6LN36_BRAFL Putative uncharacterized protein
OS=Branchiostoma floridae GN=BRAFLDRAFT_120071 PE=4 SV=1
Length = 618

Score = 38.5 bits (88), Expect = 0.19
Identities = 40/150 (26%), Positives = 58/150 (38%)
Frame = +1

Query: 37 GEPEEQTGSENSRGDSLENSAGVNHIKPSPSSVKAKIKAFETTRPEGATKQGLTLPSTVE 216
G+ E QT + G + SP S K KIK + ++ + T+ + T E
Sbjct: 249 GQMEPQTPGKTVNGGDMS----------SPVSKKKKIKKQQQSQEQSVTESPKSSKQTPE 298

Query: 217 VHMSSSQFRSSPRRNAECEAEQAELPRKHLKLMDADRTGSEEVESGNGMASVEPFKTGPE 396
+ S SSP NAE ++ QA P S N V+ P
Sbjct: 299 KQLDSDLKLSSP--NAEAKSNQATTP---------------SPVSKNKRKIVQATPEHPL 341

Query: 397 SDLAVSGASGADSCNKEKDCLPESEGKETE 486
SDL VS + + NK D L + + K+ E
Sbjct: 342 SDLPVSPKQDSAASNKTTDALLKGDLKQVE 371


>tr|Q1RM01|Q1RM01_DANRE Zgc:136639 OS=Danio rerio GN=zgc:136639 PE=2
SV=1
Length = 570

Score = 38.1 bits (87), Expect = 0.25
Identities = 40/140 (28%), Positives = 56/140 (40%), Gaps = 3/140 (2%)
Frame = +1

Query: 40 EPEEQTGSENSRGDSLENSAGVNHIKPSPSSVKAKIKAFETTRPEGATKQGLTLPSTVE- 216
E + Q S+ S + +S SPSS + + P QG TL S VE
Sbjct: 360 EVKRQPDSDESNSEDEASSKSEQSAPSSPSSSSSSSSSDSDFEPSQKQGQG-TLRSMVED 418

Query: 217 VHMSSSQFRSSPRRNAECEAEQAELPRKHLKL--MDADRTGSEEVESGNGMASVEPFKTG 390
+H S SS +E E P H MD++ G+EE + A K
Sbjct: 419 MHSEGSDDDSS----SEVETPMKTTPFNHDSRLSMDSESDGNEESRPPSQEAPSPSLKLS 474

Query: 391 PESDLAVSGASGADSCNKEK 450
++L + G DSCN+EK
Sbjct: 475 -SANLKMLGKKSPDSCNREK 493


>tr|B0S8H1|B0S8H1_DANRE Novel protein similar to vertebrate
myeloid/lymphoid or mixed-lineage leukemia (Trithorax
homolog, Drosophila); translocated to, 1 (MLLT1,
zgc:136639) OS=Danio rerio GN=DKEY-5I22.1 PE=4 SV=1
Length = 567

Score = 38.1 bits (87), Expect = 0.25
Identities = 40/140 (28%), Positives = 56/140 (40%), Gaps = 3/140 (2%)
Frame = +1

Query: 40 EPEEQTGSENSRGDSLENSAGVNHIKPSPSSVKAKIKAFETTRPEGATKQGLTLPSTVE- 216
E + Q S+ S + +S SPSS + + P QG TL S VE
Sbjct: 357 EVKRQPDSDESNSEDEASSKSEQSAPSSPSSSSSSSSSDSDFEPSQKQGQG-TLRSMVED 415

Query: 217 VHMSSSQFRSSPRRNAECEAEQAELPRKHLKL--MDADRTGSEEVESGNGMASVEPFKTG 390
+H S SS +E E P H MD++ G+EE + A K
Sbjct: 416 MHSEGSDDDSS----SEVETPMKTTPFNHDSRLSMDSESDGNEESRPPSQEAPSPSLKLS 471

Query: 391 PESDLAVSGASGADSCNKEK 450
++L + G DSCN+EK
Sbjct: 472 -SANLKMLGKKSPDSCNREK 490


>tr|A3A568|A3A568_ORYSJ Putative uncharacterized protein OS=Oryza
sativa subsp. japonica GN=OsJ_005940 PE=4 SV=1
Length = 1777

Score = 38.1 bits (87), Expect = 0.25
Identities = 37/170 (21%), Positives = 72/170 (42%), Gaps = 20/170 (11%)
Frame = +1

Query: 58 GSENSRGDSLENSAGVNHIKPSPSSVKAKIKAFETTRPEGA---------TKQGLTLPST 210
GS +S ENS+ N + SP++ A KA ++ P + + + P T
Sbjct: 1307 GSPAISSNSAENSSNPNSLSASPATTPAAAKAVLSSAPIASQTVRKALSYKEVAIAAPGT 1366

Query: 211 VEVHMSSSQFRSSPRRNAECEAEQAELPRK---HL--------KLMDADRTGSEEVESGN 357
+ ++ +Q +A E A+ P++ HL ++ D T E+G
Sbjct: 1367 LVKALNDAQTEEKDATDAGANIETAKAPKESNGHLSKEKDGAVQVSPKDSTSQGSKETGE 1426

Query: 358 GMASVEPFKTGPESDLAVSGASGADSCNKEKDCLPESEGKETERSLEAVT 507
G +S E + ++G++ +++ ++K L S+ + +SL VT
Sbjct: 1427 GKSS----NPDDEQTVVLAGSNQSETQPEKKRDLVASDVSSSSQSLTTVT 1472


>tr|Q59FZ2|Q59FZ2_HUMAN TCOF1 protein variant (Fragment) OS=Homo
sapiens PE=2 SV=1
Length = 712

Score = 38.1 bits (87), Expect = 0.25
Identities = 36/162 (22%), Positives = 73/162 (45%), Gaps = 8/162 (4%)
Frame = +1

Query: 10 AIPSQAYVYGEPEEQTGSENSRGDSLENSAGVNHIKPSPSS--VKAKIKAFETTRPEGA- 180
A+ + A PEE +GS DS E + + +KPS + ++A + + + +GA
Sbjct: 453 AVATAAQAQTGPEEDSGSSEEESDSEEEAETLAQVKPSGKTHQIRAALAPAKESPRKGAA 512

Query: 181 -TKQGLTLPSTVEVHMSSSQFRSSPRRNAECEA----EQAELPRKHLKLMDADRTGSEEV 345
T G T PS + SS +++ EA A++ + L +D +R+ +
Sbjct: 513 PTPPGKTGPSAAQAGKQDDSGSSSEESDSDGEAPAAVTSAQVIKPPLIFVDPNRSPAGPA 572

Query: 346 ESGNGMASVEPFKTGPESDLAVSGASGADSCNKEKDCLPESE 471
+ A + T ++ + S A + S ++++D +P ++
Sbjct: 573 AT---PAQAQAASTPRKARASESTARSSSSESEDEDVIPATQ 611


>tr|B4DRA2|B4DRA2_HUMAN cDNA FLJ57828, highly similar to Treacle
protein (Fragment) OS=Homo sapiens PE=2 SV=1
Length = 923

Score = 38.1 bits (87), Expect = 0.25
Identities = 36/162 (22%), Positives = 73/162 (45%), Gaps = 8/162 (4%)
Frame = +1

Query: 10 AIPSQAYVYGEPEEQTGSENSRGDSLENSAGVNHIKPSPSS--VKAKIKAFETTRPEGA- 180
A+ + A PEE +GS DS E + + +KPS + ++A + + + +GA
Sbjct: 362 AVATAAQAQTGPEEDSGSSEEESDSEEEAETLAQVKPSGKTHQIRAALAPAKESPRKGAA 421

Query: 181 -TKQGLTLPSTVEVHMSSSQFRSSPRRNAECEA----EQAELPRKHLKLMDADRTGSEEV 345
T G T PS + SS +++ EA A++ + L +D +R+ +
Sbjct: 422 PTPPGKTGPSAAQAGKQDDSGSSSEESDSDGEAPAAVTSAQVIKPPLIFVDPNRSPAGPA 481

Query: 346 ESGNGMASVEPFKTGPESDLAVSGASGADSCNKEKDCLPESE 471
+ A + T ++ + S A + S ++++D +P ++
Sbjct: 482 AT---PAQAQAASTPRKARASESTARSSSSESEDEDVIPATQ 520