BLASTX 2.0.8 [Jan-05-1999]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 12VII-25F
(459 letters)
Database: Non-redundant GenBank CDS
translations+PDB+SwissProt+SPupdate+PIR
369,800 sequences; 113,023,754 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|4377464|emb|CAA26447| (X02600) unidentified open reading fram... 32 3.0
gi|4106802|gb|AAD03019| (AF079175) p67 surface antigen [Theileri... 32 3.0
gi|161880 (M67476) p67 gene product [Theileria parva] 32 3.0
gi|1511628 (U40703) sporozoite surface protein p67 [Theileria pa... 32 3.0
gi|4106806|gb|AAD03021| (AF079177) p67 surface antigen [Theileri... 31 4.0
gi|3193286 (AF069298) T14P8.22 gene product [Arabidopsis thaliana] 31 4.0
gi|4049815 (AF063866) ORF MSV171 ATP/GTP binding motif homolog (... 31 5.2
gi|4388834|gb|AAD19789| (AC006528) putative disease resistance p... 31 5.2
gi|731732|sp|P38853|YHV8_YEAST HYPOTHETICAL 131.1 KD PROTEIN IN ... 31 6.8
gi|2264408 (U70138) Na-K-Cl cotransporter [Bos taurus] 31 6.8
gi|224534|prf||1107279B ORF g [Drosophila sp.] 31 6.8
gi|3876935|emb|CAA92125| (Z68106) F41E7.2 [Caenorhabditis elegans] 31 6.8
gi|1723892|sp|P53147|YGJ6_YEAST HYPOTHETICAL 31.3 KD HOMEOBOX PR... 31 6.8
gi|74599|pir||OFFFCP retrovirus-related polyprotein - fruit fly ... 31 6.8
gi|950319 (M11240) unknown protein [Drosophila melanogaster] 31 6.8
gi|391657|dbj|BAA01703| (D10880) ORF [Drosophila simulans] 31 6.8
gi|1169016|sp|P04146|COPI_DROME COPIA PROTEIN >gi|7744|emb|CAA28... 31 6.8
gi|1572773 (U70850) contains strong similarity to multiple C2H2-... 30 8.9
gi|4106804|gb|AAD03020| (AF079176) p67 surface antigen [Theileri... 30 8.9
gi|3386548 (AF079504) H-protein promoter binding factor-2b [Arab... 30 8.9
gi|4538666|emb|CAB39360| (AL049474) hypothetical protein [Schizo... 30 8.9
gi|2429532 (AF025471) No definition line found [Caenorhabditis e... 30 8.9
>gi|4377464|emb|CAA26447| (X02600) unidentified open reading frame I (1289 aa) [Drosophila
melanogaster]
Length = 1288
Score = 31.7 bits (70), Expect = 3.0
Identities = 18/53 (33%), Positives = 23/53 (42%)
Query: 12 ISCPRKVKMDSNIIPQEKSEDLKRNVEIIFSAIGNQLNSTSINGHTTYIYTPN 170
I K D I +SE LK EI ++ N LN +N HT + PN
Sbjct: 840 IGIDNPTKNDGIEIINRRSERLKTKPEISYNEEDNSLNKVVLNAHTIFNDVPN 892
>gi|4106802|gb|AAD03019| (AF079175) p67 surface antigen [Theileria parva]
Length = 750
Score = 31.7 bits (70), Expect = 3.0
Identities = 21/68 (30%), Positives = 29/68 (41%)
Query: 198 NTNNDKSIDISSHIPVFSIGTQPSKQKSPVLNEQIEPTLNVRLPEEDEPKDINKYIXXPX 377
N ++ S DIS IP +PV E I PTL + EE P D++ +
Sbjct: 140 NEDSTLSTDISPTIP------------TPVSEEIITPTLQAQTKEEVPPADLSDQVSSNG 187
Query: 378 XSTEPEXN 401
+E E N
Sbjct: 188 SDSEEEDN 195
>gi|161880 (M67476) p67 gene product [Theileria parva]
Length = 709
Score = 31.7 bits (70), Expect = 3.0
Identities = 21/72 (29%), Positives = 31/72 (42%)
Query: 186 STVDNTNNDKSIDISSHIPVFSIGTQPSKQKSPVLNEQIEPTLNVRLPEEDEPKDINKYI 365
S +N ++ S D+S IP +PV E I PTL + EE P D++ +
Sbjct: 135 SEEENEDSTLSTDVSPTIP------------TPVSEEIITPTLQAQTKEEVPPADLSDQV 182
Query: 366 XXPXXSTEPEXN 401
+E E N
Sbjct: 183 PSNGSDSEEEDN 194
>gi|1511628 (U40703) sporozoite surface protein p67 [Theileria parva]
Length = 752
Score = 31.7 bits (70), Expect = 3.0
Identities = 21/72 (29%), Positives = 31/72 (42%)
Query: 186 STVDNTNNDKSIDISSHIPVFSIGTQPSKQKSPVLNEQIEPTLNVRLPEEDEPKDINKYI 365
S +N ++ S D+S IP +PV E I PTL + EE P D++ +
Sbjct: 135 SEEENEDSTVSTDVSPTIP------------TPVSEEIITPTLQAQTKEEVPPADLSDQV 182
Query: 366 XXPXXSTEPEXN 401
+E E N
Sbjct: 183 PSNGSDSEEEDN 194
>gi|4106806|gb|AAD03021| (AF079177) p67 surface antigen [Theileria parva]
Length = 709
Score = 31.3 bits (69), Expect = 4.0
Identities = 21/73 (28%), Positives = 30/73 (40%)
Query: 186 STVDNTNNDKSIDISSHIPVFSIGTQPSKQKSPVLNEQIEPTLNVRLPEEDEPKDINKYI 365
S +N + S D+S IP +PV E I PTL + EE P D++ +
Sbjct: 135 SEEENEDGTLSTDVSPAIP------------TPVSEEIITPTLQAQTKEEVPPADLSDQV 182
Query: 366 XXPXXSTEPEXND 404
+E E D
Sbjct: 183 PSNGSDSEEEDED 195
>gi|3193286 (AF069298) T14P8.22 gene product [Arabidopsis thaliana]
Length = 336
Score = 31.3 bits (69), Expect = 4.0
Identities = 21/80 (26%), Positives = 35/80 (43%)
Query: 3 NSTISCPRKVKMDSNIIPQEKSEDLKRNVEIIFSAIGNQLNSTSINGHTTYIYTPNIIFI 182
NS R++ D N+ P+ + DL N E + +G+ L +TS +YT I
Sbjct: 111 NSDFGLERELGPDQNLDPKPTTTDLALNDEEVSKPVGSGLETTSFWSLYDDLYTDTIPAP 170
Query: 183 PSTVDNTNNDKSIDISSHIP 242
P + ++ I+ S P
Sbjct: 171 PPEDSIDDQEEEIETSEIRP 190
>gi|4049815 (AF063866) ORF MSV171 ATP/GTP binding motif homolog (vaccinia
A32L), similar to GB:D11079 [Melanoplus sanguinipes
entomopoxvirus]
Length = 244
Score = 30.9 bits (68), Expect = 5.2
Identities = 31/88 (35%), Positives = 44/88 (49%), Gaps = 13/88 (14%)
Query: 27 KVKMDSNIIPQEKSEDLKRNVEIIFSAIGNQL-----NSTSINGH----TTYIYTPNIIF 179
++K +N +KS K N IIF +GN N T+ H T ++ I
Sbjct: 88 EIKKFTNKTLNDKSYGEKFNTLIIFDDVGNNFRNKLKNFTNECRHAFISTIFLVHKEIHL 147
Query: 180 IPSTVDNTN----NDKSIDISSHIPVFSIGTQPSKQKSPVL 290
P T D+ N K+ID+ IP SI + SK P+L
Sbjct: 148 DPDTRDSMKFFVINKKTIDLKYIIPNTSIRKEISKDIVPLL 188
>gi|4388834|gb|AAD19789| (AC006528) putative disease resistance protein RPP1, 3' partial
[Arabidopsis thaliana]
Length = 952
Score = 30.9 bits (68), Expect = 5.2
Identities = 14/43 (32%), Positives = 27/43 (62%)
Query: 102 SAIGNQLNSTSINGHTTYIYTPNIIFIPSTVDNTNNDKSIDIS 230
S+IGN +N +I+ + + N++ +PS++ N N K +D+S
Sbjct: 739 SSIGNAINLQTID----FSHCENLVELPSSIGNATNLKELDLS 777
>gi|731732|sp|P38853|YHV8_YEAST HYPOTHETICAL 131.1 KD PROTEIN IN REC104-SOL3 INTERGENIC REGION
>gi|626671|pir||S46769 hypothetical protein YHR158c -
yeast (Saccharomyces cerevisiae) >gi|500665 (U10397)
Yhr158cp [Saccharomyces cerevisiae]
Length = 1164
Score = 30.5 bits (67), Expect = 6.8
Identities = 28/88 (31%), Positives = 46/88 (51%), Gaps = 3/88 (3%)
Query: 66 SEDLKRNVEIIFSAIGNQLNSTSINGHTTYIYTPNII--FIPSTVDNTNNDKSIDISSHI 239
++ L R V + +Q S +N + I PNI+ ++PS T K I+S+
Sbjct: 512 ADRLNREVHNRNVSTEHQNQSHPVNSESHLIAEPNILTPYVPSESSQTPVMK---ITSNK 568
Query: 240 PVFSIGTQPSKQKSPVLNEQIEPTL-NVRLP 329
P + P+ QK P L+E ++PT+ N R+P
Sbjct: 569 PFDT----PTIQKEPDLSETMDPTVGNQRIP 595
>gi|2264408 (U70138) Na-K-Cl cotransporter [Bos taurus]
Length = 1201
Score = 30.5 bits (67), Expect = 6.8
Identities = 20/60 (33%), Positives = 34/60 (56%), Gaps = 10/60 (16%)
Query: 174 IFIPSTVDNTNNDKS----------IDISSHIPVFSIGTQPSKQKSPVLNEQIEPTLNVR 323
+FI ++ ++D+ ID S + + I T+P K+ +E IEP R
Sbjct: 1045 VFIGGKINRIDHDRRAMPTLLTKFRIDFSDIMVLGDINTKPKKENIVAFDEMIEP---YR 1101
Query: 324 LPEEDEPKDI 353
L E+D+ +DI
Sbjct: 1102 LHEDDKEQDI 1111
>gi|224534|prf||1107279B ORF g [Drosophila sp.]
Length = 1410
Score = 30.5 bits (67), Expect = 6.8
Identities = 17/53 (32%), Positives = 23/53 (43%)
Query: 12 ISCPRKVKMDSNIIPQEKSEDLKRNVEIIFSAIGNQLNSTSINGHTTYIYTPN 170
I K D I +SE LK +I ++ N LN +N HT + PN
Sbjct: 841 IGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPN 893
>gi|3876935|emb|CAA92125| (Z68106) F41E7.2 [Caenorhabditis elegans]
Length = 241
Score = 30.5 bits (67), Expect = 6.8
Identities = 17/76 (22%), Positives = 38/76 (49%)
Query: 45 NIIPQEKSEDLKRNVEIIFSAIGNQLNSTSINGHTTYIYTPNIIFIPSTVDNTNNDKSID 224
NI +++ N+E+ ++ N N++ N + IY N F+P +++N + S
Sbjct: 166 NITNSTILDEMDPNLELKKTSSSNSTNNSRTNLNNRIIYELNPNFVPISIENGYSRASYQ 225
Query: 225 ISSHIPVFSIGTQPSK 272
S++ ++ G + +K
Sbjct: 226 HSNNTKTYASGNRINK 241
>gi|1723892|sp|P53147|YGJ6_YEAST HYPOTHETICAL 31.3 KD HOMEOBOX PROTEIN IN PRP20-VPS45 INTERGENIC
REGION >gi|2131583|pir||S64103 hypothetical protein
YGL096w - yeast (Saccharomyces cerevisiae)
>gi|1322631|emb|CAA96802| (Z72618) ORF YGL096w
[Saccharomyces cerevisiae]
Length = 276
Score = 30.5 bits (67), Expect = 6.8
Identities = 25/79 (31%), Positives = 42/79 (52%), Gaps = 1/79 (1%)
Query: 87 VEIIFSAIGNQLNSTSINGHTTYIYTPNIIFIPST-VDNTNNDKSIDISSHIPVFSIGTQ 263
++++F ++ N+ N T + +Y PN F+P T + + + +SS PVF IG
Sbjct: 17 IQVLFESL-NRENETKPHFEERRLYQPNPSFVPRTNIAVGSPVNPVPVSS--PVFFIG-- 71
Query: 264 PSKQKSPVLNEQIEPTLNVR 323
PS Q+S + N T N+R
Sbjct: 72 PSPQRS-IQNHNAIMTQNIR 90
>gi|74599|pir||OFFFCP retrovirus-related polyprotein - fruit fly (Drosophila
melanogaster) transposon copia >gi|950318 (M11240)
unknown protein [Drosophila melanogaster]
>gi|1491679|emb|CAA26444| (X02599) 31 KD polyprotein
[Drosophila melanogaster]
Length = 1409
Score = 30.5 bits (67), Expect = 6.8
Identities = 17/53 (32%), Positives = 23/53 (43%)
Query: 12 ISCPRKVKMDSNIIPQEKSEDLKRNVEIIFSAIGNQLNSTSINGHTTYIYTPN 170
I K D I +SE LK +I ++ N LN +N HT + PN
Sbjct: 840 IGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPN 892
>gi|950319 (M11240) unknown protein [Drosophila melanogaster]
Length = 1017
Score = 30.5 bits (67), Expect = 6.8
Identities = 17/53 (32%), Positives = 23/53 (43%)
Query: 12 ISCPRKVKMDSNIIPQEKSEDLKRNVEIIFSAIGNQLNSTSINGHTTYIYTPN 170
I K D I +SE LK +I ++ N LN +N HT + PN
Sbjct: 448 IGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPN 500
>gi|391657|dbj|BAA01703| (D10880) ORF [Drosophila simulans]
Length = 1409
Score = 30.5 bits (67), Expect = 6.8
Identities = 17/53 (32%), Positives = 23/53 (43%)
Query: 12 ISCPRKVKMDSNIIPQEKSEDLKRNVEIIFSAIGNQLNSTSINGHTTYIYTPN 170
I K D I +SE LK +I ++ N LN +N HT + PN
Sbjct: 840 IGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPN 892
>gi|1169016|sp|P04146|COPI_DROME COPIA PROTEIN >gi|7744|emb|CAA28054| (X04456) ORF [Drosophila
melanogaster]
Length = 1409
Score = 30.5 bits (67), Expect = 6.8
Identities = 17/53 (32%), Positives = 23/53 (43%)
Query: 12 ISCPRKVKMDSNIIPQEKSEDLKRNVEIIFSAIGNQLNSTSINGHTTYIYTPN 170
I K D I +SE LK +I ++ N LN +N HT + PN
Sbjct: 840 IGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPN 892
>gi|1572773 (U70850) contains strong similarity to multiple C2H2-type
zinc-fingers (PS:PS00028) and a homeobox (PS:PS00027)
[Caenorhabditis elegans]
Length = 680
Score = 30.1 bits (66), Expect = 8.9
Identities = 22/90 (24%), Positives = 39/90 (42%)
Query: 156 IYTPNIIFIPSTVDNTNNDKSIDISSHIPVFSIGTQPSKQKSPVLNEQIEPTLNVRLPEE 335
+ TP + F+PST N + S+ ++ GT P+ + P EP + V
Sbjct: 103 LQTPQVSFLPSTAANNMDYMSLLQANLFQSLENGTSPTPTQEPSAPASPEPKIEV----V 158
Query: 336 DEPKDINKYIXXPXXSTEPEXNDKIIDECL 425
DEP+ ++ TE + D + +E +
Sbjct: 159 DEPEVSSE--VKTEVKTEVKTEDSVPEESI 186
>gi|4106804|gb|AAD03020| (AF079176) p67 surface antigen [Theileria parva]
Length = 751
Score = 30.1 bits (66), Expect = 8.9
Identities = 21/74 (28%), Positives = 34/74 (45%)
Query: 186 STVDNTNNDKSIDISSHIPVFSIGTQPSKQKSPVLNEQIEPTLNVRLPEEDEPKDINKYI 365
S +N ++ S D+S IP +PV E I PTL + EE P D++ +
Sbjct: 135 SEQENEDSTLSTDVSPTIP------------TPVSEEIITPTLQAQTKEEVPPADLSDQV 182
Query: 366 XXPXXSTEPEXNDK 407
P ++ E ++K
Sbjct: 183 --PSDGSDSEEDNK 194
>gi|3386548 (AF079504) H-protein promoter binding factor-2b [Arabidopsis
thaliana]
Length = 400
Score = 30.1 bits (66), Expect = 8.9
Identities = 15/47 (31%), Positives = 26/47 (54%), Gaps = 1/47 (2%)
Query: 222 DISSHIPVFSIGTQPSKQKSPVLNEQIEPTLN-VRLPEEDEPKDINKY 362
D SS PV + + PSK+ S + PT+ +R+P + ++ NK+
Sbjct: 27 DPSSLSPVHDVSSDPSKEDSSSSSSSCSPTIGPIRVPVKKSEQESNKF 74
>gi|4538666|emb|CAB39360| (AL049474) hypothetical protein [Schizosaccharomyces pombe]
Length = 452
Score = 30.1 bits (66), Expect = 8.9
Identities = 19/79 (24%), Positives = 33/79 (41%)
Query: 84 NVEIIFSAIGNQLNSTSINGHTTYIYTPNIIFIPSTVDNTNNDKSIDISSHIPVFSIGTQ 263
+V I+ S +NS S++ +TTY Y + + S+V S +S IP
Sbjct: 330 SVVIVQSETSCSINSASMSSNTTYFYWNSTSSLSSSVFTNTTSSSNSTNSSIPTTYPSNS 389
Query: 264 PSKQKSPVLNEQIEPTLNV 320
+ Q +P +N+
Sbjct: 390 TTYQNITTSYPWSQPVVNI 408
>gi|2429532 (AF025471) No definition line found [Caenorhabditis elegans]
Length = 379
Score = 30.1 bits (66), Expect = 8.9
Identities = 19/71 (26%), Positives = 34/71 (47%)
Query: 123 NSTSINGHTTYIYTPNIIFIPSTVDNTNNDKSIDISSHIPVFSIGTQPSKQKSPVLNEQI 302
N S G +T+I +I + ND ID++ H + SI +P + K+ L+ I
Sbjct: 175 NRDSSMGSSTFIS-----WIDMEKEYVTNDDCIDVTIHAKIISITDEPCEMKTFALSHTI 229
Query: 303 EPTLNVRLPEE 335
+ ++R E+
Sbjct: 230 KNMSSIREGED 240
Database: Non-redundant GenBank CDS
translations+PDB+SwissProt+SPupdate+PIR
Posted date: Apr 17, 1999 5:33 AM
Number of letters in database: 113,023,754
Number of sequences in database: 369,800
Lambda K H
0.318 0.135 0.00
Gapped
Lambda K H
0.270 0.0470 4.94e-324
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 83789069
Number of Sequences: 369800
Number of extensions: 1682941
Number of successful extensions: 4388
Number of sequences better than 10.0: 44
Number of HSP's better than 10.0 without gapping: 8
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 4379
Number of HSP's gapped (non-prelim): 22
length of query: 153
length of database: 113023754
effective HSP length: 52
effective length of query: 100
effective length of database: 93794154
effective search space: 9379415400
effective search space used: 9379415400
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 41 (21.7 bits)
S2: 66 (30.1 bits)