BLASTX 2.0.8 [Jan-05-1999] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= 12VII-16F (903 letters) Database: Non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PIR 368,476 sequences; 112,640,273 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|97162|pir||S16288 hypothetical protein - Haemophilus influenzae 61 1e-08 gi|4102010 (AF007429) putative transposase [Haemophilus paragall... 57 1e-07 gi|1333760|emb|CAA41166| (X58176) ORF2 [Haemophilus influenzae] 53 3e-06 gi|3212219 (U32812) H. influenzae predicted coding region HI1328... 45 8e-04 gi|97161|pir||S24059 gene IS1016-V6 protein (insertion sequence ... 45 8e-04 gi|2829877 (AC002396) Hypothetical protein [Arabidopsis thaliana] 34 1.5 gi|4337200|gb|AAD18114| (AC006403) putative NAM protein [Arabido... 33 2.5 gi|1915984 (U65919) Tritrichomonas foetus serine/threonine prote... 32 5.6 gi|3323272 (AE001263) lipase, putative [Treponema pallidum] 31 9.7
>gi|97162|pir||S16288 hypothetical protein - Haemophilus influenzae Length = 217 Score = 60.9 bits (145), Expect = 1e-08 Identities = 51/193 (26%), Positives = 89/193 (45%), Gaps = 2/193 (1%) Query: 282 EILYQFSCRRTVADSAETLGHSKTTIMSFYKLFRASLTYFLEKTSEKLGGDGIIIHFDET 461 ++L F T +A+ LG + + Y+ R +++Y L ++++ D I +E+ Sbjct: 15 KLLEFFVLEVTARAAADLLGIQANSAIFLYRKIREAISYHLALEADEVVDDQI--ELNES 72 Query: 462 PMTTRHGNTGATRSSNTVWVVGAVDIHSRKCFLSFLPSRSREDLFRFLEEWILPGSVIHT 641 H ++ V V G + K F + + L + I P S ++T Sbjct: 73 YFGGHHKGKRGRGAAGKVAVFGLLK-RQGKIFTVVVENTKTGTLMPVIVRKIKPDSWVYT 131 Query: 642 DSHRSYLTLSSLGYTHFTVNHSRELVSRTGIHTNWIEGIFGVLKKLRRKYDSNWSGVD-- 815 D++RSY L + + H +NHS EL + H N IE + K++ RKY +G+D Sbjct: 132 DTYRSYDALDASKFHHERINHS-ELFAVKQNHINGIENFWSQAKRILRKY----NGIDRK 186 Query: 816 NLNIFLSEFCFRYTY 860 N +FL E FR+ + Sbjct: 187 NFPLFLKECEFRFNF 201
>gi|4102010 (AF007429) putative transposase [Haemophilus paragallinarum] Length = 216 Score = 57.4 bits (136), Expect = 1e-07 Identities = 48/196 (24%), Positives = 85/196 (42%) Query: 312 TVADSAETLGHSKTTIMSFYKLFRASLTYFLEKTSEKLGGDGIIIHFDETPMTTRHGNTG 491 T +AE + +KTT ++ R L Y E G+ I DE+ Sbjct: 23 TARTAAELVNVNKTTAAYYFHRLR-QLIYQNSLHLEMFEGE---IEADESYFGGARKGKR 78 Query: 492 ATRSSNTVWVVGAVDIHSRKCFLSFLPSRSREDLFRFLEEWILPGSVIHTDSHRSYLTLS 671 ++ + V G + + K + +P+ L + E + P S+++TD++RSY L Sbjct: 79 GRGAAGKIAVFGLLK-RNGKVYTVAVPNTQSATLLPIIREQVKPDSIVYTDNYRSYDVLD 137 Query: 672 SLGYTHFTVNHSRELVSRTGIHTNWIEGIFGVLKKLRRKYDSNWSGVDNLNIFLSEFCFR 851 ++HF +NHS H N IE + K+ RK+ N + ++L E +R Sbjct: 138 VSEFSHFRINHSTHFAENHN-HINGIENFWSQAKRHLRKF--NGIPKAHFELYLKECEWR 194 Query: 852 YTYNGWDRRKAVLKLL 899 + Y+ + ++LK L Sbjct: 195 FNYSNIKSQISILKQL 210
>gi|1333760|emb|CAA41166| (X58176) ORF2 [Haemophilus influenzae] Length = 164 Score = 52.7 bits (124), Expect = 3e-06 Identities = 46/171 (26%), Positives = 77/171 (44%) Query: 324 SAETLGHSKTTIMSFYKLFRASLTYFLEKTSEKLGGDGIIIHFDETPMTTRHGNTGATRS 503 +A+ LG + + FY+ R ++Y L ++++ DG I DE+ + + Sbjct: 2 AADLLGIQANSAILFYRKIREVISYHLALEADEVV-DGQI-ELDESYFGSHRKGKRGRGT 59 Query: 504 SNTVWVVGAVDIHSRKCFLSFLPSRSREDLFRFLEEWILPGSVIHTDSHRSYLTLSSLGY 683 + V V G + K F + + E L + I P S ++TD++RSY L + Sbjct: 60 AGKVAVFGLLK-RQGKVFTVVVENTRTETLMPVIVRKIKPDSRVYTDTYRSYDALDVSKF 118 Query: 684 THFTVNHSRELVSRTGIHTNWIEGIFGVLKKLRRKYDSNWSGVDNLNIFLS 836 H +NHS EL + H N IE + K++ RKY +G+D S Sbjct: 119 HHERINHS-ELFAVKQNHINGIENFWSQAKRILRKY----NGIDQKTFLYS 164
>gi|3212219 (U32812) H. influenzae predicted coding region HI1328.1 [Haemophilus influenzae Rd] Length = 123 Score = 44.9 bits (104), Expect = 8e-04 Identities = 29/117 (24%), Positives = 59/117 (49%), Gaps = 2/117 (1%) Query: 549 KCFLSFLPSRSREDLFRFLEEWILPGSVIHTDSHRSYLTLSSLGYTHFTVNHSRELVSRT 728 K + +P+ L + E + P S+++TD+ RSY L ++HF +NHS Sbjct: 4 KVYTVVVPNVQSATLLPIIREKVKPDSIVYTDTFRSYDVLDVSEFSHFRINHSTHFAE-- 61 Query: 729 GIHTNWIEGIFGVLKKLRRKYDSNWSGV--DNLNIFLSEFCFRYTYNGWDRRKAVLKLL 899 + N+I GI G +++ ++G+ ++ ++L E +R+ + + ++LK L Sbjct: 62 --NHNYINGI-GNFWNHAKRHLQKFNGIPKEHFELYLKECEWRFNNSEIKSQISILKQL 117
>gi|97161|pir||S24059 gene IS1016-V6 protein (insertion sequence IS1016) - Haemophilus influenzae >gi|43593|emb|CAA42428| (X59756) IS1016-V6 [Haemophilus influenzae] Length = 191 Score = 44.9 bits (104), Expect = 8e-04 Identities = 43/165 (26%), Positives = 72/165 (43%) Query: 282 EILYQFSCRRTVADSAETLGHSKTTIMSFYKLFRASLTYFLEKTSEKLGGDGIIIHFDET 461 ++L F T +A+ LG + + FY+ R ++Y L ++++ DG I DE+ Sbjct: 15 KLLEFFVLEVTARAAADLLGIQANSAILFYRKIREVISYHLALEADEVF-DGQI-ELDES 72 Query: 462 PMTTRHGNTGATRSSNTVWVVGAVDIHSRKCFLSFLPSRSREDLFRFLEEWILPGSVIHT 641 ++ V V G + K F + + E L + I P S ++T Sbjct: 73 YFGGHRKGKRGLGAAGKVAVFGLLK-RQGKVFTVVVENTKTETLMPVIVRKIKPHSWVYT 131 Query: 642 DSHRSYLTLSSLGYTHFTVNHSRELVSRTGIHTNWIEGIFGVLKK 776 +++RSY L + H +NHS EL + H N IE + KK Sbjct: 132 NTYRSYDALDVSKFHHERINHS-ELFAVKQNHINGIENFWSQAKK 175
>gi|2829877 (AC002396) Hypothetical protein [Arabidopsis thaliana] Length = 252 Score = 34.0 bits (76), Expect = 1.5 Identities = 22/74 (29%), Positives = 35/74 (46%), Gaps = 4/74 (5%) Query: 216 TNYNILKDTIFNGTKLQYGEVLEILYQFSCRRT----VADSAETLGHSKTTIMSFYKLFR 383 T+Y I F+ +YGE+L++L F RR V E L ++ + +F Sbjct: 122 TSYLIAVKEAFHDEPAKYGEMLKLLKDFKARRVDAACVIARVEELMKDHLNLLFGFCVFL 181 Query: 384 ASLTYFLEKTSEKLGGDG 437 ++ T F K + GDG Sbjct: 182 SATTSFTTKLKARFQGDG 199
>gi|4337200|gb|AAD18114| (AC006403) putative NAM protein [Arabidopsis thaliana] Length = 316 Score = 33.2 bits (74), Expect = 2.5 Identities = 20/79 (25%), Positives = 37/79 (46%) Query: 492 ATRSSNTVWVVGAVDIHSRKCFLSFLPSRSREDLFRFLEEWILPGSVIHTDSHRSYLTLS 671 A R T WV+ +HS+ + + S++D EW++ T++ + Y++ S Sbjct: 128 APRGEKTCWVMHEYRLHSKSSYRT-----SKQD------EWVVCRVFKKTEATKKYISTS 176 Query: 672 SLGYTHFTVNHSRELVSRT 728 S +H NH+R + T Sbjct: 177 SSSTSHHHNNHTRASILST 195
>gi|1915984 (U65919) Tritrichomonas foetus serine/threonine protein kinase [Tritrichomonas foetus] Length = 468 Score = 32.1 bits (71), Expect = 5.6 Identities = 28/75 (37%), Positives = 36/75 (47%), Gaps = 7/75 (9%) Query: 500 IFKHC-LGCWCG------RYPQ*EMFLIIFTIEIKRRSI*IFRRVDFTRICNTHRFPQIV 658 I KH GC+C R + E F+ I R + IF +VD ICN +R Sbjct: 43 IAKHTPTGCFCAAKIVDLRNTKTEEFVGIM------REVSIFMQVDHPNICNLYRLSTAN 96 Query: 659 FNIIFFGIYAFHG*PFERVSIK 724 +IFF YA G E V+ K Sbjct: 97 NQLIFFMEYASRGTLLEYVNAK 118
>gi|3323272 (AE001263) lipase, putative [Treponema pallidum] Length = 345 Score = 31.3 bits (69), Expect = 9.7 Identities = 13/31 (41%), Positives = 17/31 (53%) Query: 121 LYLCGNIPXDWKGLTISLFVMFFKSSNSLRF 29 L+LCG + W GL S+F FF + L F Sbjct: 4 LFLCGCVMKKWCGLAFSIFFSFFLRAQDLTF 34 Database: Non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PIR Posted date: Apr 12, 1999 12:54 PM Number of letters in database: 112,640,273 Number of sequences in database: 368,476 Lambda K H 0.318 0.135 0.00 Gapped Lambda K H 0.270 0.0470 4.94e-324 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 182228317 Number of Sequences: 368476 Number of extensions: 3888658 Number of successful extensions: 8609 Number of sequences better than 10.0: 18 Number of HSP's better than 10.0 without gapping: 5 Number of HSP's successfully gapped in prelim test: 4 Number of HSP's that attempted gapping in prelim test: 8601 Number of HSP's gapped (non-prelim): 10 length of query: 301 length of database: 112640273 effective HSP length: 54 effective length of query: 246 effective length of database: 92742569 effective search space: 22814671974 effective search space used: 22814671974 frameshift window, decay const: 50, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 41 (21.7 bits) S2: 69 (31.3 bits)