BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1052 Length=129 Score E Sequences producing significant alignments: (Bits) Value gi|15608192|ref|NP_215568.1| hypothetical protein Rv1052 [Mycoba... 264 3e-69 gi|167967674|ref|ZP_02549951.1| hypothetical protein MtubH3_0640... 167 5e-40 gi|189347716|ref|YP_001944245.1| mobile mystery protein B [Chlor... 43.5 0.011 gi|189499707|ref|YP_001959177.1| mobile mystery protein B [Chlor... 39.3 0.19 gi|319954701|ref|YP_004165968.1| mobile mystery protein b [Cellu... 39.3 0.21 gi|121607008|ref|YP_994815.1| mobile mystery protein B [Verminep... 37.7 0.52 gi|338731216|ref|YP_004660608.1| sugar isomerase (SIS) [Thermoto... 36.6 1.2 gi|310789339|gb|EFQ24872.1| peptidase family C50 [Glomerella gra... 35.4 2.5 gi|192359940|ref|YP_001981706.1| Fic family protein [Cellvibrio ... 35.0 3.4 gi|312963984|ref|ZP_07778455.1| mobile mystery protein B [Pseudo... 35.0 3.5 gi|338740684|ref|YP_004677646.1| serine-glyoxylate aminotransfer... 34.7 4.4 gi|326496719|dbj|BAJ98386.1| predicted protein [Hordeum vulgare ... 34.7 4.9 gi|293602773|ref|ZP_06685213.1| fic family toxin-antitoxin syste... 34.3 5.8 >gi|15608192|ref|NP_215568.1| hypothetical protein Rv1052 [Mycobacterium tuberculosis H37Rv] gi|15840484|ref|NP_335521.1| hypothetical protein MT1083 [Mycobacterium tuberculosis CDC1551] gi|31792243|ref|NP_854736.1| hypothetical protein Mb1081 [Mycobacterium bovis AF2122/97] 57 more sequence titlesLength=129 Score = 264 bits (674), Expect = 3e-69, Method: Compositional matrix adjust. Identities = 129/129 (100%), Positives = 129/129 (100%), Gaps = 0/129 (0%) Query 1 MDCCEERGVARHKGLSQVGTPGCPRWSQAVSCRCSAYREAAVTAVQMPLTPGYGETPLPH 60 MDCCEERGVARHKGLSQVGTPGCPRWSQAVSCRCSAYREAAVTAVQMPLTPGYGETPLPH Sbjct 1 MDCCEERGVARHKGLSQVGTPGCPRWSQAVSCRCSAYREAAVTAVQMPLTPGYGETPLPH 60 Query 61 DELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTAVEGSLSLDELLSDHFVRD 120 DELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTAVEGSLSLDELLSDHFVRD Sbjct 61 DELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTAVEGSLSLDELLSDHFVRD 120 Query 121 LHARMFGPV 129 LHARMFGPV Sbjct 121 LHARMFGPV 129 >gi|167967674|ref|ZP_02549951.1| hypothetical protein MtubH3_06406 [Mycobacterium tuberculosis H37Ra] gi|254231342|ref|ZP_04924669.1| hypothetical protein TBCG_01040 [Mycobacterium tuberculosis C] gi|254550040|ref|ZP_05140487.1| hypothetical protein Mtube_06213 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|294995245|ref|ZP_06800936.1| hypothetical protein Mtub2_12222 [Mycobacterium tuberculosis 210] gi|297633584|ref|ZP_06951364.1| hypothetical protein MtubK4_05648 [Mycobacterium tuberculosis KZN 4207] gi|297730570|ref|ZP_06959688.1| hypothetical protein MtubKR_05733 [Mycobacterium tuberculosis KZN R506] gi|313657898|ref|ZP_07814778.1| hypothetical protein MtubKV_05728 [Mycobacterium tuberculosis KZN V2475] gi|124600401|gb|EAY59411.1| hypothetical protein TBCG_01040 [Mycobacterium tuberculosis C] Length=83 Score = 167 bits (423), Expect = 5e-40, Method: Compositional matrix adjust. Identities = 83/83 (100%), Positives = 83/83 (100%), Gaps = 0/83 (0%) Query 47 MPLTPGYGETPLPHDELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTAVEGS 106 MPLTPGYGETPLPHDELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTAVEGS Sbjct 1 MPLTPGYGETPLPHDELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTAVEGS 60 Query 107 LSLDELLSDHFVRDLHARMFGPV 129 LSLDELLSDHFVRDLHARMFGPV Sbjct 61 LSLDELLSDHFVRDLHARMFGPV 83 >gi|189347716|ref|YP_001944245.1| mobile mystery protein B [Chlorobium limicola DSM 245] gi|189341863|gb|ACD91266.1| mobile mystery protein B [Chlorobium limicola DSM 245] Length=198 Score = 43.5 bits (101), Expect = 0.011, Method: Compositional matrix adjust. Identities = 29/80 (37%), Positives = 44/80 (55%), Gaps = 18/80 (22%) Query 54 GETPLPHDELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTAVEGSLSL---- 109 G+TP+ DE A LL + T++++ + EQ +Q AV+ SLS Sbjct 10 GQTPISEDEKAGLLIPTIS------TKSELDEFEQKNIEQ--------AVQWSLSRRFRQ 55 Query 110 DELLSDHFVRDLHARMFGPV 129 +E+L++ FVR+LH RMFG V Sbjct 56 EEILTERFVRELHRRMFGSV 75 >gi|189499707|ref|YP_001959177.1| mobile mystery protein B [Chlorobium phaeobacteroides BS1] gi|189495148|gb|ACE03696.1| mobile mystery protein B [Chlorobium phaeobacteroides BS1] Length=198 Score = 39.3 bits (90), Expect = 0.19, Method: Compositional matrix adjust. Identities = 26/80 (33%), Positives = 42/80 (53%), Gaps = 18/80 (22%) Query 54 GETPLPHDELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTAVEGSLSL---- 109 G+TP+ D+ LL + TR ++ + EQ L + AV+ +LS Sbjct 10 GQTPIEEDDKQGLLVPTIS------TRGELDEFEQ--------LNIEQAVQWTLSRRFRQ 55 Query 110 DELLSDHFVRDLHARMFGPV 129 +++L++ FVRD+H RMFG V Sbjct 56 EDILTERFVRDVHKRMFGNV 75 >gi|319954701|ref|YP_004165968.1| mobile mystery protein b [Cellulophaga algicola DSM 14237] gi|319423361|gb|ADV50470.1| mobile mystery protein B [Cellulophaga algicola DSM 14237] Length=198 Score = 39.3 bits (90), Expect = 0.21, Method: Compositional matrix adjust. Identities = 20/46 (44%), Positives = 29/46 (64%), Gaps = 4/46 (8%) Query 88 QGLQDQVFDLLMPTAVEGS----LSLDELLSDHFVRDLHARMFGPV 129 QG D+ L + AVE + L L+++L++ FVRDLH RM+G V Sbjct 30 QGELDEFEQLNIEKAVEWTIHTKLKLEKILTEKFVRDLHKRMYGDV 75 >gi|121607008|ref|YP_994815.1| mobile mystery protein B [Verminephrobacter eiseniae EF01-2] gi|121551648|gb|ABM55797.1| mobile mystery protein B [Verminephrobacter eiseniae EF01-2] Length=196 Score = 37.7 bits (86), Expect = 0.52, Method: Compositional matrix adjust. Identities = 25/76 (33%), Positives = 38/76 (50%), Gaps = 12/76 (15%) Query 54 GETPLPHDELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTAVEGSLSLDELL 113 G+TPL DE A L+PE + +T+ D+ D EQ ++L ++L Sbjct 10 GQTPLDPDEKAGLIPEHL------VTKGDLNDWEQE------NILRAVRWLKRARAPDVL 57 Query 114 SDHFVRDLHARMFGPV 129 S+ F R+LH +MFG Sbjct 58 SEGFCRELHKQMFGKT 73 >gi|338731216|ref|YP_004660608.1| sugar isomerase (SIS) [Thermotoga thermarum DSM 5069] gi|335365567|gb|AEH51512.1| sugar isomerase (SIS) [Thermotoga thermarum DSM 5069] Length=598 Score = 36.6 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 22/64 (35%), Positives = 37/64 (58%), Gaps = 10/64 (15%) Query 68 PEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTAVEGSLSLDELLSDHFVRDLHARMF- 126 P+++E++D I AD Y L G ++ P A+EG+L + E++ +H V LH+ F Sbjct 444 PQILELIDSGIDFADSYMLGYG-------IMHPIALEGALKIKEVVYNH-VEGLHSSEFK 495 Query 127 -GPV 129 GP+ Sbjct 496 HGPL 499 >gi|310789339|gb|EFQ24872.1| peptidase family C50 [Glomerella graminicola M1.001] Length=2134 Score = 35.4 bits (80), Expect = 2.5, Method: Compositional matrix adjust. Identities = 22/67 (33%), Positives = 38/67 (57%), Gaps = 7/67 (10%) Query 55 ETPLPHDELAALLPEVVEVLDKPITRADVYDLEQ------GLQDQVFDLLMPTAVEGSLS 108 +TPLPH++L+ALL EV + L + + DV +LE+ + V +L +GS Sbjct 507 QTPLPHEQLSALLREVTDALQGTL-KGDVTELEELMVNLSAARRSVVGILASKTGDGSED 565 Query 109 LDELLSD 115 + EL+++ Sbjct 566 VHELMNN 572 >gi|192359940|ref|YP_001981706.1| Fic family protein [Cellvibrio japonicus Ueda107] gi|190686105|gb|ACE83783.1| Fic family protein [Cellvibrio japonicus Ueda107] Length=200 Score = 35.0 bits (79), Expect = 3.4, Method: Compositional matrix adjust. Identities = 30/77 (39%), Positives = 37/77 (49%), Gaps = 14/77 (18%) Query 54 GETPLPHDELAAL-LPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTAVEGSLSLDEL 112 G TPL DELA L P V TR ++ LEQ + L + +G E+ Sbjct 10 GATPLDPDELAGLKFPHVS-------TRGELDHLEQANIESGL-LWLKRGRKG-----EV 56 Query 113 LSDHFVRDLHARMFGPV 129 L D FVR LH R+FG V Sbjct 57 LDDQFVRTLHKRLFGEV 73 >gi|312963984|ref|ZP_07778455.1| mobile mystery protein B [Pseudomonas fluorescens WH6] gi|311282019|gb|EFQ60629.1| mobile mystery protein B [Pseudomonas fluorescens WH6] Length=195 Score = 35.0 bits (79), Expect = 3.5, Method: Compositional matrix adjust. Identities = 25/83 (31%), Positives = 41/83 (50%), Gaps = 14/83 (16%) Query 44 AVQMPLTPGYGETPLPHDELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTAV 103 A+++ L PG +TPL DE+A L P + A +L++ + A Sbjct 2 AIELELKPG--QTPLDPDEVAGLKPRHI---------ATQGELDEWEAQNILKASRWVAR 50 Query 104 EGSLSLDELLSDHFVRDLHARMF 126 + L ++L+DHF R+LH +MF Sbjct 51 QKKL---DVLNDHFCRELHVKMF 70 >gi|338740684|ref|YP_004677646.1| serine-glyoxylate aminotransferase [Hyphomicrobium sp. MC1] gi|337761247|emb|CCB67080.1| serine-glyoxylate aminotransferase [Hyphomicrobium sp. MC1] Length=403 Score = 34.7 bits (78), Expect = 4.4, Method: Composition-based stats. Identities = 18/52 (35%), Positives = 25/52 (49%), Gaps = 4/52 (7%) Query 1 MDCCEERGV----ARHKGLSQVGTPGCPRWSQAVSCRCSAYREAAVTAVQMP 48 +D E G+ ARH+ L++ W + CR + Y AVT VQMP Sbjct 262 IDMINEEGLENVFARHRRLAEATRRAVRAWGFEIQCRDAKYYSPAVTTVQMP 313 >gi|326496719|dbj|BAJ98386.1| predicted protein [Hordeum vulgare subsp. vulgare] Length=730 Score = 34.7 bits (78), Expect = 4.9, Method: Composition-based stats. Identities = 23/53 (44%), Positives = 27/53 (51%), Gaps = 4/53 (7%) Query 5 EERGVARHKGL--SQVGTPGCPRWSQAVSCRCSAYREAAVTAVQMPLTPGYGE 55 E R VAR+ GL + VG PGCP + R SA + A V V P P GE Sbjct 512 ERRPVARYTGLKHANVGNPGCPSTPERCLSRGSAAKHADV--VITPCRPSTGE 562 >gi|293602773|ref|ZP_06685213.1| fic family toxin-antitoxin system [Achromobacter piechaudii ATCC 43553] gi|292818789|gb|EFF77830.1| fic family toxin-antitoxin system [Achromobacter piechaudii ATCC 43553] Length=196 Score = 34.3 bits (77), Expect = 5.8, Method: Compositional matrix adjust. Identities = 24/73 (33%), Positives = 36/73 (50%), Gaps = 12/73 (16%) Query 54 GETPLPHDELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMPTAVEGSLSLDELL 113 G+TPL DE A L+PE + T+ D+ D EQ +LL + ++L Sbjct 10 GQTPLDPDERAGLIPEHLS------TKGDLNDWEQE------NLLRAVRWLKRVRSQDVL 57 Query 114 SDHFVRDLHARMF 126 ++ F R LH +MF Sbjct 58 TEGFCRTLHLKMF 70 Lambda K H 0.320 0.137 0.423 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 128283502052 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40