BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0061c Length=112 Score E Sequences producing significant alignments: (Bits) Value gi|15839438|ref|NP_334475.1| hypothetical protein MT0066.1 [Myco... 229 7e-59 gi|254233460|ref|ZP_04926786.1| hypothetical protein TBCG_00060 ... 227 4e-58 gi|308232627|ref|ZP_07664136.1| hypothetical protein TMAG_00731 ... 188 2e-46 gi|183983814|ref|YP_001852105.1| hypothetical protein MMAR_3839 ... 148 2e-34 gi|118619016|ref|YP_907348.1| hypothetical protein MUL_3771 [Myc... 146 8e-34 gi|183983815|ref|YP_001852106.1| hypothetical protein MMAR_3840 ... 146 1e-33 gi|326905821|gb|EGE52754.1| hypothetical protein TBPG_03787 [Myc... 135 2e-30 gi|240170091|ref|ZP_04748750.1| hypothetical protein MkanA1_1231... 112 2e-23 gi|120404998|ref|YP_954827.1| hypothetical protein Mvan_4044 [My... 73.9 8e-12 gi|31793488|ref|NP_855981.1| glycine rich protein [Mycobacterium... 57.4 8e-07 gi|289754410|ref|ZP_06513788.1| glycine rich protein [Mycobacter... 57.0 8e-07 gi|15841802|ref|NP_336839.1| hypothetical protein MT2365.1 [Myco... 57.0 8e-07 gi|254387214|ref|ZP_05002480.1| conserved hypothetical protein [... 35.8 2.0 gi|224125880|ref|XP_002319698.1| predicted protein [Populus tric... 34.3 5.8 gi|342320551|gb|EGU12491.1| Other/SCY1 protein kinase [Rhodotoru... 34.3 6.8 gi|302534563|ref|ZP_07286905.1| conserved hypothetical protein [... 33.9 7.8 >gi|15839438|ref|NP_334475.1| hypothetical protein MT0066.1 [Mycobacterium tuberculosis CDC1551] gi|148821251|ref|YP_001286005.1| hypothetical protein TBFG_10060 [Mycobacterium tuberculosis F11] gi|253796976|ref|YP_003029977.1| hypothetical protein TBMG_00060 [Mycobacterium tuberculosis KZN 1435] 20 more sequence titlesLength=126 Score = 229 bits (585), Expect = 7e-59, Method: Compositional matrix adjust. Identities = 112/112 (100%), Positives = 112/112 (100%), Gaps = 0/112 (0%) Query 1 MKLKFARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEK 60 MKLKFARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEK Sbjct 15 MKLKFARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEK 74 Query 61 YPDGSFWHQWMQTWFTGPQFYFDCVSGGEPLPGPPPPGGCGGAIPSEQPNAP 112 YPDGSFWHQWMQTWFTGPQFYFDCVSGGEPLPGPPPPGGCGGAIPSEQPNAP Sbjct 75 YPDGSFWHQWMQTWFTGPQFYFDCVSGGEPLPGPPPPGGCGGAIPSEQPNAP 126 >gi|254233460|ref|ZP_04926786.1| hypothetical protein TBCG_00060 [Mycobacterium tuberculosis C] gi|289445587|ref|ZP_06435331.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A] gi|124603253|gb|EAY61528.1| hypothetical protein TBCG_00060 [Mycobacterium tuberculosis C] gi|289418545|gb|EFD15746.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A] Length=112 Score = 227 bits (579), Expect = 4e-58, Method: Compositional matrix adjust. Identities = 112/112 (100%), Positives = 112/112 (100%), Gaps = 0/112 (0%) Query 1 MKLKFARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEK 60 MKLKFARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEK Sbjct 1 MKLKFARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEK 60 Query 61 YPDGSFWHQWMQTWFTGPQFYFDCVSGGEPLPGPPPPGGCGGAIPSEQPNAP 112 YPDGSFWHQWMQTWFTGPQFYFDCVSGGEPLPGPPPPGGCGGAIPSEQPNAP Sbjct 61 YPDGSFWHQWMQTWFTGPQFYFDCVSGGEPLPGPPPPGGCGGAIPSEQPNAP 112 >gi|308232627|ref|ZP_07664136.1| hypothetical protein TMAG_00731 [Mycobacterium tuberculosis SUMu001] gi|308213493|gb|EFO72892.1| hypothetical protein TMAG_00731 [Mycobacterium tuberculosis SUMu001] Length=93 Score = 188 bits (478), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 92/93 (99%), Positives = 93/93 (100%), Gaps = 0/93 (0%) Query 20 VFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEKYPDGSFWHQWMQTWFTGPQ 79 +FPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEKYPDGSFWHQWMQTWFTGPQ Sbjct 1 MFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEKYPDGSFWHQWMQTWFTGPQ 60 Query 80 FYFDCVSGGEPLPGPPPPGGCGGAIPSEQPNAP 112 FYFDCVSGGEPLPGPPPPGGCGGAIPSEQPNAP Sbjct 61 FYFDCVSGGEPLPGPPPPGGCGGAIPSEQPNAP 93 >gi|183983814|ref|YP_001852105.1| hypothetical protein MMAR_3839 [Mycobacterium marinum M] gi|183177140|gb|ACC42250.1| conserved hypothetical secreted protein [Mycobacterium marinum M] Length=114 Score = 148 bits (374), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 67/85 (79%), Positives = 76/85 (90%), Gaps = 0/85 (0%) Query 1 MKLKFARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEK 60 M LK +RL AILG AAL+F +VA+ADPPDPHQPDMTKGYCPGGRWG+G+LAVCDGEK Sbjct 1 MMLKLSRLGAAILGGVAALMFSTAVATADPPDPHQPDMTKGYCPGGRWGWGELAVCDGEK 60 Query 61 YPDGSFWHQWMQTWFTGPQFYFDCV 85 YPDGSFWHQWM+T+ TGPQFY+DCV Sbjct 61 YPDGSFWHQWMRTYMTGPQFYYDCV 85 >gi|118619016|ref|YP_907348.1| hypothetical protein MUL_3771 [Mycobacterium ulcerans Agy99] gi|118571126|gb|ABL05877.1| conserved hypothetical secreted protein [Mycobacterium ulcerans Agy99] Length=136 Score = 146 bits (369), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 66/85 (78%), Positives = 75/85 (89%), Gaps = 0/85 (0%) Query 1 MKLKFARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEK 60 M LK +RL AILG AAL+F +VA+A PPDPHQPDMTKGYCPGGRWG+G+LAVCDGEK Sbjct 17 MMLKLSRLGAAILGGVAALMFSTAVATAGPPDPHQPDMTKGYCPGGRWGWGELAVCDGEK 76 Query 61 YPDGSFWHQWMQTWFTGPQFYFDCV 85 YPDGSFWHQWM+T+ TGPQFY+DCV Sbjct 77 YPDGSFWHQWMRTYMTGPQFYYDCV 101 >gi|183983815|ref|YP_001852106.1| hypothetical protein MMAR_3840 [Mycobacterium marinum M] gi|183177141|gb|ACC42251.1| conserved hypothetical secreted protein [Mycobacterium marinum M] Length=120 Score = 146 bits (368), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 66/85 (78%), Positives = 75/85 (89%), Gaps = 0/85 (0%) Query 1 MKLKFARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEK 60 M LK +RL AILG AAL+F +VA+A PPDPHQPDMTKGYCPGGRWG+G+LAVCDGEK Sbjct 1 MMLKLSRLGAAILGGVAALMFSTAVATAGPPDPHQPDMTKGYCPGGRWGWGELAVCDGEK 60 Query 61 YPDGSFWHQWMQTWFTGPQFYFDCV 85 YPDGSFWHQWM+T+ TGPQFY+DCV Sbjct 61 YPDGSFWHQWMRTYMTGPQFYYDCV 85 >gi|326905821|gb|EGE52754.1| hypothetical protein TBPG_03787 [Mycobacterium tuberculosis W-148] Length=66 Score = 135 bits (339), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 66/66 (100%), Positives = 66/66 (100%), Gaps = 0/66 (0%) Query 1 MKLKFARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEK 60 MKLKFARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEK Sbjct 1 MKLKFARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEK 60 Query 61 YPDGSF 66 YPDGSF Sbjct 61 YPDGSF 66 >gi|240170091|ref|ZP_04748750.1| hypothetical protein MkanA1_12311 [Mycobacterium kansasii ATCC 12478] Length=77 Score = 112 bits (281), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 63/75 (84%), Positives = 70/75 (94%), Gaps = 0/75 (0%) Query 38 MTKGYCPGGRWGFGDLAVCDGEKYPDGSFWHQWMQTWFTGPQFYFDCVSGGEPLPGPPPP 97 MT GYCPGGRWGFG+LAVCDGEKYPDGSFWHQWM+T+ TGPQ+Y+DCVSG EPLPGPPPP Sbjct 1 MTMGYCPGGRWGFGELAVCDGEKYPDGSFWHQWMRTYMTGPQWYYDCVSGDEPLPGPPPP 60 Query 98 GGCGGAIPSEQPNAP 112 GGC GAIP +QP+AP Sbjct 61 GGCDGAIPPDQPDAP 75 >gi|120404998|ref|YP_954827.1| hypothetical protein Mvan_4044 [Mycobacterium vanbaalenii PYR-1] gi|119957816|gb|ABM14821.1| hypothetical protein Mvan_4044 [Mycobacterium vanbaalenii PYR-1] Length=128 Score = 73.9 bits (180), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 43/102 (43%), Positives = 59/102 (58%), Gaps = 5/102 (4%) Query 6 ARLSTAILGCAAALVFPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEKYPDGS 65 A+L A+L + ++ VA ADP +P++ G CPGG G +A C+GEK+PDGS Sbjct 29 AKLYVAVLVALSCVLAAPGVAEADPT--QKPNIATGDCPGGTGGILAVAWCNGEKFPDGS 86 Query 66 FWHQWMQT--WFTGPQFYFDCV-SGGEPLPGPPPPGGCGGAI 104 +WH T F P+F +CV + P P PPGGCGGA+ Sbjct 87 YWHNVAMTGGTFATPRFEMNCVINDAFPSGTPAPPGGCGGAV 128 >gi|31793488|ref|NP_855981.1| glycine rich protein [Mycobacterium bovis AF2122/97] gi|57116965|ref|YP_177666.1| glycine rich protein [Mycobacterium tuberculosis H37Rv] gi|121638191|ref|YP_978415.1| hypothetical protein BCG_2326c [Mycobacterium bovis BCG str. Pasteur 1173P2] 49 more sequence titles Length=143 Score = 57.4 bits (137), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 42/109 (39%), Positives = 54/109 (50%), Gaps = 8/109 (7%) Query 2 KLKFARLSTAILGCAAALVFPASVASADPP-----DPHQPDMTKGYCPGGRWGFGDLAV- 55 K+ ++ AI A+ F A+A+P DPH P+ GYCPGG +G Sbjct 33 KMYKNSIAIAIGTLTMAVEFSMVSANAEPAPPPGQDPHMPNSAMGYCPGGGFGGITGWGY 92 Query 56 CDGEKYPDGSFWHQ-WMQTWFTGPQFYFDCV-SGGEPLPGPPPPGGCGG 102 CDG +YPDGS+WHQ + F G CV G P+P PG CGG Sbjct 93 CDGIRYPDGSYWHQVRVPAPFVGTTLTLSCVIDDGSPVPPLAAPGSCGG 141 >gi|289754410|ref|ZP_06513788.1| glycine rich protein [Mycobacterium tuberculosis EAS054] gi|289694997|gb|EFD62426.1| glycine rich protein [Mycobacterium tuberculosis EAS054] Length=143 Score = 57.0 bits (136), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 42/109 (39%), Positives = 54/109 (50%), Gaps = 8/109 (7%) Query 2 KLKFARLSTAILGCAAALVFPASVASADPP-----DPHQPDMTKGYCPGGRWGFGDLAV- 55 K+ ++ AI A+ F A+A+P DPH P+ GYCPGG +G Sbjct 33 KMYKNSIAIAIGTLTMAVEFSMVSANAEPAPPPGQDPHMPNSAMGYCPGGGFGGITGWGY 92 Query 56 CDGEKYPDGSFWHQ-WMQTWFTGPQFYFDCV-SGGEPLPGPPPPGGCGG 102 CDG +YPDGS+WHQ + F G CV G P+P PG CGG Sbjct 93 CDGIRYPDGSYWHQVRVPAPFVGTTLTLSCVIDDGSPVPPLAAPGSCGG 141 >gi|15841802|ref|NP_336839.1| hypothetical protein MT2365.1 [Mycobacterium tuberculosis CDC1551] gi|13882064|gb|AAK46653.1| hypothetical protein MT2365.1 [Mycobacterium tuberculosis CDC1551] Length=133 Score = 57.0 bits (136), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 42/109 (39%), Positives = 54/109 (50%), Gaps = 8/109 (7%) Query 2 KLKFARLSTAILGCAAALVFPASVASADPP-----DPHQPDMTKGYCPGGRWGFGDLAV- 55 K+ ++ AI A+ F A+A+P DPH P+ GYCPGG +G Sbjct 23 KMYKNSIAIAIGTLTMAVEFSMVSANAEPAPPPGQDPHMPNSAMGYCPGGGFGGITGWGY 82 Query 56 CDGEKYPDGSFWHQ-WMQTWFTGPQFYFDCV-SGGEPLPGPPPPGGCGG 102 CDG +YPDGS+WHQ + F G CV G P+P PG CGG Sbjct 83 CDGIRYPDGSYWHQVRVPAPFVGTTLTLSCVIDDGSPVPPLAAPGSCGG 131 >gi|254387214|ref|ZP_05002480.1| conserved hypothetical protein [Streptomyces sp. Mg1] gi|194346025|gb|EDX26991.1| conserved hypothetical protein [Streptomyces sp. Mg1] Length=411 Score = 35.8 bits (81), Expect = 2.0, Method: Compositional matrix adjust. Identities = 20/47 (43%), Positives = 22/47 (47%), Gaps = 8/47 (17%) Query 62 PDGSFWHQWMQTWFTGPQFYFDCVSGG------EPLPGPPPPGGCGG 102 PD WH+W W Q DCV G +PL PPPGG GG Sbjct 260 PDLFAWHKWKLGWLDASQV--DCVQSGSSLHTLQPLAEAPPPGGTGG 304 >gi|224125880|ref|XP_002319698.1| predicted protein [Populus trichocarpa] gi|222858074|gb|EEE95621.1| predicted protein [Populus trichocarpa] Length=273 Score = 34.3 bits (77), Expect = 5.8, Method: Compositional matrix adjust. Identities = 17/48 (36%), Positives = 23/48 (48%), Gaps = 4/48 (8%) Query 21 FPASVASADPPDPHQPDMTKGYCPGGRWGFGDLAVCDGEKYPDGSFWH 68 FP V P DP++P YC + FGD+ CD E G ++H Sbjct 201 FPVPVEVEQPIDPNEP----TYCVCHQVSFGDMIACDNENCQGGEWFH 244 >gi|342320551|gb|EGU12491.1| Other/SCY1 protein kinase [Rhodotorula glutinis ATCC 204091] Length=986 Score = 34.3 bits (77), Expect = 6.8, Method: Compositional matrix adjust. Identities = 12/27 (45%), Positives = 14/27 (52%), Gaps = 0/27 (0%) Query 42 YCPGGRWGFGDLAVCDGEKYPDGSFWH 68 PGG W G + VC PDG+ WH Sbjct 171 VTPGGEWKLGGMEVCSRLDEPDGAMWH 197 >gi|302534563|ref|ZP_07286905.1| conserved hypothetical protein [Streptomyces sp. C] gi|302443458|gb|EFL15274.1| conserved hypothetical protein [Streptomyces sp. C] Length=413 Score = 33.9 bits (76), Expect = 7.8, Method: Compositional matrix adjust. Identities = 20/47 (43%), Positives = 22/47 (47%), Gaps = 8/47 (17%) Query 62 PDGSFWHQWMQTWFTGPQFYFDCVSGG------EPLPGPPPPGGCGG 102 PD WH+W W Q DCV G +PL PPPGG GG Sbjct 262 PDLFAWHKWKLGWLDPAQV--DCVRSGTSLHTLQPLSQVPPPGGTGG 306 Lambda K H 0.320 0.142 0.511 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 128047486336 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40