BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2348c Length=108 Score E Sequences producing significant alignments: (Bits) Value gi|15609485|ref|NP_216864.1| hypothetical protein Rv2348c [Mycob... 205 1e-51 gi|289553865|ref|ZP_06443075.1| hypothetical protein TBXG_01615 ... 202 1e-50 gi|148823549|ref|YP_001288303.1| hypothetical protein TBFG_12372... 191 3e-47 gi|167969904|ref|ZP_02552181.1| hypothetical protein MtubH3_1849... 167 6e-40 gi|298525833|ref|ZP_07013242.1| predicted protein [Mycobacterium... 137 5e-31 gi|240173289|ref|ZP_04751947.1| hypothetical protein MkanA1_2851... 119 2e-25 gi|183983635|ref|YP_001851926.1| hypothetical protein MMAR_3655 ... 115 3e-24 gi|41408233|ref|NP_961069.1| hypothetical protein MAP2135c [Myco... 97.4 6e-19 gi|296166321|ref|ZP_06848758.1| conserved hypothetical protein [... 85.9 2e-15 gi|118464811|ref|YP_881258.1| hypothetical protein MAV_2040 [Myc... 70.9 7e-11 gi|342857483|ref|ZP_08714139.1| hypothetical protein MCOL_01365 ... 68.2 4e-10 gi|254774764|ref|ZP_05216280.1| hypothetical protein MaviaA2_088... 46.2 0.002 gi|254819375|ref|ZP_05224376.1| hypothetical protein MintA_05595... 39.3 0.21 >gi|15609485|ref|NP_216864.1| hypothetical protein Rv2348c [Mycobacterium tuberculosis H37Rv] gi|15841855|ref|NP_336892.1| hypothetical protein MT2413 [Mycobacterium tuberculosis CDC1551] gi|148662176|ref|YP_001283699.1| hypothetical protein MRA_2368 [Mycobacterium tuberculosis H37Ra] 55 more sequence titlesLength=108 Score = 205 bits (522), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 107/108 (99%), Positives = 108/108 (100%), Gaps = 0/108 (0%) Query 1 VLLPLGPPLPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAA 60 +LLPLGPPLPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAA Sbjct 1 MLLPLGPPLPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAA 60 Query 61 EGADAEAAAMDEWDEWQAWNEWVAENAEPRFEVPRSSSSVIPHSPAAG 108 EGADAEAAAMDEWDEWQAWNEWVAENAEPRFEVPRSSSSVIPHSPAAG Sbjct 61 EGADAEAAAMDEWDEWQAWNEWVAENAEPRFEVPRSSSSVIPHSPAAG 108 >gi|289553865|ref|ZP_06443075.1| hypothetical protein TBXG_01615 [Mycobacterium tuberculosis KZN 605] gi|289438497|gb|EFD20990.1| hypothetical protein TBXG_01615 [Mycobacterium tuberculosis KZN 605] Length=106 Score = 202 bits (515), Expect = 1e-50, Method: Compositional matrix adjust. Identities = 105/106 (99%), Positives = 106/106 (100%), Gaps = 0/106 (0%) Query 3 LPLGPPLPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAAEG 62 +PLGPPLPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAAEG Sbjct 1 MPLGPPLPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAAEG 60 Query 63 ADAEAAAMDEWDEWQAWNEWVAENAEPRFEVPRSSSSVIPHSPAAG 108 ADAEAAAMDEWDEWQAWNEWVAENAEPRFEVPRSSSSVIPHSPAAG Sbjct 61 ADAEAAAMDEWDEWQAWNEWVAENAEPRFEVPRSSSSVIPHSPAAG 106 >gi|148823549|ref|YP_001288303.1| hypothetical protein TBFG_12372 [Mycobacterium tuberculosis F11] gi|253798578|ref|YP_003031579.1| hypothetical protein TBMG_01630 [Mycobacterium tuberculosis KZN 1435] gi|254365126|ref|ZP_04981172.1| hypothetical protein TBHG_02287 [Mycobacterium tuberculosis str. Haarlem] gi|294994550|ref|ZP_06800241.1| hypothetical protein Mtub2_08538 [Mycobacterium tuberculosis 210] gi|134150640|gb|EBA42685.1| hypothetical protein TBHG_02287 [Mycobacterium tuberculosis str. Haarlem] gi|148722076|gb|ABR06701.1| hypothetical protein TBFG_12372 [Mycobacterium tuberculosis F11] gi|253320081|gb|ACT24684.1| hypothetical protein TBMG_01630 [Mycobacterium tuberculosis KZN 1435] Length=100 Score = 191 bits (485), Expect = 3e-47, Method: Compositional matrix adjust. Identities = 99/100 (99%), Positives = 100/100 (100%), Gaps = 0/100 (0%) Query 9 LPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAAEGADAEAA 68 +PPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAAEGADAEAA Sbjct 1 MPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAAEGADAEAA 60 Query 69 AMDEWDEWQAWNEWVAENAEPRFEVPRSSSSVIPHSPAAG 108 AMDEWDEWQAWNEWVAENAEPRFEVPRSSSSVIPHSPAAG Sbjct 61 AMDEWDEWQAWNEWVAENAEPRFEVPRSSSSVIPHSPAAG 100 >gi|167969904|ref|ZP_02552181.1| hypothetical protein MtubH3_18499 [Mycobacterium tuberculosis H37Ra] gi|254551396|ref|ZP_05141843.1| hypothetical protein Mtube_13199 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|297634946|ref|ZP_06952726.1| hypothetical protein MtubK4_12526 [Mycobacterium tuberculosis KZN 4207] gi|297731937|ref|ZP_06961055.1| hypothetical protein MtubKR_12648 [Mycobacterium tuberculosis KZN R506] gi|313659272|ref|ZP_07816152.1| hypothetical protein MtubKV_12663 [Mycobacterium tuberculosis KZN V2475] Length=86 Score = 167 bits (422), Expect = 6e-40, Method: Compositional matrix adjust. Identities = 86/86 (100%), Positives = 86/86 (100%), Gaps = 0/86 (0%) Query 23 MLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAAEGADAEAAAMDEWDEWQAWNEW 82 MLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAAEGADAEAAAMDEWDEWQAWNEW Sbjct 1 MLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAAEGADAEAAAMDEWDEWQAWNEW 60 Query 83 VAENAEPRFEVPRSSSSVIPHSPAAG 108 VAENAEPRFEVPRSSSSVIPHSPAAG Sbjct 61 VAENAEPRFEVPRSSSSVIPHSPAAG 86 >gi|298525833|ref|ZP_07013242.1| predicted protein [Mycobacterium tuberculosis 94_M4241A] gi|298495627|gb|EFI30921.1| predicted protein [Mycobacterium tuberculosis 94_M4241A] Length=76 Score = 137 bits (346), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 72/73 (99%), Positives = 73/73 (100%), Gaps = 0/73 (0%) Query 1 VLLPLGPPLPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAA 60 +LLPLGPPLPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAA Sbjct 1 MLLPLGPPLPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAA 60 Query 61 EGADAEAAAMDEW 73 EGADAEAAAMDEW Sbjct 61 EGADAEAAAMDEW 73 >gi|240173289|ref|ZP_04751947.1| hypothetical protein MkanA1_28511 [Mycobacterium kansasii ATCC 12478] Length=116 Score = 119 bits (298), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 66/97 (69%), Positives = 70/97 (73%), Gaps = 1/97 (1%) Query 12 DAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAAEGADAEAAAMD 71 DAV AKR ESGMLGGLSVPLSWG AVPPDDYDHWA E V A + ++ D Sbjct 21 DAVSAKRGESGMLGGLSVPLSWGTAVPPDDYDHWAKEDEAAEVAVVPGAVDPEPAESSTD 80 Query 72 EWDEWQAWNEWVAENAEPRFEVPRSSSSVIPHSPAAG 108 EWDEW WNEW A NAEPRFEVPR SS V+PHSPAAG Sbjct 81 EWDEWAEWNEWEAANAEPRFEVPR-SSRVVPHSPAAG 116 >gi|183983635|ref|YP_001851926.1| hypothetical protein MMAR_3655 [Mycobacterium marinum M] gi|183176961|gb|ACC42071.1| conserved hypothetical protein [Mycobacterium marinum M] Length=105 Score = 115 bits (287), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 66/97 (69%), Positives = 71/97 (74%), Gaps = 3/97 (3%) Query 12 DAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAAEGADAEAAAMD 71 DAV AKR ESG+L GLSVPLSWG AVPPDDYDHWAP PE + E DA AA D Sbjct 12 DAVAAKRGESGLLCGLSVPLSWGTAVPPDDYDHWAPEPE--EGAEAVVEENVDAAAAGTD 69 Query 72 EWDEWQAWNEWVAENAEPRFEVPRSSSSVIPHSPAAG 108 EWDEW W EW A NAEP FE+PR +SSVIP+SPAAG Sbjct 70 EWDEWAEWREWEAANAEPHFEMPR-TSSVIPNSPAAG 105 >gi|41408233|ref|NP_961069.1| hypothetical protein MAP2135c [Mycobacterium avium subsp. paratuberculosis K-10] gi|41396588|gb|AAS04452.1| hypothetical protein MAP_2135c [Mycobacterium avium subsp. paratuberculosis K-10] Length=143 Score = 97.4 bits (241), Expect = 6e-19, Method: Compositional matrix adjust. Identities = 60/108 (56%), Positives = 65/108 (61%), Gaps = 13/108 (12%) Query 1 VLLPLGPPLPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAA 60 VLLPLG PLP D V A R ESG+LGGLSVPL WGVAVPPDDYDHWAP PE A+ + A Sbjct 49 VLLPLGSPLPDDTVSAVRGESGVLGGLSVPLKWGVAVPPDDYDHWAPKPEASAEAVEETA 108 Query 61 EGADAEAAAMDEWDEWQAWNEWVAENAEPRFEVPRSSSSVIPHSPAAG 108 E A A W W A PR ++ VIPHSPAAG Sbjct 109 EMPRPTAVADGVWSGWDG-------EAVPR------TAGVIPHSPAAG 143 >gi|296166321|ref|ZP_06848758.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295898330|gb|EFG77899.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=82 Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 52/87 (60%), Positives = 58/87 (67%), Gaps = 6/87 (6%) Query 23 MLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADV-DVQAAEGADAEAAAMDEWDEWQAWNE 81 MLGGLSVPL WGVAVPPDDYDHWAP E AD D A A A + ++W+EW+ W Sbjct 1 MLGGLSVPLKWGVAVPPDDYDHWAPKTEANADAGDPVADTPAPAAVSDANQWNEWKRWE- 59 Query 82 WVAENAEPRFEVPRSSSSVIPHSPAAG 108 AEP FE+PR S VIPHSPAAG Sbjct 60 ---GEAEPHFEMPR-SGGVIPHSPAAG 82 >gi|118464811|ref|YP_881258.1| hypothetical protein MAV_2040 [Mycobacterium avium 104] gi|118166098|gb|ABK66995.1| conserved hypothetical protein [Mycobacterium avium 104] gi|336461680|gb|EGO40543.1| hypothetical protein MAPs_28430 [Mycobacterium avium subsp. paratuberculosis S397] Length=73 Score = 70.9 bits (172), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 45/86 (53%), Positives = 49/86 (57%), Gaps = 13/86 (15%) Query 23 MLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAAEGADAEAAAMDEWDEWQAWNEW 82 MLGGLSVPL WGVAVPPDDYDHWAP PE A+ + AE A A W W Sbjct 1 MLGGLSVPLKWGVAVPPDDYDHWAPKPEASAEAVEETAEMPRPTAVADGVWSGWDG---- 56 Query 83 VAENAEPRFEVPRSSSSVIPHSPAAG 108 A PR ++ VIPHSPAAG Sbjct 57 ---EAVPR------TAGVIPHSPAAG 73 >gi|342857483|ref|ZP_08714139.1| hypothetical protein MCOL_01365 [Mycobacterium colombiense CECT 3035] gi|342134816|gb|EGT87982.1| hypothetical protein MCOL_01365 [Mycobacterium colombiense CECT 3035] Length=73 Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 44/86 (52%), Positives = 48/86 (56%), Gaps = 13/86 (15%) Query 23 MLGGLSVPLSWGVAVPPDDYDHWAPAPEDGADVDVQAAEGADAEAAAMDEWDEWQAWNEW 82 MLGGLSVPL WGV VPPDDYDHWAP P+ GA+ + A+ A A W W Sbjct 1 MLGGLSVPLKWGVVVPPDDYDHWAPKPDAGAEAPEEMADVPRPTAVADGVWTGWDG---- 56 Query 83 VAENAEPRFEVPRSSSSVIPHSPAAG 108 VPR S VIPHSPAAG Sbjct 57 --------DTVPR-SPGVIPHSPAAG 73 >gi|254774764|ref|ZP_05216280.1| hypothetical protein MaviaA2_08843 [Mycobacterium avium subsp. avium ATCC 25291] Length=61 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 33/74 (45%), Positives = 38/74 (52%), Gaps = 13/74 (17%) Query 35 VAVPPDDYDHWAPAPEDGADVDVQAAEGADAEAAAMDEWDEWQAWNEWVAENAEPRFEVP 94 +AVPPDDYDHWAP PE A+ + AE A A W W A PR Sbjct 1 MAVPPDDYDHWAPKPEASAEAVEETAEMPRPTAVADGVWSGWDG-------EAVPR---- 49 Query 95 RSSSSVIPHSPAAG 108 ++ VIPHSPAAG Sbjct 50 --TAGVIPHSPAAG 61 >gi|254819375|ref|ZP_05224376.1| hypothetical protein MintA_05595 [Mycobacterium intracellulare ATCC 13950] Length=61 Score = 39.3 bits (90), Expect = 0.21, Method: Compositional matrix adjust. Identities = 27/74 (37%), Positives = 32/74 (44%), Gaps = 13/74 (17%) Query 35 VAVPPDDYDHWAPAPEDGADVDVQAAEGADAEAAAMDEWDEWQAWNEWVAENAEPRFEVP 94 + VPPDDYDHWAP + AD + + A A W W+ E Sbjct 1 MVVPPDDYDHWAPKSDASADTTEETLDAPRPTAVADGVWSGWEG-------------EAA 47 Query 95 RSSSSVIPHSPAAG 108 VIPHSPAAG Sbjct 48 ARPVGVIPHSPAAG 61 Lambda K H 0.311 0.131 0.426 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129996839040 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40