BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3103c Length=145 Score E Sequences producing significant alignments: (Bits) Value gi|15842674|ref|NP_337711.1| hypothetical protein MT3186.1 [Myco... 272 1e-71 gi|15610240|ref|NP_217619.1| hypothetical protein Rv3103c [Mycob... 271 3e-71 gi|183981544|ref|YP_001849835.1| hypothetical protein MMAR_1529 ... 100 7e-20 gi|296169217|ref|ZP_06850870.1| conserved hypothetical protein [... 99.4 2e-19 gi|118617902|ref|YP_906234.1| hypothetical protein MUL_2410 [Myc... 98.6 3e-19 gi|254776427|ref|ZP_05217943.1| hypothetical protein MaviaA2_174... 94.0 7e-18 gi|41409271|ref|NP_962107.1| hypothetical protein MAP3173c [Myco... 94.0 7e-18 gi|118465405|ref|YP_883157.1| hypothetical protein MAV_4004 [Myc... 93.6 8e-18 gi|167969711|ref|ZP_02551988.1| hypothetical proline rich protei... 90.1 9e-17 gi|126434179|ref|YP_001069870.1| hypothetical protein Mjls_1581 ... 89.0 2e-16 gi|108798580|ref|YP_638777.1| hypothetical protein Mmcs_1610 [My... 89.0 2e-16 gi|145225028|ref|YP_001135706.1| hypothetical protein Mflv_4449 ... 85.5 2e-15 gi|333991483|ref|YP_004524097.1| hypothetical protein JDM601_284... 79.0 2e-13 gi|342861023|ref|ZP_08717672.1| hypothetical protein MCOL_19167 ... 77.4 6e-13 gi|120402910|ref|YP_952739.1| hypothetical protein Mvan_1913 [My... 75.5 3e-12 gi|118469794|ref|YP_886448.1| hypothetical protein MSMEG_2088 [M... 66.2 2e-09 >gi|15842674|ref|NP_337711.1| hypothetical protein MT3186.1 [Mycobacterium tuberculosis CDC1551] gi|253797796|ref|YP_003030797.1| hypothetical protein TBMG_00863 [Mycobacterium tuberculosis KZN 1435] gi|254365731|ref|ZP_04981776.1| hypothetical proline-rich protein [Mycobacterium tuberculosis str. Haarlem] 10 more sequence titlesLength=158 Score = 272 bits (695), Expect = 1e-71, Method: Compositional matrix adjust. Identities = 145/145 (100%), Positives = 145/145 (100%), Gaps = 0/145 (0%) Query 1 VKLSNQKRHWPGYLFGRIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFV 60 VKLSNQKRHWPGYLFGRIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFV Sbjct 14 VKLSNQKRHWPGYLFGRIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFV 73 Query 61 PDPDYTWVPRTRVQPPTVKATPTTTSSTPPVSPPETTTDSAVPPPFELPPPFGPGTTTPT 120 PDPDYTWVPRTRVQPPTVKATPTTTSSTPPVSPPETTTDSAVPPPFELPPPFGPGTTTPT Sbjct 74 PDPDYTWVPRTRVQPPTVKATPTTTSSTPPVSPPETTTDSAVPPPFELPPPFGPGTTTPT 133 Query 121 PPAPLPQPGPGPTAGTYPKSEPPTR 145 PPAPLPQPGPGPTAGTYPKSEPPTR Sbjct 134 PPAPLPQPGPGPTAGTYPKSEPPTR 158 >gi|15610240|ref|NP_217619.1| hypothetical protein Rv3103c [Mycobacterium tuberculosis H37Rv] gi|31794282|ref|NP_856775.1| hypothetical protein Mb3130c [Mycobacterium bovis AF2122/97] gi|121638988|ref|YP_979212.1| hypothetical protein BCG_3128c [Mycobacterium bovis BCG str. Pasteur 1173P2] 61 more sequence titles Length=145 Score = 271 bits (692), Expect = 3e-71, Method: Compositional matrix adjust. Identities = 144/145 (99%), Positives = 145/145 (100%), Gaps = 0/145 (0%) Query 1 VKLSNQKRHWPGYLFGRIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFV 60 +KLSNQKRHWPGYLFGRIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFV Sbjct 1 MKLSNQKRHWPGYLFGRIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFV 60 Query 61 PDPDYTWVPRTRVQPPTVKATPTTTSSTPPVSPPETTTDSAVPPPFELPPPFGPGTTTPT 120 PDPDYTWVPRTRVQPPTVKATPTTTSSTPPVSPPETTTDSAVPPPFELPPPFGPGTTTPT Sbjct 61 PDPDYTWVPRTRVQPPTVKATPTTTSSTPPVSPPETTTDSAVPPPFELPPPFGPGTTTPT 120 Query 121 PPAPLPQPGPGPTAGTYPKSEPPTR 145 PPAPLPQPGPGPTAGTYPKSEPPTR Sbjct 121 PPAPLPQPGPGPTAGTYPKSEPPTR 145 >gi|183981544|ref|YP_001849835.1| hypothetical protein MMAR_1529 [Mycobacterium marinum M] gi|183174870|gb|ACC39980.1| conserved hypothetical membrane protein [Mycobacterium marinum M] Length=166 Score = 100 bits (249), Expect = 7e-20, Method: Compositional matrix adjust. Identities = 49/71 (70%), Positives = 57/71 (81%), Gaps = 1/71 (1%) Query 5 NQKRHWPGYLFG-RIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFVPDP 63 N+KR P LFG RIR ST+VL+ AFLAVWW+Y+TY PQ +PP+QVVPPGFVPDP Sbjct 19 NRKRGTPRQLFGGRIRLSTVVLMVAFLAVWWLYDTYNPQHSAGKTTPPSQVVPPGFVPDP 78 Query 64 DYTWVPRTRVQ 74 +YTWVPRTRVQ Sbjct 79 NYTWVPRTRVQ 89 >gi|296169217|ref|ZP_06850870.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896115|gb|EFG75782.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=160 Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 51/73 (70%), Positives = 58/73 (80%), Gaps = 5/73 (6%) Query 4 SNQKRHWPGYLFG-RIRTSTLVLIAAFLAVWWIYETYRPQ-APGPGDSPPTQVVPPGFVP 61 S R WP Y+FG R+RTSTLVLI AF AVWW+Y+TYRP+ AP P P QVVPPGFVP Sbjct 15 SAADRRWPHYMFGGRVRTSTLVLIVAFFAVWWVYDTYRPEPAPKP---PAQQVVPPGFVP 71 Query 62 DPDYTWVPRTRVQ 74 DP+YTWVPR+RVQ Sbjct 72 DPNYTWVPRSRVQ 84 >gi|118617902|ref|YP_906234.1| hypothetical protein MUL_2410 [Mycobacterium ulcerans Agy99] gi|118570012|gb|ABL04763.1| conserved hypothetical membrane protein [Mycobacterium ulcerans Agy99] Length=166 Score = 98.6 bits (244), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 48/71 (68%), Positives = 56/71 (79%), Gaps = 1/71 (1%) Query 5 NQKRHWPGYLFG-RIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFVPDP 63 N+KR P LFG RIR ST+VL+ AFLAVWW+Y+TY PQ +PP+QVVPPG VPDP Sbjct 19 NRKRGTPRQLFGGRIRLSTVVLMVAFLAVWWLYDTYNPQHSAGKTTPPSQVVPPGLVPDP 78 Query 64 DYTWVPRTRVQ 74 +YTWVPRTRVQ Sbjct 79 NYTWVPRTRVQ 89 >gi|254776427|ref|ZP_05217943.1| hypothetical protein MaviaA2_17409 [Mycobacterium avium subsp. avium ATCC 25291] Length=166 Score = 94.0 bits (232), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 47/73 (65%), Positives = 57/73 (79%), Gaps = 5/73 (6%) Query 4 SNQKRHWPGYLFG-RIRTSTLVLIAAFLAVWWIYETYRPQ-APGPGDSPPTQVVPPGFVP 61 + + WP Y+FG R+RTST VLI AFL VWW+Y+TYRP+ AP P P QVVPPGFVP Sbjct 17 TEAEHRWPQYVFGGRMRTSTFVLIVAFLLVWWVYDTYRPEPAPKP---PAQQVVPPGFVP 73 Query 62 DPDYTWVPRTRVQ 74 DP+YTWVPR+R+Q Sbjct 74 DPNYTWVPRSRLQ 86 >gi|41409271|ref|NP_962107.1| hypothetical protein MAP3173c [Mycobacterium avium subsp. paratuberculosis K-10] gi|41398091|gb|AAS05721.1| hypothetical protein MAP_3173c [Mycobacterium avium subsp. paratuberculosis K-10] gi|336459373|gb|EGO38316.1| hypothetical protein MAPs_04720 [Mycobacterium avium subsp. paratuberculosis S397] Length=166 Score = 94.0 bits (232), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 47/73 (65%), Positives = 57/73 (79%), Gaps = 5/73 (6%) Query 4 SNQKRHWPGYLFG-RIRTSTLVLIAAFLAVWWIYETYRPQ-APGPGDSPPTQVVPPGFVP 61 + + WP Y+FG R+RTST VLI AFL VWW+Y+TYRP+ AP P P QVVPPGFVP Sbjct 17 TEAEHRWPQYVFGGRMRTSTFVLIVAFLLVWWVYDTYRPEPAPKP---PAQQVVPPGFVP 73 Query 62 DPDYTWVPRTRVQ 74 DP+YTWVPR+R+Q Sbjct 74 DPNYTWVPRSRLQ 86 >gi|118465405|ref|YP_883157.1| hypothetical protein MAV_4004 [Mycobacterium avium 104] gi|118166692|gb|ABK67589.1| conserved hypothetical protein [Mycobacterium avium 104] Length=166 Score = 93.6 bits (231), Expect = 8e-18, Method: Compositional matrix adjust. Identities = 47/73 (65%), Positives = 57/73 (79%), Gaps = 5/73 (6%) Query 4 SNQKRHWPGYLFG-RIRTSTLVLIAAFLAVWWIYETYRPQ-APGPGDSPPTQVVPPGFVP 61 + + WP Y+FG R+RTST VLI AFL VWW+Y+TYRP+ AP P P QVVPPGFVP Sbjct 17 TEAEHRWPQYVFGGRMRTSTFVLIVAFLLVWWVYDTYRPEPAPKP---PAQQVVPPGFVP 73 Query 62 DPDYTWVPRTRVQ 74 DP+YTWVPR+R+Q Sbjct 74 DPNYTWVPRSRLQ 86 >gi|167969711|ref|ZP_02551988.1| hypothetical proline rich protein [Mycobacterium tuberculosis H37Ra] Length=47 Score = 90.1 bits (222), Expect = 9e-17, Method: Compositional matrix adjust. Identities = 39/41 (96%), Positives = 40/41 (98%), Gaps = 0/41 (0%) Query 1 VKLSNQKRHWPGYLFGRIRTSTLVLIAAFLAVWWIYETYRP 41 +KLSNQKRHWPGYLFGRIRTSTLVLIAAFLAVWWIYETYR Sbjct 1 MKLSNQKRHWPGYLFGRIRTSTLVLIAAFLAVWWIYETYRA 41 >gi|126434179|ref|YP_001069870.1| hypothetical protein Mjls_1581 [Mycobacterium sp. JLS] gi|126233979|gb|ABN97379.1| conserved hypothetical protein [Mycobacterium sp. JLS] Length=161 Score = 89.0 bits (219), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 45/76 (60%), Positives = 56/76 (74%), Gaps = 2/76 (2%) Query 3 LSNQKRHWPGYL-FGRIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFVP 61 + N KR WP Y+ GR+RTSTL LI AF+A++W+Y+ Y P P +P QVVPPGFVP Sbjct 10 MQNDKRVWPRYMPGGRVRTSTLGLIVAFIALFWLYQVYEPPV-RPAQNPAQQVVPPGFVP 68 Query 62 DPDYTWVPRTRVQPPT 77 DPDYTWVPRT+V+ P Sbjct 69 DPDYTWVPRTQVEAPV 84 >gi|108798580|ref|YP_638777.1| hypothetical protein Mmcs_1610 [Mycobacterium sp. MCS] gi|119867680|ref|YP_937632.1| hypothetical protein Mkms_1635 [Mycobacterium sp. KMS] gi|108768999|gb|ABG07721.1| conserved hypothetical protein [Mycobacterium sp. MCS] gi|119693769|gb|ABL90842.1| conserved hypothetical protein [Mycobacterium sp. KMS] Length=161 Score = 89.0 bits (219), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 45/76 (60%), Positives = 56/76 (74%), Gaps = 2/76 (2%) Query 3 LSNQKRHWPGYL-FGRIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFVP 61 + N KR WP Y+ GR+RTSTL LI AF+A++W+Y+ Y P P +P QVVPPGFVP Sbjct 10 MQNDKRVWPRYMPGGRVRTSTLGLIVAFIALFWLYQVYEPPV-RPAQNPAQQVVPPGFVP 68 Query 62 DPDYTWVPRTRVQPPT 77 DPDYTWVPRT+V+ P Sbjct 69 DPDYTWVPRTQVEAPV 84 >gi|145225028|ref|YP_001135706.1| hypothetical protein Mflv_4449 [Mycobacterium gilvum PYR-GCK] gi|315445397|ref|YP_004078276.1| hypothetical protein Mspyr1_38480 [Mycobacterium sp. Spyr1] gi|145217514|gb|ABP46918.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK] gi|315263700|gb|ADU00442.1| hypothetical protein Mspyr1_38480 [Mycobacterium sp. Spyr1] Length=178 Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 39/74 (53%), Positives = 52/74 (71%), Gaps = 1/74 (1%) Query 2 KLSNQKRHWPGYLFG-RIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFV 60 +L + RH YLFG R+R ST+ L+ F A++W+ + Y+P+ P P P QVVPPGFV Sbjct 9 RLQPKNRHSRAYLFGGRMRVSTVGLVLVFFALYWVNQNYQPEPPAPAMDPAQQVVPPGFV 68 Query 61 PDPDYTWVPRTRVQ 74 PDP+YTWVPRT V+ Sbjct 69 PDPNYTWVPRTNVE 82 >gi|333991483|ref|YP_004524097.1| hypothetical protein JDM601_2843 [Mycobacterium sp. JDM601] gi|333487451|gb|AEF36843.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=169 Score = 79.0 bits (193), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 49/87 (57%), Positives = 58/87 (67%), Gaps = 9/87 (10%) Query 4 SNQKRHWPGYLF-GRIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFVPD 62 ++ WP LF GR+RTST++LI AF+AVWW+Y+TYRPQ P PPGF+PD Sbjct 14 DGRRWRWPAQLFNGRVRTSTVLLIIAFVAVWWVYDTYRPQPTPPAAPQVV---PPGFIPD 70 Query 63 PDYTWVPRTRVQPPTVKATPTTTSSTP 89 P YTWVPRTRVQ PT TT S TP Sbjct 71 PAYTWVPRTRVQQPT-----TTVSETP 92 >gi|342861023|ref|ZP_08717672.1| hypothetical protein MCOL_19167 [Mycobacterium colombiense CECT 3035] gi|342131467|gb|EGT84737.1| hypothetical protein MCOL_19167 [Mycobacterium colombiense CECT 3035] Length=166 Score = 77.4 bits (189), Expect = 6e-13, Method: Compositional matrix adjust. Identities = 47/75 (63%), Positives = 59/75 (79%), Gaps = 5/75 (6%) Query 4 SNQKRHWPGYLFG-RIRTSTLVLIAAFLAVWWIYETYRPQ-APGPGDSPPTQVVPPGFVP 61 S+ + WP ++FG R+RTST VL+ AFL VWW+Y+TYRP+ AP P P Q+VPPGFVP Sbjct 15 SDAEHRWPKHMFGGRMRTSTFVLVVAFLVVWWVYDTYRPEPAPKP---PAQQLVPPGFVP 71 Query 62 DPDYTWVPRTRVQPP 76 DP+YTWVPR+RVQ P Sbjct 72 DPNYTWVPRSRVQAP 86 >gi|120402910|ref|YP_952739.1| hypothetical protein Mvan_1913 [Mycobacterium vanbaalenii PYR-1] gi|119955728|gb|ABM12733.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1] Length=172 Score = 75.5 bits (184), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 60/118 (51%), Positives = 73/118 (62%), Gaps = 5/118 (4%) Query 2 KLSNQKRHWPGYLFG-RIRTSTLVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFV 60 + + R WP YL G RIR ST LI AFLA++W+ + Y+P+ P P P QVVPPGFV Sbjct 6 RRDGESRGWPTYLLGGRIRASTAGLILAFLALFWVNQNYQPELPAPTPDPAQQVVPPGFV 65 Query 61 PDPDYTWVPRTRVQP--PTVKATPTTTSSTPPVSPPETTTDSAVPPPFE--LPPPFGP 114 PDP+YTWVPRT V P P V T TT++T +PPETTT + P P P GP Sbjct 66 PDPNYTWVPRTNVAPRQPEVTTTTPTTTTTTTTTPPETTTATTTAEPTPSTTPGPLGP 123 >gi|118469794|ref|YP_886448.1| hypothetical protein MSMEG_2088 [Mycobacterium smegmatis str. MC2 155] gi|118171081|gb|ABK71977.1| hypothetical proline-rich protein [Mycobacterium smegmatis str. MC2 155] Length=146 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 32/51 (63%), Positives = 40/51 (79%), Gaps = 3/51 (5%) Query 23 LVLIAAFLAVWWIYETYRPQAPGPGDSPPTQVVPPGFVPDPDYTWVPRTRV 73 +VLI AF A+WW+ +TY+P+ P + QVVPPGFVPDPDYTWVPRT+V Sbjct 1 MVLIVAFFALWWLQQTYQPE---PARTETPQVVPPGFVPDPDYTWVPRTKV 48 Lambda K H 0.313 0.136 0.457 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 128154014136 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40