BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1535 Length=78 Score E Sequences producing significant alignments: (Bits) Value gi|15608673|ref|NP_216051.1| hypothetical protein Rv1535 [Mycoba... 153 8e-36 gi|118464089|ref|YP_882418.1| hypothetical protein MAV_3236 [Myc... 103 6e-21 gi|240167917|ref|ZP_04746576.1| hypothetical protein MkanA1_0128... 102 2e-20 gi|41407343|ref|NP_960179.1| hypothetical protein MAP1245 [Mycob... 102 2e-20 gi|254821463|ref|ZP_05226464.1| hypothetical protein MintA_16125... 96.3 1e-18 gi|296164407|ref|ZP_06846982.1| conserved hypothetical protein [... 95.5 2e-18 gi|342860884|ref|ZP_08717534.1| hypothetical protein MCOL_18473 ... 94.4 5e-18 gi|118617176|ref|YP_905508.1| hypothetical protein MUL_1522 [Myc... 82.4 2e-14 gi|296168757|ref|ZP_06850448.1| conserved hypothetical protein [... 75.1 4e-12 gi|41410350|ref|NP_963186.1| hypothetical protein MAP4252c [Myco... 57.0 1e-06 gi|336460685|gb|EGO39575.1| hypothetical protein MAPs_37760 [Myc... 56.6 1e-06 gi|342861889|ref|ZP_08718534.1| hypothetical protein MCOL_23485 ... 54.7 4e-06 gi|118465576|ref|YP_883514.1| hypothetical protein MAV_4379 [Myc... 49.7 1e-04 gi|333989037|ref|YP_004521651.1| hypothetical protein JDM601_039... 49.3 2e-04 gi|118466971|ref|YP_883856.1| hypothetical protein MAV_4729 [Myc... 46.2 0.002 gi|336460344|gb|EGO39244.1| hypothetical protein MAPs_41400 [Myc... 46.2 0.002 gi|342859084|ref|ZP_08715738.1| hypothetical protein MCOL_09408 ... 45.4 0.003 gi|254818864|ref|ZP_05223865.1| hypothetical protein MintA_03009... 41.6 0.041 gi|333992558|ref|YP_004525172.1| hypothetical protein JDM601_391... 35.8 2.0 gi|320324856|gb|EFW80928.1| Tat pathway signal sequence domain-c... 34.7 4.5 gi|320329221|gb|EFW85218.1| Tat pathway signal sequence domain-c... 34.7 4.6 >gi|15608673|ref|NP_216051.1| hypothetical protein Rv1535 [Mycobacterium tuberculosis H37Rv] gi|15841002|ref|NP_336039.1| hypothetical protein MT1586 [Mycobacterium tuberculosis CDC1551] gi|31792721|ref|NP_855214.1| hypothetical protein Mb1562 [Mycobacterium bovis AF2122/97] 76 more sequence titlesLength=78 Score = 153 bits (386), Expect = 8e-36, Method: Compositional matrix adjust. Identities = 78/78 (100%), Positives = 78/78 (100%), Gaps = 0/78 (0%) Query 1 MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLVDGAARLLSIP 60 MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLVDGAARLLSIP Sbjct 1 MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLVDGAARLLSIP 60 Query 61 LRHLYAALWRVGLLEVQA 78 LRHLYAALWRVGLLEVQA Sbjct 61 LRHLYAALWRVGLLEVQA 78 >gi|118464089|ref|YP_882418.1| hypothetical protein MAV_3236 [Mycobacterium avium 104] gi|254775684|ref|ZP_05217200.1| hypothetical protein MaviaA2_13600 [Mycobacterium avium subsp. avium ATCC 25291] gi|118165376|gb|ABK66273.1| conserved hypothetical protein [Mycobacterium avium 104] Length=78 Score = 103 bits (258), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 54/78 (70%), Positives = 59/78 (76%), Gaps = 0/78 (0%) Query 1 MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLVDGAARLLSIP 60 MTAA +++V TVA AP LRVVRD A RR PFG GGDPLVDGAARLLSIP Sbjct 1 MTAAFYDEVATVAPAPTLRVVRDTDTIAAPAPKTRRGSGAPFGAGGDPLVDGAARLLSIP 60 Query 61 LRHLYAALWRVGLLEVQA 78 LRH+YAALWRVG+LEVQA Sbjct 61 LRHVYAALWRVGVLEVQA 78 >gi|240167917|ref|ZP_04746576.1| hypothetical protein MkanA1_01285 [Mycobacterium kansasii ATCC 12478] Length=82 Score = 102 bits (254), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 52/81 (65%), Positives = 64/81 (80%), Gaps = 4/81 (4%) Query 1 MTAALHNDVVTVASAPKLRVVRDVPP----APASKKVARRLDAQPFGTGGDPLVDGAARL 56 MTA L+++VVT AP+LRVVR+ P APA +VARR PFG GGDPLVDGAARL Sbjct 1 MTAVLYDEVVTATPAPRLRVVREGAPIETSAPAKVQVARRTANPPFGVGGDPLVDGAARL 60 Query 57 LSIPLRHLYAALWRVGLLEVQ 77 L+IP+RH+YAALWR+G+LEV+ Sbjct 61 LTIPIRHVYAALWRIGVLEVR 81 >gi|41407343|ref|NP_960179.1| hypothetical protein MAP1245 [Mycobacterium avium subsp. paratuberculosis K-10] gi|41395695|gb|AAS03562.1| hypothetical protein MAP_1245 [Mycobacterium avium subsp. paratuberculosis K-10] gi|336457954|gb|EGO36943.1| hypothetical protein MAPs_17880 [Mycobacterium avium subsp. paratuberculosis S397] Length=78 Score = 102 bits (253), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 53/78 (68%), Positives = 58/78 (75%), Gaps = 0/78 (0%) Query 1 MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLVDGAARLLSIP 60 MTAA +++V VA AP LRVVRD A RR PFG GGDPLVDGAARLLSIP Sbjct 1 MTAAFYDEVAAVAPAPTLRVVRDTDTIAAPAPKTRRGSGAPFGAGGDPLVDGAARLLSIP 60 Query 61 LRHLYAALWRVGLLEVQA 78 LRH+YAALWRVG+LEVQA Sbjct 61 LRHVYAALWRVGVLEVQA 78 >gi|254821463|ref|ZP_05226464.1| hypothetical protein MintA_16125 [Mycobacterium intracellulare ATCC 13950] Length=81 Score = 96.3 bits (238), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 53/80 (67%), Positives = 60/80 (75%), Gaps = 2/80 (2%) Query 1 MTAALHNDVVTVASAPKLRVVRDVPP--APASKKVARRLDAQPFGTGGDPLVDGAARLLS 58 MTAAL++DVVTVA A LRVVRDVP P + RR D FG GGDPL+ GA RLLS Sbjct 1 MTAALYDDVVTVAPARTLRVVRDVPEIHVPEPVEEPRRSDFPLFGPGGDPLIHGACRLLS 60 Query 59 IPLRHLYAALWRVGLLEVQA 78 IPLRH+YAALWRVG+LEV+ Sbjct 61 IPLRHVYAALWRVGVLEVRG 80 >gi|296164407|ref|ZP_06846982.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295900247|gb|EFG79678.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=79 Score = 95.5 bits (236), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 54/80 (68%), Positives = 59/80 (74%), Gaps = 3/80 (3%) Query 1 MTAALHNDVVTVASAPKLRVVRDVP--PAPASKKVARRLDAQPFGTGGDPLVDGAARLLS 58 MT ALH+DV TVA L+VVRD PA A KKV RR+ Q GGDPLVDGA RLLS Sbjct 1 MTVALHDDVTTVAPTRTLKVVRDTDTVPAAAPKKVPRRIATQ-LLLGGDPLVDGATRLLS 59 Query 59 IPLRHLYAALWRVGLLEVQA 78 IPLRH YAALWRVG+LE+QA Sbjct 60 IPLRHAYAALWRVGVLEIQA 79 >gi|342860884|ref|ZP_08717534.1| hypothetical protein MCOL_18473 [Mycobacterium colombiense CECT 3035] gi|342131908|gb|EGT85165.1| hypothetical protein MCOL_18473 [Mycobacterium colombiense CECT 3035] Length=83 Score = 94.4 bits (233), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 54/82 (66%), Positives = 63/82 (77%), Gaps = 4/82 (4%) Query 1 MTAALHNDVVTVASAPKLRVVRDVPP----APASKKVARRLDAQPFGTGGDPLVDGAARL 56 MTAA +++V+TVA AP L VVRD APA ARR D PFG GGDPLVDGAARL Sbjct 2 MTAAFYDEVLTVAPAPTLTVVRDTDTIPAPAPAPVAKARRSDHAPFGAGGDPLVDGAARL 61 Query 57 LSIPLRHLYAALWRVGLLEVQA 78 LS+PLRH+YAALWRVG+L+V+A Sbjct 62 LSVPLRHVYAALWRVGVLDVRA 83 >gi|118617176|ref|YP_905508.1| hypothetical protein MUL_1522 [Mycobacterium ulcerans Agy99] gi|183982361|ref|YP_001850652.1| hypothetical protein MMAR_2348 [Mycobacterium marinum M] gi|118569286|gb|ABL04037.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99] gi|183175687|gb|ACC40797.1| conserved hypothetical protein [Mycobacterium marinum M] Length=77 Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 48/78 (62%), Positives = 58/78 (75%), Gaps = 1/78 (1%) Query 1 MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLVDGAARLLSIP 60 MT A+ ++VVTV+ AP+L VVRD P PA + + F GGDPLV GAARLLS+P Sbjct 1 MTVAVFDEVVTVSPAPRLTVVRDGTPPPAPAAKKKSANRSDF-VGGDPLVAGAARLLSVP 59 Query 61 LRHLYAALWRVGLLEVQA 78 +RH+YAALWRVGLLEVQA Sbjct 60 VRHVYAALWRVGLLEVQA 77 >gi|296168757|ref|ZP_06850448.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896559|gb|EFG76202.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=71 Score = 75.1 bits (183), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 43/78 (56%), Positives = 53/78 (68%), Gaps = 7/78 (8%) Query 1 MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLVDGAARLLSIP 60 MTA L +DV+TVA AP L + PAP +R +P +G DP+VD AARLLSIP Sbjct 1 MTAVLFDDVITVAPAPSLTLAAAPEPAP------KRRRREPIASG-DPMVDTAARLLSIP 53 Query 61 LRHLYAALWRVGLLEVQA 78 LRH+YAALWR G++EV A Sbjct 54 LRHVYAALWRAGVIEVVA 71 >gi|41410350|ref|NP_963186.1| hypothetical protein MAP4252c [Mycobacterium avium subsp. paratuberculosis K-10] gi|41399184|gb|AAS06802.1| hypothetical protein MAP_4252c [Mycobacterium avium subsp. paratuberculosis K-10] Length=89 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 35/78 (45%), Positives = 41/78 (53%), Gaps = 14/78 (17%) Query 1 MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLVDGAARLLSIP 60 MTA +DVVT P R R PAPA G PL+D RLL+IP Sbjct 26 MTAIRCDDVVTATRVPAARQARPPRPAPAD--------------SGSPLMDMTTRLLAIP 71 Query 61 LRHLYAALWRVGLLEVQA 78 L LYA LWRVG++EV+A Sbjct 72 LHQLYALLWRVGIIEVRA 89 >gi|336460685|gb|EGO39575.1| hypothetical protein MAPs_37760 [Mycobacterium avium subsp. paratuberculosis S397] Length=64 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 35/78 (45%), Positives = 41/78 (53%), Gaps = 14/78 (17%) Query 1 MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLVDGAARLLSIP 60 MTA +DVVT P R R PAPA G PL+D RLL+IP Sbjct 1 MTAIRCDDVVTATRVPAARQARPPRPAPAD--------------SGSPLMDMTTRLLAIP 46 Query 61 LRHLYAALWRVGLLEVQA 78 L LYA LWRVG++EV+A Sbjct 47 LHQLYALLWRVGIIEVRA 64 >gi|342861889|ref|ZP_08718534.1| hypothetical protein MCOL_23485 [Mycobacterium colombiense CECT 3035] gi|342130706|gb|EGT84010.1| hypothetical protein MCOL_23485 [Mycobacterium colombiense CECT 3035] Length=65 Score = 54.7 bits (130), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 34/77 (45%), Positives = 47/77 (62%), Gaps = 13/77 (16%) Query 1 MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLVDGAARLLSIP 60 MTA ++DVV P APA ++V + P G+G +PL+D A+LLSIP Sbjct 1 MTAVRYDDVVPA------------PLAPAPRRVPQARPQAPAGSG-NPLMDMTAQLLSIP 47 Query 61 LRHLYAALWRVGLLEVQ 77 L LYAALWR+G++EV+ Sbjct 48 LHQLYAALWRMGVIEVR 64 >gi|118465576|ref|YP_883514.1| hypothetical protein MAV_4379 [Mycobacterium avium 104] gi|254776808|ref|ZP_05218324.1| hypothetical protein MaviaA2_19376 [Mycobacterium avium subsp. avium ATCC 25291] gi|118166863|gb|ABK67760.1| conserved hypothetical protein [Mycobacterium avium 104] Length=64 Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 21/34 (62%), Positives = 27/34 (80%), Gaps = 0/34 (0%) Query 45 GGDPLVDGAARLLSIPLRHLYAALWRVGLLEVQA 78 G+PL+D RLL+IPL LYA LWRVG++EV+A Sbjct 31 SGNPLMDMTTRLLAIPLHQLYAVLWRVGIIEVRA 64 >gi|333989037|ref|YP_004521651.1| hypothetical protein JDM601_0397 [Mycobacterium sp. JDM601] gi|333485005|gb|AEF34397.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=67 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 26/54 (49%), Positives = 34/54 (63%), Gaps = 3/54 (5%) Query 26 PAPASKKVARRLDAQ---PFGTGGDPLVDGAARLLSIPLRHLYAALWRVGLLEV 76 PAP + A+++D + P DP+VD A RLLS PL +YA LWRVG+L V Sbjct 12 PAPRAISTAKKVDTRRGAPAAPASDPVVDMATRLLSGPLHQMYALLWRVGVLTV 65 >gi|118466971|ref|YP_883856.1| hypothetical protein MAV_4729 [Mycobacterium avium 104] gi|254777168|ref|ZP_05218684.1| hypothetical protein MaviaA2_21214 [Mycobacterium avium subsp. avium ATCC 25291] gi|118168258|gb|ABK69155.1| conserved hypothetical protein [Mycobacterium avium 104] Length=64 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 20/34 (59%), Positives = 26/34 (77%), Gaps = 0/34 (0%) Query 44 TGGDPLVDGAARLLSIPLRHLYAALWRVGLLEVQ 77 TGGDPLV A +L +PL LYA LWRVG++E++ Sbjct 4 TGGDPLVSSFASVLKVPLVELYALLWRVGVVEIR 37 >gi|336460344|gb|EGO39244.1| hypothetical protein MAPs_41400 [Mycobacterium avium subsp. paratuberculosis S397] Length=64 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 20/34 (59%), Positives = 26/34 (77%), Gaps = 0/34 (0%) Query 44 TGGDPLVDGAARLLSIPLRHLYAALWRVGLLEVQ 77 TGGDPLV A +L +PL LYA LWRVG++E++ Sbjct 4 TGGDPLVSSFASVLKVPLVELYALLWRVGVVEIR 37 >gi|342859084|ref|ZP_08715738.1| hypothetical protein MCOL_09408 [Mycobacterium colombiense CECT 3035] gi|342133325|gb|EGT86528.1| hypothetical protein MCOL_09408 [Mycobacterium colombiense CECT 3035] Length=88 Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust. Identities = 20/34 (59%), Positives = 26/34 (77%), Gaps = 0/34 (0%) Query 44 TGGDPLVDGAARLLSIPLRHLYAALWRVGLLEVQ 77 TGGDPLV A +L +PL LYA LWRVG++E++ Sbjct 4 TGGDPLVSSFASVLRVPLLELYALLWRVGVVEIR 37 >gi|254818864|ref|ZP_05223865.1| hypothetical protein MintA_03009 [Mycobacterium intracellulare ATCC 13950] Length=72 Score = 41.6 bits (96), Expect = 0.041, Method: Compositional matrix adjust. Identities = 29/78 (38%), Positives = 38/78 (49%), Gaps = 13/78 (16%) Query 1 MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLVDGAARLLSIP 60 MTA +++VV SAP AP RR P G+ L+D +L+ Sbjct 8 MTAVRYDEVVPAPSAP----------APGHAARPRR---HPSAGSGNALMDLTTHVLTAT 54 Query 61 LRHLYAALWRVGLLEVQA 78 L LYAALWR G++EV A Sbjct 55 LHQLYAALWRAGVIEVTA 72 >gi|333992558|ref|YP_004525172.1| hypothetical protein JDM601_3918 [Mycobacterium sp. JDM601] gi|333488526|gb|AEF37918.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=56 Score = 35.8 bits (81), Expect = 2.0, Method: Compositional matrix adjust. Identities = 14/32 (44%), Positives = 22/32 (69%), Gaps = 0/32 (0%) Query 46 GDPLVDGAARLLSIPLRHLYAALWRVGLLEVQ 77 +PL + A L++P+ LYA LWR G+L+V+ Sbjct 8 AEPLAEVTATALTVPIVELYAVLWRAGVLDVE 39 >gi|320324856|gb|EFW80928.1| Tat pathway signal sequence domain-containing protein [Pseudomonas syringae pv. glycinea str. B076] Length=544 Score = 34.7 bits (78), Expect = 4.5, Method: Compositional matrix adjust. Identities = 20/59 (34%), Positives = 29/59 (50%), Gaps = 3/59 (5%) Query 14 SAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLVDGAARLLSIPLRHLYAALWRVG 72 + P+ RV+ D+ PA A+K R +D G DP D A S+PL ++ A G Sbjct 107 AQPRFRVIGDLQPANAAKLYLRVMDGH---AGNDPHSDTADYEFSLPLDEVWGAFGNAG 162 >gi|320329221|gb|EFW85218.1| Tat pathway signal sequence domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] Length=544 Score = 34.7 bits (78), Expect = 4.6, Method: Compositional matrix adjust. Identities = 20/59 (34%), Positives = 29/59 (50%), Gaps = 3/59 (5%) Query 14 SAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLVDGAARLLSIPLRHLYAALWRVG 72 + P+ RV+ D+ PA A+K R +D G DP D A S+PL ++ A G Sbjct 107 AQPRFRVIGDLQPANAAKLYLRVMDGH---AGNDPHSDTADYEFSLPLDEVWGAFGNAG 162 Lambda K H 0.321 0.136 0.403 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129734191374 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40