BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0028 Length=101 Score E Sequences producing significant alignments: (Bits) Value gi|15607170|ref|NP_214542.1| hypothetical protein Rv0028 [Mycoba... 206 8e-52 gi|298527426|ref|ZP_07014835.1| conserved hypothetical protein [... 204 5e-51 gi|240172402|ref|ZP_04751061.1| hypothetical protein MkanA1_2400... 184 4e-45 gi|342860244|ref|ZP_08716896.1| hypothetical protein MCOL_15245 ... 182 1e-44 gi|296167097|ref|ZP_06849507.1| conserved hypothetical protein [... 182 2e-44 gi|254819080|ref|ZP_05224081.1| hypothetical protein MintA_04093... 182 2e-44 gi|254773091|ref|ZP_05214607.1| hypothetical protein MaviaA2_001... 181 4e-44 gi|183980082|ref|YP_001848373.1| hypothetical protein MMAR_0047 ... 179 1e-43 gi|41406138|ref|NP_958974.1| hypothetical protein MAP0040 [Mycob... 178 3e-43 gi|118615960|ref|YP_904292.1| hypothetical protein MUL_0046 [Myc... 177 3e-43 gi|15839402|ref|NP_334439.1| hypothetical protein MT0030.1 [Myco... 147 4e-34 gi|333988683|ref|YP_004521297.1| hypothetical protein JDM601_004... 144 4e-33 gi|118463831|ref|YP_879346.1| hypothetical protein MAV_0046 [Myc... 122 2e-26 gi|119855151|ref|YP_935756.1| hypothetical protein Mkms_5766 [My... 37.7 0.53 gi|240170252|ref|ZP_04748911.1| hypothetical protein MkanA1_1313... 35.8 2.3 gi|240168346|ref|ZP_04747005.1| hypothetical protein MkanA1_0347... 34.3 6.3 >gi|15607170|ref|NP_214542.1| hypothetical protein Rv0028 [Mycobacterium tuberculosis H37Rv] gi|31791205|ref|NP_853698.1| hypothetical protein Mb0029 [Mycobacterium bovis AF2122/97] gi|121635938|ref|YP_976161.1| hypothetical protein BCG_0059 [Mycobacterium bovis BCG str. Pasteur 1173P2] 78 more sequence titlesLength=101 Score = 206 bits (524), Expect = 8e-52, Method: Compositional matrix adjust. Identities = 101/101 (100%), Positives = 101/101 (100%), Gaps = 0/101 (0%) Query 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKAL 60 MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKAL Sbjct 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKAL 60 Query 61 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRNR 101 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRNR Sbjct 61 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRNR 101 >gi|298527426|ref|ZP_07014835.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|298497220|gb|EFI32514.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] Length=101 Score = 204 bits (518), Expect = 5e-51, Method: Compositional matrix adjust. Identities = 100/101 (99%), Positives = 100/101 (99%), Gaps = 0/101 (0%) Query 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKAL 60 MTDANPAFDTVHPSGHILVRSCRGGYMHSV LSEAAMETDAETLAEAILLTADVSCLKAL Sbjct 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVPLSEAAMETDAETLAEAILLTADVSCLKAL 60 Query 61 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRNR 101 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRNR Sbjct 61 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRNR 101 >gi|240172402|ref|ZP_04751061.1| hypothetical protein MkanA1_24008 [Mycobacterium kansasii ATCC 12478] Length=101 Score = 184 bits (467), Expect = 4e-45, Method: Compositional matrix adjust. Identities = 89/101 (89%), Positives = 93/101 (93%), Gaps = 0/101 (0%) Query 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKAL 60 MTDANPAFDTVHPSGHILVRSCRGGYMHSV+LSE AMETDAETLAE IL TADVSCLKAL Sbjct 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVTLSEGAMETDAETLAEGILRTADVSCLKAL 60 Query 61 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRNR 101 LEVR+EI+AAGHTPSA VPT DL+ AIEKLLAHQLRRR R Sbjct 61 LEVRDEIIAAGHTPSASVPTDQDLDAAIEKLLAHQLRRRRR 101 >gi|342860244|ref|ZP_08716896.1| hypothetical protein MCOL_15245 [Mycobacterium colombiense CECT 3035] gi|342132622|gb|EGT85851.1| hypothetical protein MCOL_15245 [Mycobacterium colombiense CECT 3035] Length=103 Score = 182 bits (462), Expect = 1e-44, Method: Compositional matrix adjust. Identities = 87/100 (87%), Positives = 95/100 (95%), Gaps = 0/100 (0%) Query 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKAL 60 MTDANPAFDTVHPSGHILVRSCRGGYMHSV+LSE AM+TDAETLA+ ILLTADVSCLKAL Sbjct 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVALSEEAMQTDAETLAQGILLTADVSCLKAL 60 Query 61 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRN 100 LE+R+EIVAAGHTPSA+VPT DL+ AIEKLLAH+LRRRN Sbjct 61 LEIRDEIVAAGHTPSAEVPTPRDLDAAIEKLLAHKLRRRN 100 >gi|296167097|ref|ZP_06849507.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295897539|gb|EFG77135.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=103 Score = 182 bits (462), Expect = 2e-44, Method: Compositional matrix adjust. Identities = 88/101 (88%), Positives = 95/101 (95%), Gaps = 0/101 (0%) Query 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKAL 60 MTDANPAFDTVHPSGHILVRSCRGGYMHSV+LSE AMETDAETLA ILLTADVSCLKAL Sbjct 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVALSEEAMETDAETLARGILLTADVSCLKAL 60 Query 61 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRNR 101 LE+R+EIVAAGHTPSA+VPT DL+ AIEKLLAH+LRRR+R Sbjct 61 LEIRDEIVAAGHTPSAEVPTPRDLDAAIEKLLAHKLRRRHR 101 >gi|254819080|ref|ZP_05224081.1| hypothetical protein MintA_04093 [Mycobacterium intracellulare ATCC 13950] Length=103 Score = 182 bits (461), Expect = 2e-44, Method: Compositional matrix adjust. Identities = 87/100 (87%), Positives = 95/100 (95%), Gaps = 0/100 (0%) Query 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKAL 60 MTDANPAFDTVHPSGHILVRSCRGGYMHSV+LSE AMETDAETLA+ I+LTADVSCLKAL Sbjct 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVALSEEAMETDAETLAQGIVLTADVSCLKAL 60 Query 61 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRN 100 LE+R+EIVAAGHTPSA+VPT DL+ AIEKLLAH+LRRRN Sbjct 61 LEIRDEIVAAGHTPSAEVPTPRDLDAAIEKLLAHKLRRRN 100 >gi|254773091|ref|ZP_05214607.1| hypothetical protein MaviaA2_00196 [Mycobacterium avium subsp. avium ATCC 25291] Length=103 Score = 181 bits (458), Expect = 4e-44, Method: Compositional matrix adjust. Identities = 87/100 (87%), Positives = 94/100 (94%), Gaps = 0/100 (0%) Query 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKAL 60 MTDANPAFDTVHPSGHILVRSCRGGYMHSV+LSE AMETDAETLA+ ILLTADVSCLKAL Sbjct 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVALSEEAMETDAETLAQGILLTADVSCLKAL 60 Query 61 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRN 100 LE+R+EIVAAGHTPSA+VPT DL+ AIEKLLAH+LRRR Sbjct 61 LEIRDEIVAAGHTPSAEVPTPRDLDAAIEKLLAHKLRRRT 100 >gi|183980082|ref|YP_001848373.1| hypothetical protein MMAR_0047 [Mycobacterium marinum M] gi|183173408|gb|ACC38518.1| conserved hypothetical protein [Mycobacterium marinum M] Length=101 Score = 179 bits (454), Expect = 1e-43, Method: Compositional matrix adjust. Identities = 88/101 (88%), Positives = 92/101 (92%), Gaps = 0/101 (0%) Query 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKAL 60 MTDANPAFDTVHPSGHILVRSCRGGYMHSV+LSE AMETDA LAE ILLTADVSCLKAL Sbjct 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVALSEGAMETDAVALAEGILLTADVSCLKAL 60 Query 61 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRNR 101 LEVR EIVAAGHTPSA+VPT DL+VAIE+LLAHQLR R R Sbjct 61 LEVREEIVAAGHTPSAEVPTNRDLDVAIERLLAHQLRPRRR 101 >gi|41406138|ref|NP_958974.1| hypothetical protein MAP0040 [Mycobacterium avium subsp. paratuberculosis K-10] gi|41394486|gb|AAS02357.1| hypothetical protein MAP_0040 [Mycobacterium avium subsp. paratuberculosis K-10] gi|336459445|gb|EGO38387.1| Protein of unknown function (DUF2694) [Mycobacterium avium subsp. paratuberculosis S397] Length=103 Score = 178 bits (451), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 86/100 (86%), Positives = 93/100 (93%), Gaps = 0/100 (0%) Query 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKAL 60 MTDANPAFDTVHPSGHILVRSC GGYMHSV+LSE AMETDAETLA+ ILLTADVSCLKAL Sbjct 1 MTDANPAFDTVHPSGHILVRSCGGGYMHSVALSEEAMETDAETLAQGILLTADVSCLKAL 60 Query 61 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRN 100 LE+R+EIVAAGHTPSA+VPT DL+ AIEKLLAH+LRRR Sbjct 61 LEIRDEIVAAGHTPSAEVPTPRDLDAAIEKLLAHKLRRRT 100 >gi|118615960|ref|YP_904292.1| hypothetical protein MUL_0046 [Mycobacterium ulcerans Agy99] gi|118568070|gb|ABL02821.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99] Length=134 Score = 177 bits (450), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 87/99 (88%), Positives = 91/99 (92%), Gaps = 0/99 (0%) Query 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKAL 60 MTDANPAFDTVHPSGHILVRSCRGGYMHSV+LSE AMETDA LAE ILLTADVSCLKAL Sbjct 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVALSEGAMETDAAALAEGILLTADVSCLKAL 60 Query 61 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRR 99 LEVR EIVAAGHTPSA+VPT DL+VAIE+LLAHQLR R Sbjct 61 LEVREEIVAAGHTPSAEVPTNRDLDVAIERLLAHQLRPR 99 >gi|15839402|ref|NP_334439.1| hypothetical protein MT0030.1 [Mycobacterium tuberculosis CDC1551] gi|13879073|gb|AAK44253.1| hypothetical protein MT0030.1 [Mycobacterium tuberculosis CDC1551] Length=75 Score = 147 bits (372), Expect = 4e-34, Method: Compositional matrix adjust. Identities = 75/75 (100%), Positives = 75/75 (100%), Gaps = 0/75 (0%) Query 27 MHSVSLSEAAMETDAETLAEAILLTADVSCLKALLEVRNEIVAAGHTPSAQVPTTDDLNV 86 MHSVSLSEAAMETDAETLAEAILLTADVSCLKALLEVRNEIVAAGHTPSAQVPTTDDLNV Sbjct 1 MHSVSLSEAAMETDAETLAEAILLTADVSCLKALLEVRNEIVAAGHTPSAQVPTTDDLNV 60 Query 87 AIEKLLAHQLRRRNR 101 AIEKLLAHQLRRRNR Sbjct 61 AIEKLLAHQLRRRNR 75 >gi|333988683|ref|YP_004521297.1| hypothetical protein JDM601_0044 [Mycobacterium sp. JDM601] gi|333484652|gb|AEF34044.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=108 Score = 144 bits (363), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 70/101 (70%), Positives = 82/101 (82%), Gaps = 0/101 (0%) Query 1 MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKAL 60 MT+ NPAFDT HPSG +L RSCRGGY+HSV+LSEAAM DA+ LAEAI+L ADVS LKA Sbjct 1 MTEPNPAFDTTHPSGDVLFRSCRGGYLHSVALSEAAMTADADRLAEAIVLAADVSYLKAA 60 Query 61 LEVRNEIVAAGHTPSAQVPTTDDLNVAIEKLLAHQLRRRNR 101 LE+R EIV+ GH+PSA VPTTDDL VA E+LL H+L +R Sbjct 61 LEIRGEIVSTGHSPSAAVPTTDDLRVATERLLNHRLHAGHR 101 >gi|118463831|ref|YP_879346.1| hypothetical protein MAV_0046 [Mycobacterium avium 104] gi|118165118|gb|ABK66015.1| conserved hypothetical protein [Mycobacterium avium 104] Length=77 Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 61/74 (83%), Positives = 68/74 (92%), Gaps = 0/74 (0%) Query 27 MHSVSLSEAAMETDAETLAEAILLTADVSCLKALLEVRNEIVAAGHTPSAQVPTTDDLNV 86 MHSV+LSE AMETDAETLA+ ILLTADVSCLKALLE+R+EIVAAGHTPSA+VPT DL+ Sbjct 1 MHSVALSEEAMETDAETLAQGILLTADVSCLKALLEIRDEIVAAGHTPSAEVPTPRDLDA 60 Query 87 AIEKLLAHQLRRRN 100 AIEKLLAH+LRRR Sbjct 61 AIEKLLAHKLRRRT 74 >gi|119855151|ref|YP_935756.1| hypothetical protein Mkms_5766 [Mycobacterium sp. KMS] gi|119697869|gb|ABL94941.1| hypothetical protein Mkms_5766 [Mycobacterium sp. KMS] Length=102 Score = 37.7 bits (86), Expect = 0.53, Method: Compositional matrix adjust. Identities = 28/87 (33%), Positives = 41/87 (48%), Gaps = 2/87 (2%) Query 7 AFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKALLEVRNE 66 AF P +LVR G + V L AM + LA+ I+ ADV+ L+ + +R + Sbjct 13 AFLARTPDDAVLVRVAVKGSILGVQLEPKAMRDNMHELAQRIMACADVAYLQGQVALREQ 72 Query 67 IVAAGHTP--SAQVPTTDDLNVAIEKL 91 + A P A PT DL A ++L Sbjct 73 MEHAKLDPVCYADFPTERDLAAARDRL 99 >gi|240170252|ref|ZP_04748911.1| hypothetical protein MkanA1_13138 [Mycobacterium kansasii ATCC 12478] Length=210 Score = 35.8 bits (81), Expect = 2.3, Method: Compositional matrix adjust. Identities = 32/108 (30%), Positives = 47/108 (44%), Gaps = 19/108 (17%) Query 8 FDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKA-------L 60 F +P G + V + GG +H V LSE LAE I + AD++ KA + Sbjct 75 FTVTNPQGSVSVSALLGGIIHQVELSEEVTTMAEPKLAEEIFVIADLARQKARAAQYTFM 134 Query 61 LEVRNEIVAAGHTPSAQ----------VPTTDDLNVAIEKLLAHQLRR 98 L+ EI H SAQ +PT ++ A +K+ A + R Sbjct 135 LQSVREIKNEQH--SAQLLEFVGTTLNLPTPEEAAAAEKKVFATRYGR 180 >gi|240168346|ref|ZP_04747005.1| hypothetical protein MkanA1_03477 [Mycobacterium kansasii ATCC 12478] Length=180 Score = 34.3 bits (77), Expect = 6.3, Method: Compositional matrix adjust. Identities = 20/76 (27%), Positives = 36/76 (48%), Gaps = 7/76 (9%) Query 8 FDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKA-------L 60 F +P G + V + GG + +++++ A LAE I + AD++ KA + Sbjct 64 FTVTNPQGSVSVTAMMGGIIQKITVTDKASRMTESGLAEEIFVIADLARQKARAAQHTFM 123 Query 61 LEVRNEIVAAGHTPSA 76 +E NE+ G +A Sbjct 124 MESMNELAGDGEEANA 139 Lambda K H 0.317 0.129 0.361 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 128767090968 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40