BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0559c Length=112 Score E Sequences producing significant alignments: (Bits) Value gi|15607699|ref|NP_215073.1| hypothetical protein Rv0559c [Mycob... 234 2e-60 gi|118616464|ref|YP_904796.1| hypothetical protein MUL_0661 [Myc... 179 9e-44 gi|240169239|ref|ZP_04747898.1| putative secreted protein [Mycob... 149 2e-34 gi|342858930|ref|ZP_08715584.1| hypothetical protein MCOL_08638 ... 148 2e-34 gi|15828218|ref|NP_302481.1| hypothetical protein [Mycobacterium... 145 2e-33 gi|296168325|ref|ZP_06850249.1| conserved hypothetical protein [... 144 4e-33 gi|118462525|ref|YP_883712.1| hypothetical protein MAV_4583 [Myc... 143 1e-32 gi|254777015|ref|ZP_05218531.1| hypothetical protein MaviaA2_204... 143 1e-32 gi|41410154|ref|NP_962990.1| hypothetical protein MAP4056c [Myco... 139 1e-31 gi|108802032|ref|YP_642229.1| hypothetical protein Mmcs_5069 [My... 112 1e-23 gi|333990720|ref|YP_004523334.1| hypothetical protein JDM601_208... 110 6e-23 gi|169628633|ref|YP_001702282.1| hypothetical protein MAB_1543 [... 101 3e-20 gi|145224021|ref|YP_001134699.1| hypothetical protein Mflv_3436 ... 99.8 1e-19 gi|118467680|ref|YP_887797.1| hypothetical protein MSMEG_3493 [M... 96.3 2e-18 gi|120404169|ref|YP_953998.1| hypothetical protein Mvan_3190 [My... 87.8 4e-16 gi|118466065|ref|YP_882736.1| hypothetical protein MAV_3558 [Myc... 39.3 0.20 gi|41408884|ref|NP_961720.1| hypothetical protein MAP2786c [Myco... 38.9 0.26 gi|342860702|ref|ZP_08717352.1| hypothetical protein MCOL_17563 ... 38.1 0.42 gi|118462859|ref|YP_883960.1| hypothetical protein MAV_4836 [Myc... 37.7 0.49 gi|254777273|ref|ZP_05218789.1| hypothetical protein MaviaA2_217... 37.7 0.50 gi|296169915|ref|ZP_06851526.1| conserved hypothetical protein [... 37.7 0.55 gi|296164252|ref|ZP_06846842.1| conserved hypothetical protein [... 37.4 0.78 gi|41408599|ref|NP_961435.1| hypothetical protein MAP2501 [Mycob... 35.8 2.2 gi|336458546|gb|EGO37514.1| Protein of unknown function (DUF732)... 35.8 2.3 gi|296164250|ref|ZP_06846841.1| conserved hypothetical protein [... 35.4 2.7 gi|315443073|ref|YP_004075952.1| hypothetical protein Mspyr1_144... 35.4 3.1 gi|254774297|ref|ZP_05215813.1| hypothetical protein MaviaA2_064... 35.4 3.1 gi|118466879|ref|YP_882536.1| hypothetical protein MAV_3354 [Myc... 34.7 5.4 gi|145222608|ref|YP_001133286.1| hypothetical protein Mflv_2020 ... 34.3 5.6 gi|333988948|ref|YP_004521562.1| hypothetical protein JDM601_030... 34.3 6.2 gi|240169738|ref|ZP_04748397.1| hypothetical protein MkanA1_1052... 33.9 8.3 gi|41407621|ref|NP_960457.1| hypothetical protein MAP1523 [Mycob... 33.9 8.6 >gi|15607699|ref|NP_215073.1| hypothetical protein Rv0559c [Mycobacterium tuberculosis H37Rv] gi|15839957|ref|NP_334994.1| hypothetical protein MT0585 [Mycobacterium tuberculosis CDC1551] gi|31791741|ref|NP_854234.1| hypothetical protein Mb0574c [Mycobacterium bovis AF2122/97] 80 more sequence titlesLength=112 Score = 234 bits (598), Expect = 2e-60, Method: Compositional matrix adjust. Identities = 112/112 (100%), Positives = 112/112 (100%), Gaps = 0/112 (0%) Query 1 MKGTKLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLS 60 MKGTKLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLS Sbjct 1 MKGTKLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLS 60 Query 61 RGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVLQRAGTR 112 RGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVLQRAGTR Sbjct 61 RGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVLQRAGTR 112 >gi|118616464|ref|YP_904796.1| hypothetical protein MUL_0661 [Mycobacterium ulcerans Agy99] gi|183980929|ref|YP_001849220.1| hypothetical protein MMAR_0908 [Mycobacterium marinum M] gi|118568574|gb|ABL03325.1| conserved protein [Mycobacterium ulcerans Agy99] gi|183174255|gb|ACC39365.1| conserved protein [Mycobacterium marinum M] Length=111 Score = 179 bits (455), Expect = 9e-44, Method: Compositional matrix adjust. Identities = 95/110 (87%), Positives = 99/110 (90%), Gaps = 0/110 (0%) Query 1 MKGTKLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLS 60 MKG +LA + VAAV LAAPAQADDYDAPFNNT+H FGIYGPQDYNAWLAKISCER+ Sbjct 1 MKGRRLATIAATAVAAVGLAAPAQADDYDAPFNNTLHSFGIYGPQDYNAWLAKISCERMD 60 Query 61 RGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVLQRAG 110 RGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCP HVGVLQR G Sbjct 61 RGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPAHVGVLQRVG 110 >gi|240169239|ref|ZP_04747898.1| putative secreted protein [Mycobacterium kansasii ATCC 12478] Length=115 Score = 149 bits (375), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 70/86 (82%), Positives = 73/86 (85%), Gaps = 0/86 (0%) Query 26 DDYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLSRGVDGDAYKSATFLQRNLPRGTTQG 85 DDYD F T++ FG+YGPQD AWLAKISCERL RGVD AY SATFLQRNL RGTTQG Sbjct 29 DDYDYTFKQTVNGFGVYGPQDQLAWLAKISCERLERGVDHTAYNSATFLQRNLARGTTQG 88 Query 86 QAFQFLGAAIDHYCPEHVGVLQRAGT 111 QAFQFLGAAIDHYCPE VGVLQRAGT Sbjct 89 QAFQFLGAAIDHYCPEQVGVLQRAGT 114 >gi|342858930|ref|ZP_08715584.1| hypothetical protein MCOL_08638 [Mycobacterium colombiense CECT 3035] gi|342133171|gb|EGT86374.1| hypothetical protein MCOL_08638 [Mycobacterium colombiense CECT 3035] Length=115 Score = 148 bits (374), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 72/110 (66%), Positives = 81/110 (74%), Gaps = 17/110 (15%) Query 19 LAAPAQAD-----------------DYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLSR 61 L+ PAQAD DYDAPF NT++ FGIY PQD AWL KI+CER+ R Sbjct 5 LSVPAQADPPPTAPYPTPRTPSPPSDYDAPFKNTVNGFGIYQPQDQLAWLGKITCERIGR 64 Query 62 GVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVLQRAGT 111 GVDGDAYKSATF+QRNLPRGTTQGQAFQFLGAA+DHYCP+ VG +Q AG Sbjct 65 GVDGDAYKSATFIQRNLPRGTTQGQAFQFLGAAVDHYCPDQVGFVQAAGN 114 >gi|15828218|ref|NP_302481.1| hypothetical protein [Mycobacterium leprae TN] gi|221230695|ref|YP_002504111.1| putative secreted protein [Mycobacterium leprae Br4923] gi|13093911|emb|CAC31790.1| putative secreted protein [Mycobacterium leprae] gi|219933802|emb|CAR72372.1| putative secreted protein [Mycobacterium leprae Br4923] Length=112 Score = 145 bits (367), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 78/112 (70%), Positives = 90/112 (81%), Gaps = 0/112 (0%) Query 1 MKGTKLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLS 60 MKGT LA V M AA LAAPA ADDYDAPFN +H +GIYG QDYNAWL KI+C+RL+ Sbjct 1 MKGTGLAANVAMAAAATVLAAPALADDYDAPFNAQLHSYGIYGAQDYNAWLGKIACQRLA 60 Query 61 RGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVLQRAGTR 112 +GVDGD KSATF+QRNLP TT+GQ+ QFLGAAI+HYCP +G+LQRAG R Sbjct 61 KGVDGDVNKSATFIQRNLPLYTTEGQSLQFLGAAINHYCPNQIGILQRAGAR 112 >gi|296168325|ref|ZP_06850249.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896756|gb|EFG76389.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=117 Score = 144 bits (364), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 77/116 (67%), Positives = 93/116 (81%), Gaps = 5/116 (4%) Query 1 MKGTKLAVVVGMTVAAVSLAAPAQA-----DDYDAPFNNTIHRFGIYGPQDYNAWLAKIS 55 MKGTKLA + G+ AAV+LAAPA A +D D F NT++ FGIY PQD AW+ KI+ Sbjct 1 MKGTKLATIGGLAAAAVALAAPALASPPPSNDGDVSFKNTVNGFGIYQPQDQLAWMGKIT 60 Query 56 CERLSRGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVLQRAGT 111 C+R+ +GVDGD YKSATF+QRNLPRGTTQGQAFQFLGAA+DHYCP+ +G +QRAGT Sbjct 61 CDRIGKGVDGDPYKSATFIQRNLPRGTTQGQAFQFLGAAVDHYCPDQIGFVQRAGT 116 >gi|118462525|ref|YP_883712.1| hypothetical protein MAV_4583 [Mycobacterium avium 104] gi|118163812|gb|ABK64709.1| conserved hypothetical protein [Mycobacterium avium 104] Length=134 Score = 143 bits (360), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 73/133 (55%), Positives = 88/133 (67%), Gaps = 23/133 (17%) Query 1 MKGTKLAVV------VGMTVAAVSLAAPAQAD-----------------DYDAPFNNTIH 37 MKGT LA + A L+APAQAD DYDAPF NT++ Sbjct 1 MKGTMLATIGAAAVAAFALAAPAQLSAPAQADPPPTAPYPTPRTPSPPSDYDAPFKNTVN 60 Query 38 RFGIYGPQDYNAWLAKISCERLSRGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDH 97 FGIY PQD AWL KI+C+RL GVD DA++SATF+QRNLPRGT++GQ+ QFLGAA+DH Sbjct 61 GFGIYQPQDQLAWLGKITCDRLDHGVDHDAHQSATFIQRNLPRGTSEGQSLQFLGAAVDH 120 Query 98 YCPEHVGVLQRAG 110 YCPEH+ V+Q AG Sbjct 121 YCPEHIDVVQAAG 133 >gi|254777015|ref|ZP_05218531.1| hypothetical protein MaviaA2_20431 [Mycobacterium avium subsp. avium ATCC 25291] Length=134 Score = 143 bits (360), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 73/133 (55%), Positives = 88/133 (67%), Gaps = 23/133 (17%) Query 1 MKGTKLAVV------VGMTVAAVSLAAPAQAD-----------------DYDAPFNNTIH 37 MKGT LA + A L+APAQAD DYDAPF NT++ Sbjct 1 MKGTMLATIGAAAVAAFALAAPAQLSAPAQADPPPTAPYPTPRTPAPPSDYDAPFKNTVN 60 Query 38 RFGIYGPQDYNAWLAKISCERLSRGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDH 97 FGIY PQD AWL KI+C+RL GVD DA++SATF+QRNLPRGT++GQ+ QFLGAA+DH Sbjct 61 GFGIYQPQDQLAWLGKITCDRLDHGVDHDAHQSATFIQRNLPRGTSEGQSLQFLGAAVDH 120 Query 98 YCPEHVGVLQRAG 110 YCPEH+ V+Q AG Sbjct 121 YCPEHIDVVQAAG 133 >gi|41410154|ref|NP_962990.1| hypothetical protein MAP4056c [Mycobacterium avium subsp. paratuberculosis K-10] gi|41398987|gb|AAS06606.1| hypothetical protein MAP_4056c [Mycobacterium avium subsp. paratuberculosis K-10] gi|336460467|gb|EGO39362.1| Protein of unknown function (DUF732) [Mycobacterium avium subsp. paratuberculosis S397] Length=130 Score = 139 bits (351), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 66/109 (61%), Positives = 80/109 (74%), Gaps = 17/109 (15%) Query 19 LAAPAQAD-----------------DYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLSR 61 L+APAQAD DYDAPF NT++ FGIY PQD AWL KI+C+RL Sbjct 21 LSAPAQADPPPTAPYPTPRTPSPPSDYDAPFKNTVNGFGIYQPQDQLAWLGKITCDRLDH 80 Query 62 GVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVLQRAG 110 GVD DA++SATF+QRNLPRGT++GQ+ QFLGAA+DHYCPEH+ V+Q AG Sbjct 81 GVDHDAHQSATFIQRNLPRGTSEGQSLQFLGAAVDHYCPEHIDVVQAAG 129 >gi|108802032|ref|YP_642229.1| hypothetical protein Mmcs_5069 [Mycobacterium sp. MCS] gi|119871184|ref|YP_941136.1| hypothetical protein Mkms_5157 [Mycobacterium sp. KMS] gi|126438011|ref|YP_001073702.1| hypothetical protein Mjls_5448 [Mycobacterium sp. JLS] gi|108772451|gb|ABG11173.1| protein of unknown function DUF732 [Mycobacterium sp. MCS] gi|119697273|gb|ABL94346.1| protein of unknown function DUF732 [Mycobacterium sp. KMS] gi|126237811|gb|ABO01212.1| protein of unknown function DUF732 [Mycobacterium sp. JLS] Length=123 Score = 112 bits (281), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 52/105 (50%), Positives = 74/105 (71%), Gaps = 1/105 (0%) Query 9 VVGMTVAAVSLAAPAQAD-DYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLSRGVDGDA 67 ++ + VA ++LA PA A D D F+N++ GIYG +DYNAW+AKI+CER+ RGVD DA Sbjct 17 ILAVAVAGMALATPAVAHADPDTDFSNSLQTIGIYGQKDYNAWIAKIACERIDRGVDRDA 76 Query 68 YKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVLQRAGTR 112 + SATF+ R LP+G+T QA++F+G A YCP H +L +A + Sbjct 77 FASATFVGRQLPKGSTTEQAWKFIGLAYPTYCPAHQALLYQAAEK 121 >gi|333990720|ref|YP_004523334.1| hypothetical protein JDM601_2080 [Mycobacterium sp. JDM601] gi|333486688|gb|AEF36080.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=111 Score = 110 bits (275), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 51/86 (60%), Positives = 62/86 (73%), Gaps = 0/86 (0%) Query 27 DYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLSRGVDGDAYKSATFLQRNLPRGTTQGQ 86 D D F N +H +GIYGP+DYNAWL KI CERL +GVDGDA KS F+ NL +GTTQ Q Sbjct 24 DPDIDFANELHSYGIYGPRDYNAWLGKIVCERLDKGVDGDAAKSVRFITPNLQKGTTQPQ 83 Query 87 AFQFLGAAIDHYCPEHVGVLQRAGTR 112 +QFLGAAI YCP+ VL++A + Sbjct 84 VWQFLGAAIHTYCPDKRPVLEQAAAQ 109 >gi|169628633|ref|YP_001702282.1| hypothetical protein MAB_1543 [Mycobacterium abscessus ATCC 19977] gi|169240600|emb|CAM61628.1| Conserved hypothetical protein [Mycobacterium abscessus] Length=113 Score = 101 bits (252), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 44/84 (53%), Positives = 56/84 (67%), Gaps = 0/84 (0%) Query 29 DAPFNNTIHRFGIYGPQDYNAWLAKISCERLSRGVDGDAYKSATFLQRNLPRGTTQGQAF 88 D FN T+H +GIY P D A+L KI+C RL G+D DAY+S FL +NL R ++ Q + Sbjct 29 DVSFNETLHSYGIYAPPDKTAYLGKITCHRLDTGLDKDAYQSTDFLAKNLDRKSSTEQKW 88 Query 89 QFLGAAIDHYCPEHVGVLQRAGTR 112 QFL A+ID YCPE VL+RA R Sbjct 89 QFLSASIDEYCPEQRPVLERAANR 112 >gi|145224021|ref|YP_001134699.1| hypothetical protein Mflv_3436 [Mycobacterium gilvum PYR-GCK] gi|315444358|ref|YP_004077237.1| hypothetical protein Mspyr1_27730 [Mycobacterium sp. Spyr1] gi|145216507|gb|ABP45911.1| protein of unknown function DUF732 [Mycobacterium gilvum PYR-GCK] gi|315262661|gb|ADT99402.1| Protein of unknown function (DUF732) [Mycobacterium sp. Spyr1] Length=109 Score = 99.8 bits (247), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 47/99 (48%), Positives = 65/99 (66%), Gaps = 1/99 (1%) Query 11 GMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLSRGVDGDAYKS 70 MT+ A+ AAPA AD D F N ++ +GIYG +DYNAW+ KI+C+RL VD D + S Sbjct 12 AMTLGALGTAAPAVADP-DTAFANELNTYGIYGQKDYNAWIGKIACKRLRTNVDPDVFAS 70 Query 71 ATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVLQRA 109 A F+ L G+T QA+QFL AA+ YCP+ + +L +A Sbjct 71 AKFVHNQLETGSTTDQAYQFLAAALRTYCPDRLPILNQA 109 >gi|118467680|ref|YP_887797.1| hypothetical protein MSMEG_3493 [Mycobacterium smegmatis str. MC2 155] gi|118168967|gb|ABK69863.1| putative secreted protein [Mycobacterium smegmatis str. MC2 155] Length=113 Score = 96.3 bits (238), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 40/84 (48%), Positives = 59/84 (71%), Gaps = 0/84 (0%) Query 27 DYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLSRGVDGDAYKSATFLQRNLPRGTTQGQ 86 D D F N +H +GIYGP+DYNAW+ KI+C+RL G+D +A S F+ + L R +T Q Sbjct 25 DPDTDFANELHTYGIYGPRDYNAWIGKIACKRLHNGLDRNANDSVNFVTKQLDRDSTTAQ 84 Query 87 AFQFLGAAIDHYCPEHVGVLQRAG 110 ++QFLG AI++YCP+ V+++A Sbjct 85 SWQFLGTAINYYCPDQRFVMEQAA 108 >gi|120404169|ref|YP_953998.1| hypothetical protein Mvan_3190 [Mycobacterium vanbaalenii PYR-1] gi|119956987|gb|ABM13992.1| protein of unknown function DUF732 [Mycobacterium vanbaalenii PYR-1] Length=108 Score = 87.8 bits (216), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 35/78 (45%), Positives = 56/78 (72%), Gaps = 0/78 (0%) Query 32 FNNTIHRFGIYGPQDYNAWLAKISCERLSRGVDGDAYKSATFLQRNLPRGTTQGQAFQFL 91 F N ++ +GIYG +DYNAW+ KI+C+R++ GVD +A++SA F+ L +G T QA++FL Sbjct 29 FTNELNTYGIYGQKDYNAWIGKIACKRITTGVDANAFESADFVHNQLEKGATTEQAWKFL 88 Query 92 GAAIDHYCPEHVGVLQRA 109 A + YCP+ + +L +A Sbjct 89 AAGLRTYCPDKLPILDQA 106 >gi|118466065|ref|YP_882736.1| hypothetical protein MAV_3558 [Mycobacterium avium 104] gi|118167352|gb|ABK68249.1| conserved hypothetical protein [Mycobacterium avium 104] Length=122 Score = 39.3 bits (90), Expect = 0.20, Method: Compositional matrix adjust. Identities = 32/107 (30%), Positives = 45/107 (43%), Gaps = 6/107 (5%) Query 5 KLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGI-YGPQDYNAWLAKISCERLSRGV 63 KL ++V A + LA PA+AD D F ++ + GI YG D A K C L G Sbjct 10 KLFLIVAGFAAVIGLAVPARADSTDDAFVASLDKAGIKYGDADKAAGAGKWVCTTLQGGK 69 Query 64 D-GDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVLQRA 109 D + NL + A F A++ YCP+ + A Sbjct 70 QMSDVVSTLQSKNSNL----SDDHANTFAAIAVNAYCPDQASSITPA 112 >gi|41408884|ref|NP_961720.1| hypothetical protein MAP2786c [Mycobacterium avium subsp. paratuberculosis K-10] gi|254776000|ref|ZP_05217516.1| hypothetical protein MaviaA2_15210 [Mycobacterium avium subsp. avium ATCC 25291] gi|41397243|gb|AAS05103.1| hypothetical protein MAP_2786c [Mycobacterium avium subsp. paratuberculosis K-10] gi|336458835|gb|EGO37792.1| Protein of unknown function (DUF732) [Mycobacterium avium subsp. paratuberculosis S397] Length=114 Score = 38.9 bits (89), Expect = 0.26, Method: Compositional matrix adjust. Identities = 31/100 (31%), Positives = 43/100 (43%), Gaps = 6/100 (6%) Query 5 KLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGI-YGPQDYNAWLAKISCERLSRGV 63 KL ++V A + LA PA+AD D F ++ + GI YG D A K C L G Sbjct 2 KLFLIVAGFAAVIGLAVPARADSTDDAFVASLDKAGIKYGDADKAAGAGKWVCTTLQGGK 61 Query 64 D-GDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEH 102 D + NL + A F A++ YCP+ Sbjct 62 QMSDVVSTLQSKNSNL----SDDHANTFAAIAVNAYCPDQ 97 >gi|342860702|ref|ZP_08717352.1| hypothetical protein MCOL_17563 [Mycobacterium colombiense CECT 3035] gi|342131726|gb|EGT84983.1| hypothetical protein MCOL_17563 [Mycobacterium colombiense CECT 3035] Length=114 Score = 38.1 bits (87), Expect = 0.42, Method: Compositional matrix adjust. Identities = 34/97 (36%), Positives = 45/97 (47%), Gaps = 17/97 (17%) Query 22 PAQADDYDAPFNNTIHRFGIYGPQDYNAWL---AKISCERLSRGVDGDAYKSATFLQRNL 78 PA+ADD D F + F + GP+ AK C L RG D D T Q+ Sbjct 22 PARADDTDDNFWAIVQTFEM-GPKVTKGVALSQAKTFCGDLRRG-DSDE----TPAQQTA 75 Query 79 PRG--------TTQGQAFQFLGAAIDHYCPEHVGVLQ 107 G TT+ +A +FLG AI+ YCPE + +Q Sbjct 76 DEGAWLANWANTTEYKAGEFLGVAINTYCPEQMSAVQ 112 >gi|118462859|ref|YP_883960.1| hypothetical protein MAV_4836 [Mycobacterium avium 104] gi|118164146|gb|ABK65043.1| conserved hypothetical protein [Mycobacterium avium 104] Length=112 Score = 37.7 bits (86), Expect = 0.49, Method: Compositional matrix adjust. Identities = 36/102 (36%), Positives = 49/102 (49%), Gaps = 10/102 (9%) Query 7 AVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLAKIS---CERLSRGV 63 A++ G +AAV A PAQAD A F N +HR GI+ +A L + C+++S G Sbjct 14 ALLGGPLLAAVVWAGPAQAD--AAGFLNDMHRDGIHAVTGGDAALLQAGLNICQQISWGA 71 Query 64 DGDAYKSATFLQRN----LPRGTTQGQAFQFLGAAIDHYCPE 101 A LQR+ P G T QA +G A+ CP Sbjct 72 P-PAQLEGLALQRSDDRQGPGGLTPQQADDLVGYAMRDLCPS 112 >gi|254777273|ref|ZP_05218789.1| hypothetical protein MaviaA2_21751 [Mycobacterium avium subsp. avium ATCC 25291] gi|336460227|gb|EGO39130.1| Protein of unknown function (DUF732) [Mycobacterium avium subsp. paratuberculosis S397] Length=103 Score = 37.7 bits (86), Expect = 0.50, Method: Compositional matrix adjust. Identities = 35/103 (34%), Positives = 51/103 (50%), Gaps = 12/103 (11%) Query 7 AVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLAKIS---CERLSRG- 62 A++ G +AAV A PAQAD A F N +HR GI+ +A L + C+++S G Sbjct 5 ALLGGPLLAAVVWAGPAQAD--AAGFLNDMHRDGIHAVTGGDAALLQAGLNICQQISWGA 62 Query 63 ----VDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPE 101 ++G A + + R P G T QA +G A+ CP Sbjct 63 PPAQLEGLALQRSD--DRQGPGGLTPQQADDLVGYAMRDLCPS 103 >gi|296169915|ref|ZP_06851526.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295895427|gb|EFG75130.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=135 Score = 37.7 bits (86), Expect = 0.55, Method: Compositional matrix adjust. Identities = 29/108 (27%), Positives = 53/108 (50%), Gaps = 4/108 (3%) Query 3 GTKLAVVVGMTVAAVSL-AAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLSR 61 G AV + +AAVS +A A AD D F ++ +GI + ++ +A R+ + Sbjct 20 GKWPAVCAVLLLAAVSFWSATASADPADDAFIASLANYGI---EVRDSGIAIAMGHRVCK 76 Query 62 GVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVLQRA 109 G D +A S ++ + +A F+G ++ YCP++ G + R+ Sbjct 77 GFDNNANASVLAMKVKRDTDLSMKEASYFVGLSVAAYCPQYRGQIDRS 124 >gi|296164252|ref|ZP_06846842.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295900391|gb|EFG79807.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=105 Score = 37.4 bits (85), Expect = 0.78, Method: Compositional matrix adjust. Identities = 29/98 (30%), Positives = 44/98 (45%), Gaps = 5/98 (5%) Query 6 LAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGI-YGPQDYNAWLAKISCERLSRGVD 64 LA V + AA+ A PA AD D F + + GI + +D A + R+ G+D Sbjct 7 LATSVILLAAALQFATPASADQTDDDFLAALKKHGIVFANRD----AAIATGHRVCTGLD 62 Query 65 GDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEH 102 D + L + A F GAA++ YCP++ Sbjct 63 ADQTPANLVLSLVKDTDLSAHVAGYFFGAAVNSYCPQY 100 >gi|41408599|ref|NP_961435.1| hypothetical protein MAP2501 [Mycobacterium avium subsp. paratuberculosis K-10] gi|41396957|gb|AAS04818.1| hypothetical protein MAP_2501 [Mycobacterium avium subsp. paratuberculosis K-10] Length=114 Score = 35.8 bits (81), Expect = 2.2, Method: Compositional matrix adjust. Identities = 34/105 (33%), Positives = 40/105 (39%), Gaps = 8/105 (7%) Query 4 TKLAVVVGMTVAAVSLAAPAQADDY---DAPFNNTIHRFGIYGPQDYNAWL-AKISCERL 59 T L + +AAV LA PA A D F + G+ P A C L Sbjct 11 TVLTTAAAIGLAAVGLAGPAAARTVQTGDEMFLAQMRSLGVSFPSPQEAVREGHQVCAEL 70 Query 60 SRGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVG 104 S G A A F Q NL T QA + AA YCP+ G Sbjct 71 SAGKTPTAVTVAVFRQTNL----TPPQAAGLVTAATQAYCPQFSG 111 >gi|336458546|gb|EGO37514.1| Protein of unknown function (DUF732) [Mycobacterium avium subsp. paratuberculosis S397] Length=110 Score = 35.8 bits (81), Expect = 2.3, Method: Compositional matrix adjust. Identities = 34/105 (33%), Positives = 40/105 (39%), Gaps = 8/105 (7%) Query 4 TKLAVVVGMTVAAVSLAAPAQADDY---DAPFNNTIHRFGIYGPQDYNAWL-AKISCERL 59 T L + +AAV LA PA A D F + G+ P A C L Sbjct 8 TVLTTAAAIGLAAVGLAGPAAARTVQTGDEMFLAQMRSLGVSFPSPQEAVREGHQVCAEL 67 Query 60 SRGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVG 104 S G A A F Q NL T QA + AA YCP+ G Sbjct 68 SAGKTPTAVTVAVFRQTNL----TPPQAAGLVTAATQAYCPQFSG 108 >gi|296164250|ref|ZP_06846841.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295900406|gb|EFG79821.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=96 Score = 35.4 bits (80), Expect = 2.7, Method: Compositional matrix adjust. Identities = 26/94 (28%), Positives = 41/94 (44%), Gaps = 3/94 (3%) Query 9 VVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLSRGVDGDAY 68 ++ + AA+ A PA AD D F + + GI N A + R+ G+D D Sbjct 1 MILLCAAALQFATPASADQTDDDFLAALKKHGIVFT---NRDAAIATGHRVCTGLDADQT 57 Query 69 KSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEH 102 + L + A F GAA++ YCP++ Sbjct 58 PANLILSLVKDTDLSAHVAGYFFGAAVNSYCPQY 91 >gi|315443073|ref|YP_004075952.1| hypothetical protein Mspyr1_14440 [Mycobacterium sp. Spyr1] gi|315261376|gb|ADT98117.1| Protein of unknown function (DUF732) [Mycobacterium sp. Spyr1] Length=116 Score = 35.4 bits (80), Expect = 3.1, Method: Compositional matrix adjust. Identities = 30/96 (32%), Positives = 43/96 (45%), Gaps = 7/96 (7%) Query 17 VSLAAPA-QADDYDAPFNNTIHRFGIY--GPQDYNAW---LAKISCERLSRGVDGDAYKS 70 +S+A PA A D DA F + G+ PQ+ + I E + G D A Sbjct 20 MSVAVPAAHATDADAQFLAVVAELGLRFDTPQEATEAGDNVCDIVAEGTANGFD-PARIR 78 Query 71 ATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVL 106 A + L G Q QA + + AA+ YCP + GV+ Sbjct 79 AAIVDSLLTEGVNQDQAVRLMVAAVGAYCPTYNGVV 114 >gi|254774297|ref|ZP_05215813.1| hypothetical protein MaviaA2_06470 [Mycobacterium avium subsp. avium ATCC 25291] Length=111 Score = 35.4 bits (80), Expect = 3.1, Method: Compositional matrix adjust. Identities = 34/105 (33%), Positives = 40/105 (39%), Gaps = 8/105 (7%) Query 4 TKLAVVVGMTVAAVSLAAPAQADDY---DAPFNNTIHRFGIYGPQDYNAWL-AKISCERL 59 T L + +AAV LA PA A D F + G+ P A C L Sbjct 8 TVLTTAAAIGLAAVGLAGPAAARAVQTGDEMFLAQMRSLGVSFPSPQEAVREGHQVCAEL 67 Query 60 SRGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVG 104 S G A A F Q NL T QA + AA YCP+ G Sbjct 68 SAGKTPTAVTVAVFRQTNL----TPPQAAGLVTAATQAYCPQFSG 108 >gi|118466879|ref|YP_882536.1| hypothetical protein MAV_3354 [Mycobacterium avium 104] gi|118168166|gb|ABK69063.1| conserved hypothetical protein [Mycobacterium avium 104] Length=117 Score = 34.7 bits (78), Expect = 5.4, Method: Compositional matrix adjust. Identities = 32/97 (33%), Positives = 42/97 (44%), Gaps = 4/97 (4%) Query 8 VVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLA-KISCERLSRGVDGD 66 +V + AV A ADD + F +H+ GI A A K C L+RGV G Sbjct 14 IVTAVPAHAVPGEDEAAADDNNEVFIADLHKVGISFQDPGQAVSAGKAVCGLLARGVSG- 72 Query 67 AYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHV 103 + L+ N P TT G A QF + YCP + Sbjct 73 -LQLLNDLRDNNPALTTNGAA-QFATISAKSYCPRQL 107 >gi|145222608|ref|YP_001133286.1| hypothetical protein Mflv_2020 [Mycobacterium gilvum PYR-GCK] gi|145215094|gb|ABP44498.1| protein of unknown function DUF732 [Mycobacterium gilvum PYR-GCK] Length=116 Score = 34.3 bits (77), Expect = 5.6, Method: Compositional matrix adjust. Identities = 30/96 (32%), Positives = 43/96 (45%), Gaps = 7/96 (7%) Query 17 VSLAAPA-QADDYDAPFNNTIHRFGIY--GPQDYNAW---LAKISCERLSRGVDGDAYKS 70 +S+A PA A D DA F + G+ PQ+ + I E + G D A Sbjct 20 MSVAVPAAHATDADAQFLAVVAELGLRFDTPQEATEAGDNVCDIVAEGTANGFD-PARIR 78 Query 71 ATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHVGVL 106 A + L G Q QA + + AA+ YCP + GV+ Sbjct 79 AAIVDSLLTDGVNQDQAVRLMVAAVGAYCPTYNGVV 114 >gi|333988948|ref|YP_004521562.1| hypothetical protein JDM601_0308 [Mycobacterium sp. JDM601] gi|333484916|gb|AEF34308.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=100 Score = 34.3 bits (77), Expect = 6.2, Method: Compositional matrix adjust. Identities = 30/102 (30%), Positives = 44/102 (44%), Gaps = 4/102 (3%) Query 1 MKGTKLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLA-KISCERL 59 M+ K V V + LAAPA A+ DA F + + GI P +A + C + Sbjct 1 MRARKPTVAVFAMAGLIGLAAPAAAEPGDAGFLGALDQMGISYPNPADAVAGGQAVCRYI 60 Query 60 SRGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPE 101 + +G + A +N T +A QF+G A YC E Sbjct 61 A---EGHSVNQAAKGVKNANPSLTLTKASQFVGIARATYCAE 99 >gi|240169738|ref|ZP_04748397.1| hypothetical protein MkanA1_10522 [Mycobacterium kansasii ATCC 12478] Length=104 Score = 33.9 bits (76), Expect = 8.3, Method: Compositional matrix adjust. Identities = 33/93 (36%), Positives = 44/93 (48%), Gaps = 7/93 (7%) Query 14 VAAVSLAAPAQADD-YDAPFNNTIHRFGIYGPQDYNAWLA--KISCERLSRGVDGDAYKS 70 V + LAAPA+ADD D F + GI Q+ A +A K C+ G A + Sbjct 6 VTVIGLAAPARADDKQDQAFLVALGAAGITY-QNAGATIATGKKVCDMAKEG--KSAIEV 62 Query 71 ATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHV 103 T+LQ P+ TQ A +F A YCPE + Sbjct 63 VTYLQGLSPQ-LTQPNAARFTAIAAGVYCPEQL 94 >gi|41407621|ref|NP_960457.1| hypothetical protein MAP1523 [Mycobacterium avium subsp. paratuberculosis K-10] gi|41395974|gb|AAS03840.1| hypothetical protein MAP_1523 [Mycobacterium avium subsp. paratuberculosis K-10] gi|336457573|gb|EGO36578.1| Protein of unknown function (DUF732) [Mycobacterium avium subsp. paratuberculosis S397] Length=106 Score = 33.9 bits (76), Expect = 8.6, Method: Compositional matrix adjust. Identities = 31/102 (31%), Positives = 48/102 (48%), Gaps = 6/102 (5%) Query 5 KLAVVVGMTVAAVSLAAPAQADDY--DAPFNNTIHRFGI-YGPQDYNAWLAKISCERLSR 61 ++ ++G+ AA+ APA+A+ DA F ++ + GI Y AK C + R Sbjct 3 RVLALLGIGAAALGYTAPARAEPQGDDAAFLASLDQSGITYSSPVQVIASAKAVCGLMGR 62 Query 62 GVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCPEHV 103 G G + T ++ P G T A QF A + YCP H+ Sbjct 63 GETG--LQVVTDIKDQNP-GMTMDGAAQFAALASNAYCPHHL 101 Lambda K H 0.321 0.135 0.414 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 128047486336 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40