BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3572 Length=176 Score E Sequences producing significant alignments: (Bits) Value gi|15610708|ref|NP_218089.1| hypothetical protein Rv3572 [Mycoba... 363 6e-99 gi|308232503|ref|ZP_07416263.2| hypothetical protein TMAG_00062 ... 337 5e-91 gi|183985035|ref|YP_001853326.1| hypothetical protein MMAR_5067 ... 322 2e-86 gi|118619322|ref|YP_907654.1| hypothetical protein MUL_4143 [Myc... 315 2e-84 gi|240169796|ref|ZP_04748455.1| hypothetical protein MkanA1_1081... 299 1e-79 gi|296166621|ref|ZP_06849049.1| conserved hypothetical protein [... 253 1e-65 gi|254822728|ref|ZP_05227729.1| hypothetical protein MintA_22554... 247 6e-64 gi|118467177|ref|YP_879869.1| hypothetical protein MAV_0589 [Myc... 240 7e-62 gi|41406593|ref|NP_959429.1| hypothetical protein MAP0495c [Myco... 239 2e-61 gi|254773546|ref|ZP_05215062.1| hypothetical protein MaviaA2_025... 238 3e-61 gi|342862404|ref|ZP_08719044.1| hypothetical protein MCOL_26053 ... 236 7e-61 gi|336458489|gb|EGO37462.1| hypothetical protein MAPs_12770 [Myc... 236 8e-61 gi|333992496|ref|YP_004525110.1| hypothetical protein JDM601_385... 189 1e-46 gi|169627685|ref|YP_001701334.1| hypothetical protein MAB_0582c ... 123 8e-27 gi|120406258|ref|YP_956087.1| hypothetical protein Mvan_5310 [My... 119 1e-25 gi|145222053|ref|YP_001132731.1| hypothetical protein Mflv_1461 ... 111 5e-23 gi|126437660|ref|YP_001073351.1| hypothetical protein Mjls_5096 ... 108 3e-22 gi|108801674|ref|YP_641871.1| hypothetical protein Mmcs_4711 [My... 108 3e-22 gi|118469878|ref|YP_890264.1| hypothetical protein MSMEG_6040 [M... 90.5 9e-17 gi|164660500|ref|XP_001731373.1| hypothetical protein MGL_1556 [... 35.4 3.7 gi|323371174|gb|ADX59436.1| beta-galactosidase [Coffea arabica] 34.3 7.3 gi|121702677|ref|XP_001269603.1| rerric reductase like transmemb... 34.3 7.6 >gi|15610708|ref|NP_218089.1| hypothetical protein Rv3572 [Mycobacterium tuberculosis H37Rv] gi|15843185|ref|NP_338222.1| hypothetical protein MT3677 [Mycobacterium tuberculosis CDC1551] gi|31794749|ref|NP_857242.1| hypothetical protein Mb3603 [Mycobacterium bovis AF2122/97] 72 more sequence titlesLength=176 Score = 363 bits (931), Expect = 6e-99, Method: Compositional matrix adjust. Identities = 176/176 (100%), Positives = 176/176 (100%), Gaps = 0/176 (0%) Query 1 MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDCHPNGSCDFVA 60 MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDCHPNGSCDFVA Sbjct 1 MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDCHPNGSCDFVA 60 Query 61 GANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYFG 120 GANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYFG Sbjct 61 GANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYFG 120 Query 121 EGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 EGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS Sbjct 121 EGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 >gi|308232503|ref|ZP_07416263.2| hypothetical protein TMAG_00062 [Mycobacterium tuberculosis SUMu001] gi|308378373|ref|ZP_07482365.2| hypothetical protein TMIG_03195 [Mycobacterium tuberculosis SUMu009] gi|308380736|ref|ZP_07490923.2| hypothetical protein TMKG_00810 [Mycobacterium tuberculosis SUMu011] gi|308213798|gb|EFO73197.1| hypothetical protein TMAG_00062 [Mycobacterium tuberculosis SUMu001] gi|308352796|gb|EFP41647.1| hypothetical protein TMIG_03195 [Mycobacterium tuberculosis SUMu009] gi|308360601|gb|EFP49452.1| hypothetical protein TMKG_00810 [Mycobacterium tuberculosis SUMu011] Length=163 Score = 337 bits (863), Expect = 5e-91, Method: Compositional matrix adjust. Identities = 163/163 (100%), Positives = 163/163 (100%), Gaps = 0/163 (0%) Query 14 MLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDCHPNGSCDFVAGANLRTPDGPTGF 73 MLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDCHPNGSCDFVAGANLRTPDGPTGF Sbjct 1 MLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDCHPNGSCDFVAGANLRTPDGPTGF 60 Query 74 PPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYFGEGPPDKYQTTGVI 133 PPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYFGEGPPDKYQTTGVI Sbjct 61 PPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYFGEGPPDKYQTTGVI 120 Query 134 DSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 DSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS Sbjct 121 DSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 163 >gi|183985035|ref|YP_001853326.1| hypothetical protein MMAR_5067 [Mycobacterium marinum M] gi|183178361|gb|ACC43471.1| conserved hypothetical secreted protein [Mycobacterium marinum M] Length=176 Score = 322 bits (824), Expect = 2e-86, Method: Compositional matrix adjust. Identities = 152/176 (87%), Positives = 161/176 (92%), Gaps = 0/176 (0%) Query 1 MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDCHPNGSCDFVA 60 M RL+ G + LMLT LP+PTSAA SNTATTLFPVDE TQLETH+F+DCHPNGSCDFVA Sbjct 1 MRRLMTGLAIPALMLTFLPSPTSAAASNTATTLFPVDETTQLETHSFVDCHPNGSCDFVA 60 Query 61 GANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYFG 120 GANLRTP+GPTGFPPGLWARQTTEIRS+NRLAYLDAHATSQFERVMK GSDVITTVYFG Sbjct 61 GANLRTPEGPTGFPPGLWARQTTEIRSSNRLAYLDAHATSQFERVMKQAGSDVITTVYFG 120 Query 121 EGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 EGPPDKYQTTGVIDS +W TGQPMT+VNV VCTHMQVVYPGVNLTSPSTCAQ NFS Sbjct 121 EGPPDKYQTTGVIDSASWETGQPMTNVNVYVCTHMQVVYPGVNLTSPSTCAQTNFS 176 >gi|118619322|ref|YP_907654.1| hypothetical protein MUL_4143 [Mycobacterium ulcerans Agy99] gi|118571432|gb|ABL06183.1| conserved hypothetical secreted protein [Mycobacterium ulcerans Agy99] Length=176 Score = 315 bits (807), Expect = 2e-84, Method: Compositional matrix adjust. Identities = 150/176 (86%), Positives = 159/176 (91%), Gaps = 0/176 (0%) Query 1 MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDCHPNGSCDFVA 60 M RL+ G + LMLT LP+PTSAA SNTATTLFPVDE TQLETH+F+DCHPNGS DFVA Sbjct 1 MRRLMTGLAIPALMLTFLPSPTSAAASNTATTLFPVDETTQLETHSFVDCHPNGSRDFVA 60 Query 61 GANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYFG 120 GANLRTP+GPTGFPPGLWARQTTEIRS+NRLAYL AHATSQFERVMK GSDVITTVYFG Sbjct 61 GANLRTPEGPTGFPPGLWARQTTEIRSSNRLAYLGAHATSQFERVMKQAGSDVITTVYFG 120 Query 121 EGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 EGPPDKYQTTGVIDS +W TGQPMT+VNV VCTHMQVVYPGVNLTSPSTCAQ NFS Sbjct 121 EGPPDKYQTTGVIDSASWETGQPMTNVNVYVCTHMQVVYPGVNLTSPSTCAQTNFS 176 >gi|240169796|ref|ZP_04748455.1| hypothetical protein MkanA1_10812 [Mycobacterium kansasii ATCC 12478] Length=180 Score = 299 bits (766), Expect = 1e-79, Method: Compositional matrix adjust. Identities = 145/180 (81%), Positives = 157/180 (88%), Gaps = 4/180 (2%) Query 1 MTRLIPGCTLV----GLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDCHPNGSC 56 M RL+PG +V G++L + PAP S A SNTATTLFPVD+ TQLETHTF++CHPNGSC Sbjct 1 MKRLLPGAPIVAAITGVLLAVAPAPQSQAASNTATTLFPVDDTTQLETHTFVNCHPNGSC 60 Query 57 DFVAGANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITT 116 DFVAGA+LRTPDG TGFPP LWARQTTEIR TNRL YLDAHAT QFERVMKAGGSDVITT Sbjct 61 DFVAGADLRTPDGVTGFPPDLWARQTTEIRPTNRLTYLDAHATGQFERVMKAGGSDVITT 120 Query 117 VYFGEGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 VYFGEGPPDKYQTTGVIDST+WSTGQP T+ VIVCTH+QVVY GVNLTSPSTCAQ NFS Sbjct 121 VYFGEGPPDKYQTTGVIDSTSWSTGQPSTNAGVIVCTHIQVVYSGVNLTSPSTCAQTNFS 180 >gi|296166621|ref|ZP_06849049.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295898024|gb|EFG77602.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=177 Score = 253 bits (645), Expect = 1e-65, Method: Compositional matrix adjust. Identities = 130/177 (74%), Positives = 144/177 (82%), Gaps = 1/177 (0%) Query 1 MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDC-HPNGSCDFV 59 M RLI +VG LT PAP + A SNTATTLFP+D QLET T L+C +G CDF Sbjct 1 MKRLITVFAVVGSALTFAPAPAAGAASNTATTLFPLDGPNQLETRTVLNCFKSDGHCDFT 60 Query 60 AGANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYF 119 AGA++ TPDG TGFPPGLWARQTTEIRS+NRLAYLDAHATSQ+ERVMK+ GSD ITTVYF Sbjct 61 AGADMLTPDGVTGFPPGLWARQTTEIRSSNRLAYLDAHATSQYERVMKSMGSDEITTVYF 120 Query 120 GEGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 GEGPPDKYQT+G IDST+W TGQP TD NVIVCTH+QVVYPG+N TSPSTCAQ FS Sbjct 121 GEGPPDKYQTSGRIDSTDWQTGQPKTDNNVIVCTHIQVVYPGINTTSPSTCAQTTFS 177 >gi|254822728|ref|ZP_05227729.1| hypothetical protein MintA_22554 [Mycobacterium intracellulare ATCC 13950] Length=180 Score = 247 bits (630), Expect = 6e-64, Method: Compositional matrix adjust. Identities = 121/177 (69%), Positives = 137/177 (78%), Gaps = 1/177 (0%) Query 1 MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDC-HPNGSCDFV 59 M RL+ G +V + LLPA AA SNTATTLFP+D QLET L+C +G CDF Sbjct 4 MMRLVSGFAVVAALPMLLPAGVCAAASNTATTLFPLDGPNQLETRVILNCFKSDGHCDFT 63 Query 60 AGANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYF 119 AGA++ TPDG TGFPPGLWARQTTEIRS+NRLAYLDAHA Q+ERV K+ GSD ITT+Y Sbjct 64 AGADMLTPDGVTGFPPGLWARQTTEIRSSNRLAYLDAHADGQYERVHKSMGSDEITTIYM 123 Query 120 GEGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 GEGPP+KYQT G IDST+W TGQP TD NVI CTH+QVVY GVN+TSPSTCAQ FS Sbjct 124 GEGPPEKYQTNGRIDSTDWQTGQPKTDNNVIACTHIQVVYSGVNITSPSTCAQTTFS 180 >gi|118467177|ref|YP_879869.1| hypothetical protein MAV_0589 [Mycobacterium avium 104] gi|118168464|gb|ABK69361.1| conserved hypothetical protein [Mycobacterium avium 104] Length=177 Score = 240 bits (612), Expect = 7e-62, Method: Compositional matrix adjust. Identities = 117/177 (67%), Positives = 135/177 (77%), Gaps = 1/177 (0%) Query 1 MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDC-HPNGSCDFV 59 M RL+ G + + LPA AA SNTATTLFP+D QLET L+C +G CDF Sbjct 1 MMRLVSGFAVAAALPMFLPAGVCAAASNTATTLFPLDGPNQLETRALLNCFKSDGHCDFT 60 Query 60 AGANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYF 119 AGA++ TPDG TGFPPGLWARQT+EIRS+NRLAYLDAHA Q+ERV K+ GSD ITT+Y Sbjct 61 AGADMLTPDGVTGFPPGLWARQTSEIRSSNRLAYLDAHADGQYERVHKSMGSDEITTIYM 120 Query 120 GEGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 GEGPPDKY+T G IDST+W TGQP TD NVI+CTH+QVVY GVN+TSPSTCA FS Sbjct 121 GEGPPDKYETHGRIDSTDWQTGQPKTDNNVILCTHIQVVYSGVNITSPSTCAVTTFS 177 >gi|41406593|ref|NP_959429.1| hypothetical protein MAP0495c [Mycobacterium avium subsp. paratuberculosis K-10] gi|41394942|gb|AAS02812.1| hypothetical protein MAP_0495c [Mycobacterium avium subsp. paratuberculosis K-10] Length=186 Score = 239 bits (609), Expect = 2e-61, Method: Compositional matrix adjust. Identities = 116/177 (66%), Positives = 134/177 (76%), Gaps = 1/177 (0%) Query 1 MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDC-HPNGSCDFV 59 M RL+ G + + LPA AA SNTATTLFP+D QLET L+C +G CDF Sbjct 10 MMRLVSGFAVAAALPMFLPAGVCAAASNTATTLFPLDGPNQLETRALLNCFKSDGHCDFT 69 Query 60 AGANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYF 119 AGA++ TPDG TGFPPGLWARQT+EIRS+NRLAYLDAHA Q+ERV K+ GSD ITT+Y Sbjct 70 AGADMLTPDGVTGFPPGLWARQTSEIRSSNRLAYLDAHADGQYERVHKSMGSDEITTIYM 129 Query 120 GEGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 GEGPPDKY+ G IDST+W TGQP TD NVI+CTH+QVVY GVN+TSPSTCA FS Sbjct 130 GEGPPDKYEAHGRIDSTDWQTGQPKTDNNVILCTHIQVVYSGVNITSPSTCAVTTFS 186 >gi|254773546|ref|ZP_05215062.1| hypothetical protein MaviaA2_02565 [Mycobacterium avium subsp. avium ATCC 25291] Length=176 Score = 238 bits (606), Expect = 3e-61, Method: Compositional matrix adjust. Identities = 116/175 (67%), Positives = 134/175 (77%), Gaps = 1/175 (0%) Query 3 RLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDC-HPNGSCDFVAG 61 RL+ G + + LPA AA SNTATTLFP+D QLET L+C +G CDF AG Sbjct 2 RLVSGFAVAAALPMFLPAGVCAAASNTATTLFPLDGPNQLETRALLNCFKSDGHCDFTAG 61 Query 62 ANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYFGE 121 A++ TPDG TGFPPGLWARQT+EIRS+NRLAYLDAHA Q+ERV K+ GSD ITT+Y GE Sbjct 62 ADMLTPDGVTGFPPGLWARQTSEIRSSNRLAYLDAHADGQYERVHKSMGSDEITTIYMGE 121 Query 122 GPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 GPPDKY+T G IDST+W TGQP TD NVI+CTH+QVVY GVN+TSPSTCA FS Sbjct 122 GPPDKYETHGRIDSTDWQTGQPKTDNNVILCTHIQVVYSGVNITSPSTCAVTTFS 176 >gi|342862404|ref|ZP_08719044.1| hypothetical protein MCOL_26053 [Mycobacterium colombiense CECT 3035] gi|342130075|gb|EGT83409.1| hypothetical protein MCOL_26053 [Mycobacterium colombiense CECT 3035] Length=176 Score = 236 bits (603), Expect = 7e-61, Method: Compositional matrix adjust. Identities = 117/175 (67%), Positives = 134/175 (77%), Gaps = 1/175 (0%) Query 3 RLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDC-HPNGSCDFVAG 61 RL+ G V ++ LPA AA SNTATTLFP+D QLET L+C +G CDF AG Sbjct 2 RLVWGFAGVAVLPMFLPAAVCAASSNTATTLFPLDGPNQLETRVVLNCFKADGHCDFTAG 61 Query 62 ANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYFGE 121 A++ TP G TGFPPGLWARQTTEIRS+NRLAYLDAHA Q+ERVMK+ GSD ITT+Y GE Sbjct 62 ADMLTPGGVTGFPPGLWARQTTEIRSSNRLAYLDAHADGQYERVMKSMGSDEITTIYMGE 121 Query 122 GPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 GPP+KYQT G IDST+W TGQP TD NVI C+H+QVVY GVN+TSPSTCA FS Sbjct 122 GPPEKYQTNGRIDSTDWQTGQPKTDNNVIACSHIQVVYSGVNITSPSTCAVTTFS 176 >gi|336458489|gb|EGO37462.1| hypothetical protein MAPs_12770 [Mycobacterium avium subsp. paratuberculosis S397] Length=177 Score = 236 bits (603), Expect = 8e-61, Method: Compositional matrix adjust. Identities = 115/177 (65%), Positives = 134/177 (76%), Gaps = 1/177 (0%) Query 1 MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDC-HPNGSCDFV 59 M RL+ G + + LPA AA SNTATTLFP+D QLET L+C +G CDF Sbjct 1 MMRLVSGFAVAAALPMFLPAGVCAAASNTATTLFPLDGPNQLETRALLNCFKSDGHCDFT 60 Query 60 AGANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYF 119 AGA++ TPDG TGFPPGLWARQT+EIRS+NRLAYLDAHA Q+E+V K+ GSD ITT+Y Sbjct 61 AGADMLTPDGVTGFPPGLWARQTSEIRSSNRLAYLDAHADGQYEQVHKSMGSDEITTIYM 120 Query 120 GEGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 GEGPPDKY+ G IDST+W TGQP TD NVI+CTH+QVVY GVN+TSPSTCA FS Sbjct 121 GEGPPDKYEAHGRIDSTDWQTGQPKTDNNVILCTHIQVVYSGVNITSPSTCAVTTFS 177 >gi|333992496|ref|YP_004525110.1| hypothetical protein JDM601_3857 [Mycobacterium sp. JDM601] gi|333488465|gb|AEF37857.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=185 Score = 189 bits (480), Expect = 1e-46, Method: Compositional matrix adjust. Identities = 97/185 (53%), Positives = 121/185 (66%), Gaps = 10/185 (5%) Query 1 MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDCHPNGSCDFVA 60 M RL+ V L+ L+P SAA S+T T+L PV+E QL+TH+++DC C F Sbjct 1 MKRLLMVPAAVTLLAALIPPAISAAASDTKTSLMPVNEFDQLQTHSWVDCAAPPLCRFTV 60 Query 61 GANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAH---ATSQFE-------RVMKAGG 110 G L+TP+G GFPP LWARQ+TEIRS+ R AYLD H F+ +V K G Sbjct 61 GVRLQTPEGLAGFPPDLWARQSTEIRSSKRTAYLDVHTDGGEGWFKDRGGPGTKVFKDGT 120 Query 111 SDVITTVYFGEGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTC 170 VI ++Y+GEGPP+KYQT G ID +STG+P TD NVIVCTH+QV Y GVNLT+PSTC Sbjct 121 GAVIQSMYYGEGPPEKYQTNGSIDVLEYSTGRPKTDANVIVCTHVQVTYSGVNLTTPSTC 180 Query 171 AQANF 175 AQ F Sbjct 181 AQTVF 185 >gi|169627685|ref|YP_001701334.1| hypothetical protein MAB_0582c [Mycobacterium abscessus ATCC 19977] gi|169239652|emb|CAM60680.1| Conserved hypothetical protein [Mycobacterium abscessus] Length=181 Score = 123 bits (309), Expect = 8e-27, Method: Compositional matrix adjust. Identities = 65/151 (44%), Positives = 85/151 (57%), Gaps = 2/151 (1%) Query 28 NTATTLFPVDEVTQLETHTFLDCHP-NGSCDFVAGANLRTPDGPTGFPPGLWARQTTEIR 86 TA PVD+ QLE H DC C + A NLR P+G GFP LWARQTTE+R Sbjct 31 KTADVSTPVDDARQLEVHATADCRKAERQCYYTATVNLRGPNGIEGFPGDLWARQTTELR 90 Query 87 STNRLAYLDAHATSQFERVMKAGGSD-VITTVYFGEGPPDKYQTTGVIDSTNWSTGQPMT 145 +++RL YL V GGS +TTVY G G P+++Q +G T+W+TGQP Sbjct 91 TSDRLNYLWVQWADNPNTVEHNGGSTWTLTTVYIGGGTPERFQISGTTQPTDWATGQPKL 150 Query 146 DVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 D + IVC+ ++ + G + SP CA A FS Sbjct 151 DADYIVCSRIEANFDGRYVISPDACAVARFS 181 >gi|120406258|ref|YP_956087.1| hypothetical protein Mvan_5310 [Mycobacterium vanbaalenii PYR-1] gi|119959076|gb|ABM16081.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1] Length=205 Score = 119 bits (299), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 67/172 (39%), Positives = 94/172 (55%), Gaps = 16/172 (9%) Query 20 APTSAAGSNTATTLFPVDEVTQLETHTFLDCHP-NGSCDFVAGANLRTPDGPTGFPPGLW 78 +P + A SNTAT+ VD TQ++ H +C P C F ANL TP+GPTGFP W Sbjct 34 SPPATAASNTATSSIAVDPATQIQMHVTANCDPAQNRCFFNTSANLLTPNGPTGFPGDTW 93 Query 79 ARQTTEIRSTNRLAYLDA---------------HATSQFERVMKAGGSDVITTVYFGEGP 123 ARQT +RS ++ + +A + + R++K+ ++ YFG GP Sbjct 94 ARQTVTLRSNSQDVWQEAWYSAPAGTPRELKGANHNNVLSRMLKSLRDVEVSVTYFGGGP 153 Query 124 PDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANF 175 +++ T G T W+TG P + IVC+ +QVVY GVNLT+PS CAQ F Sbjct 154 IERFTTDGDSVPTEWTTGLPAKGSDFIVCSQIQVVYGGVNLTTPSACAQTTF 205 >gi|145222053|ref|YP_001132731.1| hypothetical protein Mflv_1461 [Mycobacterium gilvum PYR-GCK] gi|315446205|ref|YP_004079084.1| hypothetical protein Mspyr1_47060 [Mycobacterium sp. Spyr1] gi|145214539|gb|ABP43943.1| hypothetical protein Mflv_1461 [Mycobacterium gilvum PYR-GCK] gi|315264508|gb|ADU01250.1| hypothetical protein Mspyr1_47060 [Mycobacterium sp. Spyr1] Length=193 Score = 111 bits (277), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 65/182 (36%), Positives = 93/182 (52%), Gaps = 17/182 (9%) Query 10 LVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDCHP-NGSCDFVAGANLRTPD 68 L + + L AP + A S+ + PV ++E H DC P C + ANL TP Sbjct 12 LASMAMALTGAPGAGAASDAGVSSIPVGP-DRIEMHVSADCVPAQNRCFYRTSANLLTPG 70 Query 69 GPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQ---------------FERVMKAGGSDV 113 GPTGFP WARQT +RS ++ + +A ++ R++K+ Sbjct 71 GPTGFPQDTWARQTITLRSDSQDVWQEAWYSAPAGVPRELKGANHNNVLSRMLKSLREVE 130 Query 114 ITTVYFGEGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQA 173 ++ YFG GP +++ T G T W+TG P IVC+ +QVVYPGVNLT+P+ CAQ Sbjct 131 VSVTYFGGGPIERFTTDGDSVPTAWTTGLPAKGSRFIVCSQIQVVYPGVNLTTPTACAQP 190 Query 174 NF 175 F Sbjct 191 IF 192 >gi|126437660|ref|YP_001073351.1| hypothetical protein Mjls_5096 [Mycobacterium sp. JLS] gi|126237460|gb|ABO00861.1| conserved hypothetical protein [Mycobacterium sp. JLS] Length=190 Score = 108 bits (271), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 63/159 (40%), Positives = 85/159 (54%), Gaps = 16/159 (10%) Query 32 TLFPVDEVTQLETHTFLDCHPNGSCDFVAGANLRTPDGPTGFPPGLWARQTTEIRSTNRL 91 T VD T++E H +C N C F ANL TP GPTGFP W+RQT +RS +R Sbjct 32 TSIAVDPATKIEMHVNANCVGN-RCMFDTTANLMTPGGPTGFPGDAWSRQTITLRSNDRN 90 Query 92 AYLDAHATSQF---ERVMKAGGSDVITTVY------------FGEGPPDKYQTTGVIDST 136 + +A ++ V A ++V++ +Y FG GP ++Y+ G T Sbjct 91 VWQEAEYSAPAGTPREVKGANHNNVLSKMYKSLRDVEISVSSFGGGPIERYRIDGASMPT 150 Query 137 NWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANF 175 W+TGQ T IVC+H+QVVY GVNLT+P CAQ F Sbjct 151 EWATGQLATRAAFIVCSHIQVVYGGVNLTTPDACAQTTF 189 >gi|108801674|ref|YP_641871.1| hypothetical protein Mmcs_4711 [Mycobacterium sp. MCS] gi|119870825|ref|YP_940777.1| hypothetical protein Mkms_4797 [Mycobacterium sp. KMS] gi|108772093|gb|ABG10815.1| conserved hypothetical protein [Mycobacterium sp. MCS] gi|119696914|gb|ABL93987.1| conserved hypothetical protein [Mycobacterium sp. KMS] Length=190 Score = 108 bits (271), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 63/159 (40%), Positives = 85/159 (54%), Gaps = 16/159 (10%) Query 32 TLFPVDEVTQLETHTFLDCHPNGSCDFVAGANLRTPDGPTGFPPGLWARQTTEIRSTNRL 91 T VD T++E H +C N C F ANL TP GPTGFP W+RQT +RS +R Sbjct 32 TSIAVDPATKIEMHVNANCVGN-RCMFDTTANLMTPGGPTGFPGDAWSRQTITLRSNDRN 90 Query 92 AYLDAHATSQF---ERVMKAGGSDVITTVY------------FGEGPPDKYQTTGVIDST 136 + +A ++ V A ++V++ +Y FG GP ++Y+ G T Sbjct 91 VWQEAEYSAPAGTPREVKGANHNNVLSKMYKSLRDVEISVTSFGGGPIERYRIDGASMPT 150 Query 137 NWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANF 175 W+TGQ T IVC+H+QVVY GVNLT+P CAQ F Sbjct 151 EWATGQLATRAAFIVCSHIQVVYGGVNLTTPDACAQTTF 189 >gi|118469878|ref|YP_890264.1| hypothetical protein MSMEG_6040 [Mycobacterium smegmatis str. MC2 155] gi|118171165|gb|ABK72061.1| conserved hypothetical protein [Mycobacterium smegmatis str. MC2 155] Length=191 Score = 90.5 bits (223), Expect = 9e-17, Method: Compositional matrix adjust. Identities = 56/165 (34%), Positives = 76/165 (47%), Gaps = 16/165 (9%) Query 28 NTATTLFPVDEVTQLETHTFLDCH-PNGSCDFVAGANLRTPDGPTGFPPGLWARQTTEIR 86 ++ T P+D T +E H +C C F ANL PTGFP WARQT +R Sbjct 27 DSGHTSVPIDPATAVEMHVDANCTLATSRCTFKTWANLLAGGAPTGFPGDTWARQTITLR 86 Query 87 STNRLAYLDA--HATSQFERVMKAGGSD-------------VITTVYFGEGPPDKYQTTG 131 S++R + +A A + R K D I+ YFG GP +++ G Sbjct 87 SSDRAVWQEAFYSAPAGMPREDKGANHDNVLSALKKSLNTVEISVTYFGGGPIERFTVDG 146 Query 132 VIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSPSTCAQANFS 176 +W G P TD I C+ +QVV G N+T+P CAQ FS Sbjct 147 DSTPIDWYKGHPNTDATFIACSVIQVVRGGSNVTTPPACAQTKFS 191 >gi|164660500|ref|XP_001731373.1| hypothetical protein MGL_1556 [Malassezia globosa CBS 7966] gi|159105273|gb|EDP44159.1| hypothetical protein MGL_1556 [Malassezia globosa CBS 7966] Length=309 Score = 35.4 bits (80), Expect = 3.7, Method: Compositional matrix adjust. Identities = 31/89 (35%), Positives = 43/89 (49%), Gaps = 13/89 (14%) Query 41 QLETHTFLDCHPNGSCDFVAGANL----RTPDGPT-GFPPGLWARQTTEIRSTNRLAYLD 95 +L+ T L C +G CD+VA L PD PP L + T I + ++LA D Sbjct 59 RLKMATDLLCRASGVCDYVATQLLPSGTARPDSALPSLPPEL---RHTGILACSKLAMAD 115 Query 96 AHATSQFERVMKAGGSDVITTVYFGEGPP 124 AHA + + + A G+D IT GPP Sbjct 116 AHALAIRKLLASAVGTDQITP-----GPP 139 >gi|323371174|gb|ADX59436.1| beta-galactosidase [Coffea arabica] Length=338 Score = 34.3 bits (77), Expect = 7.3, Method: Compositional matrix adjust. Identities = 26/107 (25%), Positives = 45/107 (43%), Gaps = 12/107 (11%) Query 61 GANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATSQFERVMKAGGSDVITTVYFG 120 G P+ P P +W T +++ + + Y H + F R GS + T Y+ Sbjct 238 GETFVGPNSPN--KPAIWTDNWTSLKNGSFVNYYMYHGGTNFGRT----GSAFVLTSYYD 291 Query 121 EGPPDKYQTTGVIDSTNWSTGQPMTDVNVIVCTHMQVVYPGVNLTSP 167 E P D+Y G+I W + ++ ++ + Q + GV SP Sbjct 292 EAPIDEY---GLIRQPKWGH---LKQLHSVIKSCSQTLLHGVISVSP 332 >gi|121702677|ref|XP_001269603.1| rerric reductase like transmembrane component, putative [Aspergillus clavatus NRRL 1] gi|119397746|gb|EAW08177.1| rerric reductase like transmembrane component, putative [Aspergillus clavatus NRRL 1] Length=216 Score = 34.3 bits (77), Expect = 7.6, Method: Compositional matrix adjust. Identities = 13/25 (52%), Positives = 17/25 (68%), Gaps = 0/25 (0%) Query 70 PTGFPPGLWARQTTEIRSTNRLAYL 94 PTG PGLW R+ T ++S R+ YL Sbjct 150 PTGSSPGLWVRKATSVQSKRRVFYL 174 Lambda K H 0.317 0.132 0.411 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 149040830208 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40