BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3491 Length=192 Score E Sequences producing significant alignments: (Bits) Value gi|15610627|ref|NP_218008.1| hypothetical protein Rv3491 [Mycoba... 395 2e-108 gi|254234077|ref|ZP_04927402.1| hypothetical protein TBCG_03425 ... 386 7e-106 gi|240170045|ref|ZP_04748704.1| hypothetical protein MkanA1_1208... 327 4e-88 gi|296166525|ref|ZP_06848956.1| conserved hypothetical protein [... 320 5e-86 gi|342862230|ref|ZP_08718872.1| hypothetical protein MCOL_25191 ... 319 2e-85 gi|254821777|ref|ZP_05226778.1| hypothetical protein MintA_17727... 316 1e-84 gi|183984947|ref|YP_001853238.1| hypothetical protein MMAR_4979 ... 313 7e-84 gi|41406670|ref|NP_959506.1| hypothetical protein MAP0572c [Myco... 305 2e-81 gi|118467277|ref|YP_879944.1| hypothetical protein MAV_0665 [Myc... 304 4e-81 gi|336458377|gb|EGO37353.1| hypothetical protein MAPs_13560 [Myc... 302 1e-80 gi|289445088|ref|ZP_06434832.1| conserved hypothetical protein [... 209 2e-52 gi|169629539|ref|YP_001703188.1| hypothetical protein MAB_2453c ... 174 4e-42 gi|118469188|ref|YP_888628.1| hypothetical protein MSMEG_4353 [M... 99.8 2e-19 gi|333992262|ref|YP_004524876.1| hypothetical protein JDM601_362... 97.4 9e-19 gi|145223349|ref|YP_001134027.1| hypothetical protein Mflv_2762 ... 90.9 8e-17 gi|108800356|ref|YP_640553.1| hypothetical protein Mmcs_3390 [My... 84.3 7e-15 gi|120404731|ref|YP_954560.1| hypothetical protein Mvan_3772 [My... 79.7 2e-13 gi|288960916|ref|YP_003451255.1| major facilitator superfamily p... 41.6 0.066 gi|115676832|ref|XP_801712.2| PREDICTED: similar to SWI/SNF rela... 38.1 0.66 gi|108800646|ref|YP_640843.1| hypothetical protein Mmcs_3681 [My... 37.7 0.79 gi|120405115|ref|YP_954944.1| hypothetical protein Mvan_4161 [My... 36.2 2.4 gi|315443541|ref|YP_004076420.1| hypothetical protein Mspyr1_192... 35.8 3.3 gi|307710462|ref|ZP_07646899.1| endonuclease and methylase LlaGI... 35.8 3.4 >gi|15610627|ref|NP_218008.1| hypothetical protein Rv3491 [Mycobacterium tuberculosis H37Rv] gi|15843103|ref|NP_338140.1| hypothetical protein MT3595 [Mycobacterium tuberculosis CDC1551] gi|31794667|ref|NP_857160.1| hypothetical protein Mb3521 [Mycobacterium bovis AF2122/97] 76 more sequence titlesLength=192 Score = 395 bits (1015), Expect = 2e-108, Method: Compositional matrix adjust. Identities = 191/192 (99%), Positives = 192/192 (100%), Gaps = 0/192 (0%) Query 1 VNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQ 60 +NIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQ Sbjct 1 MNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQ 60 Query 61 PSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQAD 120 PSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQAD Sbjct 61 PSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQAD 120 Query 121 GVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQR 180 GVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQR Sbjct 121 GVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQR 180 Query 181 YPLYCNNIAMCY 192 YPLYCNNIAMCY Sbjct 181 YPLYCNNIAMCY 192 >gi|254234077|ref|ZP_04927402.1| hypothetical protein TBCG_03425 [Mycobacterium tuberculosis C] gi|124599606|gb|EAY58710.1| hypothetical protein TBCG_03425 [Mycobacterium tuberculosis C] Length=192 Score = 386 bits (992), Expect = 7e-106, Method: Compositional matrix adjust. Identities = 187/190 (99%), Positives = 188/190 (99%), Gaps = 0/190 (0%) Query 3 IRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQPS 62 + GLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQPS Sbjct 3 LAAGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQPS 62 Query 63 GTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQADGV 122 GTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQADGV Sbjct 63 GTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQADGV 122 Query 123 ACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQRYP 182 ACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQRYP Sbjct 123 ACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQRYP 182 Query 183 LYCNNIAMCY 192 LYCNNIAMCY Sbjct 183 LYCNNIAMCY 192 >gi|240170045|ref|ZP_04748704.1| hypothetical protein MkanA1_12081 [Mycobacterium kansasii ATCC 12478] Length=192 Score = 327 bits (839), Expect = 4e-88, Method: Compositional matrix adjust. Identities = 149/192 (78%), Positives = 172/192 (90%), Gaps = 0/192 (0%) Query 1 VNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQ 60 +NIR GLAAGA I SAVAL AL SG+PA A+ PPPDG+YSF++AGVSGVTWTITALCDQ Sbjct 1 MNIRHGLAAGAGIVSAVALATALESGEPAHAVAPPPDGAYSFSEAGVSGVTWTITALCDQ 60 Query 61 PSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQAD 120 PSGTRNMNDYSDP+ WAF CALN+ STTPQQITR D+LQNFS RAR++S+LWTFQVNQA+ Sbjct 61 PSGTRNMNDYSDPVTWAFQCALNIASTTPQQITRADKLQNFSARARLTSVLWTFQVNQAE 120 Query 121 GVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQR 180 GV CPDGSTAPS+ETYAFSDE ++GTHTT+HGAVCGL P + K+PFSLQL+GPPPSP++R Sbjct 121 GVLCPDGSTAPSTETYAFSDENMSGTHTTLHGAVCGLAPDMKKEPFSLQLVGPPPSPIER 180 Query 181 YPLYCNNIAMCY 192 YPL CN IA+CY Sbjct 181 YPLRCNQIAICY 192 >gi|296166525|ref|ZP_06848956.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295898137|gb|EFG77712.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=192 Score = 320 bits (821), Expect = 5e-86, Method: Compositional matrix adjust. Identities = 145/192 (76%), Positives = 174/192 (91%), Gaps = 0/192 (0%) Query 1 VNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQ 60 + +R GLAAGA I SAVA+ IAL SGDPA A+GPPPDG+Y++N+AGVSG TWT+TALCDQ Sbjct 1 MKLRRGLAAGAGIFSAVAVAIALDSGDPAHAVGPPPDGNYTYNEAGVSGATWTVTALCDQ 60 Query 61 PSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQAD 120 PSGTRNMNDYSDPIV+A NCALN+VS+TP++IT D+LQNFSGRARMSSMLWTF+V ++D Sbjct 61 PSGTRNMNDYSDPIVFAMNCALNIVSSTPERITPADKLQNFSGRARMSSMLWTFKVEKSD 120 Query 121 GVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQR 180 GV+CP G TAPS++TYAFSDE ++GTHT++HGAVCGLQP++ KQPFSLQL GPPPSP++R Sbjct 121 GVSCPGGGTAPSTDTYAFSDENMSGTHTSLHGAVCGLQPEMKKQPFSLQLAGPPPSPIER 180 Query 181 YPLYCNNIAMCY 192 YPLYCN IAMCY Sbjct 181 YPLYCNGIAMCY 192 >gi|342862230|ref|ZP_08718872.1| hypothetical protein MCOL_25191 [Mycobacterium colombiense CECT 3035] gi|342130308|gb|EGT83628.1| hypothetical protein MCOL_25191 [Mycobacterium colombiense CECT 3035] Length=192 Score = 319 bits (817), Expect = 2e-85, Method: Compositional matrix adjust. Identities = 146/192 (77%), Positives = 172/192 (90%), Gaps = 0/192 (0%) Query 1 VNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQ 60 + +R G+AAGA I +AVA+GIAL SG+PA+ALGPP DG+YSFN+AGVSGVTWT+TA+CDQ Sbjct 1 MKLRHGIAAGAGIFTAVAVGIALQSGEPAKALGPPADGNYSFNEAGVSGVTWTVTAICDQ 60 Query 61 PSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQAD 120 PSGTRNMNDYSDPIV+A CALNVVS T ++I+ D+LQNFSGRARMSSMLWTFQV + D Sbjct 61 PSGTRNMNDYSDPIVFAMQCALNVVSFTDEKISAADKLQNFSGRARMSSMLWTFQVTKDD 120 Query 121 GVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQR 180 GV+CP G TAPS+ET+AFSDET++GTHTT+HG VCGLQP++ K+PFSL LIGPPPSPVQR Sbjct 121 GVSCPGGGTAPSTETFAFSDETMSGTHTTIHGPVCGLQPEMKKEPFSLSLIGPPPSPVQR 180 Query 181 YPLYCNNIAMCY 192 YPLYCN IAMCY Sbjct 181 YPLYCNGIAMCY 192 >gi|254821777|ref|ZP_05226778.1| hypothetical protein MintA_17727 [Mycobacterium intracellulare ATCC 13950] Length=192 Score = 316 bits (809), Expect = 1e-84, Method: Compositional matrix adjust. Identities = 142/192 (74%), Positives = 172/192 (90%), Gaps = 0/192 (0%) Query 1 VNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQ 60 + +R GLAAGA + +AVA GIAL SG+PA+ALGPP DG+Y++N+AGVSGVTWTI+ALCDQ Sbjct 1 MKLRHGLAAGAAVFTAVATGIALQSGEPAKALGPPADGTYAYNEAGVSGVTWTISALCDQ 60 Query 61 PSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQAD 120 PSGTRNMNDYSDPIV+A NCALN+VS+T +QI+ D+LQNFSGRARMSSMLWTF+V + + Sbjct 61 PSGTRNMNDYSDPIVFAMNCALNIVSSTNEQISAADKLQNFSGRARMSSMLWTFKVQKDN 120 Query 121 GVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQR 180 GVACP G TAPS+ET+AFSDET++GTHTT+HG VCGLQP++ KQPFSL + GPPPSP++R Sbjct 121 GVACPGGGTAPSTETFAFSDETMSGTHTTLHGPVCGLQPEMKKQPFSLAMTGPPPSPIER 180 Query 181 YPLYCNNIAMCY 192 YPLYCN IAMCY Sbjct 181 YPLYCNGIAMCY 192 >gi|183984947|ref|YP_001853238.1| hypothetical protein MMAR_4979 [Mycobacterium marinum M] gi|183178273|gb|ACC43383.1| conserved hypothetical secreted protein [Mycobacterium marinum M] Length=193 Score = 313 bits (802), Expect = 7e-84, Method: Compositional matrix adjust. Identities = 144/193 (75%), Positives = 168/193 (88%), Gaps = 1/193 (0%) Query 1 VNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQ 60 + IR GLA GA I SAVALG+ L S PA ALGPPPDG+Y+ N+AGVSGVTWTITALCDQ Sbjct 1 MKIRHGLAVGAGIVSAVALGLTLESESPAHALGPPPDGAYTLNEAGVSGVTWTITALCDQ 60 Query 61 PSGTRNMNDYSDPIVWAFNCALNVVSTTPQQ-ITRTDRLQNFSGRARMSSMLWTFQVNQA 119 PSGTRNMNDYSDP+VWAFNCALNVVS TP ITR ++LQNFSGRAR+SS+LWTF V + Sbjct 61 PSGTRNMNDYSDPVVWAFNCALNVVSATPNAPITRAEKLQNFSGRARLSSVLWTFTVTKQ 120 Query 120 DGVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQ 179 DGV CP G TAPS++TYAFSDET++GTHTT+H AVCG+QP + K+PFSLQ++GPPPSP++ Sbjct 121 DGVVCPGGGTAPSTDTYAFSDETMSGTHTTLHDAVCGMQPDMKKEPFSLQMVGPPPSPIE 180 Query 180 RYPLYCNNIAMCY 192 RYPLYCN+IAMCY Sbjct 181 RYPLYCNDIAMCY 193 >gi|41406670|ref|NP_959506.1| hypothetical protein MAP0572c [Mycobacterium avium subsp. paratuberculosis K-10] gi|41395019|gb|AAS02889.1| hypothetical protein MAP_0572c [Mycobacterium avium subsp. paratuberculosis K-10] Length=192 Score = 305 bits (781), Expect = 2e-81, Method: Compositional matrix adjust. Identities = 142/192 (74%), Positives = 173/192 (91%), Gaps = 0/192 (0%) Query 1 VNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQ 60 + +R GLAAGA + AV GIA+ SG PA+A+GPP DG+Y++N+AGVSGVTWTI+ALCDQ Sbjct 1 MKLRHGLAAGAAVVMAVGTGIAVQSGQPAKAVGPPADGTYNYNEAGVSGVTWTISALCDQ 60 Query 61 PSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQAD 120 PSGTRNMNDYSDPIV+AFNCALN+VS+TP+QI+ D+LQNFSGRARMSSMLWTF+V + + Sbjct 61 PSGTRNMNDYSDPIVFAFNCALNIVSSTPEQISAADKLQNFSGRARMSSMLWTFKVQKDN 120 Query 121 GVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQR 180 GV+CP G TAP++ETYAFSDET++GTHTT+HGAVCGLQP++ KQPFSLQ+ GPPPSP++R Sbjct 121 GVSCPGGGTAPTTETYAFSDETMSGTHTTLHGAVCGLQPEMKKQPFSLQMTGPPPSPIER 180 Query 181 YPLYCNNIAMCY 192 YPLYCN IAMCY Sbjct 181 YPLYCNGIAMCY 192 >gi|118467277|ref|YP_879944.1| hypothetical protein MAV_0665 [Mycobacterium avium 104] gi|254773621|ref|ZP_05215137.1| hypothetical protein MaviaA2_02950 [Mycobacterium avium subsp. avium ATCC 25291] gi|118168564|gb|ABK69461.1| conserved hypothetical protein [Mycobacterium avium 104] Length=192 Score = 304 bits (779), Expect = 4e-81, Method: Compositional matrix adjust. Identities = 141/192 (74%), Positives = 173/192 (91%), Gaps = 0/192 (0%) Query 1 VNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQ 60 + +R GLAAGA + AV G+A+ SG PA+A+GPP DG+Y++N+AGVSGVTWTI+ALCDQ Sbjct 1 MKLRHGLAAGAAVVMAVGTGLAVQSGQPAKAVGPPADGTYNYNEAGVSGVTWTISALCDQ 60 Query 61 PSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQAD 120 PSGTRNMNDYSDPIV+AFNCALN+VS+TP+QI+ D+LQNFSGRARMSSMLWTF+V + + Sbjct 61 PSGTRNMNDYSDPIVFAFNCALNIVSSTPEQISAADKLQNFSGRARMSSMLWTFKVQKDN 120 Query 121 GVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQR 180 GV+CP G TAP++ETYAFSDET++GTHTT+HGAVCGLQP++ KQPFSLQ+ GPPPSP++R Sbjct 121 GVSCPGGGTAPTTETYAFSDETMSGTHTTLHGAVCGLQPEMKKQPFSLQMTGPPPSPIER 180 Query 181 YPLYCNNIAMCY 192 YPLYCN IAMCY Sbjct 181 YPLYCNGIAMCY 192 >gi|336458377|gb|EGO37353.1| hypothetical protein MAPs_13560 [Mycobacterium avium subsp. paratuberculosis S397] Length=192 Score = 302 bits (774), Expect = 1e-80, Method: Compositional matrix adjust. Identities = 141/187 (76%), Positives = 170/187 (91%), Gaps = 0/187 (0%) Query 6 GLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQPSGTR 65 GLAAGA + AV GIA+ SG PA+A+GPP DG+Y++N+AGVSGVTWTI+ALCDQPSGTR Sbjct 6 GLAAGAAVVMAVGTGIAVQSGQPAKAVGPPADGTYNYNEAGVSGVTWTISALCDQPSGTR 65 Query 66 NMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQADGVACP 125 NMNDYSDPIV+AFNCALN+VS+TP+QI+ D+LQNFSGRARMSSMLWTF+V + +GV+CP Sbjct 66 NMNDYSDPIVFAFNCALNIVSSTPEQISAADKLQNFSGRARMSSMLWTFKVQKDNGVSCP 125 Query 126 DGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQRYPLYC 185 G TAP++ETYAFSDET++GTHTT+HGAVCGLQP++ KQPFSLQ+ GPPPSP++RYPLYC Sbjct 126 GGGTAPTTETYAFSDETMSGTHTTLHGAVCGLQPEMKKQPFSLQMTGPPPSPIERYPLYC 185 Query 186 NNIAMCY 192 N IAMCY Sbjct 186 NGIAMCY 192 >gi|289445088|ref|ZP_06434832.1| conserved hypothetical protein [Mycobacterium tuberculosis T46] gi|289418007|gb|EFD15247.1| conserved hypothetical protein [Mycobacterium tuberculosis T46] Length=107 Score = 209 bits (531), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 101/102 (99%), Positives = 102/102 (100%), Gaps = 0/102 (0%) Query 1 VNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQ 60 +NIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQ Sbjct 1 MNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQ 60 Query 61 PSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFS 102 PSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFS Sbjct 61 PSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFS 102 >gi|169629539|ref|YP_001703188.1| hypothetical protein MAB_2453c [Mycobacterium abscessus ATCC 19977] gi|169241506|emb|CAM62534.1| Conserved hypothetical protein [Mycobacterium abscessus] Length=187 Score = 174 bits (442), Expect = 4e-42, Method: Compositional matrix adjust. Identities = 81/156 (52%), Positives = 104/156 (67%), Gaps = 0/156 (0%) Query 37 DGSYSFNQAGVSGVTWTITALCDQPSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTD 96 DG+Y + G VTWTI+A+C Q SGTR +DY+DP + A C L V S TP + +D Sbjct 32 DGTYRMIEDGAGEVTWTISAVCVQASGTRVQSDYTDPTIQADGCTLTVTSATPDIVKHSD 91 Query 97 RLQNFSGRARMSSMLWTFQVNQADGVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCG 156 RLQNFS AR+ WTF + G+ CPDGS AP+ E Y F+DETL GTHTT+H AVCG Sbjct 92 RLQNFSTPARLVDDQWTFLHARPQGMTCPDGSYAPTQEKYVFNDETLAGTHTTIHSAVCG 151 Query 157 LQPKLSKQPFSLQLIGPPPSPVQRYPLYCNNIAMCY 192 QP ++K+PFSLQ + P PV+R+P YC+ A C+ Sbjct 152 GQPGMTKKPFSLQFVKPLDVPVERFPQYCDGFAHCW 187 >gi|118469188|ref|YP_888628.1| hypothetical protein MSMEG_4353 [Mycobacterium smegmatis str. MC2 155] gi|118170475|gb|ABK71371.1| conserved hypothetical protein [Mycobacterium smegmatis str. MC2 155] Length=181 Score = 99.8 bits (247), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 65/182 (36%), Positives = 87/182 (48%), Gaps = 22/182 (12%) Query 11 AVICSAVAL------GIALHSGDPARALGPPPDGSYSFNQAGVSGVTWTITALCDQPSGT 64 AVI S AL G+ + PA+A P G +++NQ G+ TWTI PS Sbjct 8 AVIASVPALVMGLWMGVGVGFAGPAQA-DPVIQGVFNYNQEGLGAATWTI-----YPSCV 61 Query 65 RNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQADGVAC 124 + D +P+ C L+V + G A+M+ +W F N +G+ C Sbjct 62 PVVGDLREPLYLPVACVLHVQGSPGVS----------GGDAKMAGGMWQFSTNIKEGIKC 111 Query 125 PDGSTAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQRYPLY 184 DGS AP ETY F D LTGT + AVCGL P + + PF+L P P PV+ YPL Sbjct 112 SDGSWAPIVETYRFDDVALTGTREVSNTAVCGLAPAIHRFPFTLTFKEPLPIPVKTYPLD 171 Query 185 CN 186 C Sbjct 172 CE 173 >gi|333992262|ref|YP_004524876.1| hypothetical protein JDM601_3622 [Mycobacterium sp. JDM601] gi|333488230|gb|AEF37622.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=180 Score = 97.4 bits (241), Expect = 9e-19, Method: Compositional matrix adjust. Identities = 68/185 (37%), Positives = 88/185 (48%), Gaps = 24/185 (12%) Query 13 ICSAVALGIALHSGDPARALGPPPDGS----YSFNQAGVSGVTWTITALCDQPSGTRNMN 68 I A + G+A+ +G AL P+G+ Y + Q S TW I LC + Sbjct 13 IAMAASAGLAVEAGADDDAL---PEGTLEGIYKYTQDDGSTATWRIYPLC-----VPTVG 64 Query 69 DYSDPIVWAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQADGVACPDGS 128 D P+ A C L V S P +G R+S+ WT+ GV CPDG Sbjct 65 DGRVPLHLAVGCKLQVESDGPP---------GQAGAYRLSNGQWTYHTPLLAGVRCPDGR 115 Query 129 TAPSSETYAFSDETLTGTHTTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQRYPLYC--N 186 TA + E Y F D +L GT+T H AVCG QP L PF+L + P P+PV YPL C N Sbjct 116 TAGTEEMYRF-DTSLRGTYTQSHNAVCGWQPGLKTHPFTLTFVSPLPNPVIHYPLTCQDN 174 Query 187 NIAMC 191 I +C Sbjct 175 PIHLC 179 >gi|145223349|ref|YP_001134027.1| hypothetical protein Mflv_2762 [Mycobacterium gilvum PYR-GCK] gi|315443808|ref|YP_004076687.1| hypothetical protein Mspyr1_22020 [Mycobacterium sp. Spyr1] gi|145215835|gb|ABP45239.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK] gi|315262111|gb|ADT98852.1| hypothetical protein Mspyr1_22020 [Mycobacterium sp. Spyr1] Length=171 Score = 90.9 bits (224), Expect = 8e-17, Method: Compositional matrix adjust. Identities = 58/159 (37%), Positives = 78/159 (50%), Gaps = 18/159 (11%) Query 28 PARALGPPPDGSYSFNQAGVSGVTWTITALCDQPSGTRNMNDYSDPIVWAFNCALNVVST 87 PA A P G Y++ G +G TWTI C G D +P+ C L+V Sbjct 23 PAAADTPVMQGDYTYVDEGRTG-TWTIFPTCVPIVG-----DLREPLYLPVGCTLHV--- 73 Query 88 TPQQITRTDRLQNFSGRARMSSMLWTFQVNQADGVACPDGSTAPSSETYAFSDETLTGTH 147 TP R G AR++ LWT+ N +G+ CPDGS AP++E +F++ + GT Sbjct 74 TPSGGVR-------GGDARLTGGLWTYTTNNREGLQCPDGSWAPTTEDVSFNETS--GTR 124 Query 148 TTVHGAVCGLQPKLSKQPFSLQLIGPPPSPVQRYPLYCN 186 H CGLQP + PF+L P P PV +YPL C Sbjct 125 RITHAGACGLQPGIIDVPFTLSFKAPLPIPVDQYPLICE 163 >gi|108800356|ref|YP_640553.1| hypothetical protein Mmcs_3390 [Mycobacterium sp. MCS] gi|119869484|ref|YP_939436.1| hypothetical protein Mkms_3452 [Mycobacterium sp. KMS] gi|126435978|ref|YP_001071669.1| hypothetical protein Mjls_3400 [Mycobacterium sp. JLS] gi|108770775|gb|ABG09497.1| conserved hypothetical protein [Mycobacterium sp. MCS] gi|119695573|gb|ABL92646.1| conserved hypothetical protein [Mycobacterium sp. KMS] gi|126235778|gb|ABN99178.1| conserved hypothetical protein [Mycobacterium sp. JLS] Length=177 Score = 84.3 bits (207), Expect = 7e-15, Method: Compositional matrix adjust. Identities = 59/173 (35%), Positives = 82/173 (48%), Gaps = 19/173 (10%) Query 17 VALGIALHSGDPARALGPPPDGSYSFNQA-GVSGVTWTITALCDQPSGTRNMNDYSDPIV 75 +ALG+A+ + PA+A G Y++ QA G+SG WTI PS + D +P+ Sbjct 13 LALGVAVGAAGPAQAEQQVLQGIYTYTQADGLSG-EWTIF-----PSCVPTVGDLREPLY 66 Query 76 WAFNCALNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQADGVACPDGSTAPSSET 135 C L+V S Q G A +++ +W + ADG+ C DGS E Sbjct 67 LPVACRLHVTSFEGVQ----------GGDAVLTNGVWAYDTTIADGITCSDGSPLQVVEI 116 Query 136 YAFSDETLTGTHTTVHGAVCG--LQPKLSKQPFSLQLIGPPPSPVQRYPLYCN 186 Y F+ ET++GT T H C L + K PFSL P PV +YPL C Sbjct 117 YEFNTETMSGTRTNTHAPGCNGELPATMIKTPFSLAYKTELPIPVDKYPLICE 169 >gi|120404731|ref|YP_954560.1| hypothetical protein Mvan_3772 [Mycobacterium vanbaalenii PYR-1] gi|119957549|gb|ABM14554.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1] Length=172 Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 49/149 (33%), Positives = 70/149 (47%), Gaps = 16/149 (10%) Query 38 GSYSFNQAGVSGVTWTITALCDQPSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDR 97 G Y++ G +G TWTI C G D +P+ C L+V + + +T Sbjct 32 GVYNYTSEGQTG-TWTIYPTCVPVVG-----DLREPLYLPVGCRLHVNPS--RGLT---- 79 Query 98 LQNFSGRARMSSMLWTFQVNQADGVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVCGL 157 G A + +W F V + +G+ CPDGS P+ E F D T++GT H CG Sbjct 80 ----GGDAALVGGVWGFSVPKTEGMQCPDGSWEPTVEVIRFDDVTMSGTRKITHNGNCGT 135 Query 158 QPKLSKQPFSLQLIGPPPSPVQRYPLYCN 186 P + + PF+L P P PV +YPL C Sbjct 136 APGIIEIPFTLSFKEPLPIPVDQYPLICE 164 >gi|288960916|ref|YP_003451255.1| major facilitator superfamily protein [Azospirillum sp. B510] gi|288913224|dbj|BAI74711.1| major facilitator superfamily protein [Azospirillum sp. B510] Length=838 Score = 41.6 bits (96), Expect = 0.066, Method: Compositional matrix adjust. Identities = 32/84 (39%), Positives = 41/84 (49%), Gaps = 7/84 (8%) Query 68 NDYSDPIVWAFNCA-LNVVSTTPQQITRTDRLQNFSGRARMSSMLWTFQVNQADGVAC-- 124 D SD + F A VVS+ + ++NF G R S L T+ + QADGVA Sbjct 48 RDCSDALFSTFGVAGRKVVSSIEYAVKYGKTIENFFGMERALSQLKTY-LPQADGVAVAL 106 Query 125 PDGSTAPSSETYAFSD---ETLTG 145 PDG A + TY F D + LTG Sbjct 107 PDGQVAYAIGTYGFKDRIGDRLTG 130 >gi|115676832|ref|XP_801712.2| PREDICTED: similar to SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily c, member 2 isoform 2 [Strongylocentrotus purpuratus] gi|115930457|ref|XP_001192167.1| PREDICTED: similar to SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily c, member 2 [Strongylocentrotus purpuratus] Length=1192 Score = 38.1 bits (87), Expect = 0.66, Method: Composition-based stats. Identities = 25/97 (26%), Positives = 43/97 (45%), Gaps = 16/97 (16%) Query 27 DPARALGPPPDGSYSFNQAGVSGVTWTITALCDQPSGTRNMNDYSDPIVWAFNCALNVVS 86 + A +GPPP + L D PSG + + S+PI+W C +N+ Sbjct 532 NKATPMGPPPTSHFH--------------VLADTPSGLQPVQA-SNPIIWCIQCMMNMTD 576 Query 87 TTPQQITRTDRLQNFSGRARMSSMLWTFQVNQADGVA 123 + T++ L NF R M + + Q+++A G + Sbjct 577 KDGAKDTKSTDLTNFGLRPDMYATKKS-QISKAKGTS 612 >gi|108800646|ref|YP_640843.1| hypothetical protein Mmcs_3681 [Mycobacterium sp. MCS] gi|119869785|ref|YP_939737.1| hypothetical protein Mkms_3754 [Mycobacterium sp. KMS] gi|126436270|ref|YP_001071961.1| hypothetical protein Mjls_3694 [Mycobacterium sp. JLS] gi|108771065|gb|ABG09787.1| conserved hypothetical protein [Mycobacterium sp. MCS] gi|119695874|gb|ABL92947.1| conserved hypothetical protein [Mycobacterium sp. KMS] gi|126236070|gb|ABN99470.1| conserved hypothetical protein [Mycobacterium sp. JLS] Length=146 Score = 37.7 bits (86), Expect = 0.79, Method: Compositional matrix adjust. Identities = 18/55 (33%), Positives = 33/55 (60%), Gaps = 1/55 (1%) Query 101 FSGRARMSSMLWTFQVNQADGVACPDGSTAPSSETYAFSDETLTGTHTTVHGAVC 155 + G A++ + +T V+ DG+ CP G + P+ ETY++ + +L GT +V+ C Sbjct 76 YEGTAQLVNGRYTMTVDVPDGLRCP-GHSMPTRETYSWDEVSLVGTIDSVYNVGC 129 >gi|120405115|ref|YP_954944.1| hypothetical protein Mvan_4161 [Mycobacterium vanbaalenii PYR-1] gi|119957933|gb|ABM14938.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1] Length=145 Score = 36.2 bits (82), Expect = 2.4, Method: Compositional matrix adjust. Identities = 19/62 (31%), Positives = 30/62 (49%), Gaps = 1/62 (1%) Query 94 RTDRLQNFSGRARMSSMLWTFQVNQADGVACPDGSTAPSSETYAFSDETLTGTHTTVHGA 153 R Q + + + +TF V+ DG+ CP G P+ ETY++ TL GT + + Sbjct 68 RLAYYQYYESKVWLVDGRYTFTVDVPDGLQCP-GHVMPTRETYSWDANTLFGTIDSKYNV 126 Query 154 VC 155 C Sbjct 127 GC 128 >gi|315443541|ref|YP_004076420.1| hypothetical protein Mspyr1_19240 [Mycobacterium sp. Spyr1] gi|315261844|gb|ADT98585.1| hypothetical protein Mspyr1_19240 [Mycobacterium sp. Spyr1] Length=146 Score = 35.8 bits (81), Expect = 3.3, Method: Compositional matrix adjust. Identities = 23/85 (28%), Positives = 38/85 (45%), Gaps = 6/85 (7%) Query 76 WAFNCALNVVSTTPQQIT-----RTDRLQNFSGRARMSSMLWTFQVNQADGVACPDGSTA 130 W + +L + TP + R + + +A + +TF V+ DG+ CP G Sbjct 46 WFWFVSLCIPEKTPDCVNVAARPRLQFYEYYESKAWLVDGRYTFTVDVPDGLQCP-GHVM 104 Query 131 PSSETYAFSDETLTGTHTTVHGAVC 155 P+ ETY++ TL GT + C Sbjct 105 PTRETYSWDAVTLLGTIDSQFAVGC 129 >gi|307710462|ref|ZP_07646899.1| endonuclease and methylase LlaGI [Streptococcus mitis SK564] gi|307618725|gb|EFN97864.1| endonuclease and methylase LlaGI [Streptococcus mitis SK564] Length=1565 Score = 35.8 bits (81), Expect = 3.4, Method: Composition-based stats. Identities = 21/70 (30%), Positives = 34/70 (49%), Gaps = 4/70 (5%) Query 36 PDGSYSFNQAGVSGVTWTITAL---CDQPSG-TRNMNDYSDPIVWAFNCALNVVSTTPQQ 91 P+ +Y + G S + W I D+ SG T + NDYSD + FN L +++ + Q Sbjct 1492 PEKAYEYVVNGRSAIEWIIDQYQVKTDRKSGITDDPNDYSDDEKYIFNLLLRIINVSVQT 1551 Query 92 ITRTDRLQNF 101 + + L F Sbjct 1552 VDLINSLPKF 1561 Lambda K H 0.318 0.132 0.421 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 192573564720 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40