BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0361
Length=275
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607502|ref|NP_214875.1| hypothetical protein Rv0361 [Mycoba... 562 2e-158
gi|298523838|ref|ZP_07011247.1| conserved hypothetical protein [... 487 7e-136
gi|240173044|ref|ZP_04751702.1| hypothetical protein MkanA1_2727... 382 2e-104
gi|118618268|ref|YP_906600.1| hypothetical protein MUL_2842 [Myc... 372 4e-101
gi|254822873|ref|ZP_05227874.1| hypothetical protein MintA_23289... 352 4e-95
gi|342859136|ref|ZP_08715790.1| hypothetical protein MCOL_09668 ... 345 4e-93
gi|254777220|ref|ZP_05218736.1| hypothetical protein MaviaA2_214... 344 7e-93
gi|41409961|ref|NP_962797.1| hypothetical protein MAP3863c [Myco... 344 8e-93
gi|118467008|ref|YP_883911.1| hypothetical protein MAV_4784 [Myc... 342 5e-92
gi|15827062|ref|NP_301325.1| hypothetical protein ML0285 [Mycoba... 320 2e-85
gi|296167839|ref|ZP_06850023.1| conserved hypothetical protein [... 312 4e-83
gi|118473560|ref|YP_885159.1| hypothetical protein MSMEG_0753 [M... 255 6e-66
gi|108797484|ref|YP_637681.1| hypothetical protein Mmcs_0504 [My... 239 2e-61
gi|120401686|ref|YP_951515.1| hypothetical protein Mvan_0671 [My... 226 2e-57
gi|315442208|ref|YP_004075087.1| hypothetical protein Mspyr1_054... 225 6e-57
gi|145220838|ref|YP_001131516.1| hypothetical protein Mflv_0233 ... 225 6e-57
gi|333988998|ref|YP_004521612.1| hypothetical protein JDM601_035... 216 3e-54
gi|169631331|ref|YP_001704980.1| hypothetical protein MAB_4253 [... 210 2e-52
gi|312141297|ref|YP_004008633.1| membrane protein [Rhodococcus e... 90.5 2e-16
gi|54027352|ref|YP_121594.1| hypothetical protein nfa53780 [Noca... 75.1 1e-11
gi|226365021|ref|YP_002782804.1| hypothetical protein ROP_56120 ... 70.9 2e-10
gi|111022503|ref|YP_705475.1| hypothetical protein RHA1_ro05537 ... 70.1 4e-10
gi|343928554|ref|ZP_08768001.1| hypothetical protein GOALK_118_0... 61.2 2e-07
gi|229493098|ref|ZP_04386893.1| conserved hypothetical protein [... 58.5 1e-06
gi|226304902|ref|YP_002764860.1| hypothetical protein RER_14130 ... 58.2 1e-06
gi|296138250|ref|YP_003645493.1| hypothetical protein Tpau_0513 ... 52.8 5e-05
gi|326384115|ref|ZP_08205798.1| hypothetical protein SCNU_14329 ... 48.5 0.001
gi|296165723|ref|ZP_06848238.1| conserved hypothetical protein [... 44.3 0.019
gi|120401701|ref|YP_951530.1| hypothetical protein Mvan_0686 [My... 44.3 0.020
gi|262203913|ref|YP_003275121.1| hypothetical protein Gbro_4062 ... 38.5 1.0
gi|303321466|ref|XP_003070727.1| peroxisomal membrane protein PA... 36.6 3.7
gi|320040193|gb|EFW22126.1| peroxisomal membrane protein Pex13 [... 36.6 3.8
gi|269954828|ref|YP_003324617.1| serine/threonine protein kinase... 36.6 4.4
gi|302535587|ref|ZP_07287929.1| conserved hypothetical protein [... 36.6 4.5
gi|226315188|ref|YP_002775084.1| hypothetical protein BBR47_5603... 35.8 6.6
gi|66570962|emb|CAH10285.1| merzoite surface protein 1 [Plasmodi... 35.8 6.7
>gi|15607502|ref|NP_214875.1| hypothetical protein Rv0361 [Mycobacterium tuberculosis H37Rv]
gi|15839747|ref|NP_334784.1| hypothetical protein MT0377 [Mycobacterium tuberculosis CDC1551]
gi|31791538|ref|NP_854031.1| hypothetical protein Mb0368 [Mycobacterium bovis AF2122/97]
78 more sequence titles
Length=275
Score = 562 bits (1448), Expect = 2e-158, Method: Compositional matrix adjust.
Identities = 275/275 (100%), Positives = 275/275 (100%), Gaps = 0/275 (0%)
Query 1 MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSDNDAAVTQP 60
MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSDNDAAVTQP
Sbjct 1 MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSDNDAAVTQP 60
Query 61 EAQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPTGMPPKTAVPQSIPP 120
EAQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPTGMPPKTAVPQSIPP
Sbjct 61 EAQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPTGMPPKTAVPQSIPP 120
Query 121 RTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDIA 180
RTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDIA
Sbjct 121 RTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDIA 180
Query 181 IQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEA 240
IQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEA
Sbjct 181 IQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEA 240
Query 241 NVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN 275
NVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN
Sbjct 241 NVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN 275
>gi|298523838|ref|ZP_07011247.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298493632|gb|EFI28926.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=239
Score = 487 bits (1254), Expect = 7e-136, Method: Compositional matrix adjust.
Identities = 238/239 (99%), Positives = 239/239 (100%), Gaps = 0/239 (0%)
Query 37 VPHDAETETVVITTSDNDAAVTQPEAQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQ 96
+PHDAETETVVITTSDNDAAVTQPEAQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQ
Sbjct 1 MPHDAETETVVITTSDNDAAVTQPEAQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQ 60
Query 97 APTTPPRMPTGMPPKTAVPQSIPPRTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLL 156
APTTPPRMPTGMPPKTAVPQSIPPRTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLL
Sbjct 61 APTTPPRMPTGMPPKTAVPQSIPPRTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLL 120
Query 157 TRGKHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRV 216
TRGKHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRV
Sbjct 121 TRGKHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRV 180
Query 217 SAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN 275
SAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN
Sbjct 181 SAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN 239
>gi|240173044|ref|ZP_04751702.1| hypothetical protein MkanA1_27276 [Mycobacterium kansasii ATCC
12478]
Length=272
Score = 382 bits (982), Expect = 2e-104, Method: Compositional matrix adjust.
Identities = 202/277 (73%), Positives = 222/277 (81%), Gaps = 7/277 (2%)
Query 1 MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSD--NDAAVT 58
M NAPEPDR E+G + AG D G TE PL+P DAETETVVI+TSD ++
Sbjct 1 MPNAPEPDRGDTETGDQSAG----DEGAAGTEGQPLIPDDAETETVVISTSDPKDNPGSA 56
Query 59 QPEAQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPTGMPPKTAVPQSI 118
+ RERRFTAPGFDAKETQVI T+ E ATEVF T+ P PP G P KTA+PQSI
Sbjct 57 NADLPRERRFTAPGFDAKETQVIATSPEPATEVFHTSPVPPGPPPPIGGPP-KTAMPQSI 115
Query 119 PPRTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLD 178
PPR +RQR WGW LA+VVIVLA+AAIAILGTVL TRGKHSK+SQE+QVRQ IQS D
Sbjct 116 PPRGGKPPLRQRNWGWVLAIVVIVLAVAAIAILGTVLFTRGKHSKVSQEEQVRQTIQSFD 175
Query 179 IAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHA 238
+AIQTGDLT LRS+TCG+TRDGYVDYDER W+ETYRRVSAAKQYPVIASIDQVVVNG HA
Sbjct 176 VAIQTGDLTTLRSITCGTTRDGYVDYDERSWSETYRRVSAAKQYPVIASIDQVVVNGQHA 235
Query 239 EANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN 275
EANVTTFMA+DPQVRSTRSLDLQFRDDQWKICQS SN
Sbjct 236 EANVTTFMAYDPQVRSTRSLDLQFRDDQWKICQSPSN 272
>gi|118618268|ref|YP_906600.1| hypothetical protein MUL_2842 [Mycobacterium ulcerans Agy99]
gi|183980696|ref|YP_001848987.1| hypothetical protein MMAR_0671 [Mycobacterium marinum M]
gi|118570378|gb|ABL05129.1| conserved membrane protein [Mycobacterium ulcerans Agy99]
gi|183174022|gb|ACC39132.1| conserved membrane protein [Mycobacterium marinum M]
Length=271
Score = 372 bits (955), Expect = 4e-101, Method: Compositional matrix adjust.
Identities = 202/282 (72%), Positives = 220/282 (79%), Gaps = 18/282 (6%)
Query 1 MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSDNDAAVTQP 60
M NAPEPDR E+ R A GE TES+PL+P D ETETVVI+ +D VT P
Sbjct 1 MPNAPEPDRDGTET-------RDAAAGEAGTESFPLIPDDTETETVVISKAD---PVTDP 50
Query 61 E-----AQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPTGMPPKTAVP 115
E AQ ERRFTAPGFDAKETQVI TA E ATEVFQT+QAP P P KTAVP
Sbjct 51 EPSGADAQPERRFTAPGFDAKETQVIATAPEPATEVFQTHQAPPAGPPPIGMPP-KTAVP 109
Query 116 QSIPPRTEA--TSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQA 173
QSIPPR +++QR WGW LA+VVIVLALAAIAILGTVLLTR KH +SQED+VRQ
Sbjct 110 QSIPPRDSGRPAALKQRNWGWVLAIVVIVLALAAIAILGTVLLTRNKHPNVSQEDRVRQT 169
Query 174 IQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVV 233
IQ D A+QTGDLTALRS+TCG+TRDGYVDYDER W ETYRRVSAAKQYPVIASIDQVVV
Sbjct 170 IQHFDAAVQTGDLTALRSITCGTTRDGYVDYDERSWDETYRRVSAAKQYPVIASIDQVVV 229
Query 234 NGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN 275
NG HAEAN+TTFMA+DPQVRSTRSLDLQFRDDQWKICQS S+
Sbjct 230 NGQHAEANITTFMAYDPQVRSTRSLDLQFRDDQWKICQSPSS 271
>gi|254822873|ref|ZP_05227874.1| hypothetical protein MintA_23289 [Mycobacterium intracellulare
ATCC 13950]
Length=293
Score = 352 bits (903), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 191/296 (65%), Positives = 213/296 (72%), Gaps = 24/296 (8%)
Query 1 MSNAPEPDRS-----------------AGESGSEPAGERSADPGEERTESYPLVPHDAET 43
M N EPDR + E G+E E + G++ TES P+VP+DAET
Sbjct 1 MPNPSEPDRGGPPNRPGFDPRESRNDPSDEWGAESGPEAGFETGDDATESVPMVPNDAET 60
Query 44 ETVVITTSDNDAAVTQ-PEA-QRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQAPTTP 101
ETVVI D + + PEA QRERRFTAPGFDAKET +I T E ATEVF T P
Sbjct 61 ETVVINKPDPRSGPQEVPEAPQRERRFTAPGFDAKETTIISTTPEPATEVFAT---PGQD 117
Query 102 PRMPTGMPPKTAVPQSIPPRTEAT--SVRQRTWGWALAVVVIVLALAAIAILGTVLLTRG 159
G PPK AVPQSIPPR + RQ WGW LA+VVIVLALAAIAILGTVLLTRG
Sbjct 118 GTAQFGAPPKAAVPQSIPPRLGGKLRTSRQFNWGWILAIVVIVLALAAIAILGTVLLTRG 177
Query 160 KHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAA 219
KH+ +SQEDQVR I + D+AIQ GDLT LR++TCG+TRDGY DYDER W ETYRRVSAA
Sbjct 178 KHTGVSQEDQVRHTIGNFDVAIQRGDLTTLRTITCGTTRDGYADYDERSWDETYRRVSAA 237
Query 220 KQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN 275
KQYPVIASIDQVVVNG HAEANVTTFMA+DPQVRSTRSLDLQ+RDDQWKICQS+S
Sbjct 238 KQYPVIASIDQVVVNGQHAEANVTTFMAYDPQVRSTRSLDLQYRDDQWKICQSASG 293
>gi|342859136|ref|ZP_08715790.1| hypothetical protein MCOL_09668 [Mycobacterium colombiense CECT
3035]
gi|342133377|gb|EGT86580.1| hypothetical protein MCOL_09668 [Mycobacterium colombiense CECT
3035]
Length=290
Score = 345 bits (886), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 181/274 (67%), Positives = 205/274 (75%), Gaps = 6/274 (2%)
Query 6 EPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSDNDAAVTQ-PEA-Q 63
+P S E E E D G + TES P+VP D+ETETVVI D + + P+ Q
Sbjct 19 DPRESRNEPSDEWGAESGFDSGGDATESVPMVPGDSETETVVINRPDPHSGPQETPDGPQ 78
Query 64 RERRFTAPGFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPTGMPPKTAVPQSIPPRTE 123
RERRFTAPGFDAKET +I T E ATEVF T P G+PPK AVPQSIPPR
Sbjct 79 RERRFTAPGFDAKETAIISTTAEPATEVFAT--PPGQDGTAQFGVPPKPAVPQSIPPRLG 136
Query 124 AT--SVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDIAI 181
+ RQ WGW LA+VVIVLALAAIAILGTVLLTRGKH+ +SQEDQVR IQ+ D+AI
Sbjct 137 GKLRTSRQINWGWVLALVVIVLALAAIAILGTVLLTRGKHTNVSQEDQVRHTIQNFDVAI 196
Query 182 QTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEAN 241
Q GDLT LR++TCG+TRDGY DYDE W ETYRRVSAAKQYPVIASIDQVVVNG HAEAN
Sbjct 197 QRGDLTTLRTITCGTTRDGYADYDEHAWDETYRRVSAAKQYPVIASIDQVVVNGQHAEAN 256
Query 242 VTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN 275
VTTFMA+DPQ+RSTRS+DLQ+RDDQWK+CQS+S
Sbjct 257 VTTFMAYDPQLRSTRSMDLQYRDDQWKVCQSASG 290
>gi|254777220|ref|ZP_05218736.1| hypothetical protein MaviaA2_21484 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=293
Score = 344 bits (883), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 184/271 (68%), Positives = 204/271 (76%), Gaps = 17/271 (6%)
Query 13 ESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSDNDAAVTQ-PEA-QRERRFTA 70
E+G EP G D E+ TES P+VP DAETETVVI D + + P+ QRERRFTA
Sbjct 32 ETGEEPDG--GFDSSEDATESVPMVPSDAETETVVINKPDPPSGPQETPDGPQRERRFTA 89
Query 71 PGFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPT----GMPPKTAVPQSIPPRT--EA 124
PGFDAKET +I T E ATE F PP G+PPK AVPQSIPPR +
Sbjct 90 PGFDAKETTIIATTPEPATEAF-------VPPGQDNTTQFGVPPKAAVPQSIPPRLGGKL 142
Query 125 TSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDIAIQTG 184
+ RQ WGW LA+VVIVLALAAIAILGTVLLTRG+H+ +SQEDQVR I + D AIQ G
Sbjct 143 RTSRQFNWGWILALVVIVLALAAIAILGTVLLTRGRHTNVSQEDQVRHTIGNFDAAIQRG 202
Query 185 DLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEANVTT 244
DLT LRS+TCG+TRDGY DYDE WAETYRRVSAAKQYPVIASIDQVVVNG HAEANVTT
Sbjct 203 DLTTLRSITCGTTRDGYADYDEHAWAETYRRVSAAKQYPVIASIDQVVVNGQHAEANVTT 262
Query 245 FMAFDPQVRSTRSLDLQFRDDQWKICQSSSN 275
FMA+DPQVRSTRSLDLQ+RDDQWKICQS+S
Sbjct 263 FMAYDPQVRSTRSLDLQYRDDQWKICQSASG 293
>gi|41409961|ref|NP_962797.1| hypothetical protein MAP3863c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41398794|gb|AAS06413.1| hypothetical protein MAP_3863c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336460397|gb|EGO39297.1| hypothetical protein MAPs_41950 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=293
Score = 344 bits (883), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 184/271 (68%), Positives = 204/271 (76%), Gaps = 17/271 (6%)
Query 13 ESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSDNDAAVTQ-PEA-QRERRFTA 70
E+G EP G D E+ TES P+VP DAETETVVI D + + P+ QRERRFTA
Sbjct 32 ETGEEPDG--GFDSSEDATESVPMVPSDAETETVVINKPDPPSGPQETPDGPQRERRFTA 89
Query 71 PGFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPT----GMPPKTAVPQSIPPRT--EA 124
PGFDAKET +I T E ATE F PP G+PPK AVPQSIPPR +
Sbjct 90 PGFDAKETTIIATTPEPATEAF-------VPPGQDNTTQFGVPPKAAVPQSIPPRLGGKL 142
Query 125 TSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDIAIQTG 184
+ RQ WGW LA+VVIVLALAAIAILGTVLLTRG+H+ +SQEDQVR I + D AIQ G
Sbjct 143 RTSRQINWGWILALVVIVLALAAIAILGTVLLTRGRHTNVSQEDQVRHTIGNFDAAIQRG 202
Query 185 DLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEANVTT 244
DLT LRS+TCG+TRDGY DYDE WAETYRRVSAAKQYPVIASIDQVVVNG HAEANVTT
Sbjct 203 DLTTLRSITCGTTRDGYADYDEHAWAETYRRVSAAKQYPVIASIDQVVVNGQHAEANVTT 262
Query 245 FMAFDPQVRSTRSLDLQFRDDQWKICQSSSN 275
FMA+DPQVRSTRSLDLQ+RDDQWKICQS+S
Sbjct 263 FMAYDPQVRSTRSLDLQYRDDQWKICQSASG 293
>gi|118467008|ref|YP_883911.1| hypothetical protein MAV_4784 [Mycobacterium avium 104]
gi|118168295|gb|ABK69192.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=292
Score = 342 bits (876), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 183/271 (68%), Positives = 204/271 (76%), Gaps = 18/271 (6%)
Query 13 ESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSDNDAAVTQ-PEA-QRERRFTA 70
E+G EP G + E+ TES P+VP DAETETVVI D + + P+ QRERRFTA
Sbjct 32 ETGEEPDGGFDS---EDATESVPMVPSDAETETVVINKPDPPSGPQETPDGPQRERRFTA 88
Query 71 PGFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPT----GMPPKTAVPQSIPPRT--EA 124
PGFDAKET +I T E ATE F PP G+PPK AVPQSIPPR +
Sbjct 89 PGFDAKETTIIATTPEPATEAF-------VPPGQDNTTQFGVPPKAAVPQSIPPRLGGKL 141
Query 125 TSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDIAIQTG 184
+ RQ WGW LA+VVIVLALAAIAILGTVLLTRG+H+ +SQEDQVR I + D AIQ G
Sbjct 142 RTSRQINWGWILALVVIVLALAAIAILGTVLLTRGRHTNVSQEDQVRHTIGNFDAAIQRG 201
Query 185 DLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEANVTT 244
DLT LRS+TCG+TRDGY DYDE WAETYRRVSAAKQYPVIASIDQVVVNG HAEANVTT
Sbjct 202 DLTTLRSITCGTTRDGYADYDEHAWAETYRRVSAAKQYPVIASIDQVVVNGQHAEANVTT 261
Query 245 FMAFDPQVRSTRSLDLQFRDDQWKICQSSSN 275
FMA+DPQVRSTRSLDLQ+RDDQWKICQS+S
Sbjct 262 FMAYDPQVRSTRSLDLQYRDDQWKICQSASG 292
>gi|15827062|ref|NP_301325.1| hypothetical protein ML0285 [Mycobacterium leprae TN]
gi|221229540|ref|YP_002502956.1| hypothetical protein MLBr_00285 [Mycobacterium leprae Br4923]
gi|3129997|emb|CAA18949.1| putative membrane protein [Mycobacterium leprae]
gi|13092610|emb|CAC29793.1| putative membrane protein [Mycobacterium leprae]
gi|219932647|emb|CAR70378.1| putative membrane protein [Mycobacterium leprae Br4923]
Length=292
Score = 320 bits (820), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 161/251 (65%), Positives = 188/251 (75%), Gaps = 3/251 (1%)
Query 27 GEERTESYPLVPHDAETETVVITTSDNDAAVTQPEAQRERRFTAPGFDAKETQVIVTAHE 86
GEE YPL D+ETE +V+T ++ D ERRFTAPGFDA+ T ++ TA +
Sbjct 43 GEEIAAGYPLAHSDSETEAMVLTKTEPDQDPGADRQHHERRFTAPGFDARATAIMATAPD 102
Query 87 AATEVFQTNQAPTTPPRMPTGMPPKTAVPQSIPP--RTEATSVRQRTWGWALAVVVIVLA 144
ATE + + PP G+ PK AVPQSIPP T+ S R WGW +A++++VLA
Sbjct 103 PATEAIHPPLSSSDPPGH-LGISPKAAVPQSIPPVLGTKLRSARHFHWGWVVALLMMVLA 161
Query 145 LAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDY 204
LAAIAILGTVLLTRGKH K S +QVR AIQS D+A+QTG+LTALRS+TCG+TRDGYV+Y
Sbjct 162 LAAIAILGTVLLTRGKHVKASPAEQVRHAIQSFDVAVQTGNLTALRSITCGTTRDGYVEY 221
Query 205 DERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRD 264
DE W ETY RVSAAKQYPVIASIDQVVVNG HAEAN+TTFMA+DPQVRSTRSLDLQF D
Sbjct 222 DESSWDETYHRVSAAKQYPVIASIDQVVVNGQHAEANITTFMAYDPQVRSTRSLDLQFCD 281
Query 265 DQWKICQSSSN 275
DQWKICQS S
Sbjct 282 DQWKICQSPSG 292
>gi|296167839|ref|ZP_06850023.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897010|gb|EFG76632.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=276
Score = 312 bits (799), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 185/281 (66%), Positives = 205/281 (73%), Gaps = 11/281 (3%)
Query 1 MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETETVVI----TTSDNDAA 56
M PEPDR G + + P E +A GE+ TES PLVP D+ETETVVI + D
Sbjct 1 MPQPPEPDR--GGAPNTPGHEWAAPSGEDATESVPLVPSDSETETVVIRPDAGGNPADPG 58
Query 57 VTQPEAQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPTGMPPKTAVPQ 116
QRERRFTAPGFDAKET VI T E ATEVF + P P P G P K AVPQ
Sbjct 59 EYADGQQRERRFTAPGFDAKETAVISTTQEPATEVFAS--PPGGMPAQP-GTPAKPAVPQ 115
Query 117 SIPPRTEAT--SVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAI 174
SIPPR + RQ WGW LA+V++VLALAAIAILGTVLLTR KH+K+SQEDQVR I
Sbjct 116 SIPPRLGGKLRTSRQFNWGWVLALVLVVLALAAIAILGTVLLTRSKHTKVSQEDQVRTTI 175
Query 175 QSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVN 234
+S D AIQ GDLT LR++TCG+TRDGY DYDE W ETYRRVSAAKQYPVIASIDQVVVN
Sbjct 176 ESFDTAIQKGDLTTLRTITCGTTRDGYADYDEHAWDETYRRVSAAKQYPVIASIDQVVVN 235
Query 235 GAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSSSN 275
G HAEANVTTFMA+DP +RSTRSLDLQ+RDD WKICQS S
Sbjct 236 GQHAEANVTTFMAYDPSLRSTRSLDLQYRDDHWKICQSPSG 276
>gi|118473560|ref|YP_885159.1| hypothetical protein MSMEG_0753 [Mycobacterium smegmatis str.
MC2 155]
gi|118174847|gb|ABK75743.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=266
Score = 255 bits (651), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 148/280 (53%), Positives = 179/280 (64%), Gaps = 25/280 (8%)
Query 1 MSNAPEPDRS---AGESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSDNDAAV 57
M N EP+ + E +EP+ + + P E+ T S H+ TE + T A
Sbjct 1 MPNPSEPNEDHTPSSEDPTEPSRQSADAPTEKVTLSSE---HEPATEVFGLPTEPGQAT- 56
Query 58 TQPEAQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPTGMPPKTAVPQS 117
P+ ERRFTAP TQ I T + TEVF AP P G P K PQ
Sbjct 57 --PQGD-ERRFTAPSSFDGSTQKIDTPPDPETEVF----AP------PPGDPNKPVAPQV 103
Query 118 IPPRTEAT-----SVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQ 172
IPPR +A + +R+WGW +AVV+++ AL AIAILGTVLLTR S SQED+VR+
Sbjct 104 IPPRDDAARPQAPATARRSWGWVIAVVLVIAALVAIAILGTVLLTRDSASAGSQEDRVRE 163
Query 173 AIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVV 232
IQ D AIQ GDL LRS+TCG+TRD YV+Y+E+ WAET+ RV+AAKQYPV+ASIDQV+
Sbjct 164 TIQKFDSAIQRGDLATLRSITCGTTRDNYVNYNEKAWAETHERVAAAKQYPVVASIDQVI 223
Query 233 VNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQS 272
VN HAEANVTTFMAF PQ RSTRS DLQFRDD+WKICQS
Sbjct 224 VNDDHAEANVTTFMAFAPQTRSTRSFDLQFRDDEWKICQS 263
>gi|108797484|ref|YP_637681.1| hypothetical protein Mmcs_0504 [Mycobacterium sp. MCS]
gi|119866569|ref|YP_936521.1| hypothetical protein Mkms_0515 [Mycobacterium sp. KMS]
gi|126433105|ref|YP_001068796.1| hypothetical protein Mjls_0493 [Mycobacterium sp. JLS]
gi|108767903|gb|ABG06625.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119692658|gb|ABL89731.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126232905|gb|ABN96305.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=277
Score = 239 bits (611), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 141/248 (57%), Positives = 169/248 (69%), Gaps = 23/248 (9%)
Query 40 DAETETVVITTSDNDAAVTQPEAQRERRFTAP-GFDAKETQVIVTAHEAATEVFQTNQAP 98
D E T VI +S A +P ++ ERRFTAP GFDA TQ I + ATEVF
Sbjct 39 DHEPATEVIASSSPGADQFEP-SEGERRFTAPSGFDAGSTQKIDPPPDPATEVF------ 91
Query 99 TTPPRMPTGMPP---------KTAVPQSIPPRTEATSVRQ---RTWGWALAVVVIVLALA 146
PR G+P K A PQ IPPR +A Q R+WGW +AVV+++ AL
Sbjct 92 ---PRSEGGLPGSGTDPFAAQKAAGPQVIPPRGDAPRPPQQGRRSWGWVVAVVLVIAALV 148
Query 147 AIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDE 206
AIAILGT+LLTRG S SQEDQVR I+ D+AIQ GDL LR +TCG+ RD YV+YD+
Sbjct 149 AIAILGTILLTRGSGSTASQEDQVRATIEQFDVAIQNGDLATLRGITCGAKRDSYVNYDD 208
Query 207 RDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQ 266
+ WAET++RV+AAKQYPV+ASIDQ+VVNG HAEANVT FMA+ PQ RSTRS DL+FRDDQ
Sbjct 209 KAWAETHKRVAAAKQYPVVASIDQIVVNGDHAEANVTAFMAYAPQTRSTRSFDLEFRDDQ 268
Query 267 WKICQSSS 274
WKICQ+ S
Sbjct 269 WKICQAPS 276
>gi|120401686|ref|YP_951515.1| hypothetical protein Mvan_0671 [Mycobacterium vanbaalenii PYR-1]
gi|119954504|gb|ABM11509.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=262
Score = 226 bits (577), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 142/277 (52%), Positives = 175/277 (64%), Gaps = 22/277 (7%)
Query 1 MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSDNDAAVTQP 60
MSNA PD S DP E+ L D+E E + + P
Sbjct 1 MSNAGGPDNS------------DQDPSSAEPETEVLSGADSEHEPATEVFAPAEQQDEDP 48
Query 61 EAQRERRFTAP-GFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPTGM--PPKTAVPQS 117
+ ERRFTAP GFDA TQVI + ATE F + P PT P K A PQ
Sbjct 49 DQPGERRFTAPSGFDAGSTQVINRPTDPATEQFSVHD-----PGAPTEAFAPQKPAAPQM 103
Query 118 IPPRTEATS--VRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQ 175
IPPR +A ++R WGW +A+V++V ALAA+AILGTVLLTRG S +SQED+VR I+
Sbjct 104 IPPRGDAPRPPKKKRNWGWVVAIVLVVAALAAVAILGTVLLTRGSGSSVSQEDRVRSTIE 163
Query 176 SLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNG 235
+ D AI+ GDL LRS+TCG+T + Y + D++ W ET+RRV+ A +YPV+ASIDQ+VVNG
Sbjct 164 NYDAAIEKGDLATLRSITCGTTAEAYNNIDDKKWIETHRRVADAGRYPVVASIDQIVVNG 223
Query 236 AHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQS 272
HAEANVTTFMAF PQ RSTRS DLQFRDD+WKICQ+
Sbjct 224 DHAEANVTTFMAFAPQTRSTRSFDLQFRDDEWKICQA 260
>gi|315442208|ref|YP_004075087.1| hypothetical protein Mspyr1_05430 [Mycobacterium sp. Spyr1]
gi|315260511|gb|ADT97252.1| hypothetical protein Mspyr1_05430 [Mycobacterium sp. Spyr1]
Length=253
Score = 225 bits (573), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 136/273 (50%), Positives = 175/273 (65%), Gaps = 23/273 (8%)
Query 1 MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSDNDAAVTQP 60
MSNAP PD GE + A+ E TE P + E T V ++
Sbjct 1 MSNAPGPD---GED------DAPAESSEAETEVLPGTDAEHEPATEVFAPAEPQDGDPSD 51
Query 61 EAQRERRFTAP-GFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPTGMPPKTAVPQSIP 119
+A ERRFTAP GFDA TQVI HE TE F ++ P P +PP+ P+ P
Sbjct 52 DAG-ERRFTAPSGFDAGSTQVINRPHELPTEAFAAHK-----PAAPQMIPPRGETPK--P 103
Query 120 PRTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDI 179
P+T +R WGW +A+++++ ALAA A++GT+LLTRG + +SQED VR AIQ+ D
Sbjct 104 PKT-----GRRNWGWVIAIILVIAALAAAAVVGTLLLTRGSATSVSQEDSVRTAIQNYDA 158
Query 180 AIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAE 239
AI+ GDL LRS+TCG+T + Y +DER W +T+ RV+ A +YPV+ASID++V+NG HAE
Sbjct 159 AIEKGDLATLRSITCGATAESYNKFDERQWKDTHSRVAEAGRYPVVASIDEIVINGDHAE 218
Query 240 ANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQS 272
ANVTTFMAF PQ RSTRS DLQFRDDQWKICQ+
Sbjct 219 ANVTTFMAFAPQTRSTRSFDLQFRDDQWKICQA 251
>gi|145220838|ref|YP_001131516.1| hypothetical protein Mflv_0233 [Mycobacterium gilvum PYR-GCK]
gi|145213324|gb|ABP42728.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=259
Score = 225 bits (573), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 136/273 (50%), Positives = 175/273 (65%), Gaps = 23/273 (8%)
Query 1 MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETETVVITTSDNDAAVTQP 60
MSNAP PD GE + A+ E TE P + E T V ++
Sbjct 7 MSNAPGPD---GED------DAPAESSEAETEVLPGTDAEHEPATEVFAPAEPQDGDPSD 57
Query 61 EAQRERRFTAP-GFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPTGMPPKTAVPQSIP 119
+A ERRFTAP GFDA TQVI HE TE F ++ P P +PP+ P+ P
Sbjct 58 DAG-ERRFTAPSGFDAGSTQVINRPHELPTEAFAAHK-----PAAPQMIPPRGETPK--P 109
Query 120 PRTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDI 179
P+T +R WGW +A+++++ ALAA A++GT+LLTRG + +SQED VR AIQ+ D
Sbjct 110 PKT-----GRRNWGWVIAIILVIAALAAAAVVGTLLLTRGSATSVSQEDSVRTAIQNYDA 164
Query 180 AIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAE 239
AI+ GDL LRS+TCG+T + Y +DER W +T+ RV+ A +YPV+ASID++V+NG HAE
Sbjct 165 AIEKGDLATLRSITCGATAESYNKFDERQWKDTHSRVAEAGRYPVVASIDEIVINGDHAE 224
Query 240 ANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQS 272
ANVTTFMAF PQ RSTRS DLQFRDDQWKICQ+
Sbjct 225 ANVTTFMAFAPQTRSTRSFDLQFRDDQWKICQA 257
>gi|333988998|ref|YP_004521612.1| hypothetical protein JDM601_0358 [Mycobacterium sp. JDM601]
gi|333484966|gb|AEF34358.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=296
Score = 216 bits (550), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 129/241 (54%), Positives = 161/241 (67%), Gaps = 25/241 (10%)
Query 40 DAETETVVITTSDNDAAVT----QPEAQRERRFTAPGFDAKETQVIVTAHEAATEVF--Q 93
D + T VITT +AA T P+ ERRFTAPGFD T+V+ + +A TE+ Q
Sbjct 70 DGDAVTEVITT---EAAPTTTDSSPDGPPERRFTAPGFDGA-TEVMPSVGDADTELIARQ 125
Query 94 TNQAPTTPPRMPTGMPPKTAVPQSIPPRTEA--TSVRQRTWGWALAVVVIVLALAAIAIL 151
A A+PQ IPPR + R+WGW LA+++I++ LA +A+L
Sbjct 126 AGDA-------------GKAIPQQIPPRLGGRLPATVSRSWGWVLALILIIVVLAVVAVL 172
Query 152 GTVLLTRGKHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAE 211
GT+ LTR SQE++VR I + DIA+Q GDL ALRSLTCG RD YV+YD++ W E
Sbjct 173 GTLWLTRDDRQAASQEERVRGTILNFDIAMQNGDLAALRSLTCGDIRDRYVNYDQKAWDE 232
Query 212 TYRRVSAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQ 271
TYRR++AAKQYPV+ASID+VVVN HAEANVT FMA+ P+VRSTRS DLQFRDDQWKICQ
Sbjct 233 TYRRIAAAKQYPVVASIDEVVVNDGHAEANVTAFMAYAPRVRSTRSFDLQFRDDQWKICQ 292
Query 272 S 272
S
Sbjct 293 S 293
>gi|169631331|ref|YP_001704980.1| hypothetical protein MAB_4253 [Mycobacterium abscessus ATCC 19977]
gi|169243298|emb|CAM64326.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=262
Score = 210 bits (535), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/264 (49%), Positives = 174/264 (66%), Gaps = 28/264 (10%)
Query 13 ESGSEPAGERSA---DPGEERTESYPLVPHDAETETVVITTSDNDAAVTQPEAQRERRFT 69
ES +P G +A E+ T++ P+ P D + +TVV+ + D R+T
Sbjct 22 ESAEQPEGAANAPEVSDTEDNTDTGPVPPLD-DAQTVVMAPAKPDVP----------RYT 70
Query 70 APGFDAKETQVI-VTAHEAATEVFQTNQAPTTPPRMPTGMPPKTAVPQSIPPRTEATSVR 128
APGFDA +T++I + TE Q + T PK A P++I PR E
Sbjct 71 APGFDANKTEMIDPVGDDPKTEFIQP---------LATQARPKAATPETIAPRKE----E 117
Query 129 QRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDIAIQTGDLTA 188
+R+WGW +A+ ++V ALAA+ +L V+++R K SQE+ VR +IQ+ D AIQTG+L A
Sbjct 118 KRSWGWVIALALVVAALAAVIVLAAVIISRTSTPKASQEELVRNSIQNYDNAIQTGNLAA 177
Query 189 LRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEANVTTFMAF 248
LR++TCG TRDGYV Y + +W++TY++V+AAKQYPV+ASID+VVVNG HAEANVT+FMAF
Sbjct 178 LRTITCGETRDGYVRYPDGEWSQTYQKVAAAKQYPVVASIDEVVVNGEHAEANVTSFMAF 237
Query 249 DPQVRSTRSLDLQFRDDQWKICQS 272
PQ RS+RS DLQFRD+QWKICQS
Sbjct 238 APQTRSSRSFDLQFRDNQWKICQS 261
>gi|312141297|ref|YP_004008633.1| membrane protein [Rhodococcus equi 103S]
gi|325673869|ref|ZP_08153559.1| hypothetical protein HMPREF0724_11341 [Rhodococcus equi ATCC
33707]
gi|311890636|emb|CBH49954.1| putative membrane protein [Rhodococcus equi 103S]
gi|325555134|gb|EGD24806.1| hypothetical protein HMPREF0724_11341 [Rhodococcus equi ATCC
33707]
Length=175
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/177 (35%), Positives = 86/177 (49%), Gaps = 10/177 (5%)
Query 95 NQAPTTPPRMPTGMPPKTAVPQSIPPRTEATSVRQRTWGWALA-VVVIVLALAAIAILGT 153
N PT+P P + P P T+ R GW +A + +V LAA+ + G
Sbjct 5 NDNPTSPAAGPQRIRPTA-------PSTKKDRRRGGNRGWIVASTLAVVGVLAALGVTGF 57
Query 154 VLLTRGKHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETY 213
V+ G E +VR A+ + A+ +G+L AL+S TCG+ D Y D D+A +
Sbjct 58 VIARTGAQDP--DESRVRNAVDTFAQALDSGNLGALQSSTCGTLADFYRDIPPTDFAGVH 115
Query 214 RRVSAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKIC 270
R V A PV+ S+D V + G A A VT DP S R+LDL+ D WK+C
Sbjct 116 RDVVAHGGVPVVTSVDTVQITGDTAIAQVTAHTEADPSDASPRTLDLERVDGTWKVC 172
>gi|54027352|ref|YP_121594.1| hypothetical protein nfa53780 [Nocardia farcinica IFM 10152]
gi|54018860|dbj|BAD60230.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=268
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/117 (35%), Positives = 58/117 (50%), Gaps = 4/117 (3%)
Query 154 VLLTRGKHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETY 213
+ L RG S EDQVR AI + A+ GDL ALR TCG + Y + +A +
Sbjct 149 MALLRGD----SPEDQVRAAIDNYTSALHDGDLAALRESTCGPLHEFYRNITPEQFASVH 204
Query 214 RRVSAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKIC 270
++ + PV+ +D + + A A T + DP RS R+ DL+ D WK+C
Sbjct 205 QQSRERRSIPVVDGVDAIKITDNTALAQATVYTEADPGNRSARTFDLEKTDSGWKVC 261
>gi|226365021|ref|YP_002782804.1| hypothetical protein ROP_56120 [Rhodococcus opacus B4]
gi|226243511|dbj|BAH53859.1| hypothetical membrane protein [Rhodococcus opacus B4]
Length=381
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/163 (27%), Positives = 78/163 (48%), Gaps = 2/163 (1%)
Query 108 MPPKTAVPQSIPPRTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQE 167
+PP+ + PQ + ++ W A + + + A + +++ ++ S E
Sbjct 218 LPPQQSAPQRV--AAAEKPAKKHGKAWLFAAIAAGVIVVAAIAVAGIVVYNNNQAENSPE 275
Query 168 DQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIAS 227
QV+ I + A+ GDL LR+ TCGS + Y ++D+AE ++ + PV+
Sbjct 276 AQVQGTIDTFVAALAQGDLATLRTSTCGSLAEYYQGISDQDFAEVHQVAVTQQNIPVVGG 335
Query 228 IDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKIC 270
+D V + G A A V A +P +S R+ +L+ D WKIC
Sbjct 336 VDAVQITGDTAIAQVKAHTAANPSEQSWRTFNLEKVDGTWKIC 378
>gi|111022503|ref|YP_705475.1| hypothetical protein RHA1_ro05537 [Rhodococcus jostii RHA1]
gi|110822033|gb|ABG97317.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=588
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 37/110 (34%), Positives = 58/110 (53%), Gaps = 0/110 (0%)
Query 161 HSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAK 220
++ S E QV+ I + A+ GDL LR+ TCGS + Y ++D+AE ++ +
Sbjct 476 QAENSPEAQVQGTIDTFVAALTQGDLATLRTSTCGSLAEYYQGISDQDFAEVHQVAVNQQ 535
Query 221 QYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKIC 270
PV+ +D V + G A A V A +P +S R+ +L+ D WKIC
Sbjct 536 NIPVVGGVDAVQITGDSAIAQVKAHTAANPGEQSWRTFNLEKVDGTWKIC 585
>gi|343928554|ref|ZP_08768001.1| hypothetical protein GOALK_118_00450 [Gordonia alkanivorans NBRC
16433]
gi|343761565|dbj|GAA14927.1| hypothetical protein GOALK_118_00450 [Gordonia alkanivorans NBRC
16433]
Length=700
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/159 (28%), Positives = 71/159 (45%), Gaps = 5/159 (3%)
Query 115 PQSIPPRTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAI 174
PQ IP + ++R G +AV I++ +AA+ +G + + + D+ +
Sbjct 543 PQVIPGSRPTRTTKKRGMGPLIAVAAILVVIAAV--IGGIFAYQ-NMTATPPADEAAEVA 599
Query 175 QSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASID--QVV 232
A+ GDL LRS+TCG Y D+D+ + +TY A + +I+ +VV
Sbjct 600 LDYTTALYEGDLETLRSVTCGELHAFYEDFDDAAYQKTYDAQKARNELVQTQAINAVRVV 659
Query 233 VNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQ 271
G A V P T +L+LQ D WK+C
Sbjct 660 EGGELAVVEVVAVHTSAPDAPETVTLNLQREGDDWKVCN 698
>gi|229493098|ref|ZP_04386893.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229320128|gb|EEN85954.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=284
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/107 (33%), Positives = 56/107 (53%), Gaps = 1/107 (0%)
Query 167 EDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIA 226
E QV+ AI + A+QTGDL LR+ TCG+ + Y + +A+ + A K P +
Sbjct 178 EAQVQTAISTYVDALQTGDLATLRTSTCGALGEYYRTIPDAAFAQVHDNAVAQKTIPQVG 237
Query 227 SIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSS 273
++D V + A A V + + +S R+ DL+ +D WK+C S
Sbjct 238 AVDAVRITDDTAIAQVQASLPSTGE-QSWRTFDLERQDGTWKVCDPS 283
>gi|226304902|ref|YP_002764860.1| hypothetical protein RER_14130 [Rhodococcus erythropolis PR4]
gi|226184017|dbj|BAH32121.1| hypothetical protein RER_14130 [Rhodococcus erythropolis PR4]
Length=307
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/107 (33%), Positives = 56/107 (53%), Gaps = 1/107 (0%)
Query 167 EDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIA 226
E QV+ AI + A+QTGDL LR+ TCG+ + Y + +A+ + A K P +
Sbjct 201 EAQVQTAISTYVDALQTGDLATLRTSTCGALGEYYRTIPDAAFAQVHDNAVAQKTIPQVG 260
Query 227 SIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQSS 273
++D V + A A V + + +S R+ DL+ +D WK+C S
Sbjct 261 AVDAVRITDDTAIAQVQASLPSTGE-QSWRTFDLERQDGTWKVCDPS 306
>gi|296138250|ref|YP_003645493.1| hypothetical protein Tpau_0513 [Tsukamurella paurometabola DSM
20162]
gi|296026384|gb|ADG77154.1| hypothetical protein Tpau_0513 [Tsukamurella paurometabola DSM
20162]
Length=575
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/176 (28%), Positives = 77/176 (44%), Gaps = 11/176 (6%)
Query 102 PRMPTGMPPKTAVPQSIPPRTEATSVRQ---RTWGWALAVVVIVLALAAIAILGTVLLTR 158
P +P P + +VP+ P T+ Q W W A VV+V+A +L L++
Sbjct 398 PALPADNPTERSVPEYYAPTTQRPLPEQSGGNRWKWIAAGVVVVIAAIVGIVL---LVSG 454
Query 159 GKHSKMSQEDQVRQAIQSLDI--AIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRV 216
G + +V+ A + D AI +GDL ALR+ TCG+ + Y ++ +
Sbjct 455 GAGGADQTDPRVQAATSTTDFVGAINSGDLNALRAQTCGAAKQYYDRISTDEYQRVHDNA 514
Query 217 SAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDD--QWKIC 270
A P + + + VNG AE V P + +L L +D+ WK+C
Sbjct 515 KADGLLPELDGLQAIDVNGDRAEVQVEVHYTGQPDAKVKNTLTL-AKDEAGAWKVC 569
>gi|326384115|ref|ZP_08205798.1| hypothetical protein SCNU_14329 [Gordonia neofelifaecis NRRL
B-59395]
gi|326197275|gb|EGD54466.1| hypothetical protein SCNU_14329 [Gordonia neofelifaecis NRRL
B-59395]
Length=351
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/107 (27%), Positives = 46/107 (43%), Gaps = 0/107 (0%)
Query 165 SQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPV 224
S E +V +A + A+ +GDL LR +TCG Y + + + Y +
Sbjct 243 SPEHKVAEAAKDYQNAMTSGDLDKLREVTCGEEYAYYSKIPDAAFQKAYEAQKNRNELMT 302
Query 225 IASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFRDDQWKICQ 271
+ V +NG A V + + DP + + LQ D WK+C+
Sbjct 303 FDDVKAVEINGDTARVGVDMYPSNDPSKTAPAQITLQNVDGTWKVCK 349
>gi|296165723|ref|ZP_06848238.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295898896|gb|EFG78387.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=150
Score = 44.3 bits (103), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/113 (32%), Positives = 53/113 (47%), Gaps = 18/113 (15%)
Query 165 SQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDGY----VDYDERDWAETYRRVSAAK 220
S EDQVR+ + + A T + TA L C + R + +DY ++ R S
Sbjct 51 SAEDQVRETVTAFQDAYNTQNWTAYTELMCVAMRAKFTGPVLDYVKKG-----RSESGLT 105
Query 221 QYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDLQFR-DDQWKICQS 272
SI V VNG A A++T+ + TRS+ L + +D WKICQ+
Sbjct 106 H----VSITSVTVNGDTATASMTS----SNEALGTRSVSLPLKLEDGWKICQT 150
>gi|120401701|ref|YP_951530.1| hypothetical protein Mvan_0686 [Mycobacterium vanbaalenii PYR-1]
gi|119954519|gb|ABM11524.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=147
Score = 44.3 bits (103), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 31/141 (22%), Positives = 58/141 (42%), Gaps = 5/141 (3%)
Query 133 GWALAVVVIVLALAAIAILGTVLLTRGKHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSL 192
G A+A + L++ ++G L ++ E + +A + A+Q D L++
Sbjct 10 GRAVAPFLGALSIIVAVVIGIWLFNLFSGDGLTDEQLIARAASGQNDALQRADYADLQAF 69
Query 193 TCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQV 252
TC R DE + + R + + + VVV+G A A++T + DP
Sbjct 70 TCTEAR-----ADEAEVLDRQRDSVEKRGNRFVERVAGVVVDGDRASADITYYFEKDPDA 124
Query 253 RSTRSLDLQFRDDQWKICQSS 273
+ T + WK+C +
Sbjct 125 KETLEMTFAREGGTWKVCSTG 145
>gi|262203913|ref|YP_003275121.1| hypothetical protein Gbro_4062 [Gordonia bronchialis DSM 43247]
gi|262087260|gb|ACY23228.1| hypothetical protein Gbro_4062 [Gordonia bronchialis DSM 43247]
Length=694
Score = 38.5 bits (88), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/91 (26%), Positives = 38/91 (42%), Gaps = 0/91 (0%)
Query 180 AIQTGDLTALRSLTCGSTRDGYVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAE 239
A+ +GDL LR +TCG + Y + + + + I V V+G A
Sbjct 601 ALSSGDLNTLREVTCGREQAFYQSQQPDQYQKIFNAQRDRNELIKFGEIKAVRVDGNTAV 660
Query 240 ANVTTFMAFDPQVRSTRSLDLQFRDDQWKIC 270
+ PQ + +++LQ D WK+C
Sbjct 661 VELPVAPGNRPQEQELTTINLQKSGDDWKVC 691
>gi|303321466|ref|XP_003070727.1| peroxisomal membrane protein PAS20, putative [Coccidioides posadasii
C735 delta SOWgp]
gi|240110424|gb|EER28582.1| peroxisomal membrane protein PAS20, putative [Coccidioides posadasii
C735 delta SOWgp]
Length=449
Score = 36.6 bits (83), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 31/98 (32%), Positives = 46/98 (47%), Gaps = 17/98 (17%)
Query 172 QAIQSLDIAIQTGDLTALRSLT--CGS--------TRDGYVDYDERDWAETYRRVSAAKQ 221
QA +D+A++ GD+ A+ S T GS RDG V Y + +T +R Q
Sbjct 324 QAAVGIDLAVRKGDIVAVLSKTDPMGSPSEWWRCRARDGSVGYLPGPYLQTIQR---KPQ 380
Query 222 YPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLD 259
I +D NG + A + TF AF Q ++T+ D
Sbjct 381 QRAITEVD----NGPVSAAQINTFEAFPEQTKATKGED 414
>gi|320040193|gb|EFW22126.1| peroxisomal membrane protein Pex13 [Coccidioides posadasii str.
Silveira]
Length=449
Score = 36.6 bits (83), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 31/98 (32%), Positives = 46/98 (47%), Gaps = 17/98 (17%)
Query 172 QAIQSLDIAIQTGDLTALRSLT--CGS--------TRDGYVDYDERDWAETYRRVSAAKQ 221
QA +D+A++ GD+ A+ S T GS RDG V Y + +T +R Q
Sbjct 324 QAAVGIDLAVRKGDIVAVLSKTDPMGSPSEWWRCRARDGSVGYLPGPYLQTIQR---KPQ 380
Query 222 YPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLD 259
I +D NG + A + TF AF Q ++T+ D
Sbjct 381 QRAITEVD----NGPVSAAQINTFEAFPEQTKATKGED 414
>gi|269954828|ref|YP_003324617.1| serine/threonine protein kinase with PASTA sensor(s) [Xylanimonas
cellulosilytica DSM 15894]
gi|269303509|gb|ACZ29059.1| serine/threonine protein kinase with PASTA sensor(s) [Xylanimonas
cellulosilytica DSM 15894]
Length=693
Score = 36.6 bits (83), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 31/98 (32%), Positives = 49/98 (50%), Gaps = 8/98 (8%)
Query 68 FTAPGFDAKETQVIVTAHEAATEVFQTNQAPTTPPRMPTGMPPKTAVPQSIPPRTEATSV 127
F APGF TQ++ T AAT+ + P P G P A P + +A +
Sbjct 314 FGAPGFGEAGTQLL-TPETAATQQWGATGLPQAMPAAGVGGP---AGPTGPSRQEQADAR 369
Query 128 RQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKMS 165
R++T W L + + VLA+AAI T++L G+ +++
Sbjct 370 RKKTLMWTL-ITIGVLAVAAIV---TIILLNGRQQEVA 403
>gi|302535587|ref|ZP_07287929.1| conserved hypothetical protein [Streptomyces sp. C]
gi|302444482|gb|EFL16298.1| conserved hypothetical protein [Streptomyces sp. C]
Length=584
Score = 36.6 bits (83), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 26/85 (31%), Positives = 40/85 (48%), Gaps = 5/85 (5%)
Query 105 PTGMPPKTAVPQSIPPRTEATSVRQRTWGWALAVVVIVLALAAIAILGTVLLTRGKHSKM 164
P G+P AVP PP ++ + R WG A A++++ L L + + T+L HSK
Sbjct 360 PVGLPTTQAVPSVPPPPLQSRTGRALKWGVA-ALLIVALGLGSWQLADTLL----DHSKS 414
Query 165 SQEDQVRQAIQSLDIAIQTGDLTAL 189
D + Q + A Q +L L
Sbjct 415 GGGDNTQTQQQGPNDAAQKKELKPL 439
>gi|226315188|ref|YP_002775084.1| hypothetical protein BBR47_56030 [Brevibacillus brevis NBRC 100599]
gi|226098138|dbj|BAH46580.1| hypothetical protein [Brevibacillus brevis NBRC 100599]
Length=134
Score = 35.8 bits (81), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 30/130 (24%), Positives = 58/130 (45%), Gaps = 13/130 (10%)
Query 146 AAIAILGTVLLTRG-----KHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDG 200
AAI +LG +++T G ++ +++E++V++ +Q A++ GD + T D
Sbjct 8 AAILLLGAIVVTGGIWTSNTYASVNEENEVKETVQIYLEALENGDTPTMVEYTIDERFD- 66
Query 201 YVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDPQVRSTRSLDL 260
+++D + Y K + +D V V E TF + V +L +
Sbjct 67 ----NDKDKQKVYESFGTQK---MDIDMDSVSVEKIEDEKMSVTFHYSNEHVSEDITLPV 119
Query 261 QFRDDQWKIC 270
+DQWK+
Sbjct 120 VKENDQWKVV 129
>gi|66570962|emb|CAH10285.1| merzoite surface protein 1 [Plasmodium reichenowi]
Length=1739
Score = 35.8 bits (81), Expect = 6.7, Method: Composition-based stats.
Identities = 25/87 (29%), Positives = 34/87 (40%), Gaps = 5/87 (5%)
Query 38 PHDAETETVVITTSDNDAAVTQPEAQRERRFTAPGF-DAKETQVIVTAHEAATEVFQTNQ 96
P + T + + A TQP T P + TQ + T Q +
Sbjct 748 PSQPSSATTTPPSQPSSATTTQPPQPSSATTTPPSQPSSATTQPPQPSSATTTPPPQESS 807
Query 97 APTTPPRMP----TGMPPKTAVPQSIP 119
A TTPP P T +PP+ VPQ+ P
Sbjct 808 ATTTPPSQPSSATTTLPPQPTVPQAQP 834
Lambda K H
0.313 0.126 0.361
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 432410969436
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40