BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0877
Length=262
Score E
Sequences producing significant alignments: (Bits) Value
gi|323720586|gb|EGB29664.1| hypothetical protein TMMG_02866 [Myc... 526 1e-147
gi|15608017|ref|NP_215392.1| hypothetical protein Rv0877 [Mycoba... 524 6e-147
gi|340625889|ref|YP_004744341.1| hypothetical protein MCAN_08781... 523 1e-146
gi|339293889|gb|AEJ46000.1| hypothetical protein CCDC5079_0810 [... 507 7e-142
gi|296169642|ref|ZP_06851260.1| conserved hypothetical protein [... 450 9e-125
gi|183984622|ref|YP_001852913.1| hypothetical protein MMAR_4655 ... 443 1e-122
gi|118616147|ref|YP_904479.1| hypothetical protein MUL_0273 [Myc... 439 3e-121
gi|41406914|ref|NP_959750.1| hypothetical protein MAP0816 [Mycob... 439 3e-121
gi|118464954|ref|YP_880265.1| hypothetical protein MAV_1007 [Myc... 438 3e-121
gi|254773891|ref|ZP_05215407.1| hypothetical protein MaviaA2_043... 437 5e-121
gi|240171962|ref|ZP_04750621.1| hypothetical protein MkanA1_2179... 433 1e-119
gi|336461243|gb|EGO40118.1| Protein of unknown function (DUF3027... 426 2e-117
gi|342860489|ref|ZP_08717140.1| hypothetical protein MCOL_16481 ... 425 3e-117
gi|15828150|ref|NP_302413.1| hypothetical protein ML2142 [Mycoba... 424 7e-117
gi|254823197|ref|ZP_05228198.1| hypothetical protein MintA_24929... 419 2e-115
gi|108801434|ref|YP_641631.1| hypothetical protein Mmcs_4471 [My... 385 3e-105
gi|118473623|ref|YP_889924.1| hypothetical protein MSMEG_5691 [M... 375 3e-102
gi|315442737|ref|YP_004075616.1| hypothetical protein Mspyr1_109... 371 7e-101
gi|120405988|ref|YP_955817.1| hypothetical protein Mvan_5039 [My... 369 3e-100
gi|145222303|ref|YP_001132981.1| hypothetical protein Mflv_1712 ... 367 8e-100
gi|333989443|ref|YP_004522057.1| hypothetical protein JDM601_080... 350 1e-94
gi|111021935|ref|YP_704907.1| hypothetical protein RHA1_ro04968 ... 313 1e-83
gi|169627975|ref|YP_001701624.1| hypothetical protein MAB_0876 [... 307 8e-82
gi|226364445|ref|YP_002782227.1| hypothetical protein ROP_50350 ... 305 4e-81
gi|312138426|ref|YP_004005762.1| hypothetical protein REQ_09730 ... 305 5e-81
gi|229488311|ref|ZP_04382177.1| conserved hypothetical protein [... 296 2e-78
gi|54022610|ref|YP_116852.1| hypothetical protein nfa6430 [Nocar... 296 2e-78
gi|226308118|ref|YP_002768078.1| hypothetical protein RER_46310 ... 293 2e-77
gi|333918514|ref|YP_004492095.1| hypothetical protein AS9A_0843 ... 263 2e-68
gi|300790292|ref|YP_003770583.1| hypothetical protein AMED_8485 ... 254 1e-65
gi|296138561|ref|YP_003645804.1| hypothetical protein Tpau_0829 ... 248 5e-64
gi|319949873|ref|ZP_08023882.1| hypothetical protein ES5_10287 [... 246 2e-63
gi|257057359|ref|YP_003135191.1| hypothetical protein Svir_34000... 243 1e-62
gi|326384171|ref|ZP_08205853.1| hypothetical protein SCNU_14606 ... 239 2e-61
gi|302530548|ref|ZP_07282890.1| conserved hypothetical protein [... 234 8e-60
gi|317509529|ref|ZP_07967132.1| hypothetical protein HMPREF9336_... 233 3e-59
gi|331699389|ref|YP_004335628.1| hypothetical protein Psed_5647 ... 230 1e-58
gi|296395230|ref|YP_003660114.1| hypothetical protein Srot_2852 ... 229 3e-58
gi|343928207|ref|ZP_08767662.1| hypothetical protein GOALK_110_0... 229 3e-58
gi|262201220|ref|YP_003272428.1| hypothetical protein Gbro_1239 ... 227 2e-57
gi|256374638|ref|YP_003098298.1| hypothetical protein Amir_0485 ... 225 5e-57
gi|284989396|ref|YP_003407950.1| hypothetical protein Gobs_0815 ... 222 4e-56
gi|134097208|ref|YP_001102869.1| hypothetical protein SACE_0598 ... 219 3e-55
gi|291005335|ref|ZP_06563308.1| hypothetical protein SeryN2_1251... 219 3e-55
gi|258654837|ref|YP_003203993.1| hypothetical protein Namu_4728 ... 207 2e-51
gi|302865011|ref|YP_003833648.1| hypothetical protein Micau_0505... 205 5e-51
gi|334338037|ref|YP_004543189.1| hypothetical protein Isova_2591... 204 1e-50
gi|330465298|ref|YP_004403041.1| hypothetical protein VAB18032_0... 203 2e-50
gi|325002210|ref|ZP_08123322.1| hypothetical protein PseP1_25761... 203 3e-50
gi|152967524|ref|YP_001363308.1| hypothetical protein Krad_3581 ... 202 5e-50
>gi|323720586|gb|EGB29664.1| hypothetical protein TMMG_02866 [Mycobacterium tuberculosis CDC1551A]
Length=277
Score = 526 bits (1355), Expect = 1e-147, Method: Compositional matrix adjust.
Identities = 262/262 (100%), Positives = 262/262 (100%), Gaps = 0/262 (0%)
Query 1 VTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH 60
VTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH
Sbjct 16 VTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH 75
Query 61 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 120
RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP
Sbjct 76 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 135
Query 121 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 180
GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG
Sbjct 136 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 195
Query 181 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 240
SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG
Sbjct 196 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 255
Query 241 STPIYEPYDDGVLDIIEKPAES 262
STPIYEPYDDGVLDIIEKPAES
Sbjct 256 STPIYEPYDDGVLDIIEKPAES 277
>gi|15608017|ref|NP_215392.1| hypothetical protein Rv0877 [Mycobacterium tuberculosis H37Rv]
gi|15840291|ref|NP_335328.1| hypothetical protein MT0900 [Mycobacterium tuberculosis CDC1551]
gi|31792065|ref|NP_854558.1| hypothetical protein Mb0901 [Mycobacterium bovis AF2122/97]
77 more sequence titles
Length=262
Score = 524 bits (1349), Expect = 6e-147, Method: Compositional matrix adjust.
Identities = 261/262 (99%), Positives = 262/262 (100%), Gaps = 0/262 (0%)
Query 1 VTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH 60
+TGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH
Sbjct 1 MTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH 60
Query 61 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 120
RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP
Sbjct 61 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 120
Query 121 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 180
GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG
Sbjct 121 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 180
Query 181 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 240
SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG
Sbjct 181 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 240
Query 241 STPIYEPYDDGVLDIIEKPAES 262
STPIYEPYDDGVLDIIEKPAES
Sbjct 241 STPIYEPYDDGVLDIIEKPAES 262
>gi|340625889|ref|YP_004744341.1| hypothetical protein MCAN_08781 [Mycobacterium canettii CIPT
140010059]
gi|340004079|emb|CCC43216.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=262
Score = 523 bits (1347), Expect = 1e-146, Method: Compositional matrix adjust.
Identities = 260/262 (99%), Positives = 262/262 (100%), Gaps = 0/262 (0%)
Query 1 VTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH 60
+TGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLG+SYEDGNAATH
Sbjct 1 MTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGLSYEDGNAATH 60
Query 61 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 120
RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP
Sbjct 61 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 120
Query 121 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 180
GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG
Sbjct 121 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 180
Query 181 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 240
SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG
Sbjct 181 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 240
Query 241 STPIYEPYDDGVLDIIEKPAES 262
STPIYEPYDDGVLDIIEKPAES
Sbjct 241 STPIYEPYDDGVLDIIEKPAES 262
>gi|339293889|gb|AEJ46000.1| hypothetical protein CCDC5079_0810 [Mycobacterium tuberculosis
CCDC5079]
gi|339297530|gb|AEJ49640.1| hypothetical protein CCDC5180_0803 [Mycobacterium tuberculosis
CCDC5180]
Length=253
Score = 507 bits (1305), Expect = 7e-142, Method: Compositional matrix adjust.
Identities = 252/253 (99%), Positives = 253/253 (100%), Gaps = 0/253 (0%)
Query 10 VATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGY 69
+ATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGY
Sbjct 1 MATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGY 60
Query 70 QGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKD 129
QGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKD
Sbjct 61 QGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKD 120
Query 130 DPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKR 189
DPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKR
Sbjct 121 DPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKR 180
Query 190 VCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYD 249
VCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYD
Sbjct 181 VCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYD 240
Query 250 DGVLDIIEKPAES 262
DGVLDIIEKPAES
Sbjct 241 DGVLDIIEKPAES 253
>gi|296169642|ref|ZP_06851260.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295895639|gb|EFG75335.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=353
Score = 450 bits (1158), Expect = 9e-125, Method: Compositional matrix adjust.
Identities = 217/256 (85%), Positives = 235/256 (92%), Gaps = 0/256 (0%)
Query 1 VTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH 60
VTGP EESAVATVA+WPEGLAAVL A DQARAAVVEFSG E+VGDYLGVSYED AATH
Sbjct 7 VTGPIEESAVATVAEWPEGLAAVLTAAVDQARAAVVEFSGAESVGDYLGVSYEDPAAATH 66
Query 61 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 120
RF+AHLPGYQGWQWAVVVA++ GADHAT+SEVVL+PGPTALLAP WVPW+QRVRPGDLSP
Sbjct 67 RFLAHLPGYQGWQWAVVVAAHPGADHATVSEVVLIPGPTALLAPAWVPWDQRVRPGDLSP 126
Query 121 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 180
GDLLAPA DDPRLVPGY ASGD QVDE AAEIGLGRRWVMSAWGRA +A+RWH GD+GP
Sbjct 127 GDLLAPAADDPRLVPGYVASGDDQVDEAAAEIGLGRRWVMSAWGRAAAAERWHTGDHGPD 186
Query 181 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 240
S MARSTKRVCRDCGFFLPL+GSLGAMFGVCGNELSADG VVD+QYGCGAHSDT APAG
Sbjct 187 SPMARSTKRVCRDCGFFLPLSGSLGAMFGVCGNELSADGQVVDKQYGCGAHSDTPAPAGT 246
Query 241 STPIYEPYDDGVLDII 256
+P+YEP+DDGVLD++
Sbjct 247 GSPMYEPFDDGVLDVV 262
>gi|183984622|ref|YP_001852913.1| hypothetical protein MMAR_4655 [Mycobacterium marinum M]
gi|183177948|gb|ACC43058.1| conserved protein [Mycobacterium marinum M]
Length=267
Score = 443 bits (1139), Expect = 1e-122, Method: Compositional matrix adjust.
Identities = 220/265 (84%), Positives = 234/265 (89%), Gaps = 3/265 (1%)
Query 1 VTGPTE---ESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNA 57
+TGP E ES+ VA+WPE LA L GA AR AV EFSGPEAVGDYLGVSYED NA
Sbjct 1 MTGPFEDSTESSATAVAEWPEELAPTLTGAVALAREAVEEFSGPEAVGDYLGVSYEDPNA 60
Query 58 ATHRFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGD 117
ATHRF+AHLPGYQGWQWA VVASY GAD T+SEVVLVPGPTALLAP WVPWEQRVRPGD
Sbjct 61 ATHRFLAHLPGYQGWQWAAVVASYVGADRVTVSEVVLVPGPTALLAPAWVPWEQRVRPGD 120
Query 118 LSPGDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDY 177
LSPGDLLAPAKDDPRLVPGY ASGDAQVDETAAE+GLGRRWVMS WGRA++AQRWHDGD
Sbjct 121 LSPGDLLAPAKDDPRLVPGYAASGDAQVDETAAEVGLGRRWVMSGWGRAEAAQRWHDGDC 180
Query 178 GPGSAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAP 237
GP S MARSTKRVCRDCGFFLPLAGSLGAMFGVCGNEL+ADG VVD+QYGCGAHSDT AP
Sbjct 181 GPDSPMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELAADGRVVDKQYGCGAHSDTPAP 240
Query 238 AGGSTPIYEPYDDGVLDIIEKPAES 262
AG +P+YEPYDDGVLD++EKPAES
Sbjct 241 AGTGSPMYEPYDDGVLDVVEKPAES 265
>gi|118616147|ref|YP_904479.1| hypothetical protein MUL_0273 [Mycobacterium ulcerans Agy99]
gi|118568257|gb|ABL03008.1| conserved protein [Mycobacterium ulcerans Agy99]
Length=267
Score = 439 bits (1128), Expect = 3e-121, Method: Compositional matrix adjust.
Identities = 218/265 (83%), Positives = 232/265 (88%), Gaps = 3/265 (1%)
Query 1 VTGPTE---ESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNA 57
+TGP E ES+ VA+WPE LA L GA AR AV EFSG EAVGDYLGVSYED NA
Sbjct 1 MTGPFEDSTESSATAVAEWPEELAPTLTGAVALAREAVEEFSGAEAVGDYLGVSYEDPNA 60
Query 58 ATHRFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGD 117
ATHRF+AHLPGYQGWQWA VVASY GAD T+SEVVLVPGPTALLAP WVPWEQRVRPGD
Sbjct 61 ATHRFLAHLPGYQGWQWAAVVASYVGADRVTVSEVVLVPGPTALLAPAWVPWEQRVRPGD 120
Query 118 LSPGDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDY 177
LSPGDLLAPAKDDPRLVPGY ASGDAQVDETAAE+GLGRRWVMS WGRA++AQRWHDGD
Sbjct 121 LSPGDLLAPAKDDPRLVPGYAASGDAQVDETAAEVGLGRRWVMSGWGRAEAAQRWHDGDC 180
Query 178 GPGSAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAP 237
GP S MARSTKRVCRDCGFFLPLAGSLG MFGVCGNEL+ADG VVD+QYGCGAHSDT AP
Sbjct 181 GPDSPMARSTKRVCRDCGFFLPLAGSLGVMFGVCGNELAADGRVVDKQYGCGAHSDTPAP 240
Query 238 AGGSTPIYEPYDDGVLDIIEKPAES 262
AG +P+YEPYDDGVLD++EKPAES
Sbjct 241 AGTGSPMYEPYDDGVLDVVEKPAES 265
>gi|41406914|ref|NP_959750.1| hypothetical protein MAP0816 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41395264|gb|AAS03133.1| hypothetical protein MAP_0816 [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=310
Score = 439 bits (1128), Expect = 3e-121, Method: Compositional matrix adjust.
Identities = 215/255 (85%), Positives = 229/255 (90%), Gaps = 0/255 (0%)
Query 1 VTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH 60
VT P EES++AT +WPEGLAAVL GAADQARAAVVEFSG E VGDYL V YED AATH
Sbjct 17 VTTPLEESSMATAGEWPEGLAAVLTGAADQARAAVVEFSGAETVGDYLAVGYEDPYAATH 76
Query 61 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 120
RF+AHLPGYQGWQWAVVVA+Y GADHATISEVVLVPGPTALLAP+WVPWEQRVRPGDLSP
Sbjct 77 RFLAHLPGYQGWQWAVVVAAYPGADHATISEVVLVPGPTALLAPEWVPWEQRVRPGDLSP 136
Query 121 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 180
GDLLAPA +DPRLVPGYTASGD QVDETAAEIGLGRRWVMSA GRA++A+RWH G YGP
Sbjct 137 GDLLAPAANDPRLVPGYTASGDPQVDETAAEIGLGRRWVMSAEGRAEAAERWHTGAYGPD 196
Query 181 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 240
S MARSTKRVCRDCGFFLPL+GSLG MFGVCGNELSADGHVVD YGCGAHSDT APAG
Sbjct 197 SPMARSTKRVCRDCGFFLPLSGSLGRMFGVCGNELSADGHVVDMHYGCGAHSDTPAPAGT 256
Query 241 STPIYEPYDDGVLDI 255
+P YEPYDDG+LD+
Sbjct 257 GSPAYEPYDDGLLDV 271
>gi|118464954|ref|YP_880265.1| hypothetical protein MAV_1007 [Mycobacterium avium 104]
gi|118166241|gb|ABK67138.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=297
Score = 438 bits (1127), Expect = 3e-121, Method: Compositional matrix adjust.
Identities = 216/255 (85%), Positives = 230/255 (91%), Gaps = 0/255 (0%)
Query 1 VTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH 60
VT P EES++AT +WPEGLAAVL GAADQARAAVVEFSG E VGDYLGV YED AATH
Sbjct 17 VTTPLEESSMATAGEWPEGLAAVLTGAADQARAAVVEFSGAETVGDYLGVGYEDPYAATH 76
Query 61 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 120
RF+AHLPGYQGWQWAVVVA+Y GADHATISEVVLVPGPTALLAP+WVPWEQRVRPGDLSP
Sbjct 77 RFLAHLPGYQGWQWAVVVAAYPGADHATISEVVLVPGPTALLAPEWVPWEQRVRPGDLSP 136
Query 121 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 180
GDLLAPA +DPRLVPGYTASGD QVDETAAEIGLGRRWVMSA GRA++A+RWH G YGP
Sbjct 137 GDLLAPAANDPRLVPGYTASGDPQVDETAAEIGLGRRWVMSAEGRAEAAERWHTGAYGPD 196
Query 181 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 240
S MARSTKRVCRDCGFFLPL+GSLG MFGVCGNELSADGHVVD YGCGAHSDT APAG
Sbjct 197 SPMARSTKRVCRDCGFFLPLSGSLGRMFGVCGNELSADGHVVDMHYGCGAHSDTPAPAGT 256
Query 241 STPIYEPYDDGVLDI 255
+P YEPYDDG+LD+
Sbjct 257 GSPAYEPYDDGLLDV 271
>gi|254773891|ref|ZP_05215407.1| hypothetical protein MaviaA2_04342 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=287
Score = 437 bits (1125), Expect = 5e-121, Method: Compositional matrix adjust.
Identities = 216/255 (85%), Positives = 230/255 (91%), Gaps = 0/255 (0%)
Query 1 VTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH 60
VT P EES++AT +WPEGLAAVL GAADQARAAVVEFSG E VGDYLGV YED AATH
Sbjct 7 VTTPLEESSMATAGEWPEGLAAVLTGAADQARAAVVEFSGAETVGDYLGVGYEDPYAATH 66
Query 61 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 120
RF+AHLPGYQGWQWAVVVA+Y GADHATISEVVLVPGPTALLAP+WVPWEQRVRPGDLSP
Sbjct 67 RFLAHLPGYQGWQWAVVVAAYPGADHATISEVVLVPGPTALLAPEWVPWEQRVRPGDLSP 126
Query 121 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 180
GDLLAPA +DPRLVPGYTASGD QVDETAAEIGLGRRWVMSA GRA++A+RWH G YGP
Sbjct 127 GDLLAPAANDPRLVPGYTASGDPQVDETAAEIGLGRRWVMSAEGRAEAAERWHTGAYGPD 186
Query 181 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 240
S MARSTKRVCRDCGFFLPL+GSLG MFGVCGNELSADGHVVD YGCGAHSDT APAG
Sbjct 187 SPMARSTKRVCRDCGFFLPLSGSLGRMFGVCGNELSADGHVVDMHYGCGAHSDTPAPAGT 246
Query 241 STPIYEPYDDGVLDI 255
+P YEPYDDG+LD+
Sbjct 247 GSPAYEPYDDGLLDV 261
>gi|240171962|ref|ZP_04750621.1| hypothetical protein MkanA1_21790 [Mycobacterium kansasii ATCC
12478]
Length=267
Score = 433 bits (1114), Expect = 1e-119, Method: Compositional matrix adjust.
Identities = 222/265 (84%), Positives = 243/265 (92%), Gaps = 3/265 (1%)
Query 1 VTGPTE---ESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNA 57
+TGP E ESAVATVA+WP+ LA+VL GA DQARAAV EFSGP+AVGDYLGV YED NA
Sbjct 1 MTGPLEKSVESAVATVAEWPQDLASVLTGAVDQARAAVAEFSGPDAVGDYLGVGYEDPNA 60
Query 58 ATHRFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGD 117
ATHRF+AHLPGYQGWQWAVVVA+++GAD ATISEVVLVPGPTALLAP WVPWE+RV+PGD
Sbjct 61 ATHRFLAHLPGYQGWQWAVVVAAHAGADRATISEVVLVPGPTALLAPPWVPWERRVQPGD 120
Query 118 LSPGDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDY 177
LSPGDLLAPAKDDPRLVPGY+ASGD QVDETAAEIG GRRWV+SAWGRA +A+RWH+GDY
Sbjct 121 LSPGDLLAPAKDDPRLVPGYSASGDPQVDETAAEIGFGRRWVLSAWGRAGAAERWHNGDY 180
Query 178 GPGSAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAP 237
GP SAMARSTKRVCRDCGFFLPLAG+LGAMFGVCGNELSADGHVVD+ YGCGAHSDT AP
Sbjct 181 GPDSAMARSTKRVCRDCGFFLPLAGALGAMFGVCGNELSADGHVVDKHYGCGAHSDTPAP 240
Query 238 AGGSTPIYEPYDDGVLDIIEKPAES 262
AG +P+Y+PYDDGVLDI EKP ES
Sbjct 241 AGSGSPMYDPYDDGVLDIWEKPPES 265
>gi|336461243|gb|EGO40118.1| Protein of unknown function (DUF3027) [Mycobacterium avium subsp.
paratuberculosis S397]
Length=285
Score = 426 bits (1095), Expect = 2e-117, Method: Compositional matrix adjust.
Identities = 209/246 (85%), Positives = 222/246 (91%), Gaps = 0/246 (0%)
Query 10 VATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGY 69
+AT +WPEGLAAVL GAADQARAAVVEFSG E VGDYL V YED AATHRF+AHLPGY
Sbjct 1 MATAGEWPEGLAAVLTGAADQARAAVVEFSGAETVGDYLAVGYEDPYAATHRFLAHLPGY 60
Query 70 QGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKD 129
QGWQWAVVVA+Y GADHATISEVVLVPGPTALLAP+WVPWEQRVRPGDLSPGDLLAPA +
Sbjct 61 QGWQWAVVVAAYPGADHATISEVVLVPGPTALLAPEWVPWEQRVRPGDLSPGDLLAPAAN 120
Query 130 DPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKR 189
DPRLVPGYTASGD QVDETAAEIGLGRRWVMSA GRA++A+RWH G YGP S MARSTKR
Sbjct 121 DPRLVPGYTASGDPQVDETAAEIGLGRRWVMSAEGRAEAAERWHTGAYGPDSPMARSTKR 180
Query 190 VCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYD 249
VCRDCGFFLPL+GSLG MFGVCGNELSADGHVVD YGCGAHSDT APAG +P YEPYD
Sbjct 181 VCRDCGFFLPLSGSLGRMFGVCGNELSADGHVVDMHYGCGAHSDTPAPAGTGSPAYEPYD 240
Query 250 DGVLDI 255
DG+LD+
Sbjct 241 DGLLDV 246
>gi|342860489|ref|ZP_08717140.1| hypothetical protein MCOL_16481 [Mycobacterium colombiense CECT
3035]
gi|342132144|gb|EGT85385.1| hypothetical protein MCOL_16481 [Mycobacterium colombiense CECT
3035]
Length=339
Score = 425 bits (1093), Expect = 3e-117, Method: Compositional matrix adjust.
Identities = 216/259 (84%), Positives = 232/259 (90%), Gaps = 0/259 (0%)
Query 1 VTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH 60
+ PT E AVATVADWPEGLA VL GAA+ ARAAVVEFSGPE VGDYLGV YED N ATH
Sbjct 11 IEEPTVEFAVATVADWPEGLATVLTGAAEAARAAVVEFSGPEMVGDYLGVGYEDPNTATH 70
Query 61 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 120
RF+AHLPGYQGWQWAVVVA+Y GADHATISEVVLVPGPTALLAP+WVPWEQRVRPGDLSP
Sbjct 71 RFLAHLPGYQGWQWAVVVAAYPGADHATISEVVLVPGPTALLAPEWVPWEQRVRPGDLSP 130
Query 121 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 180
GDLLAPA DDPRLVPGYTASGD QVDETAAEIGLGRRWVMS GRA++A+RW GDYGP
Sbjct 131 GDLLAPAADDPRLVPGYTASGDPQVDETAAEIGLGRRWVMSVEGRAEAAERWRTGDYGPD 190
Query 181 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 240
S MARSTKRVCRDCGFFLPL+GSLGA+FGVCGNELSADGH+VD+ YGCGAHSDT APAG
Sbjct 191 SPMARSTKRVCRDCGFFLPLSGSLGALFGVCGNELSADGHIVDKLYGCGAHSDTPAPAGT 250
Query 241 STPIYEPYDDGVLDIIEKP 259
+P YEPYDDG+LD+ + P
Sbjct 251 GSPAYEPYDDGMLDVTQAP 269
>gi|15828150|ref|NP_302413.1| hypothetical protein ML2142 [Mycobacterium leprae TN]
gi|221230627|ref|YP_002504043.1| hypothetical protein MLBr_02142 [Mycobacterium leprae Br4923]
gi|2440100|emb|CAB16669.1| hypothetical protein MLCB57.29 [Mycobacterium leprae]
gi|13093704|emb|CAC31097.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219933734|emb|CAR72239.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=269
Score = 424 bits (1090), Expect = 7e-117, Method: Compositional matrix adjust.
Identities = 207/257 (81%), Positives = 225/257 (88%), Gaps = 0/257 (0%)
Query 1 VTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH 60
+TGP E+SAVATVA+WPE LAAVL AAD ARAA+ EFSG VGDYLGV YED NAATH
Sbjct 1 MTGPVEDSAVATVAEWPEELAAVLTNAADDARAAIEEFSGSVTVGDYLGVGYEDPNAATH 60
Query 61 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 120
RF+AHLPGYQGWQWAVVVA+Y GA+HAT+SEVVLVPGPTALLAP+WVPWEQRVRPGDL P
Sbjct 61 RFLAHLPGYQGWQWAVVVAAYPGAEHATVSEVVLVPGPTALLAPEWVPWEQRVRPGDLGP 120
Query 121 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 180
GDLLAP +D RLVPGY ASGD VDE AAEIGLGRRWVMS WGR+ +A+RWH GDYGP
Sbjct 121 GDLLAPTSEDLRLVPGYNASGDPAVDEIAAEIGLGRRWVMSVWGRSAAAERWHGGDYGPD 180
Query 181 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 240
S MARSTKRVCRDCGFFL L GSLGAMFGVCGNE+SADGHVVD+ YGCGAHSDT APAG
Sbjct 181 SPMARSTKRVCRDCGFFLSLVGSLGAMFGVCGNEMSADGHVVDKLYGCGAHSDTPAPAGS 240
Query 241 STPIYEPYDDGVLDIIE 257
+ +YEPYDDGVLDI+E
Sbjct 241 GSSVYEPYDDGVLDILE 257
>gi|254823197|ref|ZP_05228198.1| hypothetical protein MintA_24929 [Mycobacterium intracellulare
ATCC 13950]
Length=329
Score = 419 bits (1077), Expect = 2e-115, Method: Compositional matrix adjust.
Identities = 219/261 (84%), Positives = 236/261 (91%), Gaps = 0/261 (0%)
Query 1 VTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH 60
+T PTE S+VATV +WPEGLAAVL GAADQARAAV EFSGPE VGDYLGV YED N ATH
Sbjct 7 MTRPTEGSSVATVDEWPEGLAAVLTGAADQARAAVAEFSGPEMVGDYLGVGYEDPNTATH 66
Query 61 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 120
RF+AHLPGYQGWQWAVVVA+Y+GADHAT+SEVVLVPGPTALLAP+WVPWE RVRPGDLSP
Sbjct 67 RFLAHLPGYQGWQWAVVVAAYAGADHATVSEVVLVPGPTALLAPEWVPWEHRVRPGDLSP 126
Query 121 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 180
GDLLAPA DDPRLVPGYTASGD QVDETAAEIGLGRRWVMS GRA +A+RWH GDYGP
Sbjct 127 GDLLAPAADDPRLVPGYTASGDPQVDETAAEIGLGRRWVMSGEGRAAAAERWHTGDYGPD 186
Query 181 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 240
S MARSTKRVCRDCGFFLPL+GSLGAMFGVCGNELSADGH+VD+QYGCGAHSDT APAG
Sbjct 187 SPMARSTKRVCRDCGFFLPLSGSLGAMFGVCGNELSADGHIVDKQYGCGAHSDTPAPAGT 246
Query 241 STPIYEPYDDGVLDIIEKPAE 261
+P YEPYDDG+LD+ + P E
Sbjct 247 GSPAYEPYDDGLLDVTQAPVE 267
>gi|108801434|ref|YP_641631.1| hypothetical protein Mmcs_4471 [Mycobacterium sp. MCS]
gi|119870587|ref|YP_940539.1| hypothetical protein Mkms_4558 [Mycobacterium sp. KMS]
gi|126437419|ref|YP_001073110.1| hypothetical protein Mjls_4854 [Mycobacterium sp. JLS]
gi|108771853|gb|ABG10575.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119696676|gb|ABL93749.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126237219|gb|ABO00620.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=257
Score = 385 bits (990), Expect = 3e-105, Method: Compositional matrix adjust.
Identities = 182/248 (74%), Positives = 213/248 (86%), Gaps = 0/248 (0%)
Query 11 ATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQ 70
A VA P L AVL GA DQARAA+ EFSGP+ +G+YLG S+ED +ATHRF+A +PGY+
Sbjct 9 ADVAQRPADLEAVLMGAVDQARAALAEFSGPDTIGEYLGASFEDPTSATHRFLADMPGYR 68
Query 71 GWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDD 130
GWQWAVVVA+Y GA+ ATISE+VLVPGPTALLAP WVPW+ RVRPGDL PGDLLAP ++D
Sbjct 69 GWQWAVVVAAYPGAEQATISELVLVPGPTALLAPKWVPWQDRVRPGDLGPGDLLAPPRED 128
Query 131 PRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRV 190
PRLVPG+ ASGD Q+DETAAE+GLGRR V+S WGR +AQRWHDGD+GPGSAMARSTKRV
Sbjct 129 PRLVPGHVASGDPQIDETAAEVGLGRRQVLSRWGRIDAAQRWHDGDFGPGSAMARSTKRV 188
Query 191 CRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDD 250
CRDCGF+LPL+GSLG MFGVC NE+SADGHVVD +YGCGAHSDT AP +P+Y+PYDD
Sbjct 189 CRDCGFYLPLSGSLGVMFGVCANEMSADGHVVDSEYGCGAHSDTPAPQLTGSPLYDPYDD 248
Query 251 GVLDIIEK 258
GV+D+ +K
Sbjct 249 GVIDLADK 256
>gi|118473623|ref|YP_889924.1| hypothetical protein MSMEG_5691 [Mycobacterium smegmatis str.
MC2 155]
gi|118174910|gb|ABK75806.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=261
Score = 375 bits (964), Expect = 3e-102, Method: Compositional matrix adjust.
Identities = 179/247 (73%), Positives = 211/247 (86%), Gaps = 1/247 (0%)
Query 11 ATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQ 70
A AD P GL +VLRGA D ARAA+ EFSG + VG+YLGV+ ED ++ATHRF+A++PGY+
Sbjct 16 AAAADAP-GLESVLRGAVDVARAALTEFSGADTVGEYLGVTLEDPSSATHRFLANMPGYR 74
Query 71 GWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDD 130
GWQWAVVVA+Y GAD ATISE+VLVPGPTALLAP WVPW++R+RPGDL PGDLLAP DD
Sbjct 75 GWQWAVVVAAYPGADRATISELVLVPGPTALLAPKWVPWQERIRPGDLGPGDLLAPPPDD 134
Query 131 PRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRV 190
PRLVPGY+A+GD +DETA E+G GRR V+S WGRA +AQRWHDG++GP S MARSTKRV
Sbjct 135 PRLVPGYSATGDPLIDETALELGFGRRQVLSEWGRAAAAQRWHDGEFGPNSPMARSTKRV 194
Query 191 CRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDD 250
CRDCGF+LPL G+LG MFGVC NELSADGHVVD +YGCGAHSDT AP G +P+YEP+DD
Sbjct 195 CRDCGFYLPLGGALGRMFGVCANELSADGHVVDSEYGCGAHSDTPAPPGTGSPLYEPFDD 254
Query 251 GVLDIIE 257
GVLD+ +
Sbjct 255 GVLDLAD 261
>gi|315442737|ref|YP_004075616.1| hypothetical protein Mspyr1_10980 [Mycobacterium sp. Spyr1]
gi|315261040|gb|ADT97781.1| hypothetical protein Mspyr1_10980 [Mycobacterium sp. Spyr1]
Length=308
Score = 371 bits (952), Expect = 7e-101, Method: Compositional matrix adjust.
Identities = 173/243 (72%), Positives = 206/243 (85%), Gaps = 0/243 (0%)
Query 20 LAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVA 79
L AVL GA D ARAA+VEFSG ++VG+YLG +ED +ATHRF+A LPGY+GWQWAVVVA
Sbjct 28 LEAVLLGAVDDARAAIVEFSGEDSVGEYLGAGFEDSTSATHRFLAELPGYRGWQWAVVVA 87
Query 80 SYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTA 139
+ GA ATISEVVLVPGPTALLAP WVPWE+RVRPGDLSPGDLLAP DDPRL PGY A
Sbjct 88 ACPGAGRATISEVVLVPGPTALLAPQWVPWEERVRPGDLSPGDLLAPPADDPRLAPGYAA 147
Query 140 SGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLP 199
+GD Q+DE A E+GLGRR V+S WGR +AQRWHDG+YGPGSAMAR+T+R+CRDCG+++P
Sbjct 148 TGDPQIDEVAVEVGLGRRQVLSLWGRNDTAQRWHDGEYGPGSAMARATRRMCRDCGYYVP 207
Query 200 LAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIEKP 259
L G+LG MFGVC NE +ADGHVVD ++GCGAHSDT A AG +P+Y+PYDDGVLD++++P
Sbjct 208 LGGALGVMFGVCANEYAADGHVVDAEFGCGAHSDTPAAAGTGSPLYDPYDDGVLDVVDRP 267
Query 260 AES 262
S
Sbjct 268 QAS 270
>gi|120405988|ref|YP_955817.1| hypothetical protein Mvan_5039 [Mycobacterium vanbaalenii PYR-1]
gi|119958806|gb|ABM15811.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=275
Score = 369 bits (947), Expect = 3e-100, Method: Compositional matrix adjust.
Identities = 172/239 (72%), Positives = 206/239 (87%), Gaps = 0/239 (0%)
Query 20 LAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVA 79
L A+L GA ++ARAA+VEFSG VG+YLG +ED ++ATHRF+A +PGY+GWQWAVVVA
Sbjct 35 LEALLLGAVEEARAAIVEFSGDGTVGEYLGAGFEDPSSATHRFLAEMPGYRGWQWAVVVA 94
Query 80 SYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTA 139
+ GA HATISEVVLVPGPTALLAP WVPW++R+RPGDLSPGDLLAP +DPRLVPGYTA
Sbjct 95 ACPGAAHATISEVVLVPGPTALLAPKWVPWDERIRPGDLSPGDLLAPPAEDPRLVPGYTA 154
Query 140 SGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLP 199
+GD Q+DE A E+GLGRR V+S WGR +AQRWHDGDYGPGSAMAR+T+RVCRDCGF+LP
Sbjct 155 TGDPQIDEVAVEVGLGRRQVLSLWGRNDTAQRWHDGDYGPGSAMARATRRVCRDCGFYLP 214
Query 200 LAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIEK 258
L G+LG +FGVC NE +ADGHVVD +YGCGAHSDT A G +P+++PYDDGVLD++EK
Sbjct 215 LGGALGVLFGVCANEYAADGHVVDAEYGCGAHSDTPAAPGNGSPLFDPYDDGVLDLVEK 273
>gi|145222303|ref|YP_001132981.1| hypothetical protein Mflv_1712 [Mycobacterium gilvum PYR-GCK]
gi|145214789|gb|ABP44193.1| hypothetical protein Mflv_1712 [Mycobacterium gilvum PYR-GCK]
Length=308
Score = 367 bits (943), Expect = 8e-100, Method: Compositional matrix adjust.
Identities = 173/243 (72%), Positives = 205/243 (85%), Gaps = 0/243 (0%)
Query 20 LAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVA 79
L AVL GA D ARAA+VEFSG ++VG+YLG +ED +ATHRF+A LPGY+GWQWAVVVA
Sbjct 28 LEAVLLGAVDDARAAIVEFSGEDSVGEYLGAGFEDSTSATHRFLAELPGYRGWQWAVVVA 87
Query 80 SYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTA 139
+ GA ATISEVVLVPGPTALLAP WVPWE+RVRPGDLSPGDLLAP DDPRL PGY A
Sbjct 88 ACPGAGRATISEVVLVPGPTALLAPQWVPWEERVRPGDLSPGDLLAPPADDPRLAPGYAA 147
Query 140 SGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLP 199
+GD Q+DE A E+GLGRR V+S WGR +AQRWHDG+YGPGSAMAR+T R+CRDCG+++P
Sbjct 148 TGDPQIDEVAVEVGLGRRQVLSLWGRNDTAQRWHDGEYGPGSAMARATWRMCRDCGYYVP 207
Query 200 LAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIEKP 259
L G+LG MFGVC NE +ADGHVVD ++GCGAHSDT A AG +P+Y+PYDDGVLD++++P
Sbjct 208 LGGALGVMFGVCANEYAADGHVVDAEFGCGAHSDTPAAAGTGSPLYDPYDDGVLDVVDRP 267
Query 260 AES 262
S
Sbjct 268 QAS 270
>gi|333989443|ref|YP_004522057.1| hypothetical protein JDM601_0804 [Mycobacterium sp. JDM601]
gi|333485412|gb|AEF34804.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=295
Score = 350 bits (897), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 175/238 (74%), Positives = 203/238 (86%), Gaps = 0/238 (0%)
Query 20 LAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVA 79
+ AVL A D+ARAAVVEFSG + VG++LGV YED AATHRF A LPGYQGWQWAVVVA
Sbjct 1 MPAVLAAAVDEARAAVVEFSGADTVGEHLGVDYEDATAATHRFGAVLPGYQGWQWAVVVA 60
Query 80 SYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTA 139
++ GA H+T+SEVVLVPGP ALL+P+WVPW+QRVRPGDL PGDLLAP DDPRLVPGYT+
Sbjct 61 AFPGAAHSTVSEVVLVPGPGALLSPEWVPWDQRVRPGDLGPGDLLAPPADDPRLVPGYTS 120
Query 140 SGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLP 199
SGD +DE A E+GLGRRW++S GRA++AQRWHDGDYGP SAMARSTKRVCRDCGF+LP
Sbjct 121 SGDPDLDEVAGELGLGRRWLLSPLGRAEAAQRWHDGDYGPDSAMARSTKRVCRDCGFYLP 180
Query 200 LAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIE 257
LAG LG FGVC NE+SADG VVD YGCGAHSDT PAG +P+++PYDDGVL+I++
Sbjct 181 LAGVLGTGFGVCCNEMSADGRVVDSGYGCGAHSDTPVPAGTGSPVHDPYDDGVLEIVD 238
>gi|111021935|ref|YP_704907.1| hypothetical protein RHA1_ro04968 [Rhodococcus jostii RHA1]
gi|110821465|gb|ABG96749.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=281
Score = 313 bits (803), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 149/236 (64%), Positives = 183/236 (78%), Gaps = 1/236 (0%)
Query 23 VLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYS 82
VL A + AR A+VE VGDYLGV+ ED AATHRF+A LPGY+GWQWAVVVA+
Sbjct 22 VLTDAVELARTALVELQ-EGGVGDYLGVTAEDACAATHRFVADLPGYRGWQWAVVVAADP 80
Query 83 GADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGD 142
ADHAT+SE+ L+PGP AL+AP+W+PW+QR+RPGDLS GDLLAP DPRLVPGY A+GD
Sbjct 81 EADHATVSELALLPGPDALVAPEWIPWDQRIRPGDLSAGDLLAPPAGDPRLVPGYVATGD 140
Query 143 AQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAG 202
++DE A E+GLGR+ VMS GR +A+RWHDGDYGP S MA++ C CGF+LPLAG
Sbjct 141 PEIDEVALELGLGRKQVMSLEGRVDAAERWHDGDYGPDSEMAKAAPSTCGLCGFYLPLAG 200
Query 203 SLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIEK 258
SL A FGVCGNE++ADGHVVD YGCGAHSDT P G +P ++ YDDG ++++E+
Sbjct 201 SLHASFGVCGNEMAADGHVVDATYGCGAHSDTLLPTGAGSPRFDAYDDGAVEVVER 256
>gi|169627975|ref|YP_001701624.1| hypothetical protein MAB_0876 [Mycobacterium abscessus ATCC 19977]
gi|169239942|emb|CAM60970.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=245
Score = 307 bits (787), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 152/234 (65%), Positives = 179/234 (77%), Gaps = 3/234 (1%)
Query 24 LRGAADQARAAVVEFSGPEAVGDYLGVS--YEDGNAATHRFIAHLPGYQGWQWAVVVASY 81
L +QAR A+VEFSG E VG YLG S Y D +A THRF A LPGY+GW W VV+A+
Sbjct 11 LYTCIEQARTAIVEFSG-ETVGKYLGASSEYSDLHALTHRFEAELPGYRGWHWEVVMAAA 69
Query 82 SGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASG 141
GA AT+SEVVLVPG AL P+W+PWE+R+RPGDL PGDLLAP +DPRLVPGYT SG
Sbjct 70 PGATVATVSEVVLVPGAEALRPPNWIPWEERIRPGDLGPGDLLAPPPNDPRLVPGYTDSG 129
Query 142 DAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLA 201
D QV ETA EIGLGR+ V+S GR +AQRW DG++GP + MAR+T+RVCR CGF+LPLA
Sbjct 130 DPQVTETAGEIGLGRKQVLSLAGRIDAAQRWFDGEWGPDAEMARATRRVCRSCGFYLPLA 189
Query 202 GSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDI 255
GSLG FGVC N +SADG VV +YGCGAHSDT P + P+YEP+DDGVLD+
Sbjct 190 GSLGVAFGVCANSMSADGRVVHIEYGCGAHSDTPQPVNSAMPLYEPFDDGVLDL 243
>gi|226364445|ref|YP_002782227.1| hypothetical protein ROP_50350 [Rhodococcus opacus B4]
gi|226242934|dbj|BAH53282.1| hypothetical protein [Rhodococcus opacus B4]
Length=277
Score = 305 bits (782), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 146/236 (62%), Positives = 182/236 (78%), Gaps = 1/236 (0%)
Query 23 VLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYS 82
VL A + AR AVVE VGDYLGV+ ED AATHRF+A LPGY+GWQWAVVVA+
Sbjct 22 VLTDAVELARTAVVELQ-EGGVGDYLGVTSEDECAATHRFVADLPGYRGWQWAVVVAADP 80
Query 83 GADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGD 142
+D AT+SE+ L+PGP AL+APDW+PW+QR+RPGDLS GDLLAP DPRLVPGY A+GD
Sbjct 81 ESDRATVSELALLPGPDALVAPDWIPWDQRIRPGDLSAGDLLAPPAGDPRLVPGYVATGD 140
Query 143 AQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAG 202
++D+ A E+GLGR+ +S GR +AQRWHDGDYGP S MA++ C CGF+LPLAG
Sbjct 141 PEIDDAALELGLGRKQALSLEGRLDAAQRWHDGDYGPDSEMAKAAPSTCGLCGFYLPLAG 200
Query 203 SLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIEK 258
+L FGVCGNE++ADGHVVD YGCGAHSDT+ P+G +P Y+ YDDG ++++E+
Sbjct 201 ALHGAFGVCGNEMAADGHVVDVMYGCGAHSDTSLPSGAGSPQYDAYDDGAVEVVEQ 256
>gi|312138426|ref|YP_004005762.1| hypothetical protein REQ_09730 [Rhodococcus equi 103S]
gi|325674577|ref|ZP_08154264.1| hypothetical protein HMPREF0724_12046 [Rhodococcus equi ATCC
33707]
gi|311887765|emb|CBH47077.1| conserved hypothetical protein [Rhodococcus equi 103S]
gi|325554163|gb|EGD23838.1| hypothetical protein HMPREF0724_12046 [Rhodococcus equi ATCC
33707]
Length=299
Score = 305 bits (781), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 146/235 (63%), Positives = 180/235 (77%), Gaps = 1/235 (0%)
Query 23 VLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYS 82
VL A + AR A+V+ G VG YLGV+ ED AATHRF LPGY+GWQWAVVVA+
Sbjct 32 VLADAVELARTALVDL-GEGGVGRYLGVTAEDDCAATHRFDTELPGYRGWQWAVVVAAVP 90
Query 83 GADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGD 142
G+DH T+SE+ L+PGP AL+AP+W+PW+QRVRPGDLSPGDLLAP ++DPRLVPGY A+GD
Sbjct 91 GSDHVTVSELALLPGPDALVAPEWLPWDQRVRPGDLSPGDLLAPRENDPRLVPGYVATGD 150
Query 143 AQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAG 202
++D+ A E+GLGR+ VMS GR A RWHDGDYGP S MA++ C CGF+LPLAG
Sbjct 151 PEIDDVAFEVGLGRKQVMSREGRLDCAARWHDGDYGPDSEMAKAAPSTCDLCGFYLPLAG 210
Query 203 SLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIE 257
SL A FGVCGNE++ADGH+V +GCGAHSDTT P G TP Y+ YDDG ++ ++
Sbjct 211 SLHAAFGVCGNEMAADGHIVHAAHGCGAHSDTTLPTGAGTPRYDAYDDGAIEAVQ 265
>gi|229488311|ref|ZP_04382177.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229323815|gb|EEN89570.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=290
Score = 296 bits (758), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 149/239 (63%), Positives = 178/239 (75%), Gaps = 1/239 (0%)
Query 23 VLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYS 82
VL A + AR AVV+ VG YLGV+ ED AATHRF A +PGY+GWQWAVVVA+
Sbjct 39 VLADAVELARKAVVDLH-EGGVGAYLGVTSEDEFAATHRFAADIPGYRGWQWAVVVAAGP 97
Query 83 GADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGD 142
HATISE+ L+PGP AL+AP+W+PW+QR+RPGDLS GDLLAP +DPRLVPGY A+GD
Sbjct 98 EDTHATISELALLPGPDALVAPEWLPWDQRIRPGDLSVGDLLAPPAEDPRLVPGYVATGD 157
Query 143 AQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAG 202
+VDE A EIGLGR+ VMS GR +AQRW DGD+GP S MA++ C CGFFLPLAG
Sbjct 158 PEVDEVALEIGLGRKQVMSLEGRLDAAQRWFDGDFGPESEMAKAAPSTCGLCGFFLPLAG 217
Query 203 SLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIEKPAE 261
SL A FGVCGNELSADG VV YGCGAHSDT+ P G +P ++ +DDG +++I PA
Sbjct 218 SLHAAFGVCGNELSADGRVVSVSYGCGAHSDTSLPLGAGSPQFDAFDDGAVELIAVPAR 276
>gi|54022610|ref|YP_116852.1| hypothetical protein nfa6430 [Nocardia farcinica IFM 10152]
gi|54014118|dbj|BAD55488.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=378
Score = 296 bits (758), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 146/234 (63%), Positives = 177/234 (76%), Gaps = 2/234 (0%)
Query 23 VLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYS 82
VL A D AR A+VE P+AVG YLGV+ ED AATHRF A LPGY+GWQWAVVVA+
Sbjct 14 VLADAVDLARRALVELE-PDAVGAYLGVTAEDETAATHRFEATLPGYRGWQWAVVVAAPP 72
Query 83 GADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGD 142
GA HAT+SE L+PGP AL+AP++VPWEQR+RPGDL+PGDLLAP DDPRLVPGY A+GD
Sbjct 73 GAAHATVSESALLPGPEALVAPEFVPWEQRIRPGDLAPGDLLAPPADDPRLVPGYVANGD 132
Query 143 AQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAG 202
++DE A E+GLGR VMS GR ++A+RW+ ++GP + MA++ C CGF+LPLAG
Sbjct 133 PEIDELAREVGLGRTKVMSLEGRLEAAERWY-AEHGPDTEMAKAAPATCGTCGFYLPLAG 191
Query 203 SLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDII 256
SL A FGVCGN + ADGHVV +YGCGAHSD P G +P YE YDD +D+I
Sbjct 192 SLRAAFGVCGNAMGADGHVVHVEYGCGAHSDVELPTGDGSPRYEAYDDAAVDVI 245
>gi|226308118|ref|YP_002768078.1| hypothetical protein RER_46310 [Rhodococcus erythropolis PR4]
gi|226187235|dbj|BAH35339.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=273
Score = 293 bits (750), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 147/238 (62%), Positives = 177/238 (75%), Gaps = 1/238 (0%)
Query 23 VLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYS 82
VL A + AR AVV+ VG YLGV+ ED AATHRF A +PGY+GWQWAVVVA+
Sbjct 22 VLADAVELARTAVVDLH-EGGVGAYLGVTSEDEFAATHRFAADIPGYRGWQWAVVVAAGP 80
Query 83 GADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGD 142
AT+SE+ L+PGP AL+AP+W+PW+QR+RPGDLS GDLLAP +DPRLVPGY A+GD
Sbjct 81 EDTRATVSELALLPGPDALVAPEWLPWDQRIRPGDLSVGDLLAPPAEDPRLVPGYVATGD 140
Query 143 AQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAG 202
+VDE A EIGLGR+ VMS GR +AQRW DGD+GP S MA++ C CGFFLPLAG
Sbjct 141 PEVDEVALEIGLGRKQVMSLEGRLDAAQRWFDGDFGPESEMAKAAPSTCGLCGFFLPLAG 200
Query 203 SLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIEKPA 260
SL A FGVCGNELSADG VV YGCGAHSDT+ P G +P ++ +DDG +++I PA
Sbjct 201 SLHAAFGVCGNELSADGRVVSVSYGCGAHSDTSLPLGAGSPQFDAFDDGAVELIAVPA 258
>gi|333918514|ref|YP_004492095.1| hypothetical protein AS9A_0843 [Amycolicicoccus subflavus DQS3-9A1]
gi|333480735|gb|AEF39295.1| hypothetical protein AS9A_0843 [Amycolicicoccus subflavus DQS3-9A1]
Length=242
Score = 263 bits (672), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 138/235 (59%), Positives = 169/235 (72%), Gaps = 0/235 (0%)
Query 23 VLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYS 82
+L A D AR AV + AVG++ GV E AA+HRF A LPGY+GW+W VVVA+
Sbjct 1 MLADAIDLARDAVRAIADEAAVGEHHGVVPEGEWAASHRFAASLPGYRGWEWNVVVAACP 60
Query 83 GADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGD 142
GA AT+SE+ L+PG ALLAP+WVPWE R+ GDL PGDLL P D RLVPGY +GD
Sbjct 61 GAATATVSELALLPGADALLAPEWVPWEDRIESGDLMPGDLLPPKHHDERLVPGYIETGD 120
Query 143 AQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAG 202
VDE AAEIG GR VMS GR +A+RW +GD+GP +AMA + C CGF+LPLAG
Sbjct 121 PAVDEAAAEIGFGRPQVMSLEGRLAAAERWTEGDFGPHAAMAAAAPGTCGTCGFYLPLAG 180
Query 203 SLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIE 257
SL A FGVCGNELSADGHVV ++GCGAHSDT P+G +P Y+PYDDGV+++++
Sbjct 181 SLRASFGVCGNELSADGHVVHARFGCGAHSDTELPSGAGSPQYDPYDDGVVEVMD 235
>gi|300790292|ref|YP_003770583.1| hypothetical protein AMED_8485 [Amycolatopsis mediterranei U32]
gi|299799806|gb|ADJ50181.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340531973|gb|AEK47178.1| hypothetical protein RAM_43555 [Amycolatopsis mediterranei S699]
Length=596
Score = 254 bits (648), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/239 (54%), Positives = 165/239 (70%), Gaps = 2/239 (0%)
Query 24 LRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSG 83
L A + ARAAV+E + E +G ++GV+ ED A+H F A +PGY GW+W+V VA+
Sbjct 16 LAEAVEFARAAVLEDAPEEQLGAHVGVTREDAVTASHLFEAQVPGYGGWRWSVTVAAAGE 75
Query 84 ADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDA 143
+ T+SEVVLVPGP+AL+AP WVPWE+RVR GDL GD+ A++DPRLVP Y S D
Sbjct 76 DEPVTVSEVVLVPGPSALVAPAWVPWERRVRAGDLGVGDIFPTAENDPRLVPAYLQSDDP 135
Query 144 QVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGS 203
V+E A E GLGR +S +GR ++A RWH G++GP S MARS VC CGFF+PLAGS
Sbjct 136 AVEEVAHEAGLGRVHALSRFGRTEAAARWHAGEFGPRSDMARSAPDVCGTCGFFVPLAGS 195
Query 204 LGAMFGVCGNELS-ADGHVVDRQYGCGAHSDTTAPAGGSTPIYE-PYDDGVLDIIEKPA 260
L +FGVCGN+++ ADGHVVD +YGCGAHS+ S P+ E YDD +LD PA
Sbjct 196 LRGVFGVCGNDIAPADGHVVDVEYGCGAHSEVEVEVTSSVPVAELVYDDSLLDFAPAPA 254
>gi|296138561|ref|YP_003645804.1| hypothetical protein Tpau_0829 [Tsukamurella paurometabola DSM
20162]
gi|296026695|gb|ADG77465.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=242
Score = 248 bits (634), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 131/235 (56%), Positives = 158/235 (68%), Gaps = 4/235 (1%)
Query 27 AADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSGADH 86
A D ARAA+ G AVG ++ ED A H F A LP Y+GWQW V+A+ G +
Sbjct 10 AVDLARAALEADEG-AAVGAHVATVVEDEFAVAHYFEADLPAYRGWQWCAVLAATPGGE- 67
Query 87 ATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDAQVD 146
T+SE L+PGP +L AP+WVPW+QRVR GDL PGD+L P +DDPR+ PGY SGD + D
Sbjct 68 PTVSETALLPGPDSLTAPEWVPWDQRVRAGDLHPGDVLPPREDDPRIEPGYLLSGDPEAD 127
Query 147 ETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGSLGA 206
A EIG+ VMS GR +A+RW G YGP S MARST+ C DC F+LPLAG+L A
Sbjct 128 AVAGEIGVNLERVMSRVGRVDAAERWALGPYGPDSEMARSTRYHCGDCAFYLPLAGALRA 187
Query 207 MFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIEKPAE 261
FGVCGNE SADGHVV YGCGAHS AP+G +P YEPYDDG ++ + PAE
Sbjct 188 SFGVCGNEFSADGHVVHAHYGCGAHSSVPAPSGQGSPAYEPYDDGAVEKV--PAE 240
>gi|319949873|ref|ZP_08023882.1| hypothetical protein ES5_10287 [Dietzia cinnamea P4]
gi|319436463|gb|EFV91574.1| hypothetical protein ES5_10287 [Dietzia cinnamea P4]
Length=303
Score = 246 bits (628), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 130/261 (50%), Positives = 168/261 (65%), Gaps = 7/261 (2%)
Query 5 TEESAVATVADWPEGLAAVLR----GAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATH 60
T++ A D G + LR A D ARAA EF+ VG++LG + E G TH
Sbjct 10 TDQGATVAGTDEESGFDSELRERLASAVDVARAATEEFA-ITGVGEHLGTTVEAGYTTTH 68
Query 61 RFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSP 120
RF + LPGY+GW WA V+A G + T+ E+ L+PG AL+AP+WVPWE+R+RPGDL
Sbjct 69 RFASELPGYRGWYWACVLALVPGGE-VTVDEIALLPGDDALVAPEWVPWEKRIRPGDLGA 127
Query 121 GDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPG 180
GDLL PA+DD RLVPGY +SGD +DE A IGLGR +S GR+ +A+RW + GP
Sbjct 128 GDLLPPAEDDERLVPGYVSSGDEALDEAAGPIGLGRPRHLSWQGRSAAAERW-TAERGPD 186
Query 181 SAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG 240
+ +ARS K C CGF +PLAGSLG MFG+C NE SADG V +YGCGAHS+T P
Sbjct 187 TEIARSAKHHCGTCGFLVPLAGSLGTMFGICANEYSADGQTVHLEYGCGAHSETQVPKDT 246
Query 241 STPIYEPYDDGVLDIIEKPAE 261
+ P+ E YDD +D++ P +
Sbjct 247 TPPVPEAYDDAAVDVVVLPQQ 267
>gi|257057359|ref|YP_003135191.1| hypothetical protein Svir_34000 [Saccharomonospora viridis DSM
43017]
gi|256587231|gb|ACU98364.1| hypothetical protein Svir_34000 [Saccharomonospora viridis DSM
43017]
Length=329
Score = 243 bits (621), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 124/234 (53%), Positives = 157/234 (68%), Gaps = 2/234 (0%)
Query 24 LRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSG 83
L A + AR A V+ +G + VGDY+G +ED + TH F + LPGY+GW+W+V VA+
Sbjct 16 LLDAVEPAREAAVQEAGDDNVGDYVGAVHEDAVSVTHLFDSTLPGYRGWRWSVTVATADE 75
Query 84 ADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDA 143
TISEVVL PGP A++AP WVPWE+RVRPGDL GD+ DDPRL P Y D
Sbjct 76 HAPVTISEVVLTPGPDAIVAPRWVPWERRVRPGDLGVGDIFPTPPDDPRLAPAYATLDDP 135
Query 144 QVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGS 203
+ +E E+GLGR VMS +GR ++A RW+ +YGP S MARS C CGF+L LAGS
Sbjct 136 EAEEAVREVGLGRVRVMSRYGRQEAATRWYRSEYGPRSDMARSAPAACGTCGFYLQLAGS 195
Query 204 LGAMFGVCGNELS-ADGHVVDRQYGCGAHSDTTAPAGGSTPIYE-PYDDGVLDI 255
L + FGVCGNE+S ADGHVV ++GCGAHS+ AG S P+ E YDD +LD+
Sbjct 196 LRSAFGVCGNEISPADGHVVHAEFGCGAHSEVRLEAGSSVPVAELVYDDSLLDM 249
>gi|326384171|ref|ZP_08205853.1| hypothetical protein SCNU_14606 [Gordonia neofelifaecis NRRL
B-59395]
gi|326197036|gb|EGD54228.1| hypothetical protein SCNU_14606 [Gordonia neofelifaecis NRRL
B-59395]
Length=260
Score = 239 bits (611), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 121/221 (55%), Positives = 152/221 (69%), Gaps = 6/221 (2%)
Query 42 EAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTAL 101
E G++LG E AATH F A +PGY+GWQW VVVA + T+SE VL+PG AL
Sbjct 26 ETPGEHLGARAEGEYAATHYFAAQVPGYRGWQWCVVVAGAPDSAELTVSETVLLPGDGAL 85
Query 102 LAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDA------QVDETAAEIGLG 155
LAP+WVPW RV PGDL PGDLLA DDPRLVPG + D QV + +AEIGLG
Sbjct 86 LAPEWVPWVDRVAPGDLGPGDLLAAPVDDPRLVPGQIDTLDIDPIDADQVGQVSAEIGLG 145
Query 156 RRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNEL 215
R+ ++S GRA +AQRW+DG++GP SAMAR+ + C CGF+LP+AG+L FGVC NEL
Sbjct 146 RKRLLSFDGRADAAQRWYDGEFGPDSAMARNARHSCGTCGFYLPIAGALHGAFGVCANEL 205
Query 216 SADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDII 256
+ADG V +YGCGAHSD AG +P Y+ +DDG ++I+
Sbjct 206 AADGRAVSAEYGCGAHSDVRPAAGNGSPAYDAFDDGAVEIV 246
>gi|302530548|ref|ZP_07282890.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302439443|gb|EFL11259.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=389
Score = 234 bits (598), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 122/234 (53%), Positives = 153/234 (66%), Gaps = 2/234 (0%)
Query 24 LRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSG 83
L A + AR AV+ + + VG+++GV ED +A+H F A +PGY GW+W+V VA
Sbjct 16 LADAVEVAREAVLAEAPADQVGEHVGVEREDAVSASHLFEASVPGYGGWRWSVTVAVAGP 75
Query 84 ADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDA 143
+ T+SE+VL PGP AL+AP WVPWEQRVR GDL GDL KDDPRL P Y S D
Sbjct 76 DEPVTVSELVLQPGPEALVAPAWVPWEQRVRAGDLGVGDLFPADKDDPRLSPAYLQSDDP 135
Query 144 QVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGS 203
V+E A E+GLGR V+S +GR +A RWH G++GP S MARS C CGFF+PLAGS
Sbjct 136 AVEEAAMEVGLGRVHVLSRYGRLDAAARWHSGEFGPRSDMARSAPATCGTCGFFVPLAGS 195
Query 204 LGAMFGVCGNELS-ADGHVVDRQYGCGAHSDTTAPAGGSTPIYE-PYDDGVLDI 255
L FG C N+++ ADGHVVD YGCGAHS+ S P+ E YDD ++D
Sbjct 196 LRGSFGACTNDIAPADGHVVDVAYGCGAHSEVRVEVTSSVPVAELVYDDSLIDF 249
>gi|317509529|ref|ZP_07967132.1| hypothetical protein HMPREF9336_03504 [Segniliparus rugosus ATCC
BAA-974]
gi|316252175|gb|EFV11642.1| hypothetical protein HMPREF9336_03504 [Segniliparus rugosus ATCC
BAA-974]
Length=257
Score = 233 bits (593), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 124/253 (50%), Positives = 158/253 (63%), Gaps = 5/253 (1%)
Query 10 VATVADWP--EGLAAVLRGAADQARAAVVEFSG---PEAVGDYLGVSYEDGNAATHRFIA 64
+ T+ D P E L A L A D A A+ E G VG +LG + E + TH F +
Sbjct 2 LITMVDAPADEELLARLEQAVDLAADALREQFGDGDSPVVGAHLGSAREGSTSVTHLFAS 61
Query 65 HLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLL 124
LPGY+GW+W+ V+A G D T+SEV L+PGP AL+AP+WVPWE+RV+ GDL GDLL
Sbjct 62 LLPGYRGWRWSAVLAGCPGQDEITVSEVALLPGPDALIAPEWVPWERRVQAGDLGVGDLL 121
Query 125 APAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMA 184
DD RLVP + +SGD +DE + + +GR +S GRAQ+AQRWHDG +GP + MA
Sbjct 122 PTPADDRRLVPAHASSGDEDLDELMSLVDIGRSKTLSIMGRAQAAQRWHDGLFGPSAPMA 181
Query 185 RSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPI 244
+ C CGFF+PL GSL A FGVC NE SADG VVD YGCGAHSD + S P
Sbjct 182 VVAPKRCAQCGFFIPLEGSLSASFGVCANEYSADGRVVDANYGCGAHSDVVVESDLSIPQ 241
Query 245 YEPYDDGVLDIIE 257
+GV +++E
Sbjct 242 KAQVAEGVAELVE 254
>gi|331699389|ref|YP_004335628.1| hypothetical protein Psed_5647 [Pseudonocardia dioxanivorans
CB1190]
gi|326954078|gb|AEA27775.1| hypothetical protein Psed_5647 [Pseudonocardia dioxanivorans
CB1190]
Length=277
Score = 230 bits (587), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 134/251 (54%), Positives = 164/251 (66%), Gaps = 13/251 (5%)
Query 23 VLRGAADQARAAVVEFSGPEA--------VGDYLGVSYEDGNAA-THRFIAHLPGYQGWQ 73
+L A D ARAA VE +G +A VG++L EDG AA TH F A PGY+GW+
Sbjct 19 LLADAVDLARAAAVEEAGSDANPTEAAAAVGEHLCGVAEDGGAAYTHFFAATRPGYRGWR 78
Query 74 WAVVVASYSGAD-HATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPR 132
WAV +A+ S D AT+SEVV++PGP AL+AP WVPW++RVRPGDL GDLL DD R
Sbjct 79 WAVTLAAGSAEDGTATVSEVVMLPGPDALVAPTWVPWQERVRPGDLGVGDLLPSPPDDAR 138
Query 133 LVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCR 192
LVPGY S D V+E A E+GLGR V+S +GR ++A RW +G GP + +ARS VC
Sbjct 139 LVPGYVESDDPAVEEVALEVGLGRTRVLSRFGRTEAAARWQEGPRGPSAPIARSAPGVCG 198
Query 193 DCGFFLPLAGSLGAMFGVCGNELS-ADGHVVDRQYGCGAHSDTTAPAGGSTPIYE-PYDD 250
CGFF+PLAGSL FGVC NE S DG VV +YGCGAHSD G + YDD
Sbjct 199 TCGFFVPLAGSLRGGFGVCANEFSPGDGAVVAVEYGCGAHSDVVVEPGSPVQVAALVYDD 258
Query 251 GV-LDIIEKPA 260
GV L+ +E+PA
Sbjct 259 GVDLETVERPA 269
>gi|296395230|ref|YP_003660114.1| hypothetical protein Srot_2852 [Segniliparus rotundus DSM 44985]
gi|296182377|gb|ADG99283.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=258
Score = 229 bits (585), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 114/215 (54%), Positives = 139/215 (65%), Gaps = 0/215 (0%)
Query 43 AVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALL 102
AVG YLG E + TH F + LPGY+GW+W+VVVA G D TISE L+PGP AL
Sbjct 39 AVGAYLGSLREGSTSVTHLFESLLPGYRGWRWSVVVAGCEGQDEITISEFALLPGPDALT 98
Query 103 APDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSA 162
AP+WVPWE+RVR GDL GDLL +DD RLVP + +SGD +DE + I GR +S
Sbjct 99 APEWVPWERRVRAGDLGVGDLLPTPQDDSRLVPAHASSGDEDLDELMSPIDTGRSRTLSI 158
Query 163 WGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVV 222
GRAQ+AQRWHDG +GP + MA + C CGF++PL GSL FGVC NE SADG VV
Sbjct 159 LGRAQAAQRWHDGLFGPSAPMAVVAPKRCSHCGFYVPLQGSLSVSFGVCANEYSADGRVV 218
Query 223 DRQYGCGAHSDTTAPAGGSTPIYEPYDDGVLDIIE 257
D YGCGAHSD + P +GV +++E
Sbjct 219 DANYGCGAHSDVVVEPEAAIPPKPHLAEGVAELVE 253
>gi|343928207|ref|ZP_08767662.1| hypothetical protein GOALK_110_00480 [Gordonia alkanivorans NBRC
16433]
gi|343761905|dbj|GAA14588.1| hypothetical protein GOALK_110_00480 [Gordonia alkanivorans NBRC
16433]
Length=262
Score = 229 bits (584), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 126/244 (52%), Positives = 157/244 (65%), Gaps = 13/244 (5%)
Query 24 LRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSG 83
L A D ARAA+V+ + G + E AA H F A L GY+GWQW VV+A G
Sbjct 10 LLAAVDIARAALVDEG--QQPGAHRRSVAEGEWAAAHYFDAELAGYRGWQWCVVLAGSPG 67
Query 84 ADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDA 143
+D T+SEVVL+PG +LLAP WVPW +RV GDL+PGDLLA DDPRLVP +GD
Sbjct 68 SDEITLSEVVLLPGDGSLLAPPWVPWAERVASGDLAPGDLLAAEPDDPRLVPNQIDTGDE 127
Query 144 -----------QVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCR 192
+ + A EIGLGRR ++S GRA +AQRW+DGD+GP S MA++ C
Sbjct 128 FRFDSESEDPDDIGQIAGEIGLGRRRLLSYDGRADAAQRWYDGDFGPSSEMAQAAPFPCC 187
Query 193 DCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDDGV 252
CGF++PLAG+L A FG C NE +ADG VV +YGCGAHSD AP G +P Y+ YDDG
Sbjct 188 TCGFYVPLAGALRAGFGACTNEYAADGRVVSAEYGCGAHSDVQAPKGEGSPAYDAYDDGA 247
Query 253 LDII 256
L++I
Sbjct 248 LEVI 251
>gi|262201220|ref|YP_003272428.1| hypothetical protein Gbro_1239 [Gordonia bronchialis DSM 43247]
gi|262084567|gb|ACY20535.1| hypothetical protein Gbro_1239 [Gordonia bronchialis DSM 43247]
Length=299
Score = 227 bits (578), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 112/212 (53%), Positives = 146/212 (69%), Gaps = 12/212 (5%)
Query 57 AATHRFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPG 116
AA H F A L GY+GWQW VV+A+ G++ T+SEVVL+PG +LLAP+WVPW R+ G
Sbjct 75 AAAHYFDADLAGYRGWQWCVVLATSPGSEEITVSEVVLLPGEGSLLAPEWVPWVDRIATG 134
Query 117 DLSPGDLLAPAKDDPRLVPGYTASGDA-----------QVDETAAEIGLGRRWVMSAWGR 165
DL+PGDLLA DDPRLVP +GD + + + EIGLGRR ++S GR
Sbjct 135 DLTPGDLLAAEPDDPRLVPNQVDTGDEFRFSSDEIDPDEFGQLSGEIGLGRRRLLSPQGR 194
Query 166 AQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQ 225
+AQRW+DGD+GPGSAMA++ + C CGF++PL+G+L A FG C NE +ADG VV +
Sbjct 195 DDAAQRWYDGDFGPGSAMAQAAQFSCCTCGFYIPLSGALHAAFGACANEFAADGRVVSAE 254
Query 226 YGCGAHSDTTAPAGGS-TPIYEPYDDGVLDII 256
YGCGAHSD P GG +P Y+ YDDG L+++
Sbjct 255 YGCGAHSDVPPPKGGDGSPAYDAYDDGALEVV 286
>gi|256374638|ref|YP_003098298.1| hypothetical protein Amir_0485 [Actinosynnema mirum DSM 43827]
gi|255918941|gb|ACU34452.1| hypothetical protein Amir_0485 [Actinosynnema mirum DSM 43827]
Length=308
Score = 225 bits (573), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 118/227 (52%), Positives = 150/227 (67%), Gaps = 2/227 (0%)
Query 31 ARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSGADHATIS 90
AR A E +G E VG ++GV ED + TH F A GY+GW+WAV +A+ ++S
Sbjct 28 ARDAAQEEAGSEHVGAHVGVLVEDETSVTHFFEADHAGYRGWRWAVTLATAGEGSPVSVS 87
Query 91 EVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDAQVDETAA 150
E VL+PG AL+AP WVPW +RVR GDL GDLL + DDPRL PGY S D V+E A
Sbjct 88 EAVLLPGNDALVAPQWVPWNERVRAGDLGVGDLLPASPDDPRLAPGYVGSEDPAVEEVAL 147
Query 151 EIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGSLGAMFGV 210
E+GLGR V+S G +A+RW GD+GP S MARS C CGF+L +AG+LGA FGV
Sbjct 148 EVGLGRVRVLSREGVLDAAERWRGGDFGPRSEMARSAPAACGTCGFYLRVAGALGAAFGV 207
Query 211 CGNELS-ADGHVVDRQYGCGAHSDTTAPAGGSTPIYE-PYDDGVLDI 255
CGNEL+ ADGHVV +YGCGAHS+ G + P+ + YDD +L++
Sbjct 208 CGNELTPADGHVVHVEYGCGAHSEVEVEGGSAVPVADVVYDDALLEV 254
>gi|284989396|ref|YP_003407950.1| hypothetical protein Gobs_0815 [Geodermatophilus obscurus DSM
43160]
gi|284062641|gb|ADB73579.1| conserved hypothetical protein [Geodermatophilus obscurus DSM
43160]
Length=267
Score = 222 bits (566), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 126/248 (51%), Positives = 153/248 (62%), Gaps = 18/248 (7%)
Query 27 AADQARAAVVEFSG-PEAVGDYLGVSYED----------GNAATHRFIAHLPGYQGWQWA 75
A +QARAA VE +G P+ VG++LG + E G TH F + LPGY GW WA
Sbjct 18 AVEQARAAAVETAGSPDLVGEHLGATPEAPSAVPVGEDLGEVVTHSFASRLPGYVGWYWA 77
Query 76 VVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVP 135
V +A G + T+ EVVL+PG ALLAP WVPW +R+RPGDLS GD+L +DDPRLVP
Sbjct 78 VTLARVPGEEQVTVDEVVLLPGEQALLAPAWVPWHERLRPGDLSVGDVLPSTEDDPRLVP 137
Query 136 GYTASGDAQVDE----TAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVC 191
GYT D+ D A+E+GLGR VMS GR ++A RW G++GP SAMAR C
Sbjct 138 GYTTDDDSADDPEGSVVASEVGLGRERVMSREGREETAARWSAGEFGPRSAMARQAPGPC 197
Query 192 RDCGFFLPLAGSLGAMFGVCGNELS-ADGHVVDRQYGCGAHSDTTAPAGGSTPIYEP--Y 248
CGFFLPLAGSL FG CGN + ADG VV YGCGAHS T A +T + Y
Sbjct 198 GTCGFFLPLAGSLRHGFGACGNVYAPADGRVVTVDYGCGAHSQATLAADDATEVVRSARY 257
Query 249 DDGVLDII 256
D G D++
Sbjct 258 DTGTFDVL 265
>gi|134097208|ref|YP_001102869.1| hypothetical protein SACE_0598 [Saccharopolyspora erythraea NRRL
2338]
gi|133909831|emb|CAL99943.1| hypothetical protein SACE_0598 [Saccharopolyspora erythraea NRRL
2338]
Length=351
Score = 219 bits (559), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 114/223 (52%), Positives = 148/223 (67%), Gaps = 3/223 (1%)
Query 35 VVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSGADHATISEVVL 94
V+ G + VG+++G E +A TH F ++ PGY GW+WAV VA+ G + T+SEVVL
Sbjct 126 TVDAEGGDPVGEHVGFEPEGEHALTHYFESNYPGYVGWRWAVTVAAVPG-EPVTVSEVVL 184
Query 95 VPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGL 154
+PGP+AL AP+WVPW QRVRPGDL GDLL DD RL P Y A+ D V+ A E+GL
Sbjct 185 LPGPSALTAPEWVPWAQRVRPGDLGAGDLLPSGPDDHRLAPAYLANDDPAVESLAREVGL 244
Query 155 GRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNE 214
GR V+S GR ++A+RWH G++GP S +AR C CGFF+ LAGS+GA FGVC NE
Sbjct 245 GRERVLSREGRLEAAERWHGGEFGPHSEIARLAPGACGTCGFFMQLAGSMGAAFGVCANE 304
Query 215 LS-ADGHVVDRQYGCGAHSDTTAPAGGSTPIYE-PYDDGVLDI 255
++ ADG VV ++GCGAHS+ + P+ E YDD LD
Sbjct 305 IAPADGRVVHAEFGCGAHSEAEVDTSSTVPVAEVVYDDATLDF 347
>gi|291005335|ref|ZP_06563308.1| hypothetical protein SeryN2_12517 [Saccharopolyspora erythraea
NRRL 2338]
Length=368
Score = 219 bits (558), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 114/223 (52%), Positives = 148/223 (67%), Gaps = 3/223 (1%)
Query 35 VVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSGADHATISEVVL 94
V+ G + VG+++G E +A TH F ++ PGY GW+WAV VA+ G + T+SEVVL
Sbjct 143 TVDAEGGDPVGEHVGFEPEGEHALTHYFESNYPGYVGWRWAVTVAAVPG-EPVTVSEVVL 201
Query 95 VPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGL 154
+PGP+AL AP+WVPW QRVRPGDL GDLL DD RL P Y A+ D V+ A E+GL
Sbjct 202 LPGPSALTAPEWVPWAQRVRPGDLGAGDLLPSGPDDHRLAPAYLANDDPAVESLAREVGL 261
Query 155 GRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNE 214
GR V+S GR ++A+RWH G++GP S +AR C CGFF+ LAGS+GA FGVC NE
Sbjct 262 GRERVLSREGRLEAAERWHGGEFGPHSEIARLAPGACGTCGFFMQLAGSMGAAFGVCANE 321
Query 215 LS-ADGHVVDRQYGCGAHSDTTAPAGGSTPIYE-PYDDGVLDI 255
++ ADG VV ++GCGAHS+ + P+ E YDD LD
Sbjct 322 IAPADGRVVHAEFGCGAHSEAEVDTSSTVPVAEVVYDDATLDF 364
>gi|258654837|ref|YP_003203993.1| hypothetical protein Namu_4728 [Nakamurella multipartita DSM
44233]
gi|258558062|gb|ACV81004.1| hypothetical protein Namu_4728 [Nakamurella multipartita DSM
44233]
Length=407
Score = 207 bits (526), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 114/207 (56%), Positives = 128/207 (62%), Gaps = 3/207 (1%)
Query 30 QARAAVVEFSGPEA-VGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSGADHAT 88
ARAA VE +G EA VGDYLG ED A + F GY+GW W V +A A H T
Sbjct 44 MARAAAVEEAGIEAAVGDYLGARAEDAVATSASFATTDRGYRGWYWLVTIAVVE-ATHPT 102
Query 89 ISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDAQVDET 148
ISEVVL+PG ALLAP WVPW+QRVR GDL GDLL +D RLVPGY S D V E
Sbjct 103 ISEVVLLPGEGALLAPAWVPWDQRVRAGDLGVGDLLPTTPEDDRLVPGYLDSDDPAVREV 162
Query 149 AAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGSLGAMF 208
E G GR V+ GR +A RWHDG +GPG MA+ C CGF++ L G LG F
Sbjct 163 EYEFGFGRVRVLGRLGRDDAATRWHDGPFGPGEPMAQQAPAACGTCGFYIRLEGLLGQAF 222
Query 209 GVCGNELS-ADGHVVDRQYGCGAHSDT 234
G C NE S ADG VVD YGCGAHS+T
Sbjct 223 GACTNEFSPADGRVVDAAYGCGAHSET 249
>gi|302865011|ref|YP_003833648.1| hypothetical protein Micau_0505 [Micromonospora aurantiaca ATCC
27029]
gi|315501508|ref|YP_004080395.1| hypothetical protein ML5_0696 [Micromonospora sp. L5]
gi|302567870|gb|ADL44072.1| Protein of unknown function DUF3027 [Micromonospora aurantiaca
ATCC 27029]
gi|315408127|gb|ADU06244.1| Protein of unknown function DUF3027 [Micromonospora sp. L5]
Length=279
Score = 205 bits (522), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 115/242 (48%), Positives = 149/242 (62%), Gaps = 6/242 (2%)
Query 20 LAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVA 79
L V A + AR A+ E P +GD+L E TH F + GY+GW+WAV V
Sbjct 18 LDQVCAAAVEVARDAITEVE-PTDIGDHLQAVAEGDRVVTHYFECRMAGYRGWRWAVTVT 76
Query 80 SYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTA 139
+ H TI E VL+PGP ALLAP W+PW++R++PGDL PGDLL DD RL PGY
Sbjct 77 RVPRSRHVTICETVLLPGPDALLAPGWLPWQERLKPGDLGPGDLLPTPADDERLAPGYLL 136
Query 140 SGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRV--CRDCGFF 197
S D V+ETA E+GLGR V+S GRA++AQRW+DGD+GP +A++ + C CGF+
Sbjct 137 SDDPAVEETAWELGLGRARVLSREGRAEAAQRWYDGDHGPDAAISAAAPAAARCGTCGFY 196
Query 198 LPLAGSLGAMFGVCGNELSA-DGHVVDRQYGCGAHSDTTAPAGGSTPIYEP--YDDGVLD 254
LPLAGSL FG CGN + DG VV +GCGAHS+T A ++ P YDD ++
Sbjct 197 LPLAGSLRLAFGACGNFYAPDDGRVVSADHGCGAHSETMIEAAETSVDELPTVYDDSAVE 256
Query 255 II 256
+
Sbjct 257 AM 258
>gi|334338037|ref|YP_004543189.1| hypothetical protein Isova_2591 [Isoptericola variabilis 225]
gi|334108405|gb|AEG45295.1| hypothetical protein Isova_2591 [Isoptericola variabilis 225]
Length=282
Score = 204 bits (518), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 113/245 (47%), Positives = 151/245 (62%), Gaps = 4/245 (1%)
Query 22 AVLRGAADQARAAVVEFS-GPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVAS 80
AVL GA D ARAA + + P VG++LG E +HRF A +PGY+GW W V +A
Sbjct 23 AVLLGAVDLARAAAEDVAESPSDVGEHLGAVAEGERLMSHRFAAAMPGYRGWHWTVTLAR 82
Query 81 YSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTAS 140
AT+ EV L+PG A+LAP+WVPW +R+RPGDL PGD L DDPR+ PG+T
Sbjct 83 VPRGRSATVCEVELLPGDEAILAPEWVPWSERLRPGDLGPGDTLPYRPDDPRVEPGWTEM 142
Query 141 GDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPL 200
GD ++D+ A E+ L R V+S GR +A+RW+ G +GP + +A ++ C C F +PL
Sbjct 143 GDEEIDDVARELALTRARVLSPAGRDAAAERWYRGSHGPTAPVAVASAAECMTCAFLVPL 202
Query 201 AGSLGAMFGVCGNELSA-DGHVVDRQYGCGAHSDTTAPAGGST-PIYEPY-DDGVLDIIE 257
+G LG +FGVC NE S DG VV +GCGAHS+T G+ P P DD V++ +E
Sbjct 203 SGPLGQLFGVCANEWSPDDGKVVSFDHGCGAHSETDVERPGTEWPADAPLIDDHVIEPVE 262
Query 258 KPAES 262
A S
Sbjct 263 LRAAS 267
>gi|330465298|ref|YP_004403041.1| hypothetical protein VAB18032_06585 [Verrucosispora maris AB-18-032]
gi|328808269|gb|AEB42441.1| hypothetical protein VAB18032_06585 [Verrucosispora maris AB-18-032]
Length=272
Score = 203 bits (517), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 111/222 (50%), Positives = 138/222 (63%), Gaps = 4/222 (1%)
Query 20 LAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVA 79
L V A + ARA + E E VG++L E TH F L GY+GW+WAV V
Sbjct 12 LDQVCAAAVEVARAGITEVDATE-VGEHLQAVAEGDRLVTHYFECLLAGYRGWRWAVTVT 70
Query 80 SYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTA 139
+ TI E VL+PGP ALLAP W+PW++R++PGDL PGDLL DD RL PGY
Sbjct 71 RVPRSRTVTICETVLLPGPDALLAPGWLPWQERLKPGDLGPGDLLPTPADDERLAPGYLL 130
Query 140 SGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRV--CRDCGFF 197
S D V+ETA E+GLGR V+S GRA++AQRW+DGD+GP + ++ S C CGF+
Sbjct 131 SDDPAVEETAWELGLGRARVLSREGRAEAAQRWYDGDHGPSAPISTSAPAAARCGTCGFY 190
Query 198 LPLAGSLGAMFGVCGNELSA-DGHVVDRQYGCGAHSDTTAPA 238
LPLAG L FGVCGN + DG VV +GCGAHS+T A
Sbjct 191 LPLAGQLRQCFGVCGNFYAPDDGRVVSTDHGCGAHSETLVEA 232
>gi|325002210|ref|ZP_08123322.1| hypothetical protein PseP1_25761 [Pseudonocardia sp. P1]
Length=198
Score = 203 bits (516), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 106/191 (56%), Positives = 131/191 (69%), Gaps = 6/191 (3%)
Query 66 LPGYQGWQWAVVVASYSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLA 125
LPGY+GW+WAV +AS + T+SE VL+PG TAL+APDWVPW+QR+RP DL PGDLL
Sbjct 1 LPGYRGWRWAVTIASAGEGEPVTVSETVLLPGDTALVAPDWVPWDQRIRPDDLKPGDLLP 60
Query 126 PAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMAR 185
+ DDPRLVPGY SGD VD+ A E GLGR V+S GR+ +A+RW DG++GP + +AR
Sbjct 61 VSPDDPRLVPGYLDSGDPAVDDLAREAGLGRERVLSPLGRSDAAERWTDGEHGPSADLAR 120
Query 186 STKRVCRDCGFFLPLAGSLGAMFGVCGNELS-ADGHVVDRQYGCGAHSDTTAPAGGSTPI 244
S C CGF +PLAG+L FG C N S ADG VV +GCGAHS +G +P+
Sbjct 121 SAPASCGTCGFLVPLAGALHGSFGACANASSPADGQVVAVGFGCGAHSSVAETSG--SPV 178
Query 245 YEP---YDDGV 252
Y YDDGV
Sbjct 179 YVSALVYDDGV 189
>gi|152967524|ref|YP_001363308.1| hypothetical protein Krad_3581 [Kineococcus radiotolerans SRS30216]
gi|151362041|gb|ABS05044.1| conserved hypothetical protein [Kineococcus radiotolerans SRS30216]
Length=332
Score = 202 bits (513), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 103/208 (50%), Positives = 134/208 (65%), Gaps = 2/208 (0%)
Query 29 DQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVASYSGADHAT 88
D AR A E + ++VG++L + E +H F LPGY+GW+W V V S A + T
Sbjct 95 DLARTAAEEVAEDDSVGEHLEATAEGDRIVSHAFACLLPGYRGWRWTVTVTRASRARNVT 154
Query 89 ISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDAQVDET 148
++EV L+PG AL AP+W+PW +R+ PGD+ P D L DDP L GY A+GDA+ DE
Sbjct 155 VNEVCLLPGEDALRAPEWLPWSERIAPGDVGPQDTLPRKADDPLLEQGYEATGDAEADEL 214
Query 149 AA-EIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGSLGAM 207
A E+GLGR V+S GR ++A RW+DGD GP + +A + K C CGFFLPLAGSL +
Sbjct 215 ALWELGLGRERVLSPLGRDEAATRWYDGDRGPRTEIAENAKAPCSTCGFFLPLAGSLRQV 274
Query 208 FGVCGNELS-ADGHVVDRQYGCGAHSDT 234
FGVC NE S D VV +GCGAHS+T
Sbjct 275 FGVCANEWSPEDARVVSADHGCGAHSET 302
Lambda K H
0.316 0.134 0.428
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 391676602750
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40