BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3542c
Length=311
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610678|ref|NP_218059.1| hypothetical protein Rv3542c [Mycob... 627 6e-178
gi|340628506|ref|YP_004746958.1| hypothetical protein MCAN_35531... 626 1e-177
gi|289572194|ref|ZP_06452421.1| conserved hypothetical protein [... 625 2e-177
gi|289445134|ref|ZP_06434878.1| conserved hypothetical protein [... 624 6e-177
gi|294993660|ref|ZP_06799351.1| hypothetical protein Mtub2_03906... 622 3e-176
gi|308374914|ref|ZP_07667898.1| hypothetical protein TMGG_01078 ... 575 4e-162
gi|183984997|ref|YP_001853288.1| hypothetical protein MMAR_5029 ... 540 9e-152
gi|296166571|ref|ZP_06849001.1| conserved hypothetical protein [... 534 7e-150
gi|118619288|ref|YP_907620.1| hypothetical protein MUL_4103 [Myc... 531 6e-149
gi|240172832|ref|ZP_04751491.1| hypothetical protein MkanA1_2620... 530 2e-148
gi|41406623|ref|NP_959459.1| hypothetical protein MAP0525 [Mycob... 511 4e-143
gi|342862368|ref|ZP_08719008.1| hypothetical protein MCOL_25873 ... 509 3e-142
gi|254822624|ref|ZP_05227625.1| hypothetical protein MintA_22024... 506 1e-141
gi|120406203|ref|YP_956032.1| hypothetical protein Mvan_5255 [My... 495 3e-138
gi|118468265|ref|YP_890218.1| hypothetical protein MSMEG_5992 [M... 474 9e-132
gi|145222103|ref|YP_001132781.1| hypothetical protein Mflv_1511 ... 470 1e-130
gi|126437623|ref|YP_001073314.1| hypothetical protein Mjls_5059 ... 464 6e-129
gi|108801637|ref|YP_641834.1| hypothetical protein Mmcs_4674 [My... 463 1e-128
gi|333992329|ref|YP_004524943.1| hypothetical protein JDM601_368... 454 7e-126
gi|169627719|ref|YP_001701368.1| hypothetical protein MAB_0616 [... 448 6e-124
gi|312138120|ref|YP_004005456.1| hypothetical protein REQ_06540 ... 422 5e-116
gi|325674975|ref|ZP_08154662.1| hypothetical protein HMPREF0724_... 421 6e-116
gi|229494536|ref|ZP_04388299.1| conserved hypothetical protein [... 420 1e-115
gi|343925845|ref|ZP_08765360.1| hypothetical protein GOALK_050_0... 419 4e-115
gi|326384606|ref|ZP_08206285.1| hypothetical protein SCNU_16778 ... 417 8e-115
gi|226304199|ref|YP_002764157.1| hypothetical protein RER_07100 ... 417 1e-114
gi|262200883|ref|YP_003272091.1| hypothetical protein Gbro_0885 ... 416 2e-114
gi|111021458|ref|YP_704430.1| hypothetical protein RHA1_ro04486 ... 413 1e-113
gi|226363824|ref|YP_002781606.1| hypothetical protein ROP_44140 ... 411 5e-113
gi|54022415|ref|YP_116657.1| hypothetical protein nfa4510 [Nocar... 390 1e-106
gi|319948637|ref|ZP_08022761.1| hypothetical protein ES5_04623 [... 382 4e-104
gi|296141527|ref|YP_003648770.1| hypothetical protein Tpau_3856 ... 321 8e-86
gi|300784417|ref|YP_003764708.1| hypothetical protein AMED_2511 ... 319 4e-85
gi|302525652|ref|ZP_07277994.1| conserved hypothetical protein [... 313 2e-83
gi|333918579|ref|YP_004492160.1| hypothetical protein AS9A_0908 ... 306 2e-81
gi|159038394|ref|YP_001537647.1| hypothetical protein Sare_2821 ... 289 4e-76
gi|145595148|ref|YP_001159445.1| hypothetical protein Strop_2623... 279 4e-73
gi|311744300|ref|ZP_07718104.1| conserved hypothetical protein [... 270 2e-70
gi|271969393|ref|YP_003343589.1| hypothetical protein Sros_8196 ... 269 5e-70
gi|148556220|ref|YP_001263802.1| hypothetical protein Swit_3318 ... 228 1e-57
gi|269127688|ref|YP_003301058.1| hypothetical protein Tcur_3482 ... 208 9e-52
gi|312141661|ref|YP_004008997.1| hypothetical protein REQ_43550 ... 207 2e-51
gi|111022789|ref|YP_705761.1| hypothetical protein RHA1_ro05826 ... 207 2e-51
gi|325673082|ref|ZP_08152776.1| hypothetical protein HMPREF0724_... 206 4e-51
gi|229488465|ref|ZP_04382331.1| conserved hypothetical protein [... 200 2e-49
gi|226308563|ref|YP_002768523.1| hypothetical protein RER_50760 ... 199 4e-49
gi|288918163|ref|ZP_06412519.1| protein of unknown function DUF3... 197 2e-48
gi|254384132|ref|ZP_04999477.1| conserved hypothetical protein [... 197 3e-48
gi|157961959|ref|YP_001501993.1| hypothetical protein Spea_2138 ... 196 6e-48
gi|329895411|ref|ZP_08270992.1| hypothetical protein IMCC3088_14... 190 2e-46
>gi|15610678|ref|NP_218059.1| hypothetical protein Rv3542c [Mycobacterium tuberculosis H37Rv]
gi|15843154|ref|NP_338191.1| hypothetical protein MT3646 [Mycobacterium tuberculosis CDC1551]
gi|31794718|ref|NP_857211.1| hypothetical protein Mb3572c [Mycobacterium bovis AF2122/97]
45 more sequence titles
Length=311
Score = 627 bits (1617), Expect = 6e-178, Method: Compositional matrix adjust.
Identities = 310/311 (99%), Positives = 311/311 (100%), Gaps = 0/311 (0%)
Query 1 VTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP 60
+TGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP
Sbjct 1 MTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP 60
Query 61 GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ 120
GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ
Sbjct 61 GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ 120
Query 121 VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDL 180
VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDL
Sbjct 121 VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDL 180
Query 181 DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR 240
DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR
Sbjct 181 DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR 240
Query 241 GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP 300
GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP
Sbjct 241 GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP 300
Query 301 DWSLYAWEPDE 311
DWSLYAWEPDE
Sbjct 301 DWSLYAWEPDE 311
>gi|340628506|ref|YP_004746958.1| hypothetical protein MCAN_35531 [Mycobacterium canettii CIPT
140010059]
gi|340006696|emb|CCC45884.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=311
Score = 626 bits (1615), Expect = 1e-177, Method: Compositional matrix adjust.
Identities = 309/311 (99%), Positives = 311/311 (100%), Gaps = 0/311 (0%)
Query 1 VTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP 60
+TGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP
Sbjct 1 MTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP 60
Query 61 GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ 120
GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ
Sbjct 61 GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ 120
Query 121 VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDL 180
VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVP+DL
Sbjct 121 VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPEDL 180
Query 181 DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR 240
DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR
Sbjct 181 DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR 240
Query 241 GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP 300
GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP
Sbjct 241 GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP 300
Query 301 DWSLYAWEPDE 311
DWSLYAWEPDE
Sbjct 301 DWSLYAWEPDE 311
>gi|289572194|ref|ZP_06452421.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|339633540|ref|YP_004725182.1| hypothetical protein MAF_35540 [Mycobacterium africanum GM041182]
gi|289536625|gb|EFD41203.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|339332896|emb|CCC28623.1| conserved hypothetical protein [Mycobacterium africanum GM041182]
Length=311
Score = 625 bits (1613), Expect = 2e-177, Method: Compositional matrix adjust.
Identities = 309/311 (99%), Positives = 311/311 (100%), Gaps = 0/311 (0%)
Query 1 VTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP 60
+TGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP
Sbjct 1 MTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP 60
Query 61 GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ 120
GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ
Sbjct 61 GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ 120
Query 121 VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDL 180
VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDL
Sbjct 121 VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDL 180
Query 181 DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR 240
DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLR+PPVPAVWQDKSVPINYVVSSGR
Sbjct 181 DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRYPPVPAVWQDKSVPINYVVSSGR 240
Query 241 GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP 300
GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP
Sbjct 241 GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP 300
Query 301 DWSLYAWEPDE 311
DWSLYAWEPDE
Sbjct 301 DWSLYAWEPDE 311
>gi|289445134|ref|ZP_06434878.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289418092|gb|EFD15293.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
Length=311
Score = 624 bits (1609), Expect = 6e-177, Method: Compositional matrix adjust.
Identities = 309/311 (99%), Positives = 310/311 (99%), Gaps = 0/311 (0%)
Query 1 VTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP 60
+TGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP
Sbjct 1 MTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP 60
Query 61 GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ 120
GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ
Sbjct 61 GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ 120
Query 121 VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDL 180
VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDL
Sbjct 121 VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDL 180
Query 181 DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR 240
DPDAMM PSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR
Sbjct 181 DPDAMMCPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR 240
Query 241 GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP 300
GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP
Sbjct 241 GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP 300
Query 301 DWSLYAWEPDE 311
DWSLYAWEPDE
Sbjct 301 DWSLYAWEPDE 311
>gi|294993660|ref|ZP_06799351.1| hypothetical protein Mtub2_03906 [Mycobacterium tuberculosis
210]
gi|297733208|ref|ZP_06962326.1| hypothetical protein MtubKR_19050 [Mycobacterium tuberculosis
KZN R506]
gi|306974012|ref|ZP_07486673.1| hypothetical protein TMJG_00788 [Mycobacterium tuberculosis SUMu010]
22 more sequence titles
Length=308
Score = 622 bits (1603), Expect = 3e-176, Method: Compositional matrix adjust.
Identities = 307/308 (99%), Positives = 308/308 (100%), Gaps = 0/308 (0%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
+SDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV
Sbjct 1 MSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 60
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI
Sbjct 61 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 120
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDLDPD 183
SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDLDPD
Sbjct 121 SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDLDPD 180
Query 184 AMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGTV 243
AMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGTV
Sbjct 181 AMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGTV 240
Query 244 FSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFPDWS 303
FSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFPDWS
Sbjct 241 FSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFPDWS 300
Query 304 LYAWEPDE 311
LYAWEPDE
Sbjct 301 LYAWEPDE 308
>gi|308374914|ref|ZP_07667898.1| hypothetical protein TMGG_01078 [Mycobacterium tuberculosis SUMu007]
gi|308348056|gb|EFP36907.1| hypothetical protein TMGG_01078 [Mycobacterium tuberculosis SUMu007]
Length=283
Score = 575 bits (1481), Expect = 4e-162, Method: Compositional matrix adjust.
Identities = 282/283 (99%), Positives = 283/283 (100%), Gaps = 0/283 (0%)
Query 29 VNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGVRPKDDPLG 88
+NQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGVRPKDDPLG
Sbjct 1 MNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGVRPKDDPLG 60
Query 89 PIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGEGWFINQHI 148
PIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGEGWFINQHI
Sbjct 61 PIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGEGWFINQHI 120
Query 149 VWQVGDEDVAEMNWRILKFKPAGSPSSVPDDLDPDAMMRPSSSRDTAFFWDGVKAHELRI 208
VWQVGDEDVAEMNWRILKFKPAGSPSSVPDDLDPDAMMRPSSSRDTAFFWDGVKAHELRI
Sbjct 121 VWQVGDEDVAEMNWRILKFKPAGSPSSVPDDLDPDAMMRPSSSRDTAFFWDGVKAHELRI 180
Query 209 QRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVEL 268
QRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVEL
Sbjct 181 QRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVEL 240
Query 269 EEGVRMLGELRGADPARVAIGMPVRATYIDFPDWSLYAWEPDE 311
EEGVRMLGELRGADPARVAIGMPVRATYIDFPDWSLYAWEPDE
Sbjct 241 EEGVRMLGELRGADPARVAIGMPVRATYIDFPDWSLYAWEPDE 283
>gi|183984997|ref|YP_001853288.1| hypothetical protein MMAR_5029 [Mycobacterium marinum M]
gi|183178323|gb|ACC43433.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=316
Score = 540 bits (1392), Expect = 9e-152, Method: Compositional matrix adjust.
Identities = 267/315 (85%), Positives = 284/315 (91%), Gaps = 8/315 (2%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
+SDI E VAQ+KAAGPSKPR RDPVNQPMINNWVEAIGD+NPIY DDAAARA GHPGIV
Sbjct 1 MSDIHEIVAQVKAAGPSKPRAGRDPVNQPMINNWVEAIGDQNPIYADDAAARAVGHPGIV 60
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAMIQVWTMMGLGGVR DDPL PIIKLFDDAGYIGVVATNCEQTYHRYL PGE VSI
Sbjct 61 APPAMIQVWTMMGLGGVRADDDPLPPIIKLFDDAGYIGVVATNCEQTYHRYLQPGELVSI 120
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKP--AGSPSSVPDDLD 181
SAELGDVVGPKQTALGEGWFINQHI+WQVGDEDVAEMNWRILKFKP +P++VPDDLD
Sbjct 121 SAELGDVVGPKQTALGEGWFINQHIIWQVGDEDVAEMNWRILKFKPRAESAPTNVPDDLD 180
Query 182 PDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRG 241
PDAMMRPSSSRDTAFFWDGVKAHELRIQR DGSL+HPPVPAVWQDK+ PI+YVVS+G G
Sbjct 181 PDAMMRPSSSRDTAFFWDGVKAHELRIQRRPDGSLQHPPVPAVWQDKAEPIDYVVSTGTG 240
Query 242 TVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF-- 299
TVFSFVVHHAPKVPGRT+PFVIALVEL+EGVRMLGELR +P +V IGMPVR YIDF
Sbjct 241 TVFSFVVHHAPKVPGRTLPFVIALVELDEGVRMLGELRNVEPEQVEIGMPVRTVYIDFPA 300
Query 300 ----PDWSLYAWEPD 310
P+W+LYAWEPD
Sbjct 301 GASGPEWTLYAWEPD 315
>gi|296166571|ref|ZP_06849001.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295898057|gb|EFG77633.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=317
Score = 534 bits (1375), Expect = 7e-150, Method: Compositional matrix adjust.
Identities = 260/315 (83%), Positives = 285/315 (91%), Gaps = 9/315 (2%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
++DI EAVA+IKAAG KPR RDPVNQPMINNWVEAIGDRNPIYVD+AAARA GHPGIV
Sbjct 1 MTDIHEAVAEIKAAGGPKPRAGRDPVNQPMINNWVEAIGDRNPIYVDEAAARAVGHPGIV 60
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAMIQVWTM GLGG RPKDDP+GPI++LFDDAGYIGVVATNCEQTYHRYL PGE VSI
Sbjct 61 APPAMIQVWTMFGLGGERPKDDPMGPIMQLFDDAGYIGVVATNCEQTYHRYLQPGEHVSI 120
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSP---SSVPDDL 180
++E+GDV+GPKQT LGEG+FINQHI+W+VGDEDVAEMNWRILKFKP S SSVP DL
Sbjct 121 TSEMGDVIGPKQTGLGEGYFINQHIIWRVGDEDVAEMNWRILKFKPRESSEPTSSVPADL 180
Query 181 DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR 240
DPD MMRP+SSRDTAFFW+GVKAHELRIQR DGSL+HPPVPAVWQDK+ PI+YVV+SG+
Sbjct 181 DPDVMMRPASSRDTAFFWEGVKAHELRIQRRPDGSLQHPPVPAVWQDKTAPIDYVVASGK 240
Query 241 GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF- 299
GTVFSFVVHHAPKVPGRT+PFVIALVELEEGVRMLGELRG DPA+V IGMPVRATYIDF
Sbjct 241 GTVFSFVVHHAPKVPGRTLPFVIALVELEEGVRMLGELRGVDPAKVKIGMPVRATYIDFP 300
Query 300 -----PDWSLYAWEP 309
P+W+LYAWEP
Sbjct 301 AGDSGPEWTLYAWEP 315
>gi|118619288|ref|YP_907620.1| hypothetical protein MUL_4103 [Mycobacterium ulcerans Agy99]
gi|118571398|gb|ABL06149.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=316
Score = 531 bits (1367), Expect = 6e-149, Method: Compositional matrix adjust.
Identities = 263/315 (84%), Positives = 281/315 (90%), Gaps = 8/315 (2%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
+SDI+E VAQ+KAAGPSKPR RDPVNQPMINNWVEAIGD+NPIY DD AARA GHPGIV
Sbjct 1 MSDIREIVAQVKAAGPSKPRAGRDPVNQPMINNWVEAIGDQNPIYADDPAARAVGHPGIV 60
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAMIQVWTMMGL GVR DDPL PIIKLFDDAGYIGVVATNCEQTYHRYL GE VSI
Sbjct 61 APPAMIQVWTMMGLSGVRADDDPLPPIIKLFDDAGYIGVVATNCEQTYHRYLQLGELVSI 120
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKP--AGSPSSVPDDLD 181
SAELGDVVGPKQTALGEGWFINQHI+WQVGDEDVAEMNWRILKFKP +P++VPDDLD
Sbjct 121 SAELGDVVGPKQTALGEGWFINQHIIWQVGDEDVAEMNWRILKFKPRTESAPTNVPDDLD 180
Query 182 PDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRG 241
PDAMMRPSSSRDTAFFWDGVKAHELRIQR DGSL+HPP+PAVWQDK+ PI+YVVS G G
Sbjct 181 PDAMMRPSSSRDTAFFWDGVKAHELRIQRRPDGSLQHPPLPAVWQDKAEPIDYVVSIGTG 240
Query 242 TVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF-- 299
TVFSFVVHHAPKVPGRT+PFVIALVEL+EGVRMLGELR +P +V IGMPVR YIDF
Sbjct 241 TVFSFVVHHAPKVPGRTLPFVIALVELDEGVRMLGELRNVEPEQVEIGMPVRTVYIDFPA 300
Query 300 ----PDWSLYAWEPD 310
P+W+LYAWEPD
Sbjct 301 GASGPEWTLYAWEPD 315
>gi|240172832|ref|ZP_04751491.1| hypothetical protein MkanA1_26202 [Mycobacterium kansasii ATCC
12478]
Length=315
Score = 530 bits (1364), Expect = 2e-148, Method: Compositional matrix adjust.
Identities = 260/314 (83%), Positives = 281/314 (90%), Gaps = 7/314 (2%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
++DI+EAV QIKAAG SKPR ARDPVNQPMINNWVEAIGDRNPIYVDD AARA GHPGIV
Sbjct 1 MTDIREAVTQIKAAGSSKPRAARDPVNQPMINNWVEAIGDRNPIYVDDDAARAVGHPGIV 60
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAMIQVWTM GLGG RPKDDPLGP++KLFDDAGYIGVVATNCEQ YHRYL PGEQV+I
Sbjct 61 APPAMIQVWTMYGLGGQRPKDDPLGPVMKLFDDAGYIGVVATNCEQIYHRYLRPGEQVTI 120
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPS-SVPDDLDP 182
+AELGDVVGPKQTALGEGWF+NQH VW+VGDEDVAEM+WRILKF+PA +P +VPDDLDP
Sbjct 121 TAELGDVVGPKQTALGEGWFVNQHTVWRVGDEDVAEMDWRILKFRPATTPGVAVPDDLDP 180
Query 183 DAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGT 242
DAMMRPS SRDTAFFW+GV AHELRIQR DG+L+HPPVPAVWQ+ S PI+Y VSSGRG
Sbjct 181 DAMMRPSLSRDTAFFWEGVAAHELRIQRRPDGTLQHPPVPAVWQEPSAPIDYAVSSGRGV 240
Query 243 VFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF--- 299
VFSFVVHHAPKVPGR VPFVIALVELEEGVRMLGELR DP+RV IG PVRA YIDF
Sbjct 241 VFSFVVHHAPKVPGRIVPFVIALVELEEGVRMLGELRNVDPSRVEIGTPVRAMYIDFPAG 300
Query 300 ---PDWSLYAWEPD 310
P+W+LYAWEPD
Sbjct 301 DSGPEWTLYAWEPD 314
>gi|41406623|ref|NP_959459.1| hypothetical protein MAP0525 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|118463131|ref|YP_879899.1| hypothetical protein MAV_0619 [Mycobacterium avium 104]
gi|254773576|ref|ZP_05215092.1| hypothetical protein MaviaA2_02715 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|41394972|gb|AAS02842.1| hypothetical protein MAP_0525 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|118164418|gb|ABK65315.1| conserved hypothetical protein [Mycobacterium avium 104]
gi|336458413|gb|EGO37387.1| putative nucleic-acid-binding protein containing a Zn-ribbon
[Mycobacterium avium subsp. paratuberculosis S397]
Length=317
Score = 511 bits (1317), Expect = 4e-143, Method: Compositional matrix adjust.
Identities = 257/316 (82%), Positives = 286/316 (91%), Gaps = 9/316 (2%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
++DI EA+A+IKAAG KPR RDPVNQPMINNWVEAIGDRNPIYVD+AAARA GHPGIV
Sbjct 1 MTDIAEAIAEIKAAGGPKPRAGRDPVNQPMINNWVEAIGDRNPIYVDEAAARAVGHPGIV 60
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAMIQVWTM GLGG RPKDDP+GPI++LFD+AGY+GVVATNCEQTYHRYL PGEQVSI
Sbjct 61 APPAMIQVWTMFGLGGERPKDDPMGPIMELFDNAGYVGVVATNCEQTYHRYLRPGEQVSI 120
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGS---PSSVPDDL 180
++E+GDV+GPKQTALGEGWFINQHI+W+VGDEDVAEMNWRILKF+P S S VPDDL
Sbjct 121 TSEMGDVIGPKQTALGEGWFINQHIIWRVGDEDVAEMNWRILKFRPRESDAAASPVPDDL 180
Query 181 DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR 240
DP MMRP+SSRDTAFFW+GVKAHELRIQR DGSL+HPPVPAVWQDK+ PI+YVV+SG+
Sbjct 181 DPATMMRPASSRDTAFFWEGVKAHELRIQRRPDGSLQHPPVPAVWQDKTAPIDYVVASGK 240
Query 241 GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF- 299
GTV+SFVVHHAPKVPGRT+PFVIALVEL EGVRMLGELRG DPARV IG+PVRATY+DF
Sbjct 241 GTVYSFVVHHAPKVPGRTLPFVIALVELAEGVRMLGELRGIDPARVRIGLPVRATYLDFP 300
Query 300 -----PDWSLYAWEPD 310
P+WSLYAWEPD
Sbjct 301 ADDNGPEWSLYAWEPD 316
>gi|342862368|ref|ZP_08719008.1| hypothetical protein MCOL_25873 [Mycobacterium colombiense CECT
3035]
gi|342130039|gb|EGT83373.1| hypothetical protein MCOL_25873 [Mycobacterium colombiense CECT
3035]
Length=320
Score = 509 bits (1310), Expect = 3e-142, Method: Compositional matrix adjust.
Identities = 247/315 (79%), Positives = 278/315 (89%), Gaps = 9/315 (2%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
++DI+EAVA+I A +KPR ARDPVNQP INNWVEA+GD NPIYVD+AAA+AAGHPGIV
Sbjct 1 MTDIKEAVAEITATVVAKPRDARDPVNQPTINNWVEALGDANPIYVDEAAAKAAGHPGIV 60
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAMIQVWTM GLGG RP DDP+GPI++LFDDAGYIGVVATNCEQTYHRYL PGE+V++
Sbjct 61 APPAMIQVWTMFGLGGERPTDDPMGPIMQLFDDAGYIGVVATNCEQTYHRYLRPGERVTV 120
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKP---AGSPSSVPDDL 180
+E+ DVVGPKQT LGEGWFINQHI W+VGDE+VAEM WRILKF+P +G+ SVP DL
Sbjct 121 HSEMRDVVGPKQTGLGEGWFINQHITWRVGDENVAEMEWRILKFRPREDSGATGSVPVDL 180
Query 181 DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR 240
D DAMMRP+ SRDTAFFW+GVKAHELRIQRLADG+L+HPPVPAVWQDK+ PI+Y V+SGR
Sbjct 181 DADAMMRPAVSRDTAFFWEGVKAHELRIQRLADGTLQHPPVPAVWQDKAEPIDYAVASGR 240
Query 241 GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF- 299
GTVFSFVVHHAPKVPGRT+PFVIAL+ELEEGVRMLGELR D A V IGMPVRATYIDF
Sbjct 241 GTVFSFVVHHAPKVPGRTLPFVIALIELEEGVRMLGELRNVDHAEVKIGMPVRATYIDFP 300
Query 300 -----PDWSLYAWEP 309
P+WSLYAWEP
Sbjct 301 AGESGPEWSLYAWEP 315
>gi|254822624|ref|ZP_05227625.1| hypothetical protein MintA_22024 [Mycobacterium intracellulare
ATCC 13950]
Length=320
Score = 506 bits (1304), Expect = 1e-141, Method: Compositional matrix adjust.
Identities = 246/315 (79%), Positives = 277/315 (88%), Gaps = 9/315 (2%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
++DI+EAVA+I A + PR ARDPVNQP INNWVEA+GDRNPIYVD+AAARAAGHPGIV
Sbjct 1 MTDIKEAVAEITATVVAAPREARDPVNQPTINNWVEALGDRNPIYVDEAAARAAGHPGIV 60
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAMIQVWTM GLGG RP DDP+GPI++LFD+AGYIGVVATNCEQTYHRYL PGEQV++
Sbjct 61 APPAMIQVWTMFGLGGERPTDDPMGPIMQLFDEAGYIGVVATNCEQTYHRYLRPGEQVTV 120
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKP---AGSPSSVPDDL 180
+E+ DVVGPKQT LGEGWFINQHI W+VGDEDVAEM WRILKF+P +G+ +SVP DL
Sbjct 121 VSEMRDVVGPKQTGLGEGWFINQHITWRVGDEDVAEMAWRILKFRPREDSGATASVPQDL 180
Query 181 DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR 240
D DAMMRP+ SRDTAFFW+GV AHELRIQRL DG+L+HPPVPAVWQDK+ PI++ V+SGR
Sbjct 181 DADAMMRPAVSRDTAFFWEGVAAHELRIQRLPDGALQHPPVPAVWQDKAEPIDFAVASGR 240
Query 241 GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF- 299
GTVFS+VVHHAPKVPGRT+PFVIALVELEEGVRMLGELR D A V IGMPVRATYIDF
Sbjct 241 GTVFSYVVHHAPKVPGRTLPFVIALVELEEGVRMLGELRNVDHAEVKIGMPVRATYIDFP 300
Query 300 -----PDWSLYAWEP 309
P+WSLYAWEP
Sbjct 301 AGESGPEWSLYAWEP 315
>gi|120406203|ref|YP_956032.1| hypothetical protein Mvan_5255 [Mycobacterium vanbaalenii PYR-1]
gi|119959021|gb|ABM16026.1| protein of unknown function DUF35 [Mycobacterium vanbaalenii
PYR-1]
Length=324
Score = 495 bits (1275), Expect = 3e-138, Method: Compositional matrix adjust.
Identities = 239/315 (76%), Positives = 273/315 (87%), Gaps = 8/315 (2%)
Query 3 GVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGI 62
G+SD+Q A+A+IKAAG SKPR RDPVNQPMI++WV+AIGD+NPIYVDD AA+AAGHPGI
Sbjct 8 GMSDLQSAIAEIKAAGRSKPRAGRDPVNQPMIHHWVDAIGDKNPIYVDDEAAKAAGHPGI 67
Query 63 VAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVS 122
VAPPAMIQVWTM GLG R DDPL ++KLFDDAGY+GVVATNCEQTYHRYL PGE+V+
Sbjct 68 VAPPAMIQVWTMGGLGQGRSDDDPLSKMMKLFDDAGYVGVVATNCEQTYHRYLQPGEEVT 127
Query 123 ISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPA--GSPSSVPDDL 180
I+AEL DV+GPKQTALGEG+FINQ I WQVGDEDVAEM WRI+KFKPA + VP+DL
Sbjct 128 IAAELTDVIGPKQTALGEGFFINQKITWQVGDEDVAEMMWRIMKFKPAERAKGAQVPEDL 187
Query 181 DPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR 240
DPD +MRP+SSRDT FFWDGV AHELRIQR DG+L+HPPVPAVWQ + PI+Y V+SG
Sbjct 188 DPDKLMRPASSRDTQFFWDGVNAHELRIQRRPDGTLQHPPVPAVWQSQDAPIDYAVASGN 247
Query 241 GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP 300
GTV+S+VVHHAPKVPGRT+PFVIALVEL+EGVRMLGELRG DP +V IG+PVRATYIDFP
Sbjct 248 GTVYSYVVHHAPKVPGRTLPFVIALVELDEGVRMLGELRGVDPDQVEIGLPVRATYIDFP 307
Query 301 D------WSLYAWEP 309
D W+LYAWEP
Sbjct 308 DSEVSPAWTLYAWEP 322
>gi|118468265|ref|YP_890218.1| hypothetical protein MSMEG_5992 [Mycobacterium smegmatis str.
MC2 155]
gi|118169552|gb|ABK70448.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=321
Score = 474 bits (1219), Expect = 9e-132, Method: Compositional matrix adjust.
Identities = 232/313 (75%), Positives = 261/313 (84%), Gaps = 8/313 (2%)
Query 5 SDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVA 64
D+Q + +IKA G S PR RDPVNQPMI++WV+AIGD+NPIYVD AA+AAGHPGIVA
Sbjct 7 EDLQAGIEKIKAEGKSDPRKGRDPVNQPMIHHWVDAIGDKNPIYVDQEAAKAAGHPGIVA 66
Query 65 PPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSIS 124
PPAMIQVWTM GLG R DDPL I+KLFDDAGY+GVVATNCEQTYHRYL PGE+++I
Sbjct 67 PPAMIQVWTMGGLGAGRSDDDPLSKIMKLFDDAGYVGVVATNCEQTYHRYLQPGEELTIH 126
Query 125 AELGDVVGPKQTALGEGWFINQHIVWQVGD-EDVAEMNWRILKFKP-AGSPSSVPDDLDP 182
AE+ DVVGPKQTALGEG+FINQ I W + E VAEMNWRI+KFKP +VPDDLDP
Sbjct 127 AEITDVVGPKQTALGEGYFINQLITWTTEEGEAVAEMNWRIMKFKPREDQKPAVPDDLDP 186
Query 183 DAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGT 242
D +MRP+SSRDT FFWDGV AHELRIQR DG+L+HPPVPAVWQDK PI+YVV+SG GT
Sbjct 187 DKLMRPASSRDTQFFWDGVNAHELRIQRRPDGTLQHPPVPAVWQDKEQPIDYVVASGNGT 246
Query 243 VFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFPD- 301
VFS+VVHHAPKVPGRT+PFVIALVELEEGVRMLGELR DP +V IGMPVRAT+IDFPD
Sbjct 247 VFSYVVHHAPKVPGRTLPFVIALVELEEGVRMLGELRNVDPDQVQIGMPVRATFIDFPDS 306
Query 302 -----WSLYAWEP 309
W+LYAWEP
Sbjct 307 EISPAWTLYAWEP 319
>gi|145222103|ref|YP_001132781.1| hypothetical protein Mflv_1511 [Mycobacterium gilvum PYR-GCK]
gi|315442543|ref|YP_004075422.1| nucleic-acid-binding protein containing a Zn-ribbon [Mycobacterium
sp. Spyr1]
gi|145214589|gb|ABP43993.1| protein of unknown function DUF35 [Mycobacterium gilvum PYR-GCK]
gi|315260846|gb|ADT97587.1| predicted nucleic-acid-binding protein containing a Zn-ribbon
[Mycobacterium sp. Spyr1]
Length=317
Score = 470 bits (1210), Expect = 1e-130, Method: Compositional matrix adjust.
Identities = 232/313 (75%), Positives = 264/313 (85%), Gaps = 8/313 (2%)
Query 5 SDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVA 64
+D+Q + +I AAG S+PR RDPVNQPMI++WV+AIGDRNPIYVDD AARAAGHPGIVA
Sbjct 3 ADLQAGIEKITAAGRSEPRAGRDPVNQPMIHHWVDAIGDRNPIYVDDEAARAAGHPGIVA 62
Query 65 PPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSIS 124
PPAMIQVWTM GLG R DDPL I++LFD+AGY+GVVATNCEQTYHRYL PGE+++I
Sbjct 63 PPAMIQVWTMGGLGQGRSDDDPLSKIMELFDEAGYVGVVATNCEQTYHRYLQPGEELTIH 122
Query 125 AELGDVVGPKQTALGEGWFINQHIVWQVGD-EDVAEMNWRILKFKP-AGSPSSVPDDLDP 182
A++ DVVGPKQTALGEG+FINQ I W VGD E VAEMNWRI+KFKP A VP+DLDP
Sbjct 123 ADITDVVGPKQTALGEGFFINQLITWTVGDGETVAEMNWRIMKFKPRAEDKPPVPEDLDP 182
Query 183 DAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGT 242
D +MRP+SS+DT FFWDGV AHELRIQR DGSL+HPPVPAVW DK P +YVV+SG GT
Sbjct 183 DKLMRPASSKDTRFFWDGVNAHELRIQRRPDGSLQHPPVPAVWADKDAPTDYVVASGNGT 242
Query 243 VFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFPD- 301
VFSFVVHHAP+VPGR++PFVIALVEL+EGVRMLGELR DPA V IGMPVRATY+DFPD
Sbjct 243 VFSFVVHHAPQVPGRSLPFVIALVELDEGVRMLGELRNVDPADVQIGMPVRATYLDFPDS 302
Query 302 -----WSLYAWEP 309
W+LYAWEP
Sbjct 303 DVSPAWTLYAWEP 315
>gi|126437623|ref|YP_001073314.1| hypothetical protein Mjls_5059 [Mycobacterium sp. JLS]
gi|126237423|gb|ABO00824.1| protein of unknown function DUF35 [Mycobacterium sp. JLS]
Length=317
Score = 464 bits (1195), Expect = 6e-129, Method: Compositional matrix adjust.
Identities = 226/313 (73%), Positives = 264/313 (85%), Gaps = 8/313 (2%)
Query 5 SDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVA 64
+D+Q + +IKA G S+PR RDPVNQPMI++WV+AIGD+NPIYVDD AA+AAGHPGIVA
Sbjct 3 ADLQPDIEKIKAEGRSEPRAGRDPVNQPMIHHWVDAIGDKNPIYVDDEAAKAAGHPGIVA 62
Query 65 PPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSIS 124
PPAMIQVWTM GLG R DDPL ++ LFD+AGY+GVVATNCEQTYHRYL PGE+++I
Sbjct 63 PPAMIQVWTMGGLGQGRSDDDPLSKMMNLFDEAGYVGVVATNCEQTYHRYLQPGEEITIH 122
Query 125 AELGDVVGPKQTALGEGWFINQHIVWQV-GDEDVAEMNWRILKFKP-AGSPSSVPDDLDP 182
A++ DVVGPKQTALGEG+FINQ I W V G E VAEMNWRI+KFKP A +VP+DLDP
Sbjct 123 ADITDVVGPKQTALGEGYFINQLITWTVDGGETVAEMNWRIMKFKPKADDRPAVPEDLDP 182
Query 183 DAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGT 242
D +MRP+SSRDT FFWDGV AHELRIQ DG+L+HPPVPAVW DK P++YVV++G GT
Sbjct 183 DKLMRPASSRDTQFFWDGVNAHELRIQTRPDGTLQHPPVPAVWADKDAPVDYVVAAGTGT 242
Query 243 VFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFPD- 301
V+S+VVHHAPKVPGR++PFVIALVEL+EGVRMLGELRG DP+ V IGMPVRATYIDFPD
Sbjct 243 VYSYVVHHAPKVPGRSLPFVIALVELDEGVRMLGELRGVDPSEVQIGMPVRATYIDFPDS 302
Query 302 -----WSLYAWEP 309
W+LYAWEP
Sbjct 303 DVSPAWTLYAWEP 315
>gi|108801637|ref|YP_641834.1| hypothetical protein Mmcs_4674 [Mycobacterium sp. MCS]
gi|119870788|ref|YP_940740.1| hypothetical protein Mkms_4760 [Mycobacterium sp. KMS]
gi|108772056|gb|ABG10778.1| protein of unknown function DUF35 [Mycobacterium sp. MCS]
gi|119696877|gb|ABL93950.1| protein of unknown function DUF35 [Mycobacterium sp. KMS]
Length=318
Score = 463 bits (1192), Expect = 1e-128, Method: Compositional matrix adjust.
Identities = 224/312 (72%), Positives = 264/312 (85%), Gaps = 8/312 (2%)
Query 6 DIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAP 65
D+Q + +IKA G S+PR RDPVNQPMI++WV+AIGD+NPIYVD+ AA+AAGHPGIVAP
Sbjct 5 DLQPDIEKIKAEGKSEPRAGRDPVNQPMIHHWVDAIGDKNPIYVDEEAAKAAGHPGIVAP 64
Query 66 PAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISA 125
PAMIQVWTM GLG R DDPL ++ LFDDAGY+GVVATNCEQTYHRYL PGE+++I A
Sbjct 65 PAMIQVWTMGGLGQGRSDDDPLSKMMNLFDDAGYVGVVATNCEQTYHRYLQPGEEITIHA 124
Query 126 ELGDVVGPKQTALGEGWFINQHIVWQV-GDEDVAEMNWRILKFKP-AGSPSSVPDDLDPD 183
++ +VVGPKQTALGEG+FINQ I W V G E VAEMNWRI+KFKP A +VP+DLDPD
Sbjct 125 DITEVVGPKQTALGEGYFINQLITWTVDGGETVAEMNWRIMKFKPTADDRPAVPEDLDPD 184
Query 184 AMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGTV 243
+MRP+SS+DT FFWDGV AHELRIQ+ DG+L+HPPVPAVW DK P++YVV++G GTV
Sbjct 185 KLMRPASSKDTQFFWDGVNAHELRIQKRPDGTLQHPPVPAVWADKDAPVDYVVAAGTGTV 244
Query 244 FSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFPD-- 301
+S+VVHHAPKVPGR++PFVIALVEL+EGVRMLGELRG DP+ V IGMPVRATYIDFPD
Sbjct 245 YSYVVHHAPKVPGRSLPFVIALVELDEGVRMLGELRGVDPSEVQIGMPVRATYIDFPDSD 304
Query 302 ----WSLYAWEP 309
W+LYAWEP
Sbjct 305 VSPAWTLYAWEP 316
>gi|333992329|ref|YP_004524943.1| hypothetical protein JDM601_3689 [Mycobacterium sp. JDM601]
gi|333488297|gb|AEF37689.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=322
Score = 454 bits (1169), Expect = 7e-126, Method: Compositional matrix adjust.
Identities = 224/318 (71%), Positives = 265/318 (84%), Gaps = 10/318 (3%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
+SD+ + QI A+G S+PR RDPVNQPMI +WV+AIGD NPIY+D+ AA+AAGHPGIV
Sbjct 1 MSDLSAGIEQILASGRSEPRTGRDPVNQPMIRHWVDAIGDENPIYIDEEAAKAAGHPGIV 60
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAMIQVWTMMGLGG RP DDPLG +++LFD AGY+GVVATNCEQTYHRYL PGE+VS+
Sbjct 61 APPAMIQVWTMMGLGGERPADDPLGKVMELFDGAGYVGVVATNCEQTYHRYLRPGEEVSV 120
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKP----AGSPSSVPDD 179
+AE+ D+VGPKQT LGEG+FI Q I W VGDE+VA+MNWRILKF P +P++ P+D
Sbjct 121 TAEVTDIVGPKQTGLGEGYFITQKIRWWVGDENVADMNWRILKFVPKDAGDAAPAAAPED 180
Query 180 LDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSG 239
LDP MMRPS S+D+A+FW+GV AHELRIQ+ DGSL+HPPVPA+W+DK+ ++VV+SG
Sbjct 181 LDPAKMMRPSWSKDSAYFWEGVVAHELRIQQRPDGSLQHPPVPAIWKDKTETTDFVVASG 240
Query 240 RGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF 299
GTVFSFVVHHAPKVPGR+VPFVIALVELEEGVRMLGELRG D A V IGMPVRA Y+DF
Sbjct 241 TGTVFSFVVHHAPKVPGRSVPFVIALVELEEGVRMLGELRGVDAAAVRIGMPVRAIYLDF 300
Query 300 ------PDWSLYAWEPDE 311
P W+LYAWEP E
Sbjct 301 PAGDAGPAWTLYAWEPVE 318
>gi|169627719|ref|YP_001701368.1| hypothetical protein MAB_0616 [Mycobacterium abscessus ATCC 19977]
gi|169239686|emb|CAM60714.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=316
Score = 448 bits (1152), Expect = 6e-124, Method: Compositional matrix adjust.
Identities = 221/312 (71%), Positives = 250/312 (81%), Gaps = 7/312 (2%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
V DIQE A+IKA GPS PRLARDPVNQ MINNWVEA+GDRNPIYVD+ AA+AAGHPGIV
Sbjct 3 VVDIQEGFAKIKADGPSAPRLARDPVNQAMINNWVEAMGDRNPIYVDEEAAKAAGHPGIV 62
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAM QVWTM GL G R DDPLG ++ L D AGY VVATN EQ YHRYL GEQV+
Sbjct 63 APPAMAQVWTMAGLYGERSGDDPLGRVMALLDSAGYESVVATNYEQIYHRYLELGEQVTT 122
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKP-AGSPSSVPDDLDP 182
+ E+ +V GPKQTALGE WFIN H W VGDE V EMNWRI+KF+P A + VP DLDP
Sbjct 123 TNEVTEVFGPKQTALGESWFINIHAEWHVGDELVNEMNWRIMKFRPGAKKAADVPQDLDP 182
Query 183 DAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGT 242
+MRPS SRD+A+FW+GV AHELR+Q+ DGSL+HPPVPAVWQDK PI Y V+SGRGT
Sbjct 183 SKLMRPSWSRDSAYFWEGVAAHELRLQKRPDGSLQHPPVPAVWQDKEAPIEYQVASGRGT 242
Query 243 VFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF--- 299
V+S+VVHHAP+VPGRT+PFVIALVEL+EGVRMLGELRG P V IG+PVRATY+DF
Sbjct 243 VYSYVVHHAPQVPGRTLPFVIALVELDEGVRMLGELRGVSPDDVQIGLPVRATYLDFPAE 302
Query 300 ---PDWSLYAWE 308
P W+LYAWE
Sbjct 303 GEDPAWTLYAWE 314
>gi|312138120|ref|YP_004005456.1| hypothetical protein REQ_06540 [Rhodococcus equi 103S]
gi|311887459|emb|CBH46771.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=335
Score = 422 bits (1084), Expect = 5e-116, Method: Compositional matrix adjust.
Identities = 208/325 (64%), Positives = 245/325 (76%), Gaps = 19/325 (5%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
+ I QIKA G SKPR RDP+N PMI NW+EAIGD NPIYVD+ AA AAGH GIV
Sbjct 7 TAQILAGAEQIKAGGASKPRAGRDPINMPMIRNWLEAIGDENPIYVDEDAAVAAGHGGIV 66
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAM+QVWTM GLG VR +DDPLG + ++ DDAGY VVATNC+Q YHRYL PGE+V+I
Sbjct 67 APPAMVQVWTMRGLGAVREEDDPLGRMTQILDDAGYTSVVATNCDQIYHRYLRPGEEVTI 126
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDED-----VAEMNWRILKFK------PAGS 172
+ L +VVGPK+T LGEGWF W+V ED VAEM +RILKF PA
Sbjct 127 ESTLEEVVGPKRTGLGEGWFFTTRNFWKVAREDGAGEIVAEMMFRILKFMPPAKDVPASE 186
Query 173 PSSVPDDLDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPI 232
+SVP+DLDP MMRP+ SRDT FFWDGV AHELRIQ+ DGSL+HPPVPA+W+DK+
Sbjct 187 SASVPEDLDPTRMMRPTPSRDTQFFWDGVAAHELRIQQRPDGSLQHPPVPALWKDKAETT 246
Query 233 NYVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPV 292
+YVV+SGRGTVFSFVVHHAP+VPGR++PFV+ALVELEEGVRMLGELR DP+ V +GMPV
Sbjct 247 DYVVASGRGTVFSFVVHHAPRVPGRSLPFVVALVELEEGVRMLGELRDVDPSEVVVGMPV 306
Query 293 RATYIDFPD--------WSLYAWEP 309
+A ++DFP W+LYAW P
Sbjct 307 QAQFLDFPGDGETGNEPWTLYAWRP 331
>gi|325674975|ref|ZP_08154662.1| hypothetical protein HMPREF0724_12444 [Rhodococcus equi ATCC
33707]
gi|325554561|gb|EGD24236.1| hypothetical protein HMPREF0724_12444 [Rhodococcus equi ATCC
33707]
Length=335
Score = 421 bits (1083), Expect = 6e-116, Method: Compositional matrix adjust.
Identities = 209/325 (65%), Positives = 244/325 (76%), Gaps = 19/325 (5%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
+ I QIKA G SKPR RDP+N PMI NW+EAIGD NPIYVD+ AA AAGH GIV
Sbjct 7 TAQILAGAEQIKAGGASKPRAGRDPINMPMIRNWLEAIGDENPIYVDEDAAVAAGHGGIV 66
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAM QVWTM GLG VR +DDPLG + ++ DDAGY VVATNC+Q YHRYL PGE+V+I
Sbjct 67 APPAMAQVWTMRGLGAVREEDDPLGRMTQILDDAGYTSVVATNCDQIYHRYLRPGEEVTI 126
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDED-----VAEMNWRILKFKP------AGS 172
+ L +VVGPK+T LGEGWF W+V ED VAEM +RILKF P A
Sbjct 127 ESTLEEVVGPKRTGLGEGWFFTTRNFWKVAREDGTGEIVAEMMFRILKFMPPAKDVPASE 186
Query 173 PSSVPDDLDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPI 232
+SVP+DLDP MMRP+ SRDT FFWDGV AHELRIQ+ DGSL+HPPVPA+W+DK+
Sbjct 187 SASVPEDLDPTRMMRPTPSRDTQFFWDGVAAHELRIQQRPDGSLQHPPVPALWKDKAEAT 246
Query 233 NYVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPV 292
+YVV+SGRGTVFSFVVHHAPKVPGR++PFV+ALVELEEGVRMLGELR DP+ V +GMPV
Sbjct 247 DYVVASGRGTVFSFVVHHAPKVPGRSLPFVVALVELEEGVRMLGELRDVDPSEVVVGMPV 306
Query 293 RATYIDFPD--------WSLYAWEP 309
+A ++DFP W+LYAW P
Sbjct 307 QAQFLDFPGDDETGNEPWTLYAWRP 331
>gi|229494536|ref|ZP_04388299.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229318898|gb|EEN84756.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=332
Score = 420 bits (1080), Expect = 1e-115, Method: Compositional matrix adjust.
Identities = 207/314 (66%), Positives = 243/314 (78%), Gaps = 11/314 (3%)
Query 7 IQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPP 66
I A +++ GPS PR RDPVN PMI NWVEAIGD NPIYVD++AARAAGH GIVAPP
Sbjct 5 ILTAAERVREDGPSAPRAGRDPVNLPMIRNWVEAIGDENPIYVDESAARAAGHDGIVAPP 64
Query 67 AMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAE 126
AM QVWTM GLG +R DDPLG + + D+AG+ VVATNC+Q YHRYL PGE+V+IS+
Sbjct 65 AMAQVWTMRGLGAIREADDPLGRMTDILDEAGFTSVVATNCDQIYHRYLKPGEEVTISSV 124
Query 127 LGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSS---VPDDLDPD 183
L DVVGPK+T LGEGWF W+VG+E V+EM +RILKF P S SS VP DLD
Sbjct 125 LEDVVGPKKTGLGEGWFFTTRSFWRVGEELVSEMMFRILKFAPPVSESSSTDVPADLDAS 184
Query 184 AMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGTV 243
M+RP++S DT FFWDGV AHELRIQ ADGS++HPPVPA+W+DKS +YVV+SGRGTV
Sbjct 185 RMLRPTASLDTQFFWDGVAAHELRIQLRADGSVQHPPVPAIWKDKSEQTDYVVASGRGTV 244
Query 244 FSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFPD-- 301
FSFVVHHAPKVPGR++PFV+ALVEL EGVRMLGELRG DP+ V+IGM V A Y+DFP
Sbjct 245 FSFVVHHAPKVPGRSLPFVVALVELPEGVRMLGELRGIDPSSVSIGMEVEAVYLDFPGDD 304
Query 302 ------WSLYAWEP 309
W+LYAW+P
Sbjct 305 ETGGTPWTLYAWQP 318
>gi|343925845|ref|ZP_08765360.1| hypothetical protein GOALK_050_01400 [Gordonia alkanivorans NBRC
16433]
gi|343764196|dbj|GAA12286.1| hypothetical protein GOALK_050_01400 [Gordonia alkanivorans NBRC
16433]
Length=350
Score = 419 bits (1076), Expect = 4e-115, Method: Compositional matrix adjust.
Identities = 213/327 (66%), Positives = 246/327 (76%), Gaps = 24/327 (7%)
Query 7 IQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPP 66
I+ A I G S P RDP+NQPMINNWVEA+GD NPIYVDDAAARAAGHPG+VAPP
Sbjct 17 IRAAAQTIMDDGASAPVRGRDPINQPMINNWVEAMGDENPIYVDDAAARAAGHPGVVAPP 76
Query 67 AMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAE 126
AM QVWTM GL GVR DDPLG +LFDDAG+ VVATNC+ YHRY+ PGEQVS+SAE
Sbjct 77 AMAQVWTMRGLNGVRTADDPLGRATQLFDDAGFTSVVATNCDTVYHRYIRPGEQVSLSAE 136
Query 127 LGDVVGPKQTALGEGWFINQHIVWQVGDED-----VAEMNWRILKFKPAGSP------SS 175
L ++VGPK TALGEGWF VW V DE+ VAEM +RILKF+PA SS
Sbjct 137 LIEIVGPKNTALGEGWFFTTRNVWSVPDENGVDEVVAEMRFRILKFRPAAKDAAPEPVSS 196
Query 176 VPDDLDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQ----DKSVP 231
+PDDLDP +++PS SRDTAFFW+GV AHELRIQR +G+LRHPPVPA W+ D SVP
Sbjct 197 IPDDLDPARLLKPSVSRDTAFFWEGVAAHELRIQRDGEGALRHPPVPATWKPREADGSVP 256
Query 232 -INYVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGM 290
+YVV+SG GTV+S+VVH APKVPGR +PFV+ALVEL+EGVRMLGELRG +P V IGM
Sbjct 257 ETDYVVASGTGTVYSYVVHRAPKVPGRQLPFVVALVELDEGVRMLGELRGVEPEDVEIGM 316
Query 291 PVRATYIDF--------PDWSLYAWEP 309
PVR T++DF P W+LYAWEP
Sbjct 317 PVRVTFLDFPARDDAGTPAWTLYAWEP 343
>gi|326384606|ref|ZP_08206285.1| hypothetical protein SCNU_16778 [Gordonia neofelifaecis NRRL
B-59395]
gi|326196740|gb|EGD53935.1| hypothetical protein SCNU_16778 [Gordonia neofelifaecis NRRL
B-59395]
Length=342
Score = 417 bits (1073), Expect = 8e-115, Method: Compositional matrix adjust.
Identities = 206/326 (64%), Positives = 240/326 (74%), Gaps = 21/326 (6%)
Query 7 IQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPP 66
I+ A +I A G S PR RDP+NQPMINNW+EA+GD NPIYVDD AARAAGHPG+VAPP
Sbjct 14 IRAAADEIAARGASAPRAGRDPINQPMINNWLEAMGDDNPIYVDDDAARAAGHPGVVAPP 73
Query 67 AMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAE 126
AM QVWTM GL GVR DDPLG +LFD AGY VVATNC+ YHRY PGE+V+IS+E
Sbjct 74 AMAQVWTMRGLHGVRTDDDPLGLATELFDQAGYTSVVATNCDTVYHRYTRPGEEVTISSE 133
Query 127 LGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDD------- 179
L VVGPKQTALGEGWF W VGDE VAEM++RILKF+PA + DD
Sbjct 134 LTQVVGPKQTALGEGWFFTTRNTWNVGDETVAEMDFRILKFRPAAKDARGADDAPAEQFE 193
Query 180 -LDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKS-----VPIN 233
LDP ++RPSSSRDTAFFWDGV AHELRIQ++ DGSLRHPP+PA W ++ +
Sbjct 194 GLDPTRLIRPSSSRDTAFFWDGVAAHELRIQQVQDGSLRHPPIPATWTERGDDGEFAGTD 253
Query 234 YVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVR 293
YVV++G GTV+S+VVHHAP+VPGR +PFV+ALVEL+EGVRMLG+LRG DP V IGMPV
Sbjct 254 YVVAAGSGTVYSYVVHHAPRVPGRELPFVVALVELDEGVRMLGQLRGVDPESVTIGMPVE 313
Query 294 ATYIDFPD--------WSLYAWEPDE 311
++DFP WSLYAW E
Sbjct 314 VEFLDFPGDDDTGGAPWSLYAWRAKE 339
>gi|226304199|ref|YP_002764157.1| hypothetical protein RER_07100 [Rhodococcus erythropolis PR4]
gi|226183314|dbj|BAH31418.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=335
Score = 417 bits (1071), Expect = 1e-114, Method: Compositional matrix adjust.
Identities = 206/314 (66%), Positives = 243/314 (78%), Gaps = 11/314 (3%)
Query 7 IQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPP 66
I A +++ GPS PR RDPVN PMI NWVEAIGD NPIYVD++AARAAGH GIVAPP
Sbjct 5 ILTAAERVREDGPSAPRAGRDPVNLPMIRNWVEAIGDENPIYVDESAARAAGHDGIVAPP 64
Query 67 AMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAE 126
AM QVWTM GLG +R DDPLG + + D+AG+ VVATNC+Q YHRYL PGE+V+IS+
Sbjct 65 AMAQVWTMRGLGAIREADDPLGRMTDILDEAGFTSVVATNCDQIYHRYLKPGEEVTISSV 124
Query 127 LGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGS--PSS-VPDDLDPD 183
L DVVGPK+T LGEGWF W+VG+E V+EM +RILKF P S PS+ VP DLD
Sbjct 125 LEDVVGPKKTGLGEGWFFTTRSFWRVGEELVSEMMFRILKFAPPVSERPSTDVPADLDAS 184
Query 184 AMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGTV 243
M+RP++S DT FFWDGV AHELRIQ ADGS++HPPVPA+W+DKS +YVV+SGRGTV
Sbjct 185 RMLRPTASLDTQFFWDGVAAHELRIQLRADGSVQHPPVPAIWKDKSEQTDYVVASGRGTV 244
Query 244 FSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFPD-- 301
FSFVVHHAPKVPGR++PFV+ALVEL EGVRMLGELRG DP+ +IGM V A Y+DFP
Sbjct 245 FSFVVHHAPKVPGRSLPFVVALVELPEGVRMLGELRGVDPSSASIGMEVEAIYLDFPGDD 304
Query 302 ------WSLYAWEP 309
W+LYAW+P
Sbjct 305 ETGGTPWTLYAWQP 318
>gi|262200883|ref|YP_003272091.1| hypothetical protein Gbro_0885 [Gordonia bronchialis DSM 43247]
gi|262084230|gb|ACY20198.1| protein of unknown function DUF35 [Gordonia bronchialis DSM 43247]
Length=333
Score = 416 bits (1070), Expect = 2e-114, Method: Compositional matrix adjust.
Identities = 212/327 (65%), Positives = 244/327 (75%), Gaps = 20/327 (6%)
Query 1 VTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP 60
T I+ A I GPS P RDP+NQPMINNWVEA+GD NPIY D++AARAAGHP
Sbjct 2 TTSADTIRAAAQTIIDDGPSAPIAGRDPINQPMINNWVEAMGDENPIYTDESAARAAGHP 61
Query 61 GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ 120
GIVAPPAM QVWTM GL G R DDPLG +LFDDAGY VVATNC+ YHRY GE+
Sbjct 62 GIVAPPAMAQVWTMRGLHGARTDDDPLGRASQLFDDAGYTSVVATNCDTVYHRYTQLGEE 121
Query 121 VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKF-----KPAGSPSS 175
VS+SAEL DVVGPK TALGEGWF VW V DE VAEM +RILKF KPAG +
Sbjct 122 VSLSAELLDVVGPKNTALGEGWFFTTRNVWSVDDEVVAEMRFRILKFRPGQQKPAG--PT 179
Query 176 VPDDLDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVW----QDKSVP 231
VP+DLDP M++PS+SRDTAFFW+GV AHELRIQ+ DG+LRHPP+PA W +D SVP
Sbjct 180 VPEDLDPSKMLKPSASRDTAFFWEGVAAHELRIQQTGDGTLRHPPIPATWKPRGEDGSVP 239
Query 232 -INYVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGM 290
+YVV++G GT++SFVVH APKVPGR +PFV+ALVEL+EGVRMLGELRG DPA V IGM
Sbjct 240 ETDYVVAAGTGTIYSFVVHRAPKVPGRALPFVVALVELDEGVRMLGELRGVDPADVRIGM 299
Query 291 PVRATYIDF--------PDWSLYAWEP 309
PV T++DF P W+LYAW+P
Sbjct 300 PVTVTFLDFPADDAAGQPAWTLYAWQP 326
>gi|111021458|ref|YP_704430.1| hypothetical protein RHA1_ro04486 [Rhodococcus jostii RHA1]
gi|110820988|gb|ABG96272.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=336
Score = 413 bits (1062), Expect = 1e-113, Method: Compositional matrix adjust.
Identities = 205/324 (64%), Positives = 243/324 (75%), Gaps = 22/324 (6%)
Query 7 IQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPP 66
I A ++++AGPS PR RDP+N PMI NWVEAIGDRNPIYVD+AAARAAGH GIVAPP
Sbjct 6 ILAAADEVRSAGPSAPRPGRDPINLPMIRNWVEAIGDRNPIYVDEAAARAAGHDGIVAPP 65
Query 67 AMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAE 126
AM QVWTM GL VR DDPLG + + D+AG+ VVATNC+Q YHRYL GE+V+I+
Sbjct 66 AMAQVWTMQGLNAVRQPDDPLGRMTTILDEAGFTSVVATNCDQIYHRYLRVGEEVTITTT 125
Query 127 LGDVVGPKQTALGEGWFINQHIVWQVG-----DEDVAEMNWRILKFKP---------AGS 172
L D+VGPK+T LGEGWF +W+V DE VAEM +RILKF P A
Sbjct 126 LEDLVGPKKTGLGEGWFFTTRSLWRVAGDDGTDEVVAEMLFRILKFAPKPAEPVSPSANG 185
Query 173 PSSVPDDLDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPI 232
+ DDLDP +MRPS+S+DT FFWDGV AHELRIQ+ DGSL+HPPVPA+W+DK+
Sbjct 186 QGASFDDLDPTKLMRPSASQDTQFFWDGVAAHELRIQQREDGSLQHPPVPALWKDKTETT 245
Query 233 NYVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPV 292
+YVV+SGRGTVFS+VVHHAPKVPGR++PFV+ALVELEEGVRMLGELRG DPA V +GMPV
Sbjct 246 DYVVASGRGTVFSYVVHHAPKVPGRSLPFVVALVELEEGVRMLGELRGVDPAEVTVGMPV 305
Query 293 RATYIDFPD--------WSLYAWE 308
A Y+DFP W+LYAW+
Sbjct 306 EAIYLDFPGDDETGGTPWTLYAWQ 329
>gi|226363824|ref|YP_002781606.1| hypothetical protein ROP_44140 [Rhodococcus opacus B4]
gi|226242313|dbj|BAH52661.1| hypothetical protein [Rhodococcus opacus B4]
Length=332
Score = 411 bits (1057), Expect = 5e-113, Method: Compositional matrix adjust.
Identities = 206/319 (65%), Positives = 242/319 (76%), Gaps = 17/319 (5%)
Query 7 IQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPP 66
I A ++++AGPS PR RDP+N PMI NWVEAIGDRNPIYVD+AAARAAGH GIVAPP
Sbjct 6 ILAAAEEVRSAGPSAPRPGRDPINLPMIRNWVEAIGDRNPIYVDEAAARAAGHDGIVAPP 65
Query 67 AMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAE 126
AM QVWTM GL VR DDPLG + + D+AG+ VVATNC+Q YHRYLL GE+V+I+
Sbjct 66 AMAQVWTMQGLNAVRQTDDPLGRMTTILDEAGFTSVVATNCDQIYHRYLLVGEEVTITTT 125
Query 127 LGDVVGPKQTALGEGWFINQHIVWQ-VGDEDVAEMNWRILKFKPAG---SPSSVPD---- 178
L DVVGPK+T LGEG+F N VW+ E VAEM +RILKF P PS
Sbjct 126 LEDVVGPKKTGLGEGFFFNTRSVWRDAAGEPVAEMLFRILKFVPKTDRPGPSGNGQGASY 185
Query 179 -DLDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVS 237
DLDP MMRPS+S+DT FFWDG+ AHELRIQ+ DGSL+HPPVPA+W+DK+ +YVV+
Sbjct 186 VDLDPTKMMRPSASQDTQFFWDGIAAHELRIQQREDGSLQHPPVPALWKDKAEATDYVVA 245
Query 238 SGRGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYI 297
SGRGTVFS+VVHHAPKVPGR++PFV+ALVELEEGVRMLGELRG +PA V IGMPV A Y+
Sbjct 246 SGRGTVFSYVVHHAPKVPGRSLPFVVALVELEEGVRMLGELRGVEPAEVTIGMPVEAIYL 305
Query 298 DFPD--------WSLYAWE 308
DFP W+LYAW+
Sbjct 306 DFPGDEETGGAPWTLYAWQ 324
>gi|54022415|ref|YP_116657.1| hypothetical protein nfa4510 [Nocardia farcinica IFM 10152]
gi|54013923|dbj|BAD55293.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=310
Score = 390 bits (1003), Expect = 1e-106, Method: Compositional matrix adjust.
Identities = 193/295 (66%), Positives = 220/295 (75%), Gaps = 5/295 (1%)
Query 7 IQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPP 66
I A QI AAG PRLARDPVNQPMINNWVEAIGD NPIYVD+AAARAAGHPGIVAPP
Sbjct 10 IVAAAEQIIAAGECAPRLARDPVNQPMINNWVEAIGDTNPIYVDEAAARAAGHPGIVAPP 69
Query 67 AMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAE 126
AM QVWTM GLGGVRP DDP+ L D AGY VV TNCEQ YHRYL+PGEQV++++
Sbjct 70 AMAQVWTMFGLGGVRPPDDPMAQTNDLLDAAGYTSVVGTNCEQIYHRYLVPGEQVTVTSR 129
Query 127 LGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDLDPDAMM 186
L + GPK+T LGEGWF+ +W VGDE V EM +RILKF P + + +
Sbjct 130 LDSISGPKRTGLGEGWFVTFRTLWYVGDELVTEMLFRILKFAPRPAAEQPAGE-----RV 184
Query 187 RPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGTVFSF 246
+P S DT FFW G K ELRIQRL DG+LRHPP+PAVW+DKS +YVV+SGRGTVFS+
Sbjct 185 KPLVSHDTEFFWAGTKLGELRIQRLPDGTLRHPPIPAVWKDKSEQTDYVVASGRGTVFSY 244
Query 247 VVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFPD 301
VVHHAPKVPGR +PFV+ALVELEEGVRMLGELRG DP V +G+PV + D
Sbjct 245 VVHHAPKVPGRQLPFVVALVELEEGVRMLGELRGIDPGEVRVGLPVEVAFEQLDD 299
>gi|319948637|ref|ZP_08022761.1| hypothetical protein ES5_04623 [Dietzia cinnamea P4]
gi|319437718|gb|EFV92714.1| hypothetical protein ES5_04623 [Dietzia cinnamea P4]
Length=367
Score = 382 bits (981), Expect = 4e-104, Method: Compositional matrix adjust.
Identities = 196/333 (59%), Positives = 225/333 (68%), Gaps = 27/333 (8%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
S I A ++KA+G S R RDP+N PMI NW EAIGD NPIY + AA AAGH G+V
Sbjct 16 TSRILAAADEVKASGGSARRAGRDPINLPMIRNWTEAIGDTNPIYESEEAAVAAGHGGLV 75
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAM QVWTM GLG R DDPLG + + D G+ VVATNC+ TYHRY PGE+V+I
Sbjct 76 APPAMAQVWTMRGLGKAREADDPLGRMTDILDAEGFTSVVATNCDSTYHRYTRPGEEVTI 135
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKP-------------- 169
+ L DVVGPK T LGEGWF W+VGDE VAEM++RILKF+P
Sbjct 136 QSVLVDVVGPKTTGLGEGWFFTTRNYWRVGDEVVAEMDFRILKFRPPAKTAEPAASEEQI 195
Query 170 AGSPSSVPDDLDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKS 229
S S DDL ++RPS S DT FFWDGV AHELRIQ+ DGSL+HPPVPA+W+DKS
Sbjct 196 VASSGSATDDLVVGKVLRPSVSHDTQFFWDGVAAHELRIQKREDGSLQHPPVPALWKDKS 255
Query 230 VPINYVVSSGRGTVFSFVVHHAPKVPGRT-VPFVIALVELEEGVRMLGELRGADPARVAI 288
+YVVSSGRGTV+SFV HHAP+VPGRT PFVIALVELEEGVRMLGELR DPA V I
Sbjct 256 EQTDYVVSSGRGTVYSFVTHHAPRVPGRTKFPFVIALVELEEGVRMLGELRDVDPAEVRI 315
Query 289 GMPVRATYIDFPD------------WSLYAWEP 309
GM V ++D P W+LYAW P
Sbjct 316 GMEVEVEFLDMPGDESEDAAFGTDPWTLYAWRP 348
>gi|296141527|ref|YP_003648770.1| hypothetical protein Tpau_3856 [Tsukamurella paurometabola DSM
20162]
gi|296029661|gb|ADG80431.1| protein of unknown function DUF35 [Tsukamurella paurometabola
DSM 20162]
Length=322
Score = 321 bits (823), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 177/321 (56%), Positives = 214/321 (67%), Gaps = 14/321 (4%)
Query 1 VTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP 60
++ + A +I+A G S PR RDPVNQPMINNW EAIGD NP+Y D A +
Sbjct 1 MSTTEQVLAAAERIRALGESAPRPGRDPVNQPMINNWTEAIGDTNPLYSDPEFAMRSRFG 60
Query 61 GIVAPPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQ 120
G VAPPAM QVW+M GL G R DDPLG + DDAGY VVATN QTYHR+L GE+
Sbjct 61 GSVAPPAMAQVWSMNGLHGTRADDDPLGLMTAALDDAGYTSVVATNSNQTYHRHLRLGEE 120
Query 121 VSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGS-----PSS 175
V+ S+ L DVVGPKQTALGEGWF +W+VGDE VAEM +R+LKF+P + P+S
Sbjct 121 VTASSRLEDVVGPKQTALGEGWFFTTRTLWRVGDEVVAEMVFRLLKFRPPSAEPPEIPAS 180
Query 176 VPDDLDPDA------MMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPA-VWQDK 228
P DA ++RP SRDTAF+W+G K ELRIQR +LRHPP P D
Sbjct 181 GPSGRIVDASGYAGGVLRPVISRDTAFYWEGAKLGELRIQRWGT-TLRHPPGPMDPGGDL 239
Query 229 SVPINYVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAI 288
+ +Y VS G GTVFSFVVHH PKVPG+ +PFV+ALVEL+EGVR+LGEL DPA V I
Sbjct 240 AATPDYAVSPGTGTVFSFVVHHHPKVPGKRLPFVVALVELDEGVRVLGELIDVDPAAVRI 299
Query 289 GMPVRATYIDF-PDWSLYAWE 308
G+PV+A ++ D +L AWE
Sbjct 300 GLPVKAVFLKVDDDLTLPAWE 320
>gi|300784417|ref|YP_003764708.1| hypothetical protein AMED_2511 [Amycolatopsis mediterranei U32]
gi|299793931|gb|ADJ44306.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340525838|gb|AEK41043.1| hypothetical protein RAM_12765 [Amycolatopsis mediterranei S699]
Length=302
Score = 319 bits (817), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 169/294 (58%), Positives = 206/294 (71%), Gaps = 11/294 (3%)
Query 18 GPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGL 77
G S PRLARDPVNQ M+NNWVEAIGDRNP+Y D A A+ H G+VAPPAM QVWTM GL
Sbjct 14 GESAPRLARDPVNQAMVNNWVEAIGDRNPVYTDPEYAAASVHKGLVAPPAMAQVWTMNGL 73
Query 78 GGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTA 137
G R DDPLG ++ + D+AG+ VVATN EQTYHRYL PGE V+ + L VVGPK+TA
Sbjct 74 HGTRADDDPLGAMMAVLDEAGFTSVVATNSEQTYHRYLRPGEHVAAATRLETVVGPKRTA 133
Query 138 LGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDLDPDAMMRPSSSRDTAFF 197
LGEGWF+ W VG E VAEM +R+LKF+P ++RP SRDT FF
Sbjct 134 LGEGWFVTTRTTWSVGSEAVAEMVFRVLKFRPPAPKPPP------APVLRPVISRDTEFF 187
Query 198 WDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPI--NYVVSSGRGTVFSFVVHHAPKVP 255
WDG++ ELRIQR + +LRHPP P + D S+ +YVV+SGRGTV+SFVVHH P VP
Sbjct 188 WDGLREGELRIQRWGE-TLRHPPGP-MPPDGSLDTKPDYVVASGRGTVYSFVVHHHPPVP 245
Query 256 GRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF-PDWSLYAWE 308
G+ +PFV+ALVELEEGVR++ EL A P V IG+PV A ++ D +L AW+
Sbjct 246 GKELPFVVALVELEEGVRVMAELLDAAPEEVHIGLPVVAAFVRVDDDLTLPAWK 299
>gi|302525652|ref|ZP_07277994.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302434547|gb|EFL06363.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=302
Score = 313 bits (802), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 169/294 (58%), Positives = 203/294 (70%), Gaps = 11/294 (3%)
Query 18 GPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGL 77
G S PR ARDPVNQ M+NNWVEAIGD NP+Y D A A+ H G+VAPPAM QVWTM GL
Sbjct 14 GESAPRYARDPVNQAMVNNWVEAIGDANPVYTDAEFAEASVHKGLVAPPAMAQVWTMPGL 73
Query 78 GGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTA 137
GVR DDPLG I+K+ DDAGY +VATN EQTYHRYL PGE VS + L DV GPK+T
Sbjct 74 HGVRGDDDPLGLILKVLDDAGYTSIVATNSEQTYHRYLRPGEHVSTTNVLEDVTGPKRTG 133
Query 138 LGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDLDPDAMMRPSSSRDTAFF 197
LGEGWF+ W V E VA+M +R+LKF+P ++RP S DT FF
Sbjct 134 LGEGWFVTTRTNWYVDGELVADMVFRVLKFRPREPEPPP------SPVLRPVISHDTKFF 187
Query 198 WDGVKAHELRIQRLADGSLRHP--PVPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPKVP 255
WDG++ ELRIQR D LRHP P+P +VP +YVV+SGRGTV+SFVVHH P VP
Sbjct 188 WDGLREGELRIQRWGD-VLRHPPGPMPPDGSLDTVP-DYVVASGRGTVYSFVVHHHPAVP 245
Query 256 GRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFPD-WSLYAWE 308
G+ +PFV+ALVELEEGVR++ EL A+P V IG+PV A ++ D +L AW+
Sbjct 246 GKRLPFVVALVELEEGVRLMAELLDAEPDEVHIGLPVTAAFVRVDDELTLPAWK 299
>gi|333918579|ref|YP_004492160.1| hypothetical protein AS9A_0908 [Amycolicicoccus subflavus DQS3-9A1]
gi|333480800|gb|AEF39360.1| hypothetical protein AS9A_0908 [Amycolicicoccus subflavus DQS3-9A1]
Length=289
Score = 306 bits (785), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 162/286 (57%), Positives = 194/286 (68%), Gaps = 3/286 (1%)
Query 25 ARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGVRPKD 84
ARDPVNQ MINNWVEAIGD NP+Y D A A+ H G+VAPPAM QVWTM GL G R D
Sbjct 3 ARDPVNQAMINNWVEAIGDNNPVYTDPDFAAASIHGGVVAPPAMAQVWTMRGLHGKRNAD 62
Query 85 DPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGEGWFI 144
DPLG ++ D G+ VV TN EQTYHRYL PGE+V+ S +L D+ GPK+TALG GWF+
Sbjct 63 DPLGAMLAALDKEGFTSVVGTNTEQTYHRYLRPGERVTASTKLRDINGPKETALGPGWFV 122
Query 145 NQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDLDPDAMMRPSSSRDTAFFWDGVKAH 204
W VG+E VAEM +R+LKF+P S + +++RP SRDTAFFW+G K
Sbjct 123 TTVTTWSVGEEPVAEMVFRVLKFRPKSSARIIDAPEYAASVLRPVISRDTAFFWEGTKKG 182
Query 205 ELRIQRLADGSLRHPP-VPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPKVPGRTVPFVI 263
ELRIQR D LRHPP A D +Y+V+ G GTV+SFVVHH PKVPG+ PFV+
Sbjct 183 ELRIQRWGD-ELRHPPGAVAPNGDLEATPDYIVARGTGTVYSFVVHHHPKVPGKKTPFVV 241
Query 264 ALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF-PDWSLYAWE 308
ALVEL+EGVR+LGEL P V IG VRA ++ D +L AWE
Sbjct 242 ALVELDEGVRVLGELINVAPEDVKIGGSVRAVFLRIDDDVTLPAWE 287
>gi|159038394|ref|YP_001537647.1| hypothetical protein Sare_2821 [Salinispora arenicola CNS-205]
gi|157917229|gb|ABV98656.1| protein of unknown function DUF35 [Salinispora arenicola CNS-205]
Length=315
Score = 289 bits (740), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 160/316 (51%), Positives = 199/316 (63%), Gaps = 28/316 (8%)
Query 10 AVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMI 69
A +I+AAG PR ARDPVN PMI NW+EA+GD NP+Y VAPPAM
Sbjct 8 AAERIRAAGAGPPRRARDPVNLPMIRNWLEAMGDDNPVYERTG----------VAPPAMT 57
Query 70 QVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGD 129
QVWTM GL DPL + + D AG+ VVATNCEQTYHRYL GEQV + + L D
Sbjct 58 QVWTMRGLHPAAEPADPLSAMSAVLDRAGFTSVVATNCEQTYHRYLRHGEQVEVRSRLLD 117
Query 130 VVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPS--------------- 174
V GPK+TALGEGWF+ W VG+E VA M +RILKF+P+ P+
Sbjct 118 VSGPKRTALGEGWFVTTDSTWYVGEEAVASMTFRILKFRPSSPPTVSDAAAPPVPAVAPE 177
Query 175 SVPDDLDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLAD-GSLRHPPVPAVWQDKSVPIN 233
+ P + PD ++RP +RDTAFFW G A ELR+QR G+LRHPP PA + + +
Sbjct 178 AGPAAVAPD-VLRPVVTRDTAFFWAGTAAGELRLQRCGGCGALRHPPGPACPRCGADRRD 236
Query 234 YVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVR 293
+VV+ G G VFSFVVHH P VPGR +P V+ALVEL EGVRM+GE+ GADPA+V IG V+
Sbjct 237 HVVARGTGEVFSFVVHHHPPVPGRRLPIVVALVELTEGVRMVGEIPGADPAQVRIGAAVQ 296
Query 294 ATYIDF-PDWSLYAWE 308
++ + +L AW
Sbjct 297 VDFVRVDAELTLPAWR 312
>gi|145595148|ref|YP_001159445.1| hypothetical protein Strop_2623 [Salinispora tropica CNB-440]
gi|145304485|gb|ABP55067.1| protein of unknown function DUF35 [Salinispora tropica CNB-440]
Length=305
Score = 279 bits (714), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 163/310 (53%), Positives = 197/310 (64%), Gaps = 21/310 (6%)
Query 7 IQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPP 66
I A QI+AAG R ARDPVN PMI NW+EAIGD NP Y D VAPP
Sbjct 5 IIAAAEQIRAAGAGPARSARDPVNLPMIRNWLEAIGDGNPRYEQDG----------VAPP 54
Query 67 AMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAE 126
AM QVWTM GL DPL + + D AG+ VVATNCEQTYHRYL GE+V + +
Sbjct 55 AMTQVWTMRGLRPAAEPADPLSAMSAVLDQAGFTSVVATNCEQTYHRYLRDGERVEVRSR 114
Query 127 LGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVP-DDLDPDA- 184
L DV GPK+TALGEGWF+ W VG+E VA M +R+LKF+P GS +VP ++ P A
Sbjct 115 LVDVSGPKRTALGEGWFVTTDSTWYVGEEAVASMTFRVLKFRPPGSAPTVPASEVAPAAA 174
Query 185 --MMRPSSSRDTAFFWDGVKAHELRIQRLAD-GSLRHPPVPAVWQDKSVPINYVVSSGRG 241
++RP +RDTAFFW G A ELRIQR + G+LRHPP P Q +++V+SG G
Sbjct 175 SDVLRPVVTRDTAFFWAGAAAGELRIQRCGECGALRHPPGPVCRQCGVDRQDHLVASGTG 234
Query 242 TVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYI---- 297
VFSFVVH P VPGR +P V+A+VEL EGVRM+GE+ GADPA+V IG VR ++
Sbjct 235 EVFSFVVHRHPPVPGRQLPIVVAVVELTEGVRMVGEVVGADPAQVRIGAAVRVDFVRVDA 294
Query 298 --DFPDWSLY 305
P W L
Sbjct 295 ELTLPVWRLL 304
>gi|311744300|ref|ZP_07718104.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272]
gi|311312473|gb|EFQ82386.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272]
Length=431
Score = 270 bits (691), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 158/310 (51%), Positives = 190/310 (62%), Gaps = 22/310 (7%)
Query 4 VSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIV 63
V + V +IKA G S+PR DPVN PMI +W EAIGD NP R A P
Sbjct 6 VETMLARVEEIKARGRSRPRQGPDPVNAPMIRHWAEAIGDTNP--------RWAPGPDTE 57
Query 64 APPAMIQVWTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSI 123
APPAM QVWTM GL R DPL + D+AG+ V+ TNC+QTY R L PGE VSI
Sbjct 58 APPAMAQVWTMYGLNPERMPGDPLPDTMAALDEAGFTAVLGTNCDQTYARTLRPGEVVSI 117
Query 124 SAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKP-AGSPSSVPDDLDP 182
S L DVVGPK T +GEGWFI VW VG+E VA M +R+LKFKP G P+ +D
Sbjct 118 STRLADVVGPKMTGVGEGWFITTESVWTVGEEQVATMRFRVLKFKPGTGRPT-----VDR 172
Query 183 DAMMRPSSSRDTAFFWDGVKAHELRIQRL-ADGSLRHPPVPAVWQDKSVPINYVVSSGRG 241
RP +RDTAFFW+G A ELRIQ A G+LRHPP P ++ +VV+SGRG
Sbjct 173 SRTTRPMVNRDTAFFWEGTAAGELRIQTCTACGALRHPPGPVCPSCHAMDRGHVVASGRG 232
Query 242 TVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATY--IDF 299
V SF+VHHAP VPG+ +P +ALV+L+EGVRM+GE+ G VAIG V + ID
Sbjct 233 HVHSFLVHHAPVVPGKALPLTLALVDLDEGVRMVGEVAG----EVAIGDAVEVWFDRID- 287
Query 300 PDWSLYAWEP 309
D +L W P
Sbjct 288 DDLTLAKWRP 297
>gi|271969393|ref|YP_003343589.1| hypothetical protein Sros_8196 [Streptosporangium roseum DSM
43021]
gi|270512568|gb|ACZ90846.1| conserved hypothetical protein [Streptosporangium roseum DSM
43021]
Length=319
Score = 269 bits (687), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 148/314 (48%), Positives = 198/314 (64%), Gaps = 11/314 (3%)
Query 1 VTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHP 60
+ +++ Q AV +++ A DPVN PMI +W A+GD NP+Y D AA + H
Sbjct 13 LMALAERQIAVGEVRGVA------AADPVNLPMIRHWAAAMGDANPVYTDAEAAAGSVHG 66
Query 61 GIVAPPAMIQVWTMMGLG-GVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGE 119
+VAPPAMIQVWTM G+ R P+ +++ D GY GVVATNCEQTYHRYL GE
Sbjct 67 EVVAPPAMIQVWTMPGMDRDGRQATTPVDDVLRALDGNGYTGVVATNCEQTYHRYLRLGE 126
Query 120 QVSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDD 179
++ + + GPK+TALGEG+F+ +I W DE VA+M +R+LKF+P + P+
Sbjct 127 RLVPATRFAGLSGPKKTALGEGYFVTWNITWYSQDEPVADMLFRVLKFRPRERKTK-PEA 185
Query 180 LDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLAD-GSLRHPPVPAVWQDKSVPINYVVSS 238
+P ++P+ +RDTAFFW+GV ELRIQ+ AD G LRHPP P +SV +VV+S
Sbjct 186 REPYP-LKPAINRDTAFFWEGVGRGELRIQKCADCGELRHPPGPVCPSCRSVERTHVVAS 244
Query 239 GRGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYID 298
G G V+S+VVHH P VPGR+ PFV+A+VEL EGVR++G + P V IGM +R TY
Sbjct 245 GAGEVYSYVVHHNPPVPGRSTPFVVAVVELPEGVRIVGNVVDCPPEEVGIGMQLRVTYRQ 304
Query 299 FPD-WSLYAWEPDE 311
D L W P E
Sbjct 305 MDDELILPMWVPQE 318
>gi|148556220|ref|YP_001263802.1| hypothetical protein Swit_3318 [Sphingomonas wittichii RW1]
gi|148501410|gb|ABQ69664.1| protein of unknown function DUF35 [Sphingomonas wittichii RW1]
Length=312
Score = 228 bits (580), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 131/285 (46%), Positives = 171/285 (60%), Gaps = 8/285 (2%)
Query 20 SKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLG- 78
S P ARDPVN PMI W EA+G+ NP+Y+D+ AARAAGHPGIVAPPAM+ VW M
Sbjct 17 SPPATARDPVNLPMIRRWCEAMGETNPVYLDEVAARAAGHPGIVAPPAMLDVWAMAPYRP 76
Query 79 GVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTAL 138
G R ++ + ++ LFDDAGY GVVATN Q Y RYL GE V+ + + DV K+TAL
Sbjct 77 GGRTAEEGMA-VLSLFDDAGYTGVVATNVRQDYGRYLTLGEVVTSTPIVADVSDEKRTAL 135
Query 139 GEGWFINQHIVWQVGDEDVAEMNWRILKFKP-AGSPSSVPDDLDPDAMM----RPSSSRD 193
G G F+ + V E V M +R+LKFKP P S P + + + RP+ + D
Sbjct 136 GIGHFVTLAYDFTVAGEPVGRMTFRVLKFKPNLAQPQSRPAEPAAETKVWPHPRPAITHD 195
Query 194 TAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPK 253
AFFW+GV +L Q +G LRHPP P S+ V +SGRGT+ SFVV H P+
Sbjct 196 NAFFWEGVNRRKLLFQS-CEGRLRHPPGPMDPLTGSLDFEIVEASGRGTIHSFVVMHQPR 254
Query 254 VPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYID 298
+PG P + LVELEEGVR++ + A + IG V A +++
Sbjct 255 LPGFDYPLPVVLVELEEGVRIVANMIDAAAEAIEIGRAVVADFVE 299
>gi|269127688|ref|YP_003301058.1| hypothetical protein Tcur_3482 [Thermomonospora curvata DSM 43183]
gi|268312646|gb|ACY99020.1| protein of unknown function DUF35 [Thermomonospora curvata DSM
43183]
Length=319
Score = 208 bits (530), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 118/297 (40%), Positives = 175/297 (59%), Gaps = 12/297 (4%)
Query 21 KPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGV 80
+PR +DPVN PMI +WVEA+GD NP+Y+D+ AARA G +VAP +M+Q WTM G
Sbjct 20 EPRRGQDPVNVPMIRHWVEAMGDTNPVYLDEEAARATGRETVVAPASMMQAWTMRGYAAT 79
Query 81 ---RPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTA 137
P+ + + L + GY VVAT+ E +HR L+PG+ +S+ ++ + K+TA
Sbjct 80 VNPEPEAGGMEELTALLAEGGYTSVVATDSEFEFHRELVPGDHISVQEQVESISPEKKTA 139
Query 138 LGEGWFINQHIVW--QVGDEDVAEMNWRILKFKPAGSPSSVPDDLDPDAMM-RPSSSRDT 194
LGEG FI + Q G E VA WR+L+F+P + + + P A+ RP+ +RD
Sbjct 140 LGEGRFITTLRTYRDQRG-EVVATQRWRLLRFRPKKTEQT---EQKPKALRPRPAINRDN 195
Query 195 AFFWDGVKAHELRIQRLADG-SLRHPPVPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPK 253
AF+++ K L IQR A +LRHPP P S + V ++G G V+S++V H P
Sbjct 196 AFWFEAAKQRRLVIQRCAACKTLRHPPGPCCPHCGSFDWDTVEAAGTGQVYSYIVAHHPP 255
Query 254 VPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF-PDWSLYAWEP 309
P +P+V+ALVEL EG R++ L G P ++ IGMPV +++ P+ +L + P
Sbjct 256 HPAFEMPYVVALVELTEGTRLVTNLVGIAPDKIEIGMPVVLDWLEADPELTLPVFRP 312
>gi|312141661|ref|YP_004008997.1| hypothetical protein REQ_43550 [Rhodococcus equi 103S]
gi|311891000|emb|CBH50319.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=332
Score = 207 bits (526), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 123/303 (41%), Positives = 169/303 (56%), Gaps = 20/303 (6%)
Query 27 DPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGVRPKDDP 86
D VN PMI +WVE +GD +PIY+DD AARA G G VAP M QVWTM +DP
Sbjct 24 DEVNTPMIRHWVETMGDTSPIYLDDDAARATGRDGAVAPALMAQVWTMRTFNDKMANNDP 83
Query 87 L---GPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGEGWF 143
+I DD G+ VVAT+ + + L PG++VS++ + + K+T LG G F
Sbjct 84 SQVWTDLIATLDDEGFTSVVATDSDFEFFTELRPGDRVSLTEVIESISEEKKTGLGVGHF 143
Query 144 INQHIVWQVG---------DEDVAEMNWRILKFKPAGS--PSSVPDDLD----PDAMMRP 188
+ ++ G DE VA WR L+FKP GS P+ P + P RP
Sbjct 144 VTTLKTYRRGGAANGHSGDDEVVATQRWRTLRFKPKGSEKPAEKPAEKSGGPIPGLRPRP 203
Query 189 SSSRDTAFFWDGVKAHELRIQRLAD-GSLRHPPVPAVWQDKSVPINYVVSSGRGTVFSFV 247
+ + D AF+++ K H L IQR + G LRHP P +S+ + V +SGRG V+SFV
Sbjct 204 ALNADNAFWFEAAKEHRLVIQRCTNCGVLRHPTGPMCGACQSLNWDTVDASGRGIVYSFV 263
Query 248 VHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF-PDWSLYA 306
V+H P++PG P ++A +ELEEG R++ + G P V IGMPV +ID PD +L A
Sbjct 264 VNHHPQIPGFEYPLIVATIELEEGTRLIANMTGIAPDDVEIGMPVEMDWIDADPDLTLPA 323
Query 307 WEP 309
+ P
Sbjct 324 FRP 326
>gi|111022789|ref|YP_705761.1| hypothetical protein RHA1_ro05826 [Rhodococcus jostii RHA1]
gi|110822319|gb|ABG97603.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=312
Score = 207 bits (526), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 122/289 (43%), Positives = 168/289 (59%), Gaps = 12/289 (4%)
Query 27 DPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGVRPKDDP 86
DPVN PMI +WVEA+GD +P+Y+DDAAARA G G VAP M+QVWTM DP
Sbjct 24 DPVNTPMIRHWVEAMGDTSPVYLDDAAARATGRGGAVAPAQMLQVWTMRTFADKMSGADP 83
Query 87 ---LGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGEGWF 143
+I D+AG+ VVAT+ +Q + L PG++VS + + V KQT LG G F
Sbjct 84 SPVCTELIATLDEAGFTSVVATDSDQEFFAELRPGDRVSTTEVVESVSAEKQTGLGAGHF 143
Query 144 INQHIVWQVGD-EDVAEMNWRILKFKPAGSPSSVPDDLDPDAMMRPSSSRDTAFFWDGVK 202
+ ++ GD E V WR L+F+P P+ P RP+ + D AF++ +
Sbjct 144 VTTLKTYRNGDGEIVGTQRWRTLRFRPTERPAPRP------LRPRPALNDDNAFWFAAAR 197
Query 203 AHELRIQRLAD-GSLRHPPVPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPKVPGRTVPF 261
H L IQR A G LRHP P + +S + V +SGRGTV++FVV+H PK+ P
Sbjct 198 EHRLVIQRCARCGVLRHPTGPMCGECRSTEWDTVDASGRGTVYTFVVNHHPKIDAFDYPL 257
Query 262 VIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF-PDWSLYAWEP 309
V+A++ELEEG R++ + G +P V IGMPV +ID P+ +L A+ P
Sbjct 258 VVAVIELEEGTRLIANMTGVEPEHVHIGMPVELDWIDADPELTLPAFRP 306
>gi|325673082|ref|ZP_08152776.1| hypothetical protein HMPREF0724_10557 [Rhodococcus equi ATCC
33707]
gi|325556335|gb|EGD26003.1| hypothetical protein HMPREF0724_10557 [Rhodococcus equi ATCC
33707]
Length=332
Score = 206 bits (524), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 121/303 (40%), Positives = 167/303 (56%), Gaps = 20/303 (6%)
Query 27 DPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGVRPKDDP 86
D VN PMI +WVE +GD +PIY+DD AARA G G VAP M QVWTM +DP
Sbjct 24 DEVNTPMIRHWVETMGDTSPIYLDDDAARATGRDGAVAPALMAQVWTMRTFNDKMANNDP 83
Query 87 L---GPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGEGWF 143
+I DD G+ VVAT+ + + L PG++VS++ + + K+T LG G F
Sbjct 84 SQVWTDLIATLDDEGFTSVVATDSDFEFFTELRPGDRVSLTEVIESISEEKKTGLGVGHF 143
Query 144 INQHIVWQVG---------DEDVAEMNWRILKFKPAGSPSSVPDDLD------PDAMMRP 188
+ ++ G DE VA WR L+FKP GS + + P RP
Sbjct 144 VTTLKTYRRGGAANGHSGDDEVVATQRWRTLRFKPKGSEKAAEKPAEKSGGPIPGLRPRP 203
Query 189 SSSRDTAFFWDGVKAHELRIQRLAD-GSLRHPPVPAVWQDKSVPINYVVSSGRGTVFSFV 247
+ + D AF+++ K H L IQR + G LRHP P +S+ + V +SGRG V+SFV
Sbjct 204 ALNADNAFWFEAAKEHRLVIQRCTNCGVLRHPTGPMCGACQSLNWDTVDASGRGIVYSFV 263
Query 248 VHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF-PDWSLYA 306
V+H P++PG P ++A +ELEEG R++ + G P V IGMPV +ID PD +L A
Sbjct 264 VNHHPQIPGFEYPLIVATIELEEGTRLIANMTGIAPDDVEIGMPVELDWIDADPDLTLPA 323
Query 307 WEP 309
+ P
Sbjct 324 FRP 326
>gi|229488465|ref|ZP_04382331.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229323969|gb|EEN89724.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=315
Score = 200 bits (509), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 115/290 (40%), Positives = 165/290 (57%), Gaps = 11/290 (3%)
Query 27 DPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGVRPKDDP 86
D VN PMI +W E +GD NP+Y+DD AARA G G+VAP M+QVW M +DP
Sbjct 24 DEVNIPMIRHWAETMGDTNPVYLDDEAARATGRDGVVAPAQMLQVWIMRTFTDKMAGNDP 83
Query 87 L---GPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGEGWF 143
+I D AGY VVAT+ +Q + L PG++V+ + + + K+T LG G F
Sbjct 84 SQVWTDLIATLDAAGYSSVVATDSDQEFFAELRPGDRVNYTEVIESISPEKKTGLGVGHF 143
Query 144 INQHIVWQVGDEDV-AEMNWRILKFKPAGSPS-SVPDDLDPDAMMRPSSSRDTAFFWDGV 201
+ ++ GD+ V WR L+FKP S + P L P RP+ + D AF+++
Sbjct 144 VTTLKTYRNGDDQVVGTQRWRTLRFKPQESAAPEAPRALRP----RPALNADNAFWFEAA 199
Query 202 KAHELRIQRLADG-SLRHPPVPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPKVPGRTVP 260
K H L IQR +LRHP P + +S+ + + +SGRGTV+SFVV+H PK+ P
Sbjct 200 KEHRLVIQRCTSCKTLRHPTGPMCGECQSLEWDTIDASGRGTVYSFVVNHHPKIDAFDYP 259
Query 261 FVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF-PDWSLYAWEP 309
++A++ELEEG R++ + G P V + MPV +ID PD +L A+ P
Sbjct 260 LIVAVIELEEGTRLIANMVGIAPEDVVVDMPVELDWIDADPDLTLPAFRP 309
>gi|226308563|ref|YP_002768523.1| hypothetical protein RER_50760 [Rhodococcus erythropolis PR4]
gi|226187680|dbj|BAH35784.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=315
Score = 199 bits (507), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 114/290 (40%), Positives = 166/290 (58%), Gaps = 11/290 (3%)
Query 27 DPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGVRPKDDP 86
D VN PMI +W E +GD NP+Y+DD AARA G G+VAP M+QVW M +DP
Sbjct 24 DEVNIPMIRHWAETMGDTNPVYLDDDAARATGRDGVVAPAQMLQVWIMRTFTDKMAGNDP 83
Query 87 L---GPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGEGWF 143
+I D AGY VVAT+ +Q + L PG++V+ + + + K+T LG G F
Sbjct 84 SQVWTDLIATLDAAGYSSVVATDSDQEFFAELRPGDRVNYTEVIESISPEKKTGLGVGHF 143
Query 144 INQHIVWQVGDEDV-AEMNWRILKFKPAGSPS-SVPDDLDPDAMMRPSSSRDTAFFWDGV 201
+ ++ GD++V WR L+FKP + + P L P RP+ + D AF+++
Sbjct 144 VTTLKTYRNGDDEVVGTQRWRTLRFKPQEAKAPEAPRALRP----RPALNADNAFWFEAA 199
Query 202 KAHELRIQRLADG-SLRHPPVPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPKVPGRTVP 260
K H L IQR +LRHP P + +S+ + + +SGRGTV+SFVV+H PK+ P
Sbjct 200 KEHRLVIQRCTSCKTLRHPTGPMCGECQSLEWDTIDASGRGTVYSFVVNHHPKIDAFDYP 259
Query 261 FVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF-PDWSLYAWEP 309
++A++ELEEG R++ + G P V + MPV +ID PD +L A+ P
Sbjct 260 LIVAVIELEEGTRLIANMVGIAPEDVVVDMPVELDWIDADPDLTLPAFRP 309
>gi|288918163|ref|ZP_06412519.1| protein of unknown function DUF35 [Frankia sp. EUN1f]
gi|288350439|gb|EFC84660.1| protein of unknown function DUF35 [Frankia sp. EUN1f]
Length=571
Score = 197 bits (501), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 124/318 (39%), Positives = 167/318 (53%), Gaps = 35/318 (11%)
Query 12 AQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQV 71
A++ A P R+AR PVN+ MI +W +AIGD NP Y D A A H G+VAPPAM+ V
Sbjct 48 AEVGARAPR--RVARHPVNEAMIAHWCDAIGDANPAYTDAAYAATTVHGGLVAPPAMLDV 105
Query 72 WTMMGLGGVRPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVV 131
W GL R DDP +I + AG++ VVA N E HRYL PG+ + L DV
Sbjct 106 WDRHGLLERREADDPRARVIATLEQAGFVSVVAVNAELEIHRYLRPGDLLGNVQTLTDVS 165
Query 132 GPKQTALGEGWFI-NQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDLDPDAMMRPSS 190
K+TALG G F+ ++H E V ++ +RILKF+P GS P +P++ PS
Sbjct 166 AEKRTALGVGHFVTSRHRFTDQAGEHVGDLQFRILKFRP-GSGQRPPAAANPNS-ANPSG 223
Query 191 -----------------------------SRDTAFFWDGVKAHELRIQRL-ADGSLRHPP 220
+RD FFWDG +A ELRIQR G+L PP
Sbjct 224 GSSVTTRRGPSGPSGPPDPDPARRPRPAINRDNQFFWDGTRARELRIQRCDGCGALFSPP 283
Query 221 VPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRG 280
W + ++V+SGRG ++S V H P PG P + LVEL EG R++ +L G
Sbjct 284 TARCWHCGGFELGWIVASGRGRLYSHAVVHHPPAPGFRYPLPVGLVELAEGTRVVADLVG 343
Query 281 ADPARVAIGMPVRATYID 298
PA + +GMPV T+++
Sbjct 344 CPPAGLRVGMPVELTWVE 361
>gi|254384132|ref|ZP_04999477.1| conserved hypothetical protein [Streptomyces sp. Mg1]
gi|194343022|gb|EDX23988.1| conserved hypothetical protein [Streptomyces sp. Mg1]
Length=313
Score = 197 bits (500), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 113/277 (41%), Positives = 151/277 (55%), Gaps = 13/277 (4%)
Query 25 ARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGVRPKD 84
A+DPVN+PMI +W EA+GD +P Y P +APP M+Q WTM GL G +
Sbjct 30 AKDPVNEPMIRHWCEAMGDTSPSYTG---------PDAIAPPTMLQAWTMGGLSGHSDRS 80
Query 85 DPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGEGWFI 144
++ L D AG VVAT+CEQ Y R L PG+ V+ A + V K T LG G F+
Sbjct 81 CAYDQLLALLDGAGCTSVVATDCEQEYLRPLRPGDAVTFDAVIESVSARKTTKLGTGHFV 140
Query 145 NQHIVWQVGDEDVAEMNWRILKFKPAGSPSSVPDDLDPDAMM---RPSSSRDTAFFWDGV 201
+ + E +RILK++PA + + L P RP +RD FW+GV
Sbjct 141 TTRMDVRANGEPAGTHRFRILKYRPAATTAQPARPLPPKQAARRPRPVINRDNHGFWEGV 200
Query 202 KAHELRIQRLAD-GSLRHPPVPAVWQDKSVPINYVVSSGRGTVFSFVVHHAPKVPGRTVP 260
+AH L IQR + +LR P +P S + V +SG GTVFS+VV H P P P
Sbjct 201 RAHRLLIQRCSSCATLRFPWLPGCNACASPEWDTVEASGAGTVFSYVVMHHPPFPAFDPP 260
Query 261 FVIALVELEEGVRMLGELRGADPARVAIGMPVRATYI 297
+ +ALVEL EGVRM+G + G +V IGMPVR ++
Sbjct 261 YAVALVELAEGVRMIGNITGVPYDKVRIGMPVRLEFL 297
>gi|157961959|ref|YP_001501993.1| hypothetical protein Spea_2138 [Shewanella pealeana ATCC 700345]
gi|157846959|gb|ABV87458.1| protein of unknown function DUF35 [Shewanella pealeana ATCC 700345]
Length=321
Score = 196 bits (497), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 115/303 (38%), Positives = 164/303 (55%), Gaps = 19/303 (6%)
Query 25 ARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGV---- 80
A D +N M+ W E +G NP+Y D+ A GIVAPP+M+Q W + G V
Sbjct 17 AWDAINPAMVRQWCEVMGIDNPLYTDEQYALTTEFEGIVAPPSMLQTWCLAGFNDVCAPG 76
Query 81 RPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGE 140
+++P G ++KL + GY VVA N E + RYL GE++ + +L + K T LG
Sbjct 77 SSEENPYG-VLKLLEAQGYPAVVAVNSELDFKRYLKMGEKLCYTTKLESISEEKTTGLGV 135
Query 141 GWFINQHIVW----QVGDED-VAEMNWRILKFKPAGSPSSVPD---DLDPDAMM----RP 188
G+F+ + + G+E+ V + +R+ KFKPA S P+ D P A + +P
Sbjct 136 GYFVTFIMTYFSTHNDGEEELVGTLLFRVFKFKPAARAESTPEVKQDPKPAAKVYKRPKP 195
Query 189 SSSRDTAFFWDGVKAHELRIQRL-ADGSLRHPPVPAVWQDKSVPINYVVSSGRGTVFSFV 247
S D FFWDG HELRIQ L+HPP P +S +++ V SG+G++FSFV
Sbjct 196 GISDDNRFFWDGCNQHELRIQHCTCCDKLQHPPAPICMHCQSFELDHKVVSGKGSIFSFV 255
Query 248 VHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFP-DWSLYA 306
V H P+VP P IALVELEEGVR+ L G +P V IG V+ + + D +L
Sbjct 256 VMHYPEVPPFEHPNPIALVELEEGVRLTAGLIGIEPEDVVIGQKVKVEFTTYDGDLTLPM 315
Query 307 WEP 309
++P
Sbjct 316 FKP 318
>gi|329895411|ref|ZP_08270992.1| hypothetical protein IMCC3088_1452 [gamma proteobacterium IMCC3088]
gi|328922294|gb|EGG29640.1| hypothetical protein IMCC3088_1452 [gamma proteobacterium IMCC3088]
Length=318
Score = 190 bits (483), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 113/303 (38%), Positives = 166/303 (55%), Gaps = 13/303 (4%)
Query 15 KAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTM 74
K GP K D V +I +W EA+GD NP+Y A AAG+ ++APP M+Q W M
Sbjct 13 KTVGPFK---GWDTVQPALIRHWCEAMGDNNPVYRSREVALAAGYTDVIAPPTMLQAWGM 69
Query 75 MGLGGVRPK-DDPLGP--IIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVV 131
G P+ D P ++ + GY VVA NCEQ Y L G+ V + + +
Sbjct 70 KGYNERFPEGSDNANPFEVLVFLESQGYPAVVAVNCEQEYGEPLAIGDSVHFYSRIEAIS 129
Query 132 GPKQTALGEGWFINQHIVWQVG-DEDVAEMNWRILKFKPAGSPSSVPDD--LDPDAM--M 186
K TALGEG+F+ + ++ DE V M +R+ KFK S + D P A+ M
Sbjct 130 ELKTTALGEGYFVTEVSTYKNQRDEVVGVMTFRVFKFKAHESQQARASDSPQKPKAIKRM 189
Query 187 RPSSSRDTAFFWDGVKAHELRIQRLADG-SLRHPPVPAVWQDKSVPINYVVSSGRGTVFS 245
RP + DTAFFW+GV ELRIQ+ +LRHPP P +S+ +Y V+SG+G ++S
Sbjct 190 RPVRNYDTAFFWEGVDQGELRIQKCTGCQTLRHPPAPMCPACQSLEWDYCVASGKGHIYS 249
Query 246 FVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDF-PDWSL 304
+VV+H P++P P + LV+L+EG R++ L G + IG VRA +++ P ++L
Sbjct 250 YVVNHYPEIPPFDYPNPVVLVQLDEGTRLVSNLVGTKARDIKIGTRVRAEFVEVEPGFTL 309
Query 305 YAW 307
+ +
Sbjct 310 HQF 312
Lambda K H
0.319 0.138 0.435
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 533708119992
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40