BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3916c
Length=244
Score E
Sequences producing significant alignments: (Bits) Value
gi|15611052|ref|NP_218433.1| hypothetical protein Rv3916c [Mycob... 491 3e-137
gi|339296721|gb|AEJ48832.1| hypothetical protein CCDC5079_3643 [... 489 1e-136
gi|342862344|ref|ZP_08718985.1| hypothetical protein MCOL_25758 ... 416 1e-114
gi|296167151|ref|ZP_06849558.1| conserved hypothetical protein [... 416 2e-114
gi|41410440|ref|NP_963276.1| hypothetical protein MAP4342c [Myco... 410 8e-113
gi|118464843|ref|YP_884414.1| hypothetical protein MAV_5304 [Myc... 410 1e-112
gi|240168400|ref|ZP_04747059.1| hypothetical protein MkanA1_0375... 409 1e-112
gi|254818670|ref|ZP_05223671.1| hypothetical protein MintA_02039... 406 2e-111
gi|183985450|ref|YP_001853741.1| hypothetical protein MMAR_5480 ... 405 2e-111
gi|118620071|ref|YP_908403.1| hypothetical protein MUL_5069 [Myc... 398 3e-109
gi|15828463|ref|NP_302726.1| hypothetical protein ML2705 [Mycoba... 379 3e-103
gi|315446818|ref|YP_004079697.1| hypothetical protein Mspyr1_533... 341 6e-92
gi|145221430|ref|YP_001132108.1| hypothetical protein Mflv_0836 ... 340 2e-91
gi|108802364|ref|YP_642561.1| hypothetical protein Mmcs_5405 [My... 338 5e-91
gi|118469409|ref|YP_891130.1| hypothetical protein MSMEG_6936 [M... 332 2e-89
gi|120406999|ref|YP_956828.1| hypothetical protein Mvan_6070 [My... 330 1e-88
gi|333992980|ref|YP_004525594.1| hypothetical protein JDM601_433... 315 3e-84
gi|169632021|ref|YP_001705670.1| hypothetical protein MAB_4948c ... 314 8e-84
gi|54027639|ref|YP_121881.1| hypothetical protein nfa56650 [Noca... 231 5e-59
gi|325677533|ref|ZP_08157197.1| hypothetical protein HMPREF0724_... 218 6e-55
gi|226362885|ref|YP_002780665.1| hypothetical protein ROP_34730 ... 218 6e-55
gi|111020642|ref|YP_703614.1| hypothetical protein RHA1_ro03653 ... 218 7e-55
gi|312142019|ref|YP_004009355.1| hypothetical protein REQ_47370 ... 216 2e-54
gi|226309508|ref|YP_002769470.1| hypothetical protein RER_60230 ... 215 5e-54
gi|229491158|ref|ZP_04384986.1| conserved hypothetical protein [... 214 1e-53
gi|296141899|ref|YP_003649142.1| hypothetical protein Tpau_4235 ... 192 3e-47
gi|343928726|ref|ZP_08768171.1| hypothetical protein GOALK_120_0... 188 7e-46
gi|326383900|ref|ZP_08205584.1| hypothetical protein SCNU_13243 ... 183 2e-44
gi|262204654|ref|YP_003275862.1| hypothetical protein Gbro_4856 ... 178 5e-43
gi|317509419|ref|ZP_07967037.1| hypothetical protein HMPREF9336_... 178 8e-43
gi|325003246|ref|ZP_08124358.1| hypothetical protein PseP1_30967... 176 2e-42
gi|256381065|ref|YP_003104725.1| hypothetical protein Amir_7089 ... 174 1e-41
gi|333922229|ref|YP_004495810.1| hypothetical protein AS9A_4578 ... 173 3e-41
gi|331700383|ref|YP_004336622.1| hypothetical protein Psed_6681 ... 172 3e-41
gi|302870719|ref|YP_003839356.1| hypothetical protein Micau_6287... 172 4e-41
gi|302531347|ref|ZP_07283689.1| conserved hypothetical protein [... 172 5e-41
gi|152968444|ref|YP_001364228.1| hypothetical protein Krad_4505 ... 171 1e-40
gi|271970543|ref|YP_003344739.1| hypothetical protein Sros_9378 ... 170 2e-40
gi|300791155|ref|YP_003771446.1| hypothetical protein AMED_9356 ... 169 3e-40
gi|257057906|ref|YP_003135738.1| acetyltransferase (GNAT) family... 168 6e-40
gi|86743214|ref|YP_483614.1| hypothetical protein Francci3_4539 ... 167 1e-39
gi|238061900|ref|ZP_04606609.1| hypothetical protein MCAG_02866 ... 167 2e-39
gi|330470821|ref|YP_004408564.1| hypothetical protein VAB18032_0... 167 2e-39
gi|258655500|ref|YP_003204656.1| hypothetical protein Namu_5404 ... 166 3e-39
gi|340532855|gb|AEK48060.1| hypothetical protein RAM_47975 [Amyc... 165 6e-39
gi|159040580|ref|YP_001539833.1| hypothetical protein Sare_5100 ... 164 1e-38
gi|269129156|ref|YP_003302526.1| hypothetical protein Tcur_4974 ... 162 3e-38
gi|312200970|ref|YP_004021031.1| GCN5-related N-acetyltransferas... 162 4e-38
gi|254384783|ref|ZP_05000120.1| conserved hypothetical protein [... 162 5e-38
gi|158319048|ref|YP_001511556.1| hypothetical protein Franean1_7... 162 5e-38
>gi|15611052|ref|NP_218433.1| hypothetical protein Rv3916c [Mycobacterium tuberculosis H37Rv]
gi|15843549|ref|NP_338586.1| hypothetical protein MT4035 [Mycobacterium tuberculosis CDC1551]
gi|31795089|ref|NP_857582.1| hypothetical protein Mb3947c [Mycobacterium bovis AF2122/97]
81 more sequence titles
Length=244
Score = 491 bits (1265), Expect = 3e-137, Method: Compositional matrix adjust.
Identities = 243/244 (99%), Positives = 244/244 (100%), Gaps = 0/244 (0%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+SARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC
Sbjct 1 MSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD
Sbjct 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI
Sbjct 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTA 240
IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTA
Sbjct 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTA 240
Query 241 GNTS 244
GNTS
Sbjct 241 GNTS 244
>gi|339296721|gb|AEJ48832.1| hypothetical protein CCDC5079_3643 [Mycobacterium tuberculosis
CCDC5079]
Length=244
Score = 489 bits (1259), Expect = 1e-136, Method: Compositional matrix adjust.
Identities = 242/243 (99%), Positives = 243/243 (100%), Gaps = 0/243 (0%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+SARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC
Sbjct 1 MSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD
Sbjct 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI
Sbjct 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTA 240
IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTA
Sbjct 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTA 240
Query 241 GNT 243
GNT
Sbjct 241 GNT 243
>gi|342862344|ref|ZP_08718985.1| hypothetical protein MCOL_25758 [Mycobacterium colombiense CECT
3035]
gi|342130201|gb|EGT83529.1| hypothetical protein MCOL_25758 [Mycobacterium colombiense CECT
3035]
Length=245
Score = 416 bits (1070), Expect = 1e-114, Method: Compositional matrix adjust.
Identities = 204/235 (87%), Positives = 214/235 (92%), Gaps = 0/235 (0%)
Query 8 LRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQVATAV 67
LRLEAFEQLPKHARRCVFWEVDPA LG DHLADPEFEKEAWLSMVMLEWGSCGQVATAV
Sbjct 3 LRLEAFEQLPKHARRCVFWEVDPATLGNQDHLADPEFEKEAWLSMVMLEWGSCGQVATAV 62
Query 68 PDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADDDLPHSLI 127
PDERSHAEPPCLGYV YAPP AVPRAQRFPT PVSADAVLLTSMGIE G A DDLPH LI
Sbjct 63 PDERSHAEPPCLGYVFYAPPRAVPRAQRFPTGPVSADAVLLTSMGIEPGPAADDLPHGLI 122
Query 128 ARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCIIDANFLM 187
ARVI+ELVRRGVRALEAFGRTPAA++LQ+P V PDVRPVLEA+GDC V+HC+IDA FL
Sbjct 123 ARVIDELVRRGVRALEAFGRTPAASELQDPHLVGPDVRPVLEAVGDCSVDHCVIDAEFLK 182
Query 188 DVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTAGN 242
DVGFVVVAPH YFPRLRLELDKGLGWKAEVEAALERLLENA L++P+ AGST N
Sbjct 183 DVGFVVVAPHTYFPRLRLELDKGLGWKAEVEAALERLLENAHLEQPVGAGSTTAN 237
>gi|296167151|ref|ZP_06849558.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897473|gb|EFG77072.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=283
Score = 416 bits (1068), Expect = 2e-114, Method: Compositional matrix adjust.
Identities = 202/243 (84%), Positives = 217/243 (90%), Gaps = 0/243 (0%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S RIT LRLEAFEQLPKHARRCV+WEVDPA LG DHLADPEFEKEAWLSMVMLEWGSC
Sbjct 1 MSVRITPLRLEAFEQLPKHARRCVYWEVDPATLGNQDHLADPEFEKEAWLSMVMLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
GQVATA D+RS +EPP LGYV YAPP AVPRAQRFPTAPVSADAVLLTSMGIE GQ +
Sbjct 61 GQVATAATDDRSQSEPPVLGYVFYAPPRAVPRAQRFPTAPVSADAVLLTSMGIEPGQTAE 120
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
DLPH L+ARVI+ELVRRGVRALEAFGRTPAA +LQ+P A PDVRPVLEA+GDC V+HC+
Sbjct 121 DLPHGLLARVIDELVRRGVRALEAFGRTPAAAELQDPLAAGPDVRPVLEAVGDCSVDHCV 180
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTA 240
IDA L D GFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENA+LQEP+ AG+ A
Sbjct 181 IDAQLLEDAGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENAQLQEPVGAGTAA 240
Query 241 GNT 243
GNT
Sbjct 241 GNT 243
>gi|41410440|ref|NP_963276.1| hypothetical protein MAP4342c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|254777653|ref|ZP_05219169.1| hypothetical protein MaviaA2_23691 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|41399274|gb|AAS06892.1| hypothetical protein MAP_4342c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336459807|gb|EGO38721.1| hypothetical protein MAPs_46980 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=250
Score = 410 bits (1054), Expect = 8e-113, Method: Compositional matrix adjust.
Identities = 200/242 (83%), Positives = 218/242 (91%), Gaps = 0/242 (0%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+SARIT LRLEAFEQLPKHARRCVFWEVDPA+LG DHLAD EFEKEAWLSMVMLEWG C
Sbjct 1 MSARITPLRLEAFEQLPKHARRCVFWEVDPAVLGNHDHLADAEFEKEAWLSMVMLEWGCC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
GQVATA+PDERS AEPPCLGYV YAPP AVPRAQRFPT PVSADAVLLTSMGIE G A D
Sbjct 61 GQVATAIPDERSQAEPPCLGYVFYAPPRAVPRAQRFPTGPVSADAVLLTSMGIEPGPAAD 120
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
DLPH+L+ARVI+ELVRRGVRALEAFGRTPAA++LQ+P V PD+RPVLEA+GDC V+HC+
Sbjct 121 DLPHALLARVIDELVRRGVRALEAFGRTPAASELQDPRLVGPDLRPVLEAVGDCSVDHCV 180
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTA 240
+DA FL D GFVVVAPH YFPRLRLELDKGLGWKAEVEAALERLLE+ARL++P+ A ST
Sbjct 181 MDAEFLKDAGFVVVAPHTYFPRLRLELDKGLGWKAEVEAALERLLESARLEQPVGAASTP 240
Query 241 GN 242
N
Sbjct 241 AN 242
>gi|118464843|ref|YP_884414.1| hypothetical protein MAV_5304 [Mycobacterium avium 104]
gi|118166130|gb|ABK67027.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=243
Score = 410 bits (1053), Expect = 1e-112, Method: Compositional matrix adjust.
Identities = 200/242 (83%), Positives = 218/242 (91%), Gaps = 0/242 (0%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+SARIT LRLEAFEQLPKHARRCVFWEVDPA+LG DHLAD EFEKEAWLSMVMLEWG C
Sbjct 1 MSARITPLRLEAFEQLPKHARRCVFWEVDPAVLGNHDHLADAEFEKEAWLSMVMLEWGCC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
GQVATA+PDERS AEPPCLGYV YAPP AVPRAQRFPT PVSADAVLLTSMGIE G A D
Sbjct 61 GQVATAIPDERSQAEPPCLGYVFYAPPRAVPRAQRFPTGPVSADAVLLTSMGIEPGPAAD 120
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
DLPH+L+ARVI+ELVRRGVRALEAFGRTPAA++LQ+P V PD+RPVLEA+GDC V+HC+
Sbjct 121 DLPHALLARVIDELVRRGVRALEAFGRTPAASELQDPRLVGPDLRPVLEAVGDCSVDHCV 180
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTA 240
+DA FL D GFVVVAPH YFPRLRLELDKGLGWKAEVEAALERLLE+ARL++P+ A ST
Sbjct 181 MDAEFLKDAGFVVVAPHTYFPRLRLELDKGLGWKAEVEAALERLLESARLEQPVGAASTP 240
Query 241 GN 242
N
Sbjct 241 AN 242
>gi|240168400|ref|ZP_04747059.1| hypothetical protein MkanA1_03757 [Mycobacterium kansasii ATCC
12478]
Length=253
Score = 409 bits (1052), Expect = 1e-112, Method: Compositional matrix adjust.
Identities = 199/235 (85%), Positives = 214/235 (92%), Gaps = 0/235 (0%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+SARITALRLEAFEQLPKHARRCVFWEVDPA LG D HLADPEFEKEAWLSMVMLEWGSC
Sbjct 1 MSARITALRLEAFEQLPKHARRCVFWEVDPATLGNDHHLADPEFEKEAWLSMVMLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
GQVATA+PDERS AEPPCLGYV YAPP AVPRA RFPTAPVSADAVLLTSMGIERGQA D
Sbjct 61 GQVATAIPDERSDAEPPCLGYVFYAPPRAVPRAHRFPTAPVSADAVLLTSMGIERGQAPD 120
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
DLPHSLIA V++ELVRRGVRALEAFGRT DLQ+PG + P+VRPVLE +GDC V+HC+
Sbjct 121 DLPHSLIAGVVDELVRRGVRALEAFGRTVEVADLQDPGLIDPEVRPVLEVVGDCSVDHCV 180
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIA 235
IDA+FL D+GFVVVAPH YFPRLRLELDKG GWKAEVEAALERLLENA+LQ+P+
Sbjct 181 IDADFLTDMGFVVVAPHRYFPRLRLELDKGFGWKAEVEAALERLLENAQLQQPVG 235
>gi|254818670|ref|ZP_05223671.1| hypothetical protein MintA_02039 [Mycobacterium intracellulare
ATCC 13950]
Length=245
Score = 406 bits (1043), Expect = 2e-111, Method: Compositional matrix adjust.
Identities = 198/235 (85%), Positives = 211/235 (90%), Gaps = 0/235 (0%)
Query 8 LRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQVATAV 67
LRLEAFEQLPKHARRCVFWEVDPA LG DHL DPEFEKEAWLSMVMLEWGSCGQVATA+
Sbjct 3 LRLEAFEQLPKHARRCVFWEVDPATLGNQDHLTDPEFEKEAWLSMVMLEWGSCGQVATAI 62
Query 68 PDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADDDLPHSLI 127
PDERS AEPPCLGYV YAPP AVPRAQRFPT PVSADAV+LTSMGIE G A DDLPH LI
Sbjct 63 PDERSQAEPPCLGYVFYAPPRAVPRAQRFPTGPVSADAVMLTSMGIEPGPAADDLPHGLI 122
Query 128 ARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCIIDANFLM 187
ARVI+ELVRRGVRALEAFGRTPAA++LQ+P V PDVR VLEA+GDC VE C++DA FL
Sbjct 123 ARVIDELVRRGVRALEAFGRTPAASELQDPRLVGPDVRAVLEAVGDCSVERCVMDAEFLK 182
Query 188 DVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTAGN 242
D GFVVVAPH YFPRLRLELDKGLGWKAEVEAALERLLE+A L++P+ AGSTAGN
Sbjct 183 DAGFVVVAPHTYFPRLRLELDKGLGWKAEVEAALERLLESAHLEQPVGAGSTAGN 237
>gi|183985450|ref|YP_001853741.1| hypothetical protein MMAR_5480 [Mycobacterium marinum M]
gi|183178776|gb|ACC43886.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=243
Score = 405 bits (1042), Expect = 2e-111, Method: Compositional matrix adjust.
Identities = 194/240 (81%), Positives = 218/240 (91%), Gaps = 0/240 (0%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+SARITALRLEAFEQLPKHARRCVFWEVDPA LG +DHLADPEFEKEAWLSMVMLEWGSC
Sbjct 1 MSARITALRLEAFEQLPKHARRCVFWEVDPATLGNNDHLADPEFEKEAWLSMVMLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
GQ+ATA+PDERS AEP CLGYV YAPP AVPRA RFP+ PVSADA+LLTSMGIE G+ +
Sbjct 61 GQIATAIPDERSDAEPACLGYVFYAPPRAVPRAHRFPSGPVSADAILLTSMGIEAGEDTE 120
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
DL HSLIA VI+ELVRRGVRA+EAFGRT AA +LQ+ AVTP+++PVL ALGDC VEHC+
Sbjct 121 DLSHSLIAGVIDELVRRGVRAVEAFGRTAAAAELQDSNAVTPELQPVLAALGDCSVEHCM 180
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTA 240
+DA+FL+DVGFVVV PHPYFPRLRLELDKGLGWKAEVEAALERLLENA+LQ+P+ AG+ +
Sbjct 181 LDADFLIDVGFVVVGPHPYFPRLRLELDKGLGWKAEVEAALERLLENAQLQQPVGAGAAS 240
>gi|118620071|ref|YP_908403.1| hypothetical protein MUL_5069 [Mycobacterium ulcerans Agy99]
gi|118572181|gb|ABL06932.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=243
Score = 398 bits (1023), Expect = 3e-109, Method: Compositional matrix adjust.
Identities = 191/240 (80%), Positives = 215/240 (90%), Gaps = 0/240 (0%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+SARITALRLEAFEQLP HARRCVFWEVDPA LG +DHLADPEFEKEAWLSMVMLEWGSC
Sbjct 1 MSARITALRLEAFEQLPNHARRCVFWEVDPATLGNNDHLADPEFEKEAWLSMVMLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
GQ+ATA+PDERS AEP CLGYV YAPP AVPRA RFP+ PVSADA+LLTSMGIE G+ D
Sbjct 61 GQIATAIPDERSDAEPACLGYVFYAPPRAVPRAHRFPSGPVSADAILLTSMGIEAGEDTD 120
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
DL HSLIA VI+ELVRRGVRA+EAFGRT AA +LQ+ A TP+++PVL ALGDC VEHC+
Sbjct 121 DLSHSLIAGVIDELVRRGVRAVEAFGRTTAAAELQDSNAATPELQPVLAALGDCSVEHCM 180
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTA 240
+DA+FL+DVGFVVV PHPYFPRLRLELDK LGWKAEVEAALERLLENA+L++P+ AG+ +
Sbjct 181 LDADFLIDVGFVVVGPHPYFPRLRLELDKRLGWKAEVEAALERLLENAQLRQPVGAGAAS 240
>gi|15828463|ref|NP_302726.1| hypothetical protein ML2705 [Mycobacterium leprae TN]
gi|221230940|ref|YP_002504356.1| hypothetical protein MLBr_02705 [Mycobacterium leprae Br4923]
gi|886317|gb|AAB53133.1| L222-ORF1; putative [Mycobacterium leprae]
gi|13093893|emb|CAC32237.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219934047|emb|CAR72805.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=259
Score = 379 bits (972), Expect = 3e-103, Method: Compositional matrix adjust.
Identities = 190/250 (76%), Positives = 207/250 (83%), Gaps = 6/250 (2%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+SA+IT LRLEAFEQLPKHARRCVFWEVDPA LG DHL D EFEKEAWLSMVMLEWGSC
Sbjct 1 MSAQITPLRLEAFEQLPKHARRCVFWEVDPATLGNQDHLVDLEFEKEAWLSMVMLEWGSC 60
Query 61 GQVATAVPDE------RSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIE 114
GQVATA+ DE H EPPCLGY+LYAPP VPRA RFPTAPVSADAVLLTSMG+E
Sbjct 61 GQVATAIMDECRQSDAFKHLEPPCLGYMLYAPPRVVPRAYRFPTAPVSADAVLLTSMGVE 120
Query 115 RGQADDDLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDC 174
GQ LP SLI++VI+ELVRRGVRALEAFGRT AT+LQ+P V PDVRPVLEALGDC
Sbjct 121 PGQVAAGLPQSLISQVIDELVRRGVRALEAFGRTEVATELQDPRTVAPDVRPVLEALGDC 180
Query 175 CVEHCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPI 234
V+HCII A+FL VGFVVVAPH YFPRLRLELDKG GWKAEVEAALERLL +A+LQ+P+
Sbjct 181 SVDHCIIAADFLKAVGFVVVAPHQYFPRLRLELDKGFGWKAEVEAALERLLADAQLQQPV 240
Query 235 AAGSTAGNTS 244
AG+ S
Sbjct 241 GAGAVVKQHS 250
>gi|315446818|ref|YP_004079697.1| hypothetical protein Mspyr1_53380 [Mycobacterium sp. Spyr1]
gi|315265121|gb|ADU01863.1| hypothetical protein Mspyr1_53380 [Mycobacterium sp. Spyr1]
Length=249
Score = 341 bits (874), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 170/248 (69%), Positives = 197/248 (80%), Gaps = 7/248 (2%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
++ RIT LRLEAFEQLPKHARRCV+WEVDP + G D LADPEFEKEAWLSMVMLEWGSC
Sbjct 1 MATRITPLRLEAFEQLPKHARRCVYWEVDPPVGGGGDQLADPEFEKEAWLSMVMLEWGSC 60
Query 61 GQVATAVPDERSHAEP-------PCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGI 113
GQ+A + S EP PCLGYV YAPP +VPRA RFPT PVSADAVLLT++GI
Sbjct 61 GQLAVECRTDPSDGEPLPVADDDPCLGYVFYAPPRSVPRAVRFPTGPVSADAVLLTTLGI 120
Query 114 ERGQADDDLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGD 173
E GQ D LPH+LIA V+ +LVRRGVRALEAFGRT AA++L +V DV PV EALGD
Sbjct 121 ESGQNSDTLPHTLIAAVVADLVRRGVRALEAFGRTAAASELTGLPSVPQDVLPVTEALGD 180
Query 174 CCVEHCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEP 233
C VE C++DA+ LMD GFVVV+ H YFPRLRLEL++GLGWKA VEAALERLLE+A+L++P
Sbjct 181 CSVEQCVLDADLLMDAGFVVVSHHTYFPRLRLELEQGLGWKAGVEAALERLLESAQLEQP 240
Query 234 IAAGSTAG 241
+ AG+ G
Sbjct 241 VGAGAGVG 248
>gi|145221430|ref|YP_001132108.1| hypothetical protein Mflv_0836 [Mycobacterium gilvum PYR-GCK]
gi|145213916|gb|ABP43320.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=249
Score = 340 bits (871), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 169/248 (69%), Positives = 197/248 (80%), Gaps = 7/248 (2%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
++ RIT LRLEAFEQLPKHARRCV+WEVDP + G D LADPEFEKEAWLSMVMLEWGSC
Sbjct 1 MATRITPLRLEAFEQLPKHARRCVYWEVDPPVGGGGDQLADPEFEKEAWLSMVMLEWGSC 60
Query 61 GQVATAVPDERSHAEP-------PCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGI 113
GQ+A + S +P PCLGYV YAPP +VPRA RFPT PVSADAVLLT++GI
Sbjct 61 GQLAVECRTDPSDGDPLPVADDDPCLGYVFYAPPRSVPRAVRFPTGPVSADAVLLTTLGI 120
Query 114 ERGQADDDLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGD 173
E GQ D L H+LIA V+ +LVRRGVRALEAFGRT AA++L +V DV PV EALGD
Sbjct 121 ESGQNSDTLAHTLIAAVVADLVRRGVRALEAFGRTAAASELTGLPSVPQDVLPVTEALGD 180
Query 174 CCVEHCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEP 233
C VE C++DA+ LMD GFVVV+ HPYFPRLRLEL++GLGWKA VEAALERLLE+A+L++P
Sbjct 181 CSVEQCVLDADLLMDAGFVVVSHHPYFPRLRLELEQGLGWKAGVEAALERLLESAQLEQP 240
Query 234 IAAGSTAG 241
+ AG+ G
Sbjct 241 VGAGAGVG 248
>gi|108802364|ref|YP_642561.1| hypothetical protein Mmcs_5405 [Mycobacterium sp. MCS]
gi|119871517|ref|YP_941469.1| hypothetical protein Mkms_5494 [Mycobacterium sp. KMS]
gi|126438344|ref|YP_001074035.1| hypothetical protein Mjls_5781 [Mycobacterium sp. JLS]
gi|108772783|gb|ABG11505.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119697606|gb|ABL94679.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126238144|gb|ABO01545.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=249
Score = 338 bits (867), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 166/243 (69%), Positives = 197/243 (82%), Gaps = 8/243 (3%)
Query 4 RITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQV 63
RIT LRLEAFEQLPKHARRCVFWEVDP+ LG++DHL+DPEFEKEAWLSMVMLEWGSCGQV
Sbjct 4 RITPLRLEAFEQLPKHARRCVFWEVDPSTLGREDHLSDPEFEKEAWLSMVMLEWGSCGQV 63
Query 64 ATAVPDERS--------HAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIER 115
A P+ S AE PC+GY YAPP AVPRA+ FPT PVSADAVLLT++G+E+
Sbjct 64 AVRCPEAMSDEAAATDPSAEEPCVGYAFYAPPRAVPRARLFPTGPVSADAVLLTTVGVEQ 123
Query 116 GQADDDLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCC 175
G LPH+L+ V+ +LVRRGVRALEAFGRT AA +L +P V ++ PV+EALGDC
Sbjct 124 GDDTTGLPHTLLTSVVGDLVRRGVRALEAFGRTEAAAELIDPRLVPDELTPVVEALGDCS 183
Query 176 VEHCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIA 235
V C++DA+FL VGF VV+PH YFPRLRLEL++GLGWKA+VEAALERLLE+A+LQ+P+
Sbjct 184 VHQCMLDADFLEQVGFTVVSPHRYFPRLRLELEQGLGWKADVEAALERLLESAQLQQPVG 243
Query 236 AGS 238
AGS
Sbjct 244 AGS 246
>gi|118469409|ref|YP_891130.1| hypothetical protein MSMEG_6936 [Mycobacterium smegmatis str.
MC2 155]
gi|118170696|gb|ABK71592.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=250
Score = 332 bits (852), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 162/247 (66%), Positives = 194/247 (79%), Gaps = 9/247 (3%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S RIT LRLE FEQLPKHARRCVFWEVDP+ + +DHLADPEFEKEAWLSMVMLEWGSC
Sbjct 1 MSTRITPLRLEGFEQLPKHARRCVFWEVDPSTVAGEDHLADPEFEKEAWLSMVMLEWGSC 60
Query 61 GQVATAVPDERS---------HAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSM 111
GQ+A P R + PCLGY YAPP++VPRA+ FPTAPVSADA+LLT++
Sbjct 61 GQLAVQAPRGRDLEDDLDAVITGDEPCLGYAFYAPPASVPRARLFPTAPVSADAILLTTV 120
Query 112 GIERGQADDDLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEAL 171
G++ + +D+ L++ VI +LVRRGVRALEAF TPA T+L + A+ P++ PV++ L
Sbjct 121 GVDSAECAEDMSAGLLSAVITDLVRRGVRALEAFAYTPALTELDDLAALPPELAPVVKVL 180
Query 172 GDCCVEHCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQ 231
GDC V C++DA FL DVGF VVAPHPYFPRLRLELDKGLGWKAEVEAALERLLE+ARL+
Sbjct 181 GDCTVGQCMLDAGFLTDVGFTVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLESARLE 240
Query 232 EPIAAGS 238
P+ AGS
Sbjct 241 APVGAGS 247
>gi|120406999|ref|YP_956828.1| hypothetical protein Mvan_6070 [Mycobacterium vanbaalenii PYR-1]
gi|119959817|gb|ABM16822.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=246
Score = 330 bits (846), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 164/244 (68%), Positives = 194/244 (80%), Gaps = 6/244 (2%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
++ARIT LRLEAFEQLPKHARRCV+WEVDP I+ + DHL+DPEFEKEAWLSMVMLEWGSC
Sbjct 1 MAARITPLRLEAFEQLPKHARRCVYWEVDPGIVDRGDHLSDPEFEKEAWLSMVMLEWGSC 60
Query 61 GQV-----ATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIER 115
GQ+ TA E EP CLGY YAPP +VPRA RFPT PVSADAVLLT++GIE
Sbjct 61 GQLVVEHRGTAAVGEDPGDEP-CLGYAFYAPPRSVPRAGRFPTGPVSADAVLLTTLGIEP 119
Query 116 GQADDDLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCC 175
GQ +L SLI V+ +LVRRGVRALEAFGRT A DL + +V DVRPV+E LGDC
Sbjct 120 GQGSAELSQSLITAVVGDLVRRGVRALEAFGRTSAVDDLTDRASVPADVRPVMETLGDCS 179
Query 176 VEHCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIA 235
VE C++DA+ LMD GFVVV+ H YFPRLRLEL++GLGWKA VEAALE LL++A+L++P+
Sbjct 180 VEQCVLDADLLMDAGFVVVSHHAYFPRLRLELEQGLGWKAGVEAALELLLQSAQLEQPVG 239
Query 236 AGST 239
AG++
Sbjct 240 AGTS 243
>gi|333992980|ref|YP_004525594.1| hypothetical protein JDM601_4339 [Mycobacterium sp. JDM601]
gi|333488947|gb|AEF38339.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=238
Score = 315 bits (808), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 159/217 (74%), Positives = 175/217 (81%), Gaps = 0/217 (0%)
Query 28 VDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQVATAVPDERSHAEPPCLGYVLYAPP 87
+DPA LG+DDHL+DPEFEKEAWLSMVMLEWG CGQVAT E PCLGYVLYAPP
Sbjct 1 MDPATLGRDDHLSDPEFEKEAWLSMVMLEWGCCGQVATPSAAAGGADESPCLGYVLYAPP 60
Query 88 SAVPRAQRFPTAPVSADAVLLTSMGIERGQADDDLPHSLIARVIEELVRRGVRALEAFGR 147
AVPRA RFPTAPVSADAVLLTS+G+E D LP LIA +EEL+RRGVRALEAFGR
Sbjct 61 RAVPRAHRFPTAPVSADAVLLTSIGVEPAPMADGLPRELIAGAVEELIRRGVRALEAFGR 120
Query 148 TPAATDLQNPGAVTPDVRPVLEALGDCCVEHCIIDANFLMDVGFVVVAPHPYFPRLRLEL 207
T A DL +P V PDV PVLEA+GDC VEHCII+A+FL DVGF VVAPH YFPRLRLEL
Sbjct 121 TAAVGDLLDPRNVPPDVAPVLEAVGDCTVEHCIIEADFLTDVGFTVVAPHRYFPRLRLEL 180
Query 208 DKGLGWKAEVEAALERLLENARLQEPIAAGSTAGNTS 244
DKGLGWKAEVEAALERLLE+A+L P+ A + AG+ S
Sbjct 181 DKGLGWKAEVEAALERLLESAQLHAPVGASAPAGSVS 217
>gi|169632021|ref|YP_001705670.1| hypothetical protein MAB_4948c [Mycobacterium abscessus ATCC
19977]
gi|169243988|emb|CAM65016.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=238
Score = 314 bits (804), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 159/241 (66%), Positives = 180/241 (75%), Gaps = 6/241 (2%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+SARI LRL+ FEQLPKHARRCVFWEVDPA +G HL+DPEFEKEAWLSMVMLEWGSC
Sbjct 1 MSARIVPLRLDGFEQLPKHARRCVFWEVDPATVGDGQHLSDPEFEKEAWLSMVMLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
GQVA P R P GY LYAPP VPRA+ FPTAPVSADA+LLTS+G+E G D
Sbjct 61 GQVAVTGPQSR----PTTAGYALYAPPGVVPRARLFPTAPVSADAILLTSLGVEPGHESD 116
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
LPHS+IA V+ ELVRRGVRALEAFGRT A DL P + LG+C +E C+
Sbjct 117 GLPHSIIANVVAELVRRGVRALEAFGRTAEALDLCEGSLARHSEAP--DVLGECTIEQCM 174
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTA 240
ID +FL DVGF VVAPH +FPRLRLELD+GLGWKAEVEAALERLLE+ ++ + AG
Sbjct 175 IDVDFLKDVGFTVVAPHQHFPRLRLELDRGLGWKAEVEAALERLLESVQIPQHAGAGPVV 234
Query 241 G 241
G
Sbjct 235 G 235
>gi|54027639|ref|YP_121881.1| hypothetical protein nfa56650 [Nocardia farcinica IFM 10152]
gi|54019147|dbj|BAD60517.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=255
Score = 231 bits (590), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 128/248 (52%), Positives = 156/248 (63%), Gaps = 18/248 (7%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
VS +TAL L+ ++LP HARRCVFWE+DPA+ +DP FEKEAWLS VMLEWGSC
Sbjct 15 VSTSVTALTLDGLDKLPAHARRCVFWEIDPAVAADSHGFSDPVFEKEAWLSTVMLEWGSC 74
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
GQVA H + G LY+PP+AVPRA FPT+PVS DA+LLT++ E DD
Sbjct 75 GQVA--------HVDGKAAGCALYSPPTAVPRATLFPTSPVSPDAILLTTLCTEPAHRDD 126
Query 121 DLPHSLIARVIEELVRRGVRALEAFG--RTPAATDLQNPGA--------VTPDVRPVLEA 170
D+ H L+ V+ +LVRRGVRALEAFG PA+ L + A + VR
Sbjct 127 DIAHRLLQAVVSDLVRRGVRALEAFGIRSGPASKPLSDRLAGSMRLMERIGGPVRGKSAP 186
Query 171 LGDCCVEHCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARL 230
DC E C+I+A+ L D GF VVAPH FPRLRLELD GWK +VE AL++LL A L
Sbjct 187 SADCSPETCMIEADLLEDFGFEVVAPHHRFPRLRLELDSDHGWKEDVERALDQLLAAASL 246
Query 231 QEPIAAGS 238
P AG+
Sbjct 247 TVPTRAGA 254
>gi|325677533|ref|ZP_08157197.1| hypothetical protein HMPREF0724_14980 [Rhodococcus equi ATCC
33707]
gi|325551780|gb|EGD21478.1| hypothetical protein HMPREF0724_14980 [Rhodococcus equi ATCC
33707]
Length=242
Score = 218 bits (555), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 128/235 (55%), Positives = 155/235 (66%), Gaps = 23/235 (9%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILG---KDDHLADPEFEKEAWLSMVMLEW 57
VS +T+L L+ ++L HARRCVFWE DPA + + + DPEFEKEAWLSMVML+W
Sbjct 14 VSTSVTSLTLDGLDKLSSHARRCVFWETDPAAVRAARETGNFYDPEFEKEAWLSMVMLQW 73
Query 58 GSCGQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQ 117
GSCGQVA V D+ + G LYAPPS VPRA FPT+PVSADAVLLT+M +E
Sbjct 74 GSCGQVAM-VDDKPA-------GCALYAPPSMVPRADLFPTSPVSADAVLLTTMRLEPIG 125
Query 118 ADDDLPHSLIARVIEELVRRGVRALEAFG-RTPAATDLQNPGAVTPDVRPVLEALGDCCV 176
+ L +LI + +LVRRGVRALEAFG R A +D+ P A DC
Sbjct 126 DEHGLGATLIQAAVGDLVRRGVRALEAFGIRGEAPSDV-----------PTATAALDCSP 174
Query 177 EHCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQ 231
+ C+I A+FL DVGF V+APH FPRLRLELD+ WKA+VEAAL+RLLE A L
Sbjct 175 QECMISADFLEDVGFEVIAPHHRFPRLRLELDRDHLWKADVEAALDRLLEVAALS 229
>gi|226362885|ref|YP_002780665.1| hypothetical protein ROP_34730 [Rhodococcus opacus B4]
gi|226241372|dbj|BAH51720.1| hypothetical protein [Rhodococcus opacus B4]
Length=232
Score = 218 bits (555), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 120/231 (52%), Positives = 149/231 (65%), Gaps = 12/231 (5%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S +T+L L+ ++L HARRCVFWE+DPA + D EFEKEAWLSMVMLEWGSC
Sbjct 1 MSTHVTSLTLDGLDKLSAHARRCVFWEMDPAAIHSSRGFCDQEFEKEAWLSMVMLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
GQV AV D + +G LYAPP VPRAQ PTAPV ADAVLLTS+ +E +
Sbjct 61 GQV--AVMDGKP------VGSALYAPPRTVPRAQLLPTAPVGADAVLLTSLRLEPAGEEQ 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
+L +LI V+ +LVRRGVRALEAFG + + G + A +C E C+
Sbjct 113 NLGTTLIQAVVADLVRRGVRALEAFG----IRNTEETGPIDTHGVASATAARECSPEECM 168
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQ 231
I A+FL D GF +VAPH FPRLRLEL++ GWK +VEAALERL+ A +
Sbjct 169 IPADFLEDNGFEIVAPHHRFPRLRLELNRDHGWKEDVEAALERLIHTASVS 219
>gi|111020642|ref|YP_703614.1| hypothetical protein RHA1_ro03653 [Rhodococcus jostii RHA1]
gi|110820172|gb|ABG95456.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=232
Score = 218 bits (555), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 120/231 (52%), Positives = 148/231 (65%), Gaps = 12/231 (5%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S +T+L L+ ++L HARRCVFWE+DPA + D EFEKEAWLSMVMLEWGSC
Sbjct 1 MSTHVTSLTLDGLDKLSAHARRCVFWEMDPAAIHSSRGFCDQEFEKEAWLSMVMLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
GQV AV D + +G LYAPP VPRAQ PTAPV ADAVLLTS+ +E +
Sbjct 61 GQV--AVMDGKP------VGSALYAPPRTVPRAQLLPTAPVGADAVLLTSLRLEPAGEEQ 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
+L +LI V+ +LVRRGVRALEAFG + G + A +C E C+
Sbjct 113 NLGTTLIQAVVADLVRRGVRALEAFG----IRNTDGTGPIDTHGMASATAARECSPEECM 168
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQ 231
I A+FL D GF +VAPH FPRLRLEL++ GWK +VEAALERL+ A +
Sbjct 169 IPADFLEDNGFEIVAPHHRFPRLRLELNRDHGWKEDVEAALERLIHTATVS 219
>gi|312142019|ref|YP_004009355.1| hypothetical protein REQ_47370 [Rhodococcus equi 103S]
gi|311891358|emb|CBH50679.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=229
Score = 216 bits (551), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/235 (54%), Positives = 155/235 (66%), Gaps = 23/235 (9%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILG---KDDHLADPEFEKEAWLSMVMLEW 57
+S +T+L L+ ++L HARRCVFWE DPA + + + DPEFEKEAWLSMVML+W
Sbjct 1 MSTSVTSLTLDGLDKLSSHARRCVFWETDPAAVRAARETGNFYDPEFEKEAWLSMVMLQW 60
Query 58 GSCGQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQ 117
GSCGQVA V D+ + G LYAPPS VPRA FPT+PVSADAVLLT+M +E
Sbjct 61 GSCGQVAM-VDDKPA-------GCALYAPPSMVPRADLFPTSPVSADAVLLTTMRLEPIG 112
Query 118 ADDDLPHSLIARVIEELVRRGVRALEAFG-RTPAATDLQNPGAVTPDVRPVLEALGDCCV 176
+ L +LI + +LVRRGVRALEAFG R A +D+ P A DC
Sbjct 113 DEHGLGATLIQAAVGDLVRRGVRALEAFGIRGEAPSDV-----------PTATAALDCSP 161
Query 177 EHCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQ 231
+ C+I A+FL DVGF ++APH FPRLRLELD+ WKA+VEAAL+RLLE A L
Sbjct 162 QECMISADFLEDVGFEMIAPHHRFPRLRLELDRDHLWKADVEAALDRLLEVAALS 216
>gi|226309508|ref|YP_002769470.1| hypothetical protein RER_60230 [Rhodococcus erythropolis PR4]
gi|226188627|dbj|BAH36731.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=227
Score = 215 bits (547), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 124/227 (55%), Positives = 152/227 (67%), Gaps = 15/227 (6%)
Query 5 ITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQVA 64
IT+L L++ +QL HARRCVFWE+DP L D EFEKEAWLSMVMLEWGSCGQV
Sbjct 6 ITSLTLDSLDQLSAHARRCVFWEMDPGALHDARGFCDQEFEKEAWLSMVMLEWGSCGQV- 64
Query 65 TAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADDDLPH 124
AV D + LG LYAPP +PRAQ FPT+PVS+DAVLLTS+ +E +++L
Sbjct 65 -AVRDGKP------LGSALYAPPRMIPRAQLFPTSPVSSDAVLLTSLRLEPSGIEEELGP 117
Query 125 SLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCIIDAN 184
SL+A V+ +LVRRGVRALEAFG +D P + DV AL +C C+I A
Sbjct 118 SLLAAVVTDLVRRGVRALEAFG---IRSDDLGPAS---DVASATAAL-ECSPAECMISAE 170
Query 185 FLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQ 231
FL D GF VVAPH +PRLRLEL++ WK +VE AL+RLL+ A L+
Sbjct 171 FLEDYGFEVVAPHHRYPRLRLELNRDHEWKVDVEEALDRLLKAAALE 217
>gi|229491158|ref|ZP_04384986.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229321896|gb|EEN87689.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=227
Score = 214 bits (544), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 124/229 (55%), Positives = 151/229 (66%), Gaps = 19/229 (8%)
Query 5 ITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQVA 64
IT+L L++ +QL HARRCVFWE+DP L D EFEKEAWLSMVMLEWGSCGQV
Sbjct 6 ITSLTLDSLDQLSAHARRCVFWEMDPGALHDARGFCDQEFEKEAWLSMVMLEWGSCGQV- 64
Query 65 TAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADDDLPH 124
AV D + LG LYAPP +PRAQ FPT+PVS+DAVLLTS+ +E +++L
Sbjct 65 -AVRDGKP------LGSALYAPPRMIPRAQLFPTSPVSSDAVLLTSLRLEPSGIEEELGP 117
Query 125 SLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTP--DVRPVLEALGDCCVEHCIID 182
SL+A V+ +LVRRGVRALEAFG G + P DV AL +C C+I
Sbjct 118 SLLAAVVTDLVRRGVRALEAFG--------IRSGDLGPASDVASATAAL-ECSPAECMIS 168
Query 183 ANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQ 231
A FL D GF VVAPH +PRLRLEL++ WK +VE AL+RLL+ A L+
Sbjct 169 AEFLEDYGFEVVAPHHRYPRLRLELNRDHEWKVDVEEALDRLLKAAALE 217
>gi|296141899|ref|YP_003649142.1| hypothetical protein Tpau_4235 [Tsukamurella paurometabola DSM
20162]
gi|296030033|gb|ADG80803.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=239
Score = 192 bits (489), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 115/222 (52%), Positives = 139/222 (63%), Gaps = 16/222 (7%)
Query 5 ITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQVA 64
I L L F+ LPKH RRCV+WEV P + + L D EF+KEAWLSM+MLEWGSCGQVA
Sbjct 5 IVPLTLGGFDDLPKHVRRCVYWEVAP----EAETLMDTEFDKEAWLSMLMLEWGSCGQVA 60
Query 65 TA-VPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADDDLP 123
A PD S +G YAPP +VPRA FPTAPVS DAVLLT +G E G ++ +
Sbjct 61 IAHAPDGTSR----FVGVAFYAPPRSVPRAGTFPTAPVSPDAVLLTWVGAEPG-VEERVR 115
Query 124 HSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCIIDA 183
L+ V +LVRRGVRA+EAFG L G T V ++ C + DA
Sbjct 116 EELVTAVCTDLVRRGVRAVEAFGL------LTPVGQSTESVAAQIDCGACGCKTAPLTDA 169
Query 184 NFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLL 225
+FL +GF VAPH +PR+RLEL +GLGWKA VE ALE+LL
Sbjct 170 DFLERMGFETVAPHHRYPRMRLELSEGLGWKAGVEHALEQLL 211
>gi|343928726|ref|ZP_08768171.1| hypothetical protein GOALK_120_01540 [Gordonia alkanivorans NBRC
16433]
gi|343761475|dbj|GAA15097.1| hypothetical protein GOALK_120_01540 [Gordonia alkanivorans NBRC
16433]
Length=286
Score = 188 bits (477), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 116/282 (42%), Positives = 155/282 (55%), Gaps = 50/282 (17%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S I L L +FE LP H RRCVFWEV+P G+ + EF+KEAW+S ++LEWG+C
Sbjct 1 MSVSIVRLELGSFESLPHHTRRCVFWEVEPTTNGES---YESEFDKEAWISGLLLEWGAC 57
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
GQVA +G YAPP+ VPR+Q FPT+PVS DAVLLTS+ E G +
Sbjct 58 GQVAI------ESTTNSVIGTAFYAPPNRVPRSQHFPTSPVSHDAVLLTSIRTEPGH--E 109
Query 121 DLPHSLIARVIEELVRRGVRALEAFG----------------RTPAAT------------ 152
++ L+ V+ +L+RRGVRA+E+FG TP+ +
Sbjct 110 EVATILLDAVVGDLIRRGVRAVESFGLVRNGAGGAEFGSTLGATPSGSSETGGAVAGGGL 169
Query 153 ---DLQNPGAVTPDVRPVLE-ALGDCCVEHCIIDANFLMDVGFVVVAPHPYFPRLRLELD 208
D + R +LE + D C C+IDA+FL + GF VV+ H FPR RLELD
Sbjct 170 ADLDFWTDEEIIEVAREILEDSQADLCTT-CMIDASFLKNSGFDVVSSHHRFPRFRLELD 228
Query 209 KGLGWKAEVEAALERLLENA------RLQEPIAAGSTAGNTS 244
+GLGWK EVE+ALE+L+ A R + + GS G S
Sbjct 229 QGLGWKFEVESALEKLVVMAEIDLIGRQRTAVPVGSGRGRVS 270
>gi|326383900|ref|ZP_08205584.1| hypothetical protein SCNU_13243 [Gordonia neofelifaecis NRRL
B-59395]
gi|326197359|gb|EGD54549.1| hypothetical protein SCNU_13243 [Gordonia neofelifaecis NRRL
B-59395]
Length=253
Score = 183 bits (464), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 113/239 (48%), Positives = 147/239 (62%), Gaps = 31/239 (12%)
Query 5 ITALRLEAFEQLPKHARRCVFWEVD-----PAI-------LGKDDHLA-DPEFEKEAWLS 51
+ L LE FE LP H+RRCVFWEVD PAI G+ D + + EF+KEAWLS
Sbjct 5 VVPLDLENFETLPLHSRRCVFWEVDRAGGSPAIDAVIADAGGRIDAVGPESEFDKEAWLS 64
Query 52 MVMLEWGSCGQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSM 111
+MLEWG C QVA ER +G Y+PP VPRAQ FPTAPV ADAVLLT++
Sbjct 65 GLMLEWGVCCQVAVESSTER------VVGAAFYSPPGRVPRAQHFPTAPVGADAVLLTTI 118
Query 112 GIERGQADDDLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEAL 171
+E G + +S++ V+ +LVRRGVRA+EAFG + + ++ D+ +L
Sbjct 119 RMEPGFESE--ANSVLDAVVADLVRRGVRAVEAFGFS------GDDESLAMDLVTLLLGS 170
Query 172 G---DCCVEHCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLEN 227
G D C CI+ + L + GF VVA PY PRLRLELD+GLGWK++VE AL +L+E+
Sbjct 171 GLAADVC-RRCILPTDLLTNFGFEVVAEDPYLPRLRLELDEGLGWKSQVERALRKLVES 228
>gi|262204654|ref|YP_003275862.1| hypothetical protein Gbro_4856 [Gordonia bronchialis DSM 43247]
gi|262088001|gb|ACY23969.1| hypothetical protein Gbro_4856 [Gordonia bronchialis DSM 43247]
Length=261
Score = 178 bits (452), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 111/249 (45%), Positives = 144/249 (58%), Gaps = 39/249 (15%)
Query 8 LRLEAFEQLPKHARRCVFWEVDPAILGK---DDHLAD-----PEFEKEAWLSMVMLEWGS 59
L L++FE LP H RRCVFWEVDPA + D AD EF+KEAW+S ++LEWG+
Sbjct 3 LDLDSFESLPLHTRRCVFWEVDPANSNRSSADAVFADLGSFESEFDKEAWISGLLLEWGT 62
Query 60 CGQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQAD 119
CGQVA +G YAPP+ VPR+ FPT+PVS DAVLLTS+ E G
Sbjct 63 CGQVAI------DSTTKTVVGTAFYAPPNRVPRSVAFPTSPVSHDAVLLTSIRTEPGH-- 114
Query 120 DDLPHSLIARVIEELVRRGVRALEAFG---------------------RTPAATDLQ--N 156
++ L+ V+ +L+RRGVRA+EAFG P+A +L+
Sbjct 115 EEAATLLLDAVLADLIRRGVRAVEAFGLVRGGSPTPDQQSSEARRAPEELPSALELEAWT 174
Query 157 PGAVTPDVRPVLEALGDCCVEHCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAE 216
++ R +L+ D C+IDA FL D F VV+ HP FPR RLELD+GLGWK E
Sbjct 175 DQSIVDVAREILDGPMDGLCTACMIDAGFLKDSAFDVVSSHPRFPRFRLELDEGLGWKFE 234
Query 217 VEAALERLL 225
VE+ALE+L+
Sbjct 235 VESALEKLV 243
>gi|317509419|ref|ZP_07967037.1| hypothetical protein HMPREF9336_03409 [Segniliparus rugosus ATCC
BAA-974]
gi|316252248|gb|EFV11700.1| hypothetical protein HMPREF9336_03409 [Segniliparus rugosus ATCC
BAA-974]
Length=223
Score = 178 bits (451), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 107/238 (45%), Positives = 137/238 (58%), Gaps = 26/238 (10%)
Query 4 RITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQV 63
R+ L LE LPKHAR+C+FWE D GK D FEKEAWLS V+L+WG+CGQ+
Sbjct 11 RVVPLTLERSALLPKHARQCLFWEFDSKT-GKQIEGFDAGFEKEAWLSSVLLQWGTCGQL 69
Query 64 ATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADDDLP 123
A DE E +G + YAPPS V RA FPTAPVS DAVL+T G++ G +
Sbjct 70 AVVGEDE----EERGVGQICYAPPSMVSRAAEFPTAPVSHDAVLVTYAGVDEGHDFAAIG 125
Query 124 HSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCIIDA 183
L+ I +L RRGVRA+EAFGR AA D R + C+
Sbjct 126 QRLLLASIADLARRGVRAIEAFGREEAA---------EADERTM----------RCVNPT 166
Query 184 NFLMDVGFVVVAPHPYFPRLRLEL-DKGLGWKAEVEAALERLLENARLQEPIAAGSTA 240
+F + GF V A H ++PRLRLE+ GL W+A VEAAL L+E AR++ P+ G+T+
Sbjct 167 DFFLGGGFTVAAAHKHYPRLRLEIGGSGLLWRASVEAALAELVEEARVR-PVLVGATS 223
>gi|325003246|ref|ZP_08124358.1| hypothetical protein PseP1_30967 [Pseudonocardia sp. P1]
Length=239
Score = 176 bits (446), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/226 (45%), Positives = 131/226 (58%), Gaps = 31/226 (13%)
Query 5 ITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQVA 64
+ AL L+ LPK R CV+WE+ PA+ + + + EKEAWLS V+LEWGSCG+V
Sbjct 1 MAALNLDNLGDLPKRCRNCVYWELSPALADQAEGYGTTDLEKEAWLSEVLLEWGSCGRVV 60
Query 65 TAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADDDLPH 124
+ GYVL+APP++VPRA PT PVSADAVLLT+M + A + L
Sbjct 61 --------YVGGAPAGYVLFAPPASVPRATEMPTGPVSADAVLLTTMQVLPEFAGEGLGR 112
Query 125 SLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCIIDAN 184
+L V++EL RRGV+A+EAFG D RP EA C++ A+
Sbjct 113 ALAQAVVKELTRRGVKAVEAFG----------------DARPGTEA-------DCVMPAD 149
Query 185 FLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARL 230
FL VGF V PH +PRLR+EL GL WK++VEAALE+L +
Sbjct 150 FLRSVGFKTVRPHHRWPRLRMELRSGLEWKSDVEAALEQLFNTVTI 195
>gi|256381065|ref|YP_003104725.1| hypothetical protein Amir_7089 [Actinosynnema mirum DSM 43827]
gi|255925368|gb|ACU40879.1| hypothetical protein Amir_7089 [Actinosynnema mirum DSM 43827]
Length=210
Score = 174 bits (441), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 99/227 (44%), Positives = 129/227 (57%), Gaps = 30/227 (13%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S R+ + L+ E L KH R CVFWE+ P + + + D EFEKEAW+S V+LEWGSC
Sbjct 1 MSRRVVGVTLDNLEHLSKHGRTCVFWELAPHLKEQAEEFGDTEFEKEAWVSSVLLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G++ + + G V YAPP+AVPR+ FPT+PVS DAVL+TS+ +
Sbjct 61 GRII--------YCDGIPAGSVFYAPPAAVPRSLAFPTSPVSPDAVLMTSLEVLPEFRGG 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
L L+ V ++L RRGV+A+EAFG + D P VTP
Sbjct 113 GLARVLVQGVAKDLTRRGVKAIEAFGDNQPSED--KPSCVTP------------------ 152
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLEN 227
A+FL+ VGF V PHP +PRLRLEL WK +VEAALERLL
Sbjct 153 --ADFLLQVGFKTVRPHPRWPRLRLELRSASSWKEDVEAALERLLNT 197
>gi|333922229|ref|YP_004495810.1| hypothetical protein AS9A_4578 [Amycolicicoccus subflavus DQS3-9A1]
gi|333484450|gb|AEF43010.1| hypothetical protein AS9A_4578 [Amycolicicoccus subflavus DQS3-9A1]
Length=207
Score = 173 bits (438), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 97/204 (48%), Positives = 124/204 (61%), Gaps = 7/204 (3%)
Query 28 VDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQVATAVPDERSHAEPPCLGYVLYAPP 87
+DP + D E EKEAWLS V+L WGSCGQ+ E +H P G LYAPP
Sbjct 1 MDPGAVIDTQAFCDTELEKEAWLSSVLLNWGSCGQLLYLNQGEAAHL-PKVSGCALYAPP 59
Query 88 SAVPRAQRFPTAPVSADAVLLTSMGIERGQADDDLPHSLIARVIEELVRRGVRALEAFGR 147
S VPRA FPT+PVSADA+LLT++ ++ + LI V+++L++RGVRA+EAFG
Sbjct 60 SVVPRAGLFPTSPVSADAILLTTLYVDGIAEAEGFHEVLIRGVLDDLIKRGVRAIEAFGH 119
Query 148 TPAATDLQNPGAVTPDVRPVLEALGDCCVEHCIIDANFLMDVGFVVVAPHPYFPRLRLEL 207
++ + V GDC E C+I A L+D F VVAPH YFPR RLEL
Sbjct 120 ------IREGECTSHAYSLVHRKPGDCTPETCMISAERLLDAEFKVVAPHHYFPRFRLEL 173
Query 208 DKGLGWKAEVEAALERLLENARLQ 231
D+ GWKA+VEAAL RLLE++ L
Sbjct 174 DRDHGWKADVEAALMRLLESSTLS 197
>gi|331700383|ref|YP_004336622.1| hypothetical protein Psed_6681 [Pseudonocardia dioxanivorans
CB1190]
gi|326955072|gb|AEA28769.1| hypothetical protein Psed_6681 [Pseudonocardia dioxanivorans
CB1190]
Length=214
Score = 172 bits (436), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 97/230 (43%), Positives = 130/230 (57%), Gaps = 31/230 (13%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S + ++ L+ +LPK R CVFWE+ + + EFEKEAW+S V+LEWGSC
Sbjct 1 MSWHVASITLDNLHELPKRCRTCVFWELSDHLGKQARDFGSTEFEKEAWVSGVLLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G++ H + GYV+YAPPSAVPRA PT PVSADAVLLT+M + A +
Sbjct 61 GKIV--------HVKGAPAGYVMYAPPSAVPRAAEMPTGPVSADAVLLTTMQVLPEFAGE 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
L L V+++L RRGV+A+E FG D RP E C+
Sbjct 113 GLGRMLAQAVVKDLTRRGVKAVEVFG----------------DARPGTEP-------SCV 149
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARL 230
I A FL VGF + PHP +PRLR+EL + WK +VEAALE++L + +
Sbjct 150 IPAEFLRGVGFKTIRPHPRWPRLRMELRAAMEWKEDVEAALEQILGSVTI 199
>gi|302870719|ref|YP_003839356.1| hypothetical protein Micau_6287 [Micromonospora aurantiaca ATCC
27029]
gi|315506956|ref|YP_004085843.1| hypothetical protein ML5_6249 [Micromonospora sp. L5]
gi|302573578|gb|ADL49780.1| hypothetical protein Micau_6287 [Micromonospora aurantiaca ATCC
27029]
gi|315413575|gb|ADU11692.1| hypothetical protein ML5_6249 [Micromonospora sp. L5]
Length=221
Score = 172 bits (436), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 91/225 (41%), Positives = 126/225 (56%), Gaps = 26/225 (11%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S R+ +L L+ E LP+ R CV+WE+DP + DP EKEAW+S +LEWGSC
Sbjct 1 MSRRLVSLTLDTLEDLPRSCRSCVYWELDPVSAERACAAGDPGLEKEAWVSQTLLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G++ + + G+V+YAPP+ VPR+ FPT+PVSADA LL + + AD
Sbjct 61 GKLV--------YVDGMPAGFVMYAPPAYVPRSMAFPTSPVSADAALLMTAHVVPAFADG 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
L L+ V +L +RG++A+EAFG D +P C+
Sbjct 113 GLGRMLVQGVARDLTKRGIKAIEAFGDAKFGDDADDPA------------------RACV 154
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLL 225
A+F + VGF V PHP +PRLRLEL L WK++VE ALE+LL
Sbjct 155 APADFFLSVGFKTVRPHPRYPRLRLELRTALSWKSDVEYALEKLL 199
>gi|302531347|ref|ZP_07283689.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302440242|gb|EFL12058.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=214
Score = 172 bits (435), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 98/225 (44%), Positives = 133/225 (60%), Gaps = 26/225 (11%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S R+ + L+ E LPK R+CV+WE+ P + + D E EKEAW+S V+LEWGSC
Sbjct 1 MSRRVVGVTLDNLEHLPKSCRQCVYWELAPHLKAQADEYGSTEVEKEAWVSSVLLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G++ +++ +G+VLYAPP+AVPR+ FPT+P SADAVLLT+ +
Sbjct 61 GRIV--------YSDTLPVGFVLYAPPNAVPRSLAFPTSPPSADAVLLTAFQVLPEFRGG 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
L L+ V ++L +RGVRA+EAFG A D ++P LG C+
Sbjct 113 GLGRMLVQAVAKDLTKRGVRAIEAFGD--ATPDDEDP-------------LGQ---HSCV 154
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLL 225
+ A FL VGF V PH +PRLRLEL + WK +VEAALERLL
Sbjct 155 LPAAFLQSVGFKTVRPHRKYPRLRLELRSAITWKEDVEAALERLL 199
>gi|152968444|ref|YP_001364228.1| hypothetical protein Krad_4505 [Kineococcus radiotolerans SRS30216]
gi|151362961|gb|ABS05964.1| conserved hypothetical protein [Kineococcus radiotolerans SRS30216]
Length=244
Score = 171 bits (432), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 103/234 (45%), Positives = 136/234 (59%), Gaps = 25/234 (10%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
V R+ L L+ LPK +R+CVFWE+D + P+FEKEAW+S V+L+WG C
Sbjct 25 VGRRMAPLTLDTVADLPKQSRQCVFWELDAVAAQRAAEAGYPDFEKEAWISSVLLQWGPC 84
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G++ V D+ + G+V+YAPP VPR+ FPT+PVS DAVLLT+ IE +
Sbjct 85 GRLVY-VDDQPA-------GFVVYAPPVYVPRSTGFPTSPVSGDAVLLTTGWIEPPFRGE 136
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCV---E 177
L L+ ++L +RGV+A+EAFG PA AV D R DC E
Sbjct 137 GLARMLLQGAAKDLTQRGVKAVEAFGGGPA--------AVGGDGR------DDCAHDSDE 182
Query 178 HCIIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQ 231
C++ A+ L VGF VV PH +PRLRLEL L W+ +VEAALERLL R+Q
Sbjct 183 ACVLPAHLLESVGFTVVRPHHRYPRLRLELKTALSWREDVEAALERLLAGVRVQ 236
>gi|271970543|ref|YP_003344739.1| hypothetical protein Sros_9378 [Streptosporangium roseum DSM
43021]
gi|270513718|gb|ACZ91996.1| hypothetical protein Sros_9378 [Streptosporangium roseum DSM
43021]
Length=206
Score = 170 bits (430), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 95/225 (43%), Positives = 128/225 (57%), Gaps = 31/225 (13%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S R+ + L+ + LP+ RRCVFWE+DP + + DP EKEAW+S +LEWGSC
Sbjct 1 MSRRLANVTLDNLDDLPRRCRRCVFWELDPVNGNRAVEVGDPGLEKEAWISSTLLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G++ + + G+VLYAPP VPR+ FPT+PVSADAVLL + I +
Sbjct 61 GKIV--------YVDGVAAGFVLYAPPHYVPRSVAFPTSPVSADAVLLMTAHIVPEFSGG 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
L L+ V ++L RRGVRA+EAFG + PGA C+
Sbjct 113 GLGRMLVQGVAKDLTRRGVRAIEAFG----DLKWEQPGA-------------------CL 149
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLL 225
+ A++L+ VGF V PH FPRLRLEL + W+ +VE ALERLL
Sbjct 150 MPADYLLSVGFKTVRPHLRFPRLRLELKTAVSWREDVEVALERLL 194
>gi|300791155|ref|YP_003771446.1| hypothetical protein AMED_9356 [Amycolatopsis mediterranei U32]
gi|299800669|gb|ADJ51044.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
Length=214
Score = 169 bits (429), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 97/225 (44%), Positives = 129/225 (58%), Gaps = 26/225 (11%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S R+ + L+ E LPK RRCV+WE+ P + + + E EKEAW+S V+LEWGSC
Sbjct 1 MSRRVVGVTLDNLEHLPKSCRRCVYWELAPHLKHQAEEFGATEVEKEAWVSSVLLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G++ +++ +G+VLYAPP+AVPRA FPT+P SADAVLLT+ +
Sbjct 61 GRIV--------YSDTLPVGFVLYAPPNAVPRALAFPTSPPSADAVLLTAFQVLPEFRGG 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
L L+ V ++L +RGVRA+EAFG P PD C+
Sbjct 113 GLGRMLVQAVAKDLTKRGVRAIEAFGDA-------RPDEADPD-----------GGHSCV 154
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLL 225
+ A FL VGF V PH +PRLRLEL + WK +VEAALERLL
Sbjct 155 LPAAFLQSVGFKTVRPHQKWPRLRLELRSAITWKEDVEAALERLL 199
>gi|257057906|ref|YP_003135738.1| acetyltransferase (GNAT) family protein [Saccharomonospora viridis
DSM 43017]
gi|256587778|gb|ACU98911.1| acetyltransferase (GNAT) family protein [Saccharomonospora viridis
DSM 43017]
Length=207
Score = 168 bits (426), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 98/220 (45%), Positives = 130/220 (60%), Gaps = 30/220 (13%)
Query 7 ALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQVATA 66
+ L+ EQLP RRCV+WEV P + + + + E EKEAW+S V+LEWGSCG++
Sbjct 2 GVTLDNLEQLPLSCRRCVYWEVAPHLKEQAEQFGETEVEKEAWVSSVLLEWGSCGRLV-- 59
Query 67 VPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADDDLPHSL 126
++ +G+VLYAPP+AVPRA FPT+P S DAVLLT+ + +L
Sbjct 60 ------YSGDLLVGFVLYAPPNAVPRAGAFPTSPPSPDAVLLTAFYVLPEFRGSGFGRAL 113
Query 127 IARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEH-CIIDANF 185
+ + +L +RGVRA+EAFG D +P E D EH C++ A F
Sbjct 114 VQAAVADLTKRGVRAIEAFG----------------DAQP--ETEDD---EHICVVPAAF 152
Query 186 LMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLL 225
L VGF V PHP +PRLRLEL G+ WKA+VEAALE+LL
Sbjct 153 LRSVGFKTVRPHPRWPRLRLELRSGISWKADVEAALEKLL 192
>gi|86743214|ref|YP_483614.1| hypothetical protein Francci3_4539 [Frankia sp. CcI3]
gi|86570076|gb|ABD13885.1| conserved hypothetical protein [Frankia sp. CcI3]
Length=212
Score = 167 bits (424), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 94/242 (39%), Positives = 135/242 (56%), Gaps = 32/242 (13%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S RI + L+ + LP RRCVFWE+DP + + + EKEAW+S+ +LEWGSC
Sbjct 1 MSRRIANITLDNIDDLPLPCRRCVFWELDPVARSRAEEAGGTDLEKEAWVSLALLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G++A + + G+V++APP+ VPR+ FPT+PVS DAVLL + I
Sbjct 61 GKIA--------YIDNVPAGFVMFAPPAYVPRSVAFPTSPVSPDAVLLMTASIVNEFTGQ 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
L L+ V ++++RRG +A+EAFG DLQN G CI
Sbjct 113 GLGRILVQSVAKDVIRRGFKAVEAFG------DLQNSGT------------------RCI 148
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTA 240
+ A++L+ VGF V PH +PRLRLE+ + W+ +VE ALERLL + + + S A
Sbjct 149 LPADYLLAVGFKTVRPHHRWPRLRLEVKNAVSWREDVEVALERLLGSMTPEGMLRKVSQA 208
Query 241 GN 242
GN
Sbjct 209 GN 210
>gi|238061900|ref|ZP_04606609.1| hypothetical protein MCAG_02866 [Micromonospora sp. ATCC 39149]
gi|237883711|gb|EEP72539.1| hypothetical protein MCAG_02866 [Micromonospora sp. ATCC 39149]
Length=221
Score = 167 bits (422), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 91/227 (41%), Positives = 128/227 (57%), Gaps = 27/227 (11%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S R+ +L L+ E LP+ R+CV+WE+DP + DP EKEAW+S +LEWGSC
Sbjct 1 MSRRLVSLTLDTLEDLPRPCRQCVYWELDPVSADRACAAGDPGLEKEAWVSQTLLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G++A + + G+V+YAPP+ VPRA FPT+PVSADA LL + + A
Sbjct 61 GKLA--------YVDGMPAGFVMYAPPAYVPRAMAFPTSPVSADAALLMTAHVVAPFAGG 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
L L+ V +L +RG++A+EAFG + G+ C+
Sbjct 113 GLGRMLVQGVARDLTKRGIKAIEAFGDAKFGDEGDLAGS-------------------CV 153
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLEN 227
A+F + VGF V PHP +PRLRLEL L WK++VE ALE+LL +
Sbjct 154 APADFFLSVGFKTVRPHPRYPRLRLELRTALSWKSDVEYALEKLLGS 200
>gi|330470821|ref|YP_004408564.1| hypothetical protein VAB18032_04410 [Verrucosispora maris AB-18-032]
gi|328813792|gb|AEB47964.1| hypothetical protein VAB18032_04410 [Verrucosispora maris AB-18-032]
Length=221
Score = 167 bits (422), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 90/227 (40%), Positives = 126/227 (56%), Gaps = 27/227 (11%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S R+ +L L+ E LP+ R+CV+WE+DP + DP EKEAW+S +LEWGSC
Sbjct 1 MSRRLVSLTLDTLEDLPRPCRQCVYWELDPVSADRACAAGDPGLEKEAWVSQTLLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G++ + + G+V+YAPP+ VPR+ FPT+PVSADA LL + + A
Sbjct 61 GKLI--------YVDGMPAGFVMYAPPAYVPRSMAFPTSPVSADAALLMTANVVPAFAGG 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
L L+ V +L +RG++A+EAFG GA C+
Sbjct 113 GLGRMLVQGVARDLTKRGIKAIEAFGDAKFGDAADPAGA-------------------CV 153
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLEN 227
A+F + VGF V PHP +PRLRLEL L WK++VE ALE+LL +
Sbjct 154 APADFFLSVGFKTVRPHPRYPRLRLELRTALSWKSDVEYALEKLLGS 200
>gi|258655500|ref|YP_003204656.1| hypothetical protein Namu_5404 [Nakamurella multipartita DSM
44233]
gi|258558725|gb|ACV81667.1| hypothetical protein Namu_5404 [Nakamurella multipartita DSM
44233]
Length=269
Score = 166 bits (420), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 106/226 (47%), Positives = 128/226 (57%), Gaps = 11/226 (4%)
Query 5 ITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQVA 64
+ L + +P RRCV WE++ + EFEKE WLS VML WGS GQ+
Sbjct 5 LVPLSMSTIGLIPGRCRRCVAWELEAPAARLAADSGEAEFEKEVWLSGVMLTWGSAGQIV 64
Query 65 TAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADDDLPH 124
T DE EP +G+ LYAPP+AVP A FPTAPVS DAVLLT+ IE L
Sbjct 65 TV--DE----EP--VGFALYAPPTAVPGAAAFPTAPVSPDAVLLTTARIEPAYRQQGLAR 116
Query 125 SLIARVIEELVRRGVRALEAFGR--TPAATDLQNPG-AVTPDVRPVLEALGDCCVEHCII 181
L V+ L RRGVRA+E FGR PAA D + + P R A D + C++
Sbjct 117 FLFEGVVGTLTRRGVRAIELFGREDDPAAGDDRAENLSDRPADRWAEHAADDADIPGCVL 176
Query 182 DANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLEN 227
A F VGFV VAPH +PRLRLEL + +GWKAEVEAALE L +
Sbjct 177 PAGFARAVGFVEVAPHHRYPRLRLELGRDIGWKAEVEAALEELFAS 222
>gi|340532855|gb|AEK48060.1| hypothetical protein RAM_47975 [Amycolatopsis mediterranei S699]
Length=209
Score = 165 bits (417), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 95/219 (44%), Positives = 125/219 (58%), Gaps = 26/219 (11%)
Query 7 ALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQVATA 66
+ L+ E LPK RRCV+WE+ P + + + E EKEAW+S V+LEWGSCG++
Sbjct 2 GVTLDNLEHLPKSCRRCVYWELAPHLKHQAEEFGATEVEKEAWVSSVLLEWGSCGRIV-- 59
Query 67 VPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADDDLPHSL 126
+++ +G+VLYAPP+AVPRA FPT+P SADAVLLT+ + L L
Sbjct 60 ------YSDTLPVGFVLYAPPNAVPRALAFPTSPPSADAVLLTAFQVLPEFRGGGLGRML 113
Query 127 IARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCIIDANFL 186
+ V ++L +RGVRA+EAFG P PD C++ A FL
Sbjct 114 VQAVAKDLTKRGVRAIEAFGDA-------RPDEADPD-----------GGHSCVLPAAFL 155
Query 187 MDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLL 225
VGF V PH +PRLRLEL + WK +VEAALERLL
Sbjct 156 QSVGFKTVRPHQKWPRLRLELRSAITWKEDVEAALERLL 194
>gi|159040580|ref|YP_001539833.1| hypothetical protein Sare_5100 [Salinispora arenicola CNS-205]
gi|157919415|gb|ABW00843.1| conserved hypothetical protein [Salinispora arenicola CNS-205]
Length=221
Score = 164 bits (414), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 88/227 (39%), Positives = 124/227 (55%), Gaps = 27/227 (11%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S R+ +L L+ E LP+ R+CV+WE+DP + DP EKEAW+S +LEWG+C
Sbjct 1 MSRRLVSLTLDTLEDLPRPCRQCVYWELDPVSADRARAAGDPGLEKEAWVSQTLLEWGAC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G++ + + G+VLYAPP+ VPR+ FPT+PVS DA LL + + A
Sbjct 61 GKLV--------YVDGMPAGFVLYAPPAYVPRSMAFPTSPVSPDAALLMTAKVVPAFAGG 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
L L+ V +L +RG++A+EAFG D C+
Sbjct 113 GLGRMLVQGVARDLTKRGIKAIEAFGDAKFGD-------------------ADDSARACV 153
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLEN 227
A++ + VGF V PHP FPRLRLEL L WK++VE ALE+LL +
Sbjct 154 APADYFLSVGFKTVRPHPRFPRLRLELRTALSWKSDVEYALEKLLGS 200
>gi|269129156|ref|YP_003302526.1| hypothetical protein Tcur_4974 [Thermomonospora curvata DSM 43183]
gi|268314114|gb|ACZ00489.1| hypothetical protein Tcur_4974 [Thermomonospora curvata DSM 43183]
Length=205
Score = 162 bits (411), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 94/225 (42%), Positives = 125/225 (56%), Gaps = 32/225 (14%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S R+ + L+ LP+ R CVFWE+DP + DP EKEAW+S +LEWGSC
Sbjct 1 MSRRLVNVTLDNLGDLPRRCRGCVFWELDPVAAERAAESGDPALEKEAWVSSTLLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G++ + + G+VLYAPP VPR+ FPT+PVSADAVLL + + +
Sbjct 61 GKIV--------YVDGTPAGFVLYAPPLYVPRSLAFPTSPVSADAVLLMTAHVLPEFSGG 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
L L+ V ++LVRRGVRA+EAFG DL+ C+
Sbjct 113 GLGRMLVQGVAKDLVRRGVRAIEAFG------DLKGEEG------------------GCM 148
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLL 225
+ A++L+ VGF V PH FPRLRLEL L W+ +VE ALERLL
Sbjct 149 VPADYLLAVGFKTVRPHHRFPRLRLELKSALSWREDVEVALERLL 193
>gi|312200970|ref|YP_004021031.1| GCN5-related N-acetyltransferase [Frankia sp. EuI1c]
gi|311232306|gb|ADP85161.1| GCN5-related N-acetyltransferase [Frankia sp. EuI1c]
Length=218
Score = 162 bits (410), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 91/242 (38%), Positives = 135/242 (56%), Gaps = 36/242 (14%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S R+ + L+ + LP+ RRCVFWE+DP + + + EKEAW+S +LEWGSC
Sbjct 1 MSRRVANITLDNIDDLPQRCRRCVFWELDPVARSRAEEAGGTDIEKEAWVSSALLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G++ + + G+ +YAPP+ VPR+ FPT+PVSADAVLL + I A
Sbjct 61 GKIV--------YVDNVPAGFAMYAPPAYVPRSIAFPTSPVSADAVLLMTAKIVDEFAGQ 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDL-QNPGAVTPDVRPVLEALGDCCVEHC 179
L L+ ++++++RRG RA+EAFG DL Q+ G C
Sbjct 113 GLGRVLVQAMVKDVIRRGYRAIEAFG------DLRQDEGT------------------KC 148
Query 180 IIDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENAR---LQEPIAA 236
++ A++L+ VGF V PH +PRLRLE+ + W+ +VE ALERLL + + PI
Sbjct 149 VVPADYLLSVGFKTVRPHRRWPRLRLEVKNAVTWREDVEVALERLLGSMNPEGILRPIGG 208
Query 237 GS 238
G+
Sbjct 209 GT 210
>gi|254384783|ref|ZP_05000120.1| conserved hypothetical protein [Streptomyces sp. Mg1]
gi|194343665|gb|EDX24631.1| conserved hypothetical protein [Streptomyces sp. Mg1]
Length=205
Score = 162 bits (409), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 96/231 (42%), Positives = 128/231 (56%), Gaps = 33/231 (14%)
Query 4 RITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSCGQV 63
R+ L L+ + LP+ R CVFWE+DP PE EKEAW+S V+LEWGSCG+V
Sbjct 4 RLVPLTLDNLQDLPRRCRSCVFWELDPVSGEAAVKAGTPELEKEAWISAVLLEWGSCGRV 63
Query 64 ATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADDDLP 123
+ + +G+V+YAPP+ VPR+ FPT+PVS DAV L + I G L
Sbjct 64 V--------YVDEVPVGFVMYAPPAYVPRSTAFPTSPVSPDAVQLITAWIMPGYQGQGLG 115
Query 124 HSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCIIDA 183
++ V ++L+RRG RA+EAFG D + G C++ A
Sbjct 116 RVMVQTVAKDLLRRGFRAIEAFG------DARWDGPA------------------CLLPA 151
Query 184 NFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLLENARLQEPI 234
+ L+ VGF V PHP PRLRLEL L WK +VE AL+RLL AR +EP+
Sbjct 152 DHLLSVGFKTVRPHPVHPRLRLELRSTLSWKEDVELALDRLLGAAR-KEPV 201
>gi|158319048|ref|YP_001511556.1| hypothetical protein Franean1_7331 [Frankia sp. EAN1pec]
gi|158114453|gb|ABW16650.1| conserved hypothetical protein [Frankia sp. EAN1pec]
Length=212
Score = 162 bits (409), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 91/225 (41%), Positives = 126/225 (56%), Gaps = 32/225 (14%)
Query 1 VSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWLSMVMLEWGSC 60
+S RI + L+ + LP RRCVFWE+DP + + + EKEAW+S +LEWGSC
Sbjct 1 MSRRIANITLDNIDDLPLPCRRCVFWELDPVARSRAEEAGGTDIEKEAWVSSALLEWGSC 60
Query 61 GQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAPVSADAVLLTSMGIERGQADD 120
G+V + + G+V++APP+ VPR+ FPT+PVS DAVLL + I +
Sbjct 61 GKVV--------YIDNVPAGFVMFAPPAYVPRSVAFPTSPVSPDAVLLMTAQIVQEFTGQ 112
Query 121 DLPHSLIARVIEELVRRGVRALEAFGRTPAATDLQNPGAVTPDVRPVLEALGDCCVEHCI 180
L LI V +E+ RRG RALEAFG DL++ G C+
Sbjct 113 GLGRVLIQSVAKEITRRGYRALEAFG------DLRDSGT------------------RCV 148
Query 181 IDANFLMDVGFVVVAPHPYFPRLRLELDKGLGWKAEVEAALERLL 225
+ A++L+ VGF V PH +PRLRLE+ + W+ +VE ALERLL
Sbjct 149 VPADYLLAVGFKTVRPHHRWPRLRLEVKNAVSWREDVEVALERLL 193
Lambda K H
0.320 0.136 0.420
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 340053351120
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40