BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2004c
Length=498
Score E
Sequences producing significant alignments: (Bits) Value
gi|15841486|ref|NP_336523.1| hypothetical protein MT2060 [Mycoba... 1004 0.0
gi|15609141|ref|NP_216520.1| hypothetical protein Rv2004c [Mycob... 1002 0.0
gi|340627014|ref|YP_004745466.1| hypothetical protein MCAN_20241... 993 0.0
gi|240171510|ref|ZP_04750169.1| hypothetical protein MkanA1_1950... 644 0.0
gi|296169102|ref|ZP_06850761.1| conserved hypothetical protein [... 605 7e-171
gi|183983459|ref|YP_001851750.1| hypothetical protein MMAR_3476 ... 601 1e-169
gi|169630981|ref|YP_001704630.1| hypothetical protein MAB_3902c ... 507 2e-141
gi|333990933|ref|YP_004523547.1| hypothetical protein JDM601_229... 472 7e-131
gi|118473212|ref|YP_888231.1| hypothetical protein MSMEG_3942 [M... 470 3e-130
gi|120402395|ref|YP_952224.1| hypothetical protein Mvan_1384 [My... 468 2e-129
gi|145220867|ref|YP_001131545.1| hypothetical protein Mflv_0263 ... 467 2e-129
gi|108798047|ref|YP_638244.1| hypothetical protein Mmcs_1074 [My... 448 1e-123
gi|296394769|ref|YP_003659653.1| hypothetical protein Srot_2377 ... 428 1e-117
gi|317507619|ref|ZP_07965332.1| hypothetical protein HMPREF9336_... 416 7e-114
gi|325676765|ref|ZP_08156438.1| hypothetical protein HMPREF0724_... 393 4e-107
gi|312139781|ref|YP_004007117.1| hypothetical protein REQ_23910 ... 388 1e-105
gi|226306904|ref|YP_002766864.1| hypothetical protein RER_34170 ... 380 4e-103
gi|111017077|ref|YP_700049.1| hypothetical protein RHA1_ro00055 ... 366 6e-99
gi|271966203|ref|YP_003340399.1| gluconate kinase [Streptosporan... 364 2e-98
gi|329938537|ref|ZP_08287962.1| hypothetical protein SGM_3454 [S... 360 3e-97
gi|254381619|ref|ZP_04996983.1| conserved hypothetical protein [... 350 3e-94
gi|290955585|ref|YP_003486767.1| hypothetical protein SCAB_10221... 350 3e-94
gi|331698605|ref|YP_004334844.1| gluconate kinase [Pseudonocardi... 341 2e-91
gi|134100921|ref|YP_001106582.1| hypothetical protein SACE_4388 ... 340 3e-91
gi|21218720|ref|NP_624499.1| hypothetical protein SCO0163 [Strep... 338 9e-91
gi|289774177|ref|ZP_06533555.1| conserved hypothetical protein [... 337 3e-90
gi|134099015|ref|YP_001104676.1| hypothetical protein SACE_2453 ... 328 1e-87
gi|291006888|ref|ZP_06564861.1| hypothetical protein SeryN2_2040... 328 1e-87
gi|269127024|ref|YP_003300394.1| hypothetical protein Tcur_2811 ... 323 4e-86
gi|302557027|ref|ZP_07309369.1| conserved hypothetical protein [... 310 3e-82
gi|336179936|ref|YP_004585311.1| hypothetical protein FsymDg_411... 300 3e-79
gi|111223194|ref|YP_713988.1| hypothetical protein FRAAL3784 [Fr... 286 5e-75
gi|312198390|ref|YP_004018451.1| hypothetical protein FraEuI1c_4... 286 7e-75
gi|158318664|ref|YP_001511172.1| hypothetical protein Franean1_6... 283 4e-74
gi|319948330|ref|ZP_08022476.1| gluconate kinase [Dietzia cinnam... 277 3e-72
gi|86740954|ref|YP_481354.1| hypothetical protein Francci3_2257 ... 270 6e-70
gi|288922596|ref|ZP_06416775.1| conserved hypothetical protein [... 257 3e-66
gi|288919770|ref|ZP_06414096.1| conserved hypothetical protein [... 256 8e-66
gi|158318668|ref|YP_001511176.1| hypothetical protein Franean1_6... 249 1e-63
gi|288919156|ref|ZP_06413494.1| conserved hypothetical protein [... 241 2e-61
gi|288923414|ref|ZP_06417540.1| conserved hypothetical protein [... 231 2e-58
gi|158315848|ref|YP_001508356.1| hypothetical protein Franean1_4... 230 4e-58
gi|342857399|ref|ZP_08714055.1| hypothetical protein MCOL_00935 ... 229 8e-58
gi|254774818|ref|ZP_05216334.1| hypothetical protein MaviaA2_091... 228 2e-57
gi|218781915|ref|YP_002433233.1| gluconate kinase [Desulfatibaci... 226 6e-57
gi|269836440|ref|YP_003318668.1| Uma3 [Sphaerobacter thermophilu... 221 2e-55
gi|158313805|ref|YP_001506313.1| hypothetical protein Franean1_1... 217 3e-54
gi|254823381|ref|ZP_05228382.1| hypothetical protein MintA_25859... 212 1e-52
gi|186681115|ref|YP_001864311.1| hypothetical protein Npun_F0614... 210 4e-52
gi|86741938|ref|YP_482338.1| hypothetical protein Francci3_3252 ... 208 2e-51
>gi|15841486|ref|NP_336523.1| hypothetical protein MT2060 [Mycobacterium tuberculosis CDC1551]
gi|308369585|ref|ZP_07418361.2| hypothetical protein TMBG_00544 [Mycobacterium tuberculosis SUMu002]
gi|13881727|gb|AAK46337.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
gi|308327062|gb|EFP15913.1| hypothetical protein TMBG_00544 [Mycobacterium tuberculosis SUMu002]
Length=502
Score = 1004 bits (2595), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 498/498 (100%), Positives = 498/498 (100%), Gaps = 0/498 (0%)
Query 1 MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER 60
MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER
Sbjct 5 MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER 64
Query 61 ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL 120
ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL
Sbjct 65 ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL 124
Query 121 DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV 180
DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV
Sbjct 125 DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV 184
Query 181 DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA 240
DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA
Sbjct 185 DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA 244
Query 241 AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP 300
AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP
Sbjct 245 AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP 304
Query 301 EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL 360
EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL
Sbjct 305 EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL 364
Query 361 RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR 420
RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR
Sbjct 365 RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR 424
Query 421 RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA 480
RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA
Sbjct 425 RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA 484
Query 481 GPRERSVGQAYHIWRSAI 498
GPRERSVGQAYHIWRSAI
Sbjct 485 GPRERSVGQAYHIWRSAI 502
>gi|15609141|ref|NP_216520.1| hypothetical protein Rv2004c [Mycobacterium tuberculosis H37Rv]
gi|31793184|ref|NP_855677.1| hypothetical protein Mb2027c [Mycobacterium bovis AF2122/97]
gi|121637888|ref|YP_978111.1| hypothetical protein BCG_2021c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
77 more sequence titles
Length=498
Score = 1002 bits (2591), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 498/498 (100%), Positives = 498/498 (100%), Gaps = 0/498 (0%)
Query 1 MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER 60
MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER
Sbjct 1 MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER 60
Query 61 ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL 120
ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL
Sbjct 61 ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL 120
Query 121 DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV 180
DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV
Sbjct 121 DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV 180
Query 181 DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA 240
DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA
Sbjct 181 DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA 240
Query 241 AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP 300
AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP
Sbjct 241 AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP 300
Query 301 EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL 360
EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL
Sbjct 301 EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL 360
Query 361 RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR 420
RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR
Sbjct 361 RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR 420
Query 421 RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA 480
RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA
Sbjct 421 RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA 480
Query 481 GPRERSVGQAYHIWRSAI 498
GPRERSVGQAYHIWRSAI
Sbjct 481 GPRERSVGQAYHIWRSAI 498
>gi|340627014|ref|YP_004745466.1| hypothetical protein MCAN_20241 [Mycobacterium canettii CIPT
140010059]
gi|340005204|emb|CCC44356.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=498
Score = 993 bits (2568), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 495/498 (99%), Positives = 495/498 (99%), Gaps = 0/498 (0%)
Query 1 MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER 60
MDSPTNDGTCD HPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER
Sbjct 1 MDSPTNDGTCDDHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER 60
Query 61 ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL 120
ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL
Sbjct 61 ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL 120
Query 121 DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV 180
DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHA KVVSGDVIRRIEHMV
Sbjct 121 DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHAGKVVSGDVIRRIEHMV 180
Query 181 DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA 240
DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA
Sbjct 181 DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA 240
Query 241 AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP 300
AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP
Sbjct 241 AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP 300
Query 301 EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL 360
EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL
Sbjct 301 EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL 360
Query 361 RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR 420
RDCGVI GEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR
Sbjct 361 RDCGVIIGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR 420
Query 421 RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA 480
RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA
Sbjct 421 RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA 480
Query 481 GPRERSVGQAYHIWRSAI 498
GPRERSVGQAYHIWRSAI
Sbjct 481 GPRERSVGQAYHIWRSAI 498
>gi|240171510|ref|ZP_04750169.1| hypothetical protein MkanA1_19506 [Mycobacterium kansasii ATCC
12478]
Length=506
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/490 (68%), Positives = 376/490 (77%), Gaps = 3/490 (0%)
Query 8 GTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFE 67
GT DA P++DV ETHT VVVLAGDRA+KAKKPV+TDF DFRT EQRERAC+RE E
Sbjct 7 GTADAATTAGVPYVDVHETHTGVVVLAGDRAYKAKKPVLTDFLDFRTPEQRERACLREVE 66
Query 68 LNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAG---LPVEGALDAIA 124
LNSRL+ SYLGIAHLSDP+GG AEPVVVMRRYRD RLA + G V LD IA
Sbjct 67 LNSRLSPDSYLGIAHLSDPAGGPAEPVVVMRRYRDSDRLAWLAEHGGSETSVRELLDTIA 126
Query 125 EVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFV 184
VLARFH+ A+R+ ID QGE GA+ RRW ENL ELR +A V S + I +IE +V EFV
Sbjct 127 AVLARFHEHAERSPLIDAQGEAGAINRRWTENLTELRRYAGTVFSDESIGQIEQLVAEFV 186
Query 185 SGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLA 244
GR+VLF RI EGCIVDGH DLLADDIF V PALLDCLEF+D+LRY+DRIDDAAFLA
Sbjct 187 CGRDVLFNRRIAEGCIVDGHGDLLADDIFCVADGPALLDCLEFDDQLRYVDRIDDAAFLA 246
Query 245 MDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAA 304
MDLEFLGR DLG+YFL Y SGDTAP L DFYIAYRAVVRAKV+CVR SQGK +AA
Sbjct 247 MDLEFLGRNDLGEYFLERYLAHSGDTAPKPLHDFYIAYRAVVRAKVDCVRLSQGKSASAA 306
Query 305 DAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCG 364
DA RHL IAT+HL+ VRLALVGGNPGTGKST+AR +AE VGAQVISTDDVRR+LR+ G
Sbjct 307 DAARHLAIATRHLRQGAVRLALVGGNPGTGKSTVARALAERVGAQVISTDDVRRQLREWG 366
Query 365 VITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAA 424
I GE GVLD+GLYS NV AVY+ ALR+ARL L +G VILDGTW DPQ+RA A RLAA
Sbjct 367 AIAGESGVLDAGLYSPRNVTAVYEVALRRARLSLANGRPVILDGTWRDPQLRAQAHRLAA 426
Query 425 DTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRE 484
+ HS +VE C+A VD ADR+ R GNSDAT +IAA LAA+ WDT H +DT+ P E
Sbjct 427 EAHSPLVELLCTAPVDTAADRVRTRQPGNSDATPQIAATLAAQHNGWDTAHPVDTSRPLE 486
Query 485 RSVGQAYHIW 494
SV +A+ +W
Sbjct 487 FSVREAHDVW 496
>gi|296169102|ref|ZP_06850761.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295896222|gb|EFG75884.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=486
Score = 605 bits (1559), Expect = 7e-171, Method: Compositional matrix adjust.
Identities = 318/481 (67%), Positives = 370/481 (77%), Gaps = 5/481 (1%)
Query 19 PFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYL 78
P++D+ ETHT VV+LAGDRA+KAKKPV+TDF DFRT +QRE AC RE ELNSRL+ +SYL
Sbjct 2 PYLDLHETHTGVVILAGDRAYKAKKPVLTDFLDFRTPQQREHACRREVELNSRLSPESYL 61
Query 79 GIAHLSDPSGGHAEPVVVMRRYRDKQRLASMV--TAGLPVEGALDAIAEVLARFHQRAQR 136
GIA LSDP+GG EPV+VMRRYRD+ RLA+M + V GALDAIA VLARFH+ A R
Sbjct 62 GIAQLSDPAGGPPEPVIVMRRYRDEDRLAAMAFRDSDGHVRGALDAIAAVLARFHRDAGR 121
Query 137 NRCIDTQGEVGAVARRWHENLAELRHHADKVVSG---DVIRRIEHMVDEFVSGREVLFAG 193
+ I QGE AV RRWH+NL+ELR +AD G + + RIE +VDEF++GR LF
Sbjct 122 SAAISAQGEARAVGRRWHDNLSELRRYADAATPGVAAEAVSRIERLVDEFLAGRAPLFGA 181
Query 194 RIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRK 253
R+ +GCIVDGH DLLADDIF VDG+PALLDCLEF+D+LRY+D IDDAAFLAMDLEFLGRK
Sbjct 182 RVAQGCIVDGHGDLLADDIFWVDGKPALLDCLEFDDKLRYVDCIDDAAFLAMDLEFLGRK 241
Query 254 DLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIA 313
DL D+FL YA + DTAP SLR FYIAYRAVVRAKV+CVR SQG+ AA DA RHL +A
Sbjct 242 DLADHFLERYAQHAKDTAPPSLRAFYIAYRAVVRAKVDCVRLSQGRHAAAEDAARHLAMA 301
Query 314 TQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVL 373
T HL+ VRLALVGGNPGTGKST+ARG+AE VGA+VISTDDVRR LRD G I GEPGVL
Sbjct 302 TGHLEAGAVRLALVGGNPGTGKSTVARGLAERVGARVISTDDVRRELRDAGAIAGEPGVL 361
Query 374 DSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEF 433
++GLY V AVY+ AL +AR LL GHSVILDGTW DP R A+RLAA+THSA+VEF
Sbjct 362 NAGLYRPDQVAAVYETALSRARQLLSEGHSVILDGTWRDPGTREAAQRLAAETHSALVEF 421
Query 434 RCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHI 493
CSA V ADRI R GNS+ T EIAAALAA A W HRIDT+ + G+A+ +
Sbjct 422 VCSAAAGVAADRIKTRRSGNSEVTPEIAAALAAGHAAWVGAHRIDTSRSPDLVAGEAHDL 481
Query 494 W 494
W
Sbjct 482 W 482
>gi|183983459|ref|YP_001851750.1| hypothetical protein MMAR_3476 [Mycobacterium marinum M]
gi|183176785|gb|ACC41895.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=482
Score = 601 bits (1549), Expect = 1e-169, Method: Compositional matrix adjust.
Identities = 312/479 (66%), Positives = 360/479 (76%), Gaps = 3/479 (0%)
Query 23 VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH 82
+ ETHT VVVL G+RA+KAKKPV+TDF DFRTAEQRERAC RE ELNSRLA SYLG+AH
Sbjct 1 MHETHTGVVVLVGERAYKAKKPVLTDFLDFRTAEQRERACAREVELNSRLAPTSYLGVAH 60
Query 83 LSDPSGGHAEPVVVMRRYRDKQRLASMVTAG---LPVEGALDAIAEVLARFHQRAQRNRC 139
+DP+GG AEP+VVMRRY D RLA +++G V LD IA VLARFH+ A+R+
Sbjct 61 CTDPTGGPAEPLVVMRRYHDSDRLAYQISSGGSDESVRALLDTIATVLARFHEGAERSPT 120
Query 140 IDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGC 199
I+T GE A+ RR+ +NLAEL +A + I RIE +V F+SGRE L A RI +GC
Sbjct 121 INTAGEPAAIGRRFGDNLAELHRYAGTSFPDESIGRIEDLVAAFISGRETLLAQRIAQGC 180
Query 200 IVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYF 259
IVDGH DLLADDIF +G PALLDCLEF+D LRY+DR+DDAAFLAMDLEFLGRKDLG+YF
Sbjct 181 IVDGHGDLLADDIFCAEGGPALLDCLEFDDRLRYVDRVDDAAFLAMDLEFLGRKDLGEYF 240
Query 260 LAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQH 319
L Y SGD APASLRDFYIAYRAVVRAK +CVR SQGKPEAAADA RHL +AT+HL+
Sbjct 241 LDRYLAHSGDVAPASLRDFYIAYRAVVRAKTDCVRLSQGKPEAAADAARHLELATRHLET 300
Query 320 ATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYS 379
VRLALVGGNPGTGKSTLAR +AE VGAQVISTDDVR+ LRD G I GE GVLD GLY+
Sbjct 301 GAVRLALVGGNPGTGKSTLARALAEQVGAQVISTDDVRKELRDRGDIHGESGVLDEGLYT 360
Query 380 RANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATV 439
R NV VY L +AR L G SVILDGTW DPQ RA A L TH+A+VE C+ V
Sbjct 361 RDNVTVVYDLVLSRARRCLQEGRSVILDGTWRDPQSRARAHHLGGQTHAALVELLCTLPV 420
Query 440 DVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRSAI 498
D+ ADRI RA GNS+ TAEIAA +AA+ A WDT +DT+ P E S+ +A+ W AI
Sbjct 421 DMAADRISTRAPGNSEVTAEIAATMAAQHAGWDTALPMDTSRPIEFSLNEAHDAWCRAI 479
>gi|169630981|ref|YP_001704630.1| hypothetical protein MAB_3902c [Mycobacterium abscessus ATCC
19977]
gi|169242948|emb|CAM63976.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=519
Score = 507 bits (1305), Expect = 2e-141, Method: Compositional matrix adjust.
Identities = 264/481 (55%), Positives = 332/481 (70%), Gaps = 4/481 (0%)
Query 22 DVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIA 81
++RETHT +V+L G A+K KKPV+T+F DF T E RERAC RE LNSR++ SYLG++
Sbjct 30 EIRETHTGIVILVGGMAYKIKKPVITNFLDFSTPELRERACAREVALNSRISQDSYLGVS 89
Query 82 HLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCID 141
HL+DP GG EPVVVMRRY D RL++M + + LDAIAEVLA FH+RA R+ ID
Sbjct 90 HLTDPDGGSGEPVVVMRRYPDAARLSAMAKSKRVTKAHLDAIAEVLAGFHKRADRSPSID 149
Query 142 TQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIV 201
G + A+ RW + L L +A V++ D +R + + +F+SGR VLFA R+ +G I+
Sbjct 150 EAGSLDAIVDRWDDTLTALEKYAGTVLAADDVRLVRTLATQFISGRAVLFAQRVADGRII 209
Query 202 DGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLA 261
DGH DLLA DIF + P LLDCLEF+D LR++D +DDAAFLAMDLEFLGR DLGDYF+
Sbjct 210 DGHGDLLASDIFCLPNGPVLLDCLEFDDRLRHVDGLDDAAFLAMDLEFLGRPDLGDYFMN 269
Query 262 GYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHAT 321
Y S DTAP LR FYIAYRA+VRAKV+C+RF+QG+ AA A +H+ +A HL AT
Sbjct 270 RYVELSADTAPEPLRHFYIAYRALVRAKVDCIRFTQGQRSAAGRAAKHVAMALSHLGAAT 329
Query 322 VRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRA 381
VRL LVGG PG GKSTLAR ++E +GAQVISTD+VR++L GVI+G GVLD+GLYS
Sbjct 330 VRLVLVGGGPGAGKSTLARRISEDIGAQVISTDEVRQQLHRLGVISGGKGVLDAGLYSTE 389
Query 382 NVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDV 441
NV AVY LR+ARL L GH+VILDGTW P+ R A +LA + + +VEF C A +
Sbjct 390 NVGAVYDAVLRRARLALAGGHTVILDGTWRSPRHRLRAHQLAYEAGAPMVEFLCLAPLVT 449
Query 442 MADRIVARAGGNSDATAEIAAALAARQA----DWDTGHRIDTAGPRERSVGQAYHIWRSA 497
R+ AR G SDAT +IAAAL A A W H IDT P +RS +A + R A
Sbjct 450 AQHRVAARHDGVSDATGDIAAALGAEFAGPDRGWGEAHVIDTRLPLDRSTAEAEELCRQA 509
Query 498 I 498
+
Sbjct 510 L 510
>gi|333990933|ref|YP_004523547.1| hypothetical protein JDM601_2293 [Mycobacterium sp. JDM601]
gi|333486901|gb|AEF36293.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=429
Score = 472 bits (1214), Expect = 7e-131, Method: Compositional matrix adjust.
Identities = 248/419 (60%), Positives = 298/419 (72%), Gaps = 0/419 (0%)
Query 80 IAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRC 139
+AHL P+G AEPV+VMRRY D++RLASMV PVE LD IA +LA FH +R+
Sbjct 1 MAHLQGPAGAPAEPVIVMRRYHDEERLASMVKRAEPVERVLDRIAGLLADFHDHGERSPT 60
Query 140 IDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGC 199
I QG+ AV +RW +N L HA V + +RR++ + E++SGR LF R+++GC
Sbjct 61 ISRQGDPEAVRQRWDDNFPTLHQHAGTAVPSETVRRVQGLGAEYLSGRAGLFTRRVEQGC 120
Query 200 IVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYF 259
IVDGHADLLADDIF VD P LLDCLEF DELRY+DRIDDAAFLAMDLEFLGRKDLGD+F
Sbjct 121 IVDGHADLLADDIFWVDDRPVLLDCLEFSDELRYVDRIDDAAFLAMDLEFLGRKDLGDHF 180
Query 260 LAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQH 319
L YA SGDTA SLRDFYIAYRAVVRAKV+CVR +QGK +AA A HL IA +HL+
Sbjct 181 LERYAACSGDTAARSLRDFYIAYRAVVRAKVDCVRLTQGKRGSAAAAADHLDIALRHLED 240
Query 320 ATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYS 379
VRL LVGG PGTGKSTLA +AE VGA V+STDDVRR LR G ++GE G L +GLY+
Sbjct 241 GAVRLVLVGGGPGTGKSTLAGALAERVGAVVVSTDDVRRELRSSGQLSGETGNLGAGLYA 300
Query 380 RANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATV 439
ANV AVY L++A LG G SV+LDGTW D + RA ARRLA D H+A E RC +
Sbjct 301 PANVAAVYHAVLQRAGRHLGDGVSVVLDGTWRDAETRAEARRLADDKHAAFGEIRCVVPI 360
Query 440 DVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRSAI 498
+V A+R+ RA GNSDAT +IA L A WDT H +DT+ P + V +A+ WR+AI
Sbjct 361 EVAAERVRTRAAGNSDATPQIAGVLGADDFRWDTAHHVDTSKPLDECVREAHEQWRAAI 419
>gi|118473212|ref|YP_888231.1| hypothetical protein MSMEG_3942 [Mycobacterium smegmatis str.
MC2 155]
gi|118174499|gb|ABK75395.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=498
Score = 470 bits (1209), Expect = 3e-130, Method: Compositional matrix adjust.
Identities = 269/476 (57%), Positives = 326/476 (69%), Gaps = 2/476 (0%)
Query 23 VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH 82
+RETHT +V L GD A+KAKKPV TDF DF TA+QRE AC+RE ELNSRLA SYLG+AH
Sbjct 25 IRETHTGLVALIGDLAYKAKKPVRTDFLDFTTAQQREAACLREVELNSRLAPNSYLGVAH 84
Query 83 LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT 142
L P EPVVVMRRYRD RL+++VT G V LD IAE+LARFH+ A R ID
Sbjct 85 LVGPGDRPDEPVVVMRRYRDADRLSTLVTRGAEVNDQLDVIAEILARFHRDAGRGAVIDD 144
Query 143 QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD 202
Q +V RW ENL EL + +V + + + + +++SGR LFA RI +G IVD
Sbjct 145 QARATSVWARWDENLTELARMS--LVPPEQLSEVRRLASQYLSGRAELFAERIADGRIVD 202
Query 203 GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG 262
GHADLLADDIF PA+LDCLEF+D LRY+D +DDAAFLAMDLEFLG +L +F+
Sbjct 203 GHADLLADDIFCTPEGPAILDCLEFDDTLRYVDGVDDAAFLAMDLEFLGSPELSAFFVDR 262
Query 263 YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV 322
Y + DTAP SL DFY+AYRAVVRAKVEC+R QG+PEAA DA RH+ IA L+ ATV
Sbjct 263 YRHHAHDTAPQSLMDFYVAYRAVVRAKVECIRVGQGRPEAATDACRHIDIALDRLRAATV 322
Query 323 RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN 382
+L +VGG PGTGK+T++R +AE +GA VISTDDVRR L++ GVI G G LD+GLY+ N
Sbjct 323 QLVIVGGGPGTGKTTVSRALAEELGAVVISTDDVRRYLQESGVIGGAAGELDTGLYAPKN 382
Query 383 VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM 442
V AVY E L +AR L G SVILDGTW D R A LA++T +VEF CS V
Sbjct 383 VAAVYDEVLARARHALTHGRSVILDGTWRDVGRRQRAHLLASETAVPVVEFTCSLPVVAA 442
Query 443 ADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRSAI 498
+RI +R+G SDAT EIA ALA + A GH IDT+ P SV +A + AI
Sbjct 443 GERIASRSGTTSDATPEIADALAEQGAGIVHGHSIDTSRPLRESVTEAQRVCCLAI 498
>gi|120402395|ref|YP_952224.1| hypothetical protein Mvan_1384 [Mycobacterium vanbaalenii PYR-1]
gi|119955213|gb|ABM12218.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=490
Score = 468 bits (1203), Expect = 2e-129, Method: Compositional matrix adjust.
Identities = 262/472 (56%), Positives = 319/472 (68%), Gaps = 1/472 (0%)
Query 22 DVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIA 81
+V ETHT +V L GDRAFK KKPVVTDF DF TA++RE AC RE ELN RLA+ SYLG+
Sbjct 15 EVHETHTGLVALVGDRAFKIKKPVVTDFLDFSTAQKRETACRREIELNRRLASSSYLGVG 74
Query 82 HLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCID 141
H PSG AEPV+VMRRY D +RLA +V +G+PVE L AIA++LARFH RA+R ID
Sbjct 75 HFQPPSGD-AEPVIVMRRYPDTERLAELVRSGVPVETWLTAIADLLARFHSRAERGDAID 133
Query 142 TQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIV 201
+ ++ RW +NL ELR HA VV I +E + +++GRE L+ RI +V
Sbjct 134 REATARVLSDRWQQNLTELRRHAGTVVDDGQIAEVERLASAYLAGREPLYQARIAAHRVV 193
Query 202 DGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLA 261
DGH DLL+ DIF P LLDCLEF+D LRY+D IDDA FLAMDLEFLGR+DL D+FL
Sbjct 194 DGHGDLLSQDIFCTAEGPMLLDCLEFDDRLRYVDGIDDAGFLAMDLEFLGRRDLADFFLD 253
Query 262 GYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHAT 321
Y R+ D A SLR F IAYRAVVRAKV+CVR QG EA DA RHL IA HL+
Sbjct 254 EYCRRADDPAAHSLRHFCIAYRAVVRAKVDCVRVDQGHAEAIPDAQRHLAIALAHLRSGR 313
Query 322 VRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRA 381
V+L +VGG PGTGK+TLAR +A+ V AQ+ISTD+VRR L GV+ G G L++GLY+
Sbjct 314 VQLVVVGGGPGTGKTTLARALAQCVDAQLISTDEVRRELVGSGVVHGRAGELNTGLYTPE 373
Query 382 NVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDV 441
N+ AVY E L +AR LG+GHSVI+DGTW D R A +AA T+S+IVE RC+ V
Sbjct 374 NLSAVYDEVLSRARAWLGAGHSVIVDGTWRDAGHRQRAHAVAAQTYSSIVELRCTLPVAE 433
Query 442 MADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHI 493
RI R SDAT +AA L+ ++DW H IDTAGP SV A +
Sbjct 434 AERRIAGRGATASDATPAMAAELSRWESDWPGAHPIDTAGPLADSVAAARQV 485
>gi|145220867|ref|YP_001131545.1| hypothetical protein Mflv_0263 [Mycobacterium gilvum PYR-GCK]
gi|315442178|ref|YP_004075057.1| hypothetical protein Mspyr1_05130 [Mycobacterium sp. Spyr1]
gi|145213353|gb|ABP42757.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
gi|315260481|gb|ADT97222.1| uncharacterized conserved protein [Mycobacterium sp. Spyr1]
Length=490
Score = 467 bits (1202), Expect = 2e-129, Method: Compositional matrix adjust.
Identities = 263/480 (55%), Positives = 313/480 (66%), Gaps = 10/480 (2%)
Query 23 VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH 82
+ ETHT +VVL G+RA+KAKK V TDF DF T EQR RA E LN RLA QSYLG+
Sbjct 11 IHETHTGLVVLLGERAYKAKKAVKTDFLDFSTVEQRARALHHEVTLNRRLAPQSYLGVGE 70
Query 83 LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT 142
+ P G EPV+VMRRY D RL S+V G P+ L IA LA FH+ A+R ID
Sbjct 71 FAMP-GAQPEPVIVMRRYPDSARLTSLVAQGKPLTAELREIAGRLADFHRDARRGPDIDA 129
Query 143 QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD 202
QG AV RW +NL EL HAD V + I + F++GR VL A RI +G IVD
Sbjct 130 QGRPEAVWERWAQNLTELGRHADVVFDRADLDEIHALAQRFLAGRSVLMAQRIAQGRIVD 189
Query 203 GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG 262
GHADLL DDIF + PA+LDCLEF+D LRY+D +DDAAFLAMDLEF GR +LGD FL
Sbjct 190 GHADLLTDDIFCMPDGPAMLDCLEFDDLLRYVDGVDDAAFLAMDLEFHGRGNLGDEFLRE 249
Query 263 YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV 322
Y R+ D AP SL++FYIAYRAVVRAKV+CVR QG PEAA DA RHL +A L+ V
Sbjct 250 YVARAADPAPRSLQNFYIAYRAVVRAKVDCVRVEQGHPEAADDARRHLHLAADRLRDGAV 309
Query 323 RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN 382
RL +VGG PG+GK+T++R +AE++GAQVISTDDVRR LRD GVI+G G LDSGLY+ +
Sbjct 310 RLVIVGGGPGSGKTTVSRALAEVLGAQVISTDDVRRELRDAGVISGAVGALDSGLYAPES 369
Query 383 VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM 442
V VY E LR+A + G SVILDGTW D + ARRLA T + ++EF C V+
Sbjct 370 VARVYDEVLRRAEAAVTGGCSVILDGTWRDEEETGRARRLADSTATPLIEFTCVLPVEEA 429
Query 443 ADRIVARAGGNSDATAEIAAALAARQADWDTG----HRIDTAGPRERSVGQAYHIWRSAI 498
RI AR SDAT +IA ALA R TG H +DT P SV +A I R I
Sbjct 430 GARIRARTQTTSDATPQIAEALAGR-----TGVAGRHPLDTGRPLAESVAEAQRICRKVI 484
>gi|108798047|ref|YP_638244.1| hypothetical protein Mmcs_1074 [Mycobacterium sp. MCS]
gi|119867142|ref|YP_937094.1| hypothetical protein Mkms_1090 [Mycobacterium sp. KMS]
gi|126433707|ref|YP_001069398.1| hypothetical protein Mjls_1101 [Mycobacterium sp. JLS]
gi|108768466|gb|ABG07188.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119693231|gb|ABL90304.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126233507|gb|ABN96907.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=523
Score = 448 bits (1152), Expect = 1e-123, Method: Compositional matrix adjust.
Identities = 260/492 (53%), Positives = 309/492 (63%), Gaps = 13/492 (2%)
Query 2 DSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERA 61
DSP ND + V ETHT VV+L G++A+K KKPV TDF DF EQRER
Sbjct 40 DSPVNDMAAE-----------VYETHTGVVLLLGEKAYKIKKPVTTDFLDFSAPEQRERV 88
Query 62 CIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALD 121
C RE ELNSRLA SYLG+AH+ P EPVVVMRRY D+ RL SMV G E L
Sbjct 89 CAREVELNSRLAPGSYLGVAHMHGPGHDVPEPVVVMRRYPDRYRLRSMVIRGESTENHLT 148
Query 122 AIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVD 181
+A +LARFH A R ID AV RW ENL EL H A VVS + + +
Sbjct 149 MLASMLARFHATADRRADIDACATAAAVRARWCENLDELDHSAGAVVSAQTVDEVRRLAL 208
Query 182 EFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAA 241
++ GR+ LFAGRI + IVDGH DLLADD+F P LDCLEF+D LR++D +DDAA
Sbjct 209 RYLDGRDALFAGRIADRRIVDGHGDLLADDVFCTPDGPVPLDCLEFDDRLRFVDGVDDAA 268
Query 242 FLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPE 301
FLAMDLEFLGR+DL D+FL Y +GD+AP SL DFYIAYRAVVRAKV+C++ QG +
Sbjct 269 FLAMDLEFLGRRDLADHFLDQYQELAGDSAPRSLVDFYIAYRAVVRAKVDCIKVGQGHED 328
Query 302 AAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLR 361
AAADA HL IA HL+ ATVRL LVGG PGTGK+TL+ + E VGA VISTD+VRR L+
Sbjct 329 AAADAGWHLDIAANHLKAATVRLVLVGGGPGTGKTTLSGALGESVGAHVISTDNVRRELQ 388
Query 362 DCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARR 421
D GV+ G G L+SGLYS NV VY L +A +LL G SV+LDGTW DP R AR
Sbjct 389 DSGVVHGAAGALESGLYSPENVALVYDTVLHRAAVLLAHGESVVLDGTWRDPGHRRAARD 448
Query 422 LAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAG 481
A + + +VE C + RI R SDAT +IAA + W HR+DT
Sbjct 449 CADRSSAVLVELACDTELSAAQTRITHRTSTTSDATPQIAADITTPV--WHGAHRVDTGR 506
Query 482 PRERSVGQAYHI 493
P SV +A I
Sbjct 507 PLADSVAEAQQI 518
>gi|296394769|ref|YP_003659653.1| hypothetical protein Srot_2377 [Segniliparus rotundus DSM 44985]
gi|296181916|gb|ADG98822.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=522
Score = 428 bits (1100), Expect = 1e-117, Method: Compositional matrix adjust.
Identities = 236/468 (51%), Positives = 295/468 (64%), Gaps = 0/468 (0%)
Query 19 PFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYL 78
P+ DV ETH+ VV L GDRA+K KKP+ T F DFR E RERACIRE ELN R + YL
Sbjct 18 PYADVAETHSGVVFLVGDRAYKLKKPIATAFLDFRRTEDRERACIREVELNRRFSPDVYL 77
Query 79 GIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNR 138
G+AHL++P GG EPVVVMRR ++ RL+ + + L +A LA FH+ A+R+
Sbjct 78 GVAHLTEPGGGPDEPVVVMRRMPEEARLSLLALGQADAKEGLGELARKLASFHRLARRSA 137
Query 139 CIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEG 198
ID +G A RRW NL E+R A G +I R+E + +++ GRE LFA RI +
Sbjct 138 QIDAEGTACATRRRWQANLTEIRGFAAAAEHGWLIDRVERLAADYLHGREPLFADRIAQR 197
Query 199 CIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDY 258
I+DGH DL+ADD+FL+ P +LDCL+F+D LR++D DDAAFL MDLE LGR DL
Sbjct 198 RIIDGHGDLIADDVFLLPDGPRVLDCLDFDDRLRFVDGADDAAFLVMDLEHLGRADLAGG 257
Query 259 FLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQ 318
FL GY + D AP SL D YIAYRA+VRAKVE +RF QG E+ +A +HL A+ HL+
Sbjct 258 FLGGYLAAAEDPAPRSLVDHYIAYRALVRAKVEFLRFEQGCDESRREARQHLAEASAHLE 317
Query 319 HATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLY 378
VRL LVGG PGTGKSTLA +AE VGAQVIS+D VR L+ +I GE G SGLY
Sbjct 318 RGAVRLMLVGGLPGTGKSTLANALAERVGAQVISSDLVRHELKTARMIAGELGQYASGLY 377
Query 379 SRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSAT 438
S VYQ +AR L G SVILD +WG R A++L +T +A+V RC+
Sbjct 378 SPELSSMVYQVMFERARDALSHGESVILDASWGAAGERERAQQLGRETDAAVVALRCTTP 437
Query 439 VDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERS 486
DV RI R G SDATA+IA A+A W +DT+ P E S
Sbjct 438 PDVAERRIAHRRAGFSDATADIARAMATDAGAWTQATDVDTSAPLETS 485
>gi|317507619|ref|ZP_07965332.1| hypothetical protein HMPREF9336_01704 [Segniliparus rugosus ATCC
BAA-974]
gi|316254096|gb|EFV13453.1| hypothetical protein HMPREF9336_01704 [Segniliparus rugosus ATCC
BAA-974]
Length=492
Score = 416 bits (1068), Expect = 7e-114, Method: Compositional matrix adjust.
Identities = 233/472 (50%), Positives = 296/472 (63%), Gaps = 0/472 (0%)
Query 23 VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH 82
+RETH+ VV+LAG+RA+K KKPV T F DF T E RE AC RE ELN RL+ YLG+AH
Sbjct 13 MRETHSGVVLLAGERAYKFKKPVTTAFLDFSTHEAREFACAREAELNRRLSPDVYLGVAH 72
Query 83 LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT 142
L+DP GG AEPVVVMRR + RL+++V G PV L +A LA FH+ AQR +
Sbjct 73 LTDPVGGPAEPVVVMRRMPETARLSTLVGQGAPVGQGLAELALALAGFHRWAQRGPQVAA 132
Query 143 QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD 202
Q V A+ RW NLAE+ + ++ I+ + ++ GR+ LFA RI G IVD
Sbjct 133 QASVKAIRGRWQANLAEISLFSAAAQHDGLLDHIQQLALRYLRGRKELFAERIARGRIVD 192
Query 203 GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG 262
GH DL+ADD+FL+ P LDCL+F+D LR++D DDAAFLAMDLE+LGR DLG FL
Sbjct 193 GHGDLIADDVFLLPEGPRALDCLDFDDRLRFVDGADDAAFLAMDLEYLGRPDLGQSFLEQ 252
Query 263 YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV 322
Y + D AP SL YIAYRA+VRAKV+ VR QG ++ A+A+RHL +A HL+
Sbjct 253 YLAEAEDDAPRSLLHHYIAYRALVRAKVDYVRLGQGHAQSRAEALRHLRLAADHLERGAA 312
Query 323 RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN 382
RL L+GG PGTGKSTLA +A VGA V+S+D+VR L++ G I GE G GLY+
Sbjct 313 RLVLIGGLPGTGKSTLAAALAGEVGAAVVSSDEVRHELKESGEIAGEAGQYGRGLYAPEA 372
Query 383 VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM 442
VY+ L +AR L G SV+LD +W + R A RLA ++ +A+VE RC A DV
Sbjct 373 AAKVYRTMLDRARGALAGGASVVLDASWVQAEQRELAARLAEESSAALVELRCVAPQDVA 432
Query 443 ADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIW 494
RI AR SDATAEIA +AA W T +DTA ++ A W
Sbjct 433 WRRIAARRQSRSDATAEIARDMAADMRPWPTSSEVDTAAEPGAALRAALEAW 484
>gi|325676765|ref|ZP_08156438.1| hypothetical protein HMPREF0724_14221 [Rhodococcus equi ATCC
33707]
gi|325552313|gb|EGD22002.1| hypothetical protein HMPREF0724_14221 [Rhodococcus equi ATCC
33707]
Length=513
Score = 393 bits (1009), Expect = 4e-107, Method: Compositional matrix adjust.
Identities = 228/484 (48%), Positives = 292/484 (61%), Gaps = 7/484 (1%)
Query 17 DEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQS 76
D+PF + ETH+ VV+L GDR +K KKP+ T+F DFR+ E R AC E ELN RLA
Sbjct 26 DQPFAGLHETHSGVVILLGDRVYKIKKPIRTEFLDFRSREARLAACRNEVELNRRLAPDV 85
Query 77 YLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAG-LPVEGALDAIAEVLARFHQRAQ 135
YLG+ L GG EP VVMRR + RL+++ A A+DAIA ++A FH+RA
Sbjct 86 YLGVGELGGTEGGDGEPTVVMRRMPESARLSTLARASSTQCPSAVDAIARIVADFHRRAA 145
Query 136 RNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRI 195
R ID +G AV RRWH+N+ E R VV+ D + IE VD ++ GR LFA RI
Sbjct 146 RGPRIDREGTADAVRRRWHDNIRETRELPRAVVAEDRLAAIERTVDRYLDGRGPLFAQRI 205
Query 196 KEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDL 255
+GCIVDGHADLL+DDIF ++ P +LDCLEF+ LRYLDRIDD A LAMDLEF GR DL
Sbjct 206 TDGCIVDGHADLLSDDIFCLEDGPRILDCLEFDARLRYLDRIDDIACLAMDLEFQGRPDL 265
Query 256 GDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQ 315
+ Y +TAP SL YIAYRA +RAKV+CVR QG+ +A DA RH +A Q
Sbjct 266 ARRLVLRYRDALTETAPDSLVHHYIAYRAFMRAKVDCVRHLQGRASSADDAARHTALAEQ 325
Query 316 HLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCG-VITGEPGVLD 374
HL A RL LVGG P TGKST+A +AE VGA++IS+D VRR L T +PG
Sbjct 326 HLDRARCRLVLVGGLPATGKSTVAARLAETVGAELISSDHVRRHLFAADRTATPDPG-YR 384
Query 375 SGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFR 434
SG YS + VY L +AR LL G SV+LD +W + R A A + +V+ +
Sbjct 385 SGRYSPDSTGRVYDSMLDRARELLAGGRSVVLDASWTHREHRLRAAETAVAVCADLVQLQ 444
Query 435 CSATVDVMADRIVARAGG----NSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA 490
C+A ++ R+ RA +S+AT +A A+A W R+DT+GP + S+ A
Sbjct 445 CTAPAELTEHRLRERAASRRDHDSEATPAVAVAMAHDADSWPAATRVDTSGPLDASLSVA 504
Query 491 YHIW 494
W
Sbjct 505 AAEW 508
>gi|312139781|ref|YP_004007117.1| hypothetical protein REQ_23910 [Rhodococcus equi 103S]
gi|311889120|emb|CBH48433.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=493
Score = 388 bits (997), Expect = 1e-105, Method: Compositional matrix adjust.
Identities = 230/484 (48%), Positives = 292/484 (61%), Gaps = 7/484 (1%)
Query 17 DEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQS 76
D+PF + ETH+ VV+L GDR +K KKP+ T+F DFR+ E R AC E ELN RLA
Sbjct 6 DQPFAGLHETHSGVVILLGDRVYKIKKPIRTEFLDFRSREARLAACRNEVELNRRLAPDV 65
Query 77 YLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAG-LPVEGALDAIAEVLARFHQRAQ 135
YLG+ L DP GG EP VVMRR + RL+++ A A+DAIA ++A FH+RA
Sbjct 66 YLGVGELGDPEGGDGEPTVVMRRMPESARLSTLARASSTQCPSAVDAIARIVADFHRRAA 125
Query 136 RNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRI 195
R ID +G AV RRWH+N+ E R VV+ D + IE VD + GR LFA RI
Sbjct 126 RGPRIDREGTADAVRRRWHDNIRETRELPRAVVAEDRLAAIERTVDRYFDGRGPLFAQRI 185
Query 196 KEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDL 255
+GCIVDGHADLL+DDIF ++ P +LDCLEF+ LRYLDRIDD A LAMDLEF GR DL
Sbjct 186 TDGCIVDGHADLLSDDIFCLEDGPRILDCLEFDARLRYLDRIDDIACLAMDLEFQGRPDL 245
Query 256 GDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQ 315
+ Y +TAP SL YIAYRA +RAKV+CVR QG+ +A DA RH +A Q
Sbjct 246 AQRLVLRYRDALTETAPDSLVHHYIAYRAFMRAKVDCVRHLQGRAASADDAARHTALAEQ 305
Query 316 HLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCG-VITGEPGVLD 374
HL A RL LVGG P TGKST+A +AE VGA++IS+D VRR L T PG
Sbjct 306 HLDRARCRLVLVGGLPATGKSTVAARLAETVGAELISSDHVRRHLFAADRTATPYPG-YR 364
Query 375 SGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFR 434
SG YS + VY L +AR LL G SV+LD +W + R A A + +V+ +
Sbjct 365 SGRYSPDSTGRVYDSMLDRARELLAGGRSVVLDASWTHREHRLRAAETAVAVCADLVQLQ 424
Query 435 CSATVDVMADRIVARAGG----NSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA 490
C+A ++ R+ RA +S+AT +A A+A W R+DT+GP + S+ A
Sbjct 425 CTAPAELTEHRLRERAASRRDHDSEATPAVAVAMAHDADSWPAATRVDTSGPLDASLSVA 484
Query 491 YHIW 494
W
Sbjct 485 AAEW 488
>gi|226306904|ref|YP_002766864.1| hypothetical protein RER_34170 [Rhodococcus erythropolis PR4]
gi|226186021|dbj|BAH34125.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=495
Score = 380 bits (975), Expect = 4e-103, Method: Compositional matrix adjust.
Identities = 216/479 (46%), Positives = 278/479 (59%), Gaps = 0/479 (0%)
Query 20 FIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLG 79
++DV+ET T VVVL GDRA+K KK + T F DF +RE A RE LN R+ Y G
Sbjct 7 YLDVKETTTGVVVLVGDRAYKIKKAISTPFLDFSEPSRREDALQRELTLNQRICEGVYRG 66
Query 80 IAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRC 139
++H+ DP +EP++VM R D++RL+++ V ALD IA ++ FH+RA R+
Sbjct 67 VSHVVDPVDQSSEPILVMVRMPDERRLSALAATNRDVAHALDEIAAAVSDFHRRAARSVR 126
Query 140 IDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGC 199
I QG V RW NLAELR ++ + +E M ++ GR LF RI EGC
Sbjct 127 ISEQGTSTGVGHRWVSNLAELRQLCSGLLPPCRLDHLESMSTRYLCGRRSLFDSRIAEGC 186
Query 200 IVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYF 259
IVDGH DLLADDIF++ P +LDCL+F+D+LR++D +DD+ FLAMDLEFLG +DL F
Sbjct 187 IVDGHGDLLADDIFVLPDGPKILDCLDFDDQLRFVDILDDSCFLAMDLEFLGYEDLASEF 246
Query 260 LAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQH 319
LA +S D P SL D YIAYRA VRAKV+ +R QG + A RHL +A HL +
Sbjct 247 LASIVTKSNDQPPISLIDHYIAYRATVRAKVDALRLQQGDSNSQAALERHLDLAENHLCN 306
Query 320 ATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYS 379
VRL LVGG PGTGK+TL+ +A GA VIS+D VRR L D G + G SGLYS
Sbjct 307 GEVRLCLVGGFPGTGKTTLSLALAAHTGATVISSDRVRRELVDAGALRGSADAYQSGLYS 366
Query 380 RANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATV 439
+V VY E L +AR L G SV+LD TW + R A L T S ++ S +
Sbjct 367 PDSVHTVYSEMLDRARDHLSMGESVVLDATWARSRHRREAELLCTSTASTLISLSTSTPL 426
Query 440 DVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRSAI 498
V RI R SDAT AAA+A W IDT + S A +WR ++
Sbjct 427 SVAVQRIATRTNTLSDATTATAAAIAQGHDPWPESTAIDTDTSIDVSAESAVDVWRRSV 485
>gi|111017077|ref|YP_700049.1| hypothetical protein RHA1_ro00055 [Rhodococcus jostii RHA1]
gi|110816607|gb|ABG91891.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=504
Score = 366 bits (939), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 221/473 (47%), Positives = 288/473 (61%), Gaps = 1/473 (0%)
Query 25 ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS 84
ETHTA VV+ GD FKAKKP+ T F DF TAE+R AC RE LN RL YLG+A L+
Sbjct 25 ETHTAYVVMVGDVVFKAKKPIRTAFADFGTAERRRAACEREVTLNRRLCPDVYLGVAELT 84
Query 85 DPSGGHAEPVVVMRRYRDKQRLASMVTAGLP-VEGALDAIAEVLARFHQRAQRNRCIDTQ 143
DP+GG E +V MRR +RLA +V G +DAIA V+ARFH A+++ ID
Sbjct 85 DPAGGPTEALVKMRRMPSDRRLARLVGGGGDDTTAQVDAIAAVVARFHAGAEQSAEIDCD 144
Query 144 GEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDG 203
GAVA RW N++E+ + V+ + ++ ++ GR+ LF R++EG IVDG
Sbjct 145 ATPGAVAARWRANVSEVTSYRCDVLPAADVHEVQARALRYLKGRKRLFEYRVREGRIVDG 204
Query 204 HADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGY 263
H DLLADDIF + P +LDCL+F+D LR++D IDDAA LAMDLE+LGR+DLG FL Y
Sbjct 205 HGDLLADDIFCLADGPRILDCLDFDDHLRHVDCIDDAACLAMDLEYLGREDLGSRFLDRY 264
Query 264 AVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVR 323
+ D PASL+ YIAYRA VRAKV C+R++QG A DA H +A + L+ TVR
Sbjct 265 CAAARDEPPASLQHHYIAYRAFVRAKVACLRYTQGSRAAGEDARAHCNLALRQLRAGTVR 324
Query 324 LALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRANV 383
+ALVGG PGTGKSTL+R +A++ G+ VIS+D VR+ L + + GLYS
Sbjct 325 MALVGGLPGTGKSTLSRKLADVTGSVVISSDHVRKELDGLDPHSRQVAGFGEGLYSGTMT 384
Query 384 VAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVMA 443
Y E LR+AR L +G SV+LD +W R A +A+ THS +VE C A +
Sbjct 385 DRTYAEVLRRARDHLTAGRSVVLDASWTQSMRRERAALVASCTHSDLVELECRAPRAMAI 444
Query 444 DRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRS 496
RI +R G+SDAT + A+AA A W + +DT SV A IW S
Sbjct 445 ARIGSRPTGDSDATPAVYDAMAASAAAWPSATAVDTDTAAGDSVQTAERIWHS 497
>gi|271966203|ref|YP_003340399.1| gluconate kinase [Streptosporangium roseum DSM 43021]
gi|270509378|gb|ACZ87656.1| gluconate kinase [Streptosporangium roseum DSM 43021]
Length=533
Score = 364 bits (934), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 210/474 (45%), Positives = 277/474 (59%), Gaps = 0/474 (0%)
Query 20 FIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLG 79
+ ++ETH VVVL GD AFK KKPV F DF T + RER C E ELN RLA Y G
Sbjct 40 WAQIKETHIGVVVLLGDHAFKLKKPVNFGFVDFTTRQARERICHEEVELNRRLAPDVYEG 99
Query 80 IAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRC 139
+A + G E +VVMRR +++RLA+M+ +G PVE L IA ++A H R++ +
Sbjct 100 VADVLGTDGQVCEHLVVMRRMPEERRLAAMIDSGKPVEEHLRQIARMVASMHGRSRHSPQ 159
Query 140 IDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGC 199
ID QG A+ RW + ++R + V+ +V+ IE + F+ GR LF RI EG
Sbjct 160 IDQQGSGQALRSRWSASFDQVRALPEPVLGPEVVGEIERLTLRFLDGRGPLFTARIDEGR 219
Query 200 IVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYF 259
IVDGH DLLA+DIF +D P +LDCLEF++ LR++D +DD AFLAMDLE LG L + F
Sbjct 220 IVDGHGDLLAEDIFCLDDGPRILDCLEFDERLRFVDGLDDVAFLAMDLERLGAPRLAEVF 279
Query 260 LAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQH 319
L Y +GD AP SL Y+AYRA VRAKV C+R QG AA +A R + +HLQ
Sbjct 280 LHQYTEFTGDPAPPSLWHHYVAYRAFVRAKVACLRRGQGDSGAAWEARRFADLTLRHLQA 339
Query 320 ATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYS 379
TV L LVGG PG GKSTLA +A+ +G V+++D VR+ + G+Y
Sbjct 340 GTVPLILVGGAPGAGKSTLAAALADRLGYTVLNSDRVRKEMAGISPDQSASAPFGEGIYD 399
Query 380 RANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATV 439
+ Y E L +A LL G VILD +WG RA A R+A T S +V RC+A
Sbjct 400 PEHTERTYDELLSRAGKLLERGEPVILDASWGGAGHRAAADRVAQRTSSDLVALRCTALP 459
Query 440 DVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHI 493
V A+R+ R G SDA I AA+AAR A W IDT+ E+++ +A +
Sbjct 460 QVAAERLARRTGAVSDADQAIGAAVAARMAPWPDAVEIDTSASPEQALERALAV 513
>gi|329938537|ref|ZP_08287962.1| hypothetical protein SGM_3454 [Streptomyces griseoaurantiacus
M045]
gi|329302510|gb|EGG46401.1| hypothetical protein SGM_3454 [Streptomyces griseoaurantiacus
M045]
Length=508
Score = 360 bits (924), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 207/468 (45%), Positives = 270/468 (58%), Gaps = 0/468 (0%)
Query 23 VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH 82
VR+THTAV+ D +K KK V F D+ ++ R AC RE +LN R A YLG+
Sbjct 18 VRKTHTAVLFFMEDHVYKVKKRVDLGFLDYTSSTARRTACEREIDLNRRFAPDVYLGLGE 77
Query 83 LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT 142
L P EP+VVMRR D RLA +V G PV AL +A LA +H A R I+
Sbjct 78 LRTPGEEEPEPLVVMRRMPDDLRLAHLVGTGAPVGDALRVVARQLAAWHAVAPRGPDIEE 137
Query 143 QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD 202
QG A+ RW + A++ + + D IE +V E+++GRE LF RI++G +VD
Sbjct 138 QGTRDALTSRWESSFAQVDAMTAEGLESDAPAEIERLVREYLAGREPLFDMRIEQGRVVD 197
Query 203 GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG 262
GH DLLADDIF D P +LDCLEF+D LRY+D +DDAAFLAMDLE LG + +FLA
Sbjct 198 GHGDLLADDIFCFDDGPRILDCLEFDDHLRYVDGLDDAAFLAMDLELLGAPESAAFFLAR 257
Query 263 YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV 322
Y SGD AP SL Y+AYRA VRAKV ++ QG P A A R + + +HL+ + V
Sbjct 258 YGEYSGDPAPPSLWHHYVAYRAFVRAKVSMIQARQGAPGTRAAAQRFIAMTLRHLRTSAV 317
Query 323 RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN 382
L LVGG PGTGKSTL+ +A+ +GA ++S+D +R+ L V GLY+
Sbjct 318 GLTLVGGLPGTGKSTLSGALADRLGAVLLSSDRLRKELAGLPVEQTATAAYGQGLYTPEW 377
Query 383 VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM 442
Y L +A LL G SV+LD TW DP R ARR A +A+ C DV
Sbjct 378 TARTYAALLDRAAALLARGESVVLDATWTDPAQREAARRTAETASAALTALHCHVPRDVA 437
Query 443 ADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA 490
ADRI+ RA G SDA +A A+++R+ W +DT+GP +V QA
Sbjct 438 ADRILTRAPGASDADIGVADAMSSREPPWSGAVPVDTSGPLGSAVAQA 485
>gi|254381619|ref|ZP_04996983.1| conserved hypothetical protein [Streptomyces sp. Mg1]
gi|194340528|gb|EDX21494.1| conserved hypothetical protein [Streptomyces sp. Mg1]
Length=508
Score = 350 bits (899), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 204/472 (44%), Positives = 273/472 (58%), Gaps = 0/472 (0%)
Query 19 PFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYL 78
P +V ETHTAV+ GDRA+K KKPV F D+ T R AC +E LN R A YL
Sbjct 14 PRAEVCETHTAVLFFVGDRAYKVKKPVDLGFLDYTTTAARRAACEQEVALNRRFAPDVYL 73
Query 79 GIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNR 138
G+ P EP+VVMRR ++RL+ +V+ G V+ AL ++A +LA H A R
Sbjct 74 GLGEFRGPDADTPEPLVVMRRMPAERRLSLLVSQGADVDEALRSVARLLASRHADAPRGP 133
Query 139 CIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEG 198
ID QG A++ RW + ++R + D + E +V +++GRE LF RI++G
Sbjct 134 DIDEQGRRDALSARWEASFTQVRELTEDGRLLDGVAETERLVRRYLAGREELFDVRIEQG 193
Query 199 CIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDY 258
+VDGH DLLA DIF +D P +LDCLEF+D LR +D +DDAAFLAMDLE G + +
Sbjct 194 RVVDGHGDLLAQDIFCLDDGPRVLDCLEFDDRLRSVDGLDDAAFLAMDLEQTGAPEAAAF 253
Query 259 FLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQ 318
FLA Y SGD AP SL Y+AYRA VRAKV ++ +QG A A A R L +HL+
Sbjct 254 FLARYGEYSGDPAPPSLWHHYVAYRAFVRAKVSLIQAAQGAHGAEAAARRLLTTTLRHLR 313
Query 319 HATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLY 378
+ V L LVGG PG+GKSTL+ +A+ +G ++S+D +R+ L GLY
Sbjct 314 TSAVGLTLVGGLPGSGKSTLSGALADRLGVTLLSSDRLRKELAGMPAEESASAGYGEGLY 373
Query 379 SRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSAT 438
+ Y E L +A +LL G SV+LD TW D RA A R+A T + +V C A
Sbjct 374 TPEWTARTYAELLDRASVLLAMGESVVLDATWSDAGQRAAALRMAERTSADLVALHCQAP 433
Query 439 VDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA 490
+V A R+ RA G SDAT E+A A+AA + W+ +DT G E +V QA
Sbjct 434 GEVSAARLTTRAPGASDATPEVARAMAAVEPPWEEAVPVDTGGSLEAAVIQA 485
>gi|290955585|ref|YP_003486767.1| hypothetical protein SCAB_10221 [Streptomyces scabiei 87.22]
gi|260645111|emb|CBG68197.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
Length=501
Score = 350 bits (899), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 199/466 (43%), Positives = 274/466 (59%), Gaps = 0/466 (0%)
Query 25 ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS 84
ETHTA+V AGDRA+K KK V F D+ + R AC+RE LN R A YLG+ +
Sbjct 3 ETHTAIVFFAGDRAYKVKKAVDLGFVDYTDRQARRAACVREVALNRRFAPDVYLGVGEVV 62
Query 85 DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQG 144
P +EP+VVMRR +RL+++V AG V+ L A+A LA +H A R R +D QG
Sbjct 63 APDAEVSEPLVVMRRMPAGRRLSALVRAGADVDEVLRAVARRLAAWHATAPRGRDVDEQG 122
Query 145 EVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGH 204
A+A RW + ++R + D + ++ +V +++GRE LF RI++ +VDGH
Sbjct 123 TRDALASRWEASFEQVRATTEGGSGFDGVPEVQRLVRRYLAGREALFDSRIEQRRVVDGH 182
Query 205 ADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYA 264
DLLA+DIF +D P +LDCLEF+D LRY+D +DDAAFLAMDLE LG FLA Y
Sbjct 183 GDLLAEDIFCLDDGPRVLDCLEFDDHLRYVDGLDDAAFLAMDLEQLGAPAAAARFLARYG 242
Query 265 VRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVRL 324
SGD AP SL Y+AYRA VRAKV ++ QG P + A R + +HL+ + V L
Sbjct 243 EYSGDPAPPSLWHHYVAYRAFVRAKVSLIQAEQGAPGVRSAARRLVSTTLRHLRTSAVGL 302
Query 325 ALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRANVV 384
LVGG PG+GKSTL+ +A+ +G ++S+D +R+ L + P + GLY+
Sbjct 303 TLVGGLPGSGKSTLSGALADRLGVTLLSSDRLRKELAGIPPESPAPAAYEEGLYTPEWTA 362
Query 385 AVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVMAD 444
Y L +A LL G SV+LD TW ++RA A R+A T + +V C +V A
Sbjct 363 RTYDILLDRAAALLSRGESVVLDATWSAAELRAAAGRVAERTCADLVALHCQVPDEVAAA 422
Query 445 RIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA 490
R+ R+ G SDA +A ALAAR+ W +DT+GP E +V +A
Sbjct 423 RLSTRSPGPSDADLGVADALAAREPPWPDAVVVDTSGPLESAVSRA 468
>gi|331698605|ref|YP_004334844.1| gluconate kinase [Pseudonocardia dioxanivorans CB1190]
gi|326953294|gb|AEA26991.1| gluconate kinase [Pseudonocardia dioxanivorans CB1190]
Length=492
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 212/477 (45%), Positives = 270/477 (57%), Gaps = 5/477 (1%)
Query 23 VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH 82
V ETH +V+L GDRA+K KKPV T FCDF T+ R A RE LN RLA YLG A
Sbjct 13 VHETHVGIVLLVGDRAYKVKKPVRTSFCDFSTSALRRVAIERELRLNRRLAPDVYLGTAR 72
Query 83 LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT 142
L G AEPV+VMRR D +RL+ +V+ G V L +A +LA H R++R+ I
Sbjct 73 LD--GAGCAEPVLVMRRMPDDRRLSRLVSDGCDVTDHLRRLARMLAALHARSERSASITA 130
Query 143 QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD 202
A+ RW NLA + + ++ + F++GR LF R EG +VD
Sbjct 131 DASADALLERWRANLAGMEALRGNALDPGLLDGAGRLAARFLAGRRPLFVRRAAEGRVVD 190
Query 203 GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG 262
GH DLLADD+F +D P LDCL+F+D LR++D +DDAAFLAMDLE LG YFL
Sbjct 191 GHGDLLADDVFCLDDGPRALDCLDFDDSLRHVDGLDDAAFLAMDLERLGADAAARYFLHA 250
Query 263 YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPE--AAADAVRHLIIATQHLQHA 320
YA + D AP +L Y+AYRA VRA V VR Q +P+ A ADA R L + HL+
Sbjct 251 YAGFAADPAPDTLVHHYVAYRAGVRATVAAVRHLQ-QPDVGADADAARLLTLTMAHLRAG 309
Query 321 TVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSR 380
RL LVGG PGTGKSTLA G+A+ GA ++S+D VR+ L T SGLY+
Sbjct 310 APRLVLVGGLPGTGKSTLAAGLADRTGAVLVSSDRVRKELAGMAPSTSAAAPFGSGLYAP 369
Query 381 ANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVD 440
+ A Y+E L +A LL G SV+LD +W + R A +A D + +V+ RC A D
Sbjct 370 EHTGATYRELLTRAAALLSLGESVVLDASWTCARRRTEAAAVADDRAAELVQVRCVAPSD 429
Query 441 VMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRSA 497
V A RI AR G SDAT +AA +AA W +DT P R V A W +A
Sbjct 430 VAAARIRARHGSASDATPAVAAQMAATVDAWPDALEVDTTAPAARCVDMAREAWDAA 486
>gi|134100921|ref|YP_001106582.1| hypothetical protein SACE_4388 [Saccharopolyspora erythraea NRRL
2338]
gi|291003466|ref|ZP_06561439.1| hypothetical protein SeryN2_02972 [Saccharopolyspora erythraea
NRRL 2338]
gi|133913544|emb|CAM03657.1| hypothetical protein SACE_4388 [Saccharopolyspora erythraea NRRL
2338]
Length=495
Score = 340 bits (872), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 194/474 (41%), Positives = 266/474 (57%), Gaps = 0/474 (0%)
Query 20 FIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLG 79
++ ETH +V AGDR +K KKPV F DFR + R AC RE ELN RLA YL
Sbjct 14 YLSTAETHIGALVFAGDRVYKLKKPVDLGFVDFRDRKTRLWACRRELELNRRLAPDVYLD 73
Query 80 IAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRC 139
+ + P + +V+MRR + LA++V AG PVE + +A+ LA FH A+R
Sbjct 74 VLDVGPPGHEPCDHLVLMRRMPADRSLAALVEAGTPVEDEVREVAKKLAGFHSCAERGDE 133
Query 140 IDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGC 199
I +G AVA RW+ N+ LR ++ ++ I F+ GR L R+ EG
Sbjct 134 IAAEGAPDAVAGRWNTNVDGLRSFGGDLLDEQLLDEIAEHGRVFLEGRAPLLNRRVAEGR 193
Query 200 IVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYF 259
IVDGH D+L++D+F +D P +LDC+EF+D LR LD +DDA LAMDLE+ G +L + F
Sbjct 194 IVDGHGDVLSEDVFCLDDGPRILDCIEFDDRLRRLDAVDDAVCLAMDLEYRGAPELAERF 253
Query 260 LAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQH 319
L YA S D P LR FY AYRA+VR KV C + S G EAA+DA HL +A H++
Sbjct 254 LDWYAQFSDDAVPPGLRHFYTAYRALVRTKVACAKHSPGDEEAASDAREHLALAADHIRR 313
Query 320 ATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYS 379
A RL LVGG PG+GK+TL+ +A+ +GA ++S+D VR+ + D P SGLYS
Sbjct 314 AVPRLVLVGGLPGSGKTTLSERIADRLGAVLLSSDRVRKEIADLSPAEPAPAEYRSGLYS 373
Query 380 RANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATV 439
+ Y E +R++ LLG G SV+LD +W R A A + +V RC A
Sbjct 374 AEHTERTYDELVRRSGELLGYGESVVLDASWTREHHRERAVEAANQARAIVVPLRCQAPE 433
Query 440 DVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHI 493
+R+ R SDA E+A ++A+ DW + IDT G E S +A +
Sbjct 434 STTVERLRGRHATASDADEEVARSIASDADDWPSAWPIDTTGSPEDSADEAVAV 487
>gi|21218720|ref|NP_624499.1| hypothetical protein SCO0163 [Streptomyces coelicolor A3(2)]
gi|5748625|emb|CAB53130.1| conserved hypothetical protein SCJ1.12 [Streptomyces coelicolor
A3(2)]
Length=508
Score = 338 bits (868), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 204/468 (44%), Positives = 266/468 (57%), Gaps = 0/468 (0%)
Query 23 VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH 82
V ETHTA+V DR +K KKPV F D+ T R C RE ELN R A YLG+
Sbjct 18 VCETHTAMVFFVEDRVYKRKKPVDLGFLDYTTRSSRRAVCEREIELNRRFAPDVYLGLGE 77
Query 83 LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT 142
L P AEP+VVMRR D +RL+ +V G PV L A+A LA +H A R I
Sbjct 78 LRTPGEQEAEPLVVMRRMPDDRRLSHLVRTGAPVADDLRAVARHLAAWHSAAPRGPAIAE 137
Query 143 QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD 202
QG A+A RW + ++ A K + D +E +V +++GR+ LF RI++G ++D
Sbjct 138 QGTRDALAARWEASFTQVDVLAAKGPTRDEAGEVERLVRRYLAGRKPLFGLRIEQGRVLD 197
Query 203 GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG 262
GH DLLADD+F + P +LDCLEF+D LRY+D +DDAAFLAMDLE LG + +FLA
Sbjct 198 GHGDLLADDVFCLGDGPRILDCLEFDDALRYVDGLDDAAFLAMDLESLGAPESAAFFLAQ 257
Query 263 YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV 322
Y SGD AP SL Y+AYRA VRAKV ++ QG P A A A R + +A +HL+ + V
Sbjct 258 YGEYSGDPAPPSLWHHYVAYRAFVRAKVSLIQARQGAPGAHATARRLVRMALRHLRASAV 317
Query 323 RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN 382
L LV G PGTGKSTL+ +A+ +GA ++S+D +R+ + GLY+
Sbjct 318 GLTLVAGLPGTGKSTLSGALADRLGAVLLSSDRLRKEMAGLSPQQTASADYGEGLYTPEW 377
Query 383 VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM 442
Y E L +A LL G SV+LD TW D R AR A + +V C DV
Sbjct 378 TARTYAELLDRAAALLALGESVVLDATWIDSAQREAARHTAESAGADLVALHCHVPDDVT 437
Query 443 ADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA 490
A R+ RA G SDA +A A+AA + W +DT G E +VGQA
Sbjct 438 AARLSTRAPGASDADLGVAEAMAAEEQPWSGAVGVDTGGSLEAAVGQA 485
>gi|289774177|ref|ZP_06533555.1| conserved hypothetical protein [Streptomyces lividans TK24]
gi|289704376|gb|EFD71805.1| conserved hypothetical protein [Streptomyces lividans TK24]
Length=508
Score = 337 bits (864), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 204/468 (44%), Positives = 266/468 (57%), Gaps = 0/468 (0%)
Query 23 VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH 82
V ETHTA+V DR +K KKPV F D+ T R C RE ELN R A YLG+
Sbjct 18 VCETHTAMVFFVEDRVYKRKKPVDLGFLDYTTRSSRRAVCEREIELNRRFAPDVYLGLGE 77
Query 83 LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT 142
L P AEP+VVMRR D +RL+ +V G PV L A+A LA +H A R I
Sbjct 78 LRTPGEQGAEPLVVMRRMPDDRRLSHLVRTGAPVADDLRAVARHLAAWHSAAPRGPAIAE 137
Query 143 QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD 202
QG A+A RW + ++ A K + D +E +V +++GR+ LF RI++G ++D
Sbjct 138 QGTRDALAARWEASFTQVDVLAAKGPTRDETGEVERLVRRYLAGRKPLFDLRIEQGRVLD 197
Query 203 GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG 262
GH DLLADD+F + P +LDCLEF+D LRY+D +DDAAFLAMDLE LG + +FLA
Sbjct 198 GHGDLLADDVFCLGDGPRILDCLEFDDSLRYVDGLDDAAFLAMDLESLGAPESAAFFLAQ 257
Query 263 YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV 322
Y SGD AP SL Y+AYRA VRAKV ++ QG P A A A R + +A +HL+ + V
Sbjct 258 YGEFSGDPAPPSLWHHYVAYRAFVRAKVSLIQARQGAPGAHATARRLVRMALRHLRASAV 317
Query 323 RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN 382
L LV G PGTGKSTL+ +A+ +GA ++S+D +R+ + GLY+
Sbjct 318 GLTLVAGLPGTGKSTLSGALADRLGAVLLSSDRLRKEMAGLSPQQTASADYGEGLYTPEW 377
Query 383 VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM 442
Y E L +A LL G SV+LD TW D R AR A + +V C DV
Sbjct 378 TARTYAELLDRAAALLALGESVVLDATWIDSAQREAARHTAESAGADLVALHCHVPDDVT 437
Query 443 ADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA 490
A R+ RA G SDA +A A+AA + W +DT G E +VGQA
Sbjct 438 AARLSTRAPGASDADLGVAEAMAAEEQPWSGAVGVDTGGSLEAAVGQA 485
>gi|134099015|ref|YP_001104676.1| hypothetical protein SACE_2453 [Saccharopolyspora erythraea NRRL
2338]
gi|133911638|emb|CAM01751.1| hypothetical protein SACE_2453 [Saccharopolyspora erythraea NRRL
2338]
Length=505
Score = 328 bits (842), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 209/483 (44%), Positives = 271/483 (57%), Gaps = 5/483 (1%)
Query 16 TDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQ 75
TD + E+H AVV GDRA+K KKPV F DF T + RE AC RE ELN RLA
Sbjct 18 TDVGAAGIAESHCAVVAFIGDRAYKVKKPVDFGFLDFSTVQARETACRRELELNRRLAPD 77
Query 76 SYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQ 135
YL + + D +GG + +VVMRR +RL+ +V G V L+ +A +LA FH A+
Sbjct 78 VYLDLCRVLDGTGGTCDWIVVMRRMPPSRRLSELVRTGADVRPDLEKLARLLASFHSTAR 137
Query 136 RNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRI 195
+ +G A+ RRW +N A V+ I + +V GR L RI
Sbjct 138 SGPDVAAEGRASALRRRWVDNFAGAERFVGTVLDRGQFDEIVGLALAYVDGRGRLLDERI 197
Query 196 KEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDL 255
G +VDGH DLLA+DIF + P +LDCLEF+D LRY+D +DDAAFLAMDLE LG L
Sbjct 198 GRGYVVDGHGDLLAEDIFCLPDGPRVLDCLEFDDRLRYVDGLDDAAFLAMDLERLGAPRL 257
Query 256 GDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQ 315
FL Y SG SL Y AYRA VRAKV C+R +QG EAA A + IA +
Sbjct 258 AHQFLRWYREFSGAQVADSLAHHYTAYRAFVRAKVACLRAAQGAAEAADAAQQLSGIAVR 317
Query 316 HLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDS 375
HL+ VRL LVGG PGTGK+TLA G+A+ +GA ++ TD +R+ + + G
Sbjct 318 HLRAGQVRLLLVGGLPGTGKTTLAGGLADQLGAVLLRTDVIRKEMPGADDLATHAG-YGQ 376
Query 376 GLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRC 435
GLY+ + V Y+ L + R LL G +V+LD +W R AR +A DTHSA+ E RC
Sbjct 377 GLYNGSQVHGTYEAMLTRCRALLERGETVVLDASWSSAGERESARSIAQDTHSALAELRC 436
Query 436 SATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWR 495
A +V RI R G SDATA++A A++ DW +DT P + S +A R
Sbjct 437 VAPREVAEARIAGRYGDVSDATADVAVAMSRHFDDWPQATDVDTTRPPDESAREA----R 492
Query 496 SAI 498
SAI
Sbjct 493 SAI 495
>gi|291006888|ref|ZP_06564861.1| hypothetical protein SeryN2_20403 [Saccharopolyspora erythraea
NRRL 2338]
Length=510
Score = 328 bits (841), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 209/483 (44%), Positives = 271/483 (57%), Gaps = 5/483 (1%)
Query 16 TDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQ 75
TD + E+H AVV GDRA+K KKPV F DF T + RE AC RE ELN RLA
Sbjct 23 TDVGAAGIAESHCAVVAFIGDRAYKVKKPVDFGFLDFSTVQARETACRRELELNRRLAPD 82
Query 76 SYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQ 135
YL + + D +GG + +VVMRR +RL+ +V G V L+ +A +LA FH A+
Sbjct 83 VYLDLCRVLDGTGGTCDWIVVMRRMPPSRRLSELVRTGADVRPDLEKLARLLASFHSTAR 142
Query 136 RNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRI 195
+ +G A+ RRW +N A V+ I + +V GR L RI
Sbjct 143 SGPDVAAEGRASALRRRWVDNFAGAERFVGTVLDRGQFDEIVGLALAYVDGRGRLLDERI 202
Query 196 KEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDL 255
G +VDGH DLLA+DIF + P +LDCLEF+D LRY+D +DDAAFLAMDLE LG L
Sbjct 203 GRGYVVDGHGDLLAEDIFCLPDGPRVLDCLEFDDRLRYVDGLDDAAFLAMDLERLGAPRL 262
Query 256 GDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQ 315
FL Y SG SL Y AYRA VRAKV C+R +QG EAA A + IA +
Sbjct 263 AHQFLRWYREFSGAQVADSLAHHYTAYRAFVRAKVACLRAAQGAAEAADAAQQLSGIAVR 322
Query 316 HLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDS 375
HL+ VRL LVGG PGTGK+TLA G+A+ +GA ++ TD +R+ + + G
Sbjct 323 HLRAGQVRLLLVGGLPGTGKTTLAGGLADQLGAVLLRTDVIRKEMPGADDLATHAG-YGQ 381
Query 376 GLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRC 435
GLY+ + V Y+ L + R LL G +V+LD +W R AR +A DTHSA+ E RC
Sbjct 382 GLYNGSQVHGTYEAMLTRCRALLERGETVVLDASWSSAGERESARSIAQDTHSALAELRC 441
Query 436 SATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWR 495
A +V RI R G SDATA++A A++ DW +DT P + S +A R
Sbjct 442 VAPREVAEARIAGRYGDVSDATADVAVAMSRHFDDWPQATDVDTTRPPDESAREA----R 497
Query 496 SAI 498
SAI
Sbjct 498 SAI 500
>gi|269127024|ref|YP_003300394.1| hypothetical protein Tcur_2811 [Thermomonospora curvata DSM 43183]
gi|268311982|gb|ACY98356.1| conserved hypothetical protein [Thermomonospora curvata DSM 43183]
Length=533
Score = 323 bits (828), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 192/424 (46%), Positives = 248/424 (59%), Gaps = 0/424 (0%)
Query 23 VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH 82
V ETHT +V AG+RA+K KKPV F D T +R R C RE ELN R A YLG+A
Sbjct 17 VSETHTGIVFFAGERAYKVKKPVDLGFVDLTTRRERRRVCHREVELNRRFAGDVYLGVAE 76
Query 83 LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT 142
LS P EP+VVMRR +RLA++V A PVE L A+A LA +H +A R I
Sbjct 77 LSGPGDEPPEPIVVMRRMPAGRRLATLVAARRPVEEPLRAVARTLAAWHAQAPRGPHISE 136
Query 143 QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD 202
QG A+ +RW ++ ++R + + IE V F++GRE LF RI+ G IVD
Sbjct 137 QGSRDALRQRWRDSFEQVRPFHGRSIGAAEAAEIEERVLRFLAGREPLFRSRIQAGRIVD 196
Query 203 GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG 262
GH DL+A DIF +D P +LDCLEF+D LR+LD +DDA+FLAMDLE LG L + FL
Sbjct 197 GHGDLMATDIFCLDDGPRILDCLEFDDRLRWLDGLDDASFLAMDLERLGAPGLAERFLYW 256
Query 263 YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV 322
YA + D APASLR Y+AYRA VRAKV C+R +QG AAA + +HL+ V
Sbjct 257 YAEYAADPAPASLRHHYVAYRAFVRAKVACLRHAQGDAAAAAQIDPLTELTLRHLRAGAV 316
Query 323 RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN 382
L LVGG PGTGKSTLAR + + +G V+++D VR+ L +G+YS A+
Sbjct 317 GLILVGGLPGTGKSTLARSLGDRLGCAVLNSDVVRKELAGIPPDQSAAAPYGTGIYSPAH 376
Query 383 VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM 442
Y L +A LL G SV+LD +W + R AR LA TH+ + RC A +
Sbjct 377 TERTYATLLGRAETLLEQGESVVLDASWTVAEHRTLARLLARRTHADLFALRCEAPPALA 436
Query 443 ADRI 446
R+
Sbjct 437 EQRM 440
>gi|302557027|ref|ZP_07309369.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000]
gi|302474645|gb|EFL37738.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000]
Length=448
Score = 310 bits (795), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 187/431 (44%), Positives = 249/431 (58%), Gaps = 0/431 (0%)
Query 19 PFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYL 78
P ++ ETHTAVV+L G+ A+K KKPV F D T RE AC +E LN R A YL
Sbjct 18 PRAEMHETHTAVVLLFGEHAYKIKKPVDLGFLDHTTQAAREAACAQEVALNRRFAEDVYL 77
Query 79 GIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNR 138
G+ L P AEP+VVMRR +RL+ +V G V+ L A+A LA H A R+
Sbjct 78 GVGELRMPHTDEAEPLVVMRRMPADRRLSRLVREGADVDDVLRAVARQLAARHADAPRSP 137
Query 139 CIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEG 198
+D QG A+ RW + A++R +V D + E +V +++GRE LF RI+EG
Sbjct 138 EVDAQGTRDALLSRWEASFAQVRALDGEVPLPDGLDETERLVRRYLAGREALFDTRIREG 197
Query 199 CIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDY 258
IVDGH DL+A+D+F +D P +LDCLEF+D LR++D +DDAAFLAMDLE LG + +
Sbjct 198 RIVDGHGDLMAEDVFCLDDGPRILDCLEFDDRLRHVDGLDDAAFLAMDLEQLGVPESAAH 257
Query 259 FLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQ 318
FLA Y+ SGD AP SL Y++YRA VRAKV ++ QG P A A A R +HL+
Sbjct 258 FLARYSEYSGDPAPPSLWHHYVSYRAFVRAKVSLIQSRQGAPGAGAAARRLATATLRHLR 317
Query 319 HATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLY 378
+ V L LVGG PG+GKSTLA +A+ +G ++S+D +R+ L GLY
Sbjct 318 TSAVGLTLVGGLPGSGKSTLAGALADRLGVTLLSSDRLRKELAGIPAEQSAAAPYGEGLY 377
Query 379 SRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSAT 438
+ Y E L +A LL +G SV+LD TW DP R A R A T + +V C
Sbjct 378 TPEWTDRTYTELLDRAAALLSAGESVVLDATWSDPGRREAALRTAERTRADLVALHCRVP 437
Query 439 VDVMADRIVAR 449
+V R+ R
Sbjct 438 GEVSRARLTTR 448
>gi|336179936|ref|YP_004585311.1| hypothetical protein FsymDg_4119 [Frankia symbiont of Datisca
glomerata]
gi|334860916|gb|AEH11390.1| hypothetical protein FsymDg_4119 [Frankia symbiont of Datisca
glomerata]
Length=539
Score = 300 bits (768), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 207/471 (44%), Positives = 265/471 (57%), Gaps = 4/471 (0%)
Query 21 IDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGI 80
++ ETH+A + +GDR FK KKP+ F DFRT + RE AC E ELN RLA YLG+
Sbjct 59 LETVETHSATLYFSGDRVFKVKKPLDLGFLDFRTRQAREAACRAEVELNRRLAPDVYLGM 118
Query 81 AHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCI 140
A + D +G + +VVMRR +RL+SM++ G V+G L A+A +LA FHQR + I
Sbjct 119 ADIHDNAGTLVDHMVVMRRMPANRRLSSMISMGRRVDGQLRALARLLAAFHQRCPTSPEI 178
Query 141 DTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCI 200
G + W E+L + + ++ VI + + +++GR L A R + G I
Sbjct 179 AEAGSPATLDGLWQESLTGIAPFSGMLIDSSVIDELGRLAPRYLAGRAALLAERQRAGWI 238
Query 201 VDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFL 260
DGH DLLADDI+ +D P +LDC+EF+ LR D + D AFLAMDLE LG D + FL
Sbjct 239 RDGHGDLLADDIYCLDDGPRVLDCIEFDRRLRVGDVLGDIAFLAMDLERLGAADAAERFL 298
Query 261 AGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHA 320
Y +G+ P SLR YIAYRA+VRAKV C+R QG PEAA A R IA HL+
Sbjct 299 TWYGEFAGEKHPPSLRHLYIAYRALVRAKVSCIRARQGAPEAARQARRLARIALLHLRRG 358
Query 321 TVRLALVGGNPGTGKSTLA-RGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYS 379
VRL L+GG PGTGKSTLA R V G ++ +D VR+ L T P L G Y
Sbjct 359 RVRLVLIGGLPGTGKSTLAGRLVDTEDGWVLLRSDVVRKELAGLPADTQIPAGLFEGHYD 418
Query 380 RANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATV 439
A Y E LR+AR L G SV+LD +W RA A LA T S +VE RC +
Sbjct 419 AQTTDATYTELLRRARHALERGESVVLDASWSTAAHRAAAAALAEQTSSDLVELRCVTSP 478
Query 440 DVMADRIVARAGGN---SDATAEIAAALAARQADWDTGHRIDTAGPRERSV 487
+V A RI RA SDAT + A+AAR W T I TA P +V
Sbjct 479 EVAAARIARRAAAGEDPSDATLAVHQAMAARAQPWPTASVIQTAVPISEAV 529
>gi|111223194|ref|YP_713988.1| hypothetical protein FRAAL3784 [Frankia alni ACN14a]
gi|111150726|emb|CAJ62427.1| conserved hypothetical protein; putative reductase and kinase
domains [Frankia alni ACN14a]
Length=518
Score = 286 bits (733), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 195/492 (40%), Positives = 258/492 (53%), Gaps = 38/492 (7%)
Query 22 DVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIA 81
+ RETH+AV+ L DR +K KKPV F DF RE C+ E LN RLA YLG+A
Sbjct 17 ETRETHSAVLYLTADRVYKRKKPVNLGFLDFTDRRTREAVCLAEVALNRRLAPDVYLGVA 76
Query 82 HLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCID 141
L D SG + +VVMRR +RL+++V G + AL ++A LA FHQR + + I
Sbjct 77 DLRDDSGEVIDHLVVMRRMPTSRRLSTLVRRGHLLGPALRSVARALAVFHQRCETSPLIA 136
Query 142 TQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIV 201
T GE + W E +A + + + V+ I + +++GR L A R + G I
Sbjct 137 TAGEQATLEDLWREGMAGIAAYRGTQLDAAVVDDIGRLALRYLAGRAELLAERTRAGWIR 196
Query 202 DGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLA 261
DGH DLLADDIF +D P +LDC+EF+ LR+ D + D AFLAMDLE LG + FL
Sbjct 197 DGHGDLLADDIFCLDDGPRILDCIEFDPRLRFGDVLGDVAFLAMDLERLGAPEEAAEFLE 256
Query 262 GYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHAT 321
Y SG+ P SL+ FY+AYRA VRAKV C+R QG P+AA +A R L +A +HL+
Sbjct 257 AYREFSGEVHPRSLQHFYVAYRAFVRAKVACIRGGQGDPDAAENARRLLAVAHRHLRAGR 316
Query 322 VRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGV--------- 372
VRL +VGG PGTGK+TLA +AE+ V+ D+ R+ + GEP
Sbjct 317 VRLVVVGGLPGTGKTTLASRLAEVGDGWVLLRSDIIRQ-----ELVGEPPAEAPHQQPAA 371
Query 373 ---------------------LDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWG 411
G Y+ A Y E LR+AR L G SV+LD +W
Sbjct 372 DTAPPAADAGADADAGGFDAQFGVGRYAPEITDATYAEMLRRARAALCRGESVVLDASWS 431
Query 412 DPQMRACARRLAADTHSAIVEFRCSATVDVMADRIVARAGGN---SDATAEIAAALAARQ 468
+ R A +AAD + +VE C +V A RI R S+AT I A+AAR
Sbjct 432 SARHRDAAAAVAADVCADLVELHCVTAPEVAAARIARRMAAGPDPSEATVAIHRAMAARA 491
Query 469 ADWDTGHRIDTA 480
W I TA
Sbjct 492 DPWPRAAVIRTA 503
>gi|312198390|ref|YP_004018451.1| hypothetical protein FraEuI1c_4588 [Frankia sp. EuI1c]
gi|311229726|gb|ADP82581.1| hypothetical protein FraEuI1c_4588 [Frankia sp. EuI1c]
Length=509
Score = 286 bits (731), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 207/494 (42%), Positives = 263/494 (54%), Gaps = 21/494 (4%)
Query 5 TNDGTCDAHPVTDEPFID--------VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAE 56
T DGT A P D V ETH+A + D +K KKPV F DF T
Sbjct 2 TGDGTATAFQARPAPRWDLAALPPGEVVETHSATLTFVDDLVYKVKKPVDLGFLDFSTRA 61
Query 57 QRERACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPV 116
+R AC +E LN RLA YL +A L D G + VVMRR +++RL S++ G PV
Sbjct 62 KRLAACEQEVALNRRLAPDVYLAVADLVDDRGRVVDHAVVMRRLPERRRLTSLIQRGQPV 121
Query 117 EGALDAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRI 176
AL AIA LA FH + I G +A W E + ++ + D ++ G VI I
Sbjct 122 GAALRAIARRLAAFHAACATSEQIAAAGSSATLAGLWREGVDQVAQYRDNILDGTVIAEI 181
Query 177 EHMVDEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDR 236
+ + +++GR L R+ G + DGH DL ADDIF + P +LDC+EF+ LR D
Sbjct 182 DRLSARYLAGRAPLLRARMAAGLVRDGHGDLQADDIFCLPDGPRILDCIEFDQRLRVGDV 241
Query 237 IDDAAFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFS 296
+ D AFLAMDLE LG + D L Y +G+T P SL Y+AYRA VR KV CVR
Sbjct 242 LGDVAFLAMDLERLGARAEADQLLGWYQEFAGETHPPSLAHLYVAYRAFVRTKVTCVRAG 301
Query 297 QGKPEAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELV-GAQVISTDD 355
QG P+AA A R +A HL+ VRLALVGG PGTGKSTLA +A+ G ++ +D
Sbjct 302 QGDPDAADLARRLADLALDHLRRGRVRLALVGGLPGTGKSTLAGRLADTEDGWVLLRSDT 361
Query 356 VRRRLRDCGVITGEP-------GVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDG 408
VR+ L G+ T P G GLYS A AVY E LR+A L G +V+LD
Sbjct 362 VRKEL--AGLPTDRPASPKLYEGGPFRGLYSPAATEAVYAELLRRAGHALARGDNVLLDA 419
Query 409 TWGDPQMRACARRLAADTHSAIVEFRCSATVDVMADRIVARAGGN---SDATAEIAAALA 465
+W D RA A RLAA H+ ++E RC +V A RI RA SDATA I ALA
Sbjct 420 SWSDAADRAAAARLAAAAHADLIELRCVTAPEVAAARIARRAAARTDASDATAAIHGALA 479
Query 466 ARQADWDTGHRIDT 479
R W + + T
Sbjct 480 TRADPWPSAAVVHT 493
>gi|158318664|ref|YP_001511172.1| hypothetical protein Franean1_6932 [Frankia sp. EAN1pec]
gi|158114069|gb|ABW16266.1| conserved hypothetical protein [Frankia sp. EAN1pec]
Length=523
Score = 283 bits (725), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 188/481 (40%), Positives = 246/481 (52%), Gaps = 15/481 (3%)
Query 25 ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS 84
ETHTA++ L DR +K +KPV F D RT R AC E LN RLA YLG+A +
Sbjct 44 ETHTAILFLTEDRVYKLRKPVDLGFVDLRTRHARLTACEDEVRLNRRLAPDVYLGVADIR 103
Query 85 DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQG 144
D G + +VVMRR +RL+ +V G + G L IA +A FH+R + + I G
Sbjct 104 DEEGHPRDHMVVMRRMPADRRLSELVRGGADLTGELRVIARTMAAFHERCETSPEISRAG 163
Query 145 EVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGH 204
+ + W E + + ++ + I + +++GR L A R + G I DGH
Sbjct 164 GLANLEALWLEAMDAVAPFRGSILDAGTVDEIGRLALRYLAGRAPLLAERQRAGRIRDGH 223
Query 205 ADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYA 264
DLLADDI+ +D P +LDC+ F+ LR D + D AFLAMDLE LG FL Y
Sbjct 224 GDLLADDIYCLDDGPRVLDCINFDRRLRVGDVLADVAFLAMDLERLGAPAAARTFLDAYR 283
Query 265 VRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVRL 324
SG+T PASL YIAYRA VR ++ C+R QG PEAA +A R IA HL+ VRL
Sbjct 284 EFSGETHPASLEHLYIAYRAFVRVRIACIRDHQGDPEAAEEARRLADIALAHLRRGRVRL 343
Query 325 ALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRR----LRDCGVITGEPGVLDSGLYSR 380
LVGG PGTGKSTLA G+A+ V+ DV R+ L + PG +G+Y
Sbjct 344 VLVGGLPGTGKSTLASGLADGQDEWVLLRSDVVRKELAGLAPDIAVDVAPG---AGIYGV 400
Query 381 ANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVD 440
Y E + +AR L G SV+LD +W R A A T + + + RC A
Sbjct 401 EATEHSYAELIARARQALERGQSVVLDASWSSGLFRELAAETAKATGADLAQVRCVAPAP 460
Query 441 VMADRIVAR--------AGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYH 492
V RI +R SDAT I AA+A R W IDT G E +V A
Sbjct 461 VAVARIESRRSMRARGTGADASDATGVIHAAMADRADLWPAAFEIDTTGSVEETVAAARR 520
Query 493 I 493
+
Sbjct 521 V 521
>gi|319948330|ref|ZP_08022476.1| gluconate kinase [Dietzia cinnamea P4]
gi|319438012|gb|EFV92986.1| gluconate kinase [Dietzia cinnamea P4]
Length=509
Score = 277 bits (709), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 196/489 (41%), Positives = 256/489 (53%), Gaps = 28/489 (5%)
Query 25 ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS 84
ETH+A++ L GD A K +KPV F D T R RE ELNSRLA Y G+ +
Sbjct 19 ETHSALIFLWGDEAHKVRKPVDLGFLDNTTVGARGEQSRREVELNSRLAPDVYRGVLEVR 78
Query 85 DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDA-----------IAEVLARFHQR 133
P G + VV MRR ++ LAS+V L E L +A +AR H
Sbjct 79 GPDGEVVDHVVWMRRLPARRSLASLVR--LRAESGLGGGDTDIVVGVTEVARQIARLHAA 136
Query 134 AQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAG 193
R+ ID G AVA W +L LR + +++ IE + +++ GR L
Sbjct 137 GPRSEEIDAAGTPAAVAGLWARSLEHLRRLDVGRDAPEIVDDIESLATDYLRGRGPLLES 196
Query 194 RIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRK 253
R+ +G IVDGH DLLA D++L+D P ++DCLEF+D LRY D + D FLAMDL+ G +
Sbjct 197 RVADGRIVDGHGDLLAADVYLLDDGPRVIDCLEFDDLLRYGDAVLDIGFLAMDLDASGAR 256
Query 254 DLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQ-----GKPEAAADAVR 308
DL L Y SGD AP SL YI YRA+VR+KV +R Q G A A+
Sbjct 257 DLAVVLLGAYREASGDDAPPSLVHHYIGYRALVRSKVTAIRAEQAADGDGGRRDARRALE 316
Query 309 HLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITG 368
+A L A VRL LVGG G+GKSTLA +AE +GA+++ +D VR + G
Sbjct 317 LADLAVDSLLRARVRLVLVGGVSGSGKSTLAAPLAEALGAELLRSDVVRSNVVRAQAARG 376
Query 369 EPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHS 428
YS V AVY E L +A L G SV+LD TW +P+ RA A +AAD H+
Sbjct 377 R------DRYSEEAVGAVYAEMLDRAAASLALGRSVVLDATWLEPRRRAEAETVAADAHA 430
Query 429 AIVEFRCSATVDVMADRIV--ARAGGN-SDATAEIAAALAARQADWDTGHRIDTAGPRER 485
+VE C+A D + RI ARAG + S+AT E+ A AR A W +DT G R
Sbjct 431 ELVEISCTAPRDELVRRITDRARAGSDPSEATIEVLDAQLARPAAWPEAIEVDTVGLDVR 490
Query 486 SVGQAYHIW 494
G+A W
Sbjct 491 D-GEAVRRW 498
>gi|86740954|ref|YP_481354.1| hypothetical protein Francci3_2257 [Frankia sp. CcI3]
gi|86567816|gb|ABD11625.1| conserved hypothetical protein [Frankia sp. CcI3]
Length=584
Score = 270 bits (689), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 195/548 (36%), Positives = 269/548 (50%), Gaps = 66/548 (12%)
Query 1 MDSPTNDGTCDAHP---------VTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCD 51
M SPTN + + P +++ + ETH+A + LA DR +K KKPV F D
Sbjct 24 MLSPTNPPSPASTPAQPRLRALYLSELETFETYETHSATLHLAADRVYKRKKPVNLGFLD 83
Query 52 FRTAEQRERACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVT 111
F RE C E LN RLA YLG+A L D +G + +VVMRR +RL+++V
Sbjct 84 FTDRRTRESVCRSEVALNRRLAPDVYLGVADLLDDTGEVIDHLVVMRRMPASRRLSTLVR 143
Query 112 AGLPVEGALDAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGD 171
V AL +A LA FHQR + + I G+ + W E L + + ++
Sbjct 144 RRSRVGPALRTVARALAVFHQRCETSPEIAVAGQRATLEGLWREGLEGISPYRGTLLDAA 203
Query 172 VIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDEL 231
V+ I + +++GRE L R++ G I DGH DLLADDI+ + P +LDC+EF+ L
Sbjct 204 VVDEIGELALRYLAGRETLLGDRVRAGWIRDGHGDLLADDIYCLGDGPRILDCIEFDPRL 263
Query 232 RYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVE 291
R+ D + D AFLAMDLE LG + FL Y SG+ P SL+ Y+AYRA VRAKV
Sbjct 264 RFGDVLGDVAFLAMDLERLGAPEEAAEFLDAYREFSGEVHPRSLQHLYVAYRAFVRAKVT 323
Query 292 CVRFSQGKPEAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELV-GAQV 350
C+R QG P+AA +A R L +A +HL+ V+L +VGG PGTGK+TLA +A + G +
Sbjct 324 CIRGGQGDPDAAEEARRLLAVAHRHLRAGRVQLVVVGGLPGTGKTTLAGRLAGVGDGWVL 383
Query 351 ISTDDVRRRL------------------------------------RDCGV---ITGEPG 371
+ +D +R+ L RD G T +P
Sbjct 384 LRSDVIRQELTGMPLREGGPAADTTAGGYASALRNASGTATRTGARRDAGTGAAATSDPA 443
Query 372 VLD--------------SGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRA 417
D +G Y+ A Y E LR+A L G V+LD +W + R
Sbjct 444 TSDPADGDPATSDPRFGTGRYAPEITDATYAEMLRRAEAALARGERVVLDASWSSARHRR 503
Query 418 CARRLAADTHSAIVEFRCSATVDVMADRIVARAGGNSD---ATAEIAAALAARQADWDTG 474
A LAA + +VE C +V A RI RA +D AT I A+AAR W +
Sbjct 504 AAAELAASVCADLVELHCVTAPEVAAARIGRRAAAGTDPSEATMAIHRAMAARADPWPSA 563
Query 475 HRIDTAGP 482
+ TA P
Sbjct 564 TVVRTAVP 571
>gi|288922596|ref|ZP_06416775.1| conserved hypothetical protein [Frankia sp. EUN1f]
gi|288346070|gb|EFC80420.1| conserved hypothetical protein [Frankia sp. EUN1f]
Length=516
Score = 257 bits (657), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 171/411 (42%), Positives = 224/411 (55%), Gaps = 5/411 (1%)
Query 25 ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS 84
ETHT+V+V GDR +K KKP F DFRT E R AC E ELN RLA YLG+A +
Sbjct 50 ETHTSVLVFLGDRVYKTKKPADLGFLDFRTREARRDACHSEVELNRRLAPDVYLGVADVV 109
Query 85 DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQG 144
P G + +VVMRR +RL+ +V AG V G L +A +LA FH R + + ID
Sbjct 110 GPDGELCDHLVVMRRLPADRRLSGLVAAGRDVTGELRTVARLLADFHSRCETSAQIDDAA 169
Query 145 EVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGH 204
A+ R W E +A ++ + V+ I I + +++GRE L R + G I DGH
Sbjct 170 SPAALRRLWEEGMAGVQPYVGTVLDPATIDAIGRLATRYLAGREPLLRQRQRRGLIRDGH 229
Query 205 ADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYA 264
DLLADDI+ +D P +LDCLEF+ LR D + D AFLAMDLE LGR DL +FL Y
Sbjct 230 GDLLADDIYCLDDGPRILDCLEFDQRLRVGDILADIAFLAMDLERLGRPDLAAFFLERYR 289
Query 265 VRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVRL 324
+ ++ P SL Y+AYRA VR KV C R +QG AAA+A +A L+ VRL
Sbjct 290 EYAAESHPRSLEHLYVAYRAFVRCKVACTRHAQGDRSAAAEARTLAALALASLRRGRVRL 349
Query 325 ALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRANVV 384
LVGG P +G+S LA +AE G ++ + G P D A+
Sbjct 350 VLVGGPPESGRSQLATALAEAEGWTLLRAETTATADTATGAADSGP---DGATGDDAD-- 404
Query 385 AVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRC 435
A Y E LRKAR+ G +V+LD W + R A LA T + +V+ RC
Sbjct 405 AGYDELLRKARIAAERGETVVLDAPWALRRDRHRAAALAQATAADLVQLRC 455
>gi|288919770|ref|ZP_06414096.1| conserved hypothetical protein [Frankia sp. EUN1f]
gi|288348870|gb|EFC83121.1| conserved hypothetical protein [Frankia sp. EUN1f]
Length=543
Score = 256 bits (653), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 181/494 (37%), Positives = 247/494 (50%), Gaps = 25/494 (5%)
Query 25 ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS 84
ETHT+++ L DR +K +KPV F FR+ + R+ AC E LN RLA YLG+A +
Sbjct 48 ETHTSILFLTDDRVYKVRKPVDLGFVSFRSRQARQAACENEVRLNRRLAPDVYLGVADIR 107
Query 85 DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQG 144
D +G + +VVMRR + LA +V G V G + AIA LA FH+R + + I G
Sbjct 108 DEAGQMRDHMVVMRRLPAGRCLAELVRGGADVTGEVRAIARQLAAFHERCETSPEISRAG 167
Query 145 EVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGH 204
V + W E + + ++ V+ I + ++ GR L R + G + DGH
Sbjct 168 GVAELEALWLEAMDGVAPFRGSILDAPVVDEIGRLALRYLIGRVPLLVERQRAGRVRDGH 227
Query 205 ADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYA 264
DLLA+DI+ +D P +LDC+ F+ LR D + D AFLAMDLE LG + FL Y
Sbjct 228 GDLLAEDIYCLDDGPRVLDCINFDHRLRVGDVLADVAFLAMDLERLGAPEAARTFLDAYR 287
Query 265 VRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVRL 324
SG+T PASL YIAYRA VR ++ C+R QG+P AA +A R IA +HL+ A VRL
Sbjct 288 EFSGETHPASLEHLYIAYRAFVRTRIACIRHHQGEPGAADEARRLAAIALRHLRQARVRL 347
Query 325 ALVGGNPGTGKSTLARGVAELVGAQVI------------STDDVRRRLRDCGVITGEPGV 372
LV G PGTG STLAR +AE G V+ T R D G
Sbjct 348 VLVSGLPGTGTSTLARNLAEGEGEWVLLAREDPAGAPGRGTAAQRADGPDRGAAADWGSA 407
Query 373 LDSGLYSRANVVAVY-----QEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTH 427
D G + A + E + +AR L G SV+LD W + + A +T
Sbjct 408 ADWGSAADWGSAADWGGPGLAELVEQARRALVRGQSVVLDAPWPSRESQDLVAEAADETG 467
Query 428 SAIVEFRCSATVDVMADRIVAR------AGGNSDATAEIAAAL-AARQAD-WDTGHRIDT 479
+ +V RC A V R+ +R G + A++AA L AA D W H +DT
Sbjct 468 ADLVRLRCVAPPRVAVARVASRQAVQASGSGAEMSHADVAAYLEAATHFDLWPAAHNLDT 527
Query 480 AGPRERSVGQAYHI 493
+ +V A I
Sbjct 528 TATIQETVEAARRI 541
>gi|158318668|ref|YP_001511176.1| hypothetical protein Franean1_6936 [Frankia sp. EAN1pec]
gi|158114073|gb|ABW16270.1| conserved hypothetical protein [Frankia sp. EAN1pec]
Length=494
Score = 249 bits (635), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 162/412 (40%), Positives = 221/412 (54%), Gaps = 15/412 (3%)
Query 25 ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS 84
ETHT+V+V GDR +K KKP F DFRT E R+ AC E +LN RLA YLG+A +
Sbjct 39 ETHTSVLVFLGDRVYKTKKPADLGFLDFRTREARQAACHAEVDLNRRLAPDVYLGVADVI 98
Query 85 DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQG 144
P G + +VVMRR +RL+++VTAG V G L A+A +LA FH R + I G
Sbjct 99 GPDGNACDHMVVMRRLPAARRLSALVTAGGDVTGELRAVARLLADFHTRCDTSARITEAG 158
Query 145 EVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGH 204
+ W E + ++ + V+ + I + ++ GR+ L R G I DGH
Sbjct 159 SPATLRGLWEEGIRGVQPYLGSVLDASTVDAIGRLAGRYLDGRQPLLRERQHRGLIRDGH 218
Query 205 ADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYA 264
DLLADDI+ +D P +LDCLEF+ LR D + D AFLAMDL+ LGR DL +FL Y
Sbjct 219 GDLLADDIYCLDDGPRVLDCLEFDQRLRVGDVLADVAFLAMDLKRLGRPDLASFFLDRYR 278
Query 265 VRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVRL 324
S ++ P SL + Y+AYRA VR KV C R +QG A A+A +A +L+H VRL
Sbjct 279 EYSAESHPRSLENLYVAYRAFVRCKVACTRHAQGDESAGAEARALASLALANLRHGRVRL 338
Query 325 ALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRANVV 384
LVGG +G+S LA +A+ G ++ + G G D G
Sbjct 339 VLVGGTRDSGRSELAADLADAEGWTLLRAEPT-----GSGPDGASSGTTDLG-------- 385
Query 385 AVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCS 436
Y E LR+A + G +V+LD W + R A LA T + +V+ RC+
Sbjct 386 --YDELLRRAGIAAERGETVVLDAPWTLRRDRDRAAALADATAADLVQLRCA 435
>gi|288919156|ref|ZP_06413494.1| conserved hypothetical protein [Frankia sp. EUN1f]
gi|288349403|gb|EFC83642.1| conserved hypothetical protein [Frankia sp. EUN1f]
Length=542
Score = 241 bits (615), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 144/336 (43%), Positives = 192/336 (58%), Gaps = 2/336 (0%)
Query 2 DSPTNDGTCDA--HPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRE 59
D+PT GT A P + ETHT+V++ GDR +K KKP F DFRT + R
Sbjct 24 DAPTAPGTPTAPGMPGAVATPAALVETHTSVLIFLGDRVYKVKKPADLGFLDFRTRQARL 83
Query 60 RACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGA 119
AC E +LN RLA YLG+A + P G + +VVMRR +RL+++V AG+ V
Sbjct 84 AACQAEVDLNRRLAPDVYLGVADVQGPDGALCDHMVVMRRLPADRRLSTLVAAGVDVADD 143
Query 120 LDAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHM 179
L A A +LA FH R + + I G + W E+L + + V+ G I I +
Sbjct 144 LRATARLLAAFHTRCETSAEIADAGSSATLGGLWEESLRGVEPYLGTVLDGATIDAIGRL 203
Query 180 VDEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDD 239
F++GRE L R + G + DGH DLLA DI+ ++ P +LDCLEF+ LR D + D
Sbjct 204 AARFLAGREPLLRERQRLGLVRDGHGDLLAGDIYCLEDGPRILDCLEFDQRLRVGDVLGD 263
Query 240 AAFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGK 299
FLAMDLE LGR DL + LA Y + ++ P SL D YIAYRA+VR KV C R++QG
Sbjct 264 IGFLAMDLESLGRPDLAAFLLAHYRQYAAESHPRSLADLYIAYRALVRCKVACTRYAQGV 323
Query 300 PEAAADAVRHLIIATQHLQHATVRLALVGGNPGTGK 335
AAA+A +A HL+ VRL LVGG PG+G+
Sbjct 324 EPAAAEARALAALALSHLRQGRVRLVLVGGAPGSGR 359
>gi|288923414|ref|ZP_06417540.1| conserved hypothetical protein [Frankia sp. EUN1f]
gi|288345237|gb|EFC79640.1| conserved hypothetical protein [Frankia sp. EUN1f]
Length=517
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 160/434 (37%), Positives = 218/434 (51%), Gaps = 21/434 (4%)
Query 23 VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH 82
+ ET T+V+V GDR +K KK F DFRT E R AC E LN RLA YLG+A
Sbjct 28 LSETLTSVLVFLGDRVYKIKKTADLGFLDFRTREARLAACQAEVNLNRRLAPDVYLGVAD 87
Query 83 LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT 142
+ P G + +VVMRR ++RL++++ AG V G L A+A +LA FH RA + I
Sbjct 88 ILGPDGTALDHMVVMRRLPAERRLSALLAAGSDVTGPLRAVARLLADFHARAATSPEITE 147
Query 143 QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD 202
G + W E L + V+ I I H+ + ++ GR L R ++G I D
Sbjct 148 AGSTANLRWLWDEVLESIEPFLGPVLDTTTIDAIRHLANRYLDGRAPLLRERQRDGRIRD 207
Query 203 GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG 262
GH DLLADDI+ +D P +LDCLEF+ LR D + D AFLAMDLE LGR DL + L
Sbjct 208 GHGDLLADDIYCLDDGPRILDCLEFDRRLRVGDVLSDIAFLAMDLERLGRPDLSRFLLDQ 267
Query 263 YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV 322
Y + T P SL YIAYRA ++ C +++QG AAA+A +A L+ +
Sbjct 268 YRAYTAVTHPLSLESLYIAYRAFTMCRIACTQYAQGATAAAAEARALASLALASLRRGRI 327
Query 323 RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCG------------------ 364
RL LVGG TG+S +A G+AE G ++ V R L
Sbjct 328 RLILVGGAADTGRSAVAAGLAESEGWTLLRAASVERELAHLAPAGWTDTPTTAATTTATT 387
Query 365 ---VITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARR 421
T + + A AV E LR+AR + G +V++D W R A
Sbjct 388 TTATATAGTATATTTARTAATATAVRDELLRRARTAVERGETVVIDAPWARRHDREQAAA 447
Query 422 LAADTHSAIVEFRC 435
LA T + +++ RC
Sbjct 448 LARATFTDLIQLRC 461
>gi|158315848|ref|YP_001508356.1| hypothetical protein Franean1_4063 [Frankia sp. EAN1pec]
gi|158111253|gb|ABW13450.1| conserved hypothetical protein [Frankia sp. EAN1pec]
Length=575
Score = 230 bits (587), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 172/447 (39%), Positives = 233/447 (53%), Gaps = 22/447 (4%)
Query 21 IDVRET---HTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSY 77
+D RET TAVV L DRA+K ++ V F D+R+ R AC E LN RLA Y
Sbjct 38 LDSRETVQTPTAVVFLTEDRAYKLRRAVNHGFVDYRSRRARLIACEDEVRLNRRLAPDVY 97
Query 78 LGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRN 137
LG+A + D +G + +VVMRR +RL++++TA V G L +A+ +A FH+ +
Sbjct 98 LGVADIRDETGALRDHMVVMRRLPADRRLSALMTAD--VSGELRELAQRIAAFHEGCETT 155
Query 138 RCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKE 197
I G + A+ W E + L +++ + I + +++GR L A R
Sbjct 156 PEITRTGGLCALEALWLEAMDGLAPFRGRILDAATVDEIGRLALRYLTGRGPLLAERQAA 215
Query 198 GCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGD 257
G I DGH DLLADDI+ ++ P +L+C+ + LR D + DAA LAMDLE LG
Sbjct 216 GRIRDGHGDLLADDIYCLNDGPRVLNCVNVDPALRAGDVLGDAASLAMDLERLGNATAAR 275
Query 258 YFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHL 317
FL Y SG+T P SL D YIAYRAVVRAK CVR QG P AA +A R +A +HL
Sbjct 276 TFLDAYREFSGETHPTSLEDLYIAYRAVVRAKTACVRDHQGDPAAADEARRLTDLALRHL 335
Query 318 QHATVRLALVGGNPGTGKSTLARGVAELVGAQ----VISTDDVRRRLRDCGVITGEPGVL 373
+ RL LVGG PGTGKSTLA + LV + ++S+ VR G E
Sbjct 336 RRGRPRLILVGGLPGTGKSTLA---SHLVSGEDDWVLLSSAAVRGEPVGAGATAPESAST 392
Query 374 D----------SGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLA 423
+G Y Y E L +AR L G SV++D +W +MRA A LA
Sbjct 393 SASDSAGTEPAAGCYGADATEHSYVEVLTRARHALERGRSVVIDASWSSRRMRARAAELA 452
Query 424 ADTHSAIVEFRCSATVDVMADRIVARA 450
A+ + +++ RC V RI RA
Sbjct 453 AECDADLMQLRCVVPPRVAVARIADRA 479
>gi|342857399|ref|ZP_08714055.1| hypothetical protein MCOL_00935 [Mycobacterium colombiense CECT
3035]
gi|342134732|gb|EGT87898.1| hypothetical protein MCOL_00935 [Mycobacterium colombiense CECT
3035]
Length=533
Score = 229 bits (584), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 176/507 (35%), Positives = 237/507 (47%), Gaps = 28/507 (5%)
Query 17 DEPFIDVR--ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAA 74
D P DVR ETH++ V LAGD A+K KKPV F DF + E+R C E LN R +
Sbjct 27 DPPAADVRLHETHSSWVFLAGDYAYKLKKPVNLGFLDFTSIERRRADCEEELRLNRRFSP 86
Query 75 QSYLGIAHLSDPSG--------GHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEV 126
Q YLG+ +++ G G EP V MRR ++ L + + G I
Sbjct 87 QMYLGVVEVTEGDGHYRIGGETGSGEPAVWMRRLPEEGMLPAKLARGDVDMRLARRIGRT 146
Query 127 LARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSG 186
LA+ H RA+ ID G +V W EN ++ + VS DV I VD+FV
Sbjct 147 LAKLHSRAETGADIDVYGRPSSVIANWRENFDQIGPFVGRTVSSDVNEDIRAYVDQFVHE 206
Query 187 REVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMD 246
R L R+ +G + DGH DL A I + DG+ L D L+F R D + AFLAMD
Sbjct 207 RASLLERRVADGHVRDGHGDLHAASICIDDGQILLFDSLQFAPRYRCADVASEVAFLAMD 266
Query 247 LEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADA 306
E+ GR DL F+ Y SGD A L DFYI YRA VR KV +R +Q + A+
Sbjct 267 FEYHGRADLAWGFVESYVRASGDDGLAGLLDFYICYRAYVRGKVRSLRLAQAE-HASGGQ 325
Query 307 VRHLIIATQHL-----QHA----TVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVR 357
R LI ++ HA + + G P +GK+TLAR +A +G +S+D R
Sbjct 326 TRQLIAESRAYFDLAWAHAGGLPRAPMVVTMGLPASGKTTLARALAGRLGLVHLSSDVAR 385
Query 358 RRLRDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDP---- 413
+R+ SGLY A + Y R A L G V++D T+G+P
Sbjct 386 KRMAGIEPTRHGNDEFGSGLYDPAMTRSTYAALRRDAARWLRRGRGVVVDATFGNPRERS 445
Query 414 QMRACARRLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDT 473
Q++ A RL AD H + E AT+ +R G SDA E+ L A D
Sbjct 446 QVQQLAHRLGADLHVVLCEAD-DATLMARLERRATEQGVVSDARIELWPELRAAFTPPDE 504
Query 474 GH---RIDTAGPRERSVGQAYHIWRSA 497
R+D E +V Q + R+A
Sbjct 505 QPSLLRVDATRDMEETVEQTLTLLRAA 531
>gi|254774818|ref|ZP_05216334.1| hypothetical protein MaviaA2_09125 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=535
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 173/508 (35%), Positives = 241/508 (48%), Gaps = 32/508 (6%)
Query 17 DEPFIDVR--ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAA 74
D P D+R ETH++ V+LAG A+K KKPV F DF + EQR C E LN R +
Sbjct 27 DPPADDLRLHETHSSWVILAGPYAYKLKKPVNLGFLDFTSIEQRRADCDEELRLNRRFSP 86
Query 75 QSYLGIAHLSDPSG--------GHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEV 126
Q YLG+ +++ +G G EP V MRR + L + + G I
Sbjct 87 QVYLGVVDITEQNGHYRVGGEAGSGEPAVWMRRLPEDGMLPAKLAGGDVDTRLARRIGRT 146
Query 127 LARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSG 186
LA+ H RA+ I+ G +V W EN ++ + +S ++ I V EFV
Sbjct 147 LAKLHGRAETGPDIEAYGSPSSVIANWQENFDQMGPFIGRTISPEINNEIRSYVQEFVLR 206
Query 187 REVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMD 246
+ L R+ EG + DGH DL A + + DG+ L D L+F R D + +FLAMD
Sbjct 207 QAALLERRVTEGHVRDGHGDLHAASVCIADGQIVLFDSLQFAPRYRCADLASEVSFLAMD 266
Query 247 LEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQ------GKP 300
E+ GR DL F+ Y SGD SL DFY+ YRA VR KV +R +Q G+
Sbjct 267 FEYHGRGDLAWAFVDSYVRASGDDELPSLLDFYMCYRAYVRGKVRSLRLAQTEKVPGGEQ 326
Query 301 EA-AADAVRHLIIATQHLQHATVRLALVG-GNPGTGKSTLARGVAELVGAQVISTDDVRR 358
EA A++ + +A H L +V G P +GK+TLAR +A +G +S+D R+
Sbjct 327 EALIAESRGYFDLAWAHAGGLPRPLMVVTMGLPASGKTTLARALAGRLGLVHLSSDVARK 386
Query 359 RLRDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDP----Q 414
R+ SGLY+ A Y R A L GH V++D T+G+P Q
Sbjct 387 RMAGIPPTRRGSDEFGSGLYNPAMTRNTYAALRRDAARWLRRGHGVVVDATFGNPGERAQ 446
Query 415 MRACARRLAADTHSAIVEFRCSATVDVMADRIVARA---GGNSDATAEIAAALAARQADW 471
+R A RL D H + C A D + R+ RA G SDA E+ L A
Sbjct 447 LRQLAHRLGVDLHLVL----CDADDDTLIARLKRRATEQGVVSDARIELWPQLRAAFTPP 502
Query 472 D---TGHRIDTAGPRERSVGQAYHIWRS 496
D + R+D E +V QA + R+
Sbjct 503 DEQASVLRVDATRDTEETVEQALGLLRA 530
>gi|218781915|ref|YP_002433233.1| gluconate kinase [Desulfatibacillum alkenivorans AK-01]
gi|218763299|gb|ACL05765.1| gluconate kinase [Desulfatibacillum alkenivorans AK-01]
Length=534
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 151/470 (33%), Positives = 221/470 (48%), Gaps = 10/470 (2%)
Query 21 IDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGI 80
++V TH ++V LAGD AFK KKP+ F DF T E+R++AC E LN RLA + YL +
Sbjct 39 VEVHRTHISLVFLAGDFAFKVKKPLDLGFLDFSTLEKRKKACEDELILNRRLAPEIYLAV 98
Query 81 AHLS---------DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFH 131
+ PSG E V M+R ++ G E A++ + ++A FH
Sbjct 99 VPIFMDGQGALTLSPSGRPVEYAVKMKRLNQCGMFDVLLEQGKLDEKAMEELGGIMANFH 158
Query 132 QRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLF 191
RA ++ A+ W E+LA++R H +V+ + + +E FV L
Sbjct 159 ARADARPSVNAYAFPEAILNMWAEDLAQVREHIPRVIPPEPMDLVEAFSKSFVQNNAALL 218
Query 192 AGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLG 251
RI+E I D H DL +I L G+ + DC+EF ++ R +D + AFLAMDLE G
Sbjct 219 LERIRENRIRDCHGDLHLQNICLNKGKVVVFDCIEFNEKFRCMDVASEIAFLAMDLECRG 278
Query 252 RKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLI 311
L F A Y + D L FY YRA+VRAK+ C+R + G+P ++
Sbjct 279 ATALARAFTASYIEHAQDPNLKKLLHFYKCYRALVRAKIMCIR-ANGEPLGDMANQYAML 337
Query 312 IATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPG 371
A L + G G+GKS +A+ +A L GA V ++D +R+ + P
Sbjct 338 AARYAAPFPRPTLICMAGITGSGKSGVAQEMANLTGAAVFASDVIRKTMFGFEPTEKIPE 397
Query 372 VLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIV 431
+Y + VYQ L +AR LG G SVILD T+ Q R A LA + +
Sbjct 398 PAVKEVYGQGASQKVYQSMLDRARENLGEGKSVILDATFTLSQGRKAAYDLARECGANFF 457
Query 432 EFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAG 481
CS D+ +RI RA + A A++ +W I +G
Sbjct 458 LVVCSLPEDIAKERISGRAKDAQSVSDGTLAVYKAQKKEWQPIEGIPESG 507
>gi|269836440|ref|YP_003318668.1| Uma3 [Sphaerobacter thermophilus DSM 20745]
gi|269785703|gb|ACZ37846.1| Uma3 [Sphaerobacter thermophilus DSM 20745]
Length=538
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 162/477 (34%), Positives = 232/477 (49%), Gaps = 31/477 (6%)
Query 13 HPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRL 72
HPVT+ I + ETH ++V LA D +K KKPV F DF T E+R C E LN RL
Sbjct 27 HPVTE---IAIEETHASIVFLADDLVYKIKKPVDFGFLDFSTLERRRHFCHEEIRLNRRL 83
Query 73 AAQSYLGIAHLSDPSG--------GHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIA 124
+ YL + + + G E V M R D Q L ++T G E A+A
Sbjct 84 SQGVYLDVVPVVEVGGRLQLFGDGPEVEYAVKMNRLPDNQMLNYLITTGTVDERVFPALA 143
Query 125 EVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFV 184
+ LA F++ A +D G A EN+ + + + +++ R I+ M F
Sbjct 144 DRLAAFYREAATGPGVDEWGTAEAAHFSIRENVEQTQPYVGTIIAPVQHRLIDEMSARFF 203
Query 185 SGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEP---ALLDCLEFEDELRYLDRIDDAA 241
+ LF RI G I +GH DL I + P ++DC+EF LR D D A
Sbjct 204 AEHAELFQQRIAAGRIREGHGDLHLAHICVQGLRPEELQIIDCVEFNPRLRCGDIAVDIA 263
Query 242 FLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPE 301
FLAMDL++ GR DL + A R GD L F+ YRA VR KV C R + PE
Sbjct 264 FLAMDLDYHGRPDLSRSLVNMLAERLGDDDLPLLVHFFSVYRAHVRNKVACFRLDEIAPE 323
Query 302 ------AAADAVRHLIIATQHL-QHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTD 354
++A R++ +AT +L + L LVGG GTGKS +A +A +GA + S+D
Sbjct 324 LPEYVAVKSEAERYIDLATSYLVEPERPTLFLVGGLSGTGKSVIAYRLARALGASLSSSD 383
Query 355 DVRRRLRDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQ 414
VR+ + V + EP +G+Y + YQE L +AR+ L +G S +LD T+ DP
Sbjct 384 VVRKEIAGRPVESHEPVPYGTGIYEPSLTARTYQELLDRARVALTAGRSAVLDATFLDPS 443
Query 415 MRACARRLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADW 471
R AR +AA+ ++ C A V+ +R+ R+G +D + +ADW
Sbjct 444 WREAARDMAAELGVDLLLIECQAPPAVVEERLARRSGLMADPS----------EADW 490
>gi|158313805|ref|YP_001506313.1| hypothetical protein Franean1_1970 [Frankia sp. EAN1pec]
gi|158109210|gb|ABW11407.1| conserved hypothetical protein [Frankia sp. EAN1pec]
Length=529
Score = 217 bits (553), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 179/502 (36%), Positives = 254/502 (51%), Gaps = 36/502 (7%)
Query 23 VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH 82
V ET +AV+ L GDR +K KKPV D R+ R AC E ELN LA+ YLG+A
Sbjct 25 VVETGSAVLCLHGDRVYKRKKPVEPGLLDLRSRAARLAACRAEVELNRWLASDVYLGVAD 84
Query 83 LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT 142
+ G + VV+RR +RL+++V V+ L A+A +A FH+R + I
Sbjct 85 VLGDGGEVCDHAVVLRRMPTGRRLSALVRHADRVDDQLRAVARTVAAFHERCGTSEVIGR 144
Query 143 QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD 202
G+ AVA +W E L L KV+ DV+ I + +++GR L A R + G I D
Sbjct 145 SGDAEAVAGQWKETLGGLEPFQGKVIDADVVDEIGRLALRYLAGRGPLLAERRRAGRIRD 204
Query 203 GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG 262
GH DL A++I +D P +L+ +E + LR D + D A L MDLE LG + + +
Sbjct 205 GHGDLRAENIHCLDDGPRILNRVESDPRLRAGDVLGDVAVLVMDLERLGSPEDAERLMRW 264
Query 263 YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADA-----------VRHLI 311
Y S P SL FYIAYRA A+V C+R+ + EA A+A R L
Sbjct 265 YRDFSAQAHPPSLEHFYIAYRAFTEARVTCLRYRRILAEAGAEAGPGPGAEAGERARRLA 324
Query 312 -IATQHLQHATVRLALVGGNPGTGKSTLARGVAEL-VGAQVISTDDVRRRL-----RDCG 364
IA +HL+ A VRL LVGG PGTGKSTLAR +A+ G ++ +D VR L D
Sbjct 325 DIAYRHLRRARVRLVLVGGLPGTGKSTLARRLADADDGRLLLRSDAVRAELAADGHADPD 384
Query 365 VITGEPGVLD----------SGLYSRANVVA--VYQEALRKARLLLGSGHSVILDGTWGD 412
P + D S ++ ++ + Y L +AR L G +VI+D +W D
Sbjct 385 TPGSGPAIPDRPAVPADLGASFIWPLSSEITARTYTVLLSRARRALERGETVIIDASWSD 444
Query 413 PQMRACARRLAADTHSAIVEFRCSATVDVMADRIVAR--AGGNSDATAEIAAALAARQAD 470
+ RA A RLA +T + +E RC + +V A R+ R A + AT+ + A+++
Sbjct 445 GRHRAAAARLARETAAEFLELRCVTSPEVAATRLTRRDSASDPAGATSAVHRAMSSWAEP 504
Query 471 WDTGHRIDTAGPRERSVGQAYH 492
W T I T P V + +H
Sbjct 505 WPTARVIQTTVP----VAEVFH 522
>gi|254823381|ref|ZP_05228382.1| hypothetical protein MintA_25859 [Mycobacterium intracellulare
ATCC 13950]
Length=533
Score = 212 bits (539), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 174/509 (35%), Positives = 239/509 (47%), Gaps = 32/509 (6%)
Query 17 DEPFIDVR--ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAA 74
D P DVR ETH++ V+LAG A+K KKPV F DF + E+R C E LN R +
Sbjct 27 DPPAHDVRLHETHSSWVLLAGPYAYKLKKPVDLGFLDFTSIERRRADCDEELRLNRRFSP 86
Query 75 QSYLGIAHLSDPSG--------GHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEV 126
Q YLG+ +++ G G EP V MRR ++ L + + G I
Sbjct 87 QMYLGVVEVTEQDGRYRIGGKSGSGEPAVWMRRLPEEGMLPAKLARGEVDLRLARRIGRT 146
Query 127 LARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSG 186
LA+ H R + ID G +VA W EN ++ + +S DV I VDEF+
Sbjct 147 LAKLHGRTETGPDIDAYGSPSSVAANWQENFDQISPFVGRTISSDVNDHIRRYVDEFLRT 206
Query 187 REVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMD 246
R + R+ +G + DGH DL A I + DG+ L D L+F R D + AFLAMD
Sbjct 207 RAPVLERRVADGHVRDGHGDLHAASICIDDGQILLFDSLQFAPRYRCADLASEVAFLAMD 266
Query 247 LEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADA 306
LE+ GR DL F+ Y SGD L DFY YRA VR KV +R +Q + + D
Sbjct 267 LEYHGRADLAWGFVDSYVRASGDDGLLDLLDFYACYRAYVRGKVRSLRLAQTEQASGGDN 326
Query 307 VRHLIIATQHL-----QHA----TVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVR 357
R LI ++ HA + + G P +GK+TLAR +A +G +S+D R
Sbjct 327 -RELIAESRAYFDLAWAHAGGLPRPPMVVTMGLPASGKTTLARALAGRLGLVHLSSDMAR 385
Query 358 RRLRDCGVITGEPGV--LDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDP-- 413
+R+ G+ + G SGLY A + Y R A L G V +D T+G+P
Sbjct 386 KRM--AGIEPTQRGSDEFGSGLYDPAMTRSTYAALRRDAARWLRRGRGVAVDATFGNPRE 443
Query 414 --QMRACARRLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADW 471
QMR A RL AD + + AT+ +R G SDA E+ L A
Sbjct 444 RAQMRQLAHRLGADLRVVLCDAD-DATLIARLERRATEKGVVSDARIELWPELRAAFTPP 502
Query 472 DTGH---RIDTAGPRERSVGQAYHIWRSA 497
D R+D E +V QA + R++
Sbjct 503 DEQPSVLRVDATRDSEETVEQALALLRAS 531
>gi|186681115|ref|YP_001864311.1| hypothetical protein Npun_F0614 [Nostoc punctiforme PCC 73102]
gi|186463567|gb|ACC79368.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length=509
Score = 210 bits (535), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 152/465 (33%), Positives = 228/465 (50%), Gaps = 20/465 (4%)
Query 13 HPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRL 72
H VT EP I++ +TH + V+L GD A+K KKPV F DF T E+R+ C E LN R
Sbjct 21 HAVT-EP-IELIQTHVSYVLLTGDYAYKLKKPVNFGFLDFSTLEKRQHFCHEELRLNQRG 78
Query 73 AAQSYLGIAHLS-----DPSGGHAEPV---VVMRRYRDKQRLASMVTAGLPVEGALDAIA 124
A + YL + ++ GG E V + MR++ + L+++ G E LD +
Sbjct 79 AGELYLEVLPITLVGEQYQLGGTVEAVEYVLKMRQFPQESLLSTLFEQGKLNEARLDELG 138
Query 125 EVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFV 184
V+A++H AQ N I + GEV V + EN + ++ + + + D+F
Sbjct 139 RVVAQYHAEAQTNDYIRSFGEVPKVRAAFDENYQQTENYIGGPQTQEQFTETKQYTDKFF 198
Query 185 SGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLA 244
+ R LFA RI I + H DL +I L + + L DC+EF + R++D + D AF
Sbjct 199 AERPELFASRIHNNYIRECHGDLHLRNIALWNDKILLFDCIEFNEPFRFVDVMYDVAFTV 258
Query 245 MDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQG------ 298
MDLE RKDLG+ FL Y ++GD + Y++ +A VRAKV
Sbjct 259 MDLEARQRKDLGNAFLNAYIEQTGDWEGLQVLPLYLSRQAYVRAKVTSFLLDDPGVPAAV 318
Query 299 KPEAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRR 358
K EA A + A ++ + L L+ G G+GKST AR +A GA + +D VR+
Sbjct 319 KEEATKTASEYYKQAWEYTKPKVGELILMSGLSGSGKSTTARHLARQQGAIHLRSDAVRK 378
Query 359 RLRDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRAC 418
L G+ E G D LY+ Y L +L G SVILD + +R
Sbjct 379 HL--GGIPLWEKGGDD--LYTPEMTEKTYTRLLELGIILAKQGFSVILDAKYDKQHLRQE 434
Query 419 ARRLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAA 463
A A + +C+A ++V+ +R+ R G +DATA++ A+
Sbjct 435 AIAQATKHEIPLQIIQCTAPLEVLKERLNNRTGDIADATADLLAS 479
>gi|86741938|ref|YP_482338.1| hypothetical protein Francci3_3252 [Frankia sp. CcI3]
gi|86568800|gb|ABD12609.1| conserved hypothetical protein [Frankia sp. CcI3]
Length=534
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 173/476 (37%), Positives = 234/476 (50%), Gaps = 11/476 (2%)
Query 15 VTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAA 74
V P V ET +V+V GDR FK KKPV DFR + R AC E LN RLA
Sbjct 45 VASGPPARVVETARSVLVFLGDRVFKVKKPVDLGAVDFRGRQARLAACEAEVRLNRRLAP 104
Query 75 QSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRA 134
YLG+A + P G + +VVMRR + +RL+++ G V + A+ VL FH R
Sbjct 105 DVYLGVADVIGPDGEPCDHMVVMRRLPEARRLSTLAEGGTEVRAEIHALTRVLVDFHARC 164
Query 135 QRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGR 194
+ + I G + + RW A ++ VS ++ + + ++ GR+ L R
Sbjct 165 ETSSRIAEAGGLDRLRGRWDACFARVQRDHGAAVSASILDHVNRLAVRYLDGRDELLRER 224
Query 195 IKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKD 254
+ G I DGH DL A DIF +D P +LDCLEFE LR D + DA LA DLE+LGR+D
Sbjct 225 REAGRIRDGHGDLSAADIFCLDDGPRVLDCLEFEPGLRAADVLADACALAADLEWLGRRD 284
Query 255 LGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIAT 314
L FL Y +G+T P SL DFY A A+ R + C R + G+ AAA+A +A
Sbjct 285 LARLFLDHYREMAGETHPRSLEDFYWALAALGRCQAACQRVAAGE-NAAAEARAFADLAL 343
Query 315 QHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVI---STDDVRRRLRDCGVITGEPG 371
L+ VRL LVGG GTGKSTLA G+A V+ + +
Sbjct 344 ARLRWGRVRLVLVGGQRGTGKSTLAGGLAGTERWTVLRFDDAAADLAASANRHDLAAGGW 403
Query 372 VLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIV 431
G +V AV+QE LR+A L G SV++D W RA A +A + +V
Sbjct 404 ADAGGWVPADDVDAVHQELLRQAGTALRRGESVVVDAPWNRHSQRAQAADVARRAFADLV 463
Query 432 EFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQAD-------WDTGHRIDTA 480
+ RC+A D+ A R R+ + AT+ + R AD W IDTA
Sbjct 464 QLRCTAPPDLAATRTDRRSPATTAATSATGSVGLGRLADTVSRIEPWPEAKIIDTA 519
Lambda K H
0.322 0.136 0.405
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 1071489888984
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40