BLASTP 2.2.25+


Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.



Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           15,229,318 sequences; 5,219,829,388 total letters



Query= Rv2004c

Length=498
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|15841486|ref|NP_336523.1|  hypothetical protein MT2060 [Mycoba...  1004    0.0   
gi|15609141|ref|NP_216520.1|  hypothetical protein Rv2004c [Mycob...  1002    0.0   
gi|340627014|ref|YP_004745466.1|  hypothetical protein MCAN_20241...   993    0.0   
gi|240171510|ref|ZP_04750169.1|  hypothetical protein MkanA1_1950...   644    0.0   
gi|296169102|ref|ZP_06850761.1|  conserved hypothetical protein [...   605    7e-171
gi|183983459|ref|YP_001851750.1|  hypothetical protein MMAR_3476 ...   601    1e-169
gi|169630981|ref|YP_001704630.1|  hypothetical protein MAB_3902c ...   507    2e-141
gi|333990933|ref|YP_004523547.1|  hypothetical protein JDM601_229...   472    7e-131
gi|118473212|ref|YP_888231.1|  hypothetical protein MSMEG_3942 [M...   470    3e-130
gi|120402395|ref|YP_952224.1|  hypothetical protein Mvan_1384 [My...   468    2e-129
gi|145220867|ref|YP_001131545.1|  hypothetical protein Mflv_0263 ...   467    2e-129
gi|108798047|ref|YP_638244.1|  hypothetical protein Mmcs_1074 [My...   448    1e-123
gi|296394769|ref|YP_003659653.1|  hypothetical protein Srot_2377 ...   428    1e-117
gi|317507619|ref|ZP_07965332.1|  hypothetical protein HMPREF9336_...   416    7e-114
gi|325676765|ref|ZP_08156438.1|  hypothetical protein HMPREF0724_...   393    4e-107
gi|312139781|ref|YP_004007117.1|  hypothetical protein REQ_23910 ...   388    1e-105
gi|226306904|ref|YP_002766864.1|  hypothetical protein RER_34170 ...   380    4e-103
gi|111017077|ref|YP_700049.1|  hypothetical protein RHA1_ro00055 ...   366    6e-99 
gi|271966203|ref|YP_003340399.1|  gluconate kinase [Streptosporan...   364    2e-98 
gi|329938537|ref|ZP_08287962.1|  hypothetical protein SGM_3454 [S...   360    3e-97 
gi|254381619|ref|ZP_04996983.1|  conserved hypothetical protein [...   350    3e-94 
gi|290955585|ref|YP_003486767.1|  hypothetical protein SCAB_10221...   350    3e-94 
gi|331698605|ref|YP_004334844.1|  gluconate kinase [Pseudonocardi...   341    2e-91 
gi|134100921|ref|YP_001106582.1|  hypothetical protein SACE_4388 ...   340    3e-91 
gi|21218720|ref|NP_624499.1|  hypothetical protein SCO0163 [Strep...   338    9e-91 
gi|289774177|ref|ZP_06533555.1|  conserved hypothetical protein [...   337    3e-90 
gi|134099015|ref|YP_001104676.1|  hypothetical protein SACE_2453 ...   328    1e-87 
gi|291006888|ref|ZP_06564861.1|  hypothetical protein SeryN2_2040...   328    1e-87 
gi|269127024|ref|YP_003300394.1|  hypothetical protein Tcur_2811 ...   323    4e-86 
gi|302557027|ref|ZP_07309369.1|  conserved hypothetical protein [...   310    3e-82 
gi|336179936|ref|YP_004585311.1|  hypothetical protein FsymDg_411...   300    3e-79 
gi|111223194|ref|YP_713988.1|  hypothetical protein FRAAL3784 [Fr...   286    5e-75 
gi|312198390|ref|YP_004018451.1|  hypothetical protein FraEuI1c_4...   286    7e-75 
gi|158318664|ref|YP_001511172.1|  hypothetical protein Franean1_6...   283    4e-74 
gi|319948330|ref|ZP_08022476.1|  gluconate kinase [Dietzia cinnam...   277    3e-72 
gi|86740954|ref|YP_481354.1|  hypothetical protein Francci3_2257 ...   270    6e-70 
gi|288922596|ref|ZP_06416775.1|  conserved hypothetical protein [...   257    3e-66 
gi|288919770|ref|ZP_06414096.1|  conserved hypothetical protein [...   256    8e-66 
gi|158318668|ref|YP_001511176.1|  hypothetical protein Franean1_6...   249    1e-63 
gi|288919156|ref|ZP_06413494.1|  conserved hypothetical protein [...   241    2e-61 
gi|288923414|ref|ZP_06417540.1|  conserved hypothetical protein [...   231    2e-58 
gi|158315848|ref|YP_001508356.1|  hypothetical protein Franean1_4...   230    4e-58 
gi|342857399|ref|ZP_08714055.1|  hypothetical protein MCOL_00935 ...   229    8e-58 
gi|254774818|ref|ZP_05216334.1|  hypothetical protein MaviaA2_091...   228    2e-57 
gi|218781915|ref|YP_002433233.1|  gluconate kinase [Desulfatibaci...   226    6e-57 
gi|269836440|ref|YP_003318668.1|  Uma3 [Sphaerobacter thermophilu...   221    2e-55 
gi|158313805|ref|YP_001506313.1|  hypothetical protein Franean1_1...   217    3e-54 
gi|254823381|ref|ZP_05228382.1|  hypothetical protein MintA_25859...   212    1e-52 
gi|186681115|ref|YP_001864311.1|  hypothetical protein Npun_F0614...   210    4e-52 
gi|86741938|ref|YP_482338.1|  hypothetical protein Francci3_3252 ...   208    2e-51 


>gi|15841486|ref|NP_336523.1| hypothetical protein MT2060 [Mycobacterium tuberculosis CDC1551]
 gi|308369585|ref|ZP_07418361.2| hypothetical protein TMBG_00544 [Mycobacterium tuberculosis SUMu002]
 gi|13881727|gb|AAK46337.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
 gi|308327062|gb|EFP15913.1| hypothetical protein TMBG_00544 [Mycobacterium tuberculosis SUMu002]
Length=502

 Score = 1004 bits (2595),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 498/498 (100%), Positives = 498/498 (100%), Gaps = 0/498 (0%)

Query  1    MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER  60
            MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER
Sbjct  5    MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER  64

Query  61   ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL  120
            ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL
Sbjct  65   ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL  124

Query  121  DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV  180
            DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV
Sbjct  125  DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV  184

Query  181  DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA  240
            DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA
Sbjct  185  DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA  244

Query  241  AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP  300
            AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP
Sbjct  245  AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP  304

Query  301  EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL  360
            EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL
Sbjct  305  EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL  364

Query  361  RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR  420
            RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR
Sbjct  365  RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR  424

Query  421  RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA  480
            RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA
Sbjct  425  RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA  484

Query  481  GPRERSVGQAYHIWRSAI  498
            GPRERSVGQAYHIWRSAI
Sbjct  485  GPRERSVGQAYHIWRSAI  502


>gi|15609141|ref|NP_216520.1| hypothetical protein Rv2004c [Mycobacterium tuberculosis H37Rv]
 gi|31793184|ref|NP_855677.1| hypothetical protein Mb2027c [Mycobacterium bovis AF2122/97]
 gi|121637888|ref|YP_978111.1| hypothetical protein BCG_2021c [Mycobacterium bovis BCG str. 
Pasteur 1173P2]
 77 more sequence titles
 Length=498

 Score = 1002 bits (2591),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 498/498 (100%), Positives = 498/498 (100%), Gaps = 0/498 (0%)

Query  1    MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER  60
            MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER
Sbjct  1    MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER  60

Query  61   ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL  120
            ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL
Sbjct  61   ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL  120

Query  121  DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV  180
            DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV
Sbjct  121  DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV  180

Query  181  DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA  240
            DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA
Sbjct  181  DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA  240

Query  241  AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP  300
            AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP
Sbjct  241  AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP  300

Query  301  EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL  360
            EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL
Sbjct  301  EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL  360

Query  361  RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR  420
            RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR
Sbjct  361  RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR  420

Query  421  RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA  480
            RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA
Sbjct  421  RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA  480

Query  481  GPRERSVGQAYHIWRSAI  498
            GPRERSVGQAYHIWRSAI
Sbjct  481  GPRERSVGQAYHIWRSAI  498


>gi|340627014|ref|YP_004745466.1| hypothetical protein MCAN_20241 [Mycobacterium canettii CIPT 
140010059]
 gi|340005204|emb|CCC44356.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=498

 Score =  993 bits (2568),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 495/498 (99%), Positives = 495/498 (99%), Gaps = 0/498 (0%)

Query  1    MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER  60
            MDSPTNDGTCD HPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER
Sbjct  1    MDSPTNDGTCDDHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRER  60

Query  61   ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL  120
            ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL
Sbjct  61   ACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGAL  120

Query  121  DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMV  180
            DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHA KVVSGDVIRRIEHMV
Sbjct  121  DAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHAGKVVSGDVIRRIEHMV  180

Query  181  DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA  240
            DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA
Sbjct  181  DEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA  240

Query  241  AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP  300
            AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP
Sbjct  241  AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKP  300

Query  301  EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL  360
            EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL
Sbjct  301  EAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRL  360

Query  361  RDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR  420
            RDCGVI GEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR
Sbjct  361  RDCGVIIGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACAR  420

Query  421  RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA  480
            RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA
Sbjct  421  RLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA  480

Query  481  GPRERSVGQAYHIWRSAI  498
            GPRERSVGQAYHIWRSAI
Sbjct  481  GPRERSVGQAYHIWRSAI  498


>gi|240171510|ref|ZP_04750169.1| hypothetical protein MkanA1_19506 [Mycobacterium kansasii ATCC 
12478]
Length=506

 Score =  644 bits (1660),  Expect = 0.0, Method: Compositional matrix adjust.
 Identities = 331/490 (68%), Positives = 376/490 (77%), Gaps = 3/490 (0%)

Query  8    GTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFE  67
            GT DA      P++DV ETHT VVVLAGDRA+KAKKPV+TDF DFRT EQRERAC+RE E
Sbjct  7    GTADAATTAGVPYVDVHETHTGVVVLAGDRAYKAKKPVLTDFLDFRTPEQRERACLREVE  66

Query  68   LNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAG---LPVEGALDAIA  124
            LNSRL+  SYLGIAHLSDP+GG AEPVVVMRRYRD  RLA +   G     V   LD IA
Sbjct  67   LNSRLSPDSYLGIAHLSDPAGGPAEPVVVMRRYRDSDRLAWLAEHGGSETSVRELLDTIA  126

Query  125  EVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFV  184
             VLARFH+ A+R+  ID QGE GA+ RRW ENL ELR +A  V S + I +IE +V EFV
Sbjct  127  AVLARFHEHAERSPLIDAQGEAGAINRRWTENLTELRRYAGTVFSDESIGQIEQLVAEFV  186

Query  185  SGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLA  244
             GR+VLF  RI EGCIVDGH DLLADDIF V   PALLDCLEF+D+LRY+DRIDDAAFLA
Sbjct  187  CGRDVLFNRRIAEGCIVDGHGDLLADDIFCVADGPALLDCLEFDDQLRYVDRIDDAAFLA  246

Query  245  MDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAA  304
            MDLEFLGR DLG+YFL  Y   SGDTAP  L DFYIAYRAVVRAKV+CVR SQGK  +AA
Sbjct  247  MDLEFLGRNDLGEYFLERYLAHSGDTAPKPLHDFYIAYRAVVRAKVDCVRLSQGKSASAA  306

Query  305  DAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCG  364
            DA RHL IAT+HL+   VRLALVGGNPGTGKST+AR +AE VGAQVISTDDVRR+LR+ G
Sbjct  307  DAARHLAIATRHLRQGAVRLALVGGNPGTGKSTVARALAERVGAQVISTDDVRRQLREWG  366

Query  365  VITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAA  424
             I GE GVLD+GLYS  NV AVY+ ALR+ARL L +G  VILDGTW DPQ+RA A RLAA
Sbjct  367  AIAGESGVLDAGLYSPRNVTAVYEVALRRARLSLANGRPVILDGTWRDPQLRAQAHRLAA  426

Query  425  DTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRE  484
            + HS +VE  C+A VD  ADR+  R  GNSDAT +IAA LAA+   WDT H +DT+ P E
Sbjct  427  EAHSPLVELLCTAPVDTAADRVRTRQPGNSDATPQIAATLAAQHNGWDTAHPVDTSRPLE  486

Query  485  RSVGQAYHIW  494
             SV +A+ +W
Sbjct  487  FSVREAHDVW  496


>gi|296169102|ref|ZP_06850761.1| conserved hypothetical protein [Mycobacterium parascrofulaceum 
ATCC BAA-614]
 gi|295896222|gb|EFG75884.1| conserved hypothetical protein [Mycobacterium parascrofulaceum 
ATCC BAA-614]
Length=486

 Score =  605 bits (1559),  Expect = 7e-171, Method: Compositional matrix adjust.
 Identities = 318/481 (67%), Positives = 370/481 (77%), Gaps = 5/481 (1%)

Query  19   PFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYL  78
            P++D+ ETHT VV+LAGDRA+KAKKPV+TDF DFRT +QRE AC RE ELNSRL+ +SYL
Sbjct  2    PYLDLHETHTGVVILAGDRAYKAKKPVLTDFLDFRTPQQREHACRREVELNSRLSPESYL  61

Query  79   GIAHLSDPSGGHAEPVVVMRRYRDKQRLASMV--TAGLPVEGALDAIAEVLARFHQRAQR  136
            GIA LSDP+GG  EPV+VMRRYRD+ RLA+M    +   V GALDAIA VLARFH+ A R
Sbjct  62   GIAQLSDPAGGPPEPVIVMRRYRDEDRLAAMAFRDSDGHVRGALDAIAAVLARFHRDAGR  121

Query  137  NRCIDTQGEVGAVARRWHENLAELRHHADKVVSG---DVIRRIEHMVDEFVSGREVLFAG  193
            +  I  QGE  AV RRWH+NL+ELR +AD    G   + + RIE +VDEF++GR  LF  
Sbjct  122  SAAISAQGEARAVGRRWHDNLSELRRYADAATPGVAAEAVSRIERLVDEFLAGRAPLFGA  181

Query  194  RIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRK  253
            R+ +GCIVDGH DLLADDIF VDG+PALLDCLEF+D+LRY+D IDDAAFLAMDLEFLGRK
Sbjct  182  RVAQGCIVDGHGDLLADDIFWVDGKPALLDCLEFDDKLRYVDCIDDAAFLAMDLEFLGRK  241

Query  254  DLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIA  313
            DL D+FL  YA  + DTAP SLR FYIAYRAVVRAKV+CVR SQG+  AA DA RHL +A
Sbjct  242  DLADHFLERYAQHAKDTAPPSLRAFYIAYRAVVRAKVDCVRLSQGRHAAAEDAARHLAMA  301

Query  314  TQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVL  373
            T HL+   VRLALVGGNPGTGKST+ARG+AE VGA+VISTDDVRR LRD G I GEPGVL
Sbjct  302  TGHLEAGAVRLALVGGNPGTGKSTVARGLAERVGARVISTDDVRRELRDAGAIAGEPGVL  361

Query  374  DSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEF  433
            ++GLY    V AVY+ AL +AR LL  GHSVILDGTW DP  R  A+RLAA+THSA+VEF
Sbjct  362  NAGLYRPDQVAAVYETALSRARQLLSEGHSVILDGTWRDPGTREAAQRLAAETHSALVEF  421

Query  434  RCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHI  493
             CSA   V ADRI  R  GNS+ T EIAAALAA  A W   HRIDT+   +   G+A+ +
Sbjct  422  VCSAAAGVAADRIKTRRSGNSEVTPEIAAALAAGHAAWVGAHRIDTSRSPDLVAGEAHDL  481

Query  494  W  494
            W
Sbjct  482  W  482


>gi|183983459|ref|YP_001851750.1| hypothetical protein MMAR_3476 [Mycobacterium marinum M]
 gi|183176785|gb|ACC41895.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=482

 Score =  601 bits (1549),  Expect = 1e-169, Method: Compositional matrix adjust.
 Identities = 312/479 (66%), Positives = 360/479 (76%), Gaps = 3/479 (0%)

Query  23   VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH  82
            + ETHT VVVL G+RA+KAKKPV+TDF DFRTAEQRERAC RE ELNSRLA  SYLG+AH
Sbjct  1    MHETHTGVVVLVGERAYKAKKPVLTDFLDFRTAEQRERACAREVELNSRLAPTSYLGVAH  60

Query  83   LSDPSGGHAEPVVVMRRYRDKQRLASMVTAG---LPVEGALDAIAEVLARFHQRAQRNRC  139
             +DP+GG AEP+VVMRRY D  RLA  +++G     V   LD IA VLARFH+ A+R+  
Sbjct  61   CTDPTGGPAEPLVVMRRYHDSDRLAYQISSGGSDESVRALLDTIATVLARFHEGAERSPT  120

Query  140  IDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGC  199
            I+T GE  A+ RR+ +NLAEL  +A      + I RIE +V  F+SGRE L A RI +GC
Sbjct  121  INTAGEPAAIGRRFGDNLAELHRYAGTSFPDESIGRIEDLVAAFISGRETLLAQRIAQGC  180

Query  200  IVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYF  259
            IVDGH DLLADDIF  +G PALLDCLEF+D LRY+DR+DDAAFLAMDLEFLGRKDLG+YF
Sbjct  181  IVDGHGDLLADDIFCAEGGPALLDCLEFDDRLRYVDRVDDAAFLAMDLEFLGRKDLGEYF  240

Query  260  LAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQH  319
            L  Y   SGD APASLRDFYIAYRAVVRAK +CVR SQGKPEAAADA RHL +AT+HL+ 
Sbjct  241  LDRYLAHSGDVAPASLRDFYIAYRAVVRAKTDCVRLSQGKPEAAADAARHLELATRHLET  300

Query  320  ATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYS  379
              VRLALVGGNPGTGKSTLAR +AE VGAQVISTDDVR+ LRD G I GE GVLD GLY+
Sbjct  301  GAVRLALVGGNPGTGKSTLARALAEQVGAQVISTDDVRKELRDRGDIHGESGVLDEGLYT  360

Query  380  RANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATV  439
            R NV  VY   L +AR  L  G SVILDGTW DPQ RA A  L   TH+A+VE  C+  V
Sbjct  361  RDNVTVVYDLVLSRARRCLQEGRSVILDGTWRDPQSRARAHHLGGQTHAALVELLCTLPV  420

Query  440  DVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRSAI  498
            D+ ADRI  RA GNS+ TAEIAA +AA+ A WDT   +DT+ P E S+ +A+  W  AI
Sbjct  421  DMAADRISTRAPGNSEVTAEIAATMAAQHAGWDTALPMDTSRPIEFSLNEAHDAWCRAI  479


>gi|169630981|ref|YP_001704630.1| hypothetical protein MAB_3902c [Mycobacterium abscessus ATCC 
19977]
 gi|169242948|emb|CAM63976.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=519

 Score =  507 bits (1305),  Expect = 2e-141, Method: Compositional matrix adjust.
 Identities = 264/481 (55%), Positives = 332/481 (70%), Gaps = 4/481 (0%)

Query  22   DVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIA  81
            ++RETHT +V+L G  A+K KKPV+T+F DF T E RERAC RE  LNSR++  SYLG++
Sbjct  30   EIRETHTGIVILVGGMAYKIKKPVITNFLDFSTPELRERACAREVALNSRISQDSYLGVS  89

Query  82   HLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCID  141
            HL+DP GG  EPVVVMRRY D  RL++M  +    +  LDAIAEVLA FH+RA R+  ID
Sbjct  90   HLTDPDGGSGEPVVVMRRYPDAARLSAMAKSKRVTKAHLDAIAEVLAGFHKRADRSPSID  149

Query  142  TQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIV  201
              G + A+  RW + L  L  +A  V++ D +R +  +  +F+SGR VLFA R+ +G I+
Sbjct  150  EAGSLDAIVDRWDDTLTALEKYAGTVLAADDVRLVRTLATQFISGRAVLFAQRVADGRII  209

Query  202  DGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLA  261
            DGH DLLA DIF +   P LLDCLEF+D LR++D +DDAAFLAMDLEFLGR DLGDYF+ 
Sbjct  210  DGHGDLLASDIFCLPNGPVLLDCLEFDDRLRHVDGLDDAAFLAMDLEFLGRPDLGDYFMN  269

Query  262  GYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHAT  321
             Y   S DTAP  LR FYIAYRA+VRAKV+C+RF+QG+  AA  A +H+ +A  HL  AT
Sbjct  270  RYVELSADTAPEPLRHFYIAYRALVRAKVDCIRFTQGQRSAAGRAAKHVAMALSHLGAAT  329

Query  322  VRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRA  381
            VRL LVGG PG GKSTLAR ++E +GAQVISTD+VR++L   GVI+G  GVLD+GLYS  
Sbjct  330  VRLVLVGGGPGAGKSTLARRISEDIGAQVISTDEVRQQLHRLGVISGGKGVLDAGLYSTE  389

Query  382  NVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDV  441
            NV AVY   LR+ARL L  GH+VILDGTW  P+ R  A +LA +  + +VEF C A +  
Sbjct  390  NVGAVYDAVLRRARLALAGGHTVILDGTWRSPRHRLRAHQLAYEAGAPMVEFLCLAPLVT  449

Query  442  MADRIVARAGGNSDATAEIAAALAARQA----DWDTGHRIDTAGPRERSVGQAYHIWRSA  497
               R+ AR  G SDAT +IAAAL A  A     W   H IDT  P +RS  +A  + R A
Sbjct  450  AQHRVAARHDGVSDATGDIAAALGAEFAGPDRGWGEAHVIDTRLPLDRSTAEAEELCRQA  509

Query  498  I  498
            +
Sbjct  510  L  510


>gi|333990933|ref|YP_004523547.1| hypothetical protein JDM601_2293 [Mycobacterium sp. JDM601]
 gi|333486901|gb|AEF36293.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=429

 Score =  472 bits (1214),  Expect = 7e-131, Method: Compositional matrix adjust.
 Identities = 248/419 (60%), Positives = 298/419 (72%), Gaps = 0/419 (0%)

Query  80   IAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRC  139
            +AHL  P+G  AEPV+VMRRY D++RLASMV    PVE  LD IA +LA FH   +R+  
Sbjct  1    MAHLQGPAGAPAEPVIVMRRYHDEERLASMVKRAEPVERVLDRIAGLLADFHDHGERSPT  60

Query  140  IDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGC  199
            I  QG+  AV +RW +N   L  HA   V  + +RR++ +  E++SGR  LF  R+++GC
Sbjct  61   ISRQGDPEAVRQRWDDNFPTLHQHAGTAVPSETVRRVQGLGAEYLSGRAGLFTRRVEQGC  120

Query  200  IVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYF  259
            IVDGHADLLADDIF VD  P LLDCLEF DELRY+DRIDDAAFLAMDLEFLGRKDLGD+F
Sbjct  121  IVDGHADLLADDIFWVDDRPVLLDCLEFSDELRYVDRIDDAAFLAMDLEFLGRKDLGDHF  180

Query  260  LAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQH  319
            L  YA  SGDTA  SLRDFYIAYRAVVRAKV+CVR +QGK  +AA A  HL IA +HL+ 
Sbjct  181  LERYAACSGDTAARSLRDFYIAYRAVVRAKVDCVRLTQGKRGSAAAAADHLDIALRHLED  240

Query  320  ATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYS  379
              VRL LVGG PGTGKSTLA  +AE VGA V+STDDVRR LR  G ++GE G L +GLY+
Sbjct  241  GAVRLVLVGGGPGTGKSTLAGALAERVGAVVVSTDDVRRELRSSGQLSGETGNLGAGLYA  300

Query  380  RANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATV  439
             ANV AVY   L++A   LG G SV+LDGTW D + RA ARRLA D H+A  E RC   +
Sbjct  301  PANVAAVYHAVLQRAGRHLGDGVSVVLDGTWRDAETRAEARRLADDKHAAFGEIRCVVPI  360

Query  440  DVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRSAI  498
            +V A+R+  RA GNSDAT +IA  L A    WDT H +DT+ P +  V +A+  WR+AI
Sbjct  361  EVAAERVRTRAAGNSDATPQIAGVLGADDFRWDTAHHVDTSKPLDECVREAHEQWRAAI  419


>gi|118473212|ref|YP_888231.1| hypothetical protein MSMEG_3942 [Mycobacterium smegmatis str. 
MC2 155]
 gi|118174499|gb|ABK75395.1| conserved hypothetical protein [Mycobacterium smegmatis str. 
MC2 155]
Length=498

 Score =  470 bits (1209),  Expect = 3e-130, Method: Compositional matrix adjust.
 Identities = 269/476 (57%), Positives = 326/476 (69%), Gaps = 2/476 (0%)

Query  23   VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH  82
            +RETHT +V L GD A+KAKKPV TDF DF TA+QRE AC+RE ELNSRLA  SYLG+AH
Sbjct  25   IRETHTGLVALIGDLAYKAKKPVRTDFLDFTTAQQREAACLREVELNSRLAPNSYLGVAH  84

Query  83   LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT  142
            L  P     EPVVVMRRYRD  RL+++VT G  V   LD IAE+LARFH+ A R   ID 
Sbjct  85   LVGPGDRPDEPVVVMRRYRDADRLSTLVTRGAEVNDQLDVIAEILARFHRDAGRGAVIDD  144

Query  143  QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD  202
            Q    +V  RW ENL EL   +  +V  + +  +  +  +++SGR  LFA RI +G IVD
Sbjct  145  QARATSVWARWDENLTELARMS--LVPPEQLSEVRRLASQYLSGRAELFAERIADGRIVD  202

Query  203  GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG  262
            GHADLLADDIF     PA+LDCLEF+D LRY+D +DDAAFLAMDLEFLG  +L  +F+  
Sbjct  203  GHADLLADDIFCTPEGPAILDCLEFDDTLRYVDGVDDAAFLAMDLEFLGSPELSAFFVDR  262

Query  263  YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV  322
            Y   + DTAP SL DFY+AYRAVVRAKVEC+R  QG+PEAA DA RH+ IA   L+ ATV
Sbjct  263  YRHHAHDTAPQSLMDFYVAYRAVVRAKVECIRVGQGRPEAATDACRHIDIALDRLRAATV  322

Query  323  RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN  382
            +L +VGG PGTGK+T++R +AE +GA VISTDDVRR L++ GVI G  G LD+GLY+  N
Sbjct  323  QLVIVGGGPGTGKTTVSRALAEELGAVVISTDDVRRYLQESGVIGGAAGELDTGLYAPKN  382

Query  383  VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM  442
            V AVY E L +AR  L  G SVILDGTW D   R  A  LA++T   +VEF CS  V   
Sbjct  383  VAAVYDEVLARARHALTHGRSVILDGTWRDVGRRQRAHLLASETAVPVVEFTCSLPVVAA  442

Query  443  ADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRSAI  498
             +RI +R+G  SDAT EIA ALA + A    GH IDT+ P   SV +A  +   AI
Sbjct  443  GERIASRSGTTSDATPEIADALAEQGAGIVHGHSIDTSRPLRESVTEAQRVCCLAI  498


>gi|120402395|ref|YP_952224.1| hypothetical protein Mvan_1384 [Mycobacterium vanbaalenii PYR-1]
 gi|119955213|gb|ABM12218.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=490

 Score =  468 bits (1203),  Expect = 2e-129, Method: Compositional matrix adjust.
 Identities = 262/472 (56%), Positives = 319/472 (68%), Gaps = 1/472 (0%)

Query  22   DVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIA  81
            +V ETHT +V L GDRAFK KKPVVTDF DF TA++RE AC RE ELN RLA+ SYLG+ 
Sbjct  15   EVHETHTGLVALVGDRAFKIKKPVVTDFLDFSTAQKRETACRREIELNRRLASSSYLGVG  74

Query  82   HLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCID  141
            H   PSG  AEPV+VMRRY D +RLA +V +G+PVE  L AIA++LARFH RA+R   ID
Sbjct  75   HFQPPSGD-AEPVIVMRRYPDTERLAELVRSGVPVETWLTAIADLLARFHSRAERGDAID  133

Query  142  TQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIV  201
             +     ++ RW +NL ELR HA  VV    I  +E +   +++GRE L+  RI    +V
Sbjct  134  REATARVLSDRWQQNLTELRRHAGTVVDDGQIAEVERLASAYLAGREPLYQARIAAHRVV  193

Query  202  DGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLA  261
            DGH DLL+ DIF     P LLDCLEF+D LRY+D IDDA FLAMDLEFLGR+DL D+FL 
Sbjct  194  DGHGDLLSQDIFCTAEGPMLLDCLEFDDRLRYVDGIDDAGFLAMDLEFLGRRDLADFFLD  253

Query  262  GYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHAT  321
             Y  R+ D A  SLR F IAYRAVVRAKV+CVR  QG  EA  DA RHL IA  HL+   
Sbjct  254  EYCRRADDPAAHSLRHFCIAYRAVVRAKVDCVRVDQGHAEAIPDAQRHLAIALAHLRSGR  313

Query  322  VRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRA  381
            V+L +VGG PGTGK+TLAR +A+ V AQ+ISTD+VRR L   GV+ G  G L++GLY+  
Sbjct  314  VQLVVVGGGPGTGKTTLARALAQCVDAQLISTDEVRRELVGSGVVHGRAGELNTGLYTPE  373

Query  382  NVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDV  441
            N+ AVY E L +AR  LG+GHSVI+DGTW D   R  A  +AA T+S+IVE RC+  V  
Sbjct  374  NLSAVYDEVLSRARAWLGAGHSVIVDGTWRDAGHRQRAHAVAAQTYSSIVELRCTLPVAE  433

Query  442  MADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHI  493
               RI  R    SDAT  +AA L+  ++DW   H IDTAGP   SV  A  +
Sbjct  434  AERRIAGRGATASDATPAMAAELSRWESDWPGAHPIDTAGPLADSVAAARQV  485


>gi|145220867|ref|YP_001131545.1| hypothetical protein Mflv_0263 [Mycobacterium gilvum PYR-GCK]
 gi|315442178|ref|YP_004075057.1| hypothetical protein Mspyr1_05130 [Mycobacterium sp. Spyr1]
 gi|145213353|gb|ABP42757.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
 gi|315260481|gb|ADT97222.1| uncharacterized conserved protein [Mycobacterium sp. Spyr1]
Length=490

 Score =  467 bits (1202),  Expect = 2e-129, Method: Compositional matrix adjust.
 Identities = 263/480 (55%), Positives = 313/480 (66%), Gaps = 10/480 (2%)

Query  23   VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH  82
            + ETHT +VVL G+RA+KAKK V TDF DF T EQR RA   E  LN RLA QSYLG+  
Sbjct  11   IHETHTGLVVLLGERAYKAKKAVKTDFLDFSTVEQRARALHHEVTLNRRLAPQSYLGVGE  70

Query  83   LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT  142
             + P G   EPV+VMRRY D  RL S+V  G P+   L  IA  LA FH+ A+R   ID 
Sbjct  71   FAMP-GAQPEPVIVMRRYPDSARLTSLVAQGKPLTAELREIAGRLADFHRDARRGPDIDA  129

Query  143  QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD  202
            QG   AV  RW +NL EL  HAD V     +  I  +   F++GR VL A RI +G IVD
Sbjct  130  QGRPEAVWERWAQNLTELGRHADVVFDRADLDEIHALAQRFLAGRSVLMAQRIAQGRIVD  189

Query  203  GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG  262
            GHADLL DDIF +   PA+LDCLEF+D LRY+D +DDAAFLAMDLEF GR +LGD FL  
Sbjct  190  GHADLLTDDIFCMPDGPAMLDCLEFDDLLRYVDGVDDAAFLAMDLEFHGRGNLGDEFLRE  249

Query  263  YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV  322
            Y  R+ D AP SL++FYIAYRAVVRAKV+CVR  QG PEAA DA RHL +A   L+   V
Sbjct  250  YVARAADPAPRSLQNFYIAYRAVVRAKVDCVRVEQGHPEAADDARRHLHLAADRLRDGAV  309

Query  323  RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN  382
            RL +VGG PG+GK+T++R +AE++GAQVISTDDVRR LRD GVI+G  G LDSGLY+  +
Sbjct  310  RLVIVGGGPGSGKTTVSRALAEVLGAQVISTDDVRRELRDAGVISGAVGALDSGLYAPES  369

Query  383  VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM  442
            V  VY E LR+A   +  G SVILDGTW D +    ARRLA  T + ++EF C   V+  
Sbjct  370  VARVYDEVLRRAEAAVTGGCSVILDGTWRDEEETGRARRLADSTATPLIEFTCVLPVEEA  429

Query  443  ADRIVARAGGNSDATAEIAAALAARQADWDTG----HRIDTAGPRERSVGQAYHIWRSAI  498
              RI AR    SDAT +IA ALA R     TG    H +DT  P   SV +A  I R  I
Sbjct  430  GARIRARTQTTSDATPQIAEALAGR-----TGVAGRHPLDTGRPLAESVAEAQRICRKVI  484


>gi|108798047|ref|YP_638244.1| hypothetical protein Mmcs_1074 [Mycobacterium sp. MCS]
 gi|119867142|ref|YP_937094.1| hypothetical protein Mkms_1090 [Mycobacterium sp. KMS]
 gi|126433707|ref|YP_001069398.1| hypothetical protein Mjls_1101 [Mycobacterium sp. JLS]
 gi|108768466|gb|ABG07188.1| conserved hypothetical protein [Mycobacterium sp. MCS]
 gi|119693231|gb|ABL90304.1| conserved hypothetical protein [Mycobacterium sp. KMS]
 gi|126233507|gb|ABN96907.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=523

 Score =  448 bits (1152),  Expect = 1e-123, Method: Compositional matrix adjust.
 Identities = 260/492 (53%), Positives = 309/492 (63%), Gaps = 13/492 (2%)

Query  2    DSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERA  61
            DSP ND   +           V ETHT VV+L G++A+K KKPV TDF DF   EQRER 
Sbjct  40   DSPVNDMAAE-----------VYETHTGVVLLLGEKAYKIKKPVTTDFLDFSAPEQRERV  88

Query  62   CIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALD  121
            C RE ELNSRLA  SYLG+AH+  P     EPVVVMRRY D+ RL SMV  G   E  L 
Sbjct  89   CAREVELNSRLAPGSYLGVAHMHGPGHDVPEPVVVMRRYPDRYRLRSMVIRGESTENHLT  148

Query  122  AIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVD  181
             +A +LARFH  A R   ID      AV  RW ENL EL H A  VVS   +  +  +  
Sbjct  149  MLASMLARFHATADRRADIDACATAAAVRARWCENLDELDHSAGAVVSAQTVDEVRRLAL  208

Query  182  EFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAA  241
             ++ GR+ LFAGRI +  IVDGH DLLADD+F     P  LDCLEF+D LR++D +DDAA
Sbjct  209  RYLDGRDALFAGRIADRRIVDGHGDLLADDVFCTPDGPVPLDCLEFDDRLRFVDGVDDAA  268

Query  242  FLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPE  301
            FLAMDLEFLGR+DL D+FL  Y   +GD+AP SL DFYIAYRAVVRAKV+C++  QG  +
Sbjct  269  FLAMDLEFLGRRDLADHFLDQYQELAGDSAPRSLVDFYIAYRAVVRAKVDCIKVGQGHED  328

Query  302  AAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLR  361
            AAADA  HL IA  HL+ ATVRL LVGG PGTGK+TL+  + E VGA VISTD+VRR L+
Sbjct  329  AAADAGWHLDIAANHLKAATVRLVLVGGGPGTGKTTLSGALGESVGAHVISTDNVRRELQ  388

Query  362  DCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARR  421
            D GV+ G  G L+SGLYS  NV  VY   L +A +LL  G SV+LDGTW DP  R  AR 
Sbjct  389  DSGVVHGAAGALESGLYSPENVALVYDTVLHRAAVLLAHGESVVLDGTWRDPGHRRAARD  448

Query  422  LAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAG  481
             A  + + +VE  C   +     RI  R    SDAT +IAA +      W   HR+DT  
Sbjct  449  CADRSSAVLVELACDTELSAAQTRITHRTSTTSDATPQIAADITTPV--WHGAHRVDTGR  506

Query  482  PRERSVGQAYHI  493
            P   SV +A  I
Sbjct  507  PLADSVAEAQQI  518


>gi|296394769|ref|YP_003659653.1| hypothetical protein Srot_2377 [Segniliparus rotundus DSM 44985]
 gi|296181916|gb|ADG98822.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=522

 Score =  428 bits (1100),  Expect = 1e-117, Method: Compositional matrix adjust.
 Identities = 236/468 (51%), Positives = 295/468 (64%), Gaps = 0/468 (0%)

Query  19   PFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYL  78
            P+ DV ETH+ VV L GDRA+K KKP+ T F DFR  E RERACIRE ELN R +   YL
Sbjct  18   PYADVAETHSGVVFLVGDRAYKLKKPIATAFLDFRRTEDRERACIREVELNRRFSPDVYL  77

Query  79   GIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNR  138
            G+AHL++P GG  EPVVVMRR  ++ RL+ +       +  L  +A  LA FH+ A+R+ 
Sbjct  78   GVAHLTEPGGGPDEPVVVMRRMPEEARLSLLALGQADAKEGLGELARKLASFHRLARRSA  137

Query  139  CIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEG  198
             ID +G   A  RRW  NL E+R  A     G +I R+E +  +++ GRE LFA RI + 
Sbjct  138  QIDAEGTACATRRRWQANLTEIRGFAAAAEHGWLIDRVERLAADYLHGREPLFADRIAQR  197

Query  199  CIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDY  258
             I+DGH DL+ADD+FL+   P +LDCL+F+D LR++D  DDAAFL MDLE LGR DL   
Sbjct  198  RIIDGHGDLIADDVFLLPDGPRVLDCLDFDDRLRFVDGADDAAFLVMDLEHLGRADLAGG  257

Query  259  FLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQ  318
            FL GY   + D AP SL D YIAYRA+VRAKVE +RF QG  E+  +A +HL  A+ HL+
Sbjct  258  FLGGYLAAAEDPAPRSLVDHYIAYRALVRAKVEFLRFEQGCDESRREARQHLAEASAHLE  317

Query  319  HATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLY  378
               VRL LVGG PGTGKSTLA  +AE VGAQVIS+D VR  L+   +I GE G   SGLY
Sbjct  318  RGAVRLMLVGGLPGTGKSTLANALAERVGAQVISSDLVRHELKTARMIAGELGQYASGLY  377

Query  379  SRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSAT  438
            S      VYQ    +AR  L  G SVILD +WG    R  A++L  +T +A+V  RC+  
Sbjct  378  SPELSSMVYQVMFERARDALSHGESVILDASWGAAGERERAQQLGRETDAAVVALRCTTP  437

Query  439  VDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERS  486
             DV   RI  R  G SDATA+IA A+A     W     +DT+ P E S
Sbjct  438  PDVAERRIAHRRAGFSDATADIARAMATDAGAWTQATDVDTSAPLETS  485


>gi|317507619|ref|ZP_07965332.1| hypothetical protein HMPREF9336_01704 [Segniliparus rugosus ATCC 
BAA-974]
 gi|316254096|gb|EFV13453.1| hypothetical protein HMPREF9336_01704 [Segniliparus rugosus ATCC 
BAA-974]
Length=492

 Score =  416 bits (1068),  Expect = 7e-114, Method: Compositional matrix adjust.
 Identities = 233/472 (50%), Positives = 296/472 (63%), Gaps = 0/472 (0%)

Query  23   VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH  82
            +RETH+ VV+LAG+RA+K KKPV T F DF T E RE AC RE ELN RL+   YLG+AH
Sbjct  13   MRETHSGVVLLAGERAYKFKKPVTTAFLDFSTHEAREFACAREAELNRRLSPDVYLGVAH  72

Query  83   LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT  142
            L+DP GG AEPVVVMRR  +  RL+++V  G PV   L  +A  LA FH+ AQR   +  
Sbjct  73   LTDPVGGPAEPVVVMRRMPETARLSTLVGQGAPVGQGLAELALALAGFHRWAQRGPQVAA  132

Query  143  QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD  202
            Q  V A+  RW  NLAE+   +       ++  I+ +   ++ GR+ LFA RI  G IVD
Sbjct  133  QASVKAIRGRWQANLAEISLFSAAAQHDGLLDHIQQLALRYLRGRKELFAERIARGRIVD  192

Query  203  GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG  262
            GH DL+ADD+FL+   P  LDCL+F+D LR++D  DDAAFLAMDLE+LGR DLG  FL  
Sbjct  193  GHGDLIADDVFLLPEGPRALDCLDFDDRLRFVDGADDAAFLAMDLEYLGRPDLGQSFLEQ  252

Query  263  YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV  322
            Y   + D AP SL   YIAYRA+VRAKV+ VR  QG  ++ A+A+RHL +A  HL+    
Sbjct  253  YLAEAEDDAPRSLLHHYIAYRALVRAKVDYVRLGQGHAQSRAEALRHLRLAADHLERGAA  312

Query  323  RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN  382
            RL L+GG PGTGKSTLA  +A  VGA V+S+D+VR  L++ G I GE G    GLY+   
Sbjct  313  RLVLIGGLPGTGKSTLAAALAGEVGAAVVSSDEVRHELKESGEIAGEAGQYGRGLYAPEA  372

Query  383  VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM  442
               VY+  L +AR  L  G SV+LD +W   + R  A RLA ++ +A+VE RC A  DV 
Sbjct  373  AAKVYRTMLDRARGALAGGASVVLDASWVQAEQRELAARLAEESSAALVELRCVAPQDVA  432

Query  443  ADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIW  494
              RI AR    SDATAEIA  +AA    W T   +DTA     ++  A   W
Sbjct  433  WRRIAARRQSRSDATAEIARDMAADMRPWPTSSEVDTAAEPGAALRAALEAW  484


>gi|325676765|ref|ZP_08156438.1| hypothetical protein HMPREF0724_14221 [Rhodococcus equi ATCC 
33707]
 gi|325552313|gb|EGD22002.1| hypothetical protein HMPREF0724_14221 [Rhodococcus equi ATCC 
33707]
Length=513

 Score =  393 bits (1009),  Expect = 4e-107, Method: Compositional matrix adjust.
 Identities = 228/484 (48%), Positives = 292/484 (61%), Gaps = 7/484 (1%)

Query  17   DEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQS  76
            D+PF  + ETH+ VV+L GDR +K KKP+ T+F DFR+ E R  AC  E ELN RLA   
Sbjct  26   DQPFAGLHETHSGVVILLGDRVYKIKKPIRTEFLDFRSREARLAACRNEVELNRRLAPDV  85

Query  77   YLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAG-LPVEGALDAIAEVLARFHQRAQ  135
            YLG+  L    GG  EP VVMRR  +  RL+++  A       A+DAIA ++A FH+RA 
Sbjct  86   YLGVGELGGTEGGDGEPTVVMRRMPESARLSTLARASSTQCPSAVDAIARIVADFHRRAA  145

Query  136  RNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRI  195
            R   ID +G   AV RRWH+N+ E R     VV+ D +  IE  VD ++ GR  LFA RI
Sbjct  146  RGPRIDREGTADAVRRRWHDNIRETRELPRAVVAEDRLAAIERTVDRYLDGRGPLFAQRI  205

Query  196  KEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDL  255
             +GCIVDGHADLL+DDIF ++  P +LDCLEF+  LRYLDRIDD A LAMDLEF GR DL
Sbjct  206  TDGCIVDGHADLLSDDIFCLEDGPRILDCLEFDARLRYLDRIDDIACLAMDLEFQGRPDL  265

Query  256  GDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQ  315
                +  Y     +TAP SL   YIAYRA +RAKV+CVR  QG+  +A DA RH  +A Q
Sbjct  266  ARRLVLRYRDALTETAPDSLVHHYIAYRAFMRAKVDCVRHLQGRASSADDAARHTALAEQ  325

Query  316  HLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCG-VITGEPGVLD  374
            HL  A  RL LVGG P TGKST+A  +AE VGA++IS+D VRR L       T +PG   
Sbjct  326  HLDRARCRLVLVGGLPATGKSTVAARLAETVGAELISSDHVRRHLFAADRTATPDPG-YR  384

Query  375  SGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFR  434
            SG YS  +   VY   L +AR LL  G SV+LD +W   + R  A   A    + +V+ +
Sbjct  385  SGRYSPDSTGRVYDSMLDRARELLAGGRSVVLDASWTHREHRLRAAETAVAVCADLVQLQ  444

Query  435  CSATVDVMADRIVARAGG----NSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA  490
            C+A  ++   R+  RA      +S+AT  +A A+A     W    R+DT+GP + S+  A
Sbjct  445  CTAPAELTEHRLRERAASRRDHDSEATPAVAVAMAHDADSWPAATRVDTSGPLDASLSVA  504

Query  491  YHIW  494
               W
Sbjct  505  AAEW  508


>gi|312139781|ref|YP_004007117.1| hypothetical protein REQ_23910 [Rhodococcus equi 103S]
 gi|311889120|emb|CBH48433.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=493

 Score =  388 bits (997),  Expect = 1e-105, Method: Compositional matrix adjust.
 Identities = 230/484 (48%), Positives = 292/484 (61%), Gaps = 7/484 (1%)

Query  17   DEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQS  76
            D+PF  + ETH+ VV+L GDR +K KKP+ T+F DFR+ E R  AC  E ELN RLA   
Sbjct  6    DQPFAGLHETHSGVVILLGDRVYKIKKPIRTEFLDFRSREARLAACRNEVELNRRLAPDV  65

Query  77   YLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAG-LPVEGALDAIAEVLARFHQRAQ  135
            YLG+  L DP GG  EP VVMRR  +  RL+++  A       A+DAIA ++A FH+RA 
Sbjct  66   YLGVGELGDPEGGDGEPTVVMRRMPESARLSTLARASSTQCPSAVDAIARIVADFHRRAA  125

Query  136  RNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRI  195
            R   ID +G   AV RRWH+N+ E R     VV+ D +  IE  VD +  GR  LFA RI
Sbjct  126  RGPRIDREGTADAVRRRWHDNIRETRELPRAVVAEDRLAAIERTVDRYFDGRGPLFAQRI  185

Query  196  KEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDL  255
             +GCIVDGHADLL+DDIF ++  P +LDCLEF+  LRYLDRIDD A LAMDLEF GR DL
Sbjct  186  TDGCIVDGHADLLSDDIFCLEDGPRILDCLEFDARLRYLDRIDDIACLAMDLEFQGRPDL  245

Query  256  GDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQ  315
                +  Y     +TAP SL   YIAYRA +RAKV+CVR  QG+  +A DA RH  +A Q
Sbjct  246  AQRLVLRYRDALTETAPDSLVHHYIAYRAFMRAKVDCVRHLQGRAASADDAARHTALAEQ  305

Query  316  HLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCG-VITGEPGVLD  374
            HL  A  RL LVGG P TGKST+A  +AE VGA++IS+D VRR L       T  PG   
Sbjct  306  HLDRARCRLVLVGGLPATGKSTVAARLAETVGAELISSDHVRRHLFAADRTATPYPG-YR  364

Query  375  SGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFR  434
            SG YS  +   VY   L +AR LL  G SV+LD +W   + R  A   A    + +V+ +
Sbjct  365  SGRYSPDSTGRVYDSMLDRARELLAGGRSVVLDASWTHREHRLRAAETAVAVCADLVQLQ  424

Query  435  CSATVDVMADRIVARAGG----NSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA  490
            C+A  ++   R+  RA      +S+AT  +A A+A     W    R+DT+GP + S+  A
Sbjct  425  CTAPAELTEHRLRERAASRRDHDSEATPAVAVAMAHDADSWPAATRVDTSGPLDASLSVA  484

Query  491  YHIW  494
               W
Sbjct  485  AAEW  488


>gi|226306904|ref|YP_002766864.1| hypothetical protein RER_34170 [Rhodococcus erythropolis PR4]
 gi|226186021|dbj|BAH34125.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=495

 Score =  380 bits (975),  Expect = 4e-103, Method: Compositional matrix adjust.
 Identities = 216/479 (46%), Positives = 278/479 (59%), Gaps = 0/479 (0%)

Query  20   FIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLG  79
            ++DV+ET T VVVL GDRA+K KK + T F DF    +RE A  RE  LN R+    Y G
Sbjct  7    YLDVKETTTGVVVLVGDRAYKIKKAISTPFLDFSEPSRREDALQRELTLNQRICEGVYRG  66

Query  80   IAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRC  139
            ++H+ DP    +EP++VM R  D++RL+++      V  ALD IA  ++ FH+RA R+  
Sbjct  67   VSHVVDPVDQSSEPILVMVRMPDERRLSALAATNRDVAHALDEIAAAVSDFHRRAARSVR  126

Query  140  IDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGC  199
            I  QG    V  RW  NLAELR     ++    +  +E M   ++ GR  LF  RI EGC
Sbjct  127  ISEQGTSTGVGHRWVSNLAELRQLCSGLLPPCRLDHLESMSTRYLCGRRSLFDSRIAEGC  186

Query  200  IVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYF  259
            IVDGH DLLADDIF++   P +LDCL+F+D+LR++D +DD+ FLAMDLEFLG +DL   F
Sbjct  187  IVDGHGDLLADDIFVLPDGPKILDCLDFDDQLRFVDILDDSCFLAMDLEFLGYEDLASEF  246

Query  260  LAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQH  319
            LA    +S D  P SL D YIAYRA VRAKV+ +R  QG   + A   RHL +A  HL +
Sbjct  247  LASIVTKSNDQPPISLIDHYIAYRATVRAKVDALRLQQGDSNSQAALERHLDLAENHLCN  306

Query  320  ATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYS  379
              VRL LVGG PGTGK+TL+  +A   GA VIS+D VRR L D G + G      SGLYS
Sbjct  307  GEVRLCLVGGFPGTGKTTLSLALAAHTGATVISSDRVRRELVDAGALRGSADAYQSGLYS  366

Query  380  RANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATV  439
              +V  VY E L +AR  L  G SV+LD TW   + R  A  L   T S ++    S  +
Sbjct  367  PDSVHTVYSEMLDRARDHLSMGESVVLDATWARSRHRREAELLCTSTASTLISLSTSTPL  426

Query  440  DVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRSAI  498
             V   RI  R    SDAT   AAA+A     W     IDT    + S   A  +WR ++
Sbjct  427  SVAVQRIATRTNTLSDATTATAAAIAQGHDPWPESTAIDTDTSIDVSAESAVDVWRRSV  485


>gi|111017077|ref|YP_700049.1| hypothetical protein RHA1_ro00055 [Rhodococcus jostii RHA1]
 gi|110816607|gb|ABG91891.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=504

 Score =  366 bits (939),  Expect = 6e-99, Method: Compositional matrix adjust.
 Identities = 221/473 (47%), Positives = 288/473 (61%), Gaps = 1/473 (0%)

Query  25   ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS  84
            ETHTA VV+ GD  FKAKKP+ T F DF TAE+R  AC RE  LN RL    YLG+A L+
Sbjct  25   ETHTAYVVMVGDVVFKAKKPIRTAFADFGTAERRRAACEREVTLNRRLCPDVYLGVAELT  84

Query  85   DPSGGHAEPVVVMRRYRDKQRLASMVTAGLP-VEGALDAIAEVLARFHQRAQRNRCIDTQ  143
            DP+GG  E +V MRR    +RLA +V  G       +DAIA V+ARFH  A+++  ID  
Sbjct  85   DPAGGPTEALVKMRRMPSDRRLARLVGGGGDDTTAQVDAIAAVVARFHAGAEQSAEIDCD  144

Query  144  GEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDG  203
               GAVA RW  N++E+  +   V+    +  ++     ++ GR+ LF  R++EG IVDG
Sbjct  145  ATPGAVAARWRANVSEVTSYRCDVLPAADVHEVQARALRYLKGRKRLFEYRVREGRIVDG  204

Query  204  HADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGY  263
            H DLLADDIF +   P +LDCL+F+D LR++D IDDAA LAMDLE+LGR+DLG  FL  Y
Sbjct  205  HGDLLADDIFCLADGPRILDCLDFDDHLRHVDCIDDAACLAMDLEYLGREDLGSRFLDRY  264

Query  264  AVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVR  323
               + D  PASL+  YIAYRA VRAKV C+R++QG   A  DA  H  +A + L+  TVR
Sbjct  265  CAAARDEPPASLQHHYIAYRAFVRAKVACLRYTQGSRAAGEDARAHCNLALRQLRAGTVR  324

Query  324  LALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRANV  383
            +ALVGG PGTGKSTL+R +A++ G+ VIS+D VR+ L      + +      GLYS    
Sbjct  325  MALVGGLPGTGKSTLSRKLADVTGSVVISSDHVRKELDGLDPHSRQVAGFGEGLYSGTMT  384

Query  384  VAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVMA  443
               Y E LR+AR  L +G SV+LD +W     R  A  +A+ THS +VE  C A   +  
Sbjct  385  DRTYAEVLRRARDHLTAGRSVVLDASWTQSMRRERAALVASCTHSDLVELECRAPRAMAI  444

Query  444  DRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRS  496
             RI +R  G+SDAT  +  A+AA  A W +   +DT      SV  A  IW S
Sbjct  445  ARIGSRPTGDSDATPAVYDAMAASAAAWPSATAVDTDTAAGDSVQTAERIWHS  497


>gi|271966203|ref|YP_003340399.1| gluconate kinase [Streptosporangium roseum DSM 43021]
 gi|270509378|gb|ACZ87656.1| gluconate kinase [Streptosporangium roseum DSM 43021]
Length=533

 Score =  364 bits (934),  Expect = 2e-98, Method: Compositional matrix adjust.
 Identities = 210/474 (45%), Positives = 277/474 (59%), Gaps = 0/474 (0%)

Query  20   FIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLG  79
            +  ++ETH  VVVL GD AFK KKPV   F DF T + RER C  E ELN RLA   Y G
Sbjct  40   WAQIKETHIGVVVLLGDHAFKLKKPVNFGFVDFTTRQARERICHEEVELNRRLAPDVYEG  99

Query  80   IAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRC  139
            +A +    G   E +VVMRR  +++RLA+M+ +G PVE  L  IA ++A  H R++ +  
Sbjct  100  VADVLGTDGQVCEHLVVMRRMPEERRLAAMIDSGKPVEEHLRQIARMVASMHGRSRHSPQ  159

Query  140  IDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGC  199
            ID QG   A+  RW  +  ++R   + V+  +V+  IE +   F+ GR  LF  RI EG 
Sbjct  160  IDQQGSGQALRSRWSASFDQVRALPEPVLGPEVVGEIERLTLRFLDGRGPLFTARIDEGR  219

Query  200  IVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYF  259
            IVDGH DLLA+DIF +D  P +LDCLEF++ LR++D +DD AFLAMDLE LG   L + F
Sbjct  220  IVDGHGDLLAEDIFCLDDGPRILDCLEFDERLRFVDGLDDVAFLAMDLERLGAPRLAEVF  279

Query  260  LAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQH  319
            L  Y   +GD AP SL   Y+AYRA VRAKV C+R  QG   AA +A R   +  +HLQ 
Sbjct  280  LHQYTEFTGDPAPPSLWHHYVAYRAFVRAKVACLRRGQGDSGAAWEARRFADLTLRHLQA  339

Query  320  ATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYS  379
             TV L LVGG PG GKSTLA  +A+ +G  V+++D VR+ +               G+Y 
Sbjct  340  GTVPLILVGGAPGAGKSTLAAALADRLGYTVLNSDRVRKEMAGISPDQSASAPFGEGIYD  399

Query  380  RANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATV  439
              +    Y E L +A  LL  G  VILD +WG    RA A R+A  T S +V  RC+A  
Sbjct  400  PEHTERTYDELLSRAGKLLERGEPVILDASWGGAGHRAAADRVAQRTSSDLVALRCTALP  459

Query  440  DVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHI  493
             V A+R+  R G  SDA   I AA+AAR A W     IDT+   E+++ +A  +
Sbjct  460  QVAAERLARRTGAVSDADQAIGAAVAARMAPWPDAVEIDTSASPEQALERALAV  513


>gi|329938537|ref|ZP_08287962.1| hypothetical protein SGM_3454 [Streptomyces griseoaurantiacus 
M045]
 gi|329302510|gb|EGG46401.1| hypothetical protein SGM_3454 [Streptomyces griseoaurantiacus 
M045]
Length=508

 Score =  360 bits (924),  Expect = 3e-97, Method: Compositional matrix adjust.
 Identities = 207/468 (45%), Positives = 270/468 (58%), Gaps = 0/468 (0%)

Query  23   VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH  82
            VR+THTAV+    D  +K KK V   F D+ ++  R  AC RE +LN R A   YLG+  
Sbjct  18   VRKTHTAVLFFMEDHVYKVKKRVDLGFLDYTSSTARRTACEREIDLNRRFAPDVYLGLGE  77

Query  83   LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT  142
            L  P     EP+VVMRR  D  RLA +V  G PV  AL  +A  LA +H  A R   I+ 
Sbjct  78   LRTPGEEEPEPLVVMRRMPDDLRLAHLVGTGAPVGDALRVVARQLAAWHAVAPRGPDIEE  137

Query  143  QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD  202
            QG   A+  RW  + A++     + +  D    IE +V E+++GRE LF  RI++G +VD
Sbjct  138  QGTRDALTSRWESSFAQVDAMTAEGLESDAPAEIERLVREYLAGREPLFDMRIEQGRVVD  197

Query  203  GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG  262
            GH DLLADDIF  D  P +LDCLEF+D LRY+D +DDAAFLAMDLE LG  +   +FLA 
Sbjct  198  GHGDLLADDIFCFDDGPRILDCLEFDDHLRYVDGLDDAAFLAMDLELLGAPESAAFFLAR  257

Query  263  YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV  322
            Y   SGD AP SL   Y+AYRA VRAKV  ++  QG P   A A R + +  +HL+ + V
Sbjct  258  YGEYSGDPAPPSLWHHYVAYRAFVRAKVSMIQARQGAPGTRAAAQRFIAMTLRHLRTSAV  317

Query  323  RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN  382
             L LVGG PGTGKSTL+  +A+ +GA ++S+D +R+ L    V          GLY+   
Sbjct  318  GLTLVGGLPGTGKSTLSGALADRLGAVLLSSDRLRKELAGLPVEQTATAAYGQGLYTPEW  377

Query  383  VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM  442
                Y   L +A  LL  G SV+LD TW DP  R  ARR A    +A+    C    DV 
Sbjct  378  TARTYAALLDRAAALLARGESVVLDATWTDPAQREAARRTAETASAALTALHCHVPRDVA  437

Query  443  ADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA  490
            ADRI+ RA G SDA   +A A+++R+  W     +DT+GP   +V QA
Sbjct  438  ADRILTRAPGASDADIGVADAMSSREPPWSGAVPVDTSGPLGSAVAQA  485


>gi|254381619|ref|ZP_04996983.1| conserved hypothetical protein [Streptomyces sp. Mg1]
 gi|194340528|gb|EDX21494.1| conserved hypothetical protein [Streptomyces sp. Mg1]
Length=508

 Score =  350 bits (899),  Expect = 3e-94, Method: Compositional matrix adjust.
 Identities = 204/472 (44%), Positives = 273/472 (58%), Gaps = 0/472 (0%)

Query  19   PFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYL  78
            P  +V ETHTAV+   GDRA+K KKPV   F D+ T   R  AC +E  LN R A   YL
Sbjct  14   PRAEVCETHTAVLFFVGDRAYKVKKPVDLGFLDYTTTAARRAACEQEVALNRRFAPDVYL  73

Query  79   GIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNR  138
            G+     P     EP+VVMRR   ++RL+ +V+ G  V+ AL ++A +LA  H  A R  
Sbjct  74   GLGEFRGPDADTPEPLVVMRRMPAERRLSLLVSQGADVDEALRSVARLLASRHADAPRGP  133

Query  139  CIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEG  198
             ID QG   A++ RW  +  ++R   +     D +   E +V  +++GRE LF  RI++G
Sbjct  134  DIDEQGRRDALSARWEASFTQVRELTEDGRLLDGVAETERLVRRYLAGREELFDVRIEQG  193

Query  199  CIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDY  258
             +VDGH DLLA DIF +D  P +LDCLEF+D LR +D +DDAAFLAMDLE  G  +   +
Sbjct  194  RVVDGHGDLLAQDIFCLDDGPRVLDCLEFDDRLRSVDGLDDAAFLAMDLEQTGAPEAAAF  253

Query  259  FLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQ  318
            FLA Y   SGD AP SL   Y+AYRA VRAKV  ++ +QG   A A A R L    +HL+
Sbjct  254  FLARYGEYSGDPAPPSLWHHYVAYRAFVRAKVSLIQAAQGAHGAEAAARRLLTTTLRHLR  313

Query  319  HATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLY  378
             + V L LVGG PG+GKSTL+  +A+ +G  ++S+D +R+ L               GLY
Sbjct  314  TSAVGLTLVGGLPGSGKSTLSGALADRLGVTLLSSDRLRKELAGMPAEESASAGYGEGLY  373

Query  379  SRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSAT  438
            +       Y E L +A +LL  G SV+LD TW D   RA A R+A  T + +V   C A 
Sbjct  374  TPEWTARTYAELLDRASVLLAMGESVVLDATWSDAGQRAAALRMAERTSADLVALHCQAP  433

Query  439  VDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA  490
             +V A R+  RA G SDAT E+A A+AA +  W+    +DT G  E +V QA
Sbjct  434  GEVSAARLTTRAPGASDATPEVARAMAAVEPPWEEAVPVDTGGSLEAAVIQA  485


>gi|290955585|ref|YP_003486767.1| hypothetical protein SCAB_10221 [Streptomyces scabiei 87.22]
 gi|260645111|emb|CBG68197.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
Length=501

 Score =  350 bits (899),  Expect = 3e-94, Method: Compositional matrix adjust.
 Identities = 199/466 (43%), Positives = 274/466 (59%), Gaps = 0/466 (0%)

Query  25   ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS  84
            ETHTA+V  AGDRA+K KK V   F D+   + R  AC+RE  LN R A   YLG+  + 
Sbjct  3    ETHTAIVFFAGDRAYKVKKAVDLGFVDYTDRQARRAACVREVALNRRFAPDVYLGVGEVV  62

Query  85   DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQG  144
             P    +EP+VVMRR    +RL+++V AG  V+  L A+A  LA +H  A R R +D QG
Sbjct  63   APDAEVSEPLVVMRRMPAGRRLSALVRAGADVDEVLRAVARRLAAWHATAPRGRDVDEQG  122

Query  145  EVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGH  204
               A+A RW  +  ++R   +     D +  ++ +V  +++GRE LF  RI++  +VDGH
Sbjct  123  TRDALASRWEASFEQVRATTEGGSGFDGVPEVQRLVRRYLAGREALFDSRIEQRRVVDGH  182

Query  205  ADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYA  264
             DLLA+DIF +D  P +LDCLEF+D LRY+D +DDAAFLAMDLE LG       FLA Y 
Sbjct  183  GDLLAEDIFCLDDGPRVLDCLEFDDHLRYVDGLDDAAFLAMDLEQLGAPAAAARFLARYG  242

Query  265  VRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVRL  324
              SGD AP SL   Y+AYRA VRAKV  ++  QG P   + A R +    +HL+ + V L
Sbjct  243  EYSGDPAPPSLWHHYVAYRAFVRAKVSLIQAEQGAPGVRSAARRLVSTTLRHLRTSAVGL  302

Query  325  ALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRANVV  384
             LVGG PG+GKSTL+  +A+ +G  ++S+D +R+ L      +  P   + GLY+     
Sbjct  303  TLVGGLPGSGKSTLSGALADRLGVTLLSSDRLRKELAGIPPESPAPAAYEEGLYTPEWTA  362

Query  385  AVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVMAD  444
              Y   L +A  LL  G SV+LD TW   ++RA A R+A  T + +V   C    +V A 
Sbjct  363  RTYDILLDRAAALLSRGESVVLDATWSAAELRAAAGRVAERTCADLVALHCQVPDEVAAA  422

Query  445  RIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA  490
            R+  R+ G SDA   +A ALAAR+  W     +DT+GP E +V +A
Sbjct  423  RLSTRSPGPSDADLGVADALAAREPPWPDAVVVDTSGPLESAVSRA  468


>gi|331698605|ref|YP_004334844.1| gluconate kinase [Pseudonocardia dioxanivorans CB1190]
 gi|326953294|gb|AEA26991.1| gluconate kinase [Pseudonocardia dioxanivorans CB1190]
Length=492

 Score =  341 bits (875),  Expect = 2e-91, Method: Compositional matrix adjust.
 Identities = 212/477 (45%), Positives = 270/477 (57%), Gaps = 5/477 (1%)

Query  23   VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH  82
            V ETH  +V+L GDRA+K KKPV T FCDF T+  R  A  RE  LN RLA   YLG A 
Sbjct  13   VHETHVGIVLLVGDRAYKVKKPVRTSFCDFSTSALRRVAIERELRLNRRLAPDVYLGTAR  72

Query  83   LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT  142
            L     G AEPV+VMRR  D +RL+ +V+ G  V   L  +A +LA  H R++R+  I  
Sbjct  73   LD--GAGCAEPVLVMRRMPDDRRLSRLVSDGCDVTDHLRRLARMLAALHARSERSASITA  130

Query  143  QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD  202
                 A+  RW  NLA +       +   ++     +   F++GR  LF  R  EG +VD
Sbjct  131  DASADALLERWRANLAGMEALRGNALDPGLLDGAGRLAARFLAGRRPLFVRRAAEGRVVD  190

Query  203  GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG  262
            GH DLLADD+F +D  P  LDCL+F+D LR++D +DDAAFLAMDLE LG      YFL  
Sbjct  191  GHGDLLADDVFCLDDGPRALDCLDFDDSLRHVDGLDDAAFLAMDLERLGADAAARYFLHA  250

Query  263  YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPE--AAADAVRHLIIATQHLQHA  320
            YA  + D AP +L   Y+AYRA VRA V  VR  Q +P+  A ADA R L +   HL+  
Sbjct  251  YAGFAADPAPDTLVHHYVAYRAGVRATVAAVRHLQ-QPDVGADADAARLLTLTMAHLRAG  309

Query  321  TVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSR  380
              RL LVGG PGTGKSTLA G+A+  GA ++S+D VR+ L      T       SGLY+ 
Sbjct  310  APRLVLVGGLPGTGKSTLAAGLADRTGAVLVSSDRVRKELAGMAPSTSAAAPFGSGLYAP  369

Query  381  ANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVD  440
             +  A Y+E L +A  LL  G SV+LD +W   + R  A  +A D  + +V+ RC A  D
Sbjct  370  EHTGATYRELLTRAAALLSLGESVVLDASWTCARRRTEAAAVADDRAAELVQVRCVAPSD  429

Query  441  VMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWRSA  497
            V A RI AR G  SDAT  +AA +AA    W     +DT  P  R V  A   W +A
Sbjct  430  VAAARIRARHGSASDATPAVAAQMAATVDAWPDALEVDTTAPAARCVDMAREAWDAA  486


>gi|134100921|ref|YP_001106582.1| hypothetical protein SACE_4388 [Saccharopolyspora erythraea NRRL 
2338]
 gi|291003466|ref|ZP_06561439.1| hypothetical protein SeryN2_02972 [Saccharopolyspora erythraea 
NRRL 2338]
 gi|133913544|emb|CAM03657.1| hypothetical protein SACE_4388 [Saccharopolyspora erythraea NRRL 
2338]
Length=495

 Score =  340 bits (872),  Expect = 3e-91, Method: Compositional matrix adjust.
 Identities = 194/474 (41%), Positives = 266/474 (57%), Gaps = 0/474 (0%)

Query  20   FIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLG  79
            ++   ETH   +V AGDR +K KKPV   F DFR  + R  AC RE ELN RLA   YL 
Sbjct  14   YLSTAETHIGALVFAGDRVYKLKKPVDLGFVDFRDRKTRLWACRRELELNRRLAPDVYLD  73

Query  80   IAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRC  139
            +  +  P     + +V+MRR    + LA++V AG PVE  +  +A+ LA FH  A+R   
Sbjct  74   VLDVGPPGHEPCDHLVLMRRMPADRSLAALVEAGTPVEDEVREVAKKLAGFHSCAERGDE  133

Query  140  IDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGC  199
            I  +G   AVA RW+ N+  LR     ++   ++  I      F+ GR  L   R+ EG 
Sbjct  134  IAAEGAPDAVAGRWNTNVDGLRSFGGDLLDEQLLDEIAEHGRVFLEGRAPLLNRRVAEGR  193

Query  200  IVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYF  259
            IVDGH D+L++D+F +D  P +LDC+EF+D LR LD +DDA  LAMDLE+ G  +L + F
Sbjct  194  IVDGHGDVLSEDVFCLDDGPRILDCIEFDDRLRRLDAVDDAVCLAMDLEYRGAPELAERF  253

Query  260  LAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQH  319
            L  YA  S D  P  LR FY AYRA+VR KV C + S G  EAA+DA  HL +A  H++ 
Sbjct  254  LDWYAQFSDDAVPPGLRHFYTAYRALVRTKVACAKHSPGDEEAASDAREHLALAADHIRR  313

Query  320  ATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYS  379
            A  RL LVGG PG+GK+TL+  +A+ +GA ++S+D VR+ + D       P    SGLYS
Sbjct  314  AVPRLVLVGGLPGSGKTTLSERIADRLGAVLLSSDRVRKEIADLSPAEPAPAEYRSGLYS  373

Query  380  RANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATV  439
              +    Y E +R++  LLG G SV+LD +W     R  A   A    + +V  RC A  
Sbjct  374  AEHTERTYDELVRRSGELLGYGESVVLDASWTREHHRERAVEAANQARAIVVPLRCQAPE  433

Query  440  DVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHI  493
                +R+  R    SDA  E+A ++A+   DW +   IDT G  E S  +A  +
Sbjct  434  STTVERLRGRHATASDADEEVARSIASDADDWPSAWPIDTTGSPEDSADEAVAV  487


>gi|21218720|ref|NP_624499.1| hypothetical protein SCO0163 [Streptomyces coelicolor A3(2)]
 gi|5748625|emb|CAB53130.1| conserved hypothetical protein SCJ1.12 [Streptomyces coelicolor 
A3(2)]
Length=508

 Score =  338 bits (868),  Expect = 9e-91, Method: Compositional matrix adjust.
 Identities = 204/468 (44%), Positives = 266/468 (57%), Gaps = 0/468 (0%)

Query  23   VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH  82
            V ETHTA+V    DR +K KKPV   F D+ T   R   C RE ELN R A   YLG+  
Sbjct  18   VCETHTAMVFFVEDRVYKRKKPVDLGFLDYTTRSSRRAVCEREIELNRRFAPDVYLGLGE  77

Query  83   LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT  142
            L  P    AEP+VVMRR  D +RL+ +V  G PV   L A+A  LA +H  A R   I  
Sbjct  78   LRTPGEQEAEPLVVMRRMPDDRRLSHLVRTGAPVADDLRAVARHLAAWHSAAPRGPAIAE  137

Query  143  QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD  202
            QG   A+A RW  +  ++   A K  + D    +E +V  +++GR+ LF  RI++G ++D
Sbjct  138  QGTRDALAARWEASFTQVDVLAAKGPTRDEAGEVERLVRRYLAGRKPLFGLRIEQGRVLD  197

Query  203  GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG  262
            GH DLLADD+F +   P +LDCLEF+D LRY+D +DDAAFLAMDLE LG  +   +FLA 
Sbjct  198  GHGDLLADDVFCLGDGPRILDCLEFDDALRYVDGLDDAAFLAMDLESLGAPESAAFFLAQ  257

Query  263  YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV  322
            Y   SGD AP SL   Y+AYRA VRAKV  ++  QG P A A A R + +A +HL+ + V
Sbjct  258  YGEYSGDPAPPSLWHHYVAYRAFVRAKVSLIQARQGAPGAHATARRLVRMALRHLRASAV  317

Query  323  RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN  382
             L LV G PGTGKSTL+  +A+ +GA ++S+D +R+ +               GLY+   
Sbjct  318  GLTLVAGLPGTGKSTLSGALADRLGAVLLSSDRLRKEMAGLSPQQTASADYGEGLYTPEW  377

Query  383  VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM  442
                Y E L +A  LL  G SV+LD TW D   R  AR  A    + +V   C    DV 
Sbjct  378  TARTYAELLDRAAALLALGESVVLDATWIDSAQREAARHTAESAGADLVALHCHVPDDVT  437

Query  443  ADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA  490
            A R+  RA G SDA   +A A+AA +  W     +DT G  E +VGQA
Sbjct  438  AARLSTRAPGASDADLGVAEAMAAEEQPWSGAVGVDTGGSLEAAVGQA  485


>gi|289774177|ref|ZP_06533555.1| conserved hypothetical protein [Streptomyces lividans TK24]
 gi|289704376|gb|EFD71805.1| conserved hypothetical protein [Streptomyces lividans TK24]
Length=508

 Score =  337 bits (864),  Expect = 3e-90, Method: Compositional matrix adjust.
 Identities = 204/468 (44%), Positives = 266/468 (57%), Gaps = 0/468 (0%)

Query  23   VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH  82
            V ETHTA+V    DR +K KKPV   F D+ T   R   C RE ELN R A   YLG+  
Sbjct  18   VCETHTAMVFFVEDRVYKRKKPVDLGFLDYTTRSSRRAVCEREIELNRRFAPDVYLGLGE  77

Query  83   LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT  142
            L  P    AEP+VVMRR  D +RL+ +V  G PV   L A+A  LA +H  A R   I  
Sbjct  78   LRTPGEQGAEPLVVMRRMPDDRRLSHLVRTGAPVADDLRAVARHLAAWHSAAPRGPAIAE  137

Query  143  QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD  202
            QG   A+A RW  +  ++   A K  + D    +E +V  +++GR+ LF  RI++G ++D
Sbjct  138  QGTRDALAARWEASFTQVDVLAAKGPTRDETGEVERLVRRYLAGRKPLFDLRIEQGRVLD  197

Query  203  GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG  262
            GH DLLADD+F +   P +LDCLEF+D LRY+D +DDAAFLAMDLE LG  +   +FLA 
Sbjct  198  GHGDLLADDVFCLGDGPRILDCLEFDDSLRYVDGLDDAAFLAMDLESLGAPESAAFFLAQ  257

Query  263  YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV  322
            Y   SGD AP SL   Y+AYRA VRAKV  ++  QG P A A A R + +A +HL+ + V
Sbjct  258  YGEFSGDPAPPSLWHHYVAYRAFVRAKVSLIQARQGAPGAHATARRLVRMALRHLRASAV  317

Query  323  RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN  382
             L LV G PGTGKSTL+  +A+ +GA ++S+D +R+ +               GLY+   
Sbjct  318  GLTLVAGLPGTGKSTLSGALADRLGAVLLSSDRLRKEMAGLSPQQTASADYGEGLYTPEW  377

Query  383  VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM  442
                Y E L +A  LL  G SV+LD TW D   R  AR  A    + +V   C    DV 
Sbjct  378  TARTYAELLDRAAALLALGESVVLDATWIDSAQREAARHTAESAGADLVALHCHVPDDVT  437

Query  443  ADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQA  490
            A R+  RA G SDA   +A A+AA +  W     +DT G  E +VGQA
Sbjct  438  AARLSTRAPGASDADLGVAEAMAAEEQPWSGAVGVDTGGSLEAAVGQA  485


>gi|134099015|ref|YP_001104676.1| hypothetical protein SACE_2453 [Saccharopolyspora erythraea NRRL 
2338]
 gi|133911638|emb|CAM01751.1| hypothetical protein SACE_2453 [Saccharopolyspora erythraea NRRL 
2338]
Length=505

 Score =  328 bits (842),  Expect = 1e-87, Method: Compositional matrix adjust.
 Identities = 209/483 (44%), Positives = 271/483 (57%), Gaps = 5/483 (1%)

Query  16   TDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQ  75
            TD     + E+H AVV   GDRA+K KKPV   F DF T + RE AC RE ELN RLA  
Sbjct  18   TDVGAAGIAESHCAVVAFIGDRAYKVKKPVDFGFLDFSTVQARETACRRELELNRRLAPD  77

Query  76   SYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQ  135
             YL +  + D +GG  + +VVMRR    +RL+ +V  G  V   L+ +A +LA FH  A+
Sbjct  78   VYLDLCRVLDGTGGTCDWIVVMRRMPPSRRLSELVRTGADVRPDLEKLARLLASFHSTAR  137

Query  136  RNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRI  195
                +  +G   A+ RRW +N A        V+       I  +   +V GR  L   RI
Sbjct  138  SGPDVAAEGRASALRRRWVDNFAGAERFVGTVLDRGQFDEIVGLALAYVDGRGRLLDERI  197

Query  196  KEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDL  255
              G +VDGH DLLA+DIF +   P +LDCLEF+D LRY+D +DDAAFLAMDLE LG   L
Sbjct  198  GRGYVVDGHGDLLAEDIFCLPDGPRVLDCLEFDDRLRYVDGLDDAAFLAMDLERLGAPRL  257

Query  256  GDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQ  315
               FL  Y   SG     SL   Y AYRA VRAKV C+R +QG  EAA  A +   IA +
Sbjct  258  AHQFLRWYREFSGAQVADSLAHHYTAYRAFVRAKVACLRAAQGAAEAADAAQQLSGIAVR  317

Query  316  HLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDS  375
            HL+   VRL LVGG PGTGK+TLA G+A+ +GA ++ TD +R+ +     +    G    
Sbjct  318  HLRAGQVRLLLVGGLPGTGKTTLAGGLADQLGAVLLRTDVIRKEMPGADDLATHAG-YGQ  376

Query  376  GLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRC  435
            GLY+ + V   Y+  L + R LL  G +V+LD +W     R  AR +A DTHSA+ E RC
Sbjct  377  GLYNGSQVHGTYEAMLTRCRALLERGETVVLDASWSSAGERESARSIAQDTHSALAELRC  436

Query  436  SATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWR  495
             A  +V   RI  R G  SDATA++A A++    DW     +DT  P + S  +A    R
Sbjct  437  VAPREVAEARIAGRYGDVSDATADVAVAMSRHFDDWPQATDVDTTRPPDESAREA----R  492

Query  496  SAI  498
            SAI
Sbjct  493  SAI  495


>gi|291006888|ref|ZP_06564861.1| hypothetical protein SeryN2_20403 [Saccharopolyspora erythraea 
NRRL 2338]
Length=510

 Score =  328 bits (841),  Expect = 1e-87, Method: Compositional matrix adjust.
 Identities = 209/483 (44%), Positives = 271/483 (57%), Gaps = 5/483 (1%)

Query  16   TDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQ  75
            TD     + E+H AVV   GDRA+K KKPV   F DF T + RE AC RE ELN RLA  
Sbjct  23   TDVGAAGIAESHCAVVAFIGDRAYKVKKPVDFGFLDFSTVQARETACRRELELNRRLAPD  82

Query  76   SYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQ  135
             YL +  + D +GG  + +VVMRR    +RL+ +V  G  V   L+ +A +LA FH  A+
Sbjct  83   VYLDLCRVLDGTGGTCDWIVVMRRMPPSRRLSELVRTGADVRPDLEKLARLLASFHSTAR  142

Query  136  RNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRI  195
                +  +G   A+ RRW +N A        V+       I  +   +V GR  L   RI
Sbjct  143  SGPDVAAEGRASALRRRWVDNFAGAERFVGTVLDRGQFDEIVGLALAYVDGRGRLLDERI  202

Query  196  KEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDL  255
              G +VDGH DLLA+DIF +   P +LDCLEF+D LRY+D +DDAAFLAMDLE LG   L
Sbjct  203  GRGYVVDGHGDLLAEDIFCLPDGPRVLDCLEFDDRLRYVDGLDDAAFLAMDLERLGAPRL  262

Query  256  GDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQ  315
               FL  Y   SG     SL   Y AYRA VRAKV C+R +QG  EAA  A +   IA +
Sbjct  263  AHQFLRWYREFSGAQVADSLAHHYTAYRAFVRAKVACLRAAQGAAEAADAAQQLSGIAVR  322

Query  316  HLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDS  375
            HL+   VRL LVGG PGTGK+TLA G+A+ +GA ++ TD +R+ +     +    G    
Sbjct  323  HLRAGQVRLLLVGGLPGTGKTTLAGGLADQLGAVLLRTDVIRKEMPGADDLATHAG-YGQ  381

Query  376  GLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRC  435
            GLY+ + V   Y+  L + R LL  G +V+LD +W     R  AR +A DTHSA+ E RC
Sbjct  382  GLYNGSQVHGTYEAMLTRCRALLERGETVVLDASWSSAGERESARSIAQDTHSALAELRC  441

Query  436  SATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYHIWR  495
             A  +V   RI  R G  SDATA++A A++    DW     +DT  P + S  +A    R
Sbjct  442  VAPREVAEARIAGRYGDVSDATADVAVAMSRHFDDWPQATDVDTTRPPDESAREA----R  497

Query  496  SAI  498
            SAI
Sbjct  498  SAI  500


>gi|269127024|ref|YP_003300394.1| hypothetical protein Tcur_2811 [Thermomonospora curvata DSM 43183]
 gi|268311982|gb|ACY98356.1| conserved hypothetical protein [Thermomonospora curvata DSM 43183]
Length=533

 Score =  323 bits (828),  Expect = 4e-86, Method: Compositional matrix adjust.
 Identities = 192/424 (46%), Positives = 248/424 (59%), Gaps = 0/424 (0%)

Query  23   VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH  82
            V ETHT +V  AG+RA+K KKPV   F D  T  +R R C RE ELN R A   YLG+A 
Sbjct  17   VSETHTGIVFFAGERAYKVKKPVDLGFVDLTTRRERRRVCHREVELNRRFAGDVYLGVAE  76

Query  83   LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT  142
            LS P     EP+VVMRR    +RLA++V A  PVE  L A+A  LA +H +A R   I  
Sbjct  77   LSGPGDEPPEPIVVMRRMPAGRRLATLVAARRPVEEPLRAVARTLAAWHAQAPRGPHISE  136

Query  143  QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD  202
            QG   A+ +RW ++  ++R    + +       IE  V  F++GRE LF  RI+ G IVD
Sbjct  137  QGSRDALRQRWRDSFEQVRPFHGRSIGAAEAAEIEERVLRFLAGREPLFRSRIQAGRIVD  196

Query  203  GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG  262
            GH DL+A DIF +D  P +LDCLEF+D LR+LD +DDA+FLAMDLE LG   L + FL  
Sbjct  197  GHGDLMATDIFCLDDGPRILDCLEFDDRLRWLDGLDDASFLAMDLERLGAPGLAERFLYW  256

Query  263  YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV  322
            YA  + D APASLR  Y+AYRA VRAKV C+R +QG   AAA       +  +HL+   V
Sbjct  257  YAEYAADPAPASLRHHYVAYRAFVRAKVACLRHAQGDAAAAAQIDPLTELTLRHLRAGAV  316

Query  323  RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRAN  382
             L LVGG PGTGKSTLAR + + +G  V+++D VR+ L              +G+YS A+
Sbjct  317  GLILVGGLPGTGKSTLARSLGDRLGCAVLNSDVVRKELAGIPPDQSAAAPYGTGIYSPAH  376

Query  383  VVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVM  442
                Y   L +A  LL  G SV+LD +W   + R  AR LA  TH+ +   RC A   + 
Sbjct  377  TERTYATLLGRAETLLEQGESVVLDASWTVAEHRTLARLLARRTHADLFALRCEAPPALA  436

Query  443  ADRI  446
              R+
Sbjct  437  EQRM  440


>gi|302557027|ref|ZP_07309369.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000]
 gi|302474645|gb|EFL37738.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000]
Length=448

 Score =  310 bits (795),  Expect = 3e-82, Method: Compositional matrix adjust.
 Identities = 187/431 (44%), Positives = 249/431 (58%), Gaps = 0/431 (0%)

Query  19   PFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYL  78
            P  ++ ETHTAVV+L G+ A+K KKPV   F D  T   RE AC +E  LN R A   YL
Sbjct  18   PRAEMHETHTAVVLLFGEHAYKIKKPVDLGFLDHTTQAAREAACAQEVALNRRFAEDVYL  77

Query  79   GIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNR  138
            G+  L  P    AEP+VVMRR    +RL+ +V  G  V+  L A+A  LA  H  A R+ 
Sbjct  78   GVGELRMPHTDEAEPLVVMRRMPADRRLSRLVREGADVDDVLRAVARQLAARHADAPRSP  137

Query  139  CIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEG  198
             +D QG   A+  RW  + A++R    +V   D +   E +V  +++GRE LF  RI+EG
Sbjct  138  EVDAQGTRDALLSRWEASFAQVRALDGEVPLPDGLDETERLVRRYLAGREALFDTRIREG  197

Query  199  CIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDY  258
             IVDGH DL+A+D+F +D  P +LDCLEF+D LR++D +DDAAFLAMDLE LG  +   +
Sbjct  198  RIVDGHGDLMAEDVFCLDDGPRILDCLEFDDRLRHVDGLDDAAFLAMDLEQLGVPESAAH  257

Query  259  FLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQ  318
            FLA Y+  SGD AP SL   Y++YRA VRAKV  ++  QG P A A A R      +HL+
Sbjct  258  FLARYSEYSGDPAPPSLWHHYVSYRAFVRAKVSLIQSRQGAPGAGAAARRLATATLRHLR  317

Query  319  HATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLY  378
             + V L LVGG PG+GKSTLA  +A+ +G  ++S+D +R+ L               GLY
Sbjct  318  TSAVGLTLVGGLPGSGKSTLAGALADRLGVTLLSSDRLRKELAGIPAEQSAAAPYGEGLY  377

Query  379  SRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSAT  438
            +       Y E L +A  LL +G SV+LD TW DP  R  A R A  T + +V   C   
Sbjct  378  TPEWTDRTYTELLDRAAALLSAGESVVLDATWSDPGRREAALRTAERTRADLVALHCRVP  437

Query  439  VDVMADRIVAR  449
             +V   R+  R
Sbjct  438  GEVSRARLTTR  448


>gi|336179936|ref|YP_004585311.1| hypothetical protein FsymDg_4119 [Frankia symbiont of Datisca 
glomerata]
 gi|334860916|gb|AEH11390.1| hypothetical protein FsymDg_4119 [Frankia symbiont of Datisca 
glomerata]
Length=539

 Score =  300 bits (768),  Expect = 3e-79, Method: Compositional matrix adjust.
 Identities = 207/471 (44%), Positives = 265/471 (57%), Gaps = 4/471 (0%)

Query  21   IDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGI  80
            ++  ETH+A +  +GDR FK KKP+   F DFRT + RE AC  E ELN RLA   YLG+
Sbjct  59   LETVETHSATLYFSGDRVFKVKKPLDLGFLDFRTRQAREAACRAEVELNRRLAPDVYLGM  118

Query  81   AHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCI  140
            A + D +G   + +VVMRR    +RL+SM++ G  V+G L A+A +LA FHQR   +  I
Sbjct  119  ADIHDNAGTLVDHMVVMRRMPANRRLSSMISMGRRVDGQLRALARLLAAFHQRCPTSPEI  178

Query  141  DTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCI  200
               G    +   W E+L  +   +  ++   VI  +  +   +++GR  L A R + G I
Sbjct  179  AEAGSPATLDGLWQESLTGIAPFSGMLIDSSVIDELGRLAPRYLAGRAALLAERQRAGWI  238

Query  201  VDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFL  260
             DGH DLLADDI+ +D  P +LDC+EF+  LR  D + D AFLAMDLE LG  D  + FL
Sbjct  239  RDGHGDLLADDIYCLDDGPRVLDCIEFDRRLRVGDVLGDIAFLAMDLERLGAADAAERFL  298

Query  261  AGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHA  320
              Y   +G+  P SLR  YIAYRA+VRAKV C+R  QG PEAA  A R   IA  HL+  
Sbjct  299  TWYGEFAGEKHPPSLRHLYIAYRALVRAKVSCIRARQGAPEAARQARRLARIALLHLRRG  358

Query  321  TVRLALVGGNPGTGKSTLA-RGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYS  379
             VRL L+GG PGTGKSTLA R V    G  ++ +D VR+ L      T  P  L  G Y 
Sbjct  359  RVRLVLIGGLPGTGKSTLAGRLVDTEDGWVLLRSDVVRKELAGLPADTQIPAGLFEGHYD  418

Query  380  RANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATV  439
                 A Y E LR+AR  L  G SV+LD +W     RA A  LA  T S +VE RC  + 
Sbjct  419  AQTTDATYTELLRRARHALERGESVVLDASWSTAAHRAAAAALAEQTSSDLVELRCVTSP  478

Query  440  DVMADRIVARAGGN---SDATAEIAAALAARQADWDTGHRIDTAGPRERSV  487
            +V A RI  RA      SDAT  +  A+AAR   W T   I TA P   +V
Sbjct  479  EVAAARIARRAAAGEDPSDATLAVHQAMAARAQPWPTASVIQTAVPISEAV  529


>gi|111223194|ref|YP_713988.1| hypothetical protein FRAAL3784 [Frankia alni ACN14a]
 gi|111150726|emb|CAJ62427.1| conserved hypothetical protein; putative reductase and kinase 
domains [Frankia alni ACN14a]
Length=518

 Score =  286 bits (733),  Expect = 5e-75, Method: Compositional matrix adjust.
 Identities = 195/492 (40%), Positives = 258/492 (53%), Gaps = 38/492 (7%)

Query  22   DVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIA  81
            + RETH+AV+ L  DR +K KKPV   F DF     RE  C+ E  LN RLA   YLG+A
Sbjct  17   ETRETHSAVLYLTADRVYKRKKPVNLGFLDFTDRRTREAVCLAEVALNRRLAPDVYLGVA  76

Query  82   HLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCID  141
             L D SG   + +VVMRR    +RL+++V  G  +  AL ++A  LA FHQR + +  I 
Sbjct  77   DLRDDSGEVIDHLVVMRRMPTSRRLSTLVRRGHLLGPALRSVARALAVFHQRCETSPLIA  136

Query  142  TQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIV  201
            T GE   +   W E +A +  +    +   V+  I  +   +++GR  L A R + G I 
Sbjct  137  TAGEQATLEDLWREGMAGIAAYRGTQLDAAVVDDIGRLALRYLAGRAELLAERTRAGWIR  196

Query  202  DGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLA  261
            DGH DLLADDIF +D  P +LDC+EF+  LR+ D + D AFLAMDLE LG  +    FL 
Sbjct  197  DGHGDLLADDIFCLDDGPRILDCIEFDPRLRFGDVLGDVAFLAMDLERLGAPEEAAEFLE  256

Query  262  GYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHAT  321
             Y   SG+  P SL+ FY+AYRA VRAKV C+R  QG P+AA +A R L +A +HL+   
Sbjct  257  AYREFSGEVHPRSLQHFYVAYRAFVRAKVACIRGGQGDPDAAENARRLLAVAHRHLRAGR  316

Query  322  VRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGV---------  372
            VRL +VGG PGTGK+TLA  +AE+    V+   D+ R+      + GEP           
Sbjct  317  VRLVVVGGLPGTGKTTLASRLAEVGDGWVLLRSDIIRQ-----ELVGEPPAEAPHQQPAA  371

Query  373  ---------------------LDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWG  411
                                    G Y+     A Y E LR+AR  L  G SV+LD +W 
Sbjct  372  DTAPPAADAGADADAGGFDAQFGVGRYAPEITDATYAEMLRRARAALCRGESVVLDASWS  431

Query  412  DPQMRACARRLAADTHSAIVEFRCSATVDVMADRIVARAGGN---SDATAEIAAALAARQ  468
              + R  A  +AAD  + +VE  C    +V A RI  R       S+AT  I  A+AAR 
Sbjct  432  SARHRDAAAAVAADVCADLVELHCVTAPEVAAARIARRMAAGPDPSEATVAIHRAMAARA  491

Query  469  ADWDTGHRIDTA  480
              W     I TA
Sbjct  492  DPWPRAAVIRTA  503


>gi|312198390|ref|YP_004018451.1| hypothetical protein FraEuI1c_4588 [Frankia sp. EuI1c]
 gi|311229726|gb|ADP82581.1| hypothetical protein FraEuI1c_4588 [Frankia sp. EuI1c]
Length=509

 Score =  286 bits (731),  Expect = 7e-75, Method: Compositional matrix adjust.
 Identities = 207/494 (42%), Positives = 263/494 (54%), Gaps = 21/494 (4%)

Query  5    TNDGTCDAHPVTDEPFID--------VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAE  56
            T DGT  A      P  D        V ETH+A +    D  +K KKPV   F DF T  
Sbjct  2    TGDGTATAFQARPAPRWDLAALPPGEVVETHSATLTFVDDLVYKVKKPVDLGFLDFSTRA  61

Query  57   QRERACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPV  116
            +R  AC +E  LN RLA   YL +A L D  G   +  VVMRR  +++RL S++  G PV
Sbjct  62   KRLAACEQEVALNRRLAPDVYLAVADLVDDRGRVVDHAVVMRRLPERRRLTSLIQRGQPV  121

Query  117  EGALDAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRI  176
              AL AIA  LA FH     +  I   G    +A  W E + ++  + D ++ G VI  I
Sbjct  122  GAALRAIARRLAAFHAACATSEQIAAAGSSATLAGLWREGVDQVAQYRDNILDGTVIAEI  181

Query  177  EHMVDEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDR  236
            + +   +++GR  L   R+  G + DGH DL ADDIF +   P +LDC+EF+  LR  D 
Sbjct  182  DRLSARYLAGRAPLLRARMAAGLVRDGHGDLQADDIFCLPDGPRILDCIEFDQRLRVGDV  241

Query  237  IDDAAFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFS  296
            + D AFLAMDLE LG +   D  L  Y   +G+T P SL   Y+AYRA VR KV CVR  
Sbjct  242  LGDVAFLAMDLERLGARAEADQLLGWYQEFAGETHPPSLAHLYVAYRAFVRTKVTCVRAG  301

Query  297  QGKPEAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELV-GAQVISTDD  355
            QG P+AA  A R   +A  HL+   VRLALVGG PGTGKSTLA  +A+   G  ++ +D 
Sbjct  302  QGDPDAADLARRLADLALDHLRRGRVRLALVGGLPGTGKSTLAGRLADTEDGWVLLRSDT  361

Query  356  VRRRLRDCGVITGEP-------GVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDG  408
            VR+ L   G+ T  P       G    GLYS A   AVY E LR+A   L  G +V+LD 
Sbjct  362  VRKEL--AGLPTDRPASPKLYEGGPFRGLYSPAATEAVYAELLRRAGHALARGDNVLLDA  419

Query  409  TWGDPQMRACARRLAADTHSAIVEFRCSATVDVMADRIVARAGGN---SDATAEIAAALA  465
            +W D   RA A RLAA  H+ ++E RC    +V A RI  RA      SDATA I  ALA
Sbjct  420  SWSDAADRAAAARLAAAAHADLIELRCVTAPEVAAARIARRAAARTDASDATAAIHGALA  479

Query  466  ARQADWDTGHRIDT  479
             R   W +   + T
Sbjct  480  TRADPWPSAAVVHT  493


>gi|158318664|ref|YP_001511172.1| hypothetical protein Franean1_6932 [Frankia sp. EAN1pec]
 gi|158114069|gb|ABW16266.1| conserved hypothetical protein [Frankia sp. EAN1pec]
Length=523

 Score =  283 bits (725),  Expect = 4e-74, Method: Compositional matrix adjust.
 Identities = 188/481 (40%), Positives = 246/481 (52%), Gaps = 15/481 (3%)

Query  25   ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS  84
            ETHTA++ L  DR +K +KPV   F D RT   R  AC  E  LN RLA   YLG+A + 
Sbjct  44   ETHTAILFLTEDRVYKLRKPVDLGFVDLRTRHARLTACEDEVRLNRRLAPDVYLGVADIR  103

Query  85   DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQG  144
            D  G   + +VVMRR    +RL+ +V  G  + G L  IA  +A FH+R + +  I   G
Sbjct  104  DEEGHPRDHMVVMRRMPADRRLSELVRGGADLTGELRVIARTMAAFHERCETSPEISRAG  163

Query  145  EVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGH  204
             +  +   W E +  +      ++    +  I  +   +++GR  L A R + G I DGH
Sbjct  164  GLANLEALWLEAMDAVAPFRGSILDAGTVDEIGRLALRYLAGRAPLLAERQRAGRIRDGH  223

Query  205  ADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYA  264
             DLLADDI+ +D  P +LDC+ F+  LR  D + D AFLAMDLE LG       FL  Y 
Sbjct  224  GDLLADDIYCLDDGPRVLDCINFDRRLRVGDVLADVAFLAMDLERLGAPAAARTFLDAYR  283

Query  265  VRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVRL  324
              SG+T PASL   YIAYRA VR ++ C+R  QG PEAA +A R   IA  HL+   VRL
Sbjct  284  EFSGETHPASLEHLYIAYRAFVRVRIACIRDHQGDPEAAEEARRLADIALAHLRRGRVRL  343

Query  325  ALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRR----LRDCGVITGEPGVLDSGLYSR  380
             LVGG PGTGKSTLA G+A+     V+   DV R+    L     +   PG   +G+Y  
Sbjct  344  VLVGGLPGTGKSTLASGLADGQDEWVLLRSDVVRKELAGLAPDIAVDVAPG---AGIYGV  400

Query  381  ANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVD  440
                  Y E + +AR  L  G SV+LD +W     R  A   A  T + + + RC A   
Sbjct  401  EATEHSYAELIARARQALERGQSVVLDASWSSGLFRELAAETAKATGADLAQVRCVAPAP  460

Query  441  VMADRIVAR--------AGGNSDATAEIAAALAARQADWDTGHRIDTAGPRERSVGQAYH  492
            V   RI +R            SDAT  I AA+A R   W     IDT G  E +V  A  
Sbjct  461  VAVARIESRRSMRARGTGADASDATGVIHAAMADRADLWPAAFEIDTTGSVEETVAAARR  520

Query  493  I  493
            +
Sbjct  521  V  521


>gi|319948330|ref|ZP_08022476.1| gluconate kinase [Dietzia cinnamea P4]
 gi|319438012|gb|EFV92986.1| gluconate kinase [Dietzia cinnamea P4]
Length=509

 Score =  277 bits (709),  Expect = 3e-72, Method: Compositional matrix adjust.
 Identities = 196/489 (41%), Positives = 256/489 (53%), Gaps = 28/489 (5%)

Query  25   ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS  84
            ETH+A++ L GD A K +KPV   F D  T   R     RE ELNSRLA   Y G+  + 
Sbjct  19   ETHSALIFLWGDEAHKVRKPVDLGFLDNTTVGARGEQSRREVELNSRLAPDVYRGVLEVR  78

Query  85   DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDA-----------IAEVLARFHQR  133
             P G   + VV MRR   ++ LAS+V   L  E  L             +A  +AR H  
Sbjct  79   GPDGEVVDHVVWMRRLPARRSLASLVR--LRAESGLGGGDTDIVVGVTEVARQIARLHAA  136

Query  134  AQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAG  193
              R+  ID  G   AVA  W  +L  LR       + +++  IE +  +++ GR  L   
Sbjct  137  GPRSEEIDAAGTPAAVAGLWARSLEHLRRLDVGRDAPEIVDDIESLATDYLRGRGPLLES  196

Query  194  RIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRK  253
            R+ +G IVDGH DLLA D++L+D  P ++DCLEF+D LRY D + D  FLAMDL+  G +
Sbjct  197  RVADGRIVDGHGDLLAADVYLLDDGPRVIDCLEFDDLLRYGDAVLDIGFLAMDLDASGAR  256

Query  254  DLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQ-----GKPEAAADAVR  308
            DL    L  Y   SGD AP SL   YI YRA+VR+KV  +R  Q     G    A  A+ 
Sbjct  257  DLAVVLLGAYREASGDDAPPSLVHHYIGYRALVRSKVTAIRAEQAADGDGGRRDARRALE  316

Query  309  HLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITG  368
               +A   L  A VRL LVGG  G+GKSTLA  +AE +GA+++ +D VR  +       G
Sbjct  317  LADLAVDSLLRARVRLVLVGGVSGSGKSTLAAPLAEALGAELLRSDVVRSNVVRAQAARG  376

Query  369  EPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHS  428
                     YS   V AVY E L +A   L  G SV+LD TW +P+ RA A  +AAD H+
Sbjct  377  R------DRYSEEAVGAVYAEMLDRAAASLALGRSVVLDATWLEPRRRAEAETVAADAHA  430

Query  429  AIVEFRCSATVDVMADRIV--ARAGGN-SDATAEIAAALAARQADWDTGHRIDTAGPRER  485
             +VE  C+A  D +  RI   ARAG + S+AT E+  A  AR A W     +DT G   R
Sbjct  431  ELVEISCTAPRDELVRRITDRARAGSDPSEATIEVLDAQLARPAAWPEAIEVDTVGLDVR  490

Query  486  SVGQAYHIW  494
              G+A   W
Sbjct  491  D-GEAVRRW  498


>gi|86740954|ref|YP_481354.1| hypothetical protein Francci3_2257 [Frankia sp. CcI3]
 gi|86567816|gb|ABD11625.1| conserved hypothetical protein [Frankia sp. CcI3]
Length=584

 Score =  270 bits (689),  Expect = 6e-70, Method: Compositional matrix adjust.
 Identities = 195/548 (36%), Positives = 269/548 (50%), Gaps = 66/548 (12%)

Query  1    MDSPTNDGTCDAHP---------VTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCD  51
            M SPTN  +  + P         +++    +  ETH+A + LA DR +K KKPV   F D
Sbjct  24   MLSPTNPPSPASTPAQPRLRALYLSELETFETYETHSATLHLAADRVYKRKKPVNLGFLD  83

Query  52   FRTAEQRERACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVT  111
            F     RE  C  E  LN RLA   YLG+A L D +G   + +VVMRR    +RL+++V 
Sbjct  84   FTDRRTRESVCRSEVALNRRLAPDVYLGVADLLDDTGEVIDHLVVMRRMPASRRLSTLVR  143

Query  112  AGLPVEGALDAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGD  171
                V  AL  +A  LA FHQR + +  I   G+   +   W E L  +  +   ++   
Sbjct  144  RRSRVGPALRTVARALAVFHQRCETSPEIAVAGQRATLEGLWREGLEGISPYRGTLLDAA  203

Query  172  VIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDEL  231
            V+  I  +   +++GRE L   R++ G I DGH DLLADDI+ +   P +LDC+EF+  L
Sbjct  204  VVDEIGELALRYLAGRETLLGDRVRAGWIRDGHGDLLADDIYCLGDGPRILDCIEFDPRL  263

Query  232  RYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVE  291
            R+ D + D AFLAMDLE LG  +    FL  Y   SG+  P SL+  Y+AYRA VRAKV 
Sbjct  264  RFGDVLGDVAFLAMDLERLGAPEEAAEFLDAYREFSGEVHPRSLQHLYVAYRAFVRAKVT  323

Query  292  CVRFSQGKPEAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELV-GAQV  350
            C+R  QG P+AA +A R L +A +HL+   V+L +VGG PGTGK+TLA  +A +  G  +
Sbjct  324  CIRGGQGDPDAAEEARRLLAVAHRHLRAGRVQLVVVGGLPGTGKTTLAGRLAGVGDGWVL  383

Query  351  ISTDDVRRRL------------------------------------RDCGV---ITGEPG  371
            + +D +R+ L                                    RD G     T +P 
Sbjct  384  LRSDVIRQELTGMPLREGGPAADTTAGGYASALRNASGTATRTGARRDAGTGAAATSDPA  443

Query  372  VLD--------------SGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRA  417
              D              +G Y+     A Y E LR+A   L  G  V+LD +W   + R 
Sbjct  444  TSDPADGDPATSDPRFGTGRYAPEITDATYAEMLRRAEAALARGERVVLDASWSSARHRR  503

Query  418  CARRLAADTHSAIVEFRCSATVDVMADRIVARAGGNSD---ATAEIAAALAARQADWDTG  474
             A  LAA   + +VE  C    +V A RI  RA   +D   AT  I  A+AAR   W + 
Sbjct  504  AAAELAASVCADLVELHCVTAPEVAAARIGRRAAAGTDPSEATMAIHRAMAARADPWPSA  563

Query  475  HRIDTAGP  482
              + TA P
Sbjct  564  TVVRTAVP  571


>gi|288922596|ref|ZP_06416775.1| conserved hypothetical protein [Frankia sp. EUN1f]
 gi|288346070|gb|EFC80420.1| conserved hypothetical protein [Frankia sp. EUN1f]
Length=516

 Score =  257 bits (657),  Expect = 3e-66, Method: Compositional matrix adjust.
 Identities = 171/411 (42%), Positives = 224/411 (55%), Gaps = 5/411 (1%)

Query  25   ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS  84
            ETHT+V+V  GDR +K KKP    F DFRT E R  AC  E ELN RLA   YLG+A + 
Sbjct  50   ETHTSVLVFLGDRVYKTKKPADLGFLDFRTREARRDACHSEVELNRRLAPDVYLGVADVV  109

Query  85   DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQG  144
             P G   + +VVMRR    +RL+ +V AG  V G L  +A +LA FH R + +  ID   
Sbjct  110  GPDGELCDHLVVMRRLPADRRLSGLVAAGRDVTGELRTVARLLADFHSRCETSAQIDDAA  169

Query  145  EVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGH  204
               A+ R W E +A ++ +   V+    I  I  +   +++GRE L   R + G I DGH
Sbjct  170  SPAALRRLWEEGMAGVQPYVGTVLDPATIDAIGRLATRYLAGREPLLRQRQRRGLIRDGH  229

Query  205  ADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYA  264
             DLLADDI+ +D  P +LDCLEF+  LR  D + D AFLAMDLE LGR DL  +FL  Y 
Sbjct  230  GDLLADDIYCLDDGPRILDCLEFDQRLRVGDILADIAFLAMDLERLGRPDLAAFFLERYR  289

Query  265  VRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVRL  324
              + ++ P SL   Y+AYRA VR KV C R +QG   AAA+A     +A   L+   VRL
Sbjct  290  EYAAESHPRSLEHLYVAYRAFVRCKVACTRHAQGDRSAAAEARTLAALALASLRRGRVRL  349

Query  325  ALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRANVV  384
             LVGG P +G+S LA  +AE  G  ++  +         G     P   D      A+  
Sbjct  350  VLVGGPPESGRSQLATALAEAEGWTLLRAETTATADTATGAADSGP---DGATGDDAD--  404

Query  385  AVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRC  435
            A Y E LRKAR+    G +V+LD  W   + R  A  LA  T + +V+ RC
Sbjct  405  AGYDELLRKARIAAERGETVVLDAPWALRRDRHRAAALAQATAADLVQLRC  455


>gi|288919770|ref|ZP_06414096.1| conserved hypothetical protein [Frankia sp. EUN1f]
 gi|288348870|gb|EFC83121.1| conserved hypothetical protein [Frankia sp. EUN1f]
Length=543

 Score =  256 bits (653),  Expect = 8e-66, Method: Compositional matrix adjust.
 Identities = 181/494 (37%), Positives = 247/494 (50%), Gaps = 25/494 (5%)

Query  25   ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS  84
            ETHT+++ L  DR +K +KPV   F  FR+ + R+ AC  E  LN RLA   YLG+A + 
Sbjct  48   ETHTSILFLTDDRVYKVRKPVDLGFVSFRSRQARQAACENEVRLNRRLAPDVYLGVADIR  107

Query  85   DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQG  144
            D +G   + +VVMRR    + LA +V  G  V G + AIA  LA FH+R + +  I   G
Sbjct  108  DEAGQMRDHMVVMRRLPAGRCLAELVRGGADVTGEVRAIARQLAAFHERCETSPEISRAG  167

Query  145  EVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGH  204
             V  +   W E +  +      ++   V+  I  +   ++ GR  L   R + G + DGH
Sbjct  168  GVAELEALWLEAMDGVAPFRGSILDAPVVDEIGRLALRYLIGRVPLLVERQRAGRVRDGH  227

Query  205  ADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYA  264
             DLLA+DI+ +D  P +LDC+ F+  LR  D + D AFLAMDLE LG  +    FL  Y 
Sbjct  228  GDLLAEDIYCLDDGPRVLDCINFDHRLRVGDVLADVAFLAMDLERLGAPEAARTFLDAYR  287

Query  265  VRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVRL  324
              SG+T PASL   YIAYRA VR ++ C+R  QG+P AA +A R   IA +HL+ A VRL
Sbjct  288  EFSGETHPASLEHLYIAYRAFVRTRIACIRHHQGEPGAADEARRLAAIALRHLRQARVRL  347

Query  325  ALVGGNPGTGKSTLARGVAELVGAQVI------------STDDVRRRLRDCGVITGEPGV  372
             LV G PGTG STLAR +AE  G  V+             T   R    D G        
Sbjct  348  VLVSGLPGTGTSTLARNLAEGEGEWVLLAREDPAGAPGRGTAAQRADGPDRGAAADWGSA  407

Query  373  LDSGLYSRANVVAVY-----QEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTH  427
             D G  +     A +      E + +AR  L  G SV+LD  W   + +      A +T 
Sbjct  408  ADWGSAADWGSAADWGGPGLAELVEQARRALVRGQSVVLDAPWPSRESQDLVAEAADETG  467

Query  428  SAIVEFRCSATVDVMADRIVAR------AGGNSDATAEIAAAL-AARQAD-WDTGHRIDT  479
            + +V  RC A   V   R+ +R        G   + A++AA L AA   D W   H +DT
Sbjct  468  ADLVRLRCVAPPRVAVARVASRQAVQASGSGAEMSHADVAAYLEAATHFDLWPAAHNLDT  527

Query  480  AGPRERSVGQAYHI  493
                + +V  A  I
Sbjct  528  TATIQETVEAARRI  541


>gi|158318668|ref|YP_001511176.1| hypothetical protein Franean1_6936 [Frankia sp. EAN1pec]
 gi|158114073|gb|ABW16270.1| conserved hypothetical protein [Frankia sp. EAN1pec]
Length=494

 Score =  249 bits (635),  Expect = 1e-63, Method: Compositional matrix adjust.
 Identities = 162/412 (40%), Positives = 221/412 (54%), Gaps = 15/412 (3%)

Query  25   ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAHLS  84
            ETHT+V+V  GDR +K KKP    F DFRT E R+ AC  E +LN RLA   YLG+A + 
Sbjct  39   ETHTSVLVFLGDRVYKTKKPADLGFLDFRTREARQAACHAEVDLNRRLAPDVYLGVADVI  98

Query  85   DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQG  144
             P G   + +VVMRR    +RL+++VTAG  V G L A+A +LA FH R   +  I   G
Sbjct  99   GPDGNACDHMVVMRRLPAARRLSALVTAGGDVTGELRAVARLLADFHTRCDTSARITEAG  158

Query  145  EVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGH  204
                +   W E +  ++ +   V+    +  I  +   ++ GR+ L   R   G I DGH
Sbjct  159  SPATLRGLWEEGIRGVQPYLGSVLDASTVDAIGRLAGRYLDGRQPLLRERQHRGLIRDGH  218

Query  205  ADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAGYA  264
             DLLADDI+ +D  P +LDCLEF+  LR  D + D AFLAMDL+ LGR DL  +FL  Y 
Sbjct  219  GDLLADDIYCLDDGPRVLDCLEFDQRLRVGDVLADVAFLAMDLKRLGRPDLASFFLDRYR  278

Query  265  VRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATVRL  324
              S ++ P SL + Y+AYRA VR KV C R +QG   A A+A     +A  +L+H  VRL
Sbjct  279  EYSAESHPRSLENLYVAYRAFVRCKVACTRHAQGDESAGAEARALASLALANLRHGRVRL  338

Query  325  ALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRANVV  384
             LVGG   +G+S LA  +A+  G  ++  +         G      G  D G        
Sbjct  339  VLVGGTRDSGRSELAADLADAEGWTLLRAEPT-----GSGPDGASSGTTDLG--------  385

Query  385  AVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCS  436
              Y E LR+A +    G +V+LD  W   + R  A  LA  T + +V+ RC+
Sbjct  386  --YDELLRRAGIAAERGETVVLDAPWTLRRDRDRAAALADATAADLVQLRCA  435


>gi|288919156|ref|ZP_06413494.1| conserved hypothetical protein [Frankia sp. EUN1f]
 gi|288349403|gb|EFC83642.1| conserved hypothetical protein [Frankia sp. EUN1f]
Length=542

 Score =  241 bits (615),  Expect = 2e-61, Method: Compositional matrix adjust.
 Identities = 144/336 (43%), Positives = 192/336 (58%), Gaps = 2/336 (0%)

Query  2    DSPTNDGTCDA--HPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRE  59
            D+PT  GT  A   P        + ETHT+V++  GDR +K KKP    F DFRT + R 
Sbjct  24   DAPTAPGTPTAPGMPGAVATPAALVETHTSVLIFLGDRVYKVKKPADLGFLDFRTRQARL  83

Query  60   RACIREFELNSRLAAQSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGA  119
             AC  E +LN RLA   YLG+A +  P G   + +VVMRR    +RL+++V AG+ V   
Sbjct  84   AACQAEVDLNRRLAPDVYLGVADVQGPDGALCDHMVVMRRLPADRRLSTLVAAGVDVADD  143

Query  120  LDAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHM  179
            L A A +LA FH R + +  I   G    +   W E+L  +  +   V+ G  I  I  +
Sbjct  144  LRATARLLAAFHTRCETSAEIADAGSSATLGGLWEESLRGVEPYLGTVLDGATIDAIGRL  203

Query  180  VDEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDD  239
               F++GRE L   R + G + DGH DLLA DI+ ++  P +LDCLEF+  LR  D + D
Sbjct  204  AARFLAGREPLLRERQRLGLVRDGHGDLLAGDIYCLEDGPRILDCLEFDQRLRVGDVLGD  263

Query  240  AAFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGK  299
              FLAMDLE LGR DL  + LA Y   + ++ P SL D YIAYRA+VR KV C R++QG 
Sbjct  264  IGFLAMDLESLGRPDLAAFLLAHYRQYAAESHPRSLADLYIAYRALVRCKVACTRYAQGV  323

Query  300  PEAAADAVRHLIIATQHLQHATVRLALVGGNPGTGK  335
              AAA+A     +A  HL+   VRL LVGG PG+G+
Sbjct  324  EPAAAEARALAALALSHLRQGRVRLVLVGGAPGSGR  359


>gi|288923414|ref|ZP_06417540.1| conserved hypothetical protein [Frankia sp. EUN1f]
 gi|288345237|gb|EFC79640.1| conserved hypothetical protein [Frankia sp. EUN1f]
Length=517

 Score =  231 bits (590),  Expect = 2e-58, Method: Compositional matrix adjust.
 Identities = 160/434 (37%), Positives = 218/434 (51%), Gaps = 21/434 (4%)

Query  23   VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH  82
            + ET T+V+V  GDR +K KK     F DFRT E R  AC  E  LN RLA   YLG+A 
Sbjct  28   LSETLTSVLVFLGDRVYKIKKTADLGFLDFRTREARLAACQAEVNLNRRLAPDVYLGVAD  87

Query  83   LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT  142
            +  P G   + +VVMRR   ++RL++++ AG  V G L A+A +LA FH RA  +  I  
Sbjct  88   ILGPDGTALDHMVVMRRLPAERRLSALLAAGSDVTGPLRAVARLLADFHARAATSPEITE  147

Query  143  QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD  202
             G    +   W E L  +      V+    I  I H+ + ++ GR  L   R ++G I D
Sbjct  148  AGSTANLRWLWDEVLESIEPFLGPVLDTTTIDAIRHLANRYLDGRAPLLRERQRDGRIRD  207

Query  203  GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG  262
            GH DLLADDI+ +D  P +LDCLEF+  LR  D + D AFLAMDLE LGR DL  + L  
Sbjct  208  GHGDLLADDIYCLDDGPRILDCLEFDRRLRVGDVLSDIAFLAMDLERLGRPDLSRFLLDQ  267

Query  263  YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHATV  322
            Y   +  T P SL   YIAYRA    ++ C +++QG   AAA+A     +A   L+   +
Sbjct  268  YRAYTAVTHPLSLESLYIAYRAFTMCRIACTQYAQGATAAAAEARALASLALASLRRGRI  327

Query  323  RLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCG------------------  364
            RL LVGG   TG+S +A G+AE  G  ++    V R L                      
Sbjct  328  RLILVGGAADTGRSAVAAGLAESEGWTLLRAASVERELAHLAPAGWTDTPTTAATTTATT  387

Query  365  ---VITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARR  421
                 T       +   + A   AV  E LR+AR  +  G +V++D  W     R  A  
Sbjct  388  TTATATAGTATATTTARTAATATAVRDELLRRARTAVERGETVVIDAPWARRHDREQAAA  447

Query  422  LAADTHSAIVEFRC  435
            LA  T + +++ RC
Sbjct  448  LARATFTDLIQLRC  461


>gi|158315848|ref|YP_001508356.1| hypothetical protein Franean1_4063 [Frankia sp. EAN1pec]
 gi|158111253|gb|ABW13450.1| conserved hypothetical protein [Frankia sp. EAN1pec]
Length=575

 Score =  230 bits (587),  Expect = 4e-58, Method: Compositional matrix adjust.
 Identities = 172/447 (39%), Positives = 233/447 (53%), Gaps = 22/447 (4%)

Query  21   IDVRET---HTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSY  77
            +D RET    TAVV L  DRA+K ++ V   F D+R+   R  AC  E  LN RLA   Y
Sbjct  38   LDSRETVQTPTAVVFLTEDRAYKLRRAVNHGFVDYRSRRARLIACEDEVRLNRRLAPDVY  97

Query  78   LGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRN  137
            LG+A + D +G   + +VVMRR    +RL++++TA   V G L  +A+ +A FH+  +  
Sbjct  98   LGVADIRDETGALRDHMVVMRRLPADRRLSALMTAD--VSGELRELAQRIAAFHEGCETT  155

Query  138  RCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKE  197
              I   G + A+   W E +  L     +++    +  I  +   +++GR  L A R   
Sbjct  156  PEITRTGGLCALEALWLEAMDGLAPFRGRILDAATVDEIGRLALRYLTGRGPLLAERQAA  215

Query  198  GCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGD  257
            G I DGH DLLADDI+ ++  P +L+C+  +  LR  D + DAA LAMDLE LG      
Sbjct  216  GRIRDGHGDLLADDIYCLNDGPRVLNCVNVDPALRAGDVLGDAASLAMDLERLGNATAAR  275

Query  258  YFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHL  317
             FL  Y   SG+T P SL D YIAYRAVVRAK  CVR  QG P AA +A R   +A +HL
Sbjct  276  TFLDAYREFSGETHPTSLEDLYIAYRAVVRAKTACVRDHQGDPAAADEARRLTDLALRHL  335

Query  318  QHATVRLALVGGNPGTGKSTLARGVAELVGAQ----VISTDDVRRRLRDCGVITGEPGVL  373
            +    RL LVGG PGTGKSTLA   + LV  +    ++S+  VR      G    E    
Sbjct  336  RRGRPRLILVGGLPGTGKSTLA---SHLVSGEDDWVLLSSAAVRGEPVGAGATAPESAST  392

Query  374  D----------SGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLA  423
                       +G Y        Y E L +AR  L  G SV++D +W   +MRA A  LA
Sbjct  393  SASDSAGTEPAAGCYGADATEHSYVEVLTRARHALERGRSVVIDASWSSRRMRARAAELA  452

Query  424  ADTHSAIVEFRCSATVDVMADRIVARA  450
            A+  + +++ RC     V   RI  RA
Sbjct  453  AECDADLMQLRCVVPPRVAVARIADRA  479


>gi|342857399|ref|ZP_08714055.1| hypothetical protein MCOL_00935 [Mycobacterium colombiense CECT 
3035]
 gi|342134732|gb|EGT87898.1| hypothetical protein MCOL_00935 [Mycobacterium colombiense CECT 
3035]
Length=533

 Score =  229 bits (584),  Expect = 8e-58, Method: Compositional matrix adjust.
 Identities = 176/507 (35%), Positives = 237/507 (47%), Gaps = 28/507 (5%)

Query  17   DEPFIDVR--ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAA  74
            D P  DVR  ETH++ V LAGD A+K KKPV   F DF + E+R   C  E  LN R + 
Sbjct  27   DPPAADVRLHETHSSWVFLAGDYAYKLKKPVNLGFLDFTSIERRRADCEEELRLNRRFSP  86

Query  75   QSYLGIAHLSDPSG--------GHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEV  126
            Q YLG+  +++  G        G  EP V MRR  ++  L + +  G         I   
Sbjct  87   QMYLGVVEVTEGDGHYRIGGETGSGEPAVWMRRLPEEGMLPAKLARGDVDMRLARRIGRT  146

Query  127  LARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSG  186
            LA+ H RA+    ID  G   +V   W EN  ++     + VS DV   I   VD+FV  
Sbjct  147  LAKLHSRAETGADIDVYGRPSSVIANWRENFDQIGPFVGRTVSSDVNEDIRAYVDQFVHE  206

Query  187  REVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMD  246
            R  L   R+ +G + DGH DL A  I + DG+  L D L+F    R  D   + AFLAMD
Sbjct  207  RASLLERRVADGHVRDGHGDLHAASICIDDGQILLFDSLQFAPRYRCADVASEVAFLAMD  266

Query  247  LEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADA  306
             E+ GR DL   F+  Y   SGD   A L DFYI YRA VR KV  +R +Q +  A+   
Sbjct  267  FEYHGRADLAWGFVESYVRASGDDGLAGLLDFYICYRAYVRGKVRSLRLAQAE-HASGGQ  325

Query  307  VRHLIIATQHL-----QHA----TVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVR  357
             R LI  ++        HA       + +  G P +GK+TLAR +A  +G   +S+D  R
Sbjct  326  TRQLIAESRAYFDLAWAHAGGLPRAPMVVTMGLPASGKTTLARALAGRLGLVHLSSDVAR  385

Query  358  RRLRDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDP----  413
            +R+              SGLY  A   + Y    R A   L  G  V++D T+G+P    
Sbjct  386  KRMAGIEPTRHGNDEFGSGLYDPAMTRSTYAALRRDAARWLRRGRGVVVDATFGNPRERS  445

Query  414  QMRACARRLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDT  473
            Q++  A RL AD H  + E    AT+    +R     G  SDA  E+   L A     D 
Sbjct  446  QVQQLAHRLGADLHVVLCEAD-DATLMARLERRATEQGVVSDARIELWPELRAAFTPPDE  504

Query  474  GH---RIDTAGPRERSVGQAYHIWRSA  497
                 R+D     E +V Q   + R+A
Sbjct  505  QPSLLRVDATRDMEETVEQTLTLLRAA  531


>gi|254774818|ref|ZP_05216334.1| hypothetical protein MaviaA2_09125 [Mycobacterium avium subsp. 
avium ATCC 25291]
Length=535

 Score =  228 bits (582),  Expect = 2e-57, Method: Compositional matrix adjust.
 Identities = 173/508 (35%), Positives = 241/508 (48%), Gaps = 32/508 (6%)

Query  17   DEPFIDVR--ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAA  74
            D P  D+R  ETH++ V+LAG  A+K KKPV   F DF + EQR   C  E  LN R + 
Sbjct  27   DPPADDLRLHETHSSWVILAGPYAYKLKKPVNLGFLDFTSIEQRRADCDEELRLNRRFSP  86

Query  75   QSYLGIAHLSDPSG--------GHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEV  126
            Q YLG+  +++ +G        G  EP V MRR  +   L + +  G         I   
Sbjct  87   QVYLGVVDITEQNGHYRVGGEAGSGEPAVWMRRLPEDGMLPAKLAGGDVDTRLARRIGRT  146

Query  127  LARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSG  186
            LA+ H RA+    I+  G   +V   W EN  ++     + +S ++   I   V EFV  
Sbjct  147  LAKLHGRAETGPDIEAYGSPSSVIANWQENFDQMGPFIGRTISPEINNEIRSYVQEFVLR  206

Query  187  REVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMD  246
            +  L   R+ EG + DGH DL A  + + DG+  L D L+F    R  D   + +FLAMD
Sbjct  207  QAALLERRVTEGHVRDGHGDLHAASVCIADGQIVLFDSLQFAPRYRCADLASEVSFLAMD  266

Query  247  LEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQ------GKP  300
             E+ GR DL   F+  Y   SGD    SL DFY+ YRA VR KV  +R +Q      G+ 
Sbjct  267  FEYHGRGDLAWAFVDSYVRASGDDELPSLLDFYMCYRAYVRGKVRSLRLAQTEKVPGGEQ  326

Query  301  EA-AADAVRHLIIATQHLQHATVRLALVG-GNPGTGKSTLARGVAELVGAQVISTDDVRR  358
            EA  A++  +  +A  H       L +V  G P +GK+TLAR +A  +G   +S+D  R+
Sbjct  327  EALIAESRGYFDLAWAHAGGLPRPLMVVTMGLPASGKTTLARALAGRLGLVHLSSDVARK  386

Query  359  RLRDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDP----Q  414
            R+              SGLY+ A     Y    R A   L  GH V++D T+G+P    Q
Sbjct  387  RMAGIPPTRRGSDEFGSGLYNPAMTRNTYAALRRDAARWLRRGHGVVVDATFGNPGERAQ  446

Query  415  MRACARRLAADTHSAIVEFRCSATVDVMADRIVARA---GGNSDATAEIAAALAARQADW  471
            +R  A RL  D H  +    C A  D +  R+  RA   G  SDA  E+   L A     
Sbjct  447  LRQLAHRLGVDLHLVL----CDADDDTLIARLKRRATEQGVVSDARIELWPQLRAAFTPP  502

Query  472  D---TGHRIDTAGPRERSVGQAYHIWRS  496
            D   +  R+D     E +V QA  + R+
Sbjct  503  DEQASVLRVDATRDTEETVEQALGLLRA  530


>gi|218781915|ref|YP_002433233.1| gluconate kinase [Desulfatibacillum alkenivorans AK-01]
 gi|218763299|gb|ACL05765.1| gluconate kinase [Desulfatibacillum alkenivorans AK-01]
Length=534

 Score =  226 bits (577),  Expect = 6e-57, Method: Compositional matrix adjust.
 Identities = 151/470 (33%), Positives = 221/470 (48%), Gaps = 10/470 (2%)

Query  21   IDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGI  80
            ++V  TH ++V LAGD AFK KKP+   F DF T E+R++AC  E  LN RLA + YL +
Sbjct  39   VEVHRTHISLVFLAGDFAFKVKKPLDLGFLDFSTLEKRKKACEDELILNRRLAPEIYLAV  98

Query  81   AHLS---------DPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFH  131
              +           PSG   E  V M+R         ++  G   E A++ +  ++A FH
Sbjct  99   VPIFMDGQGALTLSPSGRPVEYAVKMKRLNQCGMFDVLLEQGKLDEKAMEELGGIMANFH  158

Query  132  QRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLF  191
             RA     ++      A+   W E+LA++R H  +V+  + +  +E     FV     L 
Sbjct  159  ARADARPSVNAYAFPEAILNMWAEDLAQVREHIPRVIPPEPMDLVEAFSKSFVQNNAALL  218

Query  192  AGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLG  251
              RI+E  I D H DL   +I L  G+  + DC+EF ++ R +D   + AFLAMDLE  G
Sbjct  219  LERIRENRIRDCHGDLHLQNICLNKGKVVVFDCIEFNEKFRCMDVASEIAFLAMDLECRG  278

Query  252  RKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLI  311
               L   F A Y   + D     L  FY  YRA+VRAK+ C+R + G+P         ++
Sbjct  279  ATALARAFTASYIEHAQDPNLKKLLHFYKCYRALVRAKIMCIR-ANGEPLGDMANQYAML  337

Query  312  IATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPG  371
             A          L  + G  G+GKS +A+ +A L GA V ++D +R+ +         P 
Sbjct  338  AARYAAPFPRPTLICMAGITGSGKSGVAQEMANLTGAAVFASDVIRKTMFGFEPTEKIPE  397

Query  372  VLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIV  431
                 +Y +     VYQ  L +AR  LG G SVILD T+   Q R  A  LA +  +   
Sbjct  398  PAVKEVYGQGASQKVYQSMLDRARENLGEGKSVILDATFTLSQGRKAAYDLARECGANFF  457

Query  432  EFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTAG  481
               CS   D+  +RI  RA      +    A   A++ +W     I  +G
Sbjct  458  LVVCSLPEDIAKERISGRAKDAQSVSDGTLAVYKAQKKEWQPIEGIPESG  507


>gi|269836440|ref|YP_003318668.1| Uma3 [Sphaerobacter thermophilus DSM 20745]
 gi|269785703|gb|ACZ37846.1| Uma3 [Sphaerobacter thermophilus DSM 20745]
Length=538

 Score =  221 bits (564),  Expect = 2e-55, Method: Compositional matrix adjust.
 Identities = 162/477 (34%), Positives = 232/477 (49%), Gaps = 31/477 (6%)

Query  13   HPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRL  72
            HPVT+   I + ETH ++V LA D  +K KKPV   F DF T E+R   C  E  LN RL
Sbjct  27   HPVTE---IAIEETHASIVFLADDLVYKIKKPVDFGFLDFSTLERRRHFCHEEIRLNRRL  83

Query  73   AAQSYLGIAHLSDPSG--------GHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIA  124
            +   YL +  + +  G           E  V M R  D Q L  ++T G   E    A+A
Sbjct  84   SQGVYLDVVPVVEVGGRLQLFGDGPEVEYAVKMNRLPDNQMLNYLITTGTVDERVFPALA  143

Query  125  EVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFV  184
            + LA F++ A     +D  G   A      EN+ + + +   +++    R I+ M   F 
Sbjct  144  DRLAAFYREAATGPGVDEWGTAEAAHFSIRENVEQTQPYVGTIIAPVQHRLIDEMSARFF  203

Query  185  SGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEP---ALLDCLEFEDELRYLDRIDDAA  241
            +    LF  RI  G I +GH DL    I +    P    ++DC+EF   LR  D   D A
Sbjct  204  AEHAELFQQRIAAGRIREGHGDLHLAHICVQGLRPEELQIIDCVEFNPRLRCGDIAVDIA  263

Query  242  FLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPE  301
            FLAMDL++ GR DL    +   A R GD     L  F+  YRA VR KV C R  +  PE
Sbjct  264  FLAMDLDYHGRPDLSRSLVNMLAERLGDDDLPLLVHFFSVYRAHVRNKVACFRLDEIAPE  323

Query  302  ------AAADAVRHLIIATQHL-QHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTD  354
                    ++A R++ +AT +L +     L LVGG  GTGKS +A  +A  +GA + S+D
Sbjct  324  LPEYVAVKSEAERYIDLATSYLVEPERPTLFLVGGLSGTGKSVIAYRLARALGASLSSSD  383

Query  355  DVRRRLRDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQ  414
             VR+ +    V + EP    +G+Y  +     YQE L +AR+ L +G S +LD T+ DP 
Sbjct  384  VVRKEIAGRPVESHEPVPYGTGIYEPSLTARTYQELLDRARVALTAGRSAVLDATFLDPS  443

Query  415  MRACARRLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADW  471
             R  AR +AA+    ++   C A   V+ +R+  R+G  +D +          +ADW
Sbjct  444  WREAARDMAAELGVDLLLIECQAPPAVVEERLARRSGLMADPS----------EADW  490


>gi|158313805|ref|YP_001506313.1| hypothetical protein Franean1_1970 [Frankia sp. EAN1pec]
 gi|158109210|gb|ABW11407.1| conserved hypothetical protein [Frankia sp. EAN1pec]
Length=529

 Score =  217 bits (553),  Expect = 3e-54, Method: Compositional matrix adjust.
 Identities = 179/502 (36%), Positives = 254/502 (51%), Gaps = 36/502 (7%)

Query  23   VRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGIAH  82
            V ET +AV+ L GDR +K KKPV     D R+   R  AC  E ELN  LA+  YLG+A 
Sbjct  25   VVETGSAVLCLHGDRVYKRKKPVEPGLLDLRSRAARLAACRAEVELNRWLASDVYLGVAD  84

Query  83   LSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDT  142
            +    G   +  VV+RR    +RL+++V     V+  L A+A  +A FH+R   +  I  
Sbjct  85   VLGDGGEVCDHAVVLRRMPTGRRLSALVRHADRVDDQLRAVARTVAAFHERCGTSEVIGR  144

Query  143  QGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVD  202
             G+  AVA +W E L  L     KV+  DV+  I  +   +++GR  L A R + G I D
Sbjct  145  SGDAEAVAGQWKETLGGLEPFQGKVIDADVVDEIGRLALRYLAGRGPLLAERRRAGRIRD  204

Query  203  GHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKDLGDYFLAG  262
            GH DL A++I  +D  P +L+ +E +  LR  D + D A L MDLE LG  +  +  +  
Sbjct  205  GHGDLRAENIHCLDDGPRILNRVESDPRLRAGDVLGDVAVLVMDLERLGSPEDAERLMRW  264

Query  263  YAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADA-----------VRHLI  311
            Y   S    P SL  FYIAYRA   A+V C+R+ +   EA A+A            R L 
Sbjct  265  YRDFSAQAHPPSLEHFYIAYRAFTEARVTCLRYRRILAEAGAEAGPGPGAEAGERARRLA  324

Query  312  -IATQHLQHATVRLALVGGNPGTGKSTLARGVAEL-VGAQVISTDDVRRRL-----RDCG  364
             IA +HL+ A VRL LVGG PGTGKSTLAR +A+   G  ++ +D VR  L      D  
Sbjct  325  DIAYRHLRRARVRLVLVGGLPGTGKSTLARRLADADDGRLLLRSDAVRAELAADGHADPD  384

Query  365  VITGEPGVLD----------SGLYSRANVVA--VYQEALRKARLLLGSGHSVILDGTWGD  412
                 P + D          S ++  ++ +    Y   L +AR  L  G +VI+D +W D
Sbjct  385  TPGSGPAIPDRPAVPADLGASFIWPLSSEITARTYTVLLSRARRALERGETVIIDASWSD  444

Query  413  PQMRACARRLAADTHSAIVEFRCSATVDVMADRIVAR--AGGNSDATAEIAAALAARQAD  470
             + RA A RLA +T +  +E RC  + +V A R+  R  A   + AT+ +  A+++    
Sbjct  445  GRHRAAAARLARETAAEFLELRCVTSPEVAATRLTRRDSASDPAGATSAVHRAMSSWAEP  504

Query  471  WDTGHRIDTAGPRERSVGQAYH  492
            W T   I T  P    V + +H
Sbjct  505  WPTARVIQTTVP----VAEVFH  522


>gi|254823381|ref|ZP_05228382.1| hypothetical protein MintA_25859 [Mycobacterium intracellulare 
ATCC 13950]
Length=533

 Score =  212 bits (539),  Expect = 1e-52, Method: Compositional matrix adjust.
 Identities = 174/509 (35%), Positives = 239/509 (47%), Gaps = 32/509 (6%)

Query  17   DEPFIDVR--ETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAA  74
            D P  DVR  ETH++ V+LAG  A+K KKPV   F DF + E+R   C  E  LN R + 
Sbjct  27   DPPAHDVRLHETHSSWVLLAGPYAYKLKKPVDLGFLDFTSIERRRADCDEELRLNRRFSP  86

Query  75   QSYLGIAHLSDPSG--------GHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEV  126
            Q YLG+  +++  G        G  EP V MRR  ++  L + +  G         I   
Sbjct  87   QMYLGVVEVTEQDGRYRIGGKSGSGEPAVWMRRLPEEGMLPAKLARGEVDLRLARRIGRT  146

Query  127  LARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSG  186
            LA+ H R +    ID  G   +VA  W EN  ++     + +S DV   I   VDEF+  
Sbjct  147  LAKLHGRTETGPDIDAYGSPSSVAANWQENFDQISPFVGRTISSDVNDHIRRYVDEFLRT  206

Query  187  REVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMD  246
            R  +   R+ +G + DGH DL A  I + DG+  L D L+F    R  D   + AFLAMD
Sbjct  207  RAPVLERRVADGHVRDGHGDLHAASICIDDGQILLFDSLQFAPRYRCADLASEVAFLAMD  266

Query  247  LEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADA  306
            LE+ GR DL   F+  Y   SGD     L DFY  YRA VR KV  +R +Q +  +  D 
Sbjct  267  LEYHGRADLAWGFVDSYVRASGDDGLLDLLDFYACYRAYVRGKVRSLRLAQTEQASGGDN  326

Query  307  VRHLIIATQHL-----QHA----TVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVR  357
             R LI  ++        HA       + +  G P +GK+TLAR +A  +G   +S+D  R
Sbjct  327  -RELIAESRAYFDLAWAHAGGLPRPPMVVTMGLPASGKTTLARALAGRLGLVHLSSDMAR  385

Query  358  RRLRDCGVITGEPGV--LDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDP--  413
            +R+   G+   + G     SGLY  A   + Y    R A   L  G  V +D T+G+P  
Sbjct  386  KRM--AGIEPTQRGSDEFGSGLYDPAMTRSTYAALRRDAARWLRRGRGVAVDATFGNPRE  443

Query  414  --QMRACARRLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADW  471
              QMR  A RL AD    + +    AT+    +R     G  SDA  E+   L A     
Sbjct  444  RAQMRQLAHRLGADLRVVLCDAD-DATLIARLERRATEKGVVSDARIELWPELRAAFTPP  502

Query  472  DTGH---RIDTAGPRERSVGQAYHIWRSA  497
            D      R+D     E +V QA  + R++
Sbjct  503  DEQPSVLRVDATRDSEETVEQALALLRAS  531


>gi|186681115|ref|YP_001864311.1| hypothetical protein Npun_F0614 [Nostoc punctiforme PCC 73102]
 gi|186463567|gb|ACC79368.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length=509

 Score =  210 bits (535),  Expect = 4e-52, Method: Compositional matrix adjust.
 Identities = 152/465 (33%), Positives = 228/465 (50%), Gaps = 20/465 (4%)

Query  13   HPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRL  72
            H VT EP I++ +TH + V+L GD A+K KKPV   F DF T E+R+  C  E  LN R 
Sbjct  21   HAVT-EP-IELIQTHVSYVLLTGDYAYKLKKPVNFGFLDFSTLEKRQHFCHEELRLNQRG  78

Query  73   AAQSYLGIAHLS-----DPSGGHAEPV---VVMRRYRDKQRLASMVTAGLPVEGALDAIA  124
            A + YL +  ++        GG  E V   + MR++  +  L+++   G   E  LD + 
Sbjct  79   AGELYLEVLPITLVGEQYQLGGTVEAVEYVLKMRQFPQESLLSTLFEQGKLNEARLDELG  138

Query  125  EVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFV  184
             V+A++H  AQ N  I + GEV  V   + EN  +  ++     + +     +   D+F 
Sbjct  139  RVVAQYHAEAQTNDYIRSFGEVPKVRAAFDENYQQTENYIGGPQTQEQFTETKQYTDKFF  198

Query  185  SGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLA  244
            + R  LFA RI    I + H DL   +I L + +  L DC+EF +  R++D + D AF  
Sbjct  199  AERPELFASRIHNNYIRECHGDLHLRNIALWNDKILLFDCIEFNEPFRFVDVMYDVAFTV  258

Query  245  MDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQG------  298
            MDLE   RKDLG+ FL  Y  ++GD     +   Y++ +A VRAKV              
Sbjct  259  MDLEARQRKDLGNAFLNAYIEQTGDWEGLQVLPLYLSRQAYVRAKVTSFLLDDPGVPAAV  318

Query  299  KPEAAADAVRHLIIATQHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRR  358
            K EA   A  +   A ++ +     L L+ G  G+GKST AR +A   GA  + +D VR+
Sbjct  319  KEEATKTASEYYKQAWEYTKPKVGELILMSGLSGSGKSTTARHLARQQGAIHLRSDAVRK  378

Query  359  RLRDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRAC  418
             L   G+   E G  D  LY+       Y   L    +L   G SVILD  +    +R  
Sbjct  379  HL--GGIPLWEKGGDD--LYTPEMTEKTYTRLLELGIILAKQGFSVILDAKYDKQHLRQE  434

Query  419  ARRLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAA  463
            A   A      +   +C+A ++V+ +R+  R G  +DATA++ A+
Sbjct  435  AIAQATKHEIPLQIIQCTAPLEVLKERLNNRTGDIADATADLLAS  479


>gi|86741938|ref|YP_482338.1| hypothetical protein Francci3_3252 [Frankia sp. CcI3]
 gi|86568800|gb|ABD12609.1| conserved hypothetical protein [Frankia sp. CcI3]
Length=534

 Score =  208 bits (530),  Expect = 2e-51, Method: Compositional matrix adjust.
 Identities = 173/476 (37%), Positives = 234/476 (50%), Gaps = 11/476 (2%)

Query  15   VTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAA  74
            V   P   V ET  +V+V  GDR FK KKPV     DFR  + R  AC  E  LN RLA 
Sbjct  45   VASGPPARVVETARSVLVFLGDRVFKVKKPVDLGAVDFRGRQARLAACEAEVRLNRRLAP  104

Query  75   QSYLGIAHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRA  134
              YLG+A +  P G   + +VVMRR  + +RL+++   G  V   + A+  VL  FH R 
Sbjct  105  DVYLGVADVIGPDGEPCDHMVVMRRLPEARRLSTLAEGGTEVRAEIHALTRVLVDFHARC  164

Query  135  QRNRCIDTQGEVGAVARRWHENLAELRHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGR  194
            + +  I   G +  +  RW    A ++      VS  ++  +  +   ++ GR+ L   R
Sbjct  165  ETSSRIAEAGGLDRLRGRWDACFARVQRDHGAAVSASILDHVNRLAVRYLDGRDELLRER  224

Query  195  IKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDAAFLAMDLEFLGRKD  254
             + G I DGH DL A DIF +D  P +LDCLEFE  LR  D + DA  LA DLE+LGR+D
Sbjct  225  REAGRIRDGHGDLSAADIFCLDDGPRVLDCLEFEPGLRAADVLADACALAADLEWLGRRD  284

Query  255  LGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIAT  314
            L   FL  Y   +G+T P SL DFY A  A+ R +  C R + G+  AAA+A     +A 
Sbjct  285  LARLFLDHYREMAGETHPRSLEDFYWALAALGRCQAACQRVAAGE-NAAAEARAFADLAL  343

Query  315  QHLQHATVRLALVGGNPGTGKSTLARGVAELVGAQVI---STDDVRRRLRDCGVITGEPG  371
              L+   VRL LVGG  GTGKSTLA G+A      V+             +   +     
Sbjct  344  ARLRWGRVRLVLVGGQRGTGKSTLAGGLAGTERWTVLRFDDAAADLAASANRHDLAAGGW  403

Query  372  VLDSGLYSRANVVAVYQEALRKARLLLGSGHSVILDGTWGDPQMRACARRLAADTHSAIV  431
                G     +V AV+QE LR+A   L  G SV++D  W     RA A  +A    + +V
Sbjct  404  ADAGGWVPADDVDAVHQELLRQAGTALRRGESVVVDAPWNRHSQRAQAADVARRAFADLV  463

Query  432  EFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQAD-------WDTGHRIDTA  480
            + RC+A  D+ A R   R+   + AT+   +    R AD       W     IDTA
Sbjct  464  QLRCTAPPDLAATRTDRRSPATTAATSATGSVGLGRLADTVSRIEPWPEAKIIDTA  519



Lambda     K      H
   0.322    0.136    0.405 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Effective search space used: 1071489888984


  Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
    Posted date:  Sep 5, 2011  4:36 AM
  Number of letters in database: 5,219,829,388
  Number of sequences in database:  15,229,318



Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40