BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1375
Length=439
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608515|ref|NP_215891.1| hypothetical protein Rv1375 [Mycoba... 888 0.0
gi|289745125|ref|ZP_06504503.1| conserved hypothetical protein [... 886 0.0
gi|121637305|ref|YP_977528.1| hypothetical protein BCG_1436 [Myc... 885 0.0
gi|31792569|ref|NP_855062.1| hypothetical protein Mb1410 [Mycoba... 881 0.0
gi|340626389|ref|YP_004744841.1| hypothetical protein MCAN_13911... 874 0.0
gi|323720128|gb|EGB29232.1| hypothetical protein TMMG_02076 [Myc... 844 0.0
gi|167968427|ref|ZP_02550704.1| hypothetical protein MtubH3_1048... 821 0.0
gi|308231824|ref|ZP_07413893.2| hypothetical protein TMAG_03379 ... 820 0.0
gi|326902998|gb|EGE49931.1| UPF0142 protein [Mycobacterium tuber... 749 0.0
gi|289761534|ref|ZP_06520912.1| conserved hypothetical protein [... 743 0.0
gi|183982509|ref|YP_001850800.1| hypothetical protein MMAR_2494 ... 587 2e-165
gi|240170138|ref|ZP_04748797.1| hypothetical protein MkanA1_1254... 573 2e-161
gi|240169071|ref|ZP_04747730.1| hypothetical protein MkanA1_0714... 566 3e-159
gi|240167688|ref|ZP_04746347.1| hypothetical protein MkanA1_0013... 521 1e-145
gi|297157814|gb|ADI07526.1| hypothetical protein SBI_04406 [Stre... 213 4e-53
gi|297155643|gb|ADI05355.1| hypothetical protein SBI_02234 [Stre... 203 6e-50
gi|302531858|ref|ZP_07284200.1| hypothetical protein SSMG_08240 ... 197 4e-48
gi|162455406|ref|YP_001617773.1| hypothetical protein sce7124 [S... 159 1e-36
gi|312602564|ref|YP_004022409.1| hypothetical protein RBRH_00235... 157 3e-36
gi|254465070|ref|ZP_05078481.1| YcaO-like family [Rhodobacterale... 156 9e-36
gi|162457357|ref|YP_001619724.1| hypothetical protein sce9072 [S... 154 2e-35
gi|209967026|ref|YP_002299941.1| hypothetical protein RC1_3786 [... 153 4e-35
gi|162448810|ref|YP_001611177.1| hypothetical protein sce0540 [S... 152 9e-35
gi|288962093|ref|YP_003452388.1| ycaO protein [Azospirillum sp. ... 152 1e-34
gi|116754698|ref|YP_843816.1| hypothetical protein Mthe_1401 [Me... 152 1e-34
gi|254512851|ref|ZP_05124917.1| YcaO-like family protein [Rhodob... 152 2e-34
gi|162457038|ref|YP_001619405.1| hypothetical protein sce8753 [S... 150 5e-34
gi|330468892|ref|YP_004406635.1| hypothetical protein VAB18032_2... 149 1e-33
gi|89054752|ref|YP_510203.1| hypothetical protein Jann_2261 [Jan... 149 1e-33
gi|262199465|ref|YP_003270674.1| hypothetical protein Hoch_6310 ... 147 3e-33
gi|288960709|ref|YP_003451049.1| hypothetical protein AZL_a09740... 147 5e-33
gi|163746794|ref|ZP_02154151.1| hypothetical protein OIHEL45_153... 146 6e-33
gi|336035262|gb|AEH81193.1| protein of unknown function DUF181 [... 143 5e-32
gi|13432020|sp|Q52871.1|YTF3_RHILT RecName: Full=UPF0142 protein... 142 8e-32
gi|150378083|ref|YP_001314678.1| hypothetical protein Smed_6106 ... 141 2e-31
gi|338732822|ref|YP_004671295.1| hypothetical protein SNE_A09270... 139 1e-30
gi|209546417|ref|YP_002278307.1| hypothetical protein Rleg2_6037... 137 5e-30
gi|326795593|ref|YP_004313413.1| YcaO-domain protein [Marinomona... 137 5e-30
gi|307352253|ref|YP_003893304.1| methanogenesis marker protein 1... 135 1e-29
gi|330508549|ref|YP_004384977.1| putative methanogenesis marker ... 134 4e-29
gi|116255806|ref|YP_771639.1| hypothetical protein pRL110605 [Rh... 133 5e-29
gi|300863880|ref|ZP_07108801.1| conserved hypothetical protein [... 133 6e-29
gi|336120687|ref|YP_004575473.1| hypothetical protein MLP_50560 ... 133 6e-29
gi|21227560|ref|NP_633482.1| hypothetical protein MM_1458 [Metha... 133 6e-29
gi|325958165|ref|YP_004289631.1| methanogenesis marker protein 1... 132 2e-28
gi|54292916|ref|YP_122303.1| hypothetical protein plpl0009 [Legi... 131 2e-28
gi|147919709|ref|YP_686545.1| hypothetical protein RCIX2079 [unc... 131 2e-28
gi|88601828|ref|YP_502006.1| hypothetical protein Mhun_0527 [Met... 130 5e-28
gi|116255454|ref|YP_771287.1| hypothetical protein pRL110255 [Rh... 130 6e-28
gi|282163224|ref|YP_003355609.1| hypothetical protein MCP_0554 [... 130 7e-28
>gi|15608515|ref|NP_215891.1| hypothetical protein Rv1375 [Mycobacterium tuberculosis H37Rv]
gi|15840833|ref|NP_335870.1| hypothetical protein MT1419 [Mycobacterium tuberculosis CDC1551]
gi|148661166|ref|YP_001282689.1| hypothetical protein MRA_1384 [Mycobacterium tuberculosis H37Ra]
29 more sequence titles
Length=439
Score = 888 bits (2294), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 439/439 (100%), Positives = 439/439 (100%), Gaps = 0/439 (0%)
Query 1 MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH
Sbjct 1 MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
Query 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS
Sbjct 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
Query 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW
Sbjct 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
Query 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME
Sbjct 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
Query 241 RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS 300
RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS
Sbjct 241 RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS 300
Query 301 ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR 360
ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR
Sbjct 301 ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR 360
Query 361 KTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA 420
KTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA
Sbjct 361 KTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA 420
Query 421 PGLVLSSASPMRTPLQEAE 439
PGLVLSSASPMRTPLQEAE
Sbjct 421 PGLVLSSASPMRTPLQEAE 439
>gi|289745125|ref|ZP_06504503.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289685653|gb|EFD53141.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
Length=439
Score = 886 bits (2289), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/439 (99%), Positives = 438/439 (99%), Gaps = 0/439 (0%)
Query 1 MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH
Sbjct 1 MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
Query 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS
Sbjct 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
Query 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW
Sbjct 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
Query 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMD TGLASGNCYDEATLHALYEVME
Sbjct 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDITGLASGNCYDEATLHALYEVME 240
Query 241 RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS 300
RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS
Sbjct 241 RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS 300
Query 301 ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR 360
ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR
Sbjct 301 ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR 360
Query 361 KTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA 420
KTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA
Sbjct 361 KTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA 420
Query 421 PGLVLSSASPMRTPLQEAE 439
PGLVLSSASPMRTPLQEAE
Sbjct 421 PGLVLSSASPMRTPLQEAE 439
>gi|121637305|ref|YP_977528.1| hypothetical protein BCG_1436 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224989780|ref|YP_002644467.1| hypothetical protein JTY_1411 [Mycobacterium bovis BCG str. Tokyo
172]
gi|289442818|ref|ZP_06432562.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
16 more sequence titles
Length=439
Score = 885 bits (2286), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/439 (99%), Positives = 438/439 (99%), Gaps = 0/439 (0%)
Query 1 MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH
Sbjct 1 MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
Query 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
GTHRITSPDETWLALQPFLAPAGIT VADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS
Sbjct 61 GTHRITSPDETWLALQPFLAPAGITRVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
Query 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW
Sbjct 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
Query 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME
Sbjct 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
Query 241 RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS 300
RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS
Sbjct 241 RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS 300
Query 301 ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR 360
ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR
Sbjct 301 ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR 360
Query 361 KTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA 420
KTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA
Sbjct 361 KTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA 420
Query 421 PGLVLSSASPMRTPLQEAE 439
PGLVLSSASPMRTPLQEAE
Sbjct 421 PGLVLSSASPMRTPLQEAE 439
>gi|31792569|ref|NP_855062.1| hypothetical protein Mb1410 [Mycobacterium bovis AF2122/97]
gi|31618158|emb|CAD94271.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
Length=439
Score = 881 bits (2277), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/439 (99%), Positives = 437/439 (99%), Gaps = 0/439 (0%)
Query 1 MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH
Sbjct 1 MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
Query 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
GTHRITSPDETWLALQPFLAPAGIT VADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS
Sbjct 61 GTHRITSPDETWLALQPFLAPAGITRVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
Query 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW
Sbjct 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
Query 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME
Sbjct 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
Query 241 RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS 300
RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS
Sbjct 241 RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS 300
Query 301 ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR 360
ATLEVTFGGFGLHHDPNVALS AITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR
Sbjct 301 ATLEVTFGGFGLHHDPNVALSPAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR 360
Query 361 KTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA 420
KTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA
Sbjct 361 KTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA 420
Query 421 PGLVLSSASPMRTPLQEAE 439
PGLVLSSASPMRTPLQEAE
Sbjct 421 PGLVLSSASPMRTPLQEAE 439
>gi|340626389|ref|YP_004744841.1| hypothetical protein MCAN_13911 [Mycobacterium canettii CIPT
140010059]
gi|340004579|emb|CCC43723.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=439
Score = 874 bits (2257), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/439 (99%), Positives = 435/439 (99%), Gaps = 0/439 (0%)
Query 1 MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
MTG RLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH
Sbjct 1 MTGHRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
Query 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
GTHRITSPDETWLALQPFLAPAGIT VADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS
Sbjct 61 GTHRITSPDETWLALQPFLAPAGITRVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
Query 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW
Sbjct 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
Query 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME
Sbjct 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
Query 241 RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS 300
RHSVAAAVAGETMFEV TDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS
Sbjct 241 RHSVAAAVAGETMFEVRTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS 300
Query 301 ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR 360
ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDL SAIYHRFGRVHTYAKAR
Sbjct 301 ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLASAIYHRFGRVHTYAKAR 360
Query 361 KTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA 420
KTSLRL+RARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA
Sbjct 361 KTSLRLSRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLA 420
Query 421 PGLVLSSASPMRTPLQEAE 439
PGL+LSSASPMRTPLQEAE
Sbjct 421 PGLMLSSASPMRTPLQEAE 439
>gi|323720128|gb|EGB29232.1| hypothetical protein TMMG_02076 [Mycobacterium tuberculosis CDC1551A]
Length=418
Score = 844 bits (2180), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/418 (99%), Positives = 418/418 (100%), Gaps = 0/418 (0%)
Query 22 VGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAP 81
+GSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAP
Sbjct 1 MGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAP 60
Query 82 AGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENV 141
AGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENV
Sbjct 61 AGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENV 120
Query 142 TADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLV 201
TADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLV
Sbjct 121 TADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLV 180
Query 202 NVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDV 261
NVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDV
Sbjct 181 NVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDV 240
Query 262 AGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALS 321
AGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALS
Sbjct 241 AGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALS 300
Query 322 RAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDS 381
RAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDS
Sbjct 301 RAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDS 360
Query 382 LPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQEAE 439
LPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQEAE
Sbjct 361 LPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQEAE 418
>gi|167968427|ref|ZP_02550704.1| hypothetical protein MtubH3_10481 [Mycobacterium tuberculosis
H37Ra]
gi|308369783|ref|ZP_07419041.2| hypothetical protein TMBG_01203 [Mycobacterium tuberculosis SUMu002]
gi|308370703|ref|ZP_07422426.2| hypothetical protein TMCG_01008 [Mycobacterium tuberculosis SUMu003]
14 more sequence titles
Length=406
Score = 821 bits (2121), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/406 (99%), Positives = 406/406 (100%), Gaps = 0/406 (0%)
Query 34 VLSGPNWSYWPSRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWL 93
+LSGPNWSYWPSRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWL
Sbjct 1 MLSGPNWSYWPSRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWL 60
Query 94 DCLGIPTVQAVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDL 153
DCLGIPTVQAVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDL
Sbjct 61 DCLGIPTVQAVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDL 120
Query 154 EADLTYDPAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPM 213
EADLTYDPAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPM
Sbjct 121 EADLTYDPAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPM 180
Query 214 FEMDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMI 273
FEMDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMI
Sbjct 181 FEMDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMI 240
Query 274 RDAGDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRIT 333
RDAGDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRIT
Sbjct 241 RDAGDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRIT 300
Query 334 AISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAV 393
AISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAV
Sbjct 301 AISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAV 360
Query 394 ANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQEAE 439
ANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQEAE
Sbjct 361 ANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQEAE 406
>gi|308231824|ref|ZP_07413893.2| hypothetical protein TMAG_03379 [Mycobacterium tuberculosis SUMu001]
gi|308378923|ref|ZP_07484326.2| hypothetical protein TMJG_03765 [Mycobacterium tuberculosis SUMu010]
gi|308380060|ref|ZP_07488546.2| hypothetical protein TMKG_01879 [Mycobacterium tuberculosis SUMu011]
gi|308215936|gb|EFO75335.1| hypothetical protein TMAG_03379 [Mycobacterium tuberculosis SUMu001]
gi|308358810|gb|EFP47661.1| hypothetical protein TMJG_03765 [Mycobacterium tuberculosis SUMu010]
gi|308362758|gb|EFP51609.1| hypothetical protein TMKG_01879 [Mycobacterium tuberculosis SUMu011]
Length=406
Score = 820 bits (2119), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/406 (99%), Positives = 405/406 (99%), Gaps = 0/406 (0%)
Query 34 VLSGPNWSYWPSRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWL 93
+LSGPNWSYWPSRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWL
Sbjct 1 MLSGPNWSYWPSRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWL 60
Query 94 DCLGIPTVQAVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDL 153
DCLGIPTVQAVRPASLTLSVSQGKAASYRAAQVS VMESLEGWHAENVTADLWSATARDL
Sbjct 61 DCLGIPTVQAVRPASLTLSVSQGKAASYRAAQVSVVMESLEGWHAENVTADLWSATARDL 120
Query 154 EADLTYDPAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPM 213
EADLTYDPAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPM
Sbjct 121 EADLTYDPAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPM 180
Query 214 FEMDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMI 273
FEMDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMI
Sbjct 181 FEMDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMI 240
Query 274 RDAGDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRIT 333
RDAGDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRIT
Sbjct 241 RDAGDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRIT 300
Query 334 AISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAV 393
AISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAV
Sbjct 301 AISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAV 360
Query 394 ANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQEAE 439
ANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQEAE
Sbjct 361 ANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQEAE 406
>gi|326902998|gb|EGE49931.1| UPF0142 protein [Mycobacterium tuberculosis W-148]
Length=437
Score = 749 bits (1935), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/372 (99%), Positives = 370/372 (99%), Gaps = 0/372 (0%)
Query 1 MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH
Sbjct 1 MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
Query 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS
Sbjct 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
Query 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW
Sbjct 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
Query 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME
Sbjct 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
Query 241 RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS 300
RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS
Sbjct 241 RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS 300
Query 301 ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR 360
ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR
Sbjct 301 ATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKAR 360
Query 361 KTSLRLNRARPT 372
KTSLRLNRA T
Sbjct 361 KTSLRLNRAADT 372
>gi|289761534|ref|ZP_06520912.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
gi|289709040|gb|EFD73056.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
Length=442
Score = 743 bits (1919), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/448 (87%), Positives = 392/448 (88%), Gaps = 15/448 (3%)
Query 1 MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH
Sbjct 1 MTGRRLARFPAFRAGVAQDDDVGSTLSQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRH 60
Query 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS
Sbjct 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
Query 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW
Sbjct 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
Query 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME
Sbjct 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
Query 241 RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTS 300
RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAG + C T
Sbjct 241 RHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGGRCGPC------PHRCLGTVTTG 294
Query 301 ATLEVTFGGFG-----LHHDPNVALSR---AITEAAQSRITAISGA-REDLPSAIYHRFG 351
E+T G + P R ITE Q S E+LPSAIYHRFG
Sbjct 295 FAAELTLRDAGGDLRRVRGTPPTLTWRYRGRITECGQVAHRGSSAEPAENLPSAIYHRFG 354
Query 352 RVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADA 411
RVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADA
Sbjct 355 RVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADA 414
Query 412 CVPVVKVLAPGLVLSSASPMRTPLQEAE 439
CVPVVKVLAPGLVLSSASPMRTPLQEAE
Sbjct 415 CVPVVKVLAPGLVLSSASPMRTPLQEAE 442
>gi|183982509|ref|YP_001850800.1| hypothetical protein MMAR_2494 [Mycobacterium marinum M]
gi|183175835|gb|ACC40945.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=420
Score = 587 bits (1512), Expect = 2e-165, Method: Compositional matrix adjust.
Identities = 291/404 (73%), Positives = 325/404 (81%), Gaps = 0/404 (0%)
Query 36 SGPNWSYWPSRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDC 95
+GP+WS+WP RVLG ADPTTI HR GTHR SPD+TW A+QP LA AGIT VAD+TWLD
Sbjct 17 NGPDWSHWPVRVLGHADPTTIGHRAGTHRTISPDQTWQAVQPLLAQAGITRVADLTWLDD 76
Query 96 LGIPTVQAVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEA 155
LGIPTVQAVRPASLTLSVSQGKA +YRAAQVSAVMESLE WHAENVT + + ARDL
Sbjct 77 LGIPTVQAVRPASLTLSVSQGKATTYRAAQVSAVMESLENWHAENVTPTMLATPARDLTV 136
Query 156 DLTYDPAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFE 215
+LTYDPA L GSLYH KLDWMVATTLL+GRRT+VPW + +VNVA D W PPMF
Sbjct 137 ELTYDPADLNRPAGSLYHPSAKLDWMVATTLLSGRRTFVPWLSTVVNVAVNDSWGPPMFG 196
Query 216 MDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRD 275
MDTTGLASGN Y EAT+HALYE+MERH +A A G T+F VP +DVA SD A LVEMI
Sbjct 197 MDTTGLASGNSYHEATVHALYEIMERHGMATAEPGSTLFHVPLEDVARSDCAELVEMIHQ 256
Query 276 AGDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAI 335
AG +V +ARID WDG+YCFAAELTS LEV F G GLHHDPNVALSRAITEAAQSR+TAI
Sbjct 257 AGSEVQVARIDTWDGFYCFAAELTSPMLEVPFSGSGLHHDPNVALSRAITEAAQSRLTAI 316
Query 336 SGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAVAN 395
SGAREDLPSAIYHRF RVH+YA ++ + A PT W + +SL EL+A+AATAV
Sbjct 317 SGAREDLPSAIYHRFARVHSYAAVHRSMQSMPDAEPTAWHIDYTNSLGELLATAATAVTK 376
Query 396 RSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQEAE 439
RSGTEPLAVVC+FADACVPVVKV+APGL S ASPMRTPLQE +
Sbjct 377 RSGTEPLAVVCEFADACVPVVKVIAPGLSASIASPMRTPLQEHQ 420
>gi|240170138|ref|ZP_04748797.1| hypothetical protein MkanA1_12548 [Mycobacterium kansasii ATCC
12478]
Length=418
Score = 573 bits (1477), Expect = 2e-161, Method: Compositional matrix adjust.
Identities = 287/410 (70%), Positives = 324/410 (80%), Gaps = 0/410 (0%)
Query 28 QGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGV 87
+G+ + +GP+WS+WP RVLG ADPTTI +R GTHRI SPD+TW A+QP L AGIT V
Sbjct 7 RGTQQLMKAGPDWSHWPPRVLGHADPTTIGYRAGTHRIISPDQTWQAVQPALERAGITRV 66
Query 88 ADVTWLDCLGIPTVQAVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWS 147
AD+TWLD LGIPTVQAVRPASLTLSVSQGKA +YRAAQVSAVMESLE WH E++T DL S
Sbjct 67 ADLTWLDDLGIPTVQAVRPASLTLSVSQGKATTYRAAQVSAVMESLENWHVESITPDLLS 126
Query 148 ATARDLEADLTYDPAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRD 207
+ DL +LTYDPA+L GS YH G KLDWM+ATTLLTGRRT+VPW A +VNVA D
Sbjct 127 RSTTDLARELTYDPAELNRPAGSFYHPGAKLDWMIATTLLTGRRTFVPWLATVVNVAVSD 186
Query 208 CWEPPMFEMDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSA 267
W PPMF MDTTGLASGN Y EATLH LYE+MERH +A A G T+FEVP DD A S+ A
Sbjct 187 SWGPPMFGMDTTGLASGNSYHEATLHGLYEIMERHGMATAAPGSTLFEVPLDDAARSECA 246
Query 268 HLVEMIRDAGDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEA 327
LVEMI AG ++ +ARID WDG+YCFAAE+TS E+ F G GLHHDPNVALSRAITEA
Sbjct 247 ELVEMIHRAGSELSVARIDSWDGFYCFAAEITSPMAEIPFSGSGLHHDPNVALSRAITEA 306
Query 328 AQSRITAISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVA 387
AQSR+TAISGAREDLPSAIYHRF RVHTYA AR++ + A TPW + +SL EL+A
Sbjct 307 AQSRLTAISGAREDLPSAIYHRFARVHTYAPARRSMQPMPAAPATPWHIDYSNSLTELLA 366
Query 388 SAATAVANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQE 437
AATAV RSG EPLAVVCDF DACVPVVKV+APGL S SPMRTPLQE
Sbjct 367 LAATAVTVRSGVEPLAVVCDFDDACVPVVKVIAPGLSASIHSPMRTPLQE 416
>gi|240169071|ref|ZP_04747730.1| hypothetical protein MkanA1_07144 [Mycobacterium kansasii ATCC
12478]
Length=416
Score = 566 bits (1458), Expect = 3e-159, Method: Compositional matrix adjust.
Identities = 287/412 (70%), Positives = 322/412 (79%), Gaps = 0/412 (0%)
Query 27 SQGSTTGVLSGPNWSYWPSRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITG 86
S + T GP+WS WP+RVLG A+PT+IAHR GT+RI SP++TW A+QP L AGIT
Sbjct 4 SCAAATAASVGPDWSQWPTRVLGHANPTSIAHRAGTYRIMSPEQTWRAVQPMLELAGITR 63
Query 87 VADVTWLDCLGIPTVQAVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLW 146
VAD+TWLD LGIPTVQAVRPAS+TLSVSQGKAA+YRAAQVSAVMESLE WHAENVT DL+
Sbjct 64 VADLTWLDDLGIPTVQAVRPASVTLSVSQGKAATYRAAQVSAVMESLETWHAENVTPDLF 123
Query 147 SATARDLEADLTYDPAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATR 206
S DL A LTYDPA L S+YH G KLDWM ATTLLTGR+TWVPW AVLVN A
Sbjct 124 SMRTTDLAAALTYDPAHLLLSARSIYHPGAKLDWMTATTLLTGRQTWVPWEAVLVNAAVD 183
Query 207 DCWEPPMFEMDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDS 266
+ W+PPMF MDTTGLASGN Y EA+LH LYEVMERH++AA G T+FEVP DDVA S
Sbjct 184 NRWDPPMFSMDTTGLASGNSYWEASLHGLYEVMERHAMAAGEPGSTLFEVPVDDVADSGC 243
Query 267 AHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITE 326
A LV+MI AG ++ +AR D WDG+ CF AE+ S L V F GFGLHHDPNVALSRAITE
Sbjct 244 AELVDMIYRAGSELKIARTDTWDGFPCFTAEICSPMLGVPFSGFGLHHDPNVALSRAITE 303
Query 327 AAQSRITAISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELV 386
AAQSR+TAISGAREDL A+YHRF RVH Y R T L A PTPW VP DSL +L+
Sbjct 304 AAQSRLTAISGAREDLSPALYHRFARVHAYGPLRPTMRHLPTAEPTPWHVPGTDSLSDLL 363
Query 387 ASAATAVANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSSASPMRTPLQEA 438
ASAATAVA+RSGTEPLAVVCD A +CVPVVKV+APGL S SPMRTPLQE+
Sbjct 364 ASAATAVADRSGTEPLAVVCDLAGSCVPVVKVIAPGLTASHGSPMRTPLQES 415
>gi|240167688|ref|ZP_04746347.1| hypothetical protein MkanA1_00130 [Mycobacterium kansasii ATCC
12478]
Length=415
Score = 521 bits (1341), Expect = 1e-145, Method: Compositional matrix adjust.
Identities = 274/407 (68%), Positives = 310/407 (77%), Gaps = 3/407 (0%)
Query 32 TGVLSGPNWSYWPSRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVT 91
T V GP+WS+WP+R LGSADP I HR GTHR SP+ETW A+QP L+ AGIT VAD+T
Sbjct 9 TAVNFGPDWSFWPNRFLGSADPAVIGHRMGTHRTISPEETWQAVQPLLSAAGITRVADIT 68
Query 92 WLDCLGIPTVQAVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATAR 151
WLD LGIPTVQAVRPASLT+SVSQGKA SYRAAQVSAVMESLE WHAEN TADL A+ +
Sbjct 69 WLDSLGIPTVQAVRPASLTVSVSQGKATSYRAAQVSAVMESLEYWHAENATADLRFASTK 128
Query 152 DLEADLTYDPAQLRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEP 211
DL+++LTYDP L PGS YH G +LDWM ATTLLTGRRTWVPW+ V V+++ D W P
Sbjct 129 DLDSELTYDPGSLSRPPGSFYHRGARLDWMAATTLLTGRRTWVPWSVVAVDISVNDRWGP 188
Query 212 PMFEMDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVE 271
PMF M T GLASGN Y EA LH LYE+MERH+V AVAG TM+ V D+ G+D A LV+
Sbjct 189 PMFTMHTQGLASGNSYYEAALHGLYEIMERHAVGTAVAGSTMWAVRPPDLDGADCAGLVD 248
Query 272 MIRDAGDDVDLARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSR 331
+ AG + +AR+DVW GYYCFAAEL S T V F G GLHHDPNVALSRAITEAAQSR
Sbjct 249 QVHRAGSQLRIARLDVWQGYYCFAAELISPTSSVQFAGSGLHHDPNVALSRAITEAAQSR 308
Query 332 ITAISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAAT 391
+TAISG RED+P+ IY R A + + ARPT WRVPD DSLP LVA+AAT
Sbjct 309 LTAISGTREDIPATIYERLAEAPASAARPPARMPV--ARPTSWRVPDTDSLPALVAAAAT 366
Query 392 AVANRSGTEPLAVVCDFADACVPVVKVLAPGLVLSS-ASPMRTPLQE 437
AVA R+G EP AVVCD ACVPVVKV+APGL LSS ASPMRTPLQE
Sbjct 367 AVARRTGIEPAAVVCDSPGACVPVVKVVAPGLSLSSVASPMRTPLQE 413
>gi|297157814|gb|ADI07526.1| hypothetical protein SBI_04406 [Streptomyces bingchenggensis
BCW-1]
Length=407
Score = 213 bits (543), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 146/385 (38%), Positives = 205/385 (54%), Gaps = 30/385 (7%)
Query 52 DPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTL 111
D T A GTHR+ +P ET +QP GIT +ADVTWLD +GIP QAVRP S T+
Sbjct 33 DDTRKACVSGTHRVLTPTETLRRIQPLFPIVGITRLADVTWLDEIGIPVHQAVRPNSRTV 92
Query 112 SVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSL 171
SVSQGK ++ A+VSA MES+E WHAE + +AT D+E Y +L P
Sbjct 93 SVSQGKGITHDLAKVSAAMESIESWHAERIDPGETTATVADMERACGYRVHELALEPRHH 152
Query 172 YHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEAT 231
G++L+W A+ L G +++P + ++ RD W PP+F ++ GLASGN + EA
Sbjct 153 LWPGMELEWTRASRLDDGTDSFLPTDLLRLDGRVRDTWMPPLFAQNSDGLASGNTFAEAA 212
Query 232 LHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSA--HLVEMIRDAGDDVDLARIDVWD 289
LH +YEV+ER +A A + P D+A D L++++ A +V +
Sbjct 213 LHGIYEVIERDCLARAETDPS----PALDLATVDGPAWELLDLMDAAAVEVRVEVPPSPT 268
Query 290 GYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHR 349
G CF A + S V F G G H D +VALSRA+TEAAQSR T I+GAR+DL + Y R
Sbjct 269 GVACFLATIWSEEFPVLFAGAGAHLDRDVALSRALTEAAQSRATQIAGARDDLTTGAYRR 328
Query 350 FGRVHTYAKARKTSLRLNRARPTPWRVPD-----------VDSLPELVASAATAVANRSG 398
V +++ ARP P D ++L + + + T+V + +G
Sbjct 329 A--VSSWS-----------ARPAPLSKADRLTYDEIASVRNETLADDLHTTVTSVLSLTG 375
Query 399 TEPLAVVCDFADACVPVVKVLAPGL 423
PL +PVV+V+ PGL
Sbjct 376 RSPLIADHTRPHLGIPVVRVVCPGL 400
>gi|297155643|gb|ADI05355.1| hypothetical protein SBI_02234 [Streptomyces bingchenggensis
BCW-1]
Length=399
Score = 203 bits (516), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 125/293 (43%), Positives = 164/293 (56%), Gaps = 4/293 (1%)
Query 58 HRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGK 117
H GTHR+ P+ETW + GIT VADVT LD LG+P V AVRPA+ TL+VSQGK
Sbjct 9 HFDGTHRVRHPEETWTLINGLRDRFGITRVADVTGLDTLGVPVVMAVRPAAKTLTVSQGK 68
Query 118 AASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVK 177
AS A+VSAVMES+E WHAE E +L YD L+ GSL
Sbjct 69 GASLLLARVSAVMESVELWHAEYACPAPELKHTPACELELPYDVCDLQQHHGSLLSERTP 128
Query 178 LDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYE 237
LDW++ ++G +T VP V V+ W+PP+ T GLA GN YDEA HALYE
Sbjct 129 LDWVIGVDAVSGTKTLVPRAYVRVDYQVSRAWQPPLLHGSTNGLAGGNTYDEALAHALYE 188
Query 238 VMERHSVAAAVAGETMFEVPTDDVAGSD---SAHLVEMIRDAGDDVDLARIDVWDGYYCF 294
V+ER A + + E D + D A ++ I DAG V++ + G CF
Sbjct 189 VIER-DCTATIGSLPVAERRHVDPSSVDDPLCATVLGRIADAGAWVEIVEVPNRWGLPCF 247
Query 295 AAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIY 347
+ + S G G+H P VALSRA+TE+AQSR+TAI+G+R+DL + ++
Sbjct 248 VSYIWSEDFPALAVGSGVHGSPAVALSRALTESAQSRLTAIAGSRDDLAAVLF 300
>gi|302531858|ref|ZP_07284200.1| hypothetical protein SSMG_08240 [Streptomyces sp. AA4]
gi|302440753|gb|EFL12569.1| hypothetical protein SSMG_08240 [Streptomyces sp. AA4]
Length=401
Score = 197 bits (500), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 148/379 (40%), Positives = 199/379 (53%), Gaps = 11/379 (2%)
Query 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
GTHR +P++TW ++P L G+T VADVT LDC+G+P AVRPAS TLSV+QGK
Sbjct 10 GTHRARAPEDTWALIEPLLPGYGVTRVADVTGLDCIGVPVFLAVRPASETLSVAQGKGHD 69
Query 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGVKLDW 180
A++SAVME+LE HAE+ + +A ARDL DL YD A L R + + LDW
Sbjct 70 PILAKLSAVMETLEQQHAEHPGNERRTALARDL--DLQYDVANLNARVTADAFDLLVLDW 127
Query 181 MVATTLLTGRRTWVPWTAV-LVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVM 239
L +G TW+P V L +TRD W+P F+ + GLASGN +DEA LH LYEV+
Sbjct 128 YRGVGLRSGTPTWIPCDVVDLAFTSTRD-WQPVPFDASSNGLASGNTHDEAVLHGLYEVI 186
Query 240 ERHSVAAAVAGETMFEVPTD--DVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAE 297
ER V+ V D ++ + + DAG ++LA + A
Sbjct 187 ERDVVSTLKEHAPDHRVFLDPRSISSPFCQDTIRRLDDAGVQLELALVPNPYALPVAVAC 246
Query 298 LTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYA 357
+ S G G H DP VA+SRA+TEA Q+R+T I+G R+D+PS I F V
Sbjct 247 IWSQDYPAVCAGAGAHSDPAVAVSRALTEAVQTRLTEITGTRDDIPSEI-DVFSSVCDEP 305
Query 358 KARKTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVK 417
+ T L + A D SL +A+ A V SG EP+ + VVK
Sbjct 306 RFTVTGLDWDLAVEG-LGFQDT-SLSSELATLARRVEAVSGHEPIVLDLSTRPDVFSVVK 363
Query 418 VLAPGL--VLSSASPMRTP 434
V+ PGL +L + P +P
Sbjct 364 VVGPGLRTMLRNDIPRYSP 382
>gi|162455406|ref|YP_001617773.1| hypothetical protein sce7124 [Sorangium cellulosum 'So ce 56']
gi|161165988|emb|CAN97293.1| hypothetical protein sce7124 [Sorangium cellulosum 'So ce 56']
Length=424
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/296 (39%), Positives = 153/296 (52%), Gaps = 18/296 (6%)
Query 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAAS 120
GTHR SP+ET L+P + GIT VADVT LD LG+P V RP + +LSVSQGK +
Sbjct 27 GTHRAVSPEETMARLRPLMPVMGITRVADVTGLDTLGVPVVMVTRPNARSLSVSQGKGLT 86
Query 121 YRAAQVSAVMESLEGWHAENVTADLWSATARDLE-ADLTYDPAQLRHRPGSLYHAGVKLD 179
AA+ S +ME++E WHAE V L T +L D + L S +H ++L
Sbjct 87 LAAARASGLMEAVEHWHAERVQLPLKLGTVNELRFRHRLVDVSALPRLSISAFHDDLRLH 146
Query 180 WMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVM 239
W++ L+ G TWVP+ V + + F M + GLASGN EA HAL E++
Sbjct 147 WVIGMDLVAGAPTWVPFEVVHTDYSLPLLSASGCFVMSSNGLASGNHPLEAISHALCELI 206
Query 240 ERHSVAA-AVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARID--VWD------- 289
ER + +AGE D++ D ++ D + A I+ VWD
Sbjct 207 ERDAATLWWLAGEEHHRRTRIDLSTVDDPSCRALL----DGYERAGIEVYVWDITSDIGV 262
Query 290 -GYYCFAA--ELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDL 342
+YC E GG+G H VALSRA+TEAAQSR+T I+GAR+D+
Sbjct 263 PAFYCTLVDREPNPHRPIAPMGGYGCHPARGVALSRALTEAAQSRLTVITGARDDV 318
>gi|312602564|ref|YP_004022409.1| hypothetical protein RBRH_00235 [Burkholderia rhizoxinica HKI
454]
gi|312169878|emb|CBW76890.1| Hypothetical cytosolic protein [Burkholderia rhizoxinica HKI
454]
Length=409
Score = 157 bits (398), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 120/313 (39%), Positives = 166/313 (54%), Gaps = 34/313 (10%)
Query 54 TTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSV 113
T+ + GT R+ +P+ET +QP LA GIT V DVT LD +GIPT A+RP + LS+
Sbjct 7 TSHEYAQGTQRVCAPEETLRRIQPVLARCGITRVLDVTQLDRIGIPTYNAIRPNGIILSI 66
Query 114 SQGKAASYRAAQVSAVMESLEGWHAENVTADLW----SATARDLEADLTYDPAQL----- 164
S GK S AA VSA+MES+E H+E W SATA E DP L
Sbjct 67 SNGKGWSSAAAAVSAIMESIEVEHSEYPDTSSWRLATSATALRTEGLDPVDPTTLIRDCL 126
Query 165 --RHRPGSLYHA-GVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTT-G 220
+ G LY+ + LDW+ A L++G + +P + + PP + T+ G
Sbjct 127 WPKDEYGGLYYTPELVLDWVEADELISGNKVMIPASTIYAV--------PPFLQYFTSNG 178
Query 221 LASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPT-------DDVAGSDSAHLVEMI 273
LASGN Y EA LHA+ E++ER ++ A + G T P+ D + G L E+I
Sbjct 179 LASGNTYAEAVLHAICEIVERDAI-AKLMGRTKDSPPSRLRPIRLDSLPGH-LVKLAELI 236
Query 274 RDAGDDVDL----ARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQ 329
G ++ L + ID++ + F A + T GG+G H DP +A SRA+TEAAQ
Sbjct 237 TSGGIELFLLSMPSAIDIYTFWTIFYCPGEPAFILSTSGGYGTHPDPVIAASRALTEAAQ 296
Query 330 SRITAISGAREDL 342
+R+ I GAREDL
Sbjct 297 ARLAHIHGAREDL 309
>gi|254465070|ref|ZP_05078481.1| YcaO-like family [Rhodobacterales bacterium Y4I]
gi|206685978|gb|EDZ46460.1| YcaO-like family [Rhodobacterales bacterium Y4I]
Length=401
Score = 156 bits (394), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 107/295 (37%), Positives = 156/295 (53%), Gaps = 25/295 (8%)
Query 62 THRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAASY 121
THR+ P +T ++P LA GIT +A++T LD +G+PTV RP S +++VS GK +
Sbjct 13 THRLCDPAQTLATVRPHLAGMGITRIANLTGLDRVGLPTVMVARPNSRSVAVSLGKGLTL 72
Query 122 RAAQVSAVMESLEGWHAENVTADLWSATARDLEAD-LTYDPAQLRHRPGSLYHAGVKLDW 180
AAQ S VME++E WHAE +T L +A+ DL + L D +L G ++ ++ W
Sbjct 73 EAAQASGVMEAVETWHAERITRSLRAASYADLRQEVLVADVERLPQVTGGTFNPHGRMLW 132
Query 181 MVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVME 240
+ L++G+ W+P V + R C F T GLASGN EAT HA+ E++E
Sbjct 133 VEGLDLVSGQPHWLPLEMVDTDYTARPCGGQGAFPRTTNGLASGNSLAEATCHAICELIE 192
Query 241 RHSVAA---AVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLA--RIDVWD------ 289
R ++ A AG P D A + R+A D + A R +W+
Sbjct 193 RDAITLWHHAPAG------PRIDAAAIEDPR----CREALDRFEAAGLRAGIWNITSDIG 242
Query 290 --GYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDL 342
++C E + + G G H D +AL RA+TEAAQ+R+T ISGAR+DL
Sbjct 243 VAAFHCMICEDGTRPGHIGIGS-GCHPDRGIALLRALTEAAQTRLTYISGARDDL 296
>gi|162457357|ref|YP_001619724.1| hypothetical protein sce9072 [Sorangium cellulosum 'So ce 56']
gi|161167939|emb|CAN99244.1| conserved hypothetical protein (YcaO-like family) [Sorangium
cellulosum 'So ce 56']
Length=427
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 115/320 (36%), Positives = 158/320 (50%), Gaps = 28/320 (8%)
Query 49 GSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPAS 108
G + P +GTHR SP+ET ++ F+ GIT +A+VT LD +GIP V RP S
Sbjct 12 GLSGPEKKRFMNGTHRTASPEETLDRIKGFMPAMGITRIANVTGLDAIGIPVVVVCRPNS 71
Query 109 LTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDL-EADLTYDPAQLRHR 167
+LSVSQGK + AA+VS +MES+E +H EN+ L ++R+L + D + L
Sbjct 72 RSLSVSQGKGLTLAAAKVSGLMESIEAYHGENIVRPLLLGSSRELRRSHAIADVSALPRT 131
Query 168 PGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCY 227
+ L W L+ G WVP+ V VN P +F T GL+SGN
Sbjct 132 SSVPFDEDTPLLWAEGYDLMRGAPVWVPYELVHVNATATGRVNPGIFCCSTNGLSSGNGL 191
Query 228 DEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHL-VEMIRDAGDDVDLARI- 285
EA + + EV+ER + A ++ TD+ D+ L ++ I D G LA+
Sbjct 192 LEAVSYGICEVVERDATA-------VWGALTDE--ERDARRLDLDSIDDPGCREVLAKFA 242
Query 286 ------DVWD--------GYYCFAAELTSATLEVTF--GGFGLHHDPNVALSRAITEAAQ 329
W+ Y C AE T + GG G H VAL RA+TEAAQ
Sbjct 243 AAGVAVGAWETTSDVGIPSYECLIAERTEDAVRALHGSGGQGCHPSRAVALLRALTEAAQ 302
Query 330 SRITAISGAREDLPSAIYHR 349
+R+T ISGAR+DL A Y R
Sbjct 303 TRLTVISGARDDLLRAEYDR 322
>gi|209967026|ref|YP_002299941.1| hypothetical protein RC1_3786 [Rhodospirillum centenum SW]
gi|209960492|gb|ACJ01129.1| conserved hypothetical protein [Rhodospirillum centenum SW]
Length=414
Score = 153 bits (387), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 137/401 (35%), Positives = 185/401 (47%), Gaps = 31/401 (7%)
Query 50 SADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASL 109
SADP I RI +ET L+ FL GIT VA +T LD +GIP V RP S
Sbjct 8 SADP--ILQADAWRRIVPAEETVARLKRFLPMFGITRVATLTGLDTVGIPVVMVNRPNSR 65
Query 110 TLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDL-EADLTYDPAQLRHRP 168
+L+VSQGK + AA+ S +MES+E WHAE + L + DL + DP +L
Sbjct 66 SLAVSQGKGVTLAAAKASGLMESVEAWHAERIVQPLKIGSFEDLCYSHAMVDPDRLPRLS 125
Query 169 GSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYD 228
S Y ++ W+ +L R WVP+ V N F+ +T GLASGN
Sbjct 126 SSRYTPHTQMLWIEGRSLTRDRSVWVPYEMVHTNYTLPLPSGHGCFQANTNGLASGNHPL 185
Query 229 EATLHALYEVMER------HSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDL 282
EA +H L E++ER H E ++ T VA L+ AG +V +
Sbjct 186 EAVIHGLCELIERDALTLWHQKPEEAQDEDRLDLET--VADPVCRDLIGRFARAGVEVGV 243
Query 283 ARIDVWDGYYCFAAELTSATLEVTFG-----GFGLHHDPNVALSRAITEAAQSRITAISG 337
I G F + A E G G G H +AL+RA+TEAAQSR+T ISG
Sbjct 244 WEITSDIGVPTFLCRIVQAEGEHATGIRPAIGCGTHLVREIALARALTEAAQSRLTFISG 303
Query 338 AREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLP-------ELVASAA 390
AR+D+ Y R A R R+ P + D +++P A
Sbjct 304 ARDDMARVDYERM---LDPALQRTWLARIRHGAP----MRDFNAVPVWGGRSLRNDLDAL 356
Query 391 TAVANRSGT-EPLAVVCDFADACVPVVKVLAPGLVLSSASP 430
A +R+G EP+ V + +PVV+VLAPGL +SP
Sbjct 357 LARLDRAGIEEPVVVDLTRRELGIPVVRVLAPGLEGVDSSP 397
>gi|162448810|ref|YP_001611177.1| hypothetical protein sce0540 [Sorangium cellulosum 'So ce 56']
gi|161159392|emb|CAN90697.1| hypothetical protein sce0540 [Sorangium cellulosum 'So ce 56']
Length=424
Score = 152 bits (385), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 134/395 (34%), Positives = 189/395 (48%), Gaps = 27/395 (6%)
Query 59 RHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKA 118
R GTHR+ P ET L+P L GIT VA+VT LD LGIP V RP + +LSVSQGK
Sbjct 23 RDGTHRLVPPAETVERLRPLLPALGITRVANVTGLDILGIPVVMVCRPNARSLSVSQGKG 82
Query 119 ASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLR---HRPGSLYHAG 175
AA+ S +ME+ E +HAE +T+ L + +L T+ A +R R S +H
Sbjct 83 VDLAAAKASGIMEATELYHAERITSPLKLGSLEELR--FTHRLADVRLLPQRAFSTFHPS 140
Query 176 VKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHAL 235
L W+ A + WVP+ V N F +TGLASGN EA H +
Sbjct 141 APLLWIEALDWMRSEPLWVPFELVHTNYTLPLPTGSGAFLTSSTGLASGNHPLEAVSHGI 200
Query 236 YEVMERHS------VAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDV---DL-ARI 285
E +ER + + T ++ T D AG + L++ AG DV D+ + I
Sbjct 201 CEAVERDAGTLWSLLDGGSRRATRLDLATVDDAGCRT--LLDRCERAGLDVAAWDIRSDI 258
Query 286 DVWDGYYCFAAELTSATLEVTF--GGFGLHHDPNVALSRAITEAAQSRITAISGAREDLP 343
D+ + C AE + L + G G H VALSRA+TEA QSR+T ISG+R+D+
Sbjct 259 DI-AAFRCMIAERSPGGLSSLYPAAGMGCHPAREVALSRALTEAVQSRMTMISGSRDDMS 317
Query 344 SAIYHRFGRVHTYAKA---RKTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTE 400
A Y R + + + R + P R ++ + E + + +G E
Sbjct 318 RADYERRLDPELHRRVLQDMRDGAPGRRFQDVPTR--EITTFEEDIRWELEQLRT-AGIE 374
Query 401 PLAVV-CDFADACVPVVKVLAPGLVLSSASPMRTP 434
+AVV A+ +PVV+V+ PGL P P
Sbjct 375 QVAVVDLTKAEIGIPVVRVVIPGLETIGGLPGYVP 409
>gi|288962093|ref|YP_003452388.1| ycaO protein [Azospirillum sp. B510]
gi|288914359|dbj|BAI75844.1| ycaO protein [Azospirillum sp. B510]
Length=423
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 131/406 (33%), Positives = 184/406 (46%), Gaps = 53/406 (13%)
Query 57 AHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQG 116
AH GTHR+ +P++T + PFL GIT VA+VT LD +GIP V RP S ++SVSQG
Sbjct 18 AHTVGTHRVMAPEQTLARVAPFLPIMGITRVANVTGLDAVGIPVVMVTRPNSRSISVSQG 77
Query 117 KAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLE-ADLTYDPAQLRHRPGSLYHAG 175
K + AA+ S VMES+E +HAE +T L A+ +L + +L +
Sbjct 78 KGVTLAAAKASGVMESIESYHAERITLPLKFASFEELRWTHPVVNVDRLPRLSTGSFDPN 137
Query 176 VKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHAL 235
+ W+ LL+G WVP+ V +N F + GLASGN EA HAL
Sbjct 138 RPILWIEGQDLLSGGPKWVPFEMVHLNFTVPMAPGHGAFLAGSNGLASGNHRVEAISHAL 197
Query 236 YEVMERHSVA----AAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWD-- 289
E++ER + A + + D ++ L++ G + VW+
Sbjct 198 TELVERDATTLWRLKGPASQAATRIDLDSISDPVCRSLIDRFEAVG-----VAVGVWETT 252
Query 290 ------GYYCFAAE---LTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGARE 340
+ C E L ++ G G H +ALSRA+TEAAQSR+T I+GAR+
Sbjct 253 SDVGLPAFLCRIVESEDLPQHSIRPA-TGMGCHVAREIALSRALTEAAQSRLTFIAGARD 311
Query 341 DLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVD-----SLPELVASAATAVAN 395
D+P A Y R L+ A WR VD S SAA +
Sbjct 312 DMPRAEYER---------------HLDPAHHARWRAMIVDGAGRRSFHHCPTSAAATIEG 356
Query 396 ---------RSGTEPLAVVCDFA--DACVPVVKVLAPGLVLSSASP 430
R+ AVV D + +PVV+V+ PGL + SP
Sbjct 357 DLAHQLDRLRAVGIEEAVVVDLTKPEFGIPVVRVVVPGLEGADESP 402
>gi|116754698|ref|YP_843816.1| hypothetical protein Mthe_1401 [Methanosaeta thermophila PT]
gi|116666149|gb|ABK15176.1| uncharacterized domain protein [Methanosaeta thermophila PT]
Length=403
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 140/400 (35%), Positives = 193/400 (49%), Gaps = 51/400 (12%)
Query 58 HRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLT--LSVSQ 115
++ THR P+ET ++ + AGIT VAD+T LD +GIP ++RP + +SV
Sbjct 10 YKKDTHRALPPEETLEIVEKKMPAAGITRVADITNLDRIGIPVFTSIRPTAEKGAISVYN 69
Query 116 GKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAG 175
GK A+ A+VSA+ME +E + AE ADL +A +L + +PA+L P
Sbjct 70 GKGATPTEAKVSAIMEGIERYSAEVRNADLRTARFSELREN-ALNPAEL-ILPRDADPDA 127
Query 176 VKLDWMVATTLLTGRRTWVPWTAV---LVNVATRDCWEPPMFEMDTTGLASGNCYDEATL 232
V + W+ L+ VP AV L + TR +F +TTGLASGN +EA
Sbjct 128 V-IPWVTGYDLMGDEEILVPANAVFHPLPSSYTR------LFRTNTTGLASGNQLEEAIF 180
Query 233 HALYEVMERHSVAAAVAGETM---FEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWD 289
H L EV+ER + + A +M D +AG L+EM + A V + I
Sbjct 181 HGLAEVVERDAWSIAEHARSMGPLLRYNGDGLAG----ELLEMFQRAEVQVYVRDITSDV 236
Query 290 GYYCFAAELTSATLE---VTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAI 346
G FAA L+ + G G H DP VAL RA+TE AQSR+T I GARED SA
Sbjct 237 GVPTFAAVSDDVKLKDPALLTAGMGTHTDPEVALLRALTEVAQSRLTQIHGAREDTVSA- 295
Query 347 YHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPE------------LVASAATAVA 394
F R+ Y + + RLNR R D SL ++ TA
Sbjct 296 --EFRRMMGYDRLK----RLNRHWFEYEREEDFSSLNSYNTDDFLDDIRYMLDRLQTAGF 349
Query 395 NRSGTEPLAVVCDF--ADACVPVVKVLAPGLVLSSASPMR 432
R A+V D ++ VPVV+V+ PGL +S+ P R
Sbjct 350 ER------AIVVDLTASEIMVPVVRVIVPGLEISAVDPER 383
>gi|254512851|ref|ZP_05124917.1| YcaO-like family protein [Rhodobacteraceae bacterium KLH11]
gi|221532850|gb|EEE35845.1| YcaO-like family protein [Rhodobacteraceae bacterium KLH11]
Length=450
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 115/354 (33%), Positives = 172/354 (49%), Gaps = 24/354 (6%)
Query 6 LARFPAFRAGVAQDDDV-GSTLSQGSTTGVLSGPNWSYWPSR--------VLGSADPTTI 56
L +RAG+A+ D G L+ G ++ +L+ + + P R V+G + +
Sbjct 2 LGHVDKWRAGLAKADHTKGVLLTLGVSSRILTWGGF-FAPVRLAYQIFGKVIGMSGNSQK 60
Query 57 AHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQG 116
+ THR+ P++T ++P+L GIT +A++T LD +G+PTV RP S +++VS G
Sbjct 61 GYVLDTHRLRDPEQTLAIVKPYLKQMGITRIANLTGLDRVGLPTVMVTRPNSRSVAVSLG 120
Query 117 KAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAGV 176
K S AA+ S VME++E WHAE + L A DL D D ++L G +
Sbjct 121 KGLSLSAAKASGVMEAIESWHAERIELPLRLANHVDLAGDHVVDVSRLPRVTGGQFDPHC 180
Query 177 KLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALY 236
+ W+ L + + WVP+ V + T F T GLASGN EA+ HA+
Sbjct 181 AILWVQGRDLPSDQPCWVPYEMVDTDYTTSPAAGQRAFPRTTNGLASGNDVTEASCHAIC 240
Query 237 EVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWD------- 289
E++ER + V V +E I AG D + +W+
Sbjct 241 ELIERDATTLWHHRSDTPRVDPLTVDDPRCRQAIEQIMAAGLD-----LGIWNTTSDVGI 295
Query 290 -GYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDL 342
+ C E AT + G G H D +AL RA+TEAAQ+R+T ISGAR+DL
Sbjct 296 ASFRCAICEAGGATGHIGIGD-GCHPDRAIALLRALTEAAQTRLTYISGARDDL 348
>gi|162457038|ref|YP_001619405.1| hypothetical protein sce8753 [Sorangium cellulosum 'So ce 56']
gi|161167620|emb|CAN98925.1| hypothetical protein sce8753 [Sorangium cellulosum 'So ce 56']
Length=404
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 116/307 (38%), Positives = 150/307 (49%), Gaps = 26/307 (8%)
Query 54 TTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSV 113
T A+ GT R SP +T ++P L GIT +ADVT LD +GIP V RP + ++SV
Sbjct 4 TEKAYWRGTQRRISPADTLARVRPLLRRLGITRIADVTGLDSIGIPVVMVCRPNARSISV 63
Query 114 SQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTY-DPAQLRHRPGSLY 172
SQGK AA+ S VMES+E WHAE++ + TA +L A D A L +
Sbjct 64 SQGKGLDLEAARASGVMESIEQWHAEHILRPMVFGTAAELAATRRLVDLAGLPRLAIGAF 123
Query 173 HAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPP---MFEMDTTGLASGNCYDE 229
KL W+ L G +P V + + PP F +TGLASGN E
Sbjct 124 QPHRKLLWLDGVDLFDGAPRALPLEVVTTDYTSP---RPPGSGCFLSTSTGLASGNDALE 180
Query 230 ATLHALYEVMERHSVAAAVAG----ETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARI 285
ATLH LYEV+ER +VA AG + D V D L+ AG +
Sbjct 181 ATLHGLYEVIERDAVAIWRAGGAEVRRRTRIALDTVDDLDCRALLRRFERAG-----VAV 235
Query 286 DVWD-----GYYCFAAELTSATLE-----VTFGGFGLHHDPNVALSRAITEAAQSRITAI 335
WD G AE+ + GG G H +AL+RA+TEAAQSR+TAI
Sbjct 236 GAWDATSDIGLPVVVAEIADRDPDPCHALCVSGGQGCHRSRAIALARALTEAAQSRLTAI 295
Query 336 SGAREDL 342
SGAR+D+
Sbjct 296 SGARDDI 302
>gi|330468892|ref|YP_004406635.1| hypothetical protein VAB18032_24685 [Verrucosispora maris AB-18-032]
gi|328811863|gb|AEB46035.1| hypothetical protein VAB18032_24685 [Verrucosispora maris AB-18-032]
Length=408
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 128/385 (34%), Positives = 179/385 (47%), Gaps = 21/385 (5%)
Query 54 TTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSV 113
T +R GT R +P ETW + P L GIT VADVT LD +G+P AVRP S L+V
Sbjct 5 TDKTYRDGTDRAIAPAETWQRVLPRLPEMGITRVADVTGLDHIGVPVFMAVRPNSRGLTV 64
Query 114 SQGKAASYRAAQVSAVMESLEGWHAENVTADL----WSATARDLEADLTYDPAQLRHRPG 169
+QGK S AA+VSAVMES+E +HAE + A L W AR D + L G
Sbjct 65 AQGKGLSVDAARVSAVMESIEAYHAERIEAPLLLGSWDELARHRR---LVDTSFLITAAG 121
Query 170 SLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDE 229
+L W+ T L++G W+P+ V + F + + GLASGN E
Sbjct 122 EPLRRDRRLLWIEGTDLMSGEPVWLPFDLVHNDYTGASQAGQSPFAVTSNGLASGNHLLE 181
Query 230 ATLHALYEVMERHSVAAAVAG----ETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARI 285
AT HA+ EV+ER + A +A + V D V ++++ + AG +
Sbjct 182 ATSHAICEVIERDAEALWLATPKQRQDELRVDPDTVDDPACRYVLDTLAAAGVAAACWDM 241
Query 286 DVWDGYYCFAAEL-----TSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGARE 340
G CF ++ S + G G H +AL RA+TEA QSR+T I+G+R+
Sbjct 242 TTDIGLPCFTVDIAEDPRVSISRVAVAQGQGCHPRREIALLRALTEAVQSRLTVIAGSRD 301
Query 341 DLPSAIYHRFGRVHTYAKARKTSLRLNRAR---PTPWRVPDVDSLPELVASAATAVANRS 397
D ++Y R + A +T N R P R D S E + A+
Sbjct 302 DFYRSLYARANDLDNREAAWRTCAAGNAPRHFTDVPTR--DNGSFQEDIEHELAALRQAG 359
Query 398 GTEPLAVVCDFADACVPVVKVLAPG 422
TE + V + VV+V+ PG
Sbjct 360 ITEAIQVPLGGEQLGISVVRVMLPG 384
>gi|89054752|ref|YP_510203.1| hypothetical protein Jann_2261 [Jannaschia sp. CCS1]
gi|88864301|gb|ABD55178.1| protein of unknown function DUF181 [Jannaschia sp. CCS1]
Length=396
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 114/317 (36%), Positives = 150/317 (48%), Gaps = 22/317 (6%)
Query 46 RVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVR 105
R + P +IA + HR P+ T+ L+ GIT VAD+T LD +G+P QAVR
Sbjct 5 RASDAGSPGSIARKTWAHRTCQPEFTYRRLRRVAERVGITRVADITDLDRVGLPVFQAVR 64
Query 106 PASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLR 165
P +LSVSQGK + AA+VSA+ME++E WHAE AT R L DP QL
Sbjct 65 PMGRSLSVSQGKGMTSMAARVSAMMEAVEIWHAEQDLPTTLRATIRSLGTRRAMDPNQLL 124
Query 166 HRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGN 225
+ + W + LL G VP A N+ +PPM TTGLA GN
Sbjct 125 MPGRDKVCEDLPIVWCPSLNLLDGADVLVPRDA--ANLDFTRAPDPPMLARSTTGLAGGN 182
Query 226 CYDEATLHALYEVMER------HSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDD 279
DEA A+ EV+ER + A + + A A +E IR AG
Sbjct 183 TRDEARASAIAEVIERACQREFQRLPPASRAQRRLDPTCLASAHRGLADPIERIRSAG-- 240
Query 280 VDLARIDVWDGYYCFAAELTSATL-EVTFG--------GFGLHHDPNVALSRAITEAAQS 330
+D++D F A + E T G G G H DP A+ RA+TEAAQ+
Sbjct 241 ---LHLDIFDMTNRFDVPAIRAVIYETTAGKPVAWPCLGHGAHLDPVTAVVRALTEAAQA 297
Query 331 RITAISGAREDLPSAIY 347
R+T ISG R+D+ Y
Sbjct 298 RLTGISGNRDDISPGHY 314
>gi|262199465|ref|YP_003270674.1| hypothetical protein Hoch_6310 [Haliangium ochraceum DSM 14365]
gi|262082812|gb|ACY18781.1| protein of unknown function DUF181 [Haliangium ochraceum DSM
14365]
Length=443
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 135/407 (34%), Positives = 194/407 (48%), Gaps = 46/407 (11%)
Query 45 SRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAV 104
S + + D T +HGTHR+ +P+ T ++P +A GIT +A+VT LD +G+P V A
Sbjct 25 SELRAAIDDTRKGFKHGTHRLIAPERTLARVRPHMAAMGITRLAEVTGLDRVGVPVVMAC 84
Query 105 RPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADL-TYDPAQ 163
RP + +L+VSQGK S AAQ S +ME +E +HAE++ A L T +L D
Sbjct 85 RPNARSLAVSQGKGLSAIAAQASGLMECVELYHAEHIVAPLLFTTLAELRGSFAVVDVRA 144
Query 164 LRH---RPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPM------F 214
L RP S + + W+ L+ GR VP+ V + + PM F
Sbjct 145 LPRSSARPLSEHQRSL---WIQGVDLMNGRPRLVPYEIVHAD------YTLPMPPGSGAF 195
Query 215 EMDTTGLASGNCYDEATLHALYEVMER--HSVAAAVAG-ETMFEVPTDDVAGSDSAHLVE 271
T GLASGN EA H L EV+ER H++ + G + D V A ++E
Sbjct 196 VSSTNGLASGNHLFEAVCHGLCEVVERDAHTLWSLTPGARAHTRIAPDSVDDDACAQVLE 255
Query 272 MIRDAGDDVDLARIDVWD--------GYYCFAAELTSATLEVT--FGGFGLHHDPNVALS 321
R + LA + VWD ++C A+ S L G G DP +AL
Sbjct 256 RFRASA----LA-VAVWDITSDVGIPAFHCVIADADSDPLRPLPPASGAGCAPDPAIALL 310
Query 322 RAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVP---- 377
RA+TEAAQSR+T I+G+R+D+ Y R H A LR R P R
Sbjct 311 RALTEAAQSRLTHIAGSRDDMSVLAYR---RAHDQG-AHTHLLRELREAPPTRRFDQVSG 366
Query 378 -DVDSLPELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLAPGL 423
D DS+ E ++ A + + + + +AV +PV +V+ PGL
Sbjct 367 YDSDSVAEDLSWALSRLRSVGIRQVVAVDLTLPAFNIPVARVVIPGL 413
>gi|288960709|ref|YP_003451049.1| hypothetical protein AZL_a09740 [Azospirillum sp. B510]
gi|288913017|dbj|BAI74505.1| hypothetical protein AZL_a09740 [Azospirillum sp. B510]
Length=398
Score = 147 bits (370), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 137/396 (35%), Positives = 191/396 (49%), Gaps = 47/396 (11%)
Query 55 TIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVS 114
T+ H G R+ SP+ET + P L G+T VAD+T LD +GIPT AVRP + + V+
Sbjct 10 TVRHAEGAQRLVSPEETLARVIPHLPTIGVTRVADITGLDRIGIPTFCAVRPLARLVQVT 69
Query 115 QGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEAD-LTYDPAQL--RHRPGSL 171
GK + AA+VSA+ME+LE HAE+ A A+ +L A+ + PAQ + PG
Sbjct 70 NGKGLTPIAARVSAIMEALEHAHAEDPPAAPRRASMAELTAERAAFLPAQALPNYVPGLH 129
Query 172 YHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEAT 231
++L W+ A +L VLV + EP + T GLASGN EAT
Sbjct 130 LDDHLRLPWLEARSLGPADS----GATVLVPACSAVPVEPLHAMVSTNGLASGNHIVEAT 185
Query 232 LHALYEVMERHSVA--------AAVAGETMFEV------PTDDVAGSDSAHLVEMIRDAG 277
LHALYE++ER +V +V G M ++ P ++AG +A VE++
Sbjct 186 LHALYELIERDAVTRFSRAGLRKSVDGACMVDLRRLPPGPVAELAGRVAAAGVELV---- 241
Query 278 DDVDLARI-------DVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQS 330
L R+ +W + A+ + + + G+G H P VA RAITEAAQS
Sbjct 242 ----LIRVASTGPATTMWAVFLDPLADQACSRVNM---GYGCHLSPTVAAVRAITEAAQS 294
Query 331 RITAISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPW-RVPDVDS--LPELVA 387
R+T I GAREDL + Y + T A R R W +PD S L +
Sbjct 295 RLTYIHGAREDLSADSY-----ILTPAHERLARFFTGRRGELAWDELPDRSSGDLGRDLD 349
Query 388 SAATAVANRSGTEPLAVVCDFADACVPVVKVLAPGL 423
+ +A L V A VPVVK++ PGL
Sbjct 350 LVLSGLAGAGFGRVLRVDLTRAAVGVPVVKLIVPGL 385
>gi|163746794|ref|ZP_02154151.1| hypothetical protein OIHEL45_15364 [Oceanibulbus indolifex HEL-45]
gi|161379908|gb|EDQ04320.1| hypothetical protein OIHEL45_15364 [Oceanibulbus indolifex HEL-45]
Length=411
Score = 146 bits (369), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 128/400 (32%), Positives = 190/400 (48%), Gaps = 40/400 (10%)
Query 48 LGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPA 107
G + T R G HRI + +T + P GIT +A+VT LD +G+P V A+RP
Sbjct 3 FGISGGTKKLLRDGLHRICTAQQTLDRILPIKHKFGITRIANVTGLDRVGLPVVLAIRPN 62
Query 108 SLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTY-DPAQLRH 166
+ ++SVSQGK ++ A+VSA+ME++E WHAE+ ++ A DL + D +L
Sbjct 63 ARSISVSQGKGSTLVLAKVSALMEAIEIWHAEHFDRPVFFARFDDLSEQHDFIDLTRLPE 122
Query 167 RPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPM-----FEMDTTGL 221
G ++ +L W+ A L++GR+ VP V + D P F T GL
Sbjct 123 VRGRTRNSAERLHWVYAQELMSGRKVLVP-----VEMVQTDYTHPLFPGTGCFPSSTNGL 177
Query 222 ASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTD-DVAGSDSAHLVEMIR---DAG 277
ASGN EAT HA+ EV+ER ++A G + + D+ D +E +R +AG
Sbjct 178 ASGNSELEATCHAICEVIERDALALWHHGSPDAQKSSQLDLNTVDDPICLEALRKFAEAG 237
Query 278 DDVDLARIDVWD--------GYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQ 329
VW+ + C + S T + G G H D +VAL RA+ EAAQ
Sbjct 238 -----LECFVWNVTSDVAVASFMCVIFDRQSETDHLGLGS-GTHPDRSVALERALNEAAQ 291
Query 330 SRITAISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTP-WRVPDVDSLPELVAS 388
+R+ ISGAREDL Y GR + T + + P P + DV S +
Sbjct 292 TRLNYISGAREDLSFEEYSASGRAQ-----KMTEFAVALSGPMPSLKFCDVPSSSNIDLE 346
Query 389 AATAVANR----SGTEPLAVV-CDFADACVPVVKVLAPGL 423
+ + +G +AVV + + VV+V+ PGL
Sbjct 347 SDLNFLKKCLWSAGINEVAVVGLGREEFRISVVRVIVPGL 386
>gi|336035262|gb|AEH81193.1| protein of unknown function DUF181 [Sinorhizobium meliloti SM11]
Length=405
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 102/307 (34%), Positives = 150/307 (49%), Gaps = 30/307 (9%)
Query 58 HRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGK 117
+ GT R +P+ET + P + GI+ V DVT LD +GIPT AVRP + LSVS GK
Sbjct 7 YSQGTQRTYNPEETLRRIAPAMRTCGISRVLDVTHLDRIGIPTYNAVRPNGMILSVSNGK 66
Query 118 AASYRAAQVSAVMESLEGWHAENVTADLW-----SATARDLEADLTYDPAQLRH------ 166
+ AA VSA+MES+E HAE W + R+ + P +
Sbjct 67 GGTKAAASVSAIMESIEVEHAEYPDTSAWHLAQSAKVLRNRGYSVVDAPTLISECLWPSD 126
Query 167 RPGSLYHA-GVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGN 225
G LY++ ++LDW+ ++ R +P + + V P + + GLASGN
Sbjct 127 TYGGLYYSDDLRLDWVEGREIIESRPVLLPASTIYVRA-------PYVHYFTSNGLASGN 179
Query 226 CYDEATLHALYEVMERHSVA------AAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDD 279
++EATLH + E++ER S A + + + + H E + AG +
Sbjct 180 TWEEATLHGICELIERDSTARLLGRPEGMTTSRLLRIEPKSMP-EHLGHFSEKVAQAGIE 238
Query 280 VDL----ARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAI 335
+ + + ID+ + F + + T GFG H P +A SRA+TEAAQSR+T I
Sbjct 239 LFMFALPSAIDIHTFWAVFHCPGEPSFMLATSAGFGCHTSPQIAASRALTEAAQSRLTYI 298
Query 336 SGAREDL 342
GAREDL
Sbjct 299 HGAREDL 305
>gi|13432020|sp|Q52871.1|YTF3_RHILT RecName: Full=UPF0142 protein in tfuA 3'region; AltName: Full=ORF3
gi|1439553|gb|AAB17514.1| ORF3 [Rhizobium leguminosarum bv. trifolii]
Length=420
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 130/384 (34%), Positives = 177/384 (47%), Gaps = 38/384 (9%)
Query 64 RITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAASYRA 123
R +P +T+ A++P L GIT V +T LD L IP A RP S TLSV QGK A
Sbjct 27 RAVTPAQTFAAIRPHLRDFGITRVGLLTALDVLNIPVAFATRPNSHTLSVFQGKGIDNEA 86
Query 124 AQVSAVMESLEGWHAENVTADLWSATARDLEAD----LTYD------PAQLRHRPGSLYH 173
A SA ME++E AE ADL AT + A+ + D P ++ RP
Sbjct 87 AMTSAAMEAVETRIAEIAPADLTQATVESMRAERAAMIDLDNVARCAPDEIGSRP----- 141
Query 174 AGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLH 233
+ W +L+G +VPW LV + R PP FE + GLASGN EA LH
Sbjct 142 ----IPWCSGLDILSGSSVFVPWW--LVGLDHRG-ERPPGFEQSSDGLASGNTPSEAVLH 194
Query 234 ALYEVMERH--SVAAAVAGETMFEVPTDDVAGSDSAH--LVEMIRDAGDDVDLARIDVWD 289
L E++ER ++ + E + E D + D+ + + I AG + L +
Sbjct 195 GLCELVERDAWALTQLKSPERLKESRIDPASFGDAVIDVMTDRITRAGMKLLLLDMTTDI 254
Query 290 GYYCFAAELTSATL--------EVTFGGFGLHHDPNVALSRAITEAAQSRITAISGARED 341
G F A + L GG G H DP A RAITEAAQSR+TAI+G+R+D
Sbjct 255 GIPAFLAVIMPGNLSDRVDARWSHVCGGCGCHPDPVRAALRAITEAAQSRLTAIAGSRDD 314
Query 342 LPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEP 401
IY R R + + R RP R ++ E + A + +G E
Sbjct 315 FSPRIYQRLDRSAAMQQVVELCEGDGRMRPFQPRHHRKATIQETIGHIADRLVA-TGIEQ 373
Query 402 LAVVCDFADACVP--VVKVLAPGL 423
+ VV F +P VV+V+ PGL
Sbjct 374 IVVV-PFPHPALPVSVVRVIVPGL 396
>gi|150378083|ref|YP_001314678.1| hypothetical protein Smed_6106 [Sinorhizobium medicae WSM419]
gi|150032630|gb|ABR64745.1| protein of unknown function DUF181 [Sinorhizobium medicae WSM419]
Length=405
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 102/305 (34%), Positives = 149/305 (49%), Gaps = 30/305 (9%)
Query 60 HGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAA 119
GT R +P+ET + P + GI+ V DVT LD +GIPT AVRP + LSVS GK
Sbjct 9 QGTQRTYNPEETLRRIAPAMRTCGISRVLDVTHLDRIGIPTYNAVRPNGMILSVSNGKGW 68
Query 120 SYRAAQVSAVMESLEGWHAENVTADLW-----SATARDLEADLTYDPAQLRH------RP 168
+ AA VSA+MES+E HAE W + R+ + P +
Sbjct 69 TKAAASVSAIMESIEVEHAEYPDTSAWHLAQSAKVLRNRGYSVVDAPTLISECLWPSDTY 128
Query 169 GSLYHA-GVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCY 227
G LY++ ++LDW+ ++ R +P + + V P + + GLASGN +
Sbjct 129 GGLYYSDDLRLDWVEGREIIESRPVLLPASTIYVRA-------PYVHYFTSNGLASGNTW 181
Query 228 DEATLHALYEVMERHSVA------AAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVD 281
+EATLH + E++ER S A + + + + H E + AG ++
Sbjct 182 EEATLHGICELIERDSTARLLGRPEGMTTSRLLRIEPKSMP-EHLGHFSEKVAQAGIELF 240
Query 282 L----ARIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISG 337
+ + ID+ + F + + T GFG H P +A SRA+TEAAQSR+T I G
Sbjct 241 MFALPSAIDIHTFWAVFHCPGEPSFMLATSAGFGCHTSPQIAASRALTEAAQSRLTYIHG 300
Query 338 AREDL 342
AREDL
Sbjct 301 AREDL 305
>gi|338732822|ref|YP_004671295.1| hypothetical protein SNE_A09270 [Simkania negevensis Z]
gi|336482205|emb|CCB88804.1| UPF0142 protein in tfuA 3'region [Simkania negevensis Z]
Length=434
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 118/390 (31%), Positives = 175/390 (45%), Gaps = 20/390 (5%)
Query 49 GSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPAS 108
G++ + GTHRI SP+ETW + P + G++ VA+VT LD +GIP +RP +
Sbjct 8 GNSYQAKKGYFKGTHRIVSPEETWEKIAPLTSQIGVSRVANVTGLDRIGIPVTAVIRPEA 67
Query 109 LTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPA-QLRHR 167
LTLS S GK + VS +MESLE AE +L + P +L R
Sbjct 68 LTLSTSSGKGLDLCTSLVSGLMESLELHCAEEADLSYLHLPYHELSKRVKTIPIDRLPLR 127
Query 168 PGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVN--VATRDCWEPPMFEMDTTGLASGN 225
SL+ W + L VP +V+ N + ++ E FEM + GLASGN
Sbjct 128 KNSLFRPDWPERWTIGWDLFNQEEVAVPLLSVIHNYKIVRQEPSELHSFEMTSNGLASGN 187
Query 226 CYDEATLHALYEVMER-----HSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDV 280
+ EA +YE++ER H A + V + + S ++E ++ A +
Sbjct 188 HFLEALAAGIYELIERDAITCHMFAFETVKAALPRVCLETIRFSKVQQVIEKLKWARFQL 247
Query 281 DLARIDVWDGYYCFAAELTSATLEVT--FGGFGLHHDPNVALSRAITEAAQSRITAISGA 338
L + F A L T+ T G+G H DP VA+ RAITEA Q I+G+
Sbjct 248 LLYDCTIDTEVPVFMATLYDETMRHTRLSQGYGAHLDPEVAMIRAITEAVQGSTIGIAGS 307
Query 339 REDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDS-----LPELVASAATAV 393
R+D I+ + + + +T L +P V ++S L E V +
Sbjct 308 RDD----IFFSQLKQGKQSDSEQTITALEN-QPATVDVSQLESVATSTLEEDVTLLMEKI 362
Query 394 ANRSGTEPLAVVCDFADACVPVVKVLAPGL 423
N T+ L D V V++V+APGL
Sbjct 363 RNVGITQLLVFDLSKEDLGVSVLRVIAPGL 392
>gi|209546417|ref|YP_002278307.1| hypothetical protein Rleg2_6037 [Rhizobium leguminosarum bv.
trifolii WSM2304]
gi|209539274|gb|ACI59207.1| protein of unknown function DUF181 [Rhizobium leguminosarum bv.
trifolii WSM2304]
Length=420
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 132/395 (34%), Positives = 178/395 (46%), Gaps = 60/395 (15%)
Query 64 RITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAASYRA 123
R +P +T A++P L GIT V +T LD L IP A RP S TLSV QGK A
Sbjct 27 RAVTPAQTLAAIRPHLREFGITRVGLLTALDVLNIPVAFATRPNSHTLSVFQGKGIDNDA 86
Query 124 AQVSAVMESLEGWHAENVTADLWSATARDLEAD----LTYDPAQLRHRPGSLYHAGVKLD 179
A SA ME++E AE ADL AT + A+ + D R P + +
Sbjct 87 AMTSAAMEAIETRIAEIPPADLTEATVAGMRAENAAMIDLDNVA-RCAPDEIGSGPIP-- 143
Query 180 WMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVM 239
W +L+G +VPW LV + R PP FE + GLASGN EA LH L E++
Sbjct 144 WCSGLDILSGSSAFVPWW--LVGLDHRG-ERPPGFEQSSDGLASGNTPSEAVLHGLCELV 200
Query 240 ERHS----------------VAAAVAGETMFEVPTDDVAGSD-SAHLVEMIRDAGDDVDL 282
ER + + A G+ + +V TD +A + L++M D G L
Sbjct 201 ERDAWALTQLKSPERLKESRIDPASFGDAVIDVMTDRIARAGMRLLLLDMTTDIGVPAFL 260
Query 283 A---------RIDVWDGYYCFAAELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRIT 333
A R+D + C GG G H DP A RAITEAAQSR+T
Sbjct 261 AVIMPGNLSDRVDARWAHVC--------------GGCGCHPDPVRAALRAITEAAQSRLT 306
Query 334 AISGAREDLPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAV 393
AI+G+R+D +Y R + + + R R R S P + +
Sbjct 307 AIAGSRDDFSPRVYQRLDQSAAMQQVVELCEGGGRMRAFQPR----QSRPATIQETIGHI 362
Query 394 ANR---SGTEPLAVVCDFADACVP--VVKVLAPGL 423
A+R +G E + VV FA +P VV+V+ PGL
Sbjct 363 ADRLAATGIEQIVVV-PFAHRALPVSVVRVIVPGL 396
>gi|326795593|ref|YP_004313413.1| YcaO-domain protein [Marinomonas mediterranea MMB-1]
gi|326546357|gb|ADZ91577.1| YcaO-domain protein [Marinomonas mediterranea MMB-1]
Length=414
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 118/384 (31%), Positives = 184/384 (48%), Gaps = 23/384 (5%)
Query 57 AHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQG 116
A+ GTHR SP ET + P L GIT +ADVT LD +G+P + A RP + +SVSQG
Sbjct 8 AYTTGTHRTVSPKETLEKITPLLLKMGITRLADVTGLDDIGVPVITACRPNAKAISVSQG 67
Query 117 KAASYRAAQVSAVMESLEGWHAENVTADLWSATARDL-EADLTYDPAQLRHRPGSLYHAG 175
K S AA+ SA ME++E WHAEN+ + L E + D L ++
Sbjct 68 KGVSVDAAKASAAMEAIETWHAENIDLPTRFCSFNALKENHVVVDLDTLPKMDVKPFNPD 127
Query 176 VKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHAL 235
+ W+ A L +VP+ + F++ T GLASGN +EA HAL
Sbjct 128 ERRLWIEAQDLNREHSYYVPYDLAHCDFTLPLPQGSGCFQLSTNGLASGNTVNEAASHAL 187
Query 236 YEVMERHSVA--AAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWD---- 289
E++ER ++ + ++ E + D +D + + + ++ D+A + VWD
Sbjct 188 CELIERDAMTLWSFLSSEEQGKRKVDLSTITDPT--IGGLLNKLEEADVA-VSVWDATSD 244
Query 290 -GYYCFAAELTSATLE-----VTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLP 343
G F + + T + G G H D +VA+ RAITEA Q+R+T ISG+R+D
Sbjct 245 IGIATFVCTIINKTESQYRPLYSMSGSGTHVDKHVAIMRAITEAVQARLTLISGSRDDAS 304
Query 344 SAIYHRFGRVHTYAKARKTSLR----LNRARPTPWRVPDVDSLPELVASAATAVANRSGT 399
IY ++ + RK + ++ + W D++ E + +A +
Sbjct 305 IKIYETRQQMEYQRRIRKELMETPSFVDFNKIDSWI---FDTIEEDLELQIAKLAAQGLP 361
Query 400 EPLAVVCDFADACVPVVKVLAPGL 423
PL + + +PVVKV++PGL
Sbjct 362 CPLFIDLTKTEFDIPVVKVISPGL 385
>gi|307352253|ref|YP_003893304.1| methanogenesis marker protein 1 [Methanoplanus petrolearius DSM
11571]
gi|307155486|gb|ADN34866.1| methanogenesis marker protein 1 [Methanoplanus petrolearius DSM
11571]
Length=396
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 100/311 (33%), Positives = 154/311 (50%), Gaps = 27/311 (8%)
Query 61 GTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASL--TLSVSQGKA 118
GTHR+T+P++T ++P + G+ V D+T LD LGIP A RP + + GK
Sbjct 20 GTHRVTAPEKTLEKIKPLMPEIGVVEVEDITGLDRLGIPVYSASRPGAKPGATRMHAGKG 79
Query 119 ASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQL-RHRPGSLYHAGVK 177
A+VSA+ME++E + AE + + + DPA L RP +G K
Sbjct 80 TRPVHAEVSAMMEAIERYSAEYRGESMIHESFDGMGPATAVDPADLILPRP---LESGEK 136
Query 178 LDWMVATTLLTGRRTWVPWTAVL-----VNVATRDCWEPPMFEMDTTGLASGNCYDEATL 232
L W + ++ +VP AV V +A + +F DT GLASGN +EA L
Sbjct 137 LHWTPSWDMMNEEEIYVPSNAVFHPYDPVGMAQQ------LFRSDTNGLASGNVIEEAIL 190
Query 233 HALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYY 292
HA++EV+ER +++ A +M + D G + L+++ D G + L ID G
Sbjct 191 HAIFEVIERDALSDAENARSMGKKIIVDKEGP-AKELLDIFEDNGVKIHLWLIDAKTGVP 249
Query 293 CFAA---ELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGARED------LP 343
AA + + + G G H +P +A+ RA+TE AQSR +++ G RED +
Sbjct 250 TVAAGGDDTLTKDPSLLVMGSGTHLNPEIAVLRALTEVAQSRGSSLKGGREDPKRRMLIE 309
Query 344 SAIYHRFGRVH 354
A Y R R++
Sbjct 310 KAGYERLKRIN 320
>gi|330508549|ref|YP_004384977.1| putative methanogenesis marker protein 1 [Methanosaeta concilii
GP6]
gi|328929357|gb|AEB69159.1| putative methanogenesis marker protein 1 [Methanosaeta concilii
GP6]
Length=406
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 132/398 (34%), Positives = 190/398 (48%), Gaps = 35/398 (8%)
Query 53 PTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASL--T 110
P ++ THR SP+ET ++ L AGIT VAD+T LD +GIP ++RP +
Sbjct 5 PCIKRYKEDTHRAASPEETEKRIEAKLPAAGITRVADITNLDRIGIPVFSSIRPMADRGA 64
Query 111 LSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRHRPGS 170
+SV GK A+ A+VSA+ME LE + E +L A L+A+ +P L +
Sbjct 65 VSVYNGKGATPVEARVSAMMEGLERYSGEVRDRELTIARYSSLKAE-ALNPVDLILPTEA 123
Query 171 LYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEA 230
+ A ++ W++ ++ VP AV +++ +F +T+GLASGN +EA
Sbjct 124 VADADAEIPWVLGWDIMNDEEIQVPANAVFHPLSSD---YKRLFRTNTSGLASGNMMEEA 180
Query 231 TLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDV--- 287
H L EV+ER + A A M + + DV + L+E R A +VD+ D+
Sbjct 181 IFHGLAEVIERDAWAIVEATRHMGPLIS-DVVDEQAQGLLE--RFAAAEVDVYLRDITSD 237
Query 288 WDGYYCFAA----ELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLP 343
D C AA +L TL T G G H VA+ RA+TE AQSR+T I GARED
Sbjct 238 IDIPTCAAAADDIKLRDPTLLTT--GMGTHTSARVAVLRALTEVAQSRLTQIHGAREDTV 295
Query 344 SAIYHRFGRVHTYAKARKTSLRLNRA---RPTPWRVPDVDSLPE----LVASAATAVANR 396
+A F R Y + + RLNR D+ S L +
Sbjct 296 TA---DFRRQIGYERTK----RLNRYWFDIGEKKSFADIQSFESNDFLLDIKFMISKLEE 348
Query 397 SGTEPLAVVCDFA--DACVPVVKVLAPGLVLSSASPMR 432
+G E AVV D + VPVV+V+ PGL ++ R
Sbjct 349 AGLER-AVVVDLTREEIGVPVVRVIVPGLEIAGVDRER 385
>gi|116255806|ref|YP_771639.1| hypothetical protein pRL110605 [Rhizobium leguminosarum bv. viciae
3841]
gi|115260454|emb|CAK03558.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length=420
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 128/381 (34%), Positives = 174/381 (46%), Gaps = 32/381 (8%)
Query 64 RITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAASYRA 123
R +P +T A++P L GIT V +T LD L IP A RP S TLSV QGK A
Sbjct 27 RAVTPAQTLAAIRPHLREFGITRVGLLTALDVLNIPVAFATRPNSHTLSVFQGKGIDNDA 86
Query 124 AQVSAVMESLEGWHAENVTADLWSATARDLEAD----LTYDPAQLRHRPGSLYHAGVKLD 179
A SA ME++E AE ADL AT + A+ + D R P + + +
Sbjct 87 AMTSAAMEAVETRIAEIAPADLTQATVDSMRAEHAAMIDLDNVA-RCAPDEIGSSPIP-- 143
Query 180 WMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVM 239
W +L+G +VPW LV + R P FE + GLASGN EA LH L E++
Sbjct 144 WCTGLDILSGSSVFVPWW--LVGLDHRG-ERPAGFEQSSDGLASGNTPSEAVLHGLCELV 200
Query 240 ERH--SVAAAVAGETMFEVPTDDVAGSDSAH--LVEMIRDAGDDVDLARIDVWDGYYCFA 295
ER ++ + E + E D + D+ + + I AG + L + G F
Sbjct 201 ERDAWALTQLKSPERLKESRIDPASFGDAVIDVMTDRITRAGMKLLLLDMTTDIGVPAFL 260
Query 296 AELTSATL--------EVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIY 347
A + L GG G H DP A RAITEAAQSR+TAI+G+R+D IY
Sbjct 261 AVIMPGNLSDRVDARWSHVCGGCGCHPDPVRAALRAITEAAQSRLTAIAGSRDDFSPRIY 320
Query 348 HRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAVANR---SGTEPLAV 404
R R + + R R R P + +A+R +G E + V
Sbjct 321 QRLDRSAAMQQVVELCEGDGRMRSFQAR----HRRPATIQDTIGHIADRLTATGIEQIVV 376
Query 405 VCDFADACVP--VVKVLAPGL 423
V F+ +P VV+V+ PGL
Sbjct 377 V-PFSHPALPISVVRVIVPGL 396
>gi|300863880|ref|ZP_07108801.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300338123|emb|CBN53947.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length=385
Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 102/307 (34%), Positives = 159/307 (52%), Gaps = 27/307 (8%)
Query 52 DPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTL 111
D T A+ GTHR+ SP++T + P+L AGIT AD+T LD +GIP +++P +
Sbjct 2 DVLTKAYAIGTHRLISPEQTLANIHPYLPAAGITRCADITGLDRIGIPVYCSIKPGGRLV 61
Query 112 SVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLE-ADLTYD-----PAQLR 165
+ GK S AA+VSA+ME++E +HAEN + +S++ D+ +DL+ P L
Sbjct 62 QIHNGKGLSQMAAKVSALMEAIEVFHAENPYCNFYSSSFNDINVSDLSIISPNILPLYLS 121
Query 166 HRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGN 225
H + + + +DW+ A L +P +AV + P ++ + GLASGN
Sbjct 122 H---NFFSKDLIIDWIKAENLQKNESVLLPASAVYLR-------SPSLYGFSSNGLASGN 171
Query 226 CYDEATLHALYEVMERHSVAA-AVAGETMFE----VPTDDVAGSDSAHLVEMIRDAGDDV 280
EATLH LYE++ER ++A ++ G+ + + + V L+ I+ A +
Sbjct 172 HIVEATLHGLYELIERDAIAGVSINGKIDIKSCQIIDLNTVDDELICSLIYRIKSANFKL 231
Query 281 DLARIDVWDGYYCFAAELTSAT-----LEVTFGGFGLHHDPNVALSRAITEAAQSRITAI 335
L + F A + + V F G+G H +VA +RAITEAAQSR+T I
Sbjct 232 VLIWLKSCISVNTFWAIILDKNPLTPAIMVNF-GYGTHLSVSVAAARAITEAAQSRLTFI 290
Query 336 SGAREDL 342
G E+L
Sbjct 291 YGVSEEL 297
>gi|336120687|ref|YP_004575473.1| hypothetical protein MLP_50560 [Microlunatus phosphovorus NM-1]
gi|334688485|dbj|BAK38070.1| hypothetical protein MLP_50560 [Microlunatus phosphovorus NM-1]
Length=436
Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 132/400 (33%), Positives = 187/400 (47%), Gaps = 30/400 (7%)
Query 45 SRVLGSADPTTIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAV 104
SR+L HR GTHR T P T + GIT +ADVT LD +G+P
Sbjct 11 SRLLPGPTAVPKTHRSGTHRTTDPAVTVARVWAHRRTMGITRIADVTGLDRVGVPVTMVT 70
Query 105 RPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADL-TYDPAQ 163
RP + +L+V+QGK + AA+ S +ME+ E +HAE+ L ++ R L +L T D +
Sbjct 71 RPNARSLAVNQGKGLTLDAARASGLMEAAETFHAEHPRLPLRLSSWRHLREELETVDCHR 130
Query 164 LRHRPGSLYHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLAS 223
L P + L W L TG +P+ V + T F + GLAS
Sbjct 131 LPRHPYGSFDDDRMLLWASGVDLRTGAPVQLPYELVHTHYTTLALPGAGAFLASSNGLAS 190
Query 224 GNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHL-VEMIRDAGDDVDL 282
GN EA LH LYEV+ER + ++E+ D A D+ + + + D G L
Sbjct 191 GNHPLEAVLHGLYEVVERDAT-------VLWEL--SDTAQQDATAVDLRTVTDPGCRGVL 241
Query 283 AR-------IDVWD-----GYYCFAAELTSAT----LEVTFGGFGLHHDPNVALSRAITE 326
R + W+ G FA E+ + G G HHD VAL+RA+TE
Sbjct 242 DRFAEAGLVVACWEQTSDIGIAVFAVEVIDSADGMDGAPAAAGMGAHHDATVALARALTE 301
Query 327 AAQSRITAISGAREDLPSAIYHRFGRVHTYAKARKTSLRLN-RARPTPWRVPDV--DSLP 383
AAQSR+TAI+G+R+D P + Y T A R R+N +A + + P D+L
Sbjct 302 AAQSRLTAIAGSRDDQPPSAYAVAHDPATLALHRAELQRVNAQATRSFAQAPHAVRDTLD 361
Query 384 ELVASAATAVANRSGTEPLAVVCDFADACVPVVKVLAPGL 423
E +A A+A +AV D + VV+V+ PGL
Sbjct 362 EDLAQLLDALAAAGLDHVVAVDLSRTDLGIDVVRVVVPGL 401
>gi|21227560|ref|NP_633482.1| hypothetical protein MM_1458 [Methanosarcina mazei Go1]
gi|20905941|gb|AAM31154.1| conserved protein [Methanosarcina mazei Go1]
Length=424
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 117/399 (30%), Positives = 190/399 (48%), Gaps = 43/399 (10%)
Query 55 TIAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASL--TLS 112
++++ GT R+ T + + G+T +AD+T LD LG+P ++RP++ +S
Sbjct 9 SLSYIEGTQRVYDEATTLENTKNQIKKIGVTRIADITNLDRLGVPIFSSIRPSAAPGAIS 68
Query 113 VSQGKAASYRAAQVSAVMESLEGWHAE------NVTADLWSATARDLEADLTYDPAQLRH 166
+ GK ++ + A++SA+MES E AE N+ D+ SA A +E+ L +
Sbjct 69 IYSGKGSTEQRARISAIMESFERCLAERPGLNANIAGDI-SAPAL-VESYLNARENYVTL 126
Query 167 RPGSL-----YHAGVKLDWMVATTLLTGRRTWVPWTAVLVNVAT-RDCWEPPMFEMDTTG 220
PGSL Y+ L+W+ A LL +V AV + C + +F +T G
Sbjct 127 DPGSLLLSQPYNPSSLLEWVGAYDLLNKEEVFVSANAVYHPYDSPGQCQK--LFLSNTNG 184
Query 221 LASGNCYDEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEM---IRDAG 277
LASGN +EA LH L EV+ER +++ A + ++ + V + +L E+ +D+G
Sbjct 185 LASGNVLEEAILHGLLEVIERDAISTA---QFTRDLGKEIVLTEEDGYLYEISRKFKDSG 241
Query 278 DDVDLARIDVWDGYYCFAAELTSATLE---VTFGGFGLHHDPNVALSRAITEAAQSRITA 334
D+ + + G A L+ + G G H P +A++RAITEAAQSR+
Sbjct 242 IDLKIWLVPTDTGIPTIIAATDDVKLKDPALLVMGAGSHLKPEIAVARAITEAAQSRVVQ 301
Query 335 ISGARED------LPSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVP--DVDSLPELV 386
I GARED + S Y R R++ + + L+ + R P ++D + E +
Sbjct 302 IQGAREDTDREGFIRSVGYDRMKRLNWFWFEEGEKISLSEVQDISKRSPAENIDVILEKL 361
Query 387 ASAATAVANRSGTEPLAVVCDFADACVPVVKVLAPGLVL 425
V L V + VPVV+V+ PG L
Sbjct 362 KGLTEKV--------LVVDLSREEVAVPVVRVIIPGFEL 392
>gi|325958165|ref|YP_004289631.1| methanogenesis marker protein 1 [Methanobacterium sp. AL-21]
gi|325329597|gb|ADZ08659.1| methanogenesis marker protein 1 [Methanobacterium sp. AL-21]
Length=399
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 112/387 (29%), Positives = 180/387 (47%), Gaps = 23/387 (5%)
Query 62 THRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRP--ASLTLSVSQGKAA 119
THR +P++T ++P L AG+T VA++T LD +GIP A+RP A +S+ GK A
Sbjct 13 THRAVAPEKTIENVEPKLRAAGVTRVAEITHLDRIGIPVYSAIRPGAAEGAVSIYAGKGA 72
Query 120 SYRAAQVSAVMESLEGWHAENVTADLWSATARDL-EADL--TYDPAQLRHRPGSLYHAGV 176
+ A+ SA+MES E + AE D + + E+DL DP +L
Sbjct 73 TKSQAKASAMMESFERFSAEITDLDRKNFVRGNFEESDLHNYLDPDKLILPKLGFNSKTE 132
Query 177 KLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALY 236
L+W+ A + + +VP AV + + + +F+ +T GLASGN +EA H +
Sbjct 133 GLEWVKAVDITNDKTVFVPANAVYHPYDSENISK--LFQSNTNGLASGNLIEEAIFHGMM 190
Query 237 EVMERHSVAAAVAGET-MFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFA 295
EV+ER + + A E+ + + +++ + + AG V L + A
Sbjct 191 EVVERDAWSIFEARHKPKPEINLETIENPLINNILHLFKKAGIHVKLVNLTADVEITTIA 250
Query 296 AELTSATLE---VTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAI------ 346
A L+ + G G H DP VA+ RA+TE AQSR T I G RED A+
Sbjct 251 AVSDDTVLKDPALLTLGVGTHLDPEVAVIRALTEVAQSRATQIHGTREDTVRAVFMRKAG 310
Query 347 YHRFGRVHTY-AKARKTSLRLNRARPTPWRVPDVDSLPELVASAATAVANRSGTEPLAVV 405
Y R R++ + ++ + L R + + E + ++ + + + L V
Sbjct 311 YERMKRINKHWFGESQSEVDLKEIRNYSGK-----TFKEDIETSQKLLGKQGFKDILYVD 365
Query 406 CDFADACVPVVKVLAPGLVLSSASPMR 432
+ +PVV+VL P + L S R
Sbjct 366 LTRQEIQIPVVRVLIPEMELFSVDVNR 392
>gi|54292916|ref|YP_122303.1| hypothetical protein plpl0009 [Legionella pneumophila str. Lens]
gi|53755824|emb|CAH17328.1| hypothetical protein plpl0009 [Legionella pneumophila str. Lens]
Length=353
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 94/299 (32%), Positives = 146/299 (49%), Gaps = 26/299 (8%)
Query 56 IAHRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLTLSVSQ 115
I H + R ET L F AGIT +AD+T LD +P A+RP + +L+ SQ
Sbjct 3 IRHAETSFRARHFSETLNLLNQFKKLAGITRLADLTHLDYTSLPVYTAIRPRAKSLTTSQ 62
Query 116 GKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDL-EADLTYDPAQLRHRPGSLYHA 174
GK + AA+ SA+MES+E + AE + + + + +L +++ + P +
Sbjct 63 GKGLTKEAAKCSALMESIEVYFAEEIIPQVTNKSELELTQSNNLFIPINQLANSVRFTNP 122
Query 175 GVKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHA 234
++W+ A + +G+ VP+ +N + ++ DTTGLA GN Y EA LH
Sbjct 123 SQPINWVYADLVFSGKTILVPFAEYSLNSYLPEVL---IYSPDTTGLAGGNNYKEALLHG 179
Query 235 LYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYY-- 292
+ EV+ER A E F +++LVE + D I + YY
Sbjct 180 ILEVIERQD--AQQITEIAFV----------NSNLVENLSIRFD----CFITYQENYYRV 223
Query 293 -CFAAELTSAT---LEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIY 347
F L S ++ F G G H + +AL+RA+TEA QSR+T I+G+R+DL + Y
Sbjct 224 PSFEVLLKSKNPFENQILFKGSGSHLNKKIALNRALTEAIQSRVTTIAGSRDDLINTKY 282
>gi|147919709|ref|YP_686545.1| hypothetical protein RCIX2079 [uncultured methanogenic archaeon
RC-I]
gi|110621941|emb|CAJ37219.1| conserved hypothetical protein [uncultured methanogenic archaeon
RC-I]
Length=415
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 128/391 (33%), Positives = 175/391 (45%), Gaps = 54/391 (13%)
Query 62 THRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLT--LSVSQGKAA 119
THR+ P+ET ++ L G+T VA+++ LD +GIP A+RP S +SV GK A
Sbjct 16 THRVVPPEETLNRVEKLLPDIGVTRVAEISGLDRIGIPVYSAIRPGSEKGAISVYAGKGA 75
Query 120 SYRAAQVSAVMESLEGW----HAENVTADLWSATARDLEADLTYDPAQLRHRPGSLYHAG 175
+ A+VS +MES+E + H ++ L E DP L PG L G
Sbjct 76 TPVEAKVSVIMESIERYSSEMHKQDKKKVLVGTYEEVSEKHAAVDPQSL-ILPGRLL-PG 133
Query 176 VKLDWMVATTLLTGRRTWVPWTAVL---VNVATRDCWEPPMFEMDTTGLASGNCYDEATL 232
KL+W L+ + +P AV + A R +F +T GLASGN +EA
Sbjct 134 TKLEWFDGYDLIGKKDVKLPCNAVFHPYTSAAVR------LFRSNTNGLASGNTMEEAIF 187
Query 233 HALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYY 292
HAL EV+ER +++ A A + + D + L A DV L + G
Sbjct 188 HALMEVVERDALSLAEATRNTGQAISIDEDDGIAYDLYAKFGKANIDVKLWYLPTDTGIP 247
Query 293 CFAA-----ELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDLP---- 343
A EL L V G G H D +A RA+TE AQSR T I G RED
Sbjct 248 TVLAAADDKELLDPALLVM--GVGTHLDARIATLRALTEVAQSRATQIHGGREDTDRERI 305
Query 344 --SAIYHRFGRV--HTYAKARKT-SLRLNRARPTPWRVPDVDSLPELVASAATAVANRS- 397
S Y R R+ H YA+A +T SL+ SLP+L ++ +S
Sbjct 306 TRSIGYERMKRLNKHWYAEAAETVSLK---------------SLPDLSTTSHKGDIEKSI 350
Query 398 ----GTEPLAVVCDFADAC-VPVVKVLAPGL 423
G +V D + VPVV+V PGL
Sbjct 351 RQLKGIAQGVIVTDLTRSIGVPVVRVTVPGL 381
>gi|88601828|ref|YP_502006.1| hypothetical protein Mhun_0527 [Methanospirillum hungatei JF-1]
gi|88187290|gb|ABD40287.1| protein of unknown function DUF181 [Methanospirillum hungatei
JF-1]
Length=406
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 132/402 (33%), Positives = 185/402 (47%), Gaps = 53/402 (13%)
Query 58 HRHGTHRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRP--ASLTLSVSQ 115
++ THR SP+ET+ A+ PAGIT VAD+T LD +GIP +RP A ++V
Sbjct 10 YQKETHRTRSPEETYEAVHDLTGPAGITRVADITGLDRIGIPVFSCIRPVAAEGAITVYN 69
Query 116 GKAASYRAAQVSAVMESLEGWHAENVTADLWSATARDLEADLTYDPAQLRH---RPGSLY 172
GK A+ AA+VSA+ME LE + AE D T +TYD ++ RP +L
Sbjct 70 GKGATPIAARVSAIMEGLERYSAE--VHDRSPQT-------MTYDQIRMEKNAIRPDTLI 120
Query 173 HAGVK-----LDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPPMFEMDTTGLASGNCY 227
+ W +L WVP AV V +F T G+ASGN Y
Sbjct 121 LPEYAEPEWPIPWWQGYDILRNEEVWVPAHAVFHPVPR---IMGKLFRTSTNGIASGNTY 177
Query 228 DEATLHALYEVMERHSVAAAVAGETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDV 287
+EA H+L E++ER + + A + T DV + L++ ++AG DV L I
Sbjct 178 EEAVFHSLCELIERDAWSLVEASQNAGPAIT-DVTHPVARELLDKFKEAGVDVILRDITS 236
Query 288 WDGYYCFAA-----ELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGAREDL 342
G AA +L TL G G H +A+ RA+TE AQSR T I GARED
Sbjct 237 DLGIPTVAAVSDDLQLRDPTLLCI--GMGSHLCSEIAILRALTEVAQSRATQIHGAREDT 294
Query 343 PSAIYHRFGRVHTYAKARKTSLRLNRARPTPWRVPDVDSLPELVASAAT--------AVA 394
+ H +V Y +A+ RLN+ W + + + + S T V
Sbjct 295 KTT--HFLSKV-GYDRAK----RLNKK----WFTTEAEIAYKDMPSYHTDDFLDDIHIVL 343
Query 395 NRSGTEPL--AVVCDFA--DACVPVVKVLAPGLVLSSASPMR 432
+R L +V D + VPVV+V+ PGL + P R
Sbjct 344 DRLKAAGLDRVIVHDLTRPEIGVPVVRVIVPGLEHYAMDPER 385
>gi|116255454|ref|YP_771287.1| hypothetical protein pRL110255 [Rhizobium leguminosarum bv. viciae
3841]
gi|115260102|emb|CAK03202.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length=385
Score = 130 bits (326), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 113/361 (32%), Positives = 165/361 (46%), Gaps = 41/361 (11%)
Query 83 GITGVADVTWLDCLGIPTVQAVRPASLTLSVSQGKAASYRAAQVSAVMESLEGWHAENVT 142
GIT + +T LD +GIP Q VRP S ++SV+QGK ++ A +SA+MESLEGW +E +
Sbjct 36 GITRLGSITELDRIGIPVAQVVRPLSRSVSVNQGKGLTHGQAAISALMESLEGWSSERIP 95
Query 143 ADLWSATARDLEADLTYDPAQLRHRPGSLYHAGV--------KLDWMVATTLLTGRRTWV 194
+ + A R G Y + + L W+ L + R V
Sbjct 96 TE-------------RVELAGFRSMNGQGYWSHLADYGERDETLAWIEGWDLFSSRAVPV 142
Query 195 PWTAVLVNVATRDCWEPPMFEMDTTGLASGNCYDEATLHALYEVMERHSVAAAVAGETMF 254
P A++ T P +TTGLA+G + A HA +E +ERH+ AA+ F
Sbjct 143 P-LALVDTAYTIPSPHPGWLPRNTTGLAAGTSWRGAIEHACFEALERHARCAAMKIPHFF 201
Query 255 ---EVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWDGYYCFAAELTSATLEVTFG--- 308
+V + V + +V +R AG V + I G + + + L+ F
Sbjct 202 DRYQVDSRSVLAGAAGEIVGRLRSAGCSVGMWSIPTEHGLPVYWCHVMESDLQAPFAPWP 261
Query 309 --GFGLHHDPNVALSRAITEAAQSRITAISGAREDLPSAIYHRFGRVHTYAKARKTSL-R 365
GFG + AL++A+ EA QSR+ IS ARED+ IY Y AR+ S R
Sbjct 262 AEGFGCDRTHDRALAKALLEACQSRLGIISAAREDMAGHIYR-------YQDARELSAWR 314
Query 366 LNRARP-TPWRVPDVDSLPELVASAATAVANRSGTEPLAVVCDFADACVP--VVKVLAPG 422
A P P+ PD L + R+G E + VV F+D +P VV+V+ P
Sbjct 315 RRLAIPGLPYPSPDGADLNTDPSPLPVEALRRAGAEAVIVVALFSDETIPLHVVRVVTPP 374
Query 423 L 423
L
Sbjct 375 L 375
>gi|282163224|ref|YP_003355609.1| hypothetical protein MCP_0554 [Methanocella paludicola SANAE]
gi|282155538|dbj|BAI60626.1| conserved hypothetical protein [Methanocella paludicola SANAE]
Length=402
Score = 130 bits (326), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 110/326 (34%), Positives = 160/326 (50%), Gaps = 39/326 (11%)
Query 62 THRITSPDETWLALQPFLAPAGITGVADVTWLDCLGIPTVQAVRPASLT--LSVSQGKAA 119
THR+ P+ET ++ L G+T VA+++ LD +GIP A+RPAS +SV GK A
Sbjct 16 THRVVPPEETLARVEKLLPGIGVTRVAEISGLDRIGIPVYSAIRPASAKGAISVYAGKGA 75
Query 120 SYRAAQVSAVMESLEGWHAENVTAD---LWSATARDL-EADLTYDPAQLRHRPGSLYHAG 175
+ A+VS +ME++E + +E AD + T D+ + DP +L PG L
Sbjct 76 TPVEAKVSVMMEAIERYSSEFQKADKKRVVMGTFTDVSNGKVAVDPQKL-ILPGQLL-PN 133
Query 176 VKLDWMVATTLLTGRRTWVPWTAVLVNVATRDCWEPP--MFEMDTTGLASGNCYDEATLH 233
V+LDW+ L+ + +P AV + P +F +T GLASGN +EA H
Sbjct 134 VRLDWIDGYDLMNKKEVLLPCNAVF------HPYLAPFKLFRSNTNGLASGNTMEEAIFH 187
Query 234 ALYEVMERHSVAAAVA----GETMFEVPTDDVAGSDSAHLVEMIRDAGDDVDLARIDVWD 289
L EV+ER +++ A A G+ + D +A L AG DV L +
Sbjct 188 GLMEVVERDALSIAEATRDPGKEITITKKDGLA----YELYAKFGKAGIDVKLWYLPTDS 243
Query 290 GYYCFAA-----ELTSATLEVTFGGFGLHHDPNVALSRAITEAAQSRITAISGARED--- 341
G A EL +L V G G H D +++ RA+TE AQSR T I GARED
Sbjct 244 GIPTVLASTDDKELMDPSLLVM--GVGTHMDARISVLRALTEVAQSRATQIQGAREDTDR 301
Query 342 ---LPSAIYHRFGRV--HTYAKARKT 362
+ + Y R R+ H Y + ++T
Sbjct 302 EKVVRTIGYERMKRMNRHWYGEGKET 327
Lambda K H
0.318 0.131 0.403
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 900442926544
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40