BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2307c
Length=281
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609444|ref|NP_216823.1| hypothetical protein Rv2307c [Mycob... 554 6e-156
gi|289443817|ref|ZP_06433561.1| conserved hypothetical protein [... 552 2e-155
gi|298525790|ref|ZP_07013199.1| conserved hypothetical protein [... 551 6e-155
gi|340627314|ref|YP_004745766.1| hypothetical protein MCAN_23341... 550 8e-155
gi|15841799|ref|NP_336836.1| hypothetical protein MT2364 [Mycoba... 549 2e-154
gi|254232448|ref|ZP_04925775.1| conserved hypothetical protein [... 548 2e-154
gi|31793486|ref|NP_855979.1| hypothetical protein Mb2330c [Mycob... 548 3e-154
gi|289754408|ref|ZP_06513786.1| conserved hypothetical protein [... 534 7e-150
gi|308232095|ref|ZP_07663999.1| hypothetical protein TMAG_00494 ... 531 3e-149
gi|323719208|gb|EGB28353.1| hypothetical protein TMMG_01588 [Myc... 529 2e-148
gi|254365086|ref|ZP_04981132.1| conserved hypothetical protein [... 521 5e-146
gi|289570427|ref|ZP_06450654.1| conserved hypothetical protein [... 470 1e-130
gi|254551348|ref|ZP_05141795.1| hypothetical protein Mtube_12945... 434 9e-120
gi|108797685|ref|YP_637882.1| hypothetical protein Mmcs_0705 [My... 330 1e-88
gi|284990955|ref|YP_003409509.1| hypothetical protein Gobs_2467 ... 259 5e-67
gi|134099491|ref|YP_001105152.1| hypothetical protein SACE_2949 ... 249 4e-64
gi|291008503|ref|ZP_06566476.1| hypothetical protein SeryN2_2862... 247 1e-63
gi|284030834|ref|YP_003380765.1| hypothetical protein Kfla_2901 ... 229 4e-58
gi|111020471|ref|YP_703443.1| hypothetical protein RHA1_ro03482 ... 224 8e-57
gi|333922116|ref|YP_004495697.1| hypothetical protein AS9A_4464 ... 220 2e-55
gi|226362689|ref|YP_002780467.1| hypothetical protein ROP_32750 ... 214 1e-53
gi|291301493|ref|YP_003512771.1| hypothetical protein Snas_4026 ... 208 6e-52
gi|333919123|ref|YP_004492704.1| hypothetical protein AS9A_1452 ... 205 6e-51
gi|209964387|ref|YP_002297302.1| hypothetical protein RC1_1069 [... 201 9e-50
gi|289208235|ref|YP_003460301.1| alpha/beta hydrolase fold prote... 194 1e-47
gi|258652534|ref|YP_003201690.1| hypothetical protein Namu_2325 ... 187 1e-45
gi|258593749|emb|CBE70090.1| putative enzyme (3.4.-) [NC10 bacte... 181 1e-43
gi|91787705|ref|YP_548657.1| hypothetical protein Bpro_1826 [Pol... 177 2e-42
gi|218782678|ref|YP_002433996.1| hypothetical protein Dalk_4851 ... 175 7e-42
gi|334336819|ref|YP_004541971.1| hypothetical protein Isova_1309... 173 2e-41
gi|269956104|ref|YP_003325893.1| hypothetical protein Xcel_1304 ... 172 4e-41
gi|328954226|ref|YP_004371560.1| alpha/beta hydrolase fold prote... 169 5e-40
gi|317151894|ref|YP_004119942.1| alpha/beta hydrolase fold prote... 168 9e-40
gi|292493769|ref|YP_003529208.1| hypothetical protein Nhal_3806 ... 167 1e-39
gi|220935197|ref|YP_002514096.1| hypothetical protein Tgr7_2029 ... 167 2e-39
gi|82703211|ref|YP_412777.1| hypothetical protein Nmul_A2092 [Ni... 165 5e-39
gi|149175241|ref|ZP_01853863.1| hypothetical protein PM8797T_206... 164 2e-38
gi|149174556|ref|ZP_01853182.1| hypothetical protein PM8797T_097... 163 3e-38
gi|229819520|ref|YP_002881046.1| hypothetical protein Bcav_1023 ... 162 4e-38
gi|114776756|ref|ZP_01451799.1| hypothetical protein SPV1_11091 ... 161 8e-38
gi|77920018|ref|YP_357833.1| putative enzyme (3.4.-) [Pelobacter... 161 8e-38
gi|302039458|ref|YP_003799780.1| putative peptidase [Candidatus ... 161 9e-38
gi|168699272|ref|ZP_02731549.1| hypothetical protein GobsU_07102... 161 1e-37
gi|302342111|ref|YP_003806640.1| enzyme (3.4.-) [Desulfarculus b... 160 2e-37
gi|148358661|ref|YP_001249868.1| hypothetical protein LPC_0537 [... 160 2e-37
gi|296122668|ref|YP_003630446.1| hypothetical protein Plim_2421 ... 160 2e-37
gi|54298593|ref|YP_124962.1| hypothetical protein lpp2657 [Legio... 159 3e-37
gi|344224157|gb|EGV50565.1| hypothetical protein Rifp1Sym_cv0007... 159 4e-37
gi|345123263|gb|EGW53165.1| hypothetical protein TevJSym_bk00200... 159 5e-37
gi|116748362|ref|YP_845049.1| hypothetical protein Sfum_0918 [Sy... 159 5e-37
>gi|15609444|ref|NP_216823.1| hypothetical protein Rv2307c [Mycobacterium tuberculosis H37Rv]
gi|148662129|ref|YP_001283652.1| hypothetical protein MRA_2323 [Mycobacterium tuberculosis H37Ra]
gi|148823506|ref|YP_001288260.1| hypothetical protein TBFG_12329 [Mycobacterium tuberculosis F11]
24 more sequence titles
Length=281
Score = 554 bits (1427), Expect = 6e-156, Method: Compositional matrix adjust.
Identities = 281/281 (100%), Positives = 281/281 (100%), Gaps = 0/281 (0%)
Query 1 MSLKRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
MSLKRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ
Sbjct 1 MSLKRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
Query 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP
Sbjct 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
Query 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF
Sbjct 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
Query 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA
Sbjct 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
Query 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ
Sbjct 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
>gi|289443817|ref|ZP_06433561.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289447940|ref|ZP_06437684.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289745579|ref|ZP_06504957.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
9 more sequence titles
Length=281
Score = 552 bits (1422), Expect = 2e-155, Method: Compositional matrix adjust.
Identities = 280/281 (99%), Positives = 280/281 (99%), Gaps = 0/281 (0%)
Query 1 MSLKRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
MSLKRCRALPVVAIVALVASGVI FIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ
Sbjct 1 MSLKRCRALPVVAIVALVASGVITFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
Query 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP
Sbjct 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
Query 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF
Sbjct 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
Query 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA
Sbjct 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
Query 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ
Sbjct 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
>gi|298525790|ref|ZP_07013199.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298495584|gb|EFI30878.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=281
Score = 551 bits (1419), Expect = 6e-155, Method: Compositional matrix adjust.
Identities = 279/281 (99%), Positives = 280/281 (99%), Gaps = 0/281 (0%)
Query 1 MSLKRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
MSL+RCRALPVVAIVALVASGVI FIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ
Sbjct 1 MSLRRCRALPVVAIVALVASGVITFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
Query 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP
Sbjct 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
Query 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF
Sbjct 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
Query 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA
Sbjct 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
Query 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ
Sbjct 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
>gi|340627314|ref|YP_004745766.1| hypothetical protein MCAN_23341 [Mycobacterium canettii CIPT
140010059]
gi|340005504|emb|CCC44665.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=281
Score = 550 bits (1417), Expect = 8e-155, Method: Compositional matrix adjust.
Identities = 279/281 (99%), Positives = 279/281 (99%), Gaps = 0/281 (0%)
Query 1 MSLKRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
MSLKRCRALPVVAIVALVASGVI FIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ
Sbjct 1 MSLKRCRALPVVAIVALVASGVITFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
Query 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP
Sbjct 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
Query 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF
Sbjct 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
Query 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPA LSERLVAAA
Sbjct 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPAALSERLVAAA 240
Query 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ
Sbjct 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
>gi|15841799|ref|NP_336836.1| hypothetical protein MT2364 [Mycobacterium tuberculosis CDC1551]
gi|13882061|gb|AAK46650.1| bem46 protein [Mycobacterium tuberculosis CDC1551]
Length=281
Score = 549 bits (1414), Expect = 2e-154, Method: Compositional matrix adjust.
Identities = 279/281 (99%), Positives = 279/281 (99%), Gaps = 0/281 (0%)
Query 1 MSLKRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
MSLKRCRALPVVAIVALVASGVI FIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ
Sbjct 1 MSLKRCRALPVVAIVALVASGVITFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
Query 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP
Sbjct 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
Query 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
GRPSEQGLAADARAAQEWLSGQSDVDPARIAY GESLGAAVAVGLAVQRPPAALVLRSPF
Sbjct 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYXGESLGAAVAVGLAVQRPPAALVLRSPF 180
Query 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA
Sbjct 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
Query 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ
Sbjct 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
>gi|254232448|ref|ZP_04925775.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|124601507|gb|EAY60517.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
Length=281
Score = 548 bits (1413), Expect = 2e-154, Method: Compositional matrix adjust.
Identities = 279/281 (99%), Positives = 279/281 (99%), Gaps = 0/281 (0%)
Query 1 MSLKRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
MSLKRCRALPVVAIVALVASGVI FIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ
Sbjct 1 MSLKRCRALPVVAIVALVASGVITFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
Query 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGN
Sbjct 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNL 120
Query 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF
Sbjct 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
Query 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA
Sbjct 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
Query 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ
Sbjct 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
>gi|31793486|ref|NP_855979.1| hypothetical protein Mb2330c [Mycobacterium bovis AF2122/97]
gi|121638189|ref|YP_978413.1| hypothetical protein BCG_2324c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224990683|ref|YP_002645370.1| hypothetical protein JTY_2318 [Mycobacterium bovis BCG str. Tokyo
172]
8 more sequence titles
Length=281
Score = 548 bits (1412), Expect = 3e-154, Method: Compositional matrix adjust.
Identities = 279/281 (99%), Positives = 279/281 (99%), Gaps = 0/281 (0%)
Query 1 MSLKRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
MSLKRCRALPVVAIVALVASGVI FIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ
Sbjct 1 MSLKRCRALPVVAIVALVASGVITFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
Query 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP
Sbjct 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
Query 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF
Sbjct 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
Query 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSE LVAAA
Sbjct 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSEWLVAAA 240
Query 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ
Sbjct 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
>gi|289754408|ref|ZP_06513786.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289694995|gb|EFD62424.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|339295211|gb|AEJ47322.1| hypothetical protein CCDC5079_2132 [Mycobacterium tuberculosis
CCDC5079]
gi|339298831|gb|AEJ50941.1| hypothetical protein CCDC5180_2104 [Mycobacterium tuberculosis
CCDC5180]
Length=273
Score = 534 bits (1375), Expect = 7e-150, Method: Compositional matrix adjust.
Identities = 271/273 (99%), Positives = 272/273 (99%), Gaps = 0/273 (0%)
Query 9 LPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGW 68
+PVVAIVALVASGVI FIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGW
Sbjct 1 MPVVAIVALVASGVITFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGW 60
Query 69 YFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGL 128
YFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGL
Sbjct 61 YFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGL 120
Query 129 AADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGA 188
AADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGA
Sbjct 121 AADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGA 180
Query 189 VHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVV 248
VHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVV
Sbjct 181 VHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVV 240
Query 249 VPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
VPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ
Sbjct 241 VPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 273
>gi|308232095|ref|ZP_07663999.1| hypothetical protein TMAG_00494 [Mycobacterium tuberculosis SUMu001]
gi|308369685|ref|ZP_07666784.1| hypothetical protein TMBG_00851 [Mycobacterium tuberculosis SUMu002]
gi|308370970|ref|ZP_07667060.1| hypothetical protein TMCG_00400 [Mycobacterium tuberculosis SUMu003]
13 more sequence titles
Length=271
Score = 531 bits (1369), Expect = 3e-149, Method: Compositional matrix adjust.
Identities = 270/271 (99%), Positives = 271/271 (100%), Gaps = 0/271 (0%)
Query 11 VVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYF 70
+VAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYF
Sbjct 1 MVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYF 60
Query 71 PHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAA 130
PHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAA
Sbjct 61 PHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAA 120
Query 131 DARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVH 190
DARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVH
Sbjct 121 DARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVH 180
Query 191 YPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVP 250
YPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVP
Sbjct 181 YPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVP 240
Query 251 GVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
GVGHNDPELLDGRVMLDAIRRFLTETAVLGQ
Sbjct 241 GVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 271
>gi|323719208|gb|EGB28353.1| hypothetical protein TMMG_01588 [Mycobacterium tuberculosis CDC1551A]
Length=271
Score = 529 bits (1363), Expect = 2e-148, Method: Compositional matrix adjust.
Identities = 269/271 (99%), Positives = 270/271 (99%), Gaps = 0/271 (0%)
Query 11 VVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYF 70
+VAIVALVASGVI FIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYF
Sbjct 1 MVAIVALVASGVITFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYF 60
Query 71 PHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAA 130
PHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAA
Sbjct 61 PHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAA 120
Query 131 DARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVH 190
DARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVH
Sbjct 121 DARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVH 180
Query 191 YPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVP 250
YPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVP
Sbjct 181 YPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVP 240
Query 251 GVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
GVGHNDPELLDGRVMLDAIRRFLTETAVLGQ
Sbjct 241 GVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 271
>gi|254365086|ref|ZP_04981132.1| conserved hypothetical protein [Mycobacterium tuberculosis str.
Haarlem]
gi|134150600|gb|EBA42645.1| conserved hypothetical protein [Mycobacterium tuberculosis str.
Haarlem]
Length=289
Score = 521 bits (1342), Expect = 5e-146, Method: Compositional matrix adjust.
Identities = 263/265 (99%), Positives = 264/265 (99%), Gaps = 0/265 (0%)
Query 17 LVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGG 76
+VASGVI FIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGG
Sbjct 25 VVASGVITFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGG 84
Query 77 SGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQ 136
SGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQ
Sbjct 85 SGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQ 144
Query 137 EWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPL 196
EWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPL
Sbjct 145 EWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPL 204
Query 197 RRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHND 256
RRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHND
Sbjct 205 RRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHND 264
Query 257 PELLDGRVMLDAIRRFLTETAVLGQ 281
PELLDGRVMLDAIRRFLTETAVLGQ
Sbjct 265 PELLDGRVMLDAIRRFLTETAVLGQ 289
>gi|289570427|ref|ZP_06450654.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289544181|gb|EFD47829.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=243
Score = 470 bits (1209), Expect = 1e-130, Method: Compositional matrix adjust.
Identities = 239/240 (99%), Positives = 239/240 (99%), Gaps = 0/240 (0%)
Query 1 MSLKRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
MSLKRCRALPVVAIVALVASGVI FIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ
Sbjct 1 MSLKRCRALPVVAIVALVASGVITFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQ 60
Query 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP
Sbjct 61 DGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNP 120
Query 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF
Sbjct 121 GRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPF 180
Query 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA
Sbjct 181 TSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
>gi|254551348|ref|ZP_05141795.1| hypothetical protein Mtube_12945 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
Length=219
Score = 434 bits (1115), Expect = 9e-120, Method: Compositional matrix adjust.
Identities = 219/219 (100%), Positives = 219/219 (100%), Gaps = 0/219 (0%)
Query 63 MRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGR 122
MRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGR
Sbjct 1 MRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGR 60
Query 123 PSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTS 182
PSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTS
Sbjct 61 PSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTS 120
Query 183 LAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAE 242
LAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAE
Sbjct 121 LAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAE 180
Query 243 PKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 281
PKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ
Sbjct 181 PKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ 219
>gi|108797685|ref|YP_637882.1| hypothetical protein Mmcs_0705 [Mycobacterium sp. MCS]
gi|119866773|ref|YP_936725.1| hypothetical protein Mkms_0719 [Mycobacterium sp. KMS]
gi|126433310|ref|YP_001069001.1| hypothetical protein Mjls_0699 [Mycobacterium sp. JLS]
gi|108768104|gb|ABG06826.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119692862|gb|ABL89935.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126233110|gb|ABN96510.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=275
Score = 330 bits (847), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 198/270 (74%), Positives = 224/270 (83%), Gaps = 2/270 (0%)
Query 11 VVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYF 70
+V IV LVA+G + +W+QQRRLIYFP+ GPVPSA++V RDVVV T DG+ LG W+F
Sbjct 7 LVVIVVLVANGALALLWNQQRRLIYFPAPGPVPSATAVWSGARDVVVRTADGVDLGAWFF 66
Query 71 PHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAA 130
+ GPAVLVCNGN GDRSMRA LA+AL +GLSVLLFDYRGYGGNPGRP+E GLAA
Sbjct 67 --AAADRGPAVLVCNGNGGDRSMRAALALALRRMGLSVLLFDYRGYGGNPGRPTEDGLAA 124
Query 131 DARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVH 190
DARAA++WL+ Q +VDP R+AYFGESLG AVAVGLA RPPAALVLRSPFTSLA+VGAVH
Sbjct 125 DARAARDWLAAQPEVDPDRLAYFGESLGGAVAVGLAAARPPAALVLRSPFTSLADVGAVH 184
Query 191 YPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVP 250
YPWLP+RRLLLD YPSIERIA +HAP+LVIAG DDIVPA LS RL AAAEPK +V+VP
Sbjct 185 YPWLPVRRLLLDRYPSIERIAGIHAPLLVIAGDRDDIVPAGLSRRLYDAAAEPKEFVLVP 244
Query 251 GVGHNDPELLDGRVMLDAIRRFLTETAVLG 280
G GHNDPELLDG ML+AI RFL TAVLG
Sbjct 245 GAGHNDPELLDGPQMLEAIERFLRHTAVLG 274
>gi|284990955|ref|YP_003409509.1| hypothetical protein Gobs_2467 [Geodermatophilus obscurus DSM
43160]
gi|284064200|gb|ADB75138.1| conserved hypothetical protein [Geodermatophilus obscurus DSM
43160]
Length=266
Score = 259 bits (661), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 136/250 (55%), Positives = 167/250 (67%), Gaps = 1/250 (0%)
Query 27 WSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAVLVCNG 86
W+ QRRL+Y P+ GPVP+A+ +P GRDV + T DG+ LG W+ P + PAVLV NG
Sbjct 15 WAFQRRLVYLPAGGPVPAAADAVPGGRDVELTTADGLTLGAWFVPGPTA-DAPAVLVANG 73
Query 87 NAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSDVD 146
N G R MRA LA AL GL+VLLFDYRGYGGNPG PSE+GLA D RAA+ L ++ V
Sbjct 74 NGGHRGMRAPLARALSAAGLAVLLFDYRGYGGNPGSPSEEGLALDVRAARSHLLEEAGVP 133
Query 147 PARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYPS 206
R+ Y+GESLG AV LAV PPA L+LRSPF LA VG VHYP+LP+R LL D YP
Sbjct 134 EERLVYYGESLGCAVVTELAVDHPPAGLLLRSPFVDLAAVGEVHYPFLPVRSLLRDRYPV 193
Query 207 IERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPELLDGRVML 266
++A V AP V+ G +D IVP S ++ AAA+ R + VPG GHND LLDG ++
Sbjct 194 AAQVAEVRAPTTVVYGTADAIVPPEQSRQVADAAAQLHRRIEVPGAGHNDAVLLDGGALV 253
Query 267 DAIRRFLTET 276
DA+ T T
Sbjct 254 DAVVELATAT 263
>gi|134099491|ref|YP_001105152.1| hypothetical protein SACE_2949 [Saccharopolyspora erythraea NRRL
2338]
gi|133912114|emb|CAM02227.1| hypothetical protein SACE_2949 [Saccharopolyspora erythraea NRRL
2338]
Length=253
Score = 249 bits (636), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 136/251 (55%), Positives = 165/251 (66%), Gaps = 2/251 (0%)
Query 22 VIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAV 81
V+ W+ QRRLIYFP P P A+SV+ R+VV+ T DG+RLG WY P G AV
Sbjct 2 VLGLAWAYQRRLIYFPVGRP-PPAASVIEGAREVVLSTGDGLRLGAWYVPGRGGAGETAV 60
Query 82 LVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSG 141
LV NGNAG+RS+RA LA AL GL+VLLFDYRGYGGNPG PSEQGLA D RAA +L
Sbjct 61 LVANGNAGERSLRAPLADALARRGLAVLLFDYRGYGGNPGTPSEQGLALDVRAAHRYLVE 120
Query 142 QSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLL 201
++ P R+ Y+GESLGAAV LA PP LVLRSPFT LA VG HYP+LP+R LL
Sbjct 121 EAGFGPDRLVYYGESLGAAVVTELAAHSPPRGLVLRSPFTDLAAVGRYHYPYLPVRMLLR 180
Query 202 DHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPELLD 261
D YP +A V PV+V+ G +D +VPA S R VA + V +PG HND LLD
Sbjct 181 DRYPLTTHLAKVRRPVIVVYGTADSVVPAAQS-RAVAESVPGATAVAIPGADHNDLALLD 239
Query 262 GRVMLDAIRRF 272
G +++A+ +
Sbjct 240 GPEIVEAVVKL 250
>gi|291008503|ref|ZP_06566476.1| hypothetical protein SeryN2_28623 [Saccharopolyspora erythraea
NRRL 2338]
Length=266
Score = 247 bits (631), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 135/246 (55%), Positives = 163/246 (67%), Gaps = 2/246 (0%)
Query 27 WSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAVLVCNG 86
W+ QRRLIYFP P P A+SV+ R+VV+ T DG+RLG WY P G AVLV NG
Sbjct 20 WAYQRRLIYFPVGRP-PPAASVIEGAREVVLSTGDGLRLGAWYVPGRGGAGETAVLVANG 78
Query 87 NAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSDVD 146
NAG+RS+RA LA AL GL+VLLFDYRGYGGNPG PSEQGLA D RAA +L ++
Sbjct 79 NAGERSLRAPLADALARRGLAVLLFDYRGYGGNPGTPSEQGLALDVRAAHRYLVEEAGFG 138
Query 147 PARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYPS 206
P R+ Y+GESLGAAV LA PP LVLRSPFT LA VG HYP+LP+R LL D YP
Sbjct 139 PDRLVYYGESLGAAVVTELAAHSPPRGLVLRSPFTDLAAVGRYHYPYLPVRMLLRDRYPL 198
Query 207 IERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPELLDGRVML 266
+A V PV+V+ G +D +VPA S R VA + V +PG HND LLDG ++
Sbjct 199 TTHLAKVRRPVIVVYGTADSVVPAAQS-RAVAESVPGATAVAIPGADHNDLALLDGPEIV 257
Query 267 DAIRRF 272
+A+ +
Sbjct 258 EAVVKL 263
>gi|284030834|ref|YP_003380765.1| hypothetical protein Kfla_2901 [Kribbella flavida DSM 17836]
gi|283810127|gb|ADB31966.1| conserved hypothetical protein [Kribbella flavida DSM 17836]
Length=292
Score = 229 bits (583), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 129/244 (53%), Positives = 160/244 (66%), Gaps = 0/244 (0%)
Query 26 IWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAVLVCN 85
IW QRRLIY P + SA++ LP RDVV++ DG+RLG W P AVLV
Sbjct 38 IWGFQRRLIYLPDSAEPGSAATALPGARDVVLDAGDGVRLGAWLVPAGGPDRSVAVLVAA 97
Query 86 GNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSDV 145
GNAG+R+ RA LA AL GL+VLLFDYRGYGG+ GRPSE+GLA D RAAQ +L+ Q+
Sbjct 98 GNAGNRASRAPLARALAAEGLTVLLFDYRGYGGSDGRPSERGLAQDVRAAQRYLAEQAGF 157
Query 146 DPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYP 205
P+R Y+GESLGAAV LA + P LVLRSPF LA VG VHYP+LP+R LL D +P
Sbjct 158 PPSRTLYYGESLGAAVVTELATEIAPGGLVLRSPFVDLASVGKVHYPFLPMRLLLRDKFP 217
Query 206 SIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPELLDGRVM 265
E++A+V PV V+ G D IVP S + AAA + K V V G HNDP+L+ G+ +
Sbjct 218 LAEQLATVKVPVTVVLGSEDSIVPPDQSRAVAAAAPDLKSLVEVTGADHNDPDLVHGKQL 277
Query 266 LDAI 269
A+
Sbjct 278 AAAV 281
>gi|111020471|ref|YP_703443.1| hypothetical protein RHA1_ro03482 [Rhodococcus jostii RHA1]
gi|110820001|gb|ABG95285.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=273
Score = 224 bits (572), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 121/244 (50%), Positives = 148/244 (61%), Gaps = 0/244 (0%)
Query 26 IWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAVLVCN 85
+W QRRLIY+P PVP A ++ D+ + T DG+ LG WY P SGG VLV
Sbjct 19 VWVLQRRLIYYPDNSPVPPADRLIAGAEDITLTTSDGLELGAWYVPPASGGPRMTVLVAA 78
Query 86 GNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSDV 145
GNAG+R+ RA LA L G + LLFDYRGYGGNPG P E GLA D RAA +L + V
Sbjct 79 GNAGNRADRALLASDLAAAGFATLLFDYRGYGGNPGHPGEDGLALDVRAAHRYLVDERRV 138
Query 146 DPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYP 205
P R+ YFGESLG V LA PPA L+LRSPF LA VGA HYP+LP+R LL D +P
Sbjct 139 PPERLLYFGESLGTGVVTELATGHPPAGLLLRSPFVDLASVGARHYPFLPVRLLLRDRFP 198
Query 206 SIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPELLDGRVM 265
E +A + P V+ G +D +VP S R+ AA P VV+ G GHND + G +
Sbjct 199 VAEYVARIDVPTTVVYGTADSVVPPDQSARVADAARGPVETVVLQGAGHNDDVMFGGAEI 258
Query 266 LDAI 269
+ AI
Sbjct 259 VRAI 262
>gi|333922116|ref|YP_004495697.1| hypothetical protein AS9A_4464 [Amycolicicoccus subflavus DQS3-9A1]
gi|333484337|gb|AEF42897.1| hypothetical protein AS9A_4464 [Amycolicicoccus subflavus DQS3-9A1]
Length=277
Score = 220 bits (560), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 115/251 (46%), Positives = 154/251 (62%), Gaps = 2/251 (0%)
Query 27 WSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGP--AVLVC 84
W+ RRLIY+P PVP AS+++ DV T DG+ L W P + + VL+
Sbjct 27 WAMHRRLIYYPDDLPVPPASALIHGAEDVQFTTDDGLTLHAWLVPPATDVTSRDITVLMA 86
Query 85 NGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSD 144
+GNAG+R+ RA LA L G++ LL DYRGYGGN G+PSEQGLA DARAA +L
Sbjct 87 HGNAGNRADRAPLAAELARRGIATLLLDYRGYGGNAGQPSEQGLALDARAAYWYLRNNRG 146
Query 145 VDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHY 204
V P R+ YFGESLG V LA++ PP +VLRSPFT L EV +HYP LP + LL D +
Sbjct 147 VAPERMIYFGESLGCGVVAELALRYPPGGVVLRSPFTDLVEVAKLHYPMLPAQLLLRDRF 206
Query 205 PSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPELLDGRV 264
+E + + P +V+ G SD I+PA +S ++ A VV+PGVGHNDP +L G
Sbjct 207 RVLEAVRKITVPTVVVYGASDVIIPAEMSAKVADATRNLNSTVVMPGVGHNDPHMLVGEE 266
Query 265 MLDAIRRFLTE 275
++DA+ + +
Sbjct 267 LIDAVESLIPD 277
>gi|226362689|ref|YP_002780467.1| hypothetical protein ROP_32750 [Rhodococcus opacus B4]
gi|226241174|dbj|BAH51522.1| hypothetical membrane protein [Rhodococcus opacus B4]
Length=273
Score = 214 bits (545), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 117/236 (50%), Positives = 144/236 (62%), Gaps = 0/236 (0%)
Query 27 WSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAVLVCNG 86
W QR+LIY+P PVP+A ++ DV + T DG+ LG WY P G VLV G
Sbjct 20 WVLQRKLIYYPDTRPVPTAGGLIAGAEDVTLTTSDGLELGAWYVPPAVGEPRMTVLVAAG 79
Query 87 NAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSDVD 146
NAG+R+ RA LA L G + LLFDYRGYGGNPGRPSE+GLA D RAA+ +L + V
Sbjct 80 NAGNRADRALLASDLAAAGFATLLFDYRGYGGNPGRPSEEGLARDVRAARRYLVDERRVP 139
Query 147 PARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYPS 206
P R+ YFGESLG V LA + PPA L+LRSPF LA VG HYP+LP+ LL D +P
Sbjct 140 PDRLLYFGESLGTGVVTELATEHPPAGLLLRSPFVDLAAVGRHHYPFLPVGLLLRDRFPV 199
Query 207 IERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPELLDG 262
E +A V P V+ G +D +VP S R+ AA VV+ G GHND + G
Sbjct 200 AEHVARVDVPTTVVYGTADVVVPPDQSARVAEAALGDVDTVVLAGAGHNDDVMFGG 255
>gi|291301493|ref|YP_003512771.1| hypothetical protein Snas_4026 [Stackebrandtia nassauensis DSM
44728]
gi|290570713|gb|ADD43678.1| conserved hypothetical protein [Stackebrandtia nassauensis DSM
44728]
Length=278
Score = 208 bits (530), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 117/248 (48%), Positives = 149/248 (61%), Gaps = 11/248 (4%)
Query 27 WSQQRRLIYFP--SAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAVLVC 84
W QR+LIYFP SA PVP +++ +V + T D ++L W F T AVLV
Sbjct 25 WWFQRQLIYFPDTSAPPVPDSAT------EVELRTSDDLKLAAWQFAPTGADRKTAVLVA 78
Query 85 NGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSD 144
NGN G+R R LA AL G +VL+FDYRGYGGNPG P E GL ADA+AA + L+G +
Sbjct 79 NGNGGNRLNRIGLAEALTAKGFTVLVFDYRGYGGNPGSPDEDGLYADAKAALDHLTGPAG 138
Query 145 VDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHY 204
D RI YFGESLG V LA+ PPAA+VLRSPFTSL +VG HYP+LP+R LL + Y
Sbjct 139 FDTDRIVYFGESLGCGVVSKLALDHPPAAMVLRSPFTSLPDVGQRHYPYLPVRLLLTETY 198
Query 205 PSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVV---VPGVGHNDPELLD 261
P + P+LV G D IVP LS+R+ +A V + G HN+ EL+
Sbjct 199 PVESNVTKTGVPLLVAYGTGDSIVPPDLSKRVAESAENSGAEVTKLAIDGADHNELELVG 258
Query 262 GRVMLDAI 269
G ++D +
Sbjct 259 GAEVIDGV 266
>gi|333919123|ref|YP_004492704.1| hypothetical protein AS9A_1452 [Amycolicicoccus subflavus DQS3-9A1]
gi|333481344|gb|AEF39904.1| hypothetical protein AS9A_1452 [Amycolicicoccus subflavus DQS3-9A1]
Length=268
Score = 205 bits (522), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 122/244 (50%), Positives = 151/244 (62%), Gaps = 4/244 (1%)
Query 27 WSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAVLVCNG 86
W QR+LIY P +G VP A+ + RDV + T DG+ L WY P G+ P VLV G
Sbjct 20 WLFQRQLIYLPMSGDVPPAAEAVDGARDVALRTADGLELSAWYIPAAEPGA-PVVLVAPG 78
Query 87 NAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSDVD 146
NAG+RS R LA GL VLL +YRGYGGNPG PSE GLAADA AA +L+ ++
Sbjct 79 NAGNRSHRTPLARGFAEDGLGVLLLEYRGYGGNPGSPSETGLAADADAAYAFLTEVENLP 138
Query 147 PARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYPS 206
P ++ YFGESLGA V LA + PAA+VLRSPFTSLA+VGA HYP+LP+R LL D YP
Sbjct 139 PEQLIYFGESLGAGVVTALATRHQPAAMVLRSPFTSLADVGARHYPFLPVRALLKDQYPV 198
Query 207 IERIASVHA-PVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPELLDGRVM 265
+E +A + PV VIAG D IVP S + AA + +P HND L G +
Sbjct 199 LENVAELRGVPVTVIAGSRDSIVPLDQSHTVAEAAG--TTVIEIPDADHNDAILNYGPEV 256
Query 266 LDAI 269
+ A+
Sbjct 257 VSAV 260
>gi|209964387|ref|YP_002297302.1| hypothetical protein RC1_1069 [Rhodospirillum centenum SW]
gi|209957853|gb|ACI98489.1| conserved hypothetical protein [Rhodospirillum centenum SW]
Length=289
Score = 201 bits (511), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 115/243 (48%), Positives = 150/243 (62%), Gaps = 3/243 (1%)
Query 14 IVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHT 73
++ LVA G + +++ QR+L YFPS V A LP V V T DG+ + GWY P
Sbjct 7 LIVLVAGGGLATLYANQRKLQYFPSTAVVVPADWGLPDFSVVTVTTADGVGIDGWYAPAA 66
Query 74 SGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADAR 133
+G P V++ +GNAG +RA+ A L G VLL YRGYGGNPG+P E GL ADAR
Sbjct 67 AGR--PTVVLFHGNAGHLGLRADKARVLRDAGFGVLLAGYRGYGGNPGQPDEPGLMADAR 124
Query 134 AAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPW 193
A ++L Q V R+ +GESLG VAV +A +R LVL +P+TS+ +V A HYP+
Sbjct 125 AQLDFLVEQG-VSGQRVVLYGESLGTGVAVRMATERRVGGLVLEAPYTSMTDVAAAHYPF 183
Query 194 LPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVG 253
LP+R LL D Y S+ RI + AP+LV+ D +VPA LS+ L AA EPK V +PG G
Sbjct 184 LPVRLLLRDRYDSLSRIDRIAAPLLVVVAQRDAVVPAALSDTLFRAAPEPKYIVRLPGAG 243
Query 254 HND 256
HND
Sbjct 244 HND 246
>gi|289208235|ref|YP_003460301.1| alpha/beta hydrolase fold protein [Thioalkalivibrio sp. K90mix]
gi|288943866|gb|ADC71565.1| alpha/beta hydrolase fold protein [Thioalkalivibrio sp. K90mix]
Length=285
Score = 194 bits (493), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 109/265 (42%), Positives = 156/265 (59%), Gaps = 2/265 (0%)
Query 14 IVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHT 73
I+A+ + V+ I+ Q RLIY PS+ V S +++ DV +ET+DG+RL GWY P
Sbjct 10 ILAVGYALVVGLIYLTQDRLIYMPSSNVVGSPANIGLEYEDVALETEDGVRLHGWYLPGP 69
Query 74 SGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADAR 133
+ P +L +GNAG+ R E H LGL+VL+ DYRGYG + GRP E+G DAR
Sbjct 70 ED-NAPVLLFLHGNAGNIGHRLESLEQFHHLGLAVLIIDYRGYGQSQGRPHEEGTYEDAR 128
Query 134 AAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPW 193
AA WL + +P I FG SLGAAVA LA + PAA++L + FTS A++GA YPW
Sbjct 129 AAWNWLREHLEYEPEEIVLFGRSLGAAVAARLAETKSPAAVILEAAFTSAADLGAEVYPW 188
Query 194 LPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVG 253
LP+R L+ Y + R+ ++ AP+L D+IVP +ERL+ A+ + + + G G
Sbjct 189 LPVRALIRHEYDVLGRVGAIEAPLLFAHAREDEIVPFAHAERLLEASGGEAQLMEMDG-G 247
Query 254 HNDPELLDGRVMLDAIRRFLTETAV 278
HND G ++ +R FL + +
Sbjct 248 HNDAFRATGSRYIEGLREFLEDAGL 272
>gi|258652534|ref|YP_003201690.1| hypothetical protein Namu_2325 [Nakamurella multipartita DSM
44233]
gi|258555759|gb|ACV78701.1| conserved hypothetical protein [Nakamurella multipartita DSM
44233]
Length=279
Score = 187 bits (476), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 117/256 (46%), Positives = 152/256 (60%), Gaps = 5/256 (1%)
Query 11 VVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYF 70
V+ +V + +GV + +WS QRRLIY P VP+AS++L DV + T+DG+ L Y
Sbjct 5 VLVLVGALLAGVAV-LWSGQRRLIYQPDTSAVPAASALLDDALDVTLTTEDGVALRALYV 63
Query 71 ----PHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQ 126
P G VLV GN G+R+ R LA AL G VLL DYRGYGGNPGRPSE
Sbjct 64 RAPVPRDPAGCRSTVLVAPGNGGNRAGRLPLARALREAGFGVLLLDYRGYGGNPGRPSED 123
Query 127 GLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEV 186
GLAADARAA +L+G + + + Y GESLG AV LA + PPAAL+LRSPFT LA+V
Sbjct 124 GLAADARAAYAFLTGDAGLSADELIYLGESLGGAVVTRLATEHPPAALLLRSPFTELADV 183
Query 187 GAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRY 246
P LP+R LL D +P ++ ++ P V+ G +D +VP LS + A +A
Sbjct 184 AQRQVPVLPVRWLLRDRFPVVDLTVALPVPTTVVYGTADTLVPPALSLTVAARSAGDPVV 243
Query 247 VVVPGVGHNDPELLDG 262
+ + G HNDP L G
Sbjct 244 IAIEGADHNDPALTHG 259
>gi|258593749|emb|CBE70090.1| putative enzyme (3.4.-) [NC10 bacterium 'Dutch sediment']
Length=275
Score = 181 bits (458), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/246 (40%), Positives = 143/246 (59%), Gaps = 3/246 (1%)
Query 30 QRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAVLVCNGNAG 89
+ LI+FP + ++ A ++ TQDG+RL GW+ P GS +L +GN G
Sbjct 23 ENSLIFFPDKRIEATPHNLDLAYEEISFTTQDGVRLNGWWIP--GAGSPFTLLWFHGNGG 80
Query 90 DRSMRAELAVALHGL-GLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSDVDPA 148
+ S R + H L G S+ +FDYRGYG + GR SE+G D AA +L + DVDP
Sbjct 81 NISYRLDNIKRRHDLLGTSIFIFDYRGYGRSEGRTSEEGTYRDGDAAIRYLRSRGDVDPN 140
Query 149 RIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYPSIE 208
+I + GESLG+AVAV +A++ AALVL SPF S+AE+ V +P LP+ + Y ++
Sbjct 141 KIVFLGESLGSAVAVEMAIRHGCAALVLESPFLSIAEMAKVTFPLLPIGSFIQTKYDTLS 200
Query 209 RIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPELLDGRVMLDA 268
+I V P+L++ G SD+IVP +RL +A EPK + + HND ++ G L+
Sbjct 201 KIGQVSVPLLIVHGDSDEIVPFRHGQRLFESANEPKEFYRIKDAHHNDLYVVGGTAYLET 260
Query 269 IRRFLT 274
+ RFL+
Sbjct 261 LNRFLS 266
>gi|91787705|ref|YP_548657.1| hypothetical protein Bpro_1826 [Polaromonas sp. JS666]
gi|91696930|gb|ABE43759.1| conserved hypothetical protein [Polaromonas sp. JS666]
Length=282
Score = 177 bits (448), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 112/274 (41%), Positives = 154/274 (57%), Gaps = 15/274 (5%)
Query 9 LPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVE-------TQD 61
L V+ + V + ++ ++ Q L+YFP AG +L +DV ++ T+D
Sbjct 5 LKVLTVGGAVYAVLLAIVFVLQGNLLYFPDAG-----RQILQTPKDVGLDYEQVWLTTED 59
Query 62 GMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPG 121
G+R+ WY P + AVL+ +GNAG+ S R + A+ H LG S+LL +YRGYG + G
Sbjct 60 GVRIEAWYVPAPAARG--AVLLAHGNAGNISHRLDYALMFHRLGYSLLLLEYRGYGRSEG 117
Query 122 RPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFT 181
+PSE+G ADARAA L Q P RIA GESLG A+ LA P ALVL S F
Sbjct 118 KPSEEGTYADARAAWRHLVAQRGFPPERIALVGESLGGAIVARLATAERPGALVLASTFV 177
Query 182 SLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAA 241
S+ E+ A YPWLP+R L Y ++E +A V +PVL+ DDIVP ERL AAA
Sbjct 178 SVPELAAELYPWLPVRWLARYRYDALEALARVSSPVLIAHSRQDDIVPFRHGERLFAAAK 237
Query 242 EPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTE 275
PK ++ + G GHN+ L +A+ RFL +
Sbjct 238 GPKAFLELAG-GHNEGFLFTREAWREALGRFLAQ 270
>gi|218782678|ref|YP_002433996.1| hypothetical protein Dalk_4851 [Desulfatibacillum alkenivorans
AK-01]
gi|218764062|gb|ACL06528.1| conserved hypothetical protein [Desulfatibacillum alkenivorans
AK-01]
Length=270
Score = 175 bits (444), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 103/272 (38%), Positives = 142/272 (53%), Gaps = 10/272 (3%)
Query 3 LKRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDG 62
+KR V+ ++ALV G + L+Y P + + D+ + + +G
Sbjct 1 MKRWVEAAVLILLALVFYGCL-------SSLVYHPDKEISFTPQELGLEHEDLYMASANG 53
Query 63 MRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGR 122
+ W+FP + + VL C+GNAG+ S R A H L LS LLFDY+G+G + GR
Sbjct 54 KMINAWFFPCENARA--VVLFCHGNAGNISDRVSQAWMFHKLELSTLLFDYQGFGQSQGR 111
Query 123 PSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTS 182
PSEQG DARAA ++L + P RI FG+SLG AVA+ LA Q P L + S FTS
Sbjct 112 PSEQGTFDDARAAWDYLVQEKGFPPDRIIVFGKSLGGAVAIELATQVKPGLLFVDSSFTS 171
Query 183 LAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAE 242
+V HYPW P L Y S+ RI +V APV D+++P E L AA E
Sbjct 172 TKDVAKAHYPWAPGFLLYSWKYDSLSRIPNVQAPVCFFHSKQDEVIPFIQGEALFGAAPE 231
Query 243 PKRYVVVPGVGHNDPELLDGRVMLDAIRRFLT 274
PK +V + G HND + GR+ DA+ F+
Sbjct 232 PKAFVEISG-SHNDGFMKSGRLYTDAVDAFIK 262
>gi|334336819|ref|YP_004541971.1| hypothetical protein Isova_1309 [Isoptericola variabilis 225]
gi|334107187|gb|AEG44077.1| hypothetical protein Isova_1309 [Isoptericola variabilis 225]
Length=235
Score = 173 bits (439), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 101/219 (47%), Positives = 125/219 (58%), Gaps = 1/219 (0%)
Query 15 VALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTS 74
+AL A + W Q L+Y P G + V+ G DV + T DG+ L W+ T+
Sbjct 1 MALTAGLAVGAAWLAQDALVYHPDRGSPGPTADVIDGGEDVTLTTDDGLELQAWFVRPTA 60
Query 75 GGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARA 134
AVLV GN G+R RA LA L G +VLL DYRGYGG PGRPSE+GL DA A
Sbjct 61 ADRRAAVLVAPGNGGNRLGRAALAELLAERGFAVLLLDYRGYGGKPGRPSERGLLRDALA 120
Query 135 AQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWL 194
AQ L+ + D R Y GESLG V L Q PPA L+LRSPFTSL + GA HYP+L
Sbjct 121 AQRALADR-DYPADRTIYLGESLGTGVVAALQEQVPPAGLLLRSPFTSLVDAGAHHYPFL 179
Query 195 PLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLS 233
P+R LL D + +E +A PV V+ G D++VP S
Sbjct 180 PVRALLRDRFDVLEHVAVSDVPVTVVHGDRDEVVPPAQS 218
>gi|269956104|ref|YP_003325893.1| hypothetical protein Xcel_1304 [Xylanimonas cellulosilytica DSM
15894]
gi|269304785|gb|ACZ30335.1| conserved hypothetical protein [Xylanimonas cellulosilytica DSM
15894]
Length=285
Score = 172 bits (437), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 113/272 (42%), Positives = 154/272 (57%), Gaps = 7/272 (2%)
Query 6 CRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRL 65
RA V +V + + + + Q +L+Y P+ P+P+A VLP D+ + T DG+RL
Sbjct 2 LRAAVTVGVVLALLAASPFALRAMQHQLVYHPTRSPLPAAEQVLPGAEDLELTTDDGLRL 61
Query 66 GGWYFPHTSGGSG--PAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRP 123
W+ P + S AVL+ +GN G+ + RA LA L G +VLL YRGY GNPG P
Sbjct 62 VSWFVPPSPAASARDEAVLLAHGNGGNLAGRARLAAELADRGFAVLLVGYRGYAGNPGTP 121
Query 124 SEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSL 183
++ GL DA A Q L + AR Y GES+G V VGLA Q PPA LVLRSPFTSL
Sbjct 122 AQDGLVLDALAGQRALESRG-FPAARTIYLGESIGTGVVVGLAAQVPPAGLVLRSPFTSL 180
Query 184 AEVGAVHYPWL-PLRRLLLD--HYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA 240
A+V P P+ R +LD YP E++A+ PV V++G +D++VP S+ + AA
Sbjct 181 ADVAGSVVPLPGPVLRFILDRNEYPLAEQVAASDVPVTVLSGTADEVVPHAQSQAVAQAA 240
Query 241 AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRF 272
+VV+ G HND L G V+ DA+ R
Sbjct 241 THLVEHVVLDGARHNDGVWL-GPVVADAVERL 271
>gi|328954226|ref|YP_004371560.1| alpha/beta hydrolase fold protein [Desulfobacca acetoxidans DSM
11109]
gi|328454550|gb|AEB10379.1| alpha/beta hydrolase fold protein [Desulfobacca acetoxidans DSM
11109]
Length=277
Score = 169 bits (428), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 100/273 (37%), Positives = 149/273 (55%), Gaps = 15/273 (5%)
Query 9 LPVVAIVALVA--SGVI--MFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMR 64
L V+A + L+ G++ MFI+ RL Y PS + +++ T G+R
Sbjct 9 LAVMAFLTLLTMHEGLVERMFIFFPTSRLDYLPSQYGLNC--------QEIFFTTPTGLR 60
Query 65 LGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPS 124
L WY + P +L C+GN G+ S R + A +GL V LFDYRGYG + G PS
Sbjct 61 LHAWY--AEAAPKAPVILYCHGNGGNISHRLGIMAAFRKVGLGVFLFDYRGYGLSQGVPS 118
Query 125 EQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLA 184
E G+ DA AA +L + + P +IA G SLG +AV LA + P AL+L S FT++
Sbjct 119 ENGVYEDAWAAYRYLVTEIGLSPQQIAIAGHSLGGVIAVDLASREPCRALILESTFTNVG 178
Query 185 EVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPK 244
++G ++ WLP RRL D + ++ RI + P L++ G D IVP L ++L A EPK
Sbjct 179 DMGRYYFAWLPTRRLWRDKFNAVRRIQPLKVPKLLVHGECDRIVPCYLGKKLFDLAPEPK 238
Query 245 RYVVVPGVGHNDPELLDGRVMLDAIRRFLTETA 277
+ + G GHN+ +++ G ++RF+ ETA
Sbjct 239 IFYQLAGAGHNNLDVVGGDAYFLFLKRFI-ETA 270
>gi|317151894|ref|YP_004119942.1| alpha/beta hydrolase fold protein [Desulfovibrio aespoeensis
Aspo-2]
gi|316942145|gb|ADU61196.1| alpha/beta hydrolase fold protein [Desulfovibrio aespoeensis
Aspo-2]
Length=295
Score = 168 bits (425), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 107/274 (40%), Positives = 146/274 (54%), Gaps = 13/274 (4%)
Query 9 LPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGW 68
L +VA++A + ++++ QRRL+Y P+ + + + A DV + G L GW
Sbjct 10 LKIVAVLAAAYVCLTVWVYLSQRRLLYQPTRTVTATPADIGLAYEDVRLVNALGTELHGW 69
Query 69 YFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGL 128
+ PH +L C+GN G+ S R H LGLSVL+FDY GYG + G PSE
Sbjct 70 WLPHPQARF--TLLFCHGNGGNVSHRLHSLRLFHDLGLSVLIFDYSGYGRSLGEPSEVAT 127
Query 129 AADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQ---------RPPAALVLRSP 179
ADARAA +WL+ Q +DP + FG SLG AVA LA P A L+L S
Sbjct 128 RADARAAWDWLA-QRGIDPGSVILFGRSLGGAVAARLAADVVADVAAEGTPVAGLILEST 186
Query 180 FTSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAA 239
FTS+ ++GA YPWLP+R L+ D Y S +A + P L I D+IVP L L
Sbjct 187 FTSVPDMGARLYPWLPVRLLVRDRYDSTRALAGLQTPALFIHSPDDEIVPHALGLALYDG 246
Query 240 AAEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFL 273
PK ++ + G GHND LL G+ + + RFL
Sbjct 247 YQGPKSFLALTG-GHNDGFLLSGQDYVAGLVRFL 279
>gi|292493769|ref|YP_003529208.1| hypothetical protein Nhal_3806 [Nitrosococcus halophilus Nc4]
gi|291582364|gb|ADE16821.1| conserved hypothetical protein [Nitrosococcus halophilus Nc4]
Length=280
Score = 167 bits (424), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 104/268 (39%), Positives = 142/268 (53%), Gaps = 15/268 (5%)
Query 15 VALVASG---VIMFIWSQQRRLIYFPSAGPVPS-ASSVLPAG-----RDVVVETQDGMRL 65
V L AS +++ ++ Q RL+YFP +PS A P V + T+DG+ L
Sbjct 10 VLLFASAYGILVLLVYFLQPRLLYFPH---IPSRAVETTPTQVGLNFETVTLTTEDGVTL 66
Query 66 GGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSE 125
GWY P S VL +GNAG+ S R + H LGLS + DYRGYG + GRP+E
Sbjct 67 EGWYLP--SSKERGTVLFFHGNAGNISHRLDSLSLFHHLGLSSFIIDYRGYGRSQGRPTE 124
Query 126 QGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAE 185
G DA+AA +L+ Q + I FG SLG A+A L P AL++ S FTS+ +
Sbjct 125 TGTYLDAQAAWHYLTQQRQIPEEEIVLFGRSLGGAIAAQLTDDTQPGALIVESAFTSIPD 184
Query 186 VGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKR 245
+ A YP+LP R L YP+ + PVL+I D+I+P T + L AA PK+
Sbjct 185 LAAELYPFLPARWLTRFRYPTQNFLQKATCPVLIIHSRDDEIIPFTHGQALFKAAPFPKQ 244
Query 246 YVVVPGVGHNDPELLDGRVMLDAIRRFL 273
++V+ G GHND L+D L I FL
Sbjct 245 FLVLNG-GHNDAFLIDDEKYLSGIEAFL 271
>gi|220935197|ref|YP_002514096.1| hypothetical protein Tgr7_2029 [Thioalkalivibrio sulfidophilus
HL-EbGr7]
gi|219996507|gb|ACL73109.1| conserved hypothetical protein [Thioalkalivibrio sulfidophilus
HL-EbGr7]
Length=276
Score = 167 bits (422), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 109/271 (41%), Positives = 148/271 (55%), Gaps = 4/271 (1%)
Query 6 CRALPVVAIVALVASGVIM-FIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMR 64
+AL + +VA A GV++ ++ +Q LIY P + V + + DV + T DG+R
Sbjct 2 IKALIHLCLVAAGAYGVLVGLVYFKQDGLIYLPLSTLVTTPTEHGMDYEDVYLTTDDGVR 61
Query 65 LGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPS 124
L GW+ P + +L +GNAG+ S R LGLSV + DYRGYG + GRPS
Sbjct 62 LHGWFVP--APEPRGVLLFFHGNAGNISHRMASIRIFRELGLSVFIIDYRGYGQSEGRPS 119
Query 125 EQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLA 184
E GL DARAA WL ++ I FG SLGAAVAV LA + PP AL+L S FTS A
Sbjct 120 EAGLRRDARAAWAWLRETREIPAREIVVFGRSLGAAVAVDLASEHPPGALILESAFTSAA 179
Query 185 EVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPK 244
++GA YPWLP+ RLL + IE + V P L+ D+IV + RL+ A +
Sbjct 180 DLGAEVYPWLPVDRLLRHRHEVIESLPQVRVPTLIAHSRQDEIVSFDHARRLMDVAHDGA 239
Query 245 RYVVVPGVGHNDPELLDGRVMLDAIRRFLTE 275
+ + G GHND L G+ + + FL E
Sbjct 240 VLLEMEG-GHNDGFLRTGQRYVRGLGDFLEE 269
>gi|82703211|ref|YP_412777.1| hypothetical protein Nmul_A2092 [Nitrosospira multiformis ATCC
25196]
gi|82411276|gb|ABB75385.1| conserved hypothetical protein Rv2307c [Nitrosospira multiformis
ATCC 25196]
Length=275
Score = 165 bits (418), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 99/267 (38%), Positives = 147/267 (56%), Gaps = 12/267 (4%)
Query 11 VVAIVALVASGVIMFIWSQQRRLIYFPSAGP----VPSASSVLPAGRDVVVETQDGMRLG 66
+ A++ +V + VI F Q L+Y+P G P S + A V +ET DG RL
Sbjct 10 MAALIYVVFAAVIFF---AQPSLVYYPEIGRGITGTPGESGL--AYESVELETADGERLH 64
Query 67 GWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQ 126
GW+ P + + VL +GNAG+ S R + + LG + +FDYRGYG + G+P+EQ
Sbjct 65 GWFVPASHAKA--TVLFFHGNAGNISQRIDYLSMFYRLGYNTFIFDYRGYGESSGKPTEQ 122
Query 127 GLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEV 186
G DA AA +++ + + PA + FGESLG A+A LA + P LVL S FTS+ ++
Sbjct 123 GTYRDAVAAWRYITEKKAIPPADVVLFGESLGGAIASWLAAREIPGVLVLTSAFTSVPDM 182
Query 187 GAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRY 246
GA YP+LP+RRL Y ++E + V PV + D+IVP + L AA PKR+
Sbjct 183 GAQLYPYLPIRRLSRFKYNTLEHLKDVSCPVFIAHSPQDEIVPFKQGQALYEAARNPKRF 242
Query 247 VVVPGVGHNDPELLDGRVMLDAIRRFL 273
+ + G GHN+ + A+ +F+
Sbjct 243 IELQG-GHNEGFIYTREDWAKALGKFI 268
>gi|149175241|ref|ZP_01853863.1| hypothetical protein PM8797T_20618 [Planctomyces maris DSM 8797]
gi|148845850|gb|EDL60191.1| hypothetical protein PM8797T_20618 [Planctomyces maris DSM 8797]
Length=279
Score = 164 bits (415), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 94/241 (40%), Positives = 131/241 (55%), Gaps = 3/241 (1%)
Query 30 QRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAVLVCNGNAG 89
+R L+Y PS P P + D E +DG RL GW+ H + L C+GNAG
Sbjct 31 ERTLVYQPSPFPEPGSLPENLPFEDAWFEAEDGTRLHGWFLGHPKPRA--VALFCHGNAG 88
Query 90 DRSMRAE-LAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSDVDPA 148
+ R E L + GL+++ FDYRGYG + G+PSE+G+ DARAA+ WL+ ++ V+
Sbjct 89 NIVSRGETLKILQERHGLAIMTFDYRGYGKSEGKPSERGILQDARAARAWLASRAGVEET 148
Query 149 RIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYPSIE 208
I G SLG AVAV LA Q LVL S F+SL + A H PW+ + S
Sbjct 149 EIVLMGRSLGGAVAVDLAAQDGARGLVLASTFSSLPDAAAHHMPWMFPNLNMTQRLNSAG 208
Query 209 RIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPELLDGRVMLDA 268
+I + P+L G D ++P L +L AA EPK++ V+PG GHNDP+ + R + D
Sbjct 209 KIGNYSGPLLQSHGDKDLLIPIELGRKLFDAAGEPKQFFVLPGAGHNDPQPEEYRRVFDE 268
Query 269 I 269
Sbjct 269 F 269
>gi|149174556|ref|ZP_01853182.1| hypothetical protein PM8797T_09794 [Planctomyces maris DSM 8797]
gi|148846666|gb|EDL61003.1| hypothetical protein PM8797T_09794 [Planctomyces maris DSM 8797]
Length=337
Score = 163 bits (413), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 103/276 (38%), Positives = 147/276 (54%), Gaps = 29/276 (10%)
Query 30 QRRLIYFP---SAGPVPSASSVLPAGRDVVVETQDGMRLGGWYF-----PHTSGGSG--- 78
QR LIY P S+ + A++ ++ T+DG+ L GW+F T +
Sbjct 56 QRWLIYQPTRVSSLSIDQANAPFGVIHEISTTTEDGLDLKGWHFLAGQVACTDKAACDAE 115
Query 79 -----PAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADAR 133
P V++ +GN G+R R E L L L V FDYRGY NPG PS+ GL DAR
Sbjct 116 LDKGRPVVILLHGNGGNRLHRIEDCRLLASLNLHVFAFDYRGYAENPGSPSQTGLLKDAR 175
Query 134 AAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQ-----RPPAALVLRSPFTSLAEVGA 188
A ++ +DP+ I FGESLG VA LA + PPA L+LRS F+SL + +
Sbjct 176 AIWKYAVRDRKIDPSHIILFGESLGGGVATLLASELCEQNTPPAGLILRSTFSSLVDAAS 235
Query 189 VHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAE------ 242
H+PW+P+ LL D YP+ I ++ P+L++ G +D IVP L E+L AAA E
Sbjct 236 SHFPWIPVSLLLWDRYPNQRLIGNITCPILMVHGTADRIVPFELGEKLFAAAPENSASGI 295
Query 243 PKRYVVVPGVGHNDPELLDGR-VMLDAIRRFLTETA 277
PKR++ + +G ++ L + R M DA F ++ A
Sbjct 296 PKRFLKIE-LGTHNGLLYEARGKMRDAYHEFTSQLA 330
>gi|229819520|ref|YP_002881046.1| hypothetical protein Bcav_1023 [Beutenbergia cavernae DSM 12333]
gi|229565433|gb|ACQ79284.1| conserved hypothetical protein [Beutenbergia cavernae DSM 12333]
Length=274
Score = 162 bits (411), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 107/246 (44%), Positives = 136/246 (56%), Gaps = 9/246 (3%)
Query 27 WSQQRRLIYFPS-AGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAVLVCN 85
W QR L++ P P P++S V+ RDV++ T DG+ L W P G LV
Sbjct 20 WVFQRSLVFLPDRTTPPPASSDVVDGARDVLLHTSDGLELTAWEVPADPA-CGVTALVLP 78
Query 86 GNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSDV 145
GN G+R+ RA L AL G+ VLL +YRGYGGNPG PSE GL DARAA L+ D
Sbjct 79 GNGGNRADRAGLVRALAERGMGVLLVEYRGYGGNPGSPSESGLRRDARAA---LAHLRDG 135
Query 146 DPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYP 205
+ Y GESLGAAVA LA PP L+LRSPFTSLA+ G Y +P+ LL D +
Sbjct 136 TTGSLLYVGESLGAAVATDLAAGEPPDGLLLRSPFTSLADAGRAAY-GVPVGWLLRDRFD 194
Query 206 SIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYV---VVPGVGHNDPELLDG 262
+ V AP+ V+ G +D IVP S + A V VVPG HND +L G
Sbjct 195 VRGAVVRVDAPLAVVYGDADHIVPPAQSREVADVAGSAGLDVTVSVVPGADHNDADLAQG 254
Query 263 RVMLDA 268
+ +++A
Sbjct 255 QALIEA 260
>gi|114776756|ref|ZP_01451799.1| hypothetical protein SPV1_11091 [Mariprofundus ferrooxydans PV-1]
gi|114552842|gb|EAU55273.1| hypothetical protein SPV1_11091 [Mariprofundus ferrooxydans PV-1]
Length=288
Score = 161 bits (408), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 96/248 (39%), Positives = 135/248 (55%), Gaps = 3/248 (1%)
Query 9 LPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGW 68
L + ++ L G ++++ S + IYFP+ V S + V + RD+ T DG++L GW
Sbjct 5 LRAIFLLLLAIGGTMLWMLSHEDHYIYFPTQEMVQSPAGVGLSFRDIWFTTADGVKLHGW 64
Query 69 YFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGL 128
Y PH +L +GNAG+ S R H +GLSV FDYRGYG + G PSE+GL
Sbjct 65 YIPHAHARF--TLLHLHGNAGNISQRLAQYRRWHAMGLSVFAFDYRGYGASEGTPSEEGL 122
Query 129 AADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGA 188
+DA AA L I G SLG AVA LA + P L L PFTSL ++
Sbjct 123 HSDAVAAWSLLQNPGYAAADNIIIAGRSLGCAVAARLAGEVNPVGLALEVPFTSLPDMAE 182
Query 189 VHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVV 248
YPWLPLR + + + S HAP+L+I+ +D+I+P +++++ AAA PK
Sbjct 183 AAYPWLPLRHFVRSRLDTEAAVRSQHAPLLLISAANDEIIPHEMADQIFAAANPPKLRGN 242
Query 249 VPGVGHND 256
+ G GHND
Sbjct 243 LAG-GHND 249
>gi|77920018|ref|YP_357833.1| putative enzyme (3.4.-) [Pelobacter carbinolicus DSM 2380]
gi|77546101|gb|ABA89663.1| putative enzyme (3.4.-) [Pelobacter carbinolicus DSM 2380]
Length=278
Score = 161 bits (408), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 96/268 (36%), Positives = 141/268 (53%), Gaps = 14/268 (5%)
Query 14 IVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHT 73
I L+ +G + + + R+ I+FP + ++ +V DG+RL GW+ P
Sbjct 11 IAVLILTGSVTPMHAMDRKYIFFPDPTLHANPNAAGLTFEEVYFPAADGVRLHGWFLPGK 70
Query 74 SGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADAR 133
+G P +L +GNAG+ S R + H LGLSV +FDYRGYG + G+ SE G D R
Sbjct 71 TGR--PLLLFAHGNAGNISHRIDNLAHFHRLGLSVFIFDYRGYGQSEGQISEVGSYEDIR 128
Query 134 AAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYP- 192
A WL + P ++ YFG SLGAAVA+ LA++ PPA LVL S FTS+ +G H P
Sbjct 129 GALAWLKSKG-WTPKQMLYFGRSLGAAVALQLALEEPPAGLVLESAFTSVPRMGWHHQPI 187
Query 193 ------WLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRY 246
W L Y ++ +I + P+L+ G D IVP ++++L A EPK
Sbjct 188 TYALLGWWALS----SRYDNLAKIGQLQCPLLMFQGTRDTIVPPKMAQQLFDRAPEPKTL 243
Query 247 VVVPGVGHNDPELLDGRVMLDAIRRFLT 274
++P GHN+ + G+ + R FL
Sbjct 244 YLIPDAGHNNTYDVGGKPYWEQWRSFLN 271
>gi|302039458|ref|YP_003799780.1| putative peptidase [Candidatus Nitrospira defluvii]
gi|300607522|emb|CBK43855.1| putative Peptidase [Candidatus Nitrospira defluvii]
Length=253
Score = 161 bits (408), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 93/220 (43%), Positives = 124/220 (57%), Gaps = 2/220 (0%)
Query 54 DVVVETQDGMRLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDY 113
DV + DG +L GWY ++ + P +L C+GNAG+ R + AL+ LGLSV LFDY
Sbjct 30 DVWFQAPDGTKLFGWYAEQSA--ASPVLLWCHGNAGNMIHRLDNLRALYRLGLSVFLFDY 87
Query 114 RGYGGNPGRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAA 173
RGYG + GRPSE GL DA A ++L+ + P R+ FG SLG AVA LA QRP
Sbjct 88 RGYGRSQGRPSENGLYRDAIGAYDYLTRIRRIRPERLMIFGRSLGGAVAGELATQRPAMG 147
Query 174 LVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLS 233
L+L S F S+ V HY LP+ LL + +R+ + P L + G DDI+P L
Sbjct 148 LLLESCFPSIEAVARHHYMGLPVHWLLEASFRLEDRLPHLSLPKLFVHGDRDDIIPIELG 207
Query 234 ERLVAAAAEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFL 273
+R AAA EPK + +V G HND + GR + F+
Sbjct 208 QRAFAAAKEPKEFYIVRGADHNDVPSVGGRAYFAKLSAFI 247
>gi|168699272|ref|ZP_02731549.1| hypothetical protein GobsU_07102 [Gemmata obscuriglobus UQM 2246]
Length=280
Score = 161 bits (407), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 99/275 (36%), Positives = 148/275 (54%), Gaps = 7/275 (2%)
Query 4 KRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGM 63
+R R V+ ++ + G+++ W +RRL++ P++ +DV ++ DG
Sbjct 11 RRARRWAVLFLITYL--GIVIVFWFLERRLVFVPTSTQEEWLEPEDRRSQDVSFDSADGN 68
Query 64 RLGGWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALH-GLGLSVLLFDYRGYGGNPGR 122
++ G + P + G AVLV NGN G+ + R LA L G VLLFDY GYG + G
Sbjct 69 KIAGRWIPPETPHHG-AVLVANGNGGNLTHRGGLAADLRLATGAGVLLFDYPGYGKSSGT 127
Query 123 PSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTS 182
PSE G A AA +WL+ + V +RI +GESLG AV LA +R ALVL FTS
Sbjct 128 PSENGCYAAGEAAYKWLTDEQKVATSRIILYGESLGGGTAVELATKREHRALVLIYTFTS 187
Query 183 LAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAE 242
L + +P+LP + L+ + ++ +IA PV + G +D +VP + SE+L AA +
Sbjct 188 LPDAAKNRFPFLPAKTLMRTRFDNLSKIAKCPRPVFFVHGRADTVVPFSHSEQLYVAANQ 247
Query 243 PKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETA 277
PK +V + G+GH L G + L A+ FL A
Sbjct 248 PKEFVRLDGIGHVR---LPGELYLPALVSFLNRHA 279
>gi|302342111|ref|YP_003806640.1| enzyme (3.4.-) [Desulfarculus baarsii DSM 2075]
gi|301638724|gb|ADK84046.1| putative enzyme (3.4.-) [Desulfarculus baarsii DSM 2075]
Length=270
Score = 160 bits (405), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 97/262 (38%), Positives = 141/262 (54%), Gaps = 8/262 (3%)
Query 12 VAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFP 71
V ++A SG + SQ I++P + + A DV E+ G+RL GW+ P
Sbjct 10 VLLMATWLSGWQRLVESQ----IFYPEKQIHYTPRDMGLAYEDVWFESAGGVRLHGWFVP 65
Query 72 HTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAAD 131
G + +L C+GNAG+ R + + L+ +G+SV +FDYRGYG + GRPSE+GL D
Sbjct 66 AAVGRT--VLLFCHGNAGNVGDRVDNIMRLNRIGISVFIFDYRGYGNSRGRPSEEGLYRD 123
Query 132 ARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHY 191
AA ++ + AR+ FG SLG AV +A + A L+L S FT L + +H+
Sbjct 124 VEAACNVAQARAKQEKARLVIFGRSLGGVAAVHVAARNHCAGLILESTFTHLGAMARIHF 183
Query 192 PWLPL-RRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVP 250
P +PL + L + + ++I++V AP+L G DDIVP L RL AA EPK +V +
Sbjct 184 P-MPLPEQWLSSRFNARKKISAVRAPILFFHGDQDDIVPLALGRRLFMAAPEPKEFVTLE 242
Query 251 GVGHNDPELLDGRVMLDAIRRF 272
G GHND L+ R F
Sbjct 243 GAGHNDTYLIGEDAYFAKFRAF 264
>gi|148358661|ref|YP_001249868.1| hypothetical protein LPC_0537 [Legionella pneumophila str. Corby]
gi|296108249|ref|YP_003619950.1| hypothetical protein lpa_03809 [Legionella pneumophila 2300/99
Alcoy]
gi|148280434|gb|ABQ54522.1| hypothetical protein LPC_0537 [Legionella pneumophila str. Corby]
gi|295650151|gb|ADG25998.1| hypothetical protein lpa_03809 [Legionella pneumophila 2300/99
Alcoy]
Length=265
Score = 160 bits (405), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 97/250 (39%), Positives = 129/250 (52%), Gaps = 8/250 (3%)
Query 9 LPVVAIVALVASG-VIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVV-VETQDGMRLG 66
L + + LV G VI ++ QR LIYFP+ P + VV + T+D + L
Sbjct 2 LKQIVLTGLVIIGIVITLMYLFQRHLIYFPNRH-TPKLEDYNASDMKVVSLRTKDNLHLK 60
Query 67 GWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQ 126
WY P + P +L +GNAG R L GL V L +YRGYGGNPG+P E+
Sbjct 61 SWYKP--ASKHRPTILYLHGNAGHIGYRMPLVREFIDAGLGVFLLEYRGYGGNPGKPGEK 118
Query 127 GLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEV 186
GL AD A E+L Q V R+ +GES+G VA LA + P A++L+SPFTSL +
Sbjct 119 GLYADGETAIEFLI-QHGVPSKRVILYGESIGTGVATHLATKYPVCAVILQSPFTSLTRL 177
Query 187 GAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRY 246
HYP L+ D Y S+ R+ +H P+LV+ G D IVP + A EPK+
Sbjct 178 AQYHYPLNFLKP--WDQYNSLARMKKIHVPILVLHGKLDQIVPYQEGLNVFNEANEPKKM 235
Query 247 VVVPGVGHND 256
V HND
Sbjct 236 VSFDDKEHND 245
>gi|296122668|ref|YP_003630446.1| hypothetical protein Plim_2421 [Planctomyces limnophilus DSM
3776]
gi|296015008|gb|ADG68247.1| conserved hypothetical protein [Planctomyces limnophilus DSM
3776]
Length=315
Score = 160 bits (405), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 96/253 (38%), Positives = 143/253 (57%), Gaps = 20/253 (7%)
Query 22 VIMFIWSQQRRLIYFPSAGPVPSASSVLPAG----RDVVVETQDGMRLGGWYFPHTSGGS 77
V + + + QR+LI+ P+ P + + AG +DV ++ D + L GWY+ +
Sbjct 35 VHLLLITFQRQLIFQPTKT-APLSGHLAGAGLIDVQDVKIKISDELTLHGWYYERPATAE 93
Query 78 GPA---VLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARA 134
PA ++ GN+G RS R E+ + L LG ++L+FDY+GY N G PSEQ A+DA+A
Sbjct 94 TPARQLLIYFPGNSGTRSDRQEICLDLLRLGYNILIFDYQGYAENQGSPSEQHFASDAQA 153
Query 135 AQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQ-----RPPAALVLRSPFTSLAEVGAV 189
++ + Q P +I FGES+G VA LA + PPAAL+L+S ++S+
Sbjct 154 IWKFATTQLGYSPEKITLFGESMGGGVATRLAAELSEAKSPPAALILKSTYSSIPATARY 213
Query 190 HYPWLPLRRLLL-DHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAE------ 242
HYP+LPL L + D +PSI+RI V +P+L G +D I P +ERL AAA E
Sbjct 214 HYPYLPLLSLFVWDPFPSIDRIGKVTSPILQFHGTADRITPYFEAERLFAAAPERSASQV 273
Query 243 PKRYVVVPGVGHN 255
K++V +P HN
Sbjct 274 AKQFVTIPEGSHN 286
>gi|54298593|ref|YP_124962.1| hypothetical protein lpp2657 [Legionella pneumophila str. Paris]
gi|53752378|emb|CAH13810.1| hypothetical protein lpp2657 [Legionella pneumophila str. Paris]
Length=265
Score = 159 bits (403), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 97/250 (39%), Positives = 131/250 (53%), Gaps = 8/250 (3%)
Query 9 LPVVAIVALVASG-VIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVV-VETQDGMRLG 66
L + + LV G VI ++ QR LIYFP+ P + VV + T+D + L
Sbjct 2 LKQIVLTGLVIIGIVITLMYLFQRHLIYFPNRH-TPKLEDYNASDMKVVSLRTKDNLHLK 60
Query 67 GWYFPHTSGGSGPAVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQ 126
WY P + P +L +GNAG R L GL V L +YRGYGGNPG+PSE+
Sbjct 61 SWYKP--ASKHRPTILYLHGNAGHIGYRMPLVREFIDAGLGVFLLEYRGYGGNPGKPSEK 118
Query 127 GLAADARAAQEWLSGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEV 186
GL AD A E+L Q V R+ +GES+G VA LA + P A++L+SPFTSL +
Sbjct 119 GLYADGETAIEFLI-QHGVPSKRVILYGESIGTGVATHLATKYPVCAVMLQSPFTSLTRL 177
Query 187 GAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRY 246
HYP L+ D Y S+ R+ ++AP+LV+ G D IVP + A EPK+
Sbjct 178 AQYHYPLNFLKP--WDQYNSLARMKKINAPILVLHGKLDQIVPYQEGLNVFNEANEPKKM 235
Query 247 VVVPGVGHND 256
+ HND
Sbjct 236 ISFDDKEHND 245
>gi|344224157|gb|EGV50565.1| hypothetical protein Rifp1Sym_cv00070 [endosymbiont of Riftia
pachyptila (vent Ph05)]
Length=287
Score = 159 bits (403), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 87/257 (34%), Positives = 140/257 (55%), Gaps = 7/257 (2%)
Query 26 IWSQQRRLIYFPSA---GPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGP--- 79
++ Q +I++P+ G V + S+ DV + T DG R+ GW+ P++
Sbjct 23 VYFMQPGMIFYPNIPGRGLVTTPKSIGLDYEDVELITDDGTRIHGWFIPNSKASDTQKQA 82
Query 80 AVLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWL 139
+L +GNAG+ S R + + LGL +L+ DYRGYG + G+P+E G DA AA +L
Sbjct 83 TLLFLHGNAGNISHRLDSIKLFNNLGLDILIIDYRGYGQSTGKPTEAGTYQDAEAAWHYL 142
Query 140 SGQSDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRL 199
+ + +I FG SLG +++ LA Q PAAL++ S F+S +G YP+LP+R L
Sbjct 143 TATRGIKENKIILFGRSLGGSISAWLASQHTPAALIVESSFSSAHSMGQRIYPFLPVRLL 202
Query 200 LLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPEL 259
Y + E + ++H PVLV DDI+P + +A EP+ ++ + G GHND +
Sbjct 203 SRFQYNTKEYVKAIHCPVLVAHSRDDDIIPYEEGRDIFNSAHEPRYFLKMRG-GHNDGFI 261
Query 260 LDGRVMLDAIRRFLTET 276
+ G +DA+ F+ +
Sbjct 262 ISGSSYVDALESFINTS 278
>gi|345123263|gb|EGW53165.1| hypothetical protein TevJSym_bk00200 [endosymbiont of Tevnia
jerichonana (vent Tica)]
Length=258
Score = 159 bits (402), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 86/250 (35%), Positives = 137/250 (55%), Gaps = 7/250 (2%)
Query 33 LIYFPSA---GPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGP---AVLVCNG 86
+I++P+ G V + S+ DV + T DG R+ GW+ P++ +L +G
Sbjct 1 MIFYPNIPGRGLVTTPKSIGLDYEDVELITDDGTRIHGWFIPNSKASDTQKQATLLFLHG 60
Query 87 NAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSDVD 146
NAG+ S R + + LGL +L+ DYRGYG + G+P+E G DA AA +L+ +
Sbjct 61 NAGNISHRLDSIKLFNNLGLDILIIDYRGYGQSTGKPTEAGTYQDAEAAWHYLTATRGIK 120
Query 147 PARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYPS 206
+I FG SLG +++ LA Q PAAL++ S F+S +G YP+LP+R L Y +
Sbjct 121 ENKIILFGRSLGGSISAWLASQHTPAALIVESSFSSAHSMGQRIYPFLPVRLLSRFQYNT 180
Query 207 IERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPELLDGRVML 266
E + ++H PVLV DDI+P + +A EP+ ++ + G GHND ++ G +
Sbjct 181 KEYVKAIHCPVLVAHSRDDDIIPYEEGRDIFNSAHEPRYFLKMRG-GHNDGFIISGSSYV 239
Query 267 DAIRRFLTET 276
DA+ F+ +
Sbjct 240 DALESFINTS 249
>gi|116748362|ref|YP_845049.1| hypothetical protein Sfum_0918 [Syntrophobacter fumaroxidans
MPOB]
gi|116697426|gb|ABK16614.1| conserved hypothetical protein [Syntrophobacter fumaroxidans
MPOB]
Length=271
Score = 159 bits (401), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 94/253 (38%), Positives = 137/253 (55%), Gaps = 5/253 (1%)
Query 23 IMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPAVL 82
+MF++ Q L+YFP S V V T+D + + W+ P + S VL
Sbjct 17 LMFVF--QSHLVYFPDKEMTCSPHDVNLPYEAVFFHTRDRIEIAAWFVP--AEQSRGVVL 72
Query 83 VCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQ 142
+C+GN G+ S R L L+ L LS L+FDYRGYG + G+P+E+G DA AA +L
Sbjct 73 ICHGNGGNISHRMPLIRILNDLSLSCLIFDYRGYGNSAGKPTEEGTYRDAEAAWHYLVDT 132
Query 143 SDVDPARIAYFGESLGAAVAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLD 202
+D I G+SLG AVA LA + PAAL+++S FTSL E+G YP+LP+R L
Sbjct 133 RGIDARNIVILGKSLGGAVAARLAREHTPAALIVQSTFTSLTELGQTVYPFLPVRLLSRF 192
Query 203 HYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAAAEPKRYVVVPGVGHNDPELLDG 262
+Y + E + V+ PVL++ D+IVP + L A +PK +V + G HN ++
Sbjct 193 NYGTAEYLRGVNCPVLIMHSRQDEIVPYSHGCELFRVAGQPKEFVEMEG-DHNSGFIVSE 251
Query 263 RVMLDAIRRFLTE 275
+ I FL +
Sbjct 252 SRFREGISGFLRQ 264
Lambda K H
0.321 0.138 0.415
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 445900241072
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40