BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0690c
Length=349
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607830|ref|NP_215204.1| hypothetical protein Rv0690c [Mycob... 699 0.0
gi|340625709|ref|YP_004744161.1| hypothetical protein MCAN_06911... 697 0.0
gi|308373547|ref|ZP_07432778.2| hypothetical protein TMEG_02062 ... 667 0.0
gi|308369983|ref|ZP_07419804.2| hypothetical protein TMBG_03395 ... 652 0.0
gi|240167691|ref|ZP_04746350.1| hypothetical protein MkanA1_0014... 482 5e-134
gi|183981039|ref|YP_001849330.1| hypothetical protein MMAR_1018 ... 474 1e-131
gi|118616554|ref|YP_904886.1| hypothetical protein MUL_0770 [Myc... 466 3e-129
gi|342861779|ref|ZP_08718424.1| hypothetical protein MCOL_22935 ... 454 9e-126
gi|296168544|ref|ZP_06850348.1| conserved hypothetical protein [... 428 6e-118
gi|118466436|ref|YP_883617.1| hypothetical protein MAV_4482 [Myc... 427 2e-117
gi|41410248|ref|NP_963084.1| hypothetical protein MAP4150c [Myco... 426 2e-117
gi|336460679|gb|EGO39570.1| hypothetical protein MAPs_38810 [Myc... 390 2e-106
gi|254822798|ref|ZP_05227799.1| hypothetical protein MintA_22904... 283 2e-74
gi|284032663|ref|YP_003382594.1| hypothetical protein Kfla_4778 ... 276 4e-72
gi|271967512|ref|YP_003341708.1| hypothetical protein Sros_6238 ... 258 1e-66
gi|311896241|dbj|BAJ28649.1| hypothetical protein KSE_28380 [Kit... 212 8e-53
gi|222149763|ref|YP_002550720.1| hypothetical protein Avi_3766 [... 201 2e-49
gi|163794829|ref|ZP_02188799.1| hypothetical protein BAL199_2775... 196 4e-48
gi|150397944|ref|YP_001328411.1| hypothetical protein Smed_2746 ... 193 3e-47
gi|256374961|ref|YP_003098621.1| hypothetical protein Amir_0814 ... 185 1e-44
gi|84683370|ref|ZP_01011273.1| hypothetical protein 109945700026... 180 3e-43
gi|84494637|ref|ZP_00993756.1| hypothetical protein JNB_07564 [J... 178 1e-42
gi|260428030|ref|ZP_05782009.1| conserved hypothetical protein [... 175 9e-42
gi|85373348|ref|YP_457410.1| hypothetical protein ELI_02605 [Ery... 173 3e-41
gi|15966603|ref|NP_386956.1| hypothetical protein SMc03928 [Sino... 172 7e-41
gi|296537372|ref|ZP_06899229.1| conserved hypothetical protein [... 172 1e-40
gi|339502101|ref|YP_004689521.1| hypothetical protein RLO149_c00... 166 4e-39
gi|83943886|ref|ZP_00956343.1| hypothetical protein EE36_09585 [... 164 2e-38
gi|332716792|ref|YP_004444258.1| hypothetical protein AGROH133_1... 163 3e-38
gi|335036021|ref|ZP_08529351.1| hypothetical protein AGRO_3353 [... 162 9e-38
gi|15890901|ref|NP_356573.1| hypothetical protein Atu4071 [Agrob... 162 9e-38
gi|110681039|ref|YP_684046.1| hypothetical protein RD1_3901 [Ros... 160 2e-37
gi|338821649|gb|EGP55618.1| hypothetical protein Agau_L100936 [A... 160 3e-37
gi|149185973|ref|ZP_01864288.1| hypothetical protein ED21_24606 ... 159 6e-37
gi|336118795|ref|YP_004573567.1| hypothetical protein MLP_31500 ... 157 2e-36
gi|114797449|ref|YP_761743.1| hypothetical protein HNE_3067 [Hyp... 157 3e-36
gi|304393390|ref|ZP_07375318.1| conserved hypothetical protein [... 157 3e-36
gi|154251314|ref|YP_001412138.1| hypothetical protein Plav_0858 ... 156 4e-36
gi|254487481|ref|ZP_05100686.1| conserved hypothetical protein [... 155 9e-36
gi|83953526|ref|ZP_00962248.1| hypothetical protein NAS141_14496... 155 1e-35
gi|85707947|ref|ZP_01039013.1| hypothetical protein NAP1_01890 [... 154 1e-35
gi|326386553|ref|ZP_08208175.1| hypothetical protein Y88_2447 [N... 154 2e-35
gi|296284340|ref|ZP_06862338.1| hypothetical protein CbatJ_11976... 151 1e-34
gi|220927192|ref|YP_002502494.1| hypothetical protein Mnod_7454 ... 149 9e-34
gi|332186538|ref|ZP_08388282.1| hypothetical protein SUS17_1623 ... 147 2e-33
gi|126735030|ref|ZP_01750776.1| hypothetical protein RCCS2_14174... 147 3e-33
gi|89069013|ref|ZP_01156394.1| hypothetical protein OG2516_17041... 146 6e-33
gi|89056082|ref|YP_511533.1| hypothetical protein Jann_3591 [Jan... 144 2e-32
gi|77404787|ref|YP_345359.1| hypothetical protein RSP_4153 [Rhod... 144 2e-32
gi|84515239|ref|ZP_01002601.1| hypothetical protein SKA53_01236 ... 142 6e-32
>gi|15607830|ref|NP_215204.1| hypothetical protein Rv0690c [Mycobacterium tuberculosis H37Rv]
gi|15840094|ref|NP_335131.1| hypothetical protein MT0718 [Mycobacterium tuberculosis CDC1551]
gi|31791874|ref|NP_854367.1| hypothetical protein Mb0709c [Mycobacterium bovis AF2122/97]
62 more sequence titles
Length=349
Score = 699 bits (1805), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/349 (99%), Positives = 349/349 (100%), Gaps = 0/349 (0%)
Query 1 VTGTEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVP 60
+TGTEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVP
Sbjct 1 MTGTEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVP 60
Query 61 LRLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQT 120
LRLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQT
Sbjct 61 LRLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQT 120
Query 121 NEVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRI 180
NEVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRI
Sbjct 121 NEVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRI 180
Query 181 DNAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVA 240
DNAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVA
Sbjct 181 DNAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVA 240
Query 241 RNIPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAH 300
RNIPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAH
Sbjct 241 RNIPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAH 300
Query 301 CPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
CPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ
Sbjct 301 CPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
>gi|340625709|ref|YP_004744161.1| hypothetical protein MCAN_06911 [Mycobacterium canettii CIPT
140010059]
gi|340003899|emb|CCC43031.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=349
Score = 697 bits (1800), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/349 (99%), Positives = 348/349 (99%), Gaps = 0/349 (0%)
Query 1 VTGTEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVP 60
+TGTEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVP
Sbjct 1 MTGTEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVP 60
Query 61 LRLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQT 120
LRLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQT
Sbjct 61 LRLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQT 120
Query 121 NEVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRI 180
NEVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYL GEWGLADSPVRI
Sbjct 121 NEVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLSGEWGLADSPVRI 180
Query 181 DNAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVA 240
DNAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVA
Sbjct 181 DNAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVA 240
Query 241 RNIPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAH 300
RNIPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAH
Sbjct 241 RNIPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAH 300
Query 301 CPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
CPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ
Sbjct 301 CPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
>gi|308373547|ref|ZP_07432778.2| hypothetical protein TMEG_02062 [Mycobacterium tuberculosis SUMu005]
gi|308375227|ref|ZP_07443181.2| hypothetical protein TMGG_03709 [Mycobacterium tuberculosis SUMu007]
gi|308376473|ref|ZP_07438971.2| hypothetical protein TMHG_03717 [Mycobacterium tuberculosis SUMu008]
gi|308378698|ref|ZP_07483565.2| hypothetical protein TMJG_02436 [Mycobacterium tuberculosis SUMu010]
gi|308337239|gb|EFP26090.1| hypothetical protein TMEG_02062 [Mycobacterium tuberculosis SUMu005]
gi|308346989|gb|EFP35840.1| hypothetical protein TMGG_03709 [Mycobacterium tuberculosis SUMu007]
gi|308350969|gb|EFP39820.1| hypothetical protein TMHG_03717 [Mycobacterium tuberculosis SUMu008]
gi|308359524|gb|EFP48375.1| hypothetical protein TMJG_02436 [Mycobacterium tuberculosis SUMu010]
Length=333
Score = 667 bits (1721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 332/333 (99%), Positives = 333/333 (100%), Gaps = 0/333 (0%)
Query 17 VCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGGLHRMVLDGRA 76
+CTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGGLHRMVLDGRA
Sbjct 1 MCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGGLHRMVLDGRA 60
Query 77 PVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIGGLLIA 136
PVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIGGLLIA
Sbjct 61 PVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIGGLLIA 120
Query 137 CLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELPPTATVRI 196
CLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELPPTATVRI
Sbjct 121 CLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELPPTATVRI 180
Query 197 VERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLHRQAAHAAV 256
VERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLHRQAAHAAV
Sbjct 181 VERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLHRQAAHAAV 240
Query 257 AGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVHLTLEPAHQRPG 316
AGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVHLTLEPAHQRPG
Sbjct 241 AGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVHLTLEPAHQRPG 300
Query 317 AQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
AQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ
Sbjct 301 AQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 333
>gi|308369983|ref|ZP_07419804.2| hypothetical protein TMBG_03395 [Mycobacterium tuberculosis SUMu002]
gi|308370476|ref|ZP_07421662.2| hypothetical protein TMCG_03501 [Mycobacterium tuberculosis SUMu003]
gi|308371736|ref|ZP_07426032.2| hypothetical protein TMDG_02414 [Mycobacterium tuberculosis SUMu004]
gi|308325765|gb|EFP14616.1| hypothetical protein TMBG_03395 [Mycobacterium tuberculosis SUMu002]
gi|308331836|gb|EFP20687.1| hypothetical protein TMCG_03501 [Mycobacterium tuberculosis SUMu003]
gi|308335622|gb|EFP24473.1| hypothetical protein TMDG_02414 [Mycobacterium tuberculosis SUMu004]
gi|339293720|gb|AEJ45831.1| hypothetical protein CCDC5079_0641 [Mycobacterium tuberculosis
CCDC5079]
gi|339297359|gb|AEJ49469.1| hypothetical protein CCDC5180_0632 [Mycobacterium tuberculosis
CCDC5180]
Length=325
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/325 (100%), Positives = 325/325 (100%), Gaps = 0/325 (0%)
Query 25 MYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGGLHRMVLDGRAPVLRRWYP 84
MYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGGLHRMVLDGRAPVLRRWYP
Sbjct 1 MYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGGLHRMVLDGRAPVLRRWYP 60
Query 85 STGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIGGLLIACLQFDLPI 144
STGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIGGLLIACLQFDLPI
Sbjct 61 STGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIGGLLIACLQFDLPI 120
Query 145 RLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELPPTATVRIVERHGYDI 204
RLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELPPTATVRIVERHGYDI
Sbjct 121 RLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELPPTATVRIVERHGYDI 180
Query 205 APIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLHRQAAHAAVAGMTLTDD 264
APIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLHRQAAHAAVAGMTLTDD
Sbjct 181 APIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLHRQAAHAAVAGMTLTDD 240
Query 265 ALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVHLTLEPAHQRPGAQIKYLVR 324
ALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVHLTLEPAHQRPGAQIKYLVR
Sbjct 241 ALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVHLTLEPAHQRPGAQIKYLVR 300
Query 325 MRSWPGGHARVLGECHPHGPPVTWQ 349
MRSWPGGHARVLGECHPHGPPVTWQ
Sbjct 301 MRSWPGGHARVLGECHPHGPPVTWQ 325
>gi|240167691|ref|ZP_04746350.1| hypothetical protein MkanA1_00145 [Mycobacterium kansasii ATCC
12478]
Length=352
Score = 482 bits (1240), Expect = 5e-134, Method: Compositional matrix adjust.
Identities = 250/344 (73%), Positives = 278/344 (81%), Gaps = 0/344 (0%)
Query 6 HLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLG 65
HL+HTLRSQGR C SGSPMY ELL+LVAADVE+GG+F SIL+ + P AVPLRLLG
Sbjct 9 HLLHTLRSQGRFCARSGSPMYGELLDLVAADVEAGGLFGSILSGHEDDPSRHAVPLRLLG 68
Query 66 GLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGR 125
GLHR+VLDGRAP LRRWYPSTGG+W A AAWPDI+R A D ++LRAALD+PPQTNEVGR
Sbjct 69 GLHRLVLDGRAPTLRRWYPSTGGSWDAAAAWPDIIRVAADHADALRAALDQPPQTNEVGR 128
Query 126 SAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWL 185
SAALIGGLL +F LPIRLFEIG+SAGLNLR DRYRYRY GG WG A++PV ID+AW
Sbjct 129 SAALIGGLLQVNHEFGLPIRLFEIGASAGLNLRADRYRYRYDGGHWGPAEAPVTIDDAWH 188
Query 186 GELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPA 245
G LPP VRIVERHGYDIAPIDVT DGEL LSY+WPDQ R++RLRGAIAVAR +PA
Sbjct 189 GRLPPAGGVRIVERHGYDIAPIDVTGADGELTVLSYVWPDQHARMKRLRGAIAVARTVPA 248
Query 246 DLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVH 305
LHRQ A AVAG+TL D LTVLWHSITWQYL ADERAAIRA ++ L AQA PF H
Sbjct 249 QLHRQTAAEAVAGLTLADGTLTVLWHSITWQYLSADERAAIRAAVEHLGAQAGPRAPFAH 308
Query 306 LTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
LTLEPA PG+ +K+LVR WPGG RVLGECHPHGPPVTW+
Sbjct 309 LTLEPARDGPGSPLKFLVRAAGWPGGRTRVLGECHPHGPPVTWR 352
>gi|183981039|ref|YP_001849330.1| hypothetical protein MMAR_1018 [Mycobacterium marinum M]
gi|183174365|gb|ACC39475.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=353
Score = 474 bits (1219), Expect = 1e-131, Method: Compositional matrix adjust.
Identities = 235/347 (68%), Positives = 269/347 (78%), Gaps = 1/347 (0%)
Query 3 GTEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLR 62
G EHL+HTLRSQGR C SGSPMY EL ELVAADVE+GGVFA ILA + P A PLR
Sbjct 8 GIEHLLHTLRSQGRFCARSGSPMYGELFELVAADVEAGGVFAPILAGHEDDPSRYATPLR 67
Query 63 LLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNE 122
LLGGLHRMVLDGRAP LRRWYPST G+W A++AWP+I A + E+LR ALD+PPQTNE
Sbjct 68 LLGGLHRMVLDGRAPTLRRWYPSTDGSWDAKSAWPEIELVAANHTEALRGALDQPPQTNE 127
Query 123 VGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDN 182
VGRSAALIGGLL +F+ P+RLFEIG+SAGLNLR DRY YRY G WG DSPV I++
Sbjct 128 VGRSAALIGGLLHIRHEFNFPVRLFEIGASAGLNLRADRYHYRYAGMTWGPIDSPVIIED 187
Query 183 AWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARN 242
AW GELPP ++IVERHGYDIAPID+ DGE+ LSY+WPDQ R++RLRGAIAVAR+
Sbjct 188 AWRGELPPALALQIVERHGYDIAPIDICGTDGEMTVLSYVWPDQHARMKRLRGAIAVARD 247
Query 243 IPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCP 302
+PA L R+ A VAG+TL D+ LTVLWHSITWQYL A ERAAIR + L AQA P
Sbjct 248 VPAQLERKTAADGVAGLTLQDETLTVLWHSITWQYLAAQERAAIRDRVAELGAQAGPRSP 307
Query 303 FVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
F HLTLEPA G ++K+LVR+ SWP G ARVLG+CHPHGPPV WQ
Sbjct 308 FAHLTLEPARDE-GGRLKFLVRLASWPSGEARVLGQCHPHGPPVNWQ 353
>gi|118616554|ref|YP_904886.1| hypothetical protein MUL_0770 [Mycobacterium ulcerans Agy99]
gi|118568664|gb|ABL03415.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=353
Score = 466 bits (1199), Expect = 3e-129, Method: Compositional matrix adjust.
Identities = 232/347 (67%), Positives = 266/347 (77%), Gaps = 1/347 (0%)
Query 3 GTEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLR 62
G EHL+HTLRSQ R C SGSPMY EL ELVAADVE+GGVFA ILA + P A PL+
Sbjct 8 GIEHLLHTLRSQDRFCARSGSPMYGELFELVAADVEAGGVFAPILAGHEDDPSRYATPLQ 67
Query 63 LLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNE 122
LLGGLHRMVLDGRAP LRRWYPST G+W A++AWP I A + E+LR LD+PPQTNE
Sbjct 68 LLGGLHRMVLDGRAPTLRRWYPSTDGSWDAKSAWPGIELVAANHTEALRGVLDQPPQTNE 127
Query 123 VGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDN 182
VGRSAALIG LL +F+ P+RLFEIG+SAGLNLR DRY YRY G WG DSPV I++
Sbjct 128 VGRSAALIGSLLHIRHEFNCPVRLFEIGASAGLNLRADRYHYRYAGMTWGPIDSPVIIED 187
Query 183 AWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARN 242
AW GELPP ++IVERHGYDIAPID+ DGE+ LSY+WPDQ R++RLRGAIAVAR+
Sbjct 188 AWRGELPPALALQIVERHGYDIAPIDICGTDGEMTVLSYVWPDQHARMKRLRGAIAVARD 247
Query 243 IPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCP 302
+PA L R+ A VAG+TL D+ LTVLWHSITWQYLPA ERAAIR + L AQA P
Sbjct 248 VPAQLERKTAADGVAGLTLQDETLTVLWHSITWQYLPAQERAAIRDRVAELGAQAGPRSP 307
Query 303 FVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
F HLTLEPA G ++K+LVR+ SWP G ARVLG+CHPHGPPV WQ
Sbjct 308 FAHLTLEPARDE-GGRLKFLVRLASWPSGEARVLGQCHPHGPPVNWQ 353
>gi|342861779|ref|ZP_08718424.1| hypothetical protein MCOL_22935 [Mycobacterium colombiense CECT
3035]
gi|342130596|gb|EGT83900.1| hypothetical protein MCOL_22935 [Mycobacterium colombiense CECT
3035]
Length=352
Score = 454 bits (1168), Expect = 9e-126, Method: Compositional matrix adjust.
Identities = 242/344 (71%), Positives = 271/344 (79%), Gaps = 0/344 (0%)
Query 5 EHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLL 64
EHLVHTLRSQGR C SSGSPMY EL ELVA DVE+GGVFASIL+ ++ AP AVPLRLL
Sbjct 4 EHLVHTLRSQGRFCASSGSPMYGELFELVARDVEAGGVFASILSGREDAPSRDAVPLRLL 63
Query 65 GGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVG 124
GGLHR+VLDGRA LRR+YPSTGG W A +AWP+I+ TA ++LRAAL +PPQTNEVG
Sbjct 64 GGLHRLVLDGRAARLRRFYPSTGGGWDARSAWPEILDTAAGHADALRAALGQPPQTNEVG 123
Query 125 RSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAW 184
RSAALIGGLL+ +F LPIRLFEIGSSAGLNLR D YRY + GG WG ADSPV ID+AW
Sbjct 124 RSAALIGGLLLVNREFGLPIRLFEIGSSAGLNLRADHYRYGFAGGGWGPADSPVLIDDAW 183
Query 185 LGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIP 244
G LPP VRIV RHGYDIAPIDV DGEL LSY+WPDQ RL RLRGAI VAR +P
Sbjct 184 RGALPPPGDVRIVARHGYDIAPIDVGRADGELAVLSYVWPDQAARLARLRGAIEVARRVP 243
Query 245 ADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFV 304
A L R+ A AVAG+TL D ALTVLWHSITWQYL DERAA+RA +DA+AA+A PF
Sbjct 244 AALERRTAGDAVAGLTLADGALTVLWHSITWQYLSVDERAAVRAHVDAVAARAGTGSPFA 303
Query 305 HLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
HLT+EPA PGA I+++VR R WP G A+ LGECHPHGPPV W
Sbjct 304 HLTMEPARSGPGAPIRFVVRARVWPDGGAQTLGECHPHGPPVDW 347
>gi|296168544|ref|ZP_06850348.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295896607|gb|EFG76246.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=348
Score = 428 bits (1101), Expect = 6e-118, Method: Compositional matrix adjust.
Identities = 236/345 (69%), Positives = 262/345 (76%), Gaps = 0/345 (0%)
Query 4 TEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRL 63
EHL+HTLR+QG+ C SGSPMY EL ELVA DV +GGVFA+ILA + P AVPLRL
Sbjct 3 VEHLLHTLRAQGQFCARSGSPMYGELFELVATDVAAGGVFATILAGHEDDPSRLAVPLRL 62
Query 64 LGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEV 123
LGGLHR+VLDGRAP LRRWYPSTGG+W A AWP+I A E+LRAAL +PPQTNEV
Sbjct 63 LGGLHRLVLDGRAPQLRRWYPSTGGSWDAGPAWPEIEGVAAAHAEALRAALRQPPQTNEV 122
Query 124 GRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNA 183
GRSAALIG LL + LPIRLFEIGSSAGLNLR D Y YR+ GGEWG DSPV ID+A
Sbjct 123 GRSAALIGALLRVNHESRLPIRLFEIGSSAGLNLRADHYHYRFAGGEWGPGDSPVIIDDA 182
Query 184 WLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNI 243
W G LPP VRIVERHG DIAPIDVT DGEL LSY+WPDQT RLERLRGAI VAR +
Sbjct 183 WRGALPPGGEVRIVERHGCDIAPIDVTGGDGELTVLSYVWPDQTARLERLRGAIEVARRV 242
Query 244 PADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPF 303
PA L R+ A AVAG+TL DALTVLWHSITWQYLP +ER A+R+ + AL AQA PF
Sbjct 243 PARLQRETAAGAVAGLTLAADALTVLWHSITWQYLPDEERDAVRSRVRALGAQAGQRSPF 302
Query 304 VHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
VHLTLEP PG I++LVR R WPGG +L +CHPHGPPV W
Sbjct 303 VHLTLEPFRDGPGGPIRFLVRARRWPGGELEILADCHPHGPPVRW 347
>gi|118466436|ref|YP_883617.1| hypothetical protein MAV_4482 [Mycobacterium avium 104]
gi|254776918|ref|ZP_05218434.1| hypothetical protein MaviaA2_19936 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|118167723|gb|ABK68620.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=347
Score = 427 bits (1097), Expect = 2e-117, Method: Compositional matrix adjust.
Identities = 236/345 (69%), Positives = 267/345 (78%), Gaps = 1/345 (0%)
Query 4 TEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRL 63
EHLVH LR+QG C SSGSPMY +L ELVA+DVE+GGVFA IL+ + AP A+PLRL
Sbjct 3 AEHLVHMLRAQGSFCASSGSPMYGDLFELVASDVEAGGVFADILSGHRDAPSRDAIPLRL 62
Query 64 LGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEV 123
LGGLHR+VLDGRA LRRWYPSTGG+W A AAWP I+ A + +LRAALDRPPQTNEV
Sbjct 63 LGGLHRLVLDGRAGSLRRWYPSTGGSWDAGAAWPPILAAAAEHAAALRAALDRPPQTNEV 122
Query 124 GRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNA 183
GRSAALIGGLL + LP+RLFEIGSSAGLNLR D YRYRY GG WG ADSPV ID+A
Sbjct 123 GRSAALIGGLL-HINESCLPVRLFEIGSSAGLNLRADHYRYRYAGGGWGPADSPVCIDDA 181
Query 184 WLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNI 243
W G LPP VRIVERHG+DIAP+DV + DGEL LSY+WPDQ RL RLRGAI VAR +
Sbjct 182 WRGALPPARGVRIVERHGFDIAPVDVGNSDGELTVLSYVWPDQAARLARLRGAIEVARRV 241
Query 244 PADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPF 303
PA L R+ A AV ++L + ALTVLWHSITWQYL A ERAA+ AG+DAL A+ADA P
Sbjct 242 PATLERRTAADAVGRLSLAEGALTVLWHSITWQYLSAGERAAVCAGVDALGARADASAPL 301
Query 304 VHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
VHLT+EPA PGA I++LVR R WP G RVL +CHPHGPPV W
Sbjct 302 VHLTMEPARDGPGAPIRFLVRARGWPDGGPRVLAQCHPHGPPVDW 346
>gi|41410248|ref|NP_963084.1| hypothetical protein MAP4150c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41399082|gb|AAS06700.1| hypothetical protein MAP_4150c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=347
Score = 426 bits (1096), Expect = 2e-117, Method: Compositional matrix adjust.
Identities = 236/345 (69%), Positives = 267/345 (78%), Gaps = 1/345 (0%)
Query 4 TEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRL 63
EHLVH LR+QG C SSGSPMY +L ELVA+DVE+GGVFA IL+ + AP A+PLRL
Sbjct 3 VEHLVHMLRAQGSFCASSGSPMYGDLFELVASDVEAGGVFADILSGHRDAPSRDAIPLRL 62
Query 64 LGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEV 123
LGGLHR+VLDGRA LRRWYPSTGG+W A AAWP I+ A + +LRAALDRPPQTNEV
Sbjct 63 LGGLHRLVLDGRAGSLRRWYPSTGGSWDAGAAWPPILAAAAEHAAALRAALDRPPQTNEV 122
Query 124 GRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNA 183
GRSAALIGGLL + LP+RLFEIGSSAGLNLR D YRYRY GG WG ADSPV ID+A
Sbjct 123 GRSAALIGGLL-HINESCLPVRLFEIGSSAGLNLRADHYRYRYAGGGWGPADSPVCIDDA 181
Query 184 WLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNI 243
W G LPP VRIVERHG+DIAP+DV +PDGEL LSY+WPDQ RL RLRGAI VAR +
Sbjct 182 WRGALPPARGVRIVERHGFDIAPVDVGNPDGELTVLSYVWPDQAARLARLRGAIEVARRV 241
Query 244 PADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPF 303
PA L R+ A AV ++L + ALTVLWHSITWQYL A ERAA+ AG+DAL A+A A P
Sbjct 242 PATLERRTAADAVGRLSLAEGALTVLWHSITWQYLSAGERAAVCAGVDALGARAGASAPL 301
Query 304 VHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
VHLT+EPA PGA I++LVR R WP G RVL +CHPHGPPV W
Sbjct 302 VHLTMEPARDGPGAPIRFLVRARGWPDGGPRVLAQCHPHGPPVDW 346
>gi|336460679|gb|EGO39570.1| hypothetical protein MAPs_38810 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=328
Score = 390 bits (1002), Expect = 2e-106, Method: Compositional matrix adjust.
Identities = 219/345 (64%), Positives = 250/345 (73%), Gaps = 20/345 (5%)
Query 4 TEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRL 63
EHLVH LR+QG C SSGSPMY +L ELVA+DVE+GGVFA IL+ + AP A+PLRL
Sbjct 3 VEHLVHMLRAQGSFCASSGSPMYGDLFELVASDVEAGGVFADILSGHRDAPSRDAIPLRL 62
Query 64 LGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEV 123
LGGLHR+VLDGRA LRRWYPSTGG+W A AAWP I+ A +
Sbjct 63 LGGLHRLVLDGRAGSLRRWYPSTGGSWDAGAAWPPILAAAAEH----------------- 105
Query 124 GRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNA 183
+AALIGGLL + LP+RLFEIGSSAGLNLR D YRYRY GG WG ADSPV ID+A
Sbjct 106 --AAALIGGLL-HINESCLPVRLFEIGSSAGLNLRADHYRYRYAGGGWGPADSPVCIDDA 162
Query 184 WLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNI 243
W G LPP VRIVERHG+DIAP+DV +PDGEL LSY+WPDQ RL RLRGAI VAR +
Sbjct 163 WRGALPPARGVRIVERHGFDIAPVDVGNPDGELTVLSYVWPDQAARLARLRGAIEVARRV 222
Query 244 PADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPF 303
PA L R+ A AV ++L + ALTVLWHSITWQYL A ERAA+ AG+DAL A+A A P
Sbjct 223 PATLERRTAADAVGRLSLAEGALTVLWHSITWQYLSAGERAAVCAGVDALGARAGASAPL 282
Query 304 VHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
VHLT+EPA PGA I++LVR R WP G RVL +CHPHGPPV W
Sbjct 283 VHLTMEPARDGPGAPIRFLVRARGWPDGGPRVLAQCHPHGPPVDW 327
>gi|254822798|ref|ZP_05227799.1| hypothetical protein MintA_22904 [Mycobacterium intracellulare
ATCC 13950]
Length=229
Score = 283 bits (725), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 159/227 (71%), Positives = 173/227 (77%), Gaps = 1/227 (0%)
Query 4 TEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRL 63
EHL HTLR+QGR C SSGS MY EL ELVAADVE+GGVFA+IL+ + AP AVPLRL
Sbjct 3 VEHLAHTLRAQGRFCASSGSAMYGELFELVAADVEAGGVFAAILSRHRHAPSRDAVPLRL 62
Query 64 LGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEV 123
LGGLHR+VLDGRA LRRWYPSTGG+W A AWP I A ++LRAALD+PPQTNEV
Sbjct 63 LGGLHRLVLDGRAAHLRRWYPSTGGSWNAGPAWPQIRDAAAGHADALRAALDQPPQTNEV 122
Query 124 GRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNA 183
GRSAALIGGLL Q LPIRLFEIGSSAGLNLR D Y YR+ G +WG DSPV ID+A
Sbjct 123 GRSAALIGGLL-HLKQSGLPIRLFEIGSSAGLNLRADHYLYRFAGSQWGPPDSPVAIDDA 181
Query 184 WLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRL 230
W G LPP VRI ER GYDIAPIDV DGEL LSY+WPDQ RL
Sbjct 182 WRGALPPGRDVRIAERCGYDIAPIDVGDTDGELTVLSYVWPDQAARL 228
>gi|284032663|ref|YP_003382594.1| hypothetical protein Kfla_4778 [Kribbella flavida DSM 17836]
gi|283811956|gb|ADB33795.1| conserved hypothetical protein [Kribbella flavida DSM 17836]
Length=347
Score = 276 bits (706), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 154/345 (45%), Positives = 203/345 (59%), Gaps = 2/345 (0%)
Query 7 LVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGG 66
+V + Q C GSP+Y +LL + D E GGV +LA + P A+ LRLLG
Sbjct 3 VVEAFKLQAAACEELGSPLYADLLRRLVDDYELGGVSTEVLAGHEQDPGPSALALRLLGS 62
Query 67 LHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRS 126
+HR+VL P L +YPS GG W W + + LR+ L +PPQTNEVGRS
Sbjct 63 VHRLVLAREVPELGVFYPSVGGEWDPVLGWEAFEQVLQARGPELRSLLSQPPQTNEVGRS 122
Query 127 AALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGG-EWGLADSPVRIDNAWL 185
AL GGLL LP+RLFEIGSS GLNLR D +RY G +G ADSPV +AW
Sbjct 123 TALYGGLLRLAEVVPLPVRLFEIGSSGGLNLRADHFRYDLADGTSFGAADSPVVFADAWS 182
Query 186 GE-LPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIP 244
G + P +RI ER G DI P++ S DG L +SY+WPD T+RL RLRGA+AVAR++P
Sbjct 183 GRPIQPAPALRIAERVGSDINPVNPLSEDGALTLMSYVWPDMTERLARLRGALAVARDVP 242
Query 245 ADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFV 304
AD+ R+ A + + + L + +TV+WHS+ WQYL ++AA A I L +A A P
Sbjct 243 ADVRREDALSFLRNLELAEGHVTVVWHSVMWQYLTQADQAAADAAIAELGERATATAPLA 302
Query 305 HLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
L LEP + P A ++L+ ++ WP G R+LG PHG P W+
Sbjct 303 RLCLEPMRRTPDAPYEFLIVLQVWPTGVPRILGHAAPHGVPAVWE 347
>gi|271967512|ref|YP_003341708.1| hypothetical protein Sros_6238 [Streptosporangium roseum DSM
43021]
gi|270510687|gb|ACZ88965.1| conserved hypothetical protein [Streptosporangium roseum DSM
43021]
Length=358
Score = 258 bits (658), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 159/352 (46%), Positives = 198/352 (57%), Gaps = 14/352 (3%)
Query 5 EHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLL 64
E L + Q R C GSP+Y LL VA DV +GG A LA + AP AV LRLL
Sbjct 4 ERLAVMVEHQARGCAELGSPLYAFLLGRVAQDVRAGGPCAEALAGYEDAPGPDAVALRLL 63
Query 65 GGLHRMVLDGRAPVLRRWYPSTGGTW---QAEAAWPDIVRTATDQPESLRAALDRPPQTN 121
GG+H + L GRAP L YPSTGG + + E W + E +R + RPPQTN
Sbjct 64 GGVHALALTGRAPDLAACYPSTGGAFDPERPEPCWHAFRAAVAGEMEWVRDWMTRPPQTN 123
Query 122 EVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRID 181
EVGR+ LI GLL A LP+RLFE+GSSAGLNLR DR+RY G WG ADSPV ++
Sbjct 124 EVGRANLLITGLLKATQAGPLPVRLFEVGSSAGLNLRADRFRYVSEGFAWGPADSPVLLE 183
Query 182 NAWLGELPP--------TATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERL 233
AW G P + IVER G D+ PID SPDG L +Y+WPDQT R+ RL
Sbjct 184 GAWAGAPPAWLAGATAGQPDLEIVERRGCDLTPIDPLSPDGALALRAYVWPDQTARVARL 243
Query 234 RGAIAVARNIPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDAL 293
GA+ VA +PA++ A +AG+ L LTV+WHSI QY+PA E A + A +D L
Sbjct 244 DGALRVAARVPAEVEAAGAADFLAGVRLEPGTLTVVWHSIMRQYVPAAEWARVEAELDRL 303
Query 294 AAQADAHCPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPP 345
AA A F H++ EP + + VR+ + G V+ E PHG P
Sbjct 304 AAAATVEARFAHISFEPRRVGERHRFRLAVRLGTAAG---TVVAEARPHGLP 352
>gi|311896241|dbj|BAJ28649.1| hypothetical protein KSE_28380 [Kitasatospora setae KM-6054]
Length=363
Score = 212 bits (539), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 139/353 (40%), Positives = 184/353 (53%), Gaps = 14/353 (3%)
Query 5 EHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLL 64
+H Q C + GSP+ LL A D+ +GG A +A + AP A+ LRLL
Sbjct 4 DHAAAMFHHQADGCAALGSPLSAALLRRAAEDLLAGGPCAEAVAGHEDAPGPDAIALRLL 63
Query 65 GGLHRMVLDGRAPVLRRWYPSTGGTW---QAEAAWPDIVRTATDQPESLRAALDRPPQTN 121
G +H +VL G AP L YPS GG + + +A WP +R L RPPQTN
Sbjct 64 GAVHALVLSGLAPELAAHYPSVGGRFDPAEPDAPWPAFRAAVAAHLPFVRGWLTRPPQTN 123
Query 122 EVGRSAALIGGLLIACLQFD----LPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSP 177
EVGR+ L L A + LP+RL E+GSSAGLNL DR+R G +G ADSP
Sbjct 124 EVGRANLLFTALAWAQRELSAGTPLPVRLRELGSSAGLNLLADRFRCTSDGFSYGPADSP 183
Query 178 VRIDNAWLGELPP----TATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERL 233
V + +AW GE P R+ +R G D PID S DG L +Y+W DQ R++RL
Sbjct 184 VVLADAWRGEPPAWLRGAPLQRVTDRRGCDPTPIDPRSADGSLALRAYLWADQLPRVQRL 243
Query 234 RGAIAVARNIPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDAL 293
GA+A+A PA + A A + G+ LTV+WHSI QY+PADE ++ A + L
Sbjct 244 NGALALAAETPAPVEATGAAAFLRGVETAGGTLTVVWHSIMRQYVPADEWRSVEAELTRL 303
Query 294 AAQADAHCPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPV 346
A + PF+H+ EP +R G ++L+ R G L E PHG P
Sbjct 304 ATASSPSAPFLHVAFEP--RRVGTGHRFLLTARLGAGPRT-TLAEAMPHGLPA 353
>gi|222149763|ref|YP_002550720.1| hypothetical protein Avi_3766 [Agrobacterium vitis S4]
gi|221736745|gb|ACM37708.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length=349
Score = 201 bits (510), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 133/349 (39%), Positives = 175/349 (51%), Gaps = 11/349 (3%)
Query 5 EHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKG--APEGQAVPLR 62
+ L H L Q R C GSP L L A + + L D G G +VPLR
Sbjct 4 DSLRHALTDQARSCDVLGSPFTARLCRLAAERLTPASAIGARLIDWPGDITSAGDSVPLR 63
Query 63 LLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNE 122
L G LH +VL +P L YP T +A W + T D ++A L+ PQTNE
Sbjct 64 LAGTLHALVLSNESPALAAVYPPHDAT--DDALWAAVETTFRDHEAFMQARLNSAPQTNE 121
Query 123 VGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDN 182
V RSAAL+ G L F LP+RL E+G+SAGLNL+ DRY YR WG S V +
Sbjct 122 VRRSAALLPGFLTIASLFGLPLRLSEVGASAGLNLQWDRYAYRLGETSWG-DGSQVLLAP 180
Query 183 AWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARN 242
W G PP+AT+ + ER G D+ P+D +P+ SYIW DQ DRLER + A+A+AR+
Sbjct 181 DWQGPPPPSATITVEERAGCDLNPLDPGTPEDCERLFSYIWADQADRLERTKAALALARS 240
Query 243 IPADLHRQAAHAAVAGMTLTDD--ALTVLWHSITWQYLPADERAAIRAGIDALAAQADAH 300
+ R A + + V++HS+ WQYLP +A A I +A A
Sbjct 241 NNLSVDRMDAIDWLKQRLAPSHPGQMHVVYHSVAWQYLPDTLKAQGEALITQAGQRATAQ 300
Query 301 CPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
PF L +E QR GA + ++ WPGG + +G HG V WQ
Sbjct 301 APFARLQMEADGQRDGASLN----LQIWPGGERQEIGRADFHGRWVKWQ 345
>gi|163794829|ref|ZP_02188799.1| hypothetical protein BAL199_27756 [alpha proteobacterium BAL199]
gi|159180102|gb|EDP64627.1| hypothetical protein BAL199_27756 [alpha proteobacterium BAL199]
Length=359
Score = 196 bits (499), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 126/340 (38%), Positives = 164/340 (49%), Gaps = 9/340 (2%)
Query 7 LVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGG 66
+V R Q C GSP + +L++ +E G F +A+ G P A+ LR G
Sbjct 7 IVDAFRQQADACRDLGSPFNAMVCDLLSDRLEPGSAFGQRIANWPGQPVADALALRACGS 66
Query 67 LHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRS 126
LH ++ GR P L YP T GT +A W I +Q L LD PPQTNEV RS
Sbjct 67 LHGLIRSGRCPALMAAYPPTPGT--PDAVWTAIRTAIAEQDGFLTRYLDSPPQTNEVARS 124
Query 127 AALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLG 186
+ ++GG L F LP+ ++EIGSSAGLNL D Y Y G WG S VRI W G
Sbjct 125 SMILGGCLTIAETFRLPLEIYEIGSSAGLNLGFDHYHYDLGGRSWGSPTSKVRIVTKWEG 184
Query 187 ELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPAD 246
+P + +V R G D P+D S LSYIWPDQ++RL R+ A+ VA + +
Sbjct 185 PVPLDVPLTVVRREGCDRNPLDPGSSADRDRLLSYIWPDQSNRLARIDAALQVAASANQN 244
Query 247 LHRQAAHAAVA---GMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPF 303
+ R A V T VL H+I WQYLPAD + I A + A P
Sbjct 245 VDRADAADWVEQRLARPCTPGRARVLMHTIVWQYLPADTQRRIEAAVYQAGEVASGDAPL 304
Query 304 VHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHG 343
L +EP G +R+ WP G + +LG HG
Sbjct 305 AWLRVEPD----GVPGSAGIRLSLWPSGKSLLLGRADYHG 340
>gi|150397944|ref|YP_001328411.1| hypothetical protein Smed_2746 [Sinorhizobium medicae WSM419]
gi|150029459|gb|ABR61576.1| conserved hypothetical protein [Sinorhizobium medicae WSM419]
Length=355
Score = 193 bits (491), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 134/344 (39%), Positives = 168/344 (49%), Gaps = 12/344 (3%)
Query 11 LRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAP--EGQAVPLRLLGGLH 68
R Q R C GSP L LVA + +G + + G P +G +VPLRL G LH
Sbjct 15 FRDQARSCDELGSPFTARLCRLVADRLATGSKVGTHVLGWHGDPTSKGDSVPLRLAGALH 74
Query 69 RMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAA 128
+VL GR L YP + EA W I R Q + L PQTNEV RSAA
Sbjct 75 ALVLSGRDEELEASYPPN--RYDDEALWQAITRAMEQQAGFILDRLISAPQTNEVRRSAA 132
Query 129 LIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGEL 188
L+ G L F P+ L E+G+SAGLNL DRYRY G WG S V I W G
Sbjct 133 LLPGFLTVAQLFGKPLLLSEVGASAGLNLHWDRYRYALAGNHWGNEASAVAIAPEWSGAR 192
Query 189 PPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLH 248
PP V I++R G D+ PID + + L LSY+W DQ DR++R R A+ +A L
Sbjct 193 PPLRNVEIIDRAGCDLNPIDPSDSEDRLRLLSYVWADQQDRIDRTRQALELA-AFHGSLV 251
Query 249 RQAAHAAVAGMTLT---DDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVH 305
+A M L+ A V++HSI WQYLP R A A I A A A + P
Sbjct 252 ERADAIDWLRMRLSIAHSGAAHVVYHSIAWQYLPQIARNAGEALISAAGAAATSEAPLAR 311
Query 306 LTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
L +E Q PGA + ++ WP G ++G HG V WQ
Sbjct 312 LQMEADGQAPGAALS----LQIWPSGDKHLVGRADFHGRWVAWQ 351
>gi|256374961|ref|YP_003098621.1| hypothetical protein Amir_0814 [Actinosynnema mirum DSM 43827]
gi|255919264|gb|ACU34775.1| conserved hypothetical protein [Actinosynnema mirum DSM 43827]
Length=362
Score = 185 bits (469), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 136/361 (38%), Positives = 186/361 (52%), Gaps = 23/361 (6%)
Query 7 LVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGG 66
L R R C + SP+ LL +AD++SGG ++A+ + A G LR
Sbjct 2 LAELFRQSARDCAGA-SPLTSTLLAAASADLDSGGPTKRVMANAEWARAGDVPALRFAAA 60
Query 67 LHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPP-QTNEVGR 125
+HR+VL+GRAP L YP+ GG+ + A W D + + LRA +D QTNE GR
Sbjct 61 VHRVVLEGRAPALAAHYPTVGGSPELGALWADARGVVEEHADELRALVDTTTVQTNEPGR 120
Query 126 SAALIGGL--------LIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLAD-- 175
S L GGL A + P+RL E+G+S GLNLRP +R YL G+ L D
Sbjct 121 SGPLFGGLHTATALAAAAAGRRTPFPVRLLEVGASGGLNLRP--HRIAYLHGDRVLGDPS 178
Query 176 SPVRIDNAWLGELPPTAT----VRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLE 231
SP+R+D W GE P +R+V R G D P+DV++ DG + LS++WPDQ +R
Sbjct 179 SPLRLDTGWSGE--PEGDLDRPLRLVGRGGCDPNPVDVSTVDGRRHLLSFVWPDQRERWA 236
Query 232 RLRGAIAVARNIPADLHRQAAHAAVAGMTL--TDDALTVLWHSITWQYLPADERAAIRAG 289
RL A+ +A P + R A + D LTV+WHSI WQY A ERAA RA
Sbjct 237 RLGAALDLAAVDPVPVRRAPASEWLGEQLARPERDVLTVVWHSIVWQYASAAERAAGRAV 296
Query 290 IDALAAQADAHCPFVHLTLEPAH-QRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
+ + A +A A P L E P ++ + ++ WP G + LG PHG P TW
Sbjct 297 LASAAERATAAAPLALLVFESRRGHDPALPYEFQLLLKLWPAGRSLRLGAGGPHGTPFTW 356
Query 349 Q 349
+
Sbjct 357 K 357
>gi|84683370|ref|ZP_01011273.1| hypothetical protein 1099457000264_RB2654_18393 [Maritimibacter
alkaliphilus HTCC2654]
gi|84668113|gb|EAQ14580.1| hypothetical protein RB2654_18393 [Rhodobacterales bacterium
HTCC2654]
Length=343
Score = 180 bits (457), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 124/345 (36%), Positives = 165/345 (48%), Gaps = 8/345 (2%)
Query 7 LVHTLRSQGRVCTSSGSPMYRELLELVAADVESGG-VFASILADQKG-APEGQAVPLRLL 64
L R Q + + GSP +L LVA + G V +LA + P GQ+VPLRLL
Sbjct 3 LATAFREQAKSNEALGSPFSARVLRLVADRIAPGSPVMDRMLAFEGDIGPSGQSVPLRLL 62
Query 65 GGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVG 124
GGLH +VL G P L YP T + + ++L L PPQTNEV
Sbjct 63 GGLHALVLSGEDPDLAACYPPNPAT-DDATLGAALDAALATRTDTLLTYLALPPQTNEVR 121
Query 125 RSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAW 184
RSA +I ++ LP L E+G+SAGLNL DRY G G AD V + W
Sbjct 122 RSAVMIAAGHWLADRYGLPFVLTELGASAGLNLMWDRYALDLPCGYRGPADPAVTLSPDW 181
Query 185 LGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIP 244
G PP A + + +R G D+AP+DV P E LSY+W DQ +R+ER R AIAV +
Sbjct 182 TGPCPPEAKIEVTDRRGIDVAPLDVHDPADERRLLSYLWADQPERIERTRAAIAV-YDAQ 240
Query 245 ADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFV 304
D + + + +++H+I WQY P +AA +A A P
Sbjct 241 VDQSDAMSFLPIRVAIRRPGHIHLVFHTIAWQYFPPATKAACEIAFEAAGKAATLDAPIA 300
Query 305 HLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
L++E Q PGA + + +WPGG LG HG V WQ
Sbjct 301 RLSMEADGQGPGAAMT----LTTWPGGEVHNLGRVDFHGRWVDWQ 341
>gi|84494637|ref|ZP_00993756.1| hypothetical protein JNB_07564 [Janibacter sp. HTCC2649]
gi|84384130|gb|EAQ00010.1| hypothetical protein JNB_07564 [Janibacter sp. HTCC2649]
Length=343
Score = 178 bits (452), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 133/326 (41%), Positives = 169/326 (52%), Gaps = 14/326 (4%)
Query 25 MYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGGLHRMVLDGRAPVLRRWYP 84
+Y L+ +AAD E GGV ILA ++ AP G V LRLL G+HR+VL G AP L +YP
Sbjct 24 LYGVLMRDLAADWERGGVVREILAGREDAPPGDMVQLRLLAGVHRIVLRGDAPELAAFYP 83
Query 85 STGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIGGLLIACLQFDL-P 143
S GGT A WP + LR ALD PQTNEVGRS AL+ GL A + +
Sbjct 84 SVGGTADRYAVWPALEPVLRSHVAELREALDVAPQTNEVGRSIALLAGLSEALRRSGMRK 143
Query 144 IRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELPPTATVRIVERHGYD 203
+RL E G+SAGLNL D++R+ G G D+ + + P +VERHG D
Sbjct 144 VRLLEPGASAGLNLLVDQFRFEGDGWTCGPDDAQLVLAGCEAAGFTPE-PFEVVERHGCD 202
Query 204 IAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLHRQAAHAAVAGMTLT- 262
+ P D T+P+GE SYIWP +R RL A+A R P + R A V +
Sbjct 203 LDPFDATTPEGEAYLRSYIWPHMPERDGRLVAALATLREHPVTIDRAPAADWVRDQLASP 262
Query 263 --DDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVHLTLEPAHQRPGAQIK 320
D LTV+WHSIT QY PA E AA+ A ID +A + P V + LE P +
Sbjct 263 APDGVLTVVWHSITRQYWPAAEYAAMLAAID----EARSRLPVVRVALEDPSPLPTSGT- 317
Query 321 YLVRMRSWPGGHARVLGECHPHGPPV 346
R V+G C HGPP+
Sbjct 318 ----WRPQVEVDDDVIGHCTHHGPPL 339
>gi|260428030|ref|ZP_05782009.1| conserved hypothetical protein [Citreicella sp. SE45]
gi|260422522|gb|EEX15773.1| conserved hypothetical protein [Citreicella sp. SE45]
Length=344
Score = 175 bits (444), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 127/347 (37%), Positives = 175/347 (51%), Gaps = 12/347 (3%)
Query 5 EHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKG--APEGQAVPLR 62
+ L Q + C + GSP LL +A D A LA+ G P G +VPLR
Sbjct 2 SRITDALNVQAKSCVALGSPFMGRLLSGLAQDWPDT-PLARRLAEWPGEIGPAGHSVPLR 60
Query 63 LLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPES-LRAALDRPPQTN 121
L GGLH +VL GRA L YP +A A V A + E+ L + PPQTN
Sbjct 61 LAGGLHALVLTGRAEPLAAVYPPNDAPVEALIA---AVHGAMARHETFLDDWMRSPPQTN 117
Query 122 EVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRID 181
E+ RS+ LI L+ +F LP+RL E+G+S GLNL DRY R + G D + +D
Sbjct 118 ELRRSSVLIPAALLLTERFGLPLRLSEMGASGGLNLLFDRYALRIGAEQRGARDPALVLD 177
Query 182 NAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVAR 241
AW G LPP ++++ +R G D+ P+D +P L ++Y+WPDQ+DR++R R A+A+ R
Sbjct 178 PAWTGPLPPAVSLQVADRRGVDLNPLDPANPADALRLVAYLWPDQSDRIDRTRRAMAIGR 237
Query 242 NIPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHC 301
P D A + +++ +I WQY P + +A RA I+ A A
Sbjct 238 -APVDRGDAADWIGARMAGNAPGLIQMIYTTIAWQYFPPEAQARARAAIETAGAAATEDA 296
Query 302 PFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
P + LE QRPGA I +R WPG + LG HG V W
Sbjct 297 PVAWVALEDDGQRPGAGIT----LRLWPGDRSFSLGRADFHGRWVNW 339
>gi|85373348|ref|YP_457410.1| hypothetical protein ELI_02605 [Erythrobacter litoralis HTCC2594]
gi|84786431|gb|ABC62613.1| hypothetical protein ELI_02605 [Erythrobacter litoralis HTCC2594]
Length=348
Score = 173 bits (439), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 120/349 (35%), Positives = 179/349 (52%), Gaps = 12/349 (3%)
Query 5 EHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLL 64
+ L + Q + G+P +++ + A + +A+ +G A+PLR+
Sbjct 4 KSLDEAIEWQAQHAEEGGAPGTAKVIRGLLAVSRTETATGRRIANWQGLTLKDAMPLRIN 63
Query 65 GGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPES-LRAALDRPPQTNEV 123
GGLH +VL G L Y GG +AA ++V + ++ L LD PPQTNE
Sbjct 64 GGLHNLVLTGEDTRLGAVY---GGLMTDQAAVDELVCELFESYDARLLPWLDGPPQTNEA 120
Query 124 GRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNA 183
GRSA+L+ GLL + E+GSSAG+N +RY + G G SP+RI
Sbjct 121 GRSASLMAGLLWLAQHVPAQFEMLELGSSAGINTMMERYFFDLGGVTTGPEASPMRIAPD 180
Query 184 WLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNI 243
W G+ PPT +IV G D+APID++ P+ L SY+WP+ +R+ R+ A+ +A
Sbjct 181 WKGDPPPTTAPQIVSIRGCDVAPIDLSDPEAALRLKSYVWPEAFERMGRIDAAVELAGQR 240
Query 244 PADLHRQAAHAAVAGMTL--TDDALT-VLWHSITWQYLPADERAAIRAGIDALAAQADAH 300
P D+ +Q A + VA D +T VL+HSI WQY+P D++ AIR ID A++A
Sbjct 241 PPDVVKQDAGSFVAEALAQPQDKGVTRVLFHSIVWQYIPDDQQQAIRDAIDEAASKATPE 300
Query 301 CPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGEC-HPHGPPVTW 348
P ++LE + ++ + + WPGG L C HPHG V W
Sbjct 301 RPLAWVSLETNRK----TFRHELHVTYWPGGAEPTLLACAHPHGAWVEW 345
>gi|15966603|ref|NP_386956.1| hypothetical protein SMc03928 [Sinorhizobium meliloti 1021]
gi|334317606|ref|YP_004550225.1| hypothetical protein Sinme_2904 [Sinorhizobium meliloti AK83]
gi|15075875|emb|CAC47429.1| Conserved hypothetical protein [Sinorhizobium meliloti 1021]
gi|333812907|gb|AEG05576.1| protein of unknown function UCP012608 [Sinorhizobium meliloti
BL225C]
gi|334096600|gb|AEG54611.1| protein of unknown function UCP012608 [Sinorhizobium meliloti
AK83]
gi|336034329|gb|AEH80261.1| hypothetical protein SM11_chr3017 [Sinorhizobium meliloti SM11]
Length=358
Score = 172 bits (436), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 125/353 (36%), Positives = 162/353 (46%), Gaps = 10/353 (2%)
Query 1 VTGTEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAP--EGQA 58
V G + + R Q + C GSP L LVA +++ + +G P +G +
Sbjct 5 VAGETCVRNAFRGQAKSCDELGSPFTARLCRLVADRLDASSAVGERILGWRGDPTSKGDS 64
Query 59 VPLRLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPP 118
V LRL G LH +VL GR+ L YP E W I + + L L P
Sbjct 65 VALRLAGALHALVLSGRSESLGASYPPNSA--DDETLWRAIDQAIRQESRFLLDRLTSAP 122
Query 119 QTNEVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPV 178
QTNEV RS AL+ G L F P+ + EIG+SAGLNL DRYRY G WG + V
Sbjct 123 QTNEVRRSGALLPGFLTVAQLFGKPLVISEIGASAGLNLHWDRYRYDLASGRWGDEAAAV 182
Query 179 RIDNAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGA-- 236
I W G PP V I++R G D+ P++ L LSYIW DQ DR++R R A
Sbjct 183 VIAPEWAGGPPPPRPVEIIDRAGCDLHPLNPADGGDRLRLLSYIWADQQDRIDRTRQALK 242
Query 237 IAVARNIPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQ 296
IA +R+ P + + A V++HSI WQYLP R A I A A
Sbjct 243 IAASRSNPVERADAIDWLKTRLARIYPGAAHVVYHSIAWQYLPEAARKEGDALIAAAGAA 302
Query 297 ADAHCPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
A P L +E Q PGA + + WP G +G HG V W+
Sbjct 303 ATQEAPLARLQMEADGQTPGAALSLQI----WPAGETHAVGRADFHGRWVDWK 351
>gi|296537372|ref|ZP_06899229.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957]
gi|296262300|gb|EFH09068.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957]
Length=302
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 129/297 (44%), Positives = 156/297 (53%), Gaps = 15/297 (5%)
Query 58 AVPLRLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRP 117
A+ LRL GGLH +VL G+AP L YP A + T Q ESLR L
Sbjct 1 ALALRLAGGLHALVLAGQAPALAACYPPHPAP-ADAAFLTALQATLAAQEESLRGFLASA 59
Query 118 PQTNEVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSP 177
PQTNEVGRSA L+GG L LP+RL EIG+SAGLNL DR+ YR EWG +SP
Sbjct 60 PQTNEVGRSAVLLGGFLKIAAATALPLRLLEIGASAGLNLAWDRFFYRLGAAEWGDPESP 119
Query 178 VRIDNAWLGELPPT-ATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGA 236
V++ W G LPP A + ++ R G D+AP+ V P L +Y+WPDQ +RL RL GA
Sbjct 120 VQLRPEWHGPLPPLGAPLSVMAREGCDLAPVPVRDPAQALRLRAYVWPDQHERLARLDGA 179
Query 237 IAVARNI-----PADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGID 291
I +AR + PAD A + A TVL+HSI WQYLP + I A +
Sbjct 180 IVLARQLGTEVAPAD----ALDWLRPRLRPATGAATVLYHSIMWQYLPEATQQGILALLR 235
Query 292 ALAAQADAHCPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
A AA A P L E PG L R+ WPGG R L HPHG + W
Sbjct 236 AAAAAATPQAPLAWLRFE---MPPGGGPAEL-RLTLWPGGAERRLATAHPHGQRIDW 288
>gi|339502101|ref|YP_004689521.1| hypothetical protein RLO149_c005300 [Roseobacter litoralis Och
149]
gi|338756094|gb|AEI92558.1| hypothetical protein RLO149_c005300 [Roseobacter litoralis Och
149]
Length=345
Score = 166 bits (421), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 122/349 (35%), Positives = 170/349 (49%), Gaps = 16/349 (4%)
Query 7 LVHTLRSQGRVCTSSGSPMYRELLELVA----ADVESGGVFASILADQKGAPEGQAVPLR 62
L R Q CT GSP +LL ++A A+ G FA+ D P G ++PLR
Sbjct 2 LQEAFRDQAISCTRLGSPFMGQLLGILADYWPANSRLGQYFATFSGDI--GPSGASLPLR 59
Query 63 LLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPE-SLRAALDRPPQTN 121
+ GGLH +VL AP L R YP +A D V A E L + PPQTN
Sbjct 60 IAGGLHALVLSDLAPALTRVYPPNQSE---DALLRDTVLEALHTHEVFLLDWVQSPPQTN 116
Query 122 EVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRID 181
EV RSAAL+ G +A FDLP+ L E+G+SAGLNL D + G +G+ + +
Sbjct 117 EVRRSAALMPGAAVAATYFDLPVYLSELGASAGLNLMWDHFDVALPEGSFGVQAPALTLS 176
Query 182 NAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVAR 241
W G +PP RI +R G D+ P+D + P L +++WPDQ +RL + A +VA
Sbjct 177 PDWNGPMPPQRLPRIAQRAGVDLNPLDPSDPADLLRLTAFLWPDQPERLALTKAAASVAC 236
Query 242 NIPADLHRQAAHAAVAG--MTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADA 299
++ R A + D + ++ H++ WQY P+D + RA I+A A+A
Sbjct 237 T---EIERSDAIDWLEHRLTNAPDQHMHLIQHTVAWQYFPSDAQTRGRALIEAAGARATQ 293
Query 300 HCPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
P L+LE GA + + +R WPG LG HG V W
Sbjct 294 TRPLAWLSLETDGDTKGA-LGAALTLRLWPGDKTLHLGRADFHGRWVKW 341
>gi|83943886|ref|ZP_00956343.1| hypothetical protein EE36_09585 [Sulfitobacter sp. EE-36]
gi|83845133|gb|EAP83013.1| hypothetical protein EE36_09585 [Sulfitobacter sp. EE-36]
Length=344
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 125/351 (36%), Positives = 178/351 (51%), Gaps = 22/351 (6%)
Query 7 LVHTLRSQGRVCTSSGSPMYRELLELVA----ADVESGGVFASILADQKGAPEGQAVPLR 62
L Q C + GSP +L+ ++A AD G FA+I D P G ++PLR
Sbjct 3 LQEAFEEQAVHCIALGSPFMGQLMGVLARDWSADTALGRKFAAIKGDI--GPSGASLPLR 60
Query 63 LLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPES-LRAALDRPPQTN 121
+ GGLH +VL +AP L YP + QA + + VR A E+ L D PQTN
Sbjct 61 IAGGLHALVLKRKAPALMAVYPPHKASDQALS---EAVRDAITTHEAFLLDWTDSAPQTN 117
Query 122 EVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRID 181
EV RSAALI G +A F+LPIRL E+G+S GLNL D + G +G S + +
Sbjct 118 EVRRSAALIAGARVAAQHFNLPIRLSELGASGGLNLMWDHFVLEIEGHRFGSNMSTILLS 177
Query 182 NAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVAR 241
W G+LPP ++ +R G D+ P+D T PD L +SYIW DQ +RL R A +V
Sbjct 178 PDWTGKLPPAINPQVEKRRGVDLNPLDPTRPDHLLRLMSYIWADQPERLTLTRTATSV-- 235
Query 242 NIPADLHRQAAHAAVAGMTL---TDDALTVLWHSITWQYLPADERAAIRAGIDALAAQAD 298
+ A + R A +A TL + L ++ H++ WQY P +A +A I+A +A
Sbjct 236 -MTAQVQRGDAIDWLA-RTLPQSPEGCLHLIQHTVAWQYFPKAAQARGKALIEAAGKRAT 293
Query 299 AHCPFVHLTLEPAHQRPGAQIK-YLVRMRSWPGGHARVLGECHPHGPPVTW 348
+ P L++E + G+ +K + +R WPG L HG + W
Sbjct 294 RNRPLAWLSME----QDGSGLKGAALTLRLWPGDITLPLARVDFHGRWIDW 340
>gi|332716792|ref|YP_004444258.1| hypothetical protein AGROH133_12861 [Agrobacterium sp. H13-3]
gi|325063477|gb|ADY67167.1| hypothetical protein AGROH133_12861 [Agrobacterium sp. H13-3]
Length=346
Score = 163 bits (413), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 121/354 (35%), Positives = 168/354 (48%), Gaps = 24/354 (6%)
Query 5 EHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKG--APEGQAVPLR 62
E + +Q R C S GSP L VAA ++ I+ G P G +VPLR
Sbjct 4 EAVRDAFLAQARACDSLGSPFTARLCRAVAARLDRQTDVGEIVLSWPGDVGPSGDSVPLR 63
Query 63 LLGGLHRMVLDGRAPVLRRWYPS-TGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTN 121
L G LH +V++ + L P WQA A+ + L PPQTN
Sbjct 64 LAGALHALVIEDKITPLVDIAPEDENALWQACAS------ALRFHSGFILERLKSPPQTN 117
Query 122 EVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRID 181
EV RSA L+ G L P+ L E+G+SAGLNL+ DRY+YR WG S V +
Sbjct 118 EVRRSAVLLPGFLSIAELLGKPLVLSEVGASAGLNLQFDRYQYRLGDLAWG-RQSEVSMS 176
Query 182 NAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVAR 241
W G+ PP + ++ER G D+ P+D +S + L +SY+W DQTDRLER A+ +A
Sbjct 177 PEWRGDTPPDKRIEVIERAGCDLNPLDPSSAEDRLRLMSYVWADQTDRLERTAAALRIA- 235
Query 242 NIPADLHRQAAHAA------VAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAA 295
+ LH + A A +A A V++HS+ WQYLP + A I +
Sbjct 236 -VENGLHVEKADAIDWLQRRLAAQ--HSGAAHVVYHSVAWQYLPDALKEAGETLIAEAGS 292
Query 296 QADAHCPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
+A P L +E A PG+ + ++ WP G + +G HG V WQ
Sbjct 293 RATPEAPLARLQME-ADTTPGSAA---ITLQIWPTGKKQEIGRADFHGRWVEWQ 342
>gi|335036021|ref|ZP_08529351.1| hypothetical protein AGRO_3353 [Agrobacterium sp. ATCC 31749]
gi|333792585|gb|EGL63952.1| hypothetical protein AGRO_3353 [Agrobacterium sp. ATCC 31749]
Length=346
Score = 162 bits (410), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 118/349 (34%), Positives = 158/349 (46%), Gaps = 14/349 (4%)
Query 5 EHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKG--APEGQAVPLR 62
E + + Q + C S GSP L VAA ++ + G P G +VPLR
Sbjct 4 EAVRNAFLVQAKACDSLGSPFTARLCRAVAARLDRQTEVGETILSWPGDVGPSGDSVPLR 63
Query 63 LLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNE 122
L G LH + + + L P EA W + L PPQTNE
Sbjct 64 LAGALHALAIQEKIAPLVDIPPD-----DEEALWQACASALRFHQVFILERLKSPPQTNE 118
Query 123 VGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDN 182
V RSA L+ G L P+ L E+G+SAGLNL+ DRYRYR WG S V +
Sbjct 119 VRRSAVLLPGFLSIAEHTGKPLVLSEVGASAGLNLQFDRYRYRLGDFAWG-EQSDVFLSP 177
Query 183 AWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARN 242
W G PP + ++ER G D+ P+D +S + L +SYIW DQTDRLER A+ +A
Sbjct 178 EWRGGTPPDGRIEVIERAGCDLNPLDPSSAEDRLRLMSYIWADQTDRLERTAAALRIAVE 237
Query 243 IPADLHRQAAHAAVAGMTLTDD--ALTVLWHSITWQYLPADERAAIRAGIDALAAQADAH 300
+ + A + T A V++HSI WQYLP + A A I A+A
Sbjct 238 NGLQVEKADAVDWLKRRLATQHTGATHVVYHSIAWQYLPDALKQAGEASIAEAGARATPE 297
Query 301 CPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
P L +E GA I ++ WP G + +G HG V W+
Sbjct 298 APLARLQMEADATPGGAAIT----LQIWPTGDKQEIGRADFHGQWVEWR 342
>gi|15890901|ref|NP_356573.1| hypothetical protein Atu4071 [Agrobacterium tumefaciens str.
C58]
gi|15159206|gb|AAK89358.1| conserved hypothetical protein [Agrobacterium tumefaciens str.
C58]
Length=346
Score = 162 bits (409), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 118/349 (34%), Positives = 159/349 (46%), Gaps = 14/349 (4%)
Query 5 EHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKG--APEGQAVPLR 62
E + + Q + C S GSP L VAA ++ + G P G +VPLR
Sbjct 4 EAVRNAFLVQAKACDSLGSPFTARLCRAVAARLDRQTEVGETILSWPGDVGPSGDSVPLR 63
Query 63 LLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNE 122
L G LH + + + L P EA W + L PPQTNE
Sbjct 64 LAGALHALAIQEKIAPLVDIPPD-----DEEALWQACASALRFHQVFILETLKSPPQTNE 118
Query 123 VGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDN 182
V RSA L+ G L + P+ L E+G+SAGLNL+ DRYRYR WG S V +
Sbjct 119 VRRSAVLLPGFLSIAERTGKPLVLSEVGASAGLNLQFDRYRYRLGDFAWG-EQSDVFLSP 177
Query 183 AWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARN 242
W G PP + ++ER G D+ P+D +S + L +SYIW DQTDRLER A+ +A
Sbjct 178 EWRGGTPPDGRIEVIERAGCDLNPLDPSSSEDRLRLMSYIWADQTDRLERTAAALRIAVE 237
Query 243 IPADLHRQAAHAAVAGMTLTDD--ALTVLWHSITWQYLPADERAAIRAGIDALAAQADAH 300
+ + A + T A V++HSI WQYLP + A A I A+A
Sbjct 238 NGLQVEKADAVDWLKRRLATQHTGATHVVYHSIAWQYLPDALKQAGEALIAEAGARATPE 297
Query 301 CPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
P L +E GA I ++ WP G + +G HG V W+
Sbjct 298 APLARLQMEADATPGGAAIT----LQIWPTGDKQEIGRADFHGQWVEWR 342
>gi|110681039|ref|YP_684046.1| hypothetical protein RD1_3901 [Roseobacter denitrificans OCh
114]
gi|109457155|gb|ABG33360.1| conserved hypothetical protein [Roseobacter denitrificans OCh
114]
Length=346
Score = 160 bits (406), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 122/347 (36%), Positives = 170/347 (49%), Gaps = 26/347 (7%)
Query 14 QGRVCTSSGSPMYRELLELVA----ADVESGGVFASILADQKGAPEGQAVPLRLLGGLHR 69
Q C GSP +LL ++A AD G FA+ D P G ++PLR+ GGLH
Sbjct 10 QAISCARLGSPFMGQLLGILADHWPADSRLGRYFANFGGDI--GPAGASLPLRIAGGLHA 67
Query 70 MVLDGRAPVLRRWYP--STGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSA 127
+VL RAP L R YP + T +A +++ L + PPQTNEV RSA
Sbjct 68 LVLSDRAPALTRVYPPHQSEDTLLRDA----VLQALRTHEVFLLDWVQSPPQTNEVRRSA 123
Query 128 ALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGE 187
AL+ G +A FDLP+ L E+G+S GLNL DR+ G +G+ + + W G
Sbjct 124 ALMPGAAVAATYFDLPVYLSELGASGGLNLMWDRFDVALPEGRFGVRAPALTLRPQWDGP 183
Query 188 LPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVA-----RN 242
+PP +I ER G D+ P+D P L +++WPDQ +RL + A +VA R
Sbjct 184 MPPQRLPQIAERAGVDLNPLDPRDPADLLRLTAFLWPDQPERLALTKAAASVACTKMERG 243
Query 243 IPAD-LHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHC 301
D L ++ A A D + ++ H++ WQY P+ +A RA I+A AQA
Sbjct 244 DAIDWLEKRLADA-------PDHHMHLIQHTVAWQYFPSAAQARGRALIEAAGAQATQTR 296
Query 302 PFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
P L+LE GA + + +R WPG LG HG V W
Sbjct 297 PLAWLSLETDGDTKGA-LGAALTLRLWPGDRTLYLGRADFHGRWVKW 342
>gi|338821649|gb|EGP55618.1| hypothetical protein Agau_L100936 [Agrobacterium tumefaciens
F2]
Length=348
Score = 160 bits (405), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 123/356 (35%), Positives = 171/356 (49%), Gaps = 28/356 (7%)
Query 5 EHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKG--APEGQAVPLR 62
E + + +Q R C S GSP L VA ++ + G P G +VPLR
Sbjct 4 EAVRNAFLAQARACGSLGSPFTARLCRAVATRLDRQTEVGERILSWPGDVGPSGDSVPLR 63
Query 63 LLGGLHRMVLDGR-APVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTN 121
L G LH +V++ + AP++ + WQA D +R + L PPQTN
Sbjct 64 LAGALHAIVIEDKIAPLVDIAPENEDALWQACT---DALRF---HAAFILERLKSPPQTN 117
Query 122 EVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRID 181
EV RSA L+ G L P+ L E+G+SAGLNL+ DRY+YR WG S V +
Sbjct 118 EVRRSAVLLPGFLTLAELTGKPLVLSEVGASAGLNLQFDRYQYRLGDLAWG-EQSEVFMS 176
Query 182 NAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVAR 241
W G PP + I+ER G D+ P+D +S + L +SY+W DQTDRLER ++ +A
Sbjct 177 PEWRGNAPPNTPIEIIERAGCDLNPLDPSSTEDRLRLISYVWADQTDRLERTAASLRIA- 235
Query 242 NIPADLHRQAAHA----AVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQA 297
+ LH + A A T A V++HSI WQYLP A++ + L A+A
Sbjct 236 -VEKGLHVEKADAIDWLKRRLATQHPGAAHVVYHSIAWQYLP----DALKQTGETLIAEA 290
Query 298 DAH----CPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
AH P L +E GA I ++ WP G + +G HG V W+
Sbjct 291 GAHATPDAPLARLQMEADATPGGAAIT----LQIWPTGEKQEIGRADFHGRWVEWR 342
>gi|149185973|ref|ZP_01864288.1| hypothetical protein ED21_24606 [Erythrobacter sp. SD-21]
gi|148830534|gb|EDL48970.1| hypothetical protein ED21_24606 [Erythrobacter sp. SD-21]
Length=354
Score = 159 bits (402), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 119/345 (35%), Positives = 169/345 (49%), Gaps = 21/345 (6%)
Query 14 QGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGGLHRMVLD 73
Q + ++G+P +++ + A S A + +G A+PLR+ GG+H ++L
Sbjct 19 QAKHAENAGAPGTAQVVRALLALEGSEAATARRIFAWQGLSLRDAMPLRIAGGIHNLLLT 78
Query 74 GRAPVLRRWYPS-TGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIGG 132
G P L Y Q +A +IV T Q L LD PPQTNE GRSA+ G
Sbjct 79 GEEPRLEDVYAGRMPAQDQVDALVREIVETHDFQ---LMPWLDGPPQTNEAGRSASFAAG 135
Query 133 LLI-----ACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGE 187
LL C QF+ EIG+SAG+N RY Y G G + + +RI W G
Sbjct 136 LLWLADGRTCPQFEW----LEIGASAGINTMLGRYHYDLGGVSTGPSGNRMRIVPEWRGA 191
Query 188 LPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADL 247
PP + V+ G DIAP+D+T L SY+WP+ T R+ R+ AIA+A +P ++
Sbjct 192 PPPARDIGFVDARGSDIAPVDLTDEAQALRLKSYVWPEATGRMARIDAAIALASRMPPEI 251
Query 248 HRQAAHAAVAGMTL--TDDALT-VLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFV 304
R A V D+ +T VL HSI WQYLP + I A + ++A P
Sbjct 252 ERMDAGDWVEKELAREQDEGVTRVLAHSIMWQYLPEFTQERIEASLQEAGSRATRERPLA 311
Query 305 HLTLEPAHQRPGAQIKYLVRMRSWPGGHARV-LGECHPHGPPVTW 348
HL+LE + + +++R WPGG ++V L HPHG V W
Sbjct 312 HLSLETNRE----TFAHELKVRYWPGGESQVHLANAHPHGAWVEW 352
>gi|336118795|ref|YP_004573567.1| hypothetical protein MLP_31500 [Microlunatus phosphovorus NM-1]
gi|334686579|dbj|BAK36164.1| hypothetical protein MLP_31500 [Microlunatus phosphovorus NM-1]
Length=355
Score = 157 bits (398), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 121/352 (35%), Positives = 167/352 (48%), Gaps = 22/352 (6%)
Query 9 HTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGGLH 68
HTL ++ R + +Y + +A D+ESGG ++ AP G + LRLL G+
Sbjct 7 HTLAARFRAHAGEQTHLYGYAMRGLADDLESGGPTREVVRGYVDAPAGAVIQLRLLAGIF 66
Query 69 RMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSA- 127
R+VL RAP L +YP GGT A AWP + LR AL PQTNEVGRS
Sbjct 67 RLVLTHRAPELEPYYPCLGGTAPAAEAWPVLREVIAAHIPELRDALAIAPQTNEVGRSVA 126
Query 128 ----ALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNA 183
C +F RL E+G+SAGLN ++R +G WG DSPV++ +A
Sbjct 127 LLAGLADLAEATGCRRF----RLLELGASAGLNQLIAQFRISGVGWVWGPEDSPVQLPDA 182
Query 184 WLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNI 243
G +P + IV G D+ P+D TS +G L S++WP R RL GA+ +A
Sbjct 183 VEGMMPTPDGIEIVAARGCDLDPVDPTSAEGRLRLTSFVWPFDLHRHARLAGALELATTR 242
Query 244 PADLHRQAAHAAVAGM-----TLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQAD 298
P + R +A +A + LTV+WHS+T Y PA E AA+ + LA
Sbjct 243 PPTVDRASAADWLARQLSGEPEMDPMMLTVVWHSVTQLYWPAKELAAVE---EILAGYGR 299
Query 299 AHCPFVHLTLEPAHQRPGAQIKYLVRMRSWPGGHA----RVLGECHPHGPPV 346
H + +E Q + V R W G + + +G H HG PV
Sbjct 300 EHS-LSEVGMEYPSQGGTHAEQPRVSTRYWAGDGSLPRRQTVGIAHDHGIPV 350
>gi|114797449|ref|YP_761743.1| hypothetical protein HNE_3067 [Hyphomonas neptunium ATCC 15444]
gi|114737623|gb|ABI75748.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444]
Length=360
Score = 157 bits (396), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 112/342 (33%), Positives = 157/342 (46%), Gaps = 14/342 (4%)
Query 2 TGTEHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPL 61
+ E L H R Q C + GSP L + D E G ++ G P A+ L
Sbjct 5 SKDEILAH-FREQAEFCRALGSPFMEALCLAMVEDAEQHGPVGRLIKGWAGDPRRDALAL 63
Query 62 RLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTN 121
R+ G LH L +AP L YPS W EA WP + + + PPQTN
Sbjct 64 RIAGYLHYSALGDKAPELTAVYPSANPDWTMEAVWPVAHDWLARHERAAKVFIKSPPQTN 123
Query 122 EVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLA-DSPVRI 180
E R+ AL+ G L F P+ L E+G+SAGLN DR+ Y+ W L +S V I
Sbjct 124 ETRRAIALLPGFLKVASLFPGPMHLLELGASAGLNQNWDRFNYQ--TTRWELTGNSDVVI 181
Query 181 DNAWLGELPP--TATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIA 238
D W G P + + R D +P++++ P SYIWPDQ RL RL AIA
Sbjct 182 DTDWNGPPPDHIDMSFNVATRAACDQSPVNLSKPSAARRLKSYIWPDQPARLARLDAAIA 241
Query 239 VARNIPADLHRQAAHAAVAGMTLT--DDALTVLWHSITWQYLPADERAAIRAGIDALAAQ 296
+AR + + A + + D+ TV++HS+ QY PA+ R A+ + I+ A+
Sbjct 242 LARRTRVRVEKADAADWLKAKLASRPDEGPTVIYHSVFLQYPPAETRRALLSLIEDAGAE 301
Query 297 ADAHCPFVHLTLEPAHQRPG-AQI-----KYLVRMRSWPGGH 332
A P + EPA G Q+ +++ MR WP G
Sbjct 302 ATWDRPLAWVCFEPAAFFQGPTQVGIEPNEFITYMRVWPEGE 343
>gi|304393390|ref|ZP_07375318.1| conserved hypothetical protein [Ahrensia sp. R2A130]
gi|303294397|gb|EFL88769.1| conserved hypothetical protein [Ahrensia sp. R2A130]
Length=343
Score = 157 bits (396), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 117/345 (34%), Positives = 162/345 (47%), Gaps = 24/345 (6%)
Query 14 QGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQA--VPLRLLGGLHRMV 71
Q C GSPM L L +++ S+ + +G P A VPLRL GGLH +V
Sbjct 13 QADHCDKLGSPMTAHLCRLFTTHLDASTKVGSLCLNWQGDPCSGADNVPLRLCGGLHSLV 72
Query 72 LDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIG 131
L G L Y + E + E L + PPQTNEV R+AAL
Sbjct 73 LSGVNTELADAYSLSLSHITPEL----LTAVMRRNDECLHDFMASPPQTNEVARAAALWP 128
Query 132 GLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELPPT 191
L+ DLP+ L E GSSAGLN DR+ Y G G S +++ W G+ P
Sbjct 129 CLMAIAGDSDLPLHLLEFGSSAGLNQNLDRFGYDLGGVLCGDLSSRLQLKPKWKGQRPQL 188
Query 192 ATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLHRQA 251
A V++ R G D++P D++ P L SY+WPDQ DRL RL AIA+A P ++ R
Sbjct 189 ADVKVSGRRGVDLSPFDLSDPQQRLRLRSYVWPDQPDRLARLDAAIAIADEHPTNVDRDD 248
Query 252 AHAAVAGMTLTD---DALTVLWHSITWQYLPADERAA----IRAGIDALAAQADAHCPFV 304
A + L D +A TV++ +I WQY+P++ R A +R + ++ P V
Sbjct 249 GLAWL-DRKLADRPQNAKTVVFSTIAWQYMPSEMREAGDTMLRKHMRSVGG------PVV 301
Query 305 HLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
L +E Q PGA + + S R+LG HG + W
Sbjct 302 WLRMEADGQEPGAALTVVDEADS----ELRLLGRADFHGRWIEWH 342
>gi|154251314|ref|YP_001412138.1| hypothetical protein Plav_0858 [Parvibaculum lavamentivorans
DS-1]
gi|154155264|gb|ABS62481.1| conserved hypothetical protein [Parvibaculum lavamentivorans
DS-1]
Length=373
Score = 156 bits (395), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 114/340 (34%), Positives = 158/340 (47%), Gaps = 9/340 (2%)
Query 14 QGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGGLHRMVLD 73
Q C GSP + +AA + + F + D +G PE A+PLR G L+ +
Sbjct 22 QALACEHLGSPFTARVCRALAAGLTAETRFGQRILDWEGKPESDALPLRAAGALNALARS 81
Query 74 GRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIGGL 133
GRAP L YP + A + A D + L LD PQTNEV RS+A++G
Sbjct 82 GRAPELAAVYPPHEADEKTLARAIEKATAAHD--DFLEGFLDSAPQTNEVARSSAILGLA 139
Query 134 LIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELPP-TA 192
L + LP+ + EIGSSAGLNL D Y Y WG D+ V I W G LPP A
Sbjct 140 LHVAKRTGLPLSVHEIGSSAGLNLGFDAYAYELETARWGDPDAAVTIAARWEGALPPLDA 199
Query 193 TVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLHRQAA 252
+++ R G D+ P+D + L+YIWPDQT RL R+ A++ A + + A
Sbjct 200 KLKVAARKGCDLNPLDAGNAADRERLLAYIWPDQTARLARIEAALSFAARSGTKVEKADA 259
Query 253 HAAVA---GMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVHLTLE 309
V G + +L H+I WQYLP + +A I A + A A P +++E
Sbjct 260 AEWVERHFGGEGKKGEVRLLMHTIVWQYLPKETQARITAAMARAGAHATKDAPVAWISVE 319
Query 310 PAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
+ + +R+R WP G LG HG W
Sbjct 320 ADGKDAASAC---MRLRLWPEGEDVELGRTDFHGRWAKWS 356
>gi|254487481|ref|ZP_05100686.1| conserved hypothetical protein [Roseobacter sp. GAI101]
gi|214044350|gb|EEB84988.1| conserved hypothetical protein [Roseobacter sp. GAI101]
Length=344
Score = 155 bits (392), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 118/349 (34%), Positives = 166/349 (48%), Gaps = 16/349 (4%)
Query 7 LVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKG--APEGQAVPLRLL 64
L Q C + GSP +L+ ++A D A KG P G ++PLR+
Sbjct 3 LKQAFEDQAAHCVALGSPFMGQLMMVLARDWPRDTALGRKFASAKGDVGPMGASLPLRIA 62
Query 65 GGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPES-LRAALDRPPQTNEV 123
GGLH +VL +AP L YP + +A V A E+ L D PQTNEV
Sbjct 63 GGLHALVLKRKAPELVAVYPPHQTS---DADLSAAVLGALQTHEAFLLEWTDHAPQTNEV 119
Query 124 GRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNA 183
RSAALI G +A F+LPI L E+G+S GLNL D + G +G S + +
Sbjct 120 RRSAALIAGARVAAQHFNLPIHLSELGASGGLNLMWDHFALEIDGHHFGPNMSTILLSPD 179
Query 184 WLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNI 243
W G LPP R+ +R G D+ P+D T D L ++Y+W DQ +RL R A +V +
Sbjct 180 WTGALPPKTQPRVEKRRGVDLHPLDPTRHDHLLRLMAYLWADQPERLNLTRSAASV---M 236
Query 244 PADLHRQAAHAAVAGM--TLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHC 301
A + + A +A +L ++ H++ WQY P +A +A I+A A+A A
Sbjct 237 QAKVDQGDAIDWLAQQLPKAPQGSLHLIQHTVAWQYFPKSAQARGKALIEAAGARATAQR 296
Query 302 PFVHLTLE-PAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
P L +E + GA + +R WPG LG HG + WQ
Sbjct 297 PLAWLAMENDGTDKKGAALT----LRLWPGDITLNLGRVDFHGRWIDWQ 341
>gi|83953526|ref|ZP_00962248.1| hypothetical protein NAS141_14496 [Sulfitobacter sp. NAS-14.1]
gi|83842494|gb|EAP81662.1| hypothetical protein NAS141_14496 [Sulfitobacter sp. NAS-14.1]
Length=323
Score = 155 bits (391), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 117/319 (37%), Positives = 165/319 (52%), Gaps = 18/319 (5%)
Query 35 ADVESGGVFASILADQKGAPEGQAVPLRLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEA 94
AD G FA+I D P G ++PLR+ GGLH +VL +AP L YP + QA +
Sbjct 14 ADTALGRKFAAIEGDI--GPSGASLPLRIAGGLHALVLKRKAPALMAVYPPHKASDQALS 71
Query 95 AWPDIVRTATDQPES-LRAALDRPPQTNEVGRSAALIGGLLIACLQFDLPIRLFEIGSSA 153
+ VR A E+ L D PQTNEV RSAALI G +A F+LPIRL E+G+S
Sbjct 72 ---EAVRDAITTHEAFLLDWTDSAPQTNEVRRSAALIAGARVAAQHFNLPIRLSELGASG 128
Query 154 GLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELPPTATVRIVERHGYDIAPIDVTSPD 213
GLNL D + G +G S + + W G+LPP ++ +R G D+ P+D T PD
Sbjct 129 GLNLMWDHFVLEIEGHRFGSNMSTILLSPDWTGKLPPAINPQVEKRRGVDLNPLDPTRPD 188
Query 214 GELNALSYIWPDQTDRLERLRGAIAVARNIPADLHRQAAHAAVAGMTL---TDDALTVLW 270
L +SYIW DQ +RL R A +V + A + R A +A TL + L ++
Sbjct 189 HLLRLMSYIWADQPERLTLTRTAASV---MTAQVQRGDAIDWLA-RTLPQSPEGCLHLIQ 244
Query 271 HSITWQYLPADERAAIRAGIDALAAQADAHCPFVHLTLEPAHQRPGAQIK-YLVRMRSWP 329
H++ WQY P +A +A I+A +A + P L++E + G+ +K + +R WP
Sbjct 245 HTVAWQYFPKAAQARGKALIEAAGKRATRNRPLAWLSME----QDGSGLKGAALTLRLWP 300
Query 330 GGHARVLGECHPHGPPVTW 348
G L HG + W
Sbjct 301 GDITLPLARVDFHGRWIDW 319
>gi|85707947|ref|ZP_01039013.1| hypothetical protein NAP1_01890 [Erythrobacter sp. NAP1]
gi|85689481|gb|EAQ29484.1| hypothetical protein NAP1_01890 [Erythrobacter sp. NAP1]
Length=370
Score = 154 bits (390), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 116/343 (34%), Positives = 168/343 (49%), Gaps = 26/343 (7%)
Query 18 CTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGGLHRMVLDGRAP 77
CT+ + R L ++ A + +G +A G A+PLR+ GGLH +VL G
Sbjct 38 CTAR---VIRSLTKVAAGETATG----RRIAGWHGLTLKDAMPLRIAGGLHHLVLSGEDD 90
Query 78 VLRRWYP-STGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIGGLLIA 136
L R Y Q + ++V T + L LD PPQTNE GRSA+++ GLL
Sbjct 91 RLARVYSGQITDQGQVDRLVCELVETYDHR---LLPWLDGPPQTNEAGRSASIMAGLLWL 147
Query 137 CLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGEL--PPTA-- 192
+ L E+G+SAG+N +RYR+R E G ADSP+RI+ W G PP A
Sbjct 148 AQRVAPRFELLELGASAGVNTMLNRYRFRLGDTEVGPADSPMRIEPEWRGGAGSPPNAPD 207
Query 193 TVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLHRQAA 252
+IV G D+API++ L SY+WPD R+ R+ AI +A P + R+ A
Sbjct 208 EFKIVSVRGCDVAPINLADEASALRLKSYVWPDAPARMARIDAAIELASQDPPQIVRKDA 267
Query 253 HAAVAGMTLTDDA---LTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVHLTLE 309
V M A ++HSI WQY+PA+ + AI ++ +A A P ++LE
Sbjct 268 GEFVGDMLSEPQAEGTTRAMFHSIMWQYMPAETQEAITQMVEREGTKASAEKPLAWISLE 327
Query 310 PAHQRPGAQIKYLVRMRSWPGGHA----RVLGECHPHGPPVTW 348
A ++ +++R W GG + +L HPHG V W
Sbjct 328 ----TDPATFRHELKVRYWNGGESDGETTLLSHAHPHGAWVEW 366
>gi|326386553|ref|ZP_08208175.1| hypothetical protein Y88_2447 [Novosphingobium nitrogenifigens
DSM 19370]
gi|326208868|gb|EGD59663.1| hypothetical protein Y88_2447 [Novosphingobium nitrogenifigens
DSM 19370]
Length=363
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/296 (38%), Positives = 145/296 (49%), Gaps = 12/296 (4%)
Query 58 AVPLRLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIV-RTATDQPESLRAALDR 116
A+ LRL GGLH +VL G RR P G + +V D L LD
Sbjct 72 ALALRLAGGLHHLVLTG---TDRRLAPVYAGEIVDQNEVDALVGAIVADHDARLLPWLDG 128
Query 117 PPQTNEVGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADS 176
PPQTNE GRSA+++ LL + L E+G+SAG+N DR+R+ G G S
Sbjct 129 PPQTNEAGRSASIMAALLWLSERMGPRFELNELGASAGINTMLDRFRFDLGGTTTGPLAS 188
Query 177 PVRIDNAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGA 236
P++I W G PP+A + IV G D AP+D+ P L SY+WP+ T+R+ R+ A
Sbjct 189 PMQIAPEWKGPPPPSARIDIVGIRGCDRAPVDLADPAQALRLKSYVWPEMTERMARIDAA 248
Query 237 IAVARNIPADLHRQAAHAAVAGMTLT---DDALTVLWHSITWQYLPADERAAIRAGIDAL 293
IA+AR L R A V + D V +HSI WQYLP R I GI+A+
Sbjct 249 IALARMQRPRLDRAEACDWVGARLASPQPADTTRVFFHSIVWQYLPEATREQITRGIEAM 308
Query 294 AAQADAHCPFVHLTLEPAHQRPGAQIKYLVRMRSWPG-GHARVLGECHPHGPPVTW 348
QA + LE Q ++ + +R WPG G A VLG H HG V W
Sbjct 309 GVQATTSRRLAWIRLETNRQ----TFRHELSVRFWPGDGEALVLGTAHAHGAWVEW 360
>gi|296284340|ref|ZP_06862338.1| hypothetical protein CbatJ_11976 [Citromicrobium bathyomarinum
JL354]
Length=368
Score = 151 bits (382), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 122/361 (34%), Positives = 173/361 (48%), Gaps = 23/361 (6%)
Query 2 TGTEHLVHTLRSQGRVCTSSGSPMYRELLELVAA--DVESGGVFASILADQKGAPEGQAV 59
G E + +Q C +G+P+ E+ E + D E GG + GAP A+
Sbjct 16 KGFEAVQRAFANQVAYCRDNGAPVTAEICEALLGLLDTERGGAVMRRVRKWAGAPLADAL 75
Query 60 PLRLLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPES-LRAALDRPP 118
PLR+ GGLH + L P L Y Q + D+V A ++ E+ L LD PP
Sbjct 76 PLRIAGGLHALHLGDDDPALSAIYLR-----QRVSNPKDVVADAIERHEAFLMPWLDGPP 130
Query 119 QTNEVGRSAALIGGLL-IACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSP 177
QTNE GRS A +L ++ L EIGSSAG+NL RY + G G +
Sbjct 131 QTNEAGRSWAYAAAMLWLSDKGLPAQFALNEIGSSAGINLMMRRYFFDLGGVTAGPGGAQ 190
Query 178 VRIDNAWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAI 237
+R+ W G PP IV G DIAP+D+T P L +YIWP+ T+R R+ AI
Sbjct 191 MRLVPEWRGSPPPDTAYDIVGARGCDIAPVDLTDPAQALRLKAYIWPEFTERFARMDAAI 250
Query 238 AVARNIPADLHRQAAHAAVAGMTLTDDA----LTVLWHSITWQYLPADERAAIRAGIDAL 293
A A +P ++ R++A V + L + A V+ HSI WQY+P +R + I+A
Sbjct 251 AAANTMPPEIARESADIFVEKV-LAERAKPGVTRVIMHSIVWQYVPEYQREKVTEAIEAA 309
Query 294 AAQADAHCPFVHLTLEPAHQRPGAQIKYLVRMRSWP----GGHA-RVLGECHPHGPPVTW 348
A+A P ++LE ++ + +R WP GG + LG HPHG V W
Sbjct 310 GAKATQDAPLAWISLEANRD----THRHELSVRYWPDSEGGGEGWQRLGVAHPHGAWVEW 365
Query 349 Q 349
+
Sbjct 366 E 366
>gi|220927192|ref|YP_002502494.1| hypothetical protein Mnod_7454 [Methylobacterium nodulans ORS
2060]
gi|219951799|gb|ACL62191.1| conserved hypothetical protein [Methylobacterium nodulans ORS
2060]
Length=347
Score = 149 bits (375), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 122/336 (37%), Positives = 159/336 (48%), Gaps = 12/336 (3%)
Query 18 CTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGGLHRMVLDGRAP 77
C GSP L LVA ++ + D G PE A+ LRL GGLH +V GR P
Sbjct 18 CARLGSPFTASLCGLVAEWLDRRSAIGCRILDWPGPPEADALALRLCGGLHALVRRGRLP 77
Query 78 VLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIGGLL-IA 136
L YP A W R + L LD PPQTNEV RS L+ GL+ +A
Sbjct 78 ELAILYPPA--PLDPAALWDATARALDEAGADLDPWLDGPPQTNEVARSGVLMPGLMAVA 135
Query 137 CLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELPPTATVRI 196
+ P+ L+EIG+SAGLNL DRY Y G G +PVR+ W G PP A V +
Sbjct 136 AATGERPMILWEIGASAGLNLVLDRYAYDLGGVAAGDPAAPVRLVPDWTGPPPPAARVAV 195
Query 197 VERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLHRQAAHAAV 256
R G D+ P+D+ ++YIWPDQ +RL R+ AIA A P L R A A +
Sbjct 196 AARRGVDLNPLDLREASHRERLVAYIWPDQRERLARMEAAIACAAETPPPLDRGEATAWL 255
Query 257 AGMTL---TDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVHLTLEPAHQ 313
A + V+ HSI QY P +A + A + A+A P L A++
Sbjct 256 ADRLAEPPQPGTVRVVQHSIALQYFPPAGQARVGALLAEAGARASPATPLAWL----AYE 311
Query 314 RPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
G+ + + WPGG R+L HPHG + W
Sbjct 312 FDGSACALTLTL--WPGGERRILASAHPHGQWLRWS 345
>gi|332186538|ref|ZP_08388282.1| hypothetical protein SUS17_1623 [Sphingomonas sp. S17]
gi|332013521|gb|EGI55582.1| hypothetical protein SUS17_1623 [Sphingomonas sp. S17]
Length=368
Score = 147 bits (372), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/344 (32%), Positives = 158/344 (46%), Gaps = 12/344 (3%)
Query 8 VHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAPEGQAVPLRLLGGL 67
+ L Q RV + G+P +LL V ++ A ++ A+ +R+ L
Sbjct 31 ISELSRQSRVMRTMGTPFVADLLAAVDRQLDHAPHTARLIRSWGRTAASSAIAMRINAAL 90
Query 68 HRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSA 127
H + GR P+L Y + A + L L + PQTNEVGR+A
Sbjct 91 HALARQGRVPLLSALYAGEHRRFDEAVAL-----ALASHDDLLVDWLHQVPQTNEVGRAA 145
Query 128 ALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGE 187
A L++ LFEIG+SAGLNL RY Y G G A SPV I AW G
Sbjct 146 AFHAALMVLARDHGGVFDLFEIGASAGLNLNLARYAYDLGGVRTGDAHSPVHIAPAWHGS 205
Query 188 LPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADL 247
PP V I E G D+ P+D+ P S+I+ DQ +R RL A+A+AR P +
Sbjct 206 PPPNVPVVIGEARGVDLHPVDIHDPAACERLASFIFADQPERGARLENALALARRHPPHM 265
Query 248 HRQAAH---AAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFV 304
+A AA + T++ V+ HS+ QY+ A+ER AI + + +A F
Sbjct 266 AAGSAADWLAAQFSVPSTEERHRVVLHSMVLQYVGAEERGAIERVLARVGGEACRSRTFA 325
Query 305 HLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
+ E + Q + +R+RSWP G +VL CHP+G + W
Sbjct 326 CIGFEWDER----QERVELRLRSWPDGRDQVLAHCHPYGAWIEW 365
>gi|126735030|ref|ZP_01750776.1| hypothetical protein RCCS2_14174 [Roseobacter sp. CCS2]
gi|126715585|gb|EBA12450.1| hypothetical protein RCCS2_14174 [Roseobacter sp. CCS2]
Length=346
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 111/337 (33%), Positives = 155/337 (46%), Gaps = 11/337 (3%)
Query 14 QGRVCTSSGSPMYRELLELVAA-DVESGGVFASILADQKG-APEGQAVPLRLLGGLHRMV 71
Q + C + GSP L+ L + G V I A Q +P GQ+VPLRL G LH +
Sbjct 13 QSKACANLGSPFMERLMALCGTMEWPEGSVRDRIFAWQGDISPAGQSVPLRLAGALHALH 72
Query 72 LDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIG 131
L G + + + P+ Q W + R + + A LD PQTNEV RSAALI
Sbjct 73 LLGHVGLRQVYPPNIVSDTQL---WNAVSRALVADADHINAWLDSAPQTNEVRRSAALIP 129
Query 132 GLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELPPT 191
+ +F LP+R E+G+S GLNL D Y + G +D + + W G PP
Sbjct 130 VGHLLADRFGLPLRTSELGASGGLNLHWDAYALQLGDTTRGASDPALTLAPDWTGPYPPD 189
Query 192 ATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLHRQA 251
V I R G D+ P++ PD L +Y+WPDQ +RL R AIA A+ P D A
Sbjct 190 TAVTIASRGGVDLNPLNPAHPDQALRLQAYLWPDQPERLTLTRAAIATAQT-PVD-QGDA 247
Query 252 AHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVHLTLEPA 311
+T +++ ++ WQY PA ++A A I+ A A P +E
Sbjct 248 IDWIKPRLTHVKGQTHLIYSTVAWQYFPAAKQAEGTALIEEAGKSATADTPLAWFGMEND 307
Query 312 HQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
+ GA + +R WPG LG HG + W
Sbjct 308 NSGHGAALT----LRLWPGNVTLDLGRADFHGRWIAW 340
>gi|89069013|ref|ZP_01156394.1| hypothetical protein OG2516_17041 [Oceanicola granulosus HTCC2516]
gi|89045382|gb|EAR51447.1| hypothetical protein OG2516_17041 [Oceanicola granulosus HTCC2516]
Length=343
Score = 146 bits (368), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 129/346 (38%), Positives = 164/346 (48%), Gaps = 13/346 (3%)
Query 5 EHLVHTLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKG--APEGQAVPLR 62
HL LR Q R C GSP LL L+A + G L +G G +VPLR
Sbjct 2 SHLRAALRHQARSCAMLGSPFMERLLLLLADRLAPGTPVTDRLFGWEGDIGSSGDSVPLR 61
Query 63 LLGGLHRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNE 122
L G LH +VL G A LR YP T EA W + D+ E L LD PPQTNE
Sbjct 62 LAGALHGLVLGGHA-GLRAVYPPEEAT--DEALWAAVEAALRDEAEVLNRWLDSPPQTNE 118
Query 123 VGRSAALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDN 182
V RS AL+ +++LP L+E+G+SAGLNL DRY G G A + +
Sbjct 119 VRRSVALVAAAQWLTARWNLPFDLYELGASAGLNLGFDRYAVETPLGSLGPAAPALTLRP 178
Query 183 AWLGELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARN 242
W G LP R+ R G D+ P+D L L+Y+WPDQ +R AIA A
Sbjct 179 DWTGALPHGPAARVAARRGVDLRPLDPAQ--DRLRLLAYLWPDQPERRTLTEAAIA-AHT 235
Query 243 IPADLHRQAAHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCP 302
D AA A + L +++H+I WQY PA R RA I+A AA A P
Sbjct 236 ATVDAG-DAAGWLEARLAPAPGRLALVYHTIAWQYFPATTRQRARAAIEAAAASATDDAP 294
Query 303 FVHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
V L +E ++PGA + + +PGGH LG HG + W
Sbjct 295 LVWLGMEADGRQPGAALSATL----YPGGHTHELGRIDFHGRWICW 336
>gi|89056082|ref|YP_511533.1| hypothetical protein Jann_3591 [Jannaschia sp. CCS1]
gi|88865631|gb|ABD56508.1| hypothetical protein Jann_3591 [Jannaschia sp. CCS1]
Length=356
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 120/345 (35%), Positives = 160/345 (47%), Gaps = 19/345 (5%)
Query 13 SQGRVCTSSGSPMYRELLELVAADVE-SGGVFASILA-DQKGAPEGQAVPLRLLGGLHRM 70
SQGR GSP L+ L+ +++ S V LA D + GQ+VPLRL G LH +
Sbjct 11 SQGRATAKLGSPFMARLMPLIGQNLDDSTAVGHRCLAWDGDVSAAGQSVPLRLAGALHGL 70
Query 71 VLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALI 130
VLDG L YP T + W ++ + T + LDRPPQTNEV R+AA+I
Sbjct 71 VLDGTDARLTAAYPPN--TVDDDTLWQAVLESLTTHEARIMDWLDRPPQTNEVRRAAAVI 128
Query 131 GGLLIACLQF-DLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELP 189
G+ A Q P+ L E+G+SAGLNL DR+ G S VR+ W G
Sbjct 129 AGIWWALGQVGQTPVILTELGASAGLNLSLDRFALSMGRGLHVAPQSSVRLKPDWTGPFV 188
Query 190 PTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPADLHR 249
+ + R G D++P+D P L L+YIWPDQ +R+ R R AIA+ +D
Sbjct 189 RPHPIHVTTRAGVDLSPLDPKDPTDALRLLAYIWPDQPERMARTRAAIAL-----SDTRV 243
Query 250 QAAHAAV-AGMTLTDD--ALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVHL 306
A AA L D L V++ +I QY A I + A A P +HL
Sbjct 244 DADDAAPWLAQRLADPWVGLHVVYTTIAAQYFSAKTVRDIAENLATHGANATPKAPLLHL 303
Query 307 TLEPAHQRPGAQIKYLVRMRSWPGGHARV--LGECHPHGPPVTWQ 349
+E R GA + + W GG V L HG + WQ
Sbjct 304 AMEADDVRRGAALTASL----WAGGPPVVTTLARVDFHGAWIEWQ 344
>gi|77404787|ref|YP_345359.1| hypothetical protein RSP_4153 [Rhodobacter sphaeroides 2.4.1]
gi|77390437|gb|ABA81618.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1]
Length=363
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/346 (32%), Positives = 160/346 (47%), Gaps = 13/346 (3%)
Query 10 TLRSQGRVCTSSGSPMYRELLELVAADVESGGVFASILADQKGAP--EGQAVPLRLLGGL 67
+ Q +C GS LL V + F + + G P A+ LR+ G L
Sbjct 21 SFADQAELCERFGSTFTAALLRSVLRVLNGHTRFGTRILTWDGNPCATADALALRVAGAL 80
Query 68 HRMVLDGRAPVLRRWYPSTGGTWQAEAAWPDIVRTA-TDQPESLRAALDRPPQTNEVGRS 126
H +V + L + YP A+ ++ A + E L + L+ PQTNEVGR+
Sbjct 81 HALVRRRPSCDLAKAYPPNSSV--GPVAFERLLAEAIAENDEFLSSWLEHAPQTNEVGRA 138
Query 127 AALIGGLLIACLQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLG 186
A L G++ + P+ +FEIG+SAGLNL DRY Y G + G SP+ + W+G
Sbjct 139 ALLYAGMMEVAGRTGCPLSVFEIGTSAGLNLILDRYAYVLSGRKAGNPGSPLVLHPDWIG 198
Query 187 ELPPTATVRIVERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPAD 246
P RIV R G D+APIDVT+ G A +YIWPDQ R R+ AI++ + P
Sbjct 199 PSPREPEPRIVSRCGCDLAPIDVTNAVGRERAHAYIWPDQEQRHRRIAQAISLFLDDPVP 258
Query 247 LHRQAAHAAVAG---MTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPF 303
+ + A V + VL+HS+ + YLP+D + AI ++ + A A + P
Sbjct 259 IEQGNASDWVLNRLRLPGIPGVARVLFHSLMFSYLPSDSQVAIAEHMETIGAHATSQSPV 318
Query 304 VHLTLEPAHQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTWQ 349
L+ E + + +R WPGG L PH + W
Sbjct 319 AWLSFELDR-----NAEPHLALRLWPGGGQERLATADPHCRRIIWH 359
>gi|84515239|ref|ZP_01002601.1| hypothetical protein SKA53_01236 [Loktanella vestfoldensis SKA53]
gi|84510522|gb|EAQ06977.1| hypothetical protein SKA53_01236 [Loktanella vestfoldensis SKA53]
Length=347
Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 117/337 (35%), Positives = 155/337 (46%), Gaps = 23/337 (6%)
Query 20 SSGSPMYRELLELVAADVESGGVFASILADQKG--APEGQAVPLRLLGGLHRMVLDGRAP 77
S GSP +L+ L A G ++ + D G P GQ+VPLRL G LH + L G A
Sbjct 17 SLGSPFMAQLMRLCATQDWPAGAVSTRIHDWTGDLGPSGQSVPLRLAGALHALHLQGHAR 76
Query 78 VLRRWYPSTGGTWQAEAAWPDIVRTATDQPESLRAALDRPPQTNEVGRSAALIGGLLIAC 137
+ + P AA D++ ++Q LR LD PPQTNEV RSA LI
Sbjct 77 LAPVYPPQASDDATLWAAVADVL--VSEQAAILRW-LDSPPQTNEVRRSAVLIALGHWLA 133
Query 138 LQFDLPIRLFEIGSSAGLNLRPDRYRYRYLGGEWGLADSPVRIDNAWLGELPPTATVRIV 197
+F LP+R E+G+SAGLNL+ D Y +G A + + W G LPP ++
Sbjct 134 DRFALPLRCSELGASAGLNLQWDDYALALGRQVFGPATPALTLSPDWTGALPPNTRPQVT 193
Query 198 ERHGYDIAPIDVTSPDGELNALSYIWPDQTDRLERLRGAIAVARNIPAD------LHRQA 251
R G D+ P+D +P+ L +Y+WPDQ DR + AI AR I A L Q
Sbjct 194 ARSGVDLTPLDPHAPNDALRLRAYLWPDQPDRQMLTQAAITTARTIVAKGDAIDWLPGQL 253
Query 252 AHAAVAGMTLTDDALTVLWHSITWQYLPADERAAIRAGIDALAAQADAHCPFVHLTLEPA 311
H AG T +++ +I WQY P + A I A A P +EP
Sbjct 254 DHH--AGQT------HLIYTTIAWQYFPTAVQDRGAAMIRAAGQSARDDAPLAWFGMEPD 305
Query 312 HQRPGAQIKYLVRMRSWPGGHARVLGECHPHGPPVTW 348
PGA + +R WPG LG HG V W
Sbjct 306 GTGPGAALT----LRLWPGDLTFALGRADFHGRWVQW 338
Lambda K H
0.320 0.137 0.435
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 645334497412
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40