BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1192
Length=275
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608332|ref|NP_215708.1| hypothetical protein Rv1192 [Mycoba... 556 1e-156
gi|340626206|ref|YP_004744658.1| hypothetical protein MCAN_12031... 555 3e-156
gi|308231769|ref|ZP_07413699.2| hypothetical protein TMAG_01824 ... 548 4e-154
gi|289749735|ref|ZP_06509113.1| LOW QUALITY PROTEIN: hypothetica... 490 1e-136
gi|294993376|ref|ZP_06799067.1| hypothetical protein Mtub2_02442... 487 8e-136
gi|254364093|ref|ZP_04980139.1| hypothetical protein TBHG_01175 ... 430 9e-119
gi|240169041|ref|ZP_04747700.1| hypothetical protein MkanA1_0699... 371 5e-101
gi|183984216|ref|YP_001852507.1| hypothetical protein MMAR_4244 ... 367 7e-100
gi|118616704|ref|YP_905036.1| hypothetical protein MUL_0942 [Myc... 367 7e-100
gi|339297823|gb|AEJ49933.1| hypothetical protein CCDC5180_1096 [... 364 7e-99
gi|284991885|ref|YP_003410439.1| PGAP1 family protein [Geodermat... 306 2e-81
gi|227205701|dbj|BAH56667.1| hypothetical protein [Rhodococcus s... 278 5e-73
gi|326381293|ref|ZP_08202987.1| hypothetical protein SCNU_00040 ... 238 9e-61
gi|256375898|ref|YP_003099558.1| hypothetical protein Amir_1764 ... 224 1e-56
gi|229488399|ref|ZP_04382265.1| pgap1 family protein [Rhodococcu... 211 7e-53
gi|226308374|ref|YP_002768334.1| hypothetical protein RER_48870 ... 208 6e-52
gi|304394658|ref|ZP_07376577.1| pgap1 family protein [Ahrensia s... 208 7e-52
gi|338973791|ref|ZP_08629154.1| hypothetical protein CSIRO_2241 ... 206 3e-51
gi|257093207|ref|YP_003166848.1| hypothetical protein CAP2UW1_16... 204 1e-50
gi|239817758|ref|YP_002946668.1| hypothetical protein Vapar_4797... 201 1e-49
gi|343918945|gb|EGV29702.1| hypothetical protein ThidrDRAFT_3146... 200 2e-49
gi|319796087|ref|YP_004157727.1| hypothetical protein Varpa_5461... 198 7e-49
gi|260221017|emb|CBA29162.1| hypothetical protein Csp_A10760 [Cu... 196 4e-48
gi|90423165|ref|YP_531535.1| hypothetical protein RPC_1654 [Rhod... 194 8e-48
gi|85373326|ref|YP_457388.1| hypothetical protein ELI_02495 [Ery... 194 1e-47
gi|154252911|ref|YP_001413735.1| hypothetical protein Plav_2469 ... 193 2e-47
gi|27378000|ref|NP_769529.1| hypothetical protein blr2889 [Brady... 192 5e-47
gi|192292659|ref|YP_001993264.1| PGAP1 family protein [Rhodopseu... 188 8e-46
gi|338974237|ref|ZP_08629599.1| hypothetical protein CSIRO_2690 ... 188 8e-46
gi|39936833|ref|NP_949109.1| hypothetical protein RPA3772 [Rhodo... 188 9e-46
gi|27377990|ref|NP_769519.1| hypothetical protein blr2879 [Brady... 186 3e-45
gi|338973781|ref|ZP_08629144.1| hypothetical protein CSIRO_2231 ... 186 4e-45
gi|152982831|ref|YP_001353694.1| hypothetical protein mma_2004 [... 186 4e-45
gi|316932943|ref|YP_004107925.1| hypothetical protein Rpdx1_1572... 185 6e-45
gi|149185953|ref|ZP_01864268.1| hypothetical protein ED21_24506 ... 185 8e-45
gi|124006675|ref|ZP_01691507.1| conserved hypothetical protein [... 184 2e-44
gi|284989856|ref|YP_003408410.1| hypothetical protein Gobs_1301 ... 182 6e-44
gi|121604610|ref|YP_981939.1| hypothetical protein Pnap_1705 [Po... 181 7e-44
gi|254514148|ref|ZP_05126209.1| pgap1 family protein [gamma prot... 179 3e-43
gi|91788171|ref|YP_549123.1| hypothetical protein Bpro_2302 [Pol... 178 8e-43
gi|86750756|ref|YP_487252.1| hypothetical protein RPB_3646 [Rhod... 178 1e-42
gi|146275779|ref|YP_001165939.1| PGAP1 family protein [Novosphin... 177 2e-42
gi|288940094|ref|YP_003442334.1| hypothetical protein Alvin_0340... 177 2e-42
gi|88703515|ref|ZP_01101231.1| conserved hypothetical protein [C... 177 2e-42
gi|91976296|ref|YP_568955.1| hypothetical protein RPD_1818 [Rhod... 175 8e-42
gi|119478567|ref|ZP_01618510.1| hypothetical protein GP2143_1231... 175 8e-42
gi|89900267|ref|YP_522738.1| hypothetical protein Rfer_1474 [Rho... 174 1e-41
gi|115523702|ref|YP_780613.1| hypothetical protein RPE_1684 [Rho... 170 2e-40
gi|85707926|ref|ZP_01038992.1| hypothetical protein NAP1_01785 [... 169 4e-40
gi|334141171|ref|YP_004534377.1| PGAP1 family protein [Novosphin... 165 7e-39
>gi|15608332|ref|NP_215708.1| hypothetical protein Rv1192 [Mycobacterium tuberculosis H37Rv]
gi|15840635|ref|NP_335672.1| hypothetical protein MT1229 [Mycobacterium tuberculosis CDC1551]
gi|31792385|ref|NP_854878.1| hypothetical protein Mb1224 [Mycobacterium bovis AF2122/97]
70 more sequence titles
Length=275
Score = 556 bits (1433), Expect = 1e-156, Method: Compositional matrix adjust.
Identities = 275/275 (100%), Positives = 275/275 (100%), Gaps = 0/275 (0%)
Query 1 MLLPVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLL 60
MLLPVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLL
Sbjct 1 MLLPVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLL 60
Query 61 AGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWS 120
AGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWS
Sbjct 61 AGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWS 120
Query 121 LGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMES 180
LGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMES
Sbjct 121 LGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMES 180
Query 181 EPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQP 240
EPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQP
Sbjct 181 EPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQP 240
Query 241 QGAWAPFRPPKVLSPLFPRPDTPAEAVSTPQTRPA 275
QGAWAPFRPPKVLSPLFPRPDTPAEAVSTPQTRPA
Sbjct 241 QGAWAPFRPPKVLSPLFPRPDTPAEAVSTPQTRPA 275
>gi|340626206|ref|YP_004744658.1| hypothetical protein MCAN_12031 [Mycobacterium canettii CIPT
140010059]
gi|340004396|emb|CCC43539.1| hypothetical protein MCAN_12031 [Mycobacterium canettii CIPT
140010059]
Length=275
Score = 555 bits (1429), Expect = 3e-156, Method: Compositional matrix adjust.
Identities = 273/275 (99%), Positives = 275/275 (100%), Gaps = 0/275 (0%)
Query 1 MLLPVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLL 60
ML+PVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLL
Sbjct 1 MLMPVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLL 60
Query 61 AGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWS 120
AGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTP+SLIGWS
Sbjct 61 AGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPLSLIGWS 120
Query 121 LGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMES 180
LGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMES
Sbjct 121 LGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMES 180
Query 181 EPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQP 240
EPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQP
Sbjct 181 EPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQP 240
Query 241 QGAWAPFRPPKVLSPLFPRPDTPAEAVSTPQTRPA 275
QGAWAPFRPPKVLSPLFPRPDTPAEAVSTPQTRPA
Sbjct 241 QGAWAPFRPPKVLSPLFPRPDTPAEAVSTPQTRPA 275
>gi|308231769|ref|ZP_07413699.2| hypothetical protein TMAG_01824 [Mycobacterium tuberculosis SUMu001]
gi|308216163|gb|EFO75562.1| hypothetical protein TMAG_01824 [Mycobacterium tuberculosis SUMu001]
Length=271
Score = 548 bits (1411), Expect = 4e-154, Method: Compositional matrix adjust.
Identities = 270/271 (99%), Positives = 271/271 (100%), Gaps = 0/271 (0%)
Query 5 VLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDG 64
+LEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDG
Sbjct 1 MLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDG 60
Query 65 STWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGI 124
STWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGI
Sbjct 61 STWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGI 120
Query 125 FARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLP 184
FARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLP
Sbjct 121 FARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLP 180
Query 185 VPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAW 244
VPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAW
Sbjct 181 VPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAW 240
Query 245 APFRPPKVLSPLFPRPDTPAEAVSTPQTRPA 275
APFRPPKVLSPLFPRPDTPAEAVSTPQTRPA
Sbjct 241 APFRPPKVLSPLFPRPDTPAEAVSTPQTRPA 271
>gi|289749735|ref|ZP_06509113.1| LOW QUALITY PROTEIN: hypothetical protein TBDG_03174 [Mycobacterium
tuberculosis T92]
gi|289690322|gb|EFD57751.1| LOW QUALITY PROTEIN: hypothetical protein TBDG_03174 [Mycobacterium
tuberculosis T92]
Length=276
Score = 490 bits (1261), Expect = 1e-136, Method: Compositional matrix adjust.
Identities = 241/243 (99%), Positives = 241/243 (99%), Gaps = 0/243 (0%)
Query 1 MLLPVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLL 60
MLLPVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLL
Sbjct 1 MLLPVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLL 60
Query 61 AGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWS 120
AGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWS
Sbjct 61 AGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWS 120
Query 121 LGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMES 180
LGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMES
Sbjct 121 LGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMES 180
Query 181 EPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQP 240
EPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQP
Sbjct 181 EPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQP 240
Query 241 QGA 243
G
Sbjct 241 PGC 243
>gi|294993376|ref|ZP_06799067.1| hypothetical protein Mtub2_02442 [Mycobacterium tuberculosis
210]
Length=271
Score = 487 bits (1253), Expect = 8e-136, Method: Compositional matrix adjust.
Identities = 243/255 (96%), Positives = 245/255 (97%), Gaps = 0/255 (0%)
Query 21 YLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAA 80
YL D+ V + AVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAA
Sbjct 17 YLVDLAFRMVGDIGVAAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAA 76
Query 81 YGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQV 140
YGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQV
Sbjct 77 YGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQV 136
Query 141 ITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAW 200
ITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAW
Sbjct 137 ITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAW 196
Query 201 QTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFPRP 260
QTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFPRP
Sbjct 197 QTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFPRP 256
Query 261 DTPAEAVSTPQTRPA 275
DTPAEAVSTPQTRPA
Sbjct 257 DTPAEAVSTPQTRPA 271
>gi|254364093|ref|ZP_04980139.1| hypothetical protein TBHG_01175 [Mycobacterium tuberculosis str.
Haarlem]
gi|134149607|gb|EBA41652.1| hypothetical protein TBHG_01175 [Mycobacterium tuberculosis str.
Haarlem]
Length=263
Score = 430 bits (1106), Expect = 9e-119, Method: Compositional matrix adjust.
Identities = 211/211 (100%), Positives = 211/211 (100%), Gaps = 0/211 (0%)
Query 65 STWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGI 124
STWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGI
Sbjct 53 STWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGI 112
Query 125 FARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLP 184
FARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLP
Sbjct 113 FARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLP 172
Query 185 VPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAW 244
VPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAW
Sbjct 173 VPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAW 232
Query 245 APFRPPKVLSPLFPRPDTPAEAVSTPQTRPA 275
APFRPPKVLSPLFPRPDTPAEAVSTPQTRPA
Sbjct 233 APFRPPKVLSPLFPRPDTPAEAVSTPQTRPA 263
>gi|240169041|ref|ZP_04747700.1| hypothetical protein MkanA1_06994 [Mycobacterium kansasii ATCC
12478]
Length=266
Score = 371 bits (953), Expect = 5e-101, Method: Compositional matrix adjust.
Identities = 183/258 (71%), Positives = 210/258 (82%), Gaps = 0/258 (0%)
Query 11 RPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILR 70
+P AP LYLTDIPRA EYGQL++VLPL+RMLP GDGHPVLVLPGLLAGDGSTW LR
Sbjct 5 KPVSAPPLALYLTDIPRAVAEYGQLVSVLPLRRMLPVGDGHPVLVLPGLLAGDGSTWTLR 64
Query 71 RILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLA 130
R+L RLGY A+GWGLGRNIGPT +AV GM L++LH+ Y P++LIGWSLGGIFAR LA
Sbjct 65 RLLGRLGYRAHGWGLGRNIGPTPEAVRGMELRLEELHASYDVPLTLIGWSLGGIFARTLA 124
Query 131 RDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAI 190
R HP AVRQVITLGSPF M D ++R+ SF RYAHLH+E+H LPL+ E+EP+PVPTTAI
Sbjct 125 RRHPEAVRQVITLGSPFRMEDEGQSRATPSFKRYAHLHSEQHALPLKSEAEPMPVPTTAI 184
Query 191 YSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPP 250
YSR DGMVAWQTC+N P R+ENIAV +SHIGYGH+P VWAIADRLAQP+G+W PFRPP
Sbjct 185 YSRFDGMVAWQTCINPPGPRSENIAVLASHIGYGHHPATVWAIADRLAQPRGSWTPFRPP 244
Query 251 KVLSPLFPRPDTPAEAVS 268
VL PLFP A A +
Sbjct 245 AVLRPLFPGSSKTAAAAA 262
>gi|183984216|ref|YP_001852507.1| hypothetical protein MMAR_4244 [Mycobacterium marinum M]
gi|183177542|gb|ACC42652.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=274
Score = 367 bits (943), Expect = 7e-100, Method: Compositional matrix adjust.
Identities = 183/260 (71%), Positives = 205/260 (79%), Gaps = 0/260 (0%)
Query 3 LPVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAG 62
P P AP LYL+DIPRA EYGQL+++ PLQ+ LP GDGHPVLVLPGLLAG
Sbjct 10 FPTAAPHVVSAGAPKMGLYLSDIPRAVAEYGQLVSLFPLQKALPVGDGHPVLVLPGLLAG 69
Query 63 DGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLG 122
DGSTW LR +L RLGY AYGW LG NIGPT+K V GM L+ LH+RY+TPVSL+GWSLG
Sbjct 70 DGSTWTLRWLLGRLGYRAYGWRLGLNIGPTSKVVDGMSARLEALHTRYNTPVSLVGWSLG 129
Query 123 GIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEP 182
GIFAR LAR HP AVRQVITLGSPF M+D ++R+A F + LH ERHELPL E+EP
Sbjct 130 GIFARTLARRHPEAVRQVITLGSPFRMQDESQSRAARHFRIFQRLHAERHELPLPAEAEP 189
Query 183 LPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQG 242
LPVP+TAIYSR DGMVAWQTC+++PSERAENIAV SSHIGYGH+P VWAIADRLAQP
Sbjct 190 LPVPSTAIYSRYDGMVAWQTCLDTPSERAENIAVLSSHIGYGHHPATVWAIADRLAQPVD 249
Query 243 AWAPFRPPKVLSPLFPRPDT 262
WAPFRPP VL PLFPRP T
Sbjct 250 TWAPFRPPTVLRPLFPRPHT 269
>gi|118616704|ref|YP_905036.1| hypothetical protein MUL_0942 [Mycobacterium ulcerans Agy99]
gi|118568814|gb|ABL03565.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=267
Score = 367 bits (943), Expect = 7e-100, Method: Compositional matrix adjust.
Identities = 184/262 (71%), Positives = 206/262 (79%), Gaps = 0/262 (0%)
Query 1 MLLPVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLL 60
M P P AP LYL+DIPRA EYGQL+++ PLQ+ LP GDGHPVLVLPGLL
Sbjct 1 MEFPTAAPHVVSAGAPKMGLYLSDIPRAVAEYGQLVSLFPLQKALPVGDGHPVLVLPGLL 60
Query 61 AGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWS 120
AGDGSTW LR +L RLGY AYGW LG NIGPT+K V GM L+ LH+RY+TPVSL+GWS
Sbjct 61 AGDGSTWTLRWLLGRLGYRAYGWRLGLNIGPTSKVVDGMSARLEALHTRYNTPVSLVGWS 120
Query 121 LGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMES 180
LGGIFAR LAR HP AVRQVITLGSPF M+D ++R+A F + LH ERHELPL E+
Sbjct 121 LGGIFARTLARRHPEAVRQVITLGSPFRMQDESQSRAARHFRIFQRLHAERHELPLPAEA 180
Query 181 EPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQP 240
EPLPVP+TAIYSR DGMVAWQTC+++PSERAENIAV SSHIGYGH+P VWAIADRLAQP
Sbjct 181 EPLPVPSTAIYSRYDGMVAWQTCLDTPSERAENIAVLSSHIGYGHHPATVWAIADRLAQP 240
Query 241 QGAWAPFRPPKVLSPLFPRPDT 262
WAPFRPP VL PLFPRP T
Sbjct 241 VDTWAPFRPPTVLRPLFPRPHT 262
>gi|339297823|gb|AEJ49933.1| hypothetical protein CCDC5180_1096 [Mycobacterium tuberculosis
CCDC5180]
Length=177
Score = 364 bits (935), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 177/177 (100%), Positives = 177/177 (100%), Gaps = 0/177 (0%)
Query 99 MRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSA 158
MRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSA
Sbjct 1 MRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSA 60
Query 159 WSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRS 218
WSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRS
Sbjct 61 WSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRS 120
Query 219 SHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFPRPDTPAEAVSTPQTRPA 275
SHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFPRPDTPAEAVSTPQTRPA
Sbjct 121 SHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFPRPDTPAEAVSTPQTRPA 177
>gi|284991885|ref|YP_003410439.1| PGAP1 family protein [Geodermatophilus obscurus DSM 43160]
gi|284065130|gb|ADB76068.1| PGAP1 family protein [Geodermatophilus obscurus DSM 43160]
Length=264
Score = 306 bits (785), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 153/255 (60%), Positives = 182/255 (72%), Gaps = 0/255 (0%)
Query 14 DAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRIL 73
D P LYLT+ RA ++G LA PL LP GDGHPVLVLPG L D ST +LR L
Sbjct 5 DGPALPLYLTEPGRAVADFGLYLAARPLLPRLPQGDGHPVLVLPGFLTDDTSTRVLRATL 64
Query 74 RRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDH 133
RRLGY +GW LGRNIGPT V+GMRD +D L RY P+SL+GWSLGGIFAR LAR
Sbjct 65 RRLGYRVHGWRLGRNIGPTGACVAGMRDRIDDLSDRYGRPLSLVGWSLGGIFARDLARRT 124
Query 134 PSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSR 193
P +VRQV+TLGSP + ++R++ +F+RYAHLH E LPLE + PLPVPTT+IYS
Sbjct 125 PDSVRQVVTLGSPIRLNRHSQSRASRAFDRYAHLHVEHRSLPLEPDGSPLPVPTTSIYSH 184
Query 194 CDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVL 253
DG+V WQTC+ +P ER ENIAV +SH+G GH+P +WAIADRLAQP+G W PF+PP L
Sbjct 185 YDGIVHWQTCLETPGERCENIAVMASHLGLGHHPAALWAIADRLAQPEGTWRPFKPPVFL 244
Query 254 SPLFPRPDTPAEAVS 268
P FPRPD PA V
Sbjct 245 RPAFPRPDVPAPLVE 259
>gi|227205701|dbj|BAH56667.1| hypothetical protein [Rhodococcus sp. HI-31]
Length=260
Score = 278 bits (712), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 144/253 (57%), Positives = 174/253 (69%), Gaps = 0/253 (0%)
Query 10 DRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWIL 69
R APG LY TD RA V+Y L PL LP GD HPVLVLPGL D ST+ L
Sbjct 6 QRGHTAPGRLLYFTDPARAAVDYALLAYSAPLLAALPRGDKHPVLVLPGLNTSDASTYTL 65
Query 70 RRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGL 129
R +L+ LGY YGW LGRNIGPT+KAV G + LD L +RY PV+LIGWSLGGIFAR L
Sbjct 66 RTVLKGLGYKTYGWQLGRNIGPTSKAVHGTQARLDYLTNRYQQPVTLIGWSLGGIFARKL 125
Query 130 ARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTA 189
AR PSAVRQVITLGSP + ++R+ F+R +H H E +LPLE + PLPVP T+
Sbjct 126 ARRTPSAVRQVITLGSPIRLARHEQSRANRLFHRNSHEHIEPLDLPLERGAGPLPVPATS 185
Query 190 IYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRP 249
IYS+ DG++AW+ C++ PS RAENIAV +SH G NP +WA+ADRLAQP WAPFRP
Sbjct 186 IYSKLDGILAWRACLDEPSPRAENIAVLASHFGITGNPATLWAVADRLAQPPDRWAPFRP 245
Query 250 PKVLSPLFPRPDT 262
P +L +P P++
Sbjct 246 PALLRMAYPAPES 258
>gi|326381293|ref|ZP_08202987.1| hypothetical protein SCNU_00040 [Gordonia neofelifaecis NRRL
B-59395]
gi|326199540|gb|EGD56720.1| hypothetical protein SCNU_00040 [Gordonia neofelifaecis NRRL
B-59395]
Length=263
Score = 238 bits (606), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 118/238 (50%), Positives = 153/238 (65%), Gaps = 1/238 (0%)
Query 23 TDIPRAGVEYGQLLAVLPLQRMLPAG-DGHPVLVLPGLLAGDGSTWILRRILRRLGYAAY 81
TD+ RA E+G P+ P D PVLVLPG D +T LR L+ LGY Y
Sbjct 23 TDLGRAAWEFGAYACTFPVMSTAPVSPDCQPVLVLPGFTTSDRTTTPLRMTLKNLGYPTY 82
Query 82 GWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVI 141
GWGLG N+GP+ + + GMR LD + + PVS+IGWSLGGIFAR LAR P VRQVI
Sbjct 83 GWGLGVNVGPSDRILRGMRRKLDAIERLHGQPVSIIGWSLGGIFARELARQTPEMVRQVI 142
Query 142 TLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAWQ 201
TLGSPF M+ ++ + +++ LH + PLE ++ PL +P+TA+YSR DG+ AWQ
Sbjct 143 TLGSPFRMQRHAQSNARFAYRLAKPLHARMLDFPLEADAPPLEMPSTALYSRLDGIAAWQ 202
Query 202 TCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFPR 259
C + PS+ +ENI V SH+G+GHN P VWA+ADRL+ P G PF PPK+L P FP+
Sbjct 203 VCRDDPSDLSENIEVLCSHLGFGHNLPAVWAVADRLSLPAGTLEPFVPPKMLRPFFPK 260
>gi|256375898|ref|YP_003099558.1| hypothetical protein Amir_1764 [Actinosynnema mirum DSM 43827]
gi|255920201|gb|ACU35712.1| conserved hypothetical protein [Actinosynnema mirum DSM 43827]
Length=284
Score = 224 bits (570), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 125/257 (49%), Positives = 161/257 (63%), Gaps = 2/257 (0%)
Query 3 LPVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAG 62
LP P APG YLT+ RA V+ GQ A L R P+GDGH V+VLPGL
Sbjct 27 LPEALPEPEAPHAPGLLWYLTEPTRAVVDLGQYAAARQLLRAAPSGDGHTVIVLPGLGGA 86
Query 63 DGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLG 122
DGST +LR+ L LG+ GWGLGRN+GP+A V G R LL+++ + VSL+GWSLG
Sbjct 87 DGSTAVLRKFLSGLGHDVRGWGLGRNLGPSAATVDGTRALLERVAAERGK-VSLVGWSLG 145
Query 123 GIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHEL-PLEMESE 181
G+FAR LAR+ P VRQVITLGSP+ +RD TR F + + +L P E E
Sbjct 146 GVFARELARERPELVRQVITLGSPYALRDARCTRVNPVFRLLSVFYEAVSDLPPPESERP 205
Query 182 PLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQ 241
+PVP T++YSR DG+V W+ C+ R E++ V SSH+GY HN V+W +ADRLAQP+
Sbjct 206 VMPVPATSVYSRSDGIVPWRACLEEEGRRRESVPVASSHLGYCHNTSVLWLVADRLAQPR 265
Query 242 GAWAPFRPPKVLSPLFP 258
G W F PP ++ +FP
Sbjct 266 GRWRRFAPPPGMARMFP 282
>gi|229488399|ref|ZP_04382265.1| pgap1 family protein [Rhodococcus erythropolis SK121]
gi|229323903|gb|EEN89658.1| pgap1 family protein [Rhodococcus erythropolis SK121]
Length=282
Score = 211 bits (538), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 119/248 (48%), Positives = 148/248 (60%), Gaps = 8/248 (3%)
Query 16 PGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRR 75
P + L++ R V+ LL P P GDGHPVLVLPGLL D ST LR L
Sbjct 37 PSLAMCLSEPTRGLVDIASLLLAAPWLLRSPRGDGHPVLVLPGLLTSDVSTLALRTYLSF 96
Query 76 LGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPS 135
LGY +GW LG N GPTA V G+ L ++ RY VS+IGWSLGGI+AR LARD P
Sbjct 97 LGYRVHGWNLGLNTGPTATVVDGLPAALAEVADRYEQKVSVIGWSLGGIYARKLARDLPD 156
Query 136 AVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELP----LEMES-EPLPVPTTAI 190
+VRQV+TLGSPFG+ +TR + YA L LP +E E P+ VP T++
Sbjct 157 SVRQVVTLGSPFGLTSLEQTRVG---SLYARLSGNHAILPPVDGIESEQGSPISVPATSV 213
Query 191 YSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPP 250
YSR DG+V WQ C + + +E+IAV+ SH+G HNP +W +ADRLAQ W PF P
Sbjct 214 YSRHDGIVPWQACCETSAGLSESIAVQGSHMGLTHNPSALWTVADRLAQDVDNWQPFAAP 273
Query 251 KVLSPLFP 258
K L +FP
Sbjct 274 KRLRRMFP 281
>gi|226308374|ref|YP_002768334.1| hypothetical protein RER_48870 [Rhodococcus erythropolis PR4]
gi|226187491|dbj|BAH35595.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=334
Score = 208 bits (530), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 118/248 (48%), Positives = 147/248 (60%), Gaps = 8/248 (3%)
Query 16 PGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRR 75
P + L++ R V+ LL P P GDGHPVLVLPGLL D ST LR L
Sbjct 89 PSLAMCLSEPTRGLVDIASLLLAAPWLLRSPRGDGHPVLVLPGLLTSDVSTLALRTYLSF 148
Query 76 LGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPS 135
LGY +GW LG N GPTA V G+ L ++ RY VS+IGWSLGGI+AR LARD P
Sbjct 149 LGYRVHGWNLGLNTGPTATVVDGLPAALAEVADRYEQKVSVIGWSLGGIYARKLARDLPD 208
Query 136 AVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELP----LEMES-EPLPVPTTAI 190
+VRQV+TLGSPF + +TR + YA L LP +E E P+ VP T++
Sbjct 209 SVRQVVTLGSPFALTSLEQTRVG---SLYARLSGNHAILPPVDGIESEQGSPISVPATSV 265
Query 191 YSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPP 250
YSR DG+V WQ C + + +E+IAV+ SH+G HNP +W +ADRLAQ W PF P
Sbjct 266 YSRHDGIVPWQACCETSAGLSESIAVQGSHMGLTHNPSALWTVADRLAQDVDNWQPFAAP 325
Query 251 KVLSPLFP 258
K L +FP
Sbjct 326 KRLRRMFP 333
>gi|304394658|ref|ZP_07376577.1| pgap1 family protein [Ahrensia sp. R2A130]
gi|303293319|gb|EFL87700.1| pgap1 family protein [Ahrensia sp. R2A130]
Length=269
Score = 208 bits (530), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 121/258 (47%), Positives = 152/258 (59%), Gaps = 7/258 (2%)
Query 5 VLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLP-LQRMLPAGDGHPVLVLPGLLAGD 63
VLEP+D P AP L L ++ RA E A +P L P GDG PVLVLPGL+ D
Sbjct 11 VLEPSDHP-KAPSRKLLLMEL-RAIPELAGFAAAVPGLLAATPRGDGQPVLVLPGLVTSD 68
Query 64 GSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGG 123
ST LR L GY+ GW GRN GP G++ L++L ++ VS++GWSLGG
Sbjct 69 RSTLSLRGFLSAKGYSVSGWEQGRNFGPLPGVEDGLKSQLERLAEEHNRKVSIVGWSLGG 128
Query 124 IFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHT--ERHELPLEMESE 181
I+AR +A+ P VRQVITLGSPF + +AW +YA H +R +
Sbjct 129 IYARQMAKMMPDLVRQVITLGSPF--KGDPRATNAWKLYQYASGHKVDDRDNHMGGTIAA 186
Query 182 PLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQ 241
P PVP+TAI+SR DG+ WQ CM PS+ ENI VRSSH G GH+P V+A+ADRLAQP+
Sbjct 187 PAPVPSTAIFSRSDGICHWQNCMEEPSDIHENIRVRSSHCGLGHHPAAVYAVADRLAQPE 246
Query 242 GAWAPFRPPKVLSPLFPR 259
G W PF V FP+
Sbjct 247 GGWKPFDRTGVKGFAFPK 264
>gi|338973791|ref|ZP_08629154.1| hypothetical protein CSIRO_2241 [Bradyrhizobiaceae bacterium
SG-6C]
gi|338233386|gb|EGP08513.1| hypothetical protein CSIRO_2241 [Bradyrhizobiaceae bacterium
SG-6C]
Length=257
Score = 206 bits (525), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 115/224 (52%), Positives = 142/224 (64%), Gaps = 5/224 (2%)
Query 27 RAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLG 86
RA E+G L LPL + P GDGHPVLVLPGL+ D +T LR L+ GYA GWGLG
Sbjct 21 RAINEFGAFLGALPLLSLAPKGDGHPVLVLPGLITSDAATRPLRSFLKGRGYAVSGWGLG 80
Query 87 RNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSP 146
RN GP A MR+L+ L+ + VSL+GWSLGGI+AR LA+ P VR VITLGSP
Sbjct 81 RNFGPRAGVEEAMRNLVKDLNETHGRKVSLVGWSLGGIYARQLAKMMPDRVRSVITLGSP 140
Query 147 FGM--RDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAWQTCM 204
FG R T R+ + + + + H L M P PVPTTAI+SR DG+ AWQ+C+
Sbjct 141 FGGHPRATNAWRTYEAVSGQSAEDYDTH-LGGHMSKTP-PVPTTAIFSRTDGICAWQSCI 198
Query 205 NSPSERAENIAVR-SSHIGYGHNPPVVWAIADRLAQPQGAWAPF 247
P AENI V +SH G GH+P +V+A+ADRLAQ +G W PF
Sbjct 199 EQPGTYAENIEVNGASHCGMGHHPAIVYAVADRLAQAEGEWKPF 242
>gi|257093207|ref|YP_003166848.1| hypothetical protein CAP2UW1_1605 [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
gi|257045731|gb|ACV34919.1| conserved hypothetical protein [Candidatus Accumulibacter phosphatis
clade IIA str. UW-1]
Length=263
Score = 204 bits (519), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 118/250 (48%), Positives = 149/250 (60%), Gaps = 3/250 (1%)
Query 15 APGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILR 74
+PGW ++ RAG EYG LA PL + P GDGHPVLV PGL+ GD ST LR L
Sbjct 15 SPGWVRLALEM-RAGWEYGASLAATPLLSLAPRGDGHPVLVFPGLITGDLSTLPLRNYLS 73
Query 75 RLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHP 134
GYA Y WGLG N GP A + + LDKL + +SLIGWSLGG++AR LA+ P
Sbjct 74 SRGYATYPWGLGINRGPRAGVIDACLERLDKLSQEHGRSLSLIGWSLGGLYARELAKARP 133
Query 135 SAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRC 194
VRQVIT+G+PF + +AW +A H E P PVPTT+I+SR
Sbjct 134 DVVRQVITMGTPF--TGHPKATNAWRIYEWATGHKIGAPDIHEPLRSPPPVPTTSIFSRS 191
Query 195 DGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLS 254
DG+VAWQ + S +NI V++SH+G G NP ++A+ADRLAQ +G W PF + S
Sbjct 192 DGVVAWQCSLERESPHTDNIEVQASHLGMGLNPLTLYALADRLAQAEGDWRPFDRSGLRS 251
Query 255 PLFPRPDTPA 264
L+P P PA
Sbjct 252 YLYPDPRRPA 261
>gi|239817758|ref|YP_002946668.1| hypothetical protein Vapar_4797 [Variovorax paradoxus S110]
gi|239804335|gb|ACS21402.1| conserved hypothetical protein [Variovorax paradoxus S110]
Length=257
Score = 201 bits (511), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 116/237 (49%), Positives = 151/237 (64%), Gaps = 8/237 (3%)
Query 31 EYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLGRNIG 90
E G +A+ PL ++ P GDGHPVLVLPGL+AGDGST +LRR L GY A+GWG GRN G
Sbjct 26 ETGAGIAMWPLLQLAPRGDGHPVLVLPGLVAGDGSTLVLRRYLCSRGYDAHGWGQGRNFG 85
Query 91 PTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSPFGMR 150
P GM LL L + VS+IGWSLGG++AR LA P+ VR VITLGSPF
Sbjct 86 PREGVEDGMLALLKSLAEKSGQKVSVIGWSLGGVYARLLASAQPALVRNVITLGSPF--- 142
Query 151 DTCETRSAWSFNRYAHLHTERHELPLEME-SEPL-PVPTTAIYSRCDGMVAWQTCMNSPS 208
+ R+ ++ Y + + P M+ +P PVPTT+I+SR DG+VAW+ + P
Sbjct 143 -SGSPRATNAWRVYEGVSGQSSHDPRRMKFVQPTPPVPTTSIFSRTDGVVAWRCSLEKPG 201
Query 209 ERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPL-FPRPDTPA 264
+AENI V +SH+G G +P V++A+ADRLAQP+G W PF +L PL +P P A
Sbjct 202 PQAENIEVVASHLGLGAHPAVLYALADRLAQPEGEWKPFN-RGLLGPLVYPDPSRKA 257
>gi|343918945|gb|EGV29702.1| hypothetical protein ThidrDRAFT_3146 [Thiorhodococcus drewsii
AZ1]
Length=263
Score = 200 bits (509), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 117/235 (50%), Positives = 141/235 (60%), Gaps = 4/235 (1%)
Query 27 RAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLG 86
RA E+G LLA PL M P GDGHPVLVLP LL D ST LR L ++GY A+ W LG
Sbjct 30 RASWEFGALLATQPLLTMAPHGDGHPVLVLPRLLGCDFSTQPLRSFLSQMGYEAHPWELG 89
Query 87 RNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSP 146
N+GP A +S LD L RY VSLIGWSLGG++AR LA+ P VRQVITLGSP
Sbjct 90 VNMGPRAGVMSACLRRLDTLEKRYGRKVSLIGWSLGGLYARELAKLAPDQVRQVITLGSP 149
Query 147 F-GMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAWQTCMN 205
F G E +A+ H+ ++ PLE P PVPTT+IYSR DG+V W +
Sbjct 150 FAGHPSPTEIWNAYEDLTGDHIGLPKNSGPLET---PPPVPTTSIYSRTDGIVPWNSSQT 206
Query 206 SPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFPRP 260
AENI V SSH+G NP V++A+ADRLAQ +G W PF + +P P
Sbjct 207 HQGPAAENIEVESSHLGLAVNPTVLYAVADRLAQSEGDWKPFERSGLRELFYPDP 261
>gi|319796087|ref|YP_004157727.1| hypothetical protein Varpa_5461 [Variovorax paradoxus EPS]
gi|315598550|gb|ADU39616.1| hypothetical protein Varpa_5461 [Variovorax paradoxus EPS]
Length=257
Score = 198 bits (504), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 112/234 (48%), Positives = 151/234 (65%), Gaps = 8/234 (3%)
Query 31 EYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLGRNIG 90
E G +A+ PL ++ P GDGHPVLVLPGL+A D ST +LRR L GY A+GWGLGRN+G
Sbjct 26 ETGAGIAMWPLLQLTPRGDGHPVLVLPGLVASDVSTLLLRRYLASRGYDAHGWGLGRNLG 85
Query 91 PTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSPFGMR 150
P GM +LL L+ + VS+IGWSLGG++AR LA H +R VITLGSPF
Sbjct 86 PREGVEDGMVELLKTLNDKSGQKVSVIGWSLGGVYARLLASAHSGLIRNVITLGSPF--- 142
Query 151 DTCETRSAWSFNRYAHLHTERHELPLEME-SEPL-PVPTTAIYSRCDGMVAWQTCMNSPS 208
+ R+ ++ Y + + P M+ +P PVPTT+I+SR DG+VAW+ +
Sbjct 143 -SGSPRATNAWRVYEGVSGQSSHDPRRMKFVQPTPPVPTTSIFSRTDGVVAWRCSIEKTG 201
Query 209 ERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPL-FPRPD 261
++ENI V +SH+G G +P V++A+ADRLAQP+G W PF +L PL +P PD
Sbjct 202 PQSENIEVMASHLGLGAHPAVLYAVADRLAQPEGEWKPFN-RGLLGPLVYPDPD 254
>gi|260221017|emb|CBA29162.1| hypothetical protein Csp_A10760 [Curvibacter putative symbiont
of Hydra magnipapillata]
Length=271
Score = 196 bits (497), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 120/266 (46%), Positives = 151/266 (57%), Gaps = 16/266 (6%)
Query 4 PVLEPADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGD 63
P+ EP P APGW L ++ RA E +L P+ PAGDGHPVLV PGL A D
Sbjct 13 PLSEPTPHPA-APGWHLIALEL-RAPWELWSVLPSWPVLSKAPAGDGHPVLVFPGLTASD 70
Query 64 GSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGG 123
GST LR L+ LGY GW G N GP A + R + +L VSL+GWSLGG
Sbjct 71 GSTLPLRAYLKNLGYDVSGWNQGYNFGPRAGVLETARQQILELAQSTGRKVSLVGWSLGG 130
Query 124 IFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPL 183
I+AR LA++ P VR VITLG+PFG T + +AW T H++ +E L
Sbjct 131 IYARELAKELPDQVRAVITLGTPFGGSHT--STNAWKLYEL----TAGHKITDAIEQFDL 184
Query 184 ----PVPTTAIYSRCDGMVAWQTCMNSPSER---AENIAVRSSHIGYGHNPPVVWAIADR 236
PVPTT++YSR DG+VAWQ + + S + EN+ V +SHIG G NP W +ADR
Sbjct 185 AGAPPVPTTSVYSRSDGVVAWQASLQAKSRKQPHTENVEVFASHIGLGLNPSAWWVVADR 244
Query 237 LAQPQGAWAPFRPPKVLSP-LFPRPD 261
LAQ +G W F+P L+ LFP P
Sbjct 245 LAQAEGKWQAFQPGSSLARLLFPDPQ 270
>gi|90423165|ref|YP_531535.1| hypothetical protein RPC_1654 [Rhodopseudomonas palustris BisB18]
gi|90105179|gb|ABD87216.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18]
Length=262
Score = 194 bits (494), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 115/228 (51%), Positives = 139/228 (61%), Gaps = 11/228 (4%)
Query 40 PLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGM 99
PL P GDGHPVLVLPGLLA D ST +LRR L+ LGY ++ WGLGRNIG V GM
Sbjct 37 PLLMQAPKGDGHPVLVLPGLLASDLSTALLRRFLKHLGYHSFAWGLGRNIG----GVYGM 92
Query 100 RDLLDKLHSRYHT----PVSLIGWSLGGIFARGLARDHPSAVRQVITLGSPFGMRDTCET 155
R LD+ R H VSL+GWSLGG++AR LA P VR VITLGSPF RD T
Sbjct 93 RAKLDERLRRIHDLTGRKVSLVGWSLGGVYARDLALHRPELVRNVITLGSPFA-RDLTAT 151
Query 156 RSAWSFNRYAHLHTER-HELPLEMESEPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENI 214
W + R + + L+ + LPVPTT+IYSR DG+V W+T + PS AENI
Sbjct 152 NGRWVYERLSGESLDNVAAADLQALAGALPVPTTSIYSRGDGIVNWRTSVLQPSATAENI 211
Query 215 AV-RSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFPRPD 261
V +SHIG N V+WA+ADRLAQP+G + PF + + RP
Sbjct 212 EVCLASHIGLTVNAAVLWAVADRLAQPEGTFRPFERGGPFAIAYARPQ 259
>gi|85373326|ref|YP_457388.1| hypothetical protein ELI_02495 [Erythrobacter litoralis HTCC2594]
gi|84786409|gb|ABC62591.1| hypothetical protein ELI_02495 [Erythrobacter litoralis HTCC2594]
Length=267
Score = 194 bits (493), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 115/257 (45%), Positives = 147/257 (58%), Gaps = 13/257 (5%)
Query 8 PADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTW 67
P RP P L L + RA E A+ P + +LP GDGH VLVLPG +A D ST
Sbjct 14 PEARP---PSRLLALAEPGRAMGELAAFYALTPFRSLLPRGDGHGVLVLPGFMASDYSTR 70
Query 68 ILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFAR 127
LRR+L LGY A GW LGRN+ V M +++LH R VS++GWSLGG+FAR
Sbjct 71 PLRRLLTGLGYDAVGWNLGRNVRVDNSRVEAMAGCVEELHERSGGKVSIVGWSLGGVFAR 130
Query 128 GLARDHPSAVRQVITLGSPFG-MRDTCETRSAWSFNRYAHLHTER----HELPLEMESEP 182
LA+ P VR VI+LGSP R+ R + F L+ E + + +E
Sbjct 131 ELAKMMPEKVRFVISLGSPISDDRNHTNARRLFEF-----LNGESPEPLRQGKFQNLAEA 185
Query 183 LPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQG 242
PVPTT+I ++ DG+V W+ + + SE+ ENI V +SH G G NP V +AIADRLAQ +G
Sbjct 186 PPVPTTSILTKTDGVVHWRGSVQAESEQTENIEVYASHCGMGANPSVAYAIADRLAQAEG 245
Query 243 AWAPFRPPKVLSPLFPR 259
W PFR V S FPR
Sbjct 246 QWKPFRAEGVYSLAFPR 262
>gi|154252911|ref|YP_001413735.1| hypothetical protein Plav_2469 [Parvibaculum lavamentivorans
DS-1]
gi|154156861|gb|ABS64078.1| conserved hypothetical protein [Parvibaculum lavamentivorans
DS-1]
Length=248
Score = 193 bits (491), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 109/238 (46%), Positives = 140/238 (59%), Gaps = 15/238 (6%)
Query 10 DRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWIL 69
D PGW L ++ R E G L+ LP P GDGH VLVLPG+L GD ST+I+
Sbjct 15 DDEMTEPGWLSRLGEL-RIFAELGTLVPALPALLAAPRGDGHAVLVLPGVLTGDESTFII 73
Query 70 RRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGL 129
RR L LGY + W G N GP+ + +R L +L +RY +S++GWSLGGIFAR L
Sbjct 74 RRYLDELGYVTHPWKQGHNWGPSRELHERLRARLQELAARYERRISIVGWSLGGIFAREL 133
Query 130 ARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTA 189
AR+ P+ VRQV+TLGSPFG + + + P PVP T+
Sbjct 134 AREFPALVRQVVTLGSPFGSDYSIDGNRRPDAAARRRI--------------PPPVPCTS 179
Query 190 IYSRCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPF 247
IYSR DG+V+W+ C + ENI V ++HIG G NP V+WAIADRLAQP+G W+PF
Sbjct 180 IYSRSDGIVSWEACREMDAPETENIEVSATHIGMGFNPLVLWAIADRLAQPEGEWSPF 237
>gi|27378000|ref|NP_769529.1| hypothetical protein blr2889 [Bradyrhizobium japonicum USDA 110]
gi|27351146|dbj|BAC48154.1| blr2889 [Bradyrhizobium japonicum USDA 110]
Length=287
Score = 192 bits (488), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 111/237 (47%), Positives = 136/237 (58%), Gaps = 6/237 (2%)
Query 27 RAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLG 86
RA E+G L LPL + P GDGHPVLVLPGL+A D ST LR L GYA GW G
Sbjct 52 RAIHEFGAFLGALPLLSLAPRGDGHPVLVLPGLVASDASTRALRTFLSGKGYAVSGWRQG 111
Query 87 RNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSP 146
RN G M DL+ +L + +SL+GWSLGG++AR LA+ P VRQVITLGSP
Sbjct 112 RNYGLRPGVQHAMVDLVQELSDTHGRKISLVGWSLGGLYARQLAKMMPERVRQVITLGSP 171
Query 147 FGMRDTCETRSAWSFNRYAHLHTERHELPL---EMESEPLPVPTTAIYSRCDGMVAWQTC 203
F + +AW +A P E+ P PVPTTAI+SR DG+ AWQ C
Sbjct 172 FA--GDPRSTNAWRVYEWASGQKADQVDPRFGGELAVPP-PVPTTAIFSRTDGVCAWQGC 228
Query 204 MNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFPRP 260
M + E+I + SSH G GH+P V+A+ADRLAQ +G W PF S +P P
Sbjct 229 MEKSGAQTESIEIESSHCGMGHHPAAVYAVADRLAQKEGQWKPFDRSGWRSLAYPDP 285
>gi|192292659|ref|YP_001993264.1| PGAP1 family protein [Rhodopseudomonas palustris TIE-1]
gi|192286408|gb|ACF02789.1| PGAP1 family protein [Rhodopseudomonas palustris TIE-1]
Length=263
Score = 188 bits (478), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 105/204 (52%), Positives = 131/204 (65%), Gaps = 3/204 (1%)
Query 46 PAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDK 105
P GDGHPVLVLPGLLA D ST +RR L++LGY + W LGRN+G + + +RD L
Sbjct 46 PRGDGHPVLVLPGLLASDLSTAPMRRYLKQLGYQVFAWELGRNLGGIYRMRARLRDRLAA 105
Query 106 LHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYA 165
+H VSL+GWSLGG++AR LA P VR +ITLGSPF D T + + + +
Sbjct 106 VHETTGRKVSLVGWSLGGVYARDLALHAPDMVRDIITLGSPF-TGDVTATNAKRIYEKLS 164
Query 166 HLH-TERHELPLEMESEPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENI-AVRSSHIGY 223
TE H LE +PVPTT+IYSR DG+V W+T +PS RAENI V +SHIG
Sbjct 165 GEELTEVHLEDLEPLGGEMPVPTTSIYSRTDGIVNWRTSQLAPSPRAENIEVVLASHIGL 224
Query 224 GHNPPVVWAIADRLAQPQGAWAPF 247
N V+WAIADRLAQP+G + PF
Sbjct 225 IVNAAVLWAIADRLAQPEGVFTPF 248
>gi|338974237|ref|ZP_08629599.1| hypothetical protein CSIRO_2690 [Bradyrhizobiaceae bacterium
SG-6C]
gi|338232964|gb|EGP08092.1| hypothetical protein CSIRO_2690 [Bradyrhizobiaceae bacterium
SG-6C]
Length=253
Score = 188 bits (477), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 102/229 (45%), Positives = 129/229 (57%), Gaps = 1/229 (0%)
Query 31 EYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLGRNIG 90
+ L+A P P G HPV+VLPGL A D ST+ +R L LGY GWG GRNI
Sbjct 23 DIAGLMAAAPFLATAPRGARHPVMVLPGLGANDNSTFAIRGFLGMLGYDVRGWGRGRNIR 82
Query 91 PTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSPFGMR 150
+ + L VSLIGWSLGGI AR +AR P VR V+TLGSPF
Sbjct 83 LPQLEAPAVAQTVRDLSRNTGQRVSLIGWSLGGILAREVARRSPDHVRLVVTLGSPFAAP 142
Query 151 DTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAWQTCMNSPSER 210
+ R+ W T E+ S PLP+P TAIY+R DG+VAWQ C+
Sbjct 143 NANNLRTVWRLLTGQPSSTVTASRIAEL-SRPLPMPATAIYTRSDGIVAWQACLEQEHPT 201
Query 211 AENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFPR 259
EN+ VR++H+G G + P +W IADRLAQP+G W PF+P ++SP FP+
Sbjct 202 TENVEVRTTHLGLGFHAPALWVIADRLAQPEGQWKPFKPSLLVSPFFPQ 250
>gi|39936833|ref|NP_949109.1| hypothetical protein RPA3772 [Rhodopseudomonas palustris CGA009]
gi|39650690|emb|CAE29213.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009]
Length=263
Score = 188 bits (477), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 110/223 (50%), Positives = 139/223 (63%), Gaps = 3/223 (1%)
Query 27 RAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLG 86
R+ E+ L + PL P GDGHPVLVLPGLLA D ST +RR L++LGY + W LG
Sbjct 27 RSFFEFNASLLLSPLLLQAPRGDGHPVLVLPGLLASDLSTAPMRRYLKQLGYQVFAWELG 86
Query 87 RNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSP 146
RN+G + + +RD L +H VSL+GWSLGG++AR LA P VR +ITLGSP
Sbjct 87 RNLGGIYRMRARLRDRLAAVHETTGRKVSLVGWSLGGVYARDLALHAPDMVRDIITLGSP 146
Query 147 FGMRDTCETRSAWSFNRYAHLH-TERHELPLEMESEPLPVPTTAIYSRCDGMVAWQTCMN 205
F D T + + + + TE H LE +PVPTT+IYSR DG+V W+T
Sbjct 147 F-TGDVTATNAKRIYEKLSGEELTEVHLEDLEPLGGEMPVPTTSIYSRTDGIVNWRTSQL 205
Query 206 SPSERAENI-AVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPF 247
+PS RAENI V +SHIG N V+WAIADRLAQP+G + PF
Sbjct 206 APSPRAENIEVVLASHIGLIVNAAVLWAIADRLAQPEGVFKPF 248
>gi|27377990|ref|NP_769519.1| hypothetical protein blr2879 [Bradyrhizobium japonicum USDA 110]
gi|27351136|dbj|BAC48144.1| blr2879 [Bradyrhizobium japonicum USDA 110]
Length=266
Score = 186 bits (473), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 107/225 (48%), Positives = 134/225 (60%), Gaps = 8/225 (3%)
Query 27 RAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLG 86
R E L + PL P GDGHPVL LPG LA D S +RR L LGY A+ W +G
Sbjct 27 RGLFELNASLLLSPLLMRAPRGDGHPVLTLPGFLASDLSMAPMRRYLSELGYEAHAWRMG 86
Query 87 RNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSP 146
RN+G + +R L ++H+ VSL+GWSLGG++AR LA P VR VITLGSP
Sbjct 87 RNLGGLGRMREALRTRLAEIHAARGRKVSLVGWSLGGVYARDLALQAPDMVRYVITLGSP 146
Query 147 FGMRDTCETRSAWSFNRYAHLHTERHELPLEMESE---PLPVPTTAIYSRCDGMVAWQTC 203
F + R+ + Y L ER E E+ LPVP T+IYSR DG+V W+TC
Sbjct 147 F----ANDVRATNATRLYEALSGERVEDFAELREAIAGDLPVPATSIYSRADGVVNWRTC 202
Query 204 MNSPSERAENIAVR-SSHIGYGHNPPVVWAIADRLAQPQGAWAPF 247
+ PS+ AENI V +SHIG G NP +WA+ADRLAQP+G + PF
Sbjct 203 LLRPSDHAENIEVHLASHIGLGVNPAALWAVADRLAQPEGEFWPF 247
>gi|338973781|ref|ZP_08629144.1| hypothetical protein CSIRO_2231 [Bradyrhizobiaceae bacterium
SG-6C]
gi|338233376|gb|EGP08503.1| hypothetical protein CSIRO_2231 [Bradyrhizobiaceae bacterium
SG-6C]
Length=264
Score = 186 bits (472), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 113/252 (45%), Positives = 145/252 (58%), Gaps = 12/252 (4%)
Query 16 PGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRR 75
P L+L + R E L + P M P GDGHPVLVLPGLLA D ST ILRR L
Sbjct 17 PNLGLFLAE-GRGVFELNATLLMAPALLMAPRGDGHPVLVLPGLLASDVSTLILRRYLDL 75
Query 76 LGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHT----PVSLIGWSLGGIFARGLAR 131
LG++ + WG GRN G V MRD L KL + H VSL+GWSLGG++AR LA
Sbjct 76 LGFSTHPWGFGRNTG----GVYSMRDKLAKLLTSVHNTTGRKVSLVGWSLGGVYARDLAL 131
Query 132 DHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHEL-PLEMESEPLPVPTTAI 190
P VR V+TLGSPF D T + + + +L + + LPVPT+++
Sbjct 132 QMPEMVRYVVTLGSPFA-GDISATNARAIYEMLSGEKIADADLRDIRAIAGDLPVPTSSL 190
Query 191 YSRCDGMVAWQTCMNSPSERAENIAVR-SSHIGYGHNPPVVWAIADRLAQPQGAWAPFRP 249
Y+R DG+V W+TC+N S+ AENI V +SHIG G N +WA+ADRLAQ +G + PF
Sbjct 191 YTRTDGVVNWRTCLNRVSDTAENIEVTLASHIGIGVNAAALWAVADRLAQREGEFQPFDR 250
Query 250 PKVLSPLFPRPD 261
S + RP+
Sbjct 251 AGPFSLAYARPE 262
>gi|152982831|ref|YP_001353694.1| hypothetical protein mma_2004 [Janthinobacterium sp. Marseille]
gi|151282908|gb|ABR91318.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille]
Length=266
Score = 186 bits (471), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 101/221 (46%), Positives = 133/221 (61%), Gaps = 2/221 (0%)
Query 27 RAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLG 86
RA E G L P+ + +PAGDGHPV+VLPGLLAGD T+ LR+ L GY AY W G
Sbjct 29 RAPWELGAALLAAPMLKDVPAGDGHPVMVLPGLLAGDALTFFLRKYLGNCGYEAYAWKQG 88
Query 87 RNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSP 146
N+GP + + +L ++ VSLIGWSLGGI+AR +A+ P VR VITLGSP
Sbjct 89 LNLGPREGLLERCIARVRELSEKHGQKVSLIGWSLGGIYAREIAKALPEHVRCVITLGSP 148
Query 147 FGMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAWQTCMNS 206
F T +AW + E+ + + PVPTT+I+SR DG+V+WQ C+
Sbjct 149 FTGHPT--ATNAWRLYQLVSGKPAIDEVQIAELKKTPPVPTTSIFSRTDGIVSWQCCVEQ 206
Query 207 PSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPF 247
++ +ENI V SH G NP V++A+ADRLAQP+G W F
Sbjct 207 ETDHSENIEVHGSHTGMVANPTVLYALADRLAQPEGQWQRF 247
>gi|316932943|ref|YP_004107925.1| hypothetical protein Rpdx1_1572 [Rhodopseudomonas palustris DX-1]
gi|315600657|gb|ADU43192.1| hypothetical protein Rpdx1_1572 [Rhodopseudomonas palustris DX-1]
Length=263
Score = 185 bits (470), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 109/207 (53%), Positives = 133/207 (65%), Gaps = 9/207 (4%)
Query 46 PAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDK 105
P GDGHPVLVLPGLLA D ST LRR LR LGY + W LGRN+G + + +R L
Sbjct 46 PRGDGHPVLVLPGLLASDLSTAPLRRYLRLLGYQVFAWELGRNLGGIYRMRARLRSRLAA 105
Query 106 LHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYA 165
+H VSL+GWSLGG++AR LA P VR +ITLGSPF D T + + + +
Sbjct 106 VHEATGRKVSLVGWSLGGVYARDLALHAPGMVRDIITLGSPF-TGDVTATNARRIYEKLS 164
Query 166 HLHTERHELPLEMESEPL----PVPTTAIYSRCDGMVAWQTCMNSPSERAENI-AVRSSH 220
E E+ LE + EPL PVP T+IYSR DG+V W+T +PS RAENI V +SH
Sbjct 165 --GEELSEVQLE-DLEPLGGEMPVPATSIYSRTDGIVNWRTSHLTPSPRAENIEVVLASH 221
Query 221 IGYGHNPPVVWAIADRLAQPQGAWAPF 247
IG NP V+WAIADRLAQP+GA+ PF
Sbjct 222 IGLVVNPAVLWAIADRLAQPEGAFTPF 248
>gi|149185953|ref|ZP_01864268.1| hypothetical protein ED21_24506 [Erythrobacter sp. SD-21]
gi|148830514|gb|EDL48950.1| hypothetical protein ED21_24506 [Erythrobacter sp. SD-21]
Length=245
Score = 185 bits (469), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 110/242 (46%), Positives = 138/242 (58%), Gaps = 3/242 (1%)
Query 20 LYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYA 79
+ L + RA E A+ PL LP GDGH VLVLPG +A D ST LRR+L LGY
Sbjct 1 MTLAEPGRAFGELASFYALRPLLGQLPRGDGHGVLVLPGFMASDYSTSPLRRLLADLGYD 60
Query 80 AYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQ 139
A GW LGRN+ + M ++ LH R P+S++GWSLGG+FAR LA+ P VR
Sbjct 61 AVGWKLGRNVKVDNARIEAMMACVEDLHDRTGRPISIVGWSLGGVFARELAKMAPEKVRL 120
Query 140 VITLGSPFGMRDTCETRSAWSFNRYAHLHTE-RHELPLEMESEPLPVPTTAIYSRCDGMV 198
VI+LGSP D T +A F E + + E PVPTT+I +R DG+V
Sbjct 121 VISLGSPIS-DDRGHTNAARLFEMLNGKEPEPLRDGGFQGLGEAPPVPTTSILTRTDGVV 179
Query 199 AWQTCMN-SPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLF 257
W+ + E ENI V +SH G G NP VV+A+ADRLAQ +GAW PFR + S F
Sbjct 180 HWRGSVQCGDREDCENIEVVASHCGLGVNPAVVYAVADRLAQDEGAWKPFRAQGLASLFF 239
Query 258 PR 259
PR
Sbjct 240 PR 241
>gi|124006675|ref|ZP_01691507.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
gi|123987830|gb|EAY27521.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
Length=267
Score = 184 bits (466), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 105/238 (45%), Positives = 135/238 (57%), Gaps = 6/238 (2%)
Query 16 PGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRR 75
P L LT++ RA G LP + +P GD HPVLVLPG + D +T LR L+
Sbjct 18 PSKLLLLTELGRASFGLGAYFMSLPWLQFMPKGDEHPVLVLPGFMTTDTTTAPLRFYLKS 77
Query 76 LGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPS 135
Y Y W +GRN+ + + D L +L + VS++GWSLGG++AR +AR HP
Sbjct 78 RNYTPYRWKMGRNLANFHEIEEKIYDRLLELKDIHGRKVSIVGWSLGGVYAREIARRHPD 137
Query 136 AVRQVITLGSPFGMRDTCETRSAWSFNRYAHLH-TERHELPLEMES---EPLPVPTTAIY 191
AVRQVITLGSPFG T E W + +E +P E+ + PVPTTAIY
Sbjct 138 AVRQVITLGSPFG-GITGENNIEWIYEMVTGRKVSEVDHIPEEIVQNIPKAPPVPTTAIY 196
Query 192 SRCDGMVAWQTCMNSPSE-RAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFR 248
S+ DG+VAWQ CM EN+ V SHIG GHNP V+ IA+RL Q +G W PF+
Sbjct 197 SKADGVVAWQHCMEKKEGPITENVQVTGSHIGLGHNPAVLACIAERLNQREGEWIPFK 254
>gi|284989856|ref|YP_003408410.1| hypothetical protein Gobs_1301 [Geodermatophilus obscurus DSM
43160]
gi|284063101|gb|ADB74039.1| conserved hypothetical protein [Geodermatophilus obscurus DSM
43160]
Length=254
Score = 182 bits (461), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 111/247 (45%), Positives = 145/247 (59%), Gaps = 3/247 (1%)
Query 15 APGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILR 74
+P LT+ PRAG++ L A PL GDGHPVLVLPGL+ GD +T +LR LR
Sbjct 7 SPSRTALLTEPPRAGLDVAALAAAWPLLAAARRGDGHPVLVLPGLMTGDPATVVLRTALR 66
Query 75 RLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHP 134
LG+ GW LG N GPT + V +R +++LH VSL+GWSLGG++A+ LAR P
Sbjct 67 ALGHDVSGWSLGINRGPTGRVVDTLRARVEQLHRTSGRRVSLVGWSLGGLYAQELARAAP 126
Query 135 SAVRQVITLGSPFGMRDTCETRSAWSF-NRYAHLHTERHELPLE-MESEPLPVPTTAIYS 192
+VR ++TLG+P +R R+A + L LP E L VP T++Y+
Sbjct 127 GSVRGLVTLGTPV-VRSAPWVRTASGIVDGGTRLLRGAAALPRPWAERGSLRVPATSVYT 185
Query 193 RCDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKV 252
R DG+V W +C R EN+ VR SH+G NP V+W +ADRL +G W PFRPP
Sbjct 186 RADGIVHWSSCRYEVRPRRENVEVRGSHLGLACNPAVLWLLADRLGMAEGTWTPFRPPPG 245
Query 253 LSPLFPR 259
LS LFPR
Sbjct 246 LSLLFPR 252
>gi|121604610|ref|YP_981939.1| hypothetical protein Pnap_1705 [Polaromonas naphthalenivorans
CJ2]
gi|120593579|gb|ABM37018.1| conserved hypothetical protein [Polaromonas naphthalenivorans
CJ2]
Length=284
Score = 181 bits (460), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 115/255 (46%), Positives = 145/255 (57%), Gaps = 12/255 (4%)
Query 17 GWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRL 76
W L L RA E+ LL PL P GD HPV+V PGL A D ST LRR L+ L
Sbjct 35 AWLLALEV--RALWEFSALLPAWPLLNRAPRGDNHPVVVFPGLSANDLSTAPLRRYLQLL 92
Query 77 GYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSA 136
++A GW G N GP + +D L + VSLIGWSLGGI+AR LA++ P
Sbjct 93 KHSACGWDQGFNFGPRPGVLDEAKDQLVRTCESTGRKVSLIGWSLGGIYARELAKEVPQM 152
Query 137 VRQVITLGSPFGMRDTCETRSAWSFNRYAHLHT-ERHELPLEMESEPLPVPTTAIYSRCD 195
VR VITLG+PF + ++ AW A + ER ++ + P PVPTT+IYSR D
Sbjct 153 VRSVITLGTPFA--GSHKSTHAWRLYELASGRSVEREAAGYDLPTAP-PVPTTSIYSRTD 209
Query 196 GMVAWQTCMNSPSER---AENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPP-- 250
G+VAWQ + SPS++ ENI V +SH+G G NP WAIADRLA P+G W PF
Sbjct 210 GVVAWQGSIQSPSDKNPWTENIEVVASHVGLGFNPSAWWAIADRLALPEGEWKPFLRETR 269
Query 251 -KVLSPLFPRPDTPA 264
+V ++P P PA
Sbjct 270 GRVHELIYPDPTRPA 284
>gi|254514148|ref|ZP_05126209.1| pgap1 family protein [gamma proteobacterium NOR5-3]
gi|219676391|gb|EED32756.1| pgap1 family protein [gamma proteobacterium NOR5-3]
Length=274
Score = 179 bits (455), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 111/249 (45%), Positives = 135/249 (55%), Gaps = 9/249 (3%)
Query 16 PGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRR 75
P +L LT+ R +E L ++ L L GDGHPV+VLPG L D LRR LR
Sbjct 28 PAAWLALTEPQRVVLEVASLASLRRLLDNLKPGDGHPVMVLPGFLGSDAYNASLRRFLRG 87
Query 76 LGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPS 135
LGY +GWG GRN+GP A+ + L RY P+SL+G SLGGIFAR LAR+ PS
Sbjct 88 LGYKVHGWGQGRNLGPRGNALESLMARAAMLAERYGEPLSLVGHSLGGIFARELAREDPS 147
Query 136 AVRQVITLGSPFGMRDTCETRSAWSFNRYAHLHTERHELPLEMES--EPLPVPTTAIYSR 193
VRQVITLGSPFG + A F +LP+ ++ PVPTTAIYS+
Sbjct 148 LVRQVITLGSPFGRGRHSASYPARLFEAL----NPTDDLPVALDDLHRAPPVPTTAIYSK 203
Query 194 CDGMVAWQTCMNS---PSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPP 250
DG+V W+T + +NI VR SH G NP V + IADRL Q W PF
Sbjct 204 GDGIVNWRTAFQNLDFAHASTQNIQVRGSHCGMTLNPAVWYVIADRLRQSMDRWEPFSVS 263
Query 251 KVLSPLFPR 259
V L PR
Sbjct 264 GVAKVLVPR 272
>gi|91788171|ref|YP_549123.1| hypothetical protein Bpro_2302 [Polaromonas sp. JS666]
gi|91697396|gb|ABE44225.1| conserved hypothetical protein [Polaromonas sp. JS666]
Length=282
Score = 178 bits (451), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 106/225 (48%), Positives = 132/225 (59%), Gaps = 7/225 (3%)
Query 27 RAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLG 86
RA E+G LL PL P GDGH V+V PGL A D ST LR L+ L Y A+GW G
Sbjct 38 RAFWEFGALLPSWPLLARAPKGDGHTVMVFPGLSANDVSTVPLRHYLQSLSYKAWGWEQG 97
Query 87 RNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSP 146
N+GP + R L + VSLIGWSLGG++AR LA++ P VR VITLG+P
Sbjct 98 FNLGPRTGVIDEARARLTRTFETNGRKVSLIGWSLGGVYARELAKELPHMVRCVITLGTP 157
Query 147 FGMRDTCETRSAWSFNRYAH-LHTERHELPLEMESEPLPVPTTAIYSRCDGMVAWQTCMN 205
F + ++ +AW A + ER ++ + P PVPT++IYSR DG+VAWQ +
Sbjct 158 FSA--SHKSTNAWRIYELASGRNIEREAENYDLPAAP-PVPTSSIYSRTDGIVAWQGSIQ 214
Query 206 SPSER---AENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPF 247
SP ENI V +SHIG G NP WAIADRLAQ +G W PF
Sbjct 215 SPCTNNPHTENIEVVASHIGLGLNPSAWWAIADRLAQAEGQWHPF 259
>gi|86750756|ref|YP_487252.1| hypothetical protein RPB_3646 [Rhodopseudomonas palustris HaA2]
gi|86573784|gb|ABD08341.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2]
Length=265
Score = 178 bits (451), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 106/223 (48%), Positives = 134/223 (61%), Gaps = 3/223 (1%)
Query 27 RAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLG 86
R+ +E+ + + PL P GDGHPVLVLPGLLA D ST LRR LR LGY + W LG
Sbjct 26 RSLLEFNASILLSPLLLQAPKGDGHPVLVLPGLLASDLSTAPLRRYLRALGYQPFAWELG 85
Query 87 RNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSP 146
RN G + +R L +H VS++GWSLGG++AR LA P +R ++TLGSP
Sbjct 86 RNFGGVYRMRDRLRRRLTTIHEASGRKVSVVGWSLGGVYARDLALHAPQMIRGIVTLGSP 145
Query 147 F-GMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAWQTCMN 205
F G R + L R + L+ + +PVP T+IYSR DG+V W+T
Sbjct 146 FSGDITATNARRVYEKLSGEDLDEIRPD-DLQALTSDMPVPATSIYSRTDGIVNWRTSRL 204
Query 206 SPSERAENIAV-RSSHIGYGHNPPVVWAIADRLAQPQGAWAPF 247
PS AENI V +SHIG NP V+WAIADRLAQP+GA+APF
Sbjct 205 RPSPTAENIEVLLASHIGLTVNPAVLWAIADRLAQPEGAFAPF 247
>gi|146275779|ref|YP_001165939.1| PGAP1 family protein [Novosphingobium aromaticivorans DSM 12444]
gi|145322470|gb|ABP64413.1| PGAP1 family protein [Novosphingobium aromaticivorans DSM 12444]
Length=259
Score = 177 bits (449), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 99/245 (41%), Positives = 130/245 (54%), Gaps = 5/245 (2%)
Query 17 GWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRL 76
GW + T PR E L P+ P GDGHPV+VLPG D T +LR L RL
Sbjct 17 GWTMLET--PRFLSETALLALAWPMLAKAPQGDGHPVMVLPGFATNDTMTVLLRSFLARL 74
Query 77 GYAAYGWGLGRNIGPTAKAVSG--MRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHP 134
GY + W LG N+ + +G + +D + + VSL+GWSLGG+ AR AR
Sbjct 75 GYQVFPWDLGWNLDQHSAGENGEHLAARIDAIAAETGRKVSLVGWSLGGVIAREAARRDH 134
Query 135 SAVRQVITLGSPF-GMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSR 193
+RQV+TLGSPF G S + +E+ PLPVP+TAI+SR
Sbjct 135 GGLRQVVTLGSPFTGNPRATSLTSLYELLTGNKASSEKSAARYARGHHPLPVPSTAIFSR 194
Query 194 CDGMVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVL 253
DG+ AW+ C++ +R ENI V SH G+ NP V WA+ADRLAQP+G W F P
Sbjct 195 TDGITAWENCVSETDDRTENIEVHCSHFGFVANPGVFWAVADRLAQPEGQWRKFDPKGCF 254
Query 254 SPLFP 258
+ +P
Sbjct 255 AAFYP 259
>gi|288940094|ref|YP_003442334.1| hypothetical protein Alvin_0340 [Allochromatium vinosum DSM 180]
gi|288895466|gb|ADC61302.1| conserved hypothetical protein [Allochromatium vinosum DSM 180]
Length=260
Score = 177 bits (449), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 109/235 (47%), Positives = 133/235 (57%), Gaps = 7/235 (2%)
Query 27 RAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLG 86
R G E+G L A P P GDGHPVLVLP L D ST LR L RLGY A WGLG
Sbjct 25 RVGWEFGALFAAQPWLAQSPRGDGHPVLVLPRFLGCDLSTQPLRDFLDRLGYRAEPWGLG 84
Query 87 RNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSP 146
N+GP A + + L+ LH+ + VSLIGWSLGG++AR LA++ P VR VITLG+P
Sbjct 85 VNLGPRAGVMDACLERLEHLHATHGRRVSLIGWSLGGLYARELAKEAPEQVRLVITLGTP 144
Query 147 FGMRDTCETRSAWSFNRYAHLHTERHELPLE---MESEPLPVPTTAIYSRCDGMVAWQTC 203
F D + W + E LPL ++ P PVPTT+I SR DG+V W
Sbjct 145 FAG-DQSDPSELWRLQE--RMTGESIGLPLRHGPLDQAP-PVPTTSILSRSDGIVHWTDS 200
Query 204 MNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPLFP 258
+ ENI V SSH+G NP + AIADRLAQP+ AW PF + L+P
Sbjct 201 LEREGPITENILVESSHLGLAFNPLSLHAIADRLAQPEDAWRPFERTGARAWLYP 255
>gi|88703515|ref|ZP_01101231.1| conserved hypothetical protein [Congregibacter litoralis KT71]
gi|88702229|gb|EAQ99332.1| conserved hypothetical protein [Congregibacter litoralis KT71]
Length=270
Score = 177 bits (448), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 104/250 (42%), Positives = 139/250 (56%), Gaps = 11/250 (4%)
Query 16 PGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRR 75
P +L LT+ R +E L AV + L GDGHPV+VLPG L DG LRR L+
Sbjct 23 PAAWLALTEPQRVALEVLSLAAVRRMLNNLAPGDGHPVMVLPGFLGSDGYNATLRRFLKS 82
Query 76 LGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPS 135
L Y YGWG G+N+GP + + + + L RY VS++G SLGGIFAR +AR+ P
Sbjct 83 LDYRVYGWGQGQNLGPRGDTLEKLLERVAMLKDRYGQSVSMVGHSLGGIFAREIAREAPD 142
Query 136 AVRQVITLGSPFGMRDTCETRSAWSF-NRYAHLHTERHELPLEMES--EPLPVPTTAIYS 192
VRQV++LGSPFG R + S+ R +LP+ ++ PVPTTA+YS
Sbjct 143 LVRQVVSLGSPFG-----RGRHSGSYPARLFEALNPTDDLPVALDDLHRAPPVPTTAVYS 197
Query 193 RCDGMVAWQTCMNSPS---ERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRP 249
+ DG+V W+T +P E +NI VR SH G NP V + IADRL Q W PF
Sbjct 198 KGDGIVNWRTAFQNPEFAHESTQNIQVRGSHCGMTVNPTVWYIIADRLRQSVDDWKPFTV 257
Query 250 PKVLSPLFPR 259
+ + + P+
Sbjct 258 SGLATVMVPK 267
>gi|91976296|ref|YP_568955.1| hypothetical protein RPD_1818 [Rhodopseudomonas palustris BisB5]
gi|91682752|gb|ABE39054.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5]
Length=262
Score = 175 bits (443), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 106/223 (48%), Positives = 135/223 (61%), Gaps = 3/223 (1%)
Query 27 RAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLG 86
R+ E+ + + PL P GDGHPVLVLPGLLA D ST LRR LR LGY + W LG
Sbjct 26 RSLFEFNASVLLSPLLLRAPKGDGHPVLVLPGLLASDLSTAPLRRYLRHLGYQTFAWELG 85
Query 87 RNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSP 146
RN G + +R LD +H+ VSL+GWSLGG++AR LA P +R +ITLGSP
Sbjct 86 RNFGGVYRMRDRLRRRLDAVHAASGRKVSLVGWSLGGVYARDLALHAPETIRGIITLGSP 145
Query 147 FGMRDTCETRSAWSFNRYAHLHTERHEL-PLEMESEPLPVPTTAIYSRCDGMVAWQTCMN 205
F D T + + + + + L L + +PVPTT+IYSR DG+V W+T +
Sbjct 146 FS-GDITATNARRVYEKLSGEPLDGVRLDDLRALAGDMPVPTTSIYSRTDGIVNWRTSLL 204
Query 206 SPSERAENIAV-RSSHIGYGHNPPVVWAIADRLAQPQGAWAPF 247
PS AENI V +SHIG N V+WAIADRLAQP+G + PF
Sbjct 205 RPSPNAENIEVLLASHIGLTVNAAVLWAIADRLAQPEGEFQPF 247
>gi|119478567|ref|ZP_01618510.1| hypothetical protein GP2143_12311 [marine gamma proteobacterium
HTCC2143]
gi|119448471|gb|EAW29720.1| hypothetical protein GP2143_12311 [marine gamma proteobacterium
HTCC2143]
Length=246
Score = 175 bits (443), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 102/245 (42%), Positives = 140/245 (58%), Gaps = 8/245 (3%)
Query 14 DAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRIL 73
P L LT+ RA V+ L +P R AGDGHPV+V+PG A ST I+R L
Sbjct 2 QGPSNLLRLTEPLRAAVDLSTLTLAMPWLRFFKAGDGHPVMVIPGFTASGRSTKIIRDFL 61
Query 74 RRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDH 133
GY A W G N+G G D+L+K+H+ VSL+G SLGGI+AR +A+
Sbjct 62 TARGYQASCWEQGTNMGVRGDLYDGAVDILEKIHAETGLKVSLVGQSLGGIYAREIAKRQ 121
Query 134 PSAVRQVITLGSPFGMRDTCETRSAWSFNR-YAHL--HTERHELPLEME-SEPLPVPTTA 189
P VRQVI+LGSPF +T +RS+ + + +A H H +E + SE P+PTTA
Sbjct 122 PHLVRQVISLGSPF---NTIGSRSSKNTEQPFAQTLRHESAHFRAMEWQPSEAPPMPTTA 178
Query 190 IYSRCDGMVAWQTC-MNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFR 248
I+S+ DG+ W+TC ++ ENI V SHIG G NP V++ +A+RL+Q + W PF
Sbjct 179 IFSKADGICHWRTCRQHNGHSSTENIEVLGSHIGMGVNPQVLFVLANRLSQAENNWQPFT 238
Query 249 PPKVL 253
+ L
Sbjct 239 SSRYL 243
>gi|89900267|ref|YP_522738.1| hypothetical protein Rfer_1474 [Rhodoferax ferrireducens T118]
gi|89345004|gb|ABD69207.1| conserved hypothetical protein [Rhodoferax ferrireducens T118]
Length=265
Score = 174 bits (442), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 105/243 (44%), Positives = 133/243 (55%), Gaps = 16/243 (6%)
Query 27 RAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLG 86
RA E G ++ P R P GDGH V+V PGL A D ST +R L LG+ GW G
Sbjct 28 RAFWELGAVIPAWPFLRQAPTGDGHSVIVFPGLSASDASTLPMRSFLENLGHDVSGWNQG 87
Query 87 RNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSP 146
N GP A + R + VSL+GWSLGGI+AR LA++ P VR VITLG+P
Sbjct 88 SNFGPRAGVLQAARRQVIDTCQVTGQKVSLVGWSLGGIYARELAKELPDCVRDVITLGTP 147
Query 147 FGMRDTCETRSAWSF-----NRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDGMVAWQ 201
F + E+ +AW R H E+ +LP+ PVPTT+I+SR DG+VAW
Sbjct 148 FA--GSHESTNAWHLYQLVSGRDIHGEVEQFDLPVAP-----PVPTTSIFSRTDGIVAWP 200
Query 202 TCMNSPSE---RAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPL-F 257
+ +P + ENI V +SH+G G NP WA+ADRLAQ +G W PF L L F
Sbjct 201 ASIQAPCKINRLTENIEVIASHVGLGLNPSAWWAVADRLAQAEGKWQPFAHKGGLHGLIF 260
Query 258 PRP 260
P P
Sbjct 261 PNP 263
>gi|115523702|ref|YP_780613.1| hypothetical protein RPE_1684 [Rhodopseudomonas palustris BisA53]
gi|115517649|gb|ABJ05633.1| PGAP1 family protein [Rhodopseudomonas palustris BisA53]
Length=251
Score = 170 bits (430), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 98/205 (48%), Positives = 128/205 (63%), Gaps = 9/205 (4%)
Query 48 GDGHPVLVLPGLLAGDGSTWILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLH 107
GDGHPVLVLPGLLA D S +RR L+ LGY ++ W LGRN G K + +R+ L ++H
Sbjct 35 GDGHPVLVLPGLLASDLSMAPMRRFLKHLGYHSHAWDLGRNTGGIYKMRAKVRERLRRIH 94
Query 108 SRYHTPVSLIGWSLGGIFARGLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNRYAHL 167
+ VSL+GWSLGGI+AR LA P VR VI+LGSPF T + + + Y L
Sbjct 95 HQAGRKVSLVGWSLGGIYARDLALHAPEMVRSVISLGSPF----TGDLSATNARRAYEML 150
Query 168 HTERHE----LPLEMESEPLPVPTTAIYSRCDGMVAWQTCMNSPSERAENIAVR-SSHIG 222
ER + L + LPVPT++IYS+ DG+V W+T + PS AENI V +SH+G
Sbjct 151 SGERLQDVEVADLVALAGDLPVPTSSIYSKTDGIVNWRTSVLRPSASAENIEVYLASHVG 210
Query 223 YGHNPPVVWAIADRLAQPQGAWAPF 247
N V+WA+ADRLAQ +G + PF
Sbjct 211 LPVNAAVLWAVADRLAQREGTFRPF 235
>gi|85707926|ref|ZP_01038992.1| hypothetical protein NAP1_01785 [Erythrobacter sp. NAP1]
gi|85689460|gb|EAQ29463.1| hypothetical protein NAP1_01785 [Erythrobacter sp. NAP1]
Length=270
Score = 169 bits (428), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 102/258 (40%), Positives = 140/258 (55%), Gaps = 12/258 (4%)
Query 8 PADRPCDAPGWFLYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTW 67
P R P LT+ RA E+ A LP RMLP GDGH V+ LPG +A + ST
Sbjct 15 PQARVAQPPNRLWTLTE-GRAMGEFAAFYAALPAMRMLPRGDGHSVMFLPGFMASNRSTV 73
Query 68 ILRRILRRLGYAAYGWGLGRNIGPTAKAVSGMRDLLDKLHSRYHTPVSLIGWSLGGIFAR 127
+RR+ L Y A+GW GRN+ V M + L +L VSLIGWSLGG+ AR
Sbjct 74 PMRRLFTELNYDAHGWESGRNVRVNEATVMKMENQLTRLFKSSGRKVSLIGWSLGGVLAR 133
Query 128 GLARDHPSAVRQVITLGSPFGMRDTCETRSAWSFNR-YAHLHTERHELPLEMESEPL--- 183
LA+ HP VR V +LGSP R S R + L+ ++ + + + L
Sbjct 134 ELAKLHPEKVRLVASLGSPL-----SNDRGHSSAKRLFELLNGNEPKVIQKGKFDELHIA 188
Query 184 -PVPTTAIYSRCDGMVAWQTCMNSPSER-AENIAVRSSHIGYGHNPPVVWAIADRLAQPQ 241
PVPTT+I ++ DG+V W+ + + +ENI V +SH+G G NP V+ A+ADRL+Q +
Sbjct 189 PPVPTTSILTKTDGVVHWRASVQEEGDHPSENIVVHASHLGLGVNPSVMLALADRLSQDE 248
Query 242 GAWAPFRPPKVLSPLFPR 259
G W PF P + +FP+
Sbjct 249 GGWKPFAPSLIQRWMFPK 266
>gi|334141171|ref|YP_004534377.1| PGAP1 family protein [Novosphingobium sp. PP1Y]
gi|333939201|emb|CCA92559.1| PGAP1 family protein [Novosphingobium sp. PP1Y]
Length=256
Score = 165 bits (417), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 91/241 (38%), Positives = 130/241 (54%), Gaps = 3/241 (1%)
Query 20 LYLTDIPRAGVEYGQLLAVLPLQRMLPAGDGHPVLVLPGLLAGDGSTWILRRILRRLGYA 79
L L + R +E G L+A+ PL + P GDGHPV+VLPG D T +LR L++L Y
Sbjct 15 LALLEPARCLMEAGALVALSPLLSLSPRGDGHPVVVLPGFATNDTMTILLRSFLKQLSYD 74
Query 80 AYGWGLGRNIGPTAKAVSG--MRDLLDKLHSRYHTPVSLIGWSLGGIFARGLARDHPSAV 137
Y LG N +G + + + + S VSL+GWSLGG+ AR AR P +
Sbjct 75 VYPMDLGWNFDQHTVGENGEYIAERIRAIRSDTGRKVSLVGWSLGGVIAREAARRDPDDL 134
Query 138 RQVITLGSPF-GMRDTCETRSAWSFNRYAHLHTERHELPLEMESEPLPVPTTAIYSRCDG 196
RQVI+LGSPF G ++ + F + + + ++ LP+P+TA++SR DG
Sbjct 135 RQVISLGSPFSGNPRATNLQTVYQFATGNDFTSAKMVERYRIGADALPIPSTAVFSRTDG 194
Query 197 MVAWQTCMNSPSERAENIAVRSSHIGYGHNPPVVWAIADRLAQPQGAWAPFRPPKVLSPL 256
+ AW+ C+ E EN+ V SSH G+ NP V IADRL Q +G W F+P +
Sbjct 195 VTAWENCLGDTDEINENVEVVSSHFGFMTNPAVFHVIADRLGQVEGQWQSFQPSAPFASF 254
Query 257 F 257
+
Sbjct 255 Y 255
Lambda K H
0.321 0.139 0.454
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 432410969436
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40