BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1907c
Length=215
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609044|ref|NP_216423.1| hypothetical protein Rv1907c [Mycob... 434 5e-120
gi|289745661|ref|ZP_06505039.1| conserved hypothetical protein [... 342 3e-92
gi|289758013|ref|ZP_06517391.1| conserved hypothetical protein [... 341 3e-92
gi|308231979|ref|ZP_07414469.2| hypothetical protein TMAG_02087 ... 334 6e-90
gi|294996819|ref|ZP_06802510.1| hypothetical protein Mtub2_20533... 320 7e-86
gi|339294842|gb|AEJ46953.1| hypothetical protein CCDC5079_1763 [... 313 1e-83
gi|293245|gb|AAA72375.1| hypothetical protein [Mycobacterium tub... 185 4e-45
gi|108802050|ref|YP_642247.1| hypothetical protein Mmcs_5087 [My... 162 4e-38
gi|126438029|ref|YP_001073720.1| hypothetical protein Mjls_5466 ... 160 9e-38
gi|145221111|ref|YP_001131789.1| hypothetical protein Mflv_0508 ... 126 2e-27
gi|302529925|ref|ZP_07282267.1| predicted protein [Streptomyces ... 123 1e-26
gi|300789692|ref|YP_003769983.1| hypothetical protein AMED_7875 ... 123 2e-26
gi|326381691|ref|ZP_08203385.1| hypothetical protein SCNU_02050 ... 69.7 3e-10
gi|302870522|ref|YP_003839159.1| hypothetical protein Micau_6088... 67.8 1e-09
gi|111223184|ref|YP_713978.1| hypothetical protein FRAAL3774 [Fr... 64.7 8e-09
gi|302866278|ref|YP_003834915.1| hypothetical protein Micau_1787... 64.3 1e-08
gi|330465503|ref|YP_004403246.1| hypothetical protein VAB18032_0... 60.1 2e-07
gi|337268413|ref|YP_004612468.1| hypothetical protein Mesop_3936... 58.9 5e-07
gi|257095086|ref|YP_003168727.1| hypothetical protein CAP2UW1_35... 55.8 4e-06
gi|87119729|ref|ZP_01075626.1| hypothetical protein MED121_07310... 54.7 8e-06
gi|312197463|ref|YP_004017524.1| hypothetical protein FraEuI1c_3... 54.7 9e-06
gi|171915661|ref|ZP_02931131.1| hypothetical protein VspiD_30860... 54.7 9e-06
gi|291008209|ref|ZP_06566182.1| hypothetical protein SeryN2_2712... 54.3 1e-05
gi|134098594|ref|YP_001104255.1| hypothetical protein SACE_2020 ... 54.3 1e-05
gi|343927728|ref|ZP_08767196.1| hypothetical protein GOALK_097_0... 53.9 1e-05
gi|108802582|ref|YP_642778.1| hypothetical protein Mmcs_5622 [My... 52.0 6e-05
gi|146300657|ref|YP_001195248.1| hypothetical protein Fjoh_2908 ... 52.0 6e-05
gi|288918036|ref|ZP_06412394.1| hypothetical protein FrEUN1fDRAF... 51.6 7e-05
gi|262203574|ref|YP_003274782.1| hypothetical protein Gbro_3705 ... 51.6 9e-05
gi|254381320|ref|ZP_04996685.1| conserved hypothetical protein [... 51.2 1e-04
gi|319787513|ref|YP_004146988.1| hypothetical protein Psesu_1916... 50.8 1e-04
gi|284989913|ref|YP_003408467.1| hypothetical protein Gobs_1361 ... 48.5 6e-04
gi|330983433|gb|EGH81536.1| hypothetical protein PLA107_00285 [P... 48.5 6e-04
gi|189426806|ref|YP_001949905.1| hypothetical protein RSL1_gp030... 48.1 8e-04
gi|298251298|ref|ZP_06975101.1| conserved hypothetical protein [... 48.1 8e-04
gi|333892736|ref|YP_004466611.1| hypothetical protein ambt_06360... 47.4 0.002
gi|308178589|ref|YP_003917995.1| hypothetical protein AARI_28190... 47.0 0.002
gi|312887878|ref|ZP_07747465.1| conserved hypothetical protein [... 46.2 0.003
gi|323500130|ref|ZP_08105076.1| hypothetical protein VISI1226_09... 44.7 0.008
gi|256424624|ref|YP_003125277.1| hypothetical protein Cpin_5652 ... 44.7 0.010
gi|169630193|ref|YP_001703842.1| hypothetical protein MAB_3111 [... 43.9 0.015
gi|269126019|ref|YP_003299389.1| hypothetical protein Tcur_1777 ... 43.5 0.021
gi|300787176|ref|YP_003767467.1| hypothetical protein AMED_5303 ... 43.5 0.022
gi|220925527|ref|YP_002500829.1| hypothetical protein Mnod_5689 ... 43.1 0.024
gi|121603288|ref|YP_980617.1| hypothetical protein Pnap_0373 [Po... 42.7 0.037
gi|94499459|ref|ZP_01305996.1| hypothetical protein RED65_00460 ... 42.4 0.046
gi|338780937|gb|EGP45334.1| hypothetical protein AXXA_16667 [Ach... 42.0 0.068
gi|302527854|ref|ZP_07280196.1| conserved hypothetical protein [... 41.6 0.072
gi|149186353|ref|ZP_01864666.1| hypothetical protein ED21_22728 ... 41.2 0.10
gi|153005465|ref|YP_001379790.1| hypothetical protein Anae109_26... 40.4 0.16
>gi|15609044|ref|NP_216423.1| hypothetical protein Rv1907c [Mycobacterium tuberculosis H37Rv]
gi|15841379|ref|NP_336416.1| hypothetical protein MT1958 [Mycobacterium tuberculosis CDC1551]
gi|31793100|ref|NP_855593.1| hypothetical protein Mb1942c [Mycobacterium bovis AF2122/97]
45 more sequence titles
Length=215
Score = 434 bits (1116), Expect = 5e-120, Method: Compositional matrix adjust.
Identities = 214/215 (99%), Positives = 215/215 (100%), Gaps = 0/215 (0%)
Query 1 LIGPARRSTTTRRSTPRADRLAGCWCLPGAICQTPRAWWSQARRDGDDETGMRRKGAEMC 60
+IGPARRSTTTRRSTPRADRLAGCWCLPGAICQTPRAWWSQARRDGDDETGMRRKGAEMC
Sbjct 1 MIGPARRSTTTRRSTPRADRLAGCWCLPGAICQTPRAWWSQARRDGDDETGMRRKGAEMC 60
Query 61 WMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPELVVTGLSPR 120
WMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPELVVTGLSPR
Sbjct 61 WMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPELVVTGLSPR 120
Query 121 RGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKVTALQ 180
RGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKVTALQ
Sbjct 121 RGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKVTALQ 180
Query 181 LVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA 215
LVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA
Sbjct 181 LVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA 215
>gi|289745661|ref|ZP_06505039.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|298525402|ref|ZP_07012811.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|289686189|gb|EFD53677.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|298495196|gb|EFI30490.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=175
Score = 342 bits (876), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 168/168 (100%), Positives = 168/168 (100%), Gaps = 0/168 (0%)
Query 48 DETGMRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRR 107
DETGMRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRR
Sbjct 8 DETGMRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRR 67
Query 108 GLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYC 167
GLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYC
Sbjct 68 GLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYC 127
Query 168 AIAIFGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA 215
AIAIFGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA
Sbjct 128 AIAIFGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA 175
>gi|289758013|ref|ZP_06517391.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|289713577|gb|EFD77589.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|326903511|gb|EGE50444.1| hypothetical protein TBPG_01387 [Mycobacterium tuberculosis W-148]
Length=207
Score = 341 bits (875), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 168/168 (100%), Positives = 168/168 (100%), Gaps = 0/168 (0%)
Query 48 DETGMRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRR 107
DETGMRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRR
Sbjct 40 DETGMRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRR 99
Query 108 GLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYC 167
GLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYC
Sbjct 100 GLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYC 159
Query 168 AIAIFGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA 215
AIAIFGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA
Sbjct 160 AIAIFGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA 207
>gi|308231979|ref|ZP_07414469.2| hypothetical protein TMAG_02087 [Mycobacterium tuberculosis SUMu001]
gi|308369557|ref|ZP_07418250.2| hypothetical protein TMBG_00440 [Mycobacterium tuberculosis SUMu002]
gi|308370858|ref|ZP_07422977.2| hypothetical protein TMCG_02949 [Mycobacterium tuberculosis SUMu003]
22 more sequence titles
Length=164
Score = 334 bits (856), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 164/164 (100%), Positives = 164/164 (100%), Gaps = 0/164 (0%)
Query 52 MRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPE 111
MRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPE
Sbjct 1 MRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPE 60
Query 112 LVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAI 171
LVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAI
Sbjct 61 LVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAI 120
Query 172 FGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA 215
FGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA
Sbjct 121 FGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA 164
>gi|294996819|ref|ZP_06802510.1| hypothetical protein Mtub2_20533 [Mycobacterium tuberculosis
210]
gi|339298467|gb|AEJ50577.1| hypothetical protein CCDC5180_1740 [Mycobacterium tuberculosis
CCDC5180]
Length=157
Score = 320 bits (821), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 157/157 (100%), Positives = 157/157 (100%), Gaps = 0/157 (0%)
Query 59 MCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPELVVTGLS 118
MCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPELVVTGLS
Sbjct 1 MCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPELVVTGLS 60
Query 119 PRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKVTA 178
PRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKVTA
Sbjct 61 PRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKVTA 120
Query 179 LQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA 215
LQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA
Sbjct 121 LQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA 157
>gi|339294842|gb|AEJ46953.1| hypothetical protein CCDC5079_1763 [Mycobacterium tuberculosis
CCDC5079]
Length=154
Score = 313 bits (801), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 154/154 (100%), Positives = 154/154 (100%), Gaps = 0/154 (0%)
Query 62 MCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPELVVTGLSPRR 121
MCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPELVVTGLSPRR
Sbjct 1 MCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPELVVTGLSPRR 60
Query 122 GQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKVTALQL 181
GQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKVTALQL
Sbjct 61 GQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKVTALQL 120
Query 182 VWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA 215
VWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA
Sbjct 121 VWADRRGRWPWAADFDEGRGTQPVLGMRATRRSA 154
>gi|293245|gb|AAA72375.1| hypothetical protein [Mycobacterium tuberculosis]
Length=168
Score = 185 bits (470), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 106/154 (69%), Positives = 111/154 (73%), Gaps = 10/154 (6%)
Query 52 MRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPE 111
MRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPE
Sbjct 1 MRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPE 60
Query 112 LVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVET-VQVTHPDAHLYCAIA 170
LVVTGLSPRRGQRLLNIAARRALVGDLL P+ P T A + C
Sbjct 61 LVVTGLSPRRGQRLLNIAARRALVGDLLNSRYADHPPSRPSCRNGPGYTSGRAFVLCDRH 120
Query 171 IF----GDKVTALQLVWADRRGRWPWAADFDEGR 200
++ G V + V A R AADFDEGR
Sbjct 121 LWRQGDGLAVGVGRPVVAGR-----GAADFDEGR 149
>gi|108802050|ref|YP_642247.1| hypothetical protein Mmcs_5087 [Mycobacterium sp. MCS]
gi|119871202|ref|YP_941154.1| hypothetical protein Mkms_5175 [Mycobacterium sp. KMS]
gi|108772469|gb|ABG11191.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119697291|gb|ABL94364.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=178
Score = 162 bits (409), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 85/159 (54%), Positives = 104/159 (66%), Gaps = 2/159 (1%)
Query 55 KGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPELVV 114
KG MCW CDHPEAT +YLD VY +L GWAVQ+VE ERRPFAYTVGL GLPEL++
Sbjct 8 KGGAMCWHCDHPEATLNDYLDVVYDKILRKGWAVQYVESERRPFAYTVGLHECGLPELLI 67
Query 115 TGLSPRRGQRLLNIAARRALVGD-LLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFG 173
T + P+R +LN A + D + G +LP L+E V+V+ PDAH+ A+ I+G
Sbjct 68 TAVVPKRALLVLNTVAEYCIGHDGPVLAGDTMSLP-DQLLEFVEVSQPDAHMGVAVGIYG 126
Query 174 DKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGMRATR 212
V ALQLVWAD WPW+A F+ G QPVLG R TR
Sbjct 127 RDVRALQLVWADANHEWPWSARFNPGGLRQPVLGQRETR 165
>gi|126438029|ref|YP_001073720.1| hypothetical protein Mjls_5466 [Mycobacterium sp. JLS]
gi|126237829|gb|ABO01230.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=157
Score = 160 bits (406), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 83/155 (54%), Positives = 105/155 (68%), Gaps = 2/155 (1%)
Query 59 MCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPELVVTGLS 118
MCW CDHPEAT +YLD V G++L +GWAVQ+VE ER PFAYT+GL GLPEL++T +
Sbjct 1 MCWHCDHPEATRSDYLDVVRGLILKNGWAVQYVESERTPFAYTIGLHECGLPELLITAVD 60
Query 119 PRRGQRLLNIAARRALVGD-LLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKVT 177
RR +LN A + D ++ G +LP L E V+V+ PDAH+ AI I+G V
Sbjct 61 KRRALLVLNTVANYCIKHDGPVSAGDVMSLPDQQL-EFVEVSQPDAHMGMAIGIYGRDVR 119
Query 178 ALQLVWADRRGRWPWAADFDEGRGTQPVLGMRATR 212
ALQLVWAD + RWPW+A+F+ G QPVLG R TR
Sbjct 120 ALQLVWADEQNRWPWSAEFNPGGVRQPVLGERVTR 154
>gi|145221111|ref|YP_001131789.1| hypothetical protein Mflv_0508 [Mycobacterium gilvum PYR-GCK]
gi|315441926|ref|YP_004074805.1| hypothetical protein Mspyr1_02540 [Mycobacterium sp. Spyr1]
gi|145213597|gb|ABP43001.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
gi|315260229|gb|ADT96970.1| hypothetical protein Mspyr1_02540 [Mycobacterium sp. Spyr1]
Length=157
Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 69/156 (45%), Positives = 86/156 (56%), Gaps = 0/156 (0%)
Query 59 MCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPELVVTGLS 118
MCW CDHPEAT +Y D + +L HGWAVQ+V ER PF YT+GL GLPEL+V GL
Sbjct 1 MCWQCDHPEATRADYHDVLRRKILAHGWAVQYVGSERTPFGYTIGLHPAGLPELLVAGLP 60
Query 119 PRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKVTA 178
P ++LN A + PG L E V V P AH+ + ++G +
Sbjct 61 PETTLKILNTLAGYMVREVEPAPGDTMQLADEWHGEFVAVAEPHAHMGLGLELYGPALRG 120
Query 179 LQLVWADRRGRWPWAADFDEGRGTQPVLGMRATRRS 214
LQ VW DR G PW DF++G QPVLG R+ S
Sbjct 121 LQFVWRDRDGHTPWCPDFNKGGLRQPVLGNRSAALS 156
>gi|302529925|ref|ZP_07282267.1| predicted protein [Streptomyces sp. AA4]
gi|302438820|gb|EFL10636.1| predicted protein [Streptomyces sp. AA4]
Length=155
Score = 123 bits (309), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/153 (48%), Positives = 93/153 (61%), Gaps = 2/153 (1%)
Query 59 MCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHV--ECERRPFAYTVGLTRRGLPELVVTG 116
MC C+ P+ E+YL EV + +GW VQ V R +AYT GLT +GLPELVVTG
Sbjct 1 MCQRCEEPDRPEEQYLIEVLDEIRENGWCVQGVLGTGSRPSWAYTAGLTAQGLPELVVTG 60
Query 117 LSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKV 176
L P + LLN AA ++L PG Q LP P VE VQ++ P AHL A+ +G +
Sbjct 61 LLPHQAVPLLNAAAGQSLHTGPPVPGEQWLLPRLPRVEIVQLSAPAAHLDIAVCCYGTGI 120
Query 177 TALQLVWADRRGRWPWAADFDEGRGTQPVLGMR 209
A QLV+AD G +PW+ ++ GRG QPVLG+R
Sbjct 121 EARQLVYADPAGWFPWSPQYNSGRGGQPVLGVR 153
>gi|300789692|ref|YP_003769983.1| hypothetical protein AMED_7875 [Amycolatopsis mediterranei U32]
gi|299799206|gb|ADJ49581.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340531356|gb|AEK46561.1| hypothetical protein RAM_40470 [Amycolatopsis mediterranei S699]
Length=154
Score = 123 bits (308), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/153 (47%), Positives = 95/153 (63%), Gaps = 4/153 (2%)
Query 59 MCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECE--RRPFAYTVGLTRRGLPELVVTG 116
MC+ C++ + + YL+ + G + GW VQ VE P+AYT+GL+ GLPELVVTG
Sbjct 1 MCFECENRDRSG--YLERLRGGVAARGWLVQGVEGAGPYPPWAYTIGLSGYGLPELVVTG 58
Query 117 LSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKV 176
L LLN A + L G T G + LP GPLVE V++T P HL A A++G ++
Sbjct 59 LPALAAGGLLNNLAAQVLRGSPPTAGERIQLPDGPLVEVVELTEPSVHLVFAAALYGPEI 118
Query 177 TALQLVWADRRGRWPWAADFDEGRGTQPVLGMR 209
ALQLV AD +GR+PW+ D+ +GR QPVLG R
Sbjct 119 RALQLVHADAQGRFPWSPDYRDGRAGQPVLGPR 151
>gi|326381691|ref|ZP_08203385.1| hypothetical protein SCNU_02050 [Gordonia neofelifaecis NRRL
B-59395]
gi|326199938|gb|EGD57118.1| hypothetical protein SCNU_02050 [Gordonia neofelifaecis NRRL
B-59395]
Length=187
Score = 69.7 bits (169), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/162 (34%), Positives = 77/162 (48%), Gaps = 12/162 (7%)
Query 54 RKGAEMCWMCDHPEATAEEYL-DEVYGIMLMHGWAVQHV--ECERRPFAYTVGLTRRGLP 110
R+G MC P + ++L D+ ++ WA+ V + R P YT GLT G P
Sbjct 25 RQGGVMCEF--DPRCSGPDHLVDDALALIADGRWAITGVLGDAARSPMTYTTGLTEHGRP 82
Query 111 ELVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPA---GPL-VETVQVTHPDAHLY 166
ELV+TGL P LL AAR + PG + +PA P+ V V +
Sbjct 83 ELVMTGLPPDLAGVLLEHAARSVIADRSFGPG--SDVPARLRRPVRFRAVDVIDSEPMRL 140
Query 167 CAIAIFGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGM 208
I ++G + A+QLVW D GR+PW + QP+LG+
Sbjct 141 TRI-VYGRQFDAVQLVWPDDDGRYPWQPGYSIPTQVQPLLGV 181
>gi|302870522|ref|YP_003839159.1| hypothetical protein Micau_6088 [Micromonospora aurantiaca ATCC
27029]
gi|302573381|gb|ADL49583.1| hypothetical protein Micau_6088 [Micromonospora aurantiaca ATCC
27029]
Length=153
Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/138 (35%), Positives = 65/138 (48%), Gaps = 10/138 (7%)
Query 80 IMLMHGWAVQHV------ECERRPFAYTVGLTRRGLPELVVTGLSPRRGQRLLNIAARRA 133
I+ GWAV HV PFAYTVGLT PEL+ GL P LLN ARR
Sbjct 14 IIDTTGWAVTHVLPTDDDPDTTAPFAYTVGLTAYDYPELITAGLPPEVAHSLLNDLARRV 73
Query 134 L-VGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAI--FG-DKVTALQLVWADRRGR 189
+ T G + + + P L A+AI +G D++ Q+VW D+ GR
Sbjct 74 YDKAERFTHGQRISDLIADYDAMIIDGPPTDELLPAMAINRYGRDQIRLQQMVWPDQEGR 133
Query 190 WPWAADFDEGRGTQPVLG 207
+PW ++ R QP++
Sbjct 134 FPWDDGYNFDRHAQPLIA 151
>gi|111223184|ref|YP_713978.1| hypothetical protein FRAAL3774 [Frankia alni ACN14a]
gi|111150716|emb|CAJ62417.1| hypothetical protein FRAAL3774 [Frankia alni ACN14a]
Length=174
Score = 64.7 bits (156), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 52/137 (38%), Positives = 63/137 (46%), Gaps = 15/137 (10%)
Query 84 HGWAVQHVECE----RRPFAYTVGLTRRGLPELVVTGLSPRRGQRLLNIAARRALVGDLL 139
HGWAVQ V E AYT+GLT PEL++ GL P LLN A R GD
Sbjct 24 HGWAVQAVLAEPDTGEPDHAYTIGLTALHHPELLIAGLHPHDAAALLNQLATRIRAGD-- 81
Query 140 TPGMQTTL-----PAGPLVETVQVTHPDAHLYCAIAIF----GDKVTALQLVWADRRGRW 190
P TTL P + T+ D L A A++ G V ALQ++W+D GR
Sbjct 82 PPPADTTLDDLAPPRRHHLLTLDAAASDELLLHANALYQHPDGPPVAALQIIWSDPTGRL 141
Query 191 PWAADFDEGRGTQPVLG 207
PW A QP+ G
Sbjct 142 PWEAGCTGDATHQPLAG 158
>gi|302866278|ref|YP_003834915.1| hypothetical protein Micau_1787 [Micromonospora aurantiaca ATCC
27029]
gi|302569137|gb|ADL45339.1| hypothetical protein Micau_1787 [Micromonospora aurantiaca ATCC
27029]
Length=153
Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/141 (35%), Positives = 65/141 (47%), Gaps = 26/141 (18%)
Query 85 GWAVQHV------ECERRPFAYTVGLTRRGLPELVVTGLSPRRGQRLLNIAARRAL---- 134
GWAV +V PFAYTVGLT PEL+ GL P LLN ARR
Sbjct 19 GWAVTYVLPTDDGTVTTAPFAYTVGLTAHDYPELITAGLPPEVAHSLLNDLARRVYDTAE 78
Query 135 -------VGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFG-DKVTALQLVWADR 186
+ DL+ G + GP + + AI+ +G D+V Q+VW D+
Sbjct 79 RFTHGQRLSDLIA-GYDAIIIDGPPTDELMPG-------LAISRYGRDQVRLQQMVWPDQ 130
Query 187 RGRWPWAADFDEGRGTQPVLG 207
+GR+PW + TQP++G
Sbjct 131 QGRFPWDDGYRFEPRTQPLIG 151
>gi|330465503|ref|YP_004403246.1| hypothetical protein VAB18032_07630 [Verrucosispora maris AB-18-032]
gi|328808474|gb|AEB42646.1| hypothetical protein VAB18032_07630 [Verrucosispora maris AB-18-032]
Length=153
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/147 (33%), Positives = 72/147 (49%), Gaps = 10/147 (6%)
Query 71 EEYLDEVYGIMLMHGWAVQHV------ECERRPFAYTVGLTRRGLPELVVTGLSPRRGQR 124
+++L I+ GWAV HV PFAYTVGLT PEL++ GL P
Sbjct 5 DDFLRNQERIITTRGWAVTHVLPTDDDPDTTAPFAYTVGLTAHDHPELIIAGLPPLVAHT 64
Query 125 LLNIAARRAL-VGDLLTPGMQTT-LPAGPLVETVQVTHPDAHLY-CAIAIFGD-KVTALQ 180
LLN AR+ + + G + + L AG + D L AIA +G ++ Q
Sbjct 65 LLNDLARQVYDKAERFSHGQRISDLIAGYDAVIIDGRPTDDLLPGAAIARYGRLRIRLQQ 124
Query 181 LVWADRRGRWPWAADFDEGRGTQPVLG 207
+VW D++GR+PW + ++ QP++
Sbjct 125 IVWPDQQGRFPWDSGYNFDPHIQPMIA 151
>gi|337268413|ref|YP_004612468.1| hypothetical protein Mesop_3936 [Mesorhizobium opportunistum
WSM2075]
gi|336028723|gb|AEH88374.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075]
Length=161
Score = 58.9 bits (141), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 42/145 (29%), Positives = 63/145 (44%), Gaps = 10/145 (6%)
Query 76 EVYGIMLMHGWAVQHVECERRPFAYTVGLTRR-GLPELVVTGLSPRRGQRLLNIAARRAL 134
E YG +++ E + PF+Y+VG+ PEL+V GL P Q ++N RR
Sbjct 19 EAYGCHILYVLE----EDDNPPFSYSVGIEHNFKAPELIVIGLKPEISQSIINEYCRRVR 74
Query 135 VGDLLTPGMQTTLPAGPL---VETVQVTHPDAHLYCAIAIF-GDKVTALQLVWADRRGRW 190
G++ PG + + TV V H H I + G +QL++ G W
Sbjct 75 SGEIFEPGQRASGFVNGFDCQFGTVHVGHYREHFGWDIWFYDGLDFRVMQLIFPTTEGVW 134
Query 191 PWAAD-FDEGRGTQPVLGMRATRRS 214
PW D D R QP+L + +
Sbjct 135 PWEVDASDWFRARQPLLDTEPSPKD 159
>gi|257095086|ref|YP_003168727.1| hypothetical protein CAP2UW1_3541 [Candidatus Accumulibacter
phosphatis clade IIA str. UW-1]
gi|257047610|gb|ACV36798.1| conserved hypothetical protein [Candidatus Accumulibacter phosphatis
clade IIA str. UW-1]
Length=151
Score = 55.8 bits (133), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/149 (29%), Positives = 68/149 (46%), Gaps = 17/149 (11%)
Query 71 EEYLDEVYGIMLMHGWAVQHVECERR---PFAYTVGLTRRG-LPELVVTGLSPRRGQRLL 126
E Y + + HG +V V + PF+Y++G+ + PEL++ GL + ++
Sbjct 2 EPYEQNILQHIEKHGCSVTSVFDPKEIDPPFSYSIGIAKSSSAPELIIVGLGSKLSHWMV 61
Query 127 NIAARRALVGDLLTPGMQT-------TLPAGPLVETVQVTHPDAHLYCAIAIFG-DKVTA 178
N RR G+ PG+ + GP V H + ++ A + G + A
Sbjct 62 NEYNRRVQSGERFLPGVHYLGFLEDFAVQFGP----VAREHREEYMRSACWLHGGSEFDA 117
Query 179 LQLVWADRRGRWPWAADFDEG-RGTQPVL 206
LQL+W + G WPW A+ E R QP+L
Sbjct 118 LQLIWPNTSGVWPWDAEASEWLRANQPLL 146
>gi|87119729|ref|ZP_01075626.1| hypothetical protein MED121_07310 [Marinomonas sp. MED121]
gi|86165205|gb|EAQ66473.1| hypothetical protein MED121_07310 [Marinomonas sp. MED121]
Length=215
Score = 54.7 bits (130), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 42/146 (29%), Positives = 65/146 (45%), Gaps = 25/146 (17%)
Query 85 GWAVQHV--ECERRPFAYTVG-LTRRGLPELVVTGLSPRRGQRLLNIAARRALVG----- 136
GW H+ E + F++++G + PEL++ GL +LLNIA + +VG
Sbjct 76 GWYNLHIGQEDNQAAFSFSIGHFQQHNHPELILVGLPAEVANQLLNIAVVK-IVGAKERL 134
Query 137 ------DLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGD---KVTALQLVWADRR 187
D T G+ V++ +L A +GD LQ+VW DR
Sbjct 135 EPYKKYDDFTEGLAVAFIP------VELDFYRNYLGYANWYYGDLPKPYPVLQMVWPDRE 188
Query 188 GRWPWAADFDEG-RGTQPVLGMRATR 212
G +PW A+FD + QP+LG +
Sbjct 189 GYFPWDAEFDTSFKQAQPLLGFGPNK 214
>gi|312197463|ref|YP_004017524.1| hypothetical protein FraEuI1c_3647 [Frankia sp. EuI1c]
gi|311228799|gb|ADP81654.1| hypothetical protein FraEuI1c_3647 [Frankia sp. EuI1c]
Length=246
Score = 54.7 bits (130), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 51/157 (33%), Positives = 68/157 (44%), Gaps = 17/157 (10%)
Query 62 MCDHPEATAEEYLDEVYGIMLMHGWAVQH---VECERRPFAYTVGLTRRG-LPELVVTGL 117
+C+ E + +DE + HGWA+Q + C R AYTVGLT PEL++TGL
Sbjct 46 LCEQFETRYDALIDEA---IAAHGWALQAAPALHCRPR-LAYTVGLTAYDRHPELIITGL 101
Query 118 SPRRGQRLLNIAARRALVGDLLTPGMQ-TTLPAGPLVETVQVTHPDAHLYCAIAIF---- 172
R+LN+ G L Q P P + + V PD +A
Sbjct 102 RSHVAARILNVLCDHVRDGQRLGTRQQCADFPGWPRLALLDVD-PDNSGDLLVAANRRYQ 160
Query 173 ---GDKVTALQLVWADRRGRWPWAADFDEGRGTQPVL 206
G V ALQ++W D G PW + R QPVL
Sbjct 161 PTDGPPVDALQVIWCDPAGNLPWEPGWVLPRDAQPVL 197
>gi|171915661|ref|ZP_02931131.1| hypothetical protein VspiD_30860 [Verrucomicrobium spinosum DSM
4136]
Length=167
Score = 54.7 bits (130), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/122 (32%), Positives = 58/122 (48%), Gaps = 8/122 (6%)
Query 84 HGWAVQHV--ECERRPFAYTVGLTRRGL-PELVVTGLSPRRGQRLLNIAARRALVGDLLT 140
HGW + H+ E + FA+++G + L PE++V GL + LLN L G +L+
Sbjct 33 HGWHLMHIGPEGDLPQFAFSIGFYYQFLQPEVLVMGLGVEKSANLLNHIGETLLSGKVLS 92
Query 141 PGMQTTLPAGPLVE--TVQVTHPDAHLYCAIAIF---GDKVTALQLVWADRRGRWPWAAD 195
PG AG VE V + H HL AI + A+Q + D+ G++P
Sbjct 93 PGRDAEYMAGYPVEFRPVHIAHYREHLGYAIWFYRSLPQAFPAMQCLLPDKAGKFPGDEG 152
Query 196 FD 197
+D
Sbjct 153 YD 154
>gi|291008209|ref|ZP_06566182.1| hypothetical protein SeryN2_27121 [Saccharopolyspora erythraea
NRRL 2338]
Length=168
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/133 (37%), Positives = 61/133 (46%), Gaps = 14/133 (10%)
Query 85 GWAVQHVECERR--PFAYTVGLTRR-GLPELVVTGLSPRRGQRLLNIAARRALVGDLLTP 141
G AV HV + P+A++VG RR G PE V GL ++N RRA G+ P
Sbjct 23 GAAVMHVAGDEHGAPYAFSVGAWRRFGKPEAVTIGLPKDVAHSVINTYVRRAAGGERFKP 82
Query 142 GMQTTLPAGPL------VETVQVTHPDAHLYCAIAIFGD-KVTALQLVWADRRGRWPWAA 194
G L G L VE V H L A ++GD A+QL+ A G++PW
Sbjct 83 GQ---LYDGFLDGCWMTVEKVAKQHYPEFLGSAFLVYGDGDFPAVQLIAATPDGKFPWHD 139
Query 195 DFDEGRGT-QPVL 206
D G QPVL
Sbjct 140 DAPGGFAEYQPVL 152
>gi|134098594|ref|YP_001104255.1| hypothetical protein SACE_2020 [Saccharopolyspora erythraea NRRL
2338]
gi|133911217|emb|CAM01330.1| hypothetical protein SACE_2020 [Saccharopolyspora erythraea NRRL
2338]
Length=165
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/133 (37%), Positives = 61/133 (46%), Gaps = 14/133 (10%)
Query 85 GWAVQHVECERR--PFAYTVGLTRR-GLPELVVTGLSPRRGQRLLNIAARRALVGDLLTP 141
G AV HV + P+A++VG RR G PE V GL ++N RRA G+ P
Sbjct 20 GAAVMHVAGDEHGAPYAFSVGAWRRFGKPEAVTIGLPKDVAHSVINTYVRRAAGGERFKP 79
Query 142 GMQTTLPAGPL------VETVQVTHPDAHLYCAIAIFGD-KVTALQLVWADRRGRWPWAA 194
G L G L VE V H L A ++GD A+QL+ A G++PW
Sbjct 80 GQ---LYDGFLDGCWMTVEKVAKQHYPEFLGSAFLVYGDGDFPAVQLIAATPDGKFPWHD 136
Query 195 DFDEGRGT-QPVL 206
D G QPVL
Sbjct 137 DAPGGFAEYQPVL 149
>gi|343927728|ref|ZP_08767196.1| hypothetical protein GOALK_097_01500 [Gordonia alkanivorans NBRC
16433]
gi|343762369|dbj|GAA14122.1| hypothetical protein GOALK_097_01500 [Gordonia alkanivorans NBRC
16433]
Length=199
Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/118 (34%), Positives = 53/118 (45%), Gaps = 12/118 (10%)
Query 98 FAYTVGLTRRGLPELVVTGLSPRRGQRLLNIAAR-------RALVGDLLTPGMQTTLPAG 150
F+YT GL+ +PEL + G+ P +LN R LV D +QT +
Sbjct 53 FSYTAGLSLHSIPELAIYGVDPLTAHHILNELGDLLHREDWRDLVADQSDIRLQTVAVSV 112
Query 151 PLVETVQVTHPDAHLYCAIAIFGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLGM 208
L+E V L A +F D T LQ+VW D GR+PW + QPV G+
Sbjct 113 RLIEQVD----KDELILANLLFPDYPT-LQVVWPDEYGRFPWEEGYILLPMHQPVKGI 165
>gi|108802582|ref|YP_642778.1| hypothetical protein Mmcs_5622 [Mycobacterium sp. MCS]
gi|119855193|ref|YP_935796.1| hypothetical protein Mkms_5806 [Mycobacterium sp. KMS]
gi|108773001|gb|ABG11722.1| hypothetical protein Mmcs_5622 [Mycobacterium sp. MCS]
gi|119697910|gb|ABL94981.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=272
Score = 52.0 bits (123), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 42/112 (38%), Positives = 50/112 (45%), Gaps = 2/112 (1%)
Query 98 FAYTVGLTRRGLPELVVTGLSPRRGQRLLNIAARRAL-VGDLLTPGMQTTLPAGPLVETV 156
FAYTVGL+ + LPEL + GL LLN ARR + G L G + V V
Sbjct 47 FAYTVGLSAQSLPELAIYGLPGPVAHSLLNEVARRIVAAGQGLATGDRIEGVLVDDVALV 106
Query 157 QVTHPDA-HLYCAIAIFGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVLG 207
V DA L +G A+QLVW D G PW G QP+ G
Sbjct 107 AVEMTDARDLNLVRECYGAVAAAVQLVWPDADGVLPWEQGSRVGGAEQPLRG 158
>gi|146300657|ref|YP_001195248.1| hypothetical protein Fjoh_2908 [Flavobacterium johnsoniae UW101]
gi|146155075|gb|ABQ05929.1| hypothetical protein Fjoh_2908 [Flavobacterium johnsoniae UW101]
Length=256
Score = 52.0 bits (123), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 39/129 (31%), Positives = 63/129 (49%), Gaps = 11/129 (8%)
Query 76 EVYGIMLMHGWAVQHVECERRPFAYTVGL-TRRGLPELVVTGLSPRRGQRLLNIAARRAL 134
E YG+ ++ A ++ FAY++GL PE++ GLS ++N A
Sbjct 24 EKYGLQVILIEATDYLPS----FAYSIGLWKEYNHPEIICFGLSTSLLHTIINDVAEIIK 79
Query 135 VGDLLTPGMQ-TTLPAGPLVETVQVTHPDAHL-YCAIAI-FGDK--VTALQLVWADRRGR 189
+ + G T + E ++V HP+ L Y AI F ++ + ALQLVW DR +
Sbjct 80 KNETIVEGKNYTNIFKNSRAEFLKV-HPNNILDYFGTAINFYEREDIPALQLVWTDRSNK 138
Query 190 WPWAADFDE 198
+PW +F+E
Sbjct 139 FPWEENFEE 147
>gi|288918036|ref|ZP_06412394.1| hypothetical protein FrEUN1fDRAFT_2090 [Frankia sp. EUN1f]
gi|288350554|gb|EFC84773.1| hypothetical protein FrEUN1fDRAFT_2090 [Frankia sp. EUN1f]
Length=219
Score = 51.6 bits (122), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 50/151 (34%), Positives = 69/151 (46%), Gaps = 19/151 (12%)
Query 58 EMCWMCDHPEATA------EEYLDEVYGIMLMHGWAVQHV---ECERRPFAYTVGL-TRR 107
E C + P ATA + LD+ I+ GWAVQ V + +AYT+GL
Sbjct 16 ETCAADNDPAATAAWIASQDALLDQ---ILRTRGWAVQPVLDDGPDEPAYAYTIGLFAFD 72
Query 108 GLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVT--HPDAHL 165
PELVV+GL + +L++ R + L G + TL VE ++T D L
Sbjct 73 SHPELVVSGLRDDQATSVLDLLGERVRRHERLHDGQRLTLAPLLTVELREITPFASDQLL 132
Query 166 YCAIAIF----GDKVTALQLVWADRRGRWPW 192
A +++ G V LQ VWAD G PW
Sbjct 133 LGANSLYRHPDGPAVPGLQAVWADHTGSLPW 163
>gi|262203574|ref|YP_003274782.1| hypothetical protein Gbro_3705 [Gordonia bronchialis DSM 43247]
gi|262086921|gb|ACY22889.1| hypothetical protein Gbro_3705 [Gordonia bronchialis DSM 43247]
Length=201
Score = 51.6 bits (122), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 43/131 (33%), Positives = 56/131 (43%), Gaps = 14/131 (10%)
Query 87 AVQHVECERR--PFAYTVGLTRRGLPELVVTGLSPRRGQRLLNIAAR-------RALVGD 137
A V C R FAYT GLT G+PEL V GL + LLN A R LV
Sbjct 42 ACSSVGCSRPDCAFAYTAGLTLHGIPELAVYGLPSNTSRALLNELAGLLHQHDWRTLVHS 101
Query 138 LLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKVTALQLVWADRRGRWPWAADFD 197
+T L+E + + A +F D ALQ+VW D G +PW ++
Sbjct 102 HTEVTSRTMAAPVRLIEAIDTD----DMLMANLLFADS-PALQVVWPDDNGHYPWQDEYT 156
Query 198 EGRGTQPVLGM 208
QP+ G+
Sbjct 157 LLPLHQPLKGI 167
>gi|254381320|ref|ZP_04996685.1| conserved hypothetical protein [Streptomyces sp. Mg1]
gi|194340230|gb|EDX21196.1| conserved hypothetical protein [Streptomyces sp. Mg1]
Length=190
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/146 (31%), Positives = 66/146 (46%), Gaps = 6/146 (4%)
Query 76 EVYGIMLMHGWAVQHVECERRP--FAYTVGLTR-RGLPELVVTGLSPRRGQRLLNIAARR 132
V ++ HGW V V + + +AYTVGL +PEL + GL R Q +LN +R
Sbjct 38 SVVDVIRQHGWQVSMVPADGQGPGWAYTVGLWHCHRMPELAMFGLDVRLMQTVLNDLGQR 97
Query 133 ALVGDLLTPGMQ-TTLPAGPLV-ETVQVTHPDAHLYCAIAIF-GDKVTALQLVWADRRGR 189
A+ G L G + + + PLV V A AI+ + LQ+VW +R G
Sbjct 98 AVEGQPLEAGQEWHDVASVPLVLRPVDYRWYKAFFGTAISYYRKPPFPVLQVVWPNRDGA 157
Query 190 WPWAADFDEGRGTQPVLGMRATRRSA 215
+PW ++ QP L + A
Sbjct 158 FPWQPGGEDALSHQPRLDLHPDEHPA 183
>gi|319787513|ref|YP_004146988.1| hypothetical protein Psesu_1916 [Pseudoxanthomonas suwonensis
11-1]
gi|317466025|gb|ADV27757.1| hypothetical protein Psesu_1916 [Pseudoxanthomonas suwonensis
11-1]
Length=152
Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/131 (27%), Positives = 63/131 (49%), Gaps = 9/131 (6%)
Query 84 HGWAVQHVECERR---PFAYTVGLTRR-GLPELVVTGLSPRRGQRLLNIAARRALVGDLL 139
+GW HV + F+Y++G + G PE+++ GL + LLN A G ++
Sbjct 19 YGWHCLHVFPAKEGQDKFSYSIGFGKSYGSPEVLIFGLEREKAHALLNECAHLLKGGHII 78
Query 140 TPGMQT-TLPAGPLVETVQVTHPD---AHLYCAIAIFGDK-VTALQLVWADRRGRWPWAA 194
PG++ ++ AG + PD +L A+ + DK +A+ + DR+ R+PW
Sbjct 79 VPGVEDGSVLAGDYKVVFKSVRPDRFGEYLGTAVRYYKDKPFSAVVMFLPDRQHRFPWHQ 138
Query 195 DFDEGRGTQPV 205
+D +P+
Sbjct 139 GYDYIPAGEPL 149
>gi|284989913|ref|YP_003408467.1| hypothetical protein Gobs_1361 [Geodermatophilus obscurus DSM
43160]
gi|284063158|gb|ADB74096.1| hypothetical protein Gobs_1361 [Geodermatophilus obscurus DSM
43160]
Length=206
Score = 48.5 bits (114), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 42/137 (31%), Positives = 57/137 (42%), Gaps = 9/137 (6%)
Query 80 IMLMHGWAVQHV---ECERRP-FAYTVGLTRRGLPELVVTGLSPRRGQRLLNIAARRALV 135
++ H WAVQ+V E + P F YT+GL G PELV+ GL +L A
Sbjct 21 VVRQHRWAVQYVGSGEEDDEPCFGYTIGLFGLGHPELVLVGLGADTTHGVLQRVAGEVAA 80
Query 136 GDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDK-----VTALQLVWADRRGRW 190
G L PG P V+ + + F + V+A QL W+ G +
Sbjct 81 GRDLVPGELIDRDDRPGRLFVEDSPNPGEVVLGANRFYQRPPEYSVSAFQLAWSHADGHF 140
Query 191 PWAADFDEGRGTQPVLG 207
W A + G G QP G
Sbjct 141 LWEAGYPCGPGCQPRPG 157
>gi|330983433|gb|EGH81536.1| hypothetical protein PLA107_00285 [Pseudomonas syringae pv. lachrymans
str. M301315]
Length=150
Score = 48.5 bits (114), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 47/147 (32%), Positives = 64/147 (44%), Gaps = 24/147 (16%)
Query 76 EVYGIMLMHGWAVQHVECERRPFAYTVGLTRRGLPELVVTGLSPRRGQRLLNIAARRALV 135
E YG+ + + + + R FAYT+G+T G PEL+V GL + N V
Sbjct 13 EKYGLAIQFAFPTEEDQGPR--FAYTIGMTDIGHPELLVIGLPDELAGLVFN------QV 64
Query 136 GDLLTPGMQTTLPAGPLVETV-----QVTHPDAHLYCAIAIFGDKVTAL--------QLV 182
D L G +T A L+E + QV D A I GD+ + QL+
Sbjct 65 HDELRTGQRTG--AELLIEKILSVPLQVHATDPVKSSAYTIQGDEYYRIRGLMPVYSQLI 122
Query 183 WADRRGRWPWAADFDEG-RGTQPVLGM 208
W D G +P FDE R QP LG+
Sbjct 123 WPDPAGVYPHQDGFDEDMREIQPYLGI 149
>gi|189426806|ref|YP_001949905.1| hypothetical protein RSL1_gp030 [Ralstonia phage RSL1]
gi|189233118|dbj|BAG41475.1| hypothetical protein [Ralstonia phage RSL1]
Length=159
Score = 48.1 bits (113), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 40/115 (35%), Positives = 54/115 (47%), Gaps = 6/115 (5%)
Query 97 PFAYTVGLTRRGLPELVVTG-LSPRRGQRLLN-IAARRALVGDLLTPGMQTTLPAGPLVE 154
PF YTVGLT +G PE++ TG LS R Q L + + G G++ L E
Sbjct 41 PFMYTVGLTAKGWPEIIATGNLSVRAMQWCLGAVVSTMEKEGADFRTGIRHDL-FNFKCE 99
Query 155 TVQVTHPDAHLYCAIA---IFGDKVTALQLVWADRRGRWPWAADFDEGRGTQPVL 206
VT + + A+ ++GD V LQLVW D + R P +D R Q V
Sbjct 100 LRWVTSEELRMEYAVHATRLYGDNVRVLQLVWTDDQNRLPDEPGYDAQRFIQQVF 154
>gi|298251298|ref|ZP_06975101.1| conserved hypothetical protein [Ktedonobacter racemifer DSM 44963]
gi|297545890|gb|EFH79758.1| conserved hypothetical protein [Ktedonobacter racemifer DSM 44963]
Length=160
Score = 48.1 bits (113), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 44/136 (33%), Positives = 68/136 (50%), Gaps = 16/136 (11%)
Query 84 HGWAVQHV--ECERRP-FAYTVGL--TRRGLPELVVTGLSPRRGQRLLNIAARRALVGDL 138
HG+++ V E+ P F YT+GL TRR LPE+ + GL + +LLN+ A+ L G
Sbjct 15 HGFSMITVGDPDEQLPMFGYTIGLYHTRR-LPEVFMIGLPQQSLMQLLNLIAQNMLSGTP 73
Query 139 LTPGMQTT------LPAGPLVETVQVTHPDAHLYCAIAIFG-DKVTALQLVWADRRGRWP 191
G TT P TV + D ++ A+ + + LQ VW+D++ R+P
Sbjct 74 YEAGQITTDLIKNGFPC--FFGTVASMYYDEYVGQAMNYYAVESFPLLQCVWSDKQQRFP 131
Query 192 WAADFDE-GRGTQPVL 206
W + + R QP+L
Sbjct 132 WQPEAEAWFRTRQPLL 147
>gi|333892736|ref|YP_004466611.1| hypothetical protein ambt_06360 [Alteromonas sp. SN2]
gi|332992754|gb|AEF02809.1| hypothetical protein ambt_06360 [Alteromonas sp. SN2]
Length=139
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/130 (26%), Positives = 59/130 (46%), Gaps = 11/130 (8%)
Query 84 HGWAVQHVECERRP-FAYTVGLTRR-GLPELVVTGLSPRRGQRLLNIAARRALVGDLLTP 141
HGW V V + P F+Y++G T PE++++GL L+N + G T
Sbjct 15 HGWHVLSVFSKDAPSFSYSIGFTETLDHPEIIMSGLDTSLMHSLINDIGQLIRNGQRFTN 74
Query 142 GM--QTTLPAGPL-VETVQVTHPDAHLYCAIAIFG-DKVTALQLVWADRRGRWPWAADFD 197
+ + P+ + + + +L A++I+ +K ALQ +W D+ G++ +
Sbjct 75 NQLSEEVIKGYPVKFSKISELNKEEYLRAAVSIYSIEKFDALQCIWPDKEGKFQ-----E 129
Query 198 EGRGTQPVLG 207
E Q VL
Sbjct 130 ESNTAQEVLS 139
>gi|308178589|ref|YP_003917995.1| hypothetical protein AARI_28190 [Arthrobacter arilaitensis Re117]
gi|307746052|emb|CBT77024.1| hypothetical protein AARI_28190 [Arthrobacter arilaitensis Re117]
Length=142
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/76 (41%), Positives = 41/76 (54%), Gaps = 14/76 (18%)
Query 59 MCWMCD-----HPEATAEEYLDEVYGIMLMHGWAVQHVECER--RPFAYTVGLTRRGLPE 111
MC MC+ EA A+ + + HG V VE +R +PFAYTVGL+R G PE
Sbjct 1 MCDMCNGMTRKQVEAKADRQIRD-------HGRVVIFVEPDRMSQPFAYTVGLSRIGHPE 53
Query 112 LVVTGLSPRRGQRLLN 127
+V GL+ +LLN
Sbjct 54 FIVRGLNAEDSIQLLN 69
>gi|312887878|ref|ZP_07747465.1| conserved hypothetical protein [Mucilaginibacter paludis DSM
18603]
gi|311299697|gb|EFQ76779.1| conserved hypothetical protein [Mucilaginibacter paludis DSM
18603]
Length=149
Score = 46.2 bits (108), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 66/148 (45%), Gaps = 11/148 (7%)
Query 67 EATAEEYLDEVYGIMLMHGWAVQHV--ECERRPFAYTVGLTRR-GLPELVVTGLSPR-RG 122
+ E+Y ++VY + G+ V E + PFAY+ G+ + +PEL ++GL P G
Sbjct 3 DKKKEDYFNKVYKNIKNKGYHTTAVLEEIDFTPFAYSTGIFKNFKIPELFISGLGPNLSG 62
Query 123 QRLLNIAARRALVGDLLTPGMQTTLPAGPLVETVQVTHPDAHLYCAIAI-F--GDKVTAL 179
+ + N ++ L +Q L V + + + D Y ++ F L
Sbjct 63 ELIENYVSKFKFAEVPLHRKIQ-NLSDRFAVYFISLKNSDVEEYALTSVKFYENSNYEYL 121
Query 180 QLVWADRRGRWPWAADFDEGRGTQPVLG 207
QL++ D G++P ++ Q VLG
Sbjct 122 QLIFPDLNGKFPNEVGYNYD---QKVLG 146
>gi|323500130|ref|ZP_08105076.1| hypothetical protein VISI1226_09114 [Vibrio sinaloensis DSM 21326]
gi|323314799|gb|EGA67864.1| hypothetical protein VISI1226_09114 [Vibrio sinaloensis DSM 21326]
Length=148
Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 30/123 (25%), Positives = 54/123 (44%), Gaps = 11/123 (8%)
Query 76 EVYGIMLMHGWAVQHVECERRP-FAYTVGLTR-RGLPELVVTGLSPRRGQRLLNIAARRA 133
E YG ++H +E + P F+Y++G+ + PE+++TGL+ ++N R
Sbjct 13 EQYGCHILHV-----MEEDEYPGFSYSIGIEKTSSQPEIIITGLNQEVAHWIVNEYNNRV 67
Query 134 LVGDLLTPGMQTTLPAGPLVETVQVTHPDAHL-YCAIAIF---GDKVTALQLVWADRRGR 189
G++ P + T + P+ + Y A + G LQ ++ D G
Sbjct 68 KAGEIFKPDEYYSGFLEGFDITFKEVSPEYYAEYFGWANWLYKGKNFKVLQFIYPDTSGV 127
Query 190 WPW 192
WPW
Sbjct 128 WPW 130
>gi|256424624|ref|YP_003125277.1| hypothetical protein Cpin_5652 [Chitinophaga pinensis DSM 2588]
gi|256039532|gb|ACU63076.1| hypothetical protein Cpin_5652 [Chitinophaga pinensis DSM 2588]
Length=257
Score = 44.7 bits (104), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/115 (29%), Positives = 53/115 (47%), Gaps = 6/115 (5%)
Query 98 FAYTVGLTRR-GLPELVVTGLSPRRGQRLLNIAARRALVG-DLLTPGMQTTLPAGPLVET 155
FAYT+GL + G PE++ GL + LLN AA G +T + T ++
Sbjct 39 FAYTIGLYKTFGQPEIICFGLPVKTMAGLLNDAADIIREGGSFVTGKLYATFLVDYYIQF 98
Query 156 VQVTHPDAHLYCAIAIFGD---KVTALQLVWADRRGRWPWAADFD-EGRGTQPVL 206
++V Y A + + LQ VW D++ +PW F+ + + QP+L
Sbjct 99 LEVNKASYRDYVGYAGWFNGNFDFPLLQFVWPDKQHHFPWEESFNPDWQFLQPLL 153
>gi|169630193|ref|YP_001703842.1| hypothetical protein MAB_3111 [Mycobacterium abscessus ATCC 19977]
gi|169242160|emb|CAM63188.1| Hypothetical protein MAB_3111 [Mycobacterium abscessus]
Length=190
Score = 43.9 bits (102), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 72/150 (48%), Gaps = 18/150 (12%)
Query 76 EVYGIMLMHGWAVQHV----ECERRPFAYTVGL--TRRGLPELVVTGLSP-RRGQRLLNI 128
++ G + +GW+ + E PFAYTVGL T R LPEL + G++ QR LN
Sbjct 27 DIIGSVTEYGWSALGIGPTSSEESPPFAYTVGLWHTMR-LPELAIYGVNDITMMQRALNA 85
Query 129 AARRALVGDLLTPGMQ-TTLPAGPLVET--VQVTHPDAHLYCAIAIFG------DKVTAL 179
A++A G +L G + A P V+ V+++ D Y FG + V L
Sbjct 86 VAKQAQEGRVLQVGETFADVLALPDVDDYRVKLSPIDPSWYDNEFGFGLWFNRTNHVRYL 145
Query 180 QLVWADRRGRWPWAADFD-EGRGTQPVLGM 208
Q++W D GR+P + D QP++ M
Sbjct 146 QILWPDGAGRFPGNPELDPHFDDRQPLMWM 175
>gi|269126019|ref|YP_003299389.1| hypothetical protein Tcur_1777 [Thermomonospora curvata DSM 43183]
gi|268310977|gb|ACY97351.1| hypothetical protein Tcur_1777 [Thermomonospora curvata DSM 43183]
Length=177
Score = 43.5 bits (101), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 48/132 (37%), Positives = 58/132 (44%), Gaps = 10/132 (7%)
Query 84 HGWAVQHVEC-ERRP-FAYTVGL--TRRGLPELVVTGLSPRRGQRLLNIAARRALVGDLL 139
+GW+V E RP +A+T GL T R PELVV GL P Q ++N RA G L
Sbjct 33 YGWSVILTSPRENRPGWAFTAGLWHTLRS-PELVVFGLEPYDMQTIVNNLGDRAAAGHPL 91
Query 140 TPGMQ--TTLPAGPLVETVQVTHPDAHLYCAIAIF--GDKVTALQLVWADRRGRWPWAAD 195
G + P+V TH L F + LQ VW D GR+PW A
Sbjct 92 VAGQERRDATDRHPVVLRPVHTHWYERLLSEALRFYRHPPLPFLQAVWPDAAGRYPWQAG 151
Query 196 FDEGRG-TQPVL 206
D G QP L
Sbjct 152 SDPALGRYQPSL 163
>gi|300787176|ref|YP_003767467.1| hypothetical protein AMED_5303 [Amycolatopsis mediterranei U32]
gi|299796690|gb|ADJ47065.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340528675|gb|AEK43880.1| hypothetical protein RAM_27015 [Amycolatopsis mediterranei S699]
Length=180
Score = 43.5 bits (101), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 33/97 (35%), Positives = 44/97 (46%), Gaps = 4/97 (4%)
Query 107 RGLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPG--MQTTLPAGPLV-ETVQVTHPDA 163
+PE VV GL + GQ LL+ RA G++ G P+V E V H
Sbjct 60 HNVPEAVVVGLPGQMGQVLLDAYVDRAANGEIFEVGRRYDDFFDGVPVVLERVNRGHYPE 119
Query 164 HLYCAIAIFGD-KVTALQLVWADRRGRWPWAADFDEG 199
+ A I+ D ALQL+ A G++PW D EG
Sbjct 120 YFGTAFLIYPDGDFPALQLIVATPEGKFPWHPDAPEG 156
>gi|220925527|ref|YP_002500829.1| hypothetical protein Mnod_5689 [Methylobacterium nodulans ORS
2060]
gi|219950134|gb|ACL60526.1| hypothetical protein Mnod_5689 [Methylobacterium nodulans ORS
2060]
Length=271
Score = 43.1 bits (100), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 36/115 (32%), Positives = 49/115 (43%), Gaps = 25/115 (21%)
Query 98 FAYTVGLTRRGLPELVVTGLS--------------PRRGQRLLN-IAARRALVGDLLTPG 142
F YTVG T GLPEL++ G + + G+R +N I R + G +
Sbjct 139 FRYTVGFTELGLPELLIVGQTRKLARHMLEHLLKDHKSGKRPINPIDGFRTVAGGHVC-- 196
Query 143 MQTTLPAGPLVETVQVTHPDAHLYCAIAIFGDKVTALQLVWADRRGRWPWAADFD 197
M LP TV ++ A + V LQ+V D RGR+PW FD
Sbjct 197 MLRQLPKSKANNTV--------VFQARDYYRRHVGVLQVVLPDSRGRYPWDIRFD 243
>gi|121603288|ref|YP_980617.1| hypothetical protein Pnap_0373 [Polaromonas naphthalenivorans
CJ2]
gi|120592257|gb|ABM35696.1| hypothetical protein Pnap_0373 [Polaromonas naphthalenivorans
CJ2]
Length=151
Score = 42.7 bits (99), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 36/135 (27%), Positives = 57/135 (43%), Gaps = 9/135 (6%)
Query 78 YGIMLMHGWAVQHVECERRPFAYTVGLTRR-GLPELVVTGLSPRRGQRLLNIAARRALVG 136
YG +MH V + + FAY++G+ + G PE V GL ++N RR G
Sbjct 16 YGCSVMH---VFDADGDLPSFAYSIGIQQETGAPEAFVIGLKRPMAHSVINEYNRRTREG 72
Query 137 DLLTPGMQTTLPAGPL---VETVQVTHPDAHLYCAIAIF-GDKVTALQLVWADRRGRWPW 192
+ G G + V + D + I + G + +Q+++ +G WPW
Sbjct 73 ERFEIGKYYAGFLGGFEVCIGAVPRSTYDEYFGQNIDFYDGREFDVVQIIYPTTKGVWPW 132
Query 193 AADFDEGR-GTQPVL 206
A D E QP+L
Sbjct 133 APDASEAFIQGQPIL 147
>gi|94499459|ref|ZP_01305996.1| hypothetical protein RED65_00460 [Oceanobacter sp. RED65]
gi|94428213|gb|EAT13186.1| hypothetical protein RED65_00460 [Oceanobacter sp. RED65]
Length=152
Score = 42.4 bits (98), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 50/124 (41%), Gaps = 15/124 (12%)
Query 94 ERRP-FAYTVGLTR-RGLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAGP 151
E+ P F Y++G+ + PEL++ GL ++N RR G+ PG
Sbjct 27 EKDPDFTYSIGIHKVESQPELIILGLRHELSSWIVNEYNRRIKEGERFVPGEYYE----G 82
Query 152 LVETVQVTHPDA--------HLYCAIAIFGDKVTALQLVWADRRGRWPWAADFDEG-RGT 202
+E Q+T + L C +QL++ +G WPW + EG +
Sbjct 83 FIEGFQITFQEVADKYKEEFMLSCNWLYGSINYPVMQLIFPSVKGVWPWEKEASEGFKKL 142
Query 203 QPVL 206
QP
Sbjct 143 QPSF 146
>gi|338780937|gb|EGP45334.1| hypothetical protein AXXA_16667 [Achromobacter xylosoxidans AXX-A]
Length=176
Score = 42.0 bits (97), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 53/120 (45%), Gaps = 7/120 (5%)
Query 79 GIMLMHGWA-VQHVECERRP-FAYTVGL-TRRGLPELVVTGLSPRRGQRLLNIAARRALV 135
G + HGW + E E +P F++T G G PE++V L P+ +L R
Sbjct 27 GQIREHGWFRTEIFESEGQPGFSFTTGFWVGHGFPEIIVFSLPPQVTHDVLWSLYRAVAA 86
Query 136 GDLLTPGMQTTLPAG---PLVETVQVTHPDAHL-YCAIAIFGDKVTALQLVWADRRGRWP 191
G+ G+ T G L+ V +H HL + GD +QL W D+ GR+P
Sbjct 87 GEPPPIGVPTAGIFGGFDALLAPVDKSHYPEHLGWNRWFHGGDDFPCVQLFWPDKSGRFP 146
>gi|302527854|ref|ZP_07280196.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302436749|gb|EFL08565.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=173
Score = 41.6 bits (96), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 33/105 (32%), Positives = 46/105 (44%), Gaps = 5/105 (4%)
Query 107 RGLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPG--MQTTLPAGPLV-ETVQVTHPDA 163
+PE VV GL LL+ R+ G++ G + P+V E V H
Sbjct 53 HNVPEAVVIGLPDHMAPVLLDAYVDRSANGEIFEVGKRYEDFFDGAPVVFERVAKGHYPE 112
Query 164 HLYCAIAIFGD-KVTALQLVWADRRGRWPWAADFDEGRGT-QPVL 206
+ A ++ D ALQ++ A G +PW AD EG QPVL
Sbjct 113 YFGSAFLVYPDGDFPALQMIVATPDGHFPWHADAPEGFAEWQPVL 157
>gi|149186353|ref|ZP_01864666.1| hypothetical protein ED21_22728 [Erythrobacter sp. SD-21]
gi|148829942|gb|EDL48380.1| hypothetical protein ED21_22728 [Erythrobacter sp. SD-21]
Length=167
Score = 41.2 bits (95), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 37/132 (29%), Positives = 52/132 (40%), Gaps = 9/132 (6%)
Query 84 HGWA---VQHVECERRPFAYTVGLTR-RGLPELVVTGLSPRRGQRLLNIAARRALVGDLL 139
HGW V +E + F Y+ G G PE++V L + + R G+
Sbjct 26 HGWFGTRVFDLEKQEPDFTYSTGFFHGLGHPEIIVFSLPKQVSHDIFWDIHRNIREGNFP 85
Query 140 TPGMQTTLPAGP---LVETVQVTHPDAHLYCAIAIF-GDKVTALQLVWADRRGRWPWAAD 195
P + + G + V HL + + D LQLVW DR G +PW D
Sbjct 86 KPETKLSGIFGKHQAVFVPVSRDFYAEHLGWSQWFYRSDNFPCLQLVWPDRAGIFPWQPD 145
Query 196 FDEGRGT-QPVL 206
FD + QP L
Sbjct 146 FDPAFASDQPDL 157
>gi|153005465|ref|YP_001379790.1| hypothetical protein Anae109_2605 [Anaeromyxobacter sp. Fw109-5]
gi|152029038|gb|ABS26806.1| hypothetical protein Anae109_2605 [Anaeromyxobacter sp. Fw109-5]
Length=262
Score = 40.4 bits (93), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 40/128 (32%), Positives = 53/128 (42%), Gaps = 6/128 (4%)
Query 85 GWAVQHVECERRPFAYTVGLTRR-GLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPG- 142
GW V R A+T+GL R PE+V+ G P + L+ R G+ G
Sbjct 129 GWHVVQAVETGRSHAFTIGLFRSFDHPEVVLFGFGPEIREAALDRLGARVRAGERFEDGG 188
Query 143 -MQTTLPAGPL-VETVQVTHPDAHL-YCAIAIFGDKVTALQLVWADRRGRWPWAADFDEG 199
L P+ V H A+L Y G + ALQ VW D GR+PW F
Sbjct 189 VADGILADRPVTFRVVARRHYLAYLGYAGWYHGGPRFPALQAVWPDAEGRFPWERWFSPA 248
Query 200 -RGTQPVL 206
R +P+L
Sbjct 249 LREAEPIL 256
Lambda K H
0.324 0.137 0.458
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 257507162856
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40