BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3749c
Length=169
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610885|ref|NP_218266.1| hypothetical protein Rv3749c [Mycob... 350 5e-95
gi|340628720|ref|YP_004747172.1| hypothetical protein MCAN_37691... 343 4e-93
gi|167970909|ref|ZP_02553186.1| hypothetical protein MtubH3_2383... 238 3e-61
gi|339300157|gb|AEJ52267.1| hypothetical protein CCDC5180_3430 [... 214 3e-54
gi|323717613|gb|EGB26815.1| hypothetical protein TMMG_00240 [Myc... 149 1e-34
gi|240169750|ref|ZP_04748409.1| hypothetical protein MkanA1_1058... 117 7e-25
gi|319950159|ref|ZP_08024094.1| hypothetical protein ES5_11361 [... 99.0 2e-19
gi|336120023|ref|YP_004574801.1| hypothetical protein MLP_43840 ... 88.6 3e-16
gi|269125750|ref|YP_003299120.1| hypothetical protein Tcur_1505 ... 87.8 5e-16
gi|184200164|ref|YP_001854371.1| hypothetical protein KRH_05180 ... 87.0 9e-16
gi|239988862|ref|ZP_04709526.1| hypothetical protein SrosN1_1624... 86.7 1e-15
gi|119512005|ref|ZP_01631101.1| hypothetical protein N9414_12513... 85.9 2e-15
gi|148243795|ref|YP_001220035.1| hypothetical protein Acry_3278 ... 81.3 4e-14
gi|294786190|ref|ZP_06751444.1| toxin-antitoxin system, toxin co... 76.6 1e-12
gi|226363765|ref|YP_002781547.1| hypothetical protein ROP_43550 ... 76.3 1e-12
gi|254410953|ref|ZP_05024731.1| hypothetical protein MC7420_431 ... 76.3 1e-12
gi|315227399|ref|ZP_07869186.1| conserved hypothetical protein [... 75.9 2e-12
gi|315504469|ref|YP_004083356.1| hypothetical protein ML5_3694 [... 75.5 2e-12
gi|117924004|ref|YP_864621.1| hypothetical protein Mmc1_0694 [Ma... 75.5 2e-12
gi|67920656|ref|ZP_00514176.1| conserved hypothetical protein [C... 74.7 4e-12
gi|338531238|ref|YP_004664572.1| hypothetical protein LILAB_0790... 74.3 5e-12
gi|344339851|ref|ZP_08770778.1| hypothetical protein ThimaDRAFT_... 72.8 1e-11
gi|229592490|ref|YP_002874609.1| hypothetical protein PFLU5104 [... 72.0 2e-11
gi|221196568|ref|ZP_03569615.1| conserved hypothetical protein [... 72.0 3e-11
gi|87120221|ref|ZP_01076116.1| hypothetical protein MED121_08518... 72.0 3e-11
gi|227506263|ref|ZP_03936312.1| conserved hypothetical protein [... 71.6 3e-11
gi|339501501|ref|YP_004688875.1| hypothetical protein RLO149_p83... 71.2 5e-11
gi|304311368|ref|YP_003810966.1| hypothetical protein HDN1F_1734... 71.2 5e-11
gi|152982517|ref|YP_001354376.1| hypothetical protein mma_2686 [... 70.9 5e-11
gi|225872993|ref|YP_002754452.1| hypothetical protein ACP_1362 [... 70.9 5e-11
gi|226359560|ref|YP_002777338.1| hypothetical protein ROP_01460 ... 70.9 7e-11
gi|120556291|ref|YP_960642.1| hypothetical protein Maqu_3384 [Ma... 70.5 8e-11
gi|188591385|ref|YP_001795984.1| hypothetical protein pRALTA_007... 70.5 8e-11
gi|111020496|ref|YP_703468.1| hypothetical protein RHA1_ro03507 ... 68.9 2e-10
gi|296448192|ref|ZP_06890090.1| conserved hypothetical protein [... 68.9 3e-10
gi|54022969|ref|YP_117211.1| hypothetical protein nfa10020 [Noca... 68.6 3e-10
gi|197286172|ref|YP_002152044.1| hypothetical protein PMI2326 [P... 68.2 4e-10
gi|339328513|ref|YP_004688205.1| hypothetical protein CNE_BB1p07... 67.8 6e-10
gi|53803201|ref|YP_115042.1| hypothetical protein MCA2638 [Methy... 67.4 7e-10
gi|255020302|ref|ZP_05292370.1| hypothetical protein ACA_2120 [A... 67.0 8e-10
gi|152997973|ref|YP_001342808.1| hypothetical protein Mmwyl1_397... 67.0 9e-10
gi|296123697|ref|YP_003631475.1| hypothetical protein Plim_3463 ... 66.2 1e-09
gi|225874811|ref|YP_002756270.1| hypothetical protein ACP_3270 [... 66.2 2e-09
gi|198282377|ref|YP_002218698.1| hypothetical protein Lferr_0233... 65.9 2e-09
gi|253995596|ref|YP_003047660.1| hypothetical protein Mmol_0223 ... 65.5 2e-09
gi|167585067|ref|ZP_02377455.1| hypothetical protein BuboB_07001... 65.5 3e-09
gi|289704754|ref|ZP_06501176.1| toxin-antitoxin system, toxin co... 65.1 3e-09
gi|126090285|ref|YP_001041740.1| hypothetical protein Sbal_4436 ... 65.1 3e-09
gi|227549096|ref|ZP_03979145.1| conserved hypothetical protein [... 65.1 3e-09
gi|317482997|ref|ZP_07942001.1| hypothetical protein HMPREF0177_... 65.1 3e-09
>gi|15610885|ref|NP_218266.1| hypothetical protein Rv3749c [Mycobacterium tuberculosis H37Rv]
gi|15843369|ref|NP_338406.1| hypothetical protein MT3856 [Mycobacterium tuberculosis CDC1551]
gi|31794919|ref|NP_857412.1| hypothetical protein Mb3775c [Mycobacterium bovis AF2122/97]
48 more sequence titles
Length=169
Score = 350 bits (897), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 169/169 (100%), Positives = 169/169 (100%), Gaps = 0/169 (0%)
Query 1 MPCCGSLTRAPIGLCGRRTSWPRLGEPWSTASTSAPNGLTTAFAFGYNDLIAAMNNHYKD 60
MPCCGSLTRAPIGLCGRRTSWPRLGEPWSTASTSAPNGLTTAFAFGYNDLIAAMNNHYKD
Sbjct 1 MPCCGSLTRAPIGLCGRRTSWPRLGEPWSTASTSAPNGLTTAFAFGYNDLIAAMNNHYKD 60
Query 61 RHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGM 120
RHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGM
Sbjct 61 RHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGM 120
Query 121 VDAYIDPPFTPHSLLDALGEQVPQFAAKARRLFPSGSPFGLGVLLPFDQ 169
VDAYIDPPFTPHSLLDALGEQVPQFAAKARRLFPSGSPFGLGVLLPFDQ
Sbjct 121 VDAYIDPPFTPHSLLDALGEQVPQFAAKARRLFPSGSPFGLGVLLPFDQ 169
>gi|340628720|ref|YP_004747172.1| hypothetical protein MCAN_37691 [Mycobacterium canettii CIPT
140010059]
gi|340006910|emb|CCC46100.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=169
Score = 343 bits (881), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 167/169 (99%), Positives = 167/169 (99%), Gaps = 0/169 (0%)
Query 1 MPCCGSLTRAPIGLCGRRTSWPRLGEPWSTASTSAPNGLTTAFAFGYNDLIAAMNNHYKD 60
MPCCGSLTRAPIGLCGRRTSWPRLGEPWSTASTSAPNGLTTAFAFGYNDLIAAMNNH KD
Sbjct 1 MPCCGSLTRAPIGLCGRRTSWPRLGEPWSTASTSAPNGLTTAFAFGYNDLIAAMNNHDKD 60
Query 61 RHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGM 120
RHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGM
Sbjct 61 RHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGM 120
Query 121 VDAYIDPPFTPHSLLDALGEQVPQFAAKARRLFPSGSPFGLGVLLPFDQ 169
VDAYIDPPFTPHSLLDALGEQVP FAAKARRLFPSGSPFGLGVLLPFDQ
Sbjct 121 VDAYIDPPFTPHSLLDALGEQVPHFAAKARRLFPSGSPFGLGVLLPFDQ 169
>gi|167970909|ref|ZP_02553186.1| hypothetical protein MtubH3_23830 [Mycobacterium tuberculosis
H37Ra]
gi|254552863|ref|ZP_05143310.1| hypothetical protein Mtube_20861 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
gi|294995339|ref|ZP_06801030.1| hypothetical protein Mtub2_12721 [Mycobacterium tuberculosis
210]
gi|339296558|gb|AEJ48669.1| hypothetical protein CCDC5079_3480 [Mycobacterium tuberculosis
CCDC5079]
Length=116
Score = 238 bits (606), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 116/116 (100%), Positives = 116/116 (100%), Gaps = 0/116 (0%)
Query 54 MNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEAT 113
MNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEAT
Sbjct 1 MNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEAT 60
Query 114 KAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKARRLFPSGSPFGLGVLLPFDQ 169
KAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKARRLFPSGSPFGLGVLLPFDQ
Sbjct 61 KAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKARRLFPSGSPFGLGVLLPFDQ 116
>gi|339300157|gb|AEJ52267.1| hypothetical protein CCDC5180_3430 [Mycobacterium tuberculosis
CCDC5180]
Length=107
Score = 214 bits (545), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 106/107 (99%), Positives = 107/107 (100%), Gaps = 0/107 (0%)
Query 63 VLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGMVD 122
+LAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGMVD
Sbjct 1 MLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGMVD 60
Query 123 AYIDPPFTPHSLLDALGEQVPQFAAKARRLFPSGSPFGLGVLLPFDQ 169
AYIDPPFTPHSLLDALGEQVPQFAAKARRLFPSGSPFGLGVLLPFDQ
Sbjct 61 AYIDPPFTPHSLLDALGEQVPQFAAKARRLFPSGSPFGLGVLLPFDQ 107
>gi|323717613|gb|EGB26815.1| hypothetical protein TMMG_00240 [Mycobacterium tuberculosis CDC1551A]
Length=74
Score = 149 bits (377), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/74 (99%), Positives = 74/74 (100%), Gaps = 0/74 (0%)
Query 96 LHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKARRLFPS 155
+HPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKARRLFPS
Sbjct 1 MHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKARRLFPS 60
Query 156 GSPFGLGVLLPFDQ 169
GSPFGLGVLLPFDQ
Sbjct 61 GSPFGLGVLLPFDQ 74
>gi|240169750|ref|ZP_04748409.1| hypothetical protein MkanA1_10582 [Mycobacterium kansasii ATCC
12478]
Length=185
Score = 117 bits (292), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 73/110 (67%), Positives = 84/110 (77%), Gaps = 1/110 (0%)
Query 46 GYNDLIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQ 105
GY DLI M N+ KD+HVLAAAVRERAEVIVT +L FPDDALK Y I+ALHPDDFLLDQ
Sbjct 76 GYRDLIPVMKNNDKDKHVLAAAVRERAEVIVTFDLGGFPDDALKEYSIRALHPDDFLLDQ 135
Query 106 LDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQ-VPQFAAKARRLFP 154
LDLY+EAT I GMVD++ +PPFT +L AL + P FAA+AR FP
Sbjct 136 LDLYQEATITAIRGMVDSWTNPPFTLDEILIALARRGAPNFAAEARLAFP 185
>gi|319950159|ref|ZP_08024094.1| hypothetical protein ES5_11361 [Dietzia cinnamea P4]
gi|319436173|gb|EFV91358.1| hypothetical protein ES5_11361 [Dietzia cinnamea P4]
Length=365
Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/131 (45%), Positives = 75/131 (58%), Gaps = 5/131 (3%)
Query 43 FAFGYNDLIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFL 102
F+ GY +LI++M N KDRHVLAAAVR A +IVT NLK FP AL+ + ++A+HPD+FL
Sbjct 42 FSSGYRELISSMVNDPKDRHVLAAAVRSGAGLIVTANLKDFPRSALEKFDVEAIHPDEFL 101
Query 103 LDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKARRLF-----PSGS 157
D LDL+ + +V PP T LL +L QVP+F RR PSG
Sbjct 102 CDLLDLHTGIVFECMEVVVTRNAHPPRTVGELLVSLERQVPRFVDAVRRHLIAPDEPSGE 161
Query 158 PFGLGVLLPFD 168
G L P +
Sbjct 162 SMGAADLAPHE 172
>gi|336120023|ref|YP_004574801.1| hypothetical protein MLP_43840 [Microlunatus phosphovorus NM-1]
gi|334687813|dbj|BAK37398.1| hypothetical protein MLP_43840 [Microlunatus phosphovorus NM-1]
Length=183
Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/124 (43%), Positives = 67/124 (55%), Gaps = 2/124 (1%)
Query 29 STASTSAPNG-LTTAFAFGYNDLIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDA 87
+T SA G A GY DLI M N KDRH LAA +R EV+VT NL FP ++
Sbjct 57 TTKRISAMQGAFPDATVTGYEDLIKQMTNDPKDRHALAAGIRANCEVLVTFNLSDFPANS 116
Query 88 LKPYQIKALHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQ-VPQFA 146
+ Y I+ +HPD FLLDQLDLY V+ +++ Y P LL L + VP FA
Sbjct 117 VDAYDIEIVHPDKFLLDQLDLYPGPAVGVLRQLIEDYSSPVLEMEDLLVRLADAGVPGFA 176
Query 147 AKAR 150
+ R
Sbjct 177 SAVR 180
>gi|269125750|ref|YP_003299120.1| hypothetical protein Tcur_1505 [Thermomonospora curvata DSM 43183]
gi|268310708|gb|ACY97082.1| conserved hypothetical protein [Thermomonospora curvata DSM 43183]
Length=194
Score = 87.8 bits (216), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/120 (41%), Positives = 72/120 (60%), Gaps = 6/120 (5%)
Query 42 AFAFGYNDLIAAMNN-HYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDD 100
A GY LI A+ +DRHVLAAA++ +A+VIVT+NL+ FP D L+P+ ++A PDD
Sbjct 72 ALVEGYEPLIEALKLPDLEDRHVLAAAIKAQAQVIVTSNLRDFPADMLEPWNVEAKSPDD 131
Query 101 FLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALG-----EQVPQFAAKARRLFPS 155
F+LDQ+D + A + +VD+ +PP T +LDAL E V ++ R F +
Sbjct 132 FVLDQIDFDDRVVWACVQQIVDSRNNPPETVEDVLDALERAGLVESVAALRTRSHRGFST 191
>gi|184200164|ref|YP_001854371.1| hypothetical protein KRH_05180 [Kocuria rhizophila DC2201]
gi|183580394|dbj|BAG28865.1| hypothetical protein [Kocuria rhizophila DC2201]
Length=140
Score = 87.0 bits (214), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 48/111 (44%), Positives = 61/111 (55%), Gaps = 0/111 (0%)
Query 29 STASTSAPNGLTTAFAFGYNDLIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDAL 88
ST ++ + Y L+ AM N KDRHVLAAAVR RAE+IVT+N K FP AL
Sbjct 11 STTASGHGQAIQATLVENYESLVPAMTNDPKDRHVLAAAVRSRAELIVTSNQKDFPTTAL 70
Query 89 KPYQIKALHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALG 139
KPY + PDDFLLDQL+L + +V +P T + L+ L
Sbjct 71 KPYDVAVRSPDDFLLDQLNLDPSTVVRIAEEVVADMRNPTITWNGYLEGLA 121
>gi|239988862|ref|ZP_04709526.1| hypothetical protein SrosN1_16242 [Streptomyces roseosporus NRRL
11379]
gi|291445854|ref|ZP_06585244.1| predicted protein [Streptomyces roseosporus NRRL 15998]
gi|291348801|gb|EFE75705.1| predicted protein [Streptomyces roseosporus NRRL 15998]
Length=180
Score = 86.7 bits (213), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/136 (44%), Positives = 73/136 (54%), Gaps = 2/136 (1%)
Query 18 RTSWPRLGEPWSTASTSAPNG-LTTAFAFGYNDLIAAMNNHYKDRHVLAAAVRERAEVIV 76
R R+G + A G + GY LI AM N KDRHVLAAA+ A +V
Sbjct 43 RNVAERIGGQGAKGLVDAMAGAFPESMVSGYVALIPAMTNDPKDRHVLAAAIGGHAHAVV 102
Query 77 TTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLD 136
T NLK FP A +PY ++ LHPD+FLLD LDL V+ V++Y P PH LLD
Sbjct 103 TLNLKDFPAAAAEPYGVEVLHPDEFLLDMLDLAPVEVGTVLGTQVNSYRREPRDPHGLLD 162
Query 137 ALGE-QVPQFAAKARR 151
+ PQFAA+ RR
Sbjct 163 RISAGGAPQFAAEFRR 178
>gi|119512005|ref|ZP_01631101.1| hypothetical protein N9414_12513 [Nodularia spumigena CCY9414]
gi|119463356|gb|EAW44297.1| hypothetical protein N9414_12513 [Nodularia spumigena CCY9414]
Length=169
Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/101 (43%), Positives = 60/101 (60%), Gaps = 0/101 (0%)
Query 50 LIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLY 109
L+ M NH DRHVLA AV +A++IVT+NLKHF D L P+QI A PD+FL + +LY
Sbjct 65 LVEVMTNHPGDRHVLATAVSAKADIIVTSNLKHFQDHDLAPWQIIAQSPDEFLTELYNLY 124
Query 110 EEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKAR 150
+ V+ PP + +LL+ L +VP FA+ +
Sbjct 125 PQQVIEVLQRQSQGLQKPPMSFENLLELLSREVPTFASNIK 165
>gi|148243795|ref|YP_001220035.1| hypothetical protein Acry_3278 [Acidiphilium cryptum JF-5]
gi|146400358|gb|ABQ28893.1| hypothetical protein Acry_3278 [Acidiphilium cryptum JF-5]
Length=192
Score = 81.3 bits (199), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 43/111 (39%), Positives = 68/111 (62%), Gaps = 2/111 (1%)
Query 42 AFAFGYNDLIAAMN-NHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDD 100
A GY LI +++ DRHVLAAA+R A+ IVT NL FP +AL+P+ ++A+HPD+
Sbjct 76 ALVTGYESLIGSLDLPDQNDRHVLAAAIRCGAQHIVTQNLNDFPPEALEPFSVEAIHPDE 135
Query 101 FLLDQLDLYEEATKAVILGMVDAYIDPPFTPHS-LLDALGEQVPQFAAKAR 150
FL +LY ++ + +Y +PP+TP +LD + +P+ A++ R
Sbjct 136 FLSRTFELYPTEATEILGKLRRSYQNPPYTPSEFVLDLTAKGLPKLASQIR 186
>gi|294786190|ref|ZP_06751444.1| toxin-antitoxin system, toxin component, PIN family [Parascardovia
denticolens F0305]
gi|294485023|gb|EFG32657.1| toxin-antitoxin system, toxin component, PIN family [Parascardovia
denticolens F0305]
Length=188
Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/142 (36%), Positives = 69/142 (49%), Gaps = 5/142 (3%)
Query 17 RRTSWPRLGEPWSTASTSAPNG-LTTAFAFGYNDLIAAMNNHYKDRHVLAAAVRERAEVI 75
R PR+GE + + G A Y L + M DRH+LAAAV A +
Sbjct 43 ERVLEPRIGEKQVSKRIATMEGCFPDACVTNYESLTSMMTCDPNDRHILAAAVASPAYTL 102
Query 76 VTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLL 135
VT NLK FP+ + +PYQI+ PD FL D LDL+ + ++++Y PP + L
Sbjct 103 VTFNLKDFPESSYRPYQIEIKDPDCFLCDLLDLHHSEVATIAWDLMNSYKKPPVDINEYL 162
Query 136 DALGEQ-VPQFA---AKARRLF 153
L +PQF+ KA R F
Sbjct 163 GILRRSALPQFSQEIGKALRDF 184
>gi|226363765|ref|YP_002781547.1| hypothetical protein ROP_43550 [Rhodococcus opacus B4]
gi|226242254|dbj|BAH52602.1| hypothetical protein [Rhodococcus opacus B4]
Length=395
Score = 76.3 bits (186), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 53/108 (50%), Positives = 65/108 (61%), Gaps = 0/108 (0%)
Query 46 GYNDLIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQ 105
GY +LI AM N KDRHVLAAAVR A IVT +LK FP AL P+ ++A+HPD+FL DQ
Sbjct 79 GYRELIPAMTNDSKDRHVLAAAVRSGAAAIVTADLKDFPQAALDPFDVEAIHPDEFLCDQ 138
Query 106 LDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKARRLF 153
LDL EA + +V PP T LL+ LG P+F + L
Sbjct 139 LDLDAEAVFECMHALVARNALPPRTVGELLELLGRLTPRFVDAVQALL 186
>gi|254410953|ref|ZP_05024731.1| hypothetical protein MC7420_431 [Microcoleus chthonoplastes PCC
7420]
gi|196182308|gb|EDX77294.1| hypothetical protein MC7420_431 [Microcoleus chthonoplastes PCC
7420]
Length=211
Score = 76.3 bits (186), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/105 (48%), Positives = 67/105 (64%), Gaps = 7/105 (6%)
Query 48 NDLIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLD 107
+ L+ M NH KDRHV+AAAV+ +AEVIVT NLK FP +L+P+ ++A HPDDFL L+
Sbjct 26 DQLVNLMTNHPKDRHVVAAAVKAKAEVIVTFNLKDFPQKSLEPFGVEAQHPDDFL---LE 82
Query 108 LYEEATKAVILGMV----DAYIDPPFTPHSLLDALGEQVPQFAAK 148
L E T + +L ++ +A PP T +L L QVP FA K
Sbjct 83 LCENCTASSLLQIIKEQAEALKRPPNTLKDVLGRLSRQVPNFAGK 127
>gi|315227399|ref|ZP_07869186.1| conserved hypothetical protein [Parascardovia denticolens DSM
10105]
gi|315119849|gb|EFT82982.1| conserved hypothetical protein [Parascardovia denticolens DSM
10105]
Length=160
Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/142 (36%), Positives = 69/142 (49%), Gaps = 5/142 (3%)
Query 17 RRTSWPRLGEPWSTASTSAPNG-LTTAFAFGYNDLIAAMNNHYKDRHVLAAAVRERAEVI 75
R PR+GE + + G A Y L + M DRH+LAAAV A +
Sbjct 15 ERVLEPRIGEKQVSKRIATMEGCFPDACVTNYESLTSMMTCDPNDRHILAAAVASPAYTL 74
Query 76 VTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLL 135
VT NLK FP+ + +PYQI+ PD FL D LDL+ + ++++Y PP + L
Sbjct 75 VTFNLKDFPESSYRPYQIEIKDPDCFLCDLLDLHHSEVATIAWDLMNSYKKPPVDINEYL 134
Query 136 DALGEQ-VPQFA---AKARRLF 153
L +PQF+ KA R F
Sbjct 135 GILRRSALPQFSQEIGKALRDF 156
>gi|315504469|ref|YP_004083356.1| hypothetical protein ML5_3694 [Micromonospora sp. L5]
gi|315411088|gb|ADU09205.1| hypothetical protein ML5_3694 [Micromonospora sp. L5]
Length=182
Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 59/111 (54%), Positives = 70/111 (64%), Gaps = 1/111 (0%)
Query 42 AFAFGYNDLIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDF 101
A GY LI M+N KDRHVLAAAVR AEV+VT N+ FP+ +LK + I A+HPDDF
Sbjct 70 AMVSGYESLIDGMSNDPKDRHVLAAAVRANAEVLVTYNVGDFPEPSLKSFDIVAVHPDDF 129
Query 102 LLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQ-VPQFAAKARR 151
LLDQLDLY T V+ AY P T LL LG +P+FAA+ RR
Sbjct 130 LLDQLDLYPGQTLDVLHRQAAAYRREPTTVPGLLVLLGRTGLPRFAAEVRR 180
>gi|117924004|ref|YP_864621.1| hypothetical protein Mmc1_0694 [Magnetococcus sp. MC-1]
gi|117607760|gb|ABK43215.1| conserved hypothetical protein [Magnetococcus sp. MC-1]
Length=189
Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/97 (45%), Positives = 56/97 (58%), Gaps = 1/97 (1%)
Query 46 GYNDLIAAMNN-HYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLD 104
GY LI ++ DRHVLAAA+R AEVIVT NLK FP DAL + + A HPDDF+ +
Sbjct 77 GYEKLIPSLTLPDADDRHVLAAAIRSCAEVIVTFNLKDFPSDALNNFGVFAEHPDDFISN 136
Query 105 QLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQ 141
LDL ++A + +PP T + +D L Q
Sbjct 137 LLDLNQKAVIRTVKNQRARLKNPPITSQNFIDTLRRQ 173
>gi|67920656|ref|ZP_00514176.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
gi|67858140|gb|EAM53379.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
Length=184
Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 42/99 (43%), Positives = 55/99 (56%), Gaps = 0/99 (0%)
Query 50 LIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLY 109
LI M+NH DRHVLAAA+ +A+VIV+ NLKHFP+ +L ++++A D FL DL
Sbjct 80 LIPCMDNHEGDRHVLAAALIAKAQVIVSDNLKHFPESSLNQFRVEAQSADTFLTFLWDLS 139
Query 110 EEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAK 148
E V+ V P T LLD L EQ P +
Sbjct 140 PETMVDVLRTQVKGKKKPKLTILDLLDRLKEQTPTLVQR 178
>gi|338531238|ref|YP_004664572.1| hypothetical protein LILAB_07905 [Myxococcus fulvus HW-1]
gi|337257334|gb|AEI63494.1| hypothetical protein LILAB_07905 [Myxococcus fulvus HW-1]
Length=192
Score = 74.3 bits (181), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 48/120 (40%), Positives = 62/120 (52%), Gaps = 4/120 (3%)
Query 42 AFAFGYNDLIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDF 101
A GY L+ AM NH KDRHVLAAAV A+ IVT NL+ F + L P ++A D F
Sbjct 74 AMVSGYERLVPAMTNHPKDRHVLAAAVHIGAQTIVTNNLRDFGAEHL-PRAMQAQDADTF 132
Query 102 LLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKARRLF---PSGSP 158
L + L + ++ + +PP T LLD L + VP F + R L P GSP
Sbjct 133 LQNLLSQSPSSMLELLRTQAEMLKNPPITFSRLLDGLAKSVPNFINEVRELLSPPPLGSP 192
>gi|344339851|ref|ZP_08770778.1| hypothetical protein ThimaDRAFT_2516 [Thiocapsa marina 5811]
gi|343800030|gb|EGV17977.1| hypothetical protein ThimaDRAFT_2516 [Thiocapsa marina 5811]
Length=189
Score = 72.8 bits (177), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/120 (39%), Positives = 62/120 (52%), Gaps = 5/120 (4%)
Query 23 RLGEPWSTASTSAPNGLTTAFAFGYNDLIAAMN-NHYKDRHVLAAAVRERAEVIVTTNLK 81
+L + + + P+ L T GY DLI + DRHVLAAA+R A+VIVTTNL+
Sbjct 58 KLHRTMTLINKAVPDCLVT----GYGDLIDRLFLPDPNDRHVLAAAIRSGAQVIVTTNLR 113
Query 82 HFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQ 141
FP AL + I A HPDDF+LD DL A +PP + + +D + Q
Sbjct 114 DFPQKALADFDINARHPDDFILDLADLDPAVVTTAAKLQRAALKNPPLSAETFIDTMRRQ 173
>gi|229592490|ref|YP_002874609.1| hypothetical protein PFLU5104 [Pseudomonas fluorescens SBW25]
gi|229364356|emb|CAY52120.1| conserved hypothetical protein [Pseudomonas fluorescens SBW25]
Length=191
Score = 72.0 bits (175), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/118 (39%), Positives = 59/118 (50%), Gaps = 1/118 (0%)
Query 25 GEPWSTASTSAPNGLTTAFAFGYNDLIAAMN-NHYKDRHVLAAAVRERAEVIVTTNLKHF 83
E + S + A GY DL ++++ DRHVLAAA+R +AEVIVT NLK F
Sbjct 58 AEQVNCTSAMMDKAIPDALVTGYEDLCSSLDLPDPDDRHVLAAAIRSKAEVIVTFNLKDF 117
Query 84 PDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQ 141
P L Y I+ +HPDDF+ D DL + A A PP + L+ L Q
Sbjct 118 PPHILAAYDIEPMHPDDFISDLWDLDQAAVLGAAQKQRRALKRPPMAAGNYLEMLMRQ 175
>gi|221196568|ref|ZP_03569615.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
gi|221203237|ref|ZP_03576256.1| conserved hypothetical protein [Burkholderia multivorans CGD2]
gi|221177171|gb|EEE09599.1| conserved hypothetical protein [Burkholderia multivorans CGD2]
gi|221183122|gb|EEE15522.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
Length=160
Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 47/118 (40%), Positives = 58/118 (50%), Gaps = 1/118 (0%)
Query 25 GEPWSTASTSAPNGLTTAFAFGYNDLIAAMN-NHYKDRHVLAAAVRERAEVIVTTNLKHF 83
E S+ + A GY LIA + DRHVLAAA+R A VIVT N K F
Sbjct 27 AEQLDRTSSLMDRAIPDAVVTGYESLIAGLVLPDADDRHVLAAAIRCNASVIVTFNRKDF 86
Query 84 PDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQ 141
P++AL PY I+A HPD+F+ D DL A +A PP T L+ L Q
Sbjct 87 PEEALLPYGIEAQHPDEFVDDLFDLEPAVVVAAARRQREALKQPPLTADRYLEILRRQ 144
>gi|87120221|ref|ZP_01076116.1| hypothetical protein MED121_08518 [Marinomonas sp. MED121]
gi|86164324|gb|EAQ65594.1| hypothetical protein MED121_08518 [Marinomonas sp. MED121]
Length=189
Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/113 (39%), Positives = 59/113 (53%), Gaps = 2/113 (1%)
Query 42 AFAFGYNDLIAAMN-NHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDD 100
A GY LI M DRHVLAAA++ AE I+T NLK FP+DAL Y + A+HPD+
Sbjct 73 ALVNGYESLIDGMELPDENDRHVLAAAIKGNAEGIITFNLKDFPEDALSRYGVSAIHPDE 132
Query 101 FLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQ-VPQFAAKARRL 152
FL D +L + +PPF+ L+ L +Q +P F R +
Sbjct 133 FLSDMFELDPAKCLQAAQQQRRSLRNPPFSSEEYLNCLLKQKLPSFVKHLRTM 185
>gi|227506263|ref|ZP_03936312.1| conserved hypothetical protein [Corynebacterium striatum ATCC
6940]
gi|227197145|gb|EEI77193.1| conserved hypothetical protein [Corynebacterium striatum ATCC
6940]
Length=201
Score = 71.6 bits (174), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 41/95 (44%), Positives = 57/95 (60%), Gaps = 1/95 (1%)
Query 60 DRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILG 119
D HVLAAA++ RA+VIVT N+K FP D L+ + I PD+FLL+QLDL + +
Sbjct 103 DVHVLAAAIQGRADVIVTFNVKDFPKDELQRFGIDVQTPDEFLLNQLDLDPDEVLGCLAR 162
Query 120 MVDAYIDPPFTPHSLLDALGE-QVPQFAAKARRLF 153
+A DPP LLD L + ++ F + AR+L
Sbjct 163 QAEALKDPPVQLRELLDGLEKLKIQGFTSVARKLL 197
>gi|339501501|ref|YP_004688875.1| hypothetical protein RLO149_p830180 [Roseobacter litoralis Och
149]
gi|338759987|gb|AEI96449.1| hypothetical protein RLO149_p830180 [Roseobacter litoralis Och
149]
Length=192
Score = 71.2 bits (173), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 47/131 (36%), Positives = 63/131 (49%), Gaps = 2/131 (1%)
Query 22 PRLGEPWSTASTSAPNGLTTAFAFGYNDLIAAMN-NHYKDRHVLAAAVRERAEVIVTTNL 80
P L E + F GY LIA ++ DRHVLAAA+R A+VIVT N
Sbjct 56 PHLEESVRQQEAAIRETFEECFVTGYQPLIAGLDLPDPDDRHVLAAAIRCSAQVIVTENH 115
Query 81 KHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHS-LLDALG 139
+ FP + L Y ++ L DD L + DLY + + Y +PP TP LLD
Sbjct 116 RDFPSEVLDEYGVETLGSDDMLANTYDLYPSGGARALRRVRKRYENPPMTPSEFLLDLTR 175
Query 140 EQVPQFAAKAR 150
+ + AA+AR
Sbjct 176 HGLSKLAAQAR 186
>gi|304311368|ref|YP_003810966.1| hypothetical protein HDN1F_17340 [gamma proteobacterium HdN1]
gi|301797101|emb|CBL45317.1| Conserved hypothetical protein [gamma proteobacterium HdN1]
Length=189
Score = 71.2 bits (173), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 39/82 (48%), Positives = 48/82 (59%), Gaps = 0/82 (0%)
Query 60 DRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILG 119
DRHVLAAA+R AE+IVT NLK FP LKPY++ A HPDDF+LD +DL
Sbjct 92 DRHVLAAALRANAEIIVTANLKDFPQTTLKPYKVIAQHPDDFILDLIDLKPPLVLTCFKE 151
Query 120 MVDAYIDPPFTPHSLLDALGEQ 141
Y +P +T L+ L Q
Sbjct 152 DRTHYRNPQYTVEEYLENLLRQ 173
>gi|152982517|ref|YP_001354376.1| hypothetical protein mma_2686 [Janthinobacterium sp. Marseille]
gi|151282594|gb|ABR91004.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille]
Length=190
Score = 70.9 bits (172), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 38/82 (47%), Positives = 48/82 (59%), Gaps = 0/82 (0%)
Query 60 DRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILG 119
DRHV+AAA+ A +I+T NLK FPD+ALKPY + A HPDDF++D LDL+
Sbjct 93 DRHVVAAAIHSGASLIITFNLKDFPDEALKPYNLAAQHPDDFIVDLLDLHPAGVLEAAAS 152
Query 120 MVDAYIDPPFTPHSLLDALGEQ 141
+ PP T LD L Q
Sbjct 153 HRRSLKSPPKTADEYLDTLLAQ 174
>gi|225872993|ref|YP_002754452.1| hypothetical protein ACP_1362 [Acidobacterium capsulatum ATCC
51196]
gi|225791424|gb|ACO31514.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
51196]
Length=187
Score = 70.9 bits (172), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 52/127 (41%), Positives = 66/127 (52%), Gaps = 7/127 (5%)
Query 17 RRTSWPRLGEPWSTASTSAPNGLTTAFAF-----GYNDLIAAMNNHYKDRHVLAAAVRER 71
RR RLG A L T FA GY +LI AM N KDRHV+AAA++
Sbjct 45 RRNLIARLGRSEQDADYLV-GELRTHFADAWVADGYKELIPAMRNEPKDRHVVAAAIKCG 103
Query 72 AEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTP 131
AEVI+T N KHFP D L+ I A PD++L+D + E IL A ++PP
Sbjct 104 AEVILTYNTKHFPADWLQAQNIVAKTPDEYLVDLYSINPEIV-VHILHQQGAELNPPRNI 162
Query 132 HSLLDAL 138
S+L +L
Sbjct 163 ESVLKSL 169
>gi|226359560|ref|YP_002777338.1| hypothetical protein ROP_01460 [Rhodococcus opacus B4]
gi|226238045|dbj|BAH48393.1| hypothetical protein [Rhodococcus opacus B4]
Length=183
Score = 70.9 bits (172), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 48/122 (40%), Positives = 65/122 (54%), Gaps = 6/122 (4%)
Query 32 STSAPNGLTTAFAFGYNDLIAAMN-NHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKP 90
+ + P+ L T GY LI A+ DRHVLAAA+R A+VIVT NL+ FPD L
Sbjct 66 NRAVPDCLVT----GYEKLIPALTLPDADDRHVLAAAIRAGAQVIVTANLRDFPDAELAQ 121
Query 91 YQIKALHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQ-VPQFAAKA 149
+ I+A HPD+F++D L + A+ +PP TP + D L +P AA
Sbjct 122 FGIEAKHPDEFVMDLFHLDGVRVHQAVSATAAAWRNPPGTPADVCDRLAAAGLPISAAAL 181
Query 150 RR 151
RR
Sbjct 182 RR 183
>gi|120556291|ref|YP_960642.1| hypothetical protein Maqu_3384 [Marinobacter aquaeolei VT8]
gi|120326140|gb|ABM20455.1| conserved hypothetical protein [Marinobacter aquaeolei VT8]
Length=188
Score = 70.5 bits (171), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 44/113 (39%), Positives = 61/113 (54%), Gaps = 2/113 (1%)
Query 42 AFAFGYNDLIAAMN-NHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDD 100
A +GY LI ++ DRHVLAAA+R A IVT NLK FP+ +L Y I+ +HPDD
Sbjct 72 AKVYGYEPLIEGLSLPDPDDRHVLAAAIRCNASAIVTFNLKDFPEASLHQYDIEFIHPDD 131
Query 101 FLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDAL-GEQVPQFAAKARRL 152
F+L Q+D+ A +PP + + L L +Q+PQ +K R
Sbjct 132 FILYQIDMAPTVCCRAFQAQRSALKNPPLSVDTFLATLQKQQLPQTVSKLREF 184
>gi|188591385|ref|YP_001795984.1| hypothetical protein pRALTA_0073 [Cupriavidus taiwanensis]
gi|170938780|emb|CAP63773.1| conserved hypothetical protein [Cupriavidus taiwanensis LMG 19424]
Length=192
Score = 70.5 bits (171), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 38/101 (38%), Positives = 57/101 (57%), Gaps = 1/101 (0%)
Query 42 AFAFGYNDLIAAMNN-HYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDD 100
+ FGY LI ++ DRHV+AAA+ +E I+T N+K FP+ ALK + ++A+HPDD
Sbjct 76 SLVFGYEHLIESIELPDPDDRHVVAAAIHSGSEAIITFNMKDFPEAALKEFNVEAIHPDD 135
Query 101 FLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQ 141
F++D DL A + + +PP +P L L Q
Sbjct 136 FVVDLFDLNAGKVLAAVADHRASLKNPPKSPSEYLTTLSAQ 176
>gi|111020496|ref|YP_703468.1| hypothetical protein RHA1_ro03507 [Rhodococcus jostii RHA1]
gi|110820026|gb|ABG95310.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=396
Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/108 (46%), Positives = 64/108 (60%), Gaps = 0/108 (0%)
Query 46 GYNDLIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQ 105
GY +LI AM N KDRHVLAAAVR A VIVT ++ FP AL P+ ++A+HPD+FL D
Sbjct 79 GYRELIPAMTNDPKDRHVLAAAVRSGAGVIVTATVQDFPPAALVPFDVEAIHPDEFLCDL 138
Query 106 LDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKARRLF 153
LDL +A + +V + P T LL++L P+F R L
Sbjct 139 LDLDPDAVLECMHALVARNMLSPRTVGELLESLERLTPRFVDTVRALL 186
>gi|296448192|ref|ZP_06890090.1| conserved hypothetical protein [Methylosinus trichosporium OB3b]
gi|296254323|gb|EFH01452.1| conserved hypothetical protein [Methylosinus trichosporium OB3b]
Length=124
Score = 68.9 bits (167), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 33/50 (66%), Positives = 38/50 (76%), Gaps = 0/50 (0%)
Query 59 KDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDL 108
DRHVLAAA+ R +VIVT NLKHFP+ AL P I A HPD+FL +QLDL
Sbjct 26 NDRHVLAAAIVGRCDVIVTQNLKHFPETALAPLDIDAQHPDEFLCNQLDL 75
>gi|54022969|ref|YP_117211.1| hypothetical protein nfa10020 [Nocardia farcinica IFM 10152]
gi|54014477|dbj|BAD55847.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=185
Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 40/94 (43%), Positives = 57/94 (61%), Gaps = 1/94 (1%)
Query 46 GYNDLIAAMN-NHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLD 104
GY LIA + DRHVLAAA++ A+VIVT NL FP L+ + I+A PD+F+LD
Sbjct 76 GYEPLIAGLRLPDPDDRHVLAAAIKAGAQVIVTANLADFPAAQLRQWDIEARGPDEFVLD 135
Query 105 QLDLYEEATKAVILGMVDAYIDPPFTPHSLLDAL 138
Q+ L + A + + D+ +PP T +L+AL
Sbjct 136 QVHLDDRVVWACVQQIADSRRNPPETVDDVLEAL 169
>gi|197286172|ref|YP_002152044.1| hypothetical protein PMI2326 [Proteus mirabilis HI4320]
gi|194683659|emb|CAR44600.1| conserved hypothetical protein [Proteus mirabilis HI4320]
Length=189
Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/93 (39%), Positives = 52/93 (56%), Gaps = 1/93 (1%)
Query 59 KDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVIL 118
DRHVLAAA++ +A+ I+T NLK FP L + I A+HPD+FL D +L +
Sbjct 91 NDRHVLAAAIKGQAQAIITFNLKDFPQKYLNSFDISAIHPDEFLSDMFELNPSSCIKAAQ 150
Query 119 GMVDAYIDPPFTPHSLLDALGEQ-VPQFAAKAR 150
+ +PP T + L L +Q +PQF + R
Sbjct 151 QHRQSLKNPPMTVETFLGCLQKQKLPQFISHLR 183
>gi|339328513|ref|YP_004688205.1| hypothetical protein CNE_BB1p07500 [Cupriavidus necator N-1]
gi|338171114|gb|AEI82167.1| hypothetical protein CNE_BB1p07500 [Cupriavidus necator N-1]
Length=190
Score = 67.8 bits (164), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/97 (39%), Positives = 55/97 (57%), Gaps = 1/97 (1%)
Query 46 GYNDLIAAMNN-HYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLD 104
GY LI ++ DRHV+AAA+ +E I+T N+K FP+ ALK + ++A+HPDDF++D
Sbjct 78 GYEHLIESIELPDPDDRHVVAAAIHSGSEAIITFNMKDFPETALKEFNVEAIHPDDFVVD 137
Query 105 QLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQ 141
DL A + + +PP +P L L Q
Sbjct 138 LFDLNAGKVLAAVAEHRASLKNPPKSPSEYLTTLSAQ 174
>gi|53803201|ref|YP_115042.1| hypothetical protein MCA2638 [Methylococcus capsulatus str. Bath]
gi|53756962|gb|AAU91253.1| conserved hypothetical protein [Methylococcus capsulatus str.
Bath]
Length=190
Score = 67.4 bits (163), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 37/82 (46%), Positives = 48/82 (59%), Gaps = 0/82 (0%)
Query 60 DRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILG 119
DRHV+AAA+ A +IVT NLK FP +AL+PY + A HPDDF++D LDL+
Sbjct 93 DRHVVAAAIHSGASLIVTFNLKDFPPEALRPYNLAAQHPDDFIVDLLDLHPAGVLEAAAS 152
Query 120 MVDAYIDPPFTPHSLLDALGEQ 141
+ +PP T LD L Q
Sbjct 153 HRRSLKNPPKTADEYLDTLLAQ 174
>gi|255020302|ref|ZP_05292370.1| hypothetical protein ACA_2120 [Acidithiobacillus caldus ATCC
51756]
gi|254970222|gb|EET27716.1| hypothetical protein ACA_2120 [Acidithiobacillus caldus ATCC
51756]
Length=190
Score = 67.0 bits (162), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 37/83 (45%), Positives = 48/83 (58%), Gaps = 0/83 (0%)
Query 59 KDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVIL 118
DRHV+AAA+ A +IVT NLK FP +ALKPY + A HPDDF++D LDL+ +
Sbjct 92 SDRHVVAAAIHAGASLIVTFNLKDFPAEALKPYNLAAQHPDDFIVDLLDLHPAGVLEAVA 151
Query 119 GMVDAYIDPPFTPHSLLDALGEQ 141
+PP + LD L Q
Sbjct 152 RHRRNLKNPPKSADDYLDTLLAQ 174
>gi|152997973|ref|YP_001342808.1| hypothetical protein Mmwyl1_3977 [Marinomonas sp. MWYL1]
gi|150838897|gb|ABR72873.1| conserved hypothetical protein [Marinomonas sp. MWYL1]
Length=143
Score = 67.0 bits (162), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 42/113 (38%), Positives = 61/113 (54%), Gaps = 2/113 (1%)
Query 42 AFAFGYNDLIAAMN-NHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDD 100
A G+ LI ++ DRHVLAAA++ E I+T NLK FP + L+P+ I A+HPD+
Sbjct 27 ALVSGHESLINSLTLPDADDRHVLAAAIQGHTEGIITFNLKDFPIEQLEPFGISAIHPDE 86
Query 101 FLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQ-VPQFAAKARRL 152
FL D +L + A +P TP L+ L +Q +P F + +RL
Sbjct 87 FLSDMFELDAASCLLAAQRHRSALKNPTLTPDEYLNCLLKQRLPSFVSALKRL 139
>gi|296123697|ref|YP_003631475.1| hypothetical protein Plim_3463 [Planctomyces limnophilus DSM
3776]
gi|296016037|gb|ADG69276.1| conserved hypothetical protein [Planctomyces limnophilus DSM
3776]
Length=189
Score = 66.2 bits (160), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/88 (46%), Positives = 52/88 (60%), Gaps = 8/88 (9%)
Query 34 SAPNGLTTAFAFGYNDLIAAMN-NHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQ 92
S P+GL T GY LI + DRHVLAAA+ RA I+T N K FP+ +LKPY
Sbjct 69 SVPDGLIT----GYEQLIETVALPDPDDRHVLAAAIFSRANTILTFNFKDFPETSLKPYG 124
Query 93 IKALHPDDFLLDQLDLYEEATKAVILGM 120
I+ALHPD+FL + L+E V + +
Sbjct 125 IEALHPDNFL---ISLFETTPSQVCIAV 149
>gi|225874811|ref|YP_002756270.1| hypothetical protein ACP_3270 [Acidobacterium capsulatum ATCC
51196]
gi|225794361|gb|ACO34451.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
51196]
Length=190
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/75 (48%), Positives = 48/75 (64%), Gaps = 3/75 (4%)
Query 34 SAPNGLTTAFAFGYNDLIAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQI 93
+AP+ L TA+ L N DRHVLAAA+R RA VIVT NL+ FP +AL+ + I
Sbjct 69 AAPDALVTAYEHLIPGLFLPDEN---DRHVLAAAIRCRASVIVTCNLEDFPREALREFNI 125
Query 94 KALHPDDFLLDQLDL 108
+A HPD+F+ + DL
Sbjct 126 EAQHPDEFIFNLFDL 140
>gi|198282377|ref|YP_002218698.1| hypothetical protein Lferr_0233 [Acidithiobacillus ferrooxidans
ATCC 53993]
gi|198246898|gb|ACH82491.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans
ATCC 53993]
Length=190
Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/82 (46%), Positives = 47/82 (58%), Gaps = 0/82 (0%)
Query 60 DRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILG 119
DRHV+AAA+ A +IVT NLK FP + LKPY + A HPDDF+ D LDL+
Sbjct 93 DRHVVAAAIHGGASLIVTFNLKDFPPERLKPYNLTAQHPDDFIFDLLDLHAARVCEAAAN 152
Query 120 MVDAYIDPPFTPHSLLDALGEQ 141
+ +PP T LD L +Q
Sbjct 153 HRRSLKNPPKTVDEYLDTLLKQ 174
>gi|253995596|ref|YP_003047660.1| hypothetical protein Mmol_0223 [Methylotenera mobilis JLW8]
gi|253982275|gb|ACT47133.1| conserved hypothetical protein [Methylotenera mobilis JLW8]
Length=188
Score = 65.5 bits (158), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/101 (41%), Positives = 56/101 (56%), Gaps = 2/101 (1%)
Query 46 GYNDLIAAMNNHYKD-RHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLD 104
GY LI + KD RHVLAAA++ +A+ IVT N K FP D L+ +QI+ +HPDDFL
Sbjct 76 GYECLIEGLALPDKDDRHVLAAAIKCKADAIVTFNQKDFPADYLQTFQIEIIHPDDFLYY 135
Query 105 QLDLYEEATKAVILGMVDAYIDPPFTPHSLLDAL-GEQVPQ 144
Q+D+ A +PP + L L +Q+PQ
Sbjct 136 QIDMAPAICCKAFKDQRSALKNPPMSVDEFLTCLQKQQLPQ 176
>gi|167585067|ref|ZP_02377455.1| hypothetical protein BuboB_07001 [Burkholderia ubonensis Bu]
Length=193
Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/79 (42%), Positives = 47/79 (60%), Gaps = 0/79 (0%)
Query 60 DRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILG 119
D+HVLAAA+ A+ IVT NL+ FP + + PY I+ +HPD F+++Q DL A + G
Sbjct 96 DQHVLAAAIAGHADCIVTANLRDFPAEFVSPYGIEVIHPDRFIINQWDLDPIAAMSAFKG 155
Query 120 MVDAYIDPPFTPHSLLDAL 138
M + +P TP AL
Sbjct 156 MRARWRNPKATPEDFAQAL 174
>gi|289704754|ref|ZP_06501176.1| toxin-antitoxin system, toxin component, PIN family [Micrococcus
luteus SK58]
gi|289558514|gb|EFD51783.1| toxin-antitoxin system, toxin component, PIN family [Micrococcus
luteus SK58]
Length=212
Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/107 (37%), Positives = 62/107 (58%), Gaps = 2/107 (1%)
Query 46 GYNDLIAAMNN-HYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLD 104
G+ +L++A++ DRHV+AAA++ RA++IVT NLK FP + L + ++A HPD+F ++
Sbjct 78 GWEELLSAISLPDENDRHVVAAAIQGRADLIVTANLKDFPTEVLGRFNLEAQHPDEFFVN 137
Query 105 QLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGE-QVPQFAAKAR 150
QLDL + A +P T +L +L +FA AR
Sbjct 138 QLDLDPAEVMRSLQVQASATRNPALTVADILASLERCDAWRFAEAAR 184
>gi|126090285|ref|YP_001041740.1| hypothetical protein Sbal_4436 [Shewanella baltica OS155]
gi|125999916|gb|ABN63985.1| conserved hypothetical protein [Shewanella baltica OS155]
gi|334865973|gb|AEH16426.1| hypothetical protein Sbal117_4795 [Shewanella baltica OS117]
Length=189
Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/107 (37%), Positives = 56/107 (53%), Gaps = 2/107 (1%)
Query 46 GYNDLIAAMNN-HYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLD 104
GY LI + DRHV+AAA++ AE I+T NLK FP L ++ A+HPD+FL D
Sbjct 77 GYEPLIKGLTLPDMNDRHVVAAALKGHAEAIITFNLKDFPASELNILELSAIHPDEFLCD 136
Query 105 QLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQ-VPQFAAKAR 150
+L + + +PP T L+ L +Q +P F +K R
Sbjct 137 MFELDLSSCIKAAQQQRSSLKNPPMTTSEYLNCLQKQRLPVFVSKLR 183
>gi|227549096|ref|ZP_03979145.1| conserved hypothetical protein [Corynebacterium lipophiloflavum
DSM 44291]
gi|227078825|gb|EEI16788.1| conserved hypothetical protein [Corynebacterium lipophiloflavum
DSM 44291]
Length=125
Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/93 (42%), Positives = 52/93 (56%), Gaps = 1/93 (1%)
Query 60 DRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILG 119
D HV+AAA+ RA+VIVT NLK PD L + A PD+FLL+QLDL + +
Sbjct 27 DNHVVAAALLGRADVIVTNNLKDSPDRELGRFNSSAQRPDEFLLNQLDLAPDRVMRALAQ 86
Query 120 MVDAYIDPPFTPHSLLDALGE-QVPQFAAKARR 151
DPP T +LD+L + FAA+ R+
Sbjct 87 QAADMQDPPRTVRDVLDSLERAEANSFAAECRK 119
>gi|317482997|ref|ZP_07942001.1| hypothetical protein HMPREF0177_01396 [Bifidobacterium sp. 12_1_47BFAA]
gi|316915604|gb|EFV37022.1| hypothetical protein HMPREF0177_01396 [Bifidobacterium sp. 12_1_47BFAA]
Length=152
Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/96 (44%), Positives = 51/96 (54%), Gaps = 7/96 (7%)
Query 17 RRTSWPRLGEPWSTASTSAPNGLTTAF----AFGYNDLIAAMNNHYKDRHVLAAAVRERA 72
RR R+GE S N +T AF GY LI M KDRHVLAAA A
Sbjct 43 RRDLQLRIGEE---RSAQRINAMTAAFPKALVTGYESLIKNMTCDEKDRHVLAAADYLPA 99
Query 73 EVIVTTNLKHFPDDALKPYQIKALHPDDFLLDQLDL 108
+ +VT N K FP+ + +P +I+ PDDF LD LDL
Sbjct 100 QTLVTFNTKDFPETSTQPLRIEVKRPDDFFLDVLDL 135
Lambda K H
0.322 0.138 0.433
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130209894640
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40