BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3073c
Length=118
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610210|ref|NP_217589.1| hypothetical protein Rv3073c [Mycob... 241 4e-62
gi|289553136|ref|ZP_06442346.1| conserved hypothetical protein [... 237 5e-61
gi|118618755|ref|YP_907087.1| hypothetical protein MUL_3449 [Myc... 181 3e-44
gi|342860970|ref|ZP_08717619.1| hypothetical protein MCOL_18902 ... 171 3e-41
gi|254776395|ref|ZP_05217911.1| hypothetical protein MaviaA2_172... 163 1e-38
gi|240170661|ref|ZP_04749320.1| hypothetical protein MkanA1_1520... 162 2e-38
gi|296169252|ref|ZP_06850903.1| protein of hypothetical function... 162 2e-38
gi|41409248|ref|NP_962084.1| hypothetical protein MAP3150c [Myco... 161 3e-38
gi|118464803|ref|YP_883135.1| hypothetical protein MAV_3979 [Myc... 158 2e-37
gi|336459197|gb|EGO38142.1| hypothetical protein MAPs_04980 [Myc... 157 6e-37
gi|254820721|ref|ZP_05225722.1| hypothetical protein MintA_12381... 155 2e-36
gi|333991455|ref|YP_004524069.1| hypothetical protein JDM601_281... 141 3e-32
gi|118469417|ref|YP_886485.1| hypothetical protein MSMEG_2127 [M... 125 2e-27
gi|258653798|ref|YP_003202954.1| hypothetical protein Namu_3653 ... 122 1e-26
gi|21225882|ref|NP_631661.1| hypothetical protein SCO7620 [Strep... 117 5e-25
gi|298291301|ref|YP_003693240.1| hypothetical protein Snov_1311 ... 115 2e-24
gi|257056175|ref|YP_003134007.1| hypothetical protein Svir_21720... 114 4e-24
gi|329937385|ref|ZP_08286943.1| hypothetical protein SGM_2435 [S... 114 5e-24
gi|294634153|ref|ZP_06712706.1| uroporphyrin-III c-methyltransfe... 114 6e-24
gi|119714171|ref|YP_919313.1| hypothetical protein Noca_4868 [No... 113 9e-24
gi|255513915|gb|EET90180.1| protein of unknown function DUF488 [... 113 1e-23
gi|318060569|ref|ZP_07979292.1| hypothetical protein SSA3_21683 ... 112 2e-23
gi|266631111|emb|CBH32111.1| hypothetical protein [Streptomyces ... 111 4e-23
gi|297155885|gb|ADI05597.1| hypothetical protein SBI_02476 [Stre... 110 5e-23
gi|296271324|ref|YP_003653956.1| hypothetical protein Tbis_3373 ... 110 8e-23
gi|334365029|ref|ZP_08513997.1| conserved hypothetical protein [... 109 1e-22
gi|262204109|ref|YP_003275317.1| hypothetical protein Gbro_4278 ... 109 1e-22
gi|336320306|ref|YP_004600274.1| hypothetical protein Celgi_1187... 109 2e-22
gi|312197339|ref|YP_004017400.1| hypothetical protein FraEuI1c_3... 108 2e-22
gi|289208916|ref|YP_003460982.1| hypothetical protein TK90_1755 ... 108 2e-22
gi|54025115|ref|YP_119357.1| hypothetical protein nfa31460 [Noca... 108 3e-22
gi|260906975|ref|ZP_05915297.1| hypothetical protein BlinB_16702... 105 2e-21
gi|302518037|ref|ZP_07270379.1| conserved hypothetical protein [... 105 3e-21
gi|300786417|ref|YP_003766708.1| hypothetical protein AMED_4536 ... 105 3e-21
gi|319784778|ref|YP_004144254.1| hypothetical protein Mesci_5104... 105 3e-21
gi|302528367|ref|ZP_07280709.1| conserved hypothetical protein [... 104 4e-21
gi|291515027|emb|CBK64237.1| Uncharacterized conserved protein [... 104 4e-21
gi|340357394|ref|ZP_08680011.1| MarR family transcriptional regu... 104 5e-21
gi|302541237|ref|ZP_07293579.1| putative uroporphyrin-III c-meth... 104 5e-21
gi|291450504|ref|ZP_06589894.1| conserved hypothetical protein [... 103 1e-20
gi|256831489|ref|YP_003160216.1| hypothetical protein Jden_0239 ... 102 2e-20
gi|304312379|ref|YP_003811977.1| hypothetical protein HDN1F_2751... 102 2e-20
gi|148262413|ref|YP_001229119.1| hypothetical protein Gura_0330 ... 102 3e-20
gi|159186149|ref|NP_356259.2| hypothetical protein Atu4395 [Agro... 101 3e-20
gi|39998531|ref|NP_954482.1| hypothetical protein GSU3442 [Geoba... 101 4e-20
gi|184199725|ref|YP_001853932.1| hypothetical protein KRH_00790 ... 101 4e-20
gi|119718863|ref|YP_925828.1| hypothetical protein Noca_4646 [No... 101 4e-20
gi|325123337|gb|ADY82860.1| putative uroporphyrin-III C-methyltr... 101 4e-20
gi|258645758|ref|ZP_05733227.1| transcriptional regulator, MarR ... 101 4e-20
gi|298292289|ref|YP_003694228.1| hypothetical protein Snov_2314 ... 100 5e-20
>gi|15610210|ref|NP_217589.1| hypothetical protein Rv3073c [Mycobacterium tuberculosis H37Rv]
gi|15842643|ref|NP_337680.1| hypothetical protein MT3158 [Mycobacterium tuberculosis CDC1551]
gi|31794252|ref|NP_856745.1| hypothetical protein Mb3100c [Mycobacterium bovis AF2122/97]
78 more sequence titles
Length=118
Score = 241 bits (614), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 118/118 (100%), Positives = 118/118 (100%), Gaps = 0/118 (0%)
Query 1 MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHH 60
MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHH
Sbjct 1 MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHH 60
Query 61 QPERFDEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
QPERFDEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR
Sbjct 61 QPERFDEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
>gi|289553136|ref|ZP_06442346.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
gi|289437768|gb|EFD20261.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
Length=117
Score = 237 bits (604), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 116/117 (99%), Positives = 117/117 (100%), Gaps = 0/117 (0%)
Query 2 VRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQ 61
+RETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQ
Sbjct 1 MRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQ 60
Query 62 PERFDEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
PERFDEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR
Sbjct 61 PERFDEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 117
>gi|118618755|ref|YP_907087.1| hypothetical protein MUL_3449 [Mycobacterium ulcerans Agy99]
gi|183981601|ref|YP_001849892.1| hypothetical protein MMAR_1585 [Mycobacterium marinum M]
gi|118570865|gb|ABL05616.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
gi|183174927|gb|ACC40037.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=118
Score = 181 bits (459), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 88/118 (75%), Positives = 96/118 (82%), Gaps = 0/118 (0%)
Query 1 MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHH 60
M RE R+RVAR+Y+D++PDDGQRVLVDRIWP GIRKDD RVGIWCK VAPS ELR+WYHH
Sbjct 1 MSREARIRVARIYDDVEPDDGQRVLVDRIWPRGIRKDDPRVGIWCKGVAPSTELRQWYHH 60
Query 61 QPERFDEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
QPERFDEFA+RY+ EL SAALAEL L VVT VTATR VA SH AVLA LL R
Sbjct 61 QPERFDEFAARYEAELSGSAALAELHTLAQGGVVTLVTATREVAGSHVAVLANLLKAR 118
>gi|342860970|ref|ZP_08717619.1| hypothetical protein MCOL_18902 [Mycobacterium colombiense CECT
3035]
gi|342131414|gb|EGT84684.1| hypothetical protein MCOL_18902 [Mycobacterium colombiense CECT
3035]
Length=118
Score = 171 bits (433), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 83/118 (71%), Positives = 94/118 (80%), Gaps = 0/118 (0%)
Query 1 MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHH 60
M + R+RVARVY+D+ PD+GQRVLVDR+WP G+RKDD RVGIWCK+VAPSK+LREWY H
Sbjct 1 MSAKGRIRVARVYDDVGPDEGQRVLVDRVWPRGVRKDDPRVGIWCKNVAPSKDLREWYQH 60
Query 61 QPERFDEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
PERFDEFASRY+ EL DS AL ELRKL R VT VTATR + S AAVLA LL R
Sbjct 61 DPERFDEFASRYKSELRDSGALDELRKLAKRGPVTLVTATRDLDISQAAVLATLLKSR 118
>gi|254776395|ref|ZP_05217911.1| hypothetical protein MaviaA2_17247 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=118
Score = 163 bits (412), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 78/111 (71%), Positives = 90/111 (82%), Gaps = 0/111 (0%)
Query 6 RVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERF 65
R++VARVY+ + PD+GQRVLVDRIWP G+RKDD RVGIWCKDVAPSK+LREWYHH+PERF
Sbjct 6 RIQVARVYDKVGPDEGQRVLVDRIWPRGVRKDDPRVGIWCKDVAPSKQLREWYHHEPERF 65
Query 66 DEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLN 116
DEF SRY+ EL + AL ELR L R VT VTATR + S AAVLA+LL
Sbjct 66 DEFTSRYKSELRGNPALDELRALAKRGSVTLVTATRDLDISQAAVLAELLK 116
>gi|240170661|ref|ZP_04749320.1| hypothetical protein MkanA1_15205 [Mycobacterium kansasii ATCC
12478]
Length=118
Score = 162 bits (410), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 80/116 (69%), Positives = 91/116 (79%), Gaps = 0/116 (0%)
Query 1 MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHH 60
MV + R+RV R+YE+ PDDGQR+LVD IWP GIRKDD RVGIW KDVAPSKELREWYHH
Sbjct 1 MVTKRRIRVTRIYEEPGPDDGQRILVDGIWPRGIRKDDPRVGIWRKDVAPSKELREWYHH 60
Query 61 QPERFDEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLN 116
+ ERFDEF +RY+ EL S ALAELRKL +T VTATR + SHAAVLA+LL
Sbjct 61 RRERFDEFEARYEAELSGSPALAELRKLAASGPITLVTATREIDGSHAAVLAKLLK 116
>gi|296169252|ref|ZP_06850903.1| protein of hypothetical function DUF488 [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295896076|gb|EFG75744.1| protein of hypothetical function DUF488 [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=132
Score = 162 bits (410), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 81/113 (72%), Positives = 91/113 (81%), Gaps = 0/113 (0%)
Query 6 RVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERF 65
R+RVARVY+D+ PDDGQRVLVDR+WP GIRKDD RVGIW KDVAPSK+LREWY H+PE F
Sbjct 20 RIRVARVYDDVTPDDGQRVLVDRVWPRGIRKDDPRVGIWRKDVAPSKDLREWYQHRPELF 79
Query 66 DEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
DEFA+RYQ EL S AL E+ +L R VVT VTATR V S AAVLA+LL R
Sbjct 80 DEFAARYQAELRGSPALEEVCQLAKRGVVTLVTATRAVDISQAAVLAKLLKTR 132
>gi|41409248|ref|NP_962084.1| hypothetical protein MAP3150c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41398068|gb|AAS05698.1| hypothetical protein MAP_3150c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=118
Score = 161 bits (408), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 77/111 (70%), Positives = 89/111 (81%), Gaps = 0/111 (0%)
Query 6 RVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERF 65
R++VARVY+ + PD+GQRVLVDRIWP G+RKDD RVGIWCKDVAPSK+LREWYHH+PERF
Sbjct 6 RIQVARVYDKVGPDEGQRVLVDRIWPRGVRKDDPRVGIWCKDVAPSKQLREWYHHEPERF 65
Query 66 DEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLN 116
DEF SRY+ EL + AL ELR L R VT VTATR + S A VLA+LL
Sbjct 66 DEFTSRYKSELRGNPALDELRTLAKRGSVTLVTATRDLDISQAVVLAELLK 116
>gi|118464803|ref|YP_883135.1| hypothetical protein MAV_3979 [Mycobacterium avium 104]
gi|118166090|gb|ABK66987.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=110
Score = 158 bits (400), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 76/108 (71%), Positives = 87/108 (81%), Gaps = 0/108 (0%)
Query 9 VARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFDEF 68
+ARVY+ + PD+GQRVLVDRIWP G+RKDD RVGIWCKDVAPSK+LREWYHH+PERFDEF
Sbjct 1 MARVYDKVGPDEGQRVLVDRIWPRGVRKDDPRVGIWCKDVAPSKQLREWYHHEPERFDEF 60
Query 69 ASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLN 116
SRY+ EL + AL ELR L R VT VTATR + S AAVLA+LL
Sbjct 61 TSRYKSELRGNPALDELRALAKRGSVTLVTATRDLDISQAAVLAELLK 108
>gi|336459197|gb|EGO38142.1| hypothetical protein MAPs_04980 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=110
Score = 157 bits (397), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 75/108 (70%), Positives = 86/108 (80%), Gaps = 0/108 (0%)
Query 9 VARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFDEF 68
+ARVY+ + PD+GQRVLVDRIWP G+RKDD RVGIWCKDVAPSK+LREWYHH+PERFDEF
Sbjct 1 MARVYDKVGPDEGQRVLVDRIWPRGVRKDDPRVGIWCKDVAPSKQLREWYHHEPERFDEF 60
Query 69 ASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLN 116
SRY+ EL + AL ELR L R VT VTATR + S A VLA+LL
Sbjct 61 TSRYKSELRGNPALDELRTLAKRGSVTLVTATRDLDISQAVVLAELLK 108
>gi|254820721|ref|ZP_05225722.1| hypothetical protein MintA_12381 [Mycobacterium intracellulare
ATCC 13950]
Length=109
Score = 155 bits (392), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 75/107 (71%), Positives = 86/107 (81%), Gaps = 0/107 (0%)
Query 11 RVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFDEFAS 70
RVY+D+ D+GQRVLVDR+WP G+RKDD RVGIWCK+VAPSK+LREWY H+ ERFDEF S
Sbjct 2 RVYDDVSADEGQRVLVDRVWPRGMRKDDPRVGIWCKEVAPSKDLREWYQHRAERFDEFTS 61
Query 71 RYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNG 117
RY+ EL DSAALAELRKL R VT VTATR V S A VLA++L
Sbjct 62 RYEAELRDSAALAELRKLAKRGPVTLVTATREVDISQAVVLAKVLGA 108
>gi|333991455|ref|YP_004524069.1| hypothetical protein JDM601_2815 [Mycobacterium sp. JDM601]
gi|333487423|gb|AEF36815.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=122
Score = 141 bits (356), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 74/122 (61%), Positives = 90/122 (74%), Gaps = 4/122 (3%)
Query 1 MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHH 60
M +T +RVARVY++ +DGQR+LVDR+WP G RK+D RVG W KD APS+ELR WY H
Sbjct 1 MAAKTTIRVARVYDEPRAEDGQRILVDRVWPRGFRKNDPRVGRWFKDAAPSRELRSWYDH 60
Query 61 QPERFDEFASRYQEEL---HDSAALAELRKLT-GRSVVTPVTATRHVARSHAAVLAQLLN 116
+PERFDEF +RY++EL +AAL ELR+LT G VT VTATR V S AAVLA+LL
Sbjct 61 RPERFDEFVARYKDELASPQGAAALEELRELTRGAKTVTLVTATRDVDGSQAAVLAELLA 120
Query 117 GR 118
R
Sbjct 121 SR 122
>gi|118469417|ref|YP_886485.1| hypothetical protein MSMEG_2127 [Mycobacterium smegmatis str.
MC2 155]
gi|118170704|gb|ABK71600.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=119
Score = 125 bits (313), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 64/118 (55%), Positives = 79/118 (67%), Gaps = 3/118 (2%)
Query 1 MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHH 60
M + VR+ARVYE+ +P +G RVLVDR+WP G+RKDD R+G W +VAPS ELR WY H
Sbjct 1 MASKHEVRLARVYEEPEPGEGHRVLVDRLWPRGLRKDDPRIGQWLPEVAPSTELRRWYGH 60
Query 61 QPERFDEFASRYQEELHD---SAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLL 115
PER+DEF RY EL +AAL ELR + V VTAT+ + SH +VLA LL
Sbjct 61 APERYDEFVERYTAELETPAAAAALDELRGIVRAGPVVLVTATKEIELSHLSVLAGLL 118
>gi|258653798|ref|YP_003202954.1| hypothetical protein Namu_3653 [Nakamurella multipartita DSM
44233]
gi|258557023|gb|ACV79965.1| protein of unknown function DUF488 [Nakamurella multipartita
DSM 44233]
Length=125
Score = 122 bits (307), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 62/120 (52%), Positives = 80/120 (67%), Gaps = 3/120 (2%)
Query 1 MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHH 60
M+ + V+V R+Y++ DDG RVLVDR+WP G+ K R+ WC+ VAPS ELR WYHH
Sbjct 1 MIDQHAVQVRRIYDEPRTDDGTRVLVDRLWPRGMSKARARLNEWCRTVAPSTELRTWYHH 60
Query 61 QPERFDEFASRYQEEL---HDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNG 117
+PERF+EF RY +EL +AALA LR L + +T +TAT+ A S AAVLA LL
Sbjct 61 EPERFEEFTRRYLDELATPEQAAALAHLRDLANHNSLTLLTATKSTALSEAAVLATLLTS 120
>gi|21225882|ref|NP_631661.1| hypothetical protein SCO7620 [Streptomyces coelicolor A3(2)]
gi|289766960|ref|ZP_06526338.1| conserved hypothetical protein [Streptomyces lividans TK24]
gi|11322409|emb|CAC16722.1| conserved hypothetical protein [Streptomyces coelicolor A3(2)]
gi|289697159|gb|EFD64588.1| conserved hypothetical protein [Streptomyces lividans TK24]
Length=119
Score = 117 bits (294), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 61/116 (53%), Positives = 75/116 (65%), Gaps = 3/116 (2%)
Query 5 TRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPER 64
RV R+YE P DG+RVLVDR+WP G+ K+ + W +DVAPS +LR WYHH P R
Sbjct 3 ARVGYRRIYESATPQDGKRVLVDRLWPRGVSKEKAGLDAWLRDVAPSADLRRWYHHDPRR 62
Query 65 FDEFASRYQEELHDS---AALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNG 117
+DEF RY EL DS AAL LR L R+ VT +TAT+ V S AA LA+ L+G
Sbjct 63 YDEFRRRYFAELEDSDHEAALERLRDLAARAKVTLLTATKDVDHSEAAALAEWLDG 118
>gi|298291301|ref|YP_003693240.1| hypothetical protein Snov_1311 [Starkeya novella DSM 506]
gi|296927812|gb|ADH88621.1| protein of unknown function DUF488 [Starkeya novella DSM 506]
Length=124
Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 59/116 (51%), Positives = 73/116 (63%), Gaps = 1/116 (0%)
Query 4 ETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPE 63
E + + R YE PDDG RVLVDR+WP G+RKDD ++ +W KD+APS ELREW+ H P
Sbjct 5 EPDLLMKRAYEPARPDDGARVLVDRLWPRGVRKDDLKLTLWLKDIAPSTELREWFGHDPA 64
Query 64 RFDEFASRYQEELH-DSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
RF EFA RY+EEL + ALA L +L G T + A R + A VLA L R
Sbjct 65 RFKEFARRYREELRANEGALAPLEQLLGSGRATLIYAARDADHNEAVVLADYLRER 120
>gi|257056175|ref|YP_003134007.1| hypothetical protein Svir_21720 [Saccharomonospora viridis DSM
43017]
gi|256586047|gb|ACU97180.1| uncharacterized conserved protein [Saccharomonospora viridis
DSM 43017]
Length=121
Score = 114 bits (286), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 57/112 (51%), Positives = 74/112 (67%), Gaps = 0/112 (0%)
Query 6 RVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERF 65
RV R+Y+D P DG RVLVDRIWP G++K++ W ++VAPS ELR WY H P++F
Sbjct 7 RVTYRRIYDDPTPSDGTRVLVDRIWPRGLKKENAHYDEWLREVAPSTELRTWYGHDPKKF 66
Query 66 DEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNG 117
EF RY EL DS A LR++ R VT +TA+R + S AAVLA+ L+G
Sbjct 67 AEFRRRYLTELGDSEHAARLREIAERGNVTLLTASRDIEHSQAAVLAEWLSG 118
>gi|329937385|ref|ZP_08286943.1| hypothetical protein SGM_2435 [Streptomyces griseoaurantiacus
M045]
gi|329303261|gb|EGG47148.1| hypothetical protein SGM_2435 [Streptomyces griseoaurantiacus
M045]
Length=120
Score = 114 bits (285), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 64/120 (54%), Positives = 77/120 (65%), Gaps = 3/120 (2%)
Query 1 MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHH 60
M R V V RVYE+ DDG RVLVDRIWP G+ KD + WCK VAPS ELR+WY H
Sbjct 1 MARTKAVHVRRVYEEPARDDGTRVLVDRIWPRGMSKDRAHLHEWCKQVAPSTELRKWYGH 60
Query 61 QPERFDEFASRYQEELHD---SAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNG 117
P+RF+EF RY+EEL + + ALA LR+L G +T +TAT+ S A VLA LL
Sbjct 61 VPDRFEEFGRRYREELKEPERADALAHLRELAGDRPLTLLTATKDPGISAARVLAGLLRA 120
>gi|294634153|ref|ZP_06712706.1| uroporphyrin-III c-methyltransferase [Streptomyces sp. e14]
gi|292829837|gb|EFF88193.1| uroporphyrin-III c-methyltransferase [Streptomyces sp. e14]
Length=124
Score = 114 bits (284), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 58/109 (54%), Positives = 73/109 (67%), Gaps = 3/109 (2%)
Query 11 RVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFDEFAS 70
RVYED P+DG+RVLVDR+WP G+RK+D + W +DVAPS +LR WY H+P RFDEF
Sbjct 9 RVYEDTSPEDGKRVLVDRVWPRGMRKEDAHLDEWLRDVAPSTDLRHWYGHEPSRFDEFRR 68
Query 71 RYQEELHDSA---ALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLN 116
RY EL D+ A+ LR L +T +TAT+ V S AAVLA+ L
Sbjct 69 RYLAELRDAGHREAVGHLRDLADHDRLTLLTATKDVDHSQAAVLAEWLT 117
>gi|119714171|ref|YP_919313.1| hypothetical protein Noca_4868 [Nocardioides sp. JS614]
gi|119526080|gb|ABL79450.1| protein of unknown function DUF488 [Nocardioides sp. JS614]
Length=120
Score = 113 bits (283), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 61/118 (52%), Positives = 76/118 (65%), Gaps = 3/118 (2%)
Query 1 MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHH 60
M R VR+ R+Y+D DDG RVLVDR WP G+ K + WC V+PS LR+WY H
Sbjct 1 MTRRPDVRLRRIYDDPTDDDGMRVLVDRRWPRGVSKVRADLDEWCSAVSPSDALRKWYGH 60
Query 61 QPERFDEFASRYQEELHD---SAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLL 115
PERFDEF +RY +EL D ++ALA LR + R +T +TATR + S AAVLA LL
Sbjct 61 VPERFDEFEARYLDELEDPQRASALAHLRAMAARGRLTLLTATRDLEVSQAAVLAGLL 118
>gi|255513915|gb|EET90180.1| protein of unknown function DUF488 [Candidatus Micrarchaeum acidiphilum
ARMAN-2]
Length=157
Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 49/116 (43%), Positives = 79/116 (69%), Gaps = 1/116 (0%)
Query 2 VRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQ 61
++ +++ RVY+ +D DG R+LVDR+WP GIR+ Q + +W KDVAPS ELR+W+ HQ
Sbjct 26 IKNRMIKIKRVYDKVDIHDGLRILVDRLWPRGIRRSSQNIDLWLKDVAPSTELRKWFMHQ 85
Query 62 PERFDEFASRYQEELHDSAALAELRKLTGRS-VVTPVTATRHVARSHAAVLAQLLN 116
P R++ F +Y+ EL DS ALA+L + + +T + +T+ + ++A VL ++LN
Sbjct 86 PGRWESFKKKYEAELKDSKALAKLVDIALTTDTITMLYSTKDMEHNNALVLLKVLN 141
>gi|318060569|ref|ZP_07979292.1| hypothetical protein SSA3_21683 [Streptomyces sp. SA3_actG]
gi|318076163|ref|ZP_07983495.1| hypothetical protein SSA3_05458 [Streptomyces sp. SA3_actF]
Length=127
Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 63/122 (52%), Positives = 77/122 (64%), Gaps = 5/122 (4%)
Query 1 MVRE--TRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWY 58
MVRE T VRV R Y+ + DG RVLVDR+WP G+ K+ V W K VAPS LREWY
Sbjct 1 MVREAATEVRVRRAYDPPEAADGHRVLVDRLWPRGLSKERAHVDEWLKSVAPSAALREWY 60
Query 59 HHQPERFDEFASRYQEELHDSA---ALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLL 115
H P +F EFA RY+ EL D A ALA LR L+ R +T VTAT+ + +H+ VLA+ L
Sbjct 61 GHDPAKFGEFADRYRAELADPARAEALARLRTLSARGPLTLVTATKDLGHAHSRVLAREL 120
Query 116 NG 117
Sbjct 121 GA 122
>gi|266631111|emb|CBH32111.1| hypothetical protein [Streptomyces albaduncus]
Length=120
Score = 111 bits (277), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 57/111 (52%), Positives = 73/111 (66%), Gaps = 3/111 (2%)
Query 11 RVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFDEFAS 70
RVYE+ P +G+RVLVDR+WP G+RK+D R+ W +DVAPS ELR+WY H+P RF EF
Sbjct 9 RVYEETSPQEGKRVLVDRVWPRGMRKEDARLDEWLRDVAPSSELRKWYGHEPSRFTEFRR 68
Query 71 RYQEELHDSA---ALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
RY EEL D+ A L L + +TAT+ V S AAVLA+ L G+
Sbjct 69 RYLEELRDAGHREAAGHLHDLAEHDGLVLLTATKDVDHSQAAVLAEWLAGK 119
>gi|297155885|gb|ADI05597.1| hypothetical protein SBI_02476 [Streptomyces bingchenggensis
BCW-1]
Length=122
Score = 110 bits (276), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 58/113 (52%), Positives = 72/113 (64%), Gaps = 3/113 (2%)
Query 7 VRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFD 66
+R+ RVYE DPDDG RVLVDRIWP G+ KD R+ W K APS ELR WY H PER
Sbjct 9 IRIRRVYETPDPDDGTRVLVDRIWPRGLAKDAARLDEWLKAAAPSTELRRWYGHAPERSA 68
Query 67 EFASRYQEELHDS---AALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLN 116
EF RY+ EL D AA+ LR+L +T +TAT+ A+SH +LA+ +
Sbjct 69 EFGRRYRAELADPERRAAVDHLRELAATGPLTLLTATKDPAQSHLPLLAEAVG 121
>gi|296271324|ref|YP_003653956.1| hypothetical protein Tbis_3373 [Thermobispora bispora DSM 43833]
gi|296094111|gb|ADG90063.1| protein of unknown function DUF488 [Thermobispora bispora DSM
43833]
Length=117
Score = 110 bits (275), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 60/109 (56%), Positives = 75/109 (69%), Gaps = 3/109 (2%)
Query 11 RVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFDEFAS 70
RVY++ DDG RVLVDRIWP G+ K+ R+ W KDVAPS ELR WY H PERF EF S
Sbjct 7 RVYDEPTSDDGVRVLVDRIWPRGLTKEAARLDEWIKDVAPSAELRVWYGHVPERFPEFRS 66
Query 71 RYQEELHD---SAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLN 116
RY EL + AAL LR L+ + +T +TAT+ V S+AAVLA++L+
Sbjct 67 RYLAELAEPRRQAALQHLRDLSKQGRLTLLTATKDVEHSNAAVLAEVLS 115
>gi|334365029|ref|ZP_08513997.1| conserved hypothetical protein [Alistipes sp. HGB5]
gi|313158726|gb|EFR58113.1| conserved hypothetical protein [Alistipes sp. HGB5]
Length=124
Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 55/113 (49%), Positives = 74/113 (66%), Gaps = 2/113 (1%)
Query 5 TRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPE- 63
TR+R+ RVYE PDDG RVLVD++WP G+RKD +W K++ PS ELR WYH P+
Sbjct 2 TRIRIKRVYEPAAPDDGCRVLVDKLWPRGVRKDALHYDMWAKEITPSPELRAWYHADPQT 61
Query 64 RFDEFASRYQEELHDSAALAEL-RKLTGRSVVTPVTATRHVARSHAAVLAQLL 115
R+ EF RY EEL S A+ E R++ G VT + A+++ A +HA VL + L
Sbjct 62 RWPEFRRRYLEELRGSQAVREFVRRIAGNETVTLLYASKNAAENHALVLQEFL 114
>gi|262204109|ref|YP_003275317.1| hypothetical protein Gbro_4278 [Gordonia bronchialis DSM 43247]
gi|262087456|gb|ACY23424.1| protein of unknown function DUF488 [Gordonia bronchialis DSM
43247]
Length=124
Score = 109 bits (272), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/111 (51%), Positives = 73/111 (66%), Gaps = 3/111 (2%)
Query 9 VARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFDEF 68
+ R+Y+D DDGQRVLVDR+WP G+ K + WCK VAPS +LR+WY H P RFDEF
Sbjct 8 IKRIYDDPSDDDGQRVLVDRLWPRGMSKQRAAIDEWCKTVAPSTDLRKWYAHDPVRFDEF 67
Query 69 ASRYQEELHD---SAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLN 116
+RY +EL D ++A A LR L +T +TA++ + S A VLA LLN
Sbjct 68 RARYADELADGDQASAFAHLRDLHQHGRLTLLTASKALDISEARVLADLLN 118
>gi|336320306|ref|YP_004600274.1| hypothetical protein Celgi_1187 [Cellvibrio gilvus ATCC 13127]
gi|336103887|gb|AEI11706.1| protein of unknown function DUF488 [Cellvibrio gilvus ATCC 13127]
Length=123
Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/115 (53%), Positives = 72/115 (63%), Gaps = 3/115 (2%)
Query 7 VRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFD 66
V +ARVY+++ RVLVDR+WP G+R+DD RVG W +VAPS ELR WY H+ E +D
Sbjct 6 VALARVYDEVTDAGALRVLVDRLWPRGVRRDDPRVGQWLPEVAPSNELRRWYGHRAEEYD 65
Query 67 EFASRYQEELHDS---AALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
EFA RY +EL D A L L VT VTATR V SH VLA LL+ R
Sbjct 66 EFARRYAQELADGDARIAFHRLAALAHAEPVTLVTATREVDLSHLTVLAGLLDAR 120
>gi|312197339|ref|YP_004017400.1| hypothetical protein FraEuI1c_3521 [Frankia sp. EuI1c]
gi|311228675|gb|ADP81530.1| protein of unknown function DUF488 [Frankia sp. EuI1c]
Length=120
Score = 108 bits (270), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/118 (50%), Positives = 74/118 (63%), Gaps = 3/118 (2%)
Query 1 MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHH 60
M +V+V R YE DG RVLVDR+WP G+ + + + WCK VAPS ELR WY H
Sbjct 1 MGAAAKVQVRRAYEAPATTDGTRVLVDRLWPRGLARAEASLDEWCKQVAPSTELRTWYAH 60
Query 61 QPERFDEFASRYQEELHD---SAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLL 115
PERF+EF RY++EL + ALA LR+L + +T +TATR S A VLA+LL
Sbjct 61 DPERFEEFGRRYRQELAQPERAEALAHLRELARSATLTLLTATRQPEISEAQVLAELL 118
>gi|289208916|ref|YP_003460982.1| hypothetical protein TK90_1755 [Thioalkalivibrio sp. K90mix]
gi|288944547|gb|ADC72246.1| protein of unknown function DUF488 [Thioalkalivibrio sp. K90mix]
Length=142
Score = 108 bits (270), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 57/113 (51%), Positives = 71/113 (63%), Gaps = 1/113 (0%)
Query 6 RVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERF 65
RVR+ RVY+ +DPDDG RVLVDR+WP G++K + W K VAPS +LR W+ H P R+
Sbjct 3 RVRIRRVYDPVDPDDGARVLVDRLWPRGVKKSALVLDAWEKSVAPSPDLRRWFAHDPSRW 62
Query 66 DEFASRYQEELH-DSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNG 117
+F+ RYQ EL S A L R +T +TATR V SH AVL LLN
Sbjct 63 RDFSRRYQAELDAGSEAFGRLLGYARRGCLTLLTATRDVEHSHVAVLRDLLNA 115
>gi|54025115|ref|YP_119357.1| hypothetical protein nfa31460 [Nocardia farcinica IFM 10152]
gi|54016623|dbj|BAD57993.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=131
Score = 108 bits (269), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/111 (55%), Positives = 70/111 (64%), Gaps = 3/111 (2%)
Query 11 RVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFDEFAS 70
RVY+ P+DG RVLVDR+WP G+RKDD W KDVAPS ELR WY H PERF EF
Sbjct 9 RVYDPHTPEDGARVLVDRVWPRGVRKDDLAPVEWLKDVAPSTELRRWYGHDPERFAEFRR 68
Query 71 RYQEELHDSA---ALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
RY EL S A+ LR+ VT +TATR + S AAVLA++L R
Sbjct 69 RYLAELRTSPAREAVEHLRRAVAAGPVTLLTATRDLEHSQAAVLAEVLRRR 119
>gi|260906975|ref|ZP_05915297.1| hypothetical protein BlinB_16702 [Brevibacterium linens BL2]
Length=124
Score = 105 bits (262), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 49/121 (41%), Positives = 80/121 (67%), Gaps = 3/121 (2%)
Query 1 MVRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHH 60
M ++ +V+V R Y+D +DG R+LVDRIWP G+ KD + W K++APS +LR+WY H
Sbjct 1 MTQKHQVQVRRAYDDPAQNDGTRILVDRIWPRGVSKDKAELDDWLKELAPSSDLRKWYSH 60
Query 61 QPERFDEFASRYQEELHD---SAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNG 117
P++F+EF+ RY++EL D + A +L++ + +T +TA++ S A VL +L+G
Sbjct 61 DPDKFEEFSRRYRDELKDDDHAEAFKQLKEYAKKGKLTLITASKRDDISDATVLKHVLDG 120
Query 118 R 118
+
Sbjct 121 K 121
>gi|302518037|ref|ZP_07270379.1| conserved hypothetical protein [Streptomyces sp. SPB78]
gi|302426932|gb|EFK98747.1| conserved hypothetical protein [Streptomyces sp. SPB78]
Length=119
Score = 105 bits (262), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 56/114 (50%), Positives = 71/114 (63%), Gaps = 3/114 (2%)
Query 7 VRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFD 66
+RV R Y+ + DG RVLVDR+WP G+ K+ V W K VAPS LR+WY H P +F
Sbjct 1 MRVRRAYDPPEAADGHRVLVDRLWPRGLSKERAHVDEWLKSVAPSAALRQWYGHDPAKFG 60
Query 67 EFASRYQEELHDSA---ALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNG 117
EFA RY+ EL D A ALA LR L+ R +T VTAT+ + +H VLA+ L
Sbjct 61 EFADRYRAELADPARAEALARLRTLSARGPLTLVTATKDLGHAHTRVLARELGA 114
>gi|300786417|ref|YP_003766708.1| hypothetical protein AMED_4536 [Amycolatopsis mediterranei U32]
gi|299795931|gb|ADJ46306.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340527895|gb|AEK43100.1| hypothetical protein RAM_23100 [Amycolatopsis mediterranei S699]
Length=120
Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 58/119 (49%), Positives = 76/119 (64%), Gaps = 3/119 (2%)
Query 2 VRETRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQ 61
++ T VRVAR+ + P DG RVLV+R+WP G + + W + +APS +LR WY H
Sbjct 1 MQPTVVRVARLSDPAGPADGVRVLVERLWPRGTARTAVVLDGWYRGLAPSDDLRTWYGHD 60
Query 62 PERFDEFASRYQEELHD---SAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNG 117
PERF EFA RY+ EL + +AALA LR+L G VT +TA+ A S AAVLA +L G
Sbjct 61 PERFAEFAERYRAELREPDRAAALARLRELAGDGPVTLLTASGSPAISQAAVLAAVLTG 119
>gi|319784778|ref|YP_004144254.1| hypothetical protein Mesci_5104 [Mesorhizobium ciceri biovar
biserrulae WSM1271]
gi|317170666|gb|ADV14204.1| protein of unknown function DUF488 [Mesorhizobium ciceri biovar
biserrulae WSM1271]
Length=117
Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 53/114 (47%), Positives = 71/114 (63%), Gaps = 1/114 (0%)
Query 6 RVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERF 65
++ V R+YE DDGQRVLVDRIWP G+ K D + +W KD+APS ELR+W+ H+ R+
Sbjct 4 KIAVKRIYEPPAADDGQRVLVDRIWPRGVSKQDAALTLWLKDIAPSDELRKWFGHEAVRW 63
Query 66 DEFASRYQEEL-HDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
EF RY EL + AA+ +LR L G VT + A ++A LA L+GR
Sbjct 64 AEFQERYGAELDGNGAAVTQLRGLLGNGKVTLLYGAHDEAHNNAVALAGYLHGR 117
>gi|302528367|ref|ZP_07280709.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302437262|gb|EFL09078.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=119
Score = 104 bits (260), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 55/115 (48%), Positives = 75/115 (66%), Gaps = 3/115 (2%)
Query 6 RVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERF 65
+V+V RVY+ + DG RVLVDR+WP G+ KD + W K ++PS ELR+WY H P+R+
Sbjct 5 QVQVRRVYDAPESFDGARVLVDRLWPRGLSKDRAHLDDWLKQISPSTELRKWYAHDPDRY 64
Query 66 DEFASRYQEELHD---SAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNG 117
DEFA RY+ EL + + AL +LR L + +T +TAT+ S AAVLA LL
Sbjct 65 DEFAERYRSELEEPERAEALDQLRALAKKGPLTLLTATKRSDISEAAVLADLLRS 119
>gi|291515027|emb|CBK64237.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
Length=124
Score = 104 bits (260), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 53/114 (47%), Positives = 74/114 (65%), Gaps = 2/114 (1%)
Query 5 TRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQP-E 63
TR+R+ RVYE PDDG RVLVDR+WP G+R+D +W +D+APS LR WYH P +
Sbjct 2 TRIRIKRVYEAPAPDDGCRVLVDRLWPRGVRRDALHCDVWARDLAPSAGLRSWYHADPGD 61
Query 64 RFDEFASRYQEELHDSAALAEL-RKLTGRSVVTPVTATRHVARSHAAVLAQLLN 116
R++EF RY +EL S A+ E R + G VT + A+++ A +HA +L + L
Sbjct 62 RWEEFRRRYTDELRASQAVREFVRGIEGVDTVTLLYASKNAAENHALILQEYLQ 115
>gi|340357394|ref|ZP_08680011.1| MarR family transcriptional regulator [Sporosarcina newyorkensis
2681]
gi|339617744|gb|EGQ22362.1| MarR family transcriptional regulator [Sporosarcina newyorkensis
2681]
Length=125
Score = 104 bits (259), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 52/116 (45%), Positives = 74/116 (64%), Gaps = 6/116 (5%)
Query 7 VRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFD 66
V++ RVY+ + DG RVLVDR+WP G+ K+D ++ W K+V PSKELREW+ H PE+FD
Sbjct 8 VQIKRVYDKAEKADGIRVLVDRLWPRGVSKEDAKLDDWLKEVGPSKELREWFGHDPEKFD 67
Query 67 EFASRYQEEL-HDSAALAELRKLTGRSV-----VTPVTATRHVARSHAAVLAQLLN 116
EF +Y+EEL ++ EL KL RS +T V ++ + A VL +LL+
Sbjct 68 EFKRKYKEELKNNKVQTEELNKLINRSKEHDEQLTLVFGSKDEKHNQAIVLKELLD 123
>gi|302541237|ref|ZP_07293579.1| putative uroporphyrin-III c-methyltransferase [Streptomyces hygroscopicus
ATCC 53653]
gi|302458855|gb|EFL21948.1| putative uroporphyrin-III c-methyltransferase [Streptomyces himastatinicus
ATCC 53653]
Length=111
Score = 104 bits (259), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 52/109 (48%), Positives = 70/109 (65%), Gaps = 3/109 (2%)
Query 12 VYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFDEFASR 71
+Y+ +PDDG RVLVDRIWP G+ K D R+ W K APS ELR WY H+PER+ EFA R
Sbjct 1 MYDAPEPDDGTRVLVDRIWPRGLAKADARLDEWAKAAAPSTELRRWYGHEPERYAEFARR 60
Query 72 YQEELHDS---AALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNG 117
Y+ EL + A+ LR+L G +T +TA + ++ HA +LA + G
Sbjct 61 YRAELAEPEHREAVDHLRELAGHGTLTLLTAVKDLSHGHAGLLADAVRG 109
>gi|291450504|ref|ZP_06589894.1| conserved hypothetical protein [Streptomyces albus J1074]
gi|291353453|gb|EFE80355.1| conserved hypothetical protein [Streptomyces albus J1074]
Length=132
Score = 103 bits (256), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/113 (47%), Positives = 71/113 (63%), Gaps = 4/113 (3%)
Query 8 RVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQP-ERFD 66
RV RVYE +P DG RVLVDR+WP G+ K+ V +W K++ PS ELR+WYH P ER+D
Sbjct 7 RVRRVYEPAEPSDGTRVLVDRLWPRGVSKERAAVDVWLKEITPSTELRDWYHQNPEERYD 66
Query 67 EFASRYQEELHDS---AALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLN 116
F RY+ EL D AA+ +L L + VT +TA + + RS VL + L+
Sbjct 67 GFTERYRTELADPAHRAAVGQLLDLMDQGTVTLITAVKEIDRSQIPVLVEHLH 119
>gi|256831489|ref|YP_003160216.1| hypothetical protein Jden_0239 [Jonesia denitrificans DSM 20603]
gi|256685020|gb|ACV07913.1| protein of unknown function DUF488 [Jonesia denitrificans DSM
20603]
Length=120
Score = 102 bits (254), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/111 (46%), Positives = 71/111 (64%), Gaps = 1/111 (0%)
Query 8 RVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFDE 67
R+ RVYE DP+DG RVLVDR+WP G+ K D + +W KD+APS +LR+W+H +P+ FDE
Sbjct 6 RLKRVYEAPDPNDGFRVLVDRLWPRGLSKSDAHIDMWPKDIAPSTQLRQWFHDEPQDFDE 65
Query 68 FASRYQEELHDSAALAELR-KLTGRSVVTPVTATRHVARSHAAVLAQLLNG 117
FA RY+ EL +AA+ E+ L VT + A + +HA ++ L
Sbjct 66 FAHRYRRELATNAAVHEVTAALRVYDTVTLLYAAKDTEHNHAVIVRDYLTS 116
>gi|304312379|ref|YP_003811977.1| hypothetical protein HDN1F_27510 [gamma proteobacterium HdN1]
gi|301798112|emb|CBL46334.1| Conserved hypothetical protein [gamma proteobacterium HdN1]
Length=112
Score = 102 bits (253), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/110 (44%), Positives = 72/110 (66%), Gaps = 0/110 (0%)
Query 9 VARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFDEF 68
V R YE + DG+R+LVDR+WP G++K++ + +W KDVAPS ELR+W+ H PE++ EF
Sbjct 2 VKRAYEAPEKTDGERILVDRLWPRGLKKENADIDLWLKDVAPSTELRKWFGHDPEKWAEF 61
Query 69 ASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
RY+ EL + AL ELR+ + + +T V + ++A VL QLL +
Sbjct 62 QKRYKAELKGNTALDELREHSHKGHITLVFSAHDEEHNNAIVLQQLLEKK 111
>gi|148262413|ref|YP_001229119.1| hypothetical protein Gura_0330 [Geobacter uraniireducens Rf4]
gi|146395913|gb|ABQ24546.1| protein of unknown function DUF488 [Geobacter uraniireducens
Rf4]
Length=112
Score = 102 bits (253), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/110 (44%), Positives = 72/110 (66%), Gaps = 1/110 (0%)
Query 7 VRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFD 66
VR+ R+Y+ +P DG+R+L DR+WP G+ K+ R+ W K++APS ELR W+ H P R+D
Sbjct 2 VRIKRMYDPPEPGDGRRLLTDRLWPRGMTKEAARIDDWIKELAPSTELRTWFGHDPARWD 61
Query 67 EFASRYQEELHDSA-ALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLL 115
EF +RY+EEL A LA+LR + +T + A R ++A VL ++L
Sbjct 62 EFKARYKEELQAKADLLAKLRAEAKKGTITLLFAARDTEHNNAVVLKEML 111
>gi|159186149|ref|NP_356259.2| hypothetical protein Atu4395 [Agrobacterium tumefaciens str.
C58]
gi|159141283|gb|AAK89044.2| conserved hypothetical protein [Agrobacterium tumefaciens str.
C58]
Length=116
Score = 101 bits (252), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 47/112 (42%), Positives = 71/112 (64%), Gaps = 1/112 (0%)
Query 7 VRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFD 66
+R+ R+YE DDG RVLVDR+WP G+RK+D ++ +W K++APS ELR+ + H P RF
Sbjct 4 IRIKRIYEPASDDDGTRVLVDRLWPRGMRKEDAKLSLWMKEIAPSNELRKQFSHMPSRFA 63
Query 67 EFASRYQEELH-DSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNG 117
+F Y +EL + A+A +++L + VT + A +HA VLA L+
Sbjct 64 DFTGHYHQELSANPDAVARMKELMAKGRVTLLYAAHDTEHNHALVLADYLHS 115
>gi|39998531|ref|NP_954482.1| hypothetical protein GSU3442 [Geobacter sulfurreducens PCA]
gi|39985478|gb|AAR36832.1| conserved hypothetical protein [Geobacter sulfurreducens PCA]
gi|298507475|gb|ADI86198.1| protein of unknown function DUF488 [Geobacter sulfurreducens
KN400]
Length=114
Score = 101 bits (252), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 52/110 (48%), Positives = 70/110 (64%), Gaps = 1/110 (0%)
Query 7 VRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFD 66
VRV R+Y++ +DG RVLVDR+WP GI KD R+ W K++APS ELR+W+ H P R+D
Sbjct 2 VRVKRIYDEPATEDGTRVLVDRLWPRGIAKDKARIDEWLKEIAPSDELRQWFGHDPARWD 61
Query 67 EFASRYQEELHDSAALAE-LRKLTGRSVVTPVTATRHVARSHAAVLAQLL 115
EF RY+ EL A L + LRKL VT + A + ++A VL +L
Sbjct 62 EFRERYRRELDAKAELLDGLRKLAAGGTVTLLFAAKDEQHNNAVVLKDIL 111
>gi|184199725|ref|YP_001853932.1| hypothetical protein KRH_00790 [Kocuria rhizophila DC2201]
gi|183579955|dbj|BAG28426.1| hypothetical protein [Kocuria rhizophila DC2201]
Length=116
Score = 101 bits (251), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 52/113 (47%), Positives = 73/113 (65%), Gaps = 3/113 (2%)
Query 7 VRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFD 66
V V R+Y++ +DG RVLVDR+WP GIRK D + W K+V+PS ELR+WY H PE+F+
Sbjct 3 VVVKRIYDEPAQNDGTRVLVDRVWPRGIRKADAHLDDWNKEVSPSTELRKWYGHDPEKFE 62
Query 67 EFASRYQEELHDS---AALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLN 116
EF+SRY++EL L +LR +T +TA++ V S A VL +L+
Sbjct 63 EFSSRYRKELESGEGKEGLQKLRDSVKGKRLTLLTASKAVDISQATVLKTILS 115
>gi|119718863|ref|YP_925828.1| hypothetical protein Noca_4646 [Nocardioides sp. JS614]
gi|119539524|gb|ABL84141.1| protein of unknown function DUF488 [Nocardioides sp. JS614]
Length=111
Score = 101 bits (251), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 56/111 (51%), Positives = 65/111 (59%), Gaps = 1/111 (0%)
Query 7 VRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFD 66
+R+ RVYED DG R+LVDRIWP G+ K D V W KDV PS ELR W+ H P RF
Sbjct 1 MRIKRVYEDPAKADGHRILVDRIWPRGLTKADAAVDEWLKDVGPSTELRRWFGHDPARFG 60
Query 67 EFASRYQEELHDSAALAELRKLTGR-SVVTPVTATRHVARSHAAVLAQLLN 116
EFA RY+ EL S A A LR + VVT V + R + A VL LL
Sbjct 61 EFARRYRAELDGSDAFARLRAVRDEHRVVTLVYSARDTEHNQAVVLRDLLG 111
>gi|325123337|gb|ADY82860.1| putative uroporphyrin-III C-methyltransferase [Acinetobacter
calcoaceticus PHEA-2]
Length=118
Score = 101 bits (251), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 41/110 (38%), Positives = 75/110 (69%), Gaps = 1/110 (0%)
Query 7 VRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYH-HQPERF 65
+ + R+YE+ +P DG+R+LVDR+W GI K++ + +W K++APS ELR+W+H P+ +
Sbjct 3 IHIKRIYENAEPSDGKRILVDRLWSRGISKENAHLDLWLKEIAPSTELRKWFHAATPDHW 62
Query 66 DEFASRYQEELHDSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLL 115
EF RY +EL ++A+ EL+++ ++ +T + + + V +HA +L + L
Sbjct 63 QEFKQRYLKELETNSAVQELQQIALKNTITLLYSAKDVENNHAIILKEYL 112
>gi|258645758|ref|ZP_05733227.1| transcriptional regulator, MarR family [Dialister invisus DSM
15470]
gi|260403128|gb|EEW96675.1| transcriptional regulator, MarR family [Dialister invisus DSM
15470]
Length=120
Score = 101 bits (251), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 48/118 (41%), Positives = 73/118 (62%), Gaps = 4/118 (3%)
Query 5 TRVRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPER 64
R+ + RVYE PDDG+R+LVDR+WP G+ K+ + IW KDVAPS LR+W+ H P++
Sbjct 1 MRICIKRVYEAASPDDGERILVDRLWPRGLSKEKAAIDIWEKDVAPSAALRKWFGHDPDK 60
Query 65 FDEFASRYQEELHDSAALAELRK----LTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
FD+F ++Y++EL D+ A+ L L VT + + + AAVL + L+ +
Sbjct 61 FDDFRNKYRKELEDNPAIKRLEDMIHHLGKDKKVTLLFGAKDETHNQAAVLKEYLDSK 118
>gi|298292289|ref|YP_003694228.1| hypothetical protein Snov_2314 [Starkeya novella DSM 506]
gi|296928800|gb|ADH89609.1| protein of unknown function DUF488 [Starkeya novella DSM 506]
Length=127
Score = 100 bits (250), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 50/113 (45%), Positives = 72/113 (64%), Gaps = 1/113 (0%)
Query 7 VRVARVYEDIDPDDGQRVLVDRIWPHGIRKDDQRVGIWCKDVAPSKELREWYHHQPERFD 66
+R+ R Y+ + DDG RVLVDR+WP G+RKDD ++ +W K++APS ELR+W+ H+P RF+
Sbjct 10 LRLKRAYDPVADDDGARVLVDRLWPRGVRKDDLKLTLWLKEIAPSAELRKWFGHEPARFE 69
Query 67 EFASRYQEELH-DSAALAELRKLTGRSVVTPVTATRHVARSHAAVLAQLLNGR 118
EF+ RY+ EL + A+A L +L VT + A + A VL L R
Sbjct 70 EFSRRYRAELDANGDAVARLEELLKHGRVTLLYAAHDATHNDAVVLDAYLRAR 122
Lambda K H
0.321 0.135 0.416
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 129536132814
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40