BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0345
Length=136
Score E
Sequences producing significant alignments: (Bits) Value
gi|15839731|ref|NP_334768.1| hypothetical protein MT0360 [Mycoba... 263 9e-69
gi|308231527|ref|ZP_07412776.2| hypothetical protein TMAG_01604 ... 262 1e-68
gi|15607486|ref|NP_214859.1| hypothetical protein Rv0345 [Mycoba... 262 2e-68
gi|298523822|ref|ZP_07011231.1| conserved hypothetical protein [... 259 1e-67
gi|31791523|ref|NP_854016.1| hypothetical protein Mb0353 [Mycoba... 173 6e-42
gi|296167805|ref|ZP_06849991.1| conserved hypothetical protein [... 155 2e-36
gi|183980650|ref|YP_001848941.1| hypothetical protein MMAR_0624 ... 148 3e-34
gi|118616408|ref|YP_904740.1| hypothetical protein MUL_0589 [Myc... 146 1e-33
gi|240171637|ref|ZP_04750296.1| hypothetical protein MkanA1_2014... 137 5e-31
gi|325675820|ref|ZP_08155504.1| hypothetical protein HMPREF0724_... 108 3e-22
gi|312140559|ref|YP_004007895.1| hypothetical protein REQ_32150 ... 105 2e-21
gi|296140656|ref|YP_003647899.1| hypothetical protein Tpau_2964 ... 98.6 3e-19
gi|326332101|ref|ZP_08198385.1| 4-diphosphocytidyl-2C-methyl-D-e... 94.0 6e-18
gi|315445712|ref|YP_004078591.1| MobA-like protein [Mycobacteriu... 92.8 1e-17
gi|120402668|ref|YP_952497.1| hypothetical protein Mvan_1662 [My... 91.3 4e-17
gi|145225365|ref|YP_001136043.1| hypothetical protein Mflv_4787 ... 90.5 7e-17
gi|118471927|ref|YP_886177.1| hypothetical protein MSMEG_1806 [M... 87.4 6e-16
gi|54027219|ref|YP_121461.1| hypothetical protein nfa52450 [Noca... 86.7 1e-15
gi|333921852|ref|YP_004495433.1| hypothetical protein AS9A_4200 ... 85.5 2e-15
gi|284033811|ref|YP_003383742.1| 4-diphosphocytidyl-2C-methyl-D-... 82.4 2e-14
gi|343926696|ref|ZP_08766194.1| hypothetical protein GOALK_067_0... 80.9 6e-14
gi|229488684|ref|ZP_04382550.1| molybdenum cofactor cytidylyltra... 80.9 7e-14
gi|262201768|ref|YP_003272976.1| hypothetical protein Gbro_1827 ... 79.7 1e-13
gi|108798254|ref|YP_638451.1| hypothetical protein Mmcs_1283 [My... 79.3 2e-13
gi|226308797|ref|YP_002768757.1| hypothetical protein RER_53100 ... 77.0 8e-13
gi|256379133|ref|YP_003102793.1| hypothetical protein Amir_5126 ... 74.7 4e-12
gi|169630020|ref|YP_001703669.1| hypothetical protein MAB_2937c ... 74.3 6e-12
gi|226363831|ref|YP_002781613.1| hypothetical protein ROP_44210 ... 72.4 2e-11
gi|291300030|ref|YP_003511308.1| 4-diphosphocytidyl-2C-methyl-D-... 72.0 3e-11
gi|296130633|ref|YP_003637883.1| hypothetical protein Cfla_2799 ... 70.9 6e-11
gi|126433914|ref|YP_001069605.1| hypothetical protein Mjls_1312 ... 68.9 2e-10
gi|257055349|ref|YP_003133181.1| MobA-like protein [Saccharomono... 68.2 4e-10
gi|171473737|gb|ACB47043.1| hypothetical protein [Micromonospora... 66.6 1e-09
gi|311743477|ref|ZP_07717283.1| conserved hypothetical protein [... 65.9 2e-09
gi|319949563|ref|ZP_08023610.1| purine catabolism protein PucB [... 65.9 2e-09
gi|111021471|ref|YP_704443.1| hypothetical protein RHA1_ro04499 ... 64.7 4e-09
gi|119718195|ref|YP_925160.1| hypothetical protein Noca_3976 [No... 63.9 7e-09
gi|134097846|ref|YP_001103507.1| hypothetical protein SACE_1257 ... 62.8 2e-08
gi|229820181|ref|YP_002881707.1| hypothetical protein Bcav_1689 ... 62.0 3e-08
gi|331699472|ref|YP_004335711.1| hypothetical protein Psed_5731 ... 61.2 5e-08
gi|320007639|gb|ADW02489.1| 4-diphosphocytidyl-2C-methyl-D-eryth... 60.5 8e-08
gi|297559841|ref|YP_003678815.1| 4-diphosphocytidyl-2C-methyl-D-... 58.5 3e-07
gi|317123278|ref|YP_004097390.1| hypothetical protein Intca_0101... 58.2 4e-07
gi|182435089|ref|YP_001822808.1| hypothetical protein SGR_1296 [... 58.2 5e-07
gi|344943388|ref|ZP_08782675.1| 4-diphosphocytidyl-2C-methyl-D-e... 57.8 6e-07
gi|84494872|ref|ZP_00993991.1| hypothetical protein JNB_08739 [J... 57.0 9e-07
gi|116671957|ref|YP_832890.1| molybdenum cofactor cytidylyltrans... 57.0 1e-06
gi|302525099|ref|ZP_07277441.1| YgfJ family molybdenum hydroxyla... 55.8 2e-06
gi|300783786|ref|YP_003764077.1| hypothetical protein AMED_1865 ... 55.5 3e-06
gi|302557443|ref|ZP_07309785.1| 4-diphosphocytidyl-2C-methyl-D-e... 55.1 4e-06
>gi|15839731|ref|NP_334768.1| hypothetical protein MT0360 [Mycobacterium tuberculosis CDC1551]
gi|167968484|ref|ZP_02550761.1| hypothetical protein MtubH3_10796 [Mycobacterium tuberculosis
H37Ra]
gi|253797271|ref|YP_003030272.1| hypothetical protein TBMG_00349 [Mycobacterium tuberculosis KZN
1435]
9 more sequence titles
Length=138
Score = 263 bits (671), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 136/136 (100%), Positives = 136/136 (100%), Gaps = 0/136 (0%)
Query 1 LLPSTVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGV 60
LLPSTVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGV
Sbjct 3 LLPSTVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGV 62
Query 61 TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL 120
TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL
Sbjct 63 TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL 122
Query 121 AGRGRIPAHSARRRGC 136
AGRGRIPAHSARRRGC
Sbjct 123 AGRGRIPAHSARRRGC 138
>gi|308231527|ref|ZP_07412776.2| hypothetical protein TMAG_01604 [Mycobacterium tuberculosis SUMu001]
gi|308369369|ref|ZP_07417522.2| hypothetical protein TMBG_03575 [Mycobacterium tuberculosis SUMu002]
gi|308370380|ref|ZP_07421294.2| hypothetical protein TMCG_03029 [Mycobacterium tuberculosis SUMu003]
20 more sequence titles
Length=150
Score = 262 bits (670), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 136/136 (100%), Positives = 136/136 (100%), Gaps = 0/136 (0%)
Query 1 LLPSTVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGV 60
LLPSTVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGV
Sbjct 15 LLPSTVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGV 74
Query 61 TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL 120
TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL
Sbjct 75 TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL 134
Query 121 AGRGRIPAHSARRRGC 136
AGRGRIPAHSARRRGC
Sbjct 135 AGRGRIPAHSARRRGC 150
>gi|15607486|ref|NP_214859.1| hypothetical protein Rv0345 [Mycobacterium tuberculosis H37Rv]
gi|121636259|ref|YP_976482.1| hypothetical protein BCG_0384 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|148660111|ref|YP_001281634.1| hypothetical protein MRA_0354 [Mycobacterium tuberculosis H37Ra]
41 more sequence titles
Length=136
Score = 262 bits (669), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 135/136 (99%), Positives = 136/136 (100%), Gaps = 0/136 (0%)
Query 1 LLPSTVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGV 60
+LPSTVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGV
Sbjct 1 MLPSTVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGV 60
Query 61 TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL 120
TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL
Sbjct 61 TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL 120
Query 121 AGRGRIPAHSARRRGC 136
AGRGRIPAHSARRRGC
Sbjct 121 AGRGRIPAHSARRRGC 136
>gi|298523822|ref|ZP_07011231.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298493616|gb|EFI28910.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=136
Score = 259 bits (661), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 134/136 (99%), Positives = 135/136 (99%), Gaps = 0/136 (0%)
Query 1 LLPSTVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGV 60
+LPSTVVGVLLAAGAGRWYGKPKVLV GWLDTAVGALRDGGCNDVILVLGAVEVSAPAGV
Sbjct 1 MLPSTVVGVLLAAGAGRWYGKPKVLVGGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGV 60
Query 61 TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL 120
TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL
Sbjct 61 TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL 120
Query 121 AGRGRIPAHSARRRGC 136
AGRGRIPAHSARRRGC
Sbjct 121 AGRGRIPAHSARRRGC 136
>gi|31791523|ref|NP_854016.1| hypothetical protein Mb0353 [Mycobacterium bovis AF2122/97]
gi|31617109|emb|CAD93216.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
Length=90
Score = 173 bits (439), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 89/90 (99%), Positives = 90/90 (100%), Gaps = 0/90 (0%)
Query 47 LVLGAVEVSAPAGVTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVV 106
+VLGAVEVSAPAGVTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVV
Sbjct 1 MVLGAVEVSAPAGVTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVV 60
Query 107 ARVLGRALVSRSGLAGRGRIPAHSARRRGC 136
ARVLGRALVSRSGLAGRGRIPAHSARRRGC
Sbjct 61 ARVLGRALVSRSGLAGRGRIPAHSARRRGC 90
>gi|296167805|ref|ZP_06849991.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897035|gb|EFG76655.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=186
Score = 155 bits (391), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 90/132 (69%), Positives = 97/132 (74%), Gaps = 5/132 (3%)
Query 7 VGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTAITAP 66
VG+LLAAGAGR YGKPKVLVDGWLD AV ALR GGC DV++VLGA V+AP G T + P
Sbjct 13 VGILLAAGAGRRYGKPKVLVDGWLDIAVDALRAGGCADVVVVLGAARVAAPPGATTVMEP 72
Query 67 DWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGLA----- 121
W+ GLSASVRAGL QADR ADYA LHVIDTPDV VVARVL RA+ S SGLA
Sbjct 73 RWRDGLSASVRAGLRQADRLRADYAALHVIDTPDVGPAVVARVLDRAIASPSGLARASFG 132
Query 122 GRGRIPAHSARR 133
GR P ARR
Sbjct 133 GRPGHPVVVARR 144
>gi|183980650|ref|YP_001848941.1| hypothetical protein MMAR_0624 [Mycobacterium marinum M]
gi|183173976|gb|ACC39086.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=182
Score = 148 bits (373), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 91/118 (78%), Positives = 102/118 (87%), Gaps = 0/118 (0%)
Query 4 STVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTAI 63
+ VV VLLAAGAGR YGKPKVLV+GWL+TA+ ALR GGC DV++VLGA + P+GVTA+
Sbjct 5 TRVVAVLLAAGAGRRYGKPKVLVEGWLETALAALRGGGCADVVVVLGAAPAAVPSGVTAV 64
Query 64 TAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGLA 121
TAPDWQQGLSASVRAGL QADR ADYAVLHVIDTPDV+A VVARV+GRAL S SGLA
Sbjct 65 TAPDWQQGLSASVRAGLVQADRMKADYAVLHVIDTPDVDAAVVARVVGRALGSSSGLA 122
>gi|118616408|ref|YP_904740.1| hypothetical protein MUL_0589 [Mycobacterium ulcerans Agy99]
gi|118568518|gb|ABL03269.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=176
Score = 146 bits (368), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 90/118 (77%), Positives = 100/118 (85%), Gaps = 0/118 (0%)
Query 4 STVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTAI 63
+ VV VLLA GAGR YGKPKVLV+GWL+TAV ALR GGC DV++VLGA + P+GVTA+
Sbjct 8 TRVVAVLLAPGAGRRYGKPKVLVEGWLETAVAALRGGGCADVVVVLGAAPAAVPSGVTAV 67
Query 64 TAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGLA 121
TAPDWQQGLSASVRAGL QADR ADY VLHVIDTPDV+A VVARV+GRAL S SGLA
Sbjct 68 TAPDWQQGLSASVRAGLVQADRMKADYVVLHVIDTPDVDAAVVARVVGRALGSSSGLA 125
>gi|240171637|ref|ZP_04750296.1| hypothetical protein MkanA1_20143 [Mycobacterium kansasii ATCC
12478]
Length=156
Score = 137 bits (345), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 73/106 (69%), Positives = 86/106 (82%), Gaps = 2/106 (1%)
Query 26 VDGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTAITAPDWQQGLSASVRAGLAQADR 85
++GWL+ A+ AL GGC +V++VLGA +V+ P GVTAITA DWQQGLSASVRAGLAQADR
Sbjct 1 MEGWLEAALDALAGGGCTEVVVVLGAAQVTVPPGVTAITAADWQQGLSASVRAGLAQADR 60
Query 86 EHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL--AGRGRIPAH 129
+ADYAVLHV+DTPDV+A VVARV+ RAL SRSGL A G P H
Sbjct 61 MYADYAVLHVVDTPDVDASVVARVVNRALASRSGLARAYFGERPGH 106
>gi|325675820|ref|ZP_08155504.1| hypothetical protein HMPREF0724_13286 [Rhodococcus equi ATCC
33707]
gi|325553791|gb|EGD23469.1| hypothetical protein HMPREF0724_13286 [Rhodococcus equi ATCC
33707]
Length=207
Score = 108 bits (269), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 63/119 (53%), Positives = 79/119 (67%), Gaps = 3/119 (2%)
Query 6 VVGVLLAAGAGRWYGKPKVLVD--GWLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTAI 63
VVG+LLAAGAGR YG PKVL GWL AV ALR GGC V++VLGA + P+G A+
Sbjct 25 VVGLLLAAGAGRRYGYPKVLAHEGGWLRAAVHALRSGGCGRVLVVLGAARAALPSGAEAV 84
Query 64 TAPDWQQGLSASVRAGL-AQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGLA 121
AP W +G+ S+RAGL A A+ ADY +H++D PDV VVAR + A + SG+A
Sbjct 85 YAPRWAEGMGESLRAGLDAVAEDSDADYVAVHLVDLPDVGPDVVARTIDAATSTPSGMA 143
>gi|312140559|ref|YP_004007895.1| hypothetical protein REQ_32150 [Rhodococcus equi 103S]
gi|311889898|emb|CBH49215.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=207
Score = 105 bits (262), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/119 (53%), Positives = 78/119 (66%), Gaps = 3/119 (2%)
Query 6 VVGVLLAAGAGRWYGKPKVLVD--GWLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTAI 63
VVG+LLAAGAGR YG PKVL WL AV ALR GGC V++VLGA + P+G A+
Sbjct 25 VVGLLLAAGAGRRYGYPKVLAHEGQWLRAAVHALRSGGCGRVLVVLGAARAALPSGAEAV 84
Query 64 TAPDWQQGLSASVRAGL-AQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGLA 121
AP W +G+ S+RAGL A A+ ADY +H++D PDV VVAR + A + SG+A
Sbjct 85 YAPRWAEGMGESLRAGLDAVAEDSDADYVAVHLVDLPDVGPDVVARTIDAATSTPSGMA 143
>gi|296140656|ref|YP_003647899.1| hypothetical protein Tpau_2964 [Tsukamurella paurometabola DSM
20162]
gi|296028790|gb|ADG79560.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=196
Score = 98.6 bits (244), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 62/117 (53%), Positives = 74/117 (64%), Gaps = 6/117 (5%)
Query 7 VGVLLAAGAGRWYGKPKVLVDG--WLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTAIT 64
GVLLAAGAG +G PKV DG W D+AV ALR GGC V++VLGA + AG +
Sbjct 15 TGVLLAAGAGSRFGMPKVRADGGRWADSAVRALRAGGCGSVLVVLGADPSARIAGARTVL 74
Query 65 APDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGLA 121
APDWQ GLS SV AGL +AD VL +DTPDVNA+ V RV+ + SG+A
Sbjct 75 APDWQVGLSRSVAAGLTRAD----GAVVLMPVDTPDVNAECVRRVIAAGCCASSGIA 127
>gi|326332101|ref|ZP_08198385.1| 4-diphosphocytidyl-2C-methyl-D-erythritol synthase [Nocardioidaceae
bacterium Broad-1]
gi|325950072|gb|EGD42128.1| 4-diphosphocytidyl-2C-methyl-D-erythritol synthase [Nocardioidaceae
bacterium Broad-1]
Length=182
Score = 94.0 bits (232), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 56/110 (51%), Positives = 72/110 (66%), Gaps = 5/110 (4%)
Query 5 TVVGVLLAAGAGRWYGKPKVLVDGWLDTAVGALRDGGCNDVILVLGAVEVSA----PAGV 60
+ G+LLAAGAGR GKPK LVD WL ++G LR+GGC+DV++VLGA A PA
Sbjct 1 MISGLLLAAGAGRRMGKPKALVDDWLVRSIGVLREGGCDDVLVVLGASAEEARALLPADQ 60
Query 61 TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVL 110
+ A DW +G+ AS+R GL A AV+H++D PDV A VVARV+
Sbjct 61 RVVVAEDWDEGMGASLRVGL-DALGPDVGAAVVHLVDLPDVGADVVARVV 109
>gi|315445712|ref|YP_004078591.1| MobA-like protein [Mycobacterium sp. Spyr1]
gi|315264015|gb|ADU00757.1| uncharacterized MobA-like protein [Mycobacterium sp. Spyr1]
Length=181
Score = 92.8 bits (229), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/136 (52%), Positives = 84/136 (62%), Gaps = 9/136 (6%)
Query 5 TVVGVLLAAGAGRWYGKPKVLVDG--WLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTA 62
+ +GV+LAAGAG +G PKVL D WL +V AL DGGC+DV++VLGA V P+ A
Sbjct 6 STIGVVLAAGAGTRFGMPKVLADAGEWLRLSVAALSDGGCDDVVVVLGAAVVDVPSPARA 65
Query 63 ITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGLA- 121
+ A DW GLSASVRAG+ A AD+ V+ +DTPDV A V RVL A S SGLA
Sbjct 66 VVADDWDAGLSASVRAGIQAA--SEADFVVVTTVDTPDVGAAAVQRVLSAAQKSASGLAR 123
Query 122 ----GRGRIPAHSARR 133
GR P ARR
Sbjct 124 ALYGGRPGHPVVIARR 139
>gi|120402668|ref|YP_952497.1| hypothetical protein Mvan_1662 [Mycobacterium vanbaalenii PYR-1]
gi|119955486|gb|ABM12491.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=181
Score = 91.3 bits (225), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 72/134 (54%), Positives = 83/134 (62%), Gaps = 8/134 (5%)
Query 8 GVLLAAGAGRWYGKPKVLV-DG-WLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTAITA 65
GV+LAAGAG +G PKVL DG WL +V AL DGGC DV++VLGA V PA A+
Sbjct 8 GVVLAAGAGTRFGMPKVLGGDGDWLRRSVAALHDGGCGDVVVVLGAAIVDVPAPARAVVT 67
Query 66 PDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGLA---- 121
DW GLSASVRAG+ +A + AD+ VL +DTPDV A V RVL A S SGLA
Sbjct 68 ADWADGLSASVRAGV-RAAGDAADFVVLTTVDTPDVGAAAVRRVLAAARASTSGLARAYY 126
Query 122 -GRGRIPAHSARRR 134
GR P ARR
Sbjct 127 DGRPGHPVVIARRH 140
>gi|145225365|ref|YP_001136043.1| hypothetical protein Mflv_4787 [Mycobacterium gilvum PYR-GCK]
gi|145217851|gb|ABP47255.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=175
Score = 90.5 bits (223), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 65/117 (56%), Positives = 78/117 (67%), Gaps = 4/117 (3%)
Query 6 VVGVLLAAGAGRWYGKPKVLVDG--WLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTAI 63
+ GV+LAAGAG +G PKVL DG WL +V A+ DGGC+DV++VLGA V PA A+
Sbjct 1 MTGVVLAAGAGTRFGMPKVLADGGEWLRRSVAAVSDGGCDDVVVVLGAAVVDVPAPARAV 60
Query 64 TAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGL 120
A DW+ GLSASVRAG+ A AD+ VL +DTPDV A V RVL A S SGL
Sbjct 61 VADDWRDGLSASVRAGVRAA--SGADFVVLTTVDTPDVGAAAVRRVLTAARESASGL 115
>gi|118471927|ref|YP_886177.1| hypothetical protein MSMEG_1806 [Mycobacterium smegmatis str.
MC2 155]
gi|118173214|gb|ABK74110.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=182
Score = 87.4 bits (215), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 64/137 (47%), Positives = 81/137 (60%), Gaps = 5/137 (3%)
Query 3 PSTVVGVLLAAGAGRWYGKPKVLV--DGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGV 60
P V GV+LAAGAG +G PKVL WL TAV +L +GGC V++VLGA V P
Sbjct 4 PPVVAGVVLAAGAGTRFGMPKVLAAEGDWLKTAVRSLVEGGCAHVVVVLGAAVVDVPQPA 63
Query 61 TAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARV---LGRALVSR 117
A+ A DW GLSAS+RAG+ A R AD V ++DTPD+ V+ARV G + ++R
Sbjct 64 QAVVATDWSDGLSASLRAGVGAAARTGADLIVFRLVDTPDIGGDVIARVAAAAGESGLAR 123
Query 118 SGLAGRGRIPAHSARRR 134
+ GR P ARR
Sbjct 124 ATYDGRPGHPVVIARRH 140
>gi|54027219|ref|YP_121461.1| hypothetical protein nfa52450 [Nocardia farcinica IFM 10152]
gi|54018727|dbj|BAD60097.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=188
Score = 86.7 bits (213), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/122 (53%), Positives = 80/122 (66%), Gaps = 9/122 (7%)
Query 7 VGVLLAAGAGRWYGKPKVLVDG--WLDTAVGALRDGGCNDVILVLGAV-----EVSAPAG 59
VG++LAAGAG YG+PK L +G WL +A+ AL GGC V++VLGA + P
Sbjct 6 VGIVLAAGAGTRYGRPKALAEGGAWLRSAIAALHGGGCERVVVVLGATGPHPHALDLPPE 65
Query 60 VTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSG 119
T + APDW +GLSAS+RAGL A DYAV+ +DTPDV A VVARV+ AL + SG
Sbjct 66 ATPVWAPDWARGLSASLRAGLRAA--AGGDYAVIMPVDTPDVGAAVVARVVEAALHAPSG 123
Query 120 LA 121
LA
Sbjct 124 LA 125
>gi|333921852|ref|YP_004495433.1| hypothetical protein AS9A_4200 [Amycolicicoccus subflavus DQS3-9A1]
gi|333484073|gb|AEF42633.1| hypothetical protein AS9A_4200 [Amycolicicoccus subflavus DQS3-9A1]
Length=206
Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/131 (46%), Positives = 75/131 (58%), Gaps = 10/131 (7%)
Query 6 VVGVLLAAGAGRWYGKPKVLVDG--WLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTAI 63
VVG++LAAG+GR YG+PK+L G WL +AV AL DGGC V + GA PA +
Sbjct 27 VVGIVLAAGSGRRYGRPKILAAGGRWLKSAVNALADGGCAAVWVTTGAARPPIPAPALEL 86
Query 64 TAPDWQQGLSASVRAGLAQADREHADYA-VLHVIDTPDVNAKVVARVL----GRALVSRS 118
P W GLS SVRA + A + VLH++DTPDV A VV R+L G ++R
Sbjct 87 YVPKWFHGLSESVRAAVDAASTAGTCHTLVLHIVDTPDVGADVVTRLLAGHSGDHTLTRV 146
Query 119 GLAGRGRIPAH 129
AG +P H
Sbjct 147 SFAG---VPGH 154
>gi|284033811|ref|YP_003383742.1| 4-diphosphocytidyl-2C-methyl-D-erythritolsynthase [Kribbella
flavida DSM 17836]
gi|283813104|gb|ADB34943.1| 4-diphosphocytidyl-2C-methyl-D-erythritolsynthase [Kribbella
flavida DSM 17836]
Length=191
Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/150 (42%), Positives = 81/150 (54%), Gaps = 21/150 (14%)
Query 6 VVGVLLAAGAGRWYGKPKVLV---DG--WLDTAVGALRDGGCNDVILVLGA-----VEVS 55
+VGVLLAAGAG GKP LV DG W+ +AVG LR GC + +V+GA V +
Sbjct 4 LVGVLLAAGAGTRLGKPGGLVRAADGTPWVVSAVGVLRAAGCGPIGVVVGAAAPEVVALL 63
Query 56 APAGVTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRA-- 113
A VT + +P W GLS SVRA A A+ +A AV H++D P+V V+ RV+ A
Sbjct 64 ADEDVTIVPSPSWSDGLSHSVRAAFAWAEETNAPAAVFHLVDHPEVGIAVLQRVIATAYQ 123
Query 114 ---------LVSRSGLAGRGRIPAHSARRR 134
L++R+G GR P RR
Sbjct 124 NGFPDDPASLLARAGFDGRPGYPVLIGRRH 153
>gi|343926696|ref|ZP_08766194.1| hypothetical protein GOALK_067_00790 [Gordonia alkanivorans NBRC
16433]
gi|343763448|dbj|GAA13120.1| hypothetical protein GOALK_067_00790 [Gordonia alkanivorans NBRC
16433]
Length=190
Score = 80.9 bits (198), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 55/108 (51%), Positives = 68/108 (63%), Gaps = 3/108 (2%)
Query 6 VVGVLLAAGAGRWYGKPKVLV-DG-WLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTAI 63
V+GV+LAAGAG YG PK+L DG WL + ALR GGC D+ + +GA V+ P GVTA+
Sbjct 13 VLGVVLAAGAGSRYGMPKILAHDGVWLKASTNALRAGGCADIAVAMGAAIVAPPVGVTAL 72
Query 64 TAPDWQQGLSASVRAGLAQADRE-HADYAVLHVIDTPDVNAKVVARVL 110
DW GL ASV A L A R VL V+DTPDV +VV R++
Sbjct 73 VVDDWADGLGASVSAALGWASRRPGVGGVVLTVVDTPDVGPEVVRRIV 120
>gi|229488684|ref|ZP_04382550.1| molybdenum cofactor cytidylyltransferase [Rhodococcus erythropolis
SK121]
gi|229324188|gb|EEN89943.1| molybdenum cofactor cytidylyltransferase [Rhodococcus erythropolis
SK121]
Length=199
Score = 80.9 bits (198), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 53/126 (43%), Positives = 70/126 (56%), Gaps = 10/126 (7%)
Query 5 TVVGVLLAAGAGRWYGKPKVLV-----DGWLDTAVGALRDGGCNDVILVLGAVEVSA--- 56
+ G+LLAAGAG GKPK LV WL V L+ G + VI+VLGA A
Sbjct 13 STCGILLAAGAGSRMGKPKALVTNTEGQSWLHHGVTTLQSAGLSPVIVVLGAQAEDALEL 72
Query 57 -PAGVTAITAPDWQQGLSASVRAGLAQADR-EHADYAVLHVIDTPDVNAKVVARVLGRAL 114
P + DWQ G+SAS+R GL A R + AD A + ++D PD+NA V R++G +
Sbjct 73 LPTTDVIVVISDWQHGMSASLRCGLEAAMRIDDADAAAISLVDVPDLNADTVTRIVGHSS 132
Query 115 VSRSGL 120
SR+ L
Sbjct 133 PSRNIL 138
>gi|262201768|ref|YP_003272976.1| hypothetical protein Gbro_1827 [Gordonia bronchialis DSM 43247]
gi|262085115|gb|ACY21083.1| conserved hypothetical protein [Gordonia bronchialis DSM 43247]
Length=195
Score = 79.7 bits (195), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/108 (50%), Positives = 71/108 (66%), Gaps = 3/108 (2%)
Query 6 VVGVLLAAGAGRWYGKPKVLVD--GWLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTAI 63
V+GV+LAAGAG +G PK+L WL +AV AL DGGC++V++ +GA V+ P G +A+
Sbjct 18 VLGVVLAAGAGSRFGMPKILAHQGDWLRSAVAALHDGGCDEVVVAMGAAAVAVPRGASAL 77
Query 64 TAPDWQQGLSASVRAGLAQA-DREHADYAVLHVIDTPDVNAKVVARVL 110
+W GLSASV A + A R + +L V+D PDV A VVARVL
Sbjct 78 VVENWADGLSASVSAAIGAARQRAVSANVLLQVVDMPDVGADVVARVL 125
>gi|108798254|ref|YP_638451.1| hypothetical protein Mmcs_1283 [Mycobacterium sp. MCS]
gi|119867350|ref|YP_937302.1| hypothetical protein Mkms_1300 [Mycobacterium sp. KMS]
gi|108768673|gb|ABG07395.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119693439|gb|ABL90512.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=181
Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/119 (53%), Positives = 77/119 (65%), Gaps = 4/119 (3%)
Query 5 TVVGVLLAAGAGRWYGKPKVLVDG--WLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTA 62
+V GVLLAAGAG +G PKVL WL AV AL GGC DV++VLGA V+ P A
Sbjct 6 SVTGVLLAAGAGTRFGMPKVLAHQGEWLRLAVDALIRGGCGDVVVVLGAAVVAVPPPARA 65
Query 63 ITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGLA 121
+ A DW GLSASVR G++ A A+ VLH +DTPDV A+VV+RV+ A + GLA
Sbjct 66 VVAQDWSDGLSASVRTGISAAGA--AEAVVLHTVDTPDVGAEVVSRVIEAARTADGGLA 122
>gi|226308797|ref|YP_002768757.1| hypothetical protein RER_53100 [Rhodococcus erythropolis PR4]
gi|226187914|dbj|BAH36018.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=188
Score = 77.0 bits (188), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 51/126 (41%), Positives = 70/126 (56%), Gaps = 10/126 (7%)
Query 5 TVVGVLLAAGAGRWYGKPKVLVDG-----WLDTAVGALRDGGCNDVILVLGAVEVSA--- 56
+ G+LLAAGAG GKPK LV G WL V L+ G + VI+VLGA A
Sbjct 2 STCGILLAAGAGSRMGKPKALVTGTDGQPWLRHGVATLQSAGLSPVIVVLGAQAEDAVEL 61
Query 57 -PAGVTAITAPDWQQGLSASVRAGLAQADR-EHADYAVLHVIDTPDVNAKVVARVLGRAL 114
P + DW++G+SAS+R GL A E A+ A + ++D PD+N V R+ G+ +
Sbjct 62 LPTTDVIVVKSDWERGMSASLRCGLEAAMHIEGAEAAAISLVDVPDLNTDTVTRIAGQDI 121
Query 115 VSRSGL 120
SR+ L
Sbjct 122 PSRNIL 127
>gi|256379133|ref|YP_003102793.1| hypothetical protein Amir_5126 [Actinosynnema mirum DSM 43827]
gi|255923436|gb|ACU38947.1| conserved hypothetical protein [Actinosynnema mirum DSM 43827]
Length=184
Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 61/136 (45%), Positives = 79/136 (59%), Gaps = 17/136 (12%)
Query 6 VVGVLLAAGAGRWYGKPKVLVDG----WLDTAVGALRDGGCNDVILVLGA----VEVSAP 57
V G+LLAAGAGR YG+PK LV W++TA G LR GC+ V++VLGA V +A
Sbjct 3 VTGLLLAAGAGRRYGRPKALVSQGGALWVETACGVLRAAGCDRVVVVLGASSARVRATAS 62
Query 58 AG-VTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVL---GRA 113
G + DW G+ +S+R GLA A A AVL V DTP V A VAR+L A
Sbjct 63 LGDAVVVDNADWSTGVGSSLRVGLAAAGDRDA-VAVLPV-DTPGVTADAVARLLVLASPA 120
Query 114 LVSRSGLAGRGRIPAH 129
+++R+ AG +P H
Sbjct 121 VLARACYAG---VPGH 133
>gi|169630020|ref|YP_001703669.1| hypothetical protein MAB_2937c [Mycobacterium abscessus ATCC
19977]
gi|169241987|emb|CAM63015.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=188
Score = 74.3 bits (181), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 53/132 (41%), Positives = 70/132 (54%), Gaps = 8/132 (6%)
Query 6 VVGVLLAAGAGRWYGKPKVLVDG--WLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTAI 63
VVG +LAAGAG YG PK+L WL+ AV AL GGC +V + GAV V P+ I
Sbjct 10 VVGAVLAAGAGNRYGMPKILAANGRWLELAVTALDRGGCEEVYVTQGAVSVEMPSPAHGI 69
Query 64 TAPDWQQGLSASVRAGLAQAD-REHADYAVLHVIDTPDVNAKVVARVL-----GRALVSR 117
W+ G+S SVRA L R ++H++D P V +VV VL R+ + R
Sbjct 70 EVARWRDGVSESVRAVLELVHARTDVVGVLIHLVDLPSVGPEVVRMVLRASGGRRSALVR 129
Query 118 SGLAGRGRIPAH 129
+ AGR P +
Sbjct 130 ATFAGRPGHPVY 141
>gi|226363831|ref|YP_002781613.1| hypothetical protein ROP_44210 [Rhodococcus opacus B4]
gi|226242320|dbj|BAH52668.1| hypothetical protein [Rhodococcus opacus B4]
Length=193
Score = 72.4 bits (176), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/117 (47%), Positives = 65/117 (56%), Gaps = 12/117 (10%)
Query 5 TVVGVLLAAGAGRWYGKPKVLVDG-----WLDTAVGALRDGGCNDVILVLGA----VEVS 55
T+ GVLLAAGAG G PK LV G WL+ V L D GC ++VLGA EV
Sbjct 2 TIGGVLLAAGAGSRMGMPKALVVGADGQPWLERGVRVLADAGCEPTVVVLGARADEAEVL 61
Query 56 APAG--VTAITAPDWQQGLSASVRAGLA-QADREHADYAVLHVIDTPDVNAKVVARV 109
P G VT A DWQ GLSAS+R GL A + + V+ ++D PD+ A V RV
Sbjct 62 LPDGVPVTVGIAEDWQSGLSASLRRGLEVAATFDDVEAVVITLVDLPDLGADAVRRV 118
>gi|291300030|ref|YP_003511308.1| 4-diphosphocytidyl-2C-methyl-D-erythritol synthase [Stackebrandtia
nassauensis DSM 44728]
gi|290569250|gb|ADD42215.1| 4-diphosphocytidyl-2C-methyl-D-erythritol synthase [Stackebrandtia
nassauensis DSM 44728]
Length=193
Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/113 (43%), Positives = 64/113 (57%), Gaps = 9/113 (7%)
Query 5 TVVGVLLAAGAGRWYGKPKVLVD----GWLDTAVGALRDGGCNDVILVLG----AVEVSA 56
+V G++LAAGAG +G+PK LV+ LD AV LR+GGC V VLG A +
Sbjct 2 SVAGLVLAAGAGTRFGRPKALVEYRGATLLDRAVTILREGGCETVYGVLGISAYAARARS 61
Query 57 PAGVTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARV 109
T + P W G+ +S+RAGLA DREH VL ++D P ++ V RV
Sbjct 62 ATAFTPVYNPRWHTGMGSSLRAGLAALDREHEGVVVL-LVDQPGISPVSVRRV 113
>gi|296130633|ref|YP_003637883.1| hypothetical protein Cfla_2799 [Cellulomonas flavigena DSM 20109]
gi|296022448|gb|ADG75684.1| conserved hypothetical protein [Cellulomonas flavigena DSM 20109]
Length=189
Score = 70.9 bits (172), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 47/117 (41%), Positives = 63/117 (54%), Gaps = 9/117 (7%)
Query 2 LPSTVVGVLLAAGAGRWYGKPKVLVDG-----WLDTAVGALRDGGCNDVILVLGAVEVSA 56
+P ++G++LAAGAG G+PK L WL A AL GGC V +VLGA A
Sbjct 1 MPERLLGIVLAAGAGTRMGRPKALCSTPAGVPWLVRAYDALTGGGCGQVRVVLGAAADEA 60
Query 57 ----PAGVTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARV 109
P G + A DW G+ AS+ AGLA V+ ++D PD++A+ VARV
Sbjct 61 RALVPPGAVTVVADDWANGMGASLAAGLADPVDPRTVAVVVTLVDLPDLDARAVARV 117
>gi|126433914|ref|YP_001069605.1| hypothetical protein Mjls_1312 [Mycobacterium sp. JLS]
gi|126233714|gb|ABN97114.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=181
Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 67/136 (50%), Positives = 82/136 (61%), Gaps = 9/136 (6%)
Query 5 TVVGVLLAAGAGRWYGKPKVLVDG--WLDTAVGALRDGGCNDVILVLGAVEVSAPAGVTA 62
+V GVLLAAGAG +G PKVL WL AV AL GGC DV++VLGA V+ P A
Sbjct 6 SVTGVLLAAGAGTRFGMPKVLAHQGEWLRLAVDALIRGGCGDVVVVLGAAVVAVPPPARA 65
Query 63 ITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGLA- 121
+ A DW GLSASVR G++ A A+ LH +DTPDV A+VV+RV+ A + G+A
Sbjct 66 VVAQDWSDGLSASVRTGVSAA--GAAEAVALHTVDTPDVGAEVVSRVIEAARTADGGVAR 123
Query 122 ----GRGRIPAHSARR 133
GR P ARR
Sbjct 124 AVYHGRPGHPVVVARR 139
>gi|257055349|ref|YP_003133181.1| MobA-like protein [Saccharomonospora viridis DSM 43017]
gi|256585221|gb|ACU96354.1| uncharacterized MobA-like protein [Saccharomonospora viridis
DSM 43017]
Length=194
Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 49/130 (38%), Positives = 70/130 (54%), Gaps = 13/130 (10%)
Query 17 RWYGKPKVLVD----GWLDTAVGALRDGGCNDVILVLGA----VEVSAPAGVTAITAPDW 68
R +G+PK LV+ + AV L GGC+ +++VLGA V P V + APDW
Sbjct 14 RRFGRPKALVEVGGEALVTRAVRVLAHGGCDPIVVVLGARAEDVRTLLPGRVITVYAPDW 73
Query 69 QQGLSASVRAGLAQAD--REHADYAVLHVIDTPDVNAKVVARVLGRA---LVSRSGLAGR 123
++G+ AS+RAGL D ++H++D P V A VVAR+L +V+R+G GR
Sbjct 74 EEGMGASLRAGLRTLSDLTPLPDAVLVHLVDLPSVGADVVARLLELTTPDVVARAGYGGR 133
Query 124 GRIPAHSARR 133
P RR
Sbjct 134 MGHPVLFGRR 143
>gi|171473737|gb|ACB47043.1| hypothetical protein [Micromonospora chersina]
Length=195
Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/112 (45%), Positives = 61/112 (55%), Gaps = 10/112 (8%)
Query 8 GVLLAAGAGRWYGKPKVLV---DGWL--DTAVGALRDGGCNDVILVLGAVEVSAPA---- 58
GVLLAAGAGR YG PK L DG L + A G L GGC ++VLGA A
Sbjct 7 GVLLAAGAGRRYGGPKALARHADGRLLVERAAGMLSGGGCAPTVVVLGAAAAEVRARADL 66
Query 59 -GVTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARV 109
G + P+W G+ +S+RAGLA R A AV+ ++D P V A V RV
Sbjct 67 PGALLVDNPEWAGGMGSSLRAGLAALARTDAPAAVVLLVDLPGVTAAAVRRV 118
>gi|311743477|ref|ZP_07717283.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272]
gi|311312607|gb|EFQ82518.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272]
Length=194
Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/104 (45%), Positives = 61/104 (59%), Gaps = 9/104 (8%)
Query 19 YGKPKVLVDG-----WLDTAVGALRDGGCNDVILVLGAV--EVSAPAGVTA--ITAPDWQ 69
G PK LV WL +++ ALRDGGC DV++VLGA E G +A + A W
Sbjct 16 MGVPKALVTDEVRGPWLTSSILALRDGGCEDVVVVLGAAADEGRRLVGESARVVVADGWA 75
Query 70 QGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRA 113
G++ S+ AGL A+ A+ AV+H++D PDV VVARVL A
Sbjct 76 GGMAVSLAAGLRVAEDTGAESAVVHLVDLPDVGHDVVARVLAHA 119
>gi|319949563|ref|ZP_08023610.1| purine catabolism protein PucB [Dietzia cinnamea P4]
gi|319436781|gb|EFV91854.1| purine catabolism protein PucB [Dietzia cinnamea P4]
Length=177
Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/79 (49%), Positives = 55/79 (70%), Gaps = 5/79 (6%)
Query 6 VVGVLLAAGAGRWYGKPK--VLVDG--WLDTAVGALRDGGCNDVILVLGAVEVSAPAGVT 61
+ G++LAAGAGR +G+PK V DG +D AV LR+GGC+ V++V GAVE++ P G
Sbjct 1 MTGLVLAAGAGRRFGRPKAPVEFDGTRLVDRAVALLREGGCDRVVVVSGAVELAVP-GAE 59
Query 62 AITAPDWQQGLSASVRAGL 80
+ P W+ G+ +S+RAGL
Sbjct 60 VVPNPLWETGMGSSLRAGL 78
>gi|111021471|ref|YP_704443.1| hypothetical protein RHA1_ro04499 [Rhodococcus jostii RHA1]
gi|110821001|gb|ABG96285.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=193
Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 57/143 (40%), Positives = 70/143 (49%), Gaps = 21/143 (14%)
Query 5 TVVGVLLAAGAGRWYGKPKVLVDG-----WLDTAVGALRDGGCNDVILVLGAVEVSAP-- 57
T+ GVLLAAGAG G PK LV G WL V L GC ++VLGA A
Sbjct 2 TIGGVLLAAGAGSRMGMPKSLVVGADGEPWLARGVRVLSAAGCEPTVVVLGARAEEAEDL 61
Query 58 ----AGVTAITAPDWQQGLSASVRAGLA-QADREHADYAVLHVIDTPDVNAKVVARVLG- 111
A VT A DWQ GLSAS+R GL A + D V+ ++D PD+ + RV
Sbjct 62 LPDGAPVTVGIAEDWQSGLSASLRRGLEVAATFDDVDAVVITLVDLPDLGVDAIRRVASV 121
Query 112 -----RALVSRSGLAGRGRIPAH 129
RA + ++ AGR P H
Sbjct 122 PPGDARAALRQAHYAGR---PGH 141
>gi|119718195|ref|YP_925160.1| hypothetical protein Noca_3976 [Nocardioides sp. JS614]
gi|119538856|gb|ABL83473.1| conserved hypothetical protein [Nocardioides sp. JS614]
Length=185
Score = 63.9 bits (154), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 56/113 (50%), Positives = 69/113 (62%), Gaps = 10/113 (8%)
Query 8 GVLLAAGAGRWYGKPKVLVDG-----WLDTAVGALRDGGCNDVILVLGAV-EVSAP---- 57
G+LLAAGAG G+PK LV G WL V AL +GGC V +VLGA E + P
Sbjct 3 GLLLAAGAGTRMGRPKALVRGADGEPWLVRGVRALAEGGCTQVTVVLGAAAEDALPLLEG 62
Query 58 AGVTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVL 110
G TA+ A DW G+SAS+ AGLA A AD V+ ++D PDV A+VV R+L
Sbjct 63 TGATAVVAADWADGMSASLHAGLAAALATDADAVVVTLVDLPDVGAEVVRRLL 115
>gi|134097846|ref|YP_001103507.1| hypothetical protein SACE_1257 [Saccharopolyspora erythraea NRRL
2338]
gi|133910469|emb|CAM00582.1| hypothetical protein SACE_1257 [Saccharopolyspora erythraea NRRL
2338]
Length=195
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/125 (39%), Positives = 67/125 (54%), Gaps = 13/125 (10%)
Query 6 VVGVLLAAGAGRWYGKPKVLVDG----WLDTAVGALRDGGCNDVILVLGAV-----EVSA 56
V GV+LAAGAGR +G PK LV+ +++ A L +GGC V++VLGA E +
Sbjct 5 VAGVVLAAGAGRRFGMPKALVEHRGELFVERAARVLAEGGCAPVVVVLGAAADTVQERAD 64
Query 57 PAGVTAITAPDWQQGLSASVRAGL-AQADREHADY---AVLHVIDTPDVNAKVVARVLGR 112
GVT + PDW G+ +S+R L A A AD A++ +D P + V RV
Sbjct 65 LTGVTVVVNPDWSTGMGSSLRVALDALARTTSADSVSAALITPVDMPGIGPSAVRRVAAH 124
Query 113 ALVSR 117
A +R
Sbjct 125 ASHAR 129
>gi|229820181|ref|YP_002881707.1| hypothetical protein Bcav_1689 [Beutenbergia cavernae DSM 12333]
gi|229566094|gb|ACQ79945.1| conserved hypothetical protein [Beutenbergia cavernae DSM 12333]
Length=225
Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/93 (42%), Positives = 55/93 (60%), Gaps = 9/93 (9%)
Query 6 VVGVLLAAGAGRWYGKPKVLV-----DGWLDTAVGALRDGGCNDVILVLGAVEVSAP--- 57
+ G++LAAGAGR G PK L+ + W+ A LRD GC+ V++ +GA
Sbjct 9 LAGLVLAAGAGRRMGGPKALLTTASGETWVRRAARMLRDAGCDPVLVTVGAAADDVVAAL 68
Query 58 -AGVTAITAPDWQQGLSASVRAGLAQADREHAD 89
+ VT + PDW++GL A VRAGLA+ D E +D
Sbjct 69 DSDVTVVRVPDWEEGLGAGVRAGLARIDPEASD 101
>gi|331699472|ref|YP_004335711.1| hypothetical protein Psed_5731 [Pseudonocardia dioxanivorans
CB1190]
gi|326954161|gb|AEA27858.1| hypothetical protein Psed_5731 [Pseudonocardia dioxanivorans
CB1190]
Length=198
Score = 61.2 bits (147), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 49/123 (40%), Positives = 68/123 (56%), Gaps = 12/123 (9%)
Query 17 RWYGKPKVLV--DG--WLDTAVGALRDGGCNDVILVLGA----VEVSAPAGVTAITAPDW 68
R G PK LV DG + AV LR GC +++V+GA V + PA VTA+ A DW
Sbjct 13 RRMGGPKALVRLDGEPLVLRAVEVLRAAGCAPLVVVVGAAADEVRLLLPADVTAVEAVDW 72
Query 69 QQGLSASVRAGLAQADRE-HADYAVLHVIDTPDVNAKVVAR---VLGRALVSRSGLAGRG 124
+G+ AS+RAGLA E D AV+H++D P V A + R + G +++R+ GR
Sbjct 73 AEGMGASLRAGLATLREEPDVDAAVVHLVDLPGVTAAAIGRLSALAGPDVLARASYGGRA 132
Query 125 RIP 127
P
Sbjct 133 GHP 135
>gi|320007639|gb|ADW02489.1| 4-diphosphocytidyl-2C-methyl-D-erythritol synthase [Streptomyces
flavogriseus ATCC 33331]
Length=193
Score = 60.5 bits (145), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 42/109 (39%), Positives = 62/109 (57%), Gaps = 10/109 (9%)
Query 22 PKVLVDG----WLDTAVGALRDGGCNDVILVLGAV-----EVSAPAGVTAITAPDWQQGL 72
PK L++ ++ AV ALR+GGC V +VLGA E++ + P W++G+
Sbjct 19 PKALLEHRGRPLVEHAVRALRNGGCGPVHVVLGAAAEEVGELAELSACEVTVNPSWEEGM 78
Query 73 SASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGLA 121
+S+RAGLA AD A++ ++D P + A+ VARV A SRS LA
Sbjct 79 GSSLRAGLASLSASDADAALVLLVDQPGIGAEAVARVRS-AYRSRSSLA 126
>gi|297559841|ref|YP_003678815.1| 4-diphosphocytidyl-2C-methyl-D-erythritol synthase [Nocardiopsis
dassonvillei subsp. dassonvillei DSM 43111]
gi|296844289|gb|ADH66309.1| 4-diphosphocytidyl-2C-methyl-D-erythritol synthase [Nocardiopsis
dassonvillei subsp. dassonvillei DSM 43111]
Length=210
Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/109 (38%), Positives = 61/109 (56%), Gaps = 6/109 (5%)
Query 6 VVGVLLAAGAGRWYGKPKVLV----DGWLDTAVGALRDGGCNDVILVLGAVEVSAPAGVT 61
V G+LLAAG+G G+PK LV + +D V L GGC V++VLGA + G
Sbjct 16 VAGLLLAAGSGSRLGRPKALVEVGGERLVDRGVRTLTAGGCAPVMVVLGAADTPV-RGAH 74
Query 62 AITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVL 110
+ PDW+ G+ +SVRAG+ A + D ++ + D P V + V R++
Sbjct 75 TVHNPDWRTGMGSSVRAGI-DALPDTVDAVLIALADQPLVTPEAVRRLV 122
>gi|317123278|ref|YP_004097390.1| hypothetical protein Intca_0101 [Intrasporangium calvum DSM 43043]
gi|315587366|gb|ADU46663.1| hypothetical protein Intca_0101 [Intrasporangium calvum DSM 43043]
Length=227
Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 47/122 (39%), Positives = 66/122 (55%), Gaps = 19/122 (15%)
Query 6 VVGVLLAAGAGRWYGKPKVLV---DG---WLDTAVGALRDGGCNDVILVLGAVEVSAPAG 59
V G+LLAAG+GR G+PK L+ +G + AV L +GGC+ V +V+GA A
Sbjct 9 VRGLLLAAGSGRRMGRPKALLRPAEGGRTLAERAVAVLLEGGCDGVTVVVGAASDEVTAA 68
Query 60 VTA----------ITAPDWQQGLSASVRAGLAQADREHADYA--VLHVIDTPDVNAKVVA 107
V A + DW +G+ AS+RAGL A R HA ++ ++D PD+ A V
Sbjct 69 VRASFPEEDVVDVVRCADWSEGMGASLRAGL-TAMRPHAQVQAVLVSLVDLPDLPAGAVH 127
Query 108 RV 109
RV
Sbjct 128 RV 129
>gi|182435089|ref|YP_001822808.1| hypothetical protein SGR_1296 [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|178463605|dbj|BAG18125.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus
NBRC 13350]
Length=201
Score = 58.2 bits (139), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 44/109 (41%), Positives = 64/109 (59%), Gaps = 10/109 (9%)
Query 22 PKVLVDG----WLDTAVGALRDGGCNDVILVLGAV--EVSAPAGVTAITA---PDWQQGL 72
PK L++ ++ AV +LRDGGC + +VLGA EV A A +T T PDW++G+
Sbjct 27 PKALLEHRGRPLVEHAVRSLRDGGCGPLHVVLGAAADEVRARADLTGCTVAVNPDWEEGM 86
Query 73 SASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGLA 121
+S+R GLA D A++ ++D P + A+ VARV A SR+ LA
Sbjct 87 GSSLRLGLAALGAGDTDAALVLLVDQPGIGAEAVARVR-LAYRSRASLA 134
>gi|344943388|ref|ZP_08782675.1| 4-diphosphocytidyl-2C-methyl-D-erythritol synthase [Methylobacter
tundripaludum SV96]
gi|344260675|gb|EGW20947.1| 4-diphosphocytidyl-2C-methyl-D-erythritol synthase [Methylobacter
tundripaludum SV96]
Length=202
Score = 57.8 bits (138), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/117 (36%), Positives = 64/117 (55%), Gaps = 12/117 (10%)
Query 5 TVVGVLLAAGAGRWYGKPKVLVDGW-----LDTAVGALRDGGCNDVILVLGAVEVSAPA- 58
V ++LAAGA G PK L++ W L+ V R+ VI+VLGA S
Sbjct 7 NVYAIILAAGASSRMGNPKQLLE-WRNRPLLEHTVANAREILNERVIVVLGAHAESIQTT 65
Query 59 ----GVTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLG 111
GV+++ PDWQ+G+++S+RAG+ QA E A A++ + D P +NA + +L
Sbjct 66 IDLGGVSSVVNPDWQEGMASSIRAGV-QALPESASAALILLCDQPLINAAHMQNLLN 121
>gi|84494872|ref|ZP_00993991.1| hypothetical protein JNB_08739 [Janibacter sp. HTCC2649]
gi|84384365|gb|EAQ00245.1| hypothetical protein JNB_08739 [Janibacter sp. HTCC2649]
Length=194
Score = 57.0 bits (136), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 43/119 (37%), Positives = 61/119 (52%), Gaps = 17/119 (14%)
Query 6 VVGVLLAAGAGRWYGKPKVLV------DGWLDTAVGALRDGGCNDVILVLGAVEVSAPA- 58
V G++LAAGAGR G PK L+ +++ V L D G +V +V+G SAP+
Sbjct 6 VAGLVLAAGAGRRMGGPKALLRLSPTGPSLVESTVSRLHDAGVTEVHVVVGH---SAPSV 62
Query 59 -------GVTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVL 110
G A DW +G+ AS+R G+ D A +L ++D PDV A V R+L
Sbjct 63 RLRAERVGGLVAEAEDWDEGMGASLRRGIDALDATQARAVLLMLVDLPDVGASVHTRLL 121
>gi|116671957|ref|YP_832890.1| molybdenum cofactor cytidylyltransferase [Arthrobacter sp. FB24]
gi|116612066|gb|ABK04790.1| molybdenum cofactor cytidylyltransferase [Arthrobacter sp. FB24]
Length=191
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/112 (39%), Positives = 62/112 (56%), Gaps = 12/112 (10%)
Query 9 VLLAAGAGRWYGK-PKVLVD----GWLDTAVGALRDGGCNDVILVLGA-----VEVSAPA 58
+LLAAGAG G+ PK L+ +D G L DGGC DV++VLGA E + A
Sbjct 1 MLLAAGAGTRLGRGPKALLPFRGRTLVDVIAGVLLDGGCRDVVIVLGADAVRVRETADLA 60
Query 59 GVTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVL 110
T + PDWQ G+ S R G+A A D+ ++ ++D P + + V+R+L
Sbjct 61 PYTVVHNPDWQSGMGGSFRLGVAAA--AADDHVLVALVDQPGLTPETVSRLL 110
>gi|302525099|ref|ZP_07277441.1| YgfJ family molybdenum hydroxylase accessory protein [Streptomyces
sp. AA4]
gi|302433994|gb|EFL05810.1| YgfJ family molybdenum hydroxylase accessory protein [Streptomyces
sp. AA4]
Length=196
Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/120 (43%), Positives = 70/120 (59%), Gaps = 8/120 (6%)
Query 2 LPSTVVGVLLAAGAGRWYGKPKVLV--DG--WLDTAVGALRDGGCNDVILVLGA----VE 53
+P V G+LLAAGAGR +G PK LV DG + AV L + GC V +V+GA V
Sbjct 1 MPEPVAGLLLAAGAGRRFGGPKALVEYDGEPLVQRAVRNLAEAGCASVRVVVGAAAEQVR 60
Query 54 VSAPAGVTAITAPDWQQGLSASVRAGLAQADREHADYAVLHVIDTPDVNAKVVARVLGRA 113
P VTA+ A WQ G+ S++AGL E A ++H++D P V A+ +AR++G A
Sbjct 61 ELLPPDVTAVPAARWQDGMGESLKAGLESLAGESAVAVLVHLVDLPWVPAEALARIVGEA 120
>gi|300783786|ref|YP_003764077.1| hypothetical protein AMED_1865 [Amycolatopsis mediterranei U32]
gi|299793300|gb|ADJ43675.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340525178|gb|AEK40383.1| hypothetical protein RAM_09465 [Amycolatopsis mediterranei S699]
Length=193
Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/122 (38%), Positives = 63/122 (52%), Gaps = 9/122 (7%)
Query 17 RWYGKPKVLVD----GWLDTAVGALRDGGCNDVILVLGAV--EVSA--PAGVTAITAPDW 68
R +G PK LV+ + A+ L GC V +V+GA EV A P A+ A DW
Sbjct 16 RRFGGPKALVEVDGEPLVLRALRTLTAAGCAPVRVVVGASADEVRALLPDPALAVEAEDW 75
Query 69 QQGLSASVRAGLAQADR-EHADYAVLHVIDTPDVNAKVVARVLGRALVSRSGLAGRGRIP 127
G+ AS+RAGLA D EH+ A++H++D P V ++ARV RA A +P
Sbjct 76 ATGMGASLRAGLAALDSTEHSVAALVHLVDLPWVGPDILARVAARASAETVARAAYDGVP 135
Query 128 AH 129
H
Sbjct 136 GH 137
>gi|302557443|ref|ZP_07309785.1| 4-diphosphocytidyl-2C-methyl-D-erythritol synthase [Streptomyces
griseoflavus Tu4000]
gi|302475061|gb|EFL38154.1| 4-diphosphocytidyl-2C-methyl-D-erythritol synthase [Streptomyces
griseoflavus Tu4000]
Length=205
Score = 55.1 bits (131), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 37/87 (43%), Positives = 50/87 (58%), Gaps = 5/87 (5%)
Query 30 LDTAVGALRDGGCNDVILVLGAV--EVSAPA---GVTAITAPDWQQGLSASVRAGLAQAD 84
++ AV LR GGC V +VLGA EV A A G + P W+QG+ S+RAGL
Sbjct 43 VEHAVAVLRAGGCTRVHVVLGAAADEVRARAALPGCVLVDNPAWEQGMGTSLRAGLGSLA 102
Query 85 REHADYAVLHVIDTPDVNAKVVARVLG 111
A A++ ++D P + A+ VARVLG
Sbjct 103 GTGARAALVSLVDQPGIGAEAVARVLG 129
Lambda K H
0.320 0.136 0.415
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128858389450
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40