BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0004
Length=187
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607146|ref|NP_214518.1| hypothetical protein Rv0004 [Mycoba... 364 4e-99
gi|31791181|ref|NP_853674.1| hypothetical protein Mb0004 [Mycoba... 361 3e-98
gi|167969467|ref|ZP_02551744.1| hypothetical protein MtubH3_1614... 360 4e-98
gi|1321907|emb|CAA63260.1| orf187 [Mycobacterium tuberculosis H3... 343 5e-93
gi|308232615|ref|ZP_07416629.2| hypothetical protein TMAG_00675 ... 331 3e-89
gi|254548932|ref|ZP_05139379.1| hypothetical protein Mtube_00445... 320 5e-86
gi|342862356|ref|ZP_08718997.1| hypothetical protein MCOL_25818 ... 253 6e-66
gi|118464189|ref|YP_879307.1| hypothetical protein MAV_0004 [Myc... 252 2e-65
gi|254773057|ref|ZP_05214573.1| hypothetical protein MaviaA2_000... 251 3e-65
gi|41406102|ref|NP_958938.1| hypothetical protein MAP0004 [Mycob... 251 4e-65
gi|240172094|ref|ZP_04750753.1| hypothetical protein MkanA1_2245... 249 2e-64
gi|118615923|ref|YP_904255.1| hypothetical protein MUL_0004 [Myc... 248 2e-64
gi|296167140|ref|ZP_06849547.1| in RecF-GyrB intergenic region [... 245 2e-63
gi|183980039|ref|YP_001848330.1| hypothetical protein MMAR_0004 ... 244 4e-63
gi|169627113|ref|YP_001700762.1| hypothetical protein MAB_0005 [... 237 5e-61
gi|108796986|ref|YP_637183.1| hypothetical protein Mmcs_0005 [My... 236 8e-61
gi|15826869|ref|NP_301132.1| hypothetical protein ML0004 [Mycoba... 235 2e-60
gi|1262355|emb|CAA94711.1| hypothetical protein [Mycobacterium l... 230 7e-59
gi|254820904|ref|ZP_05225905.1| hypothetical protein MintA_13300... 223 1e-56
gi|118470893|ref|YP_884427.1| hypothetical protein MSMEG_0004 [M... 222 2e-56
gi|333988644|ref|YP_004521258.1| hypothetical protein JDM601_000... 220 9e-56
gi|152112355|sp|P0C564.1|Y004_MYCSM RecName: Full=UPF0232 protei... 218 4e-55
gi|120401033|ref|YP_950862.1| hypothetical protein Mvan_0005 [My... 213 1e-53
gi|226303494|ref|YP_002763452.1| hypothetical protein RER_00050 ... 210 7e-53
gi|145221417|ref|YP_001132095.1| hypothetical protein Mflv_0823 ... 204 5e-51
gi|315441701|ref|YP_004074580.1| RNA-binding protein containing ... 204 7e-51
gi|312137519|ref|YP_004004855.1| hypothetical protein REQ_00050 ... 196 1e-48
gi|325677516|ref|ZP_08157180.1| hypothetical protein HMPREF0724_... 196 2e-48
gi|226362899|ref|YP_002780679.1| hypothetical protein ROP_34870 ... 189 1e-46
gi|111020659|ref|YP_703631.1| hypothetical protein RHA1_ro03670 ... 189 2e-46
gi|1213061|emb|CAA63916.1| orf192 [Mycobacterium smegmatis str. ... 187 6e-46
gi|54021968|ref|YP_116210.1| hypothetical protein nfa40 [Nocardi... 185 2e-45
gi|333917683|ref|YP_004491264.1| hypothetical protein AS9A_0004 ... 179 2e-43
gi|343928738|ref|ZP_08768183.1| hypothetical protein GOALK_120_0... 175 2e-42
gi|134096625|ref|YP_001102286.1| hypothetical protein SACE_0006 ... 174 4e-42
gi|324999886|ref|ZP_08120998.1| hypothetical protein PseP1_14006... 172 2e-41
gi|326383913|ref|ZP_08205597.1| hypothetical protein SCNU_13308 ... 172 2e-41
gi|296392444|ref|YP_003657328.1| hypothetical protein Srot_0004 ... 170 7e-41
gi|300781942|ref|YP_003762233.1| hypothetical protein AMED_0005 ... 170 9e-41
gi|331693903|ref|YP_004330142.1| hypothetical protein Psed_0004 ... 169 1e-40
gi|302531360|ref|ZP_07283702.1| UPF0232 protein [Streptomyces sp... 167 5e-40
gi|296137758|ref|YP_003645001.1| hypothetical protein Tpau_0008 ... 166 2e-39
gi|317509430|ref|ZP_07967048.1| hypothetical protein HMPREF9336_... 161 3e-38
gi|257054094|ref|YP_003131926.1| putative RNA-binding protein co... 156 1e-36
gi|284988634|ref|YP_003407188.1| hypothetical protein Gobs_0005 ... 153 1e-35
gi|262200050|ref|YP_003271258.1| hypothetical protein Gbro_0004 ... 152 3e-35
gi|319949428|ref|ZP_08023489.1| hypothetical protein ES5_08306 [... 151 5e-35
gi|256374165|ref|YP_003097825.1| hypothetical protein Amir_0005 ... 149 1e-34
gi|309811365|ref|ZP_07705152.1| conserved hypothetical protein [... 141 5e-32
gi|302864513|ref|YP_003833150.1| hypothetical protein Micau_0005... 133 1e-29
>gi|15607146|ref|NP_214518.1| hypothetical protein Rv0004 [Mycobacterium tuberculosis H37Rv]
gi|15839376|ref|NP_334413.1| hypothetical protein MT0004 [Mycobacterium tuberculosis CDC1551]
gi|121635887|ref|YP_976110.1| hypothetical protein BCG_0004 [Mycobacterium bovis BCG str. Pasteur
1173P2]
65 more sequence titles
Length=187
Score = 364 bits (934), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 187/187 (100%), Positives = 187/187 (100%), Gaps = 0/187 (0%)
Query 1 MTGSVDRPDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR 60
MTGSVDRPDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR
Sbjct 1 MTGSVDRPDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR 60
Query 61 RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALN 120
RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALN
Sbjct 61 RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALN 120
Query 121 DGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR 180
DGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR
Sbjct 121 DGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR 180
Query 181 GPRDTYG 187
GPRDTYG
Sbjct 181 GPRDTYG 187
>gi|31791181|ref|NP_853674.1| hypothetical protein Mb0004 [Mycobacterium bovis AF2122/97]
gi|38605569|sp|Q7U313.1|Y004_MYCBO RecName: Full=UPF0232 protein Mb0004
gi|31616766|emb|CAD92866.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
Length=187
Score = 361 bits (927), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 186/187 (99%), Positives = 186/187 (99%), Gaps = 0/187 (0%)
Query 1 MTGSVDRPDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR 60
MTGSVDRPDQNRGER MKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR
Sbjct 1 MTGSVDRPDQNRGERLMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR 60
Query 61 RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALN 120
RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALN
Sbjct 61 RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALN 120
Query 121 DGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR 180
DGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR
Sbjct 121 DGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR 180
Query 181 GPRDTYG 187
GPRDTYG
Sbjct 181 GPRDTYG 187
>gi|167969467|ref|ZP_02551744.1| hypothetical protein MtubH3_16147 [Mycobacterium tuberculosis
H37Ra]
Length=187
Score = 360 bits (925), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 186/187 (99%), Positives = 186/187 (99%), Gaps = 0/187 (0%)
Query 1 MTGSVDRPDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR 60
MTGSVDRPDQNRGERSMKSP LDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR
Sbjct 1 MTGSVDRPDQNRGERSMKSPVLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR 60
Query 61 RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALN 120
RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALN
Sbjct 61 RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALN 120
Query 121 DGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR 180
DGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR
Sbjct 121 DGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR 180
Query 181 GPRDTYG 187
GPRDTYG
Sbjct 181 GPRDTYG 187
>gi|1321907|emb|CAA63260.1| orf187 [Mycobacterium tuberculosis H37Rv]
Length=187
Score = 343 bits (881), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 183/187 (98%), Positives = 184/187 (99%), Gaps = 0/187 (0%)
Query 1 MTGSVDRPDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR 60
MTGSVDRPDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR
Sbjct 1 MTGSVDRPDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR 60
Query 61 RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALN 120
RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPT +N
Sbjct 61 RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTVIN 120
Query 121 DGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR 180
DGVLSVI EST WATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR
Sbjct 121 DGVLSVIEESTVWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR 180
Query 181 GPRDTYG 187
GPRDTYG
Sbjct 181 GPRDTYG 187
>gi|308232615|ref|ZP_07416629.2| hypothetical protein TMAG_00675 [Mycobacterium tuberculosis SUMu001]
gi|308371556|ref|ZP_07425300.2| hypothetical protein TMDG_01888 [Mycobacterium tuberculosis SUMu004]
gi|308372786|ref|ZP_07429836.2| hypothetical protein TMEG_00428 [Mycobacterium tuberculosis SUMu005]
12 more sequence titles
Length=171
Score = 331 bits (849), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 171/171 (100%), Positives = 171/171 (100%), Gaps = 0/171 (0%)
Query 17 MKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPL 76
MKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPL
Sbjct 1 MKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPL 60
Query 77 GKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQ 136
GKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQ
Sbjct 61 GKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQ 120
Query 137 LRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
LRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG
Sbjct 121 LRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 171
>gi|254548932|ref|ZP_05139379.1| hypothetical protein Mtube_00445 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
gi|297632472|ref|ZP_06950252.1| hypothetical protein MtubK4_00020 [Mycobacterium tuberculosis
KZN 4207]
Length=166
Score = 320 bits (821), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 165/166 (99%), Positives = 166/166 (100%), Gaps = 0/166 (0%)
Query 22 LDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAAR 81
+DLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAAR
Sbjct 1 MDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAAR 60
Query 82 ELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIMQ 141
ELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIMQ
Sbjct 61 ELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIMQ 120
Query 142 AQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
AQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG
Sbjct 121 AQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 166
>gi|342862356|ref|ZP_08718997.1| hypothetical protein MCOL_25818 [Mycobacterium colombiense CECT
3035]
gi|342130213|gb|EGT83541.1| hypothetical protein MCOL_25818 [Mycobacterium colombiense CECT
3035]
Length=183
Score = 253 bits (647), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 138/167 (83%), Positives = 149/167 (90%), Gaps = 0/167 (0%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAA 80
G+DLVRRTL+EARAAARA+G+DAGRGR + RVAG+RRSWSGPGPD RDPQPLG A
Sbjct 17 GIDLVRRTLEEARAAARAQGKDAGRGRSVAPTPRRVAGQRRSWSGPGPDARDPQPLGSLA 76
Query 81 RELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIM 140
R+LAKKRGWS +VAEG VLG W VVGHQIA+HA PTALNDGVLSV AESTAWATQLR++
Sbjct 77 RDLAKKRGWSAQVAEGTVLGNWVTVVGHQIADHATPTALNDGVLSVAAESTAWATQLRMI 136
Query 141 QAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
QAQLLAKIAAAVGN VV SLKITGPAAPSWRKGPRHIAGRGPRDTYG
Sbjct 137 QAQLLAKIAAAVGNGVVTSLKITGPAAPSWRKGPRHIAGRGPRDTYG 183
>gi|118464189|ref|YP_879307.1| hypothetical protein MAV_0004 [Mycobacterium avium 104]
gi|29611907|sp|Q9L7L4.2|Y004_MYCPA RecName: Full=UPF0232 protein MAP_0004
gi|118165476|gb|ABK66373.1| conserved hypothetical protein [Mycobacterium avium 104]
gi|336459819|gb|EGO38733.1| putative RNA-binding protein containing Zn ribbon [Mycobacterium
avium subsp. paratuberculosis S397]
Length=181
Score = 252 bits (643), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 137/166 (83%), Positives = 150/166 (91%), Gaps = 0/166 (0%)
Query 22 LDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAAR 81
+DLVRRTL+EARAAARA+G+DAGRGR A+ RVAG+RRSWSGPGPD RDPQPLG+ AR
Sbjct 16 MDLVRRTLEEARAAARAQGKDAGRGRAAAPTPRRVAGQRRSWSGPGPDARDPQPLGRLAR 75
Query 82 ELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIMQ 141
+LA+KRGWS +VAEG VLG W+AVVGHQIA+HA PT L DGVLSV AESTAWATQLR+MQ
Sbjct 76 DLARKRGWSAQVAEGTVLGNWTAVVGHQIADHAVPTGLRDGVLSVSAESTAWATQLRMMQ 135
Query 142 AQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
AQLLAKIAAAVGN VV SLKITGPAAPSWRKGPRHIAGRGPRDTYG
Sbjct 136 AQLLAKIAAAVGNGVVTSLKITGPAAPSWRKGPRHIAGRGPRDTYG 181
>gi|254773057|ref|ZP_05214573.1| hypothetical protein MaviaA2_00020 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=166
Score = 251 bits (642), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 137/166 (83%), Positives = 151/166 (91%), Gaps = 0/166 (0%)
Query 22 LDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAAR 81
+DLVRRTL+EARAAARA+G+DAGRGR A+ RVAG+RRSWSGPGPD RDPQPLG+ AR
Sbjct 1 MDLVRRTLEEARAAARAQGKDAGRGRAAAPTPRRVAGQRRSWSGPGPDARDPQPLGRLAR 60
Query 82 ELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIMQ 141
+LA+KRGWS +VAEG VLG W+AVVGHQIA+HA PT+L DGVLSV AESTAWATQLR+MQ
Sbjct 61 DLARKRGWSAQVAEGTVLGNWTAVVGHQIADHAVPTSLRDGVLSVSAESTAWATQLRMMQ 120
Query 142 AQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
AQLLAKIAAAVGN VV SLKITGPAAPSWRKGPRHIAGRGPRDTYG
Sbjct 121 AQLLAKIAAAVGNGVVTSLKITGPAAPSWRKGPRHIAGRGPRDTYG 166
>gi|41406102|ref|NP_958938.1| hypothetical protein MAP0004 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|6969275|gb|AAF33696.1| unknown [Mycobacterium avium subsp. paratuberculosis]
gi|41394450|gb|AAS02321.1| hypothetical protein MAP_0004 [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=166
Score = 251 bits (640), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 137/166 (83%), Positives = 150/166 (91%), Gaps = 0/166 (0%)
Query 22 LDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAAR 81
+DLVRRTL+EARAAARA+G+DAGRGR A+ RVAG+RRSWSGPGPD RDPQPLG+ AR
Sbjct 1 MDLVRRTLEEARAAARAQGKDAGRGRAAAPTPRRVAGQRRSWSGPGPDARDPQPLGRLAR 60
Query 82 ELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIMQ 141
+LA+KRGWS +VAEG VLG W+AVVGHQIA+HA PT L DGVLSV AESTAWATQLR+MQ
Sbjct 61 DLARKRGWSAQVAEGTVLGNWTAVVGHQIADHAVPTGLRDGVLSVSAESTAWATQLRMMQ 120
Query 142 AQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
AQLLAKIAAAVGN VV SLKITGPAAPSWRKGPRHIAGRGPRDTYG
Sbjct 121 AQLLAKIAAAVGNGVVTSLKITGPAAPSWRKGPRHIAGRGPRDTYG 166
>gi|240172094|ref|ZP_04750753.1| hypothetical protein MkanA1_22450 [Mycobacterium kansasii ATCC
12478]
Length=184
Score = 249 bits (635), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 141/167 (85%), Positives = 152/167 (92%), Gaps = 1/167 (0%)
Query 22 LDLVRRTLDEARAAARARGQDAGRGRVAS-VASGRVAGRRRSWSGPGPDIRDPQPLGKAA 80
+DLVRRTL EA+AAARARG+ GRG VA V+S RVAG+RRSWSGPGPD RDPQPLGK A
Sbjct 18 IDLVRRTLAEAQAAARARGRGLGRGPVAQPVSSRRVAGQRRSWSGPGPDARDPQPLGKLA 77
Query 81 RELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIM 140
RELAKKRGWS RVAEG VLGQW++VVGHQIA+HA PT+L+DGVLSV AESTAWATQLRIM
Sbjct 78 RELAKKRGWSGRVAEGTVLGQWASVVGHQIADHATPTSLDDGVLSVTAESTAWATQLRIM 137
Query 141 QAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
QAQLLAKIAAAVGN VV +LKITGPAAPSWRKGPRHIAGRGPRDTYG
Sbjct 138 QAQLLAKIAAAVGNGVVTTLKITGPAAPSWRKGPRHIAGRGPRDTYG 184
>gi|118615923|ref|YP_904255.1| hypothetical protein MUL_0004 [Mycobacterium ulcerans Agy99]
gi|166227753|sp|A0PKB5.1|Y004_MYCUA RecName: Full=UPF0232 protein MUL_0004
gi|118568033|gb|ABL02784.1| conserved protein [Mycobacterium ulcerans Agy99]
Length=187
Score = 248 bits (634), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 145/187 (78%), Positives = 154/187 (83%), Gaps = 0/187 (0%)
Query 1 MTGSVDRPDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR 60
M G ++P G + P +DLVRRTL EARAAARARGQD GRG A A RVAGRR
Sbjct 1 MNGDGEQPGPGDGAARDELPSMDLVRRTLAEARAAARARGQDPGRGFAAGPAPRRVAGRR 60
Query 61 RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALN 120
RSWSGPGPD RDPQPLGK R+LAKKRGWS VAEG VLGQWS VVG QIA+HA PTALN
Sbjct 61 RSWSGPGPDTRDPQPLGKLTRDLAKKRGWSGHVAEGTVLGQWSQVVGAQIADHATPTALN 120
Query 121 DGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR 180
+GVLSV AESTAWATQLRIMQ+QLLAKIAAAVGN VV SLKITGPA+PSWRKGPRHIAGR
Sbjct 121 EGVLSVTAESTAWATQLRIMQSQLLAKIAAAVGNGVVTSLKITGPASPSWRKGPRHIAGR 180
Query 181 GPRDTYG 187
GPRDTYG
Sbjct 181 GPRDTYG 187
>gi|296167140|ref|ZP_06849547.1| in RecF-GyrB intergenic region [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897462|gb|EFG77061.1| in RecF-GyrB intergenic region [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=185
Score = 245 bits (625), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 137/187 (74%), Positives = 158/187 (85%), Gaps = 2/187 (1%)
Query 1 MTGSVDRPDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR 60
MTGS D+ +K G+DLVRRTL+EARAAARA+G+DAGRGR + RVAG+R
Sbjct 1 MTGSDDQDAAGVEPGLLK--GIDLVRRTLEEARAAARAQGKDAGRGRSVPPSPRRVAGQR 58
Query 61 RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALN 120
RSWSGPGPD RDPQPLG+ AR+LAKKRGW+ +VAEG VLG W++VVG QIA+HA PTAL+
Sbjct 59 RSWSGPGPDARDPQPLGRLARDLAKKRGWTAQVAEGTVLGNWASVVGQQIADHATPTALS 118
Query 121 DGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR 180
DGVLSV AESTAWATQLR++Q+Q+LAKIAAAVGN VV +LKITGP APSWRKGPRHIAGR
Sbjct 119 DGVLSVTAESTAWATQLRMIQSQVLAKIAAAVGNGVVTALKITGPTAPSWRKGPRHIAGR 178
Query 181 GPRDTYG 187
GPRDTYG
Sbjct 179 GPRDTYG 185
>gi|183980039|ref|YP_001848330.1| hypothetical protein MMAR_0004 [Mycobacterium marinum M]
gi|226734001|sp|B2HI49.1|Y004_MYCMM RecName: Full=UPF0232 protein MMAR_0004
gi|183173365|gb|ACC38475.1| conserved protein [Mycobacterium marinum M]
Length=187
Score = 244 bits (624), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 144/187 (78%), Positives = 154/187 (83%), Gaps = 0/187 (0%)
Query 1 MTGSVDRPDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRR 60
M+ ++P G + G+DLVRRTL EARAAARARGQD GRG A A RVAGRR
Sbjct 1 MSDDGEQPGPGDGAARDELSGMDLVRRTLAEARAAARARGQDPGRGFAAGPAPRRVAGRR 60
Query 61 RSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALN 120
RSWSGPGPD RDPQPLGK R+LAKKRGWS VAEG VLGQWS VVG QIA+HA PTALN
Sbjct 61 RSWSGPGPDTRDPQPLGKLTRDLAKKRGWSGHVAEGTVLGQWSRVVGAQIADHATPTALN 120
Query 121 DGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGR 180
+GVLSV AESTAWATQLRIMQ+QLLAKIAAAVGN VV SLKITGPA+PSWRKGPRHIAGR
Sbjct 121 EGVLSVTAESTAWATQLRIMQSQLLAKIAAAVGNGVVTSLKITGPASPSWRKGPRHIAGR 180
Query 181 GPRDTYG 187
GPRDTYG
Sbjct 181 GPRDTYG 187
>gi|169627113|ref|YP_001700762.1| hypothetical protein MAB_0005 [Mycobacterium abscessus ATCC 19977]
gi|169239080|emb|CAM60108.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=183
Score = 237 bits (605), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 122/167 (74%), Positives = 141/167 (85%), Gaps = 0/167 (0%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAA 80
G+DLVRR L+EAR AA+ +G+D GRG + RVAG RR+WSGPGPD RDPQ LG+AA
Sbjct 17 GMDLVRRVLEEARGAAKQQGKDIGRGGRSPEQRRRVAGGRRTWSGPGPDARDPQLLGRAA 76
Query 81 RELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIM 140
+LAK+RGWS RV+EG V G+W AVVG QIA HA PTALN+GVL+V AESTAWATQLR++
Sbjct 77 GDLAKRRGWSSRVSEGAVFGRWEAVVGEQIAAHATPTALNEGVLTVAAESTAWATQLRLV 136
Query 141 QAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
QAQLLAKIAAA+G+ VV SLKI+GP APSWRKGPRHIAGRGPRDTYG
Sbjct 137 QAQLLAKIAAAIGDGVVTSLKISGPTAPSWRKGPRHIAGRGPRDTYG 183
>gi|108796986|ref|YP_637183.1| hypothetical protein Mmcs_0005 [Mycobacterium sp. MCS]
gi|119866070|ref|YP_936022.1| hypothetical protein Mkms_0013 [Mycobacterium sp. KMS]
gi|126432618|ref|YP_001068309.1| hypothetical protein Mjls_0005 [Mycobacterium sp. JLS]
gi|108767405|gb|ABG06127.1| protein of unknown function DUF721 [Mycobacterium sp. MCS]
gi|119692159|gb|ABL89232.1| protein of unknown function DUF721 [Mycobacterium sp. KMS]
gi|126232418|gb|ABN95818.1| protein of unknown function DUF721 [Mycobacterium sp. JLS]
Length=190
Score = 236 bits (603), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 121/167 (73%), Positives = 138/167 (83%), Gaps = 0/167 (0%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAA 80
G+DLVRRTL+EAR AAR++G+D GRGR + GRRRSWSGPGPD RDPQ LG A
Sbjct 24 GMDLVRRTLEEARGAARSQGKDVGRGRTSPARRVAGTGRRRSWSGPGPDSRDPQTLGAAT 83
Query 81 RELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIM 140
R+LA+ RGWS +VAEG V GQWS VVG QIAEHA P++L +GVL+V AESTAWATQLR++
Sbjct 84 RDLARTRGWSPKVAEGAVFGQWSTVVGEQIAEHATPSSLREGVLTVAAESTAWATQLRMV 143
Query 141 QAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
Q+QLLAKIAAAVG+ VV SLKITGP APSWRKG HIAGRGPRDTYG
Sbjct 144 QSQLLAKIAAAVGDGVVTSLKITGPTAPSWRKGRYHIAGRGPRDTYG 190
>gi|15826869|ref|NP_301132.1| hypothetical protein ML0004 [Mycobacterium leprae TN]
gi|221229347|ref|YP_002502763.1| hypothetical protein MLBr_00004 [Mycobacterium leprae Br4923]
gi|29611903|sp|Q9CDF4.1|Y004_MYCLE RecName: Full=UPF0232 protein ML0004
gi|254799448|sp|B8ZTP1.1|Y004_MYCLB RecName: Full=UPF0232 protein MLBr00004
gi|13092416|emb|CAC29512.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219932454|emb|CAR70097.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=189
Score = 235 bits (600), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 129/167 (78%), Positives = 141/167 (85%), Gaps = 0/167 (0%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAA 80
G DLVRR L+EARAAA A+G+DAGRG V RV RRR+WSGPGPD+RDPQPLGK A
Sbjct 23 GFDLVRRALEEARAAACAQGKDAGRGHVVPPVPFRVTDRRRNWSGPGPDVRDPQPLGKVA 82
Query 81 RELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIM 140
+LAKKRGWS +VAEG V GQW+++VG QIA+HA P LN+GVLSV AESTAWATQLRIM
Sbjct 83 HDLAKKRGWSAQVAEGRVFGQWASMVGGQIADHAFPVGLNNGVLSVTAESTAWATQLRIM 142
Query 141 QAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
QAQLLAKIAAAVGN VV SLKITGP APSWRKGP HIAGRGPRDTYG
Sbjct 143 QAQLLAKIAAAVGNGVVTSLKITGPTAPSWRKGPWHIAGRGPRDTYG 189
>gi|1262355|emb|CAA94711.1| hypothetical protein [Mycobacterium leprae]
Length=199
Score = 230 bits (587), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 127/165 (77%), Positives = 139/165 (85%), Gaps = 0/165 (0%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAA 80
G DLVRR L+EARAAA A+G+DAGRG V RV RRR+WSGPGPD+RDPQPLGK A
Sbjct 23 GFDLVRRALEEARAAACAQGKDAGRGHVVPPVPFRVTDRRRNWSGPGPDVRDPQPLGKVA 82
Query 81 RELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIM 140
+LAKKRGWS +VAEG V GQW+++VG QIA+HA P LN+GVLSV AESTAWATQLRIM
Sbjct 83 HDLAKKRGWSAQVAEGRVFGQWASMVGGQIADHAFPVGLNNGVLSVTAESTAWATQLRIM 142
Query 141 QAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDT 185
QAQLLAKIAAAVGN VV SLKITGP APSWRKGP HIAGRGPRDT
Sbjct 143 QAQLLAKIAAAVGNGVVTSLKITGPTAPSWRKGPWHIAGRGPRDT 187
>gi|254820904|ref|ZP_05225905.1| hypothetical protein MintA_13300 [Mycobacterium intracellulare
ATCC 13950]
Length=130
Score = 223 bits (567), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 111/130 (86%), Positives = 120/130 (93%), Gaps = 0/130 (0%)
Query 58 GRRRSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPT 117
G+RRSWSGPGPD RDPQPLG+ AR+LAKKRGWS +VAEG VLG W++VVGHQIA+HA PT
Sbjct 1 GQRRSWSGPGPDGRDPQPLGRLARDLAKKRGWSAQVAEGTVLGNWTSVVGHQIADHAVPT 60
Query 118 ALNDGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHI 177
AL DGVLSV AESTAWATQLR++QAQLLAKIAAAVGN VV SLKITGPAAPSWRKGPRHI
Sbjct 61 ALKDGVLSVSAESTAWATQLRMIQAQLLAKIAAAVGNGVVTSLKITGPAAPSWRKGPRHI 120
Query 178 AGRGPRDTYG 187
AGRGPRDTYG
Sbjct 121 AGRGPRDTYG 130
>gi|118470893|ref|YP_884427.1| hypothetical protein MSMEG_0004 [Mycobacterium smegmatis str.
MC2 155]
gi|152112354|sp|A0QND9.1|Y004_MYCS2 RecName: Full=UPF0232 protein MSMEG_0004
gi|118172180|gb|ABK73076.1| hypothetical protein MSMEG_0004 [Mycobacterium smegmatis str.
MC2 155]
Length=194
Score = 222 bits (565), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 119/180 (67%), Positives = 139/180 (78%), Gaps = 5/180 (2%)
Query 8 PDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPG 67
PD G R G+DLVRRTL+EAR AAR++G+D GRGR RRR+WSGPG
Sbjct 20 PDHLAGLR-----GIDLVRRTLEEARGAARSQGKDVGRGRSGPARRVGGNRRRRTWSGPG 74
Query 68 PDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVI 127
PD RDPQ LG ++LAK RGWS RVAEG V+G+W AVVG QIA+HA PTALN+GVL+V
Sbjct 75 PDARDPQLLGAVTQDLAKSRGWSARVAEGSVIGRWRAVVGDQIADHATPTALNEGVLTVT 134
Query 128 AESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
AESTAWATQLR++Q+QLLAKIAA VG+ VV +LKI GPA PSWRKG H++GRGPRDTYG
Sbjct 135 AESTAWATQLRMVQSQLLAKIAAVVGDGVVTTLKIVGPAGPSWRKGRYHVSGRGPRDTYG 194
>gi|333988644|ref|YP_004521258.1| hypothetical protein JDM601_0004 [Mycobacterium sp. JDM601]
gi|333484612|gb|AEF34004.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=188
Score = 220 bits (560), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 117/169 (70%), Positives = 135/169 (80%), Gaps = 2/169 (1%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVASVASGRV--AGRRRSWSGPGPDIRDPQPLGK 78
G+DLVRRTL EAR AAR++G+D G+GR A + AG RR WSGPGPD RDPQ LG
Sbjct 20 GMDLVRRTLAEAREAARSQGKDVGQGRRAPLRRRAPGGAGGRRRWSGPGPDARDPQTLGA 79
Query 79 AARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLR 138
A R+LA+ RGWS +VAEG VLG+W +VVG IA HA PT L+ GVLSV AESTAWATQLR
Sbjct 80 ATRDLAQSRGWSAQVAEGTVLGRWRSVVGEDIASHATPTRLSQGVLSVSAESTAWATQLR 139
Query 139 IMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
++Q+QLLAKIAAAVG VV +LKITGP APSWRKGP H++GRGPRDTYG
Sbjct 140 LVQSQLLAKIAAAVGEGVVTTLKITGPTAPSWRKGPLHVSGRGPRDTYG 188
>gi|152112355|sp|P0C564.1|Y004_MYCSM RecName: Full=UPF0232 protein in recF-gyrB intergenic region
gi|1321897|emb|CAA63252.1| orf194 [Mycobacterium smegmatis]
Length=194
Score = 218 bits (555), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 118/180 (66%), Positives = 138/180 (77%), Gaps = 5/180 (2%)
Query 8 PDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPG 67
PD G R G+DLVRRTL+EAR AAR++G+D GRGR RRR+WSGPG
Sbjct 20 PDHLAGLR-----GIDLVRRTLEEARGAARSQGKDVGRGRSGPARRVGGNRRRRTWSGPG 74
Query 68 PDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVI 127
PD RDPQ LG ++LAK RGWS RVAEG V+G+W AVVG QIA+HA PTALN+GVL+V
Sbjct 75 PDARDPQLLGAVTQDLAKSRGWSARVAEGSVIGRWRAVVGDQIADHATPTALNEGVLTVT 134
Query 128 AESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
AESTA ATQLR++Q+QLLAKIAA VG+ VV +LKI GPA PSWRKG H++GRGPRDTYG
Sbjct 135 AESTASATQLRMVQSQLLAKIAAVVGDGVVTTLKIVGPAGPSWRKGRYHVSGRGPRDTYG 194
>gi|120401033|ref|YP_950862.1| hypothetical protein Mvan_0005 [Mycobacterium vanbaalenii PYR-1]
gi|119953851|gb|ABM10856.1| protein of unknown function DUF721 [Mycobacterium vanbaalenii
PYR-1]
Length=185
Score = 213 bits (541), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 113/168 (68%), Positives = 133/168 (80%), Gaps = 2/168 (1%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGR-RRSWSGPGPDIRDPQPLGKA 79
G+DLVRRTL+EAR AAR +G++ G GR + A RVAG RR WSGPGPD RDPQ LG
Sbjct 19 GMDLVRRTLEEARGAARQQGKNVGLGRYSPTAR-RVAGSGRRRWSGPGPDSRDPQLLGAV 77
Query 80 ARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRI 139
++A+ RGWS +VAEG V G+W AVVG QIA HA PTAL++GVL+V AESTAWATQLR+
Sbjct 78 TGDVARTRGWSAKVAEGAVFGRWRAVVGDQIAAHAAPTALHEGVLTVSAESTAWATQLRM 137
Query 140 MQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
+Q+Q+LAKIAAAVG+ VV SLKI GP PSWRKGP + GRGPRDTYG
Sbjct 138 VQSQILAKIAAAVGDGVVTSLKIVGPVGPSWRKGPYTVPGRGPRDTYG 185
>gi|226303494|ref|YP_002763452.1| hypothetical protein RER_00050 [Rhodococcus erythropolis PR4]
gi|229491134|ref|ZP_04384962.1| protein in RecF-gyrB intergenic region [Rhodococcus erythropolis
SK121]
gi|226182609|dbj|BAH30713.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
gi|229321872|gb|EEN87665.1| protein in RecF-gyrB intergenic region [Rhodococcus erythropolis
SK121]
Length=181
Score = 210 bits (535), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 108/169 (64%), Positives = 132/169 (79%), Gaps = 4/169 (2%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVA-SVASGRVAGRRRS-WSGPGPDIRDPQPLGK 78
G+DL RR L+EARA A+A G+ G+GR + SGR RRRS WSGPGPD RDPQP G
Sbjct 15 GIDLARRALEEARATAKANGKAVGQGRSSPKYGSGRP--RRRSGWSGPGPDARDPQPFGA 72
Query 79 AARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLR 138
++K RGWS +V+EG VLG+W VVG I+ HA P +L +GVLS+ AESTAWATQLR
Sbjct 73 LTSAISKSRGWSPKVSEGTVLGRWPQVVGEDISAHAEPISLKEGVLSISAESTAWATQLR 132
Query 139 IMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
+MQ+Q+LAKIAAAVG+ VV++L+ITGP+APSWRKG RH+ GRGPRDTYG
Sbjct 133 MMQSQILAKIAAAVGDGVVKTLRITGPSAPSWRKGERHVKGRGPRDTYG 181
>gi|145221417|ref|YP_001132095.1| hypothetical protein Mflv_0823 [Mycobacterium gilvum PYR-GCK]
gi|145213903|gb|ABP43307.1| protein of unknown function DUF721 [Mycobacterium gilvum PYR-GCK]
Length=188
Score = 204 bits (519), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 110/169 (66%), Positives = 130/169 (77%), Gaps = 2/169 (1%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVASVAS--GRVAGRRRSWSGPGPDIRDPQPLGK 78
G+DLVRR L+EAR AAR +G++ G+GR A S A RR WSGPGPD RDPQ LG
Sbjct 20 GMDLVRRALEEARGAARQQGKNVGQGRTAPSGSPRRGTARSRRRWSGPGPDNRDPQLLGS 79
Query 79 AARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLR 138
+LA+ RGWS RVA+G V G+W AVVG QIA+HA PT L +GVL+V AESTAWATQLR
Sbjct 80 LTGDLARARGWSGRVAQGAVFGRWRAVVGDQIADHASPTTLTEGVLTVSAESTAWATQLR 139
Query 139 IMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
++Q+Q+LAKIAAAVG+ VV SLKI GP PSWRKGP ++ GRGPRDTYG
Sbjct 140 MVQSQILAKIAAAVGDGVVTSLKIVGPVGPSWRKGPYNVRGRGPRDTYG 188
>gi|315441701|ref|YP_004074580.1| RNA-binding protein containing Zn ribbon [Mycobacterium sp. Spyr1]
gi|315260004|gb|ADT96745.1| predicted RNA-binding protein containing Zn ribbon [Mycobacterium
sp. Spyr1]
Length=188
Score = 204 bits (518), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 110/169 (66%), Positives = 129/169 (77%), Gaps = 2/169 (1%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVASVAS--GRVAGRRRSWSGPGPDIRDPQPLGK 78
G+DLVRR L+EAR AAR +G++ G GR A S A RR WSGPGPD RDPQ LG
Sbjct 20 GMDLVRRALEEARGAARQQGKNVGHGRTAPSGSPRRGTARSRRRWSGPGPDNRDPQLLGS 79
Query 79 AARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLR 138
+LA+ RGWS RVA+G V G+W AVVG QIA+HA PT L +GVL+V AESTAWATQLR
Sbjct 80 LTGDLARARGWSGRVAQGAVFGRWRAVVGDQIADHASPTTLTEGVLTVSAESTAWATQLR 139
Query 139 IMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
++Q+Q+LAKIAAAVG+ VV SLKI GP PSWRKGP ++ GRGPRDTYG
Sbjct 140 MVQSQILAKIAAAVGDGVVTSLKIVGPVGPSWRKGPYNVRGRGPRDTYG 188
>gi|312137519|ref|YP_004004855.1| hypothetical protein REQ_00050 [Rhodococcus equi 103S]
gi|311886858|emb|CBH46166.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=183
Score = 196 bits (499), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 117/181 (65%), Positives = 138/181 (77%), Gaps = 6/181 (3%)
Query 13 GERSMKSP-----GLDLVRRTLDEARAAARARGQDAGRGRVASVASGR-VAGRRRSWSGP 66
GE+S P G+DL RR L+EARAAA+A G+ G+GR + R + RRRSWSG
Sbjct 3 GEQSESQPEPEIKGVDLARRALEEARAAAKANGKAVGQGRKSPRGGVRALRSRRRSWSGA 62
Query 67 GPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSV 126
GPD RDPQP G ++K+RGWS +V+EG VLG+W+ VVG IA HA PT L DGVLSV
Sbjct 63 GPDDRDPQPFGALVSAVSKQRGWSTQVSEGTVLGRWADVVGPDIASHAEPTGLRDGVLSV 122
Query 127 IAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTY 186
AESTAWATQLR+MQAQ+LAKIAAAVG+ VV+SL+ITGP APSWRKG RHI+GRGPRDTY
Sbjct 123 SAESTAWATQLRMMQAQILAKIAAAVGHGVVKSLRITGPTAPSWRKGERHISGRGPRDTY 182
Query 187 G 187
G
Sbjct 183 G 183
>gi|325677516|ref|ZP_08157180.1| hypothetical protein HMPREF0724_14963 [Rhodococcus equi ATCC
33707]
gi|325551763|gb|EGD21461.1| hypothetical protein HMPREF0724_14963 [Rhodococcus equi ATCC
33707]
Length=183
Score = 196 bits (497), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 117/181 (65%), Positives = 138/181 (77%), Gaps = 6/181 (3%)
Query 13 GERSMKSP-----GLDLVRRTLDEARAAARARGQDAGRGRVASVASGR-VAGRRRSWSGP 66
GE+S P G+DL RR L+EARAAA+A G+ G+GR + R + RRRSWSG
Sbjct 3 GEQSEPQPEPELKGVDLARRALEEARAAAKANGKAVGQGRKSPRGGVRALRSRRRSWSGA 62
Query 67 GPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSV 126
GPD RDPQP G ++K+RGWS +V+EG VLG+W+ VVG IA HA PT L DGVLSV
Sbjct 63 GPDDRDPQPFGALVSAVSKQRGWSTQVSEGTVLGRWADVVGPDIASHAEPTGLRDGVLSV 122
Query 127 IAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTY 186
AESTAWATQLR+MQAQ+LAKIAAAVG+ VV+SL+ITGP APSWRKG RHI+GRGPRDTY
Sbjct 123 SAESTAWATQLRMMQAQILAKIAAAVGHGVVKSLRITGPTAPSWRKGERHISGRGPRDTY 182
Query 187 G 187
G
Sbjct 183 G 183
>gi|226362899|ref|YP_002780679.1| hypothetical protein ROP_34870 [Rhodococcus opacus B4]
gi|226241386|dbj|BAH51734.1| hypothetical protein [Rhodococcus opacus B4]
Length=187
Score = 189 bits (481), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 113/168 (68%), Positives = 130/168 (78%), Gaps = 1/168 (0%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVASV-ASGRVAGRRRSWSGPGPDIRDPQPLGKA 79
G+DL RR L+EARAAA+A G+ G+GR + A RRR WSGPGPD RDPQP G
Sbjct 20 GIDLARRALEEARAAAKASGKSVGQGRRSGTGVRALRARRRRGWSGPGPDDRDPQPFGAL 79
Query 80 ARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRI 139
LAK+RGWS +V+EG VLG+W VVG IA HA PT L DG+LSV AESTAWATQLR+
Sbjct 80 TSALAKQRGWSPKVSEGTVLGRWVQVVGEDIAAHAEPTGLRDGILSVSAESTAWATQLRM 139
Query 140 MQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
MQ+Q+LAKIAAAVG+ VV+SL+ITGP APSWRKG RHI GRGPRDTYG
Sbjct 140 MQSQILAKIAAAVGDGVVKSLRITGPTAPSWRKGERHIRGRGPRDTYG 187
>gi|111020659|ref|YP_703631.1| hypothetical protein RHA1_ro03670 [Rhodococcus jostii RHA1]
gi|123340327|sp|Q0SAG3.1|Y3670_RHOSR RecName: Full=UPF0232 protein RHA1_ro03670
gi|110820189|gb|ABG95473.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=188
Score = 189 bits (479), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 112/168 (67%), Positives = 130/168 (78%), Gaps = 1/168 (0%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVASVA-SGRVAGRRRSWSGPGPDIRDPQPLGKA 79
G+DL RR L+EARAAA+A G+ G+GR + A RRR WSGPGPD RDPQP G
Sbjct 21 GIDLARRALEEARAAAKASGKSVGQGRRSGTGVRALRARRRRGWSGPGPDDRDPQPFGAL 80
Query 80 ARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRI 139
+AK+RGWS +V+EG VLG+W VVG IA HA PT L DG+LSV AESTAWATQLR+
Sbjct 81 TNAIAKQRGWSPKVSEGTVLGRWVQVVGEDIAAHAEPTGLRDGILSVSAESTAWATQLRM 140
Query 140 MQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
MQ+Q+LAKIAAAVG+ VV+SL+ITGP APSWRKG RHI GRGPRDTYG
Sbjct 141 MQSQILAKIAAAVGDGVVKSLRITGPTAPSWRKGERHIRGRGPRDTYG 188
>gi|1213061|emb|CAA63916.1| orf192 [Mycobacterium smegmatis str. MC2 155]
Length=192
Score = 187 bits (475), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 108/180 (60%), Positives = 128/180 (72%), Gaps = 7/180 (3%)
Query 8 PDQNRGERSMKSPGLDLVRRTLDEARAAARARGQDAGRGRVASVASGRVAGRRRSWSGPG 67
PD G R G+DLVRRTL+EAR + GQ R R S R + ++ G G
Sbjct 20 PDHLAGLR-----GIDLVRRTLEEARGRTQP-GQGCPRRRSGPAPSWREP-QAQNLVGAG 72
Query 68 PDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVI 127
RDPQ LG ++LAK RGWS RVAEG V+G+W AVVG QIA+HA PTALN+GVL+V
Sbjct 73 TRCRDPQLLGAVTQDLAKSRGWSARVAEGSVIGRWRAVVGDQIADHATPTALNEGVLTVT 132
Query 128 AESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
AESTAWATQLR++Q+QLLAKIAA VG+ VV +LKI GPA PSWRKG H++GRGPRDTYG
Sbjct 133 AESTAWATQLRMVQSQLLAKIAAVVGDGVVTTLKIVGPAGPSWRKGRYHVSGRGPRDTYG 192
>gi|54021968|ref|YP_116210.1| hypothetical protein nfa40 [Nocardia farcinica IFM 10152]
gi|54013476|dbj|BAD54846.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=189
Score = 185 bits (470), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 98/149 (66%), Positives = 110/149 (74%), Gaps = 1/149 (0%)
Query 40 GQDAGRGRVASVASGRVAGRRRS-WSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMV 98
G+ G+GR + V R GRRRS WSG PD RDPQ L + A +AK RGW +VAEG V
Sbjct 41 GKSVGQGRASPVRKLRAGGRRRSGWSGARPDDRDPQLLSQLATRIAKSRGWDGKVAEGTV 100
Query 99 LGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVR 158
G+W+ VVG IA HA P L DGVLS+ AESTAWATQLR++Q Q+LAKI AAVG VVR
Sbjct 101 FGRWAGVVGEDIAAHATPVTLKDGVLSIAAESTAWATQLRLLQPQILAKINAAVGQGVVR 160
Query 159 SLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
LKITGPAAPSWRKG RHI GRGPRDTYG
Sbjct 161 QLKITGPAAPSWRKGERHIKGRGPRDTYG 189
>gi|333917683|ref|YP_004491264.1| hypothetical protein AS9A_0004 [Amycolicicoccus subflavus DQS3-9A1]
gi|333479904|gb|AEF38464.1| hypothetical protein AS9A_0004 [Amycolicicoccus subflavus DQS3-9A1]
Length=165
Score = 179 bits (453), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 97/165 (59%), Positives = 118/165 (72%), Gaps = 2/165 (1%)
Query 25 VRRTLDEARAAARARGQDAGRGRVASVASGRVAGR--RRSWSGPGPDIRDPQPLGKAARE 82
+R+ LD+AR+ A R G+G S + GR RRSWSG PD RDPQ LG+ A
Sbjct 1 MRKVLDDARSRAGTRASVTGQGPTPSRSERGTKGRSLRRSWSGARPDDRDPQLLGQLAGS 60
Query 83 LAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIMQA 142
+AK+RGW+ +VA G VLG+W VVG IA HA P +L G+L+V AESTAWATQLR MQ+
Sbjct 61 IAKRRGWTDKVAAGAVLGRWETVVGSDIACHAEPRSLEHGILTVQAESTAWATQLRYMQS 120
Query 143 QLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
Q++A+IAAAVGN VV L+I GPAAPSWRKG H+ GRGPRDTYG
Sbjct 121 QIIARIAAAVGNGVVTKLRILGPAAPSWRKGELHVRGRGPRDTYG 165
>gi|343928738|ref|ZP_08768183.1| hypothetical protein GOALK_120_01650 [Gordonia alkanivorans NBRC
16433]
gi|343761487|dbj|GAA15109.1| hypothetical protein GOALK_120_01650 [Gordonia alkanivorans NBRC
16433]
Length=195
Score = 175 bits (444), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/168 (60%), Positives = 121/168 (73%), Gaps = 1/168 (0%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVASVA-SGRVAGRRRSWSGPGPDIRDPQPLGKA 79
G + R+ L+EARAAARA G+ GRGR + V + R A R+ WSG GPD RDPQP G+
Sbjct 28 GYERARKALEEARAAARAAGKSVGRGRASPVRRTPRGAQTRKRWSGSGPDARDPQPFGRL 87
Query 80 ARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRI 139
LAK RGW ++ EG + G W +VG IA HA+P L D VL V AESTAWATQLR
Sbjct 88 VGGLAKDRGWQEKIGEGTLFGMWDQIVGADIAAHAKPIELRDNVLHVQAESTAWATQLRY 147
Query 140 MQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
+Q+Q+LAKIAAAVG+ VV+SL+I+GP PSWRKG RH+ GRGPRDTYG
Sbjct 148 VQSQILAKIAAAVGDGVVKSLRISGPKGPSWRKGERHVRGRGPRDTYG 195
>gi|134096625|ref|YP_001102286.1| hypothetical protein SACE_0006 [Saccharopolyspora erythraea NRRL
2338]
gi|291005721|ref|ZP_06563694.1| hypothetical protein SeryN2_14469 [Saccharopolyspora erythraea
NRRL 2338]
gi|133909248|emb|CAL99360.1| hypothetical protein SACE_0006 [Saccharopolyspora erythraea NRRL
2338]
Length=173
Score = 174 bits (442), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 84/126 (67%), Positives = 99/126 (79%), Gaps = 0/126 (0%)
Query 62 SWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALND 121
SWSGPG D RDPQPLG+ A +A +RGW+ R++ G V G+WS +VG IAEH +P AL D
Sbjct 48 SWSGPGADDRDPQPLGRLASRIAAERGWADRLSGGRVFGEWSTLVGGDIAEHTKPVALKD 107
Query 122 GVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRG 181
G LSV AESTAWATQLR++Q Q+L +IA VG DVVR +K+ GPAAPSWR GPRHI GRG
Sbjct 108 GELSVQAESTAWATQLRLLQRQILKRIADGVGKDVVRRIKVQGPAAPSWRHGPRHIPGRG 167
Query 182 PRDTYG 187
PRDTYG
Sbjct 168 PRDTYG 173
>gi|324999886|ref|ZP_08120998.1| hypothetical protein PseP1_14006 [Pseudonocardia sp. P1]
Length=233
Score = 172 bits (437), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 90/170 (53%), Positives = 116/170 (69%), Gaps = 6/170 (3%)
Query 21 GLDLVRRTLDEARAAARARGQ---DAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLG 77
G DL R L AR + + + D R ++ V SG+ GRRR WSG GPD RDPQP G
Sbjct 67 GADLARDALRAARETSARKAEERADEARPKL-RVVSGK--GRRRRWSGSGPDDRDPQPFG 123
Query 78 KAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQL 137
+ ++ RGWS R+ + VLG+WS +VG +A+H P +L DG L++ AESTAWATQL
Sbjct 124 RVVSRVSMDRGWSSRLTDATVLGRWSQLVGSDVADHCTPVSLRDGELTLQAESTAWATQL 183
Query 138 RIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
R +Q QLL ++AAAVG DVVR +++ GP+ PSWR GPRH+ GRGPRDTYG
Sbjct 184 RTLQRQLLTRLAAAVGPDVVRRIRVVGPSGPSWRHGPRHVRGRGPRDTYG 233
>gi|326383913|ref|ZP_08205597.1| hypothetical protein SCNU_13308 [Gordonia neofelifaecis NRRL
B-59395]
gi|326197372|gb|EGD54562.1| hypothetical protein SCNU_13308 [Gordonia neofelifaecis NRRL
B-59395]
Length=183
Score = 172 bits (436), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 97/169 (58%), Positives = 120/169 (72%), Gaps = 2/169 (1%)
Query 21 GLDLVRRTLDEARAAARARGQDAGRGRVASVAS--GRVAGRRRSWSGPGPDIRDPQPLGK 78
G DL R L+EARA A+A+G+ G GR A + + RR WSG GPD RDPQPLG+
Sbjct 15 GYDLARAALEEARALAKAQGKSVGMGRSAPIRTKRRTGDRSRRRWSGSGPDSRDPQPLGR 74
Query 79 AARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLR 138
++A++ GW R++EG + G W +VG IA HA PT L VL V AESTAWATQLR
Sbjct 75 MVGKVAQQHGWESRISEGTLFGMWPQIVGEDIATHADPTRLEGTVLHVRAESTAWATQLR 134
Query 139 IMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
MQ+Q++AKIA +G+ +V SL+ITGP APSWRKGPRHI+GRGPRDTYG
Sbjct 135 YMQSQIIAKIAKVIGHGMVTSLRITGPQAPSWRKGPRHISGRGPRDTYG 183
>gi|296392444|ref|YP_003657328.1| hypothetical protein Srot_0004 [Segniliparus rotundus DSM 44985]
gi|296179591|gb|ADG96497.1| protein of unknown function DUF721 [Segniliparus rotundus DSM
44985]
Length=170
Score = 170 bits (431), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 78/128 (61%), Positives = 94/128 (74%), Gaps = 0/128 (0%)
Query 60 RRSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTAL 119
R WSGPGPD+RDP+P + +L KK WS ++AEG + W +VG QIA HA+P L
Sbjct 43 RFRWSGPGPDVRDPKPFSELCDQLQKKDTWSAKLAEGKIFSLWPMIVGDQIASHAKPLHL 102
Query 120 NDGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAG 179
DG+L V AESTAWATQLR+MQ QLL K + +G VVR+LKITGP APSW+KG RH+ G
Sbjct 103 TDGLLHVQAESTAWATQLRLMQNQLLEKFSHHMGTRVVRALKITGPKAPSWKKGERHVRG 162
Query 180 RGPRDTYG 187
RGPRDTYG
Sbjct 163 RGPRDTYG 170
>gi|300781942|ref|YP_003762233.1| hypothetical protein AMED_0005 [Amycolatopsis mediterranei U32]
gi|299791456|gb|ADJ41831.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340523295|gb|AEK38500.1| hypothetical protein RAM_00025 [Amycolatopsis mediterranei S699]
Length=167
Score = 170 bits (431), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 88/149 (60%), Positives = 104/149 (70%), Gaps = 2/149 (1%)
Query 39 RGQDAGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMV 98
RG GR R A+ G + RRR WSGPG D RDPQPLG+ L RGW+ V V
Sbjct 21 RGTSPGRRRPAT--GGGQSPRRRRWSGPGADARDPQPLGRLVSRLMSDRGWNESVTSARV 78
Query 99 LGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVR 158
QW+ +VG +AEHA+P AL DG L+V A STAWATQLR++Q +LL KIAA VGN VV+
Sbjct 79 FAQWARLVGEDVAEHAQPIALKDGELTVRASSTAWATQLRLLQGKLLHKIAAGVGNGVVK 138
Query 159 SLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
++I GP APSWRKGPRH+ GRGPRDTYG
Sbjct 139 RMRIQGPTAPSWRKGPRHVPGRGPRDTYG 167
>gi|331693903|ref|YP_004330142.1| hypothetical protein Psed_0004 [Pseudonocardia dioxanivorans
CB1190]
gi|326948592|gb|AEA22289.1| UPF0232 protein [Pseudonocardia dioxanivorans CB1190]
Length=198
Score = 169 bits (429), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 90/168 (54%), Positives = 112/168 (67%), Gaps = 1/168 (0%)
Query 21 GLDLVRRTLDEARAAARARGQD-AGRGRVASVASGRVAGRRRSWSGPGPDIRDPQPLGKA 79
G DL R L AR A+ R + AG+ G RR WSGPGPD RDPQP G+
Sbjct 31 GPDLAREALRAAREASAQRAAERAGKDDPRRRRGAGRRGSRRRWSGPGPDERDPQPFGRL 90
Query 80 ARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRI 139
++ RGWS R+ + VLG+W +VG IA+H P +L DG L++ AESTAWATQLR
Sbjct 91 VARVSMDRGWSPRLTDATVLGRWPQLVGPDIADHCTPVSLRDGELTLQAESTAWATQLRT 150
Query 140 MQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
+Q QLLA++A AVGNDVVR +++ GP+ PSWR GPRH+ GRGPRDTYG
Sbjct 151 LQRQLLARLAVAVGNDVVRRIRVVGPSGPSWRHGPRHVRGRGPRDTYG 198
>gi|302531360|ref|ZP_07283702.1| UPF0232 protein [Streptomyces sp. AA4]
gi|302440255|gb|EFL12071.1| UPF0232 protein [Streptomyces sp. AA4]
Length=211
Score = 167 bits (424), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 80/128 (63%), Positives = 94/128 (74%), Gaps = 0/128 (0%)
Query 60 RRSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTAL 119
RR WSGPG D RDPQPLG+ L GW + V GQW+ +VG +AEHA+P AL
Sbjct 84 RRRWSGPGADPRDPQPLGRLVSRLISDSGWQDTMTNARVFGQWARLVGEDVAEHAQPVAL 143
Query 120 NDGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAG 179
DG L+V A STAWATQLR++Q +LLAKIAA VGN VV+ ++I GP APSWRKGPRH+ G
Sbjct 144 KDGELTVRASSTAWATQLRLLQGKLLAKIAAGVGNGVVKRMRIQGPTAPSWRKGPRHVPG 203
Query 180 RGPRDTYG 187
RGPRDTYG
Sbjct 204 RGPRDTYG 211
>gi|296137758|ref|YP_003645001.1| hypothetical protein Tpau_0008 [Tsukamurella paurometabola DSM
20162]
gi|296025892|gb|ADG76662.1| protein of unknown function DUF721 [Tsukamurella paurometabola
DSM 20162]
Length=180
Score = 166 bits (420), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 89/150 (60%), Positives = 103/150 (69%), Gaps = 2/150 (1%)
Query 40 GQDAGRGRVASVASG--RVAGRRRSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGM 97
G+ GRG A + G R+ + WSGP PD RDPQ G +AK RGW +V+EG
Sbjct 31 GKSVGRGNSAPMTGGVRRLRQGSKRWSGPAPDGRDPQRFGALIGGIAKARGWDKKVSEGT 90
Query 98 VLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVV 157
VLG W VVG +A HA+ +L + VL V AESTAWATQLR+MQ QLLAKI AAVG VV
Sbjct 91 VLGCWDTVVGADVAAHAQAVSLREKVLYVSAESTAWATQLRLMQPQLLAKINAAVGQGVV 150
Query 158 RSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
SL ITGP+APSWRKGP H+ GRGPRDTYG
Sbjct 151 TSLTITGPSAPSWRKGPLHVPGRGPRDTYG 180
>gi|317509430|ref|ZP_07967048.1| hypothetical protein HMPREF9336_03420 [Segniliparus rugosus ATCC
BAA-974]
gi|316252259|gb|EFV11711.1| hypothetical protein HMPREF9336_03420 [Segniliparus rugosus ATCC
BAA-974]
Length=165
Score = 161 bits (408), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 89/176 (51%), Positives = 105/176 (60%), Gaps = 14/176 (7%)
Query 13 GERSMKSPGLDLVRRTLDEARA-AARARGQDAGRGRVASVASGRVAGRRRSWSGPGPDIR 71
GE +SP DL RR + E A RA D R WSGPGPD R
Sbjct 3 GEDETRSPAEDLARRLIGEFGTRAPRAPKPDQ-------------RPERTRWSGPGPDAR 49
Query 72 DPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIAEST 131
DP+ + +L KK WS ++AEG + +W +++G Q A + P L DGVL V EST
Sbjct 50 DPKTFSEVFEQLRKKDTWSQKLAEGKIFSEWGSIMGEQNAAKSTPQQLVDGVLHVQTEST 109
Query 132 AWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
AWATQLR+MQ Q+L KIA VG VV SLKITGP APSWRKG RH+ GRGPRDTYG
Sbjct 110 AWATQLRLMQKQILEKIAGEVGKGVVFSLKITGPKAPSWRKGERHVRGRGPRDTYG 165
>gi|257054094|ref|YP_003131926.1| putative RNA-binding protein containing Zn ribbon [Saccharomonospora
viridis DSM 43017]
gi|256583966|gb|ACU95099.1| predicted RNA-binding protein containing Zn ribbon [Saccharomonospora
viridis DSM 43017]
Length=217
Score = 156 bits (395), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 84/129 (66%), Positives = 101/129 (79%), Gaps = 0/129 (0%)
Query 59 RRRSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTA 118
RRR WSGPG D RDPQP G+ +A + GWS R+A G V GQWS +VG +IAEHA+P +
Sbjct 89 RRRRWSGPGFDERDPQPFGRLLSNMATQLGWSARLANGRVFGQWSTLVGAEIAEHAQPMS 148
Query 119 LNDGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIA 178
LN+G L+V A STAWATQLR++Q QLLA+IAA VG+ VV ++I GP APSWRKGP+HI
Sbjct 149 LNNGELTVRASSTAWATQLRLLQRQLLARIAAGVGHGVVTRMRIQGPTAPSWRKGPKHIP 208
Query 179 GRGPRDTYG 187
GRGPRDTYG
Sbjct 209 GRGPRDTYG 217
>gi|284988634|ref|YP_003407188.1| hypothetical protein Gobs_0005 [Geodermatophilus obscurus DSM
43160]
gi|284061879|gb|ADB72817.1| protein of unknown function DUF721 [Geodermatophilus obscurus
DSM 43160]
Length=166
Score = 153 bits (386), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 73/133 (55%), Positives = 93/133 (70%), Gaps = 0/133 (0%)
Query 55 RVAGRRRSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHA 114
R+AG +R+WSGP P DPQPL + L + + W+ G V G+WSA+VG +IA H
Sbjct 34 RIAGPKRTWSGPRPGDDDPQPLARLVDSLVETQDWTEHTKVGAVFGRWSALVGPEIAAHC 93
Query 115 RPTALNDGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGP 174
P L +G L V+AESTAWATQLR++ +LAK+ A VG DVVR L++ GP APSW+KGP
Sbjct 94 APQTLTEGELLVVAESTAWATQLRLLAPTILAKLHATVGGDVVRRLRVVGPTAPSWKKGP 153
Query 175 RHIAGRGPRDTYG 187
R + GRGPRDTYG
Sbjct 154 RSVRGRGPRDTYG 166
>gi|262200050|ref|YP_003271258.1| hypothetical protein Gbro_0004 [Gordonia bronchialis DSM 43247]
gi|262083397|gb|ACY19365.1| protein of unknown function DUF721 [Gordonia bronchialis DSM
43247]
Length=184
Score = 152 bits (383), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 89/151 (59%), Positives = 110/151 (73%), Gaps = 5/151 (3%)
Query 40 GQDAGRGRVASV---ASGRVAGRRRSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEG 96
G+ G GR + V ASG +RR WSG GPD RDPQPLG+ A +A++RGW ++ EG
Sbjct 36 GKSVGHGRASPVRRPASGNK--KRRRWSGAGPDSRDPQPLGRLAGGVARERGWQAKIGEG 93
Query 97 MVLGQWSAVVGHQIAEHARPTALNDGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDV 156
+ G W +VG IA HA+P +L D VL V AESTAWATQLR +QAQ++AKIAAA+G+ +
Sbjct 94 TLFGMWDQIVGADIAAHAQPISLRDKVLHVQAESTAWATQLRYVQAQIIAKIAAALGDGM 153
Query 157 VRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
V SL+ITGP PSWRKG RH+ GRGPRDTYG
Sbjct 154 VTSLRITGPKGPSWRKGERHVRGRGPRDTYG 184
>gi|319949428|ref|ZP_08023489.1| hypothetical protein ES5_08306 [Dietzia cinnamea P4]
gi|319436890|gb|EFV91949.1| hypothetical protein ES5_08306 [Dietzia cinnamea P4]
Length=130
Score = 151 bits (381), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 73/129 (57%), Positives = 92/129 (72%), Gaps = 0/129 (0%)
Query 59 RRRSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTA 118
R+R W+G G D DPQPLG+ ++AKKRGW +VA G + +W +VG ++ HA P
Sbjct 2 RKRGWTGAGADPWDPQPLGRLVGQVAKKRGWDDKVATGRLFAEWGRIVGEDVSSHATPER 61
Query 119 LNDGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIA 178
L +G+L V A STAWATQLR+M A +L KIAAA+G VR LK+ GP PSWRKGP H++
Sbjct 62 LEEGILHVRASSTAWATQLRLMSADILRKIAAAMGPGHVRRLKVEGPEKPSWRKGPLHVS 121
Query 179 GRGPRDTYG 187
GRGPRDTYG
Sbjct 122 GRGPRDTYG 130
>gi|256374165|ref|YP_003097825.1| hypothetical protein Amir_0005 [Actinosynnema mirum DSM 43827]
gi|255918468|gb|ACU33979.1| protein of unknown function DUF721 [Actinosynnema mirum DSM 43827]
Length=143
Score = 149 bits (377), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 77/125 (62%), Positives = 98/125 (79%), Gaps = 0/125 (0%)
Query 63 WSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDG 122
WSGPGPD RDPQPLG+ A +A RGW+ ++ G V+ QW +VG +AEHA+P + DG
Sbjct 19 WSGPGPDDRDPQPLGRLASRIAADRGWAEKLRGGQVIAQWPKLVGEDVAEHAQPVSFEDG 78
Query 123 VLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGP 182
L+V A+STAWATQLR++Q +LL KIAA +G +VV+ LK+ GPAAPSWR GPRH++GRGP
Sbjct 79 ELTVQADSTAWATQLRLLQRELLKKIAAGLGPNVVKRLKVLGPAAPSWRYGPRHVSGRGP 138
Query 183 RDTYG 187
RDTYG
Sbjct 139 RDTYG 143
>gi|309811365|ref|ZP_07705152.1| conserved hypothetical protein [Dermacoccus sp. Ellin185]
gi|308434672|gb|EFP58517.1| conserved hypothetical protein [Dermacoccus sp. Ellin185]
Length=202
Score = 141 bits (355), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 68/119 (58%), Positives = 86/119 (73%), Gaps = 0/119 (0%)
Query 69 DIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTALNDGVLSVIA 128
D RDPQ + + L +RGW+V VA G V+ +W+ +VG +AEHARP DGVL+V A
Sbjct 84 DGRDPQLIDSTMKRLLLERGWNVDVAAGAVMSRWADLVGAGVAEHARPLTFEDGVLTVRA 143
Query 129 ESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIAGRGPRDTYG 187
ESTAWATQL+++ A LLA IA VG VV L++ GP+APSW +GPR +AGRGPRDTYG
Sbjct 144 ESTAWATQLQLLTASLLASIADGVGEGVVNELRVVGPSAPSWVRGPRRVAGRGPRDTYG 202
>gi|302864513|ref|YP_003833150.1| hypothetical protein Micau_0005 [Micromonospora aurantiaca ATCC
27029]
gi|315500823|ref|YP_004079710.1| hypothetical protein ML5_0005 [Micromonospora sp. L5]
gi|302567372|gb|ADL43574.1| protein of unknown function DUF721 [Micromonospora aurantiaca
ATCC 27029]
gi|315407442|gb|ADU05559.1| protein of unknown function DUF721 [Micromonospora sp. L5]
Length=201
Score = 133 bits (335), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 77/129 (60%), Positives = 91/129 (71%), Gaps = 0/129 (0%)
Query 59 RRRSWSGPGPDIRDPQPLGKAARELAKKRGWSVRVAEGMVLGQWSAVVGHQIAEHARPTA 118
R R +SGPGPD RDPQPLG +L K RGW AE V G W VVG ++A+H+RP
Sbjct 73 RLRGYSGPGPDPRDPQPLGAVLDKLMKARGWQQPAAEATVFGAWEKVVGPEVAQHSRPVK 132
Query 119 LNDGVLSVIAESTAWATQLRIMQAQLLAKIAAAVGNDVVRSLKITGPAAPSWRKGPRHIA 178
L DG L+V A STAWATQLR++ LL +IA VG++VVR L I GPAAPSW +GPR +
Sbjct 133 LEDGELTVEARSTAWATQLRLLAGSLLQQIAREVGHNVVRKLHIHGPAAPSWSRGPRRVR 192
Query 179 GRGPRDTYG 187
GRGPRDTYG
Sbjct 193 GRGPRDTYG 201
Lambda K H
0.316 0.131 0.394
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 180588168880
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40