BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1265
Length=226
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608405|ref|NP_215781.1| hypothetical protein Rv1265 [Mycoba... 462 2e-128
gi|340626278|ref|YP_004744730.1| hypothetical protein MCAN_12791... 460 8e-128
gi|297730800|ref|ZP_06959918.1| hypothetical protein MtubKR_0689... 384 7e-105
gi|289554946|ref|ZP_06444156.1| hypothetical protein TBXG_02696 ... 377 5e-103
gi|308231798|ref|ZP_07413770.2| hypothetical protein TMAG_01897 ... 371 3e-101
gi|254820139|ref|ZP_05225140.1| hypothetical protein MintA_09441... 326 2e-87
gi|296170174|ref|ZP_06851769.1| conserved hypothetical protein [... 324 6e-87
gi|342862144|ref|ZP_08718787.1| hypothetical protein MCOL_24756 ... 318 4e-85
gi|41408604|ref|NP_961440.1| hypothetical protein MAP2506c [Myco... 315 2e-84
gi|118464316|ref|YP_880658.1| hypothetical protein MAV_1416 [Myc... 313 9e-84
gi|240172635|ref|ZP_04751294.1| hypothetical protein MkanA1_2519... 310 1e-82
gi|108800907|ref|YP_641104.1| hypothetical protein Mmcs_3943 [My... 299 2e-79
gi|183984143|ref|YP_001852434.1| hypothetical protein MMAR_4172 ... 284 6e-75
gi|118619582|ref|YP_907914.1| hypothetical protein MUL_4469 [Myc... 272 3e-71
gi|325677319|ref|ZP_08156985.1| hypothetical protein HMPREF0724_... 263 1e-68
gi|312141467|ref|YP_004008803.1| hypothetical protein REQ_41570 ... 256 2e-66
gi|226364617|ref|YP_002782399.1| hypothetical protein ROP_52070 ... 255 3e-66
gi|111022121|ref|YP_705093.1| hypothetical protein RHA1_ro05154 ... 255 4e-66
gi|120405396|ref|YP_955225.1| hypothetical protein Mvan_4443 [My... 254 6e-66
gi|118469659|ref|YP_889261.1| hypothetical protein MSMEG_5010 [M... 253 1e-65
gi|169628504|ref|YP_001702153.1| hypothetical protein MAB_1413 [... 248 4e-64
gi|229494415|ref|ZP_04388178.1| conserved hypothetical protein [... 248 5e-64
gi|226304554|ref|YP_002764512.1| hypothetical protein RER_10650 ... 248 5e-64
gi|54027472|ref|YP_121714.1| hypothetical protein nfa54980 [Noca... 241 4e-62
gi|296139931|ref|YP_003647174.1| hypothetical protein Tpau_2226 ... 224 1e-56
gi|336116164|ref|YP_004570930.1| hypothetical protein MLP_05130 ... 219 3e-55
gi|84495896|ref|ZP_00994750.1| hypothetical protein JNB_00215 [J... 216 2e-54
gi|302530107|ref|ZP_07282449.1| Smu12A [Streptomyces sp. AA4] >g... 215 3e-54
gi|325002196|ref|ZP_08123308.1| hypothetical protein PseP1_25691... 212 4e-53
gi|300789898|ref|YP_003770189.1| hypothetical protein AMED_8084 ... 205 4e-51
gi|336176844|ref|YP_004582219.1| hypothetical protein FsymDg_078... 203 2e-50
gi|331694813|ref|YP_004331052.1| hypothetical protein Psed_0947 ... 201 5e-50
gi|291298257|ref|YP_003509535.1| hypothetical protein Snas_0730 ... 201 9e-50
gi|258654535|ref|YP_003203691.1| hypothetical protein Namu_4415 ... 196 3e-48
gi|86742578|ref|YP_482978.1| hypothetical protein Francci3_3899 ... 193 1e-47
gi|302556024|ref|ZP_07308366.1| conserved hypothetical protein [... 189 2e-46
gi|297204297|ref|ZP_06921694.1| conserved hypothetical protein [... 189 4e-46
gi|256389919|ref|YP_003111483.1| hypothetical protein Caci_0707 ... 188 6e-46
gi|134097519|ref|YP_001103180.1| hypothetical protein SACE_0921 ... 187 1e-45
gi|302562328|ref|ZP_07314670.1| conserved hypothetical protein [... 186 2e-45
gi|158312687|ref|YP_001505195.1| hypothetical protein Franean1_0... 184 6e-45
gi|284989524|ref|YP_003408078.1| hypothetical protein Gobs_0946 ... 184 1e-44
gi|111225551|ref|YP_716345.1| hypothetical protein FRAAL6207 [Fr... 182 3e-44
gi|291435451|ref|ZP_06574841.1| conserved hypothetical protein [... 179 3e-43
gi|343926262|ref|ZP_08765771.1| hypothetical protein GOALK_056_0... 177 8e-43
gi|284029165|ref|YP_003379096.1| hypothetical protein Kfla_1194 ... 177 8e-43
gi|326331376|ref|ZP_08197666.1| hypothetical protein NBCG_02814 ... 177 1e-42
gi|254383553|ref|ZP_04998903.1| conserved hypothetical protein [... 177 1e-42
gi|271969720|ref|YP_003343916.1| hypothetical protein Sros_8531 ... 176 2e-42
gi|221635719|ref|YP_002523595.1| hypothetical protein trd_A0313 ... 176 2e-42
>gi|15608405|ref|NP_215781.1| hypothetical protein Rv1265 [Mycobacterium tuberculosis H37Rv]
gi|15840711|ref|NP_335748.1| hypothetical protein MT1303 [Mycobacterium tuberculosis CDC1551]
gi|31792457|ref|NP_854950.1| hypothetical protein Mb1296 [Mycobacterium bovis AF2122/97]
51 more sequence titles
Length=226
Score = 462 bits (1189), Expect = 2e-128, Method: Compositional matrix adjust.
Identities = 226/226 (100%), Positives = 226/226 (100%), Gaps = 0/226 (0%)
Query 1 MVLARPDAVFAPARNRCHVSLPVNAMSLKMKVCNHVIMRHHHMHGRRYGRPGGWQQAQQP 60
MVLARPDAVFAPARNRCHVSLPVNAMSLKMKVCNHVIMRHHHMHGRRYGRPGGWQQAQQP
Sbjct 1 MVLARPDAVFAPARNRCHVSLPVNAMSLKMKVCNHVIMRHHHMHGRRYGRPGGWQQAQQP 60
Query 61 DASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRD 120
DASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRD
Sbjct 61 DASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRD 120
Query 121 ETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAG 180
ETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAG
Sbjct 121 ETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAG 180
Query 181 VARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDLPA 226
VARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDLPA
Sbjct 181 VARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDLPA 226
>gi|340626278|ref|YP_004744730.1| hypothetical protein MCAN_12791 [Mycobacterium canettii CIPT
140010059]
gi|340004468|emb|CCC43611.1| hypothetical protein MCAN_12791 [Mycobacterium canettii CIPT
140010059]
Length=226
Score = 460 bits (1183), Expect = 8e-128, Method: Compositional matrix adjust.
Identities = 224/226 (99%), Positives = 226/226 (100%), Gaps = 0/226 (0%)
Query 1 MVLARPDAVFAPARNRCHVSLPVNAMSLKMKVCNHVIMRHHHMHGRRYGRPGGWQQAQQP 60
MVLARPDAVFAPARNRCHVSLPVNAMSL+MKVCNHVIM+HHHMHGRRYGRPGGWQQAQQP
Sbjct 1 MVLARPDAVFAPARNRCHVSLPVNAMSLQMKVCNHVIMKHHHMHGRRYGRPGGWQQAQQP 60
Query 61 DASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRD 120
DASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRD
Sbjct 61 DASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRD 120
Query 121 ETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAG 180
ETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAG
Sbjct 121 ETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAG 180
Query 181 VARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDLPA 226
VARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDLPA
Sbjct 181 VARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDLPA 226
>gi|297730800|ref|ZP_06959918.1| hypothetical protein MtubKR_06899 [Mycobacterium tuberculosis
KZN R506]
gi|313658132|ref|ZP_07815012.1| hypothetical protein MtubKV_06914 [Mycobacterium tuberculosis
KZN V2475]
Length=189
Score = 384 bits (985), Expect = 7e-105, Method: Compositional matrix adjust.
Identities = 189/189 (100%), Positives = 189/189 (100%), Gaps = 0/189 (0%)
Query 38 MRHHHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKL 97
MRHHHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKL
Sbjct 1 MRHHHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKL 60
Query 98 PGLESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHI 157
PGLESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHI
Sbjct 61 PGLESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHI 120
Query 158 AVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDL 217
AVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDL
Sbjct 121 AVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDL 180
Query 218 RAQGPDLPA 226
RAQGPDLPA
Sbjct 181 RAQGPDLPA 189
>gi|289554946|ref|ZP_06444156.1| hypothetical protein TBXG_02696 [Mycobacterium tuberculosis KZN
605]
gi|289439578|gb|EFD22071.1| hypothetical protein TBXG_02696 [Mycobacterium tuberculosis KZN
605]
Length=228
Score = 377 bits (969), Expect = 5e-103, Method: Compositional matrix adjust.
Identities = 183/184 (99%), Positives = 184/184 (100%), Gaps = 0/184 (0%)
Query 23 VNAMSLKMKVCNHVIMRHHHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDP 82
+NAMSLKMKVCNHVIMRHHHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDP
Sbjct 1 MNAMSLKMKVCNHVIMRHHHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDP 60
Query 83 TVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVS 142
TVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVS
Sbjct 61 TVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVS 120
Query 143 WGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEE 202
WGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEE
Sbjct 121 WGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEE 180
Query 203 WLAK 206
WLAK
Sbjct 181 WLAK 184
>gi|308231798|ref|ZP_07413770.2| hypothetical protein TMAG_01897 [Mycobacterium tuberculosis SUMu001]
gi|308370011|ref|ZP_07419991.2| hypothetical protein TMBG_01342 [Mycobacterium tuberculosis SUMu002]
gi|308370676|ref|ZP_07422307.2| hypothetical protein TMCG_00895 [Mycobacterium tuberculosis SUMu003]
24 more sequence titles
Length=184
Score = 371 bits (953), Expect = 3e-101, Method: Compositional matrix adjust.
Identities = 184/184 (100%), Positives = 184/184 (100%), Gaps = 0/184 (0%)
Query 43 MHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLES 102
MHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLES
Sbjct 1 MHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLES 60
Query 103 PEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVM 162
PEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVM
Sbjct 61 PEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVM 120
Query 163 TRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGP 222
TRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGP
Sbjct 121 TRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGP 180
Query 223 DLPA 226
DLPA
Sbjct 181 DLPA 184
>gi|254820139|ref|ZP_05225140.1| hypothetical protein MintA_09441 [Mycobacterium intracellulare
ATCC 13950]
Length=190
Score = 326 bits (835), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 162/188 (87%), Positives = 170/188 (91%), Gaps = 1/188 (0%)
Query 38 MRHHHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKL 97
M+ HH HGRRYGRPGGWQQAQQPDA AAEWFAGRLPE WFDGDPTVIVDREEITV+G+L
Sbjct 1 MKAHHTHGRRYGRPGGWQQAQQPDAGDAAEWFAGRLPEKWFDGDPTVIVDREEITVMGRL 60
Query 98 PGLESPEE-ESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTH 156
P S E ES ARA+GR SRFR+ETR ERM IADEAQ+RYGRKVSWGVEVG ERILFTH
Sbjct 61 PDTASAESGESEARAAGRASRFREETRSERMHIADEAQDRYGRKVSWGVEVGSERILFTH 120
Query 157 IAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDD 216
IAVPVMTRLKQPERQVLDTLVDAGVARSR+DALAWSVKLVGEH EEWLAKLR AM+AVDD
Sbjct 121 IAVPVMTRLKQPERQVLDTLVDAGVARSRADALAWSVKLVGEHAEEWLAKLRDAMTAVDD 180
Query 217 LRAQGPDL 224
LRAQGPDL
Sbjct 181 LRAQGPDL 188
>gi|296170174|ref|ZP_06851769.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295895166|gb|EFG74882.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=193
Score = 324 bits (831), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 163/192 (85%), Positives = 172/192 (90%), Gaps = 5/192 (2%)
Query 38 MRHHHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKL 97
M+ H HGRRYGR GGWQQAQQPDAS AAEWFAGRLPEDWFDGDPTVIVDREEITVIG+L
Sbjct 1 MKSQHTHGRRYGRTGGWQQAQQPDASDAAEWFAGRLPEDWFDGDPTVIVDREEITVIGRL 60
Query 98 PGLESPE-EESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVG----GERI 152
P L E EES ARA+GR SRFR+ETR ERM IADEAQ+RYGRKVSWGVEVG ERI
Sbjct 61 PELAGSENEESEARAAGRASRFREETRSERMHIADEAQDRYGRKVSWGVEVGSKTGAERI 120
Query 153 LFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMS 212
LFTHIAVPVMTRLKQPERQVLDTLVDAGVARSR+DALAWSV+LVGEH EEWLAKLR+AMS
Sbjct 121 LFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRADALAWSVRLVGEHAEEWLAKLRSAMS 180
Query 213 AVDDLRAQGPDL 224
AVDDLRAQGPD+
Sbjct 181 AVDDLRAQGPDI 192
>gi|342862144|ref|ZP_08718787.1| hypothetical protein MCOL_24756 [Mycobacterium colombiense CECT
3035]
gi|342130448|gb|EGT83763.1| hypothetical protein MCOL_24756 [Mycobacterium colombiense CECT
3035]
Length=190
Score = 318 bits (814), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 163/191 (86%), Positives = 170/191 (90%), Gaps = 7/191 (3%)
Query 38 MRHHHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKL 97
M+ HH H RRYGRPGGWQQAQQPDAS AAEWFAGRLPEDWFDGDPTV+VDREEITVIGKL
Sbjct 1 MKAHHTH-RRYGRPGGWQQAQQPDASDAAEWFAGRLPEDWFDGDPTVVVDREEITVIGKL 59
Query 98 PGLESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEV----GGERIL 153
P E E S ARA+GR SRFR++TR ERM IADEAQ+RYGRKVSWGVEV G ERIL
Sbjct 60 PDAEG--EASEARAAGRASRFREDTRSERMHIADEAQDRYGRKVSWGVEVASTAGAERIL 117
Query 154 FTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSA 213
FTHIAVPVMTRLKQPERQVLDTLVDAGVARSR+DALAWSVKLVGEH EEWLAKLR AMSA
Sbjct 118 FTHIAVPVMTRLKQPERQVLDTLVDAGVARSRADALAWSVKLVGEHAEEWLAKLRGAMSA 177
Query 214 VDDLRAQGPDL 224
VDDLRAQGPDL
Sbjct 178 VDDLRAQGPDL 188
>gi|41408604|ref|NP_961440.1| hypothetical protein MAP2506c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|254774294|ref|ZP_05215810.1| hypothetical protein MaviaA2_06455 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|41396962|gb|AAS04823.1| hypothetical protein MAP_2506c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336458609|gb|EGO37576.1| hypothetical protein MAPs_11990 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=191
Score = 315 bits (808), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 163/192 (85%), Positives = 170/192 (89%), Gaps = 6/192 (3%)
Query 38 MRHHHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKL 97
M+ HH H RRYGRPGGWQQAQQPDAS AAEWFAGRLPE WFDGDPTVIVDREEITVIGKL
Sbjct 1 MKAHHAH-RRYGRPGGWQQAQQPDASDAAEWFAGRLPEQWFDGDPTVIVDREEITVIGKL 59
Query 98 PGLESPEEE-SAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGG----ERI 152
P E+E S ARA+GR SRFR+ETR ERM IADEAQ+RYGRKVSWGVEVG ERI
Sbjct 60 PEPADAEKEASEARAAGRASRFREETRSERMRIADEAQDRYGRKVSWGVEVGSKTGTERI 119
Query 153 LFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMS 212
LFTHIAVPVMTRLKQPERQVLDTLVDAGVARSR+DALAWSVKLVGEH EEWLA+LR AMS
Sbjct 120 LFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRADALAWSVKLVGEHAEEWLAQLRDAMS 179
Query 213 AVDDLRAQGPDL 224
AVDDLRAQGPDL
Sbjct 180 AVDDLRAQGPDL 191
>gi|118464316|ref|YP_880658.1| hypothetical protein MAV_1416 [Mycobacterium avium 104]
gi|118165603|gb|ABK66500.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=191
Score = 313 bits (803), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 162/192 (85%), Positives = 169/192 (89%), Gaps = 6/192 (3%)
Query 38 MRHHHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKL 97
M+ HH H RRYGRPGGWQQAQQPDAS AAEWFAGRLPE WFDGDPTVIVDREEITVIGKL
Sbjct 1 MKAHHAH-RRYGRPGGWQQAQQPDASDAAEWFAGRLPEQWFDGDPTVIVDREEITVIGKL 59
Query 98 PGLESPEEE-SAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEV----GGERI 152
P E+E S AR +GR SRFR+ETR ERM IADEAQ+RYGRKVSWGVEV G ERI
Sbjct 60 PEPADAEKEASEARTAGRASRFREETRSERMRIADEAQDRYGRKVSWGVEVRSKTGTERI 119
Query 153 LFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMS 212
LFTHIAVPVMTRLKQPERQVLDTLVDAGVARSR+DALAWSVKLVGEH EEWLA+LR AMS
Sbjct 120 LFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRADALAWSVKLVGEHAEEWLAQLRDAMS 179
Query 213 AVDDLRAQGPDL 224
AVDDLRAQGPDL
Sbjct 180 AVDDLRAQGPDL 191
>gi|240172635|ref|ZP_04751294.1| hypothetical protein MkanA1_25195 [Mycobacterium kansasii ATCC
12478]
Length=189
Score = 310 bits (794), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 159/187 (86%), Positives = 168/187 (90%), Gaps = 5/187 (2%)
Query 43 MHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLE- 101
MHG R+GRPGGWQQAQQPDAS AAEWFAGRLPEDWFDGDPTVIVDREEITVIGKL E
Sbjct 1 MHGHRHGRPGGWQQAQQPDASDAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLAAPED 60
Query 102 SPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEV----GGERILFTHI 157
S E+S AR GR SRFR+ETR ERM IADEAQ+RYGRKVSWGV+V G ERILFTHI
Sbjct 61 SGTEQSPARGKGRASRFREETRSERMRIADEAQDRYGRKVSWGVDVVSGTGTERILFTHI 120
Query 158 AVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDL 217
AVPVMTRL+QPERQVLDTLVDAGVARSR+DALAWSV+LVGEHTEEWLAKLRTAM+AVDDL
Sbjct 121 AVPVMTRLRQPERQVLDTLVDAGVARSRADALAWSVRLVGEHTEEWLAKLRTAMAAVDDL 180
Query 218 RAQGPDL 224
RAQGPDL
Sbjct 181 RAQGPDL 187
>gi|108800907|ref|YP_641104.1| hypothetical protein Mmcs_3943 [Mycobacterium sp. MCS]
gi|119870047|ref|YP_939999.1| hypothetical protein Mkms_4017 [Mycobacterium sp. KMS]
gi|126436532|ref|YP_001072223.1| hypothetical protein Mjls_3957 [Mycobacterium sp. JLS]
gi|108771326|gb|ABG10048.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119696136|gb|ABL93209.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126236332|gb|ABN99732.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=203
Score = 299 bits (766), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 152/206 (74%), Positives = 174/206 (85%), Gaps = 8/206 (3%)
Query 23 VNAMSLKMKVCNHVIMRHHHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDP 82
VN ++ CNHVIM+ HH RR GR GGWQQA+QPDAS AA+WFAGRLP+ WF GDP
Sbjct 2 VNTPPPAVQRCNHVIMKTHHH--RRPGRAGGWQQAEQPDASDAADWFAGRLPDGWFAGDP 59
Query 83 TVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVS 142
TV+VDREEITVIG+LP E E+ES ARASGR +RFR++TRPERM IADEA+ RYGRKV+
Sbjct 60 TVVVDREEITVIGRLP--EPAEQESEARASGRAARFREQTRPERMQIADEAEARYGRKVA 117
Query 143 WGVEVG----GERILFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGE 198
WGVE+G ERILFTHIAVPVMTRL+QPERQVLDTLVDAGVARSRSDALAWSV+LVG+
Sbjct 118 WGVEIGSGADAERILFTHIAVPVMTRLRQPERQVLDTLVDAGVARSRSDALAWSVRLVGQ 177
Query 199 HTEEWLAKLRTAMSAVDDLRAQGPDL 224
H +EWLA+LR AM+ VDDLRA+GP L
Sbjct 178 HADEWLAQLRDAMAKVDDLRAEGPQL 203
>gi|183984143|ref|YP_001852434.1| hypothetical protein MMAR_4172 [Mycobacterium marinum M]
gi|183177469|gb|ACC42579.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=195
Score = 284 bits (727), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 147/180 (82%), Positives = 155/180 (87%), Gaps = 5/180 (2%)
Query 52 GGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLE-SPEEESAAR 110
GGWQQAQQPDAS AA+WFAGRLPEDWF+G P V+VDREEITVIG L E S E+S A
Sbjct 16 GGWQQAQQPDASDAADWFAGRLPEDWFEGAPAVVVDREEITVIGTLGAPENSGSEQSKAH 75
Query 111 ASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEV----GGERILFTHIAVPVMTRLK 166
+ GR SRFR+ETR ERM IADEAQ RY RKVSWGV+V G ERILFTHIAVPVMTRLK
Sbjct 76 SEGRASRFREETRAERMNIADEAQERYARKVSWGVDVVSDAGMERILFTHIAVPVMTRLK 135
Query 167 QPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDLPA 226
QPERQVLDTLVDAGVARSR+DALAWSVKLVGEHTEEWL KLRTAMSAVDDLRAQGPDL A
Sbjct 136 QPERQVLDTLVDAGVARSRADALAWSVKLVGEHTEEWLDKLRTAMSAVDDLRAQGPDLQA 195
>gi|118619582|ref|YP_907914.1| hypothetical protein MUL_4469 [Mycobacterium ulcerans Agy99]
gi|118571692|gb|ABL06443.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=174
Score = 272 bits (695), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 139/174 (80%), Positives = 148/174 (86%), Gaps = 5/174 (2%)
Query 43 MHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLE- 101
MHGRR+GR GGWQQAQQPDAS A+WFAGRLPEDWF+G P V+VDREEITVIG L E
Sbjct 1 MHGRRHGRSGGWQQAQQPDASDTADWFAGRLPEDWFEGAPAVVVDREEITVIGTLGAPEN 60
Query 102 SPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEV----GGERILFTHI 157
S E+S A + GR SRFR+ETR ERM IADEAQ RY RKVSWGV+V G ERILFTHI
Sbjct 61 SGSEQSKAHSEGRASRFREETRAERMNIADEAQERYARKVSWGVDVVSDAGTERILFTHI 120
Query 158 AVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAM 211
AVPVMTRLKQPERQVLDTLVDAGVARSR+DALAWSVKLVGEHTEEWL KLRTAM
Sbjct 121 AVPVMTRLKQPERQVLDTLVDAGVARSRADALAWSVKLVGEHTEEWLDKLRTAM 174
>gi|325677319|ref|ZP_08156985.1| hypothetical protein HMPREF0724_14768 [Rhodococcus equi ATCC
33707]
gi|325552016|gb|EGD21712.1| hypothetical protein HMPREF0724_14768 [Rhodococcus equi ATCC
33707]
Length=191
Score = 263 bits (672), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 130/180 (73%), Positives = 149/180 (83%), Gaps = 2/180 (1%)
Query 45 GRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPE 104
G R+GRPGGWQQA PDAS A+EWFAGRLP+DWF G TV VDREEI V G+LPG ++
Sbjct 14 GHRFGRPGGWQQADAPDASDASEWFAGRLPDDWFTGPATVEVDREEIVVFGELPGEDT-- 71
Query 105 EESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTR 164
E SAA GRV+RFR+ TRP RM IADEAQ RYGRKV+WGV +G RILFTH+AVPVMTR
Sbjct 72 ENSAAAEEGRVARFRESTRPARMQIADEAQARYGRKVAWGVTIGDRRILFTHLAVPVMTR 131
Query 165 LKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
L+QPER+VLDTLVDAGVARSR+DALAW+VKL G+H EEWLA+LR AM VDDLR+ GP +
Sbjct 132 LRQPERKVLDTLVDAGVARSRADALAWTVKLAGQHAEEWLAELREAMRKVDDLRSTGPQI 191
>gi|312141467|ref|YP_004008803.1| hypothetical protein REQ_41570 [Rhodococcus equi 103S]
gi|311890806|emb|CBH50125.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=191
Score = 256 bits (654), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 127/175 (73%), Positives = 145/175 (83%), Gaps = 2/175 (1%)
Query 50 RPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAA 109
RPGGWQQA PDAS A+EWFAGRLP+DWF G TV VDREEI V G+LPG ++ E SAA
Sbjct 19 RPGGWQQADAPDASDASEWFAGRLPDDWFTGPATVEVDREEIVVFGELPGEDT--ENSAA 76
Query 110 RASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPE 169
GRV+RFR+ TRP RM IADEAQ RYGRKV+WGV +G RILFTH+AVPVMTRL+QPE
Sbjct 77 AEEGRVARFRESTRPARMQIADEAQARYGRKVAWGVTIGDRRILFTHLAVPVMTRLRQPE 136
Query 170 RQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
R+VLDTLVDAGVARSR+DALAW+VKL G+H EEWLA+LR AM VDDLR+ GP +
Sbjct 137 RKVLDTLVDAGVARSRADALAWTVKLAGQHAEEWLAELREAMRKVDDLRSTGPQI 191
>gi|226364617|ref|YP_002782399.1| hypothetical protein ROP_52070 [Rhodococcus opacus B4]
gi|226243106|dbj|BAH53454.1| hypothetical protein [Rhodococcus opacus B4]
Length=188
Score = 255 bits (651), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 127/188 (68%), Positives = 152/188 (81%), Gaps = 1/188 (0%)
Query 38 MRHHHMHG-RRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGK 96
MR+ G RR+GRPGGWQQA PDAS AAEWFAGRLP+DWF G TV VDREEI V+G+
Sbjct 1 MRNAQGPGPRRFGRPGGWQQADVPDASDAAEWFAGRLPDDWFTGPATVEVDREEIVVVGE 60
Query 97 LPGLESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTH 156
LP +ES E S A GR+SRFR+ TR +RM IADEA+ RYGRKV+WGV + + LFTH
Sbjct 61 LPPVESAESGSEAAIDGRISRFRETTRGDRMRIADEAERRYGRKVAWGVSLDDHKTLFTH 120
Query 157 IAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDD 216
+AVPVMTRL+QPER+VLDTLVDAGVARSRSDAL W+V+L G+H+E+WLA+LR AM+ VD+
Sbjct 121 LAVPVMTRLRQPERKVLDTLVDAGVARSRSDALMWTVRLAGQHSEQWLAELREAMAKVDE 180
Query 217 LRAQGPDL 224
LRA GP +
Sbjct 181 LRADGPQI 188
>gi|111022121|ref|YP_705093.1| hypothetical protein RHA1_ro05154 [Rhodococcus jostii RHA1]
gi|110821651|gb|ABG96935.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=188
Score = 255 bits (651), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 126/188 (68%), Positives = 152/188 (81%), Gaps = 1/188 (0%)
Query 38 MRHHHMHG-RRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGK 96
MR+ G RR+GRPGGWQQA PDAS AAEWF GRLP+DWF G TV VDREEI V+G+
Sbjct 1 MRNTQGPGPRRFGRPGGWQQADVPDASDAAEWFTGRLPDDWFTGPATVEVDREEIVVVGE 60
Query 97 LPGLESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTH 156
LP +ES + S A GR+SRFR+ TR +RM IADEA+ RYGRKV+WGV + G + LFTH
Sbjct 61 LPPVESADSGSEAAIDGRISRFRETTRGDRMRIADEAERRYGRKVAWGVSLDGHKTLFTH 120
Query 157 IAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDD 216
+AVPVMTRL+QPER+VLDTLVDAGVARSRSDAL W+V+L G+H+E+WLA+LR AM+ VD+
Sbjct 121 LAVPVMTRLRQPERKVLDTLVDAGVARSRSDALMWTVRLAGQHSEQWLAELREAMAKVDE 180
Query 217 LRAQGPDL 224
LRA GP +
Sbjct 181 LRADGPQI 188
>gi|120405396|ref|YP_955225.1| hypothetical protein Mvan_4443 [Mycobacterium vanbaalenii PYR-1]
gi|119958214|gb|ABM15219.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=188
Score = 254 bits (649), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 140/188 (75%), Positives = 156/188 (83%), Gaps = 6/188 (3%)
Query 41 HHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGL 100
HH HGRR RPGGWQQA QPDA+ AA+WFAGRLPE WF GDP VIVDREEITVIG+LP
Sbjct 3 HHPHGRRSSRPGGWQQADQPDAADAADWFAGRLPEGWFAGDPEVIVDREEITVIGRLPEA 62
Query 101 ESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGG----ERILFTH 156
E+PE E ARASGR +RFR++TR ERM IADEA+ RY RKV+WGVE+ ERILFTH
Sbjct 63 ENPESE--ARASGRAARFREQTRSERMQIADEAEARYRRKVAWGVEIHSDGETERILFTH 120
Query 157 IAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDD 216
+AVPVMTRL+QPER+VLDTLVDAGVARSRSDAL WSV+LVGEH +EWL KLR AM VDD
Sbjct 121 LAVPVMTRLRQPERRVLDTLVDAGVARSRSDALVWSVRLVGEHADEWLGKLREAMREVDD 180
Query 217 LRAQGPDL 224
LR+ GP L
Sbjct 181 LRSAGPQL 188
>gi|118469659|ref|YP_889261.1| hypothetical protein MSMEG_5010 [Mycobacterium smegmatis str.
MC2 155]
gi|118170946|gb|ABK71842.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=173
Score = 253 bits (647), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 132/172 (77%), Positives = 145/172 (85%), Gaps = 7/172 (4%)
Query 55 QQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARASGR 114
Q + DAS AAEWFAGRLP+ WF GDPTVIVDREEITVIG LP +E ES ARASGR
Sbjct 3 HQHHRRDASEAAEWFAGRLPDGWFTGDPTVIVDREEITVIGTLPDVEG---ESEARASGR 59
Query 115 VSRFRDETRPERMTIADEAQNRYGRKVSWGVEV----GGERILFTHIAVPVMTRLKQPER 170
++FR+ETR ERM IADEAQ+R+GRKVSWGVEV ERI+FTHIAVPVMTRLKQPER
Sbjct 60 AAKFREETRGERMRIADEAQDRFGRKVSWGVEVRSGDKTERIMFTHIAVPVMTRLKQPER 119
Query 171 QVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGP 222
QVLDTLVDAGVARSRSDALAWSV+LVG+H +EWLAKLR AM VDDLRA+GP
Sbjct 120 QVLDTLVDAGVARSRSDALAWSVRLVGQHADEWLAKLREAMKTVDDLRAEGP 171
>gi|169628504|ref|YP_001702153.1| hypothetical protein MAB_1413 [Mycobacterium abscessus ATCC 19977]
gi|169240471|emb|CAM61499.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=172
Score = 248 bits (633), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 130/169 (77%), Positives = 142/169 (85%), Gaps = 3/169 (1%)
Query 58 QQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARASGRVSR 117
QQ DAS AAEWFAGRLP+ WF G P VIVDREEITVIG+L E E+ES A A+GR SR
Sbjct 5 QQADASDAAEWFAGRLPDGWFSGAPEVIVDREEITVIGRLTAAEG-EKESEAHAAGRASR 63
Query 118 FRDETRPERMTIADEAQNRYGRKVSWGVEVGGE--RILFTHIAVPVMTRLKQPERQVLDT 175
FR+ETR RM IADEAQ RYGRKVSWGV+VG + RILFTHIAVPVMTRL+QPERQVLDT
Sbjct 64 FREETRAHRMRIADEAQARYGRKVSWGVDVGEDPDRILFTHIAVPVMTRLRQPERQVLDT 123
Query 176 LVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
LV+AGVARSRSDALAW+V+LVGEHTEEWL KLR AM+ V DLR QGP L
Sbjct 124 LVEAGVARSRSDALAWAVRLVGEHTEEWLDKLRAAMTEVRDLRTQGPRL 172
>gi|229494415|ref|ZP_04388178.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229318777|gb|EEN84635.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=184
Score = 248 bits (633), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 121/188 (65%), Positives = 151/188 (81%), Gaps = 5/188 (2%)
Query 38 MRHHHMHG-RRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGK 96
M++ G RR+GRPGGWQQA PDAS A++WFAGRLP+DWF G V VDREEI VIG+
Sbjct 1 MKNSQNSGDRRFGRPGGWQQASVPDASDASDWFAGRLPDDWFTGTADVQVDREEIVVIGE 60
Query 97 LPGLESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTH 156
LP ++ E A A GR+SRFR+ TR +RM IADEA+ RYGRKV+WGV + + +LFTH
Sbjct 61 LPAVDGGE----AAADGRISRFRETTRADRMRIADEAEARYGRKVAWGVRLDEKTVLFTH 116
Query 157 IAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDD 216
+AVPVMTRL+QPER+VLDTLVDAGVARSR+DAL W+V+L G+H+E+WL +LR AMS VDD
Sbjct 117 LAVPVMTRLRQPERKVLDTLVDAGVARSRADALMWTVRLAGKHSEQWLTELREAMSKVDD 176
Query 217 LRAQGPDL 224
LR++GP +
Sbjct 177 LRSEGPKI 184
>gi|226304554|ref|YP_002764512.1| hypothetical protein RER_10650 [Rhodococcus erythropolis PR4]
gi|226183669|dbj|BAH31773.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=194
Score = 248 bits (633), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 121/195 (63%), Positives = 153/195 (79%), Gaps = 5/195 (2%)
Query 31 KVCNHVIMRHHHMHG-RRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDRE 89
++ M++ G RR+GRPGGWQQA PDAS A++WFAGRLP+DWF G V VDRE
Sbjct 4 QLSEENAMKNSQSSGDRRFGRPGGWQQASVPDASDASDWFAGRLPDDWFTGTADVQVDRE 63
Query 90 EITVIGKLPGLESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGG 149
EI VIG+LP ++ E A A GR+SRFR+ TR +RM IADEA+ RYGRKV+WGV +
Sbjct 64 EIVVIGELPAVDGGE----AAADGRISRFREITRADRMRIADEAEARYGRKVAWGVRLDE 119
Query 150 ERILFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRT 209
+ +LFTH+AVPVMTRL+QPER+VLDTLVDAGVARSR+DAL W+V+L G+H+E+WL +LR
Sbjct 120 KTVLFTHLAVPVMTRLRQPERKVLDTLVDAGVARSRADALMWTVRLAGKHSEQWLTELRE 179
Query 210 AMSAVDDLRAQGPDL 224
AMS VDDLR++GP +
Sbjct 180 AMSKVDDLRSEGPKI 194
>gi|54027472|ref|YP_121714.1| hypothetical protein nfa54980 [Nocardia farcinica IFM 10152]
gi|54018980|dbj|BAD60350.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=198
Score = 241 bits (616), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 120/190 (64%), Positives = 149/190 (79%), Gaps = 11/190 (5%)
Query 44 HGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLP----- 98
HGR +GRPGGWQQA PD + A +WFAGRLP DWF G P + +DR+EI V+G+LP
Sbjct 6 HGRGFGRPGGWQQADLPDPADAPDWFAGRLPSDWFTGPPEIEIDRDEIVVVGELPLPRPE 65
Query 99 --GLESPEEESAA----RASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERI 152
G E+ E +AA G V+RFR+ TRP RM IA+EAQ+RYGR+V+WGV V GERI
Sbjct 66 SSGDEAGEATTAAVPEATKEGAVARFRESTRPARMQIANEAQHRYGRRVAWGVTVEGERI 125
Query 153 LFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMS 212
+FT ++VPVMTRL+QPER+VLDTLVDAGVARSRSDALAW+V+L G+H EEWLA+LR+AM
Sbjct 126 MFTQLSVPVMTRLRQPERKVLDTLVDAGVARSRSDALAWTVRLAGKHAEEWLAELRSAMR 185
Query 213 AVDDLRAQGP 222
V+DLR++GP
Sbjct 186 KVEDLRSEGP 195
>gi|296139931|ref|YP_003647174.1| hypothetical protein Tpau_2226 [Tsukamurella paurometabola DSM
20162]
gi|296028065|gb|ADG78835.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=176
Score = 224 bits (570), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 114/176 (65%), Positives = 136/176 (78%), Gaps = 3/176 (1%)
Query 49 GRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESA 108
GR G Q+ P A A +WF+GRLPE WFDG PTV VDR+EI V+G LP +ESA
Sbjct 4 GRGPGRQRGTTPGADDAGDWFSGRLPEAWFDGPPTVTVDRDEIVVVGDLP---VAADESA 60
Query 109 ARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQP 168
A+ GR +RFR++TR +RM IADEAQ RY R+VSWGV G IL+TH AVPVMTRL+QP
Sbjct 61 AQFEGRAARFREDTREQRMQIADEAQGRYQRRVSWGVRGAGATILYTHQAVPVMTRLRQP 120
Query 169 ERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
ER VLDTLVD+GVARSR+DALAW V+LVGEHT+EWLA+LR AM VDD+R +GP+L
Sbjct 121 ERLVLDTLVDSGVARSRADALAWCVRLVGEHTDEWLAQLREAMKNVDDVRERGPEL 176
>gi|336116164|ref|YP_004570930.1| hypothetical protein MLP_05130 [Microlunatus phosphovorus NM-1]
gi|334683942|dbj|BAK33527.1| hypothetical protein MLP_05130 [Microlunatus phosphovorus NM-1]
Length=249
Score = 219 bits (557), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 114/171 (67%), Positives = 128/171 (75%), Gaps = 5/171 (2%)
Query 54 WQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARASG 113
WQ A P A A WF GRLPEDWF + +V DREEI V+G L G E +A+A G
Sbjct 84 WQNADLPAADDAQSWFEGRLPEDWF-SEVSVSTDREEIVVVGTLTG----ETTDSAQAEG 138
Query 114 RVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVL 173
R+SRFR+ETR RM IADEA+ RYGRKVSWG G LFTH++VPVMTRL+QPERQVL
Sbjct 139 RISRFREETRGTRMQIADEAEARYGRKVSWGARSGEVTALFTHLSVPVMTRLRQPERQVL 198
Query 174 DTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
DTLVDAGVARSRS+ALAW+V LVGEHTE WLA LR AM+ VD LRAQGP L
Sbjct 199 DTLVDAGVARSRSEALAWAVTLVGEHTESWLAGLRDAMAEVDKLRAQGPQL 249
>gi|84495896|ref|ZP_00994750.1| hypothetical protein JNB_00215 [Janibacter sp. HTCC2649]
gi|84382664|gb|EAP98545.1| hypothetical protein JNB_00215 [Janibacter sp. HTCC2649]
Length=177
Score = 216 bits (550), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 113/177 (64%), Positives = 133/177 (76%), Gaps = 2/177 (1%)
Query 49 GRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLP-GLESPEEES 107
GRPGGWQQA P+A A +WF GRLP+DWF V VDR+EITVIG L G E+
Sbjct 2 GRPGGWQQADVPNADDAQDWFNGRLPDDWFTA-VDVNVDRDEITVIGTLAEGNTDGGAEN 60
Query 108 AARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQ 167
+A GRVSR+R +T+ R+ IA EA+ RY RKVSWGV + LFTH+A PVMTRL+Q
Sbjct 61 SAVLEGRVSRWRSDTKERRIEIAREAEARYQRKVSWGVRLADGTALFTHLAAPVMTRLRQ 120
Query 168 PERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
PERQVLDTLVDAGVARSRSDALAWSV+LVG+H +EWL +LR AMS VD LR+QGP +
Sbjct 121 PERQVLDTLVDAGVARSRSDALAWSVRLVGQHADEWLGQLREAMSEVDKLRSQGPTI 177
>gi|302530107|ref|ZP_07282449.1| Smu12A [Streptomyces sp. AA4]
gi|302439002|gb|EFL10818.1| Smu12A [Streptomyces sp. AA4]
Length=188
Score = 215 bits (548), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 108/175 (62%), Positives = 135/175 (78%), Gaps = 3/175 (1%)
Query 53 GWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESP---EEESAA 109
GWQQA+ P A AA WF GRLP+ WF G+P + VDREEI V+G+LP L + AA
Sbjct 13 GWQQAEAPSADDAAAWFGGRLPDGWFTGEPEITVDREEILVVGELPALTGEYADDAARAA 72
Query 110 RASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPE 169
GR+SRFR+ETR ER+ IA +A++RY RKV+WG ++GG FT ++VPVMTRL+QPE
Sbjct 73 AEDGRISRFREETRDERIEIARQAEHRYQRKVAWGAKLGGTTAHFTTLSVPVMTRLRQPE 132
Query 170 RQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
R VLDTLVDAGVARSRS+ALAW+V+LVGEH + WL +LR AM+ VDDLR++GPDL
Sbjct 133 RLVLDTLVDAGVARSRSEALAWAVRLVGEHADAWLTELREAMTKVDDLRSKGPDL 187
>gi|325002196|ref|ZP_08123308.1| hypothetical protein PseP1_25691 [Pseudonocardia sp. P1]
Length=188
Score = 212 bits (539), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 107/176 (61%), Positives = 135/176 (77%), Gaps = 7/176 (3%)
Query 54 WQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLP-------GLESPEEE 106
WQQA P A AA WF+GRLP+ WF G P V DR+EI V+G+LP G + +
Sbjct 12 WQQADLPPADDAAAWFSGRLPDGWFTGAPEVTCDRDEIVVVGELPPLDGEQAGTDEGRAD 71
Query 107 SAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLK 166
+AA +GR++RFR+ETR ER+ IA +A++RY RKV+WG +G R LFT I+VPVMTRL+
Sbjct 72 TAAAEAGRIARFREETRDERIEIARQAESRYRRKVAWGARLGETRELFTTISVPVMTRLR 131
Query 167 QPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGP 222
QPER VLDTLVDAGVARSRSDALAWSV+LVG + +EWL++LR+AM+ VDDLRA+GP
Sbjct 132 QPERVVLDTLVDAGVARSRSDALAWSVRLVGRNADEWLSELRSAMARVDDLRAEGP 187
>gi|300789898|ref|YP_003770189.1| hypothetical protein AMED_8084 [Amycolatopsis mediterranei U32]
gi|299799412|gb|ADJ49787.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340531568|gb|AEK46773.1| hypothetical protein RAM_41530 [Amycolatopsis mediterranei S699]
Length=185
Score = 205 bits (521), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 108/173 (63%), Positives = 132/173 (77%), Gaps = 3/173 (1%)
Query 55 QQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARA--- 111
QQA+ P A AA WF GRLP+ WF G P V VDREEI V+G+LP L + AARA
Sbjct 12 QQAEVPPADDAAAWFGGRLPDGWFTGAPEVTVDREEIIVVGELPPLTEEHADDAARAAAE 71
Query 112 SGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQ 171
GR+SR+R+ETR ER+ IA +A++RY RKV+WG +GG LFT + PVMTRL+QPER
Sbjct 72 EGRISRYREETRDERIEIARQAEHRYQRKVAWGARLGGTTALFTTHSAPVMTRLRQPERL 131
Query 172 VLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
VLDTLVDAGVARSRSDALAW+V+LVG+H + WL +LR AM+ VDDLR++GPDL
Sbjct 132 VLDTLVDAGVARSRSDALAWAVRLVGQHADSWLGELREAMTKVDDLRSKGPDL 184
>gi|336176844|ref|YP_004582219.1| hypothetical protein FsymDg_0783 [Frankia symbiont of Datisca
glomerata]
gi|334857824|gb|AEH08298.1| hypothetical protein FsymDg_0783 [Frankia symbiont of Datisca
glomerata]
Length=186
Score = 203 bits (516), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 106/184 (58%), Positives = 134/184 (73%), Gaps = 5/184 (2%)
Query 44 HGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESP 103
GR +GR G ++ P A A WFAGRLP+ WF G TV VDR+EI V+G +P E
Sbjct 5 RGRGFGRWG--EREPVPPADDAPAWFAGRLPDTWFTGAATVTVDRDEIVVVGTIPAAELE 62
Query 104 EEESAARA---SGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVP 160
++ AA A SGR+ RFR++TR R+ IA EA+ RYGRKV+WG GG LFT ++VP
Sbjct 63 TDDPAAVAAAESGRIRRFREQTRDARIAIAQEAERRYGRKVAWGATAGGTTELFTTLSVP 122
Query 161 VMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQ 220
VMTRL+QPERQVLDTLVDAGVARSRSDALAW V+LVG++ + WLA+LR A+ V+ +RA+
Sbjct 123 VMTRLRQPERQVLDTLVDAGVARSRSDALAWCVRLVGDNAQPWLARLREALEKVEQVRAE 182
Query 221 GPDL 224
GP+L
Sbjct 183 GPEL 186
>gi|331694813|ref|YP_004331052.1| hypothetical protein Psed_0947 [Pseudonocardia dioxanivorans
CB1190]
gi|326949502|gb|AEA23199.1| hypothetical protein Psed_0947 [Pseudonocardia dioxanivorans
CB1190]
Length=203
Score = 201 bits (512), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 104/175 (60%), Positives = 129/175 (74%), Gaps = 7/175 (4%)
Query 55 QQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARA--- 111
QQA P AA W +GRLP+ WF G+P V VDREEI ++G+LP L+ ++ A
Sbjct 24 QQADLPSTDDAAAWLSGRLPDGWFVGNPDVTVDREEIVIVGELPPLDGEFADTEAGRAER 83
Query 112 ----SGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQ 167
SGR+SRFR++TR ER+ IA +A++RY RKV+WG +G LFT +VPVMTRL+Q
Sbjct 84 AAAISGRISRFREQTRDERIDIARQAEHRYQRKVAWGARIGDVVELFTTASVPVMTRLRQ 143
Query 168 PERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGP 222
PER VLDTLVD+GVARSRSDALAW+V+LVGEH EEWL LR AM+ VD+LRAQGP
Sbjct 144 PERIVLDTLVDSGVARSRSDALAWAVRLVGEHAEEWLGDLRDAMAKVDELRAQGP 198
>gi|291298257|ref|YP_003509535.1| hypothetical protein Snas_0730 [Stackebrandtia nassauensis DSM
44728]
gi|290567477|gb|ADD40442.1| hypothetical protein Snas_0730 [Stackebrandtia nassauensis DSM
44728]
Length=192
Score = 201 bits (510), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 100/180 (56%), Positives = 128/180 (72%), Gaps = 10/180 (5%)
Query 52 GGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEES---- 107
W+ A P A AA W AGR+P+DWF P V DR+E+ +IG L E PE ES
Sbjct 15 ASWRYADVPQADDAASWIAGRVPDDWFVSLPEVSADRDELIIIGHL---EEPEHESDATE 71
Query 108 ---AARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTR 164
AA GR+SRFR+ TR +R+ I+ + Q+RY R+ SWG G RI+FTH +PVM+R
Sbjct 72 ADRAAAEEGRISRFREATRDQRVQISQQIQHRYQRRASWGAACGSTRIVFTHQTIPVMSR 131
Query 165 LKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
L+QPER+VLDTLVD+GVARSRSDALAW V+LVGEH + WL +LR+AMS+VD+LR +GPD+
Sbjct 132 LRQPERRVLDTLVDSGVARSRSDALAWCVRLVGEHADSWLMELRSAMSSVDELRRKGPDI 191
>gi|258654535|ref|YP_003203691.1| hypothetical protein Namu_4415 [Nakamurella multipartita DSM
44233]
gi|258557760|gb|ACV80702.1| hypothetical protein Namu_4415 [Nakamurella multipartita DSM
44233]
Length=183
Score = 196 bits (497), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 100/168 (60%), Positives = 126/168 (75%), Gaps = 3/168 (1%)
Query 60 PDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESP---EEESAARASGRVS 116
PDAS A WFAGRLP DWF P V +DR+EI V+G+LP L + AA +GR+S
Sbjct 15 PDASDALGWFAGRLPGDWFTAAPQVRIDRDEIIVVGQLPDLTEEFADDAARAAAEAGRIS 74
Query 117 RFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTL 176
RFR++TR +R+ IA +AQ+RYGR++SWG +G LFT ++ PVMTRL+QPERQVLDTL
Sbjct 75 RFREDTREQRIEIARQAQHRYGREISWGARLGDTEELFTTLSAPVMTRLRQPERQVLDTL 134
Query 177 VDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
VDAGVARSRSDALAW+V+LVG++ + WL +LR AM V+ LR GPDL
Sbjct 135 VDAGVARSRSDALAWAVRLVGQNADNWLGELRQAMEQVNRLREAGPDL 182
>gi|86742578|ref|YP_482978.1| hypothetical protein Francci3_3899 [Frankia sp. CcI3]
gi|86569440|gb|ABD13249.1| conserved hypothetical protein [Frankia sp. CcI3]
Length=183
Score = 193 bits (491), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 100/181 (56%), Positives = 127/181 (71%), Gaps = 2/181 (1%)
Query 44 HGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESP 103
GR YGRPG ++A+ A AA WFA +P WF G PTV+VDR+EITVIG L
Sbjct 5 RGRGYGRPG--ERAEVVPADDAAAWFAEHIPAGWFAGMPTVVVDRDEITVIGVLAAGGVA 62
Query 104 EEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMT 163
+ R+ RFR++TR ER+ IA +A++RY RKV+WG VG LFT ++VPVMT
Sbjct 63 PGGGPSATGARIRRFREQTREERIAIATKAEHRYNRKVAWGATVGDTTELFTALSVPVMT 122
Query 164 RLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPD 223
RL+QPERQVLDTLVD+GVARSRS+ALAW V+LVGE+ + WL +LR AM V+ +R+ GP
Sbjct 123 RLRQPERQVLDTLVDSGVARSRSEALAWCVRLVGENADSWLRQLREAMEQVEKIRSVGPR 182
Query 224 L 224
L
Sbjct 183 L 183
>gi|302556024|ref|ZP_07308366.1| conserved hypothetical protein [Streptomyces viridochromogenes
DSM 40736]
gi|302473642|gb|EFL36735.1| conserved hypothetical protein [Streptomyces viridochromogenes
DSM 40736]
Length=185
Score = 189 bits (481), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 98/160 (62%), Positives = 120/160 (75%), Gaps = 4/160 (2%)
Query 68 WFAGRLPEDWFDGDPTVIVDREEITVIGKL--PGL--ESPEEESAARASGRVSRFRDETR 123
WFAGRLP+D F+ V VDREEITVIG++ P L ++P E A RV FR+ TR
Sbjct 13 WFAGRLPDDLFEELTEVTVDREEITVIGRIAEPRLTEDAPAAEREAALESRVQEFRERTR 72
Query 124 PERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGVAR 183
+RM +A EA++R+ RKVSWGVE GG R LFTH+A PVMTRL+QPERQVLDTL+ GVAR
Sbjct 73 EDRMAVAREAEHRFRRKVSWGVECGGRRALFTHVAAPVMTRLRQPERQVLDTLIAGGVAR 132
Query 184 SRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPD 223
SRSDALAW V+LV HT++WLA+LR ++ V +RAQGPD
Sbjct 133 SRSDALAWCVRLVQRHTDDWLAELRDSLEHVQRVRAQGPD 172
>gi|297204297|ref|ZP_06921694.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
gi|297148630|gb|EDY59882.2| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
Length=187
Score = 189 bits (479), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 94/160 (59%), Positives = 120/160 (75%), Gaps = 4/160 (2%)
Query 68 WFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAAR----ASGRVSRFRDETR 123
WF GRLP+D F+ V+VDREEITVIG++PG E+ SAA GR+ FR+ TR
Sbjct 13 WFTGRLPDDLFEALVEVVVDREEITVIGRIPGPRLTEDVSAAEREAAVQGRIQEFRERTR 72
Query 124 PERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGVAR 183
+R+ +A EA++++ RKVSWGVE GER LFTH+A PVMTRL+QPERQVLDTL+ GVAR
Sbjct 73 EDRVEVAREAEHKFRRKVSWGVECDGERALFTHVAAPVMTRLRQPERQVLDTLIAGGVAR 132
Query 184 SRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPD 223
SRSDALAW V+LV HT++WLA+LR ++ V +R +GPD
Sbjct 133 SRSDALAWCVRLVQRHTDDWLAELRESLEHVQRVRERGPD 172
>gi|256389919|ref|YP_003111483.1| hypothetical protein Caci_0707 [Catenulispora acidiphila DSM
44928]
gi|256356145|gb|ACU69642.1| conserved hypothetical protein [Catenulispora acidiphila DSM
44928]
Length=187
Score = 188 bits (477), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 94/160 (59%), Positives = 117/160 (74%), Gaps = 4/160 (2%)
Query 68 WFAGRLPEDWFDGDPTVIVDREEITVIGKLP----GLESPEEESAARASGRVSRFRDETR 123
WF GRLPED FDG P V VDREEITVIG++P + E E AA GR+ FR+ TR
Sbjct 14 WFTGRLPEDLFDGAPEVTVDREEITVIGRIPEPAYAEGASEAEKAATIEGRIKEFRERTR 73
Query 124 PERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGVAR 183
RM +A +A++++GRKVSWGV G + LFTH++VPVMTRLKQPER +LDTLVD GVAR
Sbjct 74 DARMAVARDAEHKFGRKVSWGVRCGDQGRLFTHLSVPVMTRLKQPERALLDTLVDGGVAR 133
Query 184 SRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPD 223
SRS+ALAW V+LV +H +W+ LR AM V+++R GPD
Sbjct 134 SRSEALAWCVRLVEKHAGDWVGDLRDAMRKVEEVREAGPD 173
>gi|134097519|ref|YP_001103180.1| hypothetical protein SACE_0921 [Saccharopolyspora erythraea NRRL
2338]
gi|291009340|ref|ZP_06567313.1| hypothetical protein SeryN2_32875 [Saccharopolyspora erythraea
NRRL 2338]
gi|133910142|emb|CAM00255.1| hypothetical protein SACE_0921 [Saccharopolyspora erythraea NRRL
2338]
Length=204
Score = 187 bits (474), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 116/192 (61%), Positives = 140/192 (73%), Gaps = 11/192 (5%)
Query 44 HGRRYGRPGGWQQA--------QQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIG 95
GRR+G W + P A A WF GRLPE WF P V VDREEI V+G
Sbjct 12 RGRRWGGQPPWTGGGRGGRRPPEAPSADDAKAWFTGRLPEGWFTTAPEVSVDREEILVVG 71
Query 96 KLPGLESPEEESAARA---SGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERI 152
+LP L+ + AARA SGR++RFR+ETR ER+ IA +A++RYGRKVSWG +GG
Sbjct 72 ELPALDEDFADDAARAAAESGRIARFREETREERIEIARQAEHRYGRKVSWGARLGGAEE 131
Query 153 LFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMS 212
LFT ++ PVMTRL+QPER VLDTLVDAG+ARSRS+ALAW+V+LVGEHTEEWLA+LR A++
Sbjct 132 LFTTLSAPVMTRLRQPERIVLDTLVDAGIARSRSEALAWAVRLVGEHTEEWLAELREALA 191
Query 213 AVDDLRAQGPDL 224
VDDLRAQGPDL
Sbjct 192 KVDDLRAQGPDL 203
>gi|302562328|ref|ZP_07314670.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000]
gi|302479946|gb|EFL43039.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000]
Length=175
Score = 186 bits (472), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 96/161 (60%), Positives = 119/161 (74%), Gaps = 6/161 (3%)
Query 68 WFAGRLPEDWFDGDPTVIVDREEITVIGKLPGL-----ESPEEESAARASGRVSRFRDET 122
WF GRLP+D F+ V VDREEITVIG++PG SP E AA GRV FR+ T
Sbjct 4 WFTGRLPDDLFEELAEVTVDREEITVIGRIPGPLPVADASPAEREAA-VEGRVQEFRERT 62
Query 123 RPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGVA 182
R +R+ +A +A++R+GRKVSWGVE G R LFTH+A PVMTRL+QPERQVLDTL+ GVA
Sbjct 63 RDDRIAVARDAEHRFGRKVSWGVECDGRRALFTHVAAPVMTRLRQPERQVLDTLLAGGVA 122
Query 183 RSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPD 223
RSRS+ALAW V+LV HT++WL +LR ++ V +RAQGPD
Sbjct 123 RSRSEALAWCVRLVQRHTDDWLTELRESLQHVQRVRAQGPD 163
>gi|158312687|ref|YP_001505195.1| hypothetical protein Franean1_0831 [Frankia sp. EAN1pec]
gi|158108092|gb|ABW10289.1| conserved hypothetical protein [Frankia sp. EAN1pec]
Length=176
Score = 184 bits (468), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 95/163 (59%), Positives = 119/163 (74%), Gaps = 7/163 (4%)
Query 62 ASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRDE 121
A A WFA R+P+ WF G P V VDR+EITV+G LP S A GR+ FR+E
Sbjct 21 ADDVAAWFARRVPDGWFTGPPKVTVDRDEITVVGALP-------PSDADRVGRIRGFREE 73
Query 122 TRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGV 181
TR R+ +A EA+ RYGR VSWGV +GG LFT ++VPVMTRL+QPERQVLDTLVD+GV
Sbjct 74 TRDARIGMAREAELRYGRSVSWGVSLGGSTELFTTVSVPVMTRLRQPERQVLDTLVDSGV 133
Query 182 ARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
ARSRS+ALAW V+LVGE+ + WLA+LR AM V+ +R++GP +
Sbjct 134 ARSRSEALAWCVRLVGENADTWLARLREAMEKVEQIRSEGPGI 176
>gi|284989524|ref|YP_003408078.1| hypothetical protein Gobs_0946 [Geodermatophilus obscurus DSM
43160]
gi|284062769|gb|ADB73707.1| conserved hypothetical protein [Geodermatophilus obscurus DSM
43160]
Length=207
Score = 184 bits (466), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 96/159 (61%), Positives = 129/159 (82%), Gaps = 0/159 (0%)
Query 66 AEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRDETRPE 125
A WFAGRLP+DWF G + VDR+EITV+G L E+ E ++AA +GR++RFR+ETR +
Sbjct 48 AGWFAGRLPDDWFTGPVELTVDRDEITVVGTLAEPEAGEGDAAAARAGRIARFREETREQ 107
Query 126 RMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGVARSR 185
RM +AD AQ RYGR V+WG G R +FT+++VPVMTRL+QPER VLDTLVDAGVARSR
Sbjct 108 RMAVADAAQARYGRSVAWGAACGDVREVFTNLSVPVMTRLRQPERLVLDTLVDAGVARSR 167
Query 186 SDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
S+ALAW+V+LV +HT++WL +LR AM++V+++RA+GP +
Sbjct 168 SEALAWAVRLVAQHTDDWLGELRAAMASVEEVRARGPQV 206
>gi|111225551|ref|YP_716345.1| hypothetical protein FRAAL6207 [Frankia alni ACN14a]
gi|111153083|emb|CAJ64830.1| Conserved hypothetical protein [Frankia alni ACN14a]
Length=202
Score = 182 bits (463), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 99/198 (50%), Positives = 131/198 (67%), Gaps = 21/198 (10%)
Query 44 HGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLP----G 99
GR YGRPG +A+ A A +WFA +P WF G P V+VDR+EITV+G+LP
Sbjct 5 RGRGYGRPG--DRAESVPAQDAPDWFAAHVPAGWFTGPPAVVVDRDEITVVGELPFEQPP 62
Query 100 LESPEEESAA---------------RASGRVSRFRDETRPERMTIADEAQNRYGRKVSWG 144
+E P + AA A+ + FR+ TR ER+ IA EA+ RYGRKV+WG
Sbjct 63 VEKPRADQAAASHPGGVAPSNGPDLEAAALIRGFRERTRDERIAIAAEAERRYGRKVAWG 122
Query 145 VEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWL 204
G LFT ++VPVMTRL+QPERQVLDTLV++GVARSRS+ALAW V+LVG++++ WL
Sbjct 123 ATAGAASELFTVLSVPVMTRLRQPERQVLDTLVESGVARSRSEALAWCVRLVGDNSDTWL 182
Query 205 AKLRTAMSAVDDLRAQGP 222
+LR A+ V+ +RA+GP
Sbjct 183 RQLREALEQVEKIRAEGP 200
>gi|291435451|ref|ZP_06574841.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC
14672]
gi|291338346|gb|EFE65302.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC
14672]
Length=188
Score = 179 bits (454), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 92/160 (58%), Positives = 117/160 (74%), Gaps = 6/160 (3%)
Query 68 WFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLE-----SPEEESAARASGRVSRFRDET 122
WFAGRLP+D F+ V VDREEITVIG++ G SP E AA GR+ FR+ T
Sbjct 13 WFAGRLPDDLFEELVEVTVDREEITVIGRITGPRPAEDASPAEREAA-VEGRIQEFRERT 71
Query 123 RPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGVA 182
R R+++A +A++R+GRKVSWGVE G R LFTH+A PVMTRL+QPERQVLDTL+ GVA
Sbjct 72 RDARISVARDAEHRFGRKVSWGVECDGRRALFTHVAAPVMTRLRQPERQVLDTLIAGGVA 131
Query 183 RSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGP 222
RSRS+ALAW V+LV HT++WL +LR ++ V +R +GP
Sbjct 132 RSRSEALAWCVRLVQRHTDDWLTELRESLEHVQRVRDRGP 171
>gi|343926262|ref|ZP_08765771.1| hypothetical protein GOALK_056_01300 [Gordonia alkanivorans NBRC
16433]
gi|343763891|dbj|GAA12697.1| hypothetical protein GOALK_056_01300 [Gordonia alkanivorans NBRC
16433]
Length=133
Score = 177 bits (450), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 87/107 (82%), Positives = 96/107 (90%), Gaps = 0/107 (0%)
Query 118 FRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLV 177
FR++TR ERM+IADEAQ RY RKVSWGV VG ERILFTH+AVPVMTRL+QPER+VLDTLV
Sbjct 27 FREDTRAERMSIADEAQARYARKVSWGVAVGDERILFTHLAVPVMTRLRQPERRVLDTLV 86
Query 178 DAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
DAGVARSR+DALAWSVKLVG HTEEWL KLR AM VD+LRA+GP L
Sbjct 87 DAGVARSRADALAWSVKLVGAHTEEWLGKLRAAMDEVDNLRAEGPGL 133
>gi|284029165|ref|YP_003379096.1| hypothetical protein Kfla_1194 [Kribbella flavida DSM 17836]
gi|283808458|gb|ADB30297.1| hypothetical protein Kfla_1194 [Kribbella flavida DSM 17836]
Length=175
Score = 177 bits (450), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 89/163 (55%), Positives = 117/163 (72%), Gaps = 10/163 (6%)
Query 68 WFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEES-------AARASGRVSRFRD 120
W AGRLP++WFDG+P V +DR+EI V+G +P +P++E +A GR+ +FR+
Sbjct 13 WLAGRLPQEWFDGEPEVSIDRDEILVVGTIP---APQQEGEVSAAARSAAEEGRIKQFRE 69
Query 121 ETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAG 180
+TR R+ IA E ++ RKV+WGV G R +FT ++ PVMTRL+QPERQVLDTLVDAG
Sbjct 70 DTRERRIEIARELEHATRRKVAWGVRCGETRTVFTSLSAPVMTRLRQPERQVLDTLVDAG 129
Query 181 VARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPD 223
VARSRSDAL W VKLV +H+E WL LR AM V+++R GPD
Sbjct 130 VARSRSDALGWCVKLVAQHSETWLKDLREAMEKVENVRRAGPD 172
>gi|326331376|ref|ZP_08197666.1| hypothetical protein NBCG_02814 [Nocardioidaceae bacterium Broad-1]
gi|325950632|gb|EGD42682.1| hypothetical protein NBCG_02814 [Nocardioidaceae bacterium Broad-1]
Length=173
Score = 177 bits (449), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 89/159 (56%), Positives = 118/159 (75%), Gaps = 4/159 (2%)
Query 68 WFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPE----EESAARASGRVSRFRDETR 123
WF+GRLP+ +D V+VDREEITV+G++ ++ E EE AA GR S FR+ TR
Sbjct 9 WFSGRLPDGVYDEIVDVVVDREEITVVGRIKEPKTAEGASDEERAAANEGRASEFRERTR 68
Query 124 PERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGVAR 183
+RM +A +A+ R+ RKVSWGVEVGGER +FT ++ PVMTRL+QPER+VLD LV GVAR
Sbjct 69 EDRMAVARQAERRFDRKVSWGVEVGGERHMFTTVSAPVMTRLRQPERKVLDLLVSGGVAR 128
Query 184 SRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGP 222
SRSDAL W VKLV +H+ EWL +L+ ++ V+ +RA+GP
Sbjct 129 SRSDALGWCVKLVQQHSSEWLDELQESLVNVERVRAKGP 167
>gi|254383553|ref|ZP_04998903.1| conserved hypothetical protein [Streptomyces sp. Mg1]
gi|194342448|gb|EDX23414.1| conserved hypothetical protein [Streptomyces sp. Mg1]
Length=170
Score = 177 bits (448), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 90/157 (58%), Positives = 112/157 (72%), Gaps = 12/157 (7%)
Query 68 WFAGRLPEDWFDGDPTVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRDETRPERM 127
WFA RLP D F+ +V VDREEITV+G +P ES +E FR+ TR +R+
Sbjct 13 WFAERLPVDVFESLVSVTVDREEITVVGAIPATESVKE------------FRERTREQRI 60
Query 128 TIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSD 187
+A EA+ Y RKV+WGV+ G ER LFTH+AVPVMTRL+Q ERQVLDTLV GVARSR+D
Sbjct 61 EVAREAEELYRRKVAWGVQAGTERHLFTHLAVPVMTRLRQSERQVLDTLVAGGVARSRAD 120
Query 188 ALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
ALAW V+LVG +T+EWL LR ++ V +RAQGPD+
Sbjct 121 ALAWCVRLVGSNTDEWLGDLRESLDKVQQVRAQGPDV 157
>gi|271969720|ref|YP_003343916.1| hypothetical protein Sros_8531 [Streptosporangium roseum DSM
43021]
gi|270512895|gb|ACZ91173.1| hypothetical protein Sros_8531 [Streptosporangium roseum DSM
43021]
Length=175
Score = 176 bits (446), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 97/163 (60%), Positives = 124/163 (77%), Gaps = 4/163 (2%)
Query 68 WFAGRLPEDWFDGDPTVIVDREEITVIGKLP----GLESPEEESAARASGRVSRFRDETR 123
WF+GRLPE+WF+G P +++DREEI V+G+L G + E E AA G V RFR+ETR
Sbjct 13 WFSGRLPEEWFEGPPEIVLDREEIAVVGRLQAPALGDDVSEVERAAAVEGGVQRFREETR 72
Query 124 PERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQVLDTLVDAGVAR 183
R+ IA EA++R+ RKVSWGV VG E ++FT ++VPVMTRL+Q ER+VLDTLV AGVAR
Sbjct 73 ERRIEIALEAEHRFRRKVSWGVAVGDETVMFTTLSVPVMTRLRQSERRVLDTLVAAGVAR 132
Query 184 SRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDLPA 226
SRSDALAW V+LVG++T+ WL LR A+ VD +RA GPDL +
Sbjct 133 SRSDALAWCVRLVGKNTDTWLTDLRDALQHVDRVRASGPDLSS 175
>gi|221635719|ref|YP_002523595.1| hypothetical protein trd_A0313 [Thermomicrobium roseum DSM 5159]
gi|221157938|gb|ACM07056.1| conserved hypothetical protein [Thermomicrobium roseum DSM 5159]
Length=174
Score = 176 bits (446), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 93/172 (55%), Positives = 119/172 (70%), Gaps = 4/172 (2%)
Query 57 AQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKL--PGLE--SPEEESAARAS 112
++ P A+ WF GRLPE WF P + DR+EI V+G+L P L + EEE A +
Sbjct 2 SRDPFAAELHAWFLGRLPEGWFVEPPEIAYDRDEILVVGRLAEPSLPEGASEEERQAACA 61
Query 113 GRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVPVMTRLKQPERQV 172
R+ RFR+ETR +RM IA EA+ R+GRKVSWG G R FT ++VPVMTRL+ ERQV
Sbjct 62 ARIQRFREETRAQRMRIASEAEYRFGRKVSWGAACGPIRQSFTVLSVPVMTRLRLAERQV 121
Query 173 LDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDL 224
LDTLV+AG+ARSRS+ALAW V+LVG H +WL +LR A+ V +LR GPD+
Sbjct 122 LDTLVEAGIARSRSEALAWCVRLVGRHQADWLEELRRALVRVQELRQAGPDV 173
Lambda K H
0.319 0.133 0.412
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 287912302678
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40