BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3912
Length=254
Score E
Sequences producing significant alignments: (Bits) Value
gi|15611048|ref|NP_218429.1| hypothetical protein Rv3912 [Mycoba... 481 4e-134
gi|289445513|ref|ZP_06435257.1| hypothetical alanine rich protei... 480 7e-134
gi|306778276|ref|ZP_07416613.1| hypothetical alanine rich protei... 478 3e-133
gi|289747756|ref|ZP_06507134.1| hypothetical alanine rich protei... 446 1e-123
gi|340628882|ref|YP_004747334.1| hypothetical protein MCAN_39321... 416 1e-114
gi|240168392|ref|ZP_04747051.1| hypothetical protein MkanA1_0371... 294 1e-77
gi|342862340|ref|ZP_08718981.1| hypothetical protein MCOL_25738 ... 263 2e-68
gi|183985446|ref|YP_001853737.1| hypothetical protein MMAR_5476 ... 248 8e-64
gi|118620067|ref|YP_908399.1| hypothetical protein MUL_5065 [Myc... 247 9e-64
gi|296167155|ref|ZP_06849562.1| conserved hypothetical protein [... 225 6e-57
gi|254777649|ref|ZP_05219165.1| hypothetical protein MaviaA2_236... 211 1e-52
gi|118463922|ref|YP_884410.1| hypothetical protein MAV_5300 [Myc... 211 1e-52
gi|41410436|ref|NP_963272.1| hypothetical protein MAP4338 [Mycob... 202 5e-50
gi|254818788|ref|ZP_05223789.1| hypothetical protein MintA_02629... 178 5e-43
gi|333992976|ref|YP_004525590.1| hypothetical protein JDM601_433... 108 8e-22
gi|120406995|ref|YP_956824.1| hypothetical protein Mvan_6066 [My... 107 2e-21
gi|169632012|ref|YP_001705661.1| hypothetical protein MAB_4939 [... 105 7e-21
gi|145221434|ref|YP_001132112.1| hypothetical protein Mflv_0840 ... 103 3e-20
gi|118470542|ref|YP_891126.1| hypothetical protein MSMEG_6932 [M... 100 2e-19
gi|315446814|ref|YP_004079693.1| hypothetical protein Mspyr1_533... 95.5 7e-18
gi|111020635|ref|YP_703607.1| hypothetical protein RHA1_ro03646 ... 74.3 2e-11
gi|333922224|ref|YP_004495805.1| hypothetical protein AS9A_4573 ... 68.9 6e-10
gi|312142011|ref|YP_004009347.1| membrane protein [Rhodococcus e... 66.6 4e-09
gi|325677541|ref|ZP_08157205.1| hypothetical protein HMPREF0724_... 66.6 4e-09
gi|262204650|ref|YP_003275858.1| hypothetical protein Gbro_4852 ... 63.5 3e-08
gi|229491169|ref|ZP_04384997.1| conserved hypothetical protein [... 63.2 3e-08
gi|254823060|ref|ZP_05228061.1| hypothetical protein MintA_24240... 60.1 3e-07
gi|226309502|ref|YP_002769464.1| hypothetical protein RER_60170 ... 58.2 1e-06
gi|326383896|ref|ZP_08205580.1| hypothetical protein SCNU_13223 ... 52.8 5e-05
gi|226362878|ref|YP_002780658.1| hypothetical protein ROP_34660 ... 52.0 8e-05
gi|300791151|ref|YP_003771442.1| hypothetical protein AMED_9352 ... 50.1 3e-04
gi|302531343|ref|ZP_07283685.1| predicted protein [Streptomyces ... 49.7 4e-04
gi|257057902|ref|YP_003135734.1| hypothetical protein Svir_39660... 47.8 0.002
gi|54027635|ref|YP_121877.1| hypothetical protein nfa56610 [Noca... 45.1 0.011
gi|134103803|ref|YP_001109464.1| hypothetical protein SACE_7383 ... 44.7 0.012
gi|256381061|ref|YP_003104721.1| hypothetical protein Amir_7085 ... 43.5 0.028
gi|331700379|ref|YP_004336618.1| hypothetical protein Psed_6677 ... 41.6 0.13
gi|317509438|ref|ZP_07967056.1| hypothetical protein HMPREF9336_... 37.7 1.6
gi|296392453|ref|YP_003657337.1| hypothetical protein Srot_0013 ... 35.4 8.1
>gi|15611048|ref|NP_218429.1| hypothetical protein Rv3912 [Mycobacterium tuberculosis H37Rv]
gi|15843545|ref|NP_338582.1| hypothetical protein MT4031 [Mycobacterium tuberculosis CDC1551]
gi|31795085|ref|NP_857578.1| hypothetical protein Mb3943 [Mycobacterium bovis AF2122/97]
70 more sequence titles
Length=254
Score = 481 bits (1238), Expect = 4e-134, Method: Compositional matrix adjust.
Identities = 254/254 (100%), Positives = 254/254 (100%), Gaps = 0/254 (0%)
Query 1 MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRV 60
MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRV
Sbjct 1 MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRV 60
Query 61 RRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGL 120
RRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGL
Sbjct 61 RRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGL 120
Query 121 CAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGG 180
CAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGG
Sbjct 121 CAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGG 180
Query 181 PLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSA 240
PLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSA
Sbjct 181 PLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSA 240
Query 241 ADTGLLASTVVPRA 254
ADTGLLASTVVPRA
Sbjct 241 ADTGLLASTVVPRA 254
>gi|289445513|ref|ZP_06435257.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
CPHL_A]
gi|289418471|gb|EFD15672.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
CPHL_A]
Length=254
Score = 480 bits (1236), Expect = 7e-134, Method: Compositional matrix adjust.
Identities = 253/254 (99%), Positives = 254/254 (100%), Gaps = 0/254 (0%)
Query 1 MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRV 60
MSAADKDPDKHSADADPPLTVELLA+LQAGLLDDATAARIRSRVRSDPQAQQILRALNRV
Sbjct 1 MSAADKDPDKHSADADPPLTVELLAELQAGLLDDATAARIRSRVRSDPQAQQILRALNRV 60
Query 61 RRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGL 120
RRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGL
Sbjct 61 RRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGL 120
Query 121 CAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGG 180
CAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGG
Sbjct 121 CAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGG 180
Query 181 PLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSA 240
PLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSA
Sbjct 181 PLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSA 240
Query 241 ADTGLLASTVVPRA 254
ADTGLLASTVVPRA
Sbjct 241 ADTGLLASTVVPRA 254
>gi|306778276|ref|ZP_07416613.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
SUMu001]
gi|306974395|ref|ZP_07487056.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
SUMu010]
gi|307082103|ref|ZP_07491273.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
SUMu011]
gi|308213426|gb|EFO72825.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
SUMu001]
gi|308356290|gb|EFP45141.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
SUMu010]
gi|308360177|gb|EFP49028.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
SUMu011]
Length=254
Score = 478 bits (1230), Expect = 3e-133, Method: Compositional matrix adjust.
Identities = 253/254 (99%), Positives = 253/254 (99%), Gaps = 0/254 (0%)
Query 1 MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRV 60
MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRV
Sbjct 1 MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRV 60
Query 61 RRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGL 120
RRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGL
Sbjct 61 RRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGL 120
Query 121 CAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGG 180
CAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGP GG
Sbjct 121 CAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPRGG 180
Query 181 PLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSA 240
PLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSA
Sbjct 181 PLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSA 240
Query 241 ADTGLLASTVVPRA 254
ADTGLLASTVVPRA
Sbjct 241 ADTGLLASTVVPRA 254
>gi|289747756|ref|ZP_06507134.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
02_1987]
gi|289688284|gb|EFD55772.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
02_1987]
Length=236
Score = 446 bits (1147), Expect = 1e-123, Method: Compositional matrix adjust.
Identities = 235/236 (99%), Positives = 236/236 (100%), Gaps = 0/236 (0%)
Query 19 LTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGADPAWGPAAR 78
+TVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGADPAWGPAAR
Sbjct 1 MTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGADPAWGPAAR 60
Query 79 PAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGLCAVATAIGVGAVVDAPPP 138
PAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGLCAVATAIGVGAVVDAPPP
Sbjct 61 PAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGLCAVATAIGVGAVVDAPPP 120
Query 139 APSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPSRRTSCLSGLGYP 198
APSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPSRRTSCLSGLGYP
Sbjct 121 APSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPSRRTSCLSGLGYP 180
Query 199 ASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLASTVVPRA 254
ASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLASTVVPRA
Sbjct 181 ASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLASTVVPRA 236
>gi|340628882|ref|YP_004747334.1| hypothetical protein MCAN_39321 [Mycobacterium canettii CIPT
140010059]
gi|340007072|emb|CCC46263.1| hypothetical alanine rich protein [Mycobacterium canettii CIPT
140010059]
Length=256
Score = 416 bits (1070), Expect = 1e-114, Method: Compositional matrix adjust.
Identities = 252/256 (99%), Positives = 253/256 (99%), Gaps = 2/256 (0%)
Query 1 MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRV 60
MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRV
Sbjct 1 MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRV 60
Query 61 RRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGL 120
RRDVAAMGADPAWGPAARPAVVD ISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGL
Sbjct 61 RRDVAAMGADPAWGPAARPAVVDRISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGL 120
Query 121 CAVATAIGVGAV--VDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPP 178
CAVATAIGVGAV +DAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPP
Sbjct 121 CAVATAIGVGAVALIDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPP 180
Query 179 GGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHC 238
GGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHC
Sbjct 181 GGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHC 240
Query 239 SAADTGLLASTVVPRA 254
SAADTGLLASTVVPRA
Sbjct 241 SAADTGLLASTVVPRA 256
>gi|240168392|ref|ZP_04747051.1| hypothetical protein MkanA1_03717 [Mycobacterium kansasii ATCC
12478]
Length=253
Score = 294 bits (752), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 165/247 (67%), Positives = 184/247 (75%), Gaps = 3/247 (1%)
Query 10 KHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGA 69
+H ADA+PPLTVELLADLQAGLLDD AA IR RVR+DPQA ILRAL RVRRD+A GA
Sbjct 5 EHEADANPPLTVELLADLQAGLLDDEAAAGIRRRVRTDPQAAAILRALQRVRRDLADAGA 64
Query 70 DPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGLCAVATAIGV 129
DPA P V IS LRS S AAHAA+P + P +++AG AG CA AIGV
Sbjct 65 DPASAPDPPADVTARISGTLRSTASGES-SAAHAAQPRIRPGKILAGIAGFCAAVAAIGV 123
Query 130 G--AVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPSR 187
G A+V AP PAPSAPTTA HITVS P PVIPLS Q+L+LL TPDYGP L DPSR
Sbjct 124 GTAALVTAPSPAPSAPTTAMHITVSTPPPVIPLSHEQILELLQRTPDYGPVDAALADPSR 183
Query 188 RTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLA 247
R SCLSGLGYPAST +LGAQPI I+ARP +LLV+P D PD +AV+AVA +CSAADTGLLA
Sbjct 184 RASCLSGLGYPASTQILGAQPIGINARPGLLLVVPGDGPDTVAVYAVALNCSAADTGLLA 243
Query 248 STVVPRA 254
ST V RA
Sbjct 244 STTVARA 250
>gi|342862340|ref|ZP_08718981.1| hypothetical protein MCOL_25738 [Mycobacterium colombiense CECT
3035]
gi|342130197|gb|EGT83525.1| hypothetical protein MCOL_25738 [Mycobacterium colombiense CECT
3035]
Length=256
Score = 263 bits (671), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 161/259 (63%), Positives = 191/259 (74%), Gaps = 10/259 (3%)
Query 1 MSAADKDP-DKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNR 59
M+AA+ D SA +DPPLTVELLADLQAG+LDD AAR+RS+VR+DP A +L ALNR
Sbjct 1 MNAAENGAADDRSAGSDPPLTVELLADLQAGVLDDEAAARVRSQVRADPHAADVLGALNR 60
Query 60 VRRDVAAMGADPAWGPAARPAVVDSISAALRSARP--NSSPG-AAHAARPHVHPVRMIAG 116
VRRDVAA+GADP P P V ISAALRSA P N + G A H+ARPH+ P R +A
Sbjct 61 VRRDVAAVGADPGAPPDPPPQVSGRISAALRSAEPVSNHATGPADHSARPHLRPARTVAA 120
Query 117 AAGLCAVATAIGVG--AVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPD 174
AAG+CAV AIG G A++ AP PAP P HITVS P IPLS+P +L LL TPD
Sbjct 121 AAGMCAVLAAIGFGTVALLHAPQPAPDTPGDVAHITVSTPPMEIPLSQPDILGLLDRTPD 180
Query 175 YGPPGGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAV 234
+G L DP+RR SCLSGLGYPAST VLGA+P++I+ARP ++LVIP D+P LAVFAV
Sbjct 181 FGA----LSDPARRASCLSGLGYPASTQVLGARPVEINARPGIVLVIPGDSPHILAVFAV 236
Query 235 APHCSAADTGLLASTVVPR 253
+P+CSAADTGLLA+T VPR
Sbjct 237 SPNCSAADTGLLANTQVPR 255
>gi|183985446|ref|YP_001853737.1| hypothetical protein MMAR_5476 [Mycobacterium marinum M]
gi|183178772|gb|ACC43882.1| conserved hypothetical alanine rich membrane protein [Mycobacterium
marinum M]
Length=249
Score = 248 bits (632), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 152/246 (62%), Positives = 175/246 (72%), Gaps = 7/246 (2%)
Query 10 KHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGA 69
+ D DP LTVELLADLQAGLLDD TAA++R R+R+DPQA L AL RVRRDVA G
Sbjct 5 EDETDTDPALTVELLADLQAGLLDDETAAQVRRRIRTDPQAAATLEALQRVRRDVAQAGT 64
Query 70 DPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGLCAVATAIGV 129
D + P + I+ A+RSA + P AAHAARP +++AG AGL A+ AIG+
Sbjct 65 DTSGLGDPPPHLPTRITDAVRSAT-SGGPTAAHAARPRPSSNKILAGIAGLIALIAAIGL 123
Query 130 G--AVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPSR 187
G A++ AP P PS P TA HITVS P PVIPLS QVLDLL PDYGP DPSR
Sbjct 124 GTAALITAPGPTPSGPPTAMHITVSTPPPVIPLSHDQVLDLLQRAPDYGP----FADPSR 179
Query 188 RTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLA 247
R SCLSGLGYPASTP+LGAQPIDI+ARP VLLV+ D P LAV+AVA +CSAADTGLLA
Sbjct 180 RASCLSGLGYPASTPILGAQPIDINARPGVLLVLAGDAPADLAVYAVALNCSAADTGLLA 239
Query 248 STVVPR 253
ST +PR
Sbjct 240 STTLPR 245
>gi|118620067|ref|YP_908399.1| hypothetical protein MUL_5065 [Mycobacterium ulcerans Agy99]
gi|118572177|gb|ABL06928.1| conserved hypothetical alanine rich membrane protein [Mycobacterium
ulcerans Agy99]
Length=249
Score = 247 bits (631), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 149/246 (61%), Positives = 176/246 (72%), Gaps = 7/246 (2%)
Query 10 KHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGA 69
+ D DP LTVELLADLQAGLLDD TAA++R R+R+DP+A L AL RVRRDVA G
Sbjct 5 EDETDTDPALTVELLADLQAGLLDDETAAQVRRRIRTDPEAAATLEALQRVRRDVAQAGT 64
Query 70 DPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGLCAVATAIGV 129
D + P + I+ A+RSA + P AAHAARP + +++AG AGL A+ AIG+
Sbjct 65 DTSGLGDPPPHLPTRITDAVRSAT-SGGPTAAHAARPRPNSNKILAGIAGLIALIAAIGL 123
Query 130 G--AVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPSR 187
G A++ AP P P+ P TA HITVS P PVIPLS QVLDLL PDYGP DPSR
Sbjct 124 GTAALITAPGPTPNGPPTAMHITVSTPPPVIPLSHDQVLDLLQRAPDYGP----FADPSR 179
Query 188 RTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLA 247
RTSCLSGLGYPASTP+LGAQP+DI+ RP VLLV+ D P LAV+AVA +CSAADTGLLA
Sbjct 180 RTSCLSGLGYPASTPILGAQPVDINVRPGVLLVLAGDAPADLAVYAVALNCSAADTGLLA 239
Query 248 STVVPR 253
ST +PR
Sbjct 240 STTLPR 245
>gi|296167155|ref|ZP_06849562.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897477|gb|EFG77076.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=242
Score = 225 bits (573), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 144/250 (58%), Positives = 178/250 (72%), Gaps = 12/250 (4%)
Query 6 KDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVA 65
++PD + +A PLTVELLADLQAGLLDD +AAR+R RVR DP+A+QILRALN+VR DVA
Sbjct 2 REPDNEADEA--PLTVELLADLQAGLLDDESAARVRRRVRDDPEAEQILRALNQVRCDVA 59
Query 66 AMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGLCAVAT 125
+G PA P P V + A A + P HAARP + P R++A G+ AV
Sbjct 60 TLGGGPA--PEVPPDV--TARVAAALASAETGPPVDHAARPRLRPTRVLAMVIGVGAVLA 115
Query 126 AIGVG--AVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLG 183
AIGVG A+V AP PAPSA TA+HITVS P VIPLS +++ LL P+YGP L
Sbjct 116 AIGVGTAALVTAPEPAPSAQVTAEHITVSTPPMVIPLSSDEIVGLLSQRPEYGP----LA 171
Query 184 DPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADT 243
DP RR SCL+GLGYPAST VLG + ++I+ARP ++LV+PADTP LAV+AVA +C+AADT
Sbjct 172 DPVRRASCLTGLGYPASTQVLGGRRVEINARPGIVLVVPADTPHNLAVYAVALNCNAADT 231
Query 244 GLLASTVVPR 253
GLLA+T VPR
Sbjct 232 GLLATTQVPR 241
>gi|254777649|ref|ZP_05219165.1| hypothetical protein MaviaA2_23671 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=254
Score = 211 bits (536), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 153/255 (60%), Positives = 187/255 (74%), Gaps = 7/255 (2%)
Query 2 SAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVR 61
+A + D +A++DPPLT ELLADLQAG+LDDA AAR+R RVR+DP A +L ALNRVR
Sbjct 3 TAGNDAADHRNAESDPPLTAELLADLQAGVLDDAAAARVRRRVRADPHAADVLDALNRVR 62
Query 62 RDVAAMGADPAWGPAARPAVVDSISAALRSARP-NSSPGAAHAARPHVHPVRMIAGAAGL 120
R+VAA+GADPA P P V ++ ALRSA P ++P AAH+ARP + P R IA AGL
Sbjct 63 REVAALGADPASPPDPPPQVTARVAEALRSAEPVGATPRAAHSARPPLRPARAIAAVAGL 122
Query 121 CAVATAIGVG--AVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPP 178
A AIGVG A++ P PAPS PT +HITVS P IPLSR ++L LL PD+GP
Sbjct 123 GAALAAIGVGTVALLRTPAPAPSTPTDIEHITVSTPPMQIPLSRAEILGLLDRGPDFGP- 181
Query 179 GGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHC 238
L DP+RR SCL+GLGYPASTPVLGA+P+ I+ARP V+LVIP D+P L V+AV+ +C
Sbjct 182 ---LSDPARRASCLTGLGYPASTPVLGARPVAINARPGVVLVIPGDSPHVLTVYAVSANC 238
Query 239 SAADTGLLASTVVPR 253
SAADTGLLA+T VPR
Sbjct 239 SAADTGLLANTEVPR 253
>gi|118463922|ref|YP_884410.1| hypothetical protein MAV_5300 [Mycobacterium avium 104]
gi|118165209|gb|ABK66106.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=254
Score = 211 bits (536), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 153/255 (60%), Positives = 188/255 (74%), Gaps = 7/255 (2%)
Query 2 SAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVR 61
+A + D +A++DPPLT ELLADLQAG+LDDA AAR+R RVR+DP A +L ALNRVR
Sbjct 3 TAGNDAADHRNAESDPPLTAELLADLQAGVLDDAAAARVRRRVRADPHAADVLDALNRVR 62
Query 62 RDVAAMGADPAWGPAARPAVVDSISAALRSARP-NSSPGAAHAARPHVHPVRMIAGAAGL 120
R+VAA+GADPA P P V ++AALRSA P ++P AAH+ARP + R IA AGL
Sbjct 63 REVAALGADPASPPDPPPQVTARVAAALRSAEPVGATPRAAHSARPPLRTARAIAAVAGL 122
Query 121 CAVATAIGVG--AVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPP 178
A AIGVG A++ P PAPS PT +HITVS P IPLSR ++L LL +PD+GP
Sbjct 123 GAALAAIGVGTVALLRTPAPAPSTPTDIEHITVSTPPMQIPLSRAEILGLLDRSPDFGP- 181
Query 179 GGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHC 238
L DP+RR SCL+GLGYPASTPVLGA+P+ I+ARP V+LVIP D+P L V+AV+ +C
Sbjct 182 ---LSDPARRASCLTGLGYPASTPVLGARPVAINARPGVVLVIPGDSPHVLTVYAVSANC 238
Query 239 SAADTGLLASTVVPR 253
SAADTGLLA+T VPR
Sbjct 239 SAADTGLLANTEVPR 253
>gi|41410436|ref|NP_963272.1| hypothetical protein MAP4338 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41399270|gb|AAS06888.1| hypothetical protein MAP_4338 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336459803|gb|EGO38717.1| hypothetical protein MAPs_46940 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=232
Score = 202 bits (513), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 140/255 (55%), Positives = 172/255 (68%), Gaps = 29/255 (11%)
Query 2 SAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVR 61
+A + D +A++DPPLT ELLADLQAG+LD ALNRVR
Sbjct 3 TAGNDAADHRNAESDPPLTAELLADLQAGVLD----------------------ALNRVR 40
Query 62 RDVAAMGADPAWGPAARPAVVDSISAALRSARP-NSSPGAAHAARPHVHPVRMIAGAAGL 120
R+VAA+GADPA P P V ++AALRSA P ++P AAH+ARP + R IA AGL
Sbjct 41 REVAALGADPASPPDPPPQVTARVAAALRSAEPVGATPRAAHSARPPLRTARAIAAVAGL 100
Query 121 CAVATAIGVG--AVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPP 178
A AIGVG A++ P PAPS PT +HITVS P IPLSR ++L LL +PD+GP
Sbjct 101 GAALAAIGVGTVALLRTPAPAPSTPTDIEHITVSTPPMQIPLSRAEILGLLDRSPDFGP- 159
Query 179 GGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHC 238
L DP+RR SCL+GLGYPASTPVLGA+P+ I+ARP V+LVIP D+P L V+AV+ +C
Sbjct 160 ---LSDPARRASCLTGLGYPASTPVLGARPVAINARPGVVLVIPGDSPHVLTVYAVSANC 216
Query 239 SAADTGLLASTVVPR 253
SAADTGLLA+T VPR
Sbjct 217 SAADTGLLANTEVPR 231
>gi|254818788|ref|ZP_05223789.1| hypothetical protein MintA_02629 [Mycobacterium intracellulare
ATCC 13950]
Length=160
Score = 178 bits (452), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 94/156 (61%), Positives = 117/156 (75%), Gaps = 2/156 (1%)
Query 100 AAHAARPHVHPVRMIAGAAGLCAVATAIGVG--AVVDAPPPAPSAPTTAQHITVSKPAPV 157
AAH+ARP + P R+IA AGLCA AIG G A++ AP PAPS P QHITVS P
Sbjct 4 AAHSARPRLRPARLIAAVAGLCAALAAIGFGTVALLHAPEPAPSTPGDVQHITVSTPPME 63
Query 158 IPLSRPQVLDLLHHTPDYGPPGGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAV 217
+PLS ++L LL PD+G GG L DP+RR SCL+GLGYPA+T VLGA+P++++ARP V
Sbjct 64 VPLSADEILGLLDRAPDFGSSGGTLSDPARRASCLTGLGYPAATQVLGARPVEVNARPGV 123
Query 218 LLVIPADTPDKLAVFAVAPHCSAADTGLLASTVVPR 253
+LVIP D+P LAV+ V+P+CSAADTGLLA+T VPR
Sbjct 124 VLVIPGDSPHVLAVYVVSPNCSAADTGLLANTQVPR 159
>gi|333992976|ref|YP_004525590.1| hypothetical protein JDM601_4336 [Mycobacterium sp. JDM601]
gi|333488944|gb|AEF38336.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=146
Score = 108 bits (270), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 58/122 (48%), Positives = 81/122 (67%), Gaps = 5/122 (4%)
Query 133 VDAPPPAPSAPTTAQHITV-SKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPSRRTSC 191
+ P PS P + ITV S+P PV+PL+ ++L+LL +P+YG L DP RR C
Sbjct 28 IRFPGDTPSGPRSFDAITVASEPTPVLPLTEAEILELLDRSPEYGA----LADPGRRAGC 83
Query 192 LSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLASTVV 251
L+ LGYPA+T VLGA+ + ++ RPA++L++P P + AVAP+CS+ADTGLLA T V
Sbjct 84 LAALGYPAATRVLGARELAVNGRPAIVLLLPGAAPGTVIALAVAPNCSSADTGLLADTSV 143
Query 252 PR 253
R
Sbjct 144 RR 145
>gi|120406995|ref|YP_956824.1| hypothetical protein Mvan_6066 [Mycobacterium vanbaalenii PYR-1]
gi|119959813|gb|ABM16818.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=210
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 81/200 (41%), Positives = 110/200 (55%), Gaps = 11/200 (5%)
Query 57 LNRVRRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARP---HVHPVRM 113
++ VRR++A +G D P V +SAALR+A P PG+ RP +H +
Sbjct 18 IDDVRRELARLGTDARNAPEVPAEVTARVSAALRAAPP---PGSHAVIRPKLTRLHRAGL 74
Query 114 IAGAAGLCAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTP 173
+ GA + A A + D P P+ PT +Q ITV+ PA PLS P++L L P
Sbjct 75 LIGACAVAAAAVVGAITLTRDPAPVFPAGPTASQ-ITVTDPAQPFPLSGPELLTALDAAP 133
Query 174 DYGPPGGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFA 233
D GP L D RR SCL+GLGY + VLG + ID+ RP VLL++P + ++ A
Sbjct 134 DLGP----LTDAPRRASCLAGLGYAPTLEVLGGRQIDVAGRPGVLLLLPGASAGQIVAVA 189
Query 234 VAPHCSAADTGLLASTVVPR 253
V+P CS A TGLLA T+V R
Sbjct 190 VSPTCSTAHTGLLAETLVNR 209
>gi|169632012|ref|YP_001705661.1| hypothetical protein MAB_4939 [Mycobacterium abscessus ATCC 19977]
gi|169243979|emb|CAM65007.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=299
Score = 105 bits (261), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 96/280 (35%), Positives = 130/280 (47%), Gaps = 50/280 (17%)
Query 15 ADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGADPAWG 74
A+ PL++E+LADL AG+ D + +R R DP A+Q L AL++VR ++ A PA
Sbjct 21 AEVPLSLEVLADLHAGVYDTGDSNVLRQRADQDPDARQTLAALDQVRAELVAWMDSPA-- 78
Query 75 PAARPAVVDSISAALR--SARPNSSPGAAHAARPHVHPVRMIAGAAGL------------ 120
P +VVD I AALR SA+ AA A RP + ++ G L
Sbjct 79 PEVPESVVDDIVAALRAESAKSTLPAIAADAVRPADVSLNLVTGPVPLDHRRRPAAKRRW 138
Query 121 ---------------CAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPV-------- 157
A+ A + P+ +Q I APV
Sbjct 139 LAYSGAGLAAAACVAVAITVVAQDNQRASQTTTAVAGPSLSQQIAPKTAAPVAPELPARG 198
Query 158 ------IPLSRPQVLDLLHHTPDYGPPGGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDI 211
PLS ++ L+ P +G L D +RR SCL+GLG ASTPVLGAQ +DI
Sbjct 199 AVGQTAFPLSGNEITALVGRAPAFGE----LDDAARRASCLTGLGLSASTPVLGAQTLDI 254
Query 212 DARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLASTVV 251
D PAVL+V+PA+ P +L AV P CS D +A T +
Sbjct 255 DG-PAVLMVLPAERPGELLAVAVRPGCSQTDPQRVAQTRI 293
>gi|145221434|ref|YP_001132112.1| hypothetical protein Mflv_0840 [Mycobacterium gilvum PYR-GCK]
gi|145213920|gb|ABP43324.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=200
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 73/201 (37%), Positives = 103/201 (52%), Gaps = 14/201 (6%)
Query 55 RALNRVRRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMI 114
+ R+RR++A +G+D A P PAV I+AALR A GA RP + +
Sbjct 12 ETVTRLRRELAYLGSDTASAPEVPPAVTARITAALRDAS-----GAHAVDRPVLSGSQRA 66
Query 115 AGAAGLCAVATAIGVGAVVDAPPPAPSAPT--TAQHITVSKPAPVIPLSRPQVLDLLHHT 172
G A TA+ + + PAP P TA ITV AP PLS+ + +++
Sbjct 67 GLVLGAGAAITAVVLAVLTLGGDPAPQFPAGPTASQITV---APSFPLSKQDLWEVVAAA 123
Query 173 PDYGPPGGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVF 232
PD GP L DP+R SCL+ LG+P + V+G + + + RPA+LL + D PD++
Sbjct 124 PDLGP----LADPARLASCLAALGHPTTVEVVGGRQLQVSGRPAILLALTGDDPDRVHAV 179
Query 233 AVAPHCSAADTGLLASTVVPR 253
AV C D+ LLA T V R
Sbjct 180 AVGTGCGGGDSDLLAETTVRR 200
>gi|118470542|ref|YP_891126.1| hypothetical protein MSMEG_6932 [Mycobacterium smegmatis str.
MC2 155]
gi|118171829|gb|ABK72725.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=245
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/229 (43%), Positives = 121/229 (53%), Gaps = 19/229 (8%)
Query 16 DPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGA--DPAW 73
+ PLT ELLADLQAGLLDDATAAR+R RVR DP+A +L AL RVRRD+A +GA D
Sbjct 11 EEPLTPELLADLQAGLLDDATAARVRRRVRDDPEAADMLAALERVRRDLAGLGADLDAES 70
Query 74 GPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGLCAVATAIGVGAVV 133
P P V + AALR A R R+ A G AV ++
Sbjct 71 APPVPPEVSAKLIAALR----------AERPRHARRWRRIGAIIGGCAAVVAVAIGAIML 120
Query 134 DAPPP--APSAPTTAQHITVSKPAPV-IPLSRPQVLDLLHHTPDYGPPGGPLGDPSRRTS 190
P P P APT + P P + L+ QVL +L PD+ GPL DP+RRT
Sbjct 121 QHPAPLSKPPAPTGQFGRITTTPVPSDLGLTEAQVLAVLTTPPDF----GPLADPARRTG 176
Query 191 CLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCS 239
CL+ LGYP VLGA+P+D+ R VL+++ D P L V C+
Sbjct 177 CLAALGYPPGIRVLGARPLDVAGRRGVLILLADDRPATLTGLVVPADCA 225
>gi|315446814|ref|YP_004079693.1| hypothetical protein Mspyr1_53340 [Mycobacterium sp. Spyr1]
gi|315265117|gb|ADU01859.1| hypothetical protein Mspyr1_53340 [Mycobacterium sp. Spyr1]
Length=200
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 72/199 (37%), Positives = 103/199 (52%), Gaps = 10/199 (5%)
Query 55 RALNRVRRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMI 114
+ R+RR++A +G+D A P PAV I+AALR A + +R + +
Sbjct 12 ETVTRLRRELAYLGSDTASAPEVPPAVTARITAALRDASGTHAVDRPVLSRSQRTGLVLG 71
Query 115 AGAAGLCAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPD 174
A A V + +G D P P+ PT +Q ITV AP PLS + +++ PD
Sbjct 72 AAAVVAAVVLAVLTLGG--DPEPQFPAGPTASQ-ITV---APSFPLSEQDLWEVVAAAPD 125
Query 175 YGPPGGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAV 234
GP L DP+R SCL+ LGYP + V+G + + + RPA+LL + D PD++ AV
Sbjct 126 LGP----LADPARLASCLAALGYPPTVEVVGGRQLQVSGRPAILLALTGDDPDRVHAVAV 181
Query 235 APHCSAADTGLLASTVVPR 253
C A D+ LLA T V R
Sbjct 182 GTGCGAGDSDLLAETTVRR 200
>gi|111020635|ref|YP_703607.1| hypothetical protein RHA1_ro03646 [Rhodococcus jostii RHA1]
gi|110820165|gb|ABG95449.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=249
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/250 (32%), Positives = 116/250 (47%), Gaps = 5/250 (2%)
Query 4 ADKDPDKHSADA-DPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRR 62
AD++PD A PP + +LLADL AG+L ++ + R+ VR+DPQA ++ AL+RV
Sbjct 2 ADREPDPSDATLPAPPYSEDLLADLHAGVLPESVSDRLWPLVRNDPQAMSVIDALDRVTD 61
Query 63 DVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGLCA 122
+ A+G D + V D I+ AL + R +P A A A A A
Sbjct 62 QLGALGRDHSVSTPIPADVADRINRALAAER--DTPAEATAVPLARRRKWAAAAAGTFAA 119
Query 123 VATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPG-GP 181
A + AVV P P A + + PAP + L LD G GP
Sbjct 120 AAAVVVAVAVVTPDRQEPETPIVALPSSETSPAPAV-LDLGTDLDSGRLLTVIGSRQLGP 178
Query 182 LGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAA 241
L DP++ + CL G S P+LG+ + +D P VLL++ A P ++ V C A
Sbjct 179 LEDPAQLSECLRANGIETSRPLLGSGEVRLDGVPGVLLLVAAPRPPQITALVVGRECGAG 238
Query 242 DTGLLASTVV 251
D +A T +
Sbjct 239 DPATIAVTEI 248
>gi|333922224|ref|YP_004495805.1| hypothetical protein AS9A_4573 [Amycolicicoccus subflavus DQS3-9A1]
gi|333484445|gb|AEF43005.1| hypothetical protein AS9A_4573 [Amycolicicoccus subflavus DQS3-9A1]
Length=245
Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 71/244 (30%), Positives = 113/244 (47%), Gaps = 25/244 (10%)
Query 16 DPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGADPAWGP 75
+PP + E+LADL +G+ DA A+ +R+ +++DP A +L AL VR +++ + +P
Sbjct 17 EPPFSREVLADLDSGVYPDAVASHMRAHIQADPYAAPVLSALRTVRSELSQL-RNPTGAC 75
Query 76 AARPAVVDS-ISAALRSARPNSSPGAAH---AARPHVHPVRMIAGAAGLCAVATAIGVG- 130
PA + + ++AA+ A SSP R PV +A AG A A+ G
Sbjct 76 EDIPADISARLTAAIERATAESSPENVAPLLRTRRWTRPV--VAVLAGTAAAGIAVFAGA 133
Query 131 AVVDAPP---PAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPSR 187
AV+DAPP PA SA + Q + P+ + LL S
Sbjct 134 AVLDAPPSESPAFSAQPSHQGTDAASTGPLSAIGSRAETGLL--------------SGSA 179
Query 188 RTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLA 247
+CL G+ TP+LG+ + AV L++P ++ V P C A + L++
Sbjct 180 LAACLGEHGFSGDTPLLGSSVEIVSGEQAVRLLLPETHSARVIALTVRPSCGAGNPALIS 239
Query 248 STVV 251
TV+
Sbjct 240 RTVL 243
>gi|312142011|ref|YP_004009347.1| membrane protein [Rhodococcus equi 103S]
gi|311891350|emb|CBH50671.1| putative membrane protein [Rhodococcus equi 103S]
Length=255
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 76/245 (32%), Positives = 111/245 (46%), Gaps = 12/245 (4%)
Query 17 PPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGADPAWGPA 76
PP + +LLADL AG+L DA + ++ VR DP+A +L AL+ V +A +G D +
Sbjct 12 PPFSTDLLADLHAGVLPDAVSDKLWPLVRQDPEAVAVLDALDAVSARLAEVGRDHSVETP 71
Query 77 ARPAVVDSISAAL----RSARPNSSPGA-AHAARPHVHPVRMIAGAAGLCAVATAIGVGA 131
V I++AL + R + P A A A R + + + A + G
Sbjct 72 IPHDVAARINSALGLNVSAPRSDVVPLADATAKRRRMAWLGVAAASMAAAVAVVFALTG- 130
Query 132 VVD-----APPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPS 186
VD P SA TTA + ++ L R Q+L L+ T G L P
Sbjct 131 -VDRSGSTGPEAVASATTTAPDVAPARVELSGELDRGQLLALVGDTESAADGVGALARPE 189
Query 187 RRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLL 246
R++CLS +G A+ PVLG + + AVLL++ TP L V C A L+
Sbjct 190 VRSACLSAVGVGATRPVLGMRAVRFQDTDAVLLLVAGPTPPTLLALVVGTGCDATHPDLI 249
Query 247 ASTVV 251
ST +
Sbjct 250 DSTEI 254
>gi|325677541|ref|ZP_08157205.1| hypothetical protein HMPREF0724_14988 [Rhodococcus equi ATCC
33707]
gi|325551788|gb|EGD21486.1| hypothetical protein HMPREF0724_14988 [Rhodococcus equi ATCC
33707]
Length=255
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 76/245 (32%), Positives = 111/245 (46%), Gaps = 12/245 (4%)
Query 17 PPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGADPAWGPA 76
PP + +LLADL AG+L DA + ++ VR DP+A +L AL+ V +A +G D +
Sbjct 12 PPFSTDLLADLHAGVLPDAVSDKLWPLVRQDPEAVAVLDALDAVSARLAEVGRDHSVETP 71
Query 77 ARPAVVDSISAAL----RSARPNSSPGA-AHAARPHVHPVRMIAGAAGLCAVATAIGVGA 131
V I++AL + R + P A A A R + + + A + G
Sbjct 72 IPHDVAARINSALGLNVSAPRSDVVPLADATAKRRRMAWLGVAAASTAAAVAVVFALTG- 130
Query 132 VVD-----APPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPS 186
VD P SA TTA + ++ L R Q+L L+ T G L P
Sbjct 131 -VDRSGSTGPEAVASATTTAPDVAPARVELSGELDRGQLLALVGDTESAADGVGALARPE 189
Query 187 RRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLL 246
R++CLS +G A+ PVLG + + AVLL++ TP L V C A L+
Sbjct 190 VRSACLSAVGVGATRPVLGMRAVRFQDTDAVLLLVAGPTPPTLLALVVGTGCDATHPDLI 249
Query 247 ASTVV 251
ST +
Sbjct 250 DSTEI 254
>gi|262204650|ref|YP_003275858.1| hypothetical protein Gbro_4852 [Gordonia bronchialis DSM 43247]
gi|262087997|gb|ACY23965.1| hypothetical protein Gbro_4852 [Gordonia bronchialis DSM 43247]
Length=249
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 70/244 (29%), Positives = 106/244 (44%), Gaps = 15/244 (6%)
Query 9 DKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMG 68
D+ DPP EL+ADL AG+L D +A + +R+ DP AQ+ILRAL R +
Sbjct 17 DRGQGFPDPPYPPELIADLHAGVLTDELSAHLYARIADDPAAQRILRALEDTRDQLHNAP 76
Query 69 ADPAWGPAARPAVVDSISAALRS-ARPNSSPGAAHAARPHVHPVRMIAGAAGLCAVATAI 127
DP P A V + A LR+ + P S P A R V + A AV+ AI
Sbjct 77 VDPVAPPRDVEASVAATLAGLRTPSTPASDPTRAVRRRTLVTVLAAAAVVIAALAVSIAI 136
Query 128 GVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPSR 187
+ ++ PP + PTT + ++ + L +L T G P +
Sbjct 137 LRPSDDESTPPV-AEPTTTHTLDGAE--------QVSALSVLGRT-----DGAPFVSINA 182
Query 188 RTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLA 247
C S G PA T +G+ PI + RPA ++++ + V C + ++
Sbjct 183 LRRCTSANGVPAQTATVGSGPITVGGRPAAVILLSTGVAGRFEALVVGLDCDTNNPATVS 242
Query 248 STVV 251
TV+
Sbjct 243 RTVI 246
>gi|229491169|ref|ZP_04384997.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229321907|gb|EEN87700.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=238
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 72/237 (31%), Positives = 103/237 (44%), Gaps = 20/237 (8%)
Query 17 PPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGADPAWGPA 76
PP + +LLADL AG+LD T+A + VR+DP A + AL+ V +A + + A
Sbjct 11 PPFSEDLLADLHAGVLDSETSAALWPVVRADPDAHAFVTALDGVTASLATLNSSKATHER 70
Query 77 ARPAVVDSISAALRSARPNSSPGAAHAARPH-VHPVRMIAGAAGLCAVATAIGVGAVVDA 135
++ + I++AL P A A P P + GAA VA I V D+
Sbjct 71 IPNSLAERINSALDLQSEQLEPPAEAAVVPFRRRPQAWVLGAAAAVLVAFVIVVVGTRDS 130
Query 136 PPPA-----PSAPTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPSRRTS 190
P PS P T+ V P LS +L GPL DPS+
Sbjct 131 EPAESVVAQPSTPATS---AVDDPDSASLLSLVGSKNL-----------GPLDDPSKLAG 176
Query 191 CLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLA 247
CL G P+LG+ I ID PA++L+ P ++ V +CS + L+
Sbjct 177 CLRANGIDEGRPLLGSGEIRIDGAPAIVLLFTGSQPRQITALTVGINCSDGNPNTLS 233
>gi|254823060|ref|ZP_05228061.1| hypothetical protein MintA_24240 [Mycobacterium intracellulare
ATCC 13950]
Length=48
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/48 (69%), Positives = 38/48 (80%), Gaps = 1/48 (2%)
Query 1 MSAADKD-PDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSD 47
M AAD D D SA++DPPLTVELLADLQAG LDD AAR+R +VR+D
Sbjct 1 MDAADNDGADHRSAESDPPLTVELLADLQAGALDDEAAARVRRQVRAD 48
>gi|226309502|ref|YP_002769464.1| hypothetical protein RER_60170 [Rhodococcus erythropolis PR4]
gi|226188621|dbj|BAH36725.1| hypothetical protein RER_60170 [Rhodococcus erythropolis PR4]
Length=238
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 62/234 (27%), Positives = 95/234 (41%), Gaps = 14/234 (5%)
Query 17 PPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGADPAWGPA 76
PP + +LLADL AG+LD T+A + V +D A + L+ V +A + + A
Sbjct 11 PPFSEDLLADLHAGVLDPETSAALWPLVHADADAHAFVTTLDSVTASLATLNSREASHER 70
Query 77 ARPAVVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGLCAVATAIGVGAVVDAP 136
++ + I++AL P A P + A + + V
Sbjct 71 IPNSLAERINSALDLQYGQVEPSAETTVVPFRRRPQAWLLGAAAAVLVAFVIVVIGTRDS 130
Query 137 PPAPSA---PTTAQHITVSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPSRRTSCLS 193
PA S P+T+ V P LS +L GPL DPS+ CLS
Sbjct 131 EPAESVVAQPSTSVTSAVDDPDSASLLSLIGSKNL-----------GPLDDPSKLAGCLS 179
Query 194 GLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLA 247
G PVLG+ + ID PA++L+ P ++ V +CS + L+
Sbjct 180 ANGIDEGRPVLGSGEVRIDGAPAIVLLFTGSQPRQITALTVGINCSDGNPNTLS 233
>gi|326383896|ref|ZP_08205580.1| hypothetical protein SCNU_13223 [Gordonia neofelifaecis NRRL
B-59395]
gi|326197355|gb|EGD54545.1| hypothetical protein SCNU_13223 [Gordonia neofelifaecis NRRL
B-59395]
Length=97
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 33/83 (40%), Positives = 50/83 (61%), Gaps = 3/83 (3%)
Query 10 KHSADA---DPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAA 66
KH D +PP + ELLADL A LD A+ +RSR+ +DP+A+++L AL+RV++D+
Sbjct 6 KHPTDFTPPEPPFSTELLADLHADALDPELASHVRSRLPADPRAEEVLDALDRVQQDLRG 65
Query 67 MGADPAWGPAARPAVVDSISAAL 89
+ P A A +DS+ L
Sbjct 66 LRTPAPPMPEAVAARLDSVIDGL 88
>gi|226362878|ref|YP_002780658.1| hypothetical protein ROP_34660 [Rhodococcus opacus B4]
gi|226241365|dbj|BAH51713.1| hypothetical membrane protein [Rhodococcus opacus B4]
Length=248
Score = 52.0 bits (123), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 35/91 (39%), Positives = 53/91 (59%), Gaps = 1/91 (1%)
Query 4 ADKDPDKHSADA-DPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRR 62
AD++P A +PP + +LLADL AG+L ++ + R+ VRSDPQA ++ AL+RV
Sbjct 2 ADREPGPSGAALPEPPYSEDLLADLHAGVLPESVSDRLWPLVRSDPQAMAVIDALDRVTD 61
Query 63 DVAAMGADPAWGPAARPAVVDSISAALRSAR 93
+ A+G D + + D IS AL + R
Sbjct 62 QLGALGRDHSVSTPIPAEIADRISRALAAER 92
>gi|300791151|ref|YP_003771442.1| hypothetical protein AMED_9352 [Amycolatopsis mediterranei U32]
gi|299800665|gb|ADJ51040.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340532851|gb|AEK48056.1| hypothetical protein RAM_47955 [Amycolatopsis mediterranei S699]
Length=269
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/56 (47%), Positives = 39/56 (70%), Gaps = 0/56 (0%)
Query 17 PPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGADPA 72
PP +V++LADL AG+LDD AA + V +DP+A+ IL AL+ + D+A++ PA
Sbjct 14 PPWSVDVLADLHAGVLDDTRAAELWPLVNADPEARAILDALDATQADLASLAEAPA 69
>gi|302531343|ref|ZP_07283685.1| predicted protein [Streptomyces sp. AA4]
gi|302440238|gb|EFL12054.1| predicted protein [Streptomyces sp. AA4]
Length=265
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/51 (48%), Positives = 37/51 (73%), Gaps = 0/51 (0%)
Query 17 PPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAM 67
PP +V+LLADL AG+LD+ AA++ RV +DP+A+ I+ AL D++A+
Sbjct 14 PPWSVDLLADLHAGVLDEHEAAQLWPRVNADPEARAIIEALEATTADLSAL 64
>gi|257057902|ref|YP_003135734.1| hypothetical protein Svir_39660 [Saccharomonospora viridis DSM
43017]
gi|256587774|gb|ACU98907.1| hypothetical protein Svir_39660 [Saccharomonospora viridis DSM
43017]
Length=278
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/55 (44%), Positives = 36/55 (66%), Gaps = 0/55 (0%)
Query 15 ADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGA 69
A PP +V++LADL AG+LD+ AA + RV +DP A+ I+ AL + D++ A
Sbjct 12 AGPPWSVDVLADLHAGVLDEQEAAELWPRVNADPGARAIIEALESTKADLSGFAA 66
>gi|54027635|ref|YP_121877.1| hypothetical protein nfa56610 [Nocardia farcinica IFM 10152]
gi|54019143|dbj|BAD60513.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=299
Score = 45.1 bits (105), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/97 (36%), Positives = 47/97 (49%), Gaps = 7/97 (7%)
Query 17 PPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGAD------ 70
PP + ELLADL A +D A++R VR+D A + L L+ V V A+G D
Sbjct 18 PPFSAELLADLHADNIDPELGAQLRPVVRADASASRYLHDLDEVSARVRALGTDDRIIHP 77
Query 71 -PAWGPAARPAVVDSISAALRSARPNSSPGAAHAARP 106
PA A VD++ + SA N +P A+ P
Sbjct 78 MPADVAERLAAFVDALDSGAESAASNGAPTNANGWGP 114
>gi|134103803|ref|YP_001109464.1| hypothetical protein SACE_7383 [Saccharopolyspora erythraea NRRL
2338]
gi|291005739|ref|ZP_06563712.1| hypothetical protein SeryN2_14559 [Saccharopolyspora erythraea
NRRL 2338]
gi|133916426|emb|CAM06539.1| hypothetical protein SACE_7383 [Saccharopolyspora erythraea NRRL
2338]
Length=244
Score = 44.7 bits (104), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 21/47 (45%), Positives = 32/47 (69%), Gaps = 0/47 (0%)
Query 21 VELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAM 67
++LLAD AG+LD TA R+R +V DP+A++IL AL+ D+ +
Sbjct 1 MDLLADFHAGVLDQETADRVRVQVEEDPEAREILAALDATTADLGDL 47
>gi|256381061|ref|YP_003104721.1| hypothetical protein Amir_7085 [Actinosynnema mirum DSM 43827]
gi|255925364|gb|ACU40875.1| hypothetical protein Amir_7085 [Actinosynnema mirum DSM 43827]
Length=287
Score = 43.5 bits (101), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 22/51 (44%), Positives = 31/51 (61%), Gaps = 0/51 (0%)
Query 17 PPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAM 67
PP +V+LLADL AG L +R R+ D +AQ+IL AL+ D+ A+
Sbjct 10 PPWSVDLLADLHAGALTAQEENELRERIADDAEAQEILAALDATLSDLGAL 60
>gi|331700379|ref|YP_004336618.1| hypothetical protein Psed_6677 [Pseudonocardia dioxanivorans
CB1190]
gi|326955068|gb|AEA28765.1| hypothetical protein Psed_6677 [Pseudonocardia dioxanivorans
CB1190]
Length=552
Score = 41.6 bits (96), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 30/90 (34%), Positives = 43/90 (48%), Gaps = 14/90 (15%)
Query 178 PGGPLGDPSRRTSCLSGLGY--------PAST------PVLGAQPIDIDARPAVLLVIPA 223
P GPL DP+R+ +CL+ +G AST P L + + +D P VLLV+P
Sbjct 463 PEGPLSDPARQAACLAAVGVATPPVRAPSASTASTPPAPALATRRVLVDGAPGVLLVLPT 522
Query 224 DTPDKLAVFAVAPHCSAADTGLLASTVVPR 253
+ + V P C + +LA VV R
Sbjct 523 GDLGRFRLLVVDPECGSGGGHVLADQVVAR 552
>gi|317509438|ref|ZP_07967056.1| hypothetical protein HMPREF9336_03428 [Segniliparus rugosus ATCC
BAA-974]
gi|316252267|gb|EFV11719.1| hypothetical protein HMPREF9336_03428 [Segniliparus rugosus ATCC
BAA-974]
Length=271
Score = 37.7 bits (86), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 23/81 (29%), Positives = 37/81 (46%), Gaps = 9/81 (11%)
Query 180 GPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDK-------LAVF 232
GPL D + R CL G+P++ ++ A I D + + ++IP TP + + V
Sbjct 192 GPLSDDATRADCLVANGFPSNQQLVAAGRIKRDDKEGIFMMIP--TPSRMQGGAPEMTVL 249
Query 233 AVAPHCSAADTGLLASTVVPR 253
V C A L+ V+ R
Sbjct 250 VVGVECRAGVPATLSKQVLSR 270
>gi|296392453|ref|YP_003657337.1| hypothetical protein Srot_0013 [Segniliparus rotundus DSM 44985]
gi|296179600|gb|ADG96506.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=274
Score = 35.4 bits (80), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 34/134 (26%), Positives = 53/134 (40%), Gaps = 17/134 (12%)
Query 134 DAPPP-APSAPTTAQHITVSKPAPVI-------PLSRPQVLDLLHHTPDYGPPGGPLGDP 185
D+P P A P+ AQ + P++ + R LL + G PL D
Sbjct 143 DSPQPQAQPEPSAAQETSAEATEPLLASVDNRFTIERKDFSTLLRSGGSFKDLG-PLSDE 201
Query 186 SRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDK------LAVFAVAPHCS 239
+ R CL+ G+PA ++ A I + + ++IP TP + V V C
Sbjct 202 AIRGDCLAANGFPADQQLVAAGRIKRGDKEGIFMMIP--TPAMRGGAPGMTVLVVGSECR 259
Query 240 AADTGLLASTVVPR 253
A L+ V+ R
Sbjct 260 AGIPATLSKQVLSR 273
Lambda K H
0.317 0.132 0.395
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 371539772520
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40