BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0007
Length=304
Score E
Sequences producing significant alignments: (Bits) Value
gi|308232617|ref|ZP_07416632.2| conserved membrane protein [Myco... 595 4e-168
gi|15607149|ref|NP_214521.1| hypothetical protein Rv0007 [Mycoba... 595 4e-168
gi|31791184|ref|NP_853677.1| hypothetical protein Mb0007 [Mycoba... 594 7e-168
gi|289756066|ref|ZP_06515444.1| conserved hypothetical protein [... 593 1e-167
gi|340625040|ref|YP_004743492.1| hypothetical protein MCAN_00061... 570 1e-160
gi|289441374|ref|ZP_06431118.1| conserved hypothetical protein [... 434 6e-120
gi|254233412|ref|ZP_04926738.1| hypothetical protein TBCG_00007 ... 427 8e-118
gi|339293087|gb|AEJ45198.1| hypothetical protein CCDC5079_0008 [... 413 1e-113
gi|118615926|ref|YP_904258.1| hypothetical protein MUL_0007 [Myc... 352 4e-95
gi|118462372|ref|YP_879310.1| hypothetical protein MAV_0007 [Myc... 352 4e-95
gi|41406105|ref|NP_958941.1| hypothetical protein MAP0007 [Mycob... 342 4e-92
gi|240172101|ref|ZP_04750760.1| hypothetical protein MkanA1_2248... 331 9e-89
gi|183980042|ref|YP_001848333.1| hypothetical protein MMAR_0007 ... 331 1e-88
gi|254773060|ref|ZP_05214576.1| hypothetical protein MaviaA2_000... 325 5e-87
gi|342862359|ref|ZP_08719000.1| hypothetical protein MCOL_25833 ... 322 3e-86
gi|296167136|ref|ZP_06849544.1| conserved hypothetical protein [... 307 1e-81
gi|254821355|ref|ZP_05226356.1| hypothetical protein MintA_15563... 293 3e-77
gi|15826872|ref|NP_301135.1| hypothetical protein ML0007 [Mycoba... 288 6e-76
gi|120401036|ref|YP_950865.1| hypothetical protein Mvan_0008 [My... 268 5e-70
gi|108796989|ref|YP_637186.1| hypothetical protein Mmcs_0008 [My... 266 3e-69
gi|145221414|ref|YP_001132092.1| hypothetical protein Mflv_0820 ... 262 5e-68
gi|315441704|ref|YP_004074583.1| hypothetical protein Mspyr1_000... 262 5e-68
gi|118470742|ref|YP_884430.1| hypothetical protein MSMEG_0007 [M... 248 6e-64
gi|169627128|ref|YP_001700777.1| hypothetical protein MAB_0020 [... 236 5e-60
gi|339296736|gb|AEJ48846.1| hypothetical protein CCDC5180_0009 [... 228 7e-58
gi|333988647|ref|YP_004521261.1| hypothetical protein JDM601_000... 224 1e-56
gi|325677510|ref|ZP_08157174.1| putative proline rich protein [R... 169 4e-40
gi|111020670|ref|YP_703642.1| proline rich protein [Rhodococcus ... 168 1e-39
gi|229491180|ref|ZP_04385008.1| putative proline rich protein [R... 168 1e-39
gi|134096628|ref|YP_001102289.1| hypothetical protein SACE_0010 ... 165 9e-39
gi|226303500|ref|YP_002763458.1| hypothetical protein RER_00110 ... 164 2e-38
gi|226362910|ref|YP_002780690.1| hypothetical protein ROP_34980 ... 164 2e-38
gi|54021972|ref|YP_116214.1| hypothetical protein nfa80 [Nocardi... 162 6e-38
gi|312137525|ref|YP_004004861.1| integral membrane protein [Rhod... 159 7e-37
gi|296137761|ref|YP_003645004.1| hypothetical protein Tpau_0011 ... 153 4e-35
gi|256374168|ref|YP_003097828.1| hypothetical protein Amir_0008 ... 147 2e-33
gi|326383916|ref|ZP_08205600.1| hypothetical protein SCNU_13323 ... 144 1e-32
gi|284988641|ref|YP_003407195.1| hypothetical protein Gobs_0012 ... 141 1e-31
gi|331693906|ref|YP_004330145.1| hypothetical protein Psed_0007 ... 141 1e-31
gi|330464885|ref|YP_004402628.1| hypothetical protein VAB18032_0... 139 4e-31
gi|262200053|ref|YP_003271261.1| hypothetical protein Gbro_0007 ... 138 9e-31
gi|333917687|ref|YP_004491268.1| hypothetical protein AS9A_0008 ... 136 4e-30
gi|300781945|ref|YP_003762236.1| hypothetical protein AMED_0008 ... 136 5e-30
gi|296392548|ref|YP_003657432.1| hypothetical protein Srot_0110 ... 135 8e-30
gi|343928741|ref|ZP_08768186.1| hypothetical protein GOALK_120_0... 135 9e-30
gi|302531363|ref|ZP_07283705.1| predicted protein [Streptomyces ... 131 2e-28
gi|159035683|ref|YP_001534936.1| hypothetical protein Sare_0009 ... 127 3e-27
gi|257054097|ref|YP_003131929.1| hypothetical protein Svir_00080... 126 5e-27
gi|324999889|ref|ZP_08121001.1| hypothetical protein PseP1_14021... 125 6e-27
gi|145592576|ref|YP_001156873.1| hypothetical protein Strop_0010... 125 9e-27
>gi|308232617|ref|ZP_07416632.2| conserved membrane protein [Mycobacterium tuberculosis SUMu001]
gi|308369282|ref|ZP_07417162.2| conserved membrane protein [Mycobacterium tuberculosis SUMu002]
gi|308370292|ref|ZP_07420935.2| conserved membrane protein [Mycobacterium tuberculosis SUMu003]
21 more sequence titles
Length=309
Score = 595 bits (1533), Expect = 4e-168, Method: Compositional matrix adjust.
Identities = 304/304 (100%), Positives = 304/304 (100%), Gaps = 0/304 (0%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ 60
VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ
Sbjct 6 VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ 65
Query 61 PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE 120
PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE
Sbjct 66 PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE 125
Query 121 AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA 180
AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA
Sbjct 126 AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA 185
Query 181 RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV 240
RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV
Sbjct 186 RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV 245
Query 241 GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL 300
GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL
Sbjct 246 GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL 305
Query 301 ADRD 304
ADRD
Sbjct 306 ADRD 309
>gi|15607149|ref|NP_214521.1| hypothetical protein Rv0007 [Mycobacterium tuberculosis H37Rv]
gi|15839379|ref|NP_334416.1| hypothetical protein MT0007 [Mycobacterium tuberculosis CDC1551]
gi|121635890|ref|YP_976113.1| hypothetical protein BCG_0007 [Mycobacterium bovis BCG str. Pasteur
1173P2]
47 more sequence titles
Length=304
Score = 595 bits (1533), Expect = 4e-168, Method: Compositional matrix adjust.
Identities = 303/304 (99%), Positives = 304/304 (100%), Gaps = 0/304 (0%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ 60
+TAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ
Sbjct 1 MTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ 60
Query 61 PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE 120
PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE
Sbjct 61 PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE 120
Query 121 AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA 180
AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA
Sbjct 121 AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA 180
Query 181 RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV 240
RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV
Sbjct 181 RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV 240
Query 241 GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL 300
GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL
Sbjct 241 GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL 300
Query 301 ADRD 304
ADRD
Sbjct 301 ADRD 304
>gi|31791184|ref|NP_853677.1| hypothetical protein Mb0007 [Mycobacterium bovis AF2122/97]
gi|31616769|emb|CAD92869.1| POSSIBLE CONSERVED MEMBRANE PROTEIN [Mycobacterium bovis AF2122/97]
Length=304
Score = 594 bits (1531), Expect = 7e-168, Method: Compositional matrix adjust.
Identities = 302/304 (99%), Positives = 304/304 (100%), Gaps = 0/304 (0%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ 60
+TAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ
Sbjct 1 MTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ 60
Query 61 PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE 120
PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE
Sbjct 61 PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE 120
Query 121 AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA 180
AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA
Sbjct 121 AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA 180
Query 181 RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV 240
RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV
Sbjct 181 RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV 240
Query 241 GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL 300
GDLLNNASGSSAELVSSGTIFGGAFLIGLVN+VLMTALATIGAFVYNLITDLIGGIEVTL
Sbjct 241 GDLLNNASGSSAELVSSGTIFGGAFLIGLVNVVLMTALATIGAFVYNLITDLIGGIEVTL 300
Query 301 ADRD 304
ADRD
Sbjct 301 ADRD 304
>gi|289756066|ref|ZP_06515444.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289696653|gb|EFD64082.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
Length=304
Score = 593 bits (1528), Expect = 1e-167, Method: Compositional matrix adjust.
Identities = 302/304 (99%), Positives = 303/304 (99%), Gaps = 0/304 (0%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ 60
+TAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGH Q
Sbjct 1 MTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHHQ 60
Query 61 PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE 120
PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE
Sbjct 61 PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE 120
Query 121 AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA 180
AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA
Sbjct 121 AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA 180
Query 181 RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV 240
RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV
Sbjct 181 RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV 240
Query 241 GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL 300
GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL
Sbjct 241 GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL 300
Query 301 ADRD 304
ADRD
Sbjct 301 ADRD 304
>gi|340625040|ref|YP_004743492.1| hypothetical protein MCAN_00061 [Mycobacterium canettii CIPT
140010059]
gi|340003230|emb|CCC42347.1| putative conserved membrane protein [Mycobacterium canettii CIPT
140010059]
Length=304
Score = 570 bits (1469), Expect = 1e-160, Method: Compositional matrix adjust.
Identities = 292/304 (97%), Positives = 295/304 (98%), Gaps = 0/304 (0%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ 60
+TAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQA RQ
Sbjct 1 MTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAAQRQ 60
Query 61 PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE 120
PPPVSH EGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQP+PDASLG GDG PAE
Sbjct 61 PPPVSHLEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPEPDASLGSGDGPPAE 120
Query 121 AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA 180
YASELPDLSGP PRAPQRNPAPARPAE GAGSRGDSAAGSSGGRSITAE R+ARVQLSA
Sbjct 121 TYASELPDLSGPAPRAPQRNPAPARPAEAGAGSRGDSAAGSSGGRSITAEGREARVQLSA 180
Query 181 RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV 240
RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV
Sbjct 181 RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV 240
Query 241 GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL 300
GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL
Sbjct 241 GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL 300
Query 301 ADRD 304
ADRD
Sbjct 301 ADRD 304
>gi|289441374|ref|ZP_06431118.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289414293|gb|EFD11533.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
Length=223
Score = 434 bits (1117), Expect = 6e-120, Method: Compositional matrix adjust.
Identities = 222/223 (99%), Positives = 223/223 (100%), Gaps = 0/223 (0%)
Query 82 LNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNP 141
+NRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNP
Sbjct 1 MNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNP 60
Query 142 APARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWST 201
APARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWST
Sbjct 61 APARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWST 120
Query 202 LKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIF 261
LKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIF
Sbjct 121 LKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIF 180
Query 262 GGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
GGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD
Sbjct 181 GGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 223
>gi|254233412|ref|ZP_04926738.1| hypothetical protein TBCG_00007 [Mycobacterium tuberculosis C]
gi|124603205|gb|EAY61480.1| hypothetical protein TBCG_00007 [Mycobacterium tuberculosis C]
Length=221
Score = 427 bits (1099), Expect = 8e-118, Method: Compositional matrix adjust.
Identities = 219/220 (99%), Positives = 220/220 (100%), Gaps = 0/220 (0%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ 60
+TAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ
Sbjct 1 MTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ 60
Query 61 PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE 120
PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE
Sbjct 61 PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE 120
Query 121 AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA 180
AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA
Sbjct 121 AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA 180
Query 181 RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITV 220
RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITV
Sbjct 181 RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITV 220
>gi|339293087|gb|AEJ45198.1| hypothetical protein CCDC5079_0008 [Mycobacterium tuberculosis
CCDC5079]
Length=212
Score = 413 bits (1062), Expect = 1e-113, Method: Compositional matrix adjust.
Identities = 211/212 (99%), Positives = 212/212 (100%), Gaps = 0/212 (0%)
Query 93 VTGPAAAVRTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAG 152
+TGPAAAVRTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAG
Sbjct 1 MTGPAAAVRTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAG 60
Query 153 SRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVAL 212
SRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVAL
Sbjct 61 SRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVAL 120
Query 213 FFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNI 272
FFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNI
Sbjct 121 FFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNI 180
Query 273 VLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
VLMTALATIGAFVYNLITDLIGGIEVTLADRD
Sbjct 181 VLMTALATIGAFVYNLITDLIGGIEVTLADRD 212
>gi|118615926|ref|YP_904258.1| hypothetical protein MUL_0007 [Mycobacterium ulcerans Agy99]
gi|118568036|gb|ABL02787.1| conserved hypothetical membrane protein [Mycobacterium ulcerans
Agy99]
Length=332
Score = 352 bits (903), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 208/306 (68%), Positives = 229/306 (75%), Gaps = 8/306 (2%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ 60
+T+PN+PGA + GD N D VDR G HRA GPGR+PD D PPWQR A R G R
Sbjct 1 MTSPNDPGAPNMGDNFNGDAAVDRSGVHRAGPGPGRLPDPADSPPWQRGAARPGNPGPRP 60
Query 61 PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE 120
P EGR P +AADARL+RFISG SAP GPA V+ P P+ +A G+ PAE
Sbjct 61 PE--QSGEGRVPGPTSAADARLSRFISGTSAPAGGPAP-VKAPPPEAEAPPVRGENLPAE 117
Query 121 AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSS-GGRSITAESRDARVQLS 179
YASELPDLSGP PR QR PAP R G S D AG++ G + ESR++R Q+S
Sbjct 118 VYASELPDLSGPAPRPVQRKPAPDR---GADNSGADPMAGAARSGAAEARESRESRFQVS 174
Query 180 ARRS-RGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNS 238
+RR+ RGPVRASMQIRRIDPWSTLKVS LLSVALFF+WMI VAFLYLVLGGMGVWAKLNS
Sbjct 175 SRRAARGPVRASMQIRRIDPWSTLKVSALLSVALFFIWMIAVAFLYLVLGGMGVWAKLNS 234
Query 239 NVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEV 298
NVGDLLNN SGSSAELVSSGTIFGGA LIGLVNIVLMTA+ATI AF YNL TDLIGGIEV
Sbjct 235 NVGDLLNNTSGSSAELVSSGTIFGGAVLIGLVNIVLMTAMATIAAFTYNLATDLIGGIEV 294
Query 299 TLADRD 304
TLADRD
Sbjct 295 TLADRD 300
>gi|118462372|ref|YP_879310.1| hypothetical protein MAV_0007 [Mycobacterium avium 104]
gi|118163659|gb|ABK64556.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=287
Score = 352 bits (903), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 201/312 (65%), Positives = 226/312 (73%), Gaps = 33/312 (10%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGP---GRIPDAGDPPPWQRAATRQSQAG 57
+++PNEPGA KG+ PN DG +R G RA T P GR P+AGD PPWQR +TR
Sbjct 1 MSSPNEPGAPKKGETPNGDGSAERAGVRRA-TPPAPGGRAPEAGDGPPWQRGSTRP---- 55
Query 58 HRQPPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGS 117
+QPPP + P A+ARLNRFISG +AP G A+ + P+ G+G
Sbjct 56 -QQPPPRQNEPPTEKRPGGGAEARLNRFISGTAAP--GTASHAKEPE--------RGEGP 104
Query 118 PAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDAR-- 175
PAEAYASELPDLSGP PR P R PA R AE + G GGR ++ ESR+ R
Sbjct 105 PAEAYASELPDLSGPVPRGPHRKPAAERGAE--------TTGGQGGGRPVSGESREGRDN 156
Query 176 ---VQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGV 232
VQ+S RR+RGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMI VAFLYLVLGGMGV
Sbjct 157 RDRVQVS-RRTRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMIAVAFLYLVLGGMGV 215
Query 233 WAKLNSNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDL 292
W+KLNSNVGDLLNN SGSS ELVSSGTIFGGA LIGLVNIVL+TA+ATI AF+YNL TDL
Sbjct 216 WSKLNSNVGDLLNNTSGSSGELVSSGTIFGGAVLIGLVNIVLLTAMATIAAFIYNLATDL 275
Query 293 IGGIEVTLADRD 304
IGGIEVTLADRD
Sbjct 276 IGGIEVTLADRD 287
>gi|41406105|ref|NP_958941.1| hypothetical protein MAP0007 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41394453|gb|AAS02324.1| hypothetical protein MAP_0007 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336459822|gb|EGO38736.1| Transmembrane domain of unknown function (DUF3566) [Mycobacterium
avium subsp. paratuberculosis S397]
Length=283
Score = 342 bits (878), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 199/314 (64%), Positives = 222/314 (71%), Gaps = 41/314 (13%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGP---GRIPDAGDPPPWQRAATRQSQAG 57
+++PNEPGA KG+ PN DG +R G RA T P GR P+AGD PPWQR +TR Q
Sbjct 1 MSSPNEPGAPKKGETPNGDGSAERAGVRRA-TPPAPGGRAPEAGDGPPWQRGSTRPQQPP 59
Query 58 HRQ--PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGD 115
RQ PP +G A+ARLNRFISG A+ + P+ G+
Sbjct 60 PRQNEPPTEKRADG------GGAEARLNRFISGT-------ASHAKEPE--------RGE 98
Query 116 GSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDAR 175
G PAEAYASELPDLSGP PR P R PA R AE + G GGR ++ ESR+ R
Sbjct 99 GPPAEAYASELPDLSGPVPRGPHRKPAAERGAE--------TTGGQGGGRPVSGESREGR 150
Query 176 -----VQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGM 230
VQ+S RR+RGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMI VAFLYLVLGGM
Sbjct 151 DNRDRVQVS-RRTRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMIAVAFLYLVLGGM 209
Query 231 GVWAKLNSNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLIT 290
GVW+KLNSNVGDLLNN SGSS ELVSSGTIFGGA LIGLVNIVL+TA+ATI AF+YNL T
Sbjct 210 GVWSKLNSNVGDLLNNTSGSSGELVSSGTIFGGAVLIGLVNIVLLTAMATIAAFIYNLAT 269
Query 291 DLIGGIEVTLADRD 304
DLIGGIEVTLADRD
Sbjct 270 DLIGGIEVTLADRD 283
>gi|240172101|ref|ZP_04750760.1| hypothetical protein MkanA1_22485 [Mycobacterium kansasii ATCC
12478]
Length=228
Score = 331 bits (848), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 175/235 (75%), Positives = 191/235 (82%), Gaps = 8/235 (3%)
Query 70 RPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAEAYASELPDL 129
RP P +A DARLNRFI+G SAP P+ V+T P+PD DG P EAYASELPDL
Sbjct 2 RPPGPSSAVDARLNRFITGTSAPSGAPSGPVKTSAPEPDDVPARSDGPPVEAYASELPDL 61
Query 130 SGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRA 189
SGP PRA QR AP RP + A + ++ R++T ESR++RVQ+S RR RGPVRA
Sbjct 62 SGPIPRA-QRRSAPDRPVDAPATT-------TTASRAVTTESRESRVQVSPRRGRGPVRA 113
Query 190 SMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASG 249
SMQIRRIDPWSTLKVSLLLSVALFFVWMI+VAFLYLVLGGMGVWAKLNSNVGDLLNN SG
Sbjct 114 SMQIRRIDPWSTLKVSLLLSVALFFVWMISVAFLYLVLGGMGVWAKLNSNVGDLLNNTSG 173
Query 250 SSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
SSAELVSSGTIFGGA LIGLVNIVLMTA+ATIGAFVYNL TDLIGGIEVTLADRD
Sbjct 174 SSAELVSSGTIFGGAVLIGLVNIVLMTAMATIGAFVYNLTTDLIGGIEVTLADRD 228
>gi|183980042|ref|YP_001848333.1| hypothetical protein MMAR_0007 [Mycobacterium marinum M]
gi|183173368|gb|ACC38478.1| conserved hypothetical membrane protein [Mycobacterium marinum
M]
Length=300
Score = 331 bits (848), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 202/305 (67%), Positives = 222/305 (73%), Gaps = 6/305 (1%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ 60
+T+PN+PGA + GD N D VDR G HRA TGPGR+PD D PPWQR A R G R
Sbjct 1 MTSPNDPGAPNMGDNFNGDAAVDRSGVHRAGTGPGRLPDPADSPPWQRGAARPGNPGPRP 60
Query 61 PPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAE 120
P EGR P +AADARL+RFISG SAP PA P + G+ PAE
Sbjct 61 PE--QSGEGRVPGPTSAADARLSRFISGTSAPAGAPAPVKAPPPEAEAPPV-RGENLPAE 117
Query 121 AYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA 180
AYASELPDLSGP PR QR PAP R A+ A + + ESR++R Q+S+
Sbjct 118 AYASELPDLSGPAPRPVQRKPAPDRGADNSGAH--PVAGAARSAAAEARESRESRFQVSS 175
Query 181 RRS-RGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSN 239
RR+ RGPVRASMQIRRIDPWSTLKVS LLSVALFF+WMI VAFLYLVLGGMGVWAKLNSN
Sbjct 176 RRAARGPVRASMQIRRIDPWSTLKVSALLSVALFFIWMIAVAFLYLVLGGMGVWAKLNSN 235
Query 240 VGDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVT 299
VGDLLNN SGSSAELVSSGTIFGGA LIGLVNIVLMTA+ATI AF+YNL TDLIGGIEVT
Sbjct 236 VGDLLNNTSGSSAELVSSGTIFGGAVLIGLVNIVLMTAMATIAAFIYNLATDLIGGIEVT 295
Query 300 LADRD 304
LADRD
Sbjct 296 LADRD 300
>gi|254773060|ref|ZP_05214576.1| hypothetical protein MaviaA2_00035 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=261
Score = 325 bits (834), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 182/275 (67%), Positives = 203/275 (74%), Gaps = 29/275 (10%)
Query 35 GRIPDAGDPPPWQRAATRQSQAGHRQPPPVSHPEGRPTNPPAAADARLNRFISGASAPVT 94
GR P+AGD PPWQR +TR +QPPP + P A+ARLNRFISG +AP
Sbjct 11 GRAPEAGDGPPWQRGSTRP-----QQPPPRQNEPPTEKRPGGGAEARLNRFISGTAAP-- 63
Query 95 GPAAAVRTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAGSR 154
G A+ + P+ G+G PAEAYASELPDLSGP PR P R PA R AE
Sbjct 64 GTASHAKEPE--------RGEGPPAEAYASELPDLSGPVPRGPHRKPAAERGAE------ 109
Query 155 GDSAAGSSGGRSITAESRDAR-----VQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLS 209
+ G GGR ++ ESR+ R VQ+S RR+RGPVRASMQIRRIDPWSTLKVSLLLS
Sbjct 110 --TTGGQGGGRPVSGESREGRDNRDRVQVS-RRTRGPVRASMQIRRIDPWSTLKVSLLLS 166
Query 210 VALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIFGGAFLIGL 269
VALFFVWMI VAFLYLVLGGMGVW+KLNSNVGDLLNN SGSS ELVSSGTIFGGA LIG+
Sbjct 167 VALFFVWMIAVAFLYLVLGGMGVWSKLNSNVGDLLNNTSGSSGELVSSGTIFGGAVLIGV 226
Query 270 VNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
VNIVL+TA+ATI AF+YNL TDLIGGIEVTLADRD
Sbjct 227 VNIVLLTAMATIAAFIYNLATDLIGGIEVTLADRD 261
>gi|342862359|ref|ZP_08719000.1| hypothetical protein MCOL_25833 [Mycobacterium colombiense CECT
3035]
gi|342130216|gb|EGT83544.1| hypothetical protein MCOL_25833 [Mycobacterium colombiense CECT
3035]
Length=267
Score = 322 bits (826), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 181/272 (67%), Positives = 199/272 (74%), Gaps = 17/272 (6%)
Query 35 GRIPDAGDPPPWQRAATRQSQAGHRQPPPVSHPEGRPTNPPAAADARLNRFISGASAPVT 94
GR + GD PPWQR R Q G RQ P + E RP+ +ARLNRFISG +AP
Sbjct 11 GRSQEGGDGPPWQRGTARPQQPGSRQGEPPT--EKRPSGQTGGVEARLNRFISGTAAP-- 66
Query 95 GPAAAVRTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAGSR 154
G + + P+P G+ SP +AYASELPDLSGP PR P R P P RPAE + S
Sbjct 67 GAPSHAKEPEP------ARGEASPTDAYASELPDLSGPLPRGPHRKPTPERPAESSSSSG 120
Query 155 GDS--AAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVAL 212
A + GR E RD RVQ+S RR+RGPVRASMQIRRIDPWSTLKVSLLLSVAL
Sbjct 121 AGRSATAETRDGR----EGRDNRVQVS-RRTRGPVRASMQIRRIDPWSTLKVSLLLSVAL 175
Query 213 FFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNI 272
FFVWMI VAFLYLVLGGMGVW+KLNSNVGDLLNN SGSS ELVSSGTIFGGA LIGLVNI
Sbjct 176 FFVWMIAVAFLYLVLGGMGVWSKLNSNVGDLLNNTSGSSGELVSSGTIFGGAVLIGLVNI 235
Query 273 VLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
VL+TA+ATI AFVYNL TDLIGG+EVTLADRD
Sbjct 236 VLLTAMATIAAFVYNLTTDLIGGVEVTLADRD 267
>gi|296167136|ref|ZP_06849544.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897519|gb|EFG77117.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=231
Score = 307 bits (787), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 170/237 (72%), Positives = 184/237 (78%), Gaps = 9/237 (3%)
Query 70 RPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAEAYASELPDL 129
R PP+ +ARLNRFISG +AP G + P D DA G GDG PAEAYASELPDL
Sbjct 2 RQPEPPSGVEARLNRFISGTAAP--GAPHQAKNPPHDTDAPPGRGDGPPAEAYASELPDL 59
Query 130 SGPTPRAPQRNPAPARPAEGG--AGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGPV 187
SGP PR PQR P R +E AGS SA + GR ++R+ R Q++ RR RGPV
Sbjct 60 SGPLPRGPQRKPPADRTSEASTTAGSARSSAPENREGR----DTRENRTQVT-RRPRGPV 114
Query 188 RASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNA 247
RASMQIRRIDPWSTLKVSLLLSVALFFVWMI VAFLYLVLGGMGVW+KLNSNVGDLLNN
Sbjct 115 RASMQIRRIDPWSTLKVSLLLSVALFFVWMIAVAFLYLVLGGMGVWSKLNSNVGDLLNNT 174
Query 248 SGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
SGSS ELVSSGTIFGGA LIGLVNIVL+TA+ATI AFVYNL TDLIGGIEVTLADRD
Sbjct 175 SGSSGELVSSGTIFGGAVLIGLVNIVLLTAMATIAAFVYNLTTDLIGGIEVTLADRD 231
>gi|254821355|ref|ZP_05226356.1| hypothetical protein MintA_15563 [Mycobacterium intracellulare
ATCC 13950]
Length=233
Score = 293 bits (749), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 167/243 (69%), Positives = 180/243 (75%), Gaps = 22/243 (9%)
Query 68 EGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAEAYASELP 127
E RP+ P +ARLNRFISG T A + +GSPAEAYASELP
Sbjct 7 EKRPSGPAGGVEARLNRFISG-----TAAPGAGAAAHAKDADAAPAREGSPAEAYASELP 61
Query 128 DLSGPTPRAPQRNPAPA-RPAEGGAGSRGDSAAGSSGGRSITAESRDAR-----VQLSAR 181
DLSGP PR P R PAP R E +G+ GR T+ESR+AR VQ+S R
Sbjct 62 DLSGPLPRGPHRKPAPEQRTPESPSGA----------GRPSTSESREARENRDRVQVS-R 110
Query 182 RSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVG 241
R+RGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMI VAFLYLVLGGMGVW+KLNSNVG
Sbjct 111 RTRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMIAVAFLYLVLGGMGVWSKLNSNVG 170
Query 242 DLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLA 301
DLLNN SGSS ELVSSGTIFGGA LIGLVNIVL+TA+ATI AFVYNL TDLIGGIEVTLA
Sbjct 171 DLLNNTSGSSGELVSSGTIFGGAVLIGLVNIVLLTAMATIAAFVYNLTTDLIGGIEVTLA 230
Query 302 DRD 304
DRD
Sbjct 231 DRD 233
>gi|15826872|ref|NP_301135.1| hypothetical protein ML0007 [Mycobacterium leprae TN]
gi|221229350|ref|YP_002502766.1| hypothetical protein MLBr_00007 [Mycobacterium leprae Br4923]
gi|17367947|sp|O32870.1|Y007_MYCLE RecName: Full=Uncharacterized protein ML0007
gi|2344820|emb|CAA94717.1| hypothetical protein [Mycobacterium leprae]
gi|13092419|emb|CAC29515.1| putative membrane protein [Mycobacterium leprae]
gi|219932457|emb|CAR70100.1| putative membrane protein [Mycobacterium leprae Br4923]
Length=303
Score = 288 bits (738), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 170/317 (54%), Positives = 202/317 (64%), Gaps = 27/317 (8%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPG---------RIPDAGDPPPWQRAAT 51
+T+PNE A + D DG V+R G HRA + PG P+ D PPWQR +
Sbjct 1 MTSPNESRAFNAADDLIGDGSVERAGLHRATSVPGESSEGLQRGHSPEPNDSPPWQRGSA 60
Query 52 RQSQAGHRQPPPVSHPEGRPTNPPAAADARLNRFISGASAP-VTGPAAAVRTPQPDPDAS 110
R SQ+G+R P++ R +NP A+ R NRFISG +AP ++G +
Sbjct 61 RASQSGYRPSDPLT--TTRQSNPAPGANVRSNRFISGMTAPALSGQLPKKNNSTQALEPV 118
Query 111 LGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAE 170
L + E+YASELPDLSGP R P+P R G + R GR +
Sbjct 119 LMSNEVPFTESYASELPDLSGPVQRTVPCKPSPDR---GSSTPRM--------GRLEITK 167
Query 171 SR---DARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVL 227
R + R Q+S RRS GPVRASMQIRRIDPWS LKVSLLLSVALFFVWMI VAFLYL+L
Sbjct 168 VRGTGEIRSQIS-RRSHGPVRASMQIRRIDPWSMLKVSLLLSVALFFVWMIAVAFLYLLL 226
Query 228 GGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYN 287
GGMGVWAKLNSNVGDLLNN G+S ELVS+ TIFG A L+GLVNIVLMT +A I AFVYN
Sbjct 227 GGMGVWAKLNSNVGDLLNNTGGNSGELVSNSTIFGCAVLVGLVNIVLMTTMAAIAAFVYN 286
Query 288 LITDLIGGIEVTLADRD 304
L +DL+GG+EVTLAD D
Sbjct 287 LSSDLVGGVEVTLADLD 303
>gi|120401036|ref|YP_950865.1| hypothetical protein Mvan_0008 [Mycobacterium vanbaalenii PYR-1]
gi|119953854|gb|ABM10859.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=301
Score = 268 bits (686), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 173/321 (54%), Positives = 195/321 (61%), Gaps = 37/321 (11%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQ 60
+++P EPG GD N G H P + D PPWQR +A Q
Sbjct 1 MSSPQEPGYPRAGDAANGPAAGPAGSGHDGGPRPAHTAEGADVPPWQRGPA--GRARQHQ 58
Query 61 PPPVSHPEGRPTNPPAAADARLNRFISGASAPV----TGPAAA--------VRT----PQ 104
P + EG N P DARLNRF++G SAP T PA A VRT P+
Sbjct 59 APEGAQGEGPRPNAPGGLDARLNRFMAGGSAPAGSQETEPAPAPRNARTEVVRTEGNRPE 118
Query 105 PDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGG 164
P P G AYASE+PDLSGP P PQ+ RP A +G
Sbjct 119 PGPRPDQGA-------AYASEIPDLSGPRP--PQQRKPVDRPVPEQQPKPTPKPAPPAGN 169
Query 165 RSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLY 224
R+ VQ++ R GPVRASMQIRR+DPWSTLKVSLLLSV LFFVWMI VAFLY
Sbjct 170 RA---------VQVATRAHTGPVRASMQIRRVDPWSTLKVSLLLSVVLFFVWMIAVAFLY 220
Query 225 LVLGGMGVWAKLNSNVGDLLNNASGSS-AELVSSGTIFGGAFLIGLVNIVLMTALATIGA 283
LVLGGMGVW+KLNSNVGDLL +ASGSS ELVSSGTIFGGA LIGLVNIVL+TA ATIGA
Sbjct 221 LVLGGMGVWSKLNSNVGDLLTSASGSSGGELVSSGTIFGGAALIGLVNIVLLTAAATIGA 280
Query 284 FVYNLITDLIGGIEVTLADRD 304
F+YNL TDL+GG+EVTLADRD
Sbjct 281 FIYNLTTDLVGGVEVTLADRD 301
>gi|108796989|ref|YP_637186.1| hypothetical protein Mmcs_0008 [Mycobacterium sp. MCS]
gi|119866073|ref|YP_936025.1| hypothetical protein Mkms_0016 [Mycobacterium sp. KMS]
gi|126432621|ref|YP_001068312.1| hypothetical protein Mjls_0008 [Mycobacterium sp. JLS]
gi|108767408|gb|ABG06130.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119692162|gb|ABL89235.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126232421|gb|ABN95821.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=290
Score = 266 bits (679), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 159/274 (59%), Positives = 180/274 (66%), Gaps = 35/274 (12%)
Query 39 DAGDPPPWQRAA-TRQSQAGHRQPPPVSHPE--GRPTNPPAAADARLNRFISGASAPV-- 93
+ GD PPWQR R Q+ + P P G PT+ P A +RF SAP
Sbjct 44 EGGDVPPWQRGRPNRPPQSRAQDAPTRQEPSRGGSPTHAPGAEARNTSRFPVSGSAPTEE 103
Query 94 -TGPAAAVRTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAG 152
PAA VRTP+ + EAYASELPDLSGP PR PQR PA RPA
Sbjct 104 TKAPAAPVRTPE--------RTEAPRPEAYASELPDLSGPVPRPPQRKPATDRPA----- 150
Query 153 SRGDSAAGSSGGRSITAESRDA-RVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVA 211
A +R A R Q+ R +RGPVRASMQIRR+DPWS LKVSL+LSVA
Sbjct 151 --------------TEAPARPAARTQVGQRPARGPVRASMQIRRVDPWSALKVSLVLSVA 196
Query 212 LFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSS-AELVSSGTIFGGAFLIGLV 270
LFFVWMI VAFLYLVLGGMGVW+KLNSNVGDLL +ASGSS ELVSS TIFGGA L+GLV
Sbjct 197 LFFVWMIAVAFLYLVLGGMGVWSKLNSNVGDLLTSASGSSGGELVSSSTIFGGAALVGLV 256
Query 271 NIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
NIV++TA+AT GAF+YNL TDL+GG+EVTLADRD
Sbjct 257 NIVILTAMATAGAFIYNLTTDLVGGVEVTLADRD 290
>gi|145221414|ref|YP_001132092.1| hypothetical protein Mflv_0820 [Mycobacterium gilvum PYR-GCK]
gi|145213900|gb|ABP43304.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=314
Score = 262 bits (669), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 175/343 (52%), Positives = 201/343 (59%), Gaps = 68/343 (19%)
Query 1 VTAPNEPGALSKGDGPN-ADGLVDRGGA--HRAATGPGRIPDAGDPPPWQRA-ATRQSQA 56
+++P EPG GDG A+G + A H A P + D + PPWQR ATR Q
Sbjct 1 MSSPKEPGPAHAGDGAAPAEGPGNGSAAPGHDAGARPAHVADTTELPPWQRGPATRARQ- 59
Query 57 GHRQPPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRT-------------- 102
Q P P+G A DARLNRF++G SAP P A R
Sbjct 60 -QPQAPESPRPDGPRGGGQAGLDARLNRFMAGGSAPSGAPDAPPRNEPPRNDPPRNEPAP 118
Query 103 ---------PQPDPDASLGCGDGSPAEAYASELPDLSGPT-----------PRAPQRNPA 142
P+P P G +G+ AYASELPDLSGP P + PA
Sbjct 119 RNDRTDVVRPEPKPRPEGGRPEGA---AYASELPDLSGPRPPQPPRKPVERPSTESKPPA 175
Query 143 PARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTL 202
P+ G RVQ+++R+ RGPVRASMQIRR+DPWSTL
Sbjct 176 KGSPSPAG------------------------RVQVASRQHRGPVRASMQIRRVDPWSTL 211
Query 203 KVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSS-AELVSSGTIF 261
KVSLLLSV LFFVWMI VAFLYLVLGGMGVW+KLNSNVGDLL +ASGSS ELVSSGTIF
Sbjct 212 KVSLLLSVVLFFVWMIAVAFLYLVLGGMGVWSKLNSNVGDLLTSASGSSGGELVSSGTIF 271
Query 262 GGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
GGA LIGLVNIVL+TA ATIGAF+YNL TDL+GG+EVTLADRD
Sbjct 272 GGAALIGLVNIVLLTAGATIGAFIYNLTTDLVGGVEVTLADRD 314
>gi|315441704|ref|YP_004074583.1| hypothetical protein Mspyr1_00080 [Mycobacterium sp. Spyr1]
gi|315260007|gb|ADT96748.1| hypothetical protein Mspyr1_00080 [Mycobacterium sp. Spyr1]
Length=314
Score = 262 bits (669), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 174/343 (51%), Positives = 200/343 (59%), Gaps = 68/343 (19%)
Query 1 VTAPNEPGALSKGDGPN-ADGLVDRGGA--HRAATGPGRIPDAGDPPPWQRA-ATRQSQA 56
+++P EPG GDG A+G + A H A P + D + PPWQR ATR Q
Sbjct 1 MSSPKEPGPAHAGDGAAPAEGPGNGSAAPGHDAGARPAHVADTTELPPWQRGPATRARQ- 59
Query 57 GHRQPPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRT-------------- 102
Q P P+G DARLNRF++G SAP P A R
Sbjct 60 -QSQAPEAPRPDGPRGGNQGGLDARLNRFMAGGSAPSGAPDAPPRNEPPRNDPPRNEPAP 118
Query 103 ---------PQPDPDASLGCGDGSPAEAYASELPDLSGPT-----------PRAPQRNPA 142
P+P P G +G+ AYASELPDLSGP P + PA
Sbjct 119 RNDRTDVVRPEPKPRPEGGRPEGA---AYASELPDLSGPRPPQPPRKPVERPSTESKPPA 175
Query 143 PARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTL 202
P+ G RVQ+++R+ RGPVRASMQIRR+DPWSTL
Sbjct 176 KGSPSPAG------------------------RVQVASRQHRGPVRASMQIRRVDPWSTL 211
Query 203 KVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSS-AELVSSGTIF 261
KVSLLLSV LFFVWMI VAFLYLVLGGMGVW+KLNSNVGDLL +ASGSS ELVSSGTIF
Sbjct 212 KVSLLLSVVLFFVWMIAVAFLYLVLGGMGVWSKLNSNVGDLLTSASGSSGGELVSSGTIF 271
Query 262 GGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
GGA LIGLVNIVL+TA ATIGAF+YNL TDL+GG+EVTLADRD
Sbjct 272 GGAALIGLVNIVLLTAGATIGAFIYNLTTDLVGGVEVTLADRD 314
>gi|118470742|ref|YP_884430.1| hypothetical protein MSMEG_0007 [Mycobacterium smegmatis str.
MC2 155]
gi|118172029|gb|ABK72925.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=260
Score = 248 bits (634), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 155/312 (50%), Positives = 188/312 (61%), Gaps = 60/312 (19%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGG-----AHRAATGPGRIPDAGDPPPWQRAATRQSQ 55
+++PNEPG GD P A GG + + G I D+GD PPWQR +R +Q
Sbjct 1 MSSPNEPGYPRAGDRPGATNGTGPGGDSGALSSNSTRATGHITDSGDVPPWQRGVSRTAQ 60
Query 56 AGHRQP-PPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAVRTPQPDPDASLGCG 114
QP P+ E +P P + +P P+
Sbjct 61 ----QPGQPLGDTEQQPRPAPRPEPRETRQ-------------------EPRPEHR---- 93
Query 115 DGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDA 174
EAYASELPDLSGP PR+PQR +G + A +
Sbjct 94 ----TEAYASELPDLSGPVPRSPQRK---------------------TGADAPRASAPTT 128
Query 175 RVQLSAR-RSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVW 233
R+Q++ R + GPVRASMQIRR+DPW+ LKVSL+LSV LFFVWMI VAFLYLVLGGMGVW
Sbjct 129 RIQVANRPQPSGPVRASMQIRRVDPWTVLKVSLVLSVVLFFVWMIAVAFLYLVLGGMGVW 188
Query 234 AKLNSNVGDLLNNASGSS-AELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDL 292
+KLNSNVGDLL +ASGSS ELVSSGTIFGGA LIGLVNIV+++A+AT+GAF+YNL TDL
Sbjct 189 SKLNSNVGDLLTSASGSSGGELVSSGTIFGGAALIGLVNIVVLSAMATVGAFIYNLTTDL 248
Query 293 IGGIEVTLADRD 304
+GGIEVTLADRD
Sbjct 249 VGGIEVTLADRD 260
>gi|169627128|ref|YP_001700777.1| hypothetical protein MAB_0020 [Mycobacterium abscessus ATCC 19977]
gi|169239095|emb|CAM60123.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=313
Score = 236 bits (601), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 161/347 (47%), Positives = 192/347 (56%), Gaps = 77/347 (22%)
Query 1 VTAPNEPGA-------LSKGDGPNADGLVDRGGAHRAATGPGRIPD----AGDPPPWQRA 49
+++PN+PG + + D P R G+H AATGP RIPD G P
Sbjct 1 MSSPNDPGESGANERPVDQADLPPWQRAERRRGSHAAATGPQRIPDREPRPGGAPTEIIP 60
Query 50 ATRQSQAGHRQPPPVSHPEGRPTNPPAAADARLNRFISGASAP---VTGPAAAVRTPQPD 106
+ A P P GRP+ A DARL+RFISG +AP T PA A P+P+
Sbjct 61 IVDDASAPPTAPTSAPRPSGRPS---AGLDARLSRFISGTAAPAGFTTPPARAEADPEPE 117
Query 107 PDASL-----------------------GCGDGSPAEAYAS-----ELPDLSGPTPRAPQ 138
P + G P A S +LPDLSGP PR P+
Sbjct 118 PYDEVPERPERSERADRPEPPRAGTPWGESGAQEPVRASKSALKNEDLPDLSGPVPRPPR 177
Query 139 RNPAPARPAEGGAGSRGDSAAGSSGGRSITA-ESRDARVQLSARRSRGPVRASMQIRRID 197
R+ A + GG ++T SR RGP+RA+MQIRRID
Sbjct 178 RSDA------------------AQGGSAVTVGPSR-----------RGPLRAAMQIRRID 208
Query 198 PWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSS 257
PW+TLKVSL+LSV FFVWMI VA LYLVLG MGVW+KLN NVG+L+ N+ G ELVSS
Sbjct 209 PWATLKVSLVLSVVFFFVWMIAVAMLYLVLGAMGVWSKLNENVGELITNSGG--GELVSS 266
Query 258 GTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
GTIFG A LIGLVNIVLMTA ATIGAF+YNL TDL+GGIE+TLADRD
Sbjct 267 GTIFGTALLIGLVNIVLMTAAATIGAFIYNLTTDLVGGIEITLADRD 313
>gi|339296736|gb|AEJ48846.1| hypothetical protein CCDC5180_0009 [Mycobacterium tuberculosis
CCDC5180]
Length=114
Score = 228 bits (582), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 114/114 (100%), Positives = 114/114 (100%), Gaps = 0/114 (0%)
Query 191 MQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGS 250
MQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGS
Sbjct 1 MQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGS 60
Query 251 SAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
SAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD
Sbjct 61 SAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 114
>gi|333988647|ref|YP_004521261.1| hypothetical protein JDM601_0007 [Mycobacterium sp. JDM601]
gi|333484615|gb|AEF34007.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=282
Score = 224 bits (572), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 153/321 (48%), Positives = 177/321 (56%), Gaps = 56/321 (17%)
Query 1 VTAPNEPGALSKGDGPNADGLVDRGGA----HRAATGPGRIPDAGDPPPWQRAATRQSQA 56
++APNEPG G GP G A R PGR+ D G+PPPWQR + A
Sbjct 1 MSAPNEPG--HPGSGPGKAESAGSGSADVAGQRPVARPGRVADTGEPPPWQRGGQSRPPA 58
Query 57 GHRQPPPVSHPEGRPTNPPAAA------DARLNRFISGASAP-------VTGPAAAVRTP 103
R P E + P AA D RL RF+SG +AP PA P
Sbjct 59 AARPAEPARPAEPARSAEPNAAGHSPGVDERLRRFVSGTAAPGAPQPAKPAKPAKQTAKP 118
Query 104 QPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSG 163
P + + Y SELPDLS P R P PAR
Sbjct 119 VQPPAKPAPAAHAATDDTYGSELPDLSEPAQRRP-----PAR------------------ 155
Query 164 GRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFL 223
R Q+SA RGPVRASMQIRRIDPWS LKVSLLLS ALFFVWMI VA L
Sbjct 156 -----------RTQVSAG-PRGPVRASMQIRRIDPWSALKVSLLLSTALFFVWMIAVAVL 203
Query 224 YLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGA 283
Y++LG MGVW KLNSNVGDLL N S+AELVS +IFGGA LIGLVN+V++TA+AT+G
Sbjct 204 YVMLGAMGVWNKLNSNVGDLLTN--NSAAELVSGSSIFGGAALIGLVNVVVLTAMATLGV 261
Query 284 FVYNLITDLIGGIEVTLADRD 304
+YNL TD++GG+EVTLADRD
Sbjct 262 VIYNLSTDVVGGVEVTLADRD 282
>gi|325677510|ref|ZP_08157174.1| putative proline rich protein [Rhodococcus equi ATCC 33707]
gi|325551757|gb|EGD21455.1| putative proline rich protein [Rhodococcus equi ATCC 33707]
Length=354
Score = 169 bits (429), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 122/304 (41%), Positives = 157/304 (52%), Gaps = 69/304 (22%)
Query 28 HRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQPPPVSHPEGR-----PTNPPAAADARL 82
R A G P PPWQR G +Q PP P G P PPA A
Sbjct 93 ERGADGAKPDPKPVQTPPWQR--------GQQQKPPAQGPNGSARPEAPGKPPAKAQP-A 143
Query 83 NRFISGASAP-------------VTGPAAAVRTPQPD----PDASLGCGDGSPAEA---- 121
++G +AP VTG AA + P D A DG P +
Sbjct 144 RPVVTGTAAPKPPQGDGRPARPVVTG-TAAPKPPAADRRDVAKAKAAAIDG-PTRSIART 201
Query 122 -YASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSA 180
A ++PDLS PR RPA G A +AA + G
Sbjct 202 DLAKDMPDLSA-VPRQ--------RPASGSARKAALTAAVTEDGE--------------- 237
Query 181 RRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV 240
P+RA++Q+RRIDPWSTLKVSL++SVALFFVWM+ V LYLVL GMGVW +LN+
Sbjct 238 -----PLRATVQLRRIDPWSTLKVSLVISVALFFVWMVAVGLLYLVLDGMGVWDRLNNAF 292
Query 241 GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTL 300
+ ++SG LV+SG +FG + L+G++N+VL TALATIG+F+YNL +DL+GG++VTL
Sbjct 293 TEFTTDSSGGG--LVTSGQVFGYSALVGVMNVVLFTALATIGSFIYNLCSDLVGGVQVTL 350
Query 301 ADRD 304
AD D
Sbjct 351 ADPD 354
>gi|111020670|ref|YP_703642.1| proline rich protein [Rhodococcus jostii RHA1]
gi|110820200|gb|ABG95484.1| possible proline rich protein [Rhodococcus jostii RHA1]
Length=467
Score = 168 bits (425), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 73/118 (62%), Positives = 97/118 (83%), Gaps = 2/118 (1%)
Query 187 VRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNN 246
+RA++Q+R+IDPWSTLK+S ++SV+LFFVWM+ V LY+VL GMGVW +LN+ D++
Sbjct 352 LRATVQVRKIDPWSTLKISSVISVSLFFVWMVAVGLLYVVLDGMGVWDRLNNAFTDIV-- 409
Query 247 ASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
A SS LV++G +FG A +IGL N+VL TALATIGAF+YNL +DL+GG+EVTLADRD
Sbjct 410 AESSSDGLVTAGQVFGYAAVIGLANMVLFTALATIGAFIYNLCSDLVGGVEVTLADRD 467
>gi|229491180|ref|ZP_04385008.1| putative proline rich protein [Rhodococcus erythropolis SK121]
gi|229321918|gb|EEN87711.1| putative proline rich protein [Rhodococcus erythropolis SK121]
Length=464
Score = 168 bits (425), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 73/119 (62%), Positives = 97/119 (82%), Gaps = 2/119 (1%)
Query 186 PVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLN 245
P+RA++QIRRIDPWSTLK++ ++SV+LFFVWM+ V LY+VL GMGVW +LN+ D++
Sbjct 348 PLRATVQIRRIDPWSTLKITSVISVSLFFVWMVAVGLLYVVLDGMGVWDRLNNAFTDIV- 406
Query 246 NASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
A G S LV++G +FG A LIG+ N+VL TAL TIG+F+YNL +DL+GG+EVTLADRD
Sbjct 407 -ADGGSDGLVTAGQVFGYAALIGIANMVLFTALVTIGSFIYNLCSDLVGGVEVTLADRD 464
>gi|134096628|ref|YP_001102289.1| hypothetical protein SACE_0010 [Saccharopolyspora erythraea NRRL
2338]
gi|291005718|ref|ZP_06563691.1| hypothetical protein SeryN2_14454 [Saccharopolyspora erythraea
NRRL 2338]
gi|133909251|emb|CAL99363.1| hypothetical protein Rv0007 [Saccharopolyspora erythraea NRRL
2338]
Length=271
Score = 165 bits (417), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 84/164 (52%), Positives = 112/164 (69%), Gaps = 4/164 (2%)
Query 145 RPAEGGAGSRGDSAAGSSGGRSITAESRDARVQL---SARR-SRGPVRASMQIRRIDPWS 200
RPA+ +R + SSGGR+ R SARR SRGP RAS+Q++R+DPWS
Sbjct 108 RPAQEQETARNEVPQSSSGGRTAVTFGSAGRATTAAGSARRPSRGPRRASLQVKRVDPWS 167
Query 201 TLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTI 260
LK++L+LSVALFFVWMI VA LY VL GMGVW +LN +L ++ L+S+G +
Sbjct 168 VLKLALVLSVALFFVWMIAVAVLYGVLDGMGVWDQLNGTFTELTQPDDAAAEPLISAGRV 227
Query 261 FGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
FG A +IG +NIVL+TALAT+ AF+YN+ D GG+EVTL++R+
Sbjct 228 FGVASIIGAINIVLITALATVAAFIYNVAADFAGGVEVTLSERE 271
>gi|226303500|ref|YP_002763458.1| hypothetical protein RER_00110 [Rhodococcus erythropolis PR4]
gi|226182615|dbj|BAH30719.1| hypothetical protein RER_00110 [Rhodococcus erythropolis PR4]
Length=377
Score = 164 bits (415), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 73/119 (62%), Positives = 97/119 (82%), Gaps = 2/119 (1%)
Query 186 PVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLN 245
P+RA++QIRRIDPWSTLK++ ++SV+LFFVWM+ V LY+VL GMGVW +LN+ D++
Sbjct 261 PLRATVQIRRIDPWSTLKITSVISVSLFFVWMVAVGLLYVVLDGMGVWDRLNNAFTDIV- 319
Query 246 NASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
A G S LV++G +FG A LIG+ N+VL TAL TIG+F+YNL +DL+GG+EVTLADRD
Sbjct 320 -ADGGSDGLVTAGQVFGYAALIGIANMVLFTALVTIGSFIYNLCSDLVGGVEVTLADRD 377
>gi|226362910|ref|YP_002780690.1| hypothetical protein ROP_34980 [Rhodococcus opacus B4]
gi|226241397|dbj|BAH51745.1| hypothetical membrane protein [Rhodococcus opacus B4]
Length=359
Score = 164 bits (415), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 73/118 (62%), Positives = 97/118 (83%), Gaps = 2/118 (1%)
Query 187 VRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNN 246
+RA++Q+R+IDPWSTLK+S ++SV+LFFVWM+ V LY+VL GMGVW +LN+ D++
Sbjct 244 LRATVQVRKIDPWSTLKISSVISVSLFFVWMVAVGLLYVVLDGMGVWDRLNNAFTDIV-- 301
Query 247 ASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
A SS LV++G +FG A +IGL N+VL TALATIGAF+YNL +DL+GG+EVTLADRD
Sbjct 302 AESSSDGLVTAGQVFGYAAVIGLANMVLFTALATIGAFIYNLCSDLVGGVEVTLADRD 359
>gi|54021972|ref|YP_116214.1| hypothetical protein nfa80 [Nocardia farcinica IFM 10152]
gi|54013480|dbj|BAD54850.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=389
Score = 162 bits (410), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 74/119 (63%), Positives = 97/119 (82%), Gaps = 1/119 (0%)
Query 186 PVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLN 245
P+RA++QIRRIDPWSTLK+SL++SVALFFVWM+ V LY+VL GMGVW +LN+ D+++
Sbjct 272 PLRATVQIRRIDPWSTLKISLVISVALFFVWMLAVGLLYIVLEGMGVWERLNNTFTDMVS 331
Query 246 NASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
SG SA L+ +GT+FG A +IGL+N+VL TAL T+G F+YN DL+GGI+VTLAD D
Sbjct 332 QDSG-SAGLIDAGTVFGYAGVIGLINVVLFTALGTVGTFIYNQCCDLVGGIQVTLADPD 389
>gi|312137525|ref|YP_004004861.1| integral membrane protein [Rhodococcus equi 103S]
gi|311886864|emb|CBH46172.1| putative integral membrane protein [Rhodococcus equi 103S]
Length=189
Score = 159 bits (401), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 88/182 (49%), Positives = 117/182 (65%), Gaps = 31/182 (17%)
Query 123 ASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSARR 182
A ++PDLS PR RPA G A +AA + G
Sbjct 39 AKDMPDLSA-VPRQ--------RPASGSARKAALTAAVTEDGE----------------- 72
Query 183 SRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGD 242
P+RA++Q+RRIDPWSTLKVSL++SVALFFVWM+ V LYLVL GMGVW +LN+ +
Sbjct 73 ---PLRATVQLRRIDPWSTLKVSLVISVALFFVWMVAVGLLYLVLDGMGVWDRLNNAFTE 129
Query 243 LLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLAD 302
++SG LV+SG +FG + L+G++N+VL TALATIG+F+YNL +DL+GG++VTLAD
Sbjct 130 FTTDSSGGG--LVTSGQVFGYSALVGVMNVVLFTALATIGSFIYNLCSDLVGGVQVTLAD 187
Query 303 RD 304
D
Sbjct 188 PD 189
>gi|296137761|ref|YP_003645004.1| hypothetical protein Tpau_0011 [Tsukamurella paurometabola DSM
20162]
gi|296025895|gb|ADG76665.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=232
Score = 153 bits (386), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 70/127 (56%), Positives = 100/127 (79%), Gaps = 2/127 (1%)
Query 178 LSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLN 237
+ + + GPVRA+MQ+R IDPW+T K++ ++SV LFFVWMI V LY++L GMGVW+K+N
Sbjct 108 VKTQAATGPVRAAMQLRSIDPWTTFKLAGVVSVVLFFVWMIAVGALYVILNGMGVWSKIN 167
Query 238 SNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIE 297
+ L+N +GSS ++++G +F + L+GL N+VL+TALAT+GAF+YNL DL+GG+E
Sbjct 168 DSFQTLVNEQAGSS--ILTAGDVFLYSGLVGLANVVLLTALATVGAFIYNLCADLVGGVE 225
Query 298 VTLADRD 304
VTLADRD
Sbjct 226 VTLADRD 232
>gi|256374168|ref|YP_003097828.1| hypothetical protein Amir_0008 [Actinosynnema mirum DSM 43827]
gi|255918471|gb|ACU33982.1| hypothetical protein Amir_0008 [Actinosynnema mirum DSM 43827]
Length=288
Score = 147 bits (371), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 66/115 (58%), Positives = 90/115 (79%), Gaps = 0/115 (0%)
Query 190 SMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASG 249
S+Q++R+DPWS LK++L+LSVALFFVW++ V LY VL GMGVW K+N+ DLL +
Sbjct 174 SLQVKRVDPWSVLKLALVLSVALFFVWLVAVGVLYGVLNGMGVWDKINNTANDLLQSEEA 233
Query 250 SSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
S L+S+G +FG + ++G VNIVL TALAT+GAFVYN+ DL GG+EVTL++R+
Sbjct 234 SGDPLISAGRVFGVSAIVGAVNIVLFTALATVGAFVYNVSADLAGGLEVTLSERE 288
>gi|326383916|ref|ZP_08205600.1| hypothetical protein SCNU_13323 [Gordonia neofelifaecis NRRL
B-59395]
gi|326197375|gb|EGD54565.1| hypothetical protein SCNU_13323 [Gordonia neofelifaecis NRRL
B-59395]
Length=288
Score = 144 bits (364), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 68/137 (50%), Positives = 100/137 (73%), Gaps = 1/137 (0%)
Query 169 AESRDARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLG 228
A +R + + AR P+RA++QIRR+DPWS KVS +LSVA FF+WMI VA LY ++G
Sbjct 152 ALARASSQTIPARSVGTPLRAAVQIRRVDPWSVFKVSGVLSVAGFFIWMIAVAILYGIMG 211
Query 229 GMGVWAKLNSNVGDLL-NNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYN 287
GMG+W ++NS+ G L+ ++ S S +L+S G +F + L G+V +L+TAL+TI A++YN
Sbjct 212 GMGIWDQINSSFGTLVSSDGSTSGQDLISGGQVFMFSALFGIVAAILLTALSTISAYIYN 271
Query 288 LITDLIGGIEVTLADRD 304
+ D++GG+EVTLAD D
Sbjct 272 VCADMVGGVEVTLADLD 288
>gi|284988641|ref|YP_003407195.1| hypothetical protein Gobs_0012 [Geodermatophilus obscurus DSM
43160]
gi|284061886|gb|ADB72824.1| conserved hypothetical protein [Geodermatophilus obscurus DSM
43160]
Length=237
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 65/127 (52%), Positives = 93/127 (74%), Gaps = 1/127 (0%)
Query 179 SARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNS 238
S + RGP RA +Q+R ID +S LK+SL+LS+A+FF+WM+ V LY VL G+GV+ LN
Sbjct 111 SGKAPRGPRRARLQLRHIDTFSALKISLVLSIAMFFIWMVAVGVLYGVLSGLGVFETLND 170
Query 239 NVGDL-LNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIE 297
G L + +E+++ G +FGGA +IG +NIVLMTAL T+ AF+YN+ +DL+GG+E
Sbjct 171 LFGQLGSASGGDGGSEVITPGIVFGGAAVIGAINIVLMTALCTVAAFIYNMCSDLVGGLE 230
Query 298 VTLADRD 304
VTL++RD
Sbjct 231 VTLSERD 237
>gi|331693906|ref|YP_004330145.1| hypothetical protein Psed_0007 [Pseudonocardia dioxanivorans
CB1190]
gi|326948595|gb|AEA22292.1| hypothetical protein Psed_0007 [Pseudonocardia dioxanivorans
CB1190]
Length=367
Score = 141 bits (356), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 64/125 (52%), Positives = 94/125 (76%), Gaps = 3/125 (2%)
Query 182 RSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVG 241
R R P +A++Q++R+DPWS LK++L+L+V +FF+WM+ + LY VL GMGVW +LN
Sbjct 244 RPRRPRQAALQLKRLDPWSVLKLALVLAVVIFFIWMVAIGVLYGVLDGMGVWDRLNGTYN 303
Query 242 DLLN--NASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVT 299
DL++ +ASG SA L+S+G +FG A ++G +N +L TIGAFVYN+ DL+GG+EVT
Sbjct 304 DLVSGESASGGSA-LISAGRVFGLAAVVGAINSLLFAVAMTIGAFVYNVSADLVGGVEVT 362
Query 300 LADRD 304
L++RD
Sbjct 363 LSERD 367
>gi|330464885|ref|YP_004402628.1| hypothetical protein VAB18032_04510 [Verrucosispora maris AB-18-032]
gi|328807856|gb|AEB42028.1| hypothetical protein VAB18032_04510 [Verrucosispora maris AB-18-032]
Length=301
Score = 139 bits (351), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 75/163 (47%), Positives = 103/163 (64%), Gaps = 5/163 (3%)
Query 145 RPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTLKV 204
RPA GG G A R + R AR +S+ SRGP RA + ++RIDPWS +K
Sbjct 141 RPANGGGLPPGVGTAAVGAAR-VGEAVRAARTSVSSAASRGPRRARLNLKRIDPWSVMKF 199
Query 205 SLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASG---SSAELVSSGTIF 261
+ +SV LF V ++ + LYL L MGV+ +N ++ DL+N G S ++ ++G IF
Sbjct 200 AFAVSVVLFIVIVVATSVLYLALDAMGVFTSVNDSLSDLVNAGGGQGTSGFQITATGVIF 259
Query 262 GGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
+ LIG VN+VL TALAT+GAFVYN+ DL+GGIE+TLA+RD
Sbjct 260 -TSMLIGAVNVVLFTALATLGAFVYNVCADLVGGIELTLAERD 301
>gi|262200053|ref|YP_003271261.1| hypothetical protein Gbro_0007 [Gordonia bronchialis DSM 43247]
gi|262083400|gb|ACY19368.1| hypothetical protein Gbro_0007 [Gordonia bronchialis DSM 43247]
Length=316
Score = 138 bits (348), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 66/126 (53%), Positives = 94/126 (75%), Gaps = 4/126 (3%)
Query 182 RSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVG 241
R+ P+RA++QIRRIDPW+T K++ +L+V F +WMI +A LYLVL GMGV ++N++
Sbjct 192 RANTPLRAAIQIRRIDPWATFKITAVLAVIGFIIWMIAIAVLYLVLDGMGVREQVNTSFA 251
Query 242 DLLNNASGSSAE---LVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEV 298
+ A GSSA+ + S+ T+FG A L+G +N +L+TALATIG+++YN+ DLIGG EV
Sbjct 252 -TVATADGSSAQSDDIFSATTVFGAAALLGAINAILITALATIGSYIYNICADLIGGAEV 310
Query 299 TLADRD 304
TLAD D
Sbjct 311 TLADLD 316
>gi|333917687|ref|YP_004491268.1| hypothetical protein AS9A_0008 [Amycolicicoccus subflavus DQS3-9A1]
gi|333479908|gb|AEF38468.1| hypothetical protein AS9A_0008 [Amycolicicoccus subflavus DQS3-9A1]
Length=127
Score = 136 bits (342), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 67/127 (53%), Positives = 95/127 (75%), Gaps = 2/127 (1%)
Query 178 LSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLN 237
++A+ ++ +RAS+QIR IDPWSTLK+S ++S ALF VWMI V LYLVL M VW +LN
Sbjct 3 VTAKPTQAGLRASVQIREIDPWSTLKISAIISAALFLVWMIAVGTLYLVLEFMNVWDRLN 62
Query 238 SNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIE 297
S ++++ G++ E++S+ IFG IG +N++L TAL T+ +F+YNL +DL+GGIE
Sbjct 63 SAFLEIVDE--GAAGEIISASQIFGWTAAIGFINMILFTALLTLASFIYNLASDLVGGIE 120
Query 298 VTLADRD 304
VTLADRD
Sbjct 121 VTLADRD 127
>gi|300781945|ref|YP_003762236.1| hypothetical protein AMED_0008 [Amycolatopsis mediterranei U32]
gi|299791459|gb|ADJ41834.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340523298|gb|AEK38503.1| hypothetical protein RAM_00040 [Amycolatopsis mediterranei S699]
Length=221
Score = 136 bits (342), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 63/118 (54%), Positives = 89/118 (76%), Gaps = 2/118 (1%)
Query 189 ASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLL--NN 246
AS+Q++R DPWS LK+SL+L VA+FFVW++ V LY VL GMGVW KLN L+
Sbjct 104 ASLQVKRFDPWSVLKLSLVLGVAMFFVWLVAVGVLYTVLDGMGVWDKLNGTYSSLVGGEG 163
Query 247 ASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
A+ S+ L+S+G +FG A ++G +NIVL++ALAT+ AF+YN+ DL GG+EVTL++R+
Sbjct 164 ANASAEPLISAGRVFGIAAILGAINIVLVSALATVSAFIYNVSADLAGGLEVTLSERE 221
>gi|296392548|ref|YP_003657432.1| hypothetical protein Srot_0110 [Segniliparus rotundus DSM 44985]
gi|296179695|gb|ADG96601.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=199
Score = 135 bits (340), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 62/123 (51%), Positives = 90/123 (74%), Gaps = 0/123 (0%)
Query 180 ARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSN 239
+++ GP+RA +Q+R IDPW+ LKV+ + V LF WMI VA LY +L +GV K+NS
Sbjct 75 SQQPSGPLRAQVQVRWIDPWTVLKVTAAVMVVLFVAWMIGVAVLYALLEAIGVMGKINSG 134
Query 240 VGDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVT 299
+GD A+ ++V+ G +FG + L+G+VNIVL+TALAT+GAF++NL DL+GG+E+T
Sbjct 135 LGDFSTAAADQGGDIVTPGMVFGFSALVGIVNIVLVTALATVGAFIFNLCVDLVGGVEIT 194
Query 300 LAD 302
LAD
Sbjct 195 LAD 197
>gi|343928741|ref|ZP_08768186.1| hypothetical protein GOALK_120_01680 [Gordonia alkanivorans NBRC
16433]
gi|343761490|dbj|GAA15112.1| hypothetical protein GOALK_120_01680 [Gordonia alkanivorans NBRC
16433]
Length=305
Score = 135 bits (340), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 100/298 (34%), Positives = 145/298 (49%), Gaps = 62/298 (20%)
Query 44 PPWQRAATRQS------QAGHRQ---------------------PPPVSH----PEGRPT 72
PPWQR +S +A R+ PPP S P G T
Sbjct 33 PPWQRGPVEESSGSAPTEAFGREDDSRPNAQDNPGGLPPESRGGPPPESRGGPPPRGIVT 92
Query 73 NPPAAADARLNRFISGASAPVTG------PAAAVRTPQPDPDASLGCGDGSPAEAYASEL 126
N AA + +SG APVT P +AV T +P G + + L
Sbjct 93 NSGTAAAS-----MSGQQAPVTKLDSPRRPGSAVATEEP------GFVESPTSTIDRENL 141
Query 127 PDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGP 186
P P RP E + A S R + A S
Sbjct 142 PGHDLPDLDQIHHTADLKRPPEAPPAAAPSKVAPRSAPRQVGAGSA-------------- 187
Query 187 VRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNN 246
+RA++Q+RRIDPW+T K++ +LS FF+WMI VA LYL+ GMG+W ++N++ G L+ +
Sbjct 188 LRAAVQLRRIDPWATFKIAAVLSFVGFFIWMIAVAVLYLIFDGMGIWDQVNNSFGTLVAD 247
Query 247 ASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
S ++ +++ +GT+FG A L+G VN +L+TA+AT+G+++YN+ DL+GG EVTLAD D
Sbjct 248 ESSTAGDVIGAGTVFGVAALLGAVNAILLTAIATVGSYIYNICADLVGGAEVTLADLD 305
>gi|302531363|ref|ZP_07283705.1| predicted protein [Streptomyces sp. AA4]
gi|302440258|gb|EFL12074.1| predicted protein [Streptomyces sp. AA4]
Length=218
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 63/118 (54%), Positives = 88/118 (75%), Gaps = 2/118 (1%)
Query 189 ASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLL--NN 246
AS+QI+R DPWS LK++L+L VA+FFVW++ V LY VL GMGVW KLN L+
Sbjct 101 ASLQIKRFDPWSVLKLALVLGVAMFFVWLVAVGVLYTVLDGMGVWDKLNGTYSSLVGGEG 160
Query 247 ASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
A+ S L+S+G +FG A ++G +NIVL++ALAT+ AF+YN+ DL GG+EVTL++R+
Sbjct 161 ANASPDPLISAGRVFGIAAILGAINIVLVSALATVSAFIYNVSADLAGGLEVTLSERE 218
>gi|159035683|ref|YP_001534936.1| hypothetical protein Sare_0009 [Salinispora arenicola CNS-205]
gi|157914518|gb|ABV95945.1| conserved hypothetical protein [Salinispora arenicola CNS-205]
Length=302
Score = 127 bits (318), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 70/136 (52%), Positives = 95/136 (70%), Gaps = 4/136 (2%)
Query 172 RDARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMG 231
R AR +S+ SRGP RA + +RRIDPWS +K + +SV LF V ++ + LYL L MG
Sbjct 168 RAARTAVSSAASRGPRRARLNLRRIDPWSVMKFAFAVSVVLFIVVVVATSVLYLALDAMG 227
Query 232 VWAKLNSNVGDLLNNASGSSA---ELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNL 288
V+A +N ++ DL+N G SA ++ + G I A LIGLVN+VL TALAT+GAFVYN+
Sbjct 228 VFASVNDSLSDLVNAGGGQSADGFQITARGVILSSA-LIGLVNVVLFTALATLGAFVYNV 286
Query 289 ITDLIGGIEVTLADRD 304
DL+GG+E+TLA+RD
Sbjct 287 CADLVGGVELTLAERD 302
>gi|257054097|ref|YP_003131929.1| hypothetical protein Svir_00080 [Saccharomonospora viridis DSM
43017]
gi|256583969|gb|ACU95102.1| hypothetical protein Svir_00080 [Saccharomonospora viridis DSM
43017]
Length=248
Score = 126 bits (316), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 92/264 (35%), Positives = 133/264 (51%), Gaps = 35/264 (13%)
Query 41 GDPPPWQRAATRQSQAGHRQPPPVSHPEGRPTNPPAAADARLNRFISGASAPVTGPAAAV 100
G+ PPWQR A + AG ++ A L +S ++ V A
Sbjct 20 GEVPPWQRVA-KDGSAG-------------TSDGDEGATQWLTSPVSAGASTVPPAPGAG 65
Query 101 RTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAG 160
P G G A A+ L + Q A+P AG G
Sbjct 66 GGGAQPPQQPGGEGSAFGQGAVANRL------FGQGEQSATRTAQPQGSDAGKSG----- 114
Query 161 SSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITV 220
SG +S++A R R A ++Q+RR+DPWS LK+SL+L VALFF+W++ V
Sbjct 115 -SGTQSLSAFRRPGRGPRRA---------NLQVRRVDPWSVLKLSLVLGVALFFIWLVAV 164
Query 221 AFLYLVLGGMGVWAKLNSNVGDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALAT 280
LY VL GMGVW +N L+ N + L+++GT+FG A ++G VNIVL++ALAT
Sbjct 165 GVLYTVLDGMGVWDSINGTYDSLVANDAVDGDVLITAGTVFGAAAIVGAVNIVLISALAT 224
Query 281 IGAFVYNLITDLIGGIEVTLADRD 304
+GAF+YN+ L GG+E+TL++R+
Sbjct 225 VGAFIYNVSAGLSGGLELTLSERE 248
>gi|324999889|ref|ZP_08121001.1| hypothetical protein PseP1_14021 [Pseudonocardia sp. P1]
Length=171
Score = 125 bits (315), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 61/123 (50%), Positives = 85/123 (70%), Gaps = 0/123 (0%)
Query 182 RSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVG 241
R+R P +A +Q++R+DPWS LK++L L+V L+ VWM+ LY VLGGMGVW +LN
Sbjct 49 RNRPPRQALLQLKRLDPWSVLKMALALAVVLWLVWMVAAGVLYGVLGGMGVWDRLNGTYA 108
Query 242 DLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLA 301
DL+ + L+S+G +FG A ++G VN +L TI AFVYN+ DL+GGIEVTL+
Sbjct 109 DLVTAQPETGGALISAGRVFGLAAVVGAVNSLLFAVAITIVAFVYNVAADLVGGIEVTLS 168
Query 302 DRD 304
+RD
Sbjct 169 ERD 171
>gi|145592576|ref|YP_001156873.1| hypothetical protein Strop_0010 [Salinispora tropica CNB-440]
gi|145301913|gb|ABP52495.1| hypothetical protein Strop_0010 [Salinispora tropica CNB-440]
Length=308
Score = 125 bits (314), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 79/171 (47%), Positives = 109/171 (64%), Gaps = 6/171 (3%)
Query 137 PQRNPAPARPAEGGAGSRGDSAAGSSGGRSITAESRDARVQLSARRSRGPVRASMQIRRI 196
PQ N A RP +GG+ G S + G + R AR +S+ SRGP RA + +RRI
Sbjct 141 PQPNSA-GRP-QGGSLPPGISGVAAVGAARVGEAVRAARTAVSSAASRGPRRARLNLRRI 198
Query 197 DPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNVGDLLNNASGSSA---E 253
DPWS +K + +SV LF V ++ + LYL L MGV+A +N ++ DL+N G + +
Sbjct 199 DPWSVMKFAFAVSVVLFIVVVVATSVLYLALDAMGVFASVNDSLSDLVNAGGGQNTNGFQ 258
Query 254 LVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD 304
+ + G I A LIGLVN+VL TALAT+GAFVYN+ DL+GG+E+TLA+RD
Sbjct 259 ITARGVILSSA-LIGLVNVVLFTALATLGAFVYNVCADLVGGVELTLAERD 308
Lambda K H
0.315 0.133 0.400
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 511987440690
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40