BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3839
Length=258
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610975|ref|NP_218356.1| hypothetical protein Rv3839 [Mycoba... 510 9e-143
gi|167969955|ref|ZP_02552232.1| hypothetical protein MtubH3_1876... 509 2e-142
gi|31795013|ref|NP_857506.1| hypothetical protein Mb3869 [Mycoba... 508 4e-142
gi|289445440|ref|ZP_06435184.1| conserved hypothetical protein [... 506 2e-141
gi|307086638|ref|ZP_07495751.1| hypothetical protein TMLG_00332 ... 469 2e-130
gi|289764025|ref|ZP_06523403.1| conserved hypothetical protein [... 461 4e-128
gi|183985359|ref|YP_001853650.1| hypothetical protein MMAR_5391 ... 394 8e-108
gi|342860093|ref|ZP_08716745.1| hypothetical protein MCOL_14480 ... 387 6e-106
gi|118620026|ref|YP_908358.1| hypothetical protein MUL_5012 [Myc... 383 1e-104
gi|296166969|ref|ZP_06849384.1| conserved hypothetical protein [... 375 3e-102
gi|240168316|ref|ZP_04746975.1| hypothetical protein MkanA1_0332... 374 6e-102
gi|41406290|ref|NP_959126.1| hypothetical protein MAP0192c [Myco... 372 2e-101
gi|118463004|ref|YP_879481.1| hypothetical protein MAV_0187 [Myc... 370 8e-101
gi|336461876|gb|EGO40731.1| hypothetical protein MAPs_26240 [Myc... 370 1e-100
gi|254773244|ref|ZP_05214760.1| hypothetical protein MaviaA2_009... 370 1e-100
gi|118469515|ref|YP_890632.1| hypothetical protein MSMEG_6419 [M... 325 6e-87
gi|120406605|ref|YP_956434.1| hypothetical protein Mvan_5663 [My... 320 2e-85
gi|108801998|ref|YP_642195.1| hypothetical protein Mmcs_5035 [My... 317 9e-85
gi|126437979|ref|YP_001073670.1| hypothetical protein Mjls_5416 ... 317 1e-84
gi|315446528|ref|YP_004079407.1| hypothetical protein Mspyr1_504... 299 3e-79
gi|145221736|ref|YP_001132414.1| hypothetical protein Mflv_1144 ... 297 1e-78
gi|169627236|ref|YP_001700885.1| hypothetical protein MAB_0131c ... 246 2e-63
gi|226363319|ref|YP_002781101.1| hypothetical protein ROP_39090 ... 241 7e-62
gi|111021014|ref|YP_703986.1| hypothetical protein RHA1_ro04031 ... 236 3e-60
gi|333992789|ref|YP_004525403.1| hypothetical protein JDM601_414... 236 3e-60
gi|312137673|ref|YP_004005009.1| hypothetical protein REQ_01700 ... 225 6e-57
gi|325677584|ref|ZP_08157246.1| hypothetical protein HMPREF0724_... 224 9e-57
gi|229494085|ref|ZP_04387852.1| conserved hypothetical protein [... 222 3e-56
gi|226303674|ref|YP_002763632.1| hypothetical protein RER_01850 ... 222 4e-56
gi|54022095|ref|YP_116337.1| hypothetical protein nfa1310 [Nocar... 216 2e-54
gi|254822679|ref|ZP_05227680.1| hypothetical protein MintA_22309... 209 4e-52
gi|333917762|ref|YP_004491343.1| hypothetical protein AS9A_0083 ... 197 2e-48
gi|343926274|ref|ZP_08765783.1| hypothetical protein GOALK_056_0... 194 9e-48
gi|262200207|ref|YP_003271415.1| hypothetical protein Gbro_0174 ... 177 2e-42
gi|296137892|ref|YP_003645135.1| hypothetical protein Tpau_0142 ... 161 9e-38
gi|134096752|ref|YP_001102413.1| hypothetical protein SACE_0134 ... 154 2e-35
gi|326384321|ref|ZP_08206002.1| hypothetical protein SCNU_15359 ... 145 4e-33
gi|256374246|ref|YP_003097906.1| hypothetical protein Amir_0086 ... 145 7e-33
gi|302531511|ref|ZP_07283853.1| conserved hypothetical protein [... 144 1e-32
gi|300782086|ref|YP_003762377.1| hypothetical protein AMED_0151 ... 142 5e-32
gi|331694054|ref|YP_004330293.1| hypothetical protein Psed_0164 ... 129 3e-28
gi|257054185|ref|YP_003132017.1| hypothetical protein Svir_01020... 127 2e-27
gi|324998836|ref|ZP_08119948.1| hypothetical protein PseP1_08734... 123 3e-26
gi|329936444|ref|ZP_08286209.1| hypothetical protein SGM_1701 [S... 97.1 2e-18
gi|294633390|ref|ZP_06711949.1| conserved hypothetical protein [... 92.0 8e-17
gi|254383599|ref|ZP_04998949.1| conserved hypothetical protein [... 87.8 1e-15
gi|290955498|ref|YP_003486680.1| hypothetical protein SCAB_9301 ... 87.8 2e-15
gi|297204232|ref|ZP_06921629.1| conserved hypothetical protein [... 86.7 3e-15
gi|117165013|emb|CAJ88565.1| conserved hypothetical protein [Str... 86.3 4e-15
gi|297564388|ref|YP_003683361.1| hypothetical protein Ndas_5476 ... 85.1 9e-15
>gi|15610975|ref|NP_218356.1| hypothetical protein Rv3839 [Mycobacterium tuberculosis H37Rv]
gi|15843463|ref|NP_338500.1| hypothetical protein MT3947 [Mycobacterium tuberculosis CDC1551]
gi|148663707|ref|YP_001285230.1| hypothetical protein MRA_3879 [Mycobacterium tuberculosis H37Ra]
42 more sequence titles
Length=258
Score = 510 bits (1313), Expect = 9e-143, Method: Compositional matrix adjust.
Identities = 258/258 (100%), Positives = 258/258 (100%), Gaps = 0/258 (0%)
Query 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS 60
MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS
Sbjct 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS 60
Query 61 GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETP 120
GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETP
Sbjct 61 GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETP 120
Query 121 RPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATA 180
RPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATA
Sbjct 121 RPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATA 180
Query 181 HDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ 240
HDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ
Sbjct 181 HDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ 240
Query 241 AIRVLMGCPFRNGLRARR 258
AIRVLMGCPFRNGLRARR
Sbjct 241 AIRVLMGCPFRNGLRARR 258
>gi|167969955|ref|ZP_02552232.1| hypothetical protein MtubH3_18768 [Mycobacterium tuberculosis
H37Ra]
Length=258
Score = 509 bits (1310), Expect = 2e-142, Method: Compositional matrix adjust.
Identities = 257/258 (99%), Positives = 258/258 (100%), Gaps = 0/258 (0%)
Query 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS 60
MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS
Sbjct 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS 60
Query 61 GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETP 120
GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETP
Sbjct 61 GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETP 120
Query 121 RPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATA 180
RPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWH+ATA
Sbjct 121 RPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHVATA 180
Query 181 HDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ 240
HDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ
Sbjct 181 HDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ 240
Query 241 AIRVLMGCPFRNGLRARR 258
AIRVLMGCPFRNGLRARR
Sbjct 241 AIRVLMGCPFRNGLRARR 258
>gi|31795013|ref|NP_857506.1| hypothetical protein Mb3869 [Mycobacterium bovis AF2122/97]
gi|121639757|ref|YP_979981.1| hypothetical protein BCG_3902 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224992252|ref|YP_002646942.1| hypothetical protein JTY_3904 [Mycobacterium bovis BCG str. Tokyo
172]
28 more sequence titles
Length=258
Score = 508 bits (1308), Expect = 4e-142, Method: Compositional matrix adjust.
Identities = 257/258 (99%), Positives = 257/258 (99%), Gaps = 0/258 (0%)
Query 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS 60
MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS
Sbjct 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS 60
Query 61 GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETP 120
GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETP
Sbjct 61 GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETP 120
Query 121 RPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATA 180
R GPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATA
Sbjct 121 RSGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATA 180
Query 181 HDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ 240
HDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ
Sbjct 181 HDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ 240
Query 241 AIRVLMGCPFRNGLRARR 258
AIRVLMGCPFRNGLRARR
Sbjct 241 AIRVLMGCPFRNGLRARR 258
>gi|289445440|ref|ZP_06435184.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289418398|gb|EFD15599.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
Length=258
Score = 506 bits (1302), Expect = 2e-141, Method: Compositional matrix adjust.
Identities = 256/258 (99%), Positives = 256/258 (99%), Gaps = 0/258 (0%)
Query 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS 60
MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS
Sbjct 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS 60
Query 61 GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETP 120
GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETP
Sbjct 61 GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETP 120
Query 121 RPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATA 180
R GPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATA
Sbjct 121 RSGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATA 180
Query 181 HDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ 240
HDDVVARLVSRLPAPLR GQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ
Sbjct 181 HDDVVARLVSRLPAPLRHGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ 240
Query 241 AIRVLMGCPFRNGLRARR 258
AIRVLMGCPFRNGLRARR
Sbjct 241 AIRVLMGCPFRNGLRARR 258
>gi|307086638|ref|ZP_07495751.1| hypothetical protein TMLG_00332 [Mycobacterium tuberculosis SUMu012]
gi|308363981|gb|EFP52832.1| hypothetical protein TMLG_00332 [Mycobacterium tuberculosis SUMu012]
Length=259
Score = 469 bits (1206), Expect = 2e-130, Method: Compositional matrix adjust.
Identities = 240/246 (98%), Positives = 240/246 (98%), Gaps = 1/246 (0%)
Query 14 RIRSACARAG-GALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVSGSQALLELTDYA 72
R S RAG GALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVSGSQALLELTDYA
Sbjct 14 RANSQRLRAGRGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVSGSQALLELTDYA 73
Query 73 PLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETPRPGPADAAETRY 132
PLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETPRPGPADAAETRY
Sbjct 74 PLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETPRPGPADAAETRY 133
Query 133 TMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATAHDDVVARLVSRL 192
TMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATAHDDVVARLVSRL
Sbjct 134 TMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATAHDDVVARLVSRL 193
Query 193 PAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQAIRVLMGCPFRN 252
PAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQAIRVLMGCPFRN
Sbjct 194 PAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQAIRVLMGCPFRN 253
Query 253 GLRARR 258
GLRARR
Sbjct 254 GLRARR 259
>gi|289764025|ref|ZP_06523403.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
gi|289711531|gb|EFD75547.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
Length=233
Score = 461 bits (1187), Expect = 4e-128, Method: Compositional matrix adjust.
Identities = 232/233 (99%), Positives = 233/233 (100%), Gaps = 0/233 (0%)
Query 26 LLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVSGSQALLELTDYAPLPVREPVRSLVW 85
+LVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVSGSQALLELTDYAPLPVREPVRSLVW
Sbjct 1 MLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVSGSQALLELTDYAPLPVREPVRSLVW 60
Query 86 IRGCLHQIPPAELVETLDLIATDNPNPALLQVETPRPGPADAAETRYTMQRLEIESVVVT 145
IRGCLHQIPPAELVETLDLIATDNPNPALLQVETPRPGPADAAETRYTMQRLEIESVVVT
Sbjct 61 IRGCLHQIPPAELVETLDLIATDNPNPALLQVETPRPGPADAAETRYTMQRLEIESVVVT 120
Query 146 DATGAEPVTVADLLAARPDPFCEIESTLLWHLATAHDDVVARLVSRLPAPLRRGQIRPLG 205
DATGAEPVTVADLLAARPDPFCEIESTLLWHLATAHDDVVARLVSRLPAPLRRGQIRPLG
Sbjct 121 DATGAEPVTVADLLAARPDPFCEIESTLLWHLATAHDDVVARLVSRLPAPLRRGQIRPLG 180
Query 206 LDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQAIRVLMGCPFRNGLRARR 258
LDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQAIRVLMGCPFRNGLRARR
Sbjct 181 LDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQAIRVLMGCPFRNGLRARR 233
>gi|183985359|ref|YP_001853650.1| hypothetical protein MMAR_5391 [Mycobacterium marinum M]
gi|183178685|gb|ACC43795.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=270
Score = 394 bits (1011), Expect = 8e-108, Method: Compositional matrix adjust.
Identities = 206/265 (78%), Positives = 225/265 (85%), Gaps = 7/265 (2%)
Query 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDR-GEV 59
M PLT+ APTTAERIRSACARAGGALL VER+ PV PIHHL+ DGSFAVAVP++ G
Sbjct 6 MLPLTNPAPTTAERIRSACARAGGALLAVERDGPVATPIHHLMPDGSFAVAVPIEHPGAG 65
Query 60 SG------SQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPA 113
+G SQALLELTDYAPLPVREPVRSLVWIRG LH +P + TLDLIAT+NPNPA
Sbjct 66 TGDPGQPPSQALLELTDYAPLPVREPVRSLVWIRGRLHPVPADMIGATLDLIATENPNPA 125
Query 114 LLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTL 173
LLQV+TPR PA AE RY + RLEIESVVVTDATGAEP++V DLLAARPDPFCEIES+L
Sbjct 126 LLQVQTPRSAPAHGAEIRYALLRLEIESVVVTDATGAEPISVVDLLAARPDPFCEIESSL 185
Query 174 LWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVD 233
L HL T H DVVARLVSRLPAPLRRG++RPLGLDRYGVRFRIE+ DGDRDIRLPFH+PVD
Sbjct 186 LRHLTTDHQDVVARLVSRLPAPLRRGEVRPLGLDRYGVRFRIESNDGDRDIRLPFHRPVD 245
Query 234 DMTGLSQAIRVLMGCPFRNGLRARR 258
DM GL QAIRVL+GCPF NGLRARR
Sbjct 246 DMHGLRQAIRVLLGCPFGNGLRARR 270
>gi|342860093|ref|ZP_08716745.1| hypothetical protein MCOL_14480 [Mycobacterium colombiense CECT
3035]
gi|342132471|gb|EGT85700.1| hypothetical protein MCOL_14480 [Mycobacterium colombiense CECT
3035]
Length=264
Score = 387 bits (995), Expect = 6e-106, Method: Compositional matrix adjust.
Identities = 195/260 (75%), Positives = 221/260 (85%), Gaps = 5/260 (1%)
Query 3 PLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGE---- 58
PL+ APTTAERIRSAC RAGGALL +E +DPVP P+HHL+ DGS A+AVPV+R
Sbjct 4 PLSCPAPTTAERIRSACVRAGGALLAIEHDDPVPTPLHHLMDDGSVALAVPVERAGGLAR 63
Query 59 -VSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQV 117
+SGSQALLELTDYAPLP+REPVRSLVW+RG L Q+PP+E ++TLDLIA + PNPALL V
Sbjct 64 PISGSQALLELTDYAPLPLREPVRSLVWVRGHLQQVPPSETLDTLDLIAAECPNPALLGV 123
Query 118 ETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHL 177
+TPR P D E RYT+ RLEI SVVVTDATGAEPV+V DLL ARPDPFC +ES+LLWHL
Sbjct 124 DTPRCAPTDGEEPRYTLLRLEIASVVVTDATGAEPVSVGDLLEARPDPFCALESSLLWHL 183
Query 178 ATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTG 237
TAH DV+ARLVSRLPAPLRRG +RPLGLDRYGVRFR+E+ D D D+RLPFH+PVDDMTG
Sbjct 184 DTAHSDVLARLVSRLPAPLRRGHVRPLGLDRYGVRFRVESDDRDHDVRLPFHRPVDDMTG 243
Query 238 LSQAIRVLMGCPFRNGLRAR 257
LSQAIRVLMGCPF NGLRAR
Sbjct 244 LSQAIRVLMGCPFINGLRAR 263
>gi|118620026|ref|YP_908358.1| hypothetical protein MUL_5012 [Mycobacterium ulcerans Agy99]
gi|118572136|gb|ABL06887.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=266
Score = 383 bits (983), Expect = 1e-104, Method: Compositional matrix adjust.
Identities = 200/265 (76%), Positives = 220/265 (84%), Gaps = 7/265 (2%)
Query 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDR-GEV 59
M PLTS APTTAERIRSACARAGGALL VER+ PV PIHHL+ DG+FAVAVP++ G
Sbjct 2 MLPLTSPAPTTAERIRSACARAGGALLAVERDGPVATPIHHLMPDGTFAVAVPIEHPGAG 61
Query 60 SG------SQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPA 113
+G SQA LELTDY PLPVREPVRSLVWIRG L +P + LDLIAT+NPNPA
Sbjct 62 AGDPGQPPSQAFLELTDYTPLPVREPVRSLVWIRGRLRPVPADMIGAALDLIATENPNPA 121
Query 114 LLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTL 173
LLQV+TPR PA AE RY + RLEIESVVVTD+TGAEP++V DLLAARPDPFCEIES+L
Sbjct 122 LLQVQTPRSAPAHGAEIRYALMRLEIESVVVTDSTGAEPISVVDLLAARPDPFCEIESSL 181
Query 174 LWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVD 233
L HL T H DVVARLVSRLPAPLRRG++RPLGLDRYGVRFRIE+ DGDRDIRLPFH+PVD
Sbjct 182 LRHLTTDHQDVVARLVSRLPAPLRRGEVRPLGLDRYGVRFRIESNDGDRDIRLPFHRPVD 241
Query 234 DMTGLSQAIRVLMGCPFRNGLRARR 258
DM GL QAIRVL+GCPF NGLRA R
Sbjct 242 DMHGLRQAIRVLLGCPFGNGLRAGR 266
>gi|296166969|ref|ZP_06849384.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897680|gb|EFG77271.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=264
Score = 375 bits (963), Expect = 3e-102, Method: Compositional matrix adjust.
Identities = 197/263 (75%), Positives = 218/263 (83%), Gaps = 6/263 (2%)
Query 1 MPPLTSLAPTTAERIRSACARAGGALLVVE----REDPVPVPIHHLLYDGSFAVAVPVDR 56
M PLT APTTAERIRSAC RAGGALL +E R+DPV P+HHLL DGSFA+A+P D
Sbjct 1 MLPLTCPAPTTAERIRSACVRAGGALLALETVHPRQDPVVTPVHHLLPDGSFALALPADH 60
Query 57 GE--VSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPAL 114
V G+QA+LELTDYAPLP+REPVRSLVW RG L + PAE+ +DLIA + P+PAL
Sbjct 61 DPHPVDGAQAVLELTDYAPLPLREPVRSLVWARGRLREFHPAEVAGAVDLIAAEWPHPAL 120
Query 115 LQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLL 174
LQV+TPR P+D E RYT+ RLEI SVVVTDATGAEPV+V DLLAARPDPFCEIES LL
Sbjct 121 LQVDTPRCAPSDGDELRYTLFRLEIASVVVTDATGAEPVSVEDLLAARPDPFCEIESNLL 180
Query 175 WHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDD 234
WHL TAH DVVARLVSRLPAPLRRG++RPLGLDRYGVRFR+E RD D D+RLPFHKPVDD
Sbjct 181 WHLDTAHSDVVARLVSRLPAPLRRGRVRPLGLDRYGVRFRVEGRDRDHDVRLPFHKPVDD 240
Query 235 MTGLSQAIRVLMGCPFRNGLRAR 257
MTGL QAIRVLMGCPF NGLRAR
Sbjct 241 MTGLRQAIRVLMGCPFMNGLRAR 263
>gi|240168316|ref|ZP_04746975.1| hypothetical protein MkanA1_03327 [Mycobacterium kansasii ATCC
12478]
Length=268
Score = 374 bits (961), Expect = 6e-102, Method: Compositional matrix adjust.
Identities = 192/264 (73%), Positives = 212/264 (81%), Gaps = 6/264 (2%)
Query 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDR---- 56
M P+TS PTTAERIR+ CARAG LL VE + P+ P+HHL+ DGSFAVA+P DR
Sbjct 1 MFPITSPTPTTAERIRTICARAGAGLLAVEPDQPIAAPLHHLMSDGSFAVAIPADRAAGA 60
Query 57 --GEVSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPAL 114
G GSQALLELTDYAPLPVREPVRSLVWIRG L ++P + LD IAT+NP+PAL
Sbjct 61 GYGPGCGSQALLELTDYAPLPVREPVRSLVWIRGRLQRVPWGAVSSVLDTIATENPHPAL 120
Query 115 LQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLL 174
LQVETPR GPA + RY + RLE ESVVVTD TGA PV+VADLL A PDPFCEIESTLL
Sbjct 121 LQVETPRSGPALRQQNRYALLRLETESVVVTDTTGASPVSVADLLTALPDPFCEIESTLL 180
Query 175 WHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDD 234
WHLAT H DVVARLVSRLPA RRG IRPLGLDRYGV+FR+E DGDRD+RLPFH+PVDD
Sbjct 181 WHLATVHGDVVARLVSRLPAQWRRGPIRPLGLDRYGVQFRVEDDDGDRDVRLPFHRPVDD 240
Query 235 MTGLSQAIRVLMGCPFRNGLRARR 258
M GL+QAIRVLMGCPF NGLRARR
Sbjct 241 MNGLAQAIRVLMGCPFVNGLRARR 264
>gi|41406290|ref|NP_959126.1| hypothetical protein MAP0192c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41394638|gb|AAS02509.1| hypothetical protein MAP_0192c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=264
Score = 372 bits (956), Expect = 2e-101, Method: Compositional matrix adjust.
Identities = 194/265 (74%), Positives = 221/265 (84%), Gaps = 10/265 (3%)
Query 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLL----YDGSFAVAVPVDR 56
M P T AP+TAERIRSAC RAGGALL +E +DPVP P+HHL+ + GSFA+A+PV R
Sbjct 1 MLPQTCPAPSTAERIRSACVRAGGALLAIEHDDPVPTPVHHLIGTGPFAGSFALALPVAR 60
Query 57 GE----VSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNP 112
V+G+ ALLELTDYAPLP+REPVRSLVW+RG LH++ PA+++ETLD+IA + PNP
Sbjct 61 EHRLRPVAGAPALLELTDYAPLPLREPVRSLVWVRGRLHEVDPAQILETLDVIAAECPNP 120
Query 113 ALLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIEST 172
ALL V+TPR AD E RY ++RLEI SVVVTDATGAEPV VADLLAARPDPFC +ES
Sbjct 121 ALLGVDTPRR--ADGTEPRYVLRRLEIASVVVTDATGAEPVDVADLLAARPDPFCALESD 178
Query 173 LLWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPV 232
LLWHL TAH DVV+RLVSRLPAPLRRGQ+RPLGLDRYGVRFR+E D D D+RLPFHKPV
Sbjct 179 LLWHLDTAHGDVVSRLVSRLPAPLRRGQVRPLGLDRYGVRFRVEGNDRDHDVRLPFHKPV 238
Query 233 DDMTGLSQAIRVLMGCPFRNGLRAR 257
DDMTGLSQAIRVLMGCPF NGLRAR
Sbjct 239 DDMTGLSQAIRVLMGCPFINGLRAR 263
>gi|118463004|ref|YP_879481.1| hypothetical protein MAV_0187 [Mycobacterium avium 104]
gi|118164291|gb|ABK65188.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=264
Score = 370 bits (951), Expect = 8e-101, Method: Compositional matrix adjust.
Identities = 192/265 (73%), Positives = 222/265 (84%), Gaps = 10/265 (3%)
Query 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLL----YDGSFAVAVPVDR 56
M P T AP+TAERIRSAC RAGGALL +E +DPVP P+HHL+ + GSF++A+PV+R
Sbjct 1 MLPQTCPAPSTAERIRSACVRAGGALLAIEHDDPVPTPVHHLIGAGPFAGSFSLALPVER 60
Query 57 GE----VSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNP 112
V+G+ ALLELTDYAPLP+REPVRSLVW+RG LH++ PA+++ETLD+IA + P+P
Sbjct 61 EHRLRPVAGAPALLELTDYAPLPLREPVRSLVWVRGRLHEVDPAQILETLDVIAAECPHP 120
Query 113 ALLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIEST 172
ALL V+TPR AD E RY ++RLEI SVVVTDATGAEPV VADLLAARPDPFC +ES
Sbjct 121 ALLGVDTPRR--ADGTEPRYVLRRLEIASVVVTDATGAEPVDVADLLAARPDPFCALESD 178
Query 173 LLWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPV 232
LLWHL TAH DVV+RLVSRLPAPLRRGQ+RPLGLDRYGVRFR+E D D D+RLPFHKPV
Sbjct 179 LLWHLDTAHGDVVSRLVSRLPAPLRRGQVRPLGLDRYGVRFRVEGNDRDHDVRLPFHKPV 238
Query 233 DDMTGLSQAIRVLMGCPFRNGLRAR 257
DDMTGLSQAIRVLMGCPF NGLRAR
Sbjct 239 DDMTGLSQAIRVLMGCPFINGLRAR 263
>gi|336461876|gb|EGO40731.1| hypothetical protein MAPs_26240 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=264
Score = 370 bits (950), Expect = 1e-100, Method: Compositional matrix adjust.
Identities = 192/261 (74%), Positives = 219/261 (84%), Gaps = 10/261 (3%)
Query 5 TSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLL----YDGSFAVAVPVDRGE-- 58
T AP+TAERIRSAC RAGGALL +E +DPVP P+HHL+ + GSFA+A+PV R
Sbjct 5 TCPAPSTAERIRSACVRAGGALLAIEHDDPVPTPVHHLIGTGPFAGSFALALPVAREHRL 64
Query 59 --VSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQ 116
V+G+ ALLELTDYAPLP+REPVRSLVW+RG LH++ PA+++ETLD+IA + PNPALL
Sbjct 65 RPVAGAPALLELTDYAPLPLREPVRSLVWVRGRLHEVDPAQILETLDVIAAECPNPALLG 124
Query 117 VETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWH 176
V+TPR AD E RY ++RLEI SVVVTDATGAEPV VADLLAARPDPFC +ES LLWH
Sbjct 125 VDTPRR--ADGTEPRYVLRRLEIASVVVTDATGAEPVDVADLLAARPDPFCALESDLLWH 182
Query 177 LATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMT 236
L TAH DVV+RLVSRLPAPLRRGQ+RPLGLDRYGVRFR+E D D D+RLPFHKPVDDMT
Sbjct 183 LDTAHGDVVSRLVSRLPAPLRRGQVRPLGLDRYGVRFRVEGNDRDHDVRLPFHKPVDDMT 242
Query 237 GLSQAIRVLMGCPFRNGLRAR 257
GLSQAIRVLMGCPF NGLRAR
Sbjct 243 GLSQAIRVLMGCPFINGLRAR 263
>gi|254773244|ref|ZP_05214760.1| hypothetical protein MaviaA2_00981 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=264
Score = 370 bits (950), Expect = 1e-100, Method: Compositional matrix adjust.
Identities = 193/265 (73%), Positives = 220/265 (84%), Gaps = 10/265 (3%)
Query 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLL----YDGSFAVAVPVDR 56
M P T AP+TAERIRSAC RAGGALL +E +DPVP P+HHL+ + GSFA+A+PV R
Sbjct 1 MLPQTCPAPSTAERIRSACVRAGGALLAIEHDDPVPTPVHHLIGAGPFAGSFALALPVAR 60
Query 57 GE----VSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNP 112
V+G+ ALLELTDYAPLP+REPVRSLVW+RG L ++ PA+++ETLD+IA + PNP
Sbjct 61 EHRLRPVAGAPALLELTDYAPLPLREPVRSLVWVRGRLREVDPAQILETLDVIAAECPNP 120
Query 113 ALLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIEST 172
ALL V+TPR AD E RY ++RLEI SVVVTDATGAEPV VADLLAARPDPFC +ES
Sbjct 121 ALLNVDTPRR--ADGTEPRYVLRRLEIASVVVTDATGAEPVDVADLLAARPDPFCALESD 178
Query 173 LLWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPV 232
LLWHL TAH DVV+RLVSRLPAPLRRGQ+RPLGLDRYGVRFR+E D D D+RLPFHKPV
Sbjct 179 LLWHLDTAHGDVVSRLVSRLPAPLRRGQVRPLGLDRYGVRFRVEGNDRDHDVRLPFHKPV 238
Query 233 DDMTGLSQAIRVLMGCPFRNGLRAR 257
DDMTGLSQAIRVLMGCPF NGLRAR
Sbjct 239 DDMTGLSQAIRVLMGCPFINGLRAR 263
>gi|118469515|ref|YP_890632.1| hypothetical protein MSMEG_6419 [Mycobacterium smegmatis str.
MC2 155]
gi|118170802|gb|ABK71698.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=261
Score = 325 bits (832), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 165/259 (64%), Positives = 198/259 (77%), Gaps = 11/259 (4%)
Query 8 APTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVP--------VDRGEV 59
APTTAERIRSACAR GGA+L VE +P+ P+HHLL DGSFA+ VP V
Sbjct 5 APTTAERIRSACARGGGAMLAVEGIEPLATPVHHLLQDGSFAITVPENGPLVGTVVSSGS 64
Query 60 SGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVET 119
+G QA+LE+TDYAPLP+REPVRSLVWIRG L +P E+ + LDL+AT+NPNPALLQV +
Sbjct 65 AGIQAVLEMTDYAPLPLREPVRSLVWIRGRLQHVPNGEVADLLDLVATENPNPALLQVNS 124
Query 120 PRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLAT 179
P D A+ Y + RLEIES+VV D+TGAE VTV LLAARPDPFC +ES+ L H+ +
Sbjct 125 ---SPIDDADDTYALLRLEIESIVVADSTGAESVTVGALLAARPDPFCAMESSWLQHMES 181
Query 180 AHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLS 239
AH DVV RL +RLP LR+G++RPLGLDRYGV+ R+E +GD D+RLPF PVDD+ GLS
Sbjct 182 AHRDVVDRLATRLPVALRQGRVRPLGLDRYGVQLRVENENGDHDVRLPFPAPVDDVPGLS 241
Query 240 QAIRVLMGCPFRNGLRARR 258
+AIRVLMGCPF NGLRARR
Sbjct 242 KAIRVLMGCPFLNGLRARR 260
>gi|120406605|ref|YP_956434.1| hypothetical protein Mvan_5663 [Mycobacterium vanbaalenii PYR-1]
gi|119959423|gb|ABM16428.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=266
Score = 320 bits (819), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 167/263 (64%), Positives = 195/263 (75%), Gaps = 10/263 (3%)
Query 3 PLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS-- 60
P + APTTAERIRSACAR+GGA+L VE DP P+HHLL DGSFA+ VP D V+
Sbjct 4 PAPTSAPTTAERIRSACARSGGAMLAVEGLDPTTTPVHHLLDDGSFAITVPCDGSLVATV 63
Query 61 ------GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPAL 114
G QA+LE+TDYAPLP+REPVRSLVWI+G L +P E+ LDLIA+ PNPAL
Sbjct 64 VSAGNAGVQAVLEMTDYAPLPLREPVRSLVWIQGRLRDVPIGEVPALLDLIASTEPNPAL 123
Query 115 LQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLL 174
LQV + +TR + RLEIESVVV DATGAE V ++ LL ARPDPFC +ES L
Sbjct 124 LQVNSGSS--QGEGDTRLALMRLEIESVVVADATGAESVALSALLGARPDPFCGMESCWL 181
Query 175 WHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDD 234
H+ +AH DVV RL +RLPA LRRG++RPLGLDRYGV+ R+E DGD D+RLPF PVDD
Sbjct 182 QHMESAHRDVVDRLAARLPASLRRGRVRPLGLDRYGVQLRVEGADGDHDVRLPFAHPVDD 241
Query 235 MTGLSQAIRVLMGCPFRNGLRAR 257
+TGLSQAIRVLMGCPF NGLRAR
Sbjct 242 VTGLSQAIRVLMGCPFLNGLRAR 264
>gi|108801998|ref|YP_642195.1| hypothetical protein Mmcs_5035 [Mycobacterium sp. MCS]
gi|119871150|ref|YP_941102.1| hypothetical protein Mkms_5123 [Mycobacterium sp. KMS]
gi|108772417|gb|ABG11139.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119697239|gb|ABL94312.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=273
Score = 317 bits (813), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 162/260 (63%), Positives = 195/260 (75%), Gaps = 15/260 (5%)
Query 7 LAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVD--------RGE 58
+ P+TAERIRSACAR GGA+L E DPV P+HHLL DGSFA+ VPV+
Sbjct 15 IGPSTAERIRSACARGGGAMLAAEGVDPVSTPVHHLLDDGSFAITVPVEVPLSTMVASAG 74
Query 59 VSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVE 118
SG QA+LE+TD+APLP+REPVRSLVWI G + +P A++ LDLIA+ +PNPALLQV
Sbjct 75 TSGVQAVLEMTDHAPLPLREPVRSLVWIGGRVQAVPSADVSALLDLIASADPNPALLQVN 134
Query 119 TPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLA 178
+ +RY + RLEIESVVV D+TGAE V + LLAARPDPFC +ES L H+
Sbjct 135 S-------GGHSRYALMRLEIESVVVADSTGAESVGLGALLAARPDPFCAMESCWLQHME 187
Query 179 TAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGL 238
+AH DVV RL SRLPA +RRG++RPLGLDRYGV+ R+E DGD D+RLPF +PVDD+TGL
Sbjct 188 SAHRDVVDRLASRLPAAMRRGRVRPLGLDRYGVQLRVEDPDGDHDVRLPFPRPVDDVTGL 247
Query 239 SQAIRVLMGCPFRNGLRARR 258
SQAIRVLMGCPF NGL+ARR
Sbjct 248 SQAIRVLMGCPFLNGLQARR 267
>gi|126437979|ref|YP_001073670.1| hypothetical protein Mjls_5416 [Mycobacterium sp. JLS]
gi|126237779|gb|ABO01180.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=273
Score = 317 bits (813), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 163/259 (63%), Positives = 194/259 (75%), Gaps = 15/259 (5%)
Query 8 APTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVD--------RGEV 59
P+TAERIRSACAR GGA+L E DPV P+HHLL DGSFA+ VPV+
Sbjct 16 GPSTAERIRSACARGGGAMLAAEGVDPVSTPVHHLLDDGSFAITVPVEVPLSTMVASAGT 75
Query 60 SGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVET 119
SG QA+LE+TD+APLP+REPVRSLVWI G + +P A++ LDLIA+ +PNPALLQV +
Sbjct 76 SGVQAVLEMTDHAPLPLREPVRSLVWIGGRVQAVPSADVSALLDLIASADPNPALLQVNS 135
Query 120 PRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLAT 179
+RY + RLEIESVVV D+TGAE V + LLAARPDPFC +ES L H+ +
Sbjct 136 -------GDHSRYALMRLEIESVVVADSTGAESVGLGALLAARPDPFCAMESCWLQHMES 188
Query 180 AHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLS 239
AH DVV RL SRLPA +RRG++RPLGLDRYGV+ R+E DGD D+RLPF KPVDD+TGLS
Sbjct 189 AHRDVVDRLASRLPAAMRRGRVRPLGLDRYGVQLRVEDPDGDHDVRLPFPKPVDDVTGLS 248
Query 240 QAIRVLMGCPFRNGLRARR 258
QAIRVLMGCPF NGL+ARR
Sbjct 249 QAIRVLMGCPFLNGLQARR 267
>gi|315446528|ref|YP_004079407.1| hypothetical protein Mspyr1_50430 [Mycobacterium sp. Spyr1]
gi|315264831|gb|ADU01573.1| hypothetical protein Mspyr1_50430 [Mycobacterium sp. Spyr1]
Length=262
Score = 299 bits (765), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 157/253 (63%), Positives = 186/253 (74%), Gaps = 14/253 (5%)
Query 13 ERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVD--------RGEVSGSQA 64
ERIRSAC R GGA++ VE DP +HHLL DGS A+ VPVD +G QA
Sbjct 14 ERIRSACVRPGGAMIAVEGLDPSTTSVHHLLGDGSVAITVPVDGPLAASVVSAGNAGIQA 73
Query 65 LLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETPRPGP 124
+LE+TDYAPLP+REPVRSLVWI+G L +P AE+ LDLIA +PNPALLQV PGP
Sbjct 74 VLEMTDYAPLPLREPVRSLVWIQGVLRDVPTAEVPALLDLIAAADPNPALLQVN---PGP 130
Query 125 ADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATAHDDV 184
ET + + RLEIESVVV D+TGAE V + +LL ARPDPFC +ES L H+ +AH DV
Sbjct 131 E---ETPHALMRLEIESVVVADSTGAESVALGELLGARPDPFCAMESCWLQHMESAHRDV 187
Query 185 VARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQAIRV 244
V RL +RLPA LR+G++RPL LDRYGV+ R+E DGD D+RLPF KPVDD+T LS+AIRV
Sbjct 188 VDRLAARLPASLRQGRVRPLALDRYGVQLRVEGADGDHDVRLPFGKPVDDVTSLSRAIRV 247
Query 245 LMGCPFRNGLRAR 257
LMGCPF NGLRAR
Sbjct 248 LMGCPFLNGLRAR 260
>gi|145221736|ref|YP_001132414.1| hypothetical protein Mflv_1144 [Mycobacterium gilvum PYR-GCK]
gi|145214222|gb|ABP43626.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=262
Score = 297 bits (761), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 157/253 (63%), Positives = 186/253 (74%), Gaps = 14/253 (5%)
Query 13 ERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS--------GSQA 64
ERIRSAC R GGA++ E DP +HHLL DGS A+ VPVD V+ G QA
Sbjct 14 ERIRSACVRPGGAMIAAEGLDPSTTSVHHLLGDGSVAITVPVDGPLVASVASAGNAGIQA 73
Query 65 LLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETPRPGP 124
+LE+TDYAPLP+REPVRSLVWI+G L +P AE+ LDLIA +PNPALLQV PGP
Sbjct 74 VLEMTDYAPLPLREPVRSLVWIQGVLRDVPTAEVPALLDLIAAADPNPALLQVN---PGP 130
Query 125 ADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATAHDDV 184
ET + + RLEIESVVV D+TGAE V + +LL ARPDPFC +ES L H+ +AH DV
Sbjct 131 E---ETPHALMRLEIESVVVADSTGAESVALGELLGARPDPFCAMESCWLKHMESAHRDV 187
Query 185 VARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQAIRV 244
V RL +RLPA LR+G++RPL LDRYGV+ R+E DGD D+RLPF KPVDD+T LS+AIRV
Sbjct 188 VDRLAARLPASLRQGRVRPLALDRYGVQLRVEGADGDHDVRLPFGKPVDDVTSLSRAIRV 247
Query 245 LMGCPFRNGLRAR 257
LMGCPF NGLRAR
Sbjct 248 LMGCPFLNGLRAR 260
>gi|169627236|ref|YP_001700885.1| hypothetical protein MAB_0131c [Mycobacterium abscessus ATCC
19977]
gi|169239203|emb|CAM60231.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=261
Score = 246 bits (628), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 135/268 (51%), Positives = 174/268 (65%), Gaps = 21/268 (7%)
Query 4 LTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRG------ 57
+T+ PTTAER+RSACARA + L V D V +HHL DG+FAVAVP D
Sbjct 1 MTAAPPTTAERVRSACARAASSTLAVAGADVVGTSLHHLFDDGTFAVAVPSDSAIAATVV 60
Query 58 --EVSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALL 115
+G ALLELTD APLP+REPVRSLVW+RG + E +D+IA+ P+PALL
Sbjct 61 AAGSAGMPALLELTDQAPLPLREPVRSLVWVRGNVVAASDREARGIVDVIASRIPDPALL 120
Query 116 QVET-----PRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIE 170
+ T PG + L +ESVVV D+TGAE V V+ LL+ARPDPFC +E
Sbjct 121 DIRTDMRLRTEPGS--------VLLCLTVESVVVADSTGAESVDVSALLSARPDPFCALE 172
Query 171 STLLWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHK 230
+ L H+ H D+V RL RLP L+ G++R LG+DRYG++ R+E + D D+RLPF++
Sbjct 173 AGWLSHIDHDHRDLVERLARRLPLNLQHGEVRLLGIDRYGIQLRVEGAESDHDVRLPFNE 232
Query 231 PVDDMTGLSQAIRVLMGCPFRNGLRARR 258
PV+D GLSQA+R+L GCPF NGLRAR+
Sbjct 233 PVNDTAGLSQALRILAGCPFLNGLRARK 260
>gi|226363319|ref|YP_002781101.1| hypothetical protein ROP_39090 [Rhodococcus opacus B4]
gi|226241808|dbj|BAH52156.1| hypothetical protein [Rhodococcus opacus B4]
Length=254
Score = 241 bits (615), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 132/263 (51%), Positives = 170/263 (65%), Gaps = 19/263 (7%)
Query 3 PLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS-- 60
P+T+ P+TAER+RS+CAR A+L VE P +HHL G VAVP D +
Sbjct 2 PVTTTGPSTAERVRSSCARTQDAVLAVEGSAPTVTSVHHLRSSGDVVVAVPTDSAAATLS 61
Query 61 ------GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPAL 114
G A+LELTD +PLP+REPVRSLVW+RG LH + A D +A++ P+P L
Sbjct 62 WLAGGGGIPAVLELTDNSPLPLREPVRSLVWLRGTLHALCEAHTRTLADEVASEFPHPGL 121
Query 115 LQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLL 174
L V DA T+ +L +ES VV D++GAEPV + DLLAARPDPF E+E+ L
Sbjct 122 LDVGH------DA-----TLLQLHLESAVVADSSGAEPVALDDLLAARPDPFWEMETAWL 170
Query 175 WHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDD 234
HL H D++ L +LP +RRG++RPLG+DRYG+R R+E GDRDI +PF PVDD
Sbjct 171 QHLDEDHRDLIDMLSRKLPPHMRRGRVRPLGIDRYGLRLRVENDHGDRDIWMPFSAPVDD 230
Query 235 MTGLSQAIRVLMGCPFRNGLRAR 257
LS+AIR+L+GCPF NGLRAR
Sbjct 231 AVALSRAIRLLVGCPFVNGLRAR 253
>gi|111021014|ref|YP_703986.1| hypothetical protein RHA1_ro04031 [Rhodococcus jostii RHA1]
gi|110820544|gb|ABG95828.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=254
Score = 236 bits (602), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 129/263 (50%), Positives = 169/263 (65%), Gaps = 19/263 (7%)
Query 3 PLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS-- 60
P+T+ P+TAER+RS+CAR A+L VE P +HHL G VAVP + +
Sbjct 2 PVTTTGPSTAERVRSSCARTQDAVLAVEGSAPTVTSVHHLRSSGDVVVAVPTESAAATLS 61
Query 61 ------GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPAL 114
G A+LELTD +PL +REPVRSLVW+RG LH + A D +A++ P+P L
Sbjct 62 WLAGGGGIPAVLELTDNSPLALREPVRSLVWLRGNLHALCEAHTRTLADEVASEYPHPGL 121
Query 115 LQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLL 174
L + DA T+ +L +ES VV D++GAEPV + DLLAARPDPF E+E+ L
Sbjct 122 LDIGH------DA-----TLLQLRLESAVVADSSGAEPVALDDLLAARPDPFWEMETAWL 170
Query 175 WHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDD 234
HL H D++ L +LP +RRG++RPLG+DRYG+R R+E GDRDI +PF PVDD
Sbjct 171 QHLDEDHRDLIDMLSRKLPPHMRRGRVRPLGIDRYGLRLRVENDHGDRDIWMPFSAPVDD 230
Query 235 MTGLSQAIRVLMGCPFRNGLRAR 257
LS+AIR+L+GCPF NGLRAR
Sbjct 231 AVALSRAIRLLVGCPFVNGLRAR 253
>gi|333992789|ref|YP_004525403.1| hypothetical protein JDM601_4149 [Mycobacterium sp. JDM601]
gi|333488757|gb|AEF38149.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=272
Score = 236 bits (601), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 135/260 (52%), Positives = 173/260 (67%), Gaps = 12/260 (4%)
Query 11 TAERIRSACARAGGALLVVEREDPVPV--PIHHLLYDGSFAVAVPVD----RGEVSGSQA 64
+AERIRSAC R L + + D PV P+ LL DGS VAVPV +G A
Sbjct 12 SAERIRSACVRGQALLAIADSTDAAPVNAPVCQLLPDGSMVVAVPVGDPVAEAAGTGVAA 71
Query 65 LLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETPRPGP 124
+LELTD+APL +RE VR+L WIRG L +P +E+ LD IA NPNPALLQV +PR
Sbjct 72 MLELTDHAPLRLRERVRALAWIRGRLLTVPESEIPALLDRIAAVNPNPALLQVISPRSTA 131
Query 125 ADAA-----ETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLAT 179
+A + Y + RL +S V+ DATGAE V V +LLAARPDPFC IE+ L HL +
Sbjct 132 RPSAVAAPTDVSYALLRLTPDSAVLADATGAESVAVDELLAARPDPFCAIEAHWLQHLDS 191
Query 180 AHDDVVARL-VSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGL 238
AH +++ARL ++LP LRRG+ RPL +DRYG+ R+EA DGD D+RL F +PV+D+ L
Sbjct 192 AHPELLARLATTKLPPQLRRGRPRPLAVDRYGMWLRVEAADGDHDVRLSFPRPVEDVLSL 251
Query 239 SQAIRVLMGCPFRNGLRARR 258
++A+R LMGCPF NGL+ RR
Sbjct 252 NRAVRALMGCPFLNGLQPRR 271
>gi|312137673|ref|YP_004005009.1| hypothetical protein REQ_01700 [Rhodococcus equi 103S]
gi|311887012|emb|CBH46321.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=253
Score = 225 bits (573), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 128/260 (50%), Positives = 163/260 (63%), Gaps = 19/260 (7%)
Query 6 SLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS----- 60
+ P+TAER+RSA ARA A+L V DPV +HHL DG+ V P D +
Sbjct 4 TTGPSTAERVRSASARATDAVLAVAGTDPVVTSLHHLRGDGTVVVVAPSDAAVTALAWQY 63
Query 61 ---GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQV 117
G A+LELTDYAPL +REPVRSLVW+RG L +P + D +A ++P+PALL +
Sbjct 64 GPGGLPAVLELTDYAPLALREPVRSLVWLRGNLVALPDERARQLADAVAAEHPDPALLDL 123
Query 118 ETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHL 177
G A T L +ES VV D++GAE V + LLAA PDPF ++E+ L HL
Sbjct 124 -----GHGAALLT------LRLESAVVADSSGAESVAIDALLAAAPDPFQDVETVWLQHL 172
Query 178 ATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTG 237
H D+V L RLP+ LR G++RPLG+DRYGVR RIE GD D+RL F +PV D
Sbjct 173 EKDHADLVEMLARRLPSTLRTGRVRPLGIDRYGVRLRIEGTSGDHDVRLDFTEPVGDAMA 232
Query 238 LSQAIRVLMGCPFRNGLRAR 257
LS+A+R+L+GCPF NGLRAR
Sbjct 233 LSRALRILVGCPFVNGLRAR 252
>gi|325677584|ref|ZP_08157246.1| hypothetical protein HMPREF0724_15029 [Rhodococcus equi ATCC
33707]
gi|325551611|gb|EGD21311.1| hypothetical protein HMPREF0724_15029 [Rhodococcus equi ATCC
33707]
Length=253
Score = 224 bits (571), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 128/260 (50%), Positives = 163/260 (63%), Gaps = 19/260 (7%)
Query 6 SLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS----- 60
+ P+TAER+RSA ARA A+L V DPV +HHL DG+ V P D +
Sbjct 4 TTGPSTAERVRSASARATDAVLAVAGTDPVVTSLHHLRGDGTVVVVAPSDAAVTALAWQY 63
Query 61 ---GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQV 117
G A+LELTDYAPL +REPVRSLVW+RG L +P + D +A ++P+PALL +
Sbjct 64 GPGGLPAVLELTDYAPLALREPVRSLVWLRGNLVALPDERARQLADAVAAEHPDPALLDL 123
Query 118 ETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHL 177
G A T L +ES VV D++GAE V + LLAA PDPF ++E+ L HL
Sbjct 124 -----GHGAALLT------LRLESAVVADSSGAESVAIDALLAAAPDPFQDVETVWLQHL 172
Query 178 ATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTG 237
H D+V L RLP+ LR G++RPLG+DRYGVR RIE GD D+RL F +PV D
Sbjct 173 EEDHADLVEMLARRLPSTLRTGRVRPLGIDRYGVRLRIEGASGDHDVRLDFTEPVGDAMA 232
Query 238 LSQAIRVLMGCPFRNGLRAR 257
LS+A+R+L+GCPF NGLRAR
Sbjct 233 LSRALRILVGCPFVNGLRAR 252
>gi|229494085|ref|ZP_04387852.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229319018|gb|EEN84872.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=259
Score = 222 bits (566), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 126/264 (48%), Positives = 164/264 (63%), Gaps = 19/264 (7%)
Query 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVD----- 55
M TS P+TAER+RSAC RA A+L +E DP +HHL +G VAVP
Sbjct 1 MATKTSTGPSTAERVRSACVRAQDAVLAIEGSDPTVTSVHHLRSNGDVVVAVPHASAAAA 60
Query 56 ---RGEVSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNP 112
G A+LE+TD+APL +REPVRSLVW+RG LH +P E D +A+++P+P
Sbjct 61 LAWNSGGGGLPAVLEITDHAPLRLREPVRSLVWLRGSLHAVPDYEARVLADDVASEHPHP 120
Query 113 ALLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIEST 172
LL + + RL + S VV D+TGAEPV V +LL A PDPF E+E+
Sbjct 121 GLLDIGHTS-----------VLLRLSLASAVVADSTGAEPVAVDELLNASPDPFWEMETA 169
Query 173 LLWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPV 232
L HL H D+V LV +LP +R G++RPLG+DRYG+R R+E GDRD+ +PF PV
Sbjct 170 WLQHLDEDHRDLVDLLVRKLPPYMRTGRVRPLGIDRYGLRLRVEDHTGDRDVWMPFANPV 229
Query 233 DDMTGLSQAIRVLMGCPFRNGLRA 256
DD LS+AIR+L+GCPF NGLR+
Sbjct 230 DDAPALSRAIRMLVGCPFLNGLRS 253
>gi|226303674|ref|YP_002763632.1| hypothetical protein RER_01850 [Rhodococcus erythropolis PR4]
gi|226182789|dbj|BAH30893.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=259
Score = 222 bits (566), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 126/264 (48%), Positives = 164/264 (63%), Gaps = 19/264 (7%)
Query 1 MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVD----- 55
M TS P+TAER+RSAC RA A+L +E DP +HHL +G VAVP
Sbjct 1 MATKTSTGPSTAERVRSACVRAQDAVLAIEGSDPTVTSVHHLRSNGDVVVAVPHASAAAA 60
Query 56 ---RGEVSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNP 112
G A+LE+TD+APL +REPVRSLVW+RG LH +P E D +A+++P+P
Sbjct 61 LAWNSGGGGLPAVLEITDHAPLRLREPVRSLVWLRGSLHAVPDYEARVLADDVASEHPHP 120
Query 113 ALLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIEST 172
LL + + RL + S VV D+TGAEPV V +LL A PDPF E+E+
Sbjct 121 GLLDIGHTS-----------VLLRLSLASAVVADSTGAEPVAVDELLNASPDPFWEMETA 169
Query 173 LLWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPV 232
L HL H D+V LV +LP +R G++RPLG+DRYG+R R+E GDRD+ +PF PV
Sbjct 170 WLQHLDEDHRDLVDLLVRKLPPYMRTGRVRPLGIDRYGLRLRVEDHTGDRDVWMPFANPV 229
Query 233 DDMTGLSQAIRVLMGCPFRNGLRA 256
DD LS+AIR+L+GCPF NGLR+
Sbjct 230 DDAPALSRAIRMLVGCPFLNGLRS 253
>gi|54022095|ref|YP_116337.1| hypothetical protein nfa1310 [Nocardia farcinica IFM 10152]
gi|54013603|dbj|BAD54973.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=253
Score = 216 bits (550), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 120/259 (47%), Positives = 159/259 (62%), Gaps = 23/259 (8%)
Query 7 LAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS------ 60
L P+TAER+RSACA A A+L + DP P +HHL G VAVP G V+
Sbjct 6 LVPSTAERVRSACAHAEQAVLALPGIDPTPTSVHHLRQCGDVVVAVPA--GSVAAVLTAN 63
Query 61 ----GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQ 116
G+ A+LELTD+APLP+REPVR+LVW+RG + +P + +A + P+P LL
Sbjct 64 SGPGGAAAVLELTDHAPLPLREPVRALVWLRGAVRTVPQSAQRALAGEVAKEFPHPELLD 123
Query 117 VETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWH 176
V T+ RL I++ V+ DATGAE V V +L +A+PDPFC++ES L H
Sbjct 124 VGH-----------GATLLRLVIDTAVMADATGAESVRVEELRSAQPDPFCQMESAWLQH 172
Query 177 LATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMT 236
L H D++ +L LP L+ G + PL +DRYG+ R+E DGD D+RLPF PVDD+
Sbjct 173 LDADHPDILEQLARHLPPRLQTGAVHPLAIDRYGLTLRVEGHDGDHDVRLPFTAPVDDVE 232
Query 237 GLSQAIRVLMGCPFRNGLR 255
LS+A+R L GCPF NGLR
Sbjct 233 ALSRAVRALAGCPFLNGLR 251
>gi|254822679|ref|ZP_05227680.1| hypothetical protein MintA_22309 [Mycobacterium intracellulare
ATCC 13950]
Length=160
Score = 209 bits (531), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 109/156 (70%), Positives = 129/156 (83%), Gaps = 3/156 (1%)
Query 3 PLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVD--RGE-V 59
PL+ APTTAERIRSAC RAGGALL +E +DPVP P+HHL+ DGSFA+A+PV+ RG +
Sbjct 4 PLSCHAPTTAERIRSACVRAGGALLAIEHDDPVPTPVHHLMDDGSFALALPVEQQRGRPI 63
Query 60 SGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVET 119
+GSQALLELTDYAPLP+REPVRSLVW+RG L Q+PPAE++ TLD+IA + P+PALL V+T
Sbjct 64 AGSQALLELTDYAPLPLREPVRSLVWVRGRLQQVPPAEILSTLDVIAAECPDPALLGVDT 123
Query 120 PRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTV 155
PR P E RYT+ RLEI SVVVTDATGAEPV V
Sbjct 124 PRCAPPGGQEQRYTLLRLEIASVVVTDATGAEPVAV 159
>gi|333917762|ref|YP_004491343.1| hypothetical protein AS9A_0083 [Amycolicicoccus subflavus DQS3-9A1]
gi|333479983|gb|AEF38543.1| hypothetical protein AS9A_0083 [Amycolicicoccus subflavus DQS3-9A1]
Length=262
Score = 197 bits (500), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 122/262 (47%), Positives = 155/262 (60%), Gaps = 23/262 (8%)
Query 8 APTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVD--------RGEV 59
P AER+RSA +RA A+L ++ DPV +HH G +AVP D +
Sbjct 10 GPNAAERVRSAFSRARTAVLALDGTDPVATSVHHFDGIGGMIIAVPEDCAATALTWQACT 69
Query 60 SGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVET 119
+G A+LELTD AP+ +REPVR+LVWIRG L I P L T IA +P+ LL V
Sbjct 70 AGMPAVLELTDEAPVELREPVRALVWIRGQLFPIDPDALAATAARIAATSPHVELLDVGH 129
Query 120 PRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLAT 179
T+ RLE+ S VV D TGAE V V L AA PDPFC +E+ L HL +
Sbjct 130 -----------GMTLLRLEMTSTVVADHTGAEAVPVDALAAAEPDPFCYVETCWLRHLDS 178
Query 180 AHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIE----ARDGDRDIRLPFHKPVDDM 235
H D++A + +LPA R G IRPLGLDRYG+R R+E + + D+DIR+ F PV
Sbjct 179 DHTDMLAMITRKLPASARAGTIRPLGLDRYGLRLRVEHSGASDNADQDIRIAFPAPVSTP 238
Query 236 TGLSQAIRVLMGCPFRNGLRAR 257
LSQA+R+LMGCPF NG+RAR
Sbjct 239 DQLSQALRILMGCPFLNGMRAR 260
>gi|343926274|ref|ZP_08765783.1| hypothetical protein GOALK_056_01420 [Gordonia alkanivorans NBRC
16433]
gi|343763903|dbj|GAA12709.1| hypothetical protein GOALK_056_01420 [Gordonia alkanivorans NBRC
16433]
Length=237
Score = 194 bits (494), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 105/247 (43%), Positives = 153/247 (62%), Gaps = 18/247 (7%)
Query 15 IRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVS------GSQALLEL 68
I++AC R G A+L +E D P+ + HL +F V VP D V+ G+ A+LE+
Sbjct 2 IQTACRRVGSAILAIEGADTTPIGVVHLFESQAF-VLVPTDGDAVAAVDGAEGTPAMLEV 60
Query 69 TDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETPRPGPADAA 128
TD+AP+ +RE VRS++W+ G LH++P + IA ++P+ LL +
Sbjct 61 TDWAPIDLRERVRSVIWLNGTLHEVPRDLERDLAIEIAGEHPDDGLLDIGHG-------- 112
Query 129 ETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATAHDDVVARL 188
+M RL+++S V+ ++GA V+ A+L A PDPF E E+ L HL H D+V +L
Sbjct 113 ---ASMLRLQVDSAVLASSSGATSVSAAELADATPDPFWECEAGWLEHLDADHADLVGQL 169
Query 189 VSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQAIRVLMGC 248
+LP LR+G++RPLGLDR+G+RFRIE DGD D+RLPF +PV D+ LS+A+R L GC
Sbjct 170 ARKLPTDLRQGRVRPLGLDRFGIRFRIEGADGDSDVRLPFPRPVSDVFELSRALRNLAGC 229
Query 249 PFRNGLR 255
PF N +R
Sbjct 230 PFMNSMR 236
>gi|262200207|ref|YP_003271415.1| hypothetical protein Gbro_0174 [Gordonia bronchialis DSM 43247]
gi|262083554|gb|ACY19522.1| hypothetical protein Gbro_0174 [Gordonia bronchialis DSM 43247]
Length=263
Score = 177 bits (448), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 114/266 (43%), Positives = 158/266 (60%), Gaps = 33/266 (12%)
Query 9 PTTAERIRSACARAGGALLVVEREDPVPVPIH------------HLLYDGSFAVAVPVDR 56
PT AE I++AC R A+L VE DP P HL +F V VP
Sbjct 9 PTDAEMIQTACRRVSDAILAVEPTDPAVAPAEQSPVDPVTVDVVHLFESQAF-VLVPTAG 67
Query 57 GEVS-------GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDL-IATD 108
++ G A+LE+TD AP+ +RE VRSL+W++G LHQ+P A+L L + IA +
Sbjct 68 ATLAAVSAAPDGVAAMLEITDCAPIDLRERVRSLIWLKGDLHQVP-ADLERDLAIEIAAE 126
Query 109 NPNPALLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCE 168
+P+ LL + R +M RL+I+S V+ ++GA V+ ++L A PDPF E
Sbjct 127 HPDGGLLDIGHGR-----------SMLRLQIDSAVIASSSGAASVSASELAGASPDPFWE 175
Query 169 IESTLLWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPF 228
E + HL + H DVV +L+ RLP LR+G++RPLGLDR+G+RFRIE+ +GD D+RLPF
Sbjct 176 YEHGWISHLDSDHADVVGQLIRRLPRHLRKGRVRPLGLDRFGIRFRIESAEGDSDVRLPF 235
Query 229 HKPVDDMTGLSQAIRVLMGCPFRNGL 254
+PV D+ LS A+R L GCPF N L
Sbjct 236 GRPVSDVYELSHALRSLAGCPFMNSL 261
>gi|296137892|ref|YP_003645135.1| hypothetical protein Tpau_0142 [Tsukamurella paurometabola DSM
20162]
gi|296026026|gb|ADG76796.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=245
Score = 161 bits (407), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 93/249 (38%), Positives = 141/249 (57%), Gaps = 15/249 (6%)
Query 9 PTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVSGSQALLEL 68
PT AER+R+ A+LV + +P+ + +HH+L D +AV D + G++A++E+
Sbjct 9 PTPAERVRTVAVVPHAAVLVADGHEPITIALHHVLGD-RLVIAVADDTPWLDGARAMVEI 67
Query 69 TDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETPRPGPADAA 128
D +PL +RE RSLVW+ G L ++ A+ +A +NP +LL V
Sbjct 68 NDISPLALRERTRSLVWLSGNLSEV--ADGAALAARVAIENPLESLLDVGNG-------- 117
Query 129 ETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATAHDDVVARL 188
+ + +E+ V+ D GA V+ +L AA PDPF E+ L H+ H ++V +L
Sbjct 118 ---SRLLTMPVETAVLADTAGASSVSGDELAAADPDPFAGYEAAWLAHVENDHPEMVGQL 174
Query 189 VSRLPAPLRRGQIRPLGLDRYGVRFRIEARD-GDRDIRLPFHKPVDDMTGLSQAIRVLMG 247
R+P LR +IR LG+DR+G+R R E + D D+RL F +P DM L + IR+L+G
Sbjct 175 ARRIPGKLRNHRIRLLGIDRFGIRLRAEHHELSDVDVRLNFAQPATDMAALQRGIRILLG 234
Query 248 CPFRNGLRA 256
CPF NGLRA
Sbjct 235 CPFLNGLRA 243
>gi|134096752|ref|YP_001102413.1| hypothetical protein SACE_0134 [Saccharopolyspora erythraea NRRL
2338]
gi|291008438|ref|ZP_06566411.1| hypothetical protein SeryN2_28286 [Saccharopolyspora erythraea
NRRL 2338]
gi|133909375|emb|CAL99487.1| hypothetical protein SACE_0134 [Saccharopolyspora erythraea NRRL
2338]
Length=259
Score = 154 bits (388), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 99/255 (39%), Positives = 134/255 (53%), Gaps = 27/255 (10%)
Query 9 PTTAERIRSACARAGGALLVVEREDPVPVP--IHHLLYDGS-----------FAVAVPVD 55
P+ AER R+ R G +L+ E+ + +HH+ DG+ A A
Sbjct 12 PSPAERARTIARRGGKGVLMPSGENSARIAPLLHHVHPDGAATVLLADEHPLIASAWQAP 71
Query 56 RGEVSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALL 115
RGE++ A+LE+ D P+ +REPVR L+W+ G L + P + + +A P+ LL
Sbjct 72 RGELT---AMLEVADPTPVRLREPVRGLLWLTGWLRVLEPEQARAEVVRVAEQRPDSRLL 128
Query 116 QVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLW 175
DA + RLE S+V+ DA G V D LAA PDPFC +E L
Sbjct 129 ----------DAGHGASVL-RLESASMVLADAEGTASVQPEDFLAASPDPFCLMEDGWLR 177
Query 176 HLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDM 235
HL +H DVV L LP LR G IRPLGLD+YG+R R+E+ DGD D+RL F + V D
Sbjct 178 HLELSHRDVVGLLARHLPERLRGGHIRPLGLDKYGLRLRVESADGDHDVRLAFSRTVTDA 237
Query 236 TGLSQAIRVLMGCPF 250
L +R L+GCPF
Sbjct 238 EHLGVELRRLVGCPF 252
>gi|326384321|ref|ZP_08206002.1| hypothetical protein SCNU_15359 [Gordonia neofelifaecis NRRL
B-59395]
gi|326196919|gb|EGD54112.1| hypothetical protein SCNU_15359 [Gordonia neofelifaecis NRRL
B-59395]
Length=254
Score = 145 bits (367), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 97/254 (39%), Positives = 135/254 (54%), Gaps = 24/254 (9%)
Query 5 TSLAPTTAERIRSACARAGGALLVVEREDPVPVP-------IHHLLYDGSFAV------A 51
T LAP AE I++AC RA L V D P + HL + +F + A
Sbjct 3 TILAPCDAEMIQTACRRAHSGTLSVSALDAAAEPHGCDTVAMVHLFDNDAFLLVSDESPA 62
Query 52 VPVDRGEVSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPN 111
+ R + A++E+ D AP+ +RE VRSL+W+ G LH++P E IA D+P+
Sbjct 63 LQTLRRQAPDLTAMVEVIDVAPVQMRERVRSLIWLSGTLHEVPARLERELAVEIAGDHPD 122
Query 112 PALLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIES 171
LL V R T+ RL +ES V+ +TGA V V + AA PD F E E+
Sbjct 123 EQLLDVGHGR-----------TLIRLALESAVIATSTGAGGVEVDAIAAAEPDLFWEYET 171
Query 172 TLLWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKP 231
L HL H D++ +L RLP +R G+ RPLGLDR+G+ FR+E + D D+R+PF +P
Sbjct 172 DWLVHLDCDHQDLIHQLAQRLPEHVRGGRARPLGLDRFGITFRVEIGNRDEDVRMPFARP 231
Query 232 VDDMTGLSQAIRVL 245
V + LS AI L
Sbjct 232 VSRIPELSAAIHSL 245
>gi|256374246|ref|YP_003097906.1| hypothetical protein Amir_0086 [Actinosynnema mirum DSM 43827]
gi|255918549|gb|ACU34060.1| hypothetical protein Amir_0086 [Actinosynnema mirum DSM 43827]
Length=252
Score = 145 bits (365), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 97/250 (39%), Positives = 128/250 (52%), Gaps = 18/250 (7%)
Query 8 APTTAERIRSACARAGGALLV--VEREDPVPVPIHHLLYDGSFAVAVP-----VDRGEVS 60
AP AER R+ R G A L+ ED V +HH+ G + +P V R +
Sbjct 9 APHPAERARTIATRGGRAALLPPDGAEDRVVPELHHVHACGEATLLLPDEHPLVARAALG 68
Query 61 GSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETP 120
++LE+ D+A +P+REPVR L+WI G L + + E +A + P+P LL V
Sbjct 69 EVTSMLEIADHAAVPLREPVRGLLWITGWLRALDGPDAREACVEVAEERPDPRLLDVGHG 128
Query 121 RPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATA 180
T RL S+VV DA + A PDPFC E L HL +
Sbjct 129 -----------LTALRLVPASLVVADAETTTSLRPEVFAQAEPDPFCAHEDHWLRHLELS 177
Query 181 HDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ 240
H DVV LV LP LR G +RPLGLDR+G+R R+E D D D+R+ F +PV + LS
Sbjct 178 HRDVVGLLVQHLPEGLRGGHVRPLGLDRFGLRLRVELEDTDHDVRIAFSRPVATPSELSS 237
Query 241 AIRVLMGCPF 250
+R LMGCPF
Sbjct 238 ELRRLMGCPF 247
>gi|302531511|ref|ZP_07283853.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302440406|gb|EFL12222.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=259
Score = 144 bits (364), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 97/260 (38%), Positives = 135/260 (52%), Gaps = 31/260 (11%)
Query 8 APTTAERIRSACARAGGALLV---------VEREDPVPVPIHHLLYDGSFAVAVPVDRGE 58
AP AER ++ R G A ++ R +PV +HH+ + GS ++ +P +
Sbjct 6 APNPAERAKTIATRGGPATIMPTVDSAGCEAARVEPV---LHHVHHSGSVSILLPDEHPM 62
Query 59 VSGSQ--------ALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNP 110
VS S+ ++EL D+AP+ +REPVR L+WI G L + IA P
Sbjct 63 VSASRQAQRGELAVMVELADHAPVALREPVRGLLWITGWLRPLTDVSARARAVSIAEQRP 122
Query 111 NPALLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIE 170
+ LL V T+ RL S+V+ DA G + AA PDPF + E
Sbjct 123 DHRLLDVGHG-----------LTLLRLTPASLVLADAEGTHSLRPHMFSAAPPDPFHDYE 171
Query 171 STLLWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHK 230
+ L HL + H DVV +L LPA LR G+IRPLGLDRYG+R R+E+ GD D+RL F +
Sbjct 172 AEWLRHLESDHPDVVEQLARHLPADLRGGRIRPLGLDRYGLRLRVESTAGDHDVRLAFSR 231
Query 231 PVDDMTGLSQAIRVLMGCPF 250
VD L+ +R L+GCPF
Sbjct 232 TVDSPPQLAAELRRLLGCPF 251
>gi|300782086|ref|YP_003762377.1| hypothetical protein AMED_0151 [Amycolatopsis mediterranei U32]
gi|299791600|gb|ADJ41975.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340523444|gb|AEK38649.1| hypothetical protein RAM_00770 [Amycolatopsis mediterranei S699]
Length=264
Score = 142 bits (358), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 98/264 (38%), Positives = 135/264 (52%), Gaps = 33/264 (12%)
Query 8 APTTAERIRSACARAGGALLV-------VEREDPVPVPIHHLLYDGSFAV---------- 50
AP AER ++ R G A L+ + E VPV +HH+ GS +V
Sbjct 13 APNPAERAKTIATRNGPASLLPTCDRADLNGERVVPV-LHHVHRSGSVSVLLADDHPMVR 71
Query 51 -AVPVDRGEVSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDN 109
A RGE++ ++E+ D AP+ +REP+R L+WI G L + P IA
Sbjct 72 AAKQAQRGELA---VMVEVADQAPVDLREPIRGLLWITGWLRPLSPVSARARAVAIAETR 128
Query 110 PNPALLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEI 169
P+ LL V T+ RL S+V+ DA G + AA PDPF +
Sbjct 129 PDERLLDVGHG-----------VTLLRLTPASLVLADAEGTHSLRPHMFSAAPPDPFHDY 177
Query 170 ESTLLWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFH 229
E+ L HL + H DVV +L LP+ LR G+IRPLGLDR+G+R R+E+ GD D+RL F
Sbjct 178 EAGWLRHLESDHSDVVEQLAKHLPSALRGGRIRPLGLDRFGLRLRVESDTGDHDVRLAFS 237
Query 230 KPVDDMTGLSQAIRVLMGCPFRNG 253
K V+ L+ +R L+GCPF G
Sbjct 238 KSVESPAQLASELRRLVGCPFLRG 261
>gi|331694054|ref|YP_004330293.1| hypothetical protein Psed_0164 [Pseudonocardia dioxanivorans
CB1190]
gi|326948743|gb|AEA22440.1| hypothetical protein Psed_0164 [Pseudonocardia dioxanivorans
CB1190]
Length=275
Score = 129 bits (325), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 100/265 (38%), Positives = 133/265 (51%), Gaps = 32/265 (12%)
Query 9 PTTAERIRSACARAGGALLV-VEREDPVPVPIHHLLYDGSFAVAVPVD------------ 55
PT AER RS R G A +V +P+ HH+L DGS AV + D
Sbjct 12 PTDAERARSITTRGGRAAIVGTGGPEPIVPAFHHVLADGS-AVLLLDDHCALVEQARLDP 70
Query 56 RGEVSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALL 115
RGEV+ +LEL D+A + +REPVR+L+WI G L AE A P+ LL
Sbjct 71 RGEVA---VMLELADHAAVDLREPVRALLWITGWLRVPDDAEARRAAVHAAQARPDHRLL 127
Query 116 QVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLW 175
+ T+ RLE SVV+ DA G ++ DL AARPDPFC +E L
Sbjct 128 DLGHG-----------ATLVRLEPGSVVLADAEGTASLSPVDLAAARPDPFCILEDRWLG 176
Query 176 HLATAHDDVVARLVSRLPAPLRR---GQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPV 232
HL + H +V L LPAPLR ++RPLG+DR G+R R+E GD D+R+ +
Sbjct 177 HLESTHPEVFVALARHLPAPLRATSGARVRPLGVDRCGIRLRVETPGGDHDVRIAWPAEA 236
Query 233 DDMTGLSQAIRVLMGCPFRNGLRAR 257
+ L + L+GC + G AR
Sbjct 237 TTVHELRSRLTALVGCGY-GGRHAR 260
>gi|257054185|ref|YP_003132017.1| hypothetical protein Svir_01020 [Saccharomonospora viridis DSM
43017]
gi|256584057|gb|ACU95190.1| hypothetical protein Svir_01020 [Saccharomonospora viridis DSM
43017]
Length=265
Score = 127 bits (318), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/264 (35%), Positives = 128/264 (49%), Gaps = 32/264 (12%)
Query 12 AERIRSACARAGGALLV------VEREDPVPVPIHHLLYDGSFAVAVPVDRGEVSGSQA- 64
AER +S R G A L+ + + PV + HL +V +P D +G Q
Sbjct 15 AERAKSIAVRGGPASLLPALGHRTQSDRTTPV-LWHLHAGDDLSVVLPTDDPLAAGVQGN 73
Query 65 -------LLELTDYAPLPVREPVRSLVWIRGCLHQIPP----AELVETLDLIATDNPNPA 113
LE+ D AP+P+R+PVR L+W+ G L + A + D + P+P
Sbjct 74 ATAELAVTLEIIDEAPVPLRQPVRGLLWLTGWLRGLDDRTARARALTIADSVT--EPDPR 131
Query 114 LLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTL 173
LL + +M R SVV++D+ G ++ AA DP CE E
Sbjct 132 LLDLGHG-----------LSMLRFTPASVVLSDSEGTHSLSPVTFDAAVADPLCEYEGRW 180
Query 174 LWHLATAHDDVVARLVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVD 233
L HL H DVV +L LP L G+IRPLG+DR G+R R+E D D D+RL F +PV+
Sbjct 181 LQHLEHKHTDVVEQLSRHLPEELCGGRIRPLGVDRCGIRLRVETADDDHDVRLAFSRPVE 240
Query 234 DMTGLSQAIRVLMGCPFRNGLRAR 257
+ L+ +R L+GCPF N R
Sbjct 241 NPPQLAVELRKLVGCPFLNRFDER 264
>gi|324998836|ref|ZP_08119948.1| hypothetical protein PseP1_08734 [Pseudonocardia sp. P1]
Length=259
Score = 123 bits (308), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 93/249 (38%), Positives = 124/249 (50%), Gaps = 22/249 (8%)
Query 12 AERIRSACARAGGALLVVERE-DPVPVPIHHLLYDGSFAVAVPVD-------RGEVSGSQ 63
AER RS R G A LV E +P +HH DG+ + +P D R G
Sbjct 15 AERARSLVMRGGTAALVGTGEPEPCAPLMHHTWPDGTTDLLLPDDHRVREQARLSADGLP 74
Query 64 ALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETPRPG 123
+LELT P+P+ EPVR L+W+ G LH+ P E +A P+P LL
Sbjct 75 VMLELTGRTPVPLPEPVRELLWLLGRLHEPDPRTGRERALRLAEKAPHPNLLD------- 127
Query 124 PADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATAHDD 183
A T+ RL S V +DA G V+ A+L AA PDPFC++E L HL AH +
Sbjct 128 ----AGRGATLLRLHPSSGVYSDAEGCASVSPAELAAADPDPFCQVEQPWLEHLDQAHPE 183
Query 184 VVARLVSRLPAPLRR--GQIRPLGLDRYGVRFRIEA-RDGDRDIRLPFHKPVDDMTGLSQ 240
++ L LP LR G+IRP+G+DR G+R R+ A G +D+RL F L +
Sbjct 184 MLCALRRHLPHGLRGVDGRIRPIGVDRCGLRVRVPAPGGGTQDVRLSFSSEATTPAELQK 243
Query 241 AIRVLMGCP 249
L+GCP
Sbjct 244 RFAELVGCP 252
>gi|329936444|ref|ZP_08286209.1| hypothetical protein SGM_1701 [Streptomyces griseoaurantiacus
M045]
gi|329304240|gb|EGG48121.1| hypothetical protein SGM_1701 [Streptomyces griseoaurantiacus
M045]
Length=240
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 80/251 (32%), Positives = 112/251 (45%), Gaps = 48/251 (19%)
Query 6 SLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVSGS--- 62
+ APT AE RS A A + E + H + DG + VP D G + +
Sbjct 8 TAAPTAAEHARSVTAAAWSCAVTSEGGKEEFLGAHRVEADGGILLTVPEDSGLRTAALCA 67
Query 63 -----QALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQV 117
A+LE D AP+PVR +R+ +W+ G T L
Sbjct 68 PRGEPAAVLEFADVAPVPVRARIRARLWLAGWF----------------TAREEHLLF-- 109
Query 118 ETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHL 177
RP TR ++R A+GA V + + AA PDP E E+ LL HL
Sbjct 110 ---RP-------TRMVLRR----------ASGAVLVDLDEYAAAAPDPLAEAEARLLTHL 149
Query 178 ATAHDDVVARLVSRLPAPLRRG--QIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDM 235
A H D V RL + RG ++RPL +DR+G+ R+E G D+RL FH+P DD+
Sbjct 150 ADCHPDAVERLSRLMEHGHLRGAVRVRPLAVDRHGLTLRVERTRGHGDVRLTFHRPADDV 209
Query 236 TGLSQAIRVLM 246
L++ I VL+
Sbjct 210 AQLTERIHVLL 220
>gi|294633390|ref|ZP_06711949.1| conserved hypothetical protein [Streptomyces sp. e14]
gi|292831171|gb|EFF89521.1| conserved hypothetical protein [Streptomyces sp. e14]
Length=240
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 83/262 (32%), Positives = 114/262 (44%), Gaps = 47/262 (17%)
Query 10 TTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAV--------AVPVDRGEVSG 61
T AE +RS ARA L + +D + +H + +G + A V G
Sbjct 13 TAAEYVRSVLARAVSLSLSTDGQDYDLIGMHSVDAEGRVILQSQPGCPLAAQVAAAPDGG 72
Query 62 SQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETPR 121
A +E TD AP +R+ VR+ V + G ++ T R
Sbjct 73 LSARMEFTDIAPTALRDRVRAQVTVSG---------------------------RLVTGR 105
Query 122 PGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATAH 181
P D R + E V + TG V VADL ARPDP E+ LL HLA AH
Sbjct 106 PEHGDG-------LRFDAEQVTLRTVTGCHDVDVADLTRARPDPLAVEEAALLTHLADAH 158
Query 182 DDVVARLVSRLPAPLRRGQIR--PLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLS 239
+D+VA L + + L RG +R PL +DRYG+ R E R+G D+RL F D
Sbjct 159 EDMVAALAALAGSRLPRGVVRVMPLAVDRYGIALRCEYREGHCDVRLLFPVVARDAAEAG 218
Query 240 QAIRVLM---GCPFRNGLRARR 258
+ +R+L+ GC ARR
Sbjct 219 ERVRLLLTASGCAHHPHPSARR 240
>gi|254383599|ref|ZP_04998949.1| conserved hypothetical protein [Streptomyces sp. Mg1]
gi|194342494|gb|EDX23460.1| conserved hypothetical protein [Streptomyces sp. Mg1]
Length=241
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 65/185 (36%), Positives = 87/185 (48%), Gaps = 11/185 (5%)
Query 64 ALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQVETPRPG 123
A++E+TD AP+ V +R W+ G L ++ + L+A +P LL V
Sbjct 37 AVIEITDVAPVSVPHRIRGRAWLAGWLTRVRGDDRAVCAALMAERHPVGELLGVH----- 91
Query 124 PADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATAHDD 183
Y + RLE+ + V D G + V DL AA PDP E+ +L HLA+AH D
Sbjct 92 ----GGASYVLLRLEVGEISVDDLWGTDHVDPEDLAAAEPDPLVHHETEVLQHLASAHAD 147
Query 184 VVARLVSRLPAPLRRGQIR-PLGLDRYGVRFRIEARD-GDRDIRLPFHKPVDDMTGLSQA 241
VA L L + G PL LDR+G+R R D R F PV D GL QA
Sbjct 148 RVADLCGLLGSREADGMAAVPLALDRFGLRVRFTGGGVSSFDARFDFPDPVADACGLRQA 207
Query 242 IRVLM 246
+R L
Sbjct 208 MRRLF 212
>gi|290955498|ref|YP_003486680.1| hypothetical protein SCAB_9301 [Streptomyces scabiei 87.22]
gi|260645024|emb|CBG68110.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
Length=241
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/214 (33%), Positives = 96/214 (45%), Gaps = 48/214 (22%)
Query 37 VPIHHLLYDGSFAVAVPVDRGEVSGS--------QALLELTDYAPLPVREPVRSLVWIRG 88
V H + DG + VP D V+ + A+LE D AP+PVR +R+ +WI G
Sbjct 39 VGAHGVTEDGRVTLRVPEDSTLVAAAICAPRGEPSAVLEFADVAPVPVRGRIRARLWIAG 98
Query 89 CLHQIPPAELVETLDLIATDNPNPALLQVETPRPGPADAAETRYTMQRLEIESVVVTDAT 148
TP G + TR +V+ + +
Sbjct 99 WF----------------------------TPVDGDLEFRPTR----------IVLREPS 120
Query 149 GAEPVTVADLLAARPDPFCEIESTLLWHLATAHDDVVARLVSRLPAPLRRG--QIRPLGL 206
GA V + AA PDP E+ LL HLA AH D V RL +P G ++RPL +
Sbjct 121 GAVLVDPDEFAAAEPDPLVTAEARLLAHLADAHPDAVERLTRLVPHDSLHGAVRVRPLAV 180
Query 207 DRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQ 240
DR+G+ R+E GD D+RL FHKP DD+ L++
Sbjct 181 DRHGLTLRVERARGDGDVRLTFHKPADDLAQLTE 214
>gi|297204232|ref|ZP_06921629.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
gi|197714076|gb|EDY58110.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
Length=241
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 75/253 (30%), Positives = 107/253 (43%), Gaps = 52/253 (20%)
Query 6 SLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVPVDRGEVSGS--- 62
+ AP+ AE RS A + + E V H + DG + VP V+ +
Sbjct 8 TAAPSAAEHARSVLAASWSCAVTAEGGREEFVGAHTVAEDGRVTLEVPEGSTLVAAAICA 67
Query 63 -----QALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIATDNPNPALLQV 117
A+LE D AP+P+R +R+ +WI G
Sbjct 68 PRGEPSAVLEFADVAPVPLRSRIRARLWIAGWF--------------------------- 100
Query 118 ETPRPGPADAAETRYTMQRLEIES--VVVTDATGAEPVTVADLLAARPDPFCEIESTLLW 175
R G RLE E+ VV+ +GA V + + AA PDP E+ LL
Sbjct 101 -AARDG------------RLEFEATRVVLRRPSGAVVVDLDEFAAAAPDPLATAEARLLT 147
Query 176 HLATAHDDVVARLVSRLPAPLRRGQIR--PLGLDRYGVRFRIEARDGDRDIRLPFHKPVD 233
HLA H D + RL + G +R PL +DR+G+ RIE D+RLPFH P D
Sbjct 148 HLADCHPDAIERLTRLVDPDSLHGAVRVQPLAVDRHGLTLRIERARAHGDVRLPFHAPAD 207
Query 234 DMTGLSQAIRVLM 246
D+ L++ + VL+
Sbjct 208 DVARLTERMHVLL 220
>gi|117165013|emb|CAJ88565.1| conserved hypothetical protein [Streptomyces ambofaciens ATCC
23877]
Length=242
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 67/201 (34%), Positives = 96/201 (48%), Gaps = 43/201 (21%)
Query 48 FAVAVPVDRGEVSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVETLDLIAT 107
A A+ RGE S A+LE D AP+PVR +R+ +W+ G A
Sbjct 62 LAAAICAPRGEPS---AVLEFADVAPVPVRGRIRARLWLAG---------------WFAP 103
Query 108 DNPNPALLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFC 167
D+ + A RP TR ++R +GA V + + + ARPDP
Sbjct 104 DDGHLAF------RP-------TRVVLRR----------PSGAVVVGLDEFVTARPDPLA 140
Query 168 EIESTLLWHLATAHDDVVARLVSRLPAPLRRG--QIRPLGLDRYGVRFRIEARDGDRDIR 225
E+ LL HLA H D V RL + A G ++RPL +DR+G+ R+E D+R
Sbjct 141 LAEARLLTHLADCHGDAVERLTRLVHADSLHGAVRVRPLAVDRHGLTLRVERVSAHGDVR 200
Query 226 LPFHKPVDDMTGLSQAIRVLM 246
LPFH P DD+ L++ + VL+
Sbjct 201 LPFHAPADDVAQLTERVHVLL 221
>gi|297564388|ref|YP_003683361.1| hypothetical protein Ndas_5476 [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
gi|296848837|gb|ADH70855.1| conserved hypothetical protein [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
Length=254
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 76/246 (31%), Positives = 114/246 (47%), Gaps = 26/246 (10%)
Query 15 IRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAVAVP----VDRGEVSGSQALLELTD 70
+ +A R G A V E P H+ DG+ ++ +P + G + E TD
Sbjct 24 VAAASDRFGEAGTVYLEEAPC-----HVHADGAVSLLLPDGHPLTDPRPRGPVVMAEFTD 78
Query 71 YAPLPVREPVRSLVWIRGCLHQIPPAEL---VETLDLIATDNPNPALLQVETPRPGPADA 127
+P+ +R+ VR+++W+ G + + A L ATD+ LL++
Sbjct 79 LSPVLMRDRVRAVLWVSGAVEALEHAHARVRAARLPEAATDD---RLLEIGY-------- 127
Query 128 AETRYTMQRLEIESVVVTDATGAEPVTVADLLAARPDPFCEIESTLLWHLATAHDDVVAR 187
TM L VV +D GA + A+L AARPDPFC E+ L HL H D+V
Sbjct 128 ---GLTMVVLRTTLVVQSDHDGAHVLDPAELAAARPDPFCLWEAPWLRHLDEDHPDLVGD 184
Query 188 LVSRLPAPLRRGQIRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQAIRVLMG 247
L++ R G+ RPLG+DR G+R R+E G D+ LPF +P ++ + L G
Sbjct 185 LLAAASGAARGGRPRPLGVDRLGLRLRVETARGHHDVHLPFTRPARTPDDVAVQVHRLAG 244
Query 248 CPFRNG 253
P G
Sbjct 245 HPVPQG 250
Lambda K H
0.322 0.139 0.420
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 384134341080
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40