BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0738
Length=182
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607878|ref|NP_215252.1| hypothetical protein Rv0738 [Mycoba... 361 3e-98
gi|340625759|ref|YP_004744211.1| hypothetical protein MCAN_07431... 359 8e-98
gi|308231623|ref|ZP_07413187.2| hypothetical protein TMAG_02622 ... 355 2e-96
gi|240167757|ref|ZP_04746416.1| hypothetical protein MkanA1_0047... 256 9e-67
gi|183981096|ref|YP_001849387.1| hypothetical protein MMAR_1076 ... 246 8e-64
gi|118616611|ref|YP_904943.1| hypothetical protein MUL_0834 [Myc... 245 2e-63
gi|289749247|ref|ZP_06508625.1| LOW QUALITY PROTEIN: conserved h... 241 3e-62
gi|300784522|ref|YP_003764813.1| hypothetical protein AMED_2616 ... 125 2e-27
gi|302554300|ref|ZP_07306642.1| conserved hypothetical protein [... 103 1e-20
gi|29829303|ref|NP_823937.1| hypothetical protein SAV_2761 [Stre... 102 2e-20
gi|256391714|ref|YP_003113278.1| hypothetical protein Caci_2519 ... 97.8 6e-19
gi|297159040|gb|ADI08752.1| hypothetical protein SBI_05632 [Stre... 97.4 8e-19
gi|169631005|ref|YP_001704654.1| hypothetical protein MAB_3926 [... 95.5 3e-18
gi|290954992|ref|YP_003486174.1| MerR family transcriptional reg... 87.0 1e-15
gi|302530399|ref|ZP_07282741.1| predicted protein [Streptomyces ... 85.9 2e-15
gi|337764322|emb|CCB73031.1| conserved protein of unknown functi... 84.7 5e-15
gi|111021396|ref|YP_704368.1| hypothetical protein RHA1_ro04424 ... 81.6 5e-14
gi|291299774|ref|YP_003511052.1| hypothetical protein Snas_2270 ... 80.5 9e-14
gi|324997761|ref|ZP_08118873.1| hypothetical protein PseP1_03295... 79.7 2e-13
gi|290956238|ref|YP_003487420.1| hypothetical protein SCAB_17241... 79.7 2e-13
gi|296270489|ref|YP_003653121.1| hypothetical protein Tbis_2526 ... 79.7 2e-13
gi|312194755|ref|YP_004014816.1| hypothetical protein FraEuI1c_0... 78.6 3e-13
gi|302556180|ref|ZP_07308522.1| conserved hypothetical protein [... 78.6 4e-13
gi|258651068|ref|YP_003200224.1| hypothetical protein Namu_0824 ... 78.2 4e-13
gi|111224919|ref|YP_715713.1| hypothetical protein FRAAL5554 [Fr... 78.2 5e-13
gi|302525213|ref|ZP_07277555.1| predicted protein [Streptomyces ... 76.6 1e-12
gi|159037176|ref|YP_001536429.1| hypothetical protein Sare_1542 ... 76.6 1e-12
gi|331697737|ref|YP_004333976.1| hypothetical protein Psed_3957 ... 75.5 3e-12
gi|328880684|emb|CCA53923.1| hypothetical protein SVEN_0636 [Str... 75.1 4e-12
gi|226363750|ref|YP_002781532.1| hypothetical protein ROP_43400 ... 74.3 7e-12
gi|291297734|ref|YP_003509012.1| hypothetical protein Snas_0200 ... 73.9 9e-12
gi|86741006|ref|YP_481406.1| hypothetical protein Francci3_2309 ... 73.9 1e-11
gi|134100366|ref|YP_001106027.1| hypothetical protein SACE_3831 ... 73.2 2e-11
gi|297560803|ref|YP_003679777.1| hypothetical protein Ndas_1843 ... 73.2 2e-11
gi|297156190|gb|ADI05902.1| hypothetical protein SBI_02781 [Stre... 72.4 3e-11
gi|297156958|gb|ADI06670.1| hypothetical protein SBI_03549 [Stre... 72.0 4e-11
gi|271968539|ref|YP_003342735.1| hypothetical protein Sros_7305 ... 71.6 5e-11
gi|254822475|ref|ZP_05227476.1| hypothetical protein MintA_21251... 71.2 6e-11
gi|158318360|ref|YP_001510868.1| hypothetical protein Franean1_6... 71.2 6e-11
gi|297202958|ref|ZP_06920355.1| conserved hypothetical protein [... 70.9 8e-11
gi|302548167|ref|ZP_07300509.1| basic proline-rich protein [Stre... 69.7 2e-10
gi|342858908|ref|ZP_08715562.1| hypothetical protein MCOL_08528 ... 69.3 2e-10
gi|343928249|ref|ZP_08767703.1| hypothetical protein GOALK_111_0... 67.8 6e-10
gi|108743438|dbj|BAE95541.1| conserved hypothetical protein [Str... 67.8 7e-10
gi|294630269|ref|ZP_06708829.1| conserved hypothetical protein [... 67.4 8e-10
gi|117927359|ref|YP_871910.1| hypothetical protein Acel_0149 [Ac... 67.4 9e-10
gi|296166899|ref|ZP_06849316.1| conserved hypothetical protein [... 67.0 1e-09
gi|229818834|ref|YP_002880360.1| hypothetical protein Bcav_0334 ... 66.6 1e-09
gi|169631520|ref|YP_001705169.1| hypothetical protein MAB_4446 [... 66.6 2e-09
gi|271968450|ref|YP_003342646.1| hypothetical protein Sros_7213 ... 66.2 2e-09
>gi|15607878|ref|NP_215252.1| hypothetical protein Rv0738 [Mycobacterium tuberculosis H37Rv]
gi|15840147|ref|NP_335184.1| hypothetical protein MT0763 [Mycobacterium tuberculosis CDC1551]
gi|31791924|ref|NP_854417.1| hypothetical protein Mb0759 [Mycobacterium bovis AF2122/97]
71 more sequence titles
Length=182
Score = 361 bits (926), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 181/182 (99%), Positives = 182/182 (100%), Gaps = 0/182 (0%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE 60
+DPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE
Sbjct 1 MDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE 60
Query 61 PPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDL 120
PPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDL
Sbjct 61 PPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDL 120
Query 121 AAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRT 180
AAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRT
Sbjct 121 AAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRT 180
Query 181 VR 182
VR
Sbjct 181 VR 182
>gi|340625759|ref|YP_004744211.1| hypothetical protein MCAN_07431 [Mycobacterium canettii CIPT
140010059]
gi|340003949|emb|CCC43083.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=182
Score = 359 bits (922), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 180/182 (99%), Positives = 181/182 (99%), Gaps = 0/182 (0%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE 60
+DPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE
Sbjct 1 MDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE 60
Query 61 PPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDL 120
PPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLP GEVPGQVFIGLRTTDVLTHAWDL
Sbjct 61 PPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPFGEVPGQVFIGLRTTDVLTHAWDL 120
Query 121 AAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRT 180
AAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRT
Sbjct 121 AAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRT 180
Query 181 VR 182
VR
Sbjct 181 VR 182
>gi|308231623|ref|ZP_07413187.2| hypothetical protein TMAG_02622 [Mycobacterium tuberculosis SUMu001]
gi|308377505|ref|ZP_07479424.2| hypothetical protein TMIG_01647 [Mycobacterium tuberculosis SUMu009]
gi|308216739|gb|EFO76138.1| hypothetical protein TMAG_02622 [Mycobacterium tuberculosis SUMu001]
gi|308355614|gb|EFP44465.1| hypothetical protein TMIG_01647 [Mycobacterium tuberculosis SUMu009]
Length=178
Score = 355 bits (910), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 178/178 (100%), Positives = 178/178 (100%), Gaps = 0/178 (0%)
Query 5 MAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEPPAR 64
MAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEPPAR
Sbjct 1 MAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEPPAR 60
Query 65 PDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLAAAT 124
PDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLAAAT
Sbjct 61 PDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLAAAT 120
Query 125 GQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRTVR 182
GQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRTVR
Sbjct 121 GQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRTVR 178
>gi|240167757|ref|ZP_04746416.1| hypothetical protein MkanA1_00475 [Mycobacterium kansasii ATCC
12478]
Length=182
Score = 256 bits (655), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 135/182 (75%), Positives = 157/182 (87%), Gaps = 0/182 (0%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE 60
+DPL+AH+RAQDAFA +LANV +Q G TPCSEWT+ DLIEHV+ GNE VG+WA P+E
Sbjct 1 MDPLVAHRRAQDAFAGVLANVSPEQHGAATPCSEWTVRDLIEHVISGNEHVGQWAQHPVE 60
Query 61 PPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDL 120
PPARPD ++AAH+ AAA AHE+FAAP GMS TFKLP GE+PGQVF+G+RT+DVLTHAWDL
Sbjct 61 PPARPDDMLAAHRTAAAAAHEVFAAPDGMSTTFKLPFGELPGQVFVGIRTSDVLTHAWDL 120
Query 121 AAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRT 180
AAATGQ TDLDPELA E+LAA RA +GPQFRGPGKPFA+E+PC ER PADQLAAFLGR
Sbjct 121 AAATGQPTDLDPELATEQLAAVRAFMGPQFRGPGKPFAEEQPCSPERAPADQLAAFLGRE 180
Query 181 VR 182
V+
Sbjct 181 VQ 182
>gi|183981096|ref|YP_001849387.1| hypothetical protein MMAR_1076 [Mycobacterium marinum M]
gi|183174422|gb|ACC39532.1| conserved protein [Mycobacterium marinum M]
Length=182
Score = 246 bits (629), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 128/182 (71%), Positives = 149/182 (82%), Gaps = 0/182 (0%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE 60
+DPL AHQRAQDAF ++LANV ADQLG TPCSEWT++DLIEHV+GGNE VG W+
Sbjct 1 MDPLTAHQRAQDAFGSVLANVSADQLGAATPCSEWTVSDLIEHVIGGNEHVGIWSGGADR 60
Query 61 PPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDL 120
P ARPD +VAAH+A AA A ++FAAP GM+ FKLP GE+PGQVFIG+RT+DVLTHAWDL
Sbjct 61 PAARPDDMVAAHRATAAAAQQVFAAPDGMATVFKLPFGEIPGQVFIGMRTSDVLTHAWDL 120
Query 121 AAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRT 180
A ATGQ +DLDP+LA ++LAA RA VGPQFRGPGKPF E+PC E PADQLAAFLGR
Sbjct 121 AVATGQPSDLDPDLATQQLAAVRAFVGPQFRGPGKPFGQEQPCSAELSPADQLAAFLGRK 180
Query 181 VR 182
V+
Sbjct 181 VQ 182
>gi|118616611|ref|YP_904943.1| hypothetical protein MUL_0834 [Mycobacterium ulcerans Agy99]
gi|118568721|gb|ABL03472.1| conserved protein [Mycobacterium ulcerans Agy99]
Length=182
Score = 245 bits (626), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 127/182 (70%), Positives = 148/182 (82%), Gaps = 0/182 (0%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE 60
+DPL AHQRAQDAF ++LANV ADQLG TPCSEWT++DLIEHV+GGNE VG W+
Sbjct 1 MDPLTAHQRAQDAFGSVLANVSADQLGAATPCSEWTVSDLIEHVIGGNEHVGIWSGGADR 60
Query 61 PPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDL 120
P ARPD +VAAH+A AA A ++FAAP GM+ FKLP GE+PGQVFIG+RT+DVLTHAWDL
Sbjct 61 PAARPDDMVAAHRATAAAAQQVFAAPDGMATVFKLPFGEIPGQVFIGMRTSDVLTHAWDL 120
Query 121 AAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRT 180
A ATGQ +DLDP+LA ++LAA R VGPQFRGPGKPF E+PC E PADQLAAFLGR
Sbjct 121 AVATGQPSDLDPDLATQQLAAVRVFVGPQFRGPGKPFGQEQPCSAELSPADQLAAFLGRK 180
Query 181 VR 182
V+
Sbjct 181 VQ 182
>gi|289749247|ref|ZP_06508625.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T92]
gi|289689834|gb|EFD57263.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T92]
Length=141
Score = 241 bits (615), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 135/138 (98%), Positives = 135/138 (98%), Gaps = 0/138 (0%)
Query 45 VGGNEQVGRWAASPIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQV 104
VG EQVGRWA SPIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQV
Sbjct 4 VGVTEQVGRWAPSPIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQV 63
Query 105 FIGLRTTDVLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCP 164
FIGLRTTDVLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCP
Sbjct 64 FIGLRTTDVLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCP 123
Query 165 RERPPADQLAAFLGRTVR 182
RERPPADQLAAFLGRTVR
Sbjct 124 RERPPADQLAAFLGRTVR 141
>gi|300784522|ref|YP_003764813.1| hypothetical protein AMED_2616 [Amycolatopsis mediterranei U32]
gi|299794036|gb|ADJ44411.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340525943|gb|AEK41148.1| hypothetical protein RAM_13290 [Amycolatopsis mediterranei S699]
Length=187
Score = 125 bits (315), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 79/185 (43%), Positives = 99/185 (54%), Gaps = 9/185 (4%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE 60
+ PL A AL++ VRADQ PT C++W + +I H+ GN +V WA +
Sbjct 1 MTPLDEFDLAASTVRALVSAVRADQWALPTACADWDVRAVINHLAHGNAKVAFWAGT--G 58
Query 61 PPARPDGL------VAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVL 114
PPA PDG V A A+ A + AAPG S PLGEVPG + +R + L
Sbjct 59 PPA-PDGDYLGSAPVEAFAASVTAARAVLAAPGLFSRQVTTPLGEVPGVFLVHMRVNEYL 117
Query 115 THAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLA 174
H WD+A ATG+ TDL PELA L R+ R PG PF E P PR+ AD+LA
Sbjct 118 AHGWDIADATGRPTDLAPELAARALEQWRSRFAATPRQPGGPFGPELPPPRDATAADELA 177
Query 175 AFLGR 179
AFLGR
Sbjct 178 AFLGR 182
>gi|302554300|ref|ZP_07306642.1| conserved hypothetical protein [Streptomyces viridochromogenes
DSM 40736]
gi|302471918|gb|EFL35011.1| conserved hypothetical protein [Streptomyces viridochromogenes
DSM 40736]
Length=194
Score = 103 bits (256), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/187 (36%), Positives = 94/187 (51%), Gaps = 7/187 (3%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVG-------R 53
DP RA + AAL+ VRA++L GPTPCSE+ + L+ H+ GG ++
Sbjct 3 TDPRPLFARATEQAAALIQAVRAERLDGPTPCSEFDVRTLLSHLTGGARRIAIAGEGGDA 62
Query 54 WAASPIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDV 113
AA P DG A+ A A + +A + A +LP GE+PG+ + +
Sbjct 63 VAAQPFAEGVPDDGWAVAYDEARIRAVKAWAGDDRLEAVVRLPFGEMPGRTALSAYVMET 122
Query 114 LTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQL 173
+TH WDL+ A G+ LDPE A LA A ++ + R PF +P P D+L
Sbjct 123 VTHTWDLSEALGRPLALDPEPAEFALAVAHRMLPDEQRDERTPFGSARPAPEGADTYDRL 182
Query 174 AAFLGRT 180
AA+LGRT
Sbjct 183 AAWLGRT 189
>gi|29829303|ref|NP_823937.1| hypothetical protein SAV_2761 [Streptomyces avermitilis MA-4680]
gi|29606410|dbj|BAC70472.1| hypothetical protein [Streptomyces avermitilis MA-4680]
Length=194
Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/186 (35%), Positives = 93/186 (50%), Gaps = 7/186 (3%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE 60
DP + RA + AAL+ VR +QL GPTPC E+ + L+ H+ GG ++ A
Sbjct 3 TDPRPLYARAAEQIAALIRTVRPEQLAGPTPCGEFDVRTLLSHMAGGTRRIAVVGAGGDG 62
Query 61 PPARP-------DGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDV 113
RP DG VAA+ A + +A + A +P GE PG++ + +
Sbjct 63 LAVRPFVDGVPDDGWVAAYDEVRAEVEQSWADDARLDALVHVPWGEAPGRIALSGYVMEA 122
Query 114 LTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQL 173
+TH WDL+ A G+ LDPELA LA A ++ + RG PF P P ++L
Sbjct 123 VTHTWDLSEALGRPLGLDPELAEFALAIAHRVLPDEQRGDDVPFDSAAPAPEGADAYERL 182
Query 174 AAFLGR 179
AA+LGR
Sbjct 183 AAWLGR 188
>gi|256391714|ref|YP_003113278.1| hypothetical protein Caci_2519 [Catenulispora acidiphila DSM
44928]
gi|256357940|gb|ACU71437.1| conserved hypothetical protein [Catenulispora acidiphila DSM
44928]
Length=186
Score = 97.8 bits (242), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 67/184 (37%), Positives = 82/184 (45%), Gaps = 16/184 (8%)
Query 10 AQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPI------EPPA 63
A D+ A+L V D LG PTPC+ W + L+ H +G RW AS + E P
Sbjct 7 AFDSTMAILQKVGRDDLGTPTPCASWDVRGLVNHFIGS----ARWWASMVSGDHGLEAPE 62
Query 64 RPD----GLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWD 119
D VAA++ + V F A G +P G+ G D TH WD
Sbjct 63 GADYAAGDFVAAYEESIRVTLGAFTAEGAADRMVSVPFGDFTGSALRAFAALDQFTHGWD 122
Query 120 LAAATGQSTDLDPELAVERLAAARALVGPQFRGPG--KPFADEKPCPRERPPADQLAAFL 177
LA A G TDL PELA LA A V RG PF + P AD+LAA+L
Sbjct 123 LARALGYDTDLAPELASTLLAMAEVAVDDSLRGADGEAPFEAARQAPEGSCAADRLAAYL 182
Query 178 GRTV 181
GR V
Sbjct 183 GRQV 186
>gi|297159040|gb|ADI08752.1| hypothetical protein SBI_05632 [Streptomyces bingchenggensis
BCW-1]
Length=188
Score = 97.4 bits (241), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/183 (36%), Positives = 88/183 (49%), Gaps = 5/183 (2%)
Query 3 PLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGR---WAASPI 59
P+ A D A L+ V ++ PTPC++W + L++H+V G
Sbjct 5 PVTAFAGVIDTIAHLVEAVEEERWSAPTPCTDWNVQQLVDHLVAGQHTFAVAMGAQPPLP 64
Query 60 EPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWD 119
P P+ L + +AA F PG + T + P+GEVPG V + L+T + L H WD
Sbjct 65 APDPAPEALKKTFRTSAAALVAAFEGPGALERTVRAPIGEVPGAVALHLQTIEHLMHGWD 124
Query 120 LAAATGQSTDLDPELAVERLAA-ARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLG 178
LA A GQ D E VER AR L GPG PFA + P + P D+LAA LG
Sbjct 125 LARAIGQKALFD-EATVERETEFARGLTAQLPSGPGAPFAPSRTAPEDAPALDRLAALLG 183
Query 179 RTV 181
R +
Sbjct 184 RDI 186
>gi|169631005|ref|YP_001704654.1| hypothetical protein MAB_3926 [Mycobacterium abscessus ATCC 19977]
gi|169242972|emb|CAM64000.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=219
Score = 95.5 bits (236), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 67/174 (39%), Positives = 86/174 (50%), Gaps = 6/174 (3%)
Query 9 RAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQV-GRWAASPIEPPARPDG 67
RA DA ALLA VR DQ TPC EW + L +H+V N + GR+ A P
Sbjct 51 RASDAIEALLAAVRPDQWDAATPCEEWNLRQLADHLVEVNYSLAGRFGGLSSGTAADP-- 108
Query 68 LVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLAAATGQS 127
VAA++ +A + A PG + T+ P G + +R D+LTH WDLA ATG S
Sbjct 109 -VAAYRLSAQALRDALALPGVLDQTYPGPFAHTTGANQLQVRMADLLTHGWDLARATGAS 167
Query 128 TDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRTV 181
DL +L L + L G F GK F +P + P D+LAA GR V
Sbjct 168 ADLPVDLTENALGFVQKLAGA-FARSGK-FGAPQPVAEDAPALDRLAAMTGRVV 219
>gi|290954992|ref|YP_003486174.1| MerR family transcriptional regulator [Streptomyces scabiei 87.22]
gi|260644518|emb|CBG67603.1| putative MerR-family transcriptional regulator [Streptomyces
scabiei 87.22]
Length=540
Score = 87.0 bits (214), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/185 (37%), Positives = 88/185 (48%), Gaps = 13/185 (7%)
Query 4 LMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEPPA 63
L A R QD L+ G PTPC +WT+ DL++H+V + G A PPA
Sbjct 355 LDAFARVQDTVGTLVHATTPGHFGLPTPCEDWTVRDLLDHLVWEHLIWGGLAQGA--PPA 412
Query 64 -------RPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTH 116
D VAA AAA A + F PG + +F G PG+ + ++L H
Sbjct 413 VGHTEDHLGDDHVAAFGTAAAGARDAFRQPGLLERSF----GPAPGRRVVEQLLIELLVH 468
Query 117 AWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAF 176
WDLA A G+ DL+P +A L R + G R G FA +P P P D++AAF
Sbjct 469 GWDLATALGRDRDLEPHIARAALPVVRDIYGTLPRTAGGSFAQARPVPEHAPALDRVAAF 528
Query 177 LGRTV 181
LGR V
Sbjct 529 LGRDV 533
>gi|302530399|ref|ZP_07282741.1| predicted protein [Streptomyces sp. AA4]
gi|302439294|gb|EFL11110.1| predicted protein [Streptomyces sp. AA4]
Length=214
Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 65/179 (37%), Positives = 87/179 (49%), Gaps = 14/179 (7%)
Query 10 AQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGN---------EQVGRWAASPIE 60
A D+ +AL+A V + PTPC EWT+ DL+ H+V G+ E+ G + +P
Sbjct 38 ALDSTSALVAGV--SRWDAPTPCPEWTVRDLVNHLVLGHRLFTAVLRGEEGG--SLNPRS 93
Query 61 PPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDL 120
A D VAA++ A A F PG + ++P G VPG + LR + L H WDL
Sbjct 94 SDALGDDPVAAYREAVAGLLAAFRQPGVLEQVVEVPAGTVPGIAAVHLRIVEELVHGWDL 153
Query 121 AAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGR 179
A ATGQ D L +ER A A +PFA + PP D+L A LGR
Sbjct 154 ARATGQEAKFDDAL-IEREIAFSAAKLADLPADRRPFAPPVSVAADAPPLDRLVALLGR 211
>gi|337764322|emb|CCB73031.1| conserved protein of unknown function [Streptomyces cattleya
NRRL 8057]
Length=205
Score = 84.7 bits (208), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 70/193 (37%), Positives = 92/193 (48%), Gaps = 17/193 (8%)
Query 2 DPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHV---------VGGNEQVG 52
DP+ RA D AA+L VR DQLG PTPC W + L +HV V E+
Sbjct 16 DPVRLLARALDRMAAVLDGVRPDQLGLPTPCLTWDVGTLADHVVHDLAPFTAVARGERPD 75
Query 53 RWAASPIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLP-LGEVPGQVFIGLRTT 111
A P P R + AA + A G ++ T +LP +G VP + + + T
Sbjct 76 WTAPVPATGPDR----APVFRTGAARLLAAWRAAGDLTGTVRLPVVGTVPARFPVDQQIT 131
Query 112 DVLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRG---PGKPFADEKPCPRERP 168
+ HAWDL AT + LD E+A L AR + +FRG GK F E+P P
Sbjct 132 EFTVHAWDLRRATDGTAPLDDEVAEAALRWARTALRDEFRGREVEGKAFGPEQPAPPGAS 191
Query 169 PADQLAAFLGRTV 181
+D+LAAF GR V
Sbjct 192 ASDRLAAFTGRRV 204
>gi|111021396|ref|YP_704368.1| hypothetical protein RHA1_ro04424 [Rhodococcus jostii RHA1]
gi|110820926|gb|ABG96210.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=202
Score = 81.6 bits (200), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 63/188 (34%), Positives = 90/188 (48%), Gaps = 11/188 (5%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE 60
DP ++ A AL+ VR DQL TPC+++ + L+ H+V E+ R +
Sbjct 14 TDPRPLYREALAWTTALVEKVRDDQLTAATPCADFDVRTLLGHLVATVER-ARVIGEGGD 72
Query 61 PPARP--------DGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTD 112
P P DG +++A ++A + AT P G VPG+ I +
Sbjct 73 PGTVPLVVTDIPDDGYADTYRSATDRMWPVWADDSRLDATVTAPWGTVPGRAAIWGYINE 132
Query 113 VLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFAD-EKPCPRERPPAD 171
L H WDLA ATGQ ++ PELA LA AR + + RG PFAD +P P P +
Sbjct 133 TLVHGWDLAVATGQPSETRPELAEAMLAVARHAIPAETRGGHVPFADVVEPHPTAG-PTE 191
Query 172 QLAAFLGR 179
+LA + GR
Sbjct 192 RLANWSGR 199
>gi|291299774|ref|YP_003511052.1| hypothetical protein Snas_2270 [Stackebrandtia nassauensis DSM
44728]
gi|290568994|gb|ADD41959.1| hypothetical protein Snas_2270 [Stackebrandtia nassauensis DSM
44728]
Length=198
Score = 80.5 bits (197), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 61/164 (38%), Positives = 77/164 (47%), Gaps = 9/164 (5%)
Query 25 QLGGPTPCSEWTINDLIEHVVGGNEQVG----RWAASPIEPPARPDGLVAAHQAAAAVAH 80
+ PTPC EW + L+ H NE+ R P E PD AA +A A
Sbjct 37 RYDNPTPCREWNVGQLLCHFAFINERYAIVAERETVPPFEQRTYPDS-SAAFVKWSARAR 95
Query 81 EIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLAAATGQSTDLDPEL---AVE 137
F PG ++ P+GE PG V I +++ H+WDLA A G+STDL P+L A
Sbjct 96 AAFRRPGFLTEVMPTPIGEQPGAVVIQHVLNELIAHSWDLARALGESTDLVPDLAEAATR 155
Query 138 RLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRTV 181
A A G R P KP P PAD+LAA+LGR V
Sbjct 156 SWKTAFAEFGEPARTPSI-IDTVKPAPANASPADRLAAWLGREV 198
>gi|324997761|ref|ZP_08118873.1| hypothetical protein PseP1_03295 [Pseudonocardia sp. P1]
Length=199
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 67/189 (36%), Positives = 91/189 (49%), Gaps = 13/189 (6%)
Query 2 DPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEP 61
DP AH A D + L A V D++ GPTPC E+ + L+ H+V + AA +P
Sbjct 11 DPRAAHLAALDWVSGLAAAVPEDRMAGPTPCDEFDVRTLLAHLVTTVRRPAAIAAG-TDP 69
Query 62 PARP-------DGLVAAHQAAAAVAHEIFAAPGG---MSATFKLPLGEVPGQVFIGLRTT 111
A P D A+ A AA H ++ P + T ++P GEVP +V + +
Sbjct 70 LAAPLVSEDVLDAPADAYVAEAAALHGAWSGPDAVELLDRTVRMPFGEVPVRVALWVYVN 129
Query 112 DVLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFAD-EKPCPRERPPA 170
+ L H WDLA ATGQ + DP LA L AR + + RG PF P P P
Sbjct 130 ETLVHGWDLAVATGQPVEADPALATTALEVARRFLPAEPRGGPVPFGPVVTPAPGAG-PT 188
Query 171 DQLAAFLGR 179
+QLA + GR
Sbjct 189 EQLANWAGR 197
>gi|290956238|ref|YP_003487420.1| hypothetical protein SCAB_17241 [Streptomyces scabiei 87.22]
gi|260645764|emb|CBG68855.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
Length=195
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/153 (35%), Positives = 74/153 (49%), Gaps = 20/153 (13%)
Query 4 LMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEPPA 63
L H +AQD F A + VR DQ G TPC+EW++ DL+ H+V +EQ+ W +
Sbjct 11 LARHTQAQDLFGARVHAVRDDQWGADTPCAEWSVRDLVNHLV--SEQL--WVPCLVRDGC 66
Query 64 RPDGL-------------VAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRT 110
+ + A+ AA A E FAAPG + T L G+ P + G
Sbjct 67 MIEEVGDTFGGDLLGTDPAASWDTAAHSAREAFAAPGALDRTVHLSYGDTPAVAYCGQMV 126
Query 111 TDVLTHAWDLAAATGQSTDLDPEL---AVERLA 140
D++ HAWDL+ A G L EL AV+ +A
Sbjct 127 ADLVVHAWDLSRAIGADERLPGELVRFAVDEIA 159
>gi|296270489|ref|YP_003653121.1| hypothetical protein Tbis_2526 [Thermobispora bispora DSM 43833]
gi|296093276|gb|ADG89228.1| hypothetical protein Tbis_2526 [Thermobispora bispora DSM 43833]
Length=202
Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/142 (37%), Positives = 68/142 (48%), Gaps = 8/142 (5%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQV-----GRWA 55
+D A++RA F L VR DQ PTPC +W + +L+ H+V N GR
Sbjct 2 IDIRDAYRRALHDFGERLHLVRDDQWELPTPCVDWDVRELVNHLVNENLLAPELLAGRRI 61
Query 56 ---ASPIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTD 112
A E D + A + +A A E A G ++ LP G+VPG+ +I D
Sbjct 62 TDIAGMYEEDVLGDDPIKAFEVSAQNAVEAVYAEGALTRVAHLPFGDVPGREYISELFAD 121
Query 113 VLTHAWDLAAATGQSTDLDPEL 134
L H WDLA A G S LDPEL
Sbjct 122 ALIHTWDLAHAIGASERLDPEL 143
>gi|312194755|ref|YP_004014816.1| hypothetical protein FraEuI1c_0868 [Frankia sp. EuI1c]
gi|311226091|gb|ADP78946.1| hypothetical protein FraEuI1c_0868 [Frankia sp. EuI1c]
Length=190
Score = 78.6 bits (192), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 62/182 (35%), Positives = 82/182 (46%), Gaps = 15/182 (8%)
Query 10 AQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEP--PARPDG 67
A + A ++ VR DQL TPC++W + L+ H+VG +G + P P P G
Sbjct 10 AVTSTAGIIKTVRPDQLDATTPCTQWDVRTLLNHLVG-TLWLGEALFTDSAPRHPMPPGG 68
Query 68 L----------VAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHA 117
L A+ A+A ++ PLG++PG GL T D+L H
Sbjct 69 LPGTDLVGDDPATAYATASAALLAAARVGDTLTRLHTTPLGDMPGPALAGLTTLDILVHG 128
Query 118 WDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFL 177
WDLA ATGQ T LD +LA LA A + FR G P P D+L FL
Sbjct 129 WDLATATGQPTVLDEDLASHVLAFAGQAITDDFR--GTAIGPALPVAATAPVTDRLVGFL 186
Query 178 GR 179
GR
Sbjct 187 GR 188
>gi|302556180|ref|ZP_07308522.1| conserved hypothetical protein [Streptomyces viridochromogenes
DSM 40736]
gi|302473798|gb|EFL36891.1| conserved hypothetical protein [Streptomyces viridochromogenes
DSM 40736]
Length=197
Score = 78.6 bits (192), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/172 (35%), Positives = 81/172 (48%), Gaps = 11/172 (6%)
Query 18 LANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEPPA------RPDGLVAA 71
++ V+ D LG TPC++WT+ L+ H+V N A EP A D AA
Sbjct 18 VSRVKTDHLGRATPCADWTLYGLLRHLVSQNRGFAASARGAGEPWAVWHGGDLGDDPAAA 77
Query 72 HQAAAAVAHEIFAAPGGMSATFKLP-LGE---VPGQVFIGLRTTDVLTHAWDLAAATGQS 127
++ +A FA G + F LP +GE VPG++ IG D + HAWD+A G
Sbjct 78 YETSADELTAAFAEDGVLERKFALPEIGEGFTVPGRIAIGFHMLDYVAHAWDVAVTIGAP 137
Query 128 TDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGR 179
+ + EL L A A V + RG G F P + PP +L A LGR
Sbjct 138 WEPNAELTTAALRVA-AQVPDEGRGAGAAFRRRTAVPDDAPPGHRLLALLGR 188
>gi|258651068|ref|YP_003200224.1| hypothetical protein Namu_0824 [Nakamurella multipartita DSM
44233]
gi|258554293|gb|ACV77235.1| hypothetical protein Namu_0824 [Nakamurella multipartita DSM
44233]
Length=198
Score = 78.2 bits (191), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 65/176 (37%), Positives = 78/176 (45%), Gaps = 11/176 (6%)
Query 17 LLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEPPARP----------- 65
L+ VR Q G PTPCSEW L+ HVV GN PP
Sbjct 22 LVDGVRPAQWGAPTPCSEWDARALLNHVVFGNRSFTSILHGDPAPPQEQIRTMRDRDYLG 81
Query 66 DGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLAAATG 125
D AA + +A F P + F+ PLG +PG LR T+ L H WDLA ATG
Sbjct 82 DDPAAAWRDSADGLLAAFTGPEVLGREFRSPLGPLPGAGLARLRITETLVHGWDLARATG 141
Query 126 QSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRTV 181
QS E+ L+ R + PFA E+P + PP DQLAA LGR V
Sbjct 142 QSAPFPQEIVEATLSFTRRQLSDGSVRSALPFAAEQPAAADAPPLDQLAALLGRAV 197
>gi|111224919|ref|YP_715713.1| hypothetical protein FRAAL5554 [Frankia alni ACN14a]
gi|111152451|emb|CAJ64187.1| Conserved hypothetical protein [Frankia alni ACN14a]
Length=192
Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 66/188 (36%), Positives = 89/188 (48%), Gaps = 10/188 (5%)
Query 2 DPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQV---GRWAASP 58
DP RA D L+A V DQ+ TPCSE+ + L+ H+ ++V GR P
Sbjct 6 DPRPLLDRALDQAGRLVAAVEPDQIALSTPCSEFDVATLVGHLFTVVDRVAVAGR-GGDP 64
Query 59 IEPPARP-----DGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDV 113
E P DG + AAA +++ + +LP +PG+V T ++
Sbjct 65 RELPLVTTGVPFDGWAERYAKAAAELRAVWSDDALLDRPLRLPWAVLPGRVAAAAYTQEL 124
Query 114 LTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQL 173
THAWDLA ATG++ LDPELAV L AR V + R PF P + +L
Sbjct 125 TTHAWDLAVATGRTGGLDPELAVISLEIARRAVPVEGR-EEMPFGPVVEVPADADAYRRL 183
Query 174 AAFLGRTV 181
A LGRTV
Sbjct 184 AGHLGRTV 191
>gi|302525213|ref|ZP_07277555.1| predicted protein [Streptomyces sp. AA4]
gi|302434108|gb|EFL05924.1| predicted protein [Streptomyces sp. AA4]
Length=213
Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/133 (37%), Positives = 63/133 (48%), Gaps = 8/133 (6%)
Query 10 AQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGN-EQVGRWAASPIEP------- 61
A F L++VR +Q PTPC+EW + L+ H+V GN V A E
Sbjct 23 ASSEFDRRLSSVRPEQWTAPTPCAEWNVRQLVNHMVRGNLNYVDLLAGGTREQFLHMRDA 82
Query 62 PARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLA 121
A D AA+ A+ + + F PG + PLG+V G + +R TD HAWDLA
Sbjct 83 DALGDDPFAAYPASVRLVADAFGRPGALEQVLDYPLGKVTGHQALAVRATDSAVHAWDLA 142
Query 122 AATGQSTDLDPEL 134
A G LDP L
Sbjct 143 QALGVDDRLDPAL 155
>gi|159037176|ref|YP_001536429.1| hypothetical protein Sare_1542 [Salinispora arenicola CNS-205]
gi|157916011|gb|ABV97438.1| conserved hypothetical protein [Salinispora arenicola CNS-205]
Length=184
Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/173 (33%), Positives = 83/173 (48%), Gaps = 16/173 (9%)
Query 15 AALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQV--GRWAASPIEPPAR--PDGLVA 70
+ ++A +R + L PTPC++WT+ D+++H+VGG + P +P +R D
Sbjct 18 STIMAGIRPEHLAEPTPCAKWTVQDIVDHLVGGTGYLLAAATGGQPGDPASRATADRFTT 77
Query 71 AHQAAAAVAHEIFAAPGGMSATFKLPLG-EVPGQVFIGLRTTDVLTHAWDLAAATGQSTD 129
H A + A PG M PLG E + + DVL H+WDLAAATGQ T
Sbjct 78 GHAAVL----DAVAQPGAMERRCMSPLGFEWSVREAVAATFMDVLVHSWDLAAATGQDTR 133
Query 130 LDPELAVERLAAARALVGPQFRGPGKP---FADEKPCPRERPPADQLAAFLGR 179
LDP+L + A + P+ G+ E P + P D+L +GR
Sbjct 134 LDPDL----VQACWEMFVPEMPARGRETGLVGPEVAVPADAPLQDRLLGAMGR 182
>gi|331697737|ref|YP_004333976.1| hypothetical protein Psed_3957 [Pseudonocardia dioxanivorans
CB1190]
gi|326952426|gb|AEA26123.1| Conserved hypothetical protein CHP03086 [Pseudonocardia dioxanivorans
CB1190]
Length=201
Score = 75.5 bits (184), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 66/188 (36%), Positives = 83/188 (45%), Gaps = 16/188 (8%)
Query 4 LMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQV-----GRWAASP 58
L + RA D F LA V A+ L GP+ CSEWTI D++ HVV G + + GR
Sbjct 11 LDEYARALDGFDDALARVPAEALDGPSACSEWTIRDVVGHVVWGQDLLAALAQGRPHHDR 70
Query 59 IEPPARPD-GLVAAHQA------AAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTT 111
P P G++ A A A A A P P GE+P F+ L T
Sbjct 71 TGAPGAPAPGVLVAGDAVTGWRRARARADTTLDEPTLGRVVTVPPFGEIPLAGFVTLLVT 130
Query 112 DVLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPAD 171
D+L H+WD+A G LDP L L +R G RGPG E P P +
Sbjct 131 DLLAHSWDVAHGAGVGIRLDPTLLDGALGWSR---GHIRRGPGA-IGPEVPVPADADLQA 186
Query 172 QLAAFLGR 179
+ FLGR
Sbjct 187 RFLGFLGR 194
>gi|328880684|emb|CCA53923.1| hypothetical protein SVEN_0636 [Streptomyces venezuelae ATCC
10712]
Length=203
Score = 75.1 bits (183), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 59/171 (35%), Positives = 79/171 (47%), Gaps = 12/171 (7%)
Query 18 LANVRADQLGGPTPCSEWTINDLIEHVVGGNEQV---GRWAASPIEPPARPD----GLVA 70
+A VR +Q GPTPC+E+T+ L H+V ++ GR P D
Sbjct 34 VAAVRPEQFDGPTPCTEFTVRRLTGHLVAVLRRIALAGRGGDVTTLPTVDDDLADTAWRE 93
Query 71 AHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLAAATGQSTDL 130
A AA E +A P + T LP G +PG + T++ H WD+A ATGQ D
Sbjct 94 AWDAAVREVEEAWADPSILGRTLILPFGNLPGAAAAAVWTSEFTVHTWDMATATGQLPDW 153
Query 131 DPE-LAVERLAAARAL-VGPQFRGPGKPFADEKPCPRERPPADQLAAFLGR 179
DPE +AV A R L GP+ G PF + P D+L A+ GR
Sbjct 154 DPEVVAVSYAAMRRGLPAGPR---DGAPFGAAVEVDPDAPAIDRLVAWCGR 201
>gi|226363750|ref|YP_002781532.1| hypothetical protein ROP_43400 [Rhodococcus opacus B4]
gi|226242239|dbj|BAH52587.1| hypothetical protein [Rhodococcus opacus B4]
Length=193
Score = 74.3 bits (181), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 63/188 (34%), Positives = 91/188 (49%), Gaps = 11/188 (5%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE 60
DP ++ A L+ NVR DQL TPC+++ + ++ H+V E+ R +
Sbjct 5 TDPRPLYREALGWTTRLIDNVRQDQLTASTPCADFDVRTMLGHLVATVER-ARVIGEGGD 63
Query 61 PPARP--------DGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTD 112
P P D AA+++AA ++ G + AT P G VPG+ I +
Sbjct 64 PRTVPLVVTGIPDDSYAAAYRSAADRMWPVWTDDGRLDATVTAPWGTVPGRAAIWGYINE 123
Query 113 VLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFAD-EKPCPRERPPAD 171
L H WDLA ATGQ ++ PELA LA A+ + + RG PFAD P P P +
Sbjct 124 TLVHGWDLAVATGQPSETRPELAEAMLAVAQRAIPAEPRGGHVPFADVVDPLPTAG-PTE 182
Query 172 QLAAFLGR 179
+LA + GR
Sbjct 183 RLANWSGR 190
>gi|291297734|ref|YP_003509012.1| hypothetical protein Snas_0200 [Stackebrandtia nassauensis DSM
44728]
gi|290566954|gb|ADD39919.1| hypothetical protein Snas_0200 [Stackebrandtia nassauensis DSM
44728]
Length=196
Score = 73.9 bits (180), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 64/203 (32%), Positives = 94/203 (47%), Gaps = 39/203 (19%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAA---- 56
++ + A++RAQD F ++A V +Q P+ C+EWTI D+ HV+ G Q+ WA
Sbjct 1 METMTAYRRAQDGFDQVMAAVGDEQWDRPSTCAEWTIRDVAGHVIWGQRQLRAWAVGEEY 60
Query 57 -SPIEPP--ARPDGLVA----------AHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQ 103
SP P ++P L A A A+ E A T P+G++P
Sbjct 61 ESPTGFPGSSKPGELAADDPLATWRTARAAADEALTDETLA----RVVTIGGPVGDIPVI 116
Query 104 VFIGLRTTDVLTHAWDLAAATGQSTDLDPEL-------AVERLAAARALVGPQFRGPGKP 156
L TTD+L H+WD+ A GQ LD EL + + ++ + AL GP+ P
Sbjct 117 GVAELLTTDLLGHSWDIGHAAGQDVRLDAELLPGSMEWSRKYVSRSAALFGPEV----TP 172
Query 157 FADEKPCPRERPPADQLAAFLGR 179
AD D+L A+LGR
Sbjct 173 EADAD-------DQDRLLAYLGR 188
>gi|86741006|ref|YP_481406.1| hypothetical protein Francci3_2309 [Frankia sp. CcI3]
gi|86567868|gb|ABD11677.1| conserved hypothetical protein [Frankia sp. CcI3]
Length=193
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/186 (32%), Positives = 80/186 (44%), Gaps = 13/186 (6%)
Query 4 LMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQV-----GRWAASP 58
L ++RA D F ++ V AD+ P+ C WT L HV+ G +Q+ G P
Sbjct 5 LQCYRRALDTFTTIVTRVPADRWDAPSLCPVWTGRQLTGHVIDGQQQIVSLLTGHGPRPP 64
Query 59 IEPPARPDGLV-----AAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDV 113
+ PA L A+ Q AA + PLG + + +
Sbjct 65 VTDPALLTALAGPDPGASWQRTHQNTERTLAALDPATV-VDTPLGARSVDEVLTVAVIEP 123
Query 114 LTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQL 173
L HAWDLA GQ+ LDP+ L A AL G Q G +A +P P + PP D+L
Sbjct 124 LVHAWDLATTIGQTVQLDPDTVTATLPAVEAL-GGQLAATGM-YAAAQPAPADSPPQDRL 181
Query 174 AAFLGR 179
A LGR
Sbjct 182 LAALGR 187
>gi|134100366|ref|YP_001106027.1| hypothetical protein SACE_3831 [Saccharopolyspora erythraea NRRL
2338]
gi|291007663|ref|ZP_06565636.1| hypothetical protein SeryN2_24319 [Saccharopolyspora erythraea
NRRL 2338]
gi|133912989|emb|CAM03102.1| hypothetical protein SACE_3831 [Saccharopolyspora erythraea NRRL
2338]
Length=195
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/193 (32%), Positives = 87/193 (46%), Gaps = 22/193 (11%)
Query 4 LMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWA------AS 57
L AH+RA F + + DQ TPC++WT+ DL++H+V +EQ+ WA A+
Sbjct 4 LHAHRRAMTEFDTRVRAIGDDQWDNGTPCAQWTVRDLVQHLV--SEQL--WAPRLLDGAT 59
Query 58 PIEPPARPDGLV------AAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTT 111
E R DG V A A+A A + + PG + + G +P + + T
Sbjct 60 LEEVGDRFDGDVLGADPKGAWTEASAQARQAWDRPGAATGEVHVTGGVIPAEDYGWQMTL 119
Query 112 DVLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKP--FADEKPCPRERPP 169
D+ HAWDLA T LDP+L +A R + PQ F P P +
Sbjct 120 DLTVHAWDLACGIRSDTSLDPDL----VAVVRTVFEPQVASWQDMGIFDPPLPVPDDADE 175
Query 170 ADQLAAFLGRTVR 182
+L A LGR R
Sbjct 176 QTRLLAMLGRDAR 188
>gi|297560803|ref|YP_003679777.1| hypothetical protein Ndas_1843 [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
gi|296845251|gb|ADH67271.1| conserved hypothetical protein [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
Length=186
Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/186 (31%), Positives = 81/186 (44%), Gaps = 20/186 (10%)
Query 7 HQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWA------ASPIE 60
H A F + V+ PTPC++W ++DL+ H+ EQ+ W A E
Sbjct 8 HGTAMGEFDRRVREVKLTDWALPTPCADWDVHDLVNHLT--TEQL--WVPLLLGGARVEE 63
Query 61 PPARPDGL------VAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVL 114
R DG + + A+ A + AP + +T L G+ P ++++ T D+
Sbjct 64 VGDRLDGDNLGEEPITTWEVASREARTAWLAPSSLESTVHLSFGDAPAELYLWQMTFDLT 123
Query 115 THAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLA 174
HAWDLA A G LDP+L E A + Q GPG F + P D+L
Sbjct 124 VHAWDLARALGTDERLDPDLVKE----VHAWLSDQDLGPGPMFGAPVEVGPDASPQDRLI 179
Query 175 AFLGRT 180
A GRT
Sbjct 180 ARTGRT 185
>gi|297156190|gb|ADI05902.1| hypothetical protein SBI_02781 [Streptomyces bingchenggensis
BCW-1]
Length=196
Score = 72.4 bits (176), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 53/134 (40%), Positives = 64/134 (48%), Gaps = 10/134 (7%)
Query 2 DPLMA-HQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQV------GRW 54
+PL+A H A D F + +R DQ PTPCSEWT+ DL+ H+ V GR
Sbjct 9 NPLLARHGEALDLFTERVHAIRPDQWDEPTPCSEWTVRDLVNHLAVEQMWVPPLVREGRT 68
Query 55 AAS---PIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTT 111
A +E D VAA AA A E F APG + T +L GE P + T
Sbjct 69 IAEQGDSLEGDLLGDDPVAAWDEAATAAREAFTAPGALERTVELSFGETPAAEYCAEITI 128
Query 112 DVLTHAWDLAAATG 125
D HAWDLA A G
Sbjct 129 DAAVHAWDLARAIG 142
>gi|297156958|gb|ADI06670.1| hypothetical protein SBI_03549 [Streptomyces bingchenggensis
BCW-1]
Length=202
Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 61/188 (33%), Positives = 86/188 (46%), Gaps = 21/188 (11%)
Query 8 QRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVV-----------GGNEQVGRWAA 56
+R+ ++A V+ DQL PTPC +WT++ LI H+V GG E + W
Sbjct 13 RRSLALLGDVVAQVKDDQLRLPTPCPDWTLHGLIRHLVSQNEGFAASARGGGEALSDWRG 72
Query 57 SPIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPL----GEVPGQVFIGLRTTD 112
+ A AA +A+AA+ ++ FA G + F LP G P + I D
Sbjct 73 GDLGADA-----RAAFEASAALVNDAFAQDGVLDRAFALPEVRNGGAFPASLAISFHFVD 127
Query 113 VLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQ 172
+ HAWD+AA G + D EL L A V + RGPG F P P + P +
Sbjct 128 CVVHAWDVAATIGVPWEPDDELTAAALRVAEQ-VPDKGRGPGAAFEQRVPPPTDATPHHR 186
Query 173 LAAFLGRT 180
L + LGR
Sbjct 187 LLSLLGRV 194
>gi|271968539|ref|YP_003342735.1| hypothetical protein Sros_7305 [Streptosporangium roseum DSM
43021]
gi|270511714|gb|ACZ89992.1| hypothetical protein Sros_7305 [Streptosporangium roseum DSM
43021]
Length=189
Score = 71.6 bits (174), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 57/191 (30%), Positives = 81/191 (43%), Gaps = 18/191 (9%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWA----- 55
+D A++R D F AL+ +R +Q TPC +W + L+ HVVG N RWA
Sbjct 3 IDIREAYRRTLDDFGALVHRIRPEQWENKTPCVDWDVRALVNHVVGEN----RWAPELLA 58
Query 56 -------ASPIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGL 108
++ D + A +A A + ++ L G+V G+ +I
Sbjct 59 GRNVADLGDALDGDLLGDDPLKAFDTSAVAAAQAAGDERSLTCVVHLSFGDVRGEEYITE 118
Query 109 RTTDVLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERP 168
D L H WDLA A G LDPEL VE AA A +R G +++P
Sbjct 119 LFADALIHTWDLARAIGADERLDPEL-VEACAAWFARAEEGYRQAG-VIGEQQPVASGTD 176
Query 169 PADQLAAFLGR 179
+L A GR
Sbjct 177 SQTRLLASWGR 187
>gi|254822475|ref|ZP_05227476.1| hypothetical protein MintA_21251 [Mycobacterium intracellulare
ATCC 13950]
Length=194
Score = 71.2 bits (173), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 53/176 (31%), Positives = 81/176 (47%), Gaps = 5/176 (2%)
Query 8 QRAQDAFAAL---LANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEPPAR 64
A+D L L + AD L PTPC+++ + L H++ + +G + + PA
Sbjct 18 HSAEDTLGVLQRVLHTIAADDLSRPTPCADFDVAQLTGHLLNSIKALGGMVDADVPEPAE 77
Query 65 PDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLAAAT 124
D + AAA A + + G+ T GE+P + + + + L HAWD AAAT
Sbjct 78 GDSVERQVVAAARPALDAWHR-HGLGGTVPFGKGEMPAKSACAVLSIEFLVHAWDYAAAT 136
Query 125 GQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRT 180
+ D L+ L AR ++ P+ RG G F D P + +QL AF GR
Sbjct 137 KREVDAPEPLSEYVLGLARHIIRPELRG-GAGFDDPVDVPEDAGALEQLVAFTGRN 191
>gi|158318360|ref|YP_001510868.1| hypothetical protein Franean1_6625 [Frankia sp. EAN1pec]
gi|158113765|gb|ABW15962.1| conserved hypothetical protein [Frankia sp. EAN1pec]
Length=189
Score = 71.2 bits (173), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 58/179 (33%), Positives = 77/179 (44%), Gaps = 6/179 (3%)
Query 8 QRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQV-GRWAASPIEPPARPD 66
RA D AA++ + DQL PTPC +W + + H+VGG + D
Sbjct 9 DRALDMTAAIVKGITDDQLAAPTPCPKWDVRTELNHLVGGMRIFAAELTTTDAGADHDAD 68
Query 67 GLVAAHQAAAAVAHEIFAAP----GGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLAA 122
L QAA A A ++ A + T +L G VPG + + T+VL H DLA
Sbjct 69 WLGTGPQAAFATAADLDRAAWHRRNALDTTVRLGFGAVPGPMAALIHLTEVLVHGADLAI 128
Query 123 ATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRTV 181
ATGQ +D E L + FR PG F + P QL AFLGR +
Sbjct 129 ATGQEHLVDECACGELLTTTHGMDFDVFRRPGM-FGPAVSVSADAPAHRQLLAFLGRAL 186
>gi|297202958|ref|ZP_06920355.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
gi|197711951|gb|EDY55985.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
Length=202
Score = 70.9 bits (172), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 57/186 (31%), Positives = 80/186 (44%), Gaps = 7/186 (3%)
Query 2 DPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQV------GRWA 55
DP +A D +L VR DQ TPC ++++ L H+V +V G++
Sbjct 16 DPRNGLLKAVDLAGDVLGAVRPDQYDSITPCPDYSVRQLSNHLVSVLRRVAVIGAGGQFF 75
Query 56 ASPIEPPARPDGLVAAHQA-AAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVL 114
+ P DG A A ++ P + LP G VPG V + T + +
Sbjct 76 SVPHFAEDVADGAWAEAWADGTKELKSVWTDPAVLGREIGLPWGPVPGAVAAVIYTNEFV 135
Query 115 THAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLA 174
H WDLA ATGQS + D + LAA V + RG PF P + P D+L
Sbjct 136 LHIWDLAKATGQSPEWDETVLAGPLAAMHRAVPREPRGGQVPFGPVVDVPEDAPAIDRLV 195
Query 175 AFLGRT 180
+ GRT
Sbjct 196 GWYGRT 201
>gi|302548167|ref|ZP_07300509.1| basic proline-rich protein [Streptomyces hygroscopicus ATCC 53653]
gi|302465785|gb|EFL28878.1| basic proline-rich protein [Streptomyces himastatinicus ATCC
53653]
Length=237
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/129 (35%), Positives = 59/129 (46%), Gaps = 8/129 (6%)
Query 14 FAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQ----VGRWAASPIEPPARPDGL- 68
FA L VR+DQ PTPC+EW + L+ H+ GN + +A+ D L
Sbjct 57 FARRLRTVRSDQWTAPTPCAEWDVRHLVNHMTRGNLNYIALLDGGSAADFLRLRDEDALG 116
Query 69 ---VAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLAAATG 125
V A+ + E F PG + PLG V G + +RTTD L H WDLA A
Sbjct 117 GDPVGAYTRSVRDCAEAFRRPGALQQILDYPLGPVTGDQALAVRTTDSLIHTWDLARALD 176
Query 126 QSTDLDPEL 134
L+P L
Sbjct 177 APEGLEPGL 185
>gi|342858908|ref|ZP_08715562.1| hypothetical protein MCOL_08528 [Mycobacterium colombiense CECT
3035]
gi|342133149|gb|EGT86352.1| hypothetical protein MCOL_08528 [Mycobacterium colombiense CECT
3035]
Length=240
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 57/180 (32%), Positives = 87/180 (49%), Gaps = 13/180 (7%)
Query 9 RAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGN-EQVGRWAASPIEPPA---R 64
RA A LLA++ + PTPC+ W++ D+ +H+V N + R + + PA
Sbjct 58 RAAQAVDDLLAHLAEEDWMAPTPCTGWSVADVAQHLVEVNLDFADRMLPAGFQTPAGTTT 117
Query 65 PDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVF--IGLRTTDVLTHAWDLAA 122
P + +++ + +E A G SA +G P Q+ + LR D+LTH+WD+A+
Sbjct 118 PGDFLGSYRHSVEALNEALATQIGDSA-----VGIPPPQLSSRLALRVADLLTHSWDIAS 172
Query 123 ATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRTVR 182
ATG L P+L E L A++ R FA +P P D+LAA GR V
Sbjct 173 ATGTPLHLPPDLCAEALTFAQSRSAALQR--SGQFAPPQPIHEHAPAIDRLAALSGRQVH 230
>gi|343928249|ref|ZP_08767703.1| hypothetical protein GOALK_111_00180 [Gordonia alkanivorans NBRC
16433]
gi|343761843|dbj|GAA14629.1| hypothetical protein GOALK_111_00180 [Gordonia alkanivorans NBRC
16433]
Length=201
Score = 67.8 bits (164), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 58/185 (32%), Positives = 83/185 (45%), Gaps = 10/185 (5%)
Query 2 DPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEP 61
DP A A LL+ V A+QL PTPC E+ + L H++ + R AA P
Sbjct 9 DPRPAFAAATTWVTGLLSEVTAEQLAAPTPCDEFDVRTLGAHLLATAQ---RAAALPEGV 65
Query 62 PARPDGLVA----AHQAAAAVAHEI--FAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLT 115
R +A A + A VA + ++ ++ ++P GEVPG + + +
Sbjct 66 DVRAMPFIADRFDAQEYATVVARAVGLWSDDAVLARMVQVPWGEVPGAGALWGYVNETIV 125
Query 116 HAWDLAAATGQSTDLDPELAVERLAAARALVGPQFR-GPGKPFADEKPCPRERPPADQLA 174
H WDLA ATGQ ++ PE A LA R + P+ R P PF P + LA
Sbjct 126 HGWDLAVATGQPSEAVPEAATATLAIVRRFIRPEIRQDPNVPFGVVVEPRDGAGPVETLA 185
Query 175 AFLGR 179
+ GR
Sbjct 186 NWSGR 190
>gi|108743438|dbj|BAE95541.1| conserved hypothetical protein [Streptomyces kanamyceticus]
Length=213
Score = 67.8 bits (164), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 45/133 (34%), Positives = 62/133 (47%), Gaps = 17/133 (12%)
Query 7 HQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEPPARPD 66
H A D F + VRAD PTPC++WT+ DL+ H+ G EQ+ W S + A
Sbjct 32 HAAALDLFTDRVHAVRADLWDAPTPCTDWTVRDLVAHLTG--EQL--WVPSLVRDGATTA 87
Query 67 GL-------------VAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDV 113
+ VA+ AAA + F PG + T L G+ + G TTD+
Sbjct 88 SVGDAFDGDVLGPDPVASWDTAAAASRAAFREPGALDRTVHLSFGDTSAAFYCGQMTTDL 147
Query 114 LTHAWDLAAATGQ 126
+ HAWDL+ A G
Sbjct 148 VVHAWDLSRAIGS 160
>gi|294630269|ref|ZP_06708829.1| conserved hypothetical protein [Streptomyces sp. e14]
gi|292833602|gb|EFF91951.1| conserved hypothetical protein [Streptomyces sp. e14]
Length=192
Score = 67.4 bits (163), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 52/183 (29%), Positives = 85/183 (47%), Gaps = 15/183 (8%)
Query 10 AQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQV------GRW-----AASP 58
A D L+ + +L TPC+E+ + L+ H VG ++ GR AA
Sbjct 14 ALDQLERLVGRLDTARLDRETPCAEYDLRALLGHTVGAVHRIAYVGEGGRGLDVAAAAGR 73
Query 59 IEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAW 118
I + AH+ AA +A + ++P G VPG++ + +V+TH W
Sbjct 74 IADTDWGGAVCRAHRRLAAA----WADEAKLDREVEVPWGLVPGRIALSGYVMEVVTHTW 129
Query 119 DLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLG 178
D+A + +LD L+ L A+ ++ P+ RG PF + +P P + +LA +LG
Sbjct 130 DIAQVIDPAAELDERLSQAALDIAQKVLPPEPRGGEVPFGEVRPVPDDADVHTRLAGWLG 189
Query 179 RTV 181
RTV
Sbjct 190 RTV 192
>gi|117927359|ref|YP_871910.1| hypothetical protein Acel_0149 [Acidothermus cellulolyticus 11B]
gi|117647822|gb|ABK51924.1| conserved hypothetical protein [Acidothermus cellulolyticus 11B]
Length=194
Score = 67.4 bits (163), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 41/132 (32%), Positives = 62/132 (47%), Gaps = 8/132 (6%)
Query 10 AQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRW--------AASPIEP 61
A D ++A +R DQ TPC+EW ++ + H+V G+ R + P P
Sbjct 11 ALDTTERIIAAIRPDQWHNATPCAEWDVHAVASHLVLGHRLFVRALHGEEFAAGSRPSGP 70
Query 62 PARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLA 121
P + + A++++A F G + +P G VPGQ + LR + + H WDLA
Sbjct 71 PQITEDVRTAYRSSADELLAAFREAGALERLIVVPAGRVPGQAALYLRLVEAVVHGWDLA 130
Query 122 AATGQSTDLDPE 133
ATGQ D E
Sbjct 131 RATGQPIDFPEE 142
>gi|296166899|ref|ZP_06849316.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897776|gb|EFG77365.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=197
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/177 (30%), Positives = 78/177 (45%), Gaps = 7/177 (3%)
Query 8 QRAQDAFAAL---LANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEPPAR 64
A+D L L + AD L PTPC+E+ + L +H++ +G + I P R
Sbjct 21 HSAEDTLGVLQRVLHPIAADDLSRPTPCAEFDVAQLTDHLLKSITALGGMVGAQI--PER 78
Query 65 PDGLVAAHQAAAAVAHEIFA-APGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLAAA 123
G Q A + A G+ + GE+P + + + + L HAWD A A
Sbjct 79 DAGDSVEAQVVTAARPALDAWHRHGLDGSVPFGKGEMPAKGACAVLSIEFLVHAWDYATA 138
Query 124 TGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRT 180
G + L+ L AR ++ P+FRG G FAD P + +QL AF GR
Sbjct 139 VGHEINAPVPLSEYVLGLARQVIRPEFRG-GAGFADPVDVPEDAGALEQLVAFSGRN 194
>gi|229818834|ref|YP_002880360.1| hypothetical protein Bcav_0334 [Beutenbergia cavernae DSM 12333]
gi|229564747|gb|ACQ78598.1| conserved hypothetical protein [Beutenbergia cavernae DSM 12333]
Length=230
Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/148 (32%), Positives = 68/148 (46%), Gaps = 20/148 (13%)
Query 7 HQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVV-----------GGNEQVGRWA 55
H+ A +++ VRAD L PTPC +WT+ DL+ H+ G G W
Sbjct 41 HRTAVTISVDIVSRVRADDLDRPTPCGDWTLRDLLAHMTVQHLGFAAAARGHGGDPGLWD 100
Query 56 ASPIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKL----PLGEVPGQVFIGLRTT 111
A+P EP V A+ AAA + FAA ++A +L P+ PG IG
Sbjct 101 ANPDEPDP-----VGAYATAAADVLDAFAADDVLTAELELPEFAPVTRYPGAQAIGFHFI 155
Query 112 DVLTHAWDLAAATGQSTDLDPELAVERL 139
D + H WD+AA G ++ ++A L
Sbjct 156 DYVAHGWDVAATLGVPYEIPDDVAAAVL 183
>gi|169631520|ref|YP_001705169.1| hypothetical protein MAB_4446 [Mycobacterium abscessus ATCC 19977]
gi|169243487|emb|CAM64515.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=188
Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/180 (29%), Positives = 85/180 (48%), Gaps = 3/180 (1%)
Query 1 VDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIE 60
++PL A+ A +++ + TP +++T+ L +H+ + +G A+ ++
Sbjct 5 LNPLETVANARAALHEVVSRLTEADNDKQTPNAKFTVAQLTDHLQNSIKLLG--GAAGVD 62
Query 61 PPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDL 120
+G VA + A G+ T LP+GE P +V + + ++ L HAWD
Sbjct 63 IALTTEGSVADRLLPQSQAVVDAWQRRGIDGTVTLPIGEYPAEVAVRILGSEFLVHAWDY 122
Query 121 AAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGRT 180
A ATGQ + L L + R ++ P+ R G FADE P P + P +L AF GR
Sbjct 123 AVATGQEFEPMDALTDGVLESVRMIIQPE-RRDGDFFADEVPVPDDSPNLVKLIAFTGRN 181
>gi|271968450|ref|YP_003342646.1| hypothetical protein Sros_7213 [Streptosporangium roseum DSM
43021]
gi|270511625|gb|ACZ89903.1| hypothetical protein Sros_7213 [Streptosporangium roseum DSM
43021]
Length=180
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/169 (34%), Positives = 77/169 (46%), Gaps = 14/169 (8%)
Query 15 AALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEPPARPDGLVAAHQA 74
AA++ +R DQLG PTPC+++ + L+ H+ E A PP D +A
Sbjct 16 AAVVREIREDQLGLPTPCADFDVRGLLGHLSRAAEMFDALARKEEVPPEDGDHTAFESRA 75
Query 75 AAAVAH----EIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLAAATGQSTDL 130
AA VA E F GMS T +P+ V L DV+ H WDLA ATGQ +
Sbjct 76 AAMVAAWSRPEAFE---GMSPTLGMPMTTV-----FQLGLGDVVIHGWDLARATGQDYGV 127
Query 131 DPELAVERLAAARALVGPQFRGPGKPFADEKPCPRERPPADQLAAFLGR 179
D E E +AA + PQ R G F + P + P ++ GR
Sbjct 128 DAETG-EAVAAFMDRMAPQGRRMGA-FREAHAVPEDASPFERALGLSGR 174
Lambda K H
0.319 0.135 0.417
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 164464225230
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40