BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3258c
Length=163
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610394|ref|NP_217775.1| hypothetical protein Rv3258c [Mycob... 318 1e-85
gi|289763447|ref|ZP_06522825.1| conserved hypothetical protein [... 316 9e-85
gi|308378248|ref|ZP_07482015.2| hypothetical protein TMIG_02775 ... 313 6e-84
gi|289751952|ref|ZP_06511330.1| conserved hypothetical protein [... 282 2e-74
gi|339296085|gb|AEJ48196.1| hypothetical protein CCDC5079_3006 [... 251 2e-65
gi|41409468|ref|NP_962304.1| hypothetical protein MAP3370c [Myco... 238 3e-61
gi|118618073|ref|YP_906405.1| hypothetical protein MUL_2603 [Myc... 235 2e-60
gi|15827328|ref|NP_301591.1| hypothetical protein ML0762 [Mycoba... 232 2e-59
gi|118471977|ref|YP_886203.1| hypothetical protein MSMEG_1833 [M... 199 9e-50
gi|145225305|ref|YP_001135983.1| hypothetical protein Mflv_4727 ... 183 8e-45
gi|119867394|ref|YP_937346.1| hypothetical protein Mkms_1344 [My... 183 9e-45
gi|108798298|ref|YP_638495.1| hypothetical protein Mmcs_1327 [My... 178 2e-43
gi|240172598|ref|ZP_04751257.1| hypothetical protein MkanA1_2501... 177 3e-43
gi|118465957|ref|YP_883362.1| hypothetical protein MAV_4221 [Myc... 175 2e-42
gi|254776657|ref|ZP_05218173.1| hypothetical protein MaviaA2_185... 174 4e-42
gi|296168947|ref|ZP_06850616.1| conserved hypothetical protein [... 174 4e-42
gi|254821370|ref|ZP_05226371.1| hypothetical protein MintA_15642... 173 9e-42
gi|126433965|ref|YP_001069656.1| hypothetical protein Mjls_1363 ... 171 2e-41
gi|120402736|ref|YP_952565.1| hypothetical protein Mvan_1736 [My... 169 9e-41
gi|315445603|ref|YP_004078482.1| hypothetical protein Mspyr1_406... 166 9e-40
gi|342861461|ref|ZP_08718108.1| hypothetical protein MCOL_21351 ... 162 2e-38
gi|333991647|ref|YP_004524261.1| hypothetical protein JDM601_300... 148 3e-34
gi|226365786|ref|YP_002783569.1| hypothetical protein ROP_63770 ... 145 2e-33
gi|54026597|ref|YP_120839.1| hypothetical protein nfa46240 [Noca... 144 6e-33
gi|312140687|ref|YP_004008023.1| hypothetical protein REQ_33480 ... 143 7e-33
gi|226305643|ref|YP_002765603.1| hypothetical protein RER_21560 ... 142 1e-32
gi|229489531|ref|ZP_04383394.1| conserved hypothetical protein [... 141 3e-32
gi|169630684|ref|YP_001704333.1| hypothetical protein MAB_3604c ... 141 4e-32
gi|111023278|ref|YP_706250.1| hypothetical protein RHA1_ro06315 ... 140 6e-32
gi|317506112|ref|ZP_07963937.1| hypothetical protein HMPREF9336_... 127 5e-28
gi|296393058|ref|YP_003657942.1| hypothetical protein Srot_0629 ... 126 9e-28
gi|319948498|ref|ZP_08022631.1| hypothetical protein ES5_03971 [... 126 9e-28
gi|296138913|ref|YP_003646156.1| hypothetical protein Tpau_1186 ... 125 3e-27
gi|213964882|ref|ZP_03393081.1| conserved hypothetical protein [... 120 5e-26
gi|331698953|ref|YP_004335192.1| hypothetical protein Psed_5202 ... 119 2e-25
gi|256380316|ref|YP_003103976.1| hypothetical protein Amir_6328 ... 118 3e-25
gi|325675683|ref|ZP_08155367.1| hypothetical protein HMPREF0724_... 118 3e-25
gi|284992678|ref|YP_003411232.1| hypothetical protein Gobs_4299 ... 117 5e-25
gi|257057073|ref|YP_003134905.1| hypothetical protein Svir_31030... 117 5e-25
gi|325000743|ref|ZP_08121855.1| hypothetical protein PseP1_18337... 117 5e-25
gi|116669757|ref|YP_830690.1| hypothetical protein Arth_1196 [Ar... 115 1e-24
gi|117927672|ref|YP_872223.1| hypothetical protein Acel_0463 [Ac... 115 2e-24
gi|302534802|ref|ZP_07287144.1| conserved hypothetical protein [... 115 2e-24
gi|145593507|ref|YP_001157804.1| hypothetical protein Strop_0949... 115 2e-24
gi|302530044|ref|ZP_07282386.1| conserved hypothetical protein [... 114 5e-24
gi|25027317|ref|NP_737371.1| hypothetical protein CE0761 [Coryne... 114 5e-24
gi|325962631|ref|YP_004240537.1| hypothetical protein Asphe3_122... 114 6e-24
gi|159036546|ref|YP_001535799.1| hypothetical protein Sare_0891 ... 114 6e-24
gi|336119564|ref|YP_004574341.1| hypothetical protein MLP_39240 ... 114 7e-24
gi|296130181|ref|YP_003637431.1| hypothetical protein Cfla_2342 ... 114 7e-24
>gi|15610394|ref|NP_217775.1| hypothetical protein Rv3258c [Mycobacterium tuberculosis H37Rv]
gi|15842847|ref|NP_337884.1| hypothetical protein MT3356 [Mycobacterium tuberculosis CDC1551]
gi|31794438|ref|NP_856931.1| hypothetical protein Mb3286c [Mycobacterium bovis AF2122/97]
68 more sequence titles
Length=163
Score = 318 bits (816), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 163/163 (100%), Positives = 163/163 (100%), Gaps = 0/163 (0%)
Query 1 MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW 60
MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW
Sbjct 1 MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW 60
Query 61 DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAP 120
DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAP
Sbjct 61 DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAP 120
Query 121 LHGFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPDPAD 163
LHGFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPDPAD
Sbjct 121 LHGFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPDPAD 163
>gi|289763447|ref|ZP_06522825.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
gi|289710953|gb|EFD74969.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
Length=163
Score = 316 bits (809), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 162/163 (99%), Positives = 162/163 (99%), Gaps = 0/163 (0%)
Query 1 MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW 60
MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW
Sbjct 1 MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW 60
Query 61 DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAP 120
DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAP
Sbjct 61 DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAP 120
Query 121 LHGFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPDPAD 163
LHGFDDFPAAATGAPTGGG LAPPEPGAGRRRGHLRVLPDPAD
Sbjct 121 LHGFDDFPAAATGAPTGGGGLAPPEPGAGRRRGHLRVLPDPAD 163
>gi|308378248|ref|ZP_07482015.2| hypothetical protein TMIG_02775 [Mycobacterium tuberculosis SUMu009]
gi|308379466|ref|ZP_07486367.2| hypothetical protein TMJG_03442 [Mycobacterium tuberculosis SUMu010]
gi|308406101|ref|ZP_07495132.2| hypothetical protein TMLG_02030 [Mycobacterium tuberculosis SUMu012]
gi|308353206|gb|EFP42057.1| hypothetical protein TMIG_02775 [Mycobacterium tuberculosis SUMu009]
gi|308356946|gb|EFP45797.1| hypothetical protein TMJG_03442 [Mycobacterium tuberculosis SUMu010]
gi|308364486|gb|EFP53337.1| hypothetical protein TMLG_02030 [Mycobacterium tuberculosis SUMu012]
gi|339299696|gb|AEJ51806.1| hypothetical protein CCDC5180_2969 [Mycobacterium tuberculosis
CCDC5180]
Length=161
Score = 313 bits (802), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 160/161 (99%), Positives = 161/161 (100%), Gaps = 0/161 (0%)
Query 3 VSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDL 62
+SGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDL
Sbjct 1 MSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDL 60
Query 63 CVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLH 122
CVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLH
Sbjct 61 CVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLH 120
Query 123 GFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPDPAD 163
GFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPDPAD
Sbjct 121 GFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPDPAD 161
>gi|289751952|ref|ZP_06511330.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289692539|gb|EFD59968.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=162
Score = 282 bits (721), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 149/163 (92%), Positives = 149/163 (92%), Gaps = 1/163 (0%)
Query 1 MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW 60
MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW
Sbjct 1 MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW 60
Query 61 DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAP 120
DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAP
Sbjct 61 DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAP 120
Query 121 LHGFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPDPAD 163
LHGFDDFPAAATGAPTG L P G G R VLPDPAD
Sbjct 121 LHGFDDFPAAATGAPTGVACLRRPSLGPGAARTST-VLPDPAD 162
>gi|339296085|gb|AEJ48196.1| hypothetical protein CCDC5079_3006 [Mycobacterium tuberculosis
CCDC5079]
Length=130
Score = 251 bits (642), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 129/130 (99%), Positives = 130/130 (100%), Gaps = 0/130 (0%)
Query 34 VATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSHPDEDD 93
+ATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSHPDEDD
Sbjct 1 MATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSHPDEDD 60
Query 94 LVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGVLAPPEPGAGRRRG 153
LVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGVLAPPEPGAGRRRG
Sbjct 61 LVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGVLAPPEPGAGRRRG 120
Query 154 HLRVLPDPAD 163
HLRVLPDPAD
Sbjct 121 HLRVLPDPAD 130
>gi|41409468|ref|NP_962304.1| hypothetical protein MAP3370c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41398299|gb|AAS05920.1| hypothetical protein MAP_3370c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=165
Score = 238 bits (607), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 126/166 (76%), Positives = 134/166 (81%), Gaps = 4/166 (2%)
Query 1 MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW 60
MRVSGASAA HDSLS VNVPRRCCRPGCPHYAVATLTFVYSDSTAV+GPLATAREPHSW
Sbjct 1 MRVSGASAAFAHDSLSSVNVPRRCCRPGCPHYAVATLTFVYSDSTAVVGPLATAREPHSW 60
Query 61 DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAP 120
DLCV HAGRITAPRGW+LVRHAGPLP+HPDEDDLVALADAVREGG SA R G P
Sbjct 61 DLCVNHAGRITAPRGWDLVRHAGPLPTHPDEDDLVALADAVREGG-SAERTLPYAGAAVP 119
Query 121 LHGFDD---FPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPDPAD 163
+GF D P A ++APPE +GRRRGHLRVLPDP+D
Sbjct 120 RNGFGDPHLHPGGAQATAPSSSLIAPPEQRSGRRRGHLRVLPDPSD 165
>gi|118618073|ref|YP_906405.1| hypothetical protein MUL_2603 [Mycobacterium ulcerans Agy99]
gi|183981306|ref|YP_001849597.1| hypothetical protein MMAR_1284 [Mycobacterium marinum M]
gi|118570183|gb|ABL04934.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
gi|183174632|gb|ACC39742.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=165
Score = 235 bits (599), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 128/166 (78%), Positives = 134/166 (81%), Gaps = 4/166 (2%)
Query 1 MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW 60
MRVS SAA HDSL +VNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW
Sbjct 1 MRVSAPSAAFPHDSLCLVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW 60
Query 61 DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAP 120
DLCVGHAGRITAPRGWELVRHAGPLP+HPDEDDLVALADAVRE GP+A G GA
Sbjct 61 DLCVGHAGRITAPRGWELVRHAGPLPTHPDEDDLVALADAVREQGPTADAS-CAGATGAA 119
Query 121 LHGFDDFPAAATGAPT---GGGVLAPPEPGAGRRRGHLRVLPDPAD 163
+GF D GA GGVLA PE +GRRRGHLRVLPDP+D
Sbjct 120 HNGFSDQVMRHAGAHATAPSGGVLASPEHRSGRRRGHLRVLPDPSD 165
>gi|15827328|ref|NP_301591.1| hypothetical protein ML0762 [Mycobacterium leprae TN]
gi|221229806|ref|YP_002503222.1| hypothetical protein MLBr_00762 [Mycobacterium leprae Br4923]
gi|13092877|emb|CAC30271.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219932913|emb|CAR70856.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=165
Score = 232 bits (591), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 128/168 (77%), Positives = 134/168 (80%), Gaps = 8/168 (4%)
Query 1 MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW 60
MRVSGASA HDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAV+GPLAT REPHSW
Sbjct 1 MRVSGASATFSHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVVGPLATVREPHSW 60
Query 61 DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNG-- 118
DLCV HA RITAPRGWELVRHAGPLPS+PDEDDLVALADAVREG G H GNG
Sbjct 61 DLCVDHAARITAPRGWELVRHAGPLPSNPDEDDLVALADAVREG---PGGEHGSYGNGAR 117
Query 119 APLHGFDDFPAAATGAPT---GGGVLAPPEPGAGRRRGHLRVLPDPAD 163
A L GF D + GA GG+LAP E +GRRRGHLRVLPDP+D
Sbjct 118 ASLGGFADPQLQSAGAHATVPSGGLLAPSELRSGRRRGHLRVLPDPSD 165
>gi|118471977|ref|YP_886203.1| hypothetical protein MSMEG_1833 [Mycobacterium smegmatis str.
MC2 155]
gi|118173264|gb|ABK74160.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=139
Score = 199 bits (507), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 111/150 (74%), Positives = 118/150 (79%), Gaps = 15/150 (10%)
Query 18 VNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWE 77
+NVPRRCCRPGCPHYAVATLTFVYSDSTAV+GPLAT EPHSWDLCVGHA RITAP+GWE
Sbjct 1 MNVPRRCCRPGCPHYAVATLTFVYSDSTAVVGPLATVSEPHSWDLCVGHASRITAPKGWE 60
Query 78 LVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTG 137
LVRHAGPLP+HPDEDDLVALADAVREG R PG + GF D PA TG+
Sbjct 61 LVRHAGPLPTHPDEDDLVALADAVREG------RTGPGPVNGVVAGFSD-PATGTGS--- 110
Query 138 GGVLAP----PEPGAGRRRGHLRVLPDPAD 163
G V+AP PEP GRRRGHLRVLPDP D
Sbjct 111 GAVIAPPVRQPEPN-GRRRGHLRVLPDPTD 139
>gi|145225305|ref|YP_001135983.1| hypothetical protein Mflv_4727 [Mycobacterium gilvum PYR-GCK]
gi|145217791|gb|ABP47195.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=181
Score = 183 bits (464), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 113/166 (69%), Positives = 126/166 (76%), Gaps = 7/166 (4%)
Query 1 MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW 60
+RVSG A+VH +L +VNVPRRCCRPGCPHYAVATLTFVY+DSTAV+GPLAT EPHSW
Sbjct 20 LRVSGVMQAIVHANLLLVNVPRRCCRPGCPHYAVATLTFVYADSTAVVGPLATVSEPHSW 79
Query 61 DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAP 120
DLCV HAGRITAPRGWELVRHAGPLP+HPD+DDLVALADAVREG G G +
Sbjct 80 DLCVMHAGRITAPRGWELVRHAGPLPTHPDDDDLVALADAVREGREVPGAGVTAGFSAGL 139
Query 121 LHGFDDFPAAATGAPTGGGVLAPPE---PGAGRRRGHLRVLPDPAD 163
GF D + A GG ++APP GRRRGHLRVLPDPA+
Sbjct 140 NTGFTDPVSGA----HGGALMAPPARRPETNGRRRGHLRVLPDPAE 181
>gi|119867394|ref|YP_937346.1| hypothetical protein Mkms_1344 [Mycobacterium sp. KMS]
gi|119693483|gb|ABL90556.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=155
Score = 183 bits (464), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 117/165 (71%), Positives = 124/165 (76%), Gaps = 14/165 (8%)
Query 3 VSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDL 62
+SG + L DSL VNVPRRCCRPGCPHYAVATLTFVYSDSTAV+GPLAT EPHSWDL
Sbjct 1 MSGTTRRLSRDSLPFVNVPRRCCRPGCPHYAVATLTFVYSDSTAVVGPLATVSEPHSWDL 60
Query 63 CVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLH 122
CVGHAGRITAPRGWELVRHAGPLPSH D+DDLVALADAVREG S G GA +
Sbjct 61 CVGHAGRITAPRGWELVRHAGPLPSHTDDDDLVALADAVREGRDSTG-----PTAGAVVP 115
Query 123 GFDDFPAAATGAPTGGGVLAP----PEPGAGRRRGHLRVLPDPAD 163
GF D T G +LAP PEP GRRRGHLRVLPDPA+
Sbjct 116 GFSD----PTSGAQSGALLAPPVRRPEP-TGRRRGHLRVLPDPAE 155
>gi|108798298|ref|YP_638495.1| hypothetical protein Mmcs_1327 [Mycobacterium sp. MCS]
gi|108768717|gb|ABG07439.1| conserved hypothetical protein [Mycobacterium sp. MCS]
Length=148
Score = 178 bits (452), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 114/158 (73%), Positives = 120/158 (76%), Gaps = 14/158 (8%)
Query 10 LVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGR 69
+ DSL VNVPRRCCRPGCPHYAVATLTFVYSDSTAV+GPLAT EPHSWDLCVGHAGR
Sbjct 1 MSRDSLPFVNVPRRCCRPGCPHYAVATLTFVYSDSTAVVGPLATVSEPHSWDLCVGHAGR 60
Query 70 ITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPA 129
ITAPRGWELVRHAGPLPSH D+DDLVALADAVREG S G GA + GF D
Sbjct 61 ITAPRGWELVRHAGPLPSHTDDDDLVALADAVREGRDSTG-----PTAGAVVPGFSD--- 112
Query 130 AATGAPTGGGVLAP----PEPGAGRRRGHLRVLPDPAD 163
T G +LAP PEP GRRRGHLRVLPDPA+
Sbjct 113 -PTSGAQSGALLAPPVRRPEP-TGRRRGHLRVLPDPAE 148
>gi|240172598|ref|ZP_04751257.1| hypothetical protein MkanA1_25010 [Mycobacterium kansasii ATCC
12478]
Length=140
Score = 177 bits (450), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 101/140 (73%), Positives = 107/140 (77%), Gaps = 10/140 (7%)
Query 34 VATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSHPDEDD 93
+ATLTFVYSDSTAV+GPLATAREPHSWDLCV HAGRITAPRGWELVRHAGPLP++PDEDD
Sbjct 1 MATLTFVYSDSTAVVGPLATAREPHSWDLCVSHAGRITAPRGWELVRHAGPLPTNPDEDD 60
Query 94 LVALADAVREGGPSAGRRH---HPGGNGAPLHGFDDFPAAATGAPTG-------GGVLAP 143
LVALADAVREGGP A + HPG NG+ GF D G G GGVLAP
Sbjct 61 LVALADAVREGGPVAAGAYSVAHPGRNGSSHDGFPDPVVHHAGVHAGVHAKAPSGGVLAP 120
Query 144 PEPGAGRRRGHLRVLPDPAD 163
PE GRRRGHLRVLPDP D
Sbjct 121 PEHRNGRRRGHLRVLPDPPD 140
>gi|118465957|ref|YP_883362.1| hypothetical protein MAV_4221 [Mycobacterium avium 104]
gi|118167244|gb|ABK68141.1| conserved hypothetical protein [Mycobacterium avium 104]
gi|336459666|gb|EGO38601.1| Protein of unknown function (DUF3499) [Mycobacterium avium subsp.
paratuberculosis S397]
Length=132
Score = 175 bits (444), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 95/133 (72%), Positives = 104/133 (79%), Gaps = 4/133 (3%)
Query 34 VATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSHPDEDD 93
+ATLTFVYSDSTAV+GPLATAREPHSWDLCV HAGRITAPRGW+LVRHAGPLP+HPDEDD
Sbjct 1 MATLTFVYSDSTAVVGPLATAREPHSWDLCVNHAGRITAPRGWDLVRHAGPLPTHPDEDD 60
Query 94 LVALADAVREGGPSAGRRHHPGGNGAPLHGFDD---FPAAATGAPTGGGVLAPPEPGAGR 150
LVALADAVREGG SA R G P +GF D P A ++APPE +GR
Sbjct 61 LVALADAVREGG-SAERTLPYAGAAVPRNGFGDPHLHPGGAQATAPSSSLIAPPEQRSGR 119
Query 151 RRGHLRVLPDPAD 163
RRGHLRVLPDP+D
Sbjct 120 RRGHLRVLPDPSD 132
>gi|254776657|ref|ZP_05218173.1| hypothetical protein MaviaA2_18591 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=132
Score = 174 bits (441), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 94/133 (71%), Positives = 103/133 (78%), Gaps = 4/133 (3%)
Query 34 VATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSHPDEDD 93
+ATLTFVYSDSTAV+GPLATAREPHSWDLCV HAGRITAPRGW+LVRHAGPLP+HPDEDD
Sbjct 1 MATLTFVYSDSTAVVGPLATAREPHSWDLCVNHAGRITAPRGWDLVRHAGPLPTHPDEDD 60
Query 94 LVALADAVREGGPSAGRRHHPGGNGAPLHGFDD---FPAAATGAPTGGGVLAPPEPGAGR 150
LVALADAVREGG SA R G P +GF D P ++APPE +GR
Sbjct 61 LVALADAVREGG-SAERTLPYAGAAVPRNGFGDPHLHPGGTQATAPSSSLIAPPEQRSGR 119
Query 151 RRGHLRVLPDPAD 163
RRGHLRVLPDP+D
Sbjct 120 RRGHLRVLPDPSD 132
>gi|296168947|ref|ZP_06850616.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295896416|gb|EFG76069.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=133
Score = 174 bits (441), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 100/136 (74%), Positives = 110/136 (81%), Gaps = 9/136 (6%)
Query 34 VATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPL---PSHPD 90
+ATLTFVYSDSTAV+GPLATAREPHSWDLCVGHAGRITAPRGW+LVRHAGPL P+HPD
Sbjct 1 MATLTFVYSDSTAVVGPLATAREPHSWDLCVGHAGRITAPRGWDLVRHAGPLFSEPTHPD 60
Query 91 EDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDD--FPAAATGAPT-GGGVLAPPEPG 147
EDDLVALADAVREG P G R P G GAP++GF D P + + A VLAPPE
Sbjct 61 EDDLVALADAVREGAP--GERAMPYG-GAPINGFADPHIPHSGSQATAPSSSVLAPPEHR 117
Query 148 AGRRRGHLRVLPDPAD 163
+GRRRGHLRVLPDP+D
Sbjct 118 SGRRRGHLRVLPDPSD 133
>gi|254821370|ref|ZP_05226371.1| hypothetical protein MintA_15642 [Mycobacterium intracellulare
ATCC 13950]
Length=132
Score = 173 bits (438), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 99/135 (74%), Positives = 108/135 (80%), Gaps = 8/135 (5%)
Query 34 VATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSHPDEDD 93
+ATLTFVYSDSTAV+GPLATAREPHSWDLCV HAGRITAPRGWELVRHAGPLP+HPDEDD
Sbjct 1 MATLTFVYSDSTAVVGPLATAREPHSWDLCVNHAGRITAPRGWELVRHAGPLPTHPDEDD 60
Query 94 LVALADAVREGGPSAGRRHHP-GGNGAPLHGFDD----FPAAATGAPTGGGVLAPPEPGA 148
LVALADAVREGG +G R P GG P++GF D AP+ VLAPPE +
Sbjct 61 LVALADAVREGG--SGDRGAPYGGAPTPVNGFADPHLHHGGTQATAPS-SSVLAPPEHRS 117
Query 149 GRRRGHLRVLPDPAD 163
GRRRGHLRVLPDP+D
Sbjct 118 GRRRGHLRVLPDPSD 132
>gi|126433965|ref|YP_001069656.1| hypothetical protein Mjls_1363 [Mycobacterium sp. JLS]
gi|126233765|gb|ABN97165.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=140
Score = 171 bits (434), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 110/150 (74%), Positives = 116/150 (78%), Gaps = 14/150 (9%)
Query 18 VNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWE 77
+NVPRRCCRPGCPHYAVATLTFVYSDSTAV+GPLAT EPHSWDLCVGHAGRITAPRGWE
Sbjct 1 MNVPRRCCRPGCPHYAVATLTFVYSDSTAVVGPLATVSEPHSWDLCVGHAGRITAPRGWE 60
Query 78 LVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTG 137
LVRHAGPLPSH D+DDLVALADAVREG S G GA + GF D T
Sbjct 61 LVRHAGPLPSHTDDDDLVALADAVREGRDSTG-----PTAGAVVPGFSD----PTSGAQS 111
Query 138 GGVLAP----PEPGAGRRRGHLRVLPDPAD 163
G +LAP PEP GRRRGHLRVLPDPA+
Sbjct 112 GALLAPPVRRPEP-TGRRRGHLRVLPDPAE 140
>gi|120402736|ref|YP_952565.1| hypothetical protein Mvan_1736 [Mycobacterium vanbaalenii PYR-1]
gi|119955554|gb|ABM12559.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=138
Score = 169 bits (429), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 108/150 (72%), Positives = 115/150 (77%), Gaps = 16/150 (10%)
Query 18 VNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWE 77
+NVPRRCCRPGCPHYAVATLTFVYSDSTAV+GPLAT EPHSWDLCV HAGRITAPRGWE
Sbjct 1 MNVPRRCCRPGCPHYAVATLTFVYSDSTAVVGPLATVSEPHSWDLCVMHAGRITAPRGWE 60
Query 78 LVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFD-DFPAAATGAPT 136
LVRHAGPLPSHPD+DDLVALADAVRE G + AP+ GF F TGA
Sbjct 61 LVRHAGPLPSHPDDDDLVALADAVRE-----------GRDAAPVAGFTVGFSDPVTGA-H 108
Query 137 GGGVLAPPE---PGAGRRRGHLRVLPDPAD 163
GG ++APP GRRRGHLRVLPDP D
Sbjct 109 GGALMAPPARRPETNGRRRGHLRVLPDPTD 138
>gi|315445603|ref|YP_004078482.1| hypothetical protein Mspyr1_40610 [Mycobacterium sp. Spyr1]
gi|315263906|gb|ADU00648.1| hypothetical protein Mspyr1_40610 [Mycobacterium sp. Spyr1]
Length=145
Score = 166 bits (421), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 104/149 (70%), Positives = 113/149 (76%), Gaps = 7/149 (4%)
Query 18 VNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWE 77
+NVPRRCCRPGCPHYAVATLTFVY+DSTAV+GPLAT EPHSWDLCV HAGRITAPRGWE
Sbjct 1 MNVPRRCCRPGCPHYAVATLTFVYADSTAVVGPLATVSEPHSWDLCVMHAGRITAPRGWE 60
Query 78 LVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTG 137
LVRHAGPLP+HPD+DDLVALADAVREG G G GF D + A G
Sbjct 61 LVRHAGPLPTHPDDDDLVALADAVREGREVPGAGVTAGFTAGLNTGFTDPVSGA----HG 116
Query 138 GGVLAPPE---PGAGRRRGHLRVLPDPAD 163
G ++APP GRRRGHLRVLPDPA+
Sbjct 117 GALMAPPARRPETNGRRRGHLRVLPDPAE 145
>gi|342861461|ref|ZP_08718108.1| hypothetical protein MCOL_21351 [Mycobacterium colombiense CECT
3035]
gi|342130950|gb|EGT84239.1| hypothetical protein MCOL_21351 [Mycobacterium colombiense CECT
3035]
Length=131
Score = 162 bits (410), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 92/136 (68%), Positives = 101/136 (75%), Gaps = 11/136 (8%)
Query 34 VATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPL---PSHPD 90
+ATLTFVYSDSTAV+GPLATAREPHSWDLCV HAGRITAPRGWELVRHAGPL P+HPD
Sbjct 1 MATLTFVYSDSTAVVGPLATAREPHSWDLCVNHAGRITAPRGWELVRHAGPLLSEPAHPD 60
Query 91 EDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPT---GGGVLAPPEPG 147
EDDLVALADAVREG A P++GF D G+ +LAPPE
Sbjct 61 EDDLVALADAVREGDDRAAPY-----AATPVNGFADAHIHHGGSQATAPSSSLLAPPEHR 115
Query 148 AGRRRGHLRVLPDPAD 163
+GRRRGHLRVLPDP+D
Sbjct 116 SGRRRGHLRVLPDPSD 131
>gi|333991647|ref|YP_004524261.1| hypothetical protein JDM601_3007 [Mycobacterium sp. JDM601]
gi|333487615|gb|AEF37007.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=130
Score = 148 bits (373), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 88/140 (63%), Positives = 97/140 (70%), Gaps = 20/140 (14%)
Query 34 VATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSHPDEDD 93
+ATLTFVYSDSTAV+GPLATA EPHSWDLC HAGRITAPRGWELVRH GP S P+EDD
Sbjct 1 MATLTFVYSDSTAVVGPLATAAEPHSWDLCFSHAGRITAPRGWELVRHPGPWVS-PEEDD 59
Query 94 LVALADAVREGGPSAGRRHHPGGNGAPLHGF-------DDFPAAATGAPTGGGVLAPPEP 146
L+ALA+AVREG G AP +G+ D A+ GGGVLAPP P
Sbjct 60 LIALAEAVREG---------QSGQAAPANGWYPQAGPADTGSRGASSGTPGGGVLAPPGP 110
Query 147 ---GAGRRRGHLRVLPDPAD 163
GRRRGHLRVLPDP+D
Sbjct 111 PGKSNGRRRGHLRVLPDPSD 130
>gi|226365786|ref|YP_002783569.1| hypothetical protein ROP_63770 [Rhodococcus opacus B4]
gi|226244276|dbj|BAH54624.1| hypothetical protein [Rhodococcus opacus B4]
Length=138
Score = 145 bits (366), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/147 (58%), Positives = 91/147 (62%), Gaps = 17/147 (11%)
Query 18 VNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWE 77
V RRCCRPGC + AVATLT+VYSDSTAV+GPLAT EPHSWDLC HA RITAP+GWE
Sbjct 8 VRSLRRCCRPGCKNPAVATLTYVYSDSTAVVGPLATVDEPHSWDLCETHASRITAPKGWE 67
Query 78 LVRHAGPLPSH-PDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPT 136
LVR+ G S PDEDDL ALA+AVRE G G G A TG
Sbjct 68 LVRYEGGFSSSTPDEDDLTALAEAVREAG--------LGDRGRSERALSTEERAETG--- 116
Query 137 GGGVLAPPEPGAGRRRGHLRVLPDPAD 163
P P RRGHLRVLPDPA+
Sbjct 117 -----TQPGPARTGRRGHLRVLPDPAN 138
>gi|54026597|ref|YP_120839.1| hypothetical protein nfa46240 [Nocardia farcinica IFM 10152]
gi|54018105|dbj|BAD59475.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=125
Score = 144 bits (362), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 82/142 (58%), Positives = 93/142 (66%), Gaps = 22/142 (15%)
Query 22 RRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRH 81
RRCCRPGC + AVATLT+VYSDSTAV+GPLAT EPHSWDLC HA RITAP+GWELVRH
Sbjct 2 RRCCRPGCKNPAVATLTYVYSDSTAVVGPLATVAEPHSWDLCETHASRITAPKGWELVRH 61
Query 82 AGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGVL 141
G + PD+DDL ALA+AVRE G RR P + G+ ++
Sbjct 62 EGGFSTSPDDDDLTALAEAVREAG---LRRRPPEADQ---RGYREY-------------- 101
Query 142 APPEPGAGR--RRGHLRVLPDP 161
APP R RRGHLRVLPDP
Sbjct 102 APPPQRTTRTGRRGHLRVLPDP 123
>gi|312140687|ref|YP_004008023.1| hypothetical protein REQ_33480 [Rhodococcus equi 103S]
gi|311890026|emb|CBH49344.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=132
Score = 143 bits (361), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 85/143 (60%), Positives = 94/143 (66%), Gaps = 16/143 (11%)
Query 22 RRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRH 81
RRCCRPGC + AVATLT+VYSDSTAV+GPLAT EPHSWDLC HA RITAP+GWELVR+
Sbjct 5 RRCCRPGCKNPAVATLTYVYSDSTAVVGPLATVAEPHSWDLCETHASRITAPKGWELVRY 64
Query 82 AGPLPSH-PDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGV 140
G S PDEDDL ALA+AVRE G R G P + +GA
Sbjct 65 EGGFSSSTPDEDDLTALAEAVREAGLGERPRTDDGS-----------PESRSGASVPS-- 111
Query 141 LAPPEPGAGRRRGHLRVLPDPAD 163
APP G RRGHLRVLPDPA+
Sbjct 112 -APPTVRTG-RRGHLRVLPDPAN 132
>gi|226305643|ref|YP_002765603.1| hypothetical protein RER_21560 [Rhodococcus erythropolis PR4]
gi|226184760|dbj|BAH32864.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=144
Score = 142 bits (359), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 85/144 (60%), Positives = 96/144 (67%), Gaps = 22/144 (15%)
Query 22 RRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRH 81
RRCCRPGC + AVATLT+VYSDSTAV+GPLAT EPHSWDLC H RITAP+GWELVRH
Sbjct 21 RRCCRPGCKNPAVATLTYVYSDSTAVVGPLATVAEPHSWDLCDTHGSRITAPKGWELVRH 80
Query 82 AGPLPSH-PDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDF-PAAATGAPTGGG 139
G S PDEDDL ALA+AVRE G G R N + + +D PAA + A TG
Sbjct 81 EGGFASSTPDEDDLTALAEAVREAG--LGDR-----NKSNVDADEDIRPAAPSTARTG-- 131
Query 140 VLAPPEPGAGRRRGHLRVLPDPAD 163
RRGHLRVLPDP++
Sbjct 132 -----------RRGHLRVLPDPSN 144
>gi|229489531|ref|ZP_04383394.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229323628|gb|EEN89386.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=169
Score = 141 bits (356), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 84/143 (59%), Positives = 93/143 (66%), Gaps = 20/143 (13%)
Query 22 RRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRH 81
RRCCRPGC + AVATLT+VYSDSTAV+GPLAT EPHSWDLC H RITAP+GWELVRH
Sbjct 46 RRCCRPGCKNPAVATLTYVYSDSTAVVGPLATVAEPHSWDLCDTHGSRITAPKGWELVRH 105
Query 82 AGPLPSH-PDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGV 140
G S PDEDDL ALA+AVRE G G R + D PAA + A TG
Sbjct 106 EGGFASSTPDEDDLTALAEAVREAG--LGDRSKSNVDADE----DIRPAAPSTARTG--- 156
Query 141 LAPPEPGAGRRRGHLRVLPDPAD 163
RRGHLRVLPDP++
Sbjct 157 ----------RRGHLRVLPDPSN 169
>gi|169630684|ref|YP_001704333.1| hypothetical protein MAB_3604c [Mycobacterium abscessus ATCC
19977]
gi|169242651|emb|CAM63679.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=121
Score = 141 bits (355), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 80/131 (62%), Positives = 91/131 (70%), Gaps = 11/131 (8%)
Query 34 VATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSHPDEDD 93
+ATLTFVY+DSTAV+GPLAT+ EPHSWDLC HA RITAPRGWELVR+ GPLPS+P++DD
Sbjct 1 MATLTFVYADSTAVVGPLATSSEPHSWDLCAQHASRITAPRGWELVRYNGPLPSNPEDDD 60
Query 94 LVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGVLAPPEPG-AGRRR 152
LVALADAVRE GG GF + PA AP P P GRRR
Sbjct 61 LVALADAVRETT---------GGVRVAAAGFSE-PALDVSAPAPNVAPRPVHPAPVGRRR 110
Query 153 GHLRVLPDPAD 163
GHLRVLPDP++
Sbjct 111 GHLRVLPDPSE 121
>gi|111023278|ref|YP_706250.1| hypothetical protein RHA1_ro06315 [Rhodococcus jostii RHA1]
gi|110822808|gb|ABG98092.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=162
Score = 140 bits (353), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 84/151 (56%), Positives = 95/151 (63%), Gaps = 18/151 (11%)
Query 14 SLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAP 73
SL+V ++ RRCCRPGC + AVATLT+VYSDSTAV+GPLAT EPHSWDLC HA RITAP
Sbjct 29 SLAVRSL-RRCCRPGCKNPAVATLTYVYSDSTAVVGPLATVDEPHSWDLCETHASRITAP 87
Query 74 RGWELVRHAGPLPSH-PDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAAT 132
+GWELVR+ G S PDEDDL ALA+AVRE G G A T
Sbjct 88 KGWELVRYEGGFSSSTPDEDDLTALAEAVREAG--------LGDRWRSERTVSTEDRAET 139
Query 133 GAPTGGGVLAPPEPGAGRRRGHLRVLPDPAD 163
G P P RRGHLRVLPDP++
Sbjct 140 G--------TQPGPARTGRRGHLRVLPDPSN 162
>gi|317506112|ref|ZP_07963937.1| hypothetical protein HMPREF9336_00306 [Segniliparus rugosus ATCC
BAA-974]
gi|316255611|gb|EFV14856.1| hypothetical protein HMPREF9336_00306 [Segniliparus rugosus ATCC
BAA-974]
Length=121
Score = 127 bits (319), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 72/147 (49%), Positives = 88/147 (60%), Gaps = 29/147 (19%)
Query 18 VNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWE 77
+ +PRRC RPGC AVATLT+VY++STAV+GPLAT EPH+WDLC HA RITAP+GW
Sbjct 1 MRIPRRCSRPGCKMPAVATLTYVYAESTAVVGPLATNAEPHAWDLCEIHAQRITAPKGWA 60
Query 78 LVRHAGPL-PSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPT 136
++R G P H ++DDL ALA+AVRE G R+ P
Sbjct 61 MMRCEGSFTPVHAEDDDLTALAEAVREAG-RGERKSRP---------------------- 97
Query 137 GGGVLAPPEPGAGRRRGHLRVLPDPAD 163
APP + RRGHLRV+PDP D
Sbjct 98 -----APPAASSTGRRGHLRVVPDPVD 119
>gi|296393058|ref|YP_003657942.1| hypothetical protein Srot_0629 [Segniliparus rotundus DSM 44985]
gi|296180205|gb|ADG97111.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=128
Score = 126 bits (317), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 75/147 (52%), Positives = 89/147 (61%), Gaps = 27/147 (18%)
Query 18 VNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWE 77
+ +PRRC RPGC AVATLT+VY++STAV+GPLAT EPH+WDLC HA RITAP+GW
Sbjct 1 MRIPRRCSRPGCKMPAVATLTYVYAESTAVVGPLATNAEPHAWDLCEIHAQRITAPKGWA 60
Query 78 LVRHAGPL-PSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPT 136
++R G P H ++DDL ALA+AVRE AGR G PA A T
Sbjct 61 MMRCEGSFTPVHAEDDDLTALAEAVRE----AGRGERKGAR---------LPAVAGAGAT 107
Query 137 GGGVLAPPEPGAGRRRGHLRVLPDPAD 163
G RRGHLRV+PD AD
Sbjct 108 G-------------RRGHLRVVPDLAD 121
>gi|319948498|ref|ZP_08022631.1| hypothetical protein ES5_03971 [Dietzia cinnamea P4]
gi|319437835|gb|EFV92822.1| hypothetical protein ES5_03971 [Dietzia cinnamea P4]
Length=168
Score = 126 bits (317), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 80/147 (55%), Positives = 90/147 (62%), Gaps = 15/147 (10%)
Query 22 RRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRH 81
RRCCRPGCP+ AVATLT+VY+DSTAVIGPL +EPH+WDLC HA RITAPRGW+LVRH
Sbjct 5 RRCCRPGCPNRAVATLTYVYADSTAVIGPLPAVQEPHAWDLCAVHALRITAPRGWDLVRH 64
Query 82 AGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPL--------HGFDDFPAAATG 133
L + ++ DL AL DAV GGP GRR GA L G D A
Sbjct 65 PD-LDTSAEDSDLTALLDAV-TGGPVGGRR-----TGAALVDADRLRRLGVSDPEPALDR 117
Query 134 APTGGGVLAPPEPGAGRRRGHLRVLPD 160
A G L P P G R HLRV+PD
Sbjct 118 ALAGSDPLPAPGPSTGSGRPHLRVVPD 144
>gi|296138913|ref|YP_003646156.1| hypothetical protein Tpau_1186 [Tsukamurella paurometabola DSM
20162]
gi|296027047|gb|ADG77817.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=180
Score = 125 bits (313), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 85/177 (49%), Positives = 101/177 (58%), Gaps = 35/177 (19%)
Query 14 SLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAP 73
+L +VN R+CCRPGC ++AVATLTF Y S AV+GPL T EPHSWDLC HA R+TAP
Sbjct 12 NLLLVNPVRQCCRPGCRNHAVATLTFDYRQSIAVLGPLGTTSEPHSWDLCDFHASRMTAP 71
Query 74 RGWELVRHAGPLPSHPD--------EDDLVALADAVREGGPSAGRRHH-------PGGN- 117
RGWE++R+ LP++ +DDL ALAD VREG P R PGG+
Sbjct 72 RGWEMLRN---LPAYSAASVGGAALDDDLTALADTVREGAPGLARERGPRPVVDVPGGDV 128
Query 118 --------GA--PLH-GFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPDPAD 163
GA P+H G P A G + G PG RRGHLRVLPDP D
Sbjct 129 PARSLPPIGAMRPVHDGMSQVPPAGQG--SAGAPKHAARPG---RRGHLRVLPDPVD 180
>gi|213964882|ref|ZP_03393081.1| conserved hypothetical protein [Corynebacterium amycolatum SK46]
gi|213952418|gb|EEB63801.1| conserved hypothetical protein [Corynebacterium amycolatum SK46]
Length=135
Score = 120 bits (302), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 73/147 (50%), Positives = 88/147 (60%), Gaps = 16/147 (10%)
Query 18 VNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWE 77
++V RRCCRPGC AVATLT+ Y++STAV+GPLA A EPHSWDLC HA ITAP GWE
Sbjct 1 MSVFRRCCRPGCGKPAVATLTYAYAESTAVVGPLAAASEPHSWDLCEKHARSITAPLGWE 60
Query 78 LVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTG 137
LVR+ P P+ D+DDL ALA+AVRE G +A G L D P
Sbjct 61 LVRYDVPTPAQ-DDDDLTALAEAVREAGRNA--------TGLVLRDEVD------NRPQT 105
Query 138 GGVLAPPEPGAGRR-RGHLRVLPDPAD 163
+ + P + RGHL V+ DP D
Sbjct 106 YKIDSSRHPSTRKSARGHLHVVRDPED 132
>gi|331698953|ref|YP_004335192.1| hypothetical protein Psed_5202 [Pseudonocardia dioxanivorans
CB1190]
gi|326953642|gb|AEA27339.1| hypothetical protein Psed_5202 [Pseudonocardia dioxanivorans
CB1190]
Length=120
Score = 119 bits (297), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/139 (54%), Positives = 81/139 (59%), Gaps = 28/139 (20%)
Query 22 RRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRH 81
RRC R GC AVATLT+VY+DSTAV+GPLAT EPHS+DLC HA R+TAPRGWE+VR
Sbjct 5 RRCSRTGCTELAVATLTYVYADSTAVVGPLATQAEPHSYDLCTAHAHRLTAPRGWEVVRF 64
Query 82 AGPL-PSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGV 140
G P P DDL ALA+AVRE G A R PL PA TG
Sbjct 65 EGEFAPPQPSADDLTALAEAVREAG-RADR---------PLDPPPSVPAQGTG------- 107
Query 141 LAPPEPGAGRRRGHLRVLP 159
RRGHLRVLP
Sbjct 108 ----------RRGHLRVLP 116
>gi|256380316|ref|YP_003103976.1| hypothetical protein Amir_6328 [Actinosynnema mirum DSM 43827]
gi|255924619|gb|ACU40130.1| hypothetical protein Amir_6328 [Actinosynnema mirum DSM 43827]
Length=116
Score = 118 bits (295), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 73/144 (51%), Positives = 84/144 (59%), Gaps = 32/144 (22%)
Query 22 RRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRH 81
RRC R GC + AVATLT+ Y+DSTAV+GPLAT+ EPHS+DLC HA R+TAPRGWE+VR+
Sbjct 2 RRCSRTGCANPAVATLTYAYADSTAVVGPLATSSEPHSYDLCEEHALRLTAPRGWEVVRY 61
Query 82 AGPL-PSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGV 140
G P P DDL ALA+AVRE G D P
Sbjct 62 QGEFAPPEPTVDDLTALAEAVREAG-----------------RVDRSP------------ 92
Query 141 LAPPE-PGAGRRRGHLRVLPDPAD 163
PPE P RRGHLRVLPDP +
Sbjct 93 -QPPEVPLGTIRRGHLRVLPDPRE 115
>gi|325675683|ref|ZP_08155367.1| hypothetical protein HMPREF0724_13149 [Rhodococcus equi ATCC
33707]
gi|325553654|gb|EGD23332.1| hypothetical protein HMPREF0724_13149 [Rhodococcus equi ATCC
33707]
Length=113
Score = 118 bits (295), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 72/128 (57%), Positives = 81/128 (64%), Gaps = 16/128 (12%)
Query 37 LTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSH-PDEDDLV 95
+T+VYSDSTAV+GPLAT EPHSWDLC HA RITAP+GWELVR+ G S PDEDDL
Sbjct 1 MTYVYSDSTAVVGPLATVAEPHSWDLCETHASRITAPKGWELVRYEGGFSSSTPDEDDLT 60
Query 96 ALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHL 155
ALA+AVRE G R G P + +GA APP G RRGHL
Sbjct 61 ALAEAVREAGLGERPRTDDGS-----------PESRSGASVPS---APPTVRTG-RRGHL 105
Query 156 RVLPDPAD 163
RVLPDPA+
Sbjct 106 RVLPDPAN 113
>gi|284992678|ref|YP_003411232.1| hypothetical protein Gobs_4299 [Geodermatophilus obscurus DSM
43160]
gi|284065923|gb|ADB76861.1| conserved hypothetical protein [Geodermatophilus obscurus DSM
43160]
Length=117
Score = 117 bits (294), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 72/145 (50%), Positives = 84/145 (58%), Gaps = 31/145 (21%)
Query 17 VVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGW 76
+V RRC R GC A ATLT+VY++STAV+GPLAT EPHS+DLC HAGR+T PRGW
Sbjct 1 MVRQSRRCSRSGCAQPAAATLTYVYAESTAVVGPLATFSEPHSYDLCEFHAGRLTVPRGW 60
Query 77 ELVRHAGPLP-SHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAP 135
E+VRH + S P DDL+ALADAVRE R P N
Sbjct 61 EVVRHEIDVEDSGPTGDDLLALADAVREAA-----RPEPPRN------------------ 97
Query 136 TGGGVLAPPEPGAGRRRGHLRVLPD 160
P + G+G RRGHLRVLPD
Sbjct 98 -------PADDGSGTRRGHLRVLPD 115
>gi|257057073|ref|YP_003134905.1| hypothetical protein Svir_31030 [Saccharomonospora viridis DSM
43017]
gi|256586945|gb|ACU98078.1| hypothetical protein Svir_31030 [Saccharomonospora viridis DSM
43017]
Length=122
Score = 117 bits (294), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 72/140 (52%), Positives = 85/140 (61%), Gaps = 27/140 (19%)
Query 22 RRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRH 81
R+C R GC AVATLT+ YSDSTAV+GPLATA EPHS+DLC HA R+TAP+GWE+VRH
Sbjct 5 RKCSRTGCLEPAVATLTYAYSDSTAVVGPLATASEPHSYDLCEAHALRLTAPKGWEVVRH 64
Query 82 AGPLPS-HPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGV 140
G P D+L ALA+AVRE G R+ D PA + P
Sbjct 65 EGEFAVPEPSSDELTALAEAVREAG-----RY-------------DRPAPQSSEP----- 101
Query 141 LAPPEPGAGR-RRGHLRVLP 159
P +P G+ RRGHLRVLP
Sbjct 102 --PEQPRRGQGRRGHLRVLP 119
>gi|325000743|ref|ZP_08121855.1| hypothetical protein PseP1_18337 [Pseudonocardia sp. P1]
Length=117
Score = 117 bits (294), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 74/144 (52%), Positives = 80/144 (56%), Gaps = 32/144 (22%)
Query 22 RRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRH 81
RRC R GC AVATLT+VY+DSTAV+GPLAT EPHS+DLC GHA +TAPRGWE+VR
Sbjct 2 RRCSRTGCTELAVATLTYVYADSTAVVGPLATQAEPHSYDLCTGHAHNLTAPRGWEVVRF 61
Query 82 AGPL--PSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGG 139
G P H E DL ALADAVRE AGR P
Sbjct 62 EGEFAPPQHTGE-DLTALADAVRE----AGRLDRP------------------------- 91
Query 140 VLAPPEPGAGRRRGHLRVLPDPAD 163
V PG RRGHLR LP P D
Sbjct 92 VEVVARPGGTGRRGHLRALPTPGD 115
>gi|116669757|ref|YP_830690.1| hypothetical protein Arth_1196 [Arthrobacter sp. FB24]
gi|116609866|gb|ABK02590.1| conserved hypothetical protein [Arthrobacter sp. FB24]
Length=129
Score = 115 bits (289), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/144 (49%), Positives = 87/144 (61%), Gaps = 22/144 (15%)
Query 22 RRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRH 81
R+C R C + AVATLT+VY+DSTAV+GPLAT EPH +DLC HAG +T PRGWE++R
Sbjct 5 RQCSRSACRNSAVATLTYVYADSTAVLGPLATYAEPHCYDLCEQHAGSLTVPRGWEVLRL 64
Query 82 AGP-LPSHPDEDDLVALADAVREGG--PSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGG 138
A P P P DDL+ALA+AVRE PS+ G+ AP G +
Sbjct 65 AMPATPPQPGPDDLLALANAVREAALRPSS-------GDSAP------------GQRSAH 105
Query 139 GVLAPPEPGAGRRRGHLRVLPDPA 162
L P P G RRGHLR+L +P+
Sbjct 106 AALEAPPPAEGARRGHLRILREPS 129
>gi|117927672|ref|YP_872223.1| hypothetical protein Acel_0463 [Acidothermus cellulolyticus 11B]
gi|117648135|gb|ABK52237.1| conserved hypothetical protein [Acidothermus cellulolyticus 11B]
Length=117
Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 69/140 (50%), Positives = 80/140 (58%), Gaps = 29/140 (20%)
Query 22 RRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRH 81
RRCCR C AVATLT+VY+DSTAV+GPLAT EPHS+DLC H+ R+TAP GWE++R
Sbjct 5 RRCCRAACGQPAVATLTYVYADSTAVLGPLATYAEPHSYDLCSKHSARLTAPLGWEIIRL 64
Query 82 AGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGVL 141
P P DDLVALADA+RE G P G G DD + + G
Sbjct 65 EISEPPAPGPDDLVALADAIREAG------RQPAG------GVDDDQSRSQG-------- 104
Query 142 APPEPGAGRRRGHLRVLPDP 161
RRGHLRVL P
Sbjct 105 ---------RRGHLRVLRSP 115
>gi|302534802|ref|ZP_07287144.1| conserved hypothetical protein [Streptomyces sp. C]
gi|302443697|gb|EFL15513.1| conserved hypothetical protein [Streptomyces sp. C]
Length=154
Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/147 (50%), Positives = 86/147 (59%), Gaps = 24/147 (16%)
Query 16 SVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRG 75
+VV++ RRC R C AVATLT+VY+DSTAV+GPLAT EPH +DLC H+ R+TAPRG
Sbjct 29 NVVSLVRRCSRTACGRPAVATLTYVYADSTAVLGPLATYAEPHCYDLCAEHSERLTAPRG 88
Query 76 WELVRHA-GPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGA 134
WE+VR + G PS P DDL ALA+AVRE R GG+G GA
Sbjct 89 WEVVRLSDGSGPSRPSGDDLEALANAVREAARPPERAAEAGGSG-------------PGA 135
Query 135 PTGGGVLAPPEPGAGRRRGHLRVLPDP 161
P G RRGHLRVL P
Sbjct 136 PVAGET----------RRGHLRVLRSP 152
>gi|145593507|ref|YP_001157804.1| hypothetical protein Strop_0949 [Salinispora tropica CNB-440]
gi|145302844|gb|ABP53426.1| hypothetical protein Strop_0949 [Salinispora tropica CNB-440]
Length=135
Score = 115 bits (288), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/145 (51%), Positives = 87/145 (60%), Gaps = 24/145 (16%)
Query 16 SVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRG 75
+ V PRRC R GCP AVATLT+VY++STAV+GPLA EPH++DLC HA +TAPRG
Sbjct 11 AAVRSPRRCTRNGCPRQAVATLTYVYNESTAVVGPLAAFAEPHTYDLCEPHARSLTAPRG 70
Query 76 WELVRHAGPL-PSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGA 134
W++VRH G P P DDLVALA+AVRE P P G D PA +T +
Sbjct 71 WDVVRHEGEFEPPPPTTDDLVALAEAVREAA-------RPVAPRPPEDGHD--PATSTSS 121
Query 135 PTGGGVLAPPEPGAGRRRGHLRVLP 159
TG RRGHLRV+P
Sbjct 122 -TG-------------RRGHLRVIP 132
>gi|302530044|ref|ZP_07282386.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302438939|gb|EFL10755.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=132
Score = 114 bits (285), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 71/148 (48%), Positives = 82/148 (56%), Gaps = 32/148 (21%)
Query 14 SLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAP 73
S +V R+C R GC AVATLT+ Y DSTAV+GPLATA EPHS+DLC HA R+T P
Sbjct 12 SFRIVRSVRKCSRTGCLEPAVATLTYAYQDSTAVVGPLATASEPHSYDLCEAHALRLTVP 71
Query 74 RGWELVRHAGPLPS-HPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAAT 132
+GWE+VRH G + D+L ALA+AVRE G S D P A
Sbjct 72 KGWEVVRHEGAFAAPEQSADELTALAEAVREAGRS------------------DVPPAQ- 112
Query 133 GAPTGGGVLAPPEP-GAGRRRGHLRVLP 159
PEP G RRGHLRVLP
Sbjct 113 -----------PEPEGPSGRRGHLRVLP 129
>gi|25027317|ref|NP_737371.1| hypothetical protein CE0761 [Corynebacterium efficiens YS-314]
gi|259506545|ref|ZP_05749447.1| conserved hypothetical protein [Corynebacterium efficiens YS-314]
gi|23492598|dbj|BAC17571.1| conserved hypothetical protein [Corynebacterium efficiens YS-314]
gi|259165965|gb|EEW50519.1| conserved hypothetical protein [Corynebacterium efficiens YS-314]
Length=151
Score = 114 bits (285), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 74/163 (46%), Positives = 92/163 (57%), Gaps = 19/163 (11%)
Query 1 MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSW 60
M + ++ L D +SV RRC RPGC AVATLT+ YS+STAV+GPLA A EPHSW
Sbjct 1 MTIQTSTKGL--DFVSVSQF-RRCSRPGCGKPAVATLTYAYSESTAVVGPLAPAAEPHSW 57
Query 61 DLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAP 120
DLC HA RITAP GWE++R P D++DL ALA+AVRE G R+ G +
Sbjct 58 DLCEHHAERITAPLGWEMLRVNDIFPE--DDEDLTALAEAVREAG-----RNASGLVSSE 110
Query 121 LHGFDDFPAAATGAPTGGGVLAPPEPGAGRRRGHLRVLPDPAD 163
++ P + A RRGHL V+PDP D
Sbjct 111 EEVGENHPVNRS---------ARRAEYRAHRRGHLYVVPDPED 144
>gi|325962631|ref|YP_004240537.1| hypothetical protein Asphe3_12240 [Arthrobacter phenanthrenivorans
Sphe3]
gi|323468718|gb|ADX72403.1| hypothetical protein Asphe3_12240 [Arthrobacter phenanthrenivorans
Sphe3]
Length=128
Score = 114 bits (284), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 67/142 (48%), Positives = 82/142 (58%), Gaps = 19/142 (13%)
Query 22 RRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRH 81
R+C R C AVATLT+VY+DSTAV+GPLAT EPH +DLC HA +T PRGWE++R
Sbjct 5 RQCSRSACRQSAVATLTYVYADSTAVLGPLATYAEPHCYDLCEQHADSLTVPRGWEVLRL 64
Query 82 AGP-LPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGV 140
A P P P DDL+ALA+AVR+ + P G P +A AP G
Sbjct 65 AMPSTPQQPGPDDLLALANAVRDAAALPAQPQQPAQRG---------PHSALEAPAG--- 112
Query 141 LAPPEPGAGRRRGHLRVLPDPA 162
G RRGHLRVL +P+
Sbjct 113 ------TEGTRRGHLRVLREPS 128
>gi|159036546|ref|YP_001535799.1| hypothetical protein Sare_0891 [Salinispora arenicola CNS-205]
gi|157915381|gb|ABV96808.1| conserved hypothetical protein [Salinispora arenicola CNS-205]
Length=123
Score = 114 bits (284), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 70/143 (49%), Positives = 83/143 (59%), Gaps = 24/143 (16%)
Query 18 VNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWE 77
+ PRRC R GCP AVATLT+VY++STAV+GPLA EPH++DLC HA +TAPRGW+
Sbjct 1 MRSPRRCTRNGCPRQAVATLTYVYNESTAVVGPLAAFAEPHTYDLCEPHARSLTAPRGWD 60
Query 78 LVRHAGPL-PSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPT 136
+VRH G P P DDLVALA+AVRE P P D AT P+
Sbjct 61 VVRHEGEFEPPPPTTDDLVALAEAVREAA-------RPAVPRPPEDDHD----PATSTPS 109
Query 137 GGGVLAPPEPGAGRRRGHLRVLP 159
G RRGHLRV+P
Sbjct 110 TG------------RRGHLRVIP 120
>gi|336119564|ref|YP_004574341.1| hypothetical protein MLP_39240 [Microlunatus phosphovorus NM-1]
gi|334687353|dbj|BAK36938.1| hypothetical protein MLP_39240 [Microlunatus phosphovorus NM-1]
Length=128
Score = 114 bits (284), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 72/142 (51%), Positives = 84/142 (60%), Gaps = 18/142 (12%)
Query 20 VPRRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELV 79
V RRC R GCP AVATLT+VYSDSTAV+GPLA EPH +DLC HA + APRGWE++
Sbjct 2 VSRRCSRAGCPGAAVATLTYVYSDSTAVLGPLAARNEPHGYDLCHTHAENLRAPRGWEVI 61
Query 80 RHAGPLPSHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGG 139
R P P DDL+ALA+AVRE G S + P+H D PA G
Sbjct 62 RKEIPTEPEPSSDDLLALANAVREVGFSYDQ---------PVH---DRPAEVRPREQRPG 109
Query 140 VLAPPEPGAGRRRGHLRVLPDP 161
++ E G RRGHL VL DP
Sbjct 110 IV---ELG---RRGHLTVLADP 125
>gi|296130181|ref|YP_003637431.1| hypothetical protein Cfla_2342 [Cellulomonas flavigena DSM 20109]
gi|296021996|gb|ADG75232.1| conserved hypothetical protein [Cellulomonas flavigena DSM 20109]
Length=129
Score = 114 bits (284), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 73/139 (53%), Positives = 84/139 (61%), Gaps = 23/139 (16%)
Query 22 RRCCRPGCPHYAVATLTFVYSDSTAVIGPLATAREPHSWDLCVGHAGRITAPRGWELVRH 81
R+C R CPH AVATLT+VY+DSTAV+GPLA EPHS+DLCV HA R+TAPRGWE+VR
Sbjct 5 RQCSRTACPHAAVATLTYVYADSTAVLGPLAQLAEPHSYDLCVEHADRLTAPRGWEVVRL 64
Query 82 AGPLP-SHPDEDDLVALADAVREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGV 140
L + P DDLVALA+AVRE AGRR
Sbjct 65 LPDLQAAAPSHDDLVALAEAVRE----AGRRR------------------VPEPAPALAP 102
Query 141 LAPPEPGAGRRRGHLRVLP 159
+APP P RRRGHLRV+P
Sbjct 103 VAPPAPDTSRRRGHLRVVP 121
Lambda K H
0.319 0.139 0.455
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 129924364284
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40