BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1682
Length=305
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608820|ref|NP_216198.1| coiled-coil structural protein [Myc... 603 1e-170
gi|289554544|ref|ZP_06443754.1| conserved hypothetical protein [... 601 4e-170
gi|289443139|ref|ZP_06432883.1| conserved hypothetical protein [... 600 9e-170
gi|289750238|ref|ZP_06509616.1| conserved hypothetical protein [... 600 1e-169
gi|15841139|ref|NP_336176.1| AbpS-related protein [Mycobacterium... 550 1e-154
gi|289447294|ref|ZP_06437038.1| coiled-coil structural protein [... 463 2e-128
gi|240170128|ref|ZP_04748787.1| AbpS-related protein [Mycobacter... 359 3e-97
gi|169628622|ref|YP_001702271.1| hypothetical protein MAB_1531c ... 297 2e-78
gi|342862051|ref|ZP_08718694.1| hypothetical protein MCOL_24291 ... 132 9e-29
gi|41408720|ref|NP_961556.1| hypothetical protein MAP2622 [Mycob... 126 4e-27
gi|254821238|ref|ZP_05226239.1| hypothetical protein MintA_14972... 124 2e-26
gi|254774178|ref|ZP_05215694.1| hypothetical protein MaviaA2_058... 124 2e-26
gi|118466474|ref|YP_882827.1| hypothetical protein MAV_3650 [Myc... 124 3e-26
gi|41408965|ref|NP_961801.1| hypothetical protein MAP2867c [Myco... 123 3e-26
gi|336458979|gb|EGO37933.1| DivIVA protein [Mycobacterium avium ... 123 3e-26
gi|336458643|gb|EGO37605.1| DivIVA protein [Mycobacterium avium ... 123 4e-26
gi|118465991|ref|YP_880545.1| hypothetical protein MAV_1300 [Myc... 106 4e-21
gi|111018461|ref|YP_701433.1| myosin [Rhodococcus jostii RHA1] >... 100 2e-19
gi|226360583|ref|YP_002778361.1| hypothetical protein ROP_11690 ... 99.4 6e-19
gi|296171674|ref|ZP_06852888.1| conserved hypothetical protein [... 99.0 7e-19
gi|342858376|ref|ZP_08715031.1| hypothetical protein MCOL_05861 ... 95.9 6e-18
gi|226307383|ref|YP_002767343.1| hypothetical protein RER_38960 ... 94.0 3e-17
gi|229493826|ref|ZP_04387603.1| M protein [Rhodococcus erythropo... 90.9 2e-16
gi|54023040|ref|YP_117282.1| hypothetical protein nfa10730 [Noca... 87.0 3e-15
gi|333920904|ref|YP_004494485.1| hypothetical protein AS9A_3240 ... 87.0 3e-15
gi|312139186|ref|YP_004006522.1| hypothetical protein REQ_17680 ... 86.3 6e-15
gi|296138990|ref|YP_003646233.1| myosin [Tsukamurella paurometab... 70.5 3e-10
gi|300783497|ref|YP_003763788.1| chromosome segregation-related ... 67.0 4e-09
gi|257056833|ref|YP_003134665.1| hypothetical protein Svir_28570... 65.1 1e-08
gi|256380099|ref|YP_003103759.1| hypothetical protein Amir_6106 ... 64.7 2e-08
gi|302524849|ref|ZP_07277191.1| predicted protein [Streptomyces ... 62.8 6e-08
gi|134102678|ref|YP_001108339.1| M protein [Saccharopolyspora er... 60.8 3e-07
gi|256390452|ref|YP_003112016.1| cellulose-binding protein [Cate... 55.5 1e-05
gi|302530141|ref|ZP_07282483.1| predicted protein [Streptomyces ... 53.1 5e-05
gi|134101871|ref|YP_001107532.1| cellulose-binding protein [Sacc... 49.3 7e-04
gi|72162795|ref|YP_290452.1| cellulose-binding protein [Thermobi... 47.0 0.004
gi|297559335|ref|YP_003678309.1| cellulose-binding protein [Noca... 46.6 0.004
gi|258653271|ref|YP_003202427.1| hypothetical protein Namu_3105 ... 46.6 0.005
gi|290957327|ref|YP_003488509.1| cellulose-binding protein [Stre... 45.8 0.008
gi|300789928|ref|YP_003770219.1| hypothetical protein AMED_8114 ... 44.7 0.020
gi|329939858|ref|ZP_08289159.1| cellulose-binding protein [Strep... 43.1 0.048
gi|302542457|ref|ZP_07294799.1| putative cellulose-binding prote... 43.1 0.052
gi|294631434|ref|ZP_06709994.1| cellulose-binding protein [Strep... 43.1 0.055
gi|21223756|ref|NP_629535.1| cellulose-binding protein [Streptom... 43.1 0.056
gi|333990778|ref|YP_004523392.1| hypothetical protein JDM601_213... 42.7 0.068
gi|297202425|ref|ZP_06919822.1| cellulose-binding protein [Strep... 42.7 0.072
gi|345009364|ref|YP_004811718.1| putative cellulose-binding prot... 42.4 0.082
gi|297157165|gb|ADI06877.1| putative cellulose-binding protein [... 42.4 0.094
gi|311898405|dbj|BAJ30813.1| putative filament-forming protein [... 42.4 0.10
gi|29829401|ref|NP_824035.1| cellulose-binding protein [Streptom... 42.0 0.11
>gi|15608820|ref|NP_216198.1| coiled-coil structural protein [Mycobacterium tuberculosis H37Rv]
gi|31792868|ref|NP_855361.1| coiled-coil structural protein [Mycobacterium bovis AF2122/97]
gi|121637589|ref|YP_977812.1| putative coiled-coil structural protein [Mycobacterium bovis
BCG str. Pasteur 1173P2]
59 more sequence titles
Length=305
Score = 603 bits (1554), Expect = 1e-170, Method: Compositional matrix adjust.
Identities = 304/305 (99%), Positives = 305/305 (100%), Gaps = 0/305 (0%)
Query 1 VLPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQA 60
+LPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQA
Sbjct 1 MLPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQA 60
Query 61 QHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAAT 120
QHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAAT
Sbjct 61 QHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAAT 120
Query 121 LVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMR 180
LVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMR
Sbjct 121 LVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMR 180
Query 181 VIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADE 240
VIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADE
Sbjct 181 VIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADE 240
Query 241 QIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALLESARHSETEPLQSINGAVA 300
QIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALLESARHSETEPLQSINGAVA
Sbjct 241 QIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALLESARHSETEPLQSINGAVA 300
Query 301 ELRAI 305
ELRAI
Sbjct 301 ELRAI 305
>gi|289554544|ref|ZP_06443754.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
gi|289439176|gb|EFD21669.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
Length=304
Score = 601 bits (1550), Expect = 4e-170, Method: Compositional matrix adjust.
Identities = 303/304 (99%), Positives = 304/304 (100%), Gaps = 0/304 (0%)
Query 2 LPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQAQ 61
+PQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQAQ
Sbjct 1 MPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQAQ 60
Query 62 HCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATL 121
HCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATL
Sbjct 61 HCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATL 120
Query 122 VSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRV 181
VSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRV
Sbjct 121 VSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRV 180
Query 182 IDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQ 241
IDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQ
Sbjct 181 IDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQ 240
Query 242 IEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALLESARHSETEPLQSINGAVAE 301
IEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALLESARHSETEPLQSINGAVAE
Sbjct 241 IEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALLESARHSETEPLQSINGAVAE 300
Query 302 LRAI 305
LRAI
Sbjct 301 LRAI 304
>gi|289443139|ref|ZP_06432883.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289569729|ref|ZP_06449956.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289753773|ref|ZP_06513151.1| coiled-coil structural protein [Mycobacterium tuberculosis EAS054]
gi|289416058|gb|EFD13298.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289543483|gb|EFD47131.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289694360|gb|EFD61789.1| coiled-coil structural protein [Mycobacterium tuberculosis EAS054]
Length=305
Score = 600 bits (1547), Expect = 9e-170, Method: Compositional matrix adjust.
Identities = 303/305 (99%), Positives = 304/305 (99%), Gaps = 0/305 (0%)
Query 1 VLPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQA 60
+LPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQA
Sbjct 1 MLPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQA 60
Query 61 QHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAAT 120
QHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAAT
Sbjct 61 QHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAAT 120
Query 121 LVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMR 180
LVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMR
Sbjct 121 LVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMR 180
Query 181 VIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADE 240
VIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADE
Sbjct 181 VIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADE 240
Query 241 QIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALLESARHSETEPLQSINGAVA 300
QIEHGLSEARRTLEEISQ RVGILEQLARIHAQLENIPALLESARHSETEPLQSINGAVA
Sbjct 241 QIEHGLSEARRTLEEISQWRVGILEQLARIHAQLENIPALLESARHSETEPLQSINGAVA 300
Query 301 ELRAI 305
ELRAI
Sbjct 301 ELRAI 305
>gi|289750238|ref|ZP_06509616.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289690825|gb|EFD58254.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=305
Score = 600 bits (1546), Expect = 1e-169, Method: Compositional matrix adjust.
Identities = 302/305 (99%), Positives = 304/305 (99%), Gaps = 0/305 (0%)
Query 1 VLPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQA 60
+LPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQA
Sbjct 1 MLPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQA 60
Query 61 QHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAAT 120
QHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAAT
Sbjct 61 QHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAAT 120
Query 121 LVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMR 180
LVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMR
Sbjct 121 LVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMR 180
Query 181 VIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADE 240
VIDARRREKAEQELDAEIIRLRTDAQFQ+DDQLQATQQECEKRLGEAKIEADRRLHVADE
Sbjct 181 VIDARRREKAEQELDAEIIRLRTDAQFQVDDQLQATQQECEKRLGEAKIEADRRLHVADE 240
Query 241 QIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALLESARHSETEPLQSINGAVA 300
QIEHGLSEARRTLEEISQ RVGILEQLARIHAQLENIPALLESARHSETEPLQSINGAVA
Sbjct 241 QIEHGLSEARRTLEEISQWRVGILEQLARIHAQLENIPALLESARHSETEPLQSINGAVA 300
Query 301 ELRAI 305
ELRAI
Sbjct 301 ELRAI 305
>gi|15841139|ref|NP_336176.1| AbpS-related protein [Mycobacterium tuberculosis CDC1551]
gi|254231877|ref|ZP_04925204.1| hypothetical protein TBCG_01635 [Mycobacterium tuberculosis C]
gi|308401197|ref|ZP_07493378.2| hypothetical protein TMLG_00664 [Mycobacterium tuberculosis SUMu012]
6 more sequence titles
Length=282
Score = 550 bits (1417), Expect = 1e-154, Method: Compositional matrix adjust.
Identities = 281/282 (99%), Positives = 282/282 (100%), Gaps = 0/282 (0%)
Query 24 VTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASLKDENAALK 83
+TTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASLKDENAALK
Sbjct 1 MTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASLKDENAALK 60
Query 84 DTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADM 143
DTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADM
Sbjct 61 DTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADM 120
Query 144 NARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRT 203
NARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRT
Sbjct 121 NARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRT 180
Query 204 DAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGI 263
DAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGI
Sbjct 181 DAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGI 240
Query 264 LEQLARIHAQLENIPALLESARHSETEPLQSINGAVAELRAI 305
LEQLARIHAQLENIPALLESARHSETEPLQSINGAVAELRAI
Sbjct 241 LEQLARIHAQLENIPALLESARHSETEPLQSINGAVAELRAI 282
>gi|289447294|ref|ZP_06437038.1| coiled-coil structural protein [Mycobacterium tuberculosis CPHL_A]
gi|289420252|gb|EFD17453.1| coiled-coil structural protein [Mycobacterium tuberculosis CPHL_A]
Length=238
Score = 463 bits (1191), Expect = 2e-128, Method: Compositional matrix adjust.
Identities = 232/232 (100%), Positives = 232/232 (100%), Gaps = 0/232 (0%)
Query 1 VLPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQA 60
VLPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQA
Sbjct 7 VLPQRPNCTKLFRPRRGVSERYRVTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQA 66
Query 61 QHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAAT 120
QHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAAT
Sbjct 67 QHCEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAAT 126
Query 121 LVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMR 180
LVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMR
Sbjct 127 LVSAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMR 186
Query 181 VIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEAD 232
VIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEAD
Sbjct 187 VIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEAD 238
>gi|240170128|ref|ZP_04748787.1| AbpS-related protein [Mycobacterium kansasii ATCC 12478]
Length=282
Score = 359 bits (921), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 192/282 (69%), Positives = 244/282 (87%), Gaps = 0/282 (0%)
Query 24 VTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASLKDENAALK 83
+TT N AP+F+RTR+GYDP AVN YIAEL++++QAQH E E L+ EIA+L+DENA L+
Sbjct 1 MTTFVNDPAPKFERTRNGYDPAAVNQYIAELMVQRQAQHYETERLRVEIAALRDENAVLR 60
Query 84 DTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADM 143
DTSPS AVTDR+AKMLR AVDEVFQMQSEARAEA+ L++AA++EAEA R KR+MLA +
Sbjct 61 DTSPSPPAVTDRLAKMLRFAVDEVFQMQSEARAEASALIAAAKNEAEAARVSKRDMLAQI 120
Query 144 NARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRT 203
ARQRAL+ EHA+ +RRAREEAE+L AQA E+++MR+ DARRREKAEQ+LDAE+IRLR+
Sbjct 121 EARQRALDLEHAEAVRRAREEAEELRAQAVREIDQMRIADARRREKAEQQLDAELIRLRS 180
Query 204 DAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGI 263
DAQFQIDDQL+ TQQEC+KRL +AK EA+RR+ +ADEQ+E ++EARR L++++QRR+G+
Sbjct 181 DAQFQIDDQLRTTQQECDKRLNDAKTEAERRVRLADEQVEQRMAEARRMLDDVNQRRLGV 240
Query 264 LEQLARIHAQLENIPALLESARHSETEPLQSINGAVAELRAI 305
LEQLARIHA+LE+IPA+LESAR +ET LQS+ GA +ELRAI
Sbjct 241 LEQLARIHAKLEDIPAILESARCAETNMLQSMGGAFSELRAI 282
>gi|169628622|ref|YP_001702271.1| hypothetical protein MAB_1531c [Mycobacterium abscessus ATCC
19977]
gi|169240589|emb|CAM61617.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=295
Score = 297 bits (760), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 155/268 (58%), Positives = 204/268 (77%), Gaps = 7/268 (2%)
Query 24 VTTAHNGSAPRFQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKA-------EIASLK 76
TT N S F RTR+GYDP+ VN YI+ L + Q+ ++ETLK+ +I+SLK
Sbjct 2 TTTTLNESTRNFTRTRNGYDPIEVNEYISRLTIMHQSGLNDVETLKSRLEDATKQISSLK 61
Query 77 DENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQK 136
DE L DTSPSA A+TDRMAKMLR+AVDEV +MQ+EAR E+A L+++A+ +AEA+RT+
Sbjct 62 DEVTTLSDTSPSAHAMTDRMAKMLRIAVDEVAEMQNEARTESAALIASAKADAEAMRTKH 121
Query 137 REMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDA 196
+E+LADM ARQ ALE+E+ +VM RAREEA Q+VAQA +E ER+R +A+RREKAE ELD
Sbjct 122 KELLADMAARQSALETEYTEVMARAREEANQIVAQAVSESERLRAAEAKRREKAETELDE 181
Query 197 EIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEI 256
E+ +LRTD Q +D+Q ++ Q ECEKRL +AK EADRRL +ADEQIE L +ARR++EEI
Sbjct 182 ELTKLRTDTQSAVDEQRRSIQGECEKRLADAKDEADRRLRLADEQIERRLDQARRSVEEI 241
Query 257 SQRRVGILEQLARIHAQLENIPALLESA 284
Q+R+ ILEQL +H +LE+IP+ LESA
Sbjct 242 GQQRISILEQLMGVHGRLESIPSALESA 269
>gi|342862051|ref|ZP_08718694.1| hypothetical protein MCOL_24291 [Mycobacterium colombiense CECT
3035]
gi|342130355|gb|EGT83670.1| hypothetical protein MCOL_24291 [Mycobacterium colombiense CECT
3035]
Length=257
Score = 132 bits (331), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 100/274 (37%), Positives = 143/274 (53%), Gaps = 58/274 (21%)
Query 32 APRFQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKA--------------EIASLKD 77
AP F RT GYD AV+ YI L+ + ++ E+ L+A E+A LKD
Sbjct 6 APTFTRTFRGYDRAAVDTYIQVLLTKYKSLSDEVHNLRAQRNESNDEAVALRIEVACLKD 65
Query 78 ENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKR 137
E A L DTSPS A+ RMAK+LR A+DE+ QMQ E+RAE L++ A EA+A + + R
Sbjct 66 EIAVLSDTSPSPYAMQHRMAKLLRRAIDEISQMQDESRAEVEALIAQAEAEADAAQREHR 125
Query 138 EMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAE 197
E+LAD+ A+++A+E+E A + +E DAE
Sbjct 126 ELLADIAAQRKAVEAECA---------------------------------ASRKEFDAE 152
Query 198 IIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEIS 257
+ RLR +AQ ID Q Q+E ++ L +AK E+ H +ARR L E +
Sbjct 153 LARLRAEAQSAIDAAWQEAQRERDQLLTDAK-----------ERARHSDEQARRALAEAN 201
Query 258 QRRVGILEQLARIHAQLENIPALLESARHSETEP 291
QRR+ ILE+L + L+ +PA+LESA + P
Sbjct 202 QRRIMILEELMGVAHDLQRVPAILESAYQERSNP 235
>gi|41408720|ref|NP_961556.1| hypothetical protein MAP2622 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41397078|gb|AAS04939.1| hypothetical protein MAP_2622 [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=314
Score = 126 bits (316), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 108/315 (35%), Positives = 159/315 (51%), Gaps = 69/315 (21%)
Query 2 LPQRPNCTKLFRPRRGVSERYRVTTAH----NGS-------APRFQRTRSGYDPVAVNHY 50
P R FR RG+ + R NG A F R GYDP AV+ Y
Sbjct 22 FPCRTTTLLRFRGDRGILDGGRCGLCQALLQNGGWHMITEPATAFSRVFRGYDPAAVDAY 81
Query 51 IAELVLRQQAQHCEIETLKA--------------EIASLKDENAALKDTSPSAQAVTDRM 96
+ L+ +Q+ E++ L+A E+A LKDE A L DTSP+ A+ RM
Sbjct 82 VEALLAKQKLLIDEVQNLRAQRNESGDQLAALRIEVACLKDEVAHLSDTSPTPYAIQHRM 141
Query 97 AKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRALESEHAD 156
KMLR ++E+ +MQ+EA+AEA ++++AA + EA + +E+LADM A++ ALE+E
Sbjct 142 TKMLRRTLEEISRMQAEAQAEAESMIAAAAAQVEASHREHQELLADMAAQREALETE--- 198
Query 157 VMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQAT 216
RE A++EL+AE+ +R +AQ ID+ Q
Sbjct 199 ------------------------------REDAKKELEAELATMRAEAQAAIDEARQEA 228
Query 217 QQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQLARIHAQLEN 276
++EC + L EAK +AD DE+ ARRT+EE +Q+R+ ILE+L + L+
Sbjct 229 ERECARLLAEAKQKADD----LDER-------ARRTVEEANQQRIMILEELMDVARDLQG 277
Query 277 IPALLESARHSETEP 291
+PA L+SA T P
Sbjct 278 LPASLQSAYQERTSP 292
>gi|254821238|ref|ZP_05226239.1| hypothetical protein MintA_14972 [Mycobacterium intracellulare
ATCC 13950]
Length=244
Score = 124 bits (311), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 103/273 (38%), Positives = 152/273 (56%), Gaps = 53/273 (19%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEI-------ASLKDENAALKDTSP 87
F R GYD AV+ +I L +QQ ++E+L+A + A+L+ E AAL DTSP
Sbjct 9 FSRKFMGYDASAVDAHIEMLTTKQQLLLDDVESLRARLQESGEQTAALRKEVAALTDTSP 68
Query 88 SAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQ 147
S A+ RMAKML+ AVDEV +MQ+EARAEA L++AA EAEA ++ E+LADM A+Q
Sbjct 69 SPHAMQKRMAKMLQRAVDEVAEMQAEARAEADALIAAAEAEAEAAERKRDEVLADMAAQQ 128
Query 148 RALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRTDAQF 207
+ALE+E+ E+ +++LDAE+ +R + Q
Sbjct 129 KALEAEY---------------------------------EQTKEKLDAELAEMRAETQA 155
Query 208 QIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQL 267
ID+ + + ECE+ LG+AK AD H +ARR ++E + +R+ ILEQL
Sbjct 156 AIDEARRNAELECERLLGDAKQGAD-----------HYRDKARRAVDEATAQRIEILEQL 204
Query 268 ARIHAQLENIPALLESARHSETEPLQSINGAVA 300
++ L+ +PA LE+A + P S +GA+A
Sbjct 205 MGVYRDLDGVPAALEAAYQEQKNP--SEDGALA 235
>gi|254774178|ref|ZP_05215694.1| hypothetical protein MaviaA2_05825 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=257
Score = 124 bits (310), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 98/272 (37%), Positives = 147/272 (55%), Gaps = 60/272 (22%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKA--------------EIASLKDENA 80
F R GYDP AV+ Y+ L+ +Q+ E++ L+A E+A LKDE A
Sbjct 9 FSRVFRGYDPAAVDAYVEALLAKQKLLIDEVQNLRAQRNESGDQLAALRIEVACLKDEVA 68
Query 81 ALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREML 140
L DTSP+ A+ RM KMLR ++E+ +MQ+EA+AEA ++++AA + EA + +E+L
Sbjct 69 HLSDTSPTPYAIQHRMTKMLRRTLEEISRMQAEAQAEAESMIAAAAAQVEASHREHQELL 128
Query 141 ADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIR 200
ADM A++ ALE+E RE+A+++L+AE+
Sbjct 129 ADMAAQREALETE---------------------------------REEAKKDLEAELAT 155
Query 201 LRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSE-ARRTLEEISQR 259
+R +AQ ID+ Q ++EC + L EAK +AD L E ARRT+EE +Q+
Sbjct 156 MRAEAQAAIDEARQEAERECARLLAEAKQKAD------------DLDERARRTVEEANQQ 203
Query 260 RVGILEQLARIHAQLENIPALLESARHSETEP 291
R+ ILE+L + L+ +PA LESA T P
Sbjct 204 RIMILEELMDVARDLQGLPASLESAYQERTSP 235
>gi|118466474|ref|YP_882827.1| hypothetical protein MAV_3650 [Mycobacterium avium 104]
gi|254776086|ref|ZP_05217602.1| hypothetical protein MaviaA2_15640 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|118167761|gb|ABK68658.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=244
Score = 124 bits (310), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 106/277 (39%), Positives = 153/277 (56%), Gaps = 56/277 (20%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEI-------ASLKDENAALKDTSP 87
F R GYD AV+ +I L +QQ ++E+L+A + A+L+ E AAL DTSP
Sbjct 9 FSRKFMGYDTTAVDAHIEMLTAKQQLLIDDVESLRARLQESNEQTAALRKEVAALTDTSP 68
Query 88 SAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQ 147
+ AV RMAKML AVDEV +MQ+EARAEA L++AA EA+A ++ E+LADM A+Q
Sbjct 69 APHAVQQRMAKMLARAVDEVAEMQAEARAEADALIAAAEAEAQAAERKRDEVLADMAAQQ 128
Query 148 RALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRTDAQF 207
ALE+E+ RE E+ LDAE+ +R +AQ
Sbjct 129 EALEAEY--------------------------------RETKEK-LDAELADMRAEAQA 155
Query 208 QIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQL 267
ID+ +A ++ECE+ LG+AK AD H ARR +EE S +R+ ILEQL
Sbjct 156 AIDEARRAAERECEQLLGDAKQGAD-----------HYRDTARRAVEEASAQRIKILEQL 204
Query 268 ARIHAQLENIPALLESAR-----HSETEPLQSINGAV 299
++ +LE++PA L++A ++ L +++G V
Sbjct 205 MSVYRELEDVPAALQAAYDEQQGSADDGVLAAVDGKV 241
>gi|41408965|ref|NP_961801.1| hypothetical protein MAP2867c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41397324|gb|AAS05184.1| hypothetical protein MAP_2867c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=246
Score = 123 bits (309), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 103/257 (41%), Positives = 145/257 (57%), Gaps = 51/257 (19%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEI-------ASLKDENAALKDTSP 87
F R GYD AV+ +I L +QQ ++E+L+A + A+L+ E AAL DTSP
Sbjct 11 FSRKFMGYDTTAVDAHIEMLTAKQQLLIDDVESLRARLQESNEQTAALRKEVAALTDTSP 70
Query 88 SAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQ 147
+ AV RMAKML AVDEV +MQ+EARAEA L++AA EA+A ++ E+LADM A+Q
Sbjct 71 APHAVQQRMAKMLARAVDEVAEMQAEARAEADALIAAAEAEAQAAERKRDEVLADMAAQQ 130
Query 148 RALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRTDAQF 207
ALE+E+ RE E+ LDAE+ +R +AQ
Sbjct 131 EALEAEY--------------------------------RETKEK-LDAELADMRAEAQA 157
Query 208 QIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQL 267
ID+ +A ++ECE+ LG+AK AD H ARR +EE S +R+ ILEQL
Sbjct 158 AIDEARRAAERECEQLLGDAKQGAD-----------HYRDTARRAVEEASAQRIKILEQL 206
Query 268 ARIHAQLENIPALLESA 284
++ +LE++PA L++A
Sbjct 207 MSVYRELEDVPAALQAA 223
>gi|336458979|gb|EGO37933.1| DivIVA protein [Mycobacterium avium subsp. paratuberculosis S397]
Length=244
Score = 123 bits (309), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 103/257 (41%), Positives = 145/257 (57%), Gaps = 51/257 (19%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEI-------ASLKDENAALKDTSP 87
F R GYD AV+ +I L +QQ ++E+L+A + A+L+ E AAL DTSP
Sbjct 9 FSRKFMGYDTTAVDAHIEMLTAKQQLLIDDVESLRARLQESNEQTAALRKEVAALTDTSP 68
Query 88 SAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQ 147
+ AV RMAKML AVDEV +MQ+EARAEA L++AA EA+A ++ E+LADM A+Q
Sbjct 69 APHAVQQRMAKMLARAVDEVAEMQAEARAEADALIAAAEAEAQAAERKRDEVLADMAAQQ 128
Query 148 RALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRTDAQF 207
ALE+E+ RE E+ LDAE+ +R +AQ
Sbjct 129 EALEAEY--------------------------------RETKEK-LDAELADMRAEAQA 155
Query 208 QIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQL 267
ID+ +A ++ECE+ LG+AK AD H ARR +EE S +R+ ILEQL
Sbjct 156 AIDEARRAAERECEQLLGDAKQGAD-----------HYRDTARRAVEEASVQRIKILEQL 204
Query 268 ARIHAQLENIPALLESA 284
++ +LE++PA L++A
Sbjct 205 MSVYRELEDVPAALQAA 221
>gi|336458643|gb|EGO37605.1| DivIVA protein [Mycobacterium avium subsp. paratuberculosis S397]
Length=257
Score = 123 bits (309), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 98/272 (37%), Positives = 146/272 (54%), Gaps = 60/272 (22%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKA--------------EIASLKDENA 80
F R GYDP AV+ Y+ L+ +Q+ E++ L+A E+A LKDE A
Sbjct 9 FSRVFRGYDPAAVDAYVEALLAKQKLLIDEVQNLRAQRNESGDQLAALRIEVACLKDEVA 68
Query 81 ALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREML 140
L DTSP+ A+ RM KMLR ++E+ +MQ+EA+AEA ++++AA + EA + +E+L
Sbjct 69 HLSDTSPTPYAIQHRMTKMLRRTLEEISRMQAEAQAEAESMIAAAAAQVEASHREHQELL 128
Query 141 ADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIR 200
ADM A++ ALE+E RE A++EL+AE+
Sbjct 129 ADMAAQREALETE---------------------------------REDAKKELEAELAT 155
Query 201 LRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSE-ARRTLEEISQR 259
+R +AQ ID+ Q ++EC + L EAK +AD L E ARRT+EE +Q+
Sbjct 156 MRAEAQAAIDEARQEAERECARLLAEAKQKAD------------DLDERARRTVEEANQQ 203
Query 260 RVGILEQLARIHAQLENIPALLESARHSETEP 291
R+ ILE+L + L+ +PA L+SA T P
Sbjct 204 RIMILEELMDVARDLQGLPASLQSAYQERTSP 235
>gi|118465991|ref|YP_880545.1| hypothetical protein MAV_1300 [Mycobacterium avium 104]
gi|118167278|gb|ABK68175.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=257
Score = 106 bits (265), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 97/272 (36%), Positives = 147/272 (55%), Gaps = 60/272 (22%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKA--------------EIASLKDENA 80
F R GYDP AV+ Y+ L+ +Q+ E++ L+A E+A LKDE A
Sbjct 9 FSRVFRGYDPAAVDAYVEALLAKQKLLIDEVQNLRAQRNESGDQLAALRIEVACLKDEVA 68
Query 81 ALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREML 140
L DTSP+ A+ RM KMLR ++E+ +MQ+EA+AEA ++++AA + EA + +E+L
Sbjct 69 HLSDTSPTPYAIQHRMTKMLRRTLEEISRMQAEAQAEAESMIAAAAAQVEASHREHQELL 128
Query 141 ADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIR 200
ADM A++ ALE+E RE+A++EL+AE+
Sbjct 129 ADMAAQREALETE---------------------------------REEAKKELEAELAT 155
Query 201 LRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSE-ARRTLEEISQR 259
+R +A+ ID+ + ++EC + L EAK +AD L E ARRT+EE +Q+
Sbjct 156 MRAEARASIDEARREAERECARLLAEAKQKAD------------DLDERARRTVEEANQQ 203
Query 260 RVGILEQLARIHAQLENIPALLESARHSETEP 291
R+ ILE+L + L+ +PA LESA T P
Sbjct 204 RIMILEELMDVARDLQGLPASLESAYQERTSP 235
>gi|111018461|ref|YP_701433.1| myosin [Rhodococcus jostii RHA1]
gi|110817991|gb|ABG93275.1| probable myosin [Rhodococcus jostii RHA1]
Length=361
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/275 (31%), Positives = 145/275 (53%), Gaps = 16/275 (5%)
Query 30 GSAP-RFQRTRSGYDPVAVNHYI----AELVLRQQAQHC----------EIETLKAEIAS 74
GS P F R G+D V +Y AEL + + +++ + EI
Sbjct 8 GSIPLPFSTVRKGFDRDEVRNYFERFDAELRVTAADRDAAAAQARDLAKQLDDARDEIDD 67
Query 75 LKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRT 134
L+ + L +A+ ++DR+++MLRLA DE +++++A A+AA ++S A E +R
Sbjct 68 LRKDVDRLSVPPTTAEGMSDRISRMLRLASDEASEVRAKAEADAAEMISVAEQEGAQLRG 127
Query 135 QKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQEL 194
Q ++A++ R+ ALE +H M +AR EA ++V A +E +R+ RR + +++
Sbjct 128 QYESLVAELEDRRSALEIQHEQTMAKARTEAAKIVEAAQSERDRLAAEAELRRSQVQEDF 187
Query 195 DAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLE 254
D + R +++ ++ E ++RL +A EA+RRL A +Q E ++ A+ E
Sbjct 188 DLAMSERRQKTITAVNELESTSKAEAQRRLADATHEAERRLQAATDQSERKIAHAKELAE 247
Query 255 EISQRRVGILEQLARIHAQLENIPALLESA-RHSE 288
E+ R IL QL I QL+++PA+L S R SE
Sbjct 248 ELRVLRGRILAQLLGIRGQLDSVPAMLASVNRESE 282
>gi|226360583|ref|YP_002778361.1| hypothetical protein ROP_11690 [Rhodococcus opacus B4]
gi|226239068|dbj|BAH49416.1| hypothetical protein [Rhodococcus opacus B4]
Length=362
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 87/276 (32%), Positives = 146/276 (53%), Gaps = 18/276 (6%)
Query 30 GSAP-RFQRTRSGYDPVAVNHYI----AELVLRQQAQHC----------EIETLKAEIAS 74
GS P F R G+D V +Y AEL + + +++ + EI
Sbjct 8 GSIPLPFSTVRKGFDRDEVRNYFERFDAELRVTAADRDAAAAQARDLAKQLDDARDEIDD 67
Query 75 LKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRT 134
L+ + L +A+ ++DR+++MLRLA DE +++++A A+AA +S A E +R
Sbjct 68 LRKDVDRLSVPPTTAEGMSDRISRMLRLASDEASEVRAKAEADAAETISVAEQEGAQLRG 127
Query 135 QKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQEL 194
Q ++A++ R+ ALE +H M +AR EA ++V A +E +R+ RR + +++
Sbjct 128 QYESLVAELEDRRSALEIQHEQTMAKARTEAAKIVEAAQSERDRLAAEAELRRSQVQEDF 187
Query 195 DAEIIRLRTDAQFQIDDQLQATQQ-ECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTL 253
D + R + ++L+AT + E +RL +A EA+RRL A +Q E ++ A+
Sbjct 188 DLSMSERRQKTITAV-NELEATSKAEAHRRLADATHEAERRLQAATDQSERKIAHAKELA 246
Query 254 EEISQRRVGILEQLARIHAQLENIPALLESA-RHSE 288
EE+ R IL QL I QL+++PA+L S R SE
Sbjct 247 EELRVLRGRILAQLLGIRGQLDSVPAMLASVNRESE 282
>gi|296171674|ref|ZP_06852888.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295894035|gb|EFG73798.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=233
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 87/260 (34%), Positives = 134/260 (52%), Gaps = 62/260 (23%)
Query 32 APRFQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEI-------ASLKDENAALKD 84
A +F R GYDP AV+ +I L +QQ ++E L++ + A+L+ E AAL D
Sbjct 6 AKQFTRMFRGYDPAAVDAHIEMLTTKQQLLLDDVENLRSRLKEAGDQTAALRKEVAALTD 65
Query 85 TSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMN 144
TSPS A+ RMAKML+ AV+EV +MQ+EAR+EA L++AA E EA R + E LA+M
Sbjct 66 TSPSPHAMQRRMAKMLQRAVEEVSEMQAEARSEADALIAAAEAEVEAARRKHEEELAEMA 125
Query 145 ARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRTD 204
+++A+E+++ E+A+++L+AE+ +R D
Sbjct 126 EQRKAMETDY---------------------------------EEAKKQLEAELAGMRAD 152
Query 205 AQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGIL 264
+ ID + Q+E +H +ARR +E Q+R+ +L
Sbjct 153 TEAAIDKAWRDAQREA----------------------DHYREQARRAADEAIQQRINVL 190
Query 265 EQLARIHAQLENIPALLESA 284
EQL ++ LE +PA LESA
Sbjct 191 EQLMGVYRDLEAVPAALESA 210
>gi|342858376|ref|ZP_08715031.1| hypothetical protein MCOL_05861 [Mycobacterium colombiense CECT
3035]
gi|342134080|gb|EGT87260.1| hypothetical protein MCOL_05861 [Mycobacterium colombiense CECT
3035]
Length=236
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 90/257 (36%), Positives = 133/257 (52%), Gaps = 62/257 (24%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEI-------ASLKDENAALKDTSP 87
F R GYD AV+ +I L +QQ ++++L+A + A+L+ E A L DTSP
Sbjct 9 FSRKFMGYDAAAVDAHIEMLTTKQQLLLDDVDSLRARLQESGDQTAALRKEVALLTDTSP 68
Query 88 SAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQ 147
S AV RMAKML AVDEV +M +EARAEA +++AA EAE + ++ +MLADM A++
Sbjct 69 SPHAVQQRMAKMLSRAVDEVAEMHAEARAEADAVIAAAEAEAEETQRKREQMLADMAAQR 128
Query 148 RALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRTDAQF 207
A+E+E+ + R L+AE+ LR DA
Sbjct 129 EAMEAEYQETKER---------------------------------LEAELASLRDDA-- 153
Query 208 QIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQL 267
+Q E+ L EA+ +ADR EARR ++E S++R+ ILEQL
Sbjct 154 ---------EQARERLLAEAREKADR-----------DRDEARRAVDEASRQRIRILEQL 193
Query 268 ARIHAQLENIPALLESA 284
++ L+++P L+SA
Sbjct 194 MGVYRDLKSVPDALDSA 210
>gi|226307383|ref|YP_002767343.1| hypothetical protein RER_38960 [Rhodococcus erythropolis PR4]
gi|226186500|dbj|BAH34604.1| hypothetical protein RER_38960 [Rhodococcus erythropolis PR4]
Length=347
Score = 94.0 bits (232), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 67/202 (34%), Positives = 116/202 (58%), Gaps = 1/202 (0%)
Query 88 SAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQ 147
+A+ ++DR+++MLRLA DE +++++A A+AA + S A + +R ++A++ R+
Sbjct 81 TAEGMSDRISRMLRLASDEASEVRAKAEADAAEMTSIAEQDGARLRGHYESLIAELEDRR 140
Query 148 RALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRTDAQF 207
A+E+ H M +AR EA ++V A AE +R+ +R + ++ D + R
Sbjct 141 NAMETMHEQTMAQARTEAARIVEAAQAERDRLAAESEAKRTQIREDFDISMSERRAKVVA 200
Query 208 QIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQL 267
++D +++ E E+RL +A EA RRL A +Q E ++ A+ EE+ R IL QL
Sbjct 201 AVNDLESSSKAEAERRLSDASHEAQRRLQAATDQSERKIAHAKELTEELRVLRGRILAQL 260
Query 268 ARIHAQLENIPALLESA-RHSE 288
I QL+++PA+L S R SE
Sbjct 261 LGIRGQLDSVPAMLASVNRESE 282
>gi|229493826|ref|ZP_04387603.1| M protein [Rhodococcus erythropolis SK121]
gi|229319263|gb|EEN85107.1| M protein [Rhodococcus erythropolis SK121]
Length=352
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 66/202 (33%), Positives = 116/202 (58%), Gaps = 1/202 (0%)
Query 88 SAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQ 147
+A+ ++DR+++MLRLA DE +++++A A+AA + S A + +R ++A++ R+
Sbjct 81 TAEGMSDRISRMLRLASDEASEVRAKAEADAAEMTSIAEQDGARLRGHYESLIAELEDRR 140
Query 148 RALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRTDAQF 207
A+E+ H M +AR EA ++V A AE +R+ +R + ++ D + R
Sbjct 141 NAMETMHEQTMAQARTEAARIVEAAQAERDRLAAESEAKRTQIREDFDISMSERRAKVVA 200
Query 208 QIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQL 267
+++ +++ E E+RL +A EA RRL A +Q E ++ A+ EE+ R IL QL
Sbjct 201 AVNELESSSKAEAERRLADASHEAQRRLQAATDQSERKIAHAKELTEELRVLRGRILAQL 260
Query 268 ARIHAQLENIPALLESA-RHSE 288
I QL+++PA+L S R SE
Sbjct 261 LGIRGQLDSVPAMLASVNRESE 282
>gi|54023040|ref|YP_117282.1| hypothetical protein nfa10730 [Nocardia farcinica IFM 10152]
gi|54014548|dbj|BAD55918.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=316
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 71/232 (31%), Positives = 132/232 (57%), Gaps = 1/232 (0%)
Query 64 EIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVS 123
++E + EI L+ E L +A+ ++DR+++MLRLA DE ++++ A+AEAA +VS
Sbjct 41 QLEDARDEIDELRKEVDRLSVPPTTAEGMSDRISRMLRLASDEASEVRALAQAEAAEMVS 100
Query 124 AARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVID 183
A +A +R + +LA+ ++ ALE E+ + AR E+ +++ A AE ER+
Sbjct 101 IAEQQAAEMRGKYESLLAETKEKREALEIEYEQTLANARTESAKIIEAAQAEAERILKEA 160
Query 184 ARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIE 243
+R+ A+Q+ + + RT Q+++ ++ E +R+ +A EA+R + A + E
Sbjct 161 EAKRKAAQQDFEVTMSERRTKLTRQMEELEATSRAEAAQRIKDATDEANRLITSATQTSE 220
Query 244 HGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALLESARHSETEPLQSI 295
++ A+ EE+ R +L QL I QL+++PA+L +A + E+E L +
Sbjct 221 RKIAHAKELAEEMRVLRGRVLAQLLGIRGQLDSVPAML-AAVNRESELLDGV 271
>gi|333920904|ref|YP_004494485.1| hypothetical protein AS9A_3240 [Amycolicicoccus subflavus DQS3-9A1]
gi|333483125|gb|AEF41685.1| hypothetical protein AS9A_3240 [Amycolicicoccus subflavus DQS3-9A1]
Length=342
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/218 (30%), Positives = 126/218 (58%), Gaps = 0/218 (0%)
Query 64 EIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVS 123
+++ +A+I +L++E L T SA V+DR+A+MLRLA DE + +++A+AEAA +VS
Sbjct 77 QLDEARADIQALREEVDQLVTTPTSAADVSDRIARMLRLAADEAAETRADAQAEAAEIVS 136
Query 124 AARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVID 183
A A + T++ + R+ LE+E + + RAR EAE++V A A+ ++
Sbjct 137 VAEQNANELETRQNREYEEWTQRRARLEAEQKETLARARTEAERIVEGAEAKRRQIEAET 196
Query 184 ARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIE 243
A ++ + + + + + RT+A +I++ +++ E+ + EA+ +AD R+ A +Q
Sbjct 197 AAQQRQEQDDFNTTMAEKRTNALREIEELRDRARKDAERIVKEAQADADNRVRTASQQAN 256
Query 244 HGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALL 281
+S A + E++ R +L Q+ ++ Q ++ P+LL
Sbjct 257 DRISYASKLAEDLGAARDHLLSQIIKVREQFDSFPSLL 294
>gi|312139186|ref|YP_004006522.1| hypothetical protein REQ_17680 [Rhodococcus equi 103S]
gi|325673509|ref|ZP_08153200.1| myosin [Rhodococcus equi ATCC 33707]
gi|311888525|emb|CBH47837.1| conserved hypothetical protein [Rhodococcus equi 103S]
gi|325555530|gb|EGD25201.1| myosin [Rhodococcus equi ATCC 33707]
Length=312
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 88/285 (31%), Positives = 144/285 (51%), Gaps = 16/285 (5%)
Query 27 AHNGSAPRFQRTRSGYDPVAVNHYI----AELVLRQQAQHC----------EIETLKAEI 72
A++ P F R GYD V Y EL L + ++E + EI
Sbjct 2 AYDRGMP-FTIVRKGYDKDEVRRYFDRFDTELRLMATDRDAAAAQARELANQLEDARDEI 60
Query 73 ASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAV 132
L+ + L +A+ +++R+++MLRLA DE +++++A AEAA ++S A E+E +
Sbjct 61 DELRRDVDRLSVPPTTAEGMSERISRMLRLASDEASEVRAKAEAEAAEMLSVAEQESERL 120
Query 133 RTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQ 192
R ++ R+ ALE EH M+ AR EA ++V A AE +R+ +R + +Q
Sbjct 121 RKLYETKTEELEKRREALEREHEQTMQAARTEATRIVQAAQAESDRLAAESEAKRVQVQQ 180
Query 193 ELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRT 252
+ + + R I++ ++ E +RL EA EA RR+ VA EQ E ++ A+
Sbjct 181 DFEITMAERRAKTIAAIEELETTSKAEAHRRLTEASEEAARRVQVATEQSERKIAHAKEL 240
Query 253 LEEISQRRVGILEQLARIHAQLENIPALLESARHSETEPLQSING 297
EE+ R IL QL I QL+++PA+L S + E E L ++
Sbjct 241 AEELRVLRGRILAQLLGIRGQLDSVPAMLASV-NREAELLDNVGA 284
>gi|296138990|ref|YP_003646233.1| myosin [Tsukamurella paurometabola DSM 20162]
gi|296027124|gb|ADG77894.1| myosin [Tsukamurella paurometabola DSM 20162]
Length=348
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 71/222 (32%), Positives = 120/222 (55%), Gaps = 11/222 (4%)
Query 71 EIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAE 130
EI +L+ E L +A+ ++DR+A+M+RLA DE ++++ A EAA + S AR EAE
Sbjct 66 EIDNLRREVDRLSVPPTTAEGMSDRIARMMRLASDEASEIRAAAANEAAEVRSLARQEAE 125
Query 131 AVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDARRREKA 190
A R + + +DM R++A+E EH + M+RA +EA +++A+A AE +++ RE+
Sbjct 126 ATRREADRIRSDMTNRRQAMEDEHKETMQRAADEAARIIAEAKAEAAKLQRTAEENRERV 185
Query 191 EQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSE-- 248
+ + D + R A I + A++ E +R+ A A++ L+ A+ + L E
Sbjct 186 QNDFDLAMSMRRDKAIRTITELEDASKNEARERVDSANERAEQTLNTANATAQQKLDETN 245
Query 249 ---------ARRTLEEISQRRVGILEQLARIHAQLENIPALL 281
ARR E++ R IL+QL + QL +P L
Sbjct 246 TRISSQVANARRITEDMQALRSSILDQLETVRRQLSLVPEAL 287
>gi|300783497|ref|YP_003763788.1| chromosome segregation-related protein [Amycolatopsis mediterranei
U32]
gi|299793011|gb|ADJ43386.1| chromosome segregation-related protein [Amycolatopsis mediterranei
U32]
gi|340524883|gb|AEK40088.1| chromosome segregation-related protein [Amycolatopsis mediterranei
S699]
Length=402
Score = 67.0 bits (162), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 55/179 (31%), Positives = 104/179 (59%), Gaps = 1/179 (0%)
Query 64 EIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVS 123
++E + EIA L+ + L S + +++R+ +MLRLA DE ++ A AEA + +
Sbjct 50 QLEIARGEIADLRGQVDRLAQPPTSVEGLSERLQRMLRLAQDESADTRARAEAEAGHIRA 109
Query 124 AARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVID 183
A +A A+R + ++L +++ R++ +E+EH V+ AR EA+++ +A AE +R+
Sbjct 110 KAETDASAMRARYEQLLTELDLRRKEMEAEHRKVLEDARAEAKKITDEAEAERKRLDTES 169
Query 184 ARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIE-ADRRLHVADEQ 241
+ RR K E++ + + RT+A + +Q A++ E +R+ EA + AD R V +E+
Sbjct 170 SDRRTKVEEDFEIAMASRRTEAMRVLAEQEAASKAEAARRVQEATQDAADIRAKVLEEE 228
>gi|257056833|ref|YP_003134665.1| hypothetical protein Svir_28570 [Saccharomonospora viridis DSM
43017]
gi|256586705|gb|ACU97838.1| hypothetical protein Svir_28570 [Saccharomonospora viridis DSM
43017]
Length=395
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/179 (31%), Positives = 106/179 (60%), Gaps = 1/179 (0%)
Query 64 EIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVS 123
++E + EI L+ + L + + +++R+ +MLRLA +E ++ A AEA+ + +
Sbjct 59 QLEAARIEIDDLRGQVERLAQPPTTIEGLSERLQRMLRLAQEEASDTKARAEAEASHIRA 118
Query 124 AARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVID 183
A +A A+R + +ML ++ R++ +E+EHA V+ AR EA++++ +A E +R+
Sbjct 119 KAESDASAMRARYEQMLTELAERRKQMEAEHAKVLEDARAEAQRILTEAAEERDRLDREA 178
Query 184 ARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADR-RLHVADEQ 241
+RR + E++ + + + RT+A + +Q A++ E E+R+ EA EA R VADE+
Sbjct 179 EQRRTQIEEDFEIAMSQRRTEAMRVLAEQEAASKAEAERRVREASEEAAAIRAQVADEK 237
>gi|256380099|ref|YP_003103759.1| hypothetical protein Amir_6106 [Actinosynnema mirum DSM 43827]
gi|255924402|gb|ACU39913.1| hypothetical protein Amir_6106 [Actinosynnema mirum DSM 43827]
Length=435
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/170 (29%), Positives = 98/170 (58%), Gaps = 0/170 (0%)
Query 64 EIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVS 123
++E ++E+A L+++ L + + +++R+ +MLRLA DE ++++ A AE + +
Sbjct 59 QLEAARSEMADLRNQIDRLSMPPTTLEGLSERLQRMLRLAQDEANEIKARAEAEGGHIRA 118
Query 124 AARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVID 183
A +A A+R++ ++ +++ R+ +E+EH V+ AR EAE++VA A + R+
Sbjct 119 KAESDAGALRSRYDALIGELDQRRADMENEHRGVLEGARAEAEKIVADAREQAGRIDQEA 178
Query 184 ARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADR 233
+RR E++ + + R ++ + +Q ++ E E+RL EA EA R
Sbjct 179 QQRRTTVEEDFEIAMAARRVESMKTLAEQEAGSKNEAERRLREATDEAAR 228
>gi|302524849|ref|ZP_07277191.1| predicted protein [Streptomyces sp. AA4]
gi|302433744|gb|EFL05560.1| predicted protein [Streptomyces sp. AA4]
Length=429
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 46/161 (29%), Positives = 93/161 (58%), Gaps = 0/161 (0%)
Query 64 EIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVS 123
++E + EIA L+ + L S + +++R+ +MLRLA DE ++ A AEA + +
Sbjct 59 QLEVARGEIADLRGQVDRLAQPPTSVEGLSERLQRMLRLAQDEAADTRARAEAEAGHIRA 118
Query 124 AARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVID 183
A +A A+R + ++L +++ R++ +E+EH V+ AR EA+++ +A AE +++
Sbjct 119 KAETDASAMRARYEQLLTELDLRRKEMEAEHRKVLEDARAEAKRITDEAEAERKKLDAES 178
Query 184 ARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRL 224
RR + E++ + + R +A + +Q A++ E E+R+
Sbjct 179 EARRTQVEEDFEIAMATRRAEAMRVLAEQEAASKAEAERRV 219
>gi|134102678|ref|YP_001108339.1| M protein [Saccharopolyspora erythraea NRRL 2338]
gi|291004636|ref|ZP_06562609.1| M protein [Saccharopolyspora erythraea NRRL 2338]
gi|133915301|emb|CAM05414.1| M protein [Saccharopolyspora erythraea NRRL 2338]
Length=589
Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 57/196 (30%), Positives = 111/196 (57%), Gaps = 11/196 (5%)
Query 64 EIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLVS 123
++E+ ++EI +L+ + L + + +++R+ +MLRLA DE ++ ARAE +
Sbjct 60 QLESARSEIENLRGQVERLSKPPTTLEGLSERLQRMLRLAQDEANDIR--ARAEKDAAET 117
Query 124 AARDEAEA--VRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRV 181
AR EAEA +RT+ +++A+++ R+ +E+EH +M +A+ EAE++ +A +R
Sbjct 118 RARSEAEASNLRTRYEKLIAEVDQRRGEMETEHRALMDKAKAEAERITREAEENRKRADE 177
Query 182 IDARRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQ 241
RR + E++ + + RT++ + +Q ++ + E+R+ EA EA+RRL
Sbjct 178 ESEARRVQVEEDFEIAMAARRTESMRTLAEQEATSKADAERRVREATEEANRRLR----- 232
Query 242 IEHGLSEARRTLEEIS 257
G +EA R + E +
Sbjct 233 --EGTNEAHRRVREAT 246
Score = 40.0 bits (92), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 53/175 (31%), Positives = 87/175 (50%), Gaps = 6/175 (3%)
Query 93 TDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRALES 152
T+ + +R A DE ++ S+A A+A L R+ EA ++RE R++ +
Sbjct 235 TNEAHRRVREATDEANRLVSDATAKAEKLT---RESTEAAEKRERESTDAATRREQKSQQ 291
Query 153 EHADVMRRAREEAEQLVAQATAEVERMRVIDARRREK-AEQELDAEIIRLRTDAQFQIDD 211
++R A +++E +V + T E R+ + DARR K E E RL +A +
Sbjct 292 VAERMVREATDKSELMVKECTEEATRL-IDDARRESKRLVDEATQEHKRLLDEADRESRR 350
Query 212 QLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQ 266
+ +++ E+RLG E RRL ADEQ+ H L+E R T+ + G+L Q
Sbjct 351 LVDEARKDAERRLGATTEEVQRRLTKADEQV-HSLTELRDTVAAKLRDAAGLLGQ 404
>gi|256390452|ref|YP_003112016.1| cellulose-binding protein [Catenulispora acidiphila DSM 44928]
gi|256356678|gb|ACU70175.1| putative cellulose-binding protein [Catenulispora acidiphila
DSM 44928]
Length=317
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/292 (28%), Positives = 123/292 (43%), Gaps = 64/292 (21%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASLKDEN----AALKDTSPSAQ 90
F++ R GYDPV VN I +L+ + A H +++ I L EN A L D+ PS
Sbjct 10 FEQVRRGYDPVQVNDRITKLIAERDAAHSRSTSMEKRIEELHLENQTVQAQLADSEPSYA 69
Query 91 AVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQ--- 147
+ R+ K+LRLA +E ++ EAR A A A VRT + D A+
Sbjct 70 GLGARVEKILRLAEEEAKDLREEARRAADAHRELADQAAMKVRTDAEQYAKDRKAKADEE 129
Query 148 --RALESEHADVMR----------RAREEAEQLVAQATAEVERMRVID-----ARRREKA 190
R +E D + REEAE L + T D A+RRE++
Sbjct 130 AVRIVEKAKDDANQVRATAAKDAATKREEAEALF-EETRAKAAAAATDFETKLAKRREQS 188
Query 191 EQELDA---------------------EIIRLRTDAQFQIDDQLQATQQECEKRLGEAKI 229
E++L A E ++RTDA+ + ++ Q++ + + +A+
Sbjct 189 ERDLAARQAKAEKRLAEIENRAEQLRLEAEKMRTDAERRARSTIETAQRQADDIIADARS 248
Query 230 EADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALL 281
+ DR SEA R L ++ RR I+AQL N+ +L
Sbjct 249 KTDR-----------IRSEAERELAALTHRRDS-------INAQLHNVREML 282
>gi|302530141|ref|ZP_07282483.1| predicted protein [Streptomyces sp. AA4]
gi|302439036|gb|EFL10852.1| predicted protein [Streptomyces sp. AA4]
Length=383
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 70/241 (30%), Positives = 117/241 (49%), Gaps = 41/241 (17%)
Query 41 GYDPVAVNHYIAEL-------VLRQQAQHCEIETLKAEIASLKDENAALKDTSPSAQA-V 92
G+D V Y+ L + + A ++ TL E+ + + E A L QA V
Sbjct 24 GFDRTEVRQYLDHLEAQLHRLITDRDAAATQVATLSRELEAARGETAKL-------QARV 76
Query 93 TDRMAKMLRLA-VDEVFQ---MQSEARAEAATL---VSAARDEAEA------VRTQKREM 139
+ M RL +DE Q M ++ARAE T V+A + A + +R + R +
Sbjct 77 EELMKPPERLEDLDERMQRTVMLAQARAEEITKRAEVAAEKHWASSTEASTKLRERYRRL 136
Query 140 LADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVIDA---RRREKAEQELDA 196
+ +++ + AL SEH ++ R E ++L +A +R ++D R+R K E++ DA
Sbjct 137 VEELDKQAEALHSEHEAALKETRAEVQRLTVEA---AQRRELLDNEAERKRRKIERDFDA 193
Query 197 EIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEI 256
+I RT + I DQ A++ + E+R+ EA EA RR +E +EA+R L+E
Sbjct 194 KITAERTAHEKMIADQRTASKNQAERRIAEATAEATRR-------VEEATAEAKRRLDEA 246
Query 257 S 257
+
Sbjct 247 T 247
>gi|134101871|ref|YP_001107532.1| cellulose-binding protein [Saccharopolyspora erythraea NRRL 2338]
gi|291008599|ref|ZP_06566572.1| cellulose-binding protein [Saccharopolyspora erythraea NRRL 2338]
gi|133914494|emb|CAM04607.1| cellulose-binding protein [Saccharopolyspora erythraea NRRL 2338]
Length=329
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 55/186 (30%), Positives = 102/186 (55%), Gaps = 4/186 (2%)
Query 63 CEIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLAVDEVFQMQSEARAEAATLV 122
++E +AEI SL+ + + T + ++DRM +M+RLA DE ++ + A+A A
Sbjct 63 AQLEQSRAEIESLRAQLDRVSRTPIDEEGLSDRMRRMVRLANDEAAEVVASAQAAAEHEW 122
Query 123 SAARDEAEAVRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAEVERMRVI 182
+ A A +RT+ ++ + + +R E + + +RR RE+ +++ +A + R
Sbjct 123 ARAEQSAAELRTRYENLVNEADQWRRQAEEQRNESLRRTREDIQRMAREAE-QYRRKLDT 181
Query 183 DARRREKAEQELDAEI-IRLRTDAQFQID-DQLQATQQECEKRLGEAKIEADRRLHVADE 240
DA R + + E D EI + R + Q ++ ++ A++QE ++R+ EA EA+RR+ ADE
Sbjct 182 DAEER-RTQIERDFEISMAARREEQMRVQAEREHASRQEAQRRVREATAEAERRVRRADE 240
Query 241 QIEHGL 246
E L
Sbjct 241 YAETML 246
>gi|72162795|ref|YP_290452.1| cellulose-binding protein [Thermobifida fusca YX]
gi|71916527|gb|AAZ56429.1| putative cellulose-binding protein [Thermobifida fusca YX]
Length=427
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 66/225 (30%), Positives = 109/225 (49%), Gaps = 13/225 (5%)
Query 41 GYDPVAVNHYIAELVLRQQAQHC--EIETLKAEIASLKDENAALKDTS-PSAQAVTDRMA 97
GYD VN Y+A+L RQ+ + E E L+ E+A K + L++ P+ + R+
Sbjct 28 GYDRTQVNEYLAQL--RQEGRQAREEAEKLRKELAEAKRQ---LQEQERPTYSGLGARIE 82
Query 98 KMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRALESEHADV 157
++LRLA ++ ++ +ARA+A + AA+ EA +R ++ R AD
Sbjct 83 QLLRLAEEQATELVQDARAKANDIKVAAKVEAAEMRAAAENEATEL----RTTAQREADE 138
Query 158 MRRARE-EAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQAT 216
+R++ E EAE++ A E + + R +K +D EI R + +I T
Sbjct 139 LRQSAEREAEEIRTSAQREADELTAATEREVQKKRAAVDHEIAEKRATFEGEIAKLRTTT 198
Query 217 QQECEKRLGEAKIEADRRLHVADEQIEHGLSEARRTLEEISQRRV 261
++EC + AK E D + A Q + S+A+R LEE RR
Sbjct 199 ERECAQARAAAKRERDEMIQAAKRQADEMRSQAQRALEESEARRA 243
>gi|297559335|ref|YP_003678309.1| cellulose-binding protein [Nocardiopsis dassonvillei subsp. dassonvillei
DSM 43111]
gi|296843783|gb|ADH65803.1| putative cellulose-binding protein [Nocardiopsis dassonvillei
subsp. dassonvillei DSM 43111]
Length=431
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 75/237 (32%), Positives = 108/237 (46%), Gaps = 46/237 (19%)
Query 41 GYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASLKDENAALKDTS-PSAQAVTDRMAKM 99
GYD V+ Y A L + ++E KAE L +N L++ PS + R+ ++
Sbjct 29 GYDRTQVDDYAASLRNELKQYSKQVEKFKAE---LNAKNRQLQERERPSYSGLGSRIEEL 85
Query 100 LRLA---VDEVFQ--------MQSEARAEAATLVSAARDEAEAVRTQKREMLADMNA--- 145
LRLA +E+ Q ++S A+ EAA + +AA EA VR LA A
Sbjct 86 LRLAEEQANELVQSAQIDANDIRSAAKIEAADMRAAAESEATEVRA-----LAQREADET 140
Query 146 RQRALESEHADVMRRAREEAEQLVAQATAEVERMR-VIDARRREKAEQELDAEIIRLRTD 204
RQ A ESE ++ AR EA++L + EV++ R +D EK + EI +LRT
Sbjct 141 RQTA-ESEAEEISTTARREADELTSTTEREVQKKRSAVDHEIAEK-RATFEGEIAKLRT- 197
Query 205 AQFQIDDQLQATQQECEKRLGEAKIEADRRLHVADEQIE---------HGLSEARRT 252
T++EC + AK E D + A Q E + SEARRT
Sbjct 198 ----------TTERECAQARAAAKRERDETIQSAKSQAEELRKNAERAYAESEARRT 244
>gi|258653271|ref|YP_003202427.1| hypothetical protein Namu_3105 [Nakamurella multipartita DSM
44233]
gi|258556496|gb|ACV79438.1| hypothetical protein Namu_3105 [Nakamurella multipartita DSM
44233]
Length=510
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 58/205 (29%), Positives = 104/205 (51%), Gaps = 29/205 (14%)
Query 100 LRLAVDEVFQMQSEARAEAATLVSAARDEA----EAVRTQKREMLADMNARQRALESEHA 155
L A E ++ SEA AE+ ++AARDE+ A RT+ + L + AR +E A
Sbjct 172 LSSATAEAERLVSEASAESRATLTAARDESLATLTAARTESEQTL--LAAR-----TEAA 224
Query 156 DVMRRAREEAEQLVAQATAEVERMRVIDARRREKAEQELDAEIIRLRTDAQFQIDDQLQA 215
+ AR+EAE+L A A AE ER+ + RR E++ + I R +A ++ ++ +
Sbjct 225 QTLGAARDEAERLTATAKAERERLDAESSARRAAVEEDFEIAISSRRAEAHQRLTEREEL 284
Query 216 TQQECEKRLGEAKIEADRRL-----HVAD------EQIEHGLSEARRTLEEISQRRVGIL 264
+ + ++R+ +A E++R + H A+ Q ++EA + + +++ R +L
Sbjct 285 SVADAKQRIADATAESERLVAEATAHAAELVRRAAAQSHQRVAEADQAVGDLTSLREHVL 344
Query 265 EQLARIHAQLENI-------PALLE 282
+QLA + L + PALL+
Sbjct 345 DQLASLREHLSQVDKLAASGPALLD 369
>gi|290957327|ref|YP_003488509.1| cellulose-binding protein [Streptomyces scabiei 87.22]
gi|260646853|emb|CBG69950.1| putative cellulose-binding protein [Streptomyces scabiei 87.22]
Length=312
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 74/294 (26%), Positives = 123/294 (42%), Gaps = 68/294 (23%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASL----KDENAALKDTSPSAQ 90
F+ R GYD V+ I++LV + + I L+ I L ++ AA+ D PS
Sbjct 9 FELVRRGYDRAQVDERISKLVSDRDSALARITALEKRIEELHLETQNAQAAVTDAEPSYA 68
Query 91 AVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRAL 150
+ R+ K+LRLA +E ++ EAR A A A+ VR A+ A+
Sbjct 69 GLGARVEKILRLAEEEAKDLREEARRAAEQHRELAESAAQQVRNDAESFSAERKAK---A 125
Query 151 ESEHADVMRRAREEAEQLVAQATAEVERMR----------------------VIDARRRE 188
E E ++ +A+ +A QL A+A + + R A+RRE
Sbjct 126 EDEGVRIVEKAKSDASQLRAEAQKDAQSKREEADALFEETRAKAAQAAADFETNLAKRRE 185
Query 189 KAEQELDA---------------------EIIRLRTDAQFQIDDQLQATQQECEKRLGEA 227
++E++L + E +LRTDA+ + ++ Q++ E + +A
Sbjct 186 QSERDLASRQQKAEKRLAEIEHRAEQLRLEAEKLRTDAERRARQTVETAQRQAEDIVADA 245
Query 228 KIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALL 281
+ADR SE+ R L ++ RR I+AQL N+ +L
Sbjct 246 NAKADR-----------IRSESERELAALTNRRDS-------INAQLTNVREML 281
>gi|300789928|ref|YP_003770219.1| hypothetical protein AMED_8114 [Amycolatopsis mediterranei U32]
gi|299799442|gb|ADJ49817.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340531599|gb|AEK46804.1| hypothetical protein RAM_41685 [Amycolatopsis mediterranei S699]
Length=389
Score = 44.7 bits (104), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 69/257 (27%), Positives = 120/257 (47%), Gaps = 33/257 (12%)
Query 18 VSERYRVTTAHNGSAP---RFQRTRSGYDPVAVNHYI----AEL--VLRQQAQHC----- 63
V ER TA NG P + + G+D V Y+ A+L VL ++
Sbjct 2 VPERDPAATALNGLLPLRREYTQAWHGFDRNEVRQYLDHVEAQLRRVLSERDAAAAQAAA 61
Query 64 ---EIETLKAEIASLKDENAALKDTSPSAQAVTDRMAKMLRLA---VDEVFQMQSEARAE 117
E+E+++ ++A+L+ LK + + +RM + + LA DE+ + RAE
Sbjct 62 ATREVESVRQQVAALEARVEELKKPPERLEDLDERMQRTVTLAQARADEIIK-----RAE 116
Query 118 -AATLVSAARDEAEA-VRTQKREMLADMNARQRALESEHADVMRRAREEAEQLVAQATAE 175
AA A+ EA +R + ++ +++ + L +EH + R E ++L +A
Sbjct 117 VAAEKTWASSTEASTKLRERYTRLVEELDKQADLLHTEHESALAETRAEVQRLTVEA--- 173
Query 176 VERMRVIDA---RRREKAEQELDAEIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIEAD 232
+R ++D R+R K E+E +A + + + DQ A++ + E+RL EA EA
Sbjct 174 AQRRELLDNEAERKRRKLEREFEASQAAQKAALEKHVADQRTASKNQAERRLAEATAEAK 233
Query 233 RRLHVADEQIEHGLSEA 249
RRL A + + L EA
Sbjct 234 RRLDEATAEAKRRLDEA 250
>gi|329939858|ref|ZP_08289159.1| cellulose-binding protein [Streptomyces griseoaurantiacus M045]
gi|329301428|gb|EGG45323.1| cellulose-binding protein [Streptomyces griseoaurantiacus M045]
Length=311
Score = 43.1 bits (100), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 77/312 (25%), Positives = 129/312 (42%), Gaps = 62/312 (19%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASL----KDENAALKDTSPSAQ 90
F+ R GYD V+ I++LV + + I L+ I L ++ A + D PS
Sbjct 9 FELVRRGYDRAQVDERISKLVSDRDSALARITALEKRIEELHLETQNAQAQVTDAEPSYA 68
Query 91 AVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRAL 150
+ R+ K+LRLA +E ++ EAR A A A+ VR A+ R+
Sbjct 69 GLGARVEKILRLAEEEAKDLREEARRAAEQHRELAESAAQQVRNDAESFAAE---RKSKA 125
Query 151 ESEHADVMRRAREEAEQLVAQATAEVERMR----------------------VIDARRRE 188
E E ++ +A+ +A QL A+A + + R A+RRE
Sbjct 126 EDEGVRIVEKAKSDASQLRAEAQKDAQSKREEADALFEETRAKAAQAAADFETNLAKRRE 185
Query 189 KAEQELDA---------------------EIIRLRTDAQFQIDDQLQATQQECEKRLGEA 227
++E++L + E +LRTDA+ + ++ Q++ E + +A
Sbjct 186 QSERDLASRQAKAEKRLAEIEHRAEQLRLEAEKLRTDAERRARQTVETAQRQAEDIVADA 245
Query 228 KIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALLESARHS 287
+ADR SE+ R L ++ RR I QL + L + +A S
Sbjct 246 NAKADR-----------IRSESERELAALTNRRDSINAQLTNVREMLATLTGAAVAAAGS 294
Query 288 ETEPLQSINGAV 299
+E +SI+ V
Sbjct 295 PSED-ESISRGV 305
>gi|302542457|ref|ZP_07294799.1| putative cellulose-binding protein [Streptomyces hygroscopicus
ATCC 53653]
gi|302460075|gb|EFL23168.1| putative cellulose-binding protein [Streptomyces himastatinicus
ATCC 53653]
Length=312
Score = 43.1 bits (100), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 73/294 (25%), Positives = 121/294 (42%), Gaps = 68/294 (23%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASL----KDENAALKDTSPSAQ 90
F+ R GYD V+ I +L+ + + I +L+ I L ++ A + D PS
Sbjct 10 FELVRRGYDRGQVDDRITKLIADRDSALSRITSLEKRIEELHLETQNAQAQINDAEPSYA 69
Query 91 AVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRAL 150
+ R+ K+LRLA +E ++ EAR A A A+ VR AD A+
Sbjct 70 GLGARVEKILRLAEEEAKDLREEARRAAEQHRELAESSAQQVRNDAEAFAADRKAK---A 126
Query 151 ESEHADVMRRAREEAEQLVAQATAEVERMR----------------------VIDARRRE 188
E E A ++ +A+ EA L +A + + R A+RRE
Sbjct 127 EDEGARIVEKAKGEATTLRQEAQKDAQSKREEADSLFEETRAKAAQAAADFETNLAKRRE 186
Query 189 KAEQELDA---------------------EIIRLRTDAQFQIDDQLQATQQECEKRLGEA 227
++E++L + E +LRTDA+ + ++ Q++ E + +A
Sbjct 187 QSERDLASRQAKAEKRLAEIEHRAEQLRLEAEKLRTDAERRARQTVETAQRQAEDIVADA 246
Query 228 KIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALL 281
+ADR SE+ R L ++ RR I+AQL N+ +L
Sbjct 247 NAKADR-----------IRSESERELAALTNRRDS-------INAQLTNVREML 282
>gi|294631434|ref|ZP_06709994.1| cellulose-binding protein [Streptomyces sp. e14]
gi|292834767|gb|EFF93116.1| cellulose-binding protein [Streptomyces sp. e14]
Length=311
Score = 43.1 bits (100), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 74/303 (25%), Positives = 124/303 (41%), Gaps = 61/303 (20%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASL----KDENAALKDTSPSAQ 90
F+ R GYD V+ I++LV + + I L+ I L ++ A + D PS
Sbjct 9 FELVRRGYDRAQVDERISKLVSDRDSALSRITALEKRIEELHLETQNAQAQISDAEPSYA 68
Query 91 AVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRAL 150
+ R+ K+LRLA +E ++ EAR A A A+ VR A+ A+
Sbjct 69 GLGARVEKILRLAEEEAKDLREEARRAAEQHRELAESAAQQVRNDAESFAAERKAK---A 125
Query 151 ESEHADVMRRAREEAEQLVAQATAEVERMR----------------------VIDARRRE 188
E E ++ +A+ +A QL ++A + + R A+RRE
Sbjct 126 EDEGVRIVEKAKADAAQLRSEAQKDAQSKREEADALFEETRAKAAQAAADFETNLAKRRE 185
Query 189 KAEQELDA---------------------EIIRLRTDAQFQIDDQLQATQQECEKRLGEA 227
++E++L + E +LRTDA+ + ++ Q++ E + +A
Sbjct 186 QSERDLASRQAKAEKRLAEIEHRAEQLRLEAEKLRTDAERRARQTVETAQRQAEDIVADA 245
Query 228 KIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALLESARHS 287
+ADR SE+ R L ++ RR I QL + L + +A S
Sbjct 246 NAKADR-----------IRSESERELAALTNRRDSINAQLTNVREMLATLTGAAVAAAGS 294
Query 288 ETE 290
TE
Sbjct 295 PTE 297
>gi|21223756|ref|NP_629535.1| cellulose-binding protein [Streptomyces coelicolor A3(2)]
gi|289769043|ref|ZP_06528421.1| cellulose-binding protein [Streptomyces lividans TK24]
gi|8568818|emb|CAB94567.1| putative cellulose-binding protein [Streptomyces coelicolor A3(2)]
gi|289699242|gb|EFD66671.1| cellulose-binding protein [Streptomyces lividans TK24]
Length=310
Score = 43.1 bits (100), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 73/296 (25%), Positives = 123/296 (42%), Gaps = 68/296 (22%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASL----KDENAALKDTSPSAQ 90
F+ R GYD V+ I++LV + + I L+ I L ++ A + D PS
Sbjct 9 FELVRRGYDRAQVDERISKLVSDRDSALARITALEKRIEELHLETQNAQAQVNDAEPSYA 68
Query 91 AVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRAL 150
+ R+ K+LRLA +E ++ EAR A A A+ VR A+ A+
Sbjct 69 GLGARVEKILRLAEEEAKDLREEARRAAEQHRELAESSAQQVRNDAESYAAERKAK---A 125
Query 151 ESEHADVMRRAREEAEQLVAQATAEVERMR----------------------VIDARRRE 188
E E ++ +A+ +A QL ++A + + R A+RRE
Sbjct 126 EDEGVRIVEKAKGDASQLRSEAQKDAQSKRDEADALFEETRAKAAQAAADFETNLAKRRE 185
Query 189 KAEQELDA---------------------EIIRLRTDAQFQIDDQLQATQQECEKRLGEA 227
++E++L + E +LRTDA+ + ++ Q++ E + +A
Sbjct 186 QSERDLASRQAKAEKRLAEIEHRAEQLRLEAEKLRTDAERRARQTVETAQRQSEDIVADA 245
Query 228 KIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALLES 283
+ADR SE+ R L ++ RR I+AQL N+ +L S
Sbjct 246 NAKADR-----------IRSESERELAALTNRRDS-------INAQLTNVREMLAS 283
>gi|333990778|ref|YP_004523392.1| hypothetical protein JDM601_2138 [Mycobacterium sp. JDM601]
gi|333486746|gb|AEF36138.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=329
Score = 42.7 bits (99), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 68/253 (27%), Positives = 121/253 (48%), Gaps = 17/253 (6%)
Query 34 RFQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASLKDENAALKDTSPSAQAVT 93
RF R + + + H I + +A+ E L AE+ L+ + + S + ++
Sbjct 49 RFDREQLAAEVKRLQHDIVRMTADAEAERQE---LTAELRRLQSQFEQVSAPVDSVEGMS 105
Query 94 DRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRALESE 153
DR+++M+RLA +E + + A+ +A +L + R++ EA R + + Q + S
Sbjct 106 DRISRMMRLASEEARRTKELAQQDAESLTAELREQLEAARLDRAAAGEALAELQASANSR 165
Query 154 HADVMRRAREEAEQLVAQATAEVERMRV---IDA--RRREKAEQELDAEIIRLRTDAQFQ 208
A ++ A EAE+++ A ER R+ I+A R R + Q L + R R AQ
Sbjct 166 RAKILEVATAEAEEIL--RLAHRERARITQEIEAAERSRREVYQRLAEDDERNRRAAQEA 223
Query 209 IDDQLQATQQECEKRLGEAKIEADRRLHVADEQIEHGL----SEAR---RTLEEISQRRV 261
+D+Q++ +E E EA+ D+++ A EQ E + +EAR TL ++R
Sbjct 224 LDEQVKLAWEEDEHNRREARRRLDQKMQAAWEQTEASIARLDNEARLEASTLIATARREA 283
Query 262 GILEQLARIHAQL 274
+L AR +L
Sbjct 284 TVLSDRARTAVEL 296
>gi|297202425|ref|ZP_06919822.1| cellulose-binding protein [Streptomyces sviceus ATCC 29083]
gi|197710054|gb|EDY54088.1| cellulose-binding protein [Streptomyces sviceus ATCC 29083]
Length=311
Score = 42.7 bits (99), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 56/201 (28%), Positives = 92/201 (46%), Gaps = 14/201 (6%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASL----KDENAALKDTSPSAQ 90
F+ R GYD V+ I++LV + + I L+ I L ++ A + D PS
Sbjct 9 FELVRRGYDRAQVDERISKLVSDRDSALSRITALEKRIEELHLETQNAQAQMSDAEPSYA 68
Query 91 AVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRAL 150
+ R+ K+LRLA +E ++ EAR A A A+ VR A+ R+
Sbjct 69 GLGARVEKILRLAEEEAKDLREEARRAAEQHRELAESAAQQVRNDAESFAAE---RKSKA 125
Query 151 ESEHADVMRRAREEAEQLVAQATAEVERMR-----VIDARRREKAEQELDAEIIRLRTDA 205
E E ++ +A+ +A QL ++A + + R + + R + A+ D E +
Sbjct 126 EDEGVRIVEKAKSDASQLRSEAQKDAQSKREEADALFEETRAKAAQAAADFETNLAKRRE 185
Query 206 QFQIDDQLQATQQECEKRLGE 226
Q + D L + QQ+ EKRL E
Sbjct 186 QSERD--LASRQQKAEKRLAE 204
>gi|345009364|ref|YP_004811718.1| putative cellulose-binding protein [Streptomyces violaceusniger
Tu 4113]
gi|344035713|gb|AEM81438.1| putative cellulose-binding protein [Streptomyces violaceusniger
Tu 4113]
Length=312
Score = 42.4 bits (98), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 73/294 (25%), Positives = 123/294 (42%), Gaps = 68/294 (23%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASL----KDENAALKDTSPSAQ 90
F+ R GYD V+ I++LV + + I +L+ I L ++ A + D+ PS
Sbjct 10 FELVRRGYDRGQVDDRISKLVADRDSALARITSLEKRIEELHLETQNAQAQINDSEPSYA 69
Query 91 AVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRAL 150
+ R+ K+LRLA +E ++ EAR A A A+ VR A+ A+
Sbjct 70 GLGARVEKILRLAEEEAKDLREEARRAAEQHRELADSAAQQVRNDAESFAAERKAK---A 126
Query 151 ESEHADVMRRAREEAEQLVAQATAEVERMR----------------------VIDARRRE 188
E E A ++ +A+ EA L +A + + R A+RRE
Sbjct 127 EDEGARIVEKAKGEATTLRQEAQKDAQSKREEADSLFEETRAKAAQAAADFETNLAKRRE 186
Query 189 KAEQELDA---------------------EIIRLRTDAQFQIDDQLQATQQECEKRLGEA 227
++E++L + E +LRTDA+ + ++ Q++ E + +A
Sbjct 187 QSERDLASRQAKAEKRLAEIEHRAEQLRLEAEKLRTDAERRARQTVETAQRQAEDIVADA 246
Query 228 KIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALL 281
+ADR SE+ R L ++ RR I+AQL N+ +L
Sbjct 247 NAKADR-----------IRSESERELAALTNRRDS-------INAQLTNVREML 282
>gi|297157165|gb|ADI06877.1| putative cellulose-binding protein [Streptomyces bingchenggensis
BCW-1]
Length=312
Score = 42.4 bits (98), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 76/304 (25%), Positives = 132/304 (44%), Gaps = 57/304 (18%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASL----KDENAALKDTSPSAQ 90
F+ R GYD V+ I +LV + + I +L+ I L ++ A + D PS
Sbjct 10 FELVRRGYDRGQVDDRITKLVADRDSALARITSLEKRIEELHLETQNAQAQINDAEPSYA 69
Query 91 AVTDRMAKMLRLAVDEVFQMQSEAR--AE-----AATLVSAARDEAEAVRTQKREMLADM 143
+ R+ K+LRLA +E ++ EAR AE A + R++AE+ +++ D
Sbjct 70 GLGARVEKILRLAEEEAKDLREEARRAAEQHRELAESAAQQVRNDAESFAAERKAKAEDE 129
Query 144 NARQRALESEHADVMRR-AREEAEQLVAQATAEVERMRVID-----------ARRREKAE 191
AR A +R+ A+++A+Q +A + E R A+RRE++E
Sbjct 130 GARIVEKAKGDATTLRQDAQKDAQQKREEADSLFEETRAKAAQAAADFETNLAKRREQSE 189
Query 192 QELDA---------------------EIIRLRTDAQFQIDDQLQATQQECEKRLGEAKIE 230
++L + E +LRTDA+ + ++ Q++ E + +A +
Sbjct 190 RDLASRQAKAEKRLAEIEHRAEQLRLEAEKLRTDAERRARQTVETAQRQAEDIVADANAK 249
Query 231 ADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIP--ALLESARHSE 288
ADR SE+ R L ++ RR I QL + L + A+ + +E
Sbjct 250 ADR-----------IRSESERELAALTNRRDSINAQLTNVREMLATLTGAAVAAAGSPTE 298
Query 289 TEPL 292
EP+
Sbjct 299 DEPI 302
>gi|311898405|dbj|BAJ30813.1| putative filament-forming protein [Kitasatospora setae KM-6054]
Length=309
Score = 42.4 bits (98), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 75/294 (26%), Positives = 121/294 (42%), Gaps = 68/294 (23%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASLKDEN----AALKDTSPSAQ 90
F R GY+ V+ IA+LV + + I L+ I L E AA+ + PS
Sbjct 10 FDLVRRGYERAQVDERIAKLVADRDSALSRISALEKRIEELHLETQTAQAAVTEAEPSYA 69
Query 91 AVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRAL 150
+ R+ K+LRLA +E ++ EA A A A+ VRT+ D A+
Sbjct 70 GLGARVEKILRLAEEEAKDLRDEAHRAAEQHRELAEAAAQQVRTEAENYAKDRKAK---A 126
Query 151 ESEHADVMRRAREEAEQLVAQATAEVERMR----------------------VIDARRRE 188
E E ++ +A+ +A QL A+A + + R A+RRE
Sbjct 127 EDEGLRIVDKAKSDAAQLRAEANKDAQNKREEADALFEETRTKAAQAALEFETNLAKRRE 186
Query 189 KAEQELDA---------------------EIIRLRTDAQFQIDDQLQATQQECEKRLGEA 227
++E++L A E +LRTDA+ + ++ Q++ E + +A
Sbjct 187 QSERDLAARQAKAEKRLAEIEHRAEQLRLEAEKLRTDAERRSRQTIETAQRQAEDIVADA 246
Query 228 KIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALL 281
+ADR SE+ R L ++ RR I+AQL N+ +L
Sbjct 247 NAKADR-----------VRSESERELAALTNRRDS-------INAQLTNVREML 282
>gi|29829401|ref|NP_824035.1| cellulose-binding protein [Streptomyces avermitilis MA-4680]
gi|29606508|dbj|BAC70570.1| putative cellulose-binding protein [Streptomyces avermitilis
MA-4680]
Length=312
Score = 42.0 bits (97), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 72/294 (25%), Positives = 121/294 (42%), Gaps = 68/294 (23%)
Query 35 FQRTRSGYDPVAVNHYIAELVLRQQAQHCEIETLKAEIASL----KDENAALKDTSPSAQ 90
F+ R GYD V+ I++LV + + I L+ I L ++ A + D PS
Sbjct 9 FELVRRGYDRAQVDERISKLVSDRDSALARITALEKRIEELHLETQNAQAQVTDAEPSYA 68
Query 91 AVTDRMAKMLRLAVDEVFQMQSEARAEAATLVSAARDEAEAVRTQKREMLADMNARQRAL 150
+ R+ K+LRLA +E ++ EAR + A A+ VR AD R+
Sbjct 69 GLGARVEKILRLAEEEAKDLREEARRASEQHRELAESAAQQVRNDAESFAAD---RKSKA 125
Query 151 ESEHADVMRRAREEAEQLVAQATAEVERMR----------------------VIDARRRE 188
E E ++ +A+ +A QL +A + + R A+RRE
Sbjct 126 EDEGVRIVEKAKSDASQLRQEAQKDAQSKREEADALFEETRAKAAQAAADFETNLAKRRE 185
Query 189 KAEQELDA---------------------EIIRLRTDAQFQIDDQLQATQQECEKRLGEA 227
++E++L + E +LRTDA+ + ++ Q++ E + +A
Sbjct 186 QSERDLASRQAKAEKRLAEIEHRAEQLRLEAEKLRTDAERRARQTVETAQRQAEDIVADA 245
Query 228 KIEADRRLHVADEQIEHGLSEARRTLEEISQRRVGILEQLARIHAQLENIPALL 281
+ADR SE+ R L ++ RR I+AQL N+ +L
Sbjct 246 NAKADR-----------IRSESERELAALTNRRDS-------INAQLTNVREML 281
Lambda K H
0.316 0.127 0.333
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 515090394876
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40