BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1836c
Length=677
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608973|ref|NP_216352.1| hypothetical protein Rv1836c [Mycob... 1346 0.0
gi|289443311|ref|ZP_06433055.1| conserved hypothetical protein [... 1343 0.0
gi|289753929|ref|ZP_06513307.1| conserved hypothetical protein [... 1342 0.0
gi|31793026|ref|NP_855519.1| hypothetical protein Mb1867c [Mycob... 1341 0.0
gi|340626844|ref|YP_004745296.1| hypothetical protein MCAN_18511... 1333 0.0
gi|254232015|ref|ZP_04925342.1| conserved hypothetical protein [... 1169 0.0
gi|240171759|ref|ZP_04750418.1| hypothetical protein MkanA1_2076... 932 0.0
gi|296164820|ref|ZP_06847379.1| conserved hypothetical protein [... 929 0.0
gi|254820855|ref|ZP_05225856.1| hypothetical protein MintA_13055... 890 0.0
gi|118618442|ref|YP_906774.1| hypothetical protein MUL_3054 [Myc... 890 0.0
gi|254775357|ref|ZP_05216873.1| hypothetical protein MaviaA2_118... 887 0.0
gi|183982719|ref|YP_001851010.1| hypothetical protein MMAR_2712 ... 885 0.0
gi|118463060|ref|YP_882067.1| hypothetical protein MAV_2881 [Myc... 884 0.0
gi|41407646|ref|NP_960482.1| hypothetical protein MAP1548c [Myco... 882 0.0
gi|342861261|ref|ZP_08717909.1| hypothetical protein MCOL_20356 ... 881 0.0
gi|336457597|gb|EGO36602.1| hypothetical protein MAPs_21550 [Myc... 880 0.0
gi|15828117|ref|NP_302380.1| hypothetical protein ML2070 [Mycoba... 872 0.0
gi|2578378|emb|CAA15460.1| hypothetical protein MLCB1788.28 [Myc... 872 0.0
gi|333990568|ref|YP_004523182.1| hypothetical protein JDM601_192... 753 0.0
gi|118471824|ref|YP_887944.1| hypothetical protein MSMEG_3641 [M... 679 0.0
gi|289750409|ref|ZP_06509787.1| conserved hypothetical protein [... 677 0.0
gi|126435425|ref|YP_001071116.1| hypothetical protein Mjls_2845 ... 645 0.0
gi|108799784|ref|YP_639981.1| hypothetical protein Mmcs_2818 [My... 642 0.0
gi|315444286|ref|YP_004077165.1| hypothetical protein Mspyr1_269... 628 1e-177
gi|145223954|ref|YP_001134632.1| von Willebrand factor, type A [... 613 2e-173
gi|120404077|ref|YP_953906.1| von Willebrand factor, type A [Myc... 575 7e-162
gi|169629494|ref|YP_001703143.1| hypothetical protein MAB_2408c ... 471 2e-130
gi|111017918|ref|YP_700890.1| hypothetical protein RHA1_ro00900 ... 228 2e-57
gi|226360049|ref|YP_002777827.1| hypothetical protein ROP_06350 ... 223 6e-56
gi|23821225|emb|CAD52984.1| hypothetical protein [Rhodococcus fa... 219 1e-54
gi|312139824|ref|YP_004007160.1| hypothetical protein REQ_24380 ... 212 2e-52
gi|54024463|ref|YP_118705.1| hypothetical protein nfa24940 [Noca... 210 5e-52
gi|325674365|ref|ZP_08154054.1| hypothetical protein HMPREF0724_... 210 6e-52
gi|343928469|ref|ZP_08767917.1| hypothetical protein GOALK_117_0... 191 5e-46
gi|229494861|ref|ZP_04388614.1| von Willebrand factor, type A [R... 190 7e-46
gi|226306705|ref|YP_002766665.1| hypothetical protein RER_32180 ... 184 4e-44
gi|326384959|ref|ZP_08206633.1| hypothetical protein SCNU_18532 ... 181 3e-43
gi|256379353|ref|YP_003103013.1| von Willebrand factor type A [A... 159 2e-36
gi|262202671|ref|YP_003273879.1| hypothetical protein Gbro_2768 ... 152 2e-34
gi|296140033|ref|YP_003647276.1| hypothetical protein Tpau_2330 ... 105 4e-20
gi|271962919|ref|YP_003337115.1| hypothetical protein Sros_1377 ... 102 2e-19
gi|296270634|ref|YP_003653266.1| family 1 extracellular solute-b... 100 1e-18
gi|29833533|ref|NP_828167.1| hypothetical protein SAV_6991 [Stre... 97.4 6e-18
gi|297198202|ref|ZP_06915599.1| von Willebrand factor [Streptomy... 95.9 2e-17
gi|296268733|ref|YP_003651365.1| hypothetical protein Tbis_0747 ... 93.6 1e-16
gi|271968871|ref|YP_003343067.1| hypothetical protein Sros_7651 ... 90.9 7e-16
gi|296268803|ref|YP_003651435.1| von Willebrand factor type A [T... 88.2 4e-15
gi|297162153|gb|ADI11865.1| hypothetical protein SBI_08747 [Stre... 87.4 9e-15
gi|291443250|ref|ZP_06582640.1| von Willebrand factor [Streptomy... 87.0 9e-15
gi|239986306|ref|ZP_04706970.1| hypothetical protein SrosN1_0326... 87.0 1e-14
>gi|15608973|ref|NP_216352.1| hypothetical protein Rv1836c [Mycobacterium tuberculosis H37Rv]
gi|15841304|ref|NP_336341.1| hypothetical protein MT1884 [Mycobacterium tuberculosis CDC1551]
gi|148661642|ref|YP_001283165.1| hypothetical protein MRA_1847 [Mycobacterium tuberculosis H37Ra]
60 more sequence titles
Length=677
Score = 1346 bits (3483), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 677/677 (100%), Positives = 677/677 (100%), Gaps = 0/677 (0%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS
Sbjct 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
Query 61 ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI 120
ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI
Sbjct 61 ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI 120
Query 121 VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY 180
VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY
Sbjct 121 VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY 180
Query 181 NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS 240
NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS
Sbjct 181 NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS 240
Query 241 QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP 300
QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP
Sbjct 241 QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP 300
Query 301 SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD 360
SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD
Sbjct 301 SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD 360
Query 361 VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS 420
VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS
Sbjct 361 VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS 420
Query 421 AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM 480
AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM
Sbjct 421 AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM 480
Query 481 VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE 540
VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE
Sbjct 481 VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE 540
Query 541 VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV 600
VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV
Sbjct 541 VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV 600
Query 601 ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL 660
ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL
Sbjct 601 ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL 660
Query 661 ETSASPDLATAVNIFLS 677
ETSASPDLATAVNIFLS
Sbjct 661 ETSASPDLATAVNIFLS 677
>gi|289443311|ref|ZP_06433055.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289569910|ref|ZP_06450137.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289416230|gb|EFD13470.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289543664|gb|EFD47312.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=677
Score = 1343 bits (3477), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 676/677 (99%), Positives = 676/677 (99%), Gaps = 0/677 (0%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS
Sbjct 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
Query 61 ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI 120
ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI
Sbjct 61 ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI 120
Query 121 VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY 180
VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY
Sbjct 121 VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY 180
Query 181 NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS 240
NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS
Sbjct 181 NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS 240
Query 241 QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP 300
QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP
Sbjct 241 QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP 300
Query 301 SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD 360
SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD
Sbjct 301 SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD 360
Query 361 VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS 420
VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS
Sbjct 361 VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS 420
Query 421 AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM 480
AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM
Sbjct 421 AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM 480
Query 481 VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE 540
VTASAGVAATIMLDQSMPNDEGGN RLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE
Sbjct 481 VTASAGVAATIMLDQSMPNDEGGNGRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE 540
Query 541 VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV 600
VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV
Sbjct 541 VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV 600
Query 601 ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL 660
ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL
Sbjct 601 ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL 660
Query 661 ETSASPDLATAVNIFLS 677
ETSASPDLATAVNIFLS
Sbjct 661 ETSASPDLATAVNIFLS 677
>gi|289753929|ref|ZP_06513307.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289694516|gb|EFD61945.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
Length=677
Score = 1342 bits (3474), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 675/677 (99%), Positives = 676/677 (99%), Gaps = 0/677 (0%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS
Sbjct 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
Query 61 ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI 120
ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI
Sbjct 61 ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI 120
Query 121 VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY 180
VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY
Sbjct 121 VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY 180
Query 181 NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS 240
NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS
Sbjct 181 NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS 240
Query 241 QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP 300
QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP
Sbjct 241 QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP 300
Query 301 SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD 360
SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPK+ADDSLTAAMDTLLKPGD
Sbjct 301 SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKVADDSLTAAMDTLLKPGD 360
Query 361 VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS 420
VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS
Sbjct 361 VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS 420
Query 421 AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM 480
AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM
Sbjct 421 AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM 480
Query 481 VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE 540
VTASAGVAATIMLDQSMPNDEGGN RLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE
Sbjct 481 VTASAGVAATIMLDQSMPNDEGGNGRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE 540
Query 541 VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV 600
VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV
Sbjct 541 VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV 600
Query 601 ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL 660
ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL
Sbjct 601 ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL 660
Query 661 ETSASPDLATAVNIFLS 677
ETSASPDLATAVNIFLS
Sbjct 661 ETSASPDLATAVNIFLS 677
>gi|31793026|ref|NP_855519.1| hypothetical protein Mb1867c [Mycobacterium bovis AF2122/97]
gi|121637739|ref|YP_977962.1| hypothetical protein BCG_1871c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224990223|ref|YP_002644910.1| hypothetical protein JTY_1855 [Mycobacterium bovis BCG str. Tokyo
172]
gi|31618617|emb|CAD94570.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
gi|121493386|emb|CAL71858.1| Conserved hypothetical protein [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224773336|dbj|BAH26142.1| hypothetical protein JTY_1855 [Mycobacterium bovis BCG str. Tokyo
172]
gi|341601766|emb|CCC64440.1| conserved hypothetical protein [Mycobacterium bovis BCG str.
Moreau RDJ]
Length=677
Score = 1341 bits (3470), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 675/677 (99%), Positives = 675/677 (99%), Gaps = 0/677 (0%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS
Sbjct 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
Query 61 ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI 120
ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI
Sbjct 61 ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI 120
Query 121 VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY 180
VALVAVVVMVAGVILW FFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY
Sbjct 121 VALVAVVVMVAGVILWCFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY 180
Query 181 NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS 240
NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS
Sbjct 181 NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS 240
Query 241 QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP 300
QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP
Sbjct 241 QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP 300
Query 301 SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD 360
SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD
Sbjct 301 SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD 360
Query 361 VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS 420
VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS
Sbjct 361 VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS 420
Query 421 AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM 480
AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM
Sbjct 421 AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM 480
Query 481 VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE 540
VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE
Sbjct 481 VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE 540
Query 541 VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV 600
VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANY VGQANSVLV
Sbjct 541 VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYHVGQANSVLV 600
Query 601 ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL 660
ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL
Sbjct 601 ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL 660
Query 661 ETSASPDLATAVNIFLS 677
ETSASPDLATAVNIFLS
Sbjct 661 ETSASPDLATAVNIFLS 677
>gi|340626844|ref|YP_004745296.1| hypothetical protein MCAN_18511 [Mycobacterium canettii CIPT
140010059]
gi|340005034|emb|CCC44183.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=677
Score = 1333 bits (3451), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 669/677 (99%), Positives = 672/677 (99%), Gaps = 0/677 (0%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
MGRHSKPDPEDS D+LSDGHAAEQQHWED SGSYDYPGVDQPDDGPLSSEGHYSAVGGYS
Sbjct 1 MGRHSKPDPEDSADELSDGHAAEQQHWEDTSGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
Query 61 ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI 120
ASGSEDYPDIPPRPDWEPTG EPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI
Sbjct 61 ASGSEDYPDIPPRPDWEPTGTEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI 120
Query 121 VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY 180
VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY
Sbjct 121 VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY 180
Query 181 NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS 240
NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS
Sbjct 181 NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS 240
Query 241 QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP 300
QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP
Sbjct 241 QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP 300
Query 301 SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD 360
SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD
Sbjct 301 SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD 360
Query 361 VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS 420
VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS
Sbjct 361 VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS 420
Query 421 AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM 480
AASAFARY+HKPEQLAKLA+AGFRVSDVKPPSSPVTSFPAL STLSVGDDSMRATLADTM
Sbjct 421 AASAFARYMHKPEQLAKLAKAGFRVSDVKPPSSPVTSFPALSSTLSVGDDSMRATLADTM 480
Query 481 VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE 540
VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE
Sbjct 481 VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE 540
Query 541 VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV 600
VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANY VGQANSVLV
Sbjct 541 VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYHVGQANSVLV 600
Query 601 ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL 660
ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL
Sbjct 601 ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL 660
Query 661 ETSASPDLATAVNIFLS 677
ETSASPDLATAVNIFLS
Sbjct 661 ETSASPDLATAVNIFLS 677
>gi|254232015|ref|ZP_04925342.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|124601074|gb|EAY60084.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
Length=679
Score = 1169 bits (3025), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 648/679 (96%), Positives = 652/679 (97%), Gaps = 2/679 (0%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS
Sbjct 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
Query 61 ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI 120
ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI
Sbjct 61 ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI 120
Query 121 VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY 180
VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY
Sbjct 121 VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY 180
Query 181 NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS 240
NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS
Sbjct 181 NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS 240
Query 241 QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP 300
QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP
Sbjct 241 QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP 300
Query 301 SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD 360
SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD
Sbjct 301 SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD 360
Query 361 VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS 420
VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS
Sbjct 361 VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS 420
Query 421 AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM 480
AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM
Sbjct 421 AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM 480
Query 481 VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE 540
VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE
Sbjct 481 VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE 540
Query 541 VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV 600
VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV
Sbjct 541 VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV 600
Query 601 ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGA-DPDRATWEAVAQL-SGGSYQ 658
ITAG + G ++FIRKSADPAKPIA PDR TWEAVA +GGSYQ
Sbjct 601 ITAGRIRTKPSTGRACRNFIRKSADPAKPIAGQHPSTSVLIPDRPTWEAVAPAPAGGSYQ 660
Query 659 NLETSASPDLATAVNIFLS 677
NLETS SP ATAVNIFLS
Sbjct 661 NLETSPSPRPATAVNIFLS 679
>gi|240171759|ref|ZP_04750418.1| hypothetical protein MkanA1_20765 [Mycobacterium kansasii ATCC
12478]
Length=708
Score = 932 bits (2409), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 516/708 (73%), Positives = 578/708 (82%), Gaps = 31/708 (4%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDI---------SGSYDYPGVDQPD-------- 43
MGRHS PDPEDS D+ D +AAEQQ W D YPG +P
Sbjct 1 MGRHSLPDPEDSADEPPDEYAAEQQDWADQIADQPGGGRHSEVGYPGSAEPSAVEPPSGR 60
Query 44 ---------DGPLSSEGHYSAVGG----YSASGSEDYPDIPPRPDWEPTGAEPIAAAPPP 90
+G LS HY+ G YSA G+++YPD P E + AAAPPP
Sbjct 61 GYADRAYWSEGDLSDGAHYAGAGDHAADYSADGADEYPDFGSGPAGEEPPSPESAAAPPP 120
Query 91 LFRF-GHRGPGDWQAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHT 149
FR GHRG G+WQ GHRSADG RGVSIGVIVALVAVVV+VAGVI+WRFFG+AL +RS T
Sbjct 121 PFRTAGHRGLGNWQGGHRSADGWRGVSIGVIVALVAVVVVVAGVIVWRFFGEALYHRSRT 180
Query 150 AAARCVGGKDTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFI 209
AAARCVGGKDTVAVIADP+IA++V E ADSYNASAGPVGD+CV VAV +A SDAVI GFI
Sbjct 181 AAARCVGGKDTVAVIADPTIAERVNEFADSYNASAGPVGDKCVTVAVKAADSDAVIAGFI 240
Query 210 GKWPTELGGQPGLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQ 269
GKWP+ELGGQPGLWIP SS+SAARL+ A G Q ISDSRSL SPVLLA+RPELQQ L+NQ
Sbjct 241 GKWPSELGGQPGLWIPGSSVSAARLSAATGKQTISDSRSLATSPVLLAIRPELQQPLSNQ 300
Query 270 NWAALPGLQTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAG 329
NWAALP LQ NPNS++ L+LP+WGSLRLAMP +GNGDAA+LAGEA+A ASAP GAP TAG
Sbjct 301 NWAALPQLQANPNSMAALNLPSWGSLRLAMPVAGNGDAAFLAGEAIAVASAPPGAPPTAG 360
Query 330 IGAVRTLMGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENT 389
GAVRTLM A+PKLAD+SLT AM+TL+K GD A APVHAVVTTEQQLFQR QSLSDA+
Sbjct 361 SGAVRTLMAAQPKLADESLTEAMNTLVKSGDAAAAPVHAVVTTEQQLFQRAQSLSDAKKV 420
Query 390 LGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVK 449
L SWLPPGP A+ADYP VLL+G+WLSQEQT+AASAFARYL KP+QLAKLA+AGFRV+ VK
Sbjct 421 LSSWLPPGPVAIADYPAVLLNGSWLSQEQTTAASAFARYLQKPDQLAKLAKAGFRVNGVK 480
Query 450 PPSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSN 509
PSSPVTSF ALP+ LS+GDD MRATLADTM T S GVAATIMLDQSMP D+GG +RL+N
Sbjct 481 SPSSPVTSFAALPAPLSIGDDGMRATLADTMATPSIGVAATIMLDQSMPTDDGGKTRLAN 540
Query 510 VVAALENRIKAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSS 569
VVAAL++R+K +PPS+V+GLWTFDG EGR+EVP GPLADPVNGQPR AAL AAL KQYSS
Sbjct 541 VVAALQSRLKTLPPSAVIGLWTFDGHEGRSEVPTGPLADPVNGQPRSAALIAALDKQYSS 600
Query 570 GGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKP 629
GGGAVSFTTLR+IYQE+ AN+R GQANS+LVIT GPHTDQTLDGPGLQDFIR SADPAKP
Sbjct 601 GGGAVSFTTLRMIYQEVQANFRAGQANSILVITGGPHTDQTLDGPGLQDFIRTSADPAKP 660
Query 630 IAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
IAVNIIDFGADPDR+TWEAVAQLSGGSYQNL TSA PDLATA++IFLS
Sbjct 661 IAVNIIDFGADPDRSTWEAVAQLSGGSYQNLATSAGPDLATALSIFLS 708
>gi|296164820|ref|ZP_06847379.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295899834|gb|EFG79281.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=693
Score = 929 bits (2401), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 490/700 (70%), Positives = 562/700 (81%), Gaps = 30/700 (4%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDIS-----------------------GSYDYP 37
MGRHS PDPEDSV + E+ + ++ + D
Sbjct 1 MGRHSLPDPEDSVGGPHESGETERDRRDAVTEDHADHADDGDCPPDDDRYADDDYADDDR 60
Query 38 GVDQPDDGPLSSEGHYSAVGGYSASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHR 97
VD+ D E Y+ ++ ++YP+ PPR P+ AEP AA+P LF GHR
Sbjct 61 YVDEYAD-----EEPYADEDAFADGAGDEYPEFPPRRS-GPSSAEPPAASPS-LFARGHR 113
Query 98 GPGDWQAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGG 157
G G+W+ GHRS GRRGVS+GVIVALVAV+V+V VILW FFGDALSNRSH AA RCVGG
Sbjct 114 GLGEWRGGHRSEGGRRGVSVGVIVALVAVIVVVGTVILWSFFGDALSNRSHRAAGRCVGG 173
Query 158 KDTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELG 217
K+TVAVIADPSIAD V++ A+SYN+SAGPVGD C+ V+V AGSDAV+NGFIGKWP EL
Sbjct 174 KETVAVIADPSIADAVRQFAESYNSSAGPVGDHCMEVSVKPAGSDAVLNGFIGKWPAELS 233
Query 218 GQPGLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGL 277
GQP LWIP SS+SAARL GA + I+DSRSLV SPV+LAVRPELQ ALA QNWAALPGL
Sbjct 234 GQPALWIPGSSVSAARLAGAMAQKTITDSRSLVTSPVVLAVRPELQPALAGQNWAALPGL 293
Query 278 QTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLM 337
QTNPN L+GL+LP WGSLRLA+P GNGDAA+LAGEAVAAAS PAGAPAT G GAVR L+
Sbjct 294 QTNPNGLAGLNLPGWGSLRLALPMKGNGDAAFLAGEAVAAASVPAGAPATQGTGAVRALL 353
Query 338 GARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPG 397
+PKLAD+SLT AM+ LLKPGD ATAPVHAVVTTEQQLFQRGQSL DA++ L SWLPPG
Sbjct 354 SGQPKLADNSLTEAMNALLKPGDAATAPVHAVVTTEQQLFQRGQSLPDAKSALASWLPPG 413
Query 398 PAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTS 457
P VADYPTVLLSG+WL+QEQTSAAS FAR++HKP QL KLA+AGFRV+ V PPSSPVT+
Sbjct 414 PVPVADYPTVLLSGSWLTQEQTSAASEFARFMHKPHQLDKLAKAGFRVNGVTPPSSPVTT 473
Query 458 FPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENR 517
FPALP+TLSVGD++MRATLA+ M T S+G+AATIMLDQSMP EGG +RL+NV+AAL+++
Sbjct 474 FPALPATLSVGDEAMRATLAEAMATPSSGLAATIMLDQSMPGQEGGKTRLANVIAALQDK 533
Query 518 IKAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFT 577
IKA+PP+SVVGLWTFDG EGR+EVP GPL+DPVNGQPR AALTAAL KQYSS GGAVSFT
Sbjct 534 IKALPPTSVVGLWTFDGHEGRSEVPGGPLSDPVNGQPRSAALTAALDKQYSSPGGAVSFT 593
Query 578 TLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDF 637
TLR+IYQ++ ANYR GQ NS+LVITAGPHTDQTLDG GLQDFIRKSADPAKPIAVN+IDF
Sbjct 594 TLRMIYQDLQANYRAGQINSILVITAGPHTDQTLDGAGLQDFIRKSADPAKPIAVNVIDF 653
Query 638 GADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
GADPDRATWEAVAQLSGG YQNL TSASP+LA A+N FLS
Sbjct 654 GADPDRATWEAVAQLSGGGYQNLTTSASPELAAALNAFLS 693
>gi|254820855|ref|ZP_05225856.1| hypothetical protein MintA_13055 [Mycobacterium intracellulare
ATCC 13950]
Length=687
Score = 890 bits (2301), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/637 (74%), Positives = 535/637 (84%), Gaps = 5/637 (0%)
Query 43 DDGPLSSEGHYSAVGGYSASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDW 102
DD P + + Y+A ++ S +++YP+ PPR P +EP A +P LFR GHRG DW
Sbjct 52 DDEPYADDEPYAAGDAFADSTADEYPEFPPR-QGGPASSEPPAESPS-LFRGGHRGLADW 109
Query 103 QAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVA 162
+ GHRS GRRGVSIGVIVALVAVVV+V VILW FFGD LS RSH AA RCVGG++TVA
Sbjct 110 RGGHRSEGGRRGVSIGVIVALVAVVVVVGSVILWSFFGDVLSKRSHKAAGRCVGGQETVA 169
Query 163 VIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGL 222
V+ADPSIA V+E A+SYN SAGP+GDRC+ V V A SDAV+NGFIGKWP ELGGQP L
Sbjct 170 VVADPSIATSVQELAESYNKSAGPIGDRCMVVNVKPADSDAVLNGFIGKWPAELGGQPAL 229
Query 223 WIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPN 282
WIP SSISAARL GAA + IS+S SLV SPV+LAVRPEL ALA QNWAALPGLQTNPN
Sbjct 230 WIPGSSISAARLAGAATQKTISESHSLVSSPVVLAVRPELAPALAKQNWAALPGLQTNPN 289
Query 283 SLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPK 342
+L+GL+LPAWGSLRLA+P GNGDA++LAGEAVAAAS P GAP G AVR+L+ A+PK
Sbjct 290 ALAGLNLPAWGSLRLALPMGGNGDASFLAGEAVAAASVPPGAPVPQGTAAVRSLLSAQPK 349
Query 343 LADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVA 402
LAD+SLT AM+TLLKPGD ATAPVHAV+TTEQQLFQRGQSL DA++ L SWLPPGPA VA
Sbjct 350 LADNSLTEAMNTLLKPGDPATAPVHAVITTEQQLFQRGQSLPDAKSALASWLPPGPAPVA 409
Query 403 DYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALP 462
DYPTVLLSG+WL++EQ +AAS F+R++HKP+QLAKLA+AGFRV+ VK PSSPVT+FP LP
Sbjct 410 DYPTVLLSGSWLTREQATAASEFSRFMHKPDQLAKLAKAGFRVNGVKTPSSPVTTFPTLP 469
Query 463 STLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMP 522
STL+VGDD MRATLA+ M S G A TIMLDQSMP EG SRL+NV+AAL++RIKA+P
Sbjct 470 STLTVGDDPMRATLAEAMAAPSTGQATTIMLDQSMPGQEGAKSRLANVIAALQDRIKALP 529
Query 523 PSSVVGLWTFDGREGRTEVPAGPLADPV---NGQPRPAALTAALGKQYSSGGGAVSFTTL 579
S+VVGLWTFDG EGR+EV +GPLADPV +GQPR AAL AAL KQYSSGGGAVSFTTL
Sbjct 530 ASAVVGLWTFDGHEGRSEVASGPLADPVGGSSGQPRSAALLAALDKQYSSGGGAVSFTTL 589
Query 580 RLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGA 639
R+IYQ+M ANY GQANS+LVIT+GPHTDQTLDGPGLQDFIRKSADPAKPIAVN+IDFGA
Sbjct 590 RMIYQDMQANYHAGQANSLLVITSGPHTDQTLDGPGLQDFIRKSADPAKPIAVNVIDFGA 649
Query 640 DPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFL 676
DPDR+TWEAVAQLSGGSYQN+ TSASP+LATAVN FL
Sbjct 650 DPDRSTWEAVAQLSGGSYQNIATSASPELATAVNAFL 686
>gi|118618442|ref|YP_906774.1| hypothetical protein MUL_3054 [Mycobacterium ulcerans Agy99]
gi|118570552|gb|ABL05303.1| conserved membrane protein [Mycobacterium ulcerans Agy99]
Length=735
Score = 890 bits (2300), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 503/735 (69%), Positives = 570/735 (78%), Gaps = 58/735 (7%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWED---------------------------ISGS 33
MGRHS PDPEDS D+ SD H AE Q W+D +G
Sbjct 1 MGRHSLPDPEDSADEPSDDHDAENQDWDDELTGQPGGGADSAAADPGAFAHPQTADSAGG 60
Query 34 YDYPGVDQP------------DDGPLSSEGH-----------------YSAVGGYSASGS 64
Y YPG +QP D+ +G+ Y A + G+
Sbjct 61 YPYPGWEQPGDTVGHFGDQEADEDSADEDGYWADEQVFDESQYLEQDPYGADDRHPELGA 120
Query 65 EDYPDIPPRPDW-EPTGAEPIAAAPPPLFRF-GHRGPGDWQAGHRSADGRRGVSIGVIVA 122
E+YPD PD EP+ +P A PP LFR GHRG WQ GHRSADGRRGVS+GVIVA
Sbjct 121 EEYPDFGTHPDGPEPSDPKPAATPPPSLFRVAGHRGLRGWQGGHRSADGRRGVSVGVIVA 180
Query 123 LVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSYNA 182
LVAVVV+V GVI WRFFGD LSNRS TAAARCVGG DTVAVIADPSIADQV + ADSYNA
Sbjct 181 LVAVVVVVVGVIGWRFFGDVLSNRSQTAAARCVGGNDTVAVIADPSIADQVNDFADSYNA 240
Query 183 SAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGSQA 242
S+GP+GDRCV+VAV +A +DAVI GFIGKWP+ELG QPGLWIP SS+SAARL AAG +A
Sbjct 241 SSGPIGDRCVSVAVNAADADAVITGFIGKWPSELGAQPGLWIPGSSVSAARLVQAAGKEA 300
Query 243 ISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMPSS 302
ISDSRSLV SPVLLA+RPELQQAL NQNWAA+PGLQ++PNS++GL LP+WGSLRLA+P
Sbjct 301 ISDSRSLVTSPVLLAIRPELQQALGNQNWAAVPGLQSDPNSMAGLKLPSWGSLRLALPVG 360
Query 303 GNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGDVA 362
GNGDA +LAGEAVAAASAPA AP TAGIGAVRTLM +PKLAD SL+ AMD LLK DVA
Sbjct 361 GNGDATFLAGEAVAAASAPADAPPTAGIGAVRTLMATQPKLADGSLSEAMDALLKADDVA 420
Query 363 TAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAA 422
APVHAV+TTEQQLF R QSLSDA+ L SWLPPGP AVADYP VLL+G+WLSQEQT+AA
Sbjct 421 AAPVHAVITTEQQLFLRAQSLSDAKKKLSSWLPPGPVAVADYPAVLLNGSWLSQEQTTAA 480
Query 423 SAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTMVT 482
SAFARY+HKPEQLAKLA+AGFRV DVKPPSSPVTSFPALP+ LSVGD+ +RATLAD +
Sbjct 481 SAFARYVHKPEQLAKLAKAGFRVDDVKPPSSPVTSFPALPAPLSVGDEGIRATLADAVAA 540
Query 483 ASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTEVP 542
S GVAATIMLDQS+ D+GG +RL+N+VAAL+NR+K + P+S VGLWTFDGREGRTEVP
Sbjct 541 PSMGVAATIMLDQSLSTDDGGKTRLTNIVAALQNRVKTLLPTSAVGLWTFDGREGRTEVP 600
Query 543 AGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVIT 602
GPLADPVNGQPR AAL AAL KQYSS GGAVSFTTLR+IY+++ A++R QANS+LVIT
Sbjct 601 TGPLADPVNGQPRSAALNAALDKQYSSNGGAVSFTTLRMIYEDVQAHFRADQANSILVIT 660
Query 603 AGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLET 662
GPHTDQ+LDGPGL++FIR SADPAKPIAVN+IDFGADPDR TWEAVAQLSGGSYQNL T
Sbjct 661 GGPHTDQSLDGPGLENFIRTSADPAKPIAVNVIDFGADPDRKTWEAVAQLSGGSYQNLAT 720
Query 663 SASPDLATAVNIFLS 677
S P+LA AV+ FLS
Sbjct 721 STGPNLAAAVDTFLS 735
>gi|254775357|ref|ZP_05216873.1| hypothetical protein MaviaA2_11896 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=683
Score = 887 bits (2293), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 481/695 (70%), Positives = 556/695 (80%), Gaps = 30/695 (4%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYS-----A 55
MGRHS PDP+D +D+ S H +++ D + ++D G PD+G E Y A
Sbjct 1 MGRHSAPDPDDFLDEPSPDHPVDER---DDAYAFDAQGA--PDEGYYPDERRYPDADFVA 55
Query 56 VGGYS----ASGSE-------DYPDIPPRPDWEPTGAEPIAAAPPPLF--RFGHRGPGDW 102
Y+ A G + DYP+ P R P+G++ A+ P L R DW
Sbjct 56 DDDYAPEEFAPGEDLVDEDPDDYPEFPSRRP-APSGSQESPASAPSLRARRL------DW 108
Query 103 QAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVA 162
+ GHRS GRRGVSIGVIVALVAVVV+V VILWRFFGDALS RSHTAA RCVGG++ V
Sbjct 109 RGGHRSEGGRRGVSIGVIVALVAVVVVVGSVILWRFFGDALSKRSHTAAGRCVGGQEQVP 168
Query 163 VIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGL 222
V+ADPSIAD + + A+S+N SAGP+GD C+ V+V AGSDAV+NGFIGKWP ELGGQP L
Sbjct 169 VVADPSIADAIGQFAESFNKSAGPIGDHCMVVSVKPAGSDAVLNGFIGKWPAELGGQPAL 228
Query 223 WIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPN 282
WIP SS+SAARL GA + I++S SL SPV+LAVRPEL AL+ QNWAALPGLQTNPN
Sbjct 229 WIPGSSVSAARLAGATAQKTITESHSLASSPVVLAVRPELLPALSGQNWAALPGLQTNPN 288
Query 283 SLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPK 342
+L+GL+LPAWGSLRLA+P +GNGDAA+LAGEAVAAAS P GAP T G GAVRTL+ A+PK
Sbjct 289 ALAGLNLPAWGSLRLALPMTGNGDAAFLAGEAVAAASVPPGAPVTQGTGAVRTLLSAQPK 348
Query 343 LADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVA 402
LAD+SLT AM+TLLKPGD A+APVHAVVTTEQQLFQRGQSL D + L SWLPPG AAVA
Sbjct 349 LADNSLTEAMNTLLKPGDPASAPVHAVVTTEQQLFQRGQSLPDTKGALASWLPPGAAAVA 408
Query 403 DYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALP 462
DYPTVLLSG+WL++EQ SAAS F+R++HK +QLAKLA+AGFRV+ VKPPSSPVT+FPALP
Sbjct 409 DYPTVLLSGSWLTREQASAASEFSRFMHKSDQLAKLAKAGFRVNGVKPPSSPVTTFPALP 468
Query 463 STLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMP 522
STLSVGDD+MRATLA+ M + S G A TIMLDQSMP EGG SRL+NV+ AL+++IKA+P
Sbjct 469 STLSVGDDAMRATLAEAMASPSTGQATTIMLDQSMPGQEGGKSRLANVIGALQDKIKALP 528
Query 523 PSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLI 582
S+VVGLWTFDG EGR+EV +GPLADPVNGQPR AAL+AAL KQYSS GGAVSFTTLR+I
Sbjct 529 ASAVVGLWTFDGHEGRSEVTSGPLADPVNGQPRSAALSAALDKQYSSSGGAVSFTTLRMI 588
Query 583 YQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPD 642
YQ+M +NY GQ NS+LVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPD
Sbjct 589 YQDMQSNYHAGQTNSILVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPD 648
Query 643 RATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
R TWEAVAQLSGGSYQNL TSASPDLATAVN FLS
Sbjct 649 RTTWEAVAQLSGGSYQNLATSASPDLATAVNAFLS 683
>gi|183982719|ref|YP_001851010.1| hypothetical protein MMAR_2712 [Mycobacterium marinum M]
gi|183176045|gb|ACC41155.1| conserved membrane protein [Mycobacterium marinum M]
Length=735
Score = 885 bits (2287), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 507/735 (69%), Positives = 572/735 (78%), Gaps = 58/735 (7%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWED---------------------------ISGS 33
MGRHS PDPEDS D+ SD H AE Q W+D +G
Sbjct 1 MGRHSLPDPEDSADEPSDDHDAENQDWDDELTGQPGGGADSAAADPGAFAHPQTADSAGG 60
Query 34 YDYPGVDQP------------DDGPLSSEGH-----------------YSAVGGYSASGS 64
Y YPG +Q D+ +G+ Y A + G+
Sbjct 61 YPYPGWEQSGDTVGHFGDQEADEDSADEDGYWADEQVFDESQYLEQDPYGADDRHPELGA 120
Query 65 EDYPDIPPRPDW-EPTGAEPIAAAPPPLFRF-GHRGPGDWQAGHRSADGRRGVSIGVIVA 122
E+YPD PD EP+ +P A PPPLFR GHRG WQ GHRSADGRRGVS+GVIVA
Sbjct 121 EEYPDFGTHPDGPEPSDPKPAATPPPPLFRVAGHRGLRGWQGGHRSADGRRGVSVGVIVA 180
Query 123 LVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSYNA 182
LVAVVV+V GVI WRFFGD LSNRS TAAARCVGG DTVAVIADPSIADQV + ADSYNA
Sbjct 181 LVAVVVVVVGVIGWRFFGDVLSNRSQTAAARCVGGNDTVAVIADPSIADQVNDFADSYNA 240
Query 183 SAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGSQA 242
S+GP+GDRCV+VAV +A +DAVI GFIGKWP+ELG QPGLWIP SS+SAARL AAG +A
Sbjct 241 SSGPIGDRCVSVAVKAADADAVITGFIGKWPSELGAQPGLWIPGSSVSAARLVQAAGKEA 300
Query 243 ISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMPSS 302
ISDSRSLV SPVLLA+RPELQQAL NQNWAALPGLQ++PNS++GL LP+WGSLRLA+P
Sbjct 301 ISDSRSLVTSPVLLAIRPELQQALGNQNWAALPGLQSDPNSMAGLKLPSWGSLRLALPVG 360
Query 303 GNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGDVA 362
GNGDA +LAGEAVAAASAPA AP TAGIGAVRTLM +PKLAD SL+ AMD LLK DVA
Sbjct 361 GNGDATFLAGEAVAAASAPADAPPTAGIGAVRTLMATQPKLADGSLSEAMDALLKADDVA 420
Query 363 TAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAA 422
APVHAV+TTEQQLF R QSLSDA+ L SWLPPGP AVADYP VLL+G+WLSQEQT+AA
Sbjct 421 AAPVHAVITTEQQLFLRAQSLSDAKKKLSSWLPPGPVAVADYPAVLLNGSWLSQEQTTAA 480
Query 423 SAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTMVT 482
SAFARY+HKPEQLAKLA+AGFRV DVKPPSSPVTSFPALP+ LSVGD+ +RATLAD +
Sbjct 481 SAFARYVHKPEQLAKLAKAGFRVDDVKPPSSPVTSFPALPAPLSVGDEGIRATLADAVAA 540
Query 483 ASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTEVP 542
S GVAATIMLDQS+ D+GG +RL+N+VAAL+NRIK +PP+S VGLWTFDGREGRTEVP
Sbjct 541 PSMGVAATIMLDQSLSTDDGGKTRLTNIVAALQNRIKTLPPTSAVGLWTFDGREGRTEVP 600
Query 543 AGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVIT 602
GPLADPVNGQPR AAL AALGKQYSS GGAVSFTTLR+IY+++ A++R QANS+LVIT
Sbjct 601 TGPLADPVNGQPRSAALNAALGKQYSSNGGAVSFTTLRMIYEDVQAHFRADQANSILVIT 660
Query 603 AGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLET 662
GPHTDQ+LDGPGL++FIR SADPAKPIAVN+IDFGADPDR TWEAVAQLSGGSYQNL T
Sbjct 661 GGPHTDQSLDGPGLENFIRTSADPAKPIAVNVIDFGADPDRKTWEAVAQLSGGSYQNLAT 720
Query 663 SASPDLATAVNIFLS 677
S P+LA AV+ FLS
Sbjct 721 STGPNLAAAVDTFLS 735
>gi|118463060|ref|YP_882067.1| hypothetical protein MAV_2881 [Mycobacterium avium 104]
gi|118164347|gb|ABK65244.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=683
Score = 884 bits (2285), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/695 (70%), Positives = 555/695 (80%), Gaps = 30/695 (4%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYS-----A 55
MGRHS PDP+D +D+ S H +++ D + ++D G PD+G E Y A
Sbjct 1 MGRHSAPDPDDFLDEPSPDHPVDER---DDAYAFDAQGA--PDEGYYPDERRYPDADFVA 55
Query 56 VGGYS----ASGSE-------DYPDIPPRPDWEPTGAEPIAAAPPPLF--RFGHRGPGDW 102
Y+ A G + DYP+ P R P+G++ A+ P L R DW
Sbjct 56 DDDYTPEEFAPGEDLVDEDPDDYPEFPSRRP-APSGSQESPASAPSLRARRL------DW 108
Query 103 QAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVA 162
+ GHRS GRRGVSIGVIVALVAVVV+V VILWRFFGDALS RSHTAA RCVGG++ V
Sbjct 109 RGGHRSEGGRRGVSIGVIVALVAVVVVVGSVILWRFFGDALSKRSHTAAGRCVGGQEQVP 168
Query 163 VIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGL 222
V+ADPSIAD + + A+S+N SAGP+GD C+ V+V AGSDAV+NGFIGKWP ELGGQP L
Sbjct 169 VVADPSIADAIGQFAESFNKSAGPIGDHCMVVSVKPAGSDAVLNGFIGKWPAELGGQPAL 228
Query 223 WIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPN 282
WIP SS+SAARL GA + I++S SL SPV+LAVRPEL AL+ QNWAALPGLQTNPN
Sbjct 229 WIPGSSVSAARLAGATAQKTITESHSLASSPVVLAVRPELLPALSGQNWAALPGLQTNPN 288
Query 283 SLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPK 342
+L+GL+LPAWGSLRLA+P +GNGDAA+LAGEAVAAAS P GAP T G GAVRTL+ A+PK
Sbjct 289 ALAGLNLPAWGSLRLALPMTGNGDAAFLAGEAVAAASVPPGAPVTQGTGAVRTLLSAQPK 348
Query 343 LADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVA 402
LAD+SLT AM+TLLKPGD A+APVH VVTTEQQLFQRGQSL DA+ L SWLPPG AAVA
Sbjct 349 LADNSLTEAMNTLLKPGDPASAPVHGVVTTEQQLFQRGQSLPDAKGALASWLPPGAAAVA 408
Query 403 DYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALP 462
DYPTVLLSG+WL++EQ SAAS F+R++HK +QLAKLA+AGFRV+ KPPSSPVT+FPALP
Sbjct 409 DYPTVLLSGSWLTREQASAASEFSRFMHKSDQLAKLAKAGFRVNGGKPPSSPVTTFPALP 468
Query 463 STLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMP 522
STLSVGDD+MRATLA+ M + S G A TIMLDQSMP EGG SRL+NV+ AL+++IKA+P
Sbjct 469 STLSVGDDAMRATLAEAMASPSTGQATTIMLDQSMPGQEGGKSRLANVIGALQDKIKALP 528
Query 523 PSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLI 582
S+VVGLWTFDG EGR+EV +GPLADPVNGQPR AAL+AAL KQYSS GGAVSFTTLR+I
Sbjct 529 ASAVVGLWTFDGHEGRSEVTSGPLADPVNGQPRSAALSAALDKQYSSSGGAVSFTTLRMI 588
Query 583 YQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPD 642
YQ+M +NY GQ NS+LVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPD
Sbjct 589 YQDMQSNYHAGQTNSILVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPD 648
Query 643 RATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
R TWEAVAQLSGGSYQNL TSASPDLATAVN FLS
Sbjct 649 RTTWEAVAQLSGGSYQNLATSASPDLATAVNAFLS 683
>gi|41407646|ref|NP_960482.1| hypothetical protein MAP1548c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41395999|gb|AAS03865.1| hypothetical protein MAP_1548c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=683
Score = 882 bits (2279), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 478/693 (69%), Positives = 551/693 (80%), Gaps = 26/693 (3%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYS-----A 55
MGRHS PDP+D +D+ S H +++ D + ++D G PD+G E Y A
Sbjct 1 MGRHSAPDPDDFLDEPSPDHPVDER---DDAYAFDAQGA--PDEGYYPDERRYPDADFVA 55
Query 56 VGGYS----ASGSE-------DYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQA 104
Y+ A G + DYP+ P R E A+AP R DW+
Sbjct 56 DDDYAPEEFAPGEDLVDEDPDDYPEFPSRRPATSGPQESPASAPSLRARRL-----DWRG 110
Query 105 GHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVI 164
GHRS GRRGVSIGVIVALVAVVV+V VILWRFFGDALS RSHTAA RCVGG++ V V+
Sbjct 111 GHRSEGGRRGVSIGVIVALVAVVVVVGSVILWRFFGDALSKRSHTAAGRCVGGQEQVPVV 170
Query 165 ADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWI 224
ADPSIAD + + A+S+N SAGP+GD C+ V+V AGSDAV+NGFIGKWP ELGGQP LWI
Sbjct 171 ADPSIADAIGQFAESFNKSAGPIGDHCMVVSVKPAGSDAVLNGFIGKWPAELGGQPALWI 230
Query 225 PSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSL 284
P SS+SAARL GA + I++S SL SPV+LAVRPEL AL+ QNWAALPGLQTNPN+L
Sbjct 231 PGSSVSAARLAGATAQKTITESHSLASSPVVLAVRPELLPALSGQNWAALPGLQTNPNAL 290
Query 285 SGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLA 344
+GL+LPAWGSLRLA+P +GNGDAA+LAGEAVAAAS P GAP T G GAVRTL+ A+PKLA
Sbjct 291 AGLNLPAWGSLRLALPMTGNGDAAFLAGEAVAAASVPPGAPVTQGTGAVRTLLSAQPKLA 350
Query 345 DDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADY 404
D+SLT AM+TLLKPGD A+APVHAVVTTEQQLFQRGQSL DA+ L SWLPPG AAVADY
Sbjct 351 DNSLTEAMNTLLKPGDSASAPVHAVVTTEQQLFQRGQSLPDAKGALASWLPPGAAAVADY 410
Query 405 PTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPST 464
PTVLLSG+WL++EQ SAAS F+R++HK +QLAKLA+AGFRV+ KPPSSPVT+FPALPST
Sbjct 411 PTVLLSGSWLTREQASAASEFSRFMHKSDQLAKLAKAGFRVNGGKPPSSPVTTFPALPST 470
Query 465 LSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPS 524
LSVGDD+MRATLA+ M + S G A TIMLDQSMP EGG SRL+NV+ AL+++IKA+P S
Sbjct 471 LSVGDDAMRATLAEAMASPSTGQATTIMLDQSMPGQEGGKSRLANVIGALQDKIKALPAS 530
Query 525 SVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQ 584
+VVGLWTFDG EGR+EV +GPLADPVNGQPR AAL+AAL KQYSS GGAVSFTTLR+IYQ
Sbjct 531 AVVGLWTFDGHEGRSEVTSGPLADPVNGQPRSAALSAALDKQYSSSGGAVSFTTLRMIYQ 590
Query 585 EMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRA 644
+M +NY GQ NS+LVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVN+IDFGADPDR
Sbjct 591 DMQSNYHAGQTNSILVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNVIDFGADPDRT 650
Query 645 TWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
TWEAVAQLSGG YQNL TSASPDLATAVN FLS
Sbjct 651 TWEAVAQLSGGGYQNLATSASPDLATAVNAFLS 683
>gi|342861261|ref|ZP_08717909.1| hypothetical protein MCOL_20356 [Mycobacterium colombiense CECT
3035]
gi|342131161|gb|EGT84442.1| hypothetical protein MCOL_20356 [Mycobacterium colombiense CECT
3035]
Length=688
Score = 881 bits (2276), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 468/693 (68%), Positives = 551/693 (80%), Gaps = 21/693 (3%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPL------------- 47
MGRHS PDP+DS+D+ S ++ D +G + Y ++ L
Sbjct 1 MGRHSAPDPDDSLDEPSRDDVVDEPSRGDEAG-HRYRDAGDEEEADLYSDEDDYSDDDDH 59
Query 48 SSEGHYSAVGGY---SASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQA 104
+ EG+YS + ++DYP+ P R P EP A+ P LFR GHRG D
Sbjct 60 ADEGYYSDERRHPDDEDFAADDYPEFPSRAASSP---EPPASTPS-LFRGGHRGLADRLG 115
Query 105 GHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVI 164
GHRS GRRGVSIGVIVALVAVVV+V VILW FFGDALS RSHTAA RC GG++TVAV+
Sbjct 116 GHRSEAGRRGVSIGVIVALVAVVVVVGSVILWSFFGDALSKRSHTAAGRCSGGQETVAVV 175
Query 165 ADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWI 224
ADPSIAD V++ A+SYN SAGP+GD C+ V+V A SDAV+NGFIGKWP ELGGQP LWI
Sbjct 176 ADPSIADSVQQLAESYNKSAGPIGDHCMVVSVKPANSDAVLNGFIGKWPAELGGQPALWI 235
Query 225 PSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSL 284
P SSISAARL GA + I++S SLV SPV+LA+RP+L AL+NQNWAALPGLQTNPN+L
Sbjct 236 PGSSISAARLAGATAQKTITESHSLVTSPVVLAIRPQLAPALSNQNWAALPGLQTNPNAL 295
Query 285 SGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLA 344
+GL+LPAWG+LRLA+P +GNGDA++LAGEAVAAAS P GAP T G GAVR+L+ A+PKLA
Sbjct 296 AGLNLPAWGALRLALPMNGNGDASFLAGEAVAAASVPPGAPVTQGTGAVRSLLNAQPKLA 355
Query 345 DDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADY 404
D+SL AM++LLKPGD ATAPVHAVVTTEQQLFQRGQSL DA+ LG WLPPG AAVADY
Sbjct 356 DNSLNEAMNSLLKPGDPATAPVHAVVTTEQQLFQRGQSLPDAKGALGFWLPPGSAAVADY 415
Query 405 PTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPST 464
PTVLLSG+WLS+EQ SAAS F+RY+HK +QLAKLA+AGFRV+ VKPP SPVT+FPALP+
Sbjct 416 PTVLLSGSWLSREQASAASEFSRYMHKSDQLAKLAKAGFRVNGVKPPGSPVTNFPALPAA 475
Query 465 LSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPS 524
LSVGD+ +RATLA+ M + S+G A TIMLDQSMP EGG SRL+NV+ AL+++IK +P +
Sbjct 476 LSVGDEPLRATLAEAMASPSSGQATTIMLDQSMPGQEGGKSRLANVIGALQDKIKGLPGT 535
Query 525 SVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQ 584
+VVGLWTFDG EGR+EV +GPL+D VNGQPR AAL AAL KQYSSGGGAVSFTTLR++YQ
Sbjct 536 AVVGLWTFDGHEGRSEVASGPLSDAVNGQPRSAALAAALDKQYSSGGGAVSFTTLRMLYQ 595
Query 585 EMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRA 644
+M NY GQ NS+L+ITAGPHTDQTLDG GLQDF+RKSADPAKPIAVN+IDFGADPDRA
Sbjct 596 DMQTNYHAGQTNSILLITAGPHTDQTLDGSGLQDFVRKSADPAKPIAVNVIDFGADPDRA 655
Query 645 TWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
TWEAVAQLSGG YQNL TSASPDLA+A+N FLS
Sbjct 656 TWEAVAQLSGGGYQNLATSASPDLASAINAFLS 688
>gi|336457597|gb|EGO36602.1| hypothetical protein MAPs_21550 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=683
Score = 880 bits (2275), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 477/693 (69%), Positives = 550/693 (80%), Gaps = 26/693 (3%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYS-----A 55
MGRHS PDP+D +D+ S H +++ D + ++D G PD+G E Y A
Sbjct 1 MGRHSAPDPDDFLDEPSPDHPVDER---DDAYAFDAQGA--PDEGYYPDERRYPDADFVA 55
Query 56 VGGYS----ASGSE-------DYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQA 104
Y+ A G + DYP+ P R E A+AP R DW+
Sbjct 56 DDDYAPEEFAPGEDLVDEDPDDYPEFPSRRPATSGPQESPASAPSLRARRL-----DWRG 110
Query 105 GHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVI 164
GHRS GRRG SIGVIVALVAVVV+V VILWRFFGDALS RSHTAA RCVGG++ V V+
Sbjct 111 GHRSEGGRRGFSIGVIVALVAVVVVVGSVILWRFFGDALSKRSHTAAGRCVGGQEQVPVV 170
Query 165 ADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWI 224
ADPSIAD + + A+S+N SAGP+GD C+ V+V AGSDAV+NGFIGKWP ELGGQP LWI
Sbjct 171 ADPSIADAIGQFAESFNKSAGPIGDHCMVVSVKPAGSDAVLNGFIGKWPAELGGQPALWI 230
Query 225 PSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSL 284
P SS+SAARL GA + I++S SL SPV+LAVRPEL AL+ QNWAALPGLQTNPN+L
Sbjct 231 PGSSVSAARLAGATAQKTITESHSLASSPVVLAVRPELLPALSGQNWAALPGLQTNPNAL 290
Query 285 SGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLA 344
+GL+LPAWGSLRLA+P +GNGDAA+LAGEAVAAAS P GAP T G GAVRTL+ A+PKLA
Sbjct 291 AGLNLPAWGSLRLALPMTGNGDAAFLAGEAVAAASVPPGAPVTQGTGAVRTLLSAQPKLA 350
Query 345 DDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADY 404
D+SLT AM+TLLKPGD A+APVHAVVTTEQQLFQRGQSL DA+ L SWLPPG AAVADY
Sbjct 351 DNSLTEAMNTLLKPGDSASAPVHAVVTTEQQLFQRGQSLPDAKGALASWLPPGAAAVADY 410
Query 405 PTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPST 464
PTVLLSG+WL++EQ SAAS F+R++HK +QLAKLA+AGFRV+ KPPSSPVT+FPALPST
Sbjct 411 PTVLLSGSWLTREQASAASEFSRFMHKSDQLAKLAKAGFRVNGGKPPSSPVTTFPALPST 470
Query 465 LSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPS 524
LSVGDD+MRATLA+ M + S G A TIMLDQSMP EGG SRL+NV+ AL+++IKA+P S
Sbjct 471 LSVGDDAMRATLAEAMASPSTGQATTIMLDQSMPGQEGGKSRLANVIGALQDKIKALPAS 530
Query 525 SVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQ 584
+VVGLWTFDG EGR+EV +GPLADPVNGQPR AAL+AAL KQYSS GGAVSFTTLR+IYQ
Sbjct 531 AVVGLWTFDGHEGRSEVTSGPLADPVNGQPRSAALSAALDKQYSSSGGAVSFTTLRMIYQ 590
Query 585 EMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRA 644
+M +NY GQ NS+LVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVN+IDFGADPDR
Sbjct 591 DMQSNYHAGQTNSILVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNVIDFGADPDRT 650
Query 645 TWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
TWEAVAQLSGG YQNL TSASPDLATAVN FLS
Sbjct 651 TWEAVAQLSGGGYQNLATSASPDLATAVNAFLS 683
>gi|15828117|ref|NP_302380.1| hypothetical protein ML2070 [Mycobacterium leprae TN]
gi|221230594|ref|YP_002504010.1| hypothetical protein MLBr_02070 [Mycobacterium leprae Br4923]
gi|13093671|emb|CAC31025.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219933701|emb|CAR72167.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=733
Score = 872 bits (2254), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/714 (66%), Positives = 533/714 (75%), Gaps = 41/714 (5%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAE-QQHWEDISGSYD----------------------YP 37
MGRHS PDPEDS+D S+ AA ++I Y YP
Sbjct 24 MGRHSMPDPEDSIDQPSNQFAASGPDQSDEIDHGYQSRMGYPEPVFEPAATGSPSYRSYP 83
Query 38 -GVDQPDDGPLSSEGHYSAVGGYSAS----------GSEDYPDIPPRPDWEPTGAEPIAA 86
G + P D + Y A ++D+PD PPRP G+ +
Sbjct 84 HGAEHPADSTPEALDETIDYQSYWAEDRNEDLFVDGAADDHPDFPPRP----AGSSTSSQ 139
Query 87 APPPL---FRFGHRGPGDWQAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDAL 143
AP L F+ HR G WQ GHRS GRRGVSIGVI LVAVVV+V VI+W F G L
Sbjct 140 APTSLSHLFKASHRSVGKWQGGHRSDGGRRGVSIGVIATLVAVVVLVGAVIMWSFLGHIL 199
Query 144 SNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDA 203
+NR H AAARCVGG TVAV+ADPSIAD ++E A SYNASA PVGD C+ V V GS+A
Sbjct 200 NNRKHQAAARCVGGHQTVAVVADPSIADYLQEFAQSYNASARPVGDHCMMVTVKPVGSEA 259
Query 204 VINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQ 263
+ GF WP LG +P LWIP SSISAARL A + IS+S SLV SPVLLAVRPE +
Sbjct 260 ALTGFNDSWPANLGDKPALWIPGSSISAARLAVTADQKTISESHSLVTSPVLLAVRPEFE 319
Query 264 QALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAG 323
QALAN+ WAALPGLQTNPNSL+ L+LPAWGSLRLA+P +GN DA +LAGEAVA AS PAG
Sbjct 320 QALANKGWAALPGLQTNPNSLADLNLPAWGSLRLALPMNGNSDATFLAGEAVATASVPAG 379
Query 324 APATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSL 383
APA G+GAVRTLM A+PKLAD + AM TLLKPGDVATAPVHAV+TTEQQLFQRGQSL
Sbjct 380 APAIQGVGAVRTLMSAQPKLADSTWAEAMSTLLKPGDVATAPVHAVITTEQQLFQRGQSL 439
Query 384 SDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGF 443
SDA++ LGSWLP GPA VADYP VLL+G+WL+QEQ +AAS FAR++ KP+QLAKLA+AGF
Sbjct 440 SDAKSALGSWLPHGPAPVADYPAVLLNGSWLTQEQAAAASEFARFVQKPDQLAKLAKAGF 499
Query 444 RVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGG 503
RV+ V PPSS VTSF A+PST+SVGDD MRATL + M+ S+GVAATIMLDQSMP DEGG
Sbjct 500 RVNGVTPPSSSVTSFAAVPSTVSVGDDGMRATLVEEMIQPSSGVAATIMLDQSMPTDEGG 559
Query 504 NSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAAL 563
+RL+NVVAAL+++I AMPP+SV+GLWTFDG +G+TEV G LADPVNGQPR AALTAAL
Sbjct 560 KTRLANVVAALDDKINAMPPTSVMGLWTFDGHKGQTEVTTGQLADPVNGQPRSAALTAAL 619
Query 564 GKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKS 623
KQYSS GGAVSFTTLR+IYQEMLANY VGQ NSVLVITAGPHTDQTLDG LQDFIR S
Sbjct 620 DKQYSSNGGAVSFTTLRMIYQEMLANYHVGQTNSVLVITAGPHTDQTLDGARLQDFIRTS 679
Query 624 ADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
ADPAKPIAVN+IDFG DPD+ATW+AVAQ+SGGSYQNL TSAS DLATA+N FLS
Sbjct 680 ADPAKPIAVNVIDFGTDPDQATWKAVAQISGGSYQNLSTSASLDLATAINTFLS 733
>gi|2578378|emb|CAA15460.1| hypothetical protein MLCB1788.28 [Mycobacterium leprae]
Length=710
Score = 872 bits (2254), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/714 (66%), Positives = 533/714 (75%), Gaps = 41/714 (5%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAE-QQHWEDISGSYD----------------------YP 37
MGRHS PDPEDS+D S+ AA ++I Y YP
Sbjct 1 MGRHSMPDPEDSIDQPSNQFAASGPDQSDEIDHGYQSRMGYPEPVFEPAATGSPSYRSYP 60
Query 38 -GVDQPDDGPLSSEGHYSAVGGYSAS----------GSEDYPDIPPRPDWEPTGAEPIAA 86
G + P D + Y A ++D+PD PPRP G+ +
Sbjct 61 HGAEHPADSTPEALDETIDYQSYWAEDRNEDLFVDGAADDHPDFPPRP----AGSSTSSQ 116
Query 87 APPPL---FRFGHRGPGDWQAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDAL 143
AP L F+ HR G WQ GHRS GRRGVSIGVI LVAVVV+V VI+W F G L
Sbjct 117 APTSLSHLFKASHRSVGKWQGGHRSDGGRRGVSIGVIATLVAVVVLVGAVIMWSFLGHIL 176
Query 144 SNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDA 203
+NR H AAARCVGG TVAV+ADPSIAD ++E A SYNASA PVGD C+ V V GS+A
Sbjct 177 NNRKHQAAARCVGGHQTVAVVADPSIADYLQEFAQSYNASARPVGDHCMMVTVKPVGSEA 236
Query 204 VINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQ 263
+ GF WP LG +P LWIP SSISAARL A + IS+S SLV SPVLLAVRPE +
Sbjct 237 ALTGFNDSWPANLGDKPALWIPGSSISAARLAVTADQKTISESHSLVTSPVLLAVRPEFE 296
Query 264 QALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAG 323
QALAN+ WAALPGLQTNPNSL+ L+LPAWGSLRLA+P +GN DA +LAGEAVA AS PAG
Sbjct 297 QALANKGWAALPGLQTNPNSLADLNLPAWGSLRLALPMNGNSDATFLAGEAVATASVPAG 356
Query 324 APATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSL 383
APA G+GAVRTLM A+PKLAD + AM TLLKPGDVATAPVHAV+TTEQQLFQRGQSL
Sbjct 357 APAIQGVGAVRTLMSAQPKLADSTWAEAMSTLLKPGDVATAPVHAVITTEQQLFQRGQSL 416
Query 384 SDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGF 443
SDA++ LGSWLP GPA VADYP VLL+G+WL+QEQ +AAS FAR++ KP+QLAKLA+AGF
Sbjct 417 SDAKSALGSWLPHGPAPVADYPAVLLNGSWLTQEQAAAASEFARFVQKPDQLAKLAKAGF 476
Query 444 RVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGG 503
RV+ V PPSS VTSF A+PST+SVGDD MRATL + M+ S+GVAATIMLDQSMP DEGG
Sbjct 477 RVNGVTPPSSSVTSFAAVPSTVSVGDDGMRATLVEEMIQPSSGVAATIMLDQSMPTDEGG 536
Query 504 NSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAAL 563
+RL+NVVAAL+++I AMPP+SV+GLWTFDG +G+TEV G LADPVNGQPR AALTAAL
Sbjct 537 KTRLANVVAALDDKINAMPPTSVMGLWTFDGHKGQTEVTTGQLADPVNGQPRSAALTAAL 596
Query 564 GKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKS 623
KQYSS GGAVSFTTLR+IYQEMLANY VGQ NSVLVITAGPHTDQTLDG LQDFIR S
Sbjct 597 DKQYSSNGGAVSFTTLRMIYQEMLANYHVGQTNSVLVITAGPHTDQTLDGARLQDFIRTS 656
Query 624 ADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
ADPAKPIAVN+IDFG DPD+ATW+AVAQ+SGGSYQNL TSAS DLATA+N FLS
Sbjct 657 ADPAKPIAVNVIDFGTDPDQATWKAVAQISGGSYQNLSTSASLDLATAINTFLS 710
>gi|333990568|ref|YP_004523182.1| hypothetical protein JDM601_1928 [Mycobacterium sp. JDM601]
gi|333486536|gb|AEF35928.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=691
Score = 753 bits (1944), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/698 (60%), Positives = 486/698 (70%), Gaps = 30/698 (4%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWE--------------------DISGSYDYPGVD 40
MGRHS P P+D D+ D Q W+ YD P
Sbjct 1 MGRHSFPGPDDFDDEPLD----PDQDWDAAAPDPFGFGDPDDDEYVDDYQDAFYDEP--I 54
Query 41 QPDDGPLSSEGHYSAVGGYSASGSEDYPDIPP--RPDWEPTGAEPIAAAPPPLFRFGHRG 98
D G + Y G S S E P P P+G+ P A R G R
Sbjct 55 GGDAGYDADPAQYMRRGSGSRSEEETRYRTGPFGAPGALPSGSYPDREAERDQPRRGRRE 114
Query 99 PGDWQAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGK 158
W+ HR+ GRRGVS+GVI AL+AV+V+V VILWRFFG++LS RS AA C G
Sbjct 115 LERWRR-HRNDAGRRGVSVGVIAALIAVIVLVGTVILWRFFGNSLSQRSAIAAGNCAHGD 173
Query 159 DTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGG 218
TVAV+ADPSIAD V+ AD +N +A PVGDRCV+V V SDAV++GFIG WP +LG
Sbjct 174 LTVAVVADPSIADHVQGFADRFNKTAKPVGDRCVSVQVKPVDSDAVVSGFIGDWPAQLGQ 233
Query 219 QPGLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQ 278
+P LWIP SSISAARL +AG + +SDSRSLV SPVLLAVRP+L+ AL +QNWA LP LQ
Sbjct 234 RPALWIPGSSISAARLQASAGQETVSDSRSLVTSPVLLAVRPQLESALQHQNWANLPDLQ 293
Query 279 TNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMG 338
T+P+ L L L WG LRLA+P GNGDAA+LAGEAVA+ +AP GAPAT G GAV L G
Sbjct 294 TDPDGLGRLGLAGWGQLRLALPIGGNGDAAFLAGEAVASGAAPKGAPATDGTGAVHRLAG 353
Query 339 ARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGP 398
A+P LAD+SL AM+ LL+ GD A APVHAVVTTEQQLF RGQSLSD +TLGSWLPPGP
Sbjct 354 AQPHLADNSLAEAMNVLLRQGDSAAAPVHAVVTTEQQLFTRGQSLSDPASTLGSWLPPGP 413
Query 399 AAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSF 458
A VADYPTVLL G+WLSQEQ SAAS FAR+ KP+QLA LA+AGFRV V PPSS VT F
Sbjct 414 APVADYPTVLLVGSWLSQEQVSAASEFARFARKPDQLADLAKAGFRVEGVAPPSSDVTGF 473
Query 459 PALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRI 518
PALP TLSVGDD+MRATLA+ + T A TIMLD+SM DEGG +RL++VVAAL+ RI
Sbjct 474 PALPDTLSVGDDAMRATLANALTTLPGASAVTIMLDESMTTDEGGKTRLAHVVAALDQRI 533
Query 519 KAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTT 578
KA+PPSSVVGLWTFDG EG + + +GPL +PVNG R LT L S+ GGAVSFTT
Sbjct 534 KALPPSSVVGLWTFDGVEGHSVLTSGPLDEPVNGGTRAETLTRELDALSSTSGGAVSFTT 593
Query 579 LRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFG 638
LRL+Y ++LANY GQ NSVLVITAGPHTD+TLDGPGLQDFIR + DP +P+AVN+IDFG
Sbjct 594 LRLVYNQVLANYHPGQTNSVLVITAGPHTDRTLDGPGLQDFIRANTDPERPVAVNVIDFG 653
Query 639 ADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFL 676
D DRA W+AVAQLSGG+YQNL + +P+LA +N L
Sbjct 654 -DADRAVWQAVAQLSGGTYQNLRGANTPELAGTLNTLL 690
>gi|118471824|ref|YP_887944.1| hypothetical protein MSMEG_3641 [Mycobacterium smegmatis str.
MC2 155]
gi|118173111|gb|ABK74007.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=762
Score = 679 bits (1752), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/580 (62%), Positives = 430/580 (75%), Gaps = 1/580 (0%)
Query 98 GPGDWQAGHRSADGRR-GVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVG 156
G +W HR+ + RR GVS+GVIVALV+VVV+VA VI+W+F GDALS+RS AAARCV
Sbjct 182 GDSEWTGSHRAVESRRRGVSVGVIVALVSVVVLVAAVIVWKFVGDALSDRSDAAAARCVA 241
Query 157 GKDTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTEL 216
G+ V VIADP+I+ ++ A+ YN SA PVGD+CV V V SA SD V++GF WP+EL
Sbjct 242 GEIGVPVIADPTISTHIESLANKYNQSASPVGDKCVKVRVQSAESDRVVSGFANSWPSEL 301
Query 217 GGQPGLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPG 276
G +P LWIPSSSI +ARL AGS+ +SDSRSLV SPV+LA PEL+ AL QNW LP
Sbjct 302 GDRPALWIPSSSIGSARLEATAGSETVSDSRSLVTSPVVLATSPELKTALGQQNWQKLPE 361
Query 277 LQTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTL 336
LQ++P ++ GL LP WG+L+LA+P NGD AYL EAVA SAP+GAP TAG+GAV TL
Sbjct 362 LQSSPTAMDGLRLPNWGTLKLALPKLDNGDTAYLVAEAVAVTSAPSGAPPTAGMGAVSTL 421
Query 337 MGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPP 396
+ +PKL D L+ A D +L P D A APVHAV TTEQQLFQR +L DA + L WLP
Sbjct 422 LNGQPKLDDAELSTAFDAMLDPSDSAAAPVHAVATTEQQLFQRATTLDDAGSKLAGWLPQ 481
Query 397 GPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVT 456
GPAAVADYPTVLL+G+WL QEQ +AAS FARYL KPEQLA+LA+AGFR D P S VT
Sbjct 482 GPAAVADYPTVLLAGSWLEQEQVTAASEFARYLRKPEQLAELAKAGFRAEDATSPDSDVT 541
Query 457 SFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALEN 516
F + + +S+ D+S R TLA+ + A TIMLD+SMP DEGG SRL NVV AL N
Sbjct 542 DFGPIANPVSIADESTRVTLANATAAPVSSPAVTIMLDRSMPTDEGGRSRLQNVVEALTN 601
Query 517 RIKAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSF 576
R+KA+P +S VGLWTFDG EGR+EV GP+A+PV+G+ R L + L Q ++GGGAVSF
Sbjct 602 RLKALPVTSEVGLWTFDGTEGRSEVSMGPMAEPVDGRARSEVLNSTLEDQSAAGGGAVSF 661
Query 577 TTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIID 636
TTLRL+Y E AN+ G+ NSVLVIT GPHTD++LDGPGLQ+FIR + DPA+PIAVN+ID
Sbjct 662 TTLRLVYNEAKANFVEGRGNSVLVITTGPHTDRSLDGPGLQEFIRSNFDPARPIAVNVID 721
Query 637 FGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFL 676
FG D DR TWEAVAQ SGG Y NL TS +P+L T+V L
Sbjct 722 FGDDSDRETWEAVAQASGGDYVNLPTSTAPELVTSVATML 761
>gi|289750409|ref|ZP_06509787.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289690996|gb|EFD58425.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=341
Score = 677 bits (1746), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/341 (99%), Positives = 340/341 (99%), Gaps = 0/341 (0%)
Query 337 MGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPP 396
MGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPP
Sbjct 1 MGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPP 60
Query 397 GPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVT 456
GPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVT
Sbjct 61 GPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVT 120
Query 457 SFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALEN 516
SFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGN RLSNVVAALEN
Sbjct 121 SFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNGRLSNVVAALEN 180
Query 517 RIKAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSF 576
RIKAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSF
Sbjct 181 RIKAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSF 240
Query 577 TTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIID 636
TTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIID
Sbjct 241 TTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIID 300
Query 637 FGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
FGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS
Sbjct 301 FGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 341
>gi|126435425|ref|YP_001071116.1| hypothetical protein Mjls_2845 [Mycobacterium sp. JLS]
gi|126235225|gb|ABN98625.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=688
Score = 645 bits (1665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/690 (56%), Positives = 468/690 (68%), Gaps = 17/690 (2%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
MGRHS PDPEDS DD + + + GS D P + P EG + G Y+
Sbjct 1 MGRHSIPDPEDSDDDAG---VPDDRIDDGGYGSDDGPSGRHSGEFPAQPEGADARQGDYT 57
Query 61 ASGSE-DYPDIPPRPDW------EPTGAEPIAA-APPPLFRFG--HRGP---GDWQAGHR 107
++ DY D ++ E P+ A A PP G H G GDW HR
Sbjct 58 DEYADGDYADSEYADEYADEYADEYADDHPVTAGAQPPAEPSGPAHGGTWDGGDWTGSHR 117
Query 108 SAD-GRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIAD 166
+ GRRG+SIGVI ALV VVV+V GVILWRFFGDALS RS A+ARCV G VAV+AD
Sbjct 118 AVTPGRRGLSIGVIAALVTVVVVVGGVILWRFFGDALSERSDAASARCVDGNLDVAVLAD 177
Query 167 PSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPS 226
PSIA+ + AD YN +A PVGDRCV V V AGS+ VI GF WP +LG +P LWIP+
Sbjct 178 PSIAETIGGLADQYNENAAPVGDRCVKVGVKPAGSEQVIKGFGDTWPGDLGERPALWIPA 237
Query 227 SSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSG 286
S +SAARL A + +SDSR+LV +PV+LAVRPEL+ ALA QNW LPGLQTNP +L G
Sbjct 238 SGVSAARLEAATDQKTVSDSRTLVSTPVVLAVRPELKPALAQQNWGTLPGLQTNPTALDG 297
Query 287 LDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADD 346
L LP WG+L+LA+P SGN DA+YLA EAVAAA++P GAPAT GI A+ TL P+L D
Sbjct 298 LGLPGWGALKLALPRSGNADASYLAAEAVAAAASPDGAPATDGISAINTLSAGAPELPAD 357
Query 347 SLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPT 406
+ AAM LL GD A APVHAV TTEQQ+ R S DA++ L SWLPPGP A ADYPT
Sbjct 358 TADAAMKALLTSGDPAKAPVHAVATTEQQVVARAASSPDAKSELASWLPPGPVATADYPT 417
Query 407 VLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLS 466
VLLSG WLS+EQ +AAS FAR++ +P+++ +LA+AGFR PP S VT FP L + LS
Sbjct 418 VLLSGDWLSREQVTAASQFARFMREPDRMNELAKAGFRTQGGTPPPSDVTDFPKLAAPLS 477
Query 467 VGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSV 526
VGDD+ R LA+ + + + TIMLD SMP EG N+R+ NVV AL R+ A+PP++
Sbjct 478 VGDDAARVKLAEALTSPAQASTTTIMLDLSMPGAEGDNTRMGNVVNALIPRVDALPPTTA 537
Query 527 VGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEM 586
+GLWTFD G +++ GPL++PV+GQPR AALT L S+ GGAVSFTTLRL+Y E
Sbjct 538 LGLWTFDAAAGNSQITTGPLSEPVDGQPRSAALTTTLDTLSSTSGGAVSFTTLRLVYNEA 597
Query 587 LANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATW 646
+AN+R GQ NSVLVIT GPHTD+TLDG GL+ FIR + DPA+P+AVN+IDFG DPDR TW
Sbjct 598 MANFRAGQPNSVLVITQGPHTDRTLDGAGLEAFIRDAFDPARPVAVNVIDFGDDPDRGTW 657
Query 647 EAVAQLSGGSYQNLETSASPDLATAVNIFL 676
E VA+ +GG YQNL TS SP+L A+ L
Sbjct 658 ETVARTTGGQYQNLTTSDSPELTAAITTLL 687
>gi|108799784|ref|YP_639981.1| hypothetical protein Mmcs_2818 [Mycobacterium sp. MCS]
gi|119868894|ref|YP_938846.1| hypothetical protein Mkms_2862 [Mycobacterium sp. KMS]
gi|108770203|gb|ABG08925.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119694983|gb|ABL92056.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=685
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/687 (56%), Positives = 465/687 (68%), Gaps = 14/687 (2%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
MGRHS PDPEDS DD + + + GS D P + P EG + G Y+
Sbjct 1 MGRHSIPDPEDSDDDAG---VPDDRIDDGGYGSDDGPSGRHSGEFPAQPEGADARQGDYT 57
Query 61 AS--GSEDYPDIPPRPDW--EPTGAEPIAA-APPPLFRFG--HRGP---GDWQAGHRSAD 110
DY D ++ E P+ A A PP G H G G+W HR+
Sbjct 58 TDEYADGDYADSEYADEYADEYADDHPVTAGAQPPAEPSGPAHGGTWDGGEWTGSHRAVT 117
Query 111 -GRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSI 169
GRRG+SIGVI ALV VVV+V GVILWRFFGDALS RS A+ARCV G VAV+ADPSI
Sbjct 118 PGRRGLSIGVIAALVTVVVVVGGVILWRFFGDALSERSDAASARCVDGNLDVAVLADPSI 177
Query 170 ADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSI 229
A+ + AD YN +A PVGDRCV V V AGS+ VINGF WP +LG +P LWIP+S +
Sbjct 178 AETIGGLADQYNENAAPVGDRCVKVGVKPAGSEQVINGFGDTWPGDLGERPALWIPASGV 237
Query 230 SAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDL 289
SAARL A + +SDSR+LV +PV+LAVRPEL+ ALA QNW LP LQTNP +L GL L
Sbjct 238 SAARLEAATDQKTVSDSRTLVSTPVVLAVRPELKPALAQQNWGTLPDLQTNPTALDGLGL 297
Query 290 PAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLT 349
P WG+L+LA+P SGN DA+YLA EAVAAA++P GAP T GI A+ TL P+L D+
Sbjct 298 PGWGALKLALPRSGNADASYLAAEAVAAAASPDGAPVTDGISAINTLSAGAPELPADTAD 357
Query 350 AAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLL 409
AAM LL GD A APVHAV TTEQQ+ R S DA++ L SWLPPGP A ADYPTVLL
Sbjct 358 AAMKALLTSGDPAKAPVHAVATTEQQVVARAASSPDAKSELASWLPPGPVATADYPTVLL 417
Query 410 SGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGD 469
SG WLS+EQ +AAS FAR++ +P+++ +LA+AGFR PP S VT FP L + LSVGD
Sbjct 418 SGDWLSREQVTAASQFARFMREPDRMNELAKAGFRTQGGTPPPSDVTDFPKLAAPLSVGD 477
Query 470 DSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGL 529
D+ R LA+ + + + TIMLD SMP EG N+R+ NVV AL R+ A+PP++ +GL
Sbjct 478 DAARVKLAEALTSPAQASTTTIMLDLSMPGAEGDNTRMGNVVNALIPRVDALPPTTALGL 537
Query 530 WTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLAN 589
WTFD G +++ GPL++PV+GQPR AALT L S+ GGAVSFTTLRL+Y E +AN
Sbjct 538 WTFDAAAGNSQITTGPLSEPVDGQPRSAALTTTLDTLSSTSGGAVSFTTLRLVYNEAMAN 597
Query 590 YRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAV 649
+R GQ NSVLVIT GPHTD+TLDG GL+ FIR + DPA+P+AVN+IDFG DPDR TWE V
Sbjct 598 FRAGQPNSVLVITQGPHTDRTLDGAGLEAFIRDAFDPARPVAVNVIDFGDDPDRGTWETV 657
Query 650 AQLSGGSYQNLETSASPDLATAVNIFL 676
A+ +GG YQNL TS SP+L A+ L
Sbjct 658 ARTTGGQYQNLATSDSPELTAAITTLL 684
>gi|315444286|ref|YP_004077165.1| hypothetical protein Mspyr1_26990 [Mycobacterium sp. Spyr1]
gi|315262589|gb|ADT99330.1| hypothetical protein Mspyr1_26990 [Mycobacterium sp. Spyr1]
Length=636
Score = 628 bits (1619), Expect = 1e-177, Method: Compositional matrix adjust.
Identities = 349/578 (61%), Positives = 432/578 (75%), Gaps = 1/578 (0%)
Query 100 GDWQAGHRSAD-GRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGK 158
G+W HR+ G R VS+GVIVALV+VVV+VA VILWRF GD LS+RS AAARCV G+
Sbjct 58 GEWTGSHRAVTPGPRKVSVGVIVALVSVVVVVAAVILWRFVGDTLSDRSDIAAARCVEGE 117
Query 159 DTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGG 218
VAVIADP+IAD V A YN +A PVGDRCV V VT A S V+NGF +WP +LG
Sbjct 118 VAVAVIADPAIADPVAALAQRYNETADPVGDRCVKVGVTPADSGEVVNGFGEQWPGDLGE 177
Query 219 QPGLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQ 278
+P LWIP+SS+S ARL + G + ISDSRSLV SPVLLAV EL+ AL ++W +LP LQ
Sbjct 178 RPALWIPASSVSEARLEASTGPETISDSRSLVTSPVLLAVAAELKDALGERDWGSLPDLQ 237
Query 279 TNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMG 338
+NPNSL GL L WGSLRLAMP + DA++LA EAVAAA+APAG PATAG+GAV TL+
Sbjct 238 SNPNSLDGLGLRGWGSLRLAMPLGDDSDASFLAAEAVAAATAPAGEPATAGLGAVSTLLS 297
Query 339 ARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGP 398
P+L+D A+D L+ D A APVHAVVTTEQ++FQR + DA++ +WLP GP
Sbjct 298 RAPELSDSDAGTALDALVDASDNAAAPVHAVVTTEQRVFQRASAAPDADSRPAAWLPSGP 357
Query 399 AAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSF 458
AA+AD+PTVLLSG WLSQEQ + AS FAR+L KPEQL +LA+AGFRV V+PP+S V F
Sbjct 358 AAIADFPTVLLSGDWLSQEQVTGASEFARFLRKPEQLGELAKAGFRVEGVEPPASDVIDF 417
Query 459 PALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRI 518
L + L+VGD+ +R T+ADT+ T+MLDQSMP DEGG +RL+NVV AL+ RI
Sbjct 418 APLSAPLAVGDNQVRTTIADTLTMPVETSTVTVMLDQSMPVDEGGATRLANVVDALQARI 477
Query 519 KAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTT 578
+ +PP S VGLWTFDG R+EV AGPL++PV+G PR ALTAAL +Q +SGGGAVSFTT
Sbjct 478 QVLPPDSGVGLWTFDGVGSRSEVGAGPLSEPVDGTPRSEALTAALDRQTASGGGAVSFTT 537
Query 579 LRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFG 638
LRL+Y + A +R GQ NSVLVIT GPHTD++L GLQD+IR + P +P+AVN+IDFG
Sbjct 538 LRLVYGDATARFREGQKNSVLVITTGPHTDRSLGAQGLQDYIRGAFTPERPVAVNVIDFG 597
Query 639 ADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFL 676
D DR TWE+VA+++GGSY+N+ SASPD A+A++ L
Sbjct 598 DDTDRPTWESVAEITGGSYRNVADSASPDTASAISEML 635
>gi|145223954|ref|YP_001134632.1| von Willebrand factor, type A [Mycobacterium gilvum PYR-GCK]
gi|145216440|gb|ABP45844.1| von Willebrand factor, type A [Mycobacterium gilvum PYR-GCK]
Length=636
Score = 613 bits (1582), Expect = 2e-173, Method: Compositional matrix adjust.
Identities = 348/578 (61%), Positives = 428/578 (75%), Gaps = 1/578 (0%)
Query 100 GDWQAGHRSAD-GRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGK 158
G+W HR+ G R VS+GVIVALV+VVV+VA VILWRF GD LS+RS AAARCV G+
Sbjct 58 GEWTGSHRAVTPGPRKVSVGVIVALVSVVVVVAAVILWRFVGDTLSDRSDIAAARCVEGE 117
Query 159 DTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGG 218
VAVIADP+IAD V A YN +A PVGDRCV V VT A S V+NGF +WP +LG
Sbjct 118 VAVAVIADPAIADPVAALAQRYNETADPVGDRCVKVGVTPADSGRVVNGFGEQWPGDLGE 177
Query 219 QPGLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQ 278
+P LWIP+SS+S ARL + G + ISDSRSLV SPVLLAV EL+ AL ++W +LP LQ
Sbjct 178 RPALWIPASSVSEARLEASTGPETISDSRSLVTSPVLLAVAAELKDALGERDWGSLPDLQ 237
Query 279 TNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMG 338
+NPNSL GL L WGSLRLAMP + DA++LA EAVAAA+APAG PATAG+GAV TL+
Sbjct 238 SNPNSLDGLGLRGWGSLRLAMPLGDDSDASFLAAEAVAAAAAPAGEPATAGLGAVSTLLS 297
Query 339 ARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGP 398
P+L+D A+D L D A APVHAVVTTEQ++FQR + DA++ +WLP GP
Sbjct 298 RAPELSDADAGTALDALADASDNAAAPVHAVVTTEQRVFQRASTAPDADSKPAAWLPSGP 357
Query 399 AAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSF 458
A +AD+PTVLLSG WLSQEQ + AS FAR+L KPEQL +LA+AGFRV V+PP+S V F
Sbjct 358 AVLADFPTVLLSGDWLSQEQVTGASEFARFLRKPEQLGELAKAGFRVEGVEPPASDVVDF 417
Query 459 PALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRI 518
L + L+VGD+ +R T+ADT+ T+MLDQSMP DEGG +RL+NVV AL+ RI
Sbjct 418 APLSAPLAVGDNQVRTTIADTLTMPVETSTVTVMLDQSMPVDEGGATRLANVVDALKARI 477
Query 519 KAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTT 578
+PP S VGLWTFDG GR+EV GPLADPV+G PR LTAAL +Q +SGGGAVSFTT
Sbjct 478 PVLPPDSGVGLWTFDGVAGRSEVAVGPLADPVDGTPRSEVLTAALDRQTASGGGAVSFTT 537
Query 579 LRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFG 638
LRL+Y + A +R GQ NSVLVIT GPHTD++L GLQD+IR + P +P+AVN+IDFG
Sbjct 538 LRLVYGDATARFREGQKNSVLVITTGPHTDRSLGAQGLQDYIRGAFTPDRPVAVNVIDFG 597
Query 639 ADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFL 676
D DR TWE+VA+++GGSY+N+ S SPDL++A++ L
Sbjct 598 DDADRPTWESVAEITGGSYRNMADSTSPDLSSAISEML 635
>gi|120404077|ref|YP_953906.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1]
gi|119956895|gb|ABM13900.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1]
Length=640
Score = 575 bits (1483), Expect = 7e-162, Method: Compositional matrix adjust.
Identities = 371/677 (55%), Positives = 464/677 (69%), Gaps = 37/677 (5%)
Query 1 MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYS 60
MGRHS PDP++S D SGS P D G + G + GG+
Sbjct 1 MGRHSLPDPDES----------------DQSGS---PARGFGDFGESADSGEF---GGFR 38
Query 61 ASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVI 120
AS D P P +G + + HR GRR VS+GVI
Sbjct 39 AS------DTPGSPTAPRSGPQHSGGWEGGEWTGSHRA---------VTPGRRKVSLGVI 83
Query 121 VALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESADSY 180
VALVAVVV+VA VI+WRF GDALS RS AAARCV G+ VAV+ADP+IA+ V A+ Y
Sbjct 84 VALVAVVVVVATVIVWRFVGDALSGRSDVAAARCVEGEVAVAVVADPAIAEPVAALAERY 143
Query 181 NASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS 240
N +A PVGDRCV V V SA SD V+NGF G+WP +LG +P LWIP+SS+S ARL A G+
Sbjct 144 NETAAPVGDRCVKVGVKSADSDQVLNGFSGQWPGDLGERPALWIPASSVSGARLEAATGA 203
Query 241 QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP 300
+ +SDSRSLV SPV+LAV P L+ AL QNW LP LQT+P +L GL L WG LRLA+P
Sbjct 204 ETVSDSRSLVTSPVVLAVAPALKDALGQQNWGTLPRLQTDPAALDGLGLQGWGGLRLALP 263
Query 301 SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGD 360
+ DA+YLA EA+AAA+AP+GAPA+AG+GAV T+M P+LAD + A+D L+ D
Sbjct 264 LGDDSDASYLAAEAIAAAAAPSGAPASAGLGAVSTVMSGAPELADPNAGTAIDALVGAAD 323
Query 361 VATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTS 420
A APVHAVVTTEQ++FQR SL D+++ L +W+PPGP A AD+PTVLL+G WLSQEQ +
Sbjct 324 QAAAPVHAVVTTEQRVFQRASSLPDSKDKLAAWIPPGPTATADFPTVLLAGDWLSQEQVT 383
Query 421 AASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM 480
AAS FAR++ KPEQL +LA+AGFRV PP+S V F + + L VGD+++R+T+A+T+
Sbjct 384 AASEFARFMRKPEQLGELAKAGFRVEGTAPPASDVVDFAPVSAPLEVGDNALRSTIAETL 443
Query 481 VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTE 540
T T+MLDQSMP +EGG SRL NV+ AL+ RI +P S VGLWTFDG +GR+
Sbjct 444 ATPVGSPTVTVMLDQSMPVEEGGVSRLQNVIDALKARIAVLPADSGVGLWTFDGVQGRSA 503
Query 541 VPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV 600
V GPL++PV+G PR ALTAAL Q SGGGAVSFTTLRL+Y + YR GQ NSVLV
Sbjct 504 VSVGPLSEPVDGAPRKEALTAALDSQSPSGGGAVSFTTLRLVYTDASTKYREGQKNSVLV 563
Query 601 ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL 660
IT GPHTDQ+L GLQD+IR + + +P+AVN+IDFG D DRATWE+VAQ++GG+YQNL
Sbjct 564 ITTGPHTDQSLGAAGLQDYIRGAFNRDRPVAVNVIDFGDDSDRATWESVAQITGGNYQNL 623
Query 661 ETSASPDLATAVNIFLS 677
TSASP+LA A++ LS
Sbjct 624 GTSASPELAAAISSMLS 640
>gi|169629494|ref|YP_001703143.1| hypothetical protein MAB_2408c [Mycobacterium abscessus ATCC
19977]
gi|169241461|emb|CAM62489.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=611
Score = 471 bits (1211), Expect = 2e-130, Method: Compositional matrix adjust.
Identities = 293/623 (48%), Positives = 384/623 (62%), Gaps = 13/623 (2%)
Query 56 VGGYSASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGV 115
+G +SASGS D WE A + F G WQ HRS GV
Sbjct 1 MGRHSASGSGSPNDPEDNDGWEADSAPGSESGSGSEFDTGS-----WQRSHRSGGSNWGV 55
Query 116 SIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKE 175
S G+I A+ AV+V+ + LW +F S+ AAA CV G + +AV+ADPSIAD++ E
Sbjct 56 SKGLIGAVAAVLVVAVSIGLWWYFDRRTSDNQAEAAATCVHGNNAIAVVADPSIADRIGE 115
Query 176 SADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLT 235
++ +N +GD C V+V A S VI G G+WP ELG QP LWIP SSIS+ARL
Sbjct 116 LSERFNQKHEVIGDYCFTVSVRPADSANVIKGLTGQWPAELGEQPALWIPGSSISSARLK 175
Query 236 GAAGSQAISDSRSLVISPVLLAVRPELQQALAN-QNWAALPGLQTNPNSLSGLDLPAWGS 294
A+ + +SDSRSLV +PV++AV P+L+QA+ N ++WA +P LQ PNSL G+ LP WGS
Sbjct 176 AASKTNIVSDSRSLVSTPVVIAVTPKLRQAIPNDKSWADVPALQNVPNSLDGVGLPGWGS 235
Query 295 LRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTAAMDT 354
LRLA+PSSGN DAA LA EAVAAAS G G GA +L PKL +++ A+
Sbjct 236 LRLALPSSGNADAAQLAAEAVAAASVRPGDSPELGAGAAGSLAATAPKLPANNVADAIGA 295
Query 355 LLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWL 414
LL G+ A VHAVVTTEQQL+ R ++ DA+ + W P G +ADYPTV L GAWL
Sbjct 296 LLDGGEQPGAAVHAVVTTEQQLYARTRNNGDAKKVIAQWQPAGATPIADYPTVQLDGAWL 355
Query 415 SQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRA 474
S+EQ +AAS FAR+L +Q+ LA AGFR P+S V SF + LS+ +D +R
Sbjct 356 SEEQHTAASQFARFLGDKDQIKDLAAAGFRAEGTDLPTSDVVSFAKIDKPLSI-EDKVRV 414
Query 475 TLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDG 534
LAD T S TIML S D ++LS++ L NR++A+ P S +GLW +DG
Sbjct 415 ALADGTSTGSG--TTTIMLASSPAPD----AKLSDITGPLANRVRALAPGSGIGLWVYDG 468
Query 535 REGRTEVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQ 594
+EG T V G D V G PR ++ AL +G GAV++TTLR +YQ+ +A +R Q
Sbjct 469 KEGNTVVRLGGAGDDVEGMPRSQSVADALTALQPTGNGAVAYTTLRALYQDAVAGFRPNQ 528
Query 595 ANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSG 654
NSVLV+ HTDQTLDGPGL D I + DPAKP+ +N++DFGAD D+ TW+ +AQ SG
Sbjct 529 VNSVLVVAGRSHTDQTLDGPGLIDTINRLKDPAKPVRINVLDFGADSDQQTWQTIAQQSG 588
Query 655 GSYQNLETSASPDLATAVNIFLS 677
G+YQN+ S SP+LA A+ F+S
Sbjct 589 GAYQNVSASNSPELAAAIARFIS 611
>gi|111017918|ref|YP_700890.1| hypothetical protein RHA1_ro00900 [Rhodococcus jostii RHA1]
gi|110817448|gb|ABG92732.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=595
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 214/605 (36%), Positives = 314/605 (52%), Gaps = 46/605 (7%)
Query 106 HRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIA 165
HR RG+S G ++ L VVV+ AGV+ W D ++++ AA CV G+ +AV A
Sbjct 4 HRGESRARGISRGPLIVLGLVVVIAAGVVGWFQLRDRITDQGVAAAGACVEGESVLAVAA 63
Query 166 DPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFI----GKW-PTELGGQP 220
DP IA Q++ AD + +A + D+CV+V VT+ SD V + G W LG +P
Sbjct 64 DPDIAPQLQTLADHFTETAPVIRDQCVSVTVTAVASDTVRDALSAGPDGPWDAAALGPRP 123
Query 221 GLWIPSSSISAARL--TGAAGSQAISDSRSLVISPVLLAVRPELQQALANQ--NWAALPG 276
LWIPSSS S +L TG +A R + SPV+LAVR A +W LP
Sbjct 124 ALWIPSSSHSVKQLSATGVISGEA----RPVATSPVVLAVRTAFANAPGTPAIDWKDLPS 179
Query 277 LQTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAAS--APAG---------AP 325
LQT +SL+ L LP WGSL +A+P ++ A EAVAAA +P G AP
Sbjct 180 LQTGRDSLATLGLPGWGSLGMALPVGPGAESTETAVEAVAAAVTGSPTGPVTEEQARSAP 239
Query 326 ATAGIGAV----RTLMGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQ 381
T+ + + G +P ++LTA L + GD AT+ +HAV TEQQ++ Q
Sbjct 240 VTSALTGLALGYEASSGTKPATTREALTA----LAEQGDPATSGIHAVAATEQQVY---Q 292
Query 382 SLSDAENT-LGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLAR 440
+L DA + + +P GP VAD+P +L+G + + Q+ AA+ FA ++ +PEQ LA
Sbjct 293 ALRDAPGADITASMPKGPTPVADHPAAVLAGPSVDETQSRAAAQFAEFVRRPEQAQDLAD 352
Query 441 AGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLD--QSMP 498
AGFRV + P +FP STL D + A L + +T++LD SM
Sbjct 353 AGFRVEGLARPDDTALAFPGFESTLVPADAAAAAELMQVIRNPITPRTSTVLLDVSSSMG 412
Query 499 NDEGGNSRLSNVVAALENRIKAMPPSSVVGLW----TFDG-REGRTEVPAGPL-ADPVNG 552
EG +RL+N AAL + P SS +GLW T DG R T V GPL A
Sbjct 413 EREGTATRLANTTAALAAHVDRSPDSSNLGLWEYSTTLDGSRPYTTVVATGPLSAGGFTE 472
Query 553 QPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLD 612
R AL A L + + G+ ++T+L Y+ + Y G+ NSVL++T G + D ++
Sbjct 473 GTRRQALDARLA-EATPATGSSTYTSLEAAYKSAVDGYSPGRTNSVLLVTDGAN-DDSVA 530
Query 613 GPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSASPDLATAV 672
L I ++ +KP+ ++++ G + D T +A+A +GGS + + +S L TA+
Sbjct 531 RADLLSAIAAASSTSKPVRIDVVTIGENSDLNTLQALADRTGGSLEKVASSDGAALPTAI 590
Query 673 NIFLS 677
+ LS
Sbjct 591 SKLLS 595
>gi|226360049|ref|YP_002777827.1| hypothetical protein ROP_06350 [Rhodococcus opacus B4]
gi|226238534|dbj|BAH48882.1| hypothetical protein [Rhodococcus opacus B4]
Length=595
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 216/605 (36%), Positives = 313/605 (52%), Gaps = 46/605 (7%)
Query 106 HRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIA 165
HR RG+S G ++ L VVV+V GV+ W D ++++ AA CV G+ +AV A
Sbjct 4 HRGESRARGISKGPLIVLGLVVVLVLGVLGWFQLRDRINDQGAAAAGACVEGESVLAVAA 63
Query 166 DPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFI----GKW-PTELGGQP 220
DP IA Q++ AD Y +A + D+CV+V VT+ SD V + G W G +P
Sbjct 64 DPDIAPQLQTLADHYAETAPVIRDQCVSVTVTAVASDTVRDALAAGPDGPWDAAAFGPRP 123
Query 221 GLWIPSSSISAARL--TGAAGSQAISDSRSLVISPVLLAVRPELQQA--LANQNWAALPG 276
LWIPSSS S +L TG +A R L SPV+LAVR A A +W LP
Sbjct 124 ALWIPSSSHSVKQLSATGVISGEA----RPLASSPVVLAVRTAFANAPGTAALDWKDLPS 179
Query 277 LQTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVA---------------AASAP 321
LQ+ ++L+ L LP WG L LA+P ++ +A EAVA A S P
Sbjct 180 LQSGRDALATLGLPGWGGLGLALPVGPGAESTEMAVEAVAAAVTGSSTGPVTEEQARSVP 239
Query 322 AGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQ 381
+ T GA+P ++LTA L + GD AT+ +HAV TTEQQ++ Q
Sbjct 240 VTSALTDLALGYEASTGAKPATTREALTA----LAEQGDPATSAIHAVATTEQQVY---Q 292
Query 382 SLSDAENT-LGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLAR 440
+L DA + + +P GP VAD+P +L+G + + Q+ AA+ FA ++ +PEQ LA
Sbjct 293 ALRDAPGADITTSMPKGPTPVADHPAAVLAGPAVDETQSRAAAQFAEFVRRPEQAQVLAD 352
Query 441 AGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLD--QSMP 498
AGFRV + P +FP + S L D + A L + + +TI+LD SM
Sbjct 353 AGFRVEGLARPDDTTLAFPGVESALVPADAAAAAELMQVIRNPISPRTSTILLDVSSSMG 412
Query 499 NDEGGNSRLSNVVAALENRIKAMPPSSVVGLW----TFDG-REGRTEVPAGPL-ADPVNG 552
EG ++RL+N AL + P SS +GLW T DG R T V GPL A
Sbjct 413 EREGTSTRLANTTMALAAHVDQSPDSSNLGLWEYSTTLDGSRPYTTVVATGPLSAGGFTE 472
Query 553 QPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLD 612
R AL A L + S+ G + ++T+L Y+ + Y G+ NSVL++T G + D ++
Sbjct 473 GTRRQALDARLARATSATGSS-TYTSLEAAYKSAVDGYTPGRTNSVLLVTDGAN-DDSVS 530
Query 613 GPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSASPDLATAV 672
L I S+ +KP+ ++++ G +PD T +A+A +GGS + + TS L TA+
Sbjct 531 RAELLSAIAASSSMSKPVRIDVVTIGENPDLNTLQALADRTGGSLEKVTTSDGAALPTAI 590
Query 673 NIFLS 677
+ LS
Sbjct 591 SKLLS 595
>gi|23821225|emb|CAD52984.1| hypothetical protein [Rhodococcus fascians D188]
Length=571
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 194/605 (33%), Positives = 291/605 (49%), Gaps = 88/605 (14%)
Query 106 HRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIA 165
HR++ R ++ G ++ ++ + V+VA V W D +S++ AA CV G + V A
Sbjct 4 HRNSGRGRSIAAGPVIVVLTIAVLVAAVFGWFALRDRISDQGIEAADTCVEGPAVLTVAA 63
Query 166 DPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAV---INGFIGKWPT-ELGGQPG 221
DP I+ +++ A ++A+A + D CV V V + SDAV +N W LG +PG
Sbjct 64 DPDISAAIEQLATRFDATAPVIRDHCVTVEVRAIASDAVRAALNSGADNWDVGALGARPG 123
Query 222 LWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRP---------ELQQALANQNWA 272
LWIP SS A +A+ D ++ +P +A P A NWA
Sbjct 124 LWIPQSS---------ADVEAVVDRGAIDGTPRPVASSPIVLAAPVAVADAVVAAGSNWA 174
Query 273 ALPGLQTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGI-- 330
L LQ +P +L LP WG LRLA+PS + A+ LA A+AA G P TA
Sbjct 175 DLIRLQRDPQALG---LPEWGGLRLAVPSGSDTGASTLAVAAIAAGV--RGDPTTALTVD 229
Query 331 -GAVRTLMGARPKLADDSLT-------------AAMDTLLKPGDVATAPVHAVVTTEQQL 376
+ L+ A +LA AA++ P A VHAV TEQQL
Sbjct 230 ETSSTQLVTAMSELAVTDTGAAATTASTTYDALAALENAAGPD----AAVHAVPVTEQQL 285
Query 377 FQRGQSLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLA 436
S TL + P G VAD+P V+LSG + T AA+AF ++ +P+
Sbjct 286 ASTDSS------TLTAVRPQGATPVADHPAVVLSG---DETSTRAAAAFVDFVRQPDGTQ 336
Query 437 KLARAGFRVSD------VKPPSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAAT 490
L AGF V + V PPS PV D++ A L + ++ AT
Sbjct 337 TLLDAGFSVDEPQDAGIVAPPSGPVA-------------DALLAVLRNPVLPRR----AT 379
Query 491 IMLD--QSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTF-----DGREGRTEVPA 543
++LD +SM EGG +RL N V AL + + +P ++ +GLW+F + R +VP
Sbjct 380 VLLDVSESMRTTEGGATRLQNTVRALSEQFRRVPDATELGLWSFSEDLNNSLPFRVDVPT 439
Query 544 GPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVITA 603
GP+ PV PR +AL A + + G+ ++ ++ + Y + +A Y G+ NSV++IT
Sbjct 440 GPMTVPVGTTPRRSALDAT-AEALTPATGSFTYASVLVAYLDAVAGYVPGRVNSVVLITD 498
Query 604 GPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETS 663
GP D L L + ++DPA+P+AVN++ G + +A +GG+ + TS
Sbjct 499 GPD-DSPLSADELLTELTSASDPARPVAVNVVRIGDGSPAPVFTDIAARTGGTVDTVPTS 557
Query 664 ASPDL 668
SPDL
Sbjct 558 DSPDL 562
>gi|312139824|ref|YP_004007160.1| hypothetical protein REQ_24380 [Rhodococcus equi 103S]
gi|311889163|emb|CBH48477.1| putative secreted protein [Rhodococcus equi 103S]
Length=581
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 197/596 (34%), Positives = 292/596 (49%), Gaps = 42/596 (7%)
Query 106 HRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIA 165
HRS RGVS G ++ALV+VVV+V GV+ W D S++ AA CV G+ + V A
Sbjct 4 HRSDRRTRGVSKGPVIALVSVVVIVLGVVGWFQLRDRASSQGTAAAGACVEGEVRLDVAA 63
Query 166 DPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFI----GKWPTELGGQPG 221
DPSIA V++ A + + V D CV+V V A + AV G W +LG P
Sbjct 64 DPSIAAPVRDLAARFTDTLPVVRDHCVSVTVYDAPTAAVTEALAAAPDGPWQEDLGPAPA 123
Query 222 LWIPSSSISAARLTGAAGSQAISDS--RSLVISPVLLAVRPELQQAL--ANQNWAALPGL 277
LWIP+S + RL GA + D + L SPV++ +L AL + W LP L
Sbjct 124 LWIPASGTAIDRLAGA----GVVDGSPKPLASSPVVVVAPEDLAAALTASGTGWQNLPAL 179
Query 278 QTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLM 337
Q++ +SL G+ L WG L+LA+P+ D+A AA + P A ++
Sbjct 180 QSDKDSLDGIGLRGWGGLKLALPA--GPDSAAALDAVAAATANAGTGPLDETQAASPQVV 237
Query 338 GARPKLADDS---------LTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAEN 388
A LA+ S A++ L D A+APVHAV TEQQL+ G D
Sbjct 238 AAVGALANGSKAIDAAHATTADAVELLAGRSDPASAPVHAVPATEQQLYAAG----DDAR 293
Query 389 TLGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDV 448
L ++ P G V D+P +L+ W+ + + AA+ F ++ +PE + + AGFRV D
Sbjct 294 GLVAYAPAGATPVLDHPATILATPWVDETRGRAAAQFVDFMRQPESVQQFVDAGFRVGDR 353
Query 449 KPPSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQ--SMPNDEGGNSR 506
P ++ T P L L+ LA T + A TI+LD SM +G +R
Sbjct 354 TPAATDRTPMPELGQVLTPATGPAATRLAQTFANPAVPQATTILLDVSGSMGYTDGDGTR 413
Query 507 LSNVVAALENRIKAMPPSSVVGLWTFD-GREG----RTEVPAGPLADPVNGQPRPAALTA 561
LSN V AL RI A+P SS VGLW + G +G +VP GPL+D Q AAL
Sbjct 414 LSNTVDALSARIAALPTSSDVGLWVYSRGLDGAKPYLVKVPTGPLSDGDRRQRIEAAL-- 471
Query 562 ALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIR 621
+ ++ ++ + + + G+ NSVL++T GP+ D ++ G Q ++
Sbjct 472 ---RSLRPATATSTYASVIAAHDSAVDGFVDGRPNSVLLVTDGPNDDTSV---GTQKLMQ 525
Query 622 KSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
A P+ V+++ G + D+AT ++A +GG+ + ++ P L A LS
Sbjct 526 SLTGAAHPVRVDVVSIGENSDQATLRSMADRTGGTLIAVPSTQGPALGDAFAKTLS 581
>gi|54024463|ref|YP_118705.1| hypothetical protein nfa24940 [Nocardia farcinica IFM 10152]
gi|54015971|dbj|BAD57341.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=554
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 211/594 (36%), Positives = 287/594 (49%), Gaps = 77/594 (12%)
Query 106 HRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIA 165
HRS RGVS G ++A+VAV+++VA V W F D + R AAA CV G TV+V
Sbjct 4 HRSGTRSRGVSKGPVIAVVAVLLLVAAVFAWFQFRDRAAERDSAAAADCVEGSATVSVTV 63
Query 166 DPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFI--GKWPTELGGQPGLW 223
DP I V+ A+ YNA+ V D CV V VT+ S AV++ F G W + LG QP LW
Sbjct 64 DPGIEAPVRAIAEKYNATDPQVRDHCVTVTVTAQPSAAVVDAFRAGGPWDSTLGPQPALW 123
Query 224 IP--SSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQN--WAALPGLQT 279
IP S S+ A R+ G + + +PV LAV L+ ALA N WA LP LQ
Sbjct 124 IPDSSRSVEAMRVPGLVAGE----PSPIAATPVALAVPEPLRAALAQANVAWADLPRLQQ 179
Query 280 NPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAP---AGAPATAGIGAVRTL 336
SL L L WG LRLA+P A + A A P +GA + + AV L
Sbjct 180 --GSLDELGLSGWGGLRLALPEGDAALAVATSVAAAVAGEEPLTESGAASGQAVAAVSGL 237
Query 337 MGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPP 396
P D + A APVHAV TEQQ+ Q L ++ P
Sbjct 238 AVGAPDAGDTAAALAAAG------GGNAPVHAVAATEQQIAAHPQ--------LTAFRPA 283
Query 397 GPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVT 456
G VAD+P LLSG W+ Q Q AA F YL P+Q A AGF
Sbjct 284 GTTPVADHPAALLSGPWVDQTQNLAAGMFVDYLRHPDQAAFFTTAGFT------------ 331
Query 457 SFPALPSTLSVGDD-----SMRATLADTMVTASAGVAATIMLD--QSMPNDEGGNSRLSN 509
A P+ G D ++RATL + ++ GV T+++D SM EG +RL+N
Sbjct 332 ---ATPAPTGAGADRAALETVRATLDNPVL----GVHTTVLVDVSASMATTEGSTTRLAN 384
Query 510 VVAALENRIKAMPPSSVVGLWTF----DG-REGRTEVPAGPLADPVNGQPRPAALTAALG 564
+ AL + + MPP +G+WTF DG R + P G L D A AA+
Sbjct 385 TLGALRSTMTVMPPDFGLGVWTFGKNLDGNRPYEVQAPTGLLTD---------AQRAAVD 435
Query 565 KQYSSGGGA-----VSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDF 619
+ SS ++ TL Y++ + N++ G+ N+VL++T GP D + GP L
Sbjct 436 QALSSVRATDTRPDQAYPTLLAAYRQAVQNHQAGRTNTVLLVTDGPDDDSAVTGPQLLAD 495
Query 620 IRKSADPAKPIAVNIIDF-GADPDRATWEAVAQLSGGSYQNLETSASPDLATAV 672
+ +ADPA+P+ +++I GA D T + A+ +GGSY + TS TA+
Sbjct 496 LAAAADPARPVRIDVIVVGGAGTD--TLQTAAERTGGSYTTVPTSNDLAFGTAM 547
>gi|325674365|ref|ZP_08154054.1| hypothetical protein HMPREF0724_11836 [Rhodococcus equi ATCC
33707]
gi|325555045|gb|EGD24718.1| hypothetical protein HMPREF0724_11836 [Rhodococcus equi ATCC
33707]
Length=609
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 199/596 (34%), Positives = 294/596 (50%), Gaps = 42/596 (7%)
Query 106 HRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIA 165
HRS RGVS G ++ALV+VVV+V GV+ W D S++ AA CV G + V A
Sbjct 32 HRSDRRTRGVSKGPVIALVSVVVIVLGVVGWFQLRDRASSQGTAAAGACVEGDIRLDVAA 91
Query 166 DPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFI----GKWPTELGGQPG 221
DPSIA V++ A + + V D CV+V V A + AV G W +LG P
Sbjct 92 DPSIAAPVRDLAARFADTLPVVRDHCVSVTVYDAPTAAVTEALAAAPDGPWQEDLGPAPA 151
Query 222 LWIPSSSISAARLTGAAGSQAISDS--RSLVISPVLLAVRPELQQAL--ANQNWAALPGL 277
LWIP+S + RL GA + D + L SPV++ +L AL + W LP L
Sbjct 152 LWIPASGTAIDRLAGA----GVVDGSPKPLASSPVVVVAPEDLAAALTASGTGWQNLPAL 207
Query 278 QTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAA--------SAPAGAPATAG 329
Q+ +SL G+ L WG L+LA+P+ + AA A A A + A A
Sbjct 208 QSGKDSLDGIGLRGWGGLKLALPAGPDSAAALDAVAAATANAGTGPLDETQAASPQVVAA 267
Query 330 IGAVRTLMGARPKLADDSLTA-AMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAEN 388
+GA+ G++ A + TA A++ L D A+APVHAV TEQQL+ G D
Sbjct 268 VGALAN--GSKAIDAAPTTTADAVELLAGRSDPASAPVHAVPATEQQLYAAG----DDAR 321
Query 389 TLGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDV 448
L ++ P G V D+P +L+ W+ + + AA+ F ++ +PE + + AGFRV D
Sbjct 322 GLVAYAPTGATPVLDHPATILATPWVDETRGRAAAQFVDFMRQPESVQQFVDAGFRVGDR 381
Query 449 KPPSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQ--SMPNDEGGNSR 506
P ++ T P L L+ LA T + A TI+LD SM +G +R
Sbjct 382 TPAATDRTPMPELGQVLTPATGPAATRLAQTFANPAVPQATTILLDVSGSMGYTDGDGTR 441
Query 507 LSNVVAALENRIKAMPPSSVVGLWTFD-GREG----RTEVPAGPLADPVNGQPRPAALTA 561
LSN V AL RI A+P SS VGLW + G +G +VP GPL+D Q AAL
Sbjct 442 LSNTVDALSARIAALPTSSDVGLWVYSRGLDGAKPYLVKVPTGPLSDGDRSQRIEAAL-- 499
Query 562 ALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIR 621
+ ++ ++ + + + G+ NSVL++T GP+ D ++ G Q ++
Sbjct 500 ---RSLRPATATSTYASVIAAHDSAVDGFVDGRPNSVLLVTDGPNDDTSV---GTQKLMQ 553
Query 622 KSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
A P+ V+++ G + D+ T ++A +GG+ + ++ P L A LS
Sbjct 554 SLTGAAHPVRVDVVSIGENSDQETLRSMADRTGGTLIAVPSTQGPALGDAFAKTLS 609
>gi|343928469|ref|ZP_08767917.1| hypothetical protein GOALK_117_00750 [Gordonia alkanivorans NBRC
16433]
gi|343761654|dbj|GAA14843.1| hypothetical protein GOALK_117_00750 [Gordonia alkanivorans NBRC
16433]
Length=410
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 130/356 (37%), Positives = 188/356 (53%), Gaps = 11/356 (3%)
Query 113 RGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQ 172
RGVS GV+ AL++++++ A V+ WR GD ++ ++ AAA+CV G +V +IADP IA
Sbjct 14 RGVSRGVVFALLSILLVAAIVVTWRDLGDRINRQADDAAAQCVEGATSVPIIADPDIAPG 73
Query 173 VKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTE-LGGQPGLWIPSSSISA 231
+ A S+ + V D CV +AV + ++G GKW E +G P W+P SS+ +
Sbjct 74 LAAIATSFTNTKPVVRDHCVTIAVRPGDAKITLDGLTGKWDAESMGAYPAAWVPQSSVWS 133
Query 232 ARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQ-NWAALPGLQTNPNSLSGLDLP 290
A L A +SRSLV +PV+LAV PEL +A +Q +W+ +P LQ SL+ L
Sbjct 134 ADLATAKPDLIEGNSRSLVSTPVVLAVSPELAKAAGDQLDWSQIPLLQQRDASLTEFGLQ 193
Query 291 AWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAG-------IGAVRTLMGARPKL 343
WGSLR+AMP DA+ LA +AVA P T +V+ ++G P
Sbjct 194 GWGSLRMAMPIGAQSDASALAAQAVATRVTRTTGPLTTADAESPRVTSSVKAMLGGAPLS 253
Query 344 ADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVAD 403
D + A + D A A +HAV TEQ+L+Q + +D L +P GP +AD
Sbjct 254 PDGTPQGAATAIANAADPAKAEIHAVPITEQRLYQITK--TDQPARLSEVIPSGPTPIAD 311
Query 404 YPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFP 459
YP + LSG + A + F Y +PEQL L GFR P ++ +FP
Sbjct 312 YPIIRLSGPEVGDVAADAVADFISYASQPEQLKLLTELGFRGDAPMPSATATVTFP 367
>gi|229494861|ref|ZP_04388614.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121]
gi|229318219|gb|EEN84087.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121]
Length=564
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 186/592 (32%), Positives = 278/592 (47%), Gaps = 51/592 (8%)
Query 106 HRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIA 165
HR G RG+S G I+ +V +V++VAG W+ + + ++ AA CV G T+ V A
Sbjct 4 HRGGSGARGISKGPILVVVLIVLIVAGFFGWKALSNRIDDQGQQAAGTCVEGNKTLDVTA 63
Query 166 DPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVIN----GFIGKW-PTELGGQP 220
DPSIA QV+E A Y ++ V D C+AV V A S +V G W LG +P
Sbjct 64 DPSIAPQVEELAKRYTQTSPVVRDHCIAVVVHGAPSASVSTALEAGPAAPWDDAALGPRP 123
Query 221 GLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPEL--QQALANQNWAALPGLQ 278
LWIP+SS A L+G A D RSL SP++LA PE A A +W +LP
Sbjct 124 SLWIPTSSFELASLSGKAVING--DPRSLASSPIVLAAGPETAAALAGAASSWKSLP--- 178
Query 279 TNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPAT---AGIGAVRT 335
L +A+P G+ + + A A + P T A V
Sbjct 179 --------------SDLTVALP-VGSTETSMAAQAIAADVADAGAGPVTMDQAKSAQVNA 223
Query 336 LMGARPKLADDSLTAAMDTLLKPGDVATAP---VHAVVTTEQQLFQRGQSLSDAENTLGS 392
+ AR T T G ++ V AV TEQ + Q A + +
Sbjct 224 ALSARALQFQSLPTPPSSTAEALGALSAGTPDSVKAVPATEQSIAQ------SANAAMTT 277
Query 393 WLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPS 452
+ P G VADYP V++SG+ + + + AA+ FA ++ +P Q AGFRV P
Sbjct 278 YSPVGATPVADYPAVIVSGSGIDETASRAAAQFADFMREPNQSQLFVGAGFRVEGQDLPD 337
Query 453 SPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQ--SMPNDEGGNSRLSNV 510
S + STL A L D + + +ATI++D SM D+GG +RL+NV
Sbjct 338 LGAVSPAKISSTLKPASAETAAALGDIVANPVSPRSATILMDTSASMGTDDGGTTRLANV 397
Query 511 VAALENRIKAMPPSSVVGLWTF----DGR-EGRTEVPAGPLADPVNGQPRPAALTAALGK 565
+A+ ++ P +S +GL F DG+ R VP G L++P R A +T L
Sbjct 398 ASAVNTQLGRSPDASDIGLREFSTGTDGKPSERILVPGGSLSEP----NRRATITDFL-N 452
Query 566 QYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSAD 625
+GG + L Y+ + + G+ NSVL+IT+ + T L I + +
Sbjct 453 GLRAGGKTSKYPALASSYKSAVDGFDAGRVNSVLLITSSTPDESTTTRAELLSAIAAAGN 512
Query 626 PAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
P++P+ V++I GA D +T + V+ +GG+ +++++ P LA AV LS
Sbjct 513 PSRPVQVDVIVVGAGDDVSTLQDVSDRTGGTLVRVDSTSDPALAAAVTKMLS 564
>gi|226306705|ref|YP_002766665.1| hypothetical protein RER_32180 [Rhodococcus erythropolis PR4]
gi|226185822|dbj|BAH33926.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=564
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 186/588 (32%), Positives = 272/588 (47%), Gaps = 43/588 (7%)
Query 106 HRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIA 165
HR G RG+S G I+ +V +V++VAG W+ + + ++ AA CV G T+ V A
Sbjct 4 HRGGSGARGISKGPILVVVLIVLIVAGFFGWKALSNRIDDQGQQAAGTCVEGNKTLDVTA 63
Query 166 DPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVIN----GFIGKW-PTELGGQP 220
DPSIA Q++E A Y ++ V D C+ V V A S +V G W LG +P
Sbjct 64 DPSIAPQIEELAKRYTQTSPVVRDHCITVVVHGAPSASVSTALEAGPAASWDDAALGPRP 123
Query 221 GLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPEL--QQALANQNWAALPGLQ 278
LWIP+SS A L G A D RSL SP++LA PE A A +W +LP
Sbjct 124 SLWIPTSSFELAPLAGKAVING--DPRSLASSPIVLATGPETAAALAGAASSWKSLPSDL 181
Query 279 TNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMG 338
T + LP GS +M + AG A A + A R L
Sbjct 182 T-------VALPV-GSTETSMAAQAIAADVADAGAGPVTTDQVKSAQVNAALSA-RALQF 232
Query 339 ARPKLADDSLTAAMDTLL--KPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPP 396
S A+ L P V AV TEQ + Q A + ++ P
Sbjct 233 QSLPTPPTSTAEALGALSAGTPDSV-----KAVPATEQSIAQ------SANAAMTTYSPA 281
Query 397 GPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVT 456
G VADYP V++SG+ + + + AA+ FA ++ +P Q AGFRV P
Sbjct 282 GATPVADYPAVIVSGSGIDETASRAAAQFADFMREPNQSQLFVGAGFRVEGQDLPDLGAV 341
Query 457 SFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQ--SMPNDEGGNSRLSNVVAAL 514
S + STL A L D + + +ATI++D SM D+GG +RL+NV AA+
Sbjct 342 SPAKISSTLKPASAETAAALGDIVANPVSPRSATILMDTSASMGTDDGGTTRLANVAAAV 401
Query 515 ENRIKAMPPSSVVGLWTF----DGR-EGRTEVPAGPLADPVNGQPRPAALTAALGKQYSS 569
++ P +S +GL F DG+ R VP G L++P R A +T L +
Sbjct 402 NTQLGRSPDASDIGLREFSTGTDGKPSERILVPGGSLSEP----NRRATITDFL-NGLRA 456
Query 570 GGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKP 629
GG + L Y+ + + G+ NSVL+IT+ + T L I + +P+ P
Sbjct 457 GGKTSKYPALASSYKAAVDGFDAGRVNSVLLITSSTPDESTTTRAELLSAIAAAGNPSHP 516
Query 630 IAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
+ V++I GA D +T + V+ +GG+ +++++ P LA V LS
Sbjct 517 VQVDVIVVGAGDDVSTLQDVSDRTGGTLVRVDSTSDPALAATVTKMLS 564
>gi|326384959|ref|ZP_08206633.1| hypothetical protein SCNU_18532 [Gordonia neofelifaecis NRRL
B-59395]
gi|326196349|gb|EGD53549.1| hypothetical protein SCNU_18532 [Gordonia neofelifaecis NRRL
B-59395]
Length=392
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 132/352 (38%), Positives = 188/352 (54%), Gaps = 14/352 (3%)
Query 104 AGHRSADGRRGVSIGVIVALVAVVVMVAGVIL-WRFFGDALSNRSHTAAARCVGGKDTVA 162
A H S + R +VA +++VAG++ WR GD + + AA+ C+ G V+
Sbjct 2 AKHNSGERSRHYVSRPLVAFALALILVAGIVTAWRQLGDQIDDEQPVAASECLDGPAKVS 61
Query 163 VIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGK-WPTELGGQ-P 220
V+ADP+IA +++ A+S++A+ V D C+ V V A + A + G K W + G+ P
Sbjct 62 VLADPAIAPGLQKIAESFDATKPIVRDHCITVEVRPADARATLEGLTAKDWDAQTYGEFP 121
Query 221 GLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQ-NWAALPGLQT 279
WIP SSI +A L A SLV SP+ LA+ PE+ +A A+Q WA LP QT
Sbjct 122 AAWIPESSIWSAALQTAKPDALQGQPESLVSSPIRLAMEPEIAKAGADQIAWAELPD-QT 180
Query 280 NPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAP-------AGAPATAGIGA 332
SL+ +WGS+R+AMP+ DA L +AVAAA+ P A A + +GA
Sbjct 181 KARSLAQYGRASWGSMRIAMPTGPQSDATALGAQAVAAATVPTQQSLTLAQAQSPPVVGA 240
Query 333 VRTLMGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGS 392
+ LM A PK+ D S+ AA+ ++ D A APV AV TEQ L+ + D + +
Sbjct 241 LDQLMSAPPKVGDGSIDAAVRSIADTTDPADAPVRAVSVTEQHLYVLTK--DDQTARVAA 298
Query 393 WLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFR 444
P GP VADYP V L+G + + A S F + KP Q+ L RAGFR
Sbjct 299 VAPKGPTPVADYPVVKLAGPLVPAHVSDAISQFITFARKPPQMEILTRAGFR 350
>gi|256379353|ref|YP_003103013.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827]
gi|255923656|gb|ACU39167.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827]
Length=596
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 159/520 (31%), Positives = 234/520 (45%), Gaps = 36/520 (6%)
Query 112 RRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIAD 171
RRG++ I + V ++V G WR+ GD + R+ A C G T+ V A PS+AD
Sbjct 12 RRGIAGWPITIIGVVALLVLGWFGWRWIGDVVDQRAAVQAGDCNEGPATLKVAATPSVAD 71
Query 172 QVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTE-LGGQPGLWIPSSSIS 230
V++ A +++A V D C+ V V ++ S+ V+ G W E LG +P W+ S++
Sbjct 72 AVRQVAQAWSAQRPVVYDHCIGVEVLASDSEVVLEGLTNTWDEEKLGSRPHAWVTDSAVW 131
Query 231 AARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALA---NQNWAALPGLQTNPNSLSGL 287
A RL S S S+ SPV+LA+ E A+ W L + ++
Sbjct 132 ANRLAAQRQSMIGSPPESIATSPVVLAMPQEAADAVQAGPGFRWTDLTAMTSSATGWDRF 191
Query 288 DLPAWGSLRLAM--PSSGNGDAAYLAGEAVAAASAPAGAPATAGI-------GAVRTLMG 338
WG+ ++AM P+ G A L A + P G P TA + A+ L+
Sbjct 192 GKAGWGAFKVAMPDPAVNPGTAMALEAALAGAGADPTG-PVTADLLAQEPVKQAMAKLVA 250
Query 339 ARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAE----NTLGSWL 394
ARP+ S AM L V + AV E L++ D L
Sbjct 251 ARPEQTTTSTWQAMAVLAANPAVGSVGFSAVPALEVDLYRHNTGAEDNRPAPATPLAGVA 310
Query 395 PPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDV--KPPS 452
G VAD+P LSG W+++ Q AA AF +L PEQ A LA AG RV V +P
Sbjct 311 AQGVTPVADFPFTALSGEWVNEAQARAAQAFRTFLKAPEQRATLAAAGLRVEGVTERPSP 370
Query 453 SPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLD--QSMPNDEG-GNSRLSN 509
+P ++ + L D + +A TA G T+++D ++M D G G +RL
Sbjct 371 APGIAWAEVTEQLKPADAAATQQVAGAWATADNGQVVTVLVDTSKTMGEDGGDGRTRLEW 430
Query 510 VVAALENRIKAMPPSSVVGLWTF----DGREGRTE-VPAGPLADPVNGQPRPAALTAALG 564
V AL + S +GLW F DG + E VP G + G R + L A
Sbjct 431 VREALTGQANRAVSGS-LGLWEFATGADGDKAYRELVPTGSV-----GAQRQSLLDAV-- 482
Query 565 KQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAG 604
+ G FT L Y+++LA++R G+ N ++VIT G
Sbjct 483 GRLKPRGDDRPFTALIAAYEDVLADHRDGKRNRIVVITDG 522
>gi|262202671|ref|YP_003273879.1| hypothetical protein Gbro_2768 [Gordonia bronchialis DSM 43247]
gi|262086018|gb|ACY21986.1| hypothetical protein Gbro_2768 [Gordonia bronchialis DSM 43247]
Length=394
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 115/355 (33%), Positives = 175/355 (50%), Gaps = 13/355 (3%)
Query 119 VIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESAD 178
+I A+++++++ A ++ W+ GD + ++ AA CV G+ V + AD +A + A+
Sbjct 22 LIFAMLSIILVGAVIVTWQHLGDLIDRKADEDAAACVEGRQDVTIRADADLAAGLTAIAE 81
Query 179 SYNASAGPVGDRCVAVAVT-SAGSDAVINGFIGKW-PTELGGQPGLWIPSSSISAARLTG 236
++ ++ V D CV++ + A + + G W +G P WIP SS+ AA L
Sbjct 82 NFAKTSPVVRDHCVSITIRPDADAKITADALAGTWDDASMGTYPAAWIPQSSVWAAELAT 141
Query 237 AAGSQAISDSRSLVISPVLLAVRPELQQAL-ANQNWAALPGLQTNPNSLSGLDLPAWGSL 295
RSLV SPV+LAV PE Q L N +W+ LP LQ SL+ + L WGSL
Sbjct 142 RKPDAVEGSPRSLVTSPVVLAVSPEFNQTLGGNLDWSQLPTLQRRDASLADVGLSGWGSL 201
Query 296 RLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAG-------IGAVRTLMGARPKLADDSL 348
R+AMP+ DA+ LA +AVAA T +V L+ P+ D +
Sbjct 202 RMAMPTGQQADASALAAQAVAAQVMRTTGVLTTQDASSQRVTSSVEALLQGAPQPPDGTP 261
Query 349 TAAMDTLLK-PGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAAVADYPTV 407
A + D A+ +H+V TEQ+LFQ + D + LP GP +ADYP V
Sbjct 262 AGAAKVIADGADDAASTSIHSVPITEQKLFQITR--QDTTARVVELLPTGPTPIADYPVV 319
Query 408 LLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALP 462
L+G +S T + F + +P+QL L GFR PP++ +FP P
Sbjct 320 RLAGDRVSDVATDTVAEFVAFAAQPDQLRLLTELGFRGDAPMPPATASVTFPRTP 374
>gi|296140033|ref|YP_003647276.1| hypothetical protein Tpau_2330 [Tsukamurella paurometabola DSM
20162]
gi|296028167|gb|ADG78937.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=368
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 121/354 (35%), Positives = 169/354 (48%), Gaps = 39/354 (11%)
Query 106 HRSADGRRGVS--IGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAV 163
HRS G RG++ + +A V VV+ VAG +LW G + AAA CV G + V
Sbjct 4 HRSGSGSRGLARWVIAAIAAVLVVIAVAGAMLW-LLGRS-EQEGRDAAATCVEGDLQLKV 61
Query 164 IADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLW 223
A P++ + ++ AD +N+S D C VT S ++ G W +LG P +W
Sbjct 62 AAAPALVESLRRVADGFNSSGTVSNDYCPRAEVTGVDSPVALSALAGTWDPKLGPAPAVW 121
Query 224 IPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNS 283
IP SSI ARL A ++ S+ SP +LAVR +QA W +P Q +
Sbjct 122 IPESSIWTARLAAAKPAELSGQPTSIASSPGVLAVRGSARQAFDGVRWVDVPARQAD--- 178
Query 284 LSGLDLPAWGSLRLAMPSSGNG-DAAYLAGEAVAAASAPAGAPATAG--------IGAVR 334
L +++P++G+G D YLA ++VAAA A G A G +
Sbjct 179 -----------LGISLPTAGSGADGTYLAAQSVAAAVARTGGAAIDEEAARGPLVTGTLN 227
Query 335 TLMGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQL--FQRGQSLSDAENTLGS 392
A PK A+ TAA++ L+ P D + AV TEQQL F RG+ E +
Sbjct 228 RWASAAPKTAN--ATAALEGLMVPSD----SLRAVPVTEQQLYAFARGR----GETAPVA 277
Query 393 WLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVS 446
P GP A A YP +L +++ Q AAS F Y+ K E LA AGFRV+
Sbjct 278 VYPAGPTAAATYPAAVLDREGVTEAQRRAASDFVAYIGKGENAKPLAEAGFRVA 331
>gi|271962919|ref|YP_003337115.1| hypothetical protein Sros_1377 [Streptosporangium roseum DSM
43021]
gi|270506094|gb|ACZ84372.1| hypothetical protein Sros_1377 [Streptosporangium roseum DSM
43021]
Length=584
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 151/554 (28%), Positives = 230/554 (42%), Gaps = 79/554 (14%)
Query 161 VAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQP 220
V V A IA V E+A +N S V RCV V V V+ IG L +P
Sbjct 58 VGVAAAVDIAPTVMEAAGRFNRSGTGVDGRCVLVQVMEQPPATVLRTLIGGTAGVLSERP 117
Query 221 GLWIPSSS--ISAARLTGA---AGSQAISDSRSLVISPVLLAVRPELQQALA----NQNW 271
WI SS I AR GA AG++ + + SP++ A R L Q A + NW
Sbjct 118 DGWITDSSAWIRLARKQGAGNLAGTETV-----MATSPLVFATRKSLAQRFAVGKTDMNW 172
Query 272 AALPGLQTNPNSLSGLDLPAWGSLRLAMPS-SGNGDAAYLAGEAVAAASAPAGAPATAGI 330
+ T D P +R+ PS +G G A A V A A TA +
Sbjct 173 RMVFPATTRGRIRPNADEP--DVVRVPDPSLAGAGIATVAAARDVVGTGAEADRSLTAFV 230
Query 331 GAVRTLMGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTL 390
+ G+ P S+ AA+D D + V+ EQ ++ +
Sbjct 231 RWAQA--GSAPDY--RSMLAAVD------DRSFWQRPVVIVPEQSVWTHNR--------- 271
Query 391 GSWLPPGPAAVA----------DYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLAR 440
LP G VA DYP V+ S + S + AFA +L PE + R
Sbjct 272 ---LPSGDPVVALHPREGTINLDYPYVVTSA---DSTKASGSRAFATWLRSPETQDAVRR 325
Query 441 AGFRVSD-VKPPSSPVTSFPA-----LPSTL-SVGDDSMRA--TLA---DTMVTASAGVA 488
AGFR +D + P SP P P+ L ++ D+++ A LA + +V A G
Sbjct 326 AGFRSADGTQGPYSPGPEIPTEAPRTRPAILPAMIDEALEAWSRLAPPTNILVLADTGKH 385
Query 489 ATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTF---DGREGRTEVPAGP 545
+ +E G ++L+ + A ++ P S+ +G+W F G + R V GP
Sbjct 386 MARPI-----KEEKGRTKLTVALEAARLGLQLFPNSTHMGMWEFAAAKGGDHRERVRIGP 440
Query 546 LADPVNGQPRPAALTAALGKQYSSGGGAVS--FTTLRLIYQEMLANYRVGQANSVLVITA 603
+ +P GQ + L + + S + ++ ++E+ +Y N++LVITA
Sbjct 441 ILEPDGGQVIRRSRLEELTRTLRADPKLSSSLYDSVLAGFREVTDSYDETMNNTLLVITA 500
Query 604 GPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETS 663
G + L L + +R DP P+ + ++ FG D DRA VA ++ GS L +
Sbjct 501 GRDDGKGLSSGELAERLRDEWDPEHPVQIVVLAFGDDLDRAALGQVASITNGS---LHIA 557
Query 664 ASPDLATAVNIFLS 677
P+ + +FLS
Sbjct 558 QEPN--EIIEVFLS 569
>gi|296270634|ref|YP_003653266.1| family 1 extracellular solute-binding protein [Thermobispora
bispora DSM 43833]
gi|296093421|gb|ADG89373.1| extracellular solute-binding protein family 1 [Thermobispora
bispora DSM 43833]
Length=599
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 129/523 (25%), Positives = 212/523 (41%), Gaps = 49/523 (9%)
Query 161 VAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQP 220
++V A P I V++ AD + V +CV+V+V + GS V N G PT P
Sbjct 66 LSVAASPDIHPAVQKVADRFAKEPKDVDGKCVSVSVKAVGSADVANAIAGTGPTRAKIDP 125
Query 221 GLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRP----ELQQALANQNWAALPG 276
+WIP S I ARL G A + S SP+++ +L+ +W AL
Sbjct 126 DVWIPDSRIWLARL-AKQGVPAPKPAGSAAYSPIVMTASKAGAEQLKSVFNPASWTALMS 184
Query 277 LQT--NPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVR 334
NP+ LS L L + G A +AG +V A+ G
Sbjct 185 AANAANPDGLSR----KIRVLGLDPTQNAAGLGALIAGASVLKANN----------GGDD 230
Query 335 TLMGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWL 394
+G +L D+++ D++ A+A V V +EQ ++ S + + +
Sbjct 231 LFVGVLRRLVDNTVPTP-DSMFATLTKASARVPVGVASEQAVWAHNMKTSPSNPAVALY- 288
Query 395 PPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVK----- 449
P + DYP ++ + + AA F L + GFR D K
Sbjct 289 PAEGTIILDYPIIVRTK---DRNLRKAAELFTAELTSDAGRKLVQEHGFRTPDGKGGSLL 345
Query 450 PPSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQS----MPNDEGGNS 505
P + V++ P+ + + D++ +A G +LD S +P D G S
Sbjct 346 KPENGVSA--KKPAEMPLPDNASINKVAQAWNQLRMGTRLLALLDISGTMLLPADRTGVS 403
Query 506 RLSNVVAALENRIKAMPPSSVVGLWTF------DGREGRTEVPAGPLADPVNGQPRPAAL 559
R+ + ++ P + +G W F G + + VP GPL + G R +
Sbjct 404 RMDAIKNITREGLRLFPDKAEIGTWVFSDNLRGQGVDWKEVVPMGPLGAQIGGMTRREYI 463
Query 560 TAALGKQYSSGGGAVSFT-TLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQD 618
L + + G TL YQ+ML Y + +++L+ T G D G ++
Sbjct 464 EKTLREVKAIPTGNTGLNDTLWAAYQKMLKEYTPDKVSTILLFTDGVGNDDPNGGISNEE 523
Query 619 FIRK---SADPAKPIAVNIIDFGA--DPDRATWEAVAQLSGGS 656
+RK + DP +P+++ II D DRA A+A+ +GG+
Sbjct 524 ILRKLRQAYDPKRPVSILIISVNTTKDEDRAQMTAIAKATGGA 566
>gi|29833533|ref|NP_828167.1| hypothetical protein SAV_6991 [Streptomyces avermitilis MA-4680]
gi|29610656|dbj|BAC74702.1| hypothetical protein [Streptomyces avermitilis MA-4680]
Length=623
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 127/540 (24%), Positives = 224/540 (42%), Gaps = 53/540 (9%)
Query 154 CVGGKDTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWP 213
C + + A P +A ++ +AD C+ + VT+ + V +
Sbjct 91 CQDHAVRLKIAASPDVAPALRAAADEARRKNITSDGHCLDIHVTARDAYQVTESLLSGRK 150
Query 214 TELGGQPGLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQAL----ANQ 269
+++ W+P + + R+T A + ++ + ++ SPV +AV P ++L
Sbjct 151 SDIQA----WVPDADLWVRRVTADARATQVTQAGNIASSPVGMAVVPTAAKSLGWPDKTY 206
Query 270 NWAALPGLQTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAG 329
W L G +LR P G D + A + A + GA A
Sbjct 207 TWTELAG----------------ATLREDRPKLGTADPSRSA-TGLLALTRLTGATAKVK 249
Query 330 IGAVRT--LMGARPKLADDSLTAAMDTLLKPGDVATAPV------HAVVTTEQQLFQRGQ 381
G RT + A + DS + ++TL P D + A++ +EQ F
Sbjct 250 EGDTRTAAMAKALSQRTADSDSQVLETL--PRDSSGTEQGDPKRNQALILSEQAAFTHNT 307
Query 382 SLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARA 441
S +D++ L + P + DYP L+ LS +++ AA F L +P+ L +
Sbjct 308 S-ADSDLKLDLFYPKDGSPRLDYPFTLVDQPRLSTDESRAALRFMTLLEQPKGTRILQKH 366
Query 442 GFRVSDVKPPSSPVTSFPAL-------PSTLSVGDDSMRATLAD-TMVTASAGVAATIML 493
GFR+ D ++ VT+ P+ + +++ +L T+ SA + + +
Sbjct 367 GFRIDDEDVSATVVTAAGGRSPQPYEEPAPEPASEKTLQESLGTWTITVQSARITTVVDI 426
Query 494 DQSMPNDEGGNSR--LSNVVAALENRIKAMPPSSVVGLWTF----DG-REGRTEVPAGPL 546
SM G+SR + A+L + P +GLW F DG ++ R VP G L
Sbjct 427 SASMSEAVPGSSRSRMDVTKASLLQTLTTFTPDDEIGLWNFSAKLDGDKDYRVLVPTGRL 486
Query 547 ADPVNGQPRPAALTAALGKQYSSGGGAVS-FTTLRLIYQEMLANYRVGQANSVLVITAGP 605
D + L+AA GGA + T Y+ A+Y G+ N+++++T G
Sbjct 487 GDRGGRDTQRDRLSAAFSALEPVRGGATGLYDTTLAAYKAATASYVKGKFNALVILTDGV 546
Query 606 HTDQ-TLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSA 664
+ D ++ L +RK ADP P+ + +I G + R E +A +GGS +++ A
Sbjct 547 NEDPGSISRSTLLTQLRKLADPRHPVPLIMIAVGPEAHRQEAERIAGATGGSGHQVDSPA 606
>gi|297198202|ref|ZP_06915599.1| von Willebrand factor [Streptomyces sviceus ATCC 29083]
gi|197714651|gb|EDY58685.1| von Willebrand factor [Streptomyces sviceus ATCC 29083]
Length=592
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 137/537 (26%), Positives = 227/537 (43%), Gaps = 68/537 (12%)
Query 161 VAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGF-IGKWPTELGGQ 219
+ V A P +A ++ +A+ + RC+ ++VT+ S V + GK P G Q
Sbjct 62 IEVAASPDVAPVLRAAAERAHDENLTSDGRCLDISVTARESYKVRDTLGAGKDP---GAQ 118
Query 220 PGLWIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQALA----NQNWAALP 275
+W+P S + +L+ AG+ ++ ++ SPV +A+ P ++L W L
Sbjct 119 --VWVPDSDVWLEQLSADAGATKVARVGNVASSPVGMAMVPAAAKSLGWPQKTYGWLELA 176
Query 276 GLQTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRT 335
G +SL L A P+ L + AA GA A
Sbjct 177 GATLRDDSLK---------LGAADPARSASGLLALTRLSSAAGQVKGGATQAAA------ 221
Query 336 LMGARPKLADDSLTAAMDTLLKPGDVATAPV------HAVVTTEQQLFQRGQSLSDAENT 389
+M + + DS ++TL P D + A+V +EQ F S +++ +
Sbjct 222 MMKSLSQRISDSDGQLVETL--PRDSSGTEQGNPKRNQALVVSEQAAFAHNSS-AESGDD 278
Query 390 LGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVK 449
L + P + DYP L+ LS +++ AA F YL +PEQ L GFR SD +
Sbjct 279 LDFFYPKDGSPRLDYPYALVDETRLSTDESRAAIRFMTYLRRPEQEQLLTDRGFRTSDDQ 338
Query 450 PPSSPVTS------------------FPALPSTLSVGDDSMRATLADTMVTASAGVAATI 491
+S V AL L ++++ T+V ASA
Sbjct 339 VSASLVAKAGGRAPQPYAAAAGEPASATALQEALGTWTITVQSARITTVVDASAS----- 393
Query 492 MLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTF----DG-REGRTEVPAGPL 546
+ +++P G SR+ A+L + +GLW F DG ++ + VP L
Sbjct 394 -MSEAVPGT--GRSRMDVTRASLLQALATFTQEDEIGLWEFSTELDGDKDYKILVPTDRL 450
Query 547 ADPV-NGQPRPAALTAALGKQYSSGGGAVS-FTTLRLIYQEMLANYRVGQANSVLVITAG 604
D G + L+AA G GGA + T Y+ ++Y G+ N+++V+T G
Sbjct 451 GDSTAAGTTQRERLSAAFGGLEPVPGGATGLYDTTLAAYKAATSSYAKGKFNALVVLTDG 510
Query 605 PHTD-QTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNL 660
+ D ++ L + K + PA+P+ + +I G D DRA E +A+ +GGS Q +
Sbjct 511 VNQDPGSISRGALISELEKLSSPARPVPLIVIAVGPDADRAEAEQLAEATGGSGQQV 567
>gi|296268733|ref|YP_003651365.1| hypothetical protein Tbis_0747 [Thermobispora bispora DSM 43833]
gi|296091520|gb|ADG87472.1| hypothetical protein Tbis_0747 [Thermobispora bispora DSM 43833]
Length=587
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 148/589 (26%), Positives = 234/589 (40%), Gaps = 75/589 (12%)
Query 120 IVALVAVVVMVAGVILW-RFFGDALSNRSHTAAARCVGGKDTVAVIADPSIADQVKESAD 178
+ AL VV+ G +L+ R G A S R V V A IA V E+AD
Sbjct 29 VAALTVPVVIAGGAVLYVRGTGGACSPRDPL----------IVRVAAAVDIAPPVMEAAD 78
Query 179 SYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAA 238
+NA+ V RCV V V V+ IG L +P WI SS+ RL
Sbjct 79 RFNATDTGVDGRCVKVQVVEQPPAPVLRTLIGDRVGVLPERPDGWITDSSVWV-RLARKQ 137
Query 239 GSQAI-SDSRSLVISPVLLAVRPELQQALA----NQNWAALPGLQTNPNSLSGLDLPAWG 293
G++ + +D + SP++ A R L + A +W + P + G P
Sbjct 138 GARNLGADETVVATSPLVFATRRSLAERFAAGKTEMSWDMV-----FPATARGRLRPTES 192
Query 294 S---LRLAMPS-SGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLT 349
+R+ PS SG G A A + + A TA + + GA P
Sbjct 193 EPDVVRVPDPSVSGAGIATVAAARDLVGTGSEANKALTAFVRMAQA--GAMP-----DYR 245
Query 350 AAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQ--------SLSDAENTLGSWLPPGPAAV 401
++ + G + PV V+ EQ ++ + +L E T+
Sbjct 246 TMLEAVYARG-FWSRPV--VIVPEQSVWAHNRGPVTEPVVALQPKEGTIH---------- 292
Query 402 ADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSD-VKPPSSPVTSFP- 459
DYP V+ S + A FAR+L L RAGFR +D + P P P
Sbjct 293 LDYPYVVTSD---DPAKAKGAELFARWLRSAPVQDLLRRAGFRSADGSQAPFEPGGEIPT 349
Query 460 ----ALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALE 515
LPS D + V A + + + P G +RL V A +
Sbjct 350 RAPKVLPSITPQLIDEALEAWGKLAPPSRILVLADVSEEGARPIGPDGQTRLGVAVKAAK 409
Query 516 NRIKAMPPSSVVGLWTF-----DGREGRTEVPAGPLADPVNGQP--RPAALTAALGKQYS 568
++ P + +GLW F G++ R + G +++P +GQ R L A +
Sbjct 410 LGLELFPNETHMGLWEFARGIAKGKDHRELISVGSISEPAHGQEIRRTEMLRVADSVRPL 469
Query 569 SGGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAK 628
+G A + T+ ++ + A Y +N++LV+T G + + L + +RK +P +
Sbjct 470 AGKSASLYDTILAGFRSLSAGYEPMMSNALLVLTYGQDDGRGISRQELAEALRKEWNPDR 529
Query 629 PIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS 677
P+ + ++ FGA DRA E A ++ G E + +++FLS
Sbjct 530 PVQIVVVMFGAGRDRAALEEAAAITNG-----EVYVARQPGEIIDVFLS 573
>gi|271968871|ref|YP_003343067.1| hypothetical protein Sros_7651 [Streptosporangium roseum DSM
43021]
gi|270512046|gb|ACZ90324.1| hypothetical protein Sros_7651 [Streptosporangium roseum DSM
43021]
Length=560
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 135/553 (25%), Positives = 222/553 (41%), Gaps = 62/553 (11%)
Query 154 CVGGKDTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWP 213
C G + + V A P I V + A+ +N +A V C V V+ V +G
Sbjct 30 CAGDEIALRVTASPDIRPAVSQIAERFNKAAHEVEGGCATVTVSEGVPATVASGLA---- 85
Query 214 TELGGQPG---LWIPSSSISAARLTGAAGSQAISDSRSLVISPVLL----AVRPELQQAL 266
GG+ G +WIP S + A L A QA S+ SP+++ +V P+L+++
Sbjct 86 ---GGKTGAMDVWIPDSGLWVANLR-AKNPQAPEAGASVAHSPIVMVASGSVVPKLRKSF 141
Query 267 ANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPA 326
+W + N +++ ++ P LA+ S N A A+A +G
Sbjct 142 GEASWGGM----INAANVANVEGPGRKVRVLALDPSFNAAGLGALLAASGVATA-SGVGQ 196
Query 327 TAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDA 386
+GA++TL G S D LL V V +EQ ++ + ++A
Sbjct 197 EQLVGALKTLSG--------SAVRDQDALLSSLGVKGTRAPLGVASEQGVW----AFNNA 244
Query 387 ENTLGSWLPPGPAAVA---DYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGF 443
+ +P PA DYP V+ + + AA AF + L L GF
Sbjct 245 KKPEVPAVPLYPAEGTLNLDYPVVITTK---DAKVRKAAEAFGKELGTESARKTLQDQGF 301
Query 444 RVSDVK--PPSSPVTSFPAL-PSTLSVGDDSMRATLADTMVTASAGVAATIMLDQS---- 496
R D K P + F A P L D A ++ + + G +LD S
Sbjct 302 RTPDGKGGKPVADSGGFQAKAPQALKTPDVKSVARMSQSWSRLNLGTRLLALLDVSGTMA 361
Query 497 MPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTF------DGREGRTEVPAGPLADPV 550
P G R+ + ++ P S +G+W + G + R VP GPLA +
Sbjct 362 TPVPGTGADRMRMISKIAIEGMQLFPAKSEIGVWEYSTHLAGQGVDFRKTVPVGPLAGSI 421
Query 551 NGQPRPAALTAALGKQYSSGGGAVSFT-TLRLIYQEMLANYRVGQANSVLVITAGPHTDQ 609
+G R L L + G TL+ Y +M Y+ + N+VL++T G D
Sbjct 422 DGVLRKDLLVQKLSTIQAKPTGDTGLNDTLKAAYGQMTREYQGDKINTVLILTDGAGNDD 481
Query 610 ---TLDGPGLQDFIRKSADPAKPIAVNIIDFG--ADPDRATWEAVAQLSGGSYQNLETSA 664
+ + +++K+ +P KP+++ +I FG A + +A+A+ +GG E
Sbjct 482 PDGGVSNEEMLQYLKKTYNPEKPVSILLIAFGPEAAAGKKQMDALAKATGG-----EAFI 536
Query 665 SPDLATAVNIFLS 677
+ D+ FL
Sbjct 537 ARDILQVRKFFLK 549
>gi|296268803|ref|YP_003651435.1| von Willebrand factor type A [Thermobispora bispora DSM 43833]
gi|296091590|gb|ADG87542.1| von Willebrand factor type A [Thermobispora bispora DSM 43833]
Length=607
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 150/616 (25%), Positives = 241/616 (40%), Gaps = 79/616 (12%)
Query 112 RRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAA-ARCVGGKDTVAVIADPSIA 170
RRG + ++ ++A ++VA L G +RS A + C G T+ + A
Sbjct 12 RRGFAPFIVAIIIAGALIVALRTLVGGGGGHGGDRSPEARRSACPEGAITLNITVSSEKA 71
Query 171 DQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGG-QPGLWIPSSSI 229
+ ++ A++Y S V RC V V S + + W G +P +W P+SS
Sbjct 72 ELLRTMAEAY--SGREVNGRCAEVVVNPKASGSAMLALARGWDERRDGPKPDVWTPASSG 129
Query 230 SAARLTGAAG-----SQAISDSRSLVISPVLLAVRPELQQAL----ANQNWAALPGLQTN 280
A L A S +D+ S+ SP+++A+ + +AL W+ + L +
Sbjct 130 WIAMLQRRAADNDRASLVSADNPSIATSPLVIAMPKPMAEALGWPDKKIGWSDILSLAND 189
Query 281 PNSLSGLDLPAWGSLRLAMPS---SGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRT-- 335
P + P WG RL + S +G A + AA+ +G A + RT
Sbjct 190 PEGWAKYGHPEWGRFRLGKTNPHFSTSGLNATIG--TYFAATGLSGDLGEANLADRRTRD 247
Query 336 -LMGARPKLAD--DSLTAAMDTLLKPGD--VATAPVHAVVTTEQQLFQRGQSLSDAE-NT 389
+ G + D+ + L + D VA + V AV E+ ++ Q + T
Sbjct 248 FVRGVERSIVHYGDTTLTFLSNLQEADDAGVALSYVSAVAVEEKSVWDYNQGNPTGDPRT 307
Query 390 LGSWLPPGPAAVADYPT--VLLSG------AWLSQEQTSAASAFARYLHKPEQLAKLARA 441
LG P VA YP LLS +W+ E+ A F YL PEQ A
Sbjct 308 LGKHPKPKVPLVAIYPKEGTLLSDNPYAVLSWIDPEKKPVAEDFLNYLRAPEQQRLFAEH 367
Query 442 GFRVSDVKP-----------PSSPVTSF----PALPSTLSVGDDSMRATLADTMVTASAG 486
FR D KP P P + P + + D +R MV +G
Sbjct 368 AFRSHDGKPGELITAENGLNPKEPAKTLSVPAPRVLDRILRSWDELRKPAHVLMVIDVSG 427
Query 487 VAATIMLDQSMPNDE--GGNSRLSNVVAALENRIKAMPPSSVVGLWTFD-----GREGRT 539
SM D G ++L A N + + P+ VGLW F G++ R
Sbjct 428 ---------SMGADVPGTGQTKLELAKQAAINALPQLGPNDQVGLWMFSTNQDGGKDYRE 478
Query 540 EVPAGPLADPVNGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVL 599
VP G N + L + GGG + T R Y+ +L + N+V+
Sbjct 479 LVPMGR-----NNRD----LLKKRIQGLIPGGGTGLYDTTRAAYRTVLERHSNDVINAVV 529
Query 600 VITAGPHTDQTLDGPGLQDFI--RKSADPAKPIAVNIIDFGADPDRATWEAVAQLS-GGS 656
V+T G + D + L+D + ++ + + V I +G D D ++Q++ +
Sbjct 530 VLTDGKNEDD--NSISLEDLLAELRTETGQETVRVFTIAYGNDADLEVLRQISQVTDAAA 587
Query 657 YQNLETSASPDLATAV 672
Y + E + + TAV
Sbjct 588 YDSREPGSIDQVFTAV 603
>gi|297162153|gb|ADI11865.1| hypothetical protein SBI_08747 [Streptomyces bingchenggensis
BCW-1]
Length=610
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 159/602 (27%), Positives = 244/602 (41%), Gaps = 72/602 (11%)
Query 105 GHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDTVAVI 164
G RS+ RR V+I V L A+ A V+ G S C + V
Sbjct 23 GSRSSARRRAVAISTAVVL-ALATGAALVLRSELLGPMKS---------CSNDAVRLGVA 72
Query 165 ADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWI 224
A P IA ++E A+ ++ RC+ V VTS V + +G + G Q +W+
Sbjct 73 ASPDIAPALREVAERARSTHVRSDGRCLDVKVTSRVPYEVADA-LGDDSRDPGFQ--VWL 129
Query 225 PSSSISAARLTGA-AGSQAISDSRSLVISPVLLAVRPELQQAL----ANQNWAALPGLQT 279
P SS+ R T + A S + + SPV +A P + L +WA L G T
Sbjct 130 PDSSVWVDRATSSPAKSVPLDTLGGVASSPVAVAATPSAAKRLGWPQKKYSWARLTGAAT 189
Query 280 NPNSLS-GLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMG 338
L G PA + L + N A +A E+ A A L+
Sbjct 190 GDEDLRLGAADPARSATGLLALARVN---ASIAKESGGPGKGGG---ADTRAAAAAKLLS 243
Query 339 ARPKLADDSLTAAMDTLLKPGDVATAPV------HAVVTTEQQLFQRGQSLSDAENTLGS 392
R DD + + P D + A A+ +EQ ++ + A L
Sbjct 244 QRVSDGDDQVLTTL-----PRDDSGAEAGNPRRNQALFLSEQAAYRHNAAAGGAPR-LQL 297
Query 393 WLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFR------VS 446
+ P A DYP +L+ L+ + AA+ F +L LAR GFR V
Sbjct 298 FYPEDGTAELDYPYTVLNDDALTTVRARAATRFMTFLSDTRNRRILARHGFRPAGGKAVE 357
Query 447 DV------KPPSSPVTSFPA-------LPSTLSVGDDSMRATLADTMVTASAGVAATIML 493
+V K P P PA L + L + ++++ +T+V ASA +AA +
Sbjct 358 EVTRTAGGKAPQ-PYAVVPASGPSGAELETALGMWTITVQSARLNTVVDASASMAAPV-- 414
Query 494 DQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTF----DG-REGRTEVPAGPLA- 547
P G SR++ A+L + P +GLW F DG R+ R VP L
Sbjct 415 ----PG-RSGESRMAVTKASLLRALAQFTPDDEIGLWEFSRQLDGARDYRELVPTRRLGL 469
Query 548 DPVNGQPRPAALTAALGKQYSSGGGAVS-FTTLRLIYQEMLANYRVGQANSVLVITAGPH 606
+G + LTAA G GGA + T Y++ Y G+ N+V+++T G +
Sbjct 470 RDADGSTQRDRLTAAFGALEPVPGGATGLYDTALAAYRKARDGYAQGKFNAVVLLTDGSN 529
Query 607 TDQ-TLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGGSYQNLETSAS 665
D+ ++ L + + + DP +P+ + I G D D + E +A+ +GGS Q + A
Sbjct 530 QDEGSISRKALVEELGRLTDPNRPVPLIAIAVGPDADLSACEDIAEATGGSAQRVADPAQ 589
Query 666 PD 667
D
Sbjct 590 ID 591
>gi|291443250|ref|ZP_06582640.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998]
gi|291346197|gb|EFE73101.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998]
Length=597
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 136/540 (26%), Positives = 211/540 (40%), Gaps = 68/540 (12%)
Query 154 CVGGKDTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWP 213
C ++V+A P IA V+ A+ A RC+ V V + + V
Sbjct 57 CEDSAVHLSVVASPDIAPAVRSIAEQARADELKADGRCLVVEVLARDAHKVAEALAAG-- 114
Query 214 TELGGQPGL--WIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQAL----A 267
+P W+P S + R G +S S S+ SPV LAV P +AL
Sbjct 115 ---DAEPDFQVWLPDSDLWLERAKGLGEGIPVSPSDSVASSPVALAVVPSASRALGWPRK 171
Query 268 NQNWAALPGLQTNPNSLSGLDLPAWGS--LRLAMPSSGNGDAAYLAGEAVAAASAPAGAP 325
WA L ++G A GS +RL LA + A+SA G
Sbjct 172 TYTWAEL---------VAG----ALGSDGVRLGAADPARSATGLLALAGIGASSARQGGD 218
Query 326 ATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGDVAT----APVHAVVTTEQQLFQRGQ 381
+ + ++ R D + ++TL + A AV+ +EQ F
Sbjct 219 SDTRVAQTAKVLAERMSDGDAQV---LETLARSTSGAEEGNPKRNQAVLISEQAAFTHNA 275
Query 382 SLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARA 441
+ A L + P + DYP L++ LS ++ AA F L E A A
Sbjct 276 EATGA-GKLDLFYPEDGTPLLDYPYTLVNEPQLSTTESRAALRFMNLLGDREARATFAEH 334
Query 442 GFRVSDVKPPSSPVTS------------------FPALPSTLSVGDDSMRATLADTMVTA 483
GFR D S V + AL TL + ++++ T+V A
Sbjct 335 GFRAGDGSAEDSLVAAAGGRKPQPYAEPAAEAPSAEALQETLGMWTITVQSARLTTVVDA 394
Query 484 SAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTF----DG-REGR 538
S G AT++ ++ SR+ +L + P+ +GLW F DG ++ R
Sbjct 395 S-GSMATLVPGRN-------QSRMDVTKESLIQALDQFTPNDEIGLWEFATTLDGEKDYR 446
Query 539 TEVPAGPLADP-VNGQPRPAALTAAL-GKQYSSGGGAVSFTTLRLIYQEMLANYRVGQAN 596
+ L DP G LTAA G Q GG + T Y+E + Y G+ N
Sbjct 447 RLMETKRLGDPAAGGGTHREKLTAAFAGLQPVPGGATGLYDTTLASYKEARSTYVKGKFN 506
Query 597 SVLVITAGPHTDQT-LDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGG 655
+++++T G + D + GL +++ DP +P+ V I G D DR +A+++GG
Sbjct 507 ALVILTDGSNQDTNGISRSGLITELKELVDPERPVPVIAIAVGPDADRDEVAEIARITGG 566
>gi|239986306|ref|ZP_04706970.1| hypothetical protein SrosN1_03262 [Streptomyces roseosporus NRRL
11379]
Length=592
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 136/540 (26%), Positives = 211/540 (40%), Gaps = 68/540 (12%)
Query 154 CVGGKDTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWP 213
C ++V+A P IA V+ A+ A RC+ V V + + V
Sbjct 52 CEDSAVHLSVVASPDIAPAVRSIAEQARADELKADGRCLVVEVLARDAHKVAEALAAG-- 109
Query 214 TELGGQPGL--WIPSSSISAARLTGAAGSQAISDSRSLVISPVLLAVRPELQQAL----A 267
+P W+P S + R G +S S S+ SPV LAV P +AL
Sbjct 110 ---DAEPDFQVWLPDSDLWLERAKGLGEGIPVSPSDSVASSPVALAVVPSASRALGWPRK 166
Query 268 NQNWAALPGLQTNPNSLSGLDLPAWGS--LRLAMPSSGNGDAAYLAGEAVAAASAPAGAP 325
WA L ++G A GS +RL LA + A+SA G
Sbjct 167 TYTWAEL---------VAG----ALGSDGVRLGAADPARSATGLLALAGIGASSARQGGD 213
Query 326 ATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGDVAT----APVHAVVTTEQQLFQRGQ 381
+ + ++ R D + ++TL + A AV+ +EQ F
Sbjct 214 SDTRVAQTAKVLAERMSDGDAQV---LETLARSTSGAEEGNPKRNQAVLISEQAAFTHNA 270
Query 382 SLSDAENTLGSWLPPGPAAVADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARA 441
+ A L + P + DYP L++ LS ++ AA F L E A A
Sbjct 271 EATGA-GKLDLFYPEDGTPLLDYPYTLVNEPQLSTTESRAALRFMNLLGDREARATFAEH 329
Query 442 GFRVSDVKPPSSPVTS------------------FPALPSTLSVGDDSMRATLADTMVTA 483
GFR D S V + AL TL + ++++ T+V A
Sbjct 330 GFRAGDGSAEDSLVAAAGGRKPQPYAEPAAEAPSAEALQETLGMWTITVQSARLTTVVDA 389
Query 484 SAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTF----DG-REGR 538
S G AT++ ++ SR+ +L + P+ +GLW F DG ++ R
Sbjct 390 S-GSMATLVPGRN-------QSRMDVTKESLIQALDQFTPNDEIGLWEFATTLDGEKDYR 441
Query 539 TEVPAGPLADP-VNGQPRPAALTAAL-GKQYSSGGGAVSFTTLRLIYQEMLANYRVGQAN 596
+ L DP G LTAA G Q GG + T Y+E + Y G+ N
Sbjct 442 RLMETKRLGDPAAGGGTHREKLTAAFAGLQPVPGGATGLYDTTLASYKEARSTYVKGKFN 501
Query 597 SVLVITAGPHTDQT-LDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVAQLSGG 655
+++++T G + D + GL +++ DP +P+ V I G D DR +A+++GG
Sbjct 502 ALVILTDGSNQDTNGISRSGLITELKELVDPERPVPVIAIAVGPDADRDEVAEIARITGG 561
Lambda K H
0.313 0.130 0.384
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 1579993410260
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40