BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3178
Length=119
Score E
Sequences producing significant alignments: (Bits) Value
gi|15842755|ref|NP_337792.1| hypothetical protein MT3267 [Mycoba... 242 2e-62
gi|15610314|ref|NP_217694.1| hypothetical protein Rv3178 [Mycoba... 241 2e-62
gi|289575891|ref|ZP_06456118.1| conserved hypothetical protein [... 240 5e-62
gi|340628154|ref|YP_004746606.1| hypothetical protein MCAN_31931... 237 5e-61
gi|126432655|ref|YP_001068346.1| hypothetical protein Mjls_0042 ... 168 2e-40
gi|108797033|ref|YP_637230.1| hypothetical protein Mmcs_0052 [My... 167 6e-40
gi|312195411|ref|YP_004015472.1| hypothetical protein FraEuI1c_1... 140 6e-32
gi|302526918|ref|ZP_07279260.1| conserved hypothetical protein [... 137 5e-31
gi|300784413|ref|YP_003764704.1| hypothetical protein AMED_2507 ... 132 2e-29
gi|340525834|gb|AEK41039.1| hypothetical protein RAM_12745 [Amyc... 132 2e-29
gi|312138188|ref|YP_004005524.1| hypothetical protein REQ_07220 ... 118 4e-25
gi|325674907|ref|ZP_08154594.1| hypothetical protein HMPREF0724_... 117 6e-25
gi|326383127|ref|ZP_08204816.1| hypothetical protein SCNU_09321 ... 117 6e-25
gi|226364186|ref|YP_002781968.1| hypothetical protein ROP_47760 ... 115 3e-24
gi|111021649|ref|YP_704621.1| hypothetical protein RHA1_ro04677 ... 114 4e-24
gi|262203831|ref|YP_003275039.1| hypothetical protein Gbro_3972 ... 114 6e-24
gi|326331633|ref|ZP_08197921.1| hypothetical protein NBCG_03072 ... 111 3e-23
gi|226304413|ref|YP_002764371.1| hypothetical protein RER_09240 ... 111 4e-23
gi|169627712|ref|YP_001701361.1| hypothetical protein MAB_0609c ... 110 5e-23
gi|229494560|ref|ZP_04388323.1| conserved hypothetical protein [... 110 5e-23
gi|343928246|ref|ZP_08767700.1| hypothetical protein GOALK_111_0... 108 3e-22
gi|111026879|ref|YP_708857.1| hypothetical protein RHA1_ro11052 ... 105 3e-21
gi|269127186|ref|YP_003300556.1| hypothetical protein Tcur_2976 ... 99.8 1e-19
gi|183985003|ref|YP_001853294.1| hypothetical protein MMAR_5035 ... 99.0 2e-19
gi|126437629|ref|YP_001073320.1| hypothetical protein Mjls_5065 ... 97.4 5e-19
gi|108801643|ref|YP_641840.1| hypothetical protein Mmcs_4680 [My... 97.1 9e-19
gi|300784815|ref|YP_003765106.1| hypothetical protein AMED_2910 ... 95.1 3e-18
gi|240172838|ref|ZP_04751497.1| hypothetical protein MkanA1_2623... 93.2 1e-17
gi|289445139|ref|ZP_06434883.1| conserved hypothetical protein [... 92.4 2e-17
gi|15610683|ref|NP_218064.1| hypothetical protein Rv3547 [Mycoba... 92.0 2e-17
gi|41406617|ref|NP_959453.1| hypothetical protein MAP0519c [Myco... 91.7 4e-17
gi|288920589|ref|ZP_06414894.1| hypothetical protein FrEUN1fDRAF... 91.3 4e-17
gi|118467215|ref|YP_879893.1| AclJ protein [Mycobacterium avium ... 91.3 4e-17
gi|336458407|gb|EGO37381.1| deazaflavin-dependent nitroreductase... 91.3 4e-17
gi|330466024|ref|YP_004403767.1| hypothetical protein VAB18032_1... 91.3 5e-17
gi|288920357|ref|ZP_06414668.1| hypothetical protein FrEUN1fDRAF... 89.7 1e-16
gi|269126740|ref|YP_003300110.1| hypothetical protein Tcur_2511 ... 89.0 2e-16
gi|296166578|ref|ZP_06849008.1| hypothetical protein HMPREF0591_... 88.2 3e-16
gi|342862374|ref|ZP_08719014.1| hypothetical protein MCOL_25903 ... 88.2 3e-16
gi|254822629|ref|ZP_05227630.1| hypothetical protein MintA_22059... 88.2 4e-16
gi|16945722|dbj|BAB72053.1| AclJ [Streptomyces galilaeus] 87.8 5e-16
gi|302529146|ref|ZP_07281488.1| proline iminopeptidase [Streptom... 87.4 7e-16
gi|302530438|ref|ZP_07282780.1| conserved hypothetical protein [... 86.7 1e-15
gi|271969075|ref|YP_003343271.1| hypothetical protein Sros_7866 ... 84.7 5e-15
gi|227121349|gb|ACP19381.1| hypothetical protein [Micromonospora... 84.3 5e-15
gi|254823479|ref|ZP_05228480.1| hypothetical protein MintA_26353... 83.6 9e-15
gi|333922113|ref|YP_004495694.1| hypothetical protein AS9A_4461 ... 83.2 1e-14
gi|227121348|gb|ACP19380.1| hypothetical protein [Micromonospora... 83.2 1e-14
gi|84498004|ref|ZP_00996801.1| hypothetical protein JNB_17993 [J... 82.0 2e-14
gi|333992336|ref|YP_004524950.1| hypothetical protein JDM601_369... 82.0 3e-14
>gi|15842755|ref|NP_337792.1| hypothetical protein MT3267 [Mycobacterium tuberculosis CDC1551]
gi|254233791|ref|ZP_04927116.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|308232361|ref|ZP_07664068.1| hypothetical protein TMAG_03640 [Mycobacterium tuberculosis SUMu001]
19 more sequence titles
Length=171
Score = 242 bits (617), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 119/119 (100%), Positives = 119/119 (100%), Gaps = 0/119 (0%)
Query 1 LRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLP 60
LRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLP
Sbjct 53 LRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLP 112
Query 61 PNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
PNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR
Sbjct 113 PNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 171
>gi|15610314|ref|NP_217694.1| hypothetical protein Rv3178 [Mycobacterium tuberculosis H37Rv]
gi|31794356|ref|NP_856849.1| hypothetical protein Mb3204 [Mycobacterium bovis AF2122/97]
gi|121639063|ref|YP_979287.1| hypothetical protein BCG_3203 [Mycobacterium bovis BCG str. Pasteur
1173P2]
53 more sequence titles
Length=119
Score = 241 bits (616), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 118/119 (99%), Positives = 119/119 (100%), Gaps = 0/119 (0%)
Query 1 LRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLP 60
+RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLP
Sbjct 1 MRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLP 60
Query 61 PNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
PNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR
Sbjct 61 PNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
>gi|289575891|ref|ZP_06456118.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|339633186|ref|YP_004724828.1| hypothetical protein MAF_31870 [Mycobacterium africanum GM041182]
gi|289540322|gb|EFD44900.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|339332542|emb|CCC28257.1| conserved hypothetical protein [Mycobacterium africanum GM041182]
Length=119
Score = 240 bits (612), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 117/119 (99%), Positives = 118/119 (99%), Gaps = 0/119 (0%)
Query 1 LRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLP 60
+RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLP
Sbjct 1 MRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLP 60
Query 61 PNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
PNPDTHIQIGSDRRPVRAVV SSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR
Sbjct 61 PNPDTHIQIGSDRRPVRAVVTSSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
>gi|340628154|ref|YP_004746606.1| hypothetical protein MCAN_31931 [Mycobacterium canettii CIPT
140010059]
gi|340006344|emb|CCC45524.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=119
Score = 237 bits (604), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 116/119 (98%), Positives = 118/119 (99%), Gaps = 0/119 (0%)
Query 1 LRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLP 60
+RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNL
Sbjct 1 MRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLL 60
Query 61 PNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
PNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIIL+PR
Sbjct 61 PNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILQPR 119
>gi|126432655|ref|YP_001068346.1| hypothetical protein Mjls_0042 [Mycobacterium sp. JLS]
gi|126232455|gb|ABN95855.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=159
Score = 168 bits (426), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 80/118 (68%), Positives = 97/118 (83%), Gaps = 0/118 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+GAGFRKPVPT+LLEHR RKSGK FV PLLY+ D +V++VAS G+ E+PQWYRNL
Sbjct 42 RVGAGFRKPVPTMLLEHRGRKSGKVFVTPLLYLRDGADVVIVASQGGRKEHPQWYRNLIA 101
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
PD +IGS+RRPV+AVVA +++RARLWP V+ YADFD+ QSWTER IPVIIL+PR
Sbjct 102 GPDVFAEIGSERRPVKAVVADAEQRARLWPALVELYADFDTYQSWTEREIPVIILQPR 159
>gi|108797033|ref|YP_637230.1| hypothetical protein Mmcs_0052 [Mycobacterium sp. MCS]
gi|119866118|ref|YP_936070.1| hypothetical protein Mkms_0061 [Mycobacterium sp. KMS]
gi|108767452|gb|ABG06174.1| hypothetical protein Mmcs_0052 [Mycobacterium sp. MCS]
gi|119692207|gb|ABL89280.1| Mycobacterium tuberculosis paralogous family 11 [Mycobacterium
sp. KMS]
Length=159
Score = 167 bits (422), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 78/118 (67%), Positives = 97/118 (83%), Gaps = 0/118 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+GAGFRKPVPT+LLEHR RK+GK FV PLLY+ D +V++VAS G+ E+PQWYRNL
Sbjct 42 RVGAGFRKPVPTMLLEHRGRKTGKVFVTPLLYLRDGADVVIVASQGGRKEHPQWYRNLIA 101
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
PD +IGS+RRPV+AVVA +++RARLWP V+ YADFD+ QSWTER IPV+IL+PR
Sbjct 102 GPDVFAEIGSERRPVKAVVADAEQRARLWPALVELYADFDTYQSWTEREIPVVILQPR 159
>gi|312195411|ref|YP_004015472.1| hypothetical protein FraEuI1c_1540 [Frankia sp. EuI1c]
gi|311226747|gb|ADP79602.1| hypothetical protein FraEuI1c_1540 [Frankia sp. EuI1c]
Length=167
Score = 140 bits (353), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 71/119 (60%), Positives = 85/119 (72%), Gaps = 1/119 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+GAGFR+PVPTLLLEH R SG+ F PLLY+ D N++VVAS G +NPQWY NL
Sbjct 46 RVGAGFRRPVPTLLLEHVGRTSGRRFTTPLLYLDDAPNLVVVASQGGLPKNPQWYANLLA 105
Query 62 NPDTHIQIGSDR-RPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
+PDT + + R RPV A VA+ DERA LWPR V+ YADF +WTER IPV+IL PR
Sbjct 106 HPDTLVHLRKQRGRPVHARVATPDERASLWPRLVELYADFAKYAAWTEREIPVVILEPR 164
>gi|302526918|ref|ZP_07279260.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302435813|gb|EFL07629.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=164
Score = 137 bits (345), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 68/119 (58%), Positives = 85/119 (72%), Gaps = 1/119 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+GAGFRKPVPTLLLEHR RKSG+ F PLLY+ +++++VAS G ++PQWY NL
Sbjct 46 RVGAGFRKPVPTLLLEHRGRKSGRLFTTPLLYLRSGDDLVIVASQGGLPKHPQWYFNLRA 105
Query 62 NPDTHIQI-GSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
P+T + + G PV A VAS+DER LWPR V+ YADF Q WTER IPV++L PR
Sbjct 106 EPETRVHLRGRRNVPVTARVASADERRELWPRLVELYADFAKYQKWTEREIPVVVLTPR 164
>gi|300784413|ref|YP_003764704.1| hypothetical protein AMED_2507 [Amycolatopsis mediterranei U32]
gi|299793927|gb|ADJ44302.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
Length=166
Score = 132 bits (331), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 67/119 (57%), Positives = 81/119 (69%), Gaps = 1/119 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+GAGFRKPVPTLLLEHR RKSG+ F PLLY+ + ++V S G +PQWY NL
Sbjct 47 RIGAGFRKPVPTLLLEHRGRKSGRLFTTPLLYLRSGADFVIVGSQGGLPRHPQWYFNLLA 106
Query 62 NPDTHIQIGSDRR-PVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
+P+T I + R PV A VA ERA LWPR V+ YADF Q+WTER IPV++L PR
Sbjct 107 HPETRIDLKGRRGVPVTARVAEPPERAELWPRLVELYADFAKYQAWTEREIPVVVLSPR 165
>gi|340525834|gb|AEK41039.1| hypothetical protein RAM_12745 [Amycolatopsis mediterranei S699]
Length=165
Score = 132 bits (331), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 67/119 (57%), Positives = 81/119 (69%), Gaps = 1/119 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+GAGFRKPVPTLLLEHR RKSG+ F PLLY+ + ++V S G +PQWY NL
Sbjct 46 RIGAGFRKPVPTLLLEHRGRKSGRLFTTPLLYLRSGADFVIVGSQGGLPRHPQWYFNLLA 105
Query 62 NPDTHIQIGSDRR-PVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
+P+T I + R PV A VA ERA LWPR V+ YADF Q+WTER IPV++L PR
Sbjct 106 HPETRIDLKGRRGVPVTARVAEPPERAELWPRLVELYADFAKYQAWTEREIPVVVLSPR 164
>gi|312138188|ref|YP_004005524.1| hypothetical protein REQ_07220 [Rhodococcus equi 103S]
gi|311887527|emb|CBH46839.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=156
Score = 118 bits (295), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 60/114 (53%), Positives = 78/114 (69%), Gaps = 0/114 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+G+ F +P LL RKSG+ V+PLL++ D + VI+VAS G +NP WY N+
Sbjct 41 RVGSAFPWGIPVCLLTTTGRKSGQPRVSPLLFLEDGDRVILVASQGGLPKNPMWYLNIRA 100
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVII 115
NP+ IQI S R +RA VA+ +ERA LWPR VD YADFD+ QSWT+R IPV+I
Sbjct 101 NPEVTIQIKSRIRTMRARVATDEERAELWPRLVDMYADFDNYQSWTDRQIPVVI 154
>gi|325674907|ref|ZP_08154594.1| hypothetical protein HMPREF0724_12376 [Rhodococcus equi ATCC
33707]
gi|325554493|gb|EGD24168.1| hypothetical protein HMPREF0724_12376 [Rhodococcus equi ATCC
33707]
Length=156
Score = 117 bits (293), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 60/114 (53%), Positives = 77/114 (68%), Gaps = 0/114 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+G+ F +P LL RKSG+ V+PLL++ D VI+VAS G +NP WY N+
Sbjct 41 RVGSAFPWGIPVCLLTTTGRKSGQPRVSPLLFLEDGGRVILVASQGGLPKNPMWYLNIRA 100
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVII 115
NP+ IQI S R +RA VA+ +ERA LWPR VD YADFD+ QSWT+R IPV+I
Sbjct 101 NPEVTIQIKSRIRTMRARVATDEERAELWPRLVDMYADFDNYQSWTDRQIPVVI 154
>gi|326383127|ref|ZP_08204816.1| hypothetical protein SCNU_09321 [Gordonia neofelifaecis NRRL
B-59395]
gi|326198263|gb|EGD55448.1| hypothetical protein SCNU_09321 [Gordonia neofelifaecis NRRL
B-59395]
Length=159
Score = 117 bits (293), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 57/117 (49%), Positives = 71/117 (61%), Gaps = 0/117 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+GA RKP P LL RKSG+ PLLY+ D +++VAS G +NP WY N+
Sbjct 42 RVGAALRKPTPVCLLTTVGRKSGQKRTVPLLYLRDGQAIVLVASQGGLPKNPSWYYNVRD 101
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
P IQIG D R A +A ERA LWPR VD YADFD+ +WT+R IPV++ P
Sbjct 102 EPAVSIQIGGDDRAYVARIAGDAERAELWPRLVDLYADFDTYAAWTDRAIPVVVCEP 158
>gi|226364186|ref|YP_002781968.1| hypothetical protein ROP_47760 [Rhodococcus opacus B4]
gi|226242675|dbj|BAH53023.1| hypothetical protein [Rhodococcus opacus B4]
Length=169
Score = 115 bits (287), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 56/118 (48%), Positives = 78/118 (67%), Gaps = 0/118 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+G+ F + +P LL RKSG ++PLL++ D + +++VAS G ++P WY NL
Sbjct 52 RVGSAFPRGLPVCLLTTTGRKSGDPRISPLLFLEDGDRIVLVASQGGLPKHPMWYLNLRA 111
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
NP +Q+ S RP+ A VA ERARLWPR VD YADFD+ Q+WT+R IPV++ PR
Sbjct 112 NPQVTVQVKSRVRPMTAQVADPGERARLWPRLVDMYADFDNYQAWTDRTIPVVVCTPR 169
>gi|111021649|ref|YP_704621.1| hypothetical protein RHA1_ro04677 [Rhodococcus jostii RHA1]
gi|110821179|gb|ABG96463.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=161
Score = 114 bits (285), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 56/118 (48%), Positives = 79/118 (67%), Gaps = 0/118 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+G+ F + +P LL RKSG+ ++PLL++ D + +I+VAS G ++P WY NL
Sbjct 44 RVGSAFPRGLPVCLLTTTGRKSGEPRISPLLFLEDGDRIILVASQGGLPKHPMWYLNLRA 103
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
NPD +Q+ S RP+ A VA +ERARLWPR V Y DFD+ Q+WT+R IPV++ PR
Sbjct 104 NPDVTVQVKSRVRPMTAHVADPEERARLWPRLVAMYPDFDNYQAWTDRTIPVVVCTPR 161
>gi|262203831|ref|YP_003275039.1| hypothetical protein Gbro_3972 [Gordonia bronchialis DSM 43247]
gi|262087178|gb|ACY23146.1| hypothetical protein Gbro_3972 [Gordonia bronchialis DSM 43247]
Length=169
Score = 114 bits (284), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 60/123 (49%), Positives = 76/123 (62%), Gaps = 5/123 (4%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+GA R PVP LL RKSGK APLLY+ ++ +VVAS G A +P WY NL
Sbjct 47 RVGAAARNPVPVALLTTIGRKSGKPRTAPLLYLRRGDDFVVVASQGGSATHPAWYLNLRD 106
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERG-----IPVIIL 116
NPD IQ+G D +RA A+ ERA++WP VD Y DFD+ +WT++G IPV+I
Sbjct 107 NPDVSIQVGKDTYELRARTATDSERAQIWPELVDHYPDFDTYAAWTDQGPTKRIIPVVIC 166
Query 117 RPR 119
PR
Sbjct 167 EPR 169
>gi|326331633|ref|ZP_08197921.1| hypothetical protein NBCG_03072 [Nocardioidaceae bacterium Broad-1]
gi|325950432|gb|EGD42484.1| hypothetical protein NBCG_03072 [Nocardioidaceae bacterium Broad-1]
Length=161
Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 58/119 (49%), Positives = 79/119 (67%), Gaps = 1/119 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+ AG+RKP P LLLEH R SG+ F PLL++ +++VV S G ++PQW NL
Sbjct 43 RVMAGWRKPAPVLLLEHIGRTSGRTFSTPLLFMRSGPDLVVVGSQGGLPKDPQWVANLRA 102
Query 62 NPDTHIQI-GSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
PD +++ S R VRA VA ERA LWP+ ++ YADF++ Q+WT+R IPVI+L PR
Sbjct 103 RPDVAVRLPRSGRSLVRARVADPAERAALWPQLLETYADFETYQTWTDREIPVIVLSPR 161
>gi|226304413|ref|YP_002764371.1| hypothetical protein RER_09240 [Rhodococcus erythropolis PR4]
gi|226183528|dbj|BAH31632.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=162
Score = 111 bits (277), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 54/117 (47%), Positives = 76/117 (65%), Gaps = 0/117 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+G+ F + +P LL RKSG+ ++PLL++ D NVI+VAS G +NP WY N+
Sbjct 43 RVGSAFPRGLPICLLTTTGRKSGEERISPLLFLEDGENVILVASQGGLPKNPMWYLNIKT 102
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
+P +QI S R ++A V++ DERARLWP Y DFD+ Q+WT+R IPVI+ P
Sbjct 103 DPVVTVQIKSRIRSMKARVSTEDERARLWPMLTAMYPDFDNYQAWTDRIIPVIVCEP 159
>gi|169627712|ref|YP_001701361.1| hypothetical protein MAB_0609c [Mycobacterium abscessus ATCC
19977]
gi|169239679|emb|CAM60707.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=162
Score = 110 bits (276), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 55/117 (48%), Positives = 74/117 (64%), Gaps = 0/117 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+GA F +P LLL RK+G+ ++ LLY+ D + V++VAS G+A NP WY NL
Sbjct 46 RVGAAFPWGIPVLLLTTTGRKTGQQRLSALLYLPDGDKVVLVASQGGRASNPAWYLNLKA 105
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
N + +QI D R + A A+ ERA WP+ V+ YADFD QS+T R IPV+IL P
Sbjct 106 NSEVTVQIKGDIRTMTARTATDAERAYYWPKLVELYADFDKYQSYTTRQIPVVILEP 162
>gi|229494560|ref|ZP_04388323.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229318922|gb|EEN84780.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=162
Score = 110 bits (276), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 55/117 (48%), Positives = 76/117 (65%), Gaps = 0/117 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+G+ F + +P LL RKSG+ ++PLL++ D NVI+VAS G +NP WY N+
Sbjct 43 RVGSAFPRGLPICLLTTTGRKSGEERISPLLFLEDGENVILVASQGGLPKNPMWYLNIKS 102
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
+P +QI S R ++A VA+ DERARLWP Y DFD+ Q+WT+R IPVI+ P
Sbjct 103 DPVVTVQIKSRIRSMKARVAAEDERARLWPMLRAMYPDFDNYQAWTDRVIPVIVCEP 159
>gi|343928246|ref|ZP_08767700.1| hypothetical protein GOALK_111_00150 [Gordonia alkanivorans NBRC
16433]
gi|343761840|dbj|GAA14626.1| hypothetical protein GOALK_111_00150 [Gordonia alkanivorans NBRC
16433]
Length=164
Score = 108 bits (269), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 52/117 (45%), Positives = 72/117 (62%), Gaps = 0/117 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+GAG +KP P LL RKSG+ PL+Y+ + +VVAS G A++P WY NL
Sbjct 45 RVGAGLKKPAPVCLLTTIGRKSGEPRTVPLVYLRRGDQFVVVASQGGSAKHPAWYLNLRD 104
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
NP +Q+G + + A A+ ERA +WP V+ YADFD+ +WT+R IPV+I P
Sbjct 105 NPSVTVQLGKETFDLTARTATEAERAEVWPELVEVYADFDTYVAWTDRTIPVVICEP 161
>gi|111026879|ref|YP_708857.1| hypothetical protein RHA1_ro11052 [Rhodococcus jostii RHA1]
gi|110825418|gb|ABH00699.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=155
Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 51/118 (44%), Positives = 73/118 (62%), Gaps = 0/118 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
R+G F TLL++H +SG + + L+Y D N+I+VAS G NP W+ NL
Sbjct 37 RIGHRFLHLKQTLLIDHVGARSGTHRTSGLVYAKDGANLILVASKGGHPRNPDWFHNLRA 96
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
NPDT +Q+G++RR V A VA DER RLW +D + F + Q+ T+R IP+++L PR
Sbjct 97 NPDTTVQVGTERRTVHARVAQQDERERLWALAIDVHPGFAAYQARTDREIPLVVLEPR 154
>gi|269127186|ref|YP_003300556.1| hypothetical protein Tcur_2976 [Thermomonospora curvata DSM 43183]
gi|268312144|gb|ACY98518.1| hypothetical protein Tcur_2976 [Thermomonospora curvata DSM 43183]
Length=163
Score = 99.8 bits (247), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/119 (45%), Positives = 70/119 (59%), Gaps = 1/119 (0%)
Query 1 LRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLP 60
+G+ + + VP L RKSG PLLY+ D ++VVAS G +PQWY NL
Sbjct 44 FHVGSAWLRGVPVCLFTTIGRKSGLPRTMPLLYMRDGERIVVVASQGGLPGHPQWYLNLR 103
Query 61 PNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWT-ERGIPVIILRP 118
NP+ +QIG +R +RA A ERA+LWPR + YADFD+ Q W +R IPV+I P
Sbjct 104 ANPEVVVQIGRKKRRMRARTADEAERAQLWPRLLKIYADFDNYQRWAGDRVIPVVICEP 162
>gi|183985003|ref|YP_001853294.1| hypothetical protein MMAR_5035 [Mycobacterium marinum M]
gi|183178329|gb|ACC43439.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=151
Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/116 (44%), Positives = 70/116 (61%), Gaps = 1/116 (0%)
Query 3 LGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPN 62
LG F+K +P LL RK+G+ V+PL + D + VIV AS G A+NP WY NL N
Sbjct 37 LGGTFQK-IPVALLTTTGRKTGEPRVSPLYFHRDGDRVIVAASKGGSAKNPMWYLNLKAN 95
Query 63 PDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
P +QI + + A A+ +ERAR WP+ VD Y ++ QSWT+R IP+++ P
Sbjct 96 PKVGVQIKKEVLDLTARDATDEERARYWPKLVDMYPSYEDYQSWTDRTIPIVVCEP 151
>gi|126437629|ref|YP_001073320.1| hypothetical protein Mjls_5065 [Mycobacterium sp. JLS]
gi|126237429|gb|ABO00830.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=150
Score = 97.4 bits (241), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 51/117 (44%), Positives = 69/117 (59%), Gaps = 1/117 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
RLG F K P LL RK+G+ V+PL+++ D VI+VAS G ++P WY NL
Sbjct 35 RLGGSFGK-APVALLTTIGRKTGEPRVSPLIFLRDGERVILVASRGGSDKHPMWYLNLRA 93
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
NP +QI D + A +A+ DE A WP+ Y DFDS +SWT+R IP++I P
Sbjct 94 NPTVQVQIRDDVLTLTARLATEDEHAHYWPKLTAMYPDFDSYRSWTDREIPIVICEP 150
>gi|108801643|ref|YP_641840.1| hypothetical protein Mmcs_4680 [Mycobacterium sp. MCS]
gi|119870794|ref|YP_940746.1| hypothetical protein Mkms_4766 [Mycobacterium sp. KMS]
gi|108772062|gb|ABG10784.1| hypothetical protein Mmcs_4680 [Mycobacterium sp. MCS]
gi|119696883|gb|ABL93956.1| Mycobacterium tuberculosis paralogous family 11 [Mycobacterium
sp. KMS]
Length=150
Score = 97.1 bits (240), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 50/117 (43%), Positives = 69/117 (59%), Gaps = 1/117 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
RLG F K P LL RK+G+ V+PL+++ D VI+VAS G ++P WY NL
Sbjct 35 RLGGSFGK-APVALLTTIGRKTGEPRVSPLIFLRDGERVILVASRGGSDKHPMWYLNLRA 93
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
NP +QI D + A +A+ DE A WP+ Y DFD+ +SWT+R IP++I P
Sbjct 94 NPTVQVQIKDDVLTLTARLATEDEHAHYWPKLTAMYPDFDNYRSWTDREIPIVICEP 150
>gi|300784815|ref|YP_003765106.1| hypothetical protein AMED_2910 [Amycolatopsis mediterranei U32]
gi|299794329|gb|ADJ44704.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340526241|gb|AEK41446.1| hypothetical protein RAM_14790 [Amycolatopsis mediterranei S699]
Length=156
Score = 95.1 bits (235), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/118 (41%), Positives = 70/118 (60%), Gaps = 1/118 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
++G + P LLL H RKSG+ ++PL+Y+ D ++VVAS G +P W+ NL
Sbjct 32 KVGGSYFGGAPVLLLHHVGRKSGEARISPLIYLDDAPRLVVVASKGGVDAHPAWFHNLTA 91
Query 62 NPDTHIQI-GSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
T +++ G +RR VR VA ERARLW R V Y + + ++TER IPV++L P
Sbjct 92 MATTEVELPGGERRRVRPHVAEGAERARLWDRAVAIYKTYAAYATYTERTIPVVVLEP 149
>gi|240172838|ref|ZP_04751497.1| hypothetical protein MkanA1_26232 [Mycobacterium kansasii ATCC
12478]
Length=151
Score = 93.2 bits (230), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/116 (41%), Positives = 69/116 (60%), Gaps = 1/116 (0%)
Query 3 LGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPN 62
LG F+K +P LL RK+G+ V+PL ++ D + VIV AS G +NP WY NL N
Sbjct 37 LGGTFQK-IPVALLTTTGRKTGQPRVSPLYFLRDGDRVIVAASKGGADKNPMWYLNLKAN 95
Query 63 PDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
P +QI + + A A+ +ERA+ W + VD Y ++ QSWT+R IP+++ P
Sbjct 96 PKVSVQIKKEILDLTARDATDEERAKYWRQLVDMYPTYEDYQSWTDRTIPIVVCEP 151
>gi|289445139|ref|ZP_06434883.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289418097|gb|EFD15298.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
Length=151
Score = 92.4 bits (228), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/116 (41%), Positives = 65/116 (57%), Gaps = 1/116 (0%)
Query 3 LGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPN 62
LG F+K +P LL RK+G+ V PL ++ D VIV AS G +NP WY NL N
Sbjct 37 LGGTFQK-IPVALLTTTGRKTGQPRVNPLYFLRDGGRVIVAASKGGAEKNPMWYLNLKAN 95
Query 63 PDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
P +QI + + A A+ +ERA WP+ V Y + QSWT+R IP+++ P
Sbjct 96 PKVQVQIKKEVLDLTARNATDEERAEYWPQLVTMYPSYQDYQSWTDRTIPIVVCEP 151
>gi|15610683|ref|NP_218064.1| hypothetical protein Rv3547 [Mycobacterium tuberculosis H37Rv]
gi|15843159|ref|NP_338196.1| hypothetical protein MT3651 [Mycobacterium tuberculosis CDC1551]
gi|31794723|ref|NP_857216.1| hypothetical protein Mb3577 [Mycobacterium bovis AF2122/97]
79 more sequence titles
Length=151
Score = 92.0 bits (227), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/116 (41%), Positives = 65/116 (57%), Gaps = 1/116 (0%)
Query 3 LGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPN 62
LG F+K +P LL RK+G+ V PL ++ D VIV AS G +NP WY NL N
Sbjct 37 LGGTFQK-IPVALLTTTGRKTGQPRVNPLYFLRDGGRVIVAASKGGAEKNPMWYLNLKAN 95
Query 63 PDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
P +QI + + A A+ +ERA WP+ V Y + QSWT+R IP+++ P
Sbjct 96 PKVQVQIKKEVLDLTARDATDEERAEYWPQLVTMYPSYQDYQSWTDRTIPIVVCEP 151
>gi|41406617|ref|NP_959453.1| hypothetical protein MAP0519c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|254773570|ref|ZP_05215086.1| hypothetical protein MaviaA2_02685 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|41394966|gb|AAS02836.1| hypothetical protein MAP_0519c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=148
Score = 91.7 bits (226), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 49/117 (42%), Positives = 71/117 (61%), Gaps = 1/117 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
+ G F+K P LL RK+G+ V+PLLY+ + + VI+VAS G+ ++P WY NL
Sbjct 33 KWGGTFQKR-PVALLTTTGRKTGQPRVSPLLYLREGDRVILVASQGGRDKHPLWYLNLKA 91
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
NP +QI + ++A A+ +ER + WP+ V Y FD QSWT+R IPV+I P
Sbjct 92 NPKVSVQIKDEVLQLQARDATPEEREQYWPKLVAMYPSFDDYQSWTDRVIPVVICDP 148
>gi|288920589|ref|ZP_06414894.1| hypothetical protein FrEUN1fDRAFT_4592 [Frankia sp. EUN1f]
gi|288348010|gb|EFC82282.1| hypothetical protein FrEUN1fDRAFT_4592 [Frankia sp. EUN1f]
Length=144
Score = 91.3 bits (225), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 47/108 (44%), Positives = 68/108 (63%), Gaps = 0/108 (0%)
Query 11 VPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPNPDTHIQIG 70
V TLLL R+SG+ + L++ D ++ +VVAS G E+P+WYRNL NP IQ+
Sbjct 36 VHTLLLTTTGRRSGQPRTSALIFARDGDDYLVVASKGGAPEHPEWYRNLLANPAAEIQVQ 95
Query 71 SDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
++R VRA AS E+ARLW + ++D+ Q+ TER IPV++L P
Sbjct 96 AERFTVRAATASDAEKARLWEIVTKVWPNYDAYQTRTERPIPVVVLSP 143
>gi|118467215|ref|YP_879893.1| AclJ protein [Mycobacterium avium 104]
gi|118168502|gb|ABK69399.1| AclJ protein [Mycobacterium avium 104]
Length=142
Score = 91.3 bits (225), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 49/117 (42%), Positives = 71/117 (61%), Gaps = 1/117 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
+ G F+K P LL RK+G+ V+PLLY+ + + VI+VAS G+ ++P WY NL
Sbjct 27 KWGGTFQKR-PVALLTTTGRKTGQPRVSPLLYLREGDRVILVASQGGRDKHPLWYLNLKA 85
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
NP +QI + ++A A+ +ER + WP+ V Y FD QSWT+R IPV+I P
Sbjct 86 NPKVSVQIKDEVLQLQARDATPEEREQYWPKLVAMYPSFDDYQSWTDRVIPVVICDP 142
>gi|336458407|gb|EGO37381.1| deazaflavin-dependent nitroreductase family protein [Mycobacterium
avium subsp. paratuberculosis S397]
Length=138
Score = 91.3 bits (225), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 49/117 (42%), Positives = 71/117 (61%), Gaps = 1/117 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
+ G F+K P LL RK+G+ V+PLLY+ + + VI+VAS G+ ++P WY NL
Sbjct 23 KWGGTFQKR-PVALLTTTGRKTGQPRVSPLLYLREGDRVILVASQGGRDKHPLWYLNLKA 81
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
NP +QI + ++A A+ +ER + WP+ V Y FD QSWT+R IPV+I P
Sbjct 82 NPKVSVQIKDEVLQLQARDATPEEREQYWPKLVAMYPSFDDYQSWTDRVIPVVICDP 138
>gi|330466024|ref|YP_004403767.1| hypothetical protein VAB18032_10245 [Verrucosispora maris AB-18-032]
gi|328808995|gb|AEB43167.1| hypothetical protein VAB18032_10245 [Verrucosispora maris AB-18-032]
Length=145
Score = 91.3 bits (225), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 48/113 (43%), Positives = 66/113 (59%), Gaps = 0/113 (0%)
Query 4 GAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPNP 63
G +R V TLLL R R+SGK L+Y D ++VAS G +P WY NL +P
Sbjct 30 GHEWRPGVFTLLLTTRGRRSGKLRRTALIYGRDGEAYVIVASQGGDPRHPAWYLNLEADP 89
Query 64 DTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIIL 116
+Q+G+DR RA A+ +E+ARLWP D + +D Q+ TER IPV++L
Sbjct 90 TAQVQVGADRFTARARTATPEEKARLWPMMTDIWPAYDDYQAKTERDIPVVVL 142
>gi|288920357|ref|ZP_06414668.1| hypothetical protein FrEUN1fDRAFT_4366 [Frankia sp. EUN1f]
gi|288348234|gb|EFC82500.1| hypothetical protein FrEUN1fDRAFT_4366 [Frankia sp. EUN1f]
Length=151
Score = 89.7 bits (221), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/115 (40%), Positives = 69/115 (60%), Gaps = 0/115 (0%)
Query 3 LGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPN 62
+ F + PTLLL R R+SGK PL+Y D N+ ++VAS G E+P WY NL +
Sbjct 35 IDGSFTQGAPTLLLTTRGRRSGKPRRTPLIYGRDGNDYVIVASKAGFPEHPLWYLNLLAH 94
Query 63 PDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILR 117
P+ +Q+ ++R P A VA E+ARLW + ++D+ QS T+R IP+++L
Sbjct 95 PEAELQVEAERFPANARVAEPAEKARLWLVMTALWPEYDAYQSRTDRDIPIVLLE 149
>gi|269126740|ref|YP_003300110.1| hypothetical protein Tcur_2511 [Thermomonospora curvata DSM 43183]
gi|268311698|gb|ACY98072.1| hypothetical protein Tcur_2511 [Thermomonospora curvata DSM 43183]
Length=134
Score = 89.0 bits (219), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/114 (37%), Positives = 64/114 (57%), Gaps = 0/114 (0%)
Query 3 LGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPN 62
+G + K P L+L + RKSG+ PL+Y + N +VVAS G ++P WY NL +
Sbjct 18 VGHDWEKGAPVLILTTKGRKSGQERKCPLIYQEHQGNYVVVASKGGAPDHPHWYLNLQAD 77
Query 63 PDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIIL 116
P+ +Q+ +D+ RA A ER LW + + D+D Q T+R IPV++L
Sbjct 78 PEVRVQVKADKFTARARTAEGAERDELWAKMAAVWPDYDEYQKKTDRQIPVVVL 131
>gi|296166578|ref|ZP_06849008.1| hypothetical protein HMPREF0591_2449 [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295898064|gb|EFG77640.1| hypothetical protein HMPREF0591_2449 [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=150
Score = 88.2 bits (217), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 48/117 (42%), Positives = 66/117 (57%), Gaps = 1/117 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
RLG F P LL RK+G+ V+PLLY+ + N V++VAS G +NP WY NL
Sbjct 35 RLGGTFGN-APVALLTTTGRKTGEPRVSPLLYLREGNRVVLVASKGGSDKNPLWYLNLKA 93
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
NP+ +QI + +RA A+ ER WP+ Y F +SWT+R IP++I P
Sbjct 94 NPEVSVQIKDEVLRLRARDATEAEREEYWPKLDKMYPSFGDYRSWTDRVIPIVICDP 150
>gi|342862374|ref|ZP_08719014.1| hypothetical protein MCOL_25903 [Mycobacterium colombiense CECT
3035]
gi|342130045|gb|EGT83379.1| hypothetical protein MCOL_25903 [Mycobacterium colombiense CECT
3035]
Length=150
Score = 88.2 bits (217), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/117 (40%), Positives = 68/117 (59%), Gaps = 1/117 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
+ G F+K P LL RK+G+ V+PLLY+ + N V++VAS G ++P WY NL
Sbjct 35 KFGGTFQK-APVALLTTTGRKTGEPRVSPLLYLREGNRVVLVASKGGSDKHPLWYLNLKA 93
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
+P +QI + +RA A+ ERA WP+ Y FD ++WT+R IP++I P
Sbjct 94 DPKVSVQIKDEVLQLRARDATEAERAEYWPKLDAMYPSFDDYRAWTDRVIPIVICDP 150
>gi|254822629|ref|ZP_05227630.1| hypothetical protein MintA_22059 [Mycobacterium intracellulare
ATCC 13950]
Length=150
Score = 88.2 bits (217), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 47/117 (41%), Positives = 68/117 (59%), Gaps = 1/117 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
+ G F+K P LL RK+G+ V+PLLY+ + + V++VAS G ++P WY NL
Sbjct 35 KFGGTFQK-APVALLTTTGRKTGEPRVSPLLYLREGDRVVLVASKGGSDKHPLWYLNLKA 93
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
NP +QI + ++A A+ ERA WP+ Y FD +SWT+R IPV+I P
Sbjct 94 NPKVSVQIKDEVLHLQARDATEAERAEYWPKLDAMYPSFDDYRSWTDRVIPVVICDP 150
>gi|16945722|dbj|BAB72053.1| AclJ [Streptomyces galilaeus]
Length=156
Score = 87.8 bits (216), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 45/106 (43%), Positives = 63/106 (60%), Gaps = 0/106 (0%)
Query 13 TLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPNPDTHIQIGSD 72
TLLL R RKSG+ ++Y D N+IVV S G+ E+P WY NL +P+ H+QIG +
Sbjct 48 TLLLTTRGRKSGRLRRTAVIYGRDGGNIIVVGSNGGKPEHPLWYLNLVASPEAHVQIGEE 107
Query 73 RRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
VRA A+++E+ LW + + S Q T+R IPV+IL P
Sbjct 108 HLDVRARTATAEEKPELWKLMTGLFPQYKSYQKKTDREIPVVILEP 153
>gi|302529146|ref|ZP_07281488.1| proline iminopeptidase [Streptomyces sp. AA4]
gi|302438041|gb|EFL09857.1| proline iminopeptidase [Streptomyces sp. AA4]
Length=472
Score = 87.4 bits (215), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 42/116 (37%), Positives = 69/116 (60%), Gaps = 0/116 (0%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPP 61
++G +++ VPTL+L + RK+G++ L+Y D N ++VAS G +P WY NL
Sbjct 355 KVGHDWQEGVPTLVLTTKGRKTGQDRKFALIYQEDNGNPVIVASKGGAPNHPGWYLNLQE 414
Query 62 NPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILR 117
NP+ +Q+ D+ RA A+ +ERARLW + + ++ Q T+R IPV++L
Sbjct 415 NPEVRVQVWGDKFTARARTATGEERARLWDKLAAVWPAYNDYQKKTDREIPVVVLE 470
>gi|302530438|ref|ZP_07282780.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302439333|gb|EFL11149.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=134
Score = 86.7 bits (213), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 45/114 (40%), Positives = 63/114 (56%), Gaps = 1/114 (0%)
Query 5 AGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPNPD 64
G + +LL H KSGK V+PL+Y TD + +++VAS G NP WY NL NP
Sbjct 20 GGMFEGKNMVLLTHTGAKSGKQRVSPLVYTTDGDRIVIVASKGGADTNPDWYHNLVANPS 79
Query 65 THIQIGSDRRPVRA-VVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILR 117
++IG+D+ P A +V ER RL+ + V+ F + TER IPV +L
Sbjct 80 VTVEIGTDKFPATAKLVTDRAERDRLYAKMVEHAEGFAEYEKKTERLIPVFVLE 133
>gi|271969075|ref|YP_003343271.1| hypothetical protein Sros_7866 [Streptosporangium roseum DSM
43021]
gi|270512250|gb|ACZ90528.1| hypothetical protein Sros_7866 [Streptosporangium roseum DSM
43021]
Length=145
Score = 84.7 bits (208), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/109 (41%), Positives = 62/109 (57%), Gaps = 0/109 (0%)
Query 8 RKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPNPDTHI 67
R V TLLL R R+SG L+Y TD ++VV S G +P WY NL P H+
Sbjct 29 RWGVTTLLLITRGRRSGLPRRTALIYGTDGERLVVVGSNGGADAHPAWYLNLLAEPSAHV 88
Query 68 QIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIIL 116
Q+G+++ V AV A +ER RLW +AD++ ++ T R IPV++L
Sbjct 89 QVGAEQFAVTAVEAEGEERERLWEMMAGLWADYERYRTGTSRAIPVVVL 137
>gi|227121349|gb|ACP19381.1| hypothetical protein [Micromonospora sp. Tu 6368]
Length=349
Score = 84.3 bits (207), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 38/107 (36%), Positives = 63/107 (59%), Gaps = 0/107 (0%)
Query 12 PTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPNPDTHIQIGS 71
PTLLL R R++G+ PL+Y D + ++++AS +NP WY+NL +P+ +Q+ +
Sbjct 240 PTLLLTTRGRRTGRQIRTPLIYGRDGDRIVLIASNGASPKNPHWYQNLVADPEVRVQVRA 299
Query 72 DRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
DR +A+++ER RLW + +D + T R IPV++L P
Sbjct 300 DRFVATGRIATAEERPRLWELMAKIFPKYDEYATETTRDIPVVVLEP 346
>gi|254823479|ref|ZP_05228480.1| hypothetical protein MintA_26353 [Mycobacterium intracellulare
ATCC 13950]
Length=161
Score = 83.6 bits (205), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 45/114 (40%), Positives = 62/114 (55%), Gaps = 0/114 (0%)
Query 5 AGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPNPD 64
AG + LL RKSG + PL+++ D +++VAS G +P W+ NL NPD
Sbjct 47 AGHELGLALCLLTTTGRKSGLSRTTPLIFMADDARLVLVASQGGLPTHPMWFLNLRANPD 106
Query 65 THIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
+QI + R +RA VA ER +LWPR + QSWT+R IPV+I P
Sbjct 107 VRVQIRAQTRSMRARVADETERTQLWPRLTAHNPRWARYQSWTDRVIPVVICEP 160
>gi|333922113|ref|YP_004495694.1| hypothetical protein AS9A_4461 [Amycolicicoccus subflavus DQS3-9A1]
gi|333484334|gb|AEF42894.1| hypothetical protein AS9A_4461 [Amycolicicoccus subflavus DQS3-9A1]
Length=146
Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 49/113 (44%), Positives = 63/113 (56%), Gaps = 7/113 (6%)
Query 11 VPTLLLEHRSRKSGKNFVAPLLYIT----DRNNVIVVASALGQAENPQWYRNLPPNPDTH 66
+P + L RKSG + Y+T D N V++VAS G NP WYRNL NPD
Sbjct 36 MPVVELHTVGRKSG---LPRSCYLTSPVHDSNRVVLVASKGGDDRNPDWYRNLQSNPDAE 92
Query 67 IQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRPR 119
+ I RR V A AS+DE+A LWP+ V AY + + Q T+R IPV+I R
Sbjct 93 LVIHGSRRKVHARTASADEKAELWPQIVAAYKGYANYQQRTDRDIPVVICEFR 145
>gi|227121348|gb|ACP19380.1| hypothetical protein [Micromonospora sp. Tu 6368]
Length=151
Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 38/107 (36%), Positives = 63/107 (59%), Gaps = 0/107 (0%)
Query 12 PTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLPPNPDTHIQIGS 71
PTLLL R R++G+ PL+Y D + ++++AS +NP WY+NL +P+ +Q+ +
Sbjct 42 PTLLLTTRGRRTGRQIRTPLIYGRDGDRIVLIASNGASPKNPHWYQNLVADPEVRVQVRA 101
Query 72 DRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
DR +A+++ER RLW + +D + T R IPV++L P
Sbjct 102 DRFVATGRIATAEERPRLWELMAKIFPKYDEYATETTRDIPVVVLEP 148
>gi|84498004|ref|ZP_00996801.1| hypothetical protein JNB_17993 [Janibacter sp. HTCC2649]
gi|84381504|gb|EAP97387.1| hypothetical protein JNB_17993 [Janibacter sp. HTCC2649]
Length=136
Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/117 (39%), Positives = 64/117 (55%), Gaps = 2/117 (1%)
Query 2 RLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYI-TDRNNVIVVASALGQAENPQWYRNLP 60
++G F P +L+ H K+G V PL+Y D ++I+VAS G +NP W+ N+
Sbjct 18 KVGGPFEG-APMILIHHLGAKTGTERVTPLMYFPQDDGSMIIVASKAGAPDNPDWFHNVR 76
Query 61 PNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILR 117
NP +++G+ PV A V DER LWPR V+ F Q T+R IPV+ LR
Sbjct 77 ANPRFDVEVGTQTFPVEAEVLPRDERDALWPRLVEKAPGFGGYQENTDRIIPVVRLR 133
>gi|333992336|ref|YP_004524950.1| hypothetical protein JDM601_3696 [Mycobacterium sp. JDM601]
gi|333488304|gb|AEF37696.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=159
Score = 82.0 bits (201), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 46/118 (39%), Positives = 68/118 (58%), Gaps = 1/118 (0%)
Query 2 RLGAGFRKPVPTL-LLEHRSRKSGKNFVAPLLYITDRNNVIVVASALGQAENPQWYRNLP 60
RLG VP + +L RK+G+ +PLL++ + + VI+VAS G+A NP WY NL
Sbjct 42 RLGTKHFGEVPEVGILTTTGRKTGEPRESPLLFLREGDRVILVASQGGRATNPMWYGNLK 101
Query 61 PNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYADFDSCQSWTERGIPVIILRP 118
NP +I ++ +RA A ERA WP+ Y DF + ++WT+R IP++I P
Sbjct 102 ANPQVSFRIRNEVLRLRARDADESERAAYWPKLDAMYPDFVNYRAWTDREIPIVICEP 159
Lambda K H
0.321 0.138 0.434
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 129033565320
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40