BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3527
Length=149
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610663|ref|NP_218044.1| hypothetical protein Rv3527 [Mycoba... 307 4e-82
gi|298527005|ref|ZP_07014414.1| conserved hypothetical protein [... 305 1e-81
gi|340628491|ref|YP_004746943.1| hypothetical protein MCAN_35381... 298 2e-79
gi|183984984|ref|YP_001853275.1| hypothetical protein MMAR_5016 ... 229 1e-58
gi|118619275|ref|YP_907607.1| hypothetical protein MUL_4090 [Myc... 228 3e-58
gi|254822611|ref|ZP_05227612.1| hypothetical protein MintA_21959... 228 3e-58
gi|240172362|ref|ZP_04751021.1| hypothetical protein MkanA1_2380... 216 1e-54
gi|118464889|ref|YP_879912.1| hypothetical protein MAV_0632 [Myc... 214 5e-54
gi|254773589|ref|ZP_05215105.1| hypothetical protein MaviaA2_027... 213 7e-54
gi|296166558|ref|ZP_06848988.1| conserved hypothetical protein [... 209 8e-53
gi|41406636|ref|NP_959472.1| hypothetical protein MAP0538c [Myco... 209 8e-53
gi|118473815|ref|YP_890152.1| hypothetical protein MSMEG_5927 [M... 185 2e-45
gi|120406174|ref|YP_956003.1| hypothetical protein Mvan_5226 [My... 181 3e-44
gi|342862261|ref|ZP_08718903.1| hypothetical protein MCOL_25346 ... 180 6e-44
gi|315442563|ref|YP_004075442.1| hypothetical protein Mspyr1_091... 177 4e-43
gi|145222124|ref|YP_001132802.1| hypothetical protein Mflv_1532 ... 177 5e-43
gi|126437587|ref|YP_001073278.1| hypothetical protein Mjls_5023 ... 163 6e-39
gi|108801604|ref|YP_641801.1| hypothetical protein Mmcs_4641 [My... 163 7e-39
gi|169631252|ref|YP_001704901.1| hypothetical protein MAB_4174 [... 150 6e-35
gi|336458425|gb|EGO37399.1| hypothetical protein MAPs_13200 [Myc... 142 2e-32
gi|333992309|ref|YP_004524923.1| hypothetical protein JDM601_366... 133 8e-30
gi|326384578|ref|ZP_08206257.1| hypothetical protein SCNU_16638 ... 108 3e-22
gi|262200915|ref|YP_003272123.1| hypothetical protein Gbro_0918 ... 107 6e-22
gi|343925869|ref|ZP_08765384.1| hypothetical protein GOALK_050_0... 94.7 4e-18
gi|312138144|ref|YP_004005480.1| hypothetical protein REQ_06780 ... 86.7 1e-15
gi|111021509|ref|YP_704481.1| hypothetical protein RHA1_ro04537 ... 85.5 2e-15
gi|226363862|ref|YP_002781644.1| hypothetical protein ROP_44520 ... 84.3 5e-15
gi|229494182|ref|ZP_04387945.1| conserved hypothetical protein [... 83.2 1e-14
gi|54022440|ref|YP_116682.1| hypothetical protein nfa4760 [Nocar... 83.2 1e-14
gi|226304242|ref|YP_002764200.1| hypothetical protein RER_07530 ... 82.0 2e-14
gi|319948617|ref|ZP_08022741.1| hypothetical protein ES5_04523 [... 74.7 4e-12
gi|302525660|ref|ZP_07278002.1| conserved hypothetical protein [... 69.3 2e-10
gi|324106152|gb|ADY18319.1| putative ferredoxin [Rhodococcus rho... 69.3 2e-10
gi|333918592|ref|YP_004492173.1| hypothetical protein AS9A_0921 ... 68.2 4e-10
gi|300784424|ref|YP_003764715.1| hypothetical protein AMED_2518 ... 61.6 4e-08
gi|325674651|ref|ZP_08154338.1| hypothetical protein HMPREF0724_... 59.3 2e-07
gi|312138357|ref|YP_004005693.1| hypothetical protein REQ_08970 ... 57.8 6e-07
gi|145595198|ref|YP_001159495.1| hypothetical protein Strop_2673... 54.7 5e-06
gi|296141521|ref|YP_003648764.1| hypothetical protein Tpau_3850 ... 53.5 1e-05
gi|159038445|ref|YP_001537698.1| hypothetical protein Sare_2881 ... 52.0 3e-05
gi|311744216|ref|ZP_07718020.1| conserved hypothetical protein [... 52.0 3e-05
gi|54024211|ref|YP_118453.1| hypothetical protein nfa22420 [Noca... 50.1 1e-04
gi|254818664|ref|ZP_05223665.1| hypothetical protein MintA_02009... 48.9 2e-04
gi|342860667|ref|ZP_08717317.1| hypothetical protein MCOL_17388 ... 47.4 7e-04
gi|343924797|ref|ZP_08764336.1| hypothetical protein GOALK_026_0... 47.0 0.001
gi|324106142|gb|ADY18311.1| putative ferredoxin [Rhodococcus rho... 46.6 0.001
gi|326331640|ref|ZP_08197928.1| hypothetical protein NBCG_03079 ... 46.6 0.001
gi|119716745|ref|YP_923710.1| hypothetical protein Noca_2519 [No... 46.6 0.001
gi|145225143|ref|YP_001135821.1| hypothetical protein Mflv_4565 ... 46.2 0.002
gi|226361623|ref|YP_002779401.1| hypothetical protein ROP_22090 ... 45.8 0.002
>gi|15610663|ref|NP_218044.1| hypothetical protein Rv3527 [Mycobacterium tuberculosis H37Rv]
gi|15843137|ref|NP_338174.1| hypothetical protein MT3627 [Mycobacterium tuberculosis CDC1551]
gi|31794703|ref|NP_857196.1| hypothetical protein Mb3557 [Mycobacterium bovis AF2122/97]
76 more sequence titles
Length=149
Score = 307 bits (786), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 149/149 (100%), Positives = 149/149 (100%), Gaps = 0/149 (0%)
Query 1 MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREA 60
MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREA
Sbjct 1 MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREA 60
Query 61 SRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDS 120
SRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDS
Sbjct 61 SRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDS 120
Query 121 ARTRSCPRLTDSIEHAVAEGYLEHHDPNR 149
ARTRSCPRLTDSIEHAVAEGYLEHHDPNR
Sbjct 121 ARTRSCPRLTDSIEHAVAEGYLEHHDPNR 149
>gi|298527005|ref|ZP_07014414.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298496799|gb|EFI32093.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=149
Score = 305 bits (782), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 148/149 (99%), Positives = 149/149 (100%), Gaps = 0/149 (0%)
Query 1 MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREA 60
MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREA
Sbjct 1 MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREA 60
Query 61 SRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDS 120
SRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDS
Sbjct 61 SRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDS 120
Query 121 ARTRSCPRLTDSIEHAVAEGYLEHHDPNR 149
ARTRSCPRLTDSIEHAVAEGYL+HHDPNR
Sbjct 121 ARTRSCPRLTDSIEHAVAEGYLKHHDPNR 149
>gi|340628491|ref|YP_004746943.1| hypothetical protein MCAN_35381 [Mycobacterium canettii CIPT
140010059]
gi|340006681|emb|CCC45869.1| hypothetical protein MCAN_35381 [Mycobacterium canettii CIPT
140010059]
Length=149
Score = 298 bits (762), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 145/149 (98%), Positives = 146/149 (98%), Gaps = 0/149 (0%)
Query 1 MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREA 60
MPDDQPAVPDVDRLAR MLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREA
Sbjct 1 MPDDQPAVPDVDRLARLMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREA 60
Query 61 SRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDS 120
SRAERDRYLTSGLQPVDCR CHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDS
Sbjct 61 SRAERDRYLTSGLQPVDCRCCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDS 120
Query 121 ARTRSCPRLTDSIEHAVAEGYLEHHDPNR 149
ARTR+CPRL DSIEHAVAEGYLEHHDPNR
Sbjct 121 ARTRACPRLADSIEHAVAEGYLEHHDPNR 149
>gi|183984984|ref|YP_001853275.1| hypothetical protein MMAR_5016 [Mycobacterium marinum M]
gi|183178310|gb|ACC43420.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=149
Score = 229 bits (583), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 108/147 (74%), Positives = 128/147 (88%), Gaps = 1/147 (0%)
Query 2 PDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREAS 61
PDDQ VPDVD+LARSMLLL GDHHDH +S EQH WSKSRDF +DPQR AAVR+AS
Sbjct 4 PDDQ-EVPDVDQLARSMLLLLGDHHDHQESSEQHHGPQPWSKSRDFTNDPQRGAAVRQAS 62
Query 62 RAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSA 121
A+R+RYLT+GLQPVDCRFCHVTVTVK+LGPGHTAVQWNTEA++RCA+F E+R GGD+A
Sbjct 63 LADRERYLTAGLQPVDCRFCHVTVTVKQLGPGHTAVQWNTEAAQRCAHFAEVRRSGGDTA 122
Query 122 RTRSCPRLTDSIEHAVAEGYLEHHDPN 148
RT++CP++TDSI+HA+AEG L HDP+
Sbjct 123 RTKACPKMTDSIQHAIAEGVLVQHDPD 149
>gi|118619275|ref|YP_907607.1| hypothetical protein MUL_4090 [Mycobacterium ulcerans Agy99]
gi|118571385|gb|ABL06136.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=149
Score = 228 bits (580), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 107/147 (73%), Positives = 128/147 (88%), Gaps = 1/147 (0%)
Query 2 PDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREAS 61
PDDQ VPDVD+LARSMLLL GDHHDH +S EQH WSKSRDF +DPQR AAVR+AS
Sbjct 4 PDDQ-EVPDVDQLARSMLLLLGDHHDHQESSEQHHGRQPWSKSRDFTNDPQRGAAVRQAS 62
Query 62 RAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSA 121
A+R+RYLT+GLQPVDCRFCHVTVTVK+LGPGHTAVQWNTEA++RCA+F E+R GGD+A
Sbjct 63 LADRERYLTAGLQPVDCRFCHVTVTVKQLGPGHTAVQWNTEAAQRCAHFAEVRRSGGDTA 122
Query 122 RTRSCPRLTDSIEHAVAEGYLEHHDPN 148
RT++CP++TDSI+HA+AEG + HDP+
Sbjct 123 RTKACPKMTDSIQHAIAEGVVVQHDPD 149
>gi|254822611|ref|ZP_05227612.1| hypothetical protein MintA_21959 [Mycobacterium intracellulare
ATCC 13950]
Length=150
Score = 228 bits (580), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 112/144 (78%), Positives = 124/144 (87%), Gaps = 1/144 (0%)
Query 1 MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCG-SWSKSRDFADDPQRAAAVRE 59
M DDQP VPDVDRLARSML LHGDHHDH + P + SWSKSRDFA+DPQRAA V E
Sbjct 1 MADDQPQVPDVDRLARSMLQLHGDHHDHEEKPAGNGGGSRSWSKSRDFANDPQRAATVAE 60
Query 60 ASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGD 119
ASRA+R+RYL SGL PVDCRFCHV VTV+RLGPGHTAVQWNTEA +RCA+FTE+R GGD
Sbjct 61 ASRADRERYLNSGLLPVDCRFCHVRVTVRRLGPGHTAVQWNTEAWQRCAHFTEVRESGGD 120
Query 120 SARTRSCPRLTDSIEHAVAEGYLE 143
+ARTRSCP+L+DSIEHAVAEGYLE
Sbjct 121 TARTRSCPKLSDSIEHAVAEGYLE 144
>gi|240172362|ref|ZP_04751021.1| hypothetical protein MkanA1_23808 [Mycobacterium kansasii ATCC
12478]
Length=149
Score = 216 bits (549), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 108/147 (74%), Positives = 126/147 (86%), Gaps = 3/147 (2%)
Query 2 PDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREAS 61
PDDQ VPDVDRLARSMLLL G HHDH + ++H S SK+ DFA+DPQRAAAVREAS
Sbjct 4 PDDQQ-VPDVDRLARSMLLLRGGHHDHRVASDEHAARRSSSKAWDFANDPQRAAAVREAS 62
Query 62 RAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSA 121
RA+R+RYLT GLQPVDCRFCHVTVTVKR GPGHTAVQWNTEA +RCA+F E+RA GGD+A
Sbjct 63 RADRERYLTGGLQPVDCRFCHVTVTVKRHGPGHTAVQWNTEAVQRCAHFAEVRASGGDTA 122
Query 122 RTRSCPRLTDSIEHAVAEGYL--EHHD 146
RT++CP+++DSI+HA+AEGYL EH D
Sbjct 123 RTKACPKMSDSIDHAIAEGYLTEEHQD 149
>gi|118464889|ref|YP_879912.1| hypothetical protein MAV_0632 [Mycobacterium avium 104]
gi|118166176|gb|ABK67073.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=145
Score = 214 bits (544), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 113/144 (79%), Positives = 124/144 (87%), Gaps = 1/144 (0%)
Query 1 MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQH-RTCGSWSKSRDFADDPQRAAAVRE 59
M DDQPAVPDVDRLARSMLLLHGDHHDH++ P GSWSKSRDF DDPQRAAAV E
Sbjct 1 MTDDQPAVPDVDRLARSMLLLHGDHHDHDERPAASPANGGSWSKSRDFTDDPQRAAAVAE 60
Query 60 ASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGD 119
ASRA+R+RYL SGL PVDCRFCH V V+RLGPGHTAVQWNTEA +RCAYFTE+R GGD
Sbjct 61 ASRADRERYLNSGLLPVDCRFCHARVAVRRLGPGHTAVQWNTEAWQRCAYFTEVRQAGGD 120
Query 120 SARTRSCPRLTDSIEHAVAEGYLE 143
+ART+SCP+L DSIEHAVAEGYL+
Sbjct 121 TARTKSCPKLGDSIEHAVAEGYLD 144
>gi|254773589|ref|ZP_05215105.1| hypothetical protein MaviaA2_02780 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=145
Score = 213 bits (542), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 113/144 (79%), Positives = 124/144 (87%), Gaps = 1/144 (0%)
Query 1 MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQH-RTCGSWSKSRDFADDPQRAAAVRE 59
M DDQPAVPDVDRLARSMLLLHGDHHDH++ P GSWSKSRDF DDPQRAAAV E
Sbjct 1 MTDDQPAVPDVDRLARSMLLLHGDHHDHDERPAASPANGGSWSKSRDFTDDPQRAAAVAE 60
Query 60 ASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGD 119
ASRA+R+RYL SGL PVDCRFCH V V+RLGPGHTAVQWNTEA +RCAYFTE+R GGD
Sbjct 61 ASRADRERYLNSGLLPVDCRFCHARVAVRRLGPGHTAVQWNTEAWQRCAYFTEVRRAGGD 120
Query 120 SARTRSCPRLTDSIEHAVAEGYLE 143
+ART+SCP+L DSIEHAVAEGYL+
Sbjct 121 TARTKSCPKLGDSIEHAVAEGYLD 144
>gi|296166558|ref|ZP_06848988.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295898044|gb|EFG77620.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=155
Score = 209 bits (533), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 113/147 (77%), Positives = 126/147 (86%), Gaps = 5/147 (3%)
Query 1 MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSW-----SKSRDFADDPQRAA 55
MPDDQPAVPDVDRLARSMLLLHGD HDH++ P G SKSRDF++DPQRAA
Sbjct 1 MPDDQPAVPDVDRLARSMLLLHGDGHDHDERPSSSDGNGGARAGLASKSRDFSNDPQRAA 60
Query 56 AVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRA 115
AV EASRA+R+RYL+SGL PVDCRFCHVTVTV+RLGPGHTAVQWNTEA +RCA+FTE+R
Sbjct 61 AVAEASRADRERYLSSGLLPVDCRFCHVTVTVRRLGPGHTAVQWNTEAWQRCAHFTEVRN 120
Query 116 RGGDSARTRSCPRLTDSIEHAVAEGYL 142
GGD+AR RSCP+LTDSIEHAVAEGYL
Sbjct 121 SGGDTARIRSCPKLTDSIEHAVAEGYL 147
>gi|41406636|ref|NP_959472.1| hypothetical protein MAP0538c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41394985|gb|AAS02855.1| hypothetical protein MAP_0538c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=145
Score = 209 bits (533), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 111/144 (78%), Positives = 124/144 (87%), Gaps = 1/144 (0%)
Query 1 MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQH-RTCGSWSKSRDFADDPQRAAAVRE 59
M DDQPAVPDVDRLARSMLLLHGDHHDH++ P GSWSKSRDF DDPQRAAAV E
Sbjct 1 MTDDQPAVPDVDRLARSMLLLHGDHHDHDERPAASPANGGSWSKSRDFTDDPQRAAAVAE 60
Query 60 ASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGD 119
ASRA+R+RYL SGL PVDCRFCH V V+RLGPGHTAVQW+TEA +RCAYFTE+R GG+
Sbjct 61 ASRADRERYLNSGLLPVDCRFCHARVAVRRLGPGHTAVQWSTEAWQRCAYFTEVRRAGGN 120
Query 120 SARTRSCPRLTDSIEHAVAEGYLE 143
+ART+SCP+L DSIEHAVAEGYL+
Sbjct 121 TARTKSCPKLGDSIEHAVAEGYLD 144
>gi|118473815|ref|YP_890152.1| hypothetical protein MSMEG_5927 [Mycobacterium smegmatis str.
MC2 155]
gi|118175102|gb|ABK75998.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=158
Score = 185 bits (470), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 93/136 (69%), Positives = 105/136 (78%), Gaps = 0/136 (0%)
Query 8 VPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREASRAERDR 67
PDV LARSMLLLHG H D + GSWSK+ +FA DPQRAAAVREAS +R+R
Sbjct 12 TPDVGDLARSMLLLHGIHDDDDHPAGNQPGTGSWSKAPNFASDPQRAAAVREASLRDRER 71
Query 68 YLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSARTRSCP 127
YLTSGL VDCR CHV+V VK+LGP HTAVQWN+EA++RCA F E+RA GGD ART SCP
Sbjct 72 YLTSGLVSVDCRHCHVSVQVKKLGPTHTAVQWNSEATQRCAVFNEIRAAGGDPARTPSCP 131
Query 128 RLTDSIEHAVAEGYLE 143
RL+DSI HAVAEG LE
Sbjct 132 RLSDSIRHAVAEGVLE 147
>gi|120406174|ref|YP_956003.1| hypothetical protein Mvan_5226 [Mycobacterium vanbaalenii PYR-1]
gi|119958992|gb|ABM15997.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=152
Score = 181 bits (459), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 91/141 (65%), Positives = 109/141 (78%), Gaps = 4/141 (2%)
Query 8 VPDVDRLARSMLLLHG---DHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREASRAE 64
PDVDRLARSMLLLHG + H+H +P GSW K+ DFA DP RAAAVRE +R +
Sbjct 4 APDVDRLARSMLLLHGHDDEEHEHGHTPAGDGP-GSWRKAPDFASDPDRAAAVREGTRRD 62
Query 65 RDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSARTR 124
R+RYLT+GL VDCRFCHV+V VK+LGP HT+VQW ++A++RCA+F E+R GGD AR R
Sbjct 63 RERYLTAGLASVDCRFCHVSVQVKKLGPAHTSVQWTSDATQRCAHFAEIREAGGDPARER 122
Query 125 SCPRLTDSIEHAVAEGYLEHH 145
SCPRLTDSI HAVAEG LE +
Sbjct 123 SCPRLTDSIRHAVAEGCLEEY 143
>gi|342862261|ref|ZP_08718903.1| hypothetical protein MCOL_25346 [Mycobacterium colombiense CECT
3035]
gi|342130339|gb|EGT83659.1| hypothetical protein MCOL_25346 [Mycobacterium colombiense CECT
3035]
Length=149
Score = 180 bits (457), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 115/145 (80%), Positives = 128/145 (89%), Gaps = 0/145 (0%)
Query 1 MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREA 60
M DDQP VPDVD+LARSMLLLHGDHHDH+D P GSWSKSRDFA+DPQRAAAV EA
Sbjct 1 MADDQPKVPDVDQLARSMLLLHGDHHDHDDKPVAGGGSGSWSKSRDFANDPQRAAAVAEA 60
Query 61 SRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDS 120
SRA+R+RYL+SGL PVDCRFCHV VTV+RLGPGHTAVQWNTEA +RCAYFTE+R GGD+
Sbjct 61 SRADRERYLSSGLLPVDCRFCHVRVTVRRLGPGHTAVQWNTEAWQRCAYFTEVRESGGDT 120
Query 121 ARTRSCPRLTDSIEHAVAEGYLEHH 145
ART+SCP+L DSIEHAVAEGYL+H
Sbjct 121 ARTKSCPKLGDSIEHAVAEGYLDHQ 145
>gi|315442563|ref|YP_004075442.1| hypothetical protein Mspyr1_09160 [Mycobacterium sp. Spyr1]
gi|315260866|gb|ADT97607.1| hypothetical protein Mspyr1_09160 [Mycobacterium sp. Spyr1]
Length=160
Score = 177 bits (449), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 90/142 (64%), Positives = 107/142 (76%), Gaps = 6/142 (4%)
Query 9 PDVDRLARSMLLLHGD-----HHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREASRA 63
PDVDRLARSMLLLHG H H +P + R GSW K+ DF DP RAAAVR +S+A
Sbjct 5 PDVDRLARSMLLLHGHDDEGHEHGHTPAPAESRP-GSWRKAPDFTSDPDRAAAVRASSQA 63
Query 64 ERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSART 123
+R+RYL SGL VDCRFCHV+V VK+LGP HT+VQW +A++RCA+FTE+R GGD AR+
Sbjct 64 DRERYLNSGLASVDCRFCHVSVKVKKLGPAHTSVQWTRDATQRCAHFTEIRDAGGDPARS 123
Query 124 RSCPRLTDSIEHAVAEGYLEHH 145
SCPRL DSI HAVAEG LE +
Sbjct 124 PSCPRLADSIRHAVAEGCLEEY 145
>gi|145222124|ref|YP_001132802.1| hypothetical protein Mflv_1532 [Mycobacterium gilvum PYR-GCK]
gi|145214610|gb|ABP44014.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=160
Score = 177 bits (449), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 90/142 (64%), Positives = 107/142 (76%), Gaps = 6/142 (4%)
Query 9 PDVDRLARSMLLLHGD-----HHDHNDSPEQHRTCGSWSKSRDFADDPQRAAAVREASRA 63
PDVDRLARSMLLLHG H H +P + R GSW K+ DF DP RAAAVR +S+A
Sbjct 5 PDVDRLARSMLLLHGHDDEGHEHGHTPAPTESRP-GSWRKAPDFTSDPDRAAAVRASSQA 63
Query 64 ERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSART 123
+R+RYL SGL VDCRFCHV+V VK+LGP HT+VQW +A++RCA+FTE+R GGD AR+
Sbjct 64 DRERYLNSGLASVDCRFCHVSVKVKKLGPAHTSVQWTRDATQRCAHFTEIRDAGGDPARS 123
Query 124 RSCPRLTDSIEHAVAEGYLEHH 145
SCPRL DSI HAVAEG LE +
Sbjct 124 PSCPRLADSIRHAVAEGCLEEY 145
>gi|126437587|ref|YP_001073278.1| hypothetical protein Mjls_5023 [Mycobacterium sp. JLS]
gi|126237387|gb|ABO00788.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=156
Score = 163 bits (413), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 93/145 (65%), Positives = 112/145 (78%), Gaps = 4/145 (2%)
Query 2 PDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCG--SWSKSRDFADDPQRAAAVRE 59
P++Q A +VDRLARSMLLLHG H D T G SWSK+ FADD +RAAAVRE
Sbjct 4 PEEQDA--EVDRLARSMLLLHGGHDDDGHDHPAPSTGGRKSWSKAPSFADDEERAAAVRE 61
Query 60 ASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGD 119
A+R +R+RYL SGL VDCRFCHV+V VK+LG GHT+VQW++EAS+RCAYF+E+R GG
Sbjct 62 ATRRDRERYLNSGLVAVDCRFCHVSVKVKKLGRGHTSVQWSSEASQRCAYFSEVREAGGQ 121
Query 120 SARTRSCPRLTDSIEHAVAEGYLEH 144
ART++CPRL DSI+HAVAEG LE
Sbjct 122 PARTKACPRLADSIKHAVAEGCLEE 146
>gi|108801604|ref|YP_641801.1| hypothetical protein Mmcs_4641 [Mycobacterium sp. MCS]
gi|119870758|ref|YP_940710.1| hypothetical protein Mkms_4729 [Mycobacterium sp. KMS]
gi|108772023|gb|ABG10745.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119696847|gb|ABL93920.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=155
Score = 163 bits (413), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 95/145 (66%), Positives = 115/145 (80%), Gaps = 5/145 (3%)
Query 2 PDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCG--SWSKSRDFADDPQRAAAVRE 59
P++Q A +VDRLARSMLLLHG H D +D P T G SWSK+ FADD +RAAAVRE
Sbjct 4 PEEQDA--EVDRLARSMLLLHGGHDDDHDHPAPS-TGGRKSWSKAPSFADDEERAAAVRE 60
Query 60 ASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGD 119
A+R +R+RYL SGL VDCRFCHV+V VK+LG GHT+VQW++EAS+RCAYF+E+R GG
Sbjct 61 ATRRDRERYLNSGLVAVDCRFCHVSVKVKKLGRGHTSVQWSSEASQRCAYFSEVREAGGQ 120
Query 120 SARTRSCPRLTDSIEHAVAEGYLEH 144
ART++CPRL DSI+HAVAEG LE
Sbjct 121 PARTKACPRLADSIKHAVAEGCLEE 145
>gi|169631252|ref|YP_001704901.1| hypothetical protein MAB_4174 [Mycobacterium abscessus ATCC 19977]
gi|169243219|emb|CAM64247.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=171
Score = 150 bits (379), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 73/137 (54%), Positives = 96/137 (71%), Gaps = 2/137 (1%)
Query 9 PDVDRLARSMLLLHGDHHDHN--DSPEQHRTCGSWSKSRDFADDPQRAAAVREASRAERD 66
P V+ LA+SML LHG H + ++ R G WSK+ F DP+RAA + A+ ++
Sbjct 24 PTVEELAQSMLSLHGAHGEDTAHEANAPGRPNGHWSKAPAFEHDPERAARLHAATEGDKQ 83
Query 67 RYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSARTRSC 126
RYLTSGLQ VDCRFCH +V VK+LG HTAVQWNT A+ +CA+F ++R G ++AR R C
Sbjct 84 RYLTSGLQEVDCRFCHGSVMVKKLGVNHTAVQWNTAATDKCAFFAQMREEGTNTARIRGC 143
Query 127 PRLTDSIEHAVAEGYLE 143
PRL+DSI HA+AEG L+
Sbjct 144 PRLSDSIAHAIAEGCLD 160
>gi|336458425|gb|EGO37399.1| hypothetical protein MAPs_13200 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=95
Score = 142 bits (357), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 67/90 (75%), Positives = 78/90 (87%), Gaps = 0/90 (0%)
Query 54 AAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTEL 113
A V EASRA+R+RYL SGL PVDCRFCH V V+RLGPGHTAVQW+TEA +RCAYFTE+
Sbjct 5 ALPVAEASRADRERYLNSGLLPVDCRFCHARVAVRRLGPGHTAVQWSTEAWQRCAYFTEV 64
Query 114 RARGGDSARTRSCPRLTDSIEHAVAEGYLE 143
R GG++ART+SCP+L DSIEHAVAEGYL+
Sbjct 65 RRAGGNTARTKSCPKLGDSIEHAVAEGYLD 94
>gi|333992309|ref|YP_004524923.1| hypothetical protein JDM601_3669 [Mycobacterium sp. JDM601]
gi|333488277|gb|AEF37669.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=158
Score = 133 bits (335), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 70/145 (49%), Positives = 94/145 (65%), Gaps = 2/145 (1%)
Query 1 MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRD--FADDPQRAAAVR 58
M ++ VP V+ LARSML L+G H DH+D P+ + SR F DP R A+R
Sbjct 1 MSPEERGVPAVEDLARSMLALYGPHDDHHDEPDTDDSGPDTRYSRQPLFPVDPARYEAIR 60
Query 59 EASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGG 118
A+ + +RYL SGL PV+CR C +V VK+LGPG+T+VQW++E+ +CA+F R G
Sbjct 61 AATERDSERYLHSGLVPVECRHCRASVEVKKLGPGYTSVQWSSESLTQCAFFAAEREAGH 120
Query 119 DSARTRSCPRLTDSIEHAVAEGYLE 143
S+R+R CP L SI HAVAEG+LE
Sbjct 121 ASSRSRGCPNLAKSIRHAVAEGHLE 145
>gi|326384578|ref|ZP_08206257.1| hypothetical protein SCNU_16638 [Gordonia neofelifaecis NRRL
B-59395]
gi|326196712|gb|EGD53907.1| hypothetical protein SCNU_16638 [Gordonia neofelifaecis NRRL
B-59395]
Length=127
Score = 108 bits (270), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 50/116 (44%), Positives = 75/116 (65%), Gaps = 0/116 (0%)
Query 33 EQHRTCGSWSKSRDFADDPQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGP 92
++ T W+K+ +ADDP R AA+ +A+ +R YLT G+ ++CR CH V VK+
Sbjct 5 DERDTASGWAKAPLWADDPDRVAAIADATARDRRHYLTGGMSEIECRTCHAHVLVKKTSA 64
Query 93 GHTAVQWNTEASRRCAYFTELRARGGDSARTRSCPRLTDSIEHAVAEGYLEHHDPN 148
HT+VQWN +A RC++ E+RA GG++A +CPRL+ SI+H VAEG + P+
Sbjct 65 HHTSVQWNDDALARCSHIGEIRAAGGNAALLPTCPRLSASIDHGVAEGIIPSESPD 120
>gi|262200915|ref|YP_003272123.1| hypothetical protein Gbro_0918 [Gordonia bronchialis DSM 43247]
gi|262084262|gb|ACY20230.1| hypothetical protein Gbro_0918 [Gordonia bronchialis DSM 43247]
Length=135
Score = 107 bits (267), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 48/108 (45%), Positives = 68/108 (63%), Gaps = 0/108 (0%)
Query 41 WSKSRDFADDPQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWN 100
W+K+ DF DDP R AAV +A+ +R+ YL G+ ++CR CH V VK+ P HT+VQWN
Sbjct 21 WAKAPDFGDDPGRLAAVHQATARDREHYLRGGMTEIECRSCHACVMVKKTSPFHTSVQWN 80
Query 101 TEASRRCAYFTELRARGGDSARTRSCPRLTDSIEHAVAEGYLEHHDPN 148
+A RC +R GG++A +CPRL+ SI+H V+EG + P
Sbjct 81 AQARARCDELERIRREGGNTAMMPTCPRLSASIDHGVSEGIIPSESPE 128
>gi|343925869|ref|ZP_08765384.1| hypothetical protein GOALK_050_01640 [Gordonia alkanivorans NBRC
16433]
gi|343764220|dbj|GAA12310.1| hypothetical protein GOALK_050_01640 [Gordonia alkanivorans NBRC
16433]
Length=131
Score = 94.7 bits (234), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 48/114 (43%), Positives = 67/114 (59%), Gaps = 7/114 (6%)
Query 41 WSKSRDFADDPQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWN 100
W+K+ DF DDP RAA V A++ +R+ YL G++ ++CR CH V VK+ HT+VQWN
Sbjct 10 WAKAPDFGDDPDRAARVHAATQRDREHYLQGGMREIECRACHACVLVKKTSTHHTSVQWN 69
Query 101 TEASRRCAYFTELRARGGD-------SARTRSCPRLTDSIEHAVAEGYLEHHDP 147
+A RC ++RA G D A +C RL+ SI+H VAEG + P
Sbjct 70 ADARSRCHGLEQMRAGGDDGNGPLLPGAMMPTCVRLSASIDHGVAEGIIPPESP 123
>gi|312138144|ref|YP_004005480.1| hypothetical protein REQ_06780 [Rhodococcus equi 103S]
gi|325674951|ref|ZP_08154638.1| hypothetical protein HMPREF0724_12420 [Rhodococcus equi ATCC
33707]
gi|311887483|emb|CBH46795.1| hypothetical protein REQ_06780 [Rhodococcus equi 103S]
gi|325554537|gb|EGD24212.1| hypothetical protein HMPREF0724_12420 [Rhodococcus equi ATCC
33707]
Length=117
Score = 86.7 bits (213), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/104 (42%), Positives = 59/104 (57%), Gaps = 3/104 (2%)
Query 40 SWSKSRDFADDPQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQW 99
+W+K+ +AD P R AVR + ++ YL GLQ V CR C V V++ HT++QW
Sbjct 6 TWAKAPSYADSPARVEAVRAQTAVDQHNYLEQGLQEVQCRACGTCVLVQKNSFKHTSIQW 65
Query 100 NTEASRRCAYFTELRARGGDSARTRSCPRLTDSIEHAVAEGYLE 143
+ C FT A GG + ++CPRL SI+HAV EG LE
Sbjct 66 QGNPAEVCPEFT---AAGGRAGSRQTCPRLLSSIQHAVMEGILE 106
>gi|111021509|ref|YP_704481.1| hypothetical protein RHA1_ro04537 [Rhodococcus jostii RHA1]
gi|110821039|gb|ABG96323.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=115
Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/106 (38%), Positives = 59/106 (56%), Gaps = 0/106 (0%)
Query 41 WSKSRDFADDPQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWN 100
W+K+ DFA P R A+ + A++ YL G+ V+CR C V V++ HT+VQW
Sbjct 7 WAKAPDFAGQPARLEAIHAQTLADKANYLDDGMNEVECRTCGTCVLVRKNSLKHTSVQWT 66
Query 101 TEASRRCAYFTELRARGGDSARTRSCPRLTDSIEHAVAEGYLEHHD 146
+ ++ C F + G +A CPRL DSI HAV EG+++ D
Sbjct 67 DDPAKTCPTFRDAVGEGQSTALREGCPRLWDSINHAVMEGFIDVRD 112
>gi|226363862|ref|YP_002781644.1| hypothetical protein ROP_44520 [Rhodococcus opacus B4]
gi|226242351|dbj|BAH52699.1| hypothetical protein [Rhodococcus opacus B4]
Length=115
Score = 84.3 bits (207), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 40/106 (38%), Positives = 59/106 (56%), Gaps = 0/106 (0%)
Query 41 WSKSRDFADDPQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWN 100
W+K+ DFA P R A+ + A++ YL G+ V+CR C V V++ HT+VQW
Sbjct 7 WAKAPDFAGQPARLEAIHAQTVADKATYLDDGMNEVECRTCGTCVLVRKNSLEHTSVQWT 66
Query 101 TEASRRCAYFTELRARGGDSARTRSCPRLTDSIEHAVAEGYLEHHD 146
+ ++ C F + G +A CPRL DSI HAV EG+++ D
Sbjct 67 DDPAKTCPTFRDAVGEGQSTALREGCPRLWDSINHAVMEGFIDVRD 112
>gi|229494182|ref|ZP_04387945.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229318544|gb|EEN84402.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=123
Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/108 (40%), Positives = 60/108 (56%), Gaps = 0/108 (0%)
Query 41 WSKSRDFADDPQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWN 100
W+K+ DFA +R A+ + ++ YL SGLQ VDC+ C V V++ HT+VQW
Sbjct 14 WAKAPDFAGQTERLEAIAAQTIEDKHEYLDSGLQEVDCKACGACVLVRKNTYKHTSVQWT 73
Query 101 TEASRRCAYFTELRARGGDSARTRSCPRLTDSIEHAVAEGYLEHHDPN 148
++ + RC F E A G SA CP+L DSI AV +G + DP
Sbjct 74 SDPALRCHTFKEAIASGQSSALKDGCPKLHDSIADAVRDGTITVRDPE 121
>gi|54022440|ref|YP_116682.1| hypothetical protein nfa4760 [Nocardia farcinica IFM 10152]
gi|54013948|dbj|BAD55318.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=110
Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/104 (39%), Positives = 57/104 (55%), Gaps = 0/104 (0%)
Query 39 GSWSKSRDFADDPQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQ 98
SW+K+ DFAD P R AAV+ + ++ RYL G+ P+ C+ CH V V++ T+V+
Sbjct 3 ASWAKAPDFADQPARRAAVQAQTALDKQRYLEDGMTPLHCQGCHSRVLVRKYSAHQTSVE 62
Query 99 WNTEASRRCAYFTELRARGGDSARTRSCPRLTDSIEHAVAEGYL 142
W + RC F L G R +CP L +I+ AV EG L
Sbjct 63 WTEPPATRCPVFARLAESGAAPGRPDTCPSLEKTIKWAVDEGVL 106
>gi|226304242|ref|YP_002764200.1| hypothetical protein RER_07530 [Rhodococcus erythropolis PR4]
gi|226183357|dbj|BAH31461.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=123
Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/107 (40%), Positives = 60/107 (57%), Gaps = 0/107 (0%)
Query 41 WSKSRDFADDPQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWN 100
W+K+ DFA +R A+ + ++ YL SGLQ VDC+ C V V++ HT+VQW
Sbjct 14 WAKAPDFAGQTERLEAIAAQTIEDKHEYLDSGLQEVDCKACGACVLVRKNTYKHTSVQWT 73
Query 101 TEASRRCAYFTELRARGGDSARTRSCPRLTDSIEHAVAEGYLEHHDP 147
++ + RC F E A G +A CP+L DSI AV +G + DP
Sbjct 74 SDPALRCHNFKEAIASGQSTALKDGCPKLHDSIADAVRDGTITVRDP 120
>gi|319948617|ref|ZP_08022741.1| hypothetical protein ES5_04523 [Dietzia cinnamea P4]
gi|319437698|gb|EFV92694.1| hypothetical protein ES5_04523 [Dietzia cinnamea P4]
Length=131
Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/93 (39%), Positives = 53/93 (57%), Gaps = 0/93 (0%)
Query 50 DPQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAY 109
DP RAA + + +R+ Y +G QPV+CR C V+V++ HT++QW + R+C
Sbjct 17 DPARAARLHSYTDLDRETYTRAGFQPVECRSCGTCVSVRKNSEKHTSIQWTEGSDRKCPV 76
Query 110 FTELRARGGDSARTRSCPRLTDSIEHAVAEGYL 142
T+ RA G +CPR+ SI +A AEG L
Sbjct 77 LTDWRAGGERPEGEDTCPRMLASIRYAYAEGLL 109
>gi|302525660|ref|ZP_07278002.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302434555|gb|EFL06371.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=88
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 34/78 (44%), Positives = 47/78 (61%), Gaps = 0/78 (0%)
Query 63 AERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSAR 122
AE +LTSGL+P +CR C V VK+ HT++QW T+A+ C F A GG +A
Sbjct 3 AEATEFLTSGLRPHECRSCGTCVLVKKNSLKHTSIQWTTDAATSCPVFAAHAASGGTTAL 62
Query 123 TRSCPRLTDSIEHAVAEG 140
+C +L DSI+ AV +G
Sbjct 63 LDTCEKLADSIDSAVRDG 80
>gi|324106152|gb|ADY18319.1| putative ferredoxin [Rhodococcus rhodochrous]
Length=125
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 41/107 (39%), Positives = 55/107 (52%), Gaps = 1/107 (0%)
Query 41 WSKSRDFADDPQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWN 100
W+KS DF+ P+RA AV + ++ YL SG V C C V V++ HT+VQW
Sbjct 10 WAKSPDFSQWPERAEAVAAQTAIDKINYLESGQHEVQCHECGTCVLVRKNSLSHTSVQWQ 69
Query 101 TEASRRCAYFTELRARGGDSARTR-SCPRLTDSIEHAVAEGYLEHHD 146
+ C F E + G S R CPRL +SI+ AV +G L D
Sbjct 70 DDPDTVCPTFRETASLRGSSITPREGCPRLRESIDRAVLDGTLPVPD 116
>gi|333918592|ref|YP_004492173.1| hypothetical protein AS9A_0921 [Amycolicicoccus subflavus DQS3-9A1]
gi|333480813|gb|AEF39373.1| hypothetical protein AS9A_0921 [Amycolicicoccus subflavus DQS3-9A1]
Length=123
Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 35/90 (39%), Positives = 48/90 (54%), Gaps = 0/90 (0%)
Query 54 AAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTEL 113
A A R + +R +LT GL P+ C C V VK+ P HT++QW T + C + E
Sbjct 27 APAGRPETLTDRAEFLTGGLVPLQCSSCGTGVLVKKHSPKHTSIQWTTSPAESCPIYAER 86
Query 114 RARGGDSARTRSCPRLTDSIEHAVAEGYLE 143
A GG +A +C RL SI AV +G L+
Sbjct 87 LAGGGRTALFDTCERLAASIAQAVTDGKLK 116
>gi|300784424|ref|YP_003764715.1| hypothetical protein AMED_2518 [Amycolatopsis mediterranei U32]
gi|299793938|gb|ADJ44313.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340525845|gb|AEK41050.1| hypothetical protein RAM_12800 [Amycolatopsis mediterranei S699]
Length=85
Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/86 (39%), Positives = 46/86 (54%), Gaps = 3/86 (3%)
Query 61 SRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDS 120
+ AE+ +L+ GL+P CR C V VK+ HT++QW + + C F + G S
Sbjct 2 TTAEQTEFLSGGLRPHQCRSCGTCVLVKKNSIQHTSIQWTSRPADTCPVFAD---TDGPS 58
Query 121 ARTRSCPRLTDSIEHAVAEGYLEHHD 146
A +CP+L DSI AV EG L D
Sbjct 59 ALLDTCPKLADSISDAVREGTLAVAD 84
>gi|325674651|ref|ZP_08154338.1| hypothetical protein HMPREF0724_12120 [Rhodococcus equi ATCC
33707]
gi|325554237|gb|EGD23912.1| hypothetical protein HMPREF0724_12120 [Rhodococcus equi ATCC
33707]
Length=90
Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/70 (45%), Positives = 40/70 (58%), Gaps = 0/70 (0%)
Query 73 LQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSARTRSCPRLTDS 132
+QPV+C C V V++ HT+ QW +A CA F + A G S R+C LT+S
Sbjct 1 MQPVECGACGNRVLVEKYSSTHTSTQWLEDAETACAEFRDAAANGTHSMYVRTCGALTES 60
Query 133 IEHAVAEGYL 142
IE AVAEG L
Sbjct 61 IEEAVAEGRL 70
>gi|312138357|ref|YP_004005693.1| hypothetical protein REQ_08970 [Rhodococcus equi 103S]
gi|311887696|emb|CBH47008.1| hypothetical protein REQ_08970 [Rhodococcus equi 103S]
Length=90
Score = 57.8 bits (138), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 30/70 (43%), Positives = 40/70 (58%), Gaps = 0/70 (0%)
Query 73 LQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSARTRSCPRLTDS 132
+QPV+C C V V++ HT+ QW +A CA F + A G S R+C LT+S
Sbjct 1 MQPVECGACGNRVLVEKYSSTHTSTQWLEDAETACAEFRDAAANGTHSMYVRTCGALTES 60
Query 133 IEHAVAEGYL 142
I+ AVAEG L
Sbjct 61 IDEAVAEGRL 70
>gi|145595198|ref|YP_001159495.1| hypothetical protein Strop_2673 [Salinispora tropica CNB-440]
gi|145304535|gb|ABP55117.1| hypothetical protein Strop_2673 [Salinispora tropica CNB-440]
Length=87
Score = 54.7 bits (130), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/82 (40%), Positives = 41/82 (50%), Gaps = 7/82 (8%)
Query 64 ERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSART 123
ER YL G + C C V V++ P T+VQW+T A R+C T G + T
Sbjct 9 ERQTYLIGGFAELSCARCAALVRVRKSSPQQTSVQWSTRAVRQCPVLT-----GNEQPHT 63
Query 124 --RSCPRLTDSIEHAVAEGYLE 143
+CP L SI+HAV G LE
Sbjct 64 PVPTCPDLRTSIQHAVRTGRLE 85
>gi|296141521|ref|YP_003648764.1| hypothetical protein Tpau_3850 [Tsukamurella paurometabola DSM
20162]
gi|296029655|gb|ADG80425.1| hypothetical protein Tpau_3850 [Tsukamurella paurometabola DSM
20162]
Length=116
Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/88 (37%), Positives = 45/88 (52%), Gaps = 4/88 (4%)
Query 60 ASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGD 119
A+ A+R +LT G P++C C V V V++ T++QW A R CA F + G
Sbjct 16 ATVADRQGFLTDGFTPLNCGTCGVAVQVRKNSREQTSIQWCGSA-RDCAVFA---SDGAG 71
Query 120 SARTRSCPRLTDSIEHAVAEGYLEHHDP 147
SA +CPRL +SI A G + P
Sbjct 72 SALHATCPRLMESITRAAKLGAVPIPTP 99
>gi|159038445|ref|YP_001537698.1| hypothetical protein Sare_2881 [Salinispora arenicola CNS-205]
gi|157917280|gb|ABV98707.1| conserved hypothetical protein [Salinispora arenicola CNS-205]
Length=87
Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/80 (38%), Positives = 39/80 (49%), Gaps = 3/80 (3%)
Query 64 ERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSART 123
ER YLT G + C C V V++ P T+VQW T ++R+C T A
Sbjct 9 ERQEYLTGGFAELSCARCAALVRVRKSSPQQTSVQWTTRSARQCPTLT---ADARPHPLV 65
Query 124 RSCPRLTDSIEHAVAEGYLE 143
+CP L SI+ AV G LE
Sbjct 66 PTCPDLRTSIQWAVRTGRLE 85
>gi|311744216|ref|ZP_07718020.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272]
gi|311312389|gb|EFQ82302.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272]
Length=99
Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/85 (37%), Positives = 41/85 (49%), Gaps = 0/85 (0%)
Query 64 ERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSART 123
+R Y + L V C C V V++ HT++QW+ EA CA F L A+ G
Sbjct 14 DRALYTRARLAEVACLDCTARVLVRKNSEHHTSIQWSREAVAACATFDRLAAQAGGRPVH 73
Query 124 RSCPRLTDSIEHAVAEGYLEHHDPN 148
CPRL SIE A +G L P+
Sbjct 74 AECPRLAASIEAAARDGALPVGSPD 98
>gi|54024211|ref|YP_118453.1| hypothetical protein nfa22420 [Nocardia farcinica IFM 10152]
gi|54015719|dbj|BAD57089.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=85
Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/70 (39%), Positives = 35/70 (50%), Gaps = 0/70 (0%)
Query 73 LQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSARTRSCPRLTDS 132
+Q + C C V V++ P HT+VQW EA C F A G S +C L DS
Sbjct 1 MQEIRCATCANRVLVEKFSPSHTSVQWLDEAEAACPEFARRAALGESSKWIPTCGALRDS 60
Query 133 IEHAVAEGYL 142
I+ AV +G L
Sbjct 61 IDRAVLDGVL 70
>gi|254818664|ref|ZP_05223665.1| hypothetical protein MintA_02009 [Mycobacterium intracellulare
ATCC 13950]
Length=95
Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/83 (37%), Positives = 41/83 (50%), Gaps = 1/83 (1%)
Query 66 DRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTEL-RARGGDSARTR 124
+R + + + PV CR C V ++ T+VQWN +A++RC E R G
Sbjct 11 NRLVDTPMAPVTCRECGARVLARKSSWNQTSVQWNADAAKRCIERREAERLAGHHRGPFL 70
Query 125 SCPRLTDSIEHAVAEGYLEHHDP 147
C L DSI+ AVA G L DP
Sbjct 71 VCAALRDSIQAAVATGELAVVDP 93
>gi|342860667|ref|ZP_08717317.1| hypothetical protein MCOL_17388 [Mycobacterium colombiense CECT
3035]
gi|342131691|gb|EGT84948.1| hypothetical protein MCOL_17388 [Mycobacterium colombiense CECT
3035]
Length=94
Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/90 (37%), Positives = 43/90 (48%), Gaps = 1/90 (1%)
Query 59 EASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTEL-RARG 117
EAS E +R + + PV CR C V ++ T+VQW+ +S RC E R G
Sbjct 3 EASVREDNRLEAAPMVPVTCRECRARVLARKSSWNQTSVQWDAASSARCVERREAERLAG 62
Query 118 GDSARTRSCPRLTDSIEHAVAEGYLEHHDP 147
+C L DSI+ AVA G L DP
Sbjct 63 HHRGPFLACEALRDSIQVAVATGELAIVDP 92
>gi|343924797|ref|ZP_08764336.1| hypothetical protein GOALK_026_00680 [Gordonia alkanivorans NBRC
16433]
gi|343765304|dbj|GAA11262.1| hypothetical protein GOALK_026_00680 [Gordonia alkanivorans NBRC
16433]
Length=99
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/69 (34%), Positives = 34/69 (50%), Gaps = 1/69 (1%)
Query 73 LQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARG-GDSARTRSCPRLTD 131
+QP+ C+ C V+V + P HT++QW +A + C A G + C L
Sbjct 1 MQPLVCQTCDARVSVAKYSPAHTSIQWTADAKQSCHEMASASAVGNANCGYVLRCAALDR 60
Query 132 SIEHAVAEG 140
SI+HAV G
Sbjct 61 SIDHAVTRG 69
>gi|324106142|gb|ADY18311.1| putative ferredoxin [Rhodococcus rhodochrous]
Length=88
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/74 (34%), Positives = 37/74 (50%), Gaps = 0/74 (0%)
Query 73 LQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSARTRSCPRLTDS 132
++P+ C C V V++ HT++QWNT+A RRC + G A SC L ++
Sbjct 5 MEPLRCGECAAQVLVRKNSWEHTSIQWNTDARRRCRIEQDGGREGRPGAPRMSCSYLDET 64
Query 133 IEHAVAEGYLEHHD 146
I A G +E D
Sbjct 65 ISRAATSGVIEVFD 78
>gi|326331640|ref|ZP_08197928.1| hypothetical protein NBCG_03079 [Nocardioidaceae bacterium Broad-1]
gi|325950439|gb|EGD42491.1| hypothetical protein NBCG_03079 [Nocardioidaceae bacterium Broad-1]
Length=101
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/78 (39%), Positives = 39/78 (50%), Gaps = 3/78 (3%)
Query 68 YLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSART--RS 125
Y + L V+C C V VK+ T+VQW EA +C +A D AR
Sbjct 18 YTQARLVEVECLDCLARVGVKKNSEHQTSVQWTAEAQAQCPDLVRRKA-AADGARLVHAR 76
Query 126 CPRLTDSIEHAVAEGYLE 143
CPRL SIE AVA+G ++
Sbjct 77 CPRLAASIEAAVADGRIQ 94
>gi|119716745|ref|YP_923710.1| hypothetical protein Noca_2519 [Nocardioides sp. JS614]
gi|119537406|gb|ABL82023.1| conserved hypothetical protein [Nocardioides sp. JS614]
Length=100
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/83 (33%), Positives = 41/83 (50%), Gaps = 0/83 (0%)
Query 60 ASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGD 119
A+ ++ Y + L V C C +V VK+ HT++QW+ +A C F + + G
Sbjct 10 ATLEDQHLYTRARLVEVACLDCLASVGVKKNSEHHTSIQWSDQALAHCQEFARMATQPGG 69
Query 120 SARTRSCPRLTDSIEHAVAEGYL 142
CPRL SI+ AV EG +
Sbjct 70 RPVHAGCPRLAASIDAAVREGRI 92
>gi|145225143|ref|YP_001135821.1| hypothetical protein Mflv_4565 [Mycobacterium gilvum PYR-GCK]
gi|315445510|ref|YP_004078389.1| hypothetical protein Mspyr1_39650 [Mycobacterium sp. Spyr1]
gi|145217629|gb|ABP47033.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
gi|315263813|gb|ADU00555.1| hypothetical protein Mspyr1_39650 [Mycobacterium sp. Spyr1]
Length=89
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/82 (35%), Positives = 38/82 (47%), Gaps = 1/82 (1%)
Query 66 DRYLTSG-LQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSARTR 124
D L G + PV C C TV V++ T+VQW+ A+ RC + R G+
Sbjct 6 DNRLADGPMAPVQCNRCAATVLVRKSSWHQTSVQWDATATARCEERHDARQLAGEGGLFL 65
Query 125 SCPRLTDSIEHAVAEGYLEHHD 146
+C L DS+ AV G L D
Sbjct 66 TCSALGDSVADAVVRGTLHTVD 87
>gi|226361623|ref|YP_002779401.1| hypothetical protein ROP_22090 [Rhodococcus opacus B4]
gi|226240108|dbj|BAH50456.1| hypothetical protein [Rhodococcus opacus B4]
Length=84
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/68 (37%), Positives = 34/68 (50%), Gaps = 0/68 (0%)
Query 73 LQPVDCRFCHVTVTVKRLGPGHTAVQWNTEASRRCAYFTELRARGGDSARTRSCPRLTDS 132
+ PV+CR C V V++ HT+VQW +A R C F G S +C L S
Sbjct 1 MLPVECRRCGNAVLVEKYSEAHTSVQWLDDAERTCPEFASRAEAGEHSMFVPTCGALRGS 60
Query 133 IEHAVAEG 140
I+ AV +G
Sbjct 61 IDDAVEDG 68
Lambda K H
0.319 0.132 0.421
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130023393564
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40