BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1615
Length=146
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608753|ref|NP_216131.1| hypothetical protein Rv1615 [Mycoba... 286 7e-76
gi|253799347|ref|YP_003032348.1| hypothetical protein TBMG_02379... 285 2e-75
gi|289569660|ref|ZP_06449887.1| hypothetical membrane protein [M... 258 2e-67
gi|308377777|ref|ZP_07668592.1| hypothetical membrane protein [M... 249 8e-65
gi|118617239|ref|YP_905571.1| hypothetical protein MUL_1595 [Myc... 153 8e-36
gi|183982431|ref|YP_001850722.1| hypothetical protein MMAR_2417 ... 153 9e-36
gi|118473125|ref|YP_887537.1| TM2 domain-containing protein [Myc... 150 5e-35
gi|240167991|ref|ZP_04746650.1| hypothetical protein MkanA1_0165... 147 6e-34
gi|145224179|ref|YP_001134857.1| TM2 domain-containing protein [... 134 3e-30
gi|108800012|ref|YP_640209.1| TM2 domain-containing protein [Myc... 114 4e-24
gi|120403804|ref|YP_953633.1| TM2 domain-containing protein [Myc... 114 6e-24
gi|226360150|ref|YP_002777928.1| hypothetical protein ROP_07360 ... 110 7e-23
gi|111018023|ref|YP_700995.1| hypothetical protein RHA1_ro01010 ... 106 9e-22
gi|312139918|ref|YP_004007254.1| integral membrane protein [Rhod... 99.8 1e-19
gi|325674201|ref|ZP_08153890.1| TM2 domain protein [Rhodococcus ... 99.4 2e-19
gi|158316667|ref|YP_001509175.1| TM2 domain-containing protein [... 97.8 4e-19
gi|288922300|ref|ZP_06416495.1| TM2 domain containing protein [F... 97.4 6e-19
gi|54023832|ref|YP_118074.1| hypothetical protein nfa18640 [Noca... 95.9 2e-18
gi|336176481|ref|YP_004581856.1| hypothetical protein FsymDg_037... 95.5 3e-18
gi|226306837|ref|YP_002766797.1| hypothetical protein RER_33500 ... 94.0 6e-18
gi|325002508|ref|ZP_08123620.1| hypothetical protein PseP1_27267... 93.2 1e-17
gi|183985259|ref|YP_001853550.1| hypothetical protein MMAR_5291 ... 91.7 3e-17
gi|333990769|ref|YP_004523383.1| hypothetical protein JDM601_212... 90.9 6e-17
gi|119716131|ref|YP_923096.1| TM2 domain-containing protein [Noc... 89.7 1e-16
gi|111223197|ref|YP_713991.1| hypothetical protein FRAAL3787 [Fr... 88.6 3e-16
gi|229492744|ref|ZP_04386545.1| TM2 domain protein [Rhodococcus ... 88.2 3e-16
gi|118619505|ref|YP_907837.1| hypothetical protein MUL_4364 [Myc... 88.2 3e-16
gi|226306836|ref|YP_002766796.1| hypothetical protein RER_33490 ... 87.4 7e-16
gi|262203537|ref|YP_003274745.1| TM2 domain-containing protein [... 86.3 1e-15
gi|169629726|ref|YP_001703375.1| hypothetical protein MAB_2641c ... 85.5 3e-15
gi|312195115|ref|YP_004015176.1| hypothetical protein FraEuI1c_1... 85.1 3e-15
gi|302866901|ref|YP_003835538.1| hypothetical protein Micau_2418... 79.3 2e-13
gi|342859989|ref|ZP_08716641.1| hypothetical protein MCOL_13955 ... 78.2 3e-13
gi|329938077|ref|ZP_08287528.1| hypothetical protein SGM_3020 [S... 78.2 4e-13
gi|256379437|ref|YP_003103097.1| hypothetical protein Amir_5432 ... 76.3 2e-12
gi|29829944|ref|NP_824578.1| hypothetical protein SAV_3401 [Stre... 75.1 3e-12
gi|159037536|ref|YP_001536789.1| TM2 domain-containing protein [... 74.7 4e-12
gi|343927691|ref|ZP_08767159.1| hypothetical protein GOALK_097_0... 74.3 6e-12
gi|290957993|ref|YP_003489175.1| hypothetical protein SCAB_35331... 73.6 1e-11
gi|297201905|ref|ZP_06919302.1| TM2 domain-containing protein [S... 72.8 2e-11
gi|330467054|ref|YP_004404797.1| hypothetical protein VAB18032_1... 72.0 3e-11
gi|145594465|ref|YP_001158762.1| TM2 domain-containing protein [... 71.2 4e-11
gi|226360772|ref|YP_002778550.1| hypothetical protein ROP_13580 ... 71.2 5e-11
gi|309811422|ref|ZP_07705209.1| TM2 domain protein [Dermacoccus ... 69.7 1e-10
gi|296140018|ref|YP_003647261.1| hypothetical protein Tpau_2315 ... 69.3 2e-10
gi|329938075|ref|ZP_08287526.1| hypothetical protein SGM_3018 [S... 68.9 2e-10
gi|290957991|ref|YP_003489173.1| hypothetical protein SCAB_35311... 68.9 2e-10
gi|254822440|ref|ZP_05227441.1| hypothetical protein MintA_21076... 65.9 2e-09
gi|29829942|ref|NP_824576.1| hypothetical protein SAV_3399 [Stre... 64.7 4e-09
gi|183602145|ref|ZP_02963513.1| hypothetical protein BIFLAC_0782... 63.5 9e-09
>gi|15608753|ref|NP_216131.1| hypothetical protein Rv1615 [Mycobacterium tuberculosis H37Rv]
gi|15841070|ref|NP_336107.1| hypothetical protein MT1651 [Mycobacterium tuberculosis CDC1551]
gi|31792801|ref|NP_855294.1| hypothetical protein Mb1641 [Mycobacterium bovis AF2122/97]
42 more sequence titles
Length=146
Score = 286 bits (732), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 146/146 (100%), Positives = 146/146 (100%), Gaps = 0/146 (0%)
Query 1 MGLRPARVVRPARSGMLKGVTDPLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPY 60
MGLRPARVVRPARSGMLKGVTDPLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPY
Sbjct 1 MGLRPARVVRPARSGMLKGVTDPLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPY 60
Query 61 GRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVI 120
GRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVI
Sbjct 61 GRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVI 120
Query 121 WGVIDALLILTDKVGDPWGRPLRDGS 146
WGVIDALLILTDKVGDPWGRPLRDGS
Sbjct 121 WGVIDALLILTDKVGDPWGRPLRDGS 146
>gi|253799347|ref|YP_003032348.1| hypothetical protein TBMG_02379 [Mycobacterium tuberculosis KZN
1435]
gi|289554611|ref|ZP_06443821.1| hypothetical membrane protein [Mycobacterium tuberculosis KZN
605]
gi|297634168|ref|ZP_06951948.1| hypothetical protein MtubK4_08602 [Mycobacterium tuberculosis
KZN 4207]
6 more sequence titles
Length=146
Score = 285 bits (728), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 145/146 (99%), Positives = 145/146 (99%), Gaps = 0/146 (0%)
Query 1 MGLRPARVVRPARSGMLKGVTDPLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPY 60
MGLRPARVVRPARSGMLKGVTDPLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPY
Sbjct 1 MGLRPARVVRPARSGMLKGVTDPLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPY 60
Query 61 GRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVI 120
GRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLG VI
Sbjct 61 GRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGTVI 120
Query 121 WGVIDALLILTDKVGDPWGRPLRDGS 146
WGVIDALLILTDKVGDPWGRPLRDGS
Sbjct 121 WGVIDALLILTDKVGDPWGRPLRDGS 146
>gi|289569660|ref|ZP_06449887.1| hypothetical membrane protein [Mycobacterium tuberculosis T17]
gi|308231887|ref|ZP_07414138.2| hypothetical membrane protein [Mycobacterium tuberculosis SUMu001]
gi|308369484|ref|ZP_07417949.2| hypothetical membrane protein [Mycobacterium tuberculosis SUMu002]
22 more sequence titles
Length=131
Score = 258 bits (658), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 131/131 (100%), Positives = 131/131 (100%), Gaps = 0/131 (0%)
Query 16 MLKGVTDPLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPYGRHPVTGQPFSDKSK 75
MLKGVTDPLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPYGRHPVTGQPFSDKSK
Sbjct 1 MLKGVTDPLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPYGRHPVTGQPFSDKSK 60
Query 76 TVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVIWGVIDALLILTDKVG 135
TVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVIWGVIDALLILTDKVG
Sbjct 61 TVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVIWGVIDALLILTDKVG 120
Query 136 DPWGRPLRDGS 146
DPWGRPLRDGS
Sbjct 121 DPWGRPLRDGS 131
>gi|308377777|ref|ZP_07668592.1| hypothetical membrane protein [Mycobacterium tuberculosis SUMu009]
gi|308400906|ref|ZP_07493308.2| hypothetical membrane protein [Mycobacterium tuberculosis SUMu012]
gi|308354639|gb|EFP43490.1| hypothetical membrane protein [Mycobacterium tuberculosis SUMu009]
gi|308366204|gb|EFP55055.1| hypothetical membrane protein [Mycobacterium tuberculosis SUMu012]
Length=127
Score = 249 bits (636), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 126/127 (99%), Positives = 127/127 (100%), Gaps = 0/127 (0%)
Query 20 VTDPLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAG 79
+TDPLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAG
Sbjct 1 MTDPLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAG 60
Query 80 LLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVIWGVIDALLILTDKVGDPWG 139
LLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVIWGVIDALLILTDKVGDPWG
Sbjct 61 LLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVIWGVIDALLILTDKVGDPWG 120
Query 140 RPLRDGS 146
RPLRDGS
Sbjct 121 RPLRDGS 127
>gi|118617239|ref|YP_905571.1| hypothetical protein MUL_1595 [Mycobacterium ulcerans Agy99]
gi|118569349|gb|ABL04100.1| conserved hypothetical membrane protein [Mycobacterium ulcerans
Agy99]
Length=130
Score = 153 bits (386), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 84/95 (89%), Positives = 93/95 (98%), Gaps = 0/95 (0%)
Query 52 SYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGW 111
SY+DP AP+GRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIY+GHTG+GIAQLLVGW
Sbjct 36 SYYDPAAPFGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYMGHTGMGIAQLLVGW 95
Query 112 VTCGLGAVIWGVIDALLILTDKVGDPWGRPLRDGS 146
+TCGLGA++WGVIDA+LILTDKV DPWGRPLRDG+
Sbjct 96 LTCGLGALVWGVIDAVLILTDKVSDPWGRPLRDGT 130
>gi|183982431|ref|YP_001850722.1| hypothetical protein MMAR_2417 [Mycobacterium marinum M]
gi|183175757|gb|ACC40867.1| conserved hypothetical membrane protein [Mycobacterium marinum
M]
Length=131
Score = 153 bits (386), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 84/95 (89%), Positives = 93/95 (98%), Gaps = 0/95 (0%)
Query 52 SYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGW 111
SY+DP AP+GRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIY+GHTG+GIAQLLVGW
Sbjct 37 SYYDPAAPFGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYMGHTGMGIAQLLVGW 96
Query 112 VTCGLGAVIWGVIDALLILTDKVGDPWGRPLRDGS 146
+TCGLGA++WGVIDA+LILTDKV DPWGRPLRDG+
Sbjct 97 LTCGLGALVWGVIDAVLILTDKVSDPWGRPLRDGT 131
>gi|118473125|ref|YP_887537.1| TM2 domain-containing protein [Mycobacterium smegmatis str. MC2
155]
gi|118174412|gb|ABK75308.1| TM2 domain protein [Mycobacterium smegmatis str. MC2 155]
Length=95
Score = 150 bits (380), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 74/94 (79%), Positives = 83/94 (89%), Gaps = 0/94 (0%)
Query 53 YFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWV 112
Y DP APYGRHPVTG+PFSDKSK VAGLLQL+GL G G GRIYLG TGLGIAQL+VG V
Sbjct 2 YGDPTAPYGRHPVTGEPFSDKSKIVAGLLQLIGLVGFVGFGRIYLGQTGLGIAQLIVGLV 61
Query 113 TCGLGAVIWGVIDALLILTDKVGDPWGRPLRDGS 146
TCG+GA IWG++DA+LILTDKV DP+GRPLRDG+
Sbjct 62 TCGIGAFIWGIVDAILILTDKVRDPYGRPLRDGT 95
>gi|240167991|ref|ZP_04746650.1| hypothetical protein MkanA1_01655 [Mycobacterium kansasii ATCC
12478]
Length=134
Score = 147 bits (370), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 85/100 (85%), Positives = 93/100 (93%), Gaps = 2/100 (2%)
Query 47 YPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQ 106
Y GP + DP AP+GRHPVTGQP SDKSKT+AGLLQLLGLFGIAGIGRIYLGHTG+GIAQ
Sbjct 37 YYGPAA--DPSAPFGRHPVTGQPLSDKSKTIAGLLQLLGLFGIAGIGRIYLGHTGMGIAQ 94
Query 107 LLVGWVTCGLGAVIWGVIDALLILTDKVGDPWGRPLRDGS 146
LLVGW+TCG+GAVIWGVIDA+LILTDKV DPWGRPLRDG+
Sbjct 95 LLVGWLTCGVGAVIWGVIDAVLILTDKVSDPWGRPLRDGT 134
>gi|145224179|ref|YP_001134857.1| TM2 domain-containing protein [Mycobacterium gilvum PYR-GCK]
gi|315444514|ref|YP_004077393.1| TM2 domain-containing protein [Mycobacterium sp. Spyr1]
gi|145216665|gb|ABP46069.1| TM2 domain containing protein+B7201 [Mycobacterium gilvum PYR-GCK]
gi|315262817|gb|ADT99558.1| TM2 domain-containing protein [Mycobacterium sp. Spyr1]
Length=161
Score = 134 bits (338), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 68/102 (67%), Positives = 82/102 (81%), Gaps = 7/102 (6%)
Query 52 SYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQ----L 107
SY DP APYGRHP+TG+P SDKSKTVAGLLQLLG+FG GIGR+Y+G G G+AQ L
Sbjct 60 SYVDPMAPYGRHPLTGEPLSDKSKTVAGLLQLLGIFGFLGIGRMYIGQVGFGVAQLVGCL 119
Query 108 LVGWVTCGLGA---VIWGVIDALLILTDKVGDPWGRPLRDGS 146
++G VTCG+G VIWG++DA+L+ TDKV DP GRPLRDG+
Sbjct 120 VLGAVTCGVGFIVPVIWGIVDAVLMFTDKVRDPQGRPLRDGT 161
>gi|108800012|ref|YP_640209.1| TM2 domain-containing protein [Mycobacterium sp. MCS]
gi|119869137|ref|YP_939089.1| TM2 domain-containing protein [Mycobacterium sp. KMS]
gi|126435641|ref|YP_001071332.1| TM2 domain-containing protein [Mycobacterium sp. JLS]
gi|108770431|gb|ABG09153.1| TM2 domain containing protein [Mycobacterium sp. MCS]
gi|119695226|gb|ABL92299.1| TM2 domain containing protein+B7201 [Mycobacterium sp. KMS]
gi|126235441|gb|ABN98841.1| TM2 domain containing protein+B7201 [Mycobacterium sp. JLS]
Length=127
Score = 114 bits (286), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 64/86 (75%), Positives = 76/86 (89%), Gaps = 0/86 (0%)
Query 61 GRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVI 120
GRHP+TG+P+SDKSK VAGLLQL+GL GI GIGR+YLG GLG+AQL+VG VTCG+GAVI
Sbjct 42 GRHPMTGEPYSDKSKVVAGLLQLVGLLGIVGIGRMYLGQVGLGVAQLVVGLVTCGIGAVI 101
Query 121 WGVIDALLILTDKVGDPWGRPLRDGS 146
WG++DA+LI TDKV DP G PLRDG+
Sbjct 102 WGIVDAVLIFTDKVRDPNGLPLRDGT 127
>gi|120403804|ref|YP_953633.1| TM2 domain-containing protein [Mycobacterium vanbaalenii PYR-1]
gi|119956622|gb|ABM13627.1| TM2 domain containing protein+B7201 [Mycobacterium vanbaalenii
PYR-1]
Length=143
Score = 114 bits (284), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 69/92 (75%), Positives = 83/92 (91%), Gaps = 0/92 (0%)
Query 55 DPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTC 114
DP AP+GRHPVTG+PFS+KSK V GLLQLLGLFG+ GIGRIYLG+T LG+ QL+VG +TC
Sbjct 52 DPAAPWGRHPVTGEPFSEKSKVVGGLLQLLGLFGLVGIGRIYLGYTSLGVIQLVVGLITC 111
Query 115 GLGAVIWGVIDALLILTDKVGDPWGRPLRDGS 146
G+GA+IWG++DA+LILTDKV DP GRPLRDG+
Sbjct 112 GIGAIIWGIVDAILILTDKVRDPEGRPLRDGT 143
>gi|226360150|ref|YP_002777928.1| hypothetical protein ROP_07360 [Rhodococcus opacus B4]
gi|226238635|dbj|BAH48983.1| hypothetical membrane protein [Rhodococcus opacus B4]
Length=279
Score = 110 bits (275), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 56/108 (52%), Positives = 74/108 (69%), Gaps = 9/108 (8%)
Query 37 PPGYPPPYPQYPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIY 96
PPGY P Y +Y DP AP+GRHP+TG+P+SDKSK AGLLQ+ G G+GR Y
Sbjct 180 PPGYAPGY------NAYADPSAPFGRHPLTGEPYSDKSKLTAGLLQI--FLGAFGVGRFY 231
Query 97 LGHTGLGIAQLLVGWVTCGLGAVIWGVIDALLILTDKVGDPWGRPLRD 144
L G+ +AQ+ V W+TCG+G IW ++D +++LT V D +GRPLRD
Sbjct 232 LNQPGIAVAQIAVTWLTCGIGG-IWPLVDGIMMLTGSVKDQYGRPLRD 278
>gi|111018023|ref|YP_700995.1| hypothetical protein RHA1_ro01010 [Rhodococcus jostii RHA1]
gi|110817553|gb|ABG92837.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=280
Score = 106 bits (265), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 55/108 (51%), Positives = 74/108 (69%), Gaps = 8/108 (7%)
Query 37 PPGYPPPYPQYPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIY 96
PPGY P G +Y DP AP+GRHP+TG+P+SDKSK AGLLQ+ G G+GR Y
Sbjct 180 PPGYAPA-----GYNAYADPSAPFGRHPLTGEPYSDKSKLTAGLLQI--FLGGFGVGRFY 232
Query 97 LGHTGLGIAQLLVGWVTCGLGAVIWGVIDALLILTDKVGDPWGRPLRD 144
L G+ +AQ+ V W+TCG+G IW ++D +++LT V D +GRPLR+
Sbjct 233 LNQPGIAVAQIAVTWLTCGIGG-IWPLVDGIMMLTGSVKDKYGRPLRE 279
>gi|312139918|ref|YP_004007254.1| integral membrane protein [Rhodococcus equi 103S]
gi|311889257|emb|CBH48571.1| putative integral membrane protein [Rhodococcus equi 103S]
Length=248
Score = 99.8 bits (247), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/120 (51%), Positives = 78/120 (65%), Gaps = 15/120 (12%)
Query 38 PGYPPPYPQYPGP--GSYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQ-LLGLFGIAGIGR 94
PGYPP YPG G+Y DP AP+GR PVTG+P+SDKS+ AGLLQ LLG + GIGR
Sbjct 129 PGYPPAS-GYPGMYGGAYDDPSAPFGRDPVTGEPYSDKSRATAGLLQILLGFISLPGIGR 187
Query 95 IYLGHTGLGIAQLLVGW-----VTCGLGAV------IWGVIDALLILTDKVGDPWGRPLR 143
+Y+G G+G+ QLLV W + G+G V IW +D +L+L+ V DP GRPL+
Sbjct 188 LYIGSIGIGLTQLLVFWLGLITIIFGIGFVIFPVIWIWAFVDGILMLSGAVRDPQGRPLK 247
>gi|325674201|ref|ZP_08153890.1| TM2 domain protein [Rhodococcus equi ATCC 33707]
gi|325554881|gb|EGD24554.1| TM2 domain protein [Rhodococcus equi ATCC 33707]
Length=247
Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/120 (51%), Positives = 78/120 (65%), Gaps = 15/120 (12%)
Query 38 PGYPPPYPQYPGP--GSYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQ-LLGLFGIAGIGR 94
PGYPP YPG G+Y DP AP+GR PVTG+P+SDKS+ AGLLQ LLG + GIGR
Sbjct 128 PGYPPAS-GYPGMYGGAYGDPSAPFGRDPVTGEPYSDKSRATAGLLQILLGFISLPGIGR 186
Query 95 IYLGHTGLGIAQLLVGW-----VTCGLGAV------IWGVIDALLILTDKVGDPWGRPLR 143
+Y+G G+G+ QLLV W + G+G V IW +D +L+L+ V DP GRPL+
Sbjct 187 LYIGSIGIGLTQLLVFWFGLITIIFGIGFVIFPVIWIWAFVDGILMLSGAVRDPQGRPLK 246
>gi|158316667|ref|YP_001509175.1| TM2 domain-containing protein [Frankia sp. EAN1pec]
gi|158112072|gb|ABW14269.1| TM2 domain containing protein [Frankia sp. EAN1pec]
Length=176
Score = 97.8 bits (242), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/91 (57%), Positives = 65/91 (72%), Gaps = 7/91 (7%)
Query 54 FDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVT 113
FD AP+GRHP+TG+P+SDK K VAGLLQ+ G G+GR YLG TG+ +AQLL T
Sbjct 93 FDQSAPFGRHPITGEPYSDKQKIVAGLLQIFA--GCFGVGRFYLGSTGIAVAQLL----T 146
Query 114 CGLGAVIWGVIDALLILTDKVGDPWGRPLRD 144
CG G +W ++D ++ILT V D +GRPLRD
Sbjct 147 CG-GLGVWALVDGVMILTGNVRDQYGRPLRD 176
>gi|288922300|ref|ZP_06416495.1| TM2 domain containing protein [Frankia sp. EUN1f]
gi|288346365|gb|EFC80699.1| TM2 domain containing protein [Frankia sp. EUN1f]
Length=156
Score = 97.4 bits (241), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 55/92 (60%), Positives = 64/92 (70%), Gaps = 7/92 (7%)
Query 53 YFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWV 112
+FDP APYGRHPVTG+P SDK K VAGLLQ+ G G GR Y+G G+ IAQLL
Sbjct 72 WFDPTAPYGRHPVTGEPLSDKQKLVAGLLQI--FLGGVGAGRWYMGSNGIAIAQLL---- 125
Query 113 TCGLGAVIWGVIDALLILTDKVGDPWGRPLRD 144
TCG G IW +ID ++ILT V D +GRPLRD
Sbjct 126 TCG-GLGIWALIDGIIILTSDVRDQYGRPLRD 156
>gi|54023832|ref|YP_118074.1| hypothetical protein nfa18640 [Nocardia farcinica IFM 10152]
gi|54015340|dbj|BAD56710.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=181
Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 52/90 (58%), Positives = 64/90 (72%), Gaps = 4/90 (4%)
Query 55 DPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTC 114
DP APYGR P G PFSDK K AGLLQ+ G G+GR Y G+TG+ IAQ+ V W+TC
Sbjct 96 DPEAPYGRDPF-GVPFSDKQKLTAGLLQIF--LGGFGVGRFYTGYTGIAIAQIAVTWLTC 152
Query 115 GLGAVIWGVIDALLILTDKVGDPWGRPLRD 144
G+GA IW ++D +++LT KV D GRPLRD
Sbjct 153 GIGA-IWPLVDGIMMLTGKVPDAQGRPLRD 181
>gi|336176481|ref|YP_004581856.1| hypothetical protein FsymDg_0373 [Frankia symbiont of Datisca
glomerata]
gi|334857461|gb|AEH07935.1| hypothetical protein FsymDg_0373 [Frankia symbiont of Datisca
glomerata]
Length=195
Score = 95.5 bits (236), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 70/139 (51%), Positives = 84/139 (61%), Gaps = 14/139 (10%)
Query 11 PARSGMLKGVTDPLQHGAFE-PGWQSAPPGYPPPYPQYPGPGSY----FDPFAPYGRHPV 65
P SG +G P G + PG+Q+ GYPP P YP P Y +DP AP+GRHPV
Sbjct 66 PQGSGYPQGSGYPQGSGYPQGPGYQAGA-GYPPA-PGYPIPQGYGQGPYDPAAPFGRHPV 123
Query 66 TGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVIWGVID 125
TG+P+SDK K VAGLLQ+ G G GR YLG TG+ +AQL TCG G IW +ID
Sbjct 124 TGEPYSDKQKLVAGLLQI--FLGGFGAGRWYLGDTGIALAQLF----TCG-GLGIWALID 176
Query 126 ALLILTDKVGDPWGRPLRD 144
++ILT V D GR LRD
Sbjct 177 GIMILTGNVRDSQGRLLRD 195
>gi|226306837|ref|YP_002766797.1| hypothetical protein RER_33500 [Rhodococcus erythropolis PR4]
gi|226185954|dbj|BAH34058.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=141
Score = 94.0 bits (232), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 63/141 (45%), Positives = 79/141 (57%), Gaps = 18/141 (12%)
Query 20 VTDPLQHGAFEPGWQSAPPGYPPPY-PQYPGPGS----YFDPFAPYGRHPVTGQPFSDKS 74
+T P G F P Q G P PY Q P P + DP+APYGRHPVTG+P+SDKS
Sbjct 1 MTGPYSQGGF-PEDQGPVYGAPNPYGYQQPLPAQSMYGFGDPYAPYGRHPVTGEPYSDKS 59
Query 75 KTVAGLLQ-LLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAV-----------IWG 122
K AGLLQ LLG FGI G+GR+Y+G G+G+ QLL + + A IW
Sbjct 60 KATAGLLQILLGFFGICGVGRLYIGSVGIGLCQLLGMFFALFMSAFLIGIPFAIGIWIWA 119
Query 123 VIDALLILTDKVGDPWGRPLR 143
+D ++IL+ D RPLR
Sbjct 120 FVDGIVILSGNARDGVRRPLR 140
>gi|325002508|ref|ZP_08123620.1| hypothetical protein PseP1_27267 [Pseudonocardia sp. P1]
Length=332
Score = 93.2 bits (230), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 54/90 (60%), Positives = 61/90 (68%), Gaps = 4/90 (4%)
Query 54 FDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVT 113
+DP AP+GR +G+P+SDK K VAGLLQL F GIGR Y GHTGL IAQLLV T
Sbjct 246 YDPAAPFGREAWSGRPYSDKQKIVAGLLQLFLPF---GIGRFYSGHTGLAIAQLLVMIFT 302
Query 114 CGLGAVIWGVIDALLILTDKVGDPWGRPLR 143
GLGA IW ID ++IL DP GRPLR
Sbjct 303 FGLGA-IWSFIDGIVILAGSPTDPDGRPLR 331
>gi|183985259|ref|YP_001853550.1| hypothetical protein MMAR_5291 [Mycobacterium marinum M]
gi|183178585|gb|ACC43695.1| conserved hypothetical membrane protein [Mycobacterium marinum
M]
Length=161
Score = 91.7 bits (226), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 58/129 (45%), Positives = 73/129 (57%), Gaps = 19/129 (14%)
Query 32 GWQSAPPGYPPPYP-----QYPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGL 86
G+ + P G P PYP YP DP AP+GR P++G P SDKS AGLLQL
Sbjct 35 GYPAQPGGQPQPYPGNPSYPYPNFAQAPDPQAPFGRDPISGVPLSDKSAATAGLLQL--F 92
Query 87 FGIAGIGRIYLGHTGLGIAQLLVGW------VTCGLG------AVIWGVIDALLILTDKV 134
FG AGIGR Y+G + AQL +G V C +G AV+WG++DA++I T V
Sbjct 93 FGFAGIGRFYVGSNSVATAQLCLGLGGLAFTVFCLIGFPFLIAAVLWGIVDAIMIFTGNV 152
Query 135 GDPWGRPLR 143
D +GR LR
Sbjct 153 ADNYGRKLR 161
>gi|333990769|ref|YP_004523383.1| hypothetical protein JDM601_2129 [Mycobacterium sp. JDM601]
gi|333486737|gb|AEF36129.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=135
Score = 90.9 bits (224), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 62/99 (63%), Positives = 73/99 (74%), Gaps = 7/99 (7%)
Query 55 DPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQ----LLVG 110
D AP+GRHPVTGQP+SDKSK +A LLQLLGLFG G GRIYLG T LGIAQ LL
Sbjct 37 DALAPHGRHPVTGQPYSDKSKVIAALLQLLGLFGFLGFGRIYLGQTALGIAQFLGCLLFT 96
Query 111 WVTCGLGA---VIWGVIDALLILTDKVGDPWGRPLRDGS 146
T G+GA +IWG++DA+L+L +V D GRPLRDG+
Sbjct 97 AATWGIGAAVPIIWGIVDAILMLAGRVSDKQGRPLRDGT 135
>gi|119716131|ref|YP_923096.1| TM2 domain-containing protein [Nocardioides sp. JS614]
gi|119536792|gb|ABL81409.1| TM2 domain containing protein+B7201 [Nocardioides sp. JS614]
Length=155
Score = 89.7 bits (221), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/90 (59%), Positives = 63/90 (70%), Gaps = 5/90 (5%)
Query 55 DPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTC 114
P APYG HPVTG P+SDK+K VAGLLQ+L GI GR YLG T G+ QL+V +TC
Sbjct 68 SPAAPYGVHPVTGIPYSDKTKLVAGLLQILIPLGI---GRFYLGDTKTGVWQLVVTVLTC 124
Query 115 GLGAVIWGVIDALLIL-TDKVGDPWGRPLR 143
G+GA +W ID ++IL TD V D GRPLR
Sbjct 125 GIGA-LWPFIDGIIILATDNVTDAQGRPLR 153
>gi|111223197|ref|YP_713991.1| hypothetical protein FRAAL3787 [Frankia alni ACN14a]
gi|111150729|emb|CAJ62430.1| conserved hypothetical protein [Frankia alni ACN14a]
Length=216
Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/90 (55%), Positives = 62/90 (69%), Gaps = 7/90 (7%)
Query 55 DPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTC 114
DP AP+GRHP+TG+ +SDK K AGLLQ+ L G G GR YLG+TG+ +AQL TC
Sbjct 134 DPSAPFGRHPLTGEAYSDKQKLTAGLLQI--LLGGVGAGRWYLGNTGIALAQLF----TC 187
Query 115 GLGAVIWGVIDALLILTDKVGDPWGRPLRD 144
G G +W +ID +++LT V D GRPLRD
Sbjct 188 G-GLGVWALIDGVMMLTGNVRDRHGRPLRD 216
>gi|229492744|ref|ZP_04386545.1| TM2 domain protein [Rhodococcus erythropolis SK121]
gi|229320403|gb|EEN86223.1| TM2 domain protein [Rhodococcus erythropolis SK121]
Length=275
Score = 88.2 bits (217), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/120 (51%), Positives = 77/120 (65%), Gaps = 6/120 (5%)
Query 25 QHGAFEPGWQSAPPGYPPPYPQYPGPG-SYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQL 83
Q G +PG+ P GYP P YP PG DP APYGR P+TG+P+SDKSKT AGLL +
Sbjct 160 QPGYSQPGY-GQPQGYPQ-APGYPQPGYGLVDPSAPYGRDPMTGEPYSDKSKTTAGLLGI 217
Query 84 LGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVIWGVIDALLILTDKVGDPWGRPLR 143
L G GR YL G+ +AQ+ V W+TCG+G IW +ID +++LT V D + RPLR
Sbjct 218 LLG--GFGAGRFYLNQPGMAVAQIAVTWLTCGIGG-IWPLIDGIMMLTGSVRDQYNRPLR 274
>gi|118619505|ref|YP_907837.1| hypothetical protein MUL_4364 [Mycobacterium ulcerans Agy99]
gi|118571615|gb|ABL06366.1| conserved hypothetical membrane protein [Mycobacterium ulcerans
Agy99]
Length=161
Score = 88.2 bits (217), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/129 (44%), Positives = 72/129 (56%), Gaps = 19/129 (14%)
Query 32 GWQSAPPGYPPPYP-----QYPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGL 86
G+ + P G P PYP YP +P AP+GR P++G P SDKS AGLLQL
Sbjct 35 GYPAQPGGQPQPYPGNPSYPYPNFAQAPEPQAPFGRDPISGVPLSDKSAATAGLLQL--F 92
Query 87 FGIAGIGRIYLGHTGLGIAQLLVGW------VTCGLG------AVIWGVIDALLILTDKV 134
FG AGIGR Y+G + AQL +G V C G AV+WG++D+++I T V
Sbjct 93 FGFAGIGRFYVGSNSVATAQLCLGLGGLAFTVFCLNGFPFLIAAVLWGIVDSIMIFTGNV 152
Query 135 GDPWGRPLR 143
D +GR LR
Sbjct 153 ADSYGRKLR 161
>gi|226306836|ref|YP_002766796.1| hypothetical protein RER_33490 [Rhodococcus erythropolis PR4]
gi|226185953|dbj|BAH34057.1| hypothetical protein RER_33490 [Rhodococcus erythropolis PR4]
Length=295
Score = 87.4 bits (215), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 55/104 (53%), Positives = 69/104 (67%), Gaps = 7/104 (6%)
Query 44 YPQ---YPGPG-SYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGH 99
YPQ YP PG DP APYGR P+TG+P+SDKSKT AGLL +L G GR YL
Sbjct 194 YPQGQGYPQPGFGLVDPSAPYGRDPMTGEPYSDKSKTTAGLLGILLG--GFGAGRFYLNQ 251
Query 100 TGLGIAQLLVGWVTCGLGAVIWGVIDALLILTDKVGDPWGRPLR 143
G+ +AQ+ V W+TCG+G IW +ID +++LT V D + RPLR
Sbjct 252 PGMAVAQIAVTWLTCGIGG-IWPLIDGIMMLTGSVRDQYNRPLR 294
>gi|262203537|ref|YP_003274745.1| TM2 domain-containing protein [Gordonia bronchialis DSM 43247]
gi|262086884|gb|ACY22852.1| TM2 domain containing protein [Gordonia bronchialis DSM 43247]
Length=160
Score = 86.3 bits (212), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/133 (44%), Positives = 73/133 (55%), Gaps = 18/133 (13%)
Query 23 PLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQ 82
P +G +P + AP PP Y YPG DP AP+GR P+TG+P SDKSK AGLLQ
Sbjct 35 PSVYGVPQPAYFVAP--VPPVYGLYPGAA---DPLAPFGRDPLTGEPLSDKSKVAAGLLQ 89
Query 83 LLGLFGIAGIGRIYLGHTGLGIAQL---LVGWVTCG--------LGAVIWGVIDALLILT 131
L G G+GR YLGH G+G AQL +VGW+ IW ++DA+++ T
Sbjct 90 L--FLGGFGVGRFYLGHGGVGAAQLCLTIVGWLLAIFFVGFILLFAVSIWALVDAVMMFT 147
Query 132 DKVGDPWGRPLRD 144
V D G LR
Sbjct 148 GSVRDSRGYKLRS 160
>gi|169629726|ref|YP_001703375.1| hypothetical protein MAB_2641c [Mycobacterium abscessus ATCC
19977]
gi|169241693|emb|CAM62721.1| Hypothetical protein MAB_2641c [Mycobacterium abscessus]
Length=118
Score = 85.5 bits (210), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 58/87 (67%), Positives = 66/87 (76%), Gaps = 0/87 (0%)
Query 58 APYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLG 117
APYGR P TG P+SDKSK VAGLLQLLGL G G GRIYLG T LG+ QLL+G+VT G+
Sbjct 32 APYGRDPATGAPYSDKSKLVAGLLQLLGLVGFLGFGRIYLGQTLLGVVQLLIGFVTFGIV 91
Query 118 AVIWGVIDALLILTDKVGDPWGRPLRD 144
AVIW ++DA+LI KV D GR LRD
Sbjct 92 AVIWAIVDAVLIFGGKVTDKVGRQLRD 118
>gi|312195115|ref|YP_004015176.1| hypothetical protein FraEuI1c_1234 [Frankia sp. EuI1c]
gi|311226451|gb|ADP79306.1| hypothetical protein FraEuI1c_1234 [Frankia sp. EuI1c]
Length=221
Score = 85.1 bits (209), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/110 (51%), Positives = 66/110 (60%), Gaps = 11/110 (10%)
Query 39 GYPPPYPQYPGPGS----YFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGR 94
GYP P Q PG G DP AP+GRH +TG+P SDK K AGLL++ G G GR
Sbjct 119 GYPQPGYQQPGYGVPAVWGADPAAPFGRHQLTGEPLSDKDKLTAGLLEI--FLGWCGAGR 176
Query 95 IYLGHTGLGIAQLLVGWVTCGLGAVIWGVIDALLILTDKVGDPWGRPLRD 144
YLG G+ AQLL TCG G IW +ID +++LT V D GRPLRD
Sbjct 177 WYLGDYGIATAQLL----TCG-GLHIWSLIDGIMMLTGSVRDKQGRPLRD 221
>gi|302866901|ref|YP_003835538.1| hypothetical protein Micau_2418 [Micromonospora aurantiaca ATCC
27029]
gi|315506757|ref|YP_004085644.1| hypothetical protein ML5_6049 [Micromonospora sp. L5]
gi|302569760|gb|ADL45962.1| hypothetical protein Micau_2418 [Micromonospora aurantiaca ATCC
27029]
gi|315413376|gb|ADU11493.1| hypothetical protein ML5_6049 [Micromonospora sp. L5]
Length=117
Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/87 (55%), Positives = 58/87 (67%), Gaps = 6/87 (6%)
Query 58 APYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLG 117
PY ++P Q SDKSK VAG+LQ+L G GIGR Y+G T G+ QL+V VTCG G
Sbjct 37 TPYQQYP---QGVSDKSKVVAGVLQIL--LGGFGIGRFYMGDTKTGVIQLIVTLVTCGFG 91
Query 118 AVIWGVIDALLILTDKVGDPWGRPLRD 144
A IWG++D +LIL + D GRPLRD
Sbjct 92 A-IWGLVDGILILVNGGVDGQGRPLRD 117
>gi|342859989|ref|ZP_08716641.1| hypothetical protein MCOL_13955 [Mycobacterium colombiense CECT
3035]
gi|342132367|gb|EGT85596.1| hypothetical protein MCOL_13955 [Mycobacterium colombiense CECT
3035]
Length=122
Score = 78.2 bits (191), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/126 (48%), Positives = 71/126 (57%), Gaps = 25/126 (19%)
Query 38 PGYPPP--------YPQYPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGI 89
PGYPPP YP YPG Y DP APYGR PVTG P SDKS T AGLLQL +
Sbjct 2 PGYPPPQAYPGNVGYPGYPG---YVDPHAPYGRDPVTGLPLSDKSATTAGLLQLFLG--L 56
Query 90 AGIGRIYLGHTGLGIAQLLVGW------VTCG------LGAVIWGVIDALLILTDKVGDP 137
GIGR Y+ T + +AQL +G + C LG ++W ++DA+LI T V D
Sbjct 57 FGIGRFYIDSTQIAVAQLCLGLFGVLFSLFCFLGFPVLLGGIVWAIVDAILIFTGSVKDK 116
Query 138 WGRPLR 143
GR LR
Sbjct 117 DGRKLR 122
>gi|329938077|ref|ZP_08287528.1| hypothetical protein SGM_3020 [Streptomyces griseoaurantiacus
M045]
gi|329302566|gb|EGG46456.1| hypothetical protein SGM_3020 [Streptomyces griseoaurantiacus
M045]
Length=155
Score = 78.2 bits (191), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 50/90 (56%), Positives = 59/90 (66%), Gaps = 9/90 (10%)
Query 55 DPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTC 114
DP APYG P G+P+SDKSK VAG+L + G GIGR YLGH GLGIAQLL TC
Sbjct 73 DPNAPYGYDPY-GRPYSDKSKIVAGVLSIF--LGYLGIGRFYLGHVGLGIAQLL----TC 125
Query 115 GLGAVIWGVIDALLILT-DKVGDPWGRPLR 143
G G IW +ID +++LT + D GR LR
Sbjct 126 G-GLGIWSLIDGIVLLTSNNTTDSNGRILR 154
>gi|256379437|ref|YP_003103097.1| hypothetical protein Amir_5432 [Actinosynnema mirum DSM 43827]
gi|255923740|gb|ACU39251.1| protein of unknown function DUF1707 [Actinosynnema mirum DSM
43827]
Length=194
Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/88 (53%), Positives = 56/88 (64%), Gaps = 7/88 (7%)
Query 57 FAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGL 116
+APYG PV+G P SDKSK VAG+LQ+ L G G+GR Y G G+ IAQLL TCG
Sbjct 114 YAPYGVDPVSGAPLSDKSKMVAGVLQV--LLGYFGVGRFYTGDYGIAIAQLL----TCG- 166
Query 117 GAVIWGVIDALLILTDKVGDPWGRPLRD 144
GA +W ID +++L D GR LRD
Sbjct 167 GAGVWSFIDGIILLIGGGTDGNGRKLRD 194
>gi|29829944|ref|NP_824578.1| hypothetical protein SAV_3401 [Streptomyces avermitilis MA-4680]
gi|29607054|dbj|BAC71113.1| hypothetical protein [Streptomyces avermitilis MA-4680]
Length=166
Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/96 (53%), Positives = 62/96 (65%), Gaps = 10/96 (10%)
Query 50 PGSYF-DPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLL 108
PG+Y DP APYG P G+P+SDKSK VAGLLQL G G+GR Y+G G+GIAQL
Sbjct 78 PGTYTGDPNAPYGYDPY-GRPYSDKSKIVAGLLQL--FLGSFGVGRFYMGSVGIGIAQLF 134
Query 109 VGWVTCGLGAVIWGVIDALLILT-DKVGDPWGRPLR 143
TCG G IW +ID +++L ++ D GR LR
Sbjct 135 ----TCG-GLGIWALIDGVMLLAGNEAKDQHGRILR 165
>gi|159037536|ref|YP_001536789.1| TM2 domain-containing protein [Salinispora arenicola CNS-205]
gi|157916371|gb|ABV97798.1| TM2 domain containing protein [Salinispora arenicola CNS-205]
Length=131
Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 53/115 (47%), Positives = 65/115 (57%), Gaps = 12/115 (10%)
Query 36 APPGYPPPYPQYP--GPGSYFDPF--APYGR--HPVTGQPFSDKSKTVAGLLQLLGLFGI 89
PP P +PQ G GS P + HP+ SDKSK VAG+LQ+L G
Sbjct 23 CPPSRPTYWPQVELAGCGSLLCRLMTTPLNQPGHPMG---VSDKSKVVAGVLQIL--LGA 77
Query 90 AGIGRIYLGHTGLGIAQLLVGWVTCGLGAVIWGVIDALLILTDKVGDPWGRPLRD 144
G+GR Y+G +G+ QL+V VTCG+G IWGVID +LIL D GRPLRD
Sbjct 78 FGVGRFYMGDIKIGVIQLIVTLVTCGVGG-IWGVIDGILILVKGGVDGQGRPLRD 131
>gi|343927691|ref|ZP_08767159.1| hypothetical protein GOALK_097_01130 [Gordonia alkanivorans NBRC
16433]
gi|343762332|dbj|GAA14085.1| hypothetical protein GOALK_097_01130 [Gordonia alkanivorans NBRC
16433]
Length=205
Score = 74.3 bits (181), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 43/101 (43%), Positives = 58/101 (58%), Gaps = 13/101 (12%)
Query 55 DPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQL---LVGW 111
DP APYGR P TG+P SDKSK VAGLLQ+ G G+GR Y+G +G QL ++G+
Sbjct 107 DPSAPYGRDPATGEPLSDKSKLVAGLLQI--FLGTLGVGRFYIGDNTIGGIQLGLTIIGY 164
Query 112 VTC--------GLGAVIWGVIDALLILTDKVGDPWGRPLRD 144
+T G IW ++D +++LT V D G L +
Sbjct 165 ITAIFLIGFVLIFGVAIWALVDGIMMLTGSVRDKNGLKLGN 205
>gi|290957993|ref|YP_003489175.1| hypothetical protein SCAB_35331 [Streptomyces scabiei 87.22]
gi|260647519|emb|CBG70624.1| putative membrane protein [Streptomyces scabiei 87.22]
Length=168
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/100 (49%), Positives = 60/100 (60%), Gaps = 9/100 (9%)
Query 45 PQYPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGI 104
P PG DP APYG P G+P+SDKSK VAG+L L G G+GR Y+GH GLG+
Sbjct 76 PPTPGGAYTGDPNAPYGYDPY-GRPYSDKSKIVAGILSLF--LGSFGVGRFYIGHVGLGL 132
Query 105 AQLLVGWVTCGLGAVIWGVIDALLILT-DKVGDPWGRPLR 143
AQL TCG G IW ++D +++LT D GR LR
Sbjct 133 AQLF----TCG-GLGIWALVDGIILLTGSNTTDSNGRVLR 167
>gi|297201905|ref|ZP_06919302.1| TM2 domain-containing protein [Streptomyces sviceus ATCC 29083]
gi|197712722|gb|EDY56756.1| TM2 domain-containing protein [Streptomyces sviceus ATCC 29083]
Length=154
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/90 (52%), Positives = 59/90 (66%), Gaps = 9/90 (10%)
Query 55 DPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTC 114
DP APYG P G+P+SDKSK +AG+LQL G G+GR YLG+ G+G+AQL TC
Sbjct 72 DPNAPYGYDPY-GRPYSDKSKVIAGILQL--TLGGFGVGRFYLGNVGMGLAQLF----TC 124
Query 115 GLGAVIWGVIDALLILT-DKVGDPWGRPLR 143
G G +W +ID +L+LT + D GR LR
Sbjct 125 G-GLGVWSLIDGILLLTGNDHTDEHGRILR 153
>gi|330467054|ref|YP_004404797.1| hypothetical protein VAB18032_15425 [Verrucosispora maris AB-18-032]
gi|328810025|gb|AEB44197.1| hypothetical protein VAB18032_15425 [Verrucosispora maris AB-18-032]
Length=84
Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/77 (52%), Positives = 53/77 (69%), Gaps = 3/77 (3%)
Query 68 QPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVIWGVIDAL 127
Q SDKSK VAG+LQ+L G G GR Y+G T + Q++V +VTCG+G+ +WG+ID +
Sbjct 11 QGVSDKSKVVAGILQIL--LGGFGAGRFYIGDTKTAVIQIIVTFVTCGVGS-LWGLIDGI 67
Query 128 LILTDKVGDPWGRPLRD 144
LIL + D GRPLRD
Sbjct 68 LILVNGGTDAQGRPLRD 84
>gi|145594465|ref|YP_001158762.1| TM2 domain-containing protein [Salinispora tropica CNB-440]
gi|145303802|gb|ABP54384.1| TM2 domain containing protein+B7201 [Salinispora tropica CNB-440]
Length=137
Score = 71.2 bits (173), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 40/73 (55%), Positives = 51/73 (70%), Gaps = 3/73 (4%)
Query 71 SDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAVIWGVIDALLIL 130
SD+SK VAG+LQ+L G GIGR Y+G T G+ QL+V +TCG GA +WG+ID +LIL
Sbjct 67 SDRSKVVAGVLQIL--LGGFGIGRFYMGDTRTGVIQLIVSLITCGFGA-LWGLIDGILIL 123
Query 131 TDKVGDPWGRPLR 143
+ D GRPLR
Sbjct 124 VNGGVDGQGRPLR 136
>gi|226360772|ref|YP_002778550.1| hypothetical protein ROP_13580 [Rhodococcus opacus B4]
gi|226239257|dbj|BAH49605.1| hypothetical membrane protein [Rhodococcus opacus B4]
Length=123
Score = 71.2 bits (173), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 46/93 (50%), Positives = 54/93 (59%), Gaps = 9/93 (9%)
Query 58 APYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWV--TCG 115
A YG HPVTG P+S K+K AGLL + FG G GR Y GHTGLG A L+V V
Sbjct 33 AAYGYHPVTGLPYSHKTKLAAGLLSI--FFGYFGAGRFYAGHTGLGFAMLIVNIVLTVVS 90
Query 116 LG-----AVIWGVIDALLILTDKVGDPWGRPLR 143
LG A IW VID +++L + D G PLR
Sbjct 91 LGMWLFVAWIWPVIDGIMLLAGEPKDKSGLPLR 123
>gi|309811422|ref|ZP_07705209.1| TM2 domain protein [Dermacoccus sp. Ellin185]
gi|308434729|gb|EFP58574.1| TM2 domain protein [Dermacoccus sp. Ellin185]
Length=251
Score = 69.7 bits (169), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 42/91 (47%), Positives = 52/91 (58%), Gaps = 13/91 (14%)
Query 64 PVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCG-------- 115
P+TG+P SDKSK AGLLQL FG G+GR Y+G T G QL +G +
Sbjct 163 PMTGEPLSDKSKIAAGLLQLF--FGYFGVGRFYIGDTRTGGIQLALGLIGLFGTLILVGF 220
Query 116 ---LGAVIWGVIDALLILTDKVGDPWGRPLR 143
+G IW IDA+L+LT V DP+GR L
Sbjct 221 PILVGVSIWAFIDAILMLTGSVRDPYGRKLN 251
>gi|296140018|ref|YP_003647261.1| hypothetical protein Tpau_2315 [Tsukamurella paurometabola DSM
20162]
gi|296028152|gb|ADG78922.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=146
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/153 (39%), Positives = 74/153 (49%), Gaps = 35/153 (22%)
Query 20 VTDPLQHGAFEPGWQ--SAPPGYPPPYPQYPGPGSYFDP------FAPYGRHPVTGQPFS 71
+TDP H +PG + P Y P Y PGP Y P F YG+ ++ P S
Sbjct 1 MTDPYFH---QPGDRLPQLPQAYQPGYGPVPGPQGYLQPHPGLPPFPGYGQ--LSAVPVS 55
Query 72 DKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLL----------VGWVTCGLGAVI- 120
DKSK AGLLQL G G GR Y+G + + QLL VG+VT G+G+V+
Sbjct 56 DKSKVAAGLLQLF--LGGYGAGRFYIGSVAIAVVQLLCNLAGWFFFLVGFVTFGIGSVVA 113
Query 121 ---------WGVIDALLILTDKVGDPWGRPLRD 144
W IDA+L+ T V D GRPLR
Sbjct 114 FFIWFVLGLWTFIDAILMFTGSVKDGQGRPLRS 146
>gi|329938075|ref|ZP_08287526.1| hypothetical protein SGM_3018 [Streptomyces griseoaurantiacus
M045]
gi|329302564|gb|EGG46454.1| hypothetical protein SGM_3018 [Streptomyces griseoaurantiacus
M045]
Length=87
Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/89 (52%), Positives = 56/89 (63%), Gaps = 9/89 (10%)
Query 56 PFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCG 115
P APYG P G+P+SDKSK VAG+LQL G GIGR Y+G G+G+AQLL TCG
Sbjct 6 PDAPYGFDP-QGRPYSDKSKIVAGILQLF--LGTLGIGRFYVGSVGVGVAQLL----TCG 58
Query 116 LGAVIWGVIDALLILT-DKVGDPWGRPLR 143
G W +ID ++ LT + D GR LR
Sbjct 59 -GLGFWALIDGIIFLTSNDRTDSQGRVLR 86
>gi|290957991|ref|YP_003489173.1| hypothetical protein SCAB_35311 [Streptomyces scabiei 87.22]
gi|260647517|emb|CBG70622.1| putative membrane protein [Streptomyces scabiei 87.22]
Length=87
Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/89 (52%), Positives = 56/89 (63%), Gaps = 9/89 (10%)
Query 56 PFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCG 115
P AP+G P G+P+SDKSK VAG+LQL G GIGR Y+G G+G+AQLL TCG
Sbjct 6 PDAPFGHDP-QGRPYSDKSKIVAGVLQL--FLGTLGIGRFYVGSVGVGVAQLL----TCG 58
Query 116 LGAVIWGVIDALLILT-DKVGDPWGRPLR 143
G W +ID +L LT + D GR LR
Sbjct 59 -GLGFWSLIDGILFLTSNDRTDAQGRVLR 86
>gi|254822440|ref|ZP_05227441.1| hypothetical protein MintA_21076 [Mycobacterium intracellulare
ATCC 13950]
Length=161
Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/101 (48%), Positives = 59/101 (59%), Gaps = 14/101 (13%)
Query 55 DPFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGW--- 111
DP AP+GR PVTG P SDKS T AGLLQL + G+GR Y+ T + +AQL VG
Sbjct 63 DPQAPFGRDPVTGLPLSDKSATTAGLLQLFLG--LFGVGRFYIDSTQIAVAQLCVGLFGI 120
Query 112 ---VTCG------LGAVIWGVIDALLILTDKVGDPWGRPLR 143
+ C LG +IW ++DA+LI T V D GR LR
Sbjct 121 VFSLFCFLGFPVLLGGIIWAIVDAILIFTGSVKDNHGRKLR 161
>gi|29829942|ref|NP_824576.1| hypothetical protein SAV_3399 [Streptomyces avermitilis MA-4680]
gi|29607052|dbj|BAC71111.1| hypothetical protein [Streptomyces avermitilis MA-4680]
Length=87
Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 43/89 (49%), Positives = 54/89 (61%), Gaps = 9/89 (10%)
Query 56 PFAPYGRHPVTGQPFSDKSKTVAGLLQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCG 115
P AP+G P G+P+SDKSK VAG+LQL G G GR Y+G TG+ + QL TCG
Sbjct 6 PEAPFGVDP-QGRPYSDKSKVVAGVLQL--FLGTFGAGRFYVGSTGVALGQLF----TCG 58
Query 116 LGAVIWGVIDALLILT-DKVGDPWGRPLR 143
G +W +IDA+L T + D GR LR
Sbjct 59 -GLGLWALIDAILFFTSNDRTDKQGRVLR 86
>gi|183602145|ref|ZP_02963513.1| hypothetical protein BIFLAC_07822 [Bifidobacterium animalis subsp.
lactis HN019]
gi|219683055|ref|YP_002469438.1| TM2 domain-containing protein [Bifidobacterium animalis subsp.
lactis AD011]
gi|241191377|ref|YP_002968771.1| hypothetical protein Balac_1360 [Bifidobacterium animalis subsp.
lactis Bl-04]
9 more sequence titles
Length=154
Score = 63.5 bits (153), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 44/126 (35%), Positives = 64/126 (51%), Gaps = 9/126 (7%)
Query 21 TDPLQHGAFEPGWQSAPPGYPPPYPQYPGPGSYFDPFAPYGRHPVTGQPFSDKSKTVAGL 80
T P Q+ A P + + GY QY P +Y Y + P + ++K VA
Sbjct 29 TQPAQNYA-APQYTQSQGGYNG---QYQDPNAYAQQNYTYAQPPAAYPTYLPRNKYVAAA 84
Query 81 LQLLGLFGIAGIGRIYLGHTGLGIAQLLVGWVTCGLGAV---IWGVIDALLILTDKVGDP 137
L L FGI G+ YLG+TG G+AQ+L+ +T G+GA+ IW +++A+ ILT G
Sbjct 85 LALF--FGIFGLHNFYLGYTGKGVAQVLISVLTFGIGAIAVMIWAIVEAIQILTADYGTE 142
Query 138 WGRPLR 143
R R
Sbjct 143 HHRDAR 148
Lambda K H
0.322 0.147 0.493
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 127605758688
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40