BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1171
Length=146
Score E
Sequences producing significant alignments: (Bits) Value
gi|15840614|ref|NP_335651.1| hypothetical protein MT1208 [Mycoba... 275 2e-72
gi|31792365|ref|NP_854858.1| hypothetical protein Mb1204 [Mycoba... 274 4e-72
gi|308370644|ref|ZP_07422212.2| hypothetical protein TMCG_00800 ... 222 2e-56
gi|339294162|gb|AEJ46273.1| hypothetical protein CCDC5079_1083 [... 202 1e-50
gi|296170060|ref|ZP_06851663.1| conserved hypothetical protein [... 170 8e-41
gi|183984254|ref|YP_001852545.1| hypothetical protein MMAR_4281 ... 169 2e-40
gi|118616765|ref|YP_905097.1| hypothetical protein MUL_1018 [Myc... 166 1e-39
gi|254818774|ref|ZP_05223775.1| hypothetical protein MintA_02559... 158 3e-37
gi|41408710|ref|NP_961546.1| hypothetical protein MAP2612c [Myco... 156 8e-37
gi|118463602|ref|YP_880556.1| hypothetical protein MAV_1311 [Myc... 155 2e-36
gi|342862061|ref|ZP_08718704.1| hypothetical protein MCOL_24341 ... 151 2e-35
gi|240171693|ref|ZP_04750352.1| hypothetical protein MkanA1_2043... 150 5e-35
gi|333989800|ref|YP_004522414.1| hypothetical protein JDM601_116... 145 2e-33
gi|108800997|ref|YP_641194.1| hypothetical protein Mmcs_4033 [My... 132 1e-29
gi|118472485|ref|YP_889374.1| hypothetical protein MSMEG_5128 [M... 125 3e-27
gi|15827786|ref|NP_302049.1| hypothetical protein ML1494 [Mycoba... 124 3e-27
gi|145222754|ref|YP_001133432.1| hypothetical protein Mflv_2166 ... 119 1e-25
gi|254774189|ref|ZP_05215705.1| hypothetical protein MaviaA2_058... 118 2e-25
gi|315443222|ref|YP_004076101.1| hypothetical protein Mspyr1_159... 96.3 1e-18
gi|120405486|ref|YP_955315.1| hypothetical protein Mvan_4534 [My... 93.2 1e-17
gi|325676454|ref|ZP_08156132.1| hypothetical protein HMPREF0724_... 79.0 2e-13
gi|312138844|ref|YP_004006180.1| integral membrane protein [Rhod... 78.6 3e-13
gi|169628410|ref|YP_001702059.1| hypothetical protein MAB_1317 [... 70.9 7e-11
gi|333921030|ref|YP_004494611.1| hypothetical protein AS9A_3370 ... 66.2 2e-09
gi|134097605|ref|YP_001103266.1| hypothetical protein SACE_1009 ... 62.4 2e-08
gi|98856|pir||PH0210 hypothetical protein 133 (fdxA 5' region) -... 62.4 2e-08
gi|229493872|ref|ZP_04387644.1| conserved hypothetical protein [... 58.5 3e-07
gi|226307675|ref|YP_002767635.1| hypothetical protein RER_41880 ... 57.8 5e-07
gi|226365407|ref|YP_002783190.1| hypothetical protein ROP_59980 ... 53.5 1e-05
gi|257057079|ref|YP_003134911.1| hypothetical protein Svir_31090... 52.8 2e-05
gi|111022900|ref|YP_705872.1| hypothetical protein RHA1_ro05937 ... 52.8 2e-05
gi|54026729|ref|YP_120971.1| hypothetical protein nfa47550 [Noca... 51.2 4e-05
gi|331694871|ref|YP_004331110.1| hypothetical protein Psed_1006 ... 51.2 5e-05
gi|296138874|ref|YP_003646117.1| hypothetical protein Tpau_1147 ... 50.4 7e-05
gi|319950846|ref|ZP_08024730.1| hypothetical protein ES5_14603 [... 48.9 2e-04
gi|343926942|ref|ZP_08766435.1| hypothetical protein GOALK_072_0... 45.4 0.003
gi|337290442|ref|YP_004629463.1| hypothetical protein CULC22_008... 43.5 0.010
gi|334696555|gb|AEG81352.1| putative membrane protein [Corynebac... 43.5 0.011
gi|317506363|ref|ZP_07964174.1| hypothetical protein HMPREF9336_... 43.1 0.013
gi|300858188|ref|YP_003783171.1| hypothetical protein cpfrc_0077... 42.7 0.018
gi|326381444|ref|ZP_08203138.1| hypothetical protein SCNU_00795 ... 42.0 0.032
gi|182436197|ref|YP_001823916.1| putative integral membrane prot... 37.7 0.56
gi|326776823|ref|ZP_08236088.1| putative integral membrane prote... 37.7 0.57
gi|262203217|ref|YP_003274425.1| facilitated glucose transporter... 37.4 0.79
gi|305681878|ref|ZP_07404682.1| conserved hypothetical protein [... 35.4 3.1
gi|225022908|ref|ZP_03712100.1| hypothetical protein CORMATOL_02... 35.0 3.2
gi|310765704|gb|ADP10654.1| putative membrane protein [Erwinia s... 35.0 4.0
gi|38233567|ref|NP_939334.1| hypothetical protein DIP0972 [Coryn... 34.7 5.0
gi|159039672|ref|YP_001538925.1| hypothetical protein Sare_4144 ... 33.9 7.5
>gi|15840614|ref|NP_335651.1| hypothetical protein MT1208 [Mycobacterium tuberculosis CDC1551]
gi|13880796|gb|AAK45465.1| hypothetical protein MT1208 [Mycobacterium tuberculosis CDC1551]
Length=155
Score = 275 bits (702), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 146/146 (100%), Positives = 146/146 (100%), Gaps = 0/146 (0%)
Query 1 VGHRVDTLSDRQRANLTTGATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPIS 60
VGHRVDTLSDRQRANLTTGATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPIS
Sbjct 10 VGHRVDTLSDRQRANLTTGATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPIS 69
Query 61 ALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGA 120
ALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGA
Sbjct 70 ALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGA 129
Query 121 LVFVVAGAVPPAWVLWRRRVQADGSG 146
LVFVVAGAVPPAWVLWRRRVQADGSG
Sbjct 130 LVFVVAGAVPPAWVLWRRRVQADGSG 155
>gi|31792365|ref|NP_854858.1| hypothetical protein Mb1204 [Mycobacterium bovis AF2122/97]
gi|57116838|ref|NP_215687.2| hypothetical protein Rv1171 [Mycobacterium tuberculosis H37Rv]
gi|121637103|ref|YP_977326.1| hypothetical protein BCG_1234 [Mycobacterium bovis BCG str. Pasteur
1173P2]
66 more sequence titles
Length=146
Score = 274 bits (700), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 145/146 (99%), Positives = 146/146 (100%), Gaps = 0/146 (0%)
Query 1 VGHRVDTLSDRQRANLTTGATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPIS 60
+GHRVDTLSDRQRANLTTGATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPIS
Sbjct 1 MGHRVDTLSDRQRANLTTGATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPIS 60
Query 61 ALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGA 120
ALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGA
Sbjct 61 ALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGA 120
Query 121 LVFVVAGAVPPAWVLWRRRVQADGSG 146
LVFVVAGAVPPAWVLWRRRVQADGSG
Sbjct 121 LVFVVAGAVPPAWVLWRRRVQADGSG 146
>gi|308370644|ref|ZP_07422212.2| hypothetical protein TMCG_00800 [Mycobacterium tuberculosis SUMu003]
gi|308373060|ref|ZP_07430883.2| hypothetical protein TMEG_01067 [Mycobacterium tuberculosis SUMu005]
gi|308374241|ref|ZP_07435289.2| hypothetical protein TMFG_02361 [Mycobacterium tuberculosis SUMu006]
6 more sequence titles
Length=120
Score = 222 bits (565), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 119/120 (99%), Positives = 120/120 (100%), Gaps = 0/120 (0%)
Query 27 LVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAA 86
+VVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAA
Sbjct 1 MVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAA 60
Query 87 LPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQADGSG 146
LPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQADGSG
Sbjct 61 LPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQADGSG 120
>gi|339294162|gb|AEJ46273.1| hypothetical protein CCDC5079_1083 [Mycobacterium tuberculosis
CCDC5079]
gi|339297803|gb|AEJ49913.1| hypothetical protein CCDC5180_1076 [Mycobacterium tuberculosis
CCDC5180]
Length=108
Score = 202 bits (514), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 107/108 (99%), Positives = 108/108 (100%), Gaps = 0/108 (0%)
Query 39 VSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAA 98
+SALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAA
Sbjct 1 MSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAA 60
Query 99 MSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQADGSG 146
MSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQADGSG
Sbjct 61 MSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQADGSG 108
>gi|296170060|ref|ZP_06851663.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295895270|gb|EFG74980.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=153
Score = 170 bits (430), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 98/149 (66%), Positives = 117/149 (79%), Gaps = 11/149 (7%)
Query 1 VGHRVDTLSDRQRANL-----------TTGATDRAIRLVVLALLTVDGVVSALAGALLMP 49
+G+R+DT+ RQR++L +GATD AIR VVL+LL VDGV+SALAGALL+P
Sbjct 1 MGYRLDTVQHRQRSDLIETEPRAASLEDSGATDPAIRFVVLSLLAVDGVLSALAGALLLP 60
Query 50 WYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVI 109
YIG+ PFPIS LISGLVNAALVWAA RWTTS+R+AALPLW WLL VA +S GGP DDVI
Sbjct 61 SYIGATPFPISGLISGLVNAALVWAAGRWTTSTRLAALPLWTWLLAVAMISMGGPADDVI 120
Query 110 LGGQGLLVYGALVFVVAGAVPPAWVLWRR 138
LGG+G++ YGAL+ +V G PP WVLWRR
Sbjct 121 LGGRGVMAYGALLLIVLGVAPPFWVLWRR 149
>gi|183984254|ref|YP_001852545.1| hypothetical protein MMAR_4281 [Mycobacterium marinum M]
gi|183177580|gb|ACC42690.1| conserved hypothetical membrane protein [Mycobacterium marinum
M]
Length=119
Score = 169 bits (427), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 86/118 (73%), Positives = 101/118 (86%), Gaps = 0/118 (0%)
Query 29 VLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALP 88
+L LL VDGV+SAL GALL+P Y+GS PFPISALISGLVNAALVWAA RWTTS RVAALP
Sbjct 1 MLTLLAVDGVLSALMGALLLPTYLGSVPFPISALISGLVNAALVWAAERWTTSHRVAALP 60
Query 89 LWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQADGSG 146
LW WL TVA +SFGGPGDDV+LGG+G++ YGAL+ +V GA+PP WVLWRR+V+ +G
Sbjct 61 LWTWLATVAVLSFGGPGDDVVLGGRGVMAYGALILIVLGALPPGWVLWRRQVRDRPNG 118
>gi|118616765|ref|YP_905097.1| hypothetical protein MUL_1018 [Mycobacterium ulcerans Agy99]
gi|118568875|gb|ABL03626.1| conserved hypothetical membrane protein [Mycobacterium ulcerans
Agy99]
Length=119
Score = 166 bits (419), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 85/118 (73%), Positives = 100/118 (85%), Gaps = 0/118 (0%)
Query 29 VLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALP 88
+L LL VDGV+SAL GALL+P Y+GS PFPISALISGLVNAALVWAA RWTTS RVAALP
Sbjct 1 MLTLLAVDGVLSALMGALLLPTYLGSVPFPISALISGLVNAALVWAAERWTTSHRVAALP 60
Query 89 LWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQADGSG 146
LW WL TVA +SFGGPGDDV+LGG+G++ YGAL+ +V GA+PP WVLWR +V+ +G
Sbjct 61 LWTWLATVAVLSFGGPGDDVVLGGRGVMAYGALILIVLGALPPGWVLWRCQVRDRPNG 118
>gi|254818774|ref|ZP_05223775.1| hypothetical protein MintA_02559 [Mycobacterium intracellulare
ATCC 13950]
Length=153
Score = 158 bits (399), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 101/149 (68%), Positives = 118/149 (80%), Gaps = 11/149 (7%)
Query 1 VGHRVDTLSDRQRANLTT-----------GATDRAIRLVVLALLTVDGVVSALAGALLMP 49
+G+R+DTL RQR++LT GATD AIR VVL+LL VDGV+SALAGALL+P
Sbjct 1 MGYRLDTLQHRQRSDLTDTQPRVASVEDGGATDPAIRFVVLSLLAVDGVLSALAGALLLP 60
Query 50 WYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVI 109
+YIG+ PFP+S LISGLVNAALVWAA RWT S R+AALPLW WLLTVA +S GGP DDVI
Sbjct 61 FYIGTVPFPLSGLISGLVNAALVWAAGRWTRSPRLAALPLWTWLLTVAVISMGGPADDVI 120
Query 110 LGGQGLLVYGALVFVVAGAVPPAWVLWRR 138
LGG+GL+ YGAL+ +V G VPP WVLWRR
Sbjct 121 LGGRGLMAYGALLLLVLGVVPPVWVLWRR 149
>gi|41408710|ref|NP_961546.1| hypothetical protein MAP2612c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41397068|gb|AAS04929.1| hypothetical protein MAP_2612c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=162
Score = 156 bits (395), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 100/149 (68%), Positives = 116/149 (78%), Gaps = 11/149 (7%)
Query 1 VGHRVDTLSDRQRANLT-----------TGATDRAIRLVVLALLTVDGVVSALAGALLMP 49
VG+R+DTL RQR++LT +GATD AIR VVLALL VDGV+SALAGALL+P
Sbjct 10 VGYRLDTLQHRQRSDLTETKPRVASGEDSGATDPAIRFVVLALLAVDGVLSALAGALLLP 69
Query 50 WYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVI 109
YIG+ PFPIS LISGL+NAALVWAA RWT S R+ ALPLW WLLTVA +S GGP DDVI
Sbjct 70 LYIGTVPFPISGLISGLLNAALVWAAGRWTRSPRLVALPLWTWLLTVAVISMGGPADDVI 129
Query 110 LGGQGLLVYGALVFVVAGAVPPAWVLWRR 138
LGG+G++ YGAL+ +V G PP WVLWRR
Sbjct 130 LGGRGVMAYGALLLLVLGVAPPVWVLWRR 158
>gi|118463602|ref|YP_880556.1| hypothetical protein MAV_1311 [Mycobacterium avium 104]
gi|118164889|gb|ABK65786.1| conserved hypothetical protein [Mycobacterium avium 104]
gi|336458653|gb|EGO37615.1| hypothetical protein MAPs_10860 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=153
Score = 155 bits (392), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 99/149 (67%), Positives = 116/149 (78%), Gaps = 11/149 (7%)
Query 1 VGHRVDTLSDRQRANLT-----------TGATDRAIRLVVLALLTVDGVVSALAGALLMP 49
+G+R+DTL RQR++LT +GATD AIR VVLALL VDGV+SALAGALL+P
Sbjct 1 MGYRLDTLQHRQRSDLTETKPRVASGEDSGATDPAIRFVVLALLAVDGVLSALAGALLLP 60
Query 50 WYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVI 109
YIG+ PFPIS LISGL+NAALVWAA RWT S R+ ALPLW WLLTVA +S GGP DDVI
Sbjct 61 LYIGTVPFPISGLISGLLNAALVWAAGRWTRSPRLVALPLWTWLLTVAVISMGGPADDVI 120
Query 110 LGGQGLLVYGALVFVVAGAVPPAWVLWRR 138
LGG+G++ YGAL+ +V G PP WVLWRR
Sbjct 121 LGGRGVMAYGALLLLVLGVAPPVWVLWRR 149
>gi|342862061|ref|ZP_08718704.1| hypothetical protein MCOL_24341 [Mycobacterium colombiense CECT
3035]
gi|342130365|gb|EGT83680.1| hypothetical protein MCOL_24341 [Mycobacterium colombiense CECT
3035]
Length=153
Score = 151 bits (382), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 98/149 (66%), Positives = 113/149 (76%), Gaps = 11/149 (7%)
Query 1 VGHRVDTLSDRQRANLTT-----------GATDRAIRLVVLALLTVDGVVSALAGALLMP 49
+G+R+ + RQR +LT GATD AIR VVLALL VDGV+SALAGALL+P
Sbjct 1 MGYRLHHVQHRQRPDLTETKPRVASLDDGGATDPAIRFVVLALLAVDGVLSALAGALLLP 60
Query 50 WYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVI 109
YIG+ PFPIS L+SGLVNAALVWAA RWT S R+AALPLW WLLTVA +S GGP DDVI
Sbjct 61 LYIGTVPFPISGLVSGLVNAALVWAAGRWTRSPRLAALPLWTWLLTVAVISMGGPADDVI 120
Query 110 LGGQGLLVYGALVFVVAGAVPPAWVLWRR 138
LGG+GL+ YGAL+ +V G PP WVLWRR
Sbjct 121 LGGRGLMAYGALLLLVLGVAPPVWVLWRR 149
>gi|240171693|ref|ZP_04750352.1| hypothetical protein MkanA1_20435 [Mycobacterium kansasii ATCC
12478]
Length=113
Score = 150 bits (380), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 86/112 (77%), Positives = 98/112 (88%), Gaps = 0/112 (0%)
Query 30 LALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPL 89
+ALL VDGV+SALAGALL+P YIGS PFPISA+ISGLVNAALVWAAARWTTS RVAALPL
Sbjct 1 MALLAVDGVLSALAGALLLPSYIGSVPFPISAVISGLVNAALVWAAARWTTSPRVAALPL 60
Query 90 WAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQ 141
W WLLTVA +S GGPGDDV+LGG+GL+ +GAL+ + AG PPAWVLWR R+
Sbjct 61 WTWLLTVAVLSLGGPGDDVVLGGRGLMAFGALILIGAGVAPPAWVLWRLRIH 112
>gi|333989800|ref|YP_004522414.1| hypothetical protein JDM601_1160 [Mycobacterium sp. JDM601]
gi|333485768|gb|AEF35160.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=147
Score = 145 bits (366), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 71/126 (57%), Positives = 95/126 (76%), Gaps = 0/126 (0%)
Query 16 LTTGATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAA 75
+ T RA+ VLALL VDGV+ A++ ALL+P +IG+ PFP+SAL+ GLVNA L+W A
Sbjct 12 MNTDPPGRAVNYPVLALLAVDGVLCAVSTALLLPAHIGAVPFPLSALVGGLVNAGLLWVA 71
Query 76 ARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVL 135
R T+S R+AALPLW WLLT+A M+ GGPGDD+I +G++ YG L+ +V G+ PP W+L
Sbjct 72 LRCTSSLRLAALPLWTWLLTIALMTLGGPGDDLIFADRGVMAYGVLLMIVLGSAPPGWLL 131
Query 136 WRRRVQ 141
WRRR+Q
Sbjct 132 WRRRLQ 137
>gi|108800997|ref|YP_641194.1| hypothetical protein Mmcs_4033 [Mycobacterium sp. MCS]
gi|119870138|ref|YP_940090.1| hypothetical protein Mkms_4108 [Mycobacterium sp. KMS]
gi|126436834|ref|YP_001072525.1| hypothetical protein Mjls_4263 [Mycobacterium sp. JLS]
gi|108771416|gb|ABG10138.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119696227|gb|ABL93300.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126236634|gb|ABO00035.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=148
Score = 132 bits (333), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/143 (52%), Positives = 99/143 (70%), Gaps = 5/143 (3%)
Query 1 VGHRVDTLSDRQRANLTTG--ATDRA---IRLVVLALLTVDGVVSALAGALLMPWYIGSA 55
+G R + RQ +LT + RA +R V+LALLT+DGV++A+ GA +P IG A
Sbjct 1 MGGRFVRGAARQHPDLTAPPRGSRRAAGLLRGVILALLTLDGVLTAVLGAFFLPLRIGVA 60
Query 56 PFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGL 115
PFP+SA + G VNAALVWAA +WT S R+AALPLWAWL T+A ++ GGPG DV+ GG+ +
Sbjct 61 PFPLSAFVVGAVNAALVWAAFQWTDSGRLAALPLWAWLATLAVLTVGGPGGDVVFGGRDV 120
Query 116 LVYGALVFVVAGAVPPAWVLWRR 138
+ +G L++ G P AW+LWRR
Sbjct 121 MAFGVLIYTAVGVAPSAWLLWRR 143
>gi|118472485|ref|YP_889374.1| hypothetical protein MSMEG_5128 [Mycobacterium smegmatis str.
MC2 155]
gi|118173772|gb|ABK74668.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=134
Score = 125 bits (313), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 67/138 (49%), Positives = 91/138 (66%), Gaps = 6/138 (4%)
Query 1 VGHRVDTLSDRQRANLTTGATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPIS 60
+G R+ + RQ +LT+ A +VL LL +DGV+SA+ GAL +P YIG+ PFPIS
Sbjct 1 MGRRLAGGAARQHPDLTSRAG------IVLTLLAIDGVISAVVGALFLPIYIGTIPFPIS 54
Query 61 ALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGA 120
A+++G VN ALVWAAA W S +A LPLW +L V M+ GGPG D+I G GL +GA
Sbjct 55 AVLTGAVNLALVWAAAHWADSLLLAGLPLWTFLAMVVVMTLGGPGGDLIFAGTGLYAFGA 114
Query 121 LVFVVAGAVPPAWVLWRR 138
++F+ G++P W L RR
Sbjct 115 MIFLAVGSLPAVWFLRRR 132
>gi|15827786|ref|NP_302049.1| hypothetical protein ML1494 [Mycobacterium leprae TN]
gi|221230263|ref|YP_002503679.1| hypothetical protein MLBr_01494 [Mycobacterium leprae Br4923]
gi|13093338|emb|CAC30444.1| conserved membrane protein [Mycobacterium leprae]
gi|219933370|emb|CAR71588.1| conserved membrane protein [Mycobacterium leprae Br4923]
Length=117
Score = 124 bits (312), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 70/113 (62%), Positives = 82/113 (73%), Gaps = 1/113 (0%)
Query 25 IRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRV 84
+R+VVL L DGV+SA+ GA+LMP YIGS PFPIS LISGLVNA LVWAA WT S +
Sbjct 1 MRIVVLVLFAADGVLSAVVGAMLMPLYIGSVPFPISGLISGLVNAVLVWAARLWTRSPWL 60
Query 85 AALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWR 137
ALPLW WLLTV +S GGPG DV+ GQ ++ GAL ++ G PPA VLWR
Sbjct 61 VALPLWVWLLTVGLLSLGGPGVDVV-SGQSVMASGALWLILLGVSPPACVLWR 112
>gi|145222754|ref|YP_001133432.1| hypothetical protein Mflv_2166 [Mycobacterium gilvum PYR-GCK]
gi|145215240|gb|ABP44644.1| hypothetical protein Mflv_2166 [Mycobacterium gilvum PYR-GCK]
Length=156
Score = 119 bits (298), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/139 (46%), Positives = 92/139 (67%), Gaps = 2/139 (1%)
Query 1 VGHRVDTLSDRQRANLTTGATDRA--IRLVVLALLTVDGVVSALAGALLMPWYIGSAPFP 58
+G R+ L+ RQ +LTT +R+ +L LL +DGV+SAL +P +G PFP
Sbjct 1 MGRRLPRLAARQHPDLTTAVRPAPGRLRVPLLILLGLDGVLSALLAVFFLPLRLGPVPFP 60
Query 59 ISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVY 118
ISAL+SG++NAALVWAA + T++ R+ ALPLW WL T+ + PGDD++ GG GL+ +
Sbjct 61 ISALLSGILNAALVWAALQLTSTPRLGALPLWTWLATLLVLMTITPGDDIVFGGSGLMEF 120
Query 119 GALVFVVAGAVPPAWVLWR 137
G ++ V GAVP AW++ +
Sbjct 121 GPVLLVALGAVPAAWLVMK 139
>gi|254774189|ref|ZP_05215705.1| hypothetical protein MaviaA2_05880 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=110
Score = 118 bits (296), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 76/106 (72%), Positives = 88/106 (84%), Gaps = 0/106 (0%)
Query 33 LTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAW 92
+ VDGV+SALAGALL+P YIG+ PFPIS LISGL+NAALVWAA RWT S R+ ALPLW W
Sbjct 1 MAVDGVLSALAGALLLPLYIGTVPFPISGLISGLLNAALVWAAGRWTRSPRLVALPLWTW 60
Query 93 LLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRR 138
LLTVA +S GGP DDVILGG+G++ YGAL+ +V G PP WVLWRR
Sbjct 61 LLTVAVISMGGPADDVILGGRGVMAYGALLLLVLGVAPPVWVLWRR 106
>gi|315443222|ref|YP_004076101.1| hypothetical protein Mspyr1_15980 [Mycobacterium sp. Spyr1]
gi|315261525|gb|ADT98266.1| hypothetical protein Mspyr1_15980 [Mycobacterium sp. Spyr1]
Length=128
Score = 96.3 bits (238), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/98 (50%), Positives = 66/98 (68%), Gaps = 0/98 (0%)
Query 40 SALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAM 99
SAL +P +G PFPISAL+SG +NAALVWAA + T + R+ ALPLW WL T+ +
Sbjct 14 SALMAVFFLPMRLGPVPFPISALLSGALNAALVWAALQLTATPRLGALPLWTWLATLLVL 73
Query 100 SFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWR 137
PGDD++ GG GL+ +G ++ V GAVP AW++ R
Sbjct 74 MTITPGDDIVFGGSGLMEFGPVLLVALGAVPAAWLVMR 111
>gi|120405486|ref|YP_955315.1| hypothetical protein Mvan_4534 [Mycobacterium vanbaalenii PYR-1]
gi|119958304|gb|ABM15309.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=131
Score = 93.2 bits (230), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/99 (48%), Positives = 64/99 (65%), Gaps = 0/99 (0%)
Query 40 SALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAM 99
SAL +P +G P PISAL+SG +N +LVWA +WT+ R+AALPLW WL TV
Sbjct 20 SALMAVFFLPLRLGPVPLPISALLSGALNLSLVWAGLQWTSVPRLAALPLWTWLATVLLC 79
Query 100 SFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRR 138
PGDD++ GG G++ +G ++ V GA+P AW+L R
Sbjct 80 MVFKPGDDIVFGGVGIMEFGPVLLVALGALPAAWLLMRH 118
>gi|325676454|ref|ZP_08156132.1| hypothetical protein HMPREF0724_13915 [Rhodococcus equi ATCC
33707]
gi|325552632|gb|EGD22316.1| hypothetical protein HMPREF0724_13915 [Rhodococcus equi ATCC
33707]
Length=136
Score = 79.0 bits (193), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/113 (41%), Positives = 66/113 (59%), Gaps = 4/113 (3%)
Query 30 LALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPL 89
LALLT DG + A+ L +P YIG+ PFPI+ LIS +VN LV A ++T + VAALPL
Sbjct 21 LALLTFDGFLCAVLSVLFLPTYIGTTPFPITVLISAVVNLLLVLGARKFTDRALVAALPL 80
Query 90 WAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQA 142
+ WL + GGPG DV++ + L+ VA +P L++ R+ +
Sbjct 81 FGWLFGFGLCAIGGPGGDVLVFED----WRTLLLFVAALLPAGLYLFQVRLSS 129
>gi|312138844|ref|YP_004006180.1| integral membrane protein [Rhodococcus equi 103S]
gi|311888183|emb|CBH47495.1| putative integral membrane protein [Rhodococcus equi 103S]
Length=136
Score = 78.6 bits (192), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 46/113 (41%), Positives = 66/113 (59%), Gaps = 4/113 (3%)
Query 30 LALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPL 89
LALLT DG + A+ L +P YIG+ PFPI+ LIS +VN LV A ++T + VAALPL
Sbjct 21 LALLTFDGFLCAVLSVLFLPAYIGTTPFPITVLISAVVNLLLVLGARKFTDRALVAALPL 80
Query 90 WAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQA 142
+ WL + GGPG DV++ + L+ VA +P L++ R+ +
Sbjct 81 FGWLFGFGLCAIGGPGGDVLVFED----WRTLLLFVAALLPAGLYLFQVRLSS 129
>gi|169628410|ref|YP_001702059.1| hypothetical protein MAB_1317 [Mycobacterium abscessus ATCC 19977]
gi|169240377|emb|CAM61405.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=133
Score = 70.9 bits (172), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 49/141 (35%), Positives = 78/141 (56%), Gaps = 8/141 (5%)
Query 1 VGHRVDTLSDRQRANLTTGATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPIS 60
+G R+ S R +L GA + +R V+A L DGV+SA+ GA L+ G P+
Sbjct 1 MGRRLAGSSARFDPDL--GALNVFVRGFVMAALIFDGVISAILGAALLNTRFGGVLVPLG 58
Query 61 ALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGA 120
+I+ ++N LVW+A +W + R LWA++ T + FGGPG DV+ G +
Sbjct 59 LIIAAVLNVLLVWSALQWAPTPRWVGASLWAFVATAMVLLFGGPGGDVVFTG-----FWP 113
Query 121 LVFVVAGAVPPAWVLWRRRVQ 141
++ +V G +P A++L RRR+
Sbjct 114 VLLIVIGVLPAAYLL-RRRID 133
>gi|333921030|ref|YP_004494611.1| hypothetical protein AS9A_3370 [Amycolicicoccus subflavus DQS3-9A1]
gi|333483251|gb|AEF41811.1| Hypothetical membrane protein [Amycolicicoccus subflavus DQS3-9A1]
Length=152
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/122 (37%), Positives = 63/122 (52%), Gaps = 9/122 (7%)
Query 25 IRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRV 84
I ++L LL VDG V L +P Y+G PFP S ++ + N LV AA R +
Sbjct 24 IDYIILGLLCVDGFVLGLVSVFYLPSYLGGVPFPFSVFLAAIGNILLVEAARRLGFRRSL 83
Query 85 AALPLWAWLLTVAAMSFGGPGDDVIL---GGQGLLVYGALVFVVAGAVPPAWVLWRRRVQ 141
AALP+ W++ V GGPG D+++ +GL + VV G VP + L+R VQ
Sbjct 84 AALPVVGWVVAVLLAFAGGPGGDIVVPAADARGLFL------VVGGLVPVGFWLFRDVVQ 137
Query 142 AD 143
D
Sbjct 138 RD 139
>gi|134097605|ref|YP_001103266.1| hypothetical protein SACE_1009 [Saccharopolyspora erythraea NRRL
2338]
gi|291008476|ref|ZP_06566449.1| hypothetical protein SeryN2_28478 [Saccharopolyspora erythraea
NRRL 2338]
gi|133910228|emb|CAM00341.1| hypothetical protein SACE_1009 [Saccharopolyspora erythraea NRRL
2338]
Length=126
Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/127 (39%), Positives = 65/127 (52%), Gaps = 12/127 (9%)
Query 20 ATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWT 79
A DR V+ L +D +V A+ L +P +IG FPI A ++ + LV AA R +
Sbjct 3 AADR----VLFGFLLLDALVLAIVELLFLPLWIGPVQFPIMAAVAAVTTPLLVSAAGRLS 58
Query 80 TSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAW----VL 135
VAA PL W L V GPG DV+L G + AL+F+ AGA+P A VL
Sbjct 59 QHRGVAAAPLVVWFLAVFVFGLFGPGGDVVLVGD----WRALLFIAAGALPSAMMLGIVL 114
Query 136 WRRRVQA 142
R V+A
Sbjct 115 ARNAVEA 121
>gi|98856|pir||PH0210 hypothetical protein 133 (fdxA 5' region) - Saccharopolyspora
erythraea (fragment)
Length=133
Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/127 (39%), Positives = 65/127 (52%), Gaps = 12/127 (9%)
Query 20 ATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWT 79
A DR V+ L +D +V A+ L +P +IG FPI A ++ + LV AA R +
Sbjct 10 AADR----VLFGFLLLDALVLAIVELLFLPLWIGPVQFPIMAAVAAVTTPLLVSAAGRLS 65
Query 80 TSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAW----VL 135
VAA PL W L V GPG DV+L G + AL+F+ AGA+P A VL
Sbjct 66 QHRGVAAAPLVVWFLAVFVFGLFGPGGDVVLVGD----WRALLFIAAGALPSAMMLGIVL 121
Query 136 WRRRVQA 142
R V+A
Sbjct 122 ARNAVEA 128
>gi|229493872|ref|ZP_04387644.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229319149|gb|EEN84998.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=138
Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/129 (34%), Positives = 66/129 (52%), Gaps = 8/129 (6%)
Query 11 RQRANLTTGATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAA 70
+ +TG + + V+L +L DG + A+ L +P ++G+ FP+S L++ N
Sbjct 5 ENNTDASTGTSLSWLDPVILGVLIFDGFLCAVLSVLFLPTHLGTTAFPLSILVAATANLV 64
Query 71 LVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVF-VVAGAV 129
LV AA TT R AALPL W++ GGPG DV L++ L F ++ GA+
Sbjct 65 LVIAARTVTTPGR-AALPLAGWIVGFLVCMVGGPGGDV------LVLSSPLTFLLLVGAI 117
Query 130 PPAWVLWRR 138
PA V +
Sbjct 118 APAAVFLFK 126
>gi|226307675|ref|YP_002767635.1| hypothetical protein RER_41880 [Rhodococcus erythropolis PR4]
gi|226186792|dbj|BAH34896.1| hypothetical membrane protein [Rhodococcus erythropolis PR4]
Length=138
Score = 57.8 bits (138), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/112 (36%), Positives = 61/112 (55%), Gaps = 8/112 (7%)
Query 28 VVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAAL 87
++L +L DG + A+ L +P ++G+ FPIS L++ N LV AAR T+ +AAL
Sbjct 22 IILGVLIFDGFLCAVLSVLFLPTHLGTTAFPISILVAATANLVLVI-AARTVTTPGMAAL 80
Query 88 PLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVF-VVAGAVPPAWVLWRR 138
PL W++ GGPG DV L++ L F ++ GA+ PA V +
Sbjct 81 PLAGWIVGFLVCMVGGPGGDV------LVLSSPLTFLLLVGAIAPAAVFLFK 126
>gi|226365407|ref|YP_002783190.1| hypothetical protein ROP_59980 [Rhodococcus opacus B4]
gi|226243897|dbj|BAH54245.1| hypothetical membrane protein [Rhodococcus opacus B4]
Length=142
Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/87 (44%), Positives = 48/87 (56%), Gaps = 0/87 (0%)
Query 24 AIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSR 83
A+ V LALL DG + A+ L +P Y+G+ PFPIS L + + N ALV AA T R
Sbjct 14 ALGRVTLALLVFDGFLCAVLSVLFLPSYLGATPFPISILAAAVANLALVIAARSTTGRVR 73
Query 84 VAALPLWAWLLTVAAMSFGGPGDDVIL 110
LPL W FGGPG DV++
Sbjct 74 SGVLPLAGWGFGFLLCMFGGPGGDVLM 100
>gi|257057079|ref|YP_003134911.1| hypothetical protein Svir_31090 [Saccharomonospora viridis DSM
43017]
gi|256586951|gb|ACU98084.1| hypothetical protein Svir_31090 [Saccharomonospora viridis DSM
43017]
Length=151
Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/110 (36%), Positives = 60/110 (55%), Gaps = 13/110 (11%)
Query 29 VLALLTVDGVVSALAGALLMPW--------YIGSAPFPISALISGLVNAALVWAAARWTT 80
++ALL+ D V+ A +P +G APFPIS L++ + LV AA R
Sbjct 30 MIALLSFDAVLLATLELFYLPLRFDGLVLPELGDAPFPISVLVAVVTTPLLVVAAQR-VA 88
Query 81 SSRVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVP 130
S R+A +PL AWL T+ + GPG D +L ++ + A++ +V GA+P
Sbjct 89 SRRLAMIPLLAWLSTLLVLGVAGPGGDTVL----MVDWRAMLLLVGGALP 134
>gi|111022900|ref|YP_705872.1| hypothetical protein RHA1_ro05937 [Rhodococcus jostii RHA1]
gi|110822430|gb|ABG97714.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=142
Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/118 (39%), Positives = 59/118 (50%), Gaps = 10/118 (8%)
Query 28 VVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAAL 87
V L LL DG + A+ L +P Y+G+ PFPIS L + + N ALV AA T L
Sbjct 18 VTLTLLVFDGFLCAVLSVLFLPTYLGATPFPISILAAAVANLALVIAARSTTGRVGTGVL 77
Query 88 PLWAWLLTVAAMSFGGPGDDVILGGQG---LLVYGALVFVVAGAVPPAWVLWRRRVQA 142
PL W FGGPG DV++ LL++G+L VP A LW+ +A
Sbjct 78 PLAGWGFGFLLCMFGGPGGDVLMLASVWTLLLLFGSL-------VPAAVYLWKVSAKA 128
>gi|54026729|ref|YP_120971.1| hypothetical protein nfa47550 [Nocardia farcinica IFM 10152]
gi|54018237|dbj|BAD59607.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=160
Score = 51.2 bits (121), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 32/86 (38%), Positives = 49/86 (57%), Gaps = 4/86 (4%)
Query 53 GSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDDVILGG 112
G+ P PI+A+++ +VN ALV A T S RVA LP+ AW L +F GPG D++L
Sbjct 72 GAVPVPITAVLAAVVNIALVAAMRTLTDSVRVAMLPVMAWTLGFLVCTFPGPGGDLVLMS 131
Query 113 QGLLVYGALVFVVAGAVPPAWVLWRR 138
+ + L+ +V G P +++R
Sbjct 132 E----WPTLLLLVCGVGAPLIYVFQR 153
>gi|331694871|ref|YP_004331110.1| hypothetical protein Psed_1006 [Pseudonocardia dioxanivorans
CB1190]
gi|326949560|gb|AEA23257.1| hypothetical protein Psed_1006 [Pseudonocardia dioxanivorans
CB1190]
Length=119
Score = 51.2 bits (121), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/86 (40%), Positives = 50/86 (59%), Gaps = 0/86 (0%)
Query 25 IRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRV 84
+R + L ++ V V+ A +L P Y GS P P+ +++ + LV AA ++S+ V
Sbjct 1 MRTLTLVVIVVTSVLLAAFELMLQPLYAGSVPAPVGTVVALVTMPWLVRGAAGESSSTAV 60
Query 85 AALPLWAWLLTVAAMSFGGPGDDVIL 110
A LPL AWLLTV + GPG DV+L
Sbjct 61 AGLPLVAWLLTVGVLGALGPGGDVLL 86
>gi|296138874|ref|YP_003646117.1| hypothetical protein Tpau_1147 [Tsukamurella paurometabola DSM
20162]
gi|296027008|gb|ADG77778.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=117
Score = 50.4 bits (119), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/73 (37%), Positives = 41/73 (57%), Gaps = 0/73 (0%)
Query 37 GVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTV 96
G V A+ + + YIGS P PISA+ G + A + A R +R AA+P+ A+L+ V
Sbjct 17 GAVCAVGATMYLHLYIGSVPMPISAIGFGALLAGISVACGRLGGEARFAAIPVIAFLIVV 76
Query 97 AAMSFGGPGDDVI 109
GGPG+ ++
Sbjct 77 VVFLLGGPGNSIM 89
>gi|319950846|ref|ZP_08024730.1| hypothetical protein ES5_14603 [Dietzia cinnamea P4]
gi|319435501|gb|EFV90737.1| hypothetical protein ES5_14603 [Dietzia cinnamea P4]
Length=136
Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/121 (33%), Positives = 60/121 (50%), Gaps = 6/121 (4%)
Query 5 VDTLSDRQRANLTTGATDRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALIS 64
+ T++D R A + + LV+LA + S + + + Y+G P P+S L +
Sbjct 1 MPTVTDPVRPARPASAVEFGVALVLLA---IGAAASCVLASAWLTMYLGPVPMPVSVLAA 57
Query 65 GLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAMSFGGPGD---DVILGGQGLLVYGAL 121
G + LV A+ W+ VAA P W+LT+ A+ G GD V L G LLV+G L
Sbjct 58 GAWSLFLVRVASTWSDRRIVAAFPALVWILTLVALELGPGGDMPVPVGLRGLALLVFGGL 117
Query 122 V 122
+
Sbjct 118 I 118
>gi|343926942|ref|ZP_08766435.1| hypothetical protein GOALK_072_01640 [Gordonia alkanivorans NBRC
16433]
gi|343763302|dbj|GAA13361.1| hypothetical protein GOALK_072_01640 [Gordonia alkanivorans NBRC
16433]
Length=129
Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 57/110 (52%), Gaps = 3/110 (2%)
Query 36 DGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAW-LL 94
DG V L G P++ALI GL+NA L+W AA +T+S A PL AW L+
Sbjct 20 DGFVVGLLSVAFAYLRFGGVALPVAALIGGLLNAVLLWLAAGYTSSPLRYA-PLAAWGLV 78
Query 95 TVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRV-QAD 143
V A GPG DVIL G + L +V G P A++ W R+ +AD
Sbjct 79 VVIAGGIPGPGGDVILSTSGNYLVQTLFLLVLGVGPAAFLGWTHRLPEAD 128
>gi|337290442|ref|YP_004629463.1| hypothetical protein CULC22_00831 [Corynebacterium ulcerans BR-AD22]
gi|334698748|gb|AEG83544.1| putative membrane protein [Corynebacterium ulcerans BR-AD22]
Length=133
Score = 43.5 bits (101), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 34/108 (32%), Positives = 53/108 (50%), Gaps = 5/108 (4%)
Query 33 LTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAW 92
L++ GV+S L + + ++G P P + I+ L N L A WT S +A +P++AW
Sbjct 30 LSLGGVISVLLEIVYLNTWVGPVPVPYTVAIAFLFNMVLTKTAKLWTKKSAIALVPMYAW 89
Query 93 LL-TVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRR 139
L M GG D +LG L+ ++AG V +W L R +
Sbjct 90 ALGFFGLMVAGGVTGDQLLGSS----LRPLLLMIAGFVGGSWPLLRGK 133
>gi|334696555|gb|AEG81352.1| putative membrane protein [Corynebacterium ulcerans 809]
Length=133
Score = 43.5 bits (101), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/108 (32%), Positives = 53/108 (50%), Gaps = 5/108 (4%)
Query 33 LTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAW 92
L++ GV+S L + + ++G P P + I+ L N L A WT S +A +P++AW
Sbjct 30 LSLGGVISVLLEIVYLNTWVGPVPVPYTVAIAFLFNMVLTKTAKLWTKKSAIALVPMYAW 89
Query 93 LL-TVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRR 139
L M GG D +LG L+ ++AG V +W L R +
Sbjct 90 ALGFFGLMVAGGVTGDQLLGSS----LRPLLLMIAGFVGGSWPLLRGK 133
>gi|317506363|ref|ZP_07964174.1| hypothetical protein HMPREF9336_00543 [Segniliparus rugosus ATCC
BAA-974]
gi|316255326|gb|EFV14585.1| hypothetical protein HMPREF9336_00543 [Segniliparus rugosus ATCC
BAA-974]
Length=128
Score = 43.1 bits (100), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 35/91 (39%), Positives = 47/91 (52%), Gaps = 3/91 (3%)
Query 40 SALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWLLTVAAM 99
AL GA +P + PFP++AL+ G +N A+ AA W S R LPL WLL
Sbjct 30 EALVGAFYLPLWWDGRPFPVTALVCGALNVAIA-LAATW-ASPRAGLLPLAVWLLVCVVC 87
Query 100 SFGGPGDDVILGGQGLLVYGALVFVVAGAVP 130
GPG IL + + A +F+ AGA+P
Sbjct 88 LVPGPGGSRIL-LFAIADWRAPLFLAAGAIP 117
>gi|300858188|ref|YP_003783171.1| hypothetical protein cpfrc_00770 [Corynebacterium pseudotuberculosis
FRC41]
gi|300685642|gb|ADK28564.1| putative membrane protein [Corynebacterium pseudotuberculosis
FRC41]
gi|302205910|gb|ADL10252.1| Hypothetical protein CpC231_0770 [Corynebacterium pseudotuberculosis
C231]
gi|302330466|gb|ADL20660.1| Hypothetical protein Cp1002_0770 [Corynebacterium pseudotuberculosis
1002]
gi|308276145|gb|ADO26044.1| putative membrane protein [Corynebacterium pseudotuberculosis
I19]
gi|341824580|gb|AEK92101.1| Hypothetical protein CpPAT10_0769 [Corynebacterium pseudotuberculosis
PAT10]
Length=133
Score = 42.7 bits (99), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 33/108 (31%), Positives = 52/108 (49%), Gaps = 5/108 (4%)
Query 33 LTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAW 92
L++ GV+S L + + ++G P P + I+ L N L A WT S +A +P++AW
Sbjct 30 LSLGGVISVLLEIVYLNTWVGPVPAPYTVAIAFLFNMVLTKTALLWTKKSAIALIPMYAW 89
Query 93 LL-TVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRR 139
L M GG D + G + L+ ++AG V W L R +
Sbjct 90 ALGFFGLMVAGGMTGDQLFGAS----FRPLLLMIAGFVGGGWPLLRGK 133
>gi|326381444|ref|ZP_08203138.1| hypothetical protein SCNU_00795 [Gordonia neofelifaecis NRRL
B-59395]
gi|326199691|gb|EGD56871.1| hypothetical protein SCNU_00795 [Gordonia neofelifaecis NRRL
B-59395]
Length=119
Score = 42.0 bits (97), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 29/80 (37%), Positives = 45/80 (57%), Gaps = 1/80 (1%)
Query 28 VVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSS-RVAA 86
V L +LTV G + + A + IGS PFPI+ALI+G+VN ++ A +T S+ + A
Sbjct 7 VELGVLTVSGFLLGILTAAFLLVRIGSFPFPITALIAGVVNVLILKIAETYTKSAWQYAP 66
Query 87 LPLWAWLLTVAAMSFGGPGD 106
L W ++ +A + G G
Sbjct 67 LIAWTFVTAMAMLPVFGNGS 86
>gi|182436197|ref|YP_001823916.1| putative integral membrane protein [Streptomyces griseus subsp.
griseus NBRC 13350]
gi|178464713|dbj|BAG19233.1| putative integral membrane protein [Streptomyces griseus subsp.
griseus NBRC 13350]
Length=157
Score = 37.7 bits (86), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 34/102 (34%), Positives = 52/102 (51%), Gaps = 10/102 (9%)
Query 20 ATDRAIRLVVLALLTVDGVVSALAGALLMP-WYIGSAPFPISALISGLVNAALVWAAARW 78
A +R+ A L V GV+ A+AGAL+ P W+ G LI L AA ++ R
Sbjct 34 ARPNPVRIAAYAGLAVLGVLVAMAGALIQPAWFPGG-------LIIALGAAAGLFYGGRI 86
Query 79 TTSSRVAAL-PLWAWLLTVAAMSFGGPGDDVILGGQ-GLLVY 118
T +++ AL P W +TV + G P D + G + GL+++
Sbjct 87 LTRTQIGALAPAAGWFITVFLLLSGRPEGDYVFGDELGLVLF 128
>gi|326776823|ref|ZP_08236088.1| putative integral membrane protein [Streptomyces cf. griseus
XylebKG-1]
gi|326657156|gb|EGE42002.1| putative integral membrane protein [Streptomyces griseus XylebKG-1]
Length=157
Score = 37.7 bits (86), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 34/102 (34%), Positives = 52/102 (51%), Gaps = 10/102 (9%)
Query 20 ATDRAIRLVVLALLTVDGVVSALAGALLMP-WYIGSAPFPISALISGLVNAALVWAAARW 78
A +R+ A L V GV+ A+AGAL+ P W+ G LI L AA ++ R
Sbjct 34 ARPNPVRIAAYAGLAVLGVLVAMAGALIQPAWFPGG-------LIIALGAAAGLFYGGRI 86
Query 79 TTSSRVAAL-PLWAWLLTVAAMSFGGPGDDVILGGQ-GLLVY 118
T +++ AL P W +TV + G P D + G + GL+++
Sbjct 87 LTRTQIGALAPAAGWFITVFLLLSGRPEGDYVFGDELGLVLF 128
>gi|262203217|ref|YP_003274425.1| facilitated glucose transporter [Gordonia bronchialis DSM 43247]
gi|262086564|gb|ACY22532.1| facilitated glucose transporter [Gordonia bronchialis DSM 43247]
Length=129
Score = 37.4 bits (85), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 43/121 (36%), Positives = 64/121 (53%), Gaps = 3/121 (2%)
Query 23 RAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSS 82
R I ++LALL VDGVV + G P++A+++GL N L+W AA +T S+
Sbjct 2 RPIDRIMLALLVVDGVVIGVLSLSFAYLRFGGVAVPVAAVLAGLANCVLLWLAAGYTASA 61
Query 83 -RVAALPLWAWLLTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRRVQ 141
R A PL W+L V + GPG DV+L L ++ ++ G PA ++W R+
Sbjct 62 WRFA--PLAGWVLGVFVGAVPGPGGDVLLASDTGLAIPTMLLLLLGGGLPAALIWSGRLP 119
Query 142 A 142
A
Sbjct 120 A 120
>gi|305681878|ref|ZP_07404682.1| conserved hypothetical protein [Corynebacterium matruchotii ATCC
14266]
gi|305658351|gb|EFM47854.1| conserved hypothetical protein [Corynebacterium matruchotii ATCC
14266]
Length=156
Score = 35.4 bits (80), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 28/92 (31%), Positives = 47/92 (52%), Gaps = 8/92 (8%)
Query 50 WYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAWL--LTVAAMSFGGPGDD 107
W G A P + I+ N L A WT + +A +PLW WL + + A +F GD
Sbjct 71 WMFGVA-VPYTIPIAFCFNMVLTRTALLWTKNPYLALIPLWVWLGGIILIAFTFSFTGDQ 129
Query 108 VILGGQGLLVYGALVFVVAGAVPPAWVLWRRR 139
+I G + +++ +VAGA+ +W L+ ++
Sbjct 130 LI--GSNI---RSILLMVAGALGGSWPLFVKK 156
>gi|225022908|ref|ZP_03712100.1| hypothetical protein CORMATOL_02954 [Corynebacterium matruchotii
ATCC 33806]
gi|224944131|gb|EEG25340.1| hypothetical protein CORMATOL_02954 [Corynebacterium matruchotii
ATCC 33806]
Length=123
Score = 35.0 bits (79), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 28/95 (30%), Positives = 49/95 (52%), Gaps = 11/95 (11%)
Query 51 YIGSAPF----PISALISGLVNAALVWAAARWTTSSRVAALPLWAWL--LTVAAMSFGGP 104
Y+G+ F P + I+ N L A WT + +A +PLW WL + + A +F
Sbjct 34 YLGTWMFGVAVPYTIPIAFCFNMVLTRTALLWTKNPYLALIPLWVWLGGIILIAFTFSFT 93
Query 105 GDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRR 139
GD +I G + +++ +VAGA+ +W L+ ++
Sbjct 94 GDQLI--GSNI---RSILLMVAGALGGSWPLFVKK 123
>gi|310765704|gb|ADP10654.1| putative membrane protein [Erwinia sp. Ejp617]
Length=208
Score = 35.0 bits (79), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 30/95 (32%), Positives = 44/95 (47%), Gaps = 15/95 (15%)
Query 56 PFPISALISGLVNAALVW---AAARWTTSSRVAALPLWAWLLTVAAMSFGGPGDD--VIL 110
P P++ LI G + A VW RW S+ L WL T F PGDD ++
Sbjct 105 PIPLTLLIIGALVVATVWNRLEELRWPISTLTGMALLMVWLST--EQYFFRPGDDSFTLM 162
Query 111 GGQGLLVYGALVFVVAGAVPPAWVLWRRRVQADGS 145
G LL+ G +V++++ +RRR AD +
Sbjct 163 AGADLLLLGNIVWLISH--------FRRRFSADSA 189
>gi|38233567|ref|NP_939334.1| hypothetical protein DIP0972 [Corynebacterium diphtheriae NCTC
13129]
gi|38199827|emb|CAE49490.1| Putative membrane protein [Corynebacterium diphtheriae]
Length=126
Score = 34.7 bits (78), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 29/109 (27%), Positives = 53/109 (49%), Gaps = 7/109 (6%)
Query 33 LTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTSSRVAALPLWAW 92
+T+ + S L + + I P P + +++ L N L + W+ + +A +P++AW
Sbjct 23 ITLGALTSLLLEVVYLDTRIDGFPVPYTIVVAFLFNRVLTKTSLLWSRTPAIALIPIFAW 82
Query 93 L--LTVAAMSFGGPGDDVILGGQGLLVYGALVFVVAGAVPPAWVLWRRR 139
+ V M+ G GD ++ GQ L A++ + AG +W L RR
Sbjct 83 IAGFVVLTMTPGITGDQIV--GQNL---RAILLLGAGVAGGSWPLLSRR 126
>gi|159039672|ref|YP_001538925.1| hypothetical protein Sare_4144 [Salinispora arenicola CNS-205]
gi|157918507|gb|ABV99934.1| conserved hypothetical protein [Salinispora arenicola CNS-205]
Length=145
Score = 33.9 bits (76), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 38/110 (35%), Positives = 49/110 (45%), Gaps = 4/110 (3%)
Query 22 DRAIRLVVLALLTVDGVVSALAGALLMPWYIGSAPFPISALISGLVNAALVWAAARWTTS 81
DRA+R L V GV+SA+ LL P +G +S L++ +N AL W A
Sbjct 23 DRALRFTGGVLAVVGGVLSAVLELLLAPLRVGDHLVGVSVLVAVGMNVALSWFAHATVGR 82
Query 82 SRVAALPLWAWLLTVAAMSFGGPGDDVILGGQ----GLLVYGALVFVVAG 127
ALP W VA DV+L G L+V GA+ F V G
Sbjct 83 RWAVALPAVPWFFLVAVAGIRTTEGDVLLTGTWVDLLLVVAGAMTFAVLG 132
Lambda K H
0.324 0.137 0.436
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 127605758688
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40