BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2570
Length=129
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609707|ref|NP_217086.1| hypothetical protein Rv2570 [Mycoba... 258 3e-67
gi|289575278|ref|ZP_06455505.1| conserved hypothetical protein [... 255 2e-66
gi|289570740|ref|ZP_06450967.1| conserved hypothetical protein [... 252 1e-65
gi|254551621|ref|ZP_05142068.1| hypothetical protein Mtube_14384... 227 4e-58
gi|254821828|ref|ZP_05226829.1| hypothetical protein MintA_17982... 209 1e-52
gi|342858615|ref|ZP_08715270.1| hypothetical protein MCOL_07056 ... 208 2e-52
gi|118467090|ref|YP_882622.1| hypothetical protein MAV_3440 [Myc... 198 2e-49
gi|336461504|gb|EGO40372.1| hypothetical protein MAPs_29920 [Myc... 196 7e-49
gi|254775887|ref|ZP_05217403.1| hypothetical protein MaviaA2_146... 196 9e-49
gi|41407165|ref|NP_960001.1| hypothetical protein MAP1067c [Myco... 195 2e-48
gi|296170796|ref|ZP_06852367.1| conserved hypothetical protein [... 194 4e-48
gi|183982174|ref|YP_001850465.1| hypothetical protein MMAR_2161 ... 190 6e-47
gi|118617365|ref|YP_905697.1| hypothetical protein MUL_1745 [Myc... 181 4e-44
gi|118473391|ref|YP_887332.1| hypothetical protein MSMEG_3014 [M... 157 4e-37
gi|158318140|ref|YP_001510648.1| hypothetical protein Franean1_6... 142 2e-32
gi|302867940|ref|YP_003836577.1| hypothetical protein Micau_3474... 134 5e-30
gi|291302027|ref|YP_003513305.1| hypothetical protein Snas_4568 ... 131 4e-29
gi|332670274|ref|YP_004453282.1| hypothetical protein Celf_1763 ... 116 9e-25
gi|229819920|ref|YP_002881446.1| hypothetical protein Bcav_1426 ... 116 1e-24
gi|46206191|ref|ZP_00210234.1| COG3801: Uncharacterized protein ... 113 8e-24
gi|331698166|ref|YP_004334405.1| hypothetical protein Psed_4395 ... 112 2e-23
gi|336120944|ref|YP_004575730.1| hypothetical protein MLP_53130 ... 101 3e-20
gi|334336597|ref|YP_004541749.1| protein of unknown function DUF... 94.7 4e-18
gi|289444107|ref|ZP_06433851.1| LOW QUALITY PROTEIN: conserved h... 87.0 8e-16
gi|238059859|ref|ZP_04604568.1| hypothetical protein MCAG_00825 ... 77.8 5e-13
gi|288916740|ref|ZP_06411114.1| protein of unknown function DUF6... 75.1 3e-12
gi|302867207|ref|YP_003835844.1| hypothetical protein Micau_2732... 74.7 4e-12
gi|284044546|ref|YP_003394886.1| hypothetical protein Cwoe_3092 ... 74.7 4e-12
gi|229821650|ref|YP_002883176.1| hypothetical protein Bcav_3170 ... 74.3 5e-12
gi|330467737|ref|YP_004405480.1| hypothetical protein VAB18032_1... 73.6 1e-11
gi|158317986|ref|YP_001510494.1| hypothetical protein Franean1_6... 72.8 1e-11
gi|315506387|ref|YP_004085274.1| hypothetical protein ML5_5664 [... 72.8 2e-11
gi|226228940|ref|YP_002763046.1| hypothetical protein GAU_3534 [... 71.2 4e-11
gi|240168531|ref|ZP_04747190.1| hypothetical protein MkanA1_0441... 68.2 4e-10
gi|120406550|ref|YP_956379.1| hypothetical protein Mvan_5608 [My... 66.2 1e-09
gi|300790989|ref|YP_003771280.1| hypothetical protein AMED_9189 ... 66.2 2e-09
gi|342860020|ref|ZP_08716672.1| hypothetical protein MCOL_14110 ... 65.5 2e-09
gi|333920065|ref|YP_004493646.1| hypothetical protein AS9A_2399 ... 64.3 5e-09
gi|326332585|ref|ZP_08198853.1| hypothetical protein NBCG_04029 ... 63.5 1e-08
gi|183985288|ref|YP_001853579.1| hypothetical protein MMAR_5320 ... 62.8 1e-08
gi|324998228|ref|ZP_08119340.1| hypothetical protein PseP1_05646... 62.8 2e-08
gi|220913708|ref|YP_002489017.1| hypothetical protein Achl_2967 ... 62.4 2e-08
gi|331694787|ref|YP_004331026.1| hypothetical protein Psed_0921 ... 62.0 3e-08
gi|324998072|ref|ZP_08119184.1| hypothetical protein PseP1_04866... 58.2 4e-07
gi|284029487|ref|YP_003379418.1| hypothetical protein Kfla_1521 ... 58.2 4e-07
gi|311744773|ref|ZP_07718569.1| conserved hypothetical protein [... 58.2 4e-07
gi|226366356|ref|YP_002784139.1| hypothetical protein ROP_69470 ... 56.6 1e-06
gi|119716818|ref|YP_923783.1| hypothetical protein Noca_2592 [No... 55.8 2e-06
gi|336120032|ref|YP_004574810.1| hypothetical protein MLP_43930 ... 55.8 2e-06
gi|13472806|ref|NP_104373.1| hypothetical protein mlr3218 [Mesor... 55.1 3e-06
>gi|15609707|ref|NP_217086.1| hypothetical protein Rv2570 [Mycobacterium tuberculosis H37Rv]
gi|15842108|ref|NP_337145.1| hypothetical protein MT2646 [Mycobacterium tuberculosis CDC1551]
gi|31793753|ref|NP_856246.1| hypothetical protein Mb2600 [Mycobacterium bovis AF2122/97]
69 more sequence titles
Length=129
Score = 258 bits (658), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 128/129 (99%), Positives = 129/129 (100%), Gaps = 0/129 (0%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG
Sbjct 1 MATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL
Sbjct 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
Query 121 VQAFLANSG 129
VQAFLANSG
Sbjct 121 VQAFLANSG 129
>gi|289575278|ref|ZP_06455505.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|289539709|gb|EFD44287.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
Length=129
Score = 255 bits (651), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 127/129 (99%), Positives = 128/129 (99%), Gaps = 0/129 (0%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPS DIVG
Sbjct 1 MATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSDDIVG 60
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL
Sbjct 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
Query 121 VQAFLANSG 129
VQAFLANSG
Sbjct 121 VQAFLANSG 129
>gi|289570740|ref|ZP_06450967.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289751193|ref|ZP_06510571.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289754688|ref|ZP_06514066.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289544494|gb|EFD48142.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289691780|gb|EFD59209.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289695275|gb|EFD62704.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
Length=129
Score = 252 bits (644), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 126/129 (98%), Positives = 127/129 (99%), Gaps = 0/129 (0%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ATWDDVARIVGGLPLTAEQAPHDWRV RKLLAWERPLRKSDR ALTRAGSEPPSGDIVG
Sbjct 1 MATWDDVARIVGGLPLTAEQAPHDWRVDRKLLAWERPLRKSDRGALTRAGSEPPSGDIVG 60
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL
Sbjct 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
Query 121 VQAFLANSG 129
VQAFLANSG
Sbjct 121 VQAFLANSG 129
>gi|254551621|ref|ZP_05142068.1| hypothetical protein Mtube_14384 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
gi|289762741|ref|ZP_06522119.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
gi|289710247|gb|EFD74263.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
Length=114
Score = 227 bits (579), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 113/114 (99%), Positives = 114/114 (100%), Gaps = 0/114 (0%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG
Sbjct 1 MATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLM 114
VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLM
Sbjct 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLM 114
>gi|254821828|ref|ZP_05226829.1| hypothetical protein MintA_17982 [Mycobacterium intracellulare
ATCC 13950]
Length=129
Score = 209 bits (531), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 102/128 (80%), Positives = 113/128 (89%), Gaps = 0/128 (0%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ATWD+VARIVG L LT+E +PHDWRVG+KLLAWERPLR S+REAL R G EPP GDI+G
Sbjct 1 MATWDEVARIVGELALTSEPSPHDWRVGKKLLAWERPLRPSEREALARNGPEPPRGDILG 60
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
VRVSDEGVKFALI DEP YFTTPHFDGYPAVLV LAEI VRDL+ELITEAWL QAP++L
Sbjct 61 VRVSDEGVKFALIDDEPQTYFTTPHFDGYPAVLVNLAEISVRDLQELITEAWLTQAPRKL 120
Query 121 VQAFLANS 128
VQ FLA+S
Sbjct 121 VQEFLADS 128
>gi|342858615|ref|ZP_08715270.1| hypothetical protein MCOL_07056 [Mycobacterium colombiense CECT
3035]
gi|342134319|gb|EGT87499.1| hypothetical protein MCOL_07056 [Mycobacterium colombiense CECT
3035]
Length=129
Score = 208 bits (530), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 102/128 (80%), Positives = 113/128 (89%), Gaps = 0/128 (0%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ATW +VARIVG L LT+E +PHDWRVG+KLLAWERPLR S+REAL R G PP GDI+G
Sbjct 1 MATWGEVARIVGELALTSEPSPHDWRVGKKLLAWERPLRPSEREALERNGPRPPQGDILG 60
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
VRVSDEGVKFALI DEPG YFTTPHFDGYPAVLV LAEI VRDLEELITEAWL+QAP++L
Sbjct 61 VRVSDEGVKFALIDDEPGTYFTTPHFDGYPAVLVNLAEISVRDLEELITEAWLIQAPRKL 120
Query 121 VQAFLANS 128
VQ FLA+S
Sbjct 121 VQEFLADS 128
>gi|118467090|ref|YP_882622.1| hypothetical protein MAV_3440 [Mycobacterium avium 104]
gi|118168377|gb|ABK69274.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=129
Score = 198 bits (504), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 97/128 (76%), Positives = 110/128 (86%), Gaps = 0/128 (0%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ATW DVARIVG L LT+E +PHDWRVG+KLLAWERPLR S+REAL R G+EP GD++G
Sbjct 1 MATWYDVARIVGELALTSEPSPHDWRVGKKLLAWERPLRPSEREALARTGAEPAPGDVLG 60
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
VRV+DEGVKFALI D P +FTTPHFDGYPAVLV LA I VRDLEELITEAWL QAP++L
Sbjct 61 VRVADEGVKFALIDDAPQTFFTTPHFDGYPAVLVNLAAISVRDLEELITEAWLTQAPRKL 120
Query 121 VQAFLANS 128
VQ FLA+S
Sbjct 121 VQEFLADS 128
>gi|336461504|gb|EGO40372.1| hypothetical protein MAPs_29920 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=149
Score = 196 bits (499), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 96/128 (75%), Positives = 109/128 (86%), Gaps = 0/128 (0%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
VATW DVARIVG L LT+E +PHDWRVG+KLLAWERPLR S+REAL R G+EP G+++G
Sbjct 21 VATWYDVARIVGELALTSEPSPHDWRVGKKLLAWERPLRPSEREALARTGAEPAPGNVLG 80
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
VRV+DEGVKFALI D P +FTTPHFDGYPAVLV L I VRDLEELITEAWL QAP++L
Sbjct 81 VRVADEGVKFALIDDAPQTFFTTPHFDGYPAVLVNLDAISVRDLEELITEAWLTQAPRKL 140
Query 121 VQAFLANS 128
VQ FLA+S
Sbjct 141 VQEFLADS 148
>gi|254775887|ref|ZP_05217403.1| hypothetical protein MaviaA2_14625 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=129
Score = 196 bits (498), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 96/128 (75%), Positives = 109/128 (86%), Gaps = 0/128 (0%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ATW DVARIVG L LT+E +PHDWRVG+KLLAWERPLR S+REAL R G+EP GD++G
Sbjct 1 MATWYDVARIVGELALTSEPSPHDWRVGKKLLAWERPLRPSEREALARTGAEPTPGDVLG 60
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
VRV+DEGVKFALI D P +FTTPHFDGYPAVLV L I VRDLEELITEAWL QAP++L
Sbjct 61 VRVADEGVKFALIDDAPQTFFTTPHFDGYPAVLVNLDAISVRDLEELITEAWLTQAPRKL 120
Query 121 VQAFLANS 128
VQ FLA+S
Sbjct 121 VQEFLADS 128
>gi|41407165|ref|NP_960001.1| hypothetical protein MAP1067c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41395516|gb|AAS03384.1| hypothetical protein MAP_1067c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=129
Score = 195 bits (495), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 95/128 (75%), Positives = 109/128 (86%), Gaps = 0/128 (0%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ATW DVARIVG L LT+E +PHDWRVG+KLLAWERPLR S+REAL R G+EP G+++G
Sbjct 1 MATWYDVARIVGELALTSEPSPHDWRVGKKLLAWERPLRPSEREALARTGAEPAPGNVLG 60
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
VRV+DEGVKFALI D P +FTTPHFDGYPAVLV L I VRDLEELITEAWL QAP++L
Sbjct 61 VRVADEGVKFALIDDAPQTFFTTPHFDGYPAVLVNLDAISVRDLEELITEAWLTQAPRKL 120
Query 121 VQAFLANS 128
VQ FLA+S
Sbjct 121 VQEFLADS 128
>gi|296170796|ref|ZP_06852367.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295894532|gb|EFG74270.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=129
Score = 194 bits (493), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 97/128 (76%), Positives = 107/128 (84%), Gaps = 0/128 (0%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ATW DVARIV L LT+E +PHDWRVG+KLLAWERPLR S+REAL R G P GDI+G
Sbjct 1 MATWYDVARIVRELALTSEPSPHDWRVGKKLLAWERPLRPSEREALDRDGMGSPEGDILG 60
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
VRVSDEGVKF LIADEP +YFTTPHFDGYP VLVRL I VR LEELITEAWL QAP++L
Sbjct 61 VRVSDEGVKFGLIADEPDIYFTTPHFDGYPVVLVRLGAISVRGLEELITEAWLTQAPRKL 120
Query 121 VQAFLANS 128
VQ FLA+S
Sbjct 121 VQEFLADS 128
>gi|183982174|ref|YP_001850465.1| hypothetical protein MMAR_2161 [Mycobacterium marinum M]
gi|183175500|gb|ACC40610.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=132
Score = 190 bits (482), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 92/129 (72%), Positives = 106/129 (83%), Gaps = 0/129 (0%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ATW DVARIV LPLT E++ H+WRVG+K LAWERPLRKSD AL +G +PP GDI+G
Sbjct 1 MATWSDVARIVSELPLTEERSAHNWRVGKKPLAWERPLRKSDLAALAASGRQPPDGDILG 60
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
VRV+DEGVKFAL+ADEP VYFTTPHFDGYPAVLV L+EIE LEEL+TEAWL QAPK+L
Sbjct 61 VRVADEGVKFALVADEPTVYFTTPHFDGYPAVLVVLSEIEAIGLEELLTEAWLTQAPKKL 120
Query 121 VQAFLANSG 129
+ FL S
Sbjct 121 AKEFLTGSA 129
>gi|118617365|ref|YP_905697.1| hypothetical protein MUL_1745 [Mycobacterium ulcerans Agy99]
gi|118569475|gb|ABL04226.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=135
Score = 181 bits (458), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 92/132 (70%), Positives = 106/132 (81%), Gaps = 3/132 (2%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTR---AGSEPPSGD 57
+ATW DVARIV LPLT E++ H+WRVG+K LAWERPLRKSD AL +G +PP GD
Sbjct 1 MATWSDVARIVSELPLTEERSAHNWRVGKKPLAWERPLRKSDLAALAALAASGRQPPDGD 60
Query 58 IVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAP 117
I+GVRV+DEGVKFAL+ADEP VYFTTPHFDGYPAVLV L+EIE LEEL+TEAWL QAP
Sbjct 61 ILGVRVADEGVKFALVADEPTVYFTTPHFDGYPAVLVVLSEIEAIGLEELLTEAWLTQAP 120
Query 118 KQLVQAFLANSG 129
K+L + FL S
Sbjct 121 KRLAKEFLTGSA 132
>gi|118473391|ref|YP_887332.1| hypothetical protein MSMEG_3014 [Mycobacterium smegmatis str.
MC2 155]
gi|118174678|gb|ABK75574.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=126
Score = 157 bits (397), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 82/127 (65%), Positives = 97/127 (77%), Gaps = 3/127 (2%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+A W DVARI LP T EQ P WRV +K +AWERPLRK+ L G++ P+GDI+G
Sbjct 1 MADWYDVARIAAALPETDEQTPRTWRVRKKRIAWERPLRKA---DLAALGADAPTGDILG 57
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
VRVSDEGVK AL+AD+P VYFTTPHFDGYP VL+RLA IE +LEEL+TEAWL QAP L
Sbjct 58 VRVSDEGVKLALVADDPAVYFTTPHFDGYPIVLIRLAAIEPDELEELVTEAWLTQAPATL 117
Query 121 VQAFLAN 127
V+ FLA+
Sbjct 118 VEKFLAD 124
>gi|158318140|ref|YP_001510648.1| hypothetical protein Franean1_6405 [Frankia sp. EAN1pec]
gi|158113545|gb|ABW15742.1| protein of unknown function DUF661 [Frankia sp. EAN1pec]
Length=131
Score = 142 bits (358), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/128 (57%), Positives = 89/128 (70%), Gaps = 3/128 (2%)
Query 1 VATWDDVARIVGGLPLTAEQAPH---DWRVGRKLLAWERPLRKSDREALTRAGSEPPSGD 57
+ATWDDV R+ LP T E + WRV K WERPLRK+D A G + P G
Sbjct 1 MATWDDVRRLALALPETNESPSYGSASWRVRDKGFVWERPLRKTDLAAFAARGEKAPDGP 60
Query 58 IVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAP 117
++GVRV+DEGVK ALIAD P V+FT PHFDGYPAVLVRL I V +L+EL+ EAWL++AP
Sbjct 61 VLGVRVADEGVKNALIADSPTVFFTVPHFDGYPAVLVRLDRISVEELDELVVEAWLLRAP 120
Query 118 KQLVQAFL 125
K+ +A+L
Sbjct 121 KRAARAYL 128
>gi|302867940|ref|YP_003836577.1| hypothetical protein Micau_3474 [Micromonospora aurantiaca ATCC
27029]
gi|315505656|ref|YP_004084543.1| hypothetical protein ML5_4917 [Micromonospora sp. L5]
gi|302570799|gb|ADL47001.1| hypothetical protein Micau_3474 [Micromonospora aurantiaca ATCC
27029]
gi|315412275|gb|ADU10392.1| hypothetical protein ML5_4917 [Micromonospora sp. L5]
Length=130
Score = 134 bits (336), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 74/133 (56%), Positives = 88/133 (67%), Gaps = 7/133 (5%)
Query 1 VATWDDVARIVGGLPLTAEQAPHD----WRVGRKLLAWERPLRKSDREALTRAGSEPPSG 56
+ATW+DV RI GLP T E+ +D WRV K WERPLR+ + +AL G P G
Sbjct 1 MATWEDVRRIALGLPETTERPTYDEAPAWRVRDKSFVWERPLRRGELDAL---GDAAPDG 57
Query 57 DIVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQA 116
I+G RV D G K ALIAD+P VYFTTPHFDGYPAVLVRL I V +L EL+TEAW +A
Sbjct 58 PILGARVPDLGAKEALIADDPAVYFTTPHFDGYPAVLVRLDRIGVDELTELVTEAWYARA 117
Query 117 PKQLVQAFLANSG 129
PK+L A A +
Sbjct 118 PKRLATAHRAENA 130
>gi|291302027|ref|YP_003513305.1| hypothetical protein Snas_4568 [Stackebrandtia nassauensis DSM
44728]
gi|290571247|gb|ADD44212.1| protein of unknown function DUF661 [Stackebrandtia nassauensis
DSM 44728]
Length=134
Score = 131 bits (329), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 67/133 (51%), Positives = 86/133 (65%), Gaps = 7/133 (5%)
Query 1 VATWDDVARIVGGLPLTAEQAPHD----WRVGRKLLAWERPLRKSDREALTRAGSEPPSG 56
+ATWDDV RI LP T E++ +D WRV KL WERPLR +DREAL G P G
Sbjct 1 MATWDDVRRIAMALPETTERSSYDGTAAWRVKDKLFVWERPLRGTDREAL---GESAPDG 57
Query 57 DIVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQA 116
I+ RV+D G + ALI +P VYFT PHFDGYPA+L+ L I+V +L E++T+AW +A
Sbjct 58 PILAARVADLGEREALIEQDPKVYFTIPHFDGYPAILIHLDNIDVAELTEVVTDAWFTRA 117
Query 117 PKQLVQAFLANSG 129
PK++ F A
Sbjct 118 PKRVASQFRAREA 130
>gi|332670274|ref|YP_004453282.1| hypothetical protein Celf_1763 [Cellulomonas fimi ATCC 484]
gi|332339312|gb|AEE45895.1| hypothetical protein Celf_1763 [Cellulomonas fimi ATCC 484]
Length=129
Score = 116 bits (291), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 62/125 (50%), Positives = 78/125 (63%), Gaps = 3/125 (2%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ATW DV V LP T E P WR + L WERPLR+SD EAL G++ GD++G
Sbjct 1 MATWQDVRAAVAALPDTTEPDPRRWRAHGRSLVWERPLRRSDHEAL---GADAWPGDVLG 57
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
+R D K L+ P VYFTTPHFDGYPAVLVRL + +L E++TE WL PK+
Sbjct 58 LRTPDLEAKDVLLGSAPDVYFTTPHFDGYPAVLVRLERLAPDELAEVVTETWLALVPKRT 117
Query 121 VQAFL 125
+A+L
Sbjct 118 ARAWL 122
>gi|229819920|ref|YP_002881446.1| hypothetical protein Bcav_1426 [Beutenbergia cavernae DSM 12333]
gi|229565833|gb|ACQ79684.1| protein of unknown function DUF661 [Beutenbergia cavernae DSM
12333]
Length=137
Score = 116 bits (291), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/127 (49%), Positives = 79/127 (63%), Gaps = 3/127 (2%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ATWDDV RI GLP AE P +WR R L+ WERPLRK+D E L G + P+G I+G
Sbjct 1 MATWDDVRRIALGLPDVAETRPGEWRAPRGLVVWERPLRKADLEFL---GDDAPTGPILG 57
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
R D + AL+ +EP V+F TPHF GYP VLV L I+ LEEL+ EAW + P ++
Sbjct 58 ARTPDVETQQALVTEEPEVFFVTPHFHGYPGVLVLLDAIDADRLEELVVEAWSTRVPAKV 117
Query 121 VQAFLAN 127
+A
Sbjct 118 AGPVVAE 124
>gi|46206191|ref|ZP_00210234.1| COG3801: Uncharacterized protein conserved in bacteria [Magnetospirillum
magnetotacticum MS-1]
Length=126
Score = 113 bits (283), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 63/128 (50%), Positives = 83/128 (65%), Gaps = 3/128 (2%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+A+W+DV R+V LP TA + W V KL WERPLR D + L G+ P+ V
Sbjct 1 MASWEDVGRVVAVLPGTALKDARRWTVHGKLFVWERPLRPRDLDEL---GAAAPAEPPVA 57
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
+RV+DEG K ALI D+PG +FTT HFDGYP VL RL + V +LEEL+ +AWL +AP +L
Sbjct 58 LRVADEGEKAALIQDDPGTFFTTSHFDGYPIVLARLDRVPVPELEELVQDAWLARAPHRL 117
Query 121 VQAFLANS 128
Q +L +
Sbjct 118 AQEYLRRT 125
>gi|331698166|ref|YP_004334405.1| hypothetical protein Psed_4395 [Pseudonocardia dioxanivorans
CB1190]
gi|326952855|gb|AEA26552.1| hypothetical protein Psed_4395 [Pseudonocardia dioxanivorans
CB1190]
Length=128
Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 69/127 (55%), Positives = 86/127 (68%), Gaps = 5/127 (3%)
Query 1 VATWDDVARIVGGLPLTAEQAPH--DWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDI 58
+ATWDDV R+ LP E A W V +K AWERPLR+ D EAL G P+G +
Sbjct 1 MATWDDVRRVATALPEVTEDAGEKLSWLVRKKAFAWERPLRRGDLEAL---GDAAPTGPV 57
Query 59 VGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPK 118
+ R +D GVK AL+AD+P VYFTTPHF+GYPAVLVRL I + +L EL+ EAWL QAPK
Sbjct 58 LCARTADVGVKEALVADDPAVYFTTPHFNGYPAVLVRLDLIALDELAELLEEAWLAQAPK 117
Query 119 QLVQAFL 125
++ A+L
Sbjct 118 RVAAAYL 124
>gi|336120944|ref|YP_004575730.1| hypothetical protein MLP_53130 [Microlunatus phosphovorus NM-1]
gi|334688742|dbj|BAK38327.1| hypothetical protein MLP_53130 [Microlunatus phosphovorus NM-1]
Length=141
Score = 101 bits (252), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 63/136 (47%), Positives = 82/136 (61%), Gaps = 10/136 (7%)
Query 1 VATWDDVARIVGGLPLTAEQAP---HDWRV---GR---KLLAWERPLRKSDREALTRAGS 51
++T++DVAR+ LP T E W V G+ K WERPL K D+ LT G+
Sbjct 1 MSTFEDVARLARALPETEETTSWGNLTWAVRSGGKAKPKGFVWERPLSKKDQAFLTAEGA 60
Query 52 E-PPSGDIVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITE 110
E PP+ I+GVRV K A++A P FTTPHF+GYPAVLVRL ++ L E++T+
Sbjct 61 EVPPNEVILGVRVDGLAEKEAVLAANPDFMFTTPHFNGYPAVLVRLDRVDEARLREVVTD 120
Query 111 AWLMQAPKQLVQAFLA 126
AWL APK+L FLA
Sbjct 121 AWLAVAPKKLADQFLA 136
>gi|334336597|ref|YP_004541749.1| protein of unknown function DUF661 [Isoptericola variabilis 225]
gi|334106965|gb|AEG43855.1| protein of unknown function DUF661 [Isoptericola variabilis 225]
Length=129
Score = 94.7 bits (234), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 55/129 (43%), Positives = 77/129 (60%), Gaps = 8/129 (6%)
Query 1 VATWDDVARIVGGLPLTAEQAPH----DWRVGRKLLAWERPLRKSDREALTRAG-SEPPS 55
+AT+DDVAR+ LP + + + W V K+ AWERP K+D L R G ++PP+
Sbjct 1 MATYDDVARLASALPEVEDGSRYRGHCTWAVRGKVFAWERPFSKAD---LRRFGDAQPPA 57
Query 56 GDIVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQ 115
G I+ + V D K A++A PG +FT HFDGY AVLVRL + +L E + +AWL +
Sbjct 58 GPILALAVEDLDEKEAVLAARPGSFFTIEHFDGYAAVLVRLDAVGDDELAEALEDAWLAK 117
Query 116 APKQLVQAF 124
AP L + F
Sbjct 118 APADLAETF 126
>gi|289444107|ref|ZP_06433851.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T46]
gi|289417026|gb|EFD14266.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T46]
Length=129
Score = 87.0 bits (214), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 44/53 (84%), Positives = 48/53 (91%), Gaps = 1/53 (1%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDR-EALTRAGSE 52
+ATWDDVARIVGGLPLTAEQAPHDWRV RKLLAWERPLRKSDR +AL +G+
Sbjct 1 MATWDDVARIVGGLPLTAEQAPHDWRVDRKLLAWERPLRKSDRGKALDLSGAS 53
>gi|238059859|ref|ZP_04604568.1| hypothetical protein MCAG_00825 [Micromonospora sp. ATCC 39149]
gi|237881670|gb|EEP70498.1| hypothetical protein MCAG_00825 [Micromonospora sp. ATCC 39149]
Length=122
Score = 77.8 bits (190), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/126 (39%), Positives = 67/126 (54%), Gaps = 9/126 (7%)
Query 1 VATWDDVARIVGGLPLTAE--QAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDI 58
+A DDV R+ LP E D+RV K W P R + + R DI
Sbjct 1 MADADDVRRLALALPHVVEIDSDGFDFRVENKGFVWSYPERLPGKPRVIRT-------DI 53
Query 59 VGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPK 118
+ V DEG K AL+ EP ++FTT +DG P V++RL E+ V L ELIT+AW M+AP
Sbjct 54 AVLYVGDEGEKQALLLGEPDLFFTTAGYDGLPLVMLRLTEVSVERLTELITDAWRMRAPA 113
Query 119 QLVQAF 124
+L ++
Sbjct 114 ELQESL 119
>gi|288916740|ref|ZP_06411114.1| protein of unknown function DUF661 [Frankia sp. EUN1f]
gi|288351814|gb|EFC86017.1| protein of unknown function DUF661 [Frankia sp. EUN1f]
Length=130
Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/121 (39%), Positives = 64/121 (53%), Gaps = 9/121 (7%)
Query 1 VATWDDVARIVGGLPLTAE--QAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDI 58
+A DDV R+ LP E D+RV K W P R R + R DI
Sbjct 1 MADADDVRRLAMSLPHVVEIDSDGFDFRVADKGFVWSYPERVPGRPRVIRT-------DI 53
Query 59 VGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPK 118
+ V DE K AL+ EP ++FT P +D +P V+VRL E+ V L EL+T+AW M+AP+
Sbjct 54 AVLFVGDEAEKQALLLGEPDLFFTAPGYDAWPVVMVRLGEVTVERLAELVTDAWRMRAPE 113
Query 119 Q 119
+
Sbjct 114 E 114
>gi|302867207|ref|YP_003835844.1| hypothetical protein Micau_2732 [Micromonospora aurantiaca ATCC
27029]
gi|302570066|gb|ADL46268.1| hypothetical protein Micau_2732 [Micromonospora aurantiaca ATCC
27029]
Length=128
Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/118 (40%), Positives = 60/118 (51%), Gaps = 9/118 (7%)
Query 5 DDVARIVGGLPLTAE--QAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVGVR 62
DDV R+ LP E D+RV K W P R R R DI +
Sbjct 8 DDVRRVALSLPHVVEIDSDGFDFRVAGKGFVWSYPERTPGRPRRIRT-------DIAVLY 60
Query 63 VSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
V DE K AL+ EP ++FTTP +DG P V++RLA + V L EL+T+AW M+AP
Sbjct 61 VGDEAEKQALLLGEPDLFFTTPAYDGSPLVMLRLAHVGVERLTELVTDAWRMRAPDSF 118
>gi|284044546|ref|YP_003394886.1| hypothetical protein Cwoe_3092 [Conexibacter woesei DSM 14684]
gi|283948767|gb|ADB51511.1| protein of unknown function DUF661 [Conexibacter woesei DSM 14684]
Length=114
Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 48/126 (39%), Positives = 68/126 (54%), Gaps = 21/126 (16%)
Query 1 VATWDDVARIVGGLPLTAEQAPHD---WRVGRKLLAWERPLRKSDREALTRAGSEPPSGD 57
+ T DDV RI LP T E+ + +RV +L A R
Sbjct 1 MTTEDDVRRIALALPETTERPSYGTPGFRVKDRLFARMR------------------EEG 42
Query 58 IVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAP 117
++ V D K ALIA EP +FTTPH+DGYP VLVRL E++ ++L EL+ +AW ++AP
Sbjct 43 VLVVWCDDVADKEALIASEPRKFFTTPHYDGYPMVLVRLPEVDAQELRELLLDAWRIRAP 102
Query 118 KQLVQA 123
K+++ A
Sbjct 103 KRVLAA 108
>gi|229821650|ref|YP_002883176.1| hypothetical protein Bcav_3170 [Beutenbergia cavernae DSM 12333]
gi|229567563|gb|ACQ81414.1| protein of unknown function DUF661 [Beutenbergia cavernae DSM
12333]
Length=132
Score = 74.3 bits (181), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 45/124 (37%), Positives = 67/124 (55%), Gaps = 7/124 (5%)
Query 5 DDVARIVGGLPLTAE---QAPHDWRVGRKLLAWERPLRKSDREALTRAGSEP-PSGDIVG 60
++VA + LP +E + W V AW RP K+D + R G P P+G IV
Sbjct 5 EEVAELATSLPEVSEGTSRGNRSWEVAGTGFAWVRPFSKAD---IRRFGEHPVPAGPIVA 61
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
V +D K A++A+ +FT HFDGY AVL+ + + R + E I +AWL +AP++L
Sbjct 62 VMTADLAEKEAILAEGRTGFFTIEHFDGYAAVLIDVTTAKQRWVREAIVDAWLAKAPREL 121
Query 121 VQAF 124
A+
Sbjct 122 ADAY 125
>gi|330467737|ref|YP_004405480.1| hypothetical protein VAB18032_18890 [Verrucosispora maris AB-18-032]
gi|328810708|gb|AEB44880.1| hypothetical protein VAB18032_18890 [Verrucosispora maris AB-18-032]
Length=128
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 45/122 (37%), Positives = 63/122 (52%), Gaps = 9/122 (7%)
Query 1 VATWDDVARIVGGLPLTAE--QAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDI 58
+A DDV R+ LP E D+RV K W P R+ + R DI
Sbjct 5 MADADDVRRLALALPQVVEIDSDGFDFRVADKGFVWSYPERRPGKPRTIRT-------DI 57
Query 59 VGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPK 118
+ V DE K AL+ EP +FTTP +DG P V++RL ++ L EL+T+AW M+AP+
Sbjct 58 AVLYVGDEAEKQALLLGEPETFFTTPGYDGLPLVMLRLTRVDAERLAELVTDAWRMRAPE 117
Query 119 QL 120
+
Sbjct 118 SV 119
>gi|158317986|ref|YP_001510494.1| hypothetical protein Franean1_6246 [Frankia sp. EAN1pec]
gi|158113391|gb|ABW15588.1| protein of unknown function DUF661 [Frankia sp. EAN1pec]
Length=129
Score = 72.8 bits (177), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/126 (38%), Positives = 63/126 (50%), Gaps = 9/126 (7%)
Query 1 VATWDDVARIVGGLP--LTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDI 58
+A DDV R+ LP + E D+RV K W P R + L R D
Sbjct 1 MADADDVRRLALALPHVVEIESDGFDFRVAGKGFVWSYPERTPGKPRLIRT-------DT 53
Query 59 VGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPK 118
+ V DE K AL+ EP ++FTTP ++G P V+VRLA + V L EL+T+AW M+ P
Sbjct 54 AVLFVGDEAEKQALLLGEPDIFFTTPAYNGLPLVMVRLAAVTVERLRELVTDAWRMRDPD 113
Query 119 QLVQAF 124
V
Sbjct 114 AHVSGL 119
>gi|315506387|ref|YP_004085274.1| hypothetical protein ML5_5664 [Micromonospora sp. L5]
gi|315413006|gb|ADU11123.1| hypothetical protein ML5_5664 [Micromonospora sp. L5]
Length=128
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/118 (39%), Positives = 59/118 (50%), Gaps = 9/118 (7%)
Query 5 DDVARIVGGLPLTAE--QAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVGVR 62
DDV R+ LP E D+RV K W P R R R DI +
Sbjct 8 DDVRRVALSLPHVVEIDSDGFDFRVAGKGFVWSYPERTPGRPRRIRT-------DIAVLY 60
Query 63 VSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
V DE K AL+ EP ++FTTP +DG P V++RLA + V L EL+ +AW M+AP
Sbjct 61 VGDEAEKQALLLGEPDLFFTTPAYDGSPLVMLRLAHVGVERLTELVIDAWRMRAPDSF 118
>gi|226228940|ref|YP_002763046.1| hypothetical protein GAU_3534 [Gemmatimonas aurantiaca T-27]
gi|226092131|dbj|BAH40576.1| hypothetical protein [Gemmatimonas aurantiaca T-27]
Length=124
Score = 71.2 bits (173), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/122 (40%), Positives = 69/122 (57%), Gaps = 5/122 (4%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERP--LRKSDREALTRAGSEPPSGDI 58
+AT DDV RI LP AE+A D R ++L ++P E + + P+
Sbjct 1 MATQDDVRRIALSLP-GAEEA--DDRFAFQVLVKDKPKGFTWVWLERIDPKKARIPNPRF 57
Query 59 VGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPK 118
+GVR S + +IA EP ++FT PH++GYPAVLVRL EI + LE ++ E W AP+
Sbjct 58 LGVRTSSVAERDLMIAAEPRIFFTEPHYNGYPAVLVRLEEIPMAQLEVILVEGWRHVAPR 117
Query 119 QL 120
+L
Sbjct 118 EL 119
>gi|240168531|ref|ZP_04747190.1| hypothetical protein MkanA1_04412 [Mycobacterium kansasii ATCC
12478]
Length=137
Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/136 (37%), Positives = 69/136 (51%), Gaps = 19/136 (13%)
Query 2 ATWDDVARIVGGLP----LTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSG- 56
AT DDV RI +P + Q ++VG K + R R ++P +G
Sbjct 6 ATVDDVHRIAASMPHVNRVVGPQGNPIYQVGGKSFVYFRTPRPD--------ATDPDTGE 57
Query 57 ---DIVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLV---RLAEIEVRDLEELITE 110
D++ + V E K AL D +FTTPHFDG+ +VLV RLAEI +L ELI +
Sbjct 58 RYTDVIILWVESEADKLALTQDPASPFFTTPHFDGHLSVLVRASRLAEISTTELAELIQD 117
Query 111 AWLMQAPKQLVQAFLA 126
AWL +A K+ +LA
Sbjct 118 AWLSRASKRRAAQWLA 133
>gi|120406550|ref|YP_956379.1| hypothetical protein Mvan_5608 [Mycobacterium vanbaalenii PYR-1]
gi|119959368|gb|ABM16373.1| protein of unknown function DUF661 [Mycobacterium vanbaalenii
PYR-1]
Length=138
Score = 66.2 bits (160), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/136 (39%), Positives = 71/136 (53%), Gaps = 12/136 (8%)
Query 2 ATWDDVARIVGGLP-LTAEQAPHD----WRVGRKLLAWERPLRKSDREALTRAGSEPPSG 56
AT DDV I +P +T + P ++VG K + R R + T G P
Sbjct 6 ATVDDVHEIATAMPHVTRVEGPKAGNPIYQVGGKSFVFFRTPRPDALDPDT--GERYP-- 61
Query 57 DIVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLV---RLAEIEVRDLEELITEAWL 113
D++ + V E K AL D +FTT HFDG+P+VLV RL E+ V +L ELI +AWL
Sbjct 62 DVIVIWVESEDDKLALTQDPDSPFFTTDHFDGHPSVLVRASRLGEVGVTELRELIQDAWL 121
Query 114 MQAPKQLVQAFLANSG 129
+A K+ Q +LA G
Sbjct 122 SRASKRRAQQWLAERG 137
>gi|300790989|ref|YP_003771280.1| hypothetical protein AMED_9189 [Amycolatopsis mediterranei U32]
gi|299800503|gb|ADJ50878.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340532685|gb|AEK47890.1| hypothetical protein RAM_47125 [Amycolatopsis mediterranei S699]
Length=108
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/122 (33%), Positives = 61/122 (50%), Gaps = 14/122 (11%)
Query 1 VATWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVG 60
+ TW+DV R+ GLP E W + P K + R +E G +V
Sbjct 1 MTTWEDVVRLASGLP---EVEASTW--------YRTPALKVAGKGFARLRTEAEGGLVV- 48
Query 61 VRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQL 120
+ K AL+ +FTTPH+DGY +++V L ++V L EL+ EAW ++APK+L
Sbjct 49 --LCGHDEKAALLESGDAAFFTTPHYDGYGSIIVDLDRVDVDQLRELLEEAWRLKAPKRL 106
Query 121 VQ 122
+
Sbjct 107 TK 108
>gi|342860020|ref|ZP_08716672.1| hypothetical protein MCOL_14110 [Mycobacterium colombiense CECT
3035]
gi|342132398|gb|EGT85627.1| hypothetical protein MCOL_14110 [Mycobacterium colombiense CECT
3035]
Length=136
Score = 65.5 bits (158), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/132 (36%), Positives = 68/132 (52%), Gaps = 19/132 (14%)
Query 6 DVARIVGGLP----LTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSG----D 57
DV I G+P L + ++VG K + R R ++P +G D
Sbjct 10 DVHEIAAGMPHVQRLEGPKGNAVYQVGGKSFVFFRTPRPD--------ATDPDTGERYAD 61
Query 58 IVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLV---RLAEIEVRDLEELITEAWLM 114
++ + V E K AL++D +FTT HFDG+P+VLV RLAEI +L ELI +AWL
Sbjct 62 VIMIWVESESDKLALVSDPTSPFFTTDHFDGHPSVLVRASRLAEISRTELAELIQDAWLS 121
Query 115 QAPKQLVQAFLA 126
+A K+ +LA
Sbjct 122 RASKKRAATWLA 133
>gi|333920065|ref|YP_004493646.1| hypothetical protein AS9A_2399 [Amycolicicoccus subflavus DQS3-9A1]
gi|333482286|gb|AEF40846.1| hypothetical protein AS9A_2399 [Amycolicicoccus subflavus DQS3-9A1]
Length=141
Score = 64.3 bits (155), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/138 (38%), Positives = 73/138 (53%), Gaps = 18/138 (13%)
Query 2 ATWDDVARIVGGLP-LTAEQAPHD--WRVGRKLLAWERPLRKSDREALTRAGSEPPSG-- 56
AT DV I GG+P + E+A + ++VG K + R R +P +G
Sbjct 10 ATVSDVHDIAGGMPYVKVEKAGTNPVYQVGGKSFVFFRNPRPD--------AFDPDTGER 61
Query 57 --DIVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLV---RLAEIEVRDLEELITEA 111
D++ V V+ E K +LI D YFTT HFDG+ +VLV RL EI +L ELI +A
Sbjct 62 YDDVIVVWVASESEKKSLIQDRQRPYFTTAHFDGHLSVLVRASRLHEIPYGELVELIQDA 121
Query 112 WLMQAPKQLVQAFLANSG 129
WL +A ++ A+LA G
Sbjct 122 WLCRASRRRSSAWLAEHG 139
>gi|326332585|ref|ZP_08198853.1| hypothetical protein NBCG_04029 [Nocardioidaceae bacterium Broad-1]
gi|325949586|gb|EGD41658.1| hypothetical protein NBCG_04029 [Nocardioidaceae bacterium Broad-1]
Length=127
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/114 (38%), Positives = 61/114 (54%), Gaps = 7/114 (6%)
Query 1 VATWDDVARIVGGLPLTAEQAPH---DWRVGRKLLAWERPLRKSDREALTRAGSE-PPSG 56
+AT +D+ARI+G LP E H W V K +AW R K+D + R G + PPS
Sbjct 1 MATLEDLARIIGELPEVTEGERHGHPTWSVRGKSIAWLRQFSKAD---IKRFGDQTPPSP 57
Query 57 DIVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITE 110
I+ V +D K ++A FT HF+ YPAVL+ L+ I DL +L+ +
Sbjct 58 PILAVNTADLHEKEGVLAAGIDGVFTIEHFNNYPAVLIELSVIASGDLRDLLVD 111
>gi|183985288|ref|YP_001853579.1| hypothetical protein MMAR_5320 [Mycobacterium marinum M]
gi|183178614|gb|ACC43724.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=140
Score = 62.8 bits (151), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/133 (36%), Positives = 69/133 (52%), Gaps = 19/133 (14%)
Query 5 DDVARIVGGLP----LTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSG---- 56
DDV RI +P L + ++VG K + R + ++P SG
Sbjct 9 DDVHRIAASMPHVKRLEGPKGNPIYQVGGKSFVFFRTPQPD--------ATDPDSGERYT 60
Query 57 DIVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLV---RLAEIEVRDLEELITEAWL 113
D++ + V E K ALI D +F+T HFDG+ +VLV RLAEI +L ELI +AWL
Sbjct 61 DVIMLWVESESDKLALIQDPASPFFSTAHFDGHLSVLVRASRLAEIGTTELAELIQDAWL 120
Query 114 MQAPKQLVQAFLA 126
+A K+ +++LA
Sbjct 121 SRASKKRAESWLA 133
>gi|324998228|ref|ZP_08119340.1| hypothetical protein PseP1_05646 [Pseudonocardia sp. P1]
Length=118
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/121 (36%), Positives = 58/121 (48%), Gaps = 13/121 (10%)
Query 3 TWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWER-PLRKSDREALTRAGSEPPSGDIVGV 61
WDDV RI GLP E W R P K + R +E + D V
Sbjct 6 NWDDVVRIGSGLPEVEEST------------WYRTPSLKVRGKGFARLRTEDSAPDTGLV 53
Query 62 RVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQLV 121
+ K AL+A +FTTPH+DGY A+LV L ++ + L EL+ EAW +AP +V
Sbjct 54 LMCSLEEKEALLASGDPAFFTTPHYDGYGAILVDLDRVDPQQLAELVEEAWRRKAPATVV 113
Query 122 Q 122
+
Sbjct 114 R 114
>gi|220913708|ref|YP_002489017.1| hypothetical protein Achl_2967 [Arthrobacter chlorophenolicus
A6]
gi|219860586|gb|ACL40928.1| protein of unknown function DUF661 [Arthrobacter chlorophenolicus
A6]
Length=140
Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/74 (46%), Positives = 45/74 (61%), Gaps = 3/74 (4%)
Query 57 DIVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLV---RLAEIEVRDLEELITEAWL 113
D++ + V D+ K AL+ D +FT PHFDGY AVLV RL E+ +L E+I EAW
Sbjct 66 DLLVIVVPDDAAKAALVEDPSVPFFTIPHFDGYNAVLVQESRLGEMGRDELAEIIVEAWA 125
Query 114 MQAPKQLVQAFLAN 127
+APK+L F A
Sbjct 126 ARAPKKLAAEFFAG 139
>gi|331694787|ref|YP_004331026.1| hypothetical protein Psed_0921 [Pseudonocardia dioxanivorans
CB1190]
gi|326949476|gb|AEA23173.1| hypothetical protein Psed_0921 [Pseudonocardia dioxanivorans
CB1190]
Length=116
Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 35/93 (38%), Positives = 50/93 (54%), Gaps = 7/93 (7%)
Query 23 HDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVGVRVSDEGVKFALIADEPGVYFT 82
D+RVG K W P R R + R D+ + V DE K AL+ EP ++FT
Sbjct 25 FDFRVGGKGFVWSYPERVPGRRRVIRT-------DVAVLYVGDEAEKQALLLGEPDLFFT 77
Query 83 TPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQ 115
P +DG+P V+VRL ++ L EL+ +AW M+
Sbjct 78 APGYDGFPLVMVRLEALDEARLAELVGDAWAMR 110
>gi|324998072|ref|ZP_08119184.1| hypothetical protein PseP1_04866 [Pseudonocardia sp. P1]
Length=117
Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/117 (36%), Positives = 56/117 (48%), Gaps = 9/117 (7%)
Query 1 VATWDDVARIVGGLPLTAEQAP--HDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDI 58
+A DDV RI GL E D+RVG K W P R R + R DI
Sbjct 1 MADSDDVRRIALGLDGAVENPSDGFDFRVGGKGFVWSYPQRVPGRRRVLRT-------DI 53
Query 59 VGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQ 115
+ V DE K AL+ EP ++ P + +P VL+ L ++V L EL+ +AW M+
Sbjct 54 AVLYVGDEAEKQALLLGEPELFSAEPAYRTFPLVLLHLERVDVHRLAELVGDAWRMR 110
>gi|284029487|ref|YP_003379418.1| hypothetical protein Kfla_1521 [Kribbella flavida DSM 17836]
gi|283808780|gb|ADB30619.1| conserved hypothetical protein [Kribbella flavida DSM 17836]
Length=131
Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/132 (33%), Positives = 69/132 (53%), Gaps = 19/132 (14%)
Query 6 DVARIVGGLP-LTAEQAPHD---WRVGRKLLAWERPLRKSDREALTRAGSEPPSG----D 57
DV + G+P +T E+ ++VG K + R R +P +G D
Sbjct 3 DVHEVAAGMPHVTVERGGAGNPVYQVGGKSFVFFRTPRPD--------AFDPDTGERYDD 54
Query 58 IVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLV---RLAEIEVRDLEELITEAWLM 114
++ + V E K AL++DE +FTTPHFDG+ +VLV R+ E+ ++L E+I +AWL
Sbjct 55 VIVIWVPSEDDKLALVSDESTPFFTTPHFDGHLSVLVRAGRIGELSHQELTEVIQDAWLS 114
Query 115 QAPKQLVQAFLA 126
+A + +LA
Sbjct 115 RASNRRATTWLA 126
>gi|311744773|ref|ZP_07718569.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272]
gi|311311890|gb|EFQ81811.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272]
Length=119
Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 29/59 (50%), Positives = 41/59 (70%), Gaps = 0/59 (0%)
Query 69 KFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQLVQAFLAN 127
K AL++ +FTTPH+DGY VLV L ++ R+L EL+TEAWL+ AP ++ QA+ A
Sbjct 59 KEALVSGADPAFFTTPHYDGYDYVLVDLDRVDPRELLELVTEAWLLVAPVRVRQAWEAQ 117
>gi|226366356|ref|YP_002784139.1| hypothetical protein ROP_69470 [Rhodococcus opacus B4]
gi|226244846|dbj|BAH55194.1| hypothetical protein [Rhodococcus opacus B4]
Length=112
Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/122 (32%), Positives = 62/122 (51%), Gaps = 15/122 (12%)
Query 3 TWDDVARIVGGLPLTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGDIVGVR 62
TW DV I LP E ++ P K + L+R +E G +V
Sbjct 3 TWTDVVAIGASLPEVQEST-----------SYNTPALKVAGKLLSRLRTESDGGLVVMCG 51
Query 63 VSDEGVKFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQLVQ 122
+ + K AL+A + ++TTPH+DG+ ++LV L ++V L EL+ +AW ++AP +L +
Sbjct 52 LDE---KAALLA-QGAPFYTTPHYDGHGSILVDLENVDVPQLTELLRDAWRIKAPAKLRK 107
Query 123 AF 124
F
Sbjct 108 QF 109
>gi|119716818|ref|YP_923783.1| hypothetical protein Noca_2592 [Nocardioides sp. JS614]
gi|119537479|gb|ABL82096.1| conserved hypothetical protein [Nocardioides sp. JS614]
Length=140
Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/60 (47%), Positives = 41/60 (69%), Gaps = 3/60 (5%)
Query 57 DIVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLV---RLAEIEVRDLEELITEAWL 113
D++ V+DEG K AL+ D+ +FTTPHFDG+ +VL+ RL EI +L E++ +AWL
Sbjct 63 DVIVFWVADEGDKLALVQDDSSPFFTTPHFDGHLSVLLRAGRLGEITYDELAEVVQDAWL 122
>gi|336120032|ref|YP_004574810.1| hypothetical protein MLP_43930 [Microlunatus phosphovorus NM-1]
gi|334687822|dbj|BAK37407.1| hypothetical protein MLP_43930 [Microlunatus phosphovorus NM-1]
Length=141
Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/133 (31%), Positives = 65/133 (49%), Gaps = 11/133 (8%)
Query 2 ATWDDVARIVGGLP----LTAEQAPHDWRVGRKLLAWERPLRKSDREALTRAGSEPPSGD 57
A DDV + +P + + ++VG K + R R + T E D
Sbjct 6 AVVDDVHDLAAAMPYVRLIHGPKGNPVYQVGGKSFVFFRTPRPDAVDPDTGERYE----D 61
Query 58 IVGVRVSDEGVKFALIADEPGVYFTTPHFDGYPAVLV---RLAEIEVRDLEELITEAWLM 114
++ + V K AL+ D +F+T HFDG+P+VLV RL E+ +L E+I +AWL
Sbjct 62 VIMIWVGSPADKLALVEDPDSPFFSTDHFDGHPSVLVRAARLGEVCYVELTEIIQDAWLA 121
Query 115 QAPKQLVQAFLAN 127
QA + Q +L++
Sbjct 122 QASNRRAQQWLSS 134
>gi|13472806|ref|NP_104373.1| hypothetical protein mlr3218 [Mesorhizobium loti MAFF303099]
gi|14023553|dbj|BAB50159.1| mlr3218 [Mesorhizobium loti MAFF303099]
Length=133
Score = 55.1 bits (131), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/61 (40%), Positives = 37/61 (61%), Gaps = 0/61 (0%)
Query 69 KFALIADEPGVYFTTPHFDGYPAVLVRLAEIEVRDLEELITEAWLMQAPKQLVQAFLANS 128
K L+ P +YF T H+ G+PA+LVRL++I +L + W+ QAPK+L+ A +
Sbjct 58 KEMLLEAAPAIYFETDHYKGWPAILVRLSQIPPEELRHRLERTWIRQAPKKLMDALAGSE 117
Query 129 G 129
G
Sbjct 118 G 118
Lambda K H
0.319 0.137 0.418
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128283502052
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40