BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1762c
Length=262
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608900|ref|NP_216278.1| hypothetical protein Rv1762c [Mycob... 541 5e-152
gi|240171226|ref|ZP_04749885.1| hypothetical protein MkanA1_1807... 530 1e-148
gi|183982645|ref|YP_001850936.1| hypothetical protein MMAR_2635 ... 468 3e-130
gi|296164738|ref|ZP_06847302.1| conserved hypothetical protein [... 462 3e-128
gi|340626772|ref|YP_004745224.1| hypothetical protein MCAN_17781... 454 4e-126
gi|118618487|ref|YP_906819.1| hypothetical protein MUL_3110 [Myc... 394 9e-108
gi|339628142|ref|YP_004719785.1| hypothetical protein TPY_1862 [... 265 4e-69
gi|312196147|ref|YP_004016208.1| hypothetical protein FraEuI1c_2... 263 3e-68
gi|331695283|ref|YP_004331522.1| hypothetical protein Psed_1430 ... 243 3e-62
gi|256371114|ref|YP_003108938.1| hypothetical protein Afer_0296 ... 241 8e-62
gi|300783264|ref|YP_003763555.1| hypothetical protein AMED_1339 ... 232 4e-59
gi|256390485|ref|YP_003112049.1| hypothetical protein Caci_1283 ... 231 9e-59
gi|344199526|ref|YP_004783852.1| hypothetical protein Acife_1375... 229 2e-58
gi|198283791|ref|YP_002220112.1| hypothetical protein Lferr_1683... 228 5e-58
gi|256390616|ref|YP_003112180.1| hypothetical protein Caci_1416 ... 226 2e-57
gi|255021072|ref|ZP_05293125.1| hypothetical protein ACA_2799 [A... 223 2e-56
gi|296135189|ref|YP_003642431.1| hypothetical protein Tint_0704 ... 218 6e-55
gi|256390484|ref|YP_003112048.1| hypothetical protein Caci_1282 ... 100 3e-19
gi|340787431|ref|YP_004752896.1| hypothetical protein CFU_2246 [... 84.7 1e-14
gi|254496490|ref|ZP_05109365.1| hypothetical protein LDG_1040 [L... 81.6 9e-14
gi|270157494|ref|ZP_06186151.1| conserved hypothetical protein [... 76.6 3e-12
gi|300783265|ref|YP_003763556.1| hypothetical protein AMED_1340 ... 72.4 6e-11
gi|331695282|ref|YP_004331521.1| hypothetical protein Psed_1429 ... 61.6 1e-07
gi|344199525|ref|YP_004783851.1| hypothetical protein Acife_1374... 59.3 6e-07
gi|255021071|ref|ZP_05293124.1| hypothetical protein ACA_2798 [A... 58.5 1e-06
gi|198283792|ref|YP_002220113.1| hypothetical protein Lferr_1684... 56.6 4e-06
gi|339628141|ref|YP_004719784.1| hypothetical protein TPY_1861 [... 50.4 3e-04
gi|159906208|ref|YP_001549870.1| hypothetical protein MmarC6_182... 48.9 8e-04
gi|148653977|ref|YP_001281070.1| hypothetical protein PsycPRwf_2... 48.1 0.001
gi|294339243|emb|CAZ87597.1| Hypothetical protein THI_0893 [Thio... 48.1 0.001
gi|296135188|ref|YP_003642430.1| hypothetical protein Tint_0703 ... 47.8 0.002
gi|254171979|ref|ZP_04878655.1| conserved hypothetical protein [... 47.4 0.002
gi|46199883|ref|YP_005550.1| hypothetical protein TTC1581 [Therm... 46.2 0.005
gi|333967570|gb|AEG34335.1| UPF0145 protein ybjQ [Thermus thermo... 46.2 0.005
gi|55981913|ref|YP_145210.1| hypothetical protein TTHA1944 [Ther... 45.8 0.006
gi|169828726|ref|YP_001698884.1| hypothetical protein Bsph_3249 ... 45.1 0.012
gi|126651535|ref|ZP_01723738.1| hypothetical protein BB14905_130... 45.1 0.012
gi|242399522|ref|YP_002994947.1| hypothetical protein TSIB_1547 ... 44.7 0.013
gi|262369361|ref|ZP_06062689.1| conserved hypothetical protein [... 44.7 0.014
gi|206900474|ref|YP_002250230.1| hypothetical protein DICTH_0347... 44.3 0.020
gi|76801942|ref|YP_326950.1| hypothetical protein NP2600A [Natro... 43.9 0.023
gi|255318814|ref|ZP_05360040.1| hypothetical protein ACIRA0001_0... 43.9 0.026
gi|343385828|gb|AEM21318.1| hypothetical protein Bint_0689 [Brac... 43.5 0.036
gi|299534805|ref|ZP_07048134.1| hypothetical protein BFZC1_02197... 43.1 0.038
gi|253682761|ref|ZP_04863556.1| conserved hypothetical protein [... 43.1 0.038
gi|225621364|ref|YP_002722622.1| hypothetical protein BHWA1_0246... 43.1 0.040
gi|57641861|ref|YP_184339.1| hypothetical protein TK1926 [Thermo... 43.1 0.041
gi|326563852|gb|EGE14103.1| hypothetical protein E9M_02908 [Mora... 43.1 0.044
gi|339441073|ref|YP_004707078.1| hypothetical protein CXIVA_0009... 43.1 0.045
gi|331270094|ref|YP_004396586.1| hypothetical protein CbC4_1917 ... 43.1 0.047
>gi|15608900|ref|NP_216278.1| hypothetical protein Rv1762c [Mycobacterium tuberculosis H37Rv]
gi|15841231|ref|NP_336268.1| hypothetical protein MT1811 [Mycobacterium tuberculosis CDC1551]
gi|31792952|ref|NP_855445.1| hypothetical protein Mb1793c [Mycobacterium bovis AF2122/97]
47 more sequence titles
Length=262
Score = 541 bits (1393), Expect = 5e-152, Method: Compositional matrix adjust.
Identities = 262/262 (100%), Positives = 262/262 (100%), Gaps = 0/262 (0%)
Query 1 MQSSSLDPVASERLSHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMR 60
MQSSSLDPVASERLSHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMR
Sbjct 1 MQSSSLDPVASERLSHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMR 60
Query 61 QQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTA 120
QQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTA
Sbjct 61 QQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTA 120
Query 121 VRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQ 180
VRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQ
Sbjct 121 VRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQ 180
Query 181 IGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAG 240
IGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAG
Sbjct 181 IGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAG 240
Query 241 TAVRRTGSGETITPSFVLPMDS 262
TAVRRTGSGETITPSFVLPMDS
Sbjct 241 TAVRRTGSGETITPSFVLPMDS 262
>gi|240171226|ref|ZP_04749885.1| hypothetical protein MkanA1_18076 [Mycobacterium kansasii ATCC
12478]
Length=262
Score = 530 bits (1364), Expect = 1e-148, Method: Compositional matrix adjust.
Identities = 254/262 (97%), Positives = 260/262 (99%), Gaps = 0/262 (0%)
Query 1 MQSSSLDPVASERLSHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMR 60
MQ SLDPVAS+RLSHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMR
Sbjct 1 MQPGSLDPVASDRLSHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMR 60
Query 61 QQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTA 120
QQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTA
Sbjct 61 QQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTA 120
Query 121 VRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQ 180
VRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGF+PVAFVMGNCVFHIAVQGFMQTL+Q
Sbjct 121 VRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFSPVAFVMGNCVFHIAVQGFMQTLKQ 180
Query 181 IGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAG 240
IGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWG+HTVE+YTAG
Sbjct 181 IGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGLHTVEYYTAG 240
Query 241 TAVRRTGSGETITPSFVLPMDS 262
TAVRRTGSGETITPSFVLPMD+
Sbjct 241 TAVRRTGSGETITPSFVLPMDN 262
>gi|183982645|ref|YP_001850936.1| hypothetical protein MMAR_2635 [Mycobacterium marinum M]
gi|183175971|gb|ACC41081.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=261
Score = 468 bits (1205), Expect = 3e-130, Method: Compositional matrix adjust.
Identities = 223/260 (86%), Positives = 244/260 (94%), Gaps = 1/260 (0%)
Query 1 MQSSSLDPVASERLSHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMR 60
MQ + LDPVA ERLSHA K FTSDLSINEFALLHGAGFEPIELVMGVSVYHVG+QF+G+R
Sbjct 1 MQPNQLDPVAKERLSHAAKVFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGYQFTGIR 60
Query 61 QQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTA 120
QQ EL VLT+ATYRARWNAM+RMQAEADAL ADGIVGVRL+WRH GEG +HLEF+AVGTA
Sbjct 61 QQSELPVLTDATYRARWNAMSRMQAEADALGADGIVGVRLDWRHQGEG-QHLEFIAVGTA 119
Query 121 VRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQ 180
VRYTAKPGAFRRPNGQAF+SHLSGQD+ TLLRSGFAPVAFVMGNCVFHIAVQGF++TL Q
Sbjct 120 VRYTAKPGAFRRPNGQAFTSHLSGQDLTTLLRSGFAPVAFVMGNCVFHIAVQGFLKTLSQ 179
Query 181 IGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAG 240
+GRN+EMPQWTQG+YQARELAMSRMQ+EAERDGA+GVVGVHFAISNYAWG HTVEFY AG
Sbjct 180 VGRNVEMPQWTQGSYQARELAMSRMQAEAERDGASGVVGVHFAISNYAWGHHTVEFYVAG 239
Query 241 TAVRRTGSGETITPSFVLPM 260
TAVRR+G ++ITPSFVLPM
Sbjct 240 TAVRRSGEPQSITPSFVLPM 259
>gi|296164738|ref|ZP_06847302.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295899909|gb|EFG79351.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=261
Score = 462 bits (1188), Expect = 3e-128, Method: Compositional matrix adjust.
Identities = 221/262 (85%), Positives = 239/262 (92%), Gaps = 1/262 (0%)
Query 1 MQSSSLDPVASERLSHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMR 60
MQ + LDPVASERLSHA K FTSDLSINEFALLHGAGFEPIELVMGVSVYHVG+QF+G+R
Sbjct 1 MQPNPLDPVASERLSHANKVFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGYQFTGIR 60
Query 61 QQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTA 120
QQ EL VLT+ATYRARWNAM+RMQAEAD+L ADG+VGVRL WRH GEG EHLEF+AVGTA
Sbjct 61 QQSELPVLTDATYRARWNAMSRMQAEADSLGADGVVGVRLEWRHQGEG-EHLEFIAVGTA 119
Query 121 VRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQ 180
VRYT KPGA+RR NGQAF+SHLSGQD+ TLLRSGFAPVAFVMGNCVFHIAVQGF++TL Q
Sbjct 120 VRYTQKPGAYRRANGQAFTSHLSGQDLTTLLRSGFAPVAFVMGNCVFHIAVQGFLRTLSQ 179
Query 181 IGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAG 240
+GRN EMPQWTQG+YQARELAMSRMQSEAERDG TGVVGVHFAISNYAWG HTVEFY AG
Sbjct 180 VGRNAEMPQWTQGSYQARELAMSRMQSEAERDGGTGVVGVHFAISNYAWGHHTVEFYVAG 239
Query 241 TAVRRTGSGETITPSFVLPMDS 262
TAVRR G +T+TPSFVLPM
Sbjct 240 TAVRRGGEAQTLTPSFVLPMSD 261
>gi|340626772|ref|YP_004745224.1| hypothetical protein MCAN_17781 [Mycobacterium canettii CIPT
140010059]
gi|340004962|emb|CCC44110.1| hypothetical protein MCAN_17781 [Mycobacterium canettii CIPT
140010059]
Length=220
Score = 454 bits (1169), Expect = 4e-126, Method: Compositional matrix adjust.
Identities = 219/220 (99%), Positives = 220/220 (100%), Gaps = 0/220 (0%)
Query 43 LVMGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNW 102
+VMGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNW
Sbjct 1 MVMGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNW 60
Query 103 RHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVM 162
RHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVM
Sbjct 61 RHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVM 120
Query 163 GNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHF 222
GNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHF
Sbjct 121 GNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHF 180
Query 223 AISNYAWGVHTVEFYTAGTAVRRTGSGETITPSFVLPMDS 262
AISNYAWGVHTVEFYTAGTAVRRTGSGETITPSFVLPMDS
Sbjct 181 AISNYAWGVHTVEFYTAGTAVRRTGSGETITPSFVLPMDS 220
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 42/111 (38%), Positives = 61/111 (55%), Gaps = 5/111 (4%)
Query 19 KSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQ--FSGMRQ---QQELGVLTEATY 73
++F+S LS + L +GF P+ VMG V+H+ Q +RQ E+ T+ Y
Sbjct 94 QAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNY 153
Query 74 RARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYT 124
+AR AM+RMQ+EA+ A G+VGV ++ G +EF GTAVR T
Sbjct 154 QARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVRRT 204
>gi|118618487|ref|YP_906819.1| hypothetical protein MUL_3110 [Mycobacterium ulcerans Agy99]
gi|118570597|gb|ABL05348.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=218
Score = 394 bits (1011), Expect = 9e-108, Method: Compositional matrix adjust.
Identities = 186/218 (86%), Positives = 207/218 (95%), Gaps = 1/218 (0%)
Query 44 VMGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWR 103
+MGVSVYHVG+QF+G+RQQ EL VLT+ATYRARWNAM+RMQAEADAL ADGIVGVRL+WR
Sbjct 1 MMGVSVYHVGYQFTGIRQQSELPVLTDATYRARWNAMSRMQAEADALGADGIVGVRLDWR 60
Query 104 HHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMG 163
H GEG +HLEF+AVGTAVRYTAKPGAFRRPNGQAF+SHLSGQD+ TLLRSGFAPVAFVMG
Sbjct 61 HQGEG-QHLEFIAVGTAVRYTAKPGAFRRPNGQAFTSHLSGQDLTTLLRSGFAPVAFVMG 119
Query 164 NCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFA 223
NCVFHIAVQGF++TLRQ+GRN+EMPQWTQG+YQARELAMSRMQ+EAERDGA+GVVGVHFA
Sbjct 120 NCVFHIAVQGFLKTLRQVGRNVEMPQWTQGSYQARELAMSRMQAEAERDGASGVVGVHFA 179
Query 224 ISNYAWGVHTVEFYTAGTAVRRTGSGETITPSFVLPMD 261
ISNYAWG HTVEFY AGTAVRR+G ++ITPSFVLPM
Sbjct 180 ISNYAWGHHTVEFYVAGTAVRRSGEPQSITPSFVLPMS 217
Score = 72.0 bits (175), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 44/114 (39%), Positives = 64/114 (57%), Gaps = 5/114 (4%)
Query 19 KSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQ--FSGMRQ---QQELGVLTEATY 73
++FTS LS + L +GF P+ VMG V+H+ Q +RQ E+ T+ +Y
Sbjct 92 QAFTSHLSGQDLTTLLRSGFAPVAFVMGNCVFHIAVQGFLKTLRQVGRNVEMPQWTQGSY 151
Query 74 RARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKP 127
+AR AM+RMQAEA+ A G+VGV ++ G +EF GTAVR + +P
Sbjct 152 QARELAMSRMQAEAERDGASGVVGVHFAISNYAWGHHTVEFYVAGTAVRRSGEP 205
>gi|339628142|ref|YP_004719785.1| hypothetical protein TPY_1862 [Sulfobacillus acidophilus TPY]
gi|339285931|gb|AEJ40042.1| hypothetical protein TPY_1862 [Sulfobacillus acidophilus TPY]
Length=273
Score = 265 bits (677), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 135/263 (52%), Positives = 184/263 (70%), Gaps = 7/263 (2%)
Query 4 SSLDPVASERLSHAEKS--FTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMRQ 61
+ L P A ERL+ FTSDLS+NEF L+ AGFEP+ LVMG S+YH+G+Q + +
Sbjct 9 ADLPPDALERLNRESPRYVFTSDLSVNEFVLVDEAGFEPLGLVMGSSIYHIGYQQAAWSK 68
Query 62 QQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHL-EFMAVGTA 120
QE+ VL++A Y+AR AM+RM+ EA+ LKADG++GVRL + E G+HL EF+A+GTA
Sbjct 69 NQEMTVLSQAMYQARELAMSRMEEEAEELKADGVIGVRLEVKL-AEWGQHLAEFIAIGTA 127
Query 121 VRYTAKPG-AFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLR 179
+R AK G ++R P G+ F+S LSGQD TL+++GF P++ VMG CV+HIA QG +Q+LR
Sbjct 128 IR--AKDGTSYRTPAGKPFTSDLSGQDFWTLMKTGFRPISLVMGTCVYHIAHQGVLQSLR 185
Query 180 QIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTA 239
Q+G+N E+ +TQG Y ARELAM RMQ+EA G G+VGV +++ WG H +EF+
Sbjct 186 QVGQNQELTIYTQGLYDARELAMERMQTEAADLGGEGIVGVQLHENHHEWGAHVIEFFAV 245
Query 240 GTAVRRTGSGETITPSFVLPMDS 262
GTAV+R E P VLPM+
Sbjct 246 GTAVKRAKPSEMPAPQLVLPMND 268
>gi|312196147|ref|YP_004016208.1| hypothetical protein FraEuI1c_2299 [Frankia sp. EuI1c]
gi|311227483|gb|ADP80338.1| protein of unknown function DUF74 [Frankia sp. EuI1c]
Length=285
Score = 263 bits (671), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 131/247 (54%), Positives = 170/247 (69%), Gaps = 5/247 (2%)
Query 21 FTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAM 80
FTSDLS++E+ LL+ AGFEP+ V+G S+YH+G Q +G + QELGVL++A Y AR AM
Sbjct 39 FTSDLSVSEYVLLNEAGFEPLGFVVGSSIYHIGLQVAGWKTNQELGVLSQAMYSARGLAM 98
Query 81 ARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSS 140
RMQ+EA+ L ADGIVGV+L +++ G LEF+A GTAVR PGA R PNG F+S
Sbjct 99 GRMQSEAERLGADGIVGVKLRVQYYAWGQNVLEFIAAGTAVRAIRSPGAHRAPNGHPFTS 158
Query 141 HLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQAREL 200
LS QD LL +G APV+FV+G CV+HIA QG Q+LRQ+G N EM +TQG YQAREL
Sbjct 159 DLSAQDFYRLLAAGSAPVSFVLGTCVYHIAHQGVFQSLRQVGVNQEMTTFTQGVYQAREL 218
Query 201 AMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVRRTGSGETIT-----PS 255
A++RMQ+EA + GA G+VGV +SN+ WG H EF+ GTA+R + P
Sbjct 219 ALTRMQAEAAQAGANGIVGVDLVVSNHVWGEHATEFFATGTAIRTLDPAHRLADTAPKPQ 278
Query 256 FVLPMDS 262
F L +D+
Sbjct 279 FTLGLDT 285
>gi|331695283|ref|YP_004331522.1| hypothetical protein Psed_1430 [Pseudonocardia dioxanivorans
CB1190]
gi|326949972|gb|AEA23669.1| protein of unknown function DUF74 [Pseudonocardia dioxanivorans
CB1190]
Length=277
Score = 243 bits (619), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 133/257 (52%), Positives = 170/257 (67%), Gaps = 5/257 (1%)
Query 9 VASERLSHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMRQQQELGVL 68
+A R A FTSDLS+NEF L+ AGF P+ LV+G S+YHVGFQ + QEL VL
Sbjct 18 LAEMRPGQATSLFTSDLSVNEFLLVREAGFRPLGLVLGSSIYHVGFQMGRWNRNQELTVL 77
Query 69 TEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAK-- 126
++A Y AR AM RM+AEAD L ADGIVGVRL+ G + EF+AVGTAV+ A
Sbjct 78 SQAMYHARELAMTRMEAEADQLGADGIVGVRLDIEFKEFGSDLAEFIAVGTAVKADAPTT 137
Query 127 PGAF--RRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQIGRN 184
P + R Q F+S LSGQD TL+++G+AP+ VMG+CV+H+A Q F Q + IG+N
Sbjct 138 PDGYGWRNNRNQPFTSDLSGQDFWTLVQAGYAPLGMVMGSCVYHVAHQRFWQAMGNIGQN 197
Query 185 MEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVR 244
+E+PQ+T+ Y ARELAMSRMQ+EAE+ A G+VGV + WG HT EF+ GTAVR
Sbjct 198 VELPQFTEALYDARELAMSRMQAEAEQLHAEGIVGVQLRSHGHRWGGHTTEFFAIGTAVR 257
Query 245 RTGSGETI-TPSFVLPM 260
+ TI TP VLP+
Sbjct 258 PLRADHTIATPQLVLPL 274
>gi|256371114|ref|YP_003108938.1| hypothetical protein Afer_0296 [Acidimicrobium ferrooxidans DSM
10331]
gi|256007698|gb|ACU53265.1| protein of unknown function DUF74 [Acidimicrobium ferrooxidans
DSM 10331]
Length=290
Score = 241 bits (615), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 134/270 (50%), Positives = 170/270 (63%), Gaps = 11/270 (4%)
Query 3 SSSLDP-VASERLSHA-----EKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQF 56
S + +P A ERL+H FTSDLS++E+ LL AGFEP+ V+G S+YHVG Q
Sbjct 20 SGAYEPRAAEERLAHTLGRDPHGVFTSDLSVSEYVLLEEAGFEPLGFVVGSSIYHVGLQV 79
Query 57 SGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMA 116
+ Q QEL VLT+A Y AR AM RM+ EAD L ADGIVGV L + + G + LEF+A
Sbjct 80 ARWGQSQELQVLTQAMYEARELAMTRMRTEADQLGADGIVGVNLTMQMYVWGQDVLEFIA 139
Query 117 VGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQ 176
GTAVR + GA R P+G F+S LS QD LL +G PVAFV+G CV+H+A Q
Sbjct 140 TGTAVRSLSGAGAHRAPDGGPFTSDLSAQDFYRLLATGIVPVAFVLGTCVYHVAHQSVFG 199
Query 177 TLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEF 236
+LRQ N EM +T+ YQARELA++RMQ+EA R GA G+VGV ++N+ WG H EF
Sbjct 200 SLRQSMWNQEMVTFTEAIYQARELALTRMQAEATRAGANGIVGVSVEVANHVWGEHATEF 259
Query 237 YTAGTAVRRTGSGETI-----TPSFVLPMD 261
GTAVRR + P+FVL +D
Sbjct 260 LAVGTAVRRLADEHRLPDTSPKPTFVLGLD 289
>gi|300783264|ref|YP_003763555.1| hypothetical protein AMED_1339 [Amycolatopsis mediterranei U32]
gi|299792778|gb|ADJ43153.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340524645|gb|AEK39850.1| hypothetical protein RAM_06790 [Amycolatopsis mediterranei S699]
Length=257
Score = 232 bits (591), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 123/241 (52%), Positives = 161/241 (67%), Gaps = 2/241 (0%)
Query 21 FTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAM 80
FTSDLS+NEF L+ AGF P+ LV+G S+YHVGFQ+ + QEL L++A Y AR AM
Sbjct 16 FTSDLSVNEFLLVREAGFRPLGLVLGSSIYHVGFQWRQWTKNQELDRLSQAMYHARELAM 75
Query 81 ARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSS 140
+RM+ EAD L ADGIV VRL G + EF+AVGTAV+ +PG +R G+ F+S
Sbjct 76 SRMETEADVLGADGIVAVRLEIEFKEFGTDLAEFIAVGTAVK-AEEPGEWRTNAGKPFTS 134
Query 141 HLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQAREL 200
LSGQD TL+++G+AP+ VMG CV+HIA Q F Q + G+N+E+PQ+T+ Y AREL
Sbjct 135 DLSGQDFWTLVQAGYAPLGMVMGTCVYHIAHQRFRQVVGNFGKNVEIPQYTEALYDAREL 194
Query 201 AMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVRRTGSGETI-TPSFVLP 259
AMSRMQ+EAE+ A G+VGV + WG T EF+ GTAV+ + I P VLP
Sbjct 195 AMSRMQAEAEQLEAEGIVGVQLLSLPHQWGGQTTEFFAIGTAVKPLRADHHIAKPQLVLP 254
Query 260 M 260
+
Sbjct 255 L 255
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 56/124 (46%), Positives = 74/124 (60%), Gaps = 11/124 (8%)
Query 15 SHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQ-----FSGMRQQQELGVLT 69
++A K FTSDLS +F L AG+ P+ +VMG VYH+ Q + E+ T
Sbjct 126 TNAGKPFTSDLSGQDFWTLVQAGYAPLGMVMGTCVYHIAHQRFRQVVGNFGKNVEIPQYT 185
Query 70 EATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVR------Y 123
EA Y AR AM+RMQAEA+ L+A+GIVGV+L H GG+ EF A+GTAV+ +
Sbjct 186 EALYDARELAMSRMQAEAEQLEAEGIVGVQLLSLPHQWGGQTTEFFAIGTAVKPLRADHH 245
Query 124 TAKP 127
AKP
Sbjct 246 IAKP 249
>gi|256390485|ref|YP_003112049.1| hypothetical protein Caci_1283 [Catenulispora acidiphila DSM
44928]
gi|256356711|gb|ACU70208.1| protein of unknown function DUF74 [Catenulispora acidiphila DSM
44928]
Length=277
Score = 231 bits (589), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 126/246 (52%), Positives = 161/246 (66%), Gaps = 5/246 (2%)
Query 21 FTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAM 80
FTSDLS+NEF L+ AGF P+ LV+G S+YHVG Q + + QEL VL++A Y AR AM
Sbjct 32 FTSDLSVNEFLLVKEAGFRPLGLVLGSSIYHVGLQVARWGKSQELDVLSQAMYHARELAM 91
Query 81 ARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAK----PGAFRRPNGQ 136
+RM+AEADAL ADGIVGVRL G + EF+A+GTAV+ +R Q
Sbjct 92 SRMEAEADALGADGIVGVRLEVEFKEFGADVAEFIAIGTAVKDDGSGQHGEVTWRNNKNQ 151
Query 137 AFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQ 196
F+S LSGQD TL+R+G+AP+ VMG+CV+HIA Q F L IG+N+E+ Q+TQ Y
Sbjct 152 PFTSDLSGQDFWTLIRAGYAPLGMVMGSCVYHIAHQRFGAALGNIGKNVELEQFTQALYD 211
Query 197 ARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVRRTGSGETIT-PS 255
ARELAM+RMQ+EAE A G+VGV ++ WG HT EF+ GTAVR I P+
Sbjct 212 ARELAMARMQAEAEALHAEGIVGVQLRNHSHTWGSHTTEFFALGTAVRPLRPDHQIQRPT 271
Query 256 FVLPMD 261
VL +D
Sbjct 272 MVLSLD 277
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 55/125 (44%), Positives = 71/125 (57%), Gaps = 5/125 (4%)
Query 15 SHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQ-----FSGMRQQQELGVLT 69
++ + FTSDLS +F L AG+ P+ +VMG VYH+ Q + + EL T
Sbjct 147 NNKNQPFTSDLSGQDFWTLIRAGYAPLGMVMGSCVYHIAHQRFGAALGNIGKNVELEQFT 206
Query 70 EATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGA 129
+A Y AR AMARMQAEA+AL A+GIVGV+L H G EF A+GTAVR
Sbjct 207 QALYDARELAMARMQAEAEALHAEGIVGVQLRNHSHTWGSHTTEFFALGTAVRPLRPDHQ 266
Query 130 FRRPN 134
+RP
Sbjct 267 IQRPT 271
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 46/125 (37%), Positives = 73/125 (59%), Gaps = 9/125 (7%)
Query 126 KPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNM 185
KPG+ G F+S LS + + + +GF P+ V+G+ ++H+ +Q + + G++
Sbjct 24 KPGS----PGSIFTSDLSVNEFLLVKEAGFRPLGLVLGSSIYHVGLQ-----VARWGKSQ 74
Query 186 EMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVRR 245
E+ +Q Y ARELAMSRM++EA+ GA G+VGV + +G EF GTAV+
Sbjct 75 ELDVLSQAMYHARELAMSRMEAEADALGADGIVGVRLEVEFKEFGADVAEFIAIGTAVKD 134
Query 246 TGSGE 250
GSG+
Sbjct 135 DGSGQ 139
>gi|344199526|ref|YP_004783852.1| hypothetical protein Acife_1375 [Acidithiobacillus ferrivorans
SS3]
gi|343774970|gb|AEM47526.1| protein of unknown function DUF74 [Acidithiobacillus ferrivorans
SS3]
Length=274
Score = 229 bits (585), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 120/271 (45%), Positives = 170/271 (63%), Gaps = 12/271 (4%)
Query 3 SSSLDPVASERL-------SHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQ 55
+ L P A ERL SH E FTSDLS+NEF ++ A FEP+ LV+G +YH+G Q
Sbjct 5 TEQLAPHAVERLQGLRSQGSH-EGIFTSDLSVNEFVMVRKANFEPLGLVVGSCIYHMGIQ 63
Query 56 FSGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFM 115
+ Q E+ VL++A Y+AR AM+RM+ EA L+ADGIVGVRL + + LEFM
Sbjct 64 YGNWNQNMEMDVLSQAMYQARELAMSRMEQEATLLQADGIVGVRLEVKRMEWDQDILEFM 123
Query 116 AVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFM 175
A+GTA+ + + F+ G+ F+S LSGQD LL++G+ P+ VMG+CV+H+A QG +
Sbjct 124 AIGTAIAHRSGASGFKGVGGKPFTSDLSGQDFWMLLQAGYRPMEMVMGSCVYHVAHQGLL 183
Query 176 QTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVE 235
Q++ IGRN EM Q+TQ Y ARELAM RMQ EA+ A G+ GV ++ W H +E
Sbjct 184 QSMGNIGRNTEMEQFTQAMYDARELAMERMQHEAQEAQADGITGVQLHEGSHNWKPHIIE 243
Query 236 FYTAGTAVRRTGSG----ETITPSFVLPMDS 262
F+ GTAV+ + + ++P V+P++
Sbjct 244 FFAIGTAVKAMDNADALVDALSPQLVIPVND 274
>gi|198283791|ref|YP_002220112.1| hypothetical protein Lferr_1683 [Acidithiobacillus ferrooxidans
ATCC 53993]
gi|218665786|ref|YP_002426422.1| hypothetical protein AFE_2018 [Acidithiobacillus ferrooxidans
ATCC 23270]
gi|198248312|gb|ACH83905.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans
ATCC 53993]
gi|218517999|gb|ACK78585.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans
ATCC 23270]
gi|339834456|gb|EGQ62219.1| hypothetical protein GGI1_11603 [Acidithiobacillus sp. GGI-221]
Length=274
Score = 228 bits (582), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 120/271 (45%), Positives = 170/271 (63%), Gaps = 12/271 (4%)
Query 3 SSSLDPVASERL-------SHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQ 55
+ L P A ERL SH E FTSDLS+NEF ++ A FEP+ LV+G +YH+G Q
Sbjct 5 AEQLAPHAVERLKGLRSQGSH-EGIFTSDLSVNEFVMVRKANFEPLGLVVGSCIYHMGIQ 63
Query 56 FSGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFM 115
+ Q E+ VL++A Y+AR AM+RM+ EA L+ADGIVGVRL + E LEFM
Sbjct 64 YGNWNQNMEMDVLSQAMYQARELAMSRMEQEAILLQADGIVGVRLEVKRMEWDQEILEFM 123
Query 116 AVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFM 175
A+GTA+ + + F+ G+ F+S LSGQD LL++G+ P+ VMG+CV+H+A QG +
Sbjct 124 AIGTAIAHRSGASGFKGVGGKPFTSDLSGQDFWMLLQAGYRPMEMVMGSCVYHVAHQGML 183
Query 176 QTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVE 235
++L +GRN EM Q+TQ Y ARELAM RMQ EA+ A G+ GV ++ W H +E
Sbjct 184 KSLGNVGRNTEMEQFTQAMYDARELAMERMQHEAQEAQADGITGVQLHEGSHNWKPHIIE 243
Query 236 FYTAGTAVRRTGSG----ETITPSFVLPMDS 262
F+ GTAV+ + + ++P V+P++
Sbjct 244 FFAVGTAVKAMENADALVDALSPQLVIPVND 274
>gi|256390616|ref|YP_003112180.1| hypothetical protein Caci_1416 [Catenulispora acidiphila DSM
44928]
gi|256356842|gb|ACU70339.1| conserved hypothetical protein [Catenulispora acidiphila DSM
44928]
Length=275
Score = 226 bits (576), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 121/263 (47%), Positives = 162/263 (62%), Gaps = 4/263 (1%)
Query 2 QSSSLDPVASERLSHAEK---SFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSG 58
Q L A RL +++ FTSDLS+NEF L AGF P+ LVMG S+YH+G Q +
Sbjct 11 QQQPLPESARGRLVQSQQGRGCFTSDLSVNEFVLAAQAGFVPVGLVMGTSIYHIGIQPAR 70
Query 59 MRQQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVG 118
QEL VLT+A Y AR AMARM+AEAD L ADGI+GV + R + E +EF+A+G
Sbjct 71 WNASQELTVLTQAMYTARELAMARMEAEADVLGADGIIGVEVRARRYAFSAEIMEFVAIG 130
Query 119 TAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTL 178
TAV+ A R P G+ F+SHLSGQD TL + G+ P A V+G CV+H+A F Q L
Sbjct 131 TAVKAEGGTMALRTPAGRPFTSHLSGQDFWTLWQHGWVPRALVLGTCVYHVAHLTFRQAL 190
Query 179 RQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYT 238
+ G+N E+ +TQ Y ARE+AM RMQ E + GA G+VGV +++ WG H +EF+
Sbjct 191 QASGQNTELGTYTQAVYDAREIAMGRMQYEGNQVGADGIVGVTVHENDWVWGEHAIEFFA 250
Query 239 AGTAVRRT-GSGETITPSFVLPM 260
GTAV + ++P +P+
Sbjct 251 MGTAVSKLHQDASPVSPQMTMPL 273
>gi|255021072|ref|ZP_05293125.1| hypothetical protein ACA_2799 [Acidithiobacillus caldus ATCC
51756]
gi|340781701|ref|YP_004748308.1| hypothetical protein Atc_0959 [Acidithiobacillus caldus SM-1]
gi|254969486|gb|EET26995.1| hypothetical protein ACA_2799 [Acidithiobacillus caldus ATCC
51756]
gi|340555854|gb|AEK57608.1| conserved hypothetical protein [Acidithiobacillus caldus SM-1]
Length=272
Score = 223 bits (568), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 119/273 (44%), Positives = 169/273 (62%), Gaps = 12/273 (4%)
Query 1 MQSSSLDPVASERL-------SHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVG 53
M +L P A ERL SH E FTSDLS+NEF ++ A FEP+ LV+G +YH+G
Sbjct 1 MSEDTLAPHAVERLKGLRSQGSH-EGIFTSDLSVNEFVMVRKAEFEPLGLVVGSCIYHMG 59
Query 54 FQFSGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLE 113
Q+ Q E+ VL++A Y+AR A++RM+ EA L+ADGIVGVRL + + LE
Sbjct 60 IQYGNWNQNMEMDVLSQAMYQARELAISRMEQEAILLQADGIVGVRLEVKRMEWDPDILE 119
Query 114 FMAVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQG 173
FMA+GTA+ + + F+ G+ F+S LSGQD LL++G+ P+ VMG+CV+H+A QG
Sbjct 120 FMAIGTAIAHRSGAPGFKGVGGKPFTSDLSGQDFWMLLQAGYRPMEMVMGSCVYHVAHQG 179
Query 174 FMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHT 233
++++ IGRN E+ Q+TQ Y ARELAM RMQ EA+ A G+ GV ++ W H
Sbjct 180 VLRSMGNIGRNTELEQFTQAMYDARELAMERMQHEAQEAEADGITGVQLHEGSHNWKPHI 239
Query 234 VEFYTAGTAVRRTGSGET----ITPSFVLPMDS 262
+EF+ GTAV+ + + P VLP++
Sbjct 240 IEFFAVGTAVKAMDGADAKVDELRPQLVLPVND 272
>gi|296135189|ref|YP_003642431.1| hypothetical protein Tint_0704 [Thiomonas intermedia K12]
gi|294339244|emb|CAZ87598.1| Conserved hypothetical protein [Thiomonas sp. 3As]
gi|295795311|gb|ADG30101.1| conserved hypothetical protein [Thiomonas intermedia K12]
Length=287
Score = 218 bits (556), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 120/272 (45%), Positives = 169/272 (63%), Gaps = 14/272 (5%)
Query 3 SSSLDPVASERL-------SHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQ 55
+ L P A ERL SH E FTSDLS+NEF ++ AGFEP+ LV+G VYH+G Q
Sbjct 18 AEQLAPHAVERLKGLRAQGSH-EGIFTSDLSVNEFVMVRQAGFEPLGLVVGSCVYHLGIQ 76
Query 56 FSGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFM 115
+ E+ VL++A Y+AR AM+RM+ EA L+ADG+VG+RL + + LEFM
Sbjct 77 YGQWSTSMEMDVLSQAMYQARELAMSRMEQEAILLRADGVVGLRLEVKRMEWDKDILEFM 136
Query 116 AVGTAVRY-TAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGF 174
A+GTA+ + T PG F+ G+ F+S LSGQD LL+SG+ P+ VMG+CV+H+A QG
Sbjct 137 AIGTAIAHRTGDPG-FKGVGGKPFTSDLSGQDFWMLLKSGYRPMEMVMGSCVYHVAHQGA 195
Query 175 MQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTV 234
+++L +G N+E+ Q+TQ Y ARELAM RMQ EA A G+ GV ++ W H +
Sbjct 196 LRSLGNLGNNVELAQFTQAMYDARELAMERMQHEAAAAQADGITGVQLHEGSHNWQPHVI 255
Query 235 EFYTAGTAVRRTGSG----ETITPSFVLPMDS 262
EF+ GTAV+ + + P V+P++
Sbjct 256 EFFAVGTAVKAMEQADALVDALNPQLVIPVND 287
>gi|256390484|ref|YP_003112048.1| hypothetical protein Caci_1282 [Catenulispora acidiphila DSM
44928]
gi|256356710|gb|ACU70207.1| conserved hypothetical protein [Catenulispora acidiphila DSM
44928]
Length=299
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/176 (37%), Positives = 90/176 (52%), Gaps = 24/176 (13%)
Query 6 LDPVASERLSHAEKSFT--SDLSINEFALLHGAGFEPIELVMGVSVYHV----------- 52
L A +R+ S T S L+ NEFA + AGFEP+ V+G VYH+
Sbjct 4 LPQAARDRMDEIRNSGTWGSALTTNEFAAVRAAGFEPVGQVLGSCVYHLGYTGAYNCPGA 63
Query 53 -----GFQFSGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGE 107
G+ + + G LT++ Y AR A+ RM E L DG+VGVRL R
Sbjct 64 WMGGAGYTSTSTGRYSAYGPLTQSLYEARRLAVDRMSTECTMLGGDGVVGVRLAIRRFPA 123
Query 108 GGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMG 163
GG LEF A+GTAVR R+P F+SH++GQ+ L+ +G+ PV+ V+G
Sbjct 124 GG--LEFTAIGTAVRAANSAVKLRQP----FTSHVTGQEFAKLMMNGWVPVSLVLG 173
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 44/135 (33%), Positives = 59/135 (44%), Gaps = 13/135 (9%)
Query 134 NGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQ-----------TLRQIG 182
N + S L+ + + +GF PV V+G+CV+H+ G T G
Sbjct 17 NSGTWGSALTTNEFAAVRAAGFEPVGQVLGSCVYHLGYTGAYNCPGAWMGGAGYTSTSTG 76
Query 183 RNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTA 242
R TQ Y+AR LA+ RM +E G GVVGV AI + G +EF GTA
Sbjct 77 RYSAYGPLTQSLYEARRLAVDRMSTECTMLGGDGVVGVRLAIRRFPAG--GLEFTAIGTA 134
Query 243 VRRTGSGETITPSFV 257
VR S + F
Sbjct 135 VRAANSAVKLRQPFT 149
>gi|340787431|ref|YP_004752896.1| hypothetical protein CFU_2246 [Collimonas fungivorans Ter331]
gi|340552698|gb|AEK62073.1| hypothetical protein CFU_2246 [Collimonas fungivorans Ter331]
Length=374
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/234 (31%), Positives = 115/234 (50%), Gaps = 21/234 (8%)
Query 21 FTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMRQQQEL--GVLTEATY---RA 75
FT+ E AG+EPI+ G Y +G M + L G ++E +
Sbjct 115 FTTAGDAQELYCHMDAGYEPIQHAFGNIAYSMGVGGGIMGSLKTLVRGEISEYSNIFNTT 174
Query 76 RWNAMARMQAEADALKADGIVGVR---LNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRR 132
R A+ R+ ++A A A+ +VG+R L W G H E + GTA R+ A P +
Sbjct 175 RHKALERLVSQAKADGANAVVGIRTTILPWM-----GTH-EMLMAGTASRHGALPAS--- 225
Query 133 PNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAV-QGFMQTLRQIGRNMEMPQWT 191
+ +S L+G+++ + G+APV +M ++ + V GFM + + E+ T
Sbjct 226 ADSNPVTSDLTGEELWAMTSLGYAPVKLLMSTSIYSLGVVGGFMAAFKSFTKG-EISDLT 284
Query 192 QGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVRR 245
+ ARE+A++R++ EA+ GA VVGV I+ G+ VEF GTAV++
Sbjct 285 TLIHDAREIAIARLKDEADELGAEEVVGVKTYIAEIGSGL--VEFMAIGTAVKK 336
Score = 39.7 bits (91), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 31/105 (30%), Positives = 52/105 (50%), Gaps = 9/105 (8%)
Query 142 LSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNM---EMPQWTQGNYQAR 198
LSG ++ L ++ V+GN V + GF+ + +NM E+ Q T ++ R
Sbjct 7 LSGNEIFCLALKNYSAGEIVVGNSVNSM---GFLGGIGAGLKNMLGGEITQVTAAIHEGR 63
Query 199 ELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAV 243
A RM+ EA + GA+GV GV + +++ + EF G+ V
Sbjct 64 ANAFERMRKEAAQHGASGVAGVSSELRSFSG---STEFLFVGSCV 105
>gi|254496490|ref|ZP_05109365.1| hypothetical protein LDG_1040 [Legionella drancourtii LLAP12]
gi|254354288|gb|EET12948.1| hypothetical protein LDG_1040 [Legionella drancourtii LLAP12]
Length=404
Score = 81.6 bits (200), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 60/211 (29%), Positives = 103/211 (49%), Gaps = 16/211 (7%)
Query 20 SFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGF---QFSGMRQQQ--ELGVLTEATYR 74
+ T+ LS NE L +EP +V+G SV+ +G SG++ EL +T
Sbjct 2 AVTTGLSGNEIYCLQLKNYEPGNIVVGNSVHSLGIIGSVGSGLKAMLGGELQQVTSLIEE 61
Query 75 ARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPN 134
R A RM EA A GI GV H G ++EF+++G+ V A +
Sbjct 62 GRETAYKRMLQEAQTSNATGITGVTSQLIFH---GSNVEFLSIGSMVH------AINDAD 112
Query 135 GQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAV-QGFMQTLRQIGRNMEMPQWTQG 193
FS+ GQ++ L +G+ P+ F GN + + V +G + ++ +GR E+ +++
Sbjct 113 KSKFSTSADGQELYAQLDAGYKPICFAFGNVAYSMGVGRGLLGKVKTLGRG-EIKEYSDI 171
Query 194 NYQARELAMSRMQSEAERDGATGVVGVHFAI 224
+ R LA++R+ + AE+ A V+G+ +
Sbjct 172 FNKTRHLALNRIIAHAEQYKANAVLGIRTTV 202
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 69/240 (29%), Positives = 117/240 (49%), Gaps = 16/240 (6%)
Query 14 LSHAEKS-FTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMRQQQELGVLTEAT 72
++ A+KS F++ E AG++PI G Y +G + + + LG
Sbjct 108 INDADKSKFSTSADGQELYAQLDAGYKPICFAFGNVAYSMGVGRGLLGKVKTLGRGEIKE 167
Query 73 YRARWN-----AMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKP 127
Y +N A+ R+ A A+ KA+ ++G+R G G + E + +GTA Y A+
Sbjct 168 YSDIFNKTRHLALNRIIAHAEQYKANAVLGIRTTVLPFG--GVN-EMLMIGTA-SYNAQ- 222
Query 128 GAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHI-AVQGFMQTLRQIGRNME 186
N + +S ++ +M + G+AP+ ++G V+ + V G TL+ R E
Sbjct 223 -LLTDANNRIVTSDMTNIEMWNMASIGYAPMKLLLGTSVYSLGVVGGITSTLKSFFRG-E 280
Query 187 MPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVRRT 246
+ + T+ Y ARE A++ + +EA+ GA VVGV + Y G +EF GTAV++T
Sbjct 281 INELTRMIYAARENALAIINNEAKAIGADDVVGVKTYV--YQLGNGLIEFLAIGTAVKKT 338
Score = 42.4 bits (98), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 32/107 (30%), Positives = 50/107 (47%), Gaps = 3/107 (2%)
Query 137 AFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQ 196
A ++ LSG ++ L + P V+GN V + + G + + + E+ Q T +
Sbjct 2 AVTTGLSGNEIYCLQLKNYEPGNIVVGNSVHSLGIIGSVGSGLKAMLGGELQQVTSLIEE 61
Query 197 ARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAV 243
RE A RM EA+ ATG+ GV S + VEF + G+ V
Sbjct 62 GRETAYKRMLQEAQTSNATGITGVT---SQLIFHGSNVEFLSIGSMV 105
>gi|270157494|ref|ZP_06186151.1| conserved hypothetical protein [Legionella longbeachae D-4968]
gi|289164119|ref|YP_003454257.1| hypothetical protein LLO_0775 [Legionella longbeachae NSW150]
gi|269989519|gb|EEZ95773.1| conserved hypothetical protein [Legionella longbeachae D-4968]
gi|288857292|emb|CBJ11119.1| hypothetical protein LLO_0775 [Legionella longbeachae NSW150]
Length=411
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/211 (28%), Positives = 105/211 (50%), Gaps = 16/211 (7%)
Query 20 SFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFS-----GMRQQQELGVLTEATYR 74
+ T+ LS NE L + P +V+G SV+ +G S + EL +T
Sbjct 2 TVTTGLSGNEIFCLQLKNYAPGPIVVGNSVHSLGIIGSVGSSFKVILGGELTQITSLIEE 61
Query 75 ARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPN 134
R A RM EA ++ A GI GV H G ++EF+++G+ + + GA N
Sbjct 62 GRETAYKRMLEEAVSMNATGITGVTSQLILH---GANVEFLSIGSVIY--GEEGA----N 112
Query 135 GQAFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAV-QGFMQTLRQIGRNMEMPQWTQG 193
+ FS+ GQ++ L +G+ P+ F GN + + + +G + +L+ +GR E+ +++
Sbjct 113 KEKFSTSADGQELYAQLDAGYKPICFSFGNVAYSMGLGRGILGSLKTLGRG-EIKEYSDI 171
Query 194 NYQARELAMSRMQSEAERDGATGVVGVHFAI 224
+ R LA++R+ + A + A V+G+ +
Sbjct 172 FNKTRHLALNRITAHARQYKANAVLGIKTTV 202
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 58/216 (27%), Positives = 99/216 (46%), Gaps = 15/216 (6%)
Query 36 AGFEPIELVMGVSVYHVGFQFSGMRQQQELGVLTEATYRARWN-----AMARMQAEADAL 90
AG++PI G Y +G + + LG Y +N A+ R+ A A
Sbjct 131 AGYKPICFSFGNVAYSMGLGRGILGSLKTLGRGEIKEYSDIFNKTRHLALNRITAHARQY 190
Query 91 KADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTL 150
KA+ ++G++ G E L +GT + P P + +S ++ +M +
Sbjct 191 KANAVLGIKTTVLPFGGVNEML---MIGTG---SYNPQLMGDPGNEIVTSDMTNIEMWNM 244
Query 151 LRSGFAPVAFVMGNCVFHIA-VQGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEA 209
R G+ P+ ++G V+ + + G T++ R E+ + T+ Y ARE A++ + EA
Sbjct 245 ARMGYTPMKLLLGTSVYSLGIIGGITSTIKAFVRG-EINELTRMIYHARENALAIINEEA 303
Query 210 ERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVRR 245
+ GA VVGV + Y G +EF GTAV++
Sbjct 304 QALGADDVVGVKTYV--YQLGNGLIEFLAIGTAVKK 337
Score = 42.0 bits (97), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 31/105 (30%), Positives = 49/105 (47%), Gaps = 3/105 (2%)
Query 139 SSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQAR 198
++ LSG ++ L +AP V+GN V + + G + + ++ E+ Q T + R
Sbjct 4 TTGLSGNEIFCLQLKNYAPGPIVVGNSVHSLGIIGSVGSSFKVILGGELTQITSLIEEGR 63
Query 199 ELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAV 243
E A RM EA ATG+ GV S VEF + G+ +
Sbjct 64 ETAYKRMLEEAVSMNATGITGV---TSQLILHGANVEFLSIGSVI 105
>gi|300783265|ref|YP_003763556.1| hypothetical protein AMED_1340 [Amycolatopsis mediterranei U32]
gi|299792779|gb|ADJ43154.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340524646|gb|AEK39851.1| hypothetical protein RAM_06795 [Amycolatopsis mediterranei S699]
Length=269
Score = 72.4 bits (176), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 73/249 (30%), Positives = 112/249 (45%), Gaps = 21/249 (8%)
Query 22 TSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAMA 81
TS LS+ AGF P+ VMG V+ +GF + +G L A+
Sbjct 27 TSLLSVPGAVGADVAGFTPVGDVMGCVVHQIGFTGTDTWSADRIGQLAGILREGYETALD 86
Query 82 RMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSSH 141
R+ EA A+ ADG++G+ ++ G EF+A+GTAVR A P RP F++
Sbjct 87 RLVEEAAAIGADGVLGI--DFAVTSPDGAAQEFVALGTAVRAKAGP----RPR-HPFTTG 139
Query 142 LSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQ--TLRQIGRNMEMPQWTQGNYQARE 199
L GQD+ L+ +G+ PV +G + A + Q + + N+E+P T+ R
Sbjct 140 LPGQDVAKLMLAGWVPVRVAVG--ISGRAFTDYWQRSLVNEWAGNVEIPLATELATAVRA 197
Query 200 LAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEF----YTAGTAVRRTGSGETITPS 255
A + E GA G G+ + W VH + GTA+ R G TP+
Sbjct 198 EARAEFAREIGDCGADG--GIVSDLKFDIWPVHDIAVAAMATVIGTAIARFHEGPA-TPA 254
Query 256 ---FVLPMD 261
+LP++
Sbjct 255 GTLKILPLN 263
>gi|331695282|ref|YP_004331521.1| hypothetical protein Psed_1429 [Pseudonocardia dioxanivorans
CB1190]
gi|326949971|gb|AEA23668.1| protein of unknown function DUF74 [Pseudonocardia dioxanivorans
CB1190]
Length=286
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 65/219 (30%), Positives = 93/219 (43%), Gaps = 28/219 (12%)
Query 36 AGFEPIELVMGVSVYHVGFQF------------------SGMRQQQELGVLTEATYRARW 77
AGF+P+ VMG V +G+Q S + +A Y
Sbjct 41 AGFDPVGEVMGSMVQRMGWQGYRGCGGYGWGGVPTTITSSDGNRFSGFAPYVQALYTGHG 100
Query 78 NAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQA 137
A+ R+ AEA A+ ADG+VGVRL G EF A+GTAVR ++ RP G+
Sbjct 101 TAIGRLVAEAAAIGADGVVGVRLA--QEPMFGNAHEFTAIGTAVRARSR----TRP-GRI 153
Query 138 FSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQIGR-NMEMPQWTQGNYQ 196
F++ L D+ LL++G+ PV V G V ++ G N+E+ +T
Sbjct 154 FTTVLPAPDVTALLQAGWVPVQIVYGISVALRHDDWATRSQASWGAGNVEVSGYTDLIGT 213
Query 197 ARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVE 235
R A ++ R GA G V +S W V E
Sbjct 214 TRADARRLLRDAVRRTGADGAVLDDMTLS--VWSVEPAE 250
>gi|344199525|ref|YP_004783851.1| hypothetical protein Acife_1374 [Acidithiobacillus ferrivorans
SS3]
gi|343774969|gb|AEM47525.1| protein of unknown function DUF74 [Acidithiobacillus ferrivorans
SS3]
Length=325
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 44/149 (30%), Positives = 67/149 (45%), Gaps = 19/149 (12%)
Query 15 SHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMRQQQELGVLTEATYR 74
S + + S +E L G +PI +V G YH G+ + T+ Y
Sbjct 68 SQGQLPWISTARASELLLQRSHGVQPIGMVTGNCWYHFGYSW------------TQGHYD 115
Query 75 ARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPN 134
A+ RM+ EA AL A+ +V VR+ HHGE E +++ GTA+R P P+
Sbjct 116 GWHTAVERMRLEALALGANAVVDVRMK-VHHGE-HEDMDYGVTGTAIRIRGLP-----PS 168
Query 135 GQAFSSHLSGQDMVTLLRSGFAPVAFVMG 163
+ + +S + V LL G PV +G
Sbjct 169 AEPVVATVSALEFVRLLEDGVVPVGIAIG 197
Score = 42.0 bits (97), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 31/102 (31%), Positives = 44/102 (44%), Gaps = 21/102 (20%)
Query 154 GFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDG 213
G P+ V GNC +H G+ WTQG+Y A+ RM+ EA G
Sbjct 90 GVQPIGMVTGNCWYHF---GY--------------SWTQGHYDGWHTAVERMRLEALALG 132
Query 214 ATGVVGVHFAISNYAWGVH-TVEFYTAGTAVRRTGSGETITP 254
A VV V + + G H +++ GTA+R G + P
Sbjct 133 ANAVVDVRMKVHH---GEHEDMDYGVTGTAIRIRGLPPSAEP 171
>gi|255021071|ref|ZP_05293124.1| hypothetical protein ACA_2798 [Acidithiobacillus caldus ATCC
51756]
gi|340781700|ref|YP_004748307.1| hypothetical protein Atc_0958 [Acidithiobacillus caldus SM-1]
gi|254969485|gb|EET26994.1| hypothetical protein ACA_2798 [Acidithiobacillus caldus ATCC
51756]
gi|340555853|gb|AEK57607.1| conserved hypothetical protein [Acidithiobacillus caldus SM-1]
Length=325
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 57/212 (27%), Positives = 92/212 (44%), Gaps = 31/212 (14%)
Query 21 FTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAM 80
+ S S+ + G P+ +V G YH G+ + T+ + +A+
Sbjct 74 WISSGSVAALLMERSHGIRPLGMVSGNCWYHFGYSW------------TQGHHDGWRSAV 121
Query 81 ARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSS 140
ARMQ EA A+ A+ +V VR+ H GE + ++F VGTAVR P P + +
Sbjct 122 ARMQLEAVAMGANAVVDVRMR-VHRGE-SQDMDFGLVGTAVRIQDLP-----PAAEPAVA 174
Query 141 HLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQT-LRQIG---------RNMEMPQW 190
+ + V LL G PV +G ++ Q F+ T L Q G +N+E+
Sbjct 175 TVPALEFVRLLEDGVVPVGISIG--AYYDWYQPFLGTALDQAGQMAPFGAAFQNLEITDL 232
Query 191 TQGNYQARELAMSRMQSEAERDGATGVVGVHF 222
+ R A+ +Q + +R GA + HF
Sbjct 233 SSFQENVRRRALYDLQQDGQRLGAGVLAHTHF 264
Score = 38.5 bits (88), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/91 (31%), Positives = 37/91 (41%), Gaps = 19/91 (20%)
Query 154 GFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDG 213
G P+ V GNC +H WTQG++ A++RMQ EA G
Sbjct 90 GIRPLGMVSGNCWYHFGYS-----------------WTQGHHDGWRSAVARMQLEAVAMG 132
Query 214 ATGVVGVHFAISNYAWGVHTVEFYTAGTAVR 244
A VV V + ++F GTAVR
Sbjct 133 ANAVVDVRMRVHRGE--SQDMDFGLVGTAVR 161
>gi|198283792|ref|YP_002220113.1| hypothetical protein Lferr_1684 [Acidithiobacillus ferrooxidans
ATCC 53993]
gi|218666390|ref|YP_002426423.1| hypothetical protein AFE_2019 [Acidithiobacillus ferrooxidans
ATCC 23270]
gi|198248313|gb|ACH83906.1| hypothetical protein Lferr_1684 [Acidithiobacillus ferrooxidans
ATCC 53993]
gi|218518603|gb|ACK79189.1| conserved domain protein [Acidithiobacillus ferrooxidans ATCC
23270]
gi|339834457|gb|EGQ62220.1| hypothetical protein GGI1_11608 [Acidithiobacillus sp. GGI-221]
Length=325
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/149 (30%), Positives = 65/149 (44%), Gaps = 19/149 (12%)
Query 15 SHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMRQQQELGVLTEATYR 74
S + S +E L G +PI +V G YH G+ + T+ Y
Sbjct 68 SQGRLPWISTAKASELLLQRSHGVQPIGMVTGNCWYHFGYSW------------TQGHYD 115
Query 75 ARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPN 134
A+ RM+ EA AL A+ IV VR+ H GE E +++ GTA+R P P+
Sbjct 116 GWHTAVERMRLEALALGANAIVDVRMQ-VHRGE-REDMDYGVTGTAIRIRGLP-----PS 168
Query 135 GQAFSSHLSGQDMVTLLRSGFAPVAFVMG 163
+ + +S + V LL G PV +G
Sbjct 169 AEPVVATVSALEFVRLLEDGVVPVGIAIG 197
Score = 38.9 bits (89), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 28/101 (28%), Positives = 41/101 (41%), Gaps = 19/101 (18%)
Query 154 GFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDG 213
G P+ V GNC +H G+ WTQG+Y A+ RM+ EA G
Sbjct 90 GVQPIGMVTGNCWYHF---GY--------------SWTQGHYDGWHTAVERMRLEALALG 132
Query 214 ATGVVGVHFAISNYAWGVHTVEFYTAGTAVRRTGSGETITP 254
A +V V + +++ GTA+R G + P
Sbjct 133 ANAIVDVRMQVHRGE--REDMDYGVTGTAIRIRGLPPSAEP 171
>gi|339628141|ref|YP_004719784.1| hypothetical protein TPY_1861 [Sulfobacillus acidophilus TPY]
gi|339285930|gb|AEJ40041.1| hypothetical protein TPY_1861 [Sulfobacillus acidophilus TPY]
Length=332
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 57/208 (28%), Positives = 85/208 (41%), Gaps = 20/208 (9%)
Query 23 SDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSG---MRQQQELGVLTEATYRARWNA 79
S S+ E AL PI V G S +H S + + L A YRA+ +A
Sbjct 58 STFSVPETALADLLHLTPIAQVTGSSYFHAATDQSQRIFLDDHFDAANLIRAHYRAKSDA 117
Query 80 MARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFS 139
+ R+ EA A ++ R+ + G + VGTAV + G RP
Sbjct 118 LGRLVEEARLAGAHAVIDTRIQFTREGT---VVTCSLVGTAVSF----GPTLRPPRTPLV 170
Query 140 SHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQIGR-----NMEMPQWTQGN 194
S L+G+D L G+ PV + +G H+ G+ R I N E+ T+
Sbjct 171 SPLTGEDFYRLWSRGYMPVTWAVG-YYRHVMPVGYRT--RSITSGWNFWNQEVTGVTERL 227
Query 195 YQARELAMSRMQSEAERDG--ATGVVGV 220
+ R A+ +M +A G G+VGV
Sbjct 228 SEVRRYAVQQMLRDARDSGQRVDGLVGV 255
>gi|159906208|ref|YP_001549870.1| hypothetical protein MmarC6_1828 [Methanococcus maripaludis C6]
gi|226696092|sp|A9ABB5.1|Y1828_METM6 RecName: Full=UPF0145 protein MmarC6_1828
gi|159887701|gb|ABX02638.1| protein of unknown function DUF74 [Methanococcus maripaludis
C6]
Length=112
Score = 48.9 bits (115), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/59 (45%), Positives = 37/59 (63%), Gaps = 2/59 (3%)
Query 186 EMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVR 244
E+P++T+ ++RE A+ RM EA+R GA VVGV F S+ G + EF GTAV+
Sbjct 46 EIPEYTKMMDESREEALKRMYREAQRRGADAVVGVRFETSSILAG--SAEFLCYGTAVK 102
>gi|148653977|ref|YP_001281070.1| hypothetical protein PsycPRwf_2180 [Psychrobacter sp. PRwf-1]
gi|148573061|gb|ABQ95120.1| protein of unknown function DUF74 [Psychrobacter sp. PRwf-1]
Length=144
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/105 (37%), Positives = 53/105 (51%), Gaps = 10/105 (9%)
Query 32 LLHGAGF---EPIELVMGVSV--YHVGFQ-FSGMRQQQ--ELGVLTEATYRARWNAMARM 83
L H G+ E +++V G +V HVG F+ ++ EL TE +R A+ RM
Sbjct 8 LEHLPGYTITERLDVVYGSTVRSKHVGKDIFASLKNIVGGELTAYTELLEESRQEAIDRM 67
Query 84 QAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPG 128
A+ADAL AD +VG+R + +G E GTAVR A P
Sbjct 68 LAKADALGADAVVGLRFSTSSIAQGAS--ELFVYGTAVRAVANPA 110
>gi|294339243|emb|CAZ87597.1| Hypothetical protein THI_0893 [Thiomonas sp. 3As]
Length=336
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/127 (30%), Positives = 54/127 (43%), Gaps = 18/127 (14%)
Query 37 GFEPIELVMGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIV 96
G EP+ +V G YH + +S + Y +A R++ EA L A+ +V
Sbjct 92 GIEPLGMVNGNCWYHFDYSWS------------DGHYDGWHHAFDRLRLEAALLGANAVV 139
Query 97 GVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFA 156
V L H G+G L+F GTAVR P P+ + +S V LL G
Sbjct 140 DVTLQ-AHAGQGPNDLDFALTGTAVRIRGLP-----PSTDPAVATVSALHFVRLLDDGAV 193
Query 157 PVAFVMG 163
PV +G
Sbjct 194 PVGIAIG 200
Score = 38.5 bits (88), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/102 (28%), Positives = 41/102 (41%), Gaps = 18/102 (17%)
Query 154 GFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDG 213
G P+ V GNC +H W+ G+Y A R++ EA G
Sbjct 92 GIEPLGMVNGNCWYHFDYS-----------------WSDGHYDGWHHAFDRLRLEAALLG 134
Query 214 ATGVVGVHFAISNYAWGVHTVEFYTAGTAVRRTGSGETITPS 255
A VV V ++ G + ++F GTAVR G + P+
Sbjct 135 ANAVVDVTLQ-AHAGQGPNDLDFALTGTAVRIRGLPPSTDPA 175
>gi|296135188|ref|YP_003642430.1| hypothetical protein Tint_0703 [Thiomonas intermedia K12]
gi|295795310|gb|ADG30100.1| conserved hypothetical protein [Thiomonas intermedia K12]
Length=336
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/127 (30%), Positives = 55/127 (44%), Gaps = 18/127 (14%)
Query 37 GFEPIELVMGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAMARMQAEADALKADGIV 96
G EP+ +V G YH + +S + Y +A+ R++ EA L A+ +V
Sbjct 92 GIEPLGMVNGNCWYHFDYSWS------------DGHYDGWHHALDRLRLEAALLGANAVV 139
Query 97 GVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFA 156
V L H G+G ++F GTAVR P P+ + +S V LL G
Sbjct 140 DVTLQ-AHAGQGPNDVDFALTGTAVRIRGLP-----PSTDPAVATVSALHFVRLLDDGAV 193
Query 157 PVAFVMG 163
PV +G
Sbjct 194 PVGIAIG 200
Score = 40.4 bits (93), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 29/102 (29%), Positives = 42/102 (42%), Gaps = 18/102 (17%)
Query 154 GFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDG 213
G P+ V GNC +H W+ G+Y A+ R++ EA G
Sbjct 92 GIEPLGMVNGNCWYHFDYS-----------------WSDGHYDGWHHALDRLRLEAALLG 134
Query 214 ATGVVGVHFAISNYAWGVHTVEFYTAGTAVRRTGSGETITPS 255
A VV V ++ G + V+F GTAVR G + P+
Sbjct 135 ANAVVDVTLQ-AHAGQGPNDVDFALTGTAVRIRGLPPSTDPA 175
>gi|254171979|ref|ZP_04878655.1| conserved hypothetical protein [Thermococcus sp. AM4]
gi|214033875|gb|EEB74701.1| conserved hypothetical protein [Thermococcus sp. AM4]
Length=106
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/65 (45%), Positives = 37/65 (57%), Gaps = 5/65 (7%)
Query 183 RNM---EMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTA 239
RN+ E+ ++T+ QARE+A+ RM AE GA V+GV F SN G E Y
Sbjct 40 RNLVGGEVKEYTEMMAQAREIALQRMIQSAEEMGANAVIGVRFMTSNV--GQRMAEVYAY 97
Query 240 GTAVR 244
GTAVR
Sbjct 98 GTAVR 102
>gi|46199883|ref|YP_005550.1| hypothetical protein TTC1581 [Thermus thermophilus HB27]
gi|81405422|sp|Q72HB4.1|YF81_THET2 RecName: Full=UPF0145 protein TT_C1581
gi|46197510|gb|AAS81923.1| hypothetical conserved protein [Thermus thermophilus HB27]
Length=114
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/59 (43%), Positives = 37/59 (63%), Gaps = 2/59 (3%)
Query 186 EMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVR 244
E+P++T+ +ARE+A +RM EA R GA V+GV +A ++ G + Y GTAVR
Sbjct 51 ELPEYTEMLQEAREVAEARMLEEARRLGAHAVLGVRYATASVMQGAAEILVY--GTAVR 107
>gi|333967570|gb|AEG34335.1| UPF0145 protein ybjQ [Thermus thermophilus SG0.5JP17-16]
Length=111
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/59 (43%), Positives = 37/59 (63%), Gaps = 2/59 (3%)
Query 186 EMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVR 244
E+P++T+ +ARE+A +RM EA R GA V+GV +A ++ G + Y GTAVR
Sbjct 48 ELPEYTEMLQEAREVAEARMLEEARRLGAHAVLGVRYATASVMQGAAEILVY--GTAVR 104
>gi|55981913|ref|YP_145210.1| hypothetical protein TTHA1944 [Thermus thermophilus HB8]
gi|81363683|sp|Q5SGY6.1|Y1944_THET8 RecName: Full=UPF0145 protein TTHA1944
gi|55773326|dbj|BAD71767.1| conserved hypothetical protein [Thermus thermophilus HB8]
Length=114
Score = 45.8 bits (107), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/59 (43%), Positives = 37/59 (63%), Gaps = 2/59 (3%)
Query 186 EMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVR 244
E+P++T+ +ARE+A +RM EA R GA V+GV +A ++ G + Y GTAVR
Sbjct 51 ELPEYTEMLQEAREVAEARMLEEARRLGAHAVLGVRYATASVMQGAAEILVY--GTAVR 107
>gi|169828726|ref|YP_001698884.1| hypothetical protein Bsph_3249 [Lysinibacillus sphaericus C3-41]
gi|168993214|gb|ACA40754.1| UPF0145 protein [Lysinibacillus sphaericus C3-41]
Length=105
Score = 45.1 bits (105), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 29/76 (39%), Positives = 41/76 (54%), Gaps = 12/76 (15%)
Query 179 RQIGRNM----------EMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYA 228
R IGR+M EM ++ + ++RE+A M+ EA++ GA VVGV +A S+
Sbjct 30 RNIGRDMMAGLRNIVGGEMKEYAEMLVRSREIATQAMEEEAKQLGADAVVGVRYATSSVM 89
Query 229 WGVHTVEFYTAGTAVR 244
G V Y GTAVR
Sbjct 90 DGTSEVLVY--GTAVR 103
>gi|126651535|ref|ZP_01723738.1| hypothetical protein BB14905_13020 [Bacillus sp. B14905]
gi|126591484|gb|EAZ85590.1| hypothetical protein BB14905_13020 [Bacillus sp. B14905]
Length=104
Score = 45.1 bits (105), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 29/76 (39%), Positives = 41/76 (54%), Gaps = 12/76 (15%)
Query 179 RQIGRNM----------EMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYA 228
R IGR+M EM ++ + ++RE+A M+ EA++ GA VVGV +A S+
Sbjct 29 RNIGRDMMAGLRNIVGGEMKEYAEMLVRSREIATQAMEEEAKQLGADAVVGVRYATSSVM 88
Query 229 WGVHTVEFYTAGTAVR 244
G V Y GTAVR
Sbjct 89 DGTSEVLVY--GTAVR 102
>gi|242399522|ref|YP_002994947.1| hypothetical protein TSIB_1547 [Thermococcus sibiricus MM 739]
gi|242265916|gb|ACS90598.1| hypothetical protein TSIB_1547 [Thermococcus sibiricus MM 739]
Length=116
Score = 44.7 bits (104), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 25/64 (40%), Positives = 39/64 (61%), Gaps = 5/64 (7%)
Query 183 RNM---EMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTA 239
RN+ E+ ++T+ +ARE+A+ RM +A++ GA G++GV F S A G E +
Sbjct 50 RNIAGGEVKEYTEMLAEAREVALERMIQDAKKRGANGIIGVRFMTSAVASG--AAEIFAY 107
Query 240 GTAV 243
GTAV
Sbjct 108 GTAV 111
>gi|262369361|ref|ZP_06062689.1| conserved hypothetical protein [Acinetobacter johnsonii SH046]
gi|262315429|gb|EEY96468.1| conserved hypothetical protein [Acinetobacter johnsonii SH046]
Length=118
Score = 44.7 bits (104), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 24/61 (40%), Positives = 38/61 (63%), Gaps = 2/61 (3%)
Query 186 EMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVRR 245
E+ +T+ ++R+ AM+RM ++A+ GA +VG+ F+ SN A G E + GTAVR
Sbjct 46 ELTAYTELLEESRQEAMNRMIAKAQDLGANAIVGIRFSTSNIAQGAS--ELFVYGTAVRV 103
Query 246 T 246
T
Sbjct 104 T 104
>gi|206900474|ref|YP_002250230.1| hypothetical protein DICTH_0347 [Dictyoglomus thermophilum H-6-12]
gi|206739577|gb|ACI18635.1| conserved protein [Dictyoglomus thermophilum H-6-12]
Length=107
Score = 44.3 bits (103), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 33/97 (35%), Positives = 49/97 (51%), Gaps = 14/97 (14%)
Query 160 FVMGNCVFHI--AVQGFMQTLRQIGRNM----------EMPQWTQGNYQARELAMSRMQS 207
+V G V I V+G R +G+++ E+ ++T+ +ARE A+ RM
Sbjct 8 YVPGYEVIEILGIVKGSTARTRHVGKDILASLRNLVGGEVVEYTKLIGEAREQALDRMIE 67
Query 208 EAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVR 244
EA+R GA V+GV F + GV + Y GTAVR
Sbjct 68 EAKRLGADAVIGVRFGTTQIMQGVAEILAY--GTAVR 102
>gi|76801942|ref|YP_326950.1| hypothetical protein NP2600A [Natronomonas pharaonis DSM 2160]
gi|115311212|sp|Q3IR53.1|Y2600_NATPD RecName: Full=UPF0145 protein NP2600A
gi|76557807|emb|CAI49391.1| conserved hypothetical protein [Natronomonas pharaonis DSM 2160]
Length=104
Score = 43.9 bits (102), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 25/59 (43%), Positives = 36/59 (62%), Gaps = 2/59 (3%)
Query 64 ELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVR 122
EL +E +AR A++RM+A+A+ + AD +V VRL +GG E +A GTAVR
Sbjct 46 ELKAYSELLSKARDEALSRMEADAEEMGADAVVNVRLETSKVTDGGS--EVIAYGTAVR 102
>gi|255318814|ref|ZP_05360040.1| hypothetical protein ACIRA0001_0961 [Acinetobacter radioresistens
SK82]
gi|262378925|ref|ZP_06072082.1| conserved hypothetical protein [Acinetobacter radioresistens
SH164]
gi|255304070|gb|EET83261.1| hypothetical protein ACIRA0001_0961 [Acinetobacter radioresistens
SK82]
gi|262300210|gb|EEY88122.1| conserved hypothetical protein [Acinetobacter radioresistens
SH164]
Length=116
Score = 43.9 bits (102), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 31/89 (35%), Positives = 46/89 (52%), Gaps = 7/89 (7%)
Query 174 FMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHT 233
FM L+ I E+ +T+ ++R+ AM+RM +A+ GA +VG+ F+ SN A G
Sbjct 35 FMAGLKNI-VGGELKGYTELLEESRQEAMNRMIEKAQNLGANAIVGIRFSTSNIAQGAS- 92
Query 234 VEFYTAGTAVRRTGSGETITPSFVLPMDS 262
E + GTAV T + PS P S
Sbjct 93 -ELFVYGTAVTVT----PVAPSIPDPFSS 116
>gi|343385828|gb|AEM21318.1| hypothetical protein Bint_0689 [Brachyspira intermedia PWS/A]
Length=108
Score = 43.5 bits (101), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 26/83 (32%), Positives = 45/83 (55%), Gaps = 12/83 (14%)
Query 171 VQGFMQTLRQIGRNM----------EMPQWTQGNYQARELAMSRMQSEAERDGATGVVGV 220
V+G + + IG+++ E+ +T+ +ARE+A RM EA++ GA ++G+
Sbjct 25 VKGNIAQAKHIGKDLLAGLKNIVGGEVEGYTEMINEAREIATKRMIEEAKKLGANAIIGI 84
Query 221 HFAISNYAWGVHTVEFYTAGTAV 243
H++ S+ G T E GTAV
Sbjct 85 HYSSSSVMEG--TTEVIAYGTAV 105
>gi|299534805|ref|ZP_07048134.1| hypothetical protein BFZC1_02197 [Lysinibacillus fusiformis ZC1]
gi|298729650|gb|EFI70196.1| hypothetical protein BFZC1_02197 [Lysinibacillus fusiformis ZC1]
Length=104
Score = 43.1 bits (100), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 37/118 (32%), Positives = 54/118 (46%), Gaps = 29/118 (24%)
Query 137 AFSSHLSGQDMVTLLRSGFAPVAFVMGNCVFHIAVQGFMQTLRQIGRNM----------E 186
A S ++G+D++ L V GN V R IGR+M E
Sbjct 4 ATSDMVAGKDIMETL-------GLVKGNSVQS----------RNIGRDMIAGLRNIVGGE 46
Query 187 MPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVR 244
M ++ + ++RE+A M EA++ GA VVGV +A S+ G V Y GTAV+
Sbjct 47 MKEYAEMLVRSREIATEAMVEEAKQLGADAVVGVRYATSSVMDGTSEVLVY--GTAVK 102
>gi|253682761|ref|ZP_04863556.1| conserved hypothetical protein [Clostridium botulinum D str.
1873]
gi|253560960|gb|EES90414.1| conserved hypothetical protein [Clostridium botulinum D str.
1873]
Length=103
Score = 43.1 bits (100), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 32/96 (34%), Positives = 49/96 (52%), Gaps = 13/96 (13%)
Query 160 FVMGNCVFHIA-VQGFMQTLRQIGRNM----------EMPQWTQGNYQARELAMSRMQSE 208
F+ G + IA V+G + IG+++ E+ ++T+ +ARE+A +RM E
Sbjct 8 FISGKEIETIALVKGSTIQSKNIGKDILSGLRTIVGGELNEYTEMMNEAREIATNRMIEE 67
Query 209 AERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVR 244
A+R GA V+ V +A S V Y GTAVR
Sbjct 68 AKRLGADAVINVRYATSAIMQAAAEVIVY--GTAVR 101
>gi|225621364|ref|YP_002722622.1| hypothetical protein BHWA1_02464 [Brachyspira hyodysenteriae
WA1]
gi|225216184|gb|ACN84918.1| hypothetical protein BHWA1_02464 [Brachyspira hyodysenteriae
WA1]
Length=108
Score = 43.1 bits (100), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 26/83 (32%), Positives = 45/83 (55%), Gaps = 12/83 (14%)
Query 171 VQGFMQTLRQIGRNM----------EMPQWTQGNYQARELAMSRMQSEAERDGATGVVGV 220
V+G + + IG+++ E+ +T+ +ARE+A RM EA++ GA ++G+
Sbjct 25 VKGNIAQAKHIGKDLLAGLKNIVGGEVEGYTEMINEAREIATKRMIEEAKKLGANAIIGI 84
Query 221 HFAISNYAWGVHTVEFYTAGTAV 243
H++ S+ G T E GTAV
Sbjct 85 HYSSSSVMEG--TTEVIAYGTAV 105
>gi|57641861|ref|YP_184339.1| hypothetical protein TK1926 [Thermococcus kodakarensis KOD1]
gi|73921071|sp|Q5JET9.1|Y1926_PYRKO RecName: Full=UPF0145 protein TK1926
gi|57160185|dbj|BAD86115.1| hypothetical protein, conserved, DUF74 family [Thermococcus kodakarensis
KOD1]
Length=113
Score = 43.1 bits (100), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 30/70 (43%), Positives = 37/70 (53%), Gaps = 3/70 (4%)
Query 174 FMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHT 233
M LR I + E+ ++TQ +ARE A+ RM A+ GA VV V FA SN G
Sbjct 41 IMAVLRNI-KGGEVREYTQMMAEAREEALRRMALHAKELGANAVVNVRFATSN--VGSSV 97
Query 234 VEFYTAGTAV 243
E Y GTAV
Sbjct 98 AEVYAYGTAV 107
>gi|326563852|gb|EGE14103.1| hypothetical protein E9M_02908 [Moraxella catarrhalis 46P47B1]
Length=120
Score = 43.1 bits (100), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 30/85 (36%), Positives = 47/85 (56%), Gaps = 4/85 (4%)
Query 161 VMGNCVFHIAV-QGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVG 219
V GN V V + F+ L+ I E+ +T+ +AR+ AM+RM ++A+ GA +VG
Sbjct 21 VYGNTVRSKHVGKDFLSGLKNI-VGSELTTYTELLEEARQEAMNRMITKAQVLGADAIVG 79
Query 220 VHFAISNYAWGVHTVEFYTAGTAVR 244
+ F+ SN G E + GTAV+
Sbjct 80 IRFSTSNITAGAS--ELFVYGTAVK 102
>gi|339441073|ref|YP_004707078.1| hypothetical protein CXIVA_00090 [Clostridium sp. SY8519]
gi|338900474|dbj|BAK45976.1| uncharacterized ACR protein [Clostridium sp. SY8519]
Length=103
Score = 43.1 bits (100), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 29/84 (35%), Positives = 43/84 (52%), Gaps = 12/84 (14%)
Query 171 VQGFMQTLRQIGRNM----------EMPQWTQGNYQARELAMSRMQSEAERDGATGVVGV 220
V+G M + IG+++ E+ +T +A++ A RM++EAE+ GA VVGV
Sbjct 20 VEGCMIQSKHIGKDIGQSLKTMVGGELKSYTAMMAEAKDTARQRMRAEAEKLGADAVVGV 79
Query 221 HFAISNYAWGVHTVEFYTAGTAVR 244
+ S G V Y GTAVR
Sbjct 80 RYTTSAIVQGAAEVIAY--GTAVR 101
Score = 35.8 bits (81), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 25/59 (43%), Positives = 31/59 (53%), Gaps = 2/59 (3%)
Query 64 ELGVLTEATYRARWNAMARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVR 122
EL T A+ A RM+AEA+ L AD +VGVR +G E +A GTAVR
Sbjct 45 ELKSYTAMMAEAKDTARQRMRAEAEKLGADAVVGVRYTTSAIVQGAA--EVIAYGTAVR 101
>gi|331270094|ref|YP_004396586.1| hypothetical protein CbC4_1917 [Clostridium botulinum BKT015925]
gi|329126644|gb|AEB76589.1| conserved hypothetical protein [Clostridium botulinum BKT015925]
Length=108
Score = 43.1 bits (100), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 30/96 (32%), Positives = 49/96 (52%), Gaps = 13/96 (13%)
Query 160 FVMGNCVFHIA-VQGFMQTLRQIGRNM----------EMPQWTQGNYQARELAMSRMQSE 208
F+ G + IA V+G + IG+++ E+ ++T+ +ARE+A +RM E
Sbjct 13 FISGKEIETIALVKGSTIQSKNIGKDILSGLRTIVGGELNEYTEMMNEAREIATNRMIEE 72
Query 209 AERDGATGVVGVHFAISNYAWGVHTVEFYTAGTAVR 244
AER GA V+ + +A S V Y GTA++
Sbjct 73 AERLGADAVINIRYATSAIMQAAAEVIVY--GTAIK 106
Lambda K H
0.320 0.132 0.394
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 391676602750
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40