BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2044c
Length=105
Score E
Sequences producing significant alignments: (Bits) Value
gi|148823260|ref|YP_001288014.1| hypothetical protein TBFG_12081... 209 8e-53
gi|15841532|ref|NP_336569.1| hypothetical protein MT2104 [Mycoba... 209 1e-52
gi|15609181|ref|NP_216560.1| hypothetical protein Rv2044c [Mycob... 207 3e-52
gi|340627055|ref|YP_004745507.1| hypothetical protein MCAN_20671... 206 7e-52
gi|240169261|ref|ZP_04747920.1| hypothetical protein MkanA1_0810... 122 2e-26
gi|325676878|ref|ZP_08156551.1| hypothetical protein HMPREF0724_... 106 1e-21
gi|312139674|ref|YP_004007010.1| integral membrane protein [Rhod... 105 2e-21
gi|226306923|ref|YP_002766883.1| hypothetical protein RER_34360 ... 102 3e-20
gi|229491066|ref|ZP_04384897.1| conserved hypothetical protein [... 100 6e-20
gi|118472176|ref|YP_886687.1| hypothetical protein MSMEG_2341 [M... 98.2 4e-19
gi|333919614|ref|YP_004493195.1| hypothetical protein AS9A_1946 ... 97.4 6e-19
gi|226360045|ref|YP_002777823.1| hypothetical protein ROP_06310 ... 95.9 2e-18
gi|72161579|ref|YP_289236.1| hypothetical protein Tfu_1175 [Ther... 95.5 2e-18
gi|111017914|ref|YP_700886.1| hypothetical protein RHA1_ro00896 ... 95.1 3e-18
gi|297562104|ref|YP_003681078.1| hypothetical protein Ndas_3164 ... 93.2 1e-17
gi|72161580|ref|YP_289237.1| hypothetical protein Tfu_1176 [Ther... 91.7 4e-17
gi|254821733|ref|ZP_05226734.1| hypothetical protein MintA_17497... 90.1 1e-16
gi|343927302|ref|ZP_08766778.1| hypothetical protein GOALK_089_0... 84.0 6e-15
gi|296171230|ref|ZP_06852634.1| conserved hypothetical protein [... 83.6 8e-15
gi|33596019|ref|NP_883662.1| hypothetical protein BPP1360 [Borde... 80.5 8e-14
gi|54024522|ref|YP_118764.1| hypothetical protein nfa25530 [Noca... 79.0 2e-13
gi|218890414|ref|YP_002439278.1| hypothetical protein PLES_16741... 77.8 6e-13
gi|49074174|gb|AAT49385.1| PA3386 [synthetic construct] 77.4 6e-13
gi|107102918|ref|ZP_01366836.1| hypothetical protein PaerPA_0100... 77.4 7e-13
gi|116051404|ref|YP_789763.1| hypothetical protein PA14_20280 [P... 77.4 7e-13
gi|15598582|ref|NP_252076.1| hypothetical protein PA3386 [Pseudo... 77.4 7e-13
gi|229592050|ref|YP_002874169.1| hypothetical protein PFLU4648 [... 77.4 7e-13
gi|296388101|ref|ZP_06877576.1| hypothetical protein PaerPAb_080... 75.9 2e-12
gi|77459522|ref|YP_349029.1| hypothetical protein Pfl01_3300 [Ps... 75.5 3e-12
gi|56477664|ref|YP_159253.1| hypothetical protein ebB138 [Aromat... 74.7 4e-12
gi|312962508|ref|ZP_07776999.1| hypothetical protein PFWH6_4429 ... 74.3 6e-12
gi|297560980|ref|YP_003679954.1| hypothetical protein Ndas_2022 ... 73.9 7e-12
gi|121583245|ref|YP_973681.1| hypothetical protein Pnap_4875 [Po... 72.4 2e-11
gi|104781510|ref|YP_608008.1| hypothetical protein PSEEN2397 [Ps... 72.0 3e-11
gi|74317131|ref|YP_314871.1| hypothetical protein Tbd_1113 [Thio... 70.5 9e-11
gi|237653420|ref|YP_002889734.1| hypothetical protein Tmz1t_2757... 70.1 1e-10
gi|302038964|ref|YP_003799286.1| hypothetical protein NIDE3683 [... 70.1 1e-10
gi|330966201|gb|EGH66461.1| hypothetical protein PSYAC_16461 [Ps... 69.7 1e-10
gi|78223775|ref|YP_385522.1| hypothetical protein Gmet_2578 [Geo... 69.3 2e-10
gi|113869532|ref|YP_728021.1| hypothetical protein H16_A3595 [Ra... 69.3 2e-10
gi|339327625|ref|YP_004687318.1| hypothetical protein CNE_1c3544... 67.4 6e-10
gi|334112323|ref|ZP_08486586.1| hypothetical protein MetalDRAFT_... 67.4 6e-10
gi|73542968|ref|YP_297488.1| hypothetical protein Reut_A3285 [Ra... 65.5 3e-09
gi|330957602|gb|EGH57862.1| hypothetical protein PMA4326_03369 [... 65.1 3e-09
gi|89901879|ref|YP_524350.1| hypothetical protein Rfer_3110 [Rho... 65.1 3e-09
gi|256380915|ref|YP_003104575.1| hypothetical protein Amir_6934 ... 65.1 4e-09
gi|322421784|ref|YP_004201007.1| hypothetical protein GM18_4319 ... 64.7 5e-09
gi|302188444|ref|ZP_07265117.1| hypothetical protein Psyrps6_189... 64.3 6e-09
gi|194291124|ref|YP_002007031.1| hypothetical protein RALTA_A304... 63.9 7e-09
gi|343502383|ref|ZP_08740238.1| hypothetical protein VITU9109_05... 63.9 7e-09
>gi|148823260|ref|YP_001288014.1| hypothetical protein TBFG_12081 [Mycobacterium tuberculosis F11]
gi|253798899|ref|YP_003031900.1| hypothetical protein TBMG_01937 [Mycobacterium tuberculosis KZN
1435]
gi|254551069|ref|ZP_05141516.1| hypothetical protein Mtube_11491 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
34 more sequence titles
Length=116
Score = 209 bits (533), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 105/105 (100%), Positives = 105/105 (100%), Gaps = 0/105 (0%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP
Sbjct 12 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 71
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR
Sbjct 72 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 116
>gi|15841532|ref|NP_336569.1| hypothetical protein MT2104 [Mycobacterium tuberculosis CDC1551]
gi|254232214|ref|ZP_04925541.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|13881776|gb|AAK46383.1| hypothetical protein MT2104 [Mycobacterium tuberculosis CDC1551]
gi|124601273|gb|EAY60283.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
Length=129
Score = 209 bits (532), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 105/105 (100%), Positives = 105/105 (100%), Gaps = 0/105 (0%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP
Sbjct 25 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 84
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR
Sbjct 85 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 129
>gi|15609181|ref|NP_216560.1| hypothetical protein Rv2044c [Mycobacterium tuberculosis H37Rv]
gi|31793227|ref|NP_855720.1| hypothetical protein Mb2070c [Mycobacterium bovis AF2122/97]
gi|121637930|ref|YP_978153.1| hypothetical protein BCG_2063c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
37 more sequence titles
Length=105
Score = 207 bits (528), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 105/105 (100%), Positives = 105/105 (100%), Gaps = 0/105 (0%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP
Sbjct 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR
Sbjct 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
>gi|340627055|ref|YP_004745507.1| hypothetical protein MCAN_20671 [Mycobacterium canettii CIPT
140010059]
gi|340005245|emb|CCC44399.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=105
Score = 206 bits (525), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 104/105 (99%), Positives = 104/105 (99%), Gaps = 0/105 (0%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP
Sbjct 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIV ASWTLYLWLPRR
Sbjct 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVTASWTLYLWLPRR 105
>gi|240169261|ref|ZP_04747920.1| hypothetical protein MkanA1_08104 [Mycobacterium kansasii ATCC
12478]
Length=136
Score = 122 bits (305), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 59/105 (57%), Positives = 75/105 (72%), Gaps = 1/105 (0%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
HFA+IAY++ GGF+ALRWRRT+ H AV W + ++CPLT +ERW R A M PL
Sbjct 13 HFAYIAYLVVGGFVALRWRRTIACHAFAVAWSVVSFTGHLNCPLTGLERWGRAHAGMAPL 72
Query 62 SPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTL-YLWLPRR 105
PDGF+AHYITGV+YP+GW A A + +FA+VA SW L + W RR
Sbjct 73 PPDGFIAHYITGVVYPSGWSAPASIAVFALVAVSWILVFGWQARR 117
>gi|325676878|ref|ZP_08156551.1| hypothetical protein HMPREF0724_14334 [Rhodococcus equi ATCC
33707]
gi|325552426|gb|EGD22115.1| hypothetical protein HMPREF0724_14334 [Rhodococcus equi ATCC
33707]
Length=125
Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 53/105 (51%), Positives = 67/105 (64%), Gaps = 0/105 (0%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H AF+ YV+AGGFLA RWRRT+WLHV AV WG DCPLT +E WAR KA
Sbjct 13 VHLAFLGYVVAGGFLAWRWRRTIWLHVAAVAWGFATVVVGFDCPLTHLENWARRKAGEQQ 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
L DGF+AHY+TGV+YP + + ++ V ASW ++ L R
Sbjct 73 LPSDGFIAHYLTGVLYPTDALDVVRALVALCVVASWIGFVTLRHR 117
>gi|312139674|ref|YP_004007010.1| integral membrane protein [Rhodococcus equi 103S]
gi|311889013|emb|CBH48326.1| putative integral membrane protein [Rhodococcus equi 103S]
Length=125
Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 53/105 (51%), Positives = 67/105 (64%), Gaps = 0/105 (0%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H AF+ YV+AGGFLA RWRRT+WLHV AV WG DCPLT +E WAR KA
Sbjct 13 VHLAFLGYVVAGGFLAWRWRRTIWLHVAAVAWGFATVLVGFDCPLTHLENWARGKAGEQQ 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
L DGF+AHY+TGV+YP + + ++ V ASW ++ L R
Sbjct 73 LPSDGFIAHYLTGVLYPTDALDVVRALVALCVVASWIGFVTLRHR 117
>gi|226306923|ref|YP_002766883.1| hypothetical protein RER_34360 [Rhodococcus erythropolis PR4]
gi|226186040|dbj|BAH34144.1| conserved hypothetical membrane protein [Rhodococcus erythropolis
PR4]
Length=128
Score = 102 bits (253), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 50/105 (48%), Positives = 65/105 (62%), Gaps = 0/105 (0%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+HFAF+ YV+ GGF+A +WRRT+ H+ AV+WG G A DCPLT +E WAR A +
Sbjct 13 VHFAFVLYVVLGGFVAWKWRRTIGFHLLAVVWGAGSIAVGYDCPLTNLENWARRMAGIAE 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
L GF++HYITGV YPA Q + +V SW Y+ L R
Sbjct 73 LPSTGFISHYITGVFYPASIEGLVQALAATVVVGSWVGYVLLGRH 117
>gi|229491066|ref|ZP_04384897.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229322047|gb|EEN87837.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=128
Score = 100 bits (250), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 49/105 (47%), Positives = 64/105 (61%), Gaps = 0/105 (0%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+HF F+ YV+ GGF+A +WRRT+ H+ AV+WG G A DCPLT +E WAR A +
Sbjct 13 VHFGFVLYVVLGGFVAWKWRRTIGFHLLAVVWGAGSIAVGYDCPLTNLENWARRMAGIAE 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
L GF++HYITGV YPA Q + +V SW Y+ L R
Sbjct 73 LPSTGFISHYITGVFYPASIEGLVQALAATVVVGSWVGYVLLGRH 117
>gi|118472176|ref|YP_886687.1| hypothetical protein MSMEG_2341 [Mycobacterium smegmatis str.
MC2 155]
gi|118173463|gb|ABK74359.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=136
Score = 98.2 bits (243), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 52/104 (50%), Positives = 65/104 (63%), Gaps = 0/104 (0%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
HF ++ Y+ +GGFLALRW RT+ HVP VIWG + A R+ CPLT +E+WAR +A + PL
Sbjct 16 HFGYLLYLPSGGFLALRWPRTLAFHVPTVIWGAFVVALRLRCPLTHLEQWARVRAGLKPL 75
Query 62 SPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
D F+ HY GV+YPA AAQ F A SW L RR
Sbjct 76 PADDFIDHYADGVLYPANRTGAAQAAAFTAAALSWVLLFARRRR 119
>gi|333919614|ref|YP_004493195.1| hypothetical protein AS9A_1946 [Amycolicicoccus subflavus DQS3-9A1]
gi|333481835|gb|AEF40395.1| Hypothetical membrane protein [Amycolicicoccus subflavus DQS3-9A1]
Length=136
Score = 97.4 bits (241), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 49/110 (45%), Positives = 66/110 (60%), Gaps = 5/110 (4%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRV-----DCPLTWVERWARTK 55
+HFAFIAYV+ GGFLA RW +T HV +V+W IG+ + CPLT+VE WAR +
Sbjct 13 LHFAFIAYVMFGGFLAWRWAQTFAFHVISVVWVIGLVIVNLLGVALYCPLTYVENWARER 72
Query 56 AAMTPLSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
A L GF+ HY++GV+YP + A+ V+ V SW Y W+ R
Sbjct 73 AGQEGLDQAGFIDHYLSGVVYPQDHLGLARAVLLTCVLISWAGYFWIRTR 122
>gi|226360045|ref|YP_002777823.1| hypothetical protein ROP_06310 [Rhodococcus opacus B4]
gi|226238530|dbj|BAH48878.1| hypothetical membrane protein [Rhodococcus opacus B4]
Length=136
Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/104 (47%), Positives = 62/104 (60%), Gaps = 0/104 (0%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
H FI YV+ GGFLA RW RT+ +H+ AV WG +DCPLT +E WAR +A PL
Sbjct 18 HLTFILYVVCGGFLAWRWPRTIVVHLAAVAWGFTGLIVGIDCPLTHLESWARVRAGQDPL 77
Query 62 SPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
GF++HY+TGV+YP A Q ++ V SW Y+ L R
Sbjct 78 PSSGFISHYLTGVVYPESAAGAIQTLVALCVVTSWAGYVVLRTR 121
>gi|72161579|ref|YP_289236.1| hypothetical protein Tfu_1175 [Thermobifida fusca YX]
gi|71915311|gb|AAZ55213.1| conserved hypothetical protein [Thermobifida fusca YX]
Length=128
Score = 95.5 bits (236), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/105 (47%), Positives = 64/105 (61%), Gaps = 0/105 (0%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+HFAF+AY++ GG LA RW R W H+ A +G+G CPLT +E WAR+KA
Sbjct 13 VHFAFLAYLVCGGVLAWRWPRMFWPHLAAAAYGLGTVVVGWPCPLTRLENWARSKAGQQG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
L P GF++HY+TGV+YPA + Q + IV SW LW RR
Sbjct 73 LDPGGFISHYLTGVLYPADRLPHVQAAVGVIVVLSWAGALWTVRR 117
>gi|111017914|ref|YP_700886.1| hypothetical protein RHA1_ro00896 [Rhodococcus jostii RHA1]
gi|110817444|gb|ABG92728.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=135
Score = 95.1 bits (235), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 50/104 (49%), Positives = 61/104 (59%), Gaps = 0/104 (0%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
H FI YV+ GGFLA W RT+ LHV AV WG +DCPLT VE WAR +A PL
Sbjct 17 HLIFILYVVCGGFLAWHWPRTIALHVAAVAWGFTGLLVGIDCPLTHVESWARVRAGQDPL 76
Query 62 SPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
GF++HY+TGV+YP A Q ++ V SW Y+ L R
Sbjct 77 PSSGFISHYLTGVVYPESAAGAIQTLVALGVVTSWVGYVVLRLR 120
>gi|297562104|ref|YP_003681078.1| hypothetical protein Ndas_3164 [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
gi|296846552|gb|ADH68572.1| conserved hypothetical protein [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
Length=127
Score = 93.2 bits (230), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 44/96 (46%), Positives = 62/96 (65%), Gaps = 0/96 (0%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H AF+AY+ GGFLA +W RT+W+H+ ++G+ I CPLT VE WAR +A
Sbjct 13 VHLAFLAYLALGGFLAWKWPRTLWIHLGCALYGLAITLIGWPCPLTHVENWARRRAGQQG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASW 96
L P+GF+ HY+TG++YPA + AQL + V SW
Sbjct 73 LPPEGFIDHYLTGIVYPAEHLLTAQLTVALAVLTSW 108
>gi|72161580|ref|YP_289237.1| hypothetical protein Tfu_1176 [Thermobifida fusca YX]
gi|71915312|gb|AAZ55214.1| hypothetical protein Tfu_1176 [Thermobifida fusca YX]
Length=151
Score = 91.7 bits (226), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 62/96 (65%), Gaps = 0/96 (0%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+HF FIAYV+ GGFLA RW R W H+ A +G+G+ CPLTW+E W R A
Sbjct 25 VHFLFIAYVVCGGFLAWRWPRMFWPHLAAAAYGLGVTVIGWLCPLTWLEHWGRVNAGQHG 84
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASW 96
L +GF++HY+TGVIYPA ++ QL A+VA SW
Sbjct 85 LDAEGFISHYLTGVIYPAEYLREIQLAAAAVVALSW 120
>gi|254821733|ref|ZP_05226734.1| hypothetical protein MintA_17497 [Mycobacterium intracellulare
ATCC 13950]
Length=144
Score = 90.1 bits (222), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 48/97 (50%), Positives = 60/97 (62%), Gaps = 0/97 (0%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
H A++ YV +GGFLALRW RTM LHVPAV WGI + R+ CPLT +E W R A M PL
Sbjct 31 HLAYLLYVPSGGFLALRWPRTMALHVPAVGWGIAVVTLRLPCPLTILESWVRRSAGMDPL 90
Query 62 SPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTL 98
GF+ Y+ G P G + AAQ + F+ SW +
Sbjct 91 PTAGFIERYVAGHFVPTGRIGAAQALAFSAAGISWCV 127
>gi|343927302|ref|ZP_08766778.1| hypothetical protein GOALK_089_00270 [Gordonia alkanivorans NBRC
16433]
gi|343762795|dbj|GAA13704.1| hypothetical protein GOALK_089_00270 [Gordonia alkanivorans NBRC
16433]
Length=128
Score = 84.0 bits (206), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 42/96 (44%), Positives = 60/96 (63%), Gaps = 0/96 (0%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H F+ YV+ GGF+A RW RT+ LH+ AV WG+ ++CPLT E WAR +A
Sbjct 13 LHLMFLLYVMFGGFIAWRWPRTIALHLVAVGWGMASVVVGLECPLTDAENWARRRAGEAG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASW 96
L P GF+ +Y+TGVIYP + + ++ A+V SW
Sbjct 73 LPPTGFIDNYLTGVIYPDSALGLVRGLVAALVVVSW 108
>gi|296171230|ref|ZP_06852634.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295894198|gb|EFG73955.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=151
Score = 83.6 bits (205), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 49/98 (50%), Positives = 63/98 (65%), Gaps = 0/98 (0%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
H A++AYV AGGFLALRW RT+ LH AV WG+ + A ++ CPLT +E WAR +A M PL
Sbjct 25 HLAYLAYVPAGGFLALRWPRTIVLHRAAVAWGVAVVALKLPCPLTALEAWARRRADMDPL 84
Query 62 SPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLY 99
GFV Y+ G+ P+G V AQ F A SW+ +
Sbjct 85 PTTGFVDRYVAGLFVPSGRVGVAQAFAFLSAAVSWSAF 122
>gi|33596019|ref|NP_883662.1| hypothetical protein BPP1360 [Bordetella parapertussis 12822]
gi|33601406|ref|NP_888966.1| hypothetical protein BB2426 [Bordetella bronchiseptica RB50]
gi|33573022|emb|CAE36662.1| putative membrane protein [Bordetella parapertussis]
gi|33575842|emb|CAE32920.1| putative membrane protein [Bordetella bronchiseptica RB50]
Length=125
Score = 80.5 bits (197), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 44/98 (45%), Positives = 55/98 (57%), Gaps = 1/98 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H F+A+ + GG LALRW R WLHVPA +WG GI CPLT +E R +A
Sbjct 13 LHALFVAFAVLGGLLALRWPRLAWLHVPAALWGAGIMLAGAVCPLTPLENHWRAQAGQQG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTL 98
DGF+AHY+ IYP G AQ V+ A+ A L
Sbjct 73 YE-DGFIAHYLLAAIYPEGLSRTAQTVLGALALAGNAL 109
>gi|54024522|ref|YP_118764.1| hypothetical protein nfa25530 [Nocardia farcinica IFM 10152]
gi|54016030|dbj|BAD57400.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=124
Score = 79.0 bits (193), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/106 (49%), Positives = 60/106 (57%), Gaps = 4/106 (3%)
Query 2 HFAFIAYVLAGGFLALRWRR--TMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMT 59
HFAF+ YV GGFLA W R W H+ AV WG G DCPLT VE WAR +A
Sbjct 14 HFAFVGYVALGGFLA--WSRPWAFWPHLAAVGWGFGTVLIGYDCPLTHVENWARRRAGDA 71
Query 60 PLSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
L GF+ HY+TGVIYP V +L++ +V ASW L RR
Sbjct 72 ELPATGFIDHYLTGVIYPEDAVDLVRLLVAVVVVASWAGALVTIRR 117
>gi|218890414|ref|YP_002439278.1| hypothetical protein PLES_16741 [Pseudomonas aeruginosa LESB58]
gi|254236345|ref|ZP_04929668.1| conserved hypothetical protein [Pseudomonas aeruginosa C3719]
gi|254242072|ref|ZP_04935394.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192]
gi|126168276|gb|EAZ53787.1| conserved hypothetical protein [Pseudomonas aeruginosa C3719]
gi|126195450|gb|EAZ59513.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192]
gi|218770637|emb|CAW26402.1| conserved hypothetical protein [Pseudomonas aeruginosa LESB58]
gi|334843429|gb|EGM22018.1| hypothetical protein PA13_05109 [Pseudomonas aeruginosa 138244]
Length=121
Score = 77.8 bits (190), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 43/92 (47%), Positives = 53/92 (58%), Gaps = 1/92 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H FI +VL GG L LRW R WLH+PAV WG + + CPLT +E R A
Sbjct 13 LHLGFILFVLFGGLLPLRWPRLAWLHLPAVAWGCAVEFLGLPCPLTPLENRLRRAAGDAG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIV 92
S GFV HY+ +IYPAG A Q ++ AIV
Sbjct 73 YS-GGFVEHYLLPLIYPAGLTPAVQWILGAIV 103
>gi|49074174|gb|AAT49385.1| PA3386 [synthetic construct]
Length=122
Score = 77.4 bits (189), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 43/92 (47%), Positives = 53/92 (58%), Gaps = 1/92 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H FI +VL GG L LRW R WLH+PAV WG + + CPLT +E R A
Sbjct 13 LHLGFILFVLFGGLLPLRWPRLAWLHLPAVAWGCAVEFLGLPCPLTPLENRLRRAAGDAG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIV 92
S GFV HY+ +IYPAG A Q ++ AIV
Sbjct 73 YS-GGFVEHYLLPLIYPAGLTPAVQWILGAIV 103
>gi|107102918|ref|ZP_01366836.1| hypothetical protein PaerPA_01003987 [Pseudomonas aeruginosa
PACS2]
Length=121
Score = 77.4 bits (189), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 43/92 (47%), Positives = 53/92 (58%), Gaps = 1/92 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H FI +VL GG L LRW R WLH+PAV WG + + CPLT +E R A
Sbjct 13 LHLGFILFVLFGGLLPLRWPRLAWLHLPAVAWGCAVEFLGLPCPLTPLENRLRRAAGDAG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIV 92
S GFV HY+ +IYPAG A Q ++ AIV
Sbjct 73 YS-GGFVEHYLLPLIYPAGLTPAVQWILGAIV 103
>gi|116051404|ref|YP_789763.1| hypothetical protein PA14_20280 [Pseudomonas aeruginosa UCBPP-PA14]
gi|313108740|ref|ZP_07794734.1| hypothetical protein PA39016_001450018 [Pseudomonas aeruginosa
39016]
gi|115586625|gb|ABJ12640.1| conserved hypothetical protein [Pseudomonas aeruginosa UCBPP-PA14]
gi|310881236|gb|EFQ39830.1| hypothetical protein PA39016_001450018 [Pseudomonas aeruginosa
39016]
Length=121
Score = 77.4 bits (189), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 43/92 (47%), Positives = 53/92 (58%), Gaps = 1/92 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H FI +VL GG L LRW R WLH+PAV WG + + CPLT +E R A
Sbjct 13 LHLGFILFVLFGGLLPLRWPRLAWLHLPAVAWGCAVEFLGLPCPLTPLENRLRRAAGDAG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIV 92
S GFV HY+ +IYPAG A Q ++ AIV
Sbjct 73 YS-GGFVEHYLLPLIYPAGLTPAVQWILGAIV 103
>gi|15598582|ref|NP_252076.1| hypothetical protein PA3386 [Pseudomonas aeruginosa PAO1]
gi|9949522|gb|AAG06774.1|AE004760_2 conserved hypothetical protein [Pseudomonas aeruginosa PAO1]
Length=121
Score = 77.4 bits (189), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 43/92 (47%), Positives = 53/92 (58%), Gaps = 1/92 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H FI +VL GG L LRW R WLH+PAV WG + + CPLT +E R A
Sbjct 13 LHLGFILFVLFGGLLPLRWPRLAWLHLPAVAWGCAVEFLGLPCPLTPLENRLRRAAGDAG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIV 92
S GFV HY+ +IYPAG A Q ++ AIV
Sbjct 73 YS-GGFVEHYLLPLIYPAGLTPAVQWILGAIV 103
>gi|229592050|ref|YP_002874169.1| hypothetical protein PFLU4648 [Pseudomonas fluorescens SBW25]
gi|229363916|emb|CAY51425.1| putative membrane protein [Pseudomonas fluorescens SBW25]
Length=124
Score = 77.4 bits (189), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 42/93 (46%), Positives = 53/93 (57%), Gaps = 1/93 (1%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
H FI +VL GG LAL+WR +WLH+PAV WGI + + CPLT E R A
Sbjct 17 HLTFILFVLFGGLLALKWRPVVWLHLPAVAWGIAVEVFHLPCPLTRWENLFRHLAGQDGY 76
Query 62 SPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAA 94
GF+ HY+ +IYPAG QL + A+V A
Sbjct 77 G-GGFIEHYLLTLIYPAGLTPQIQLGLGALVLA 108
>gi|296388101|ref|ZP_06877576.1| hypothetical protein PaerPAb_08094 [Pseudomonas aeruginosa PAb1]
gi|334844517|gb|EGM23090.1| hypothetical protein PA15_05958 [Pseudomonas aeruginosa 152504]
Length=121
Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 42/92 (46%), Positives = 52/92 (57%), Gaps = 1/92 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H FI +VL GG L LRW R WLH+PAV WG + + CPLT +E R A
Sbjct 13 LHLGFILFVLFGGLLPLRWPRLAWLHLPAVAWGCAVEFLGLPCPLTPLENRLRRAAGDAG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIV 92
S GFV HY+ +IYPAG Q ++ AIV
Sbjct 73 YS-GGFVEHYLLPLIYPAGLTPTVQWILGAIV 103
>gi|77459522|ref|YP_349029.1| hypothetical protein Pfl01_3300 [Pseudomonas fluorescens Pf0-1]
gi|77383525|gb|ABA75038.1| putative membrane protein [Pseudomonas fluorescens Pf0-1]
Length=128
Score = 75.5 bits (184), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 38/84 (46%), Positives = 48/84 (58%), Gaps = 1/84 (1%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
H FI +VL GG L L+W R +WLH+PA WG+ + + CPLT+ E R A T
Sbjct 14 HLLFILFVLFGGLLVLKWPRLVWLHLPAAAWGVAVEVLHLTCPLTYWENLMRHAAGQTEY 73
Query 62 SPDGFVAHYITGVIYPAGWVAAAQ 85
S GF+ HYI +IYPAG Q
Sbjct 74 S-GGFIEHYIWPIIYPAGLTPQIQ 96
>gi|56477664|ref|YP_159253.1| hypothetical protein ebB138 [Aromatoleum aromaticum EbN1]
gi|56313707|emb|CAI08352.1| conserved hypothetical protein [Aromatoleum aromaticum EbN1]
Length=135
Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 43/93 (47%), Positives = 53/93 (57%), Gaps = 1/93 (1%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
HFAFI +V+AGG L LRW R WLH+P V WG GI CPLT +E R A
Sbjct 14 HFAFILFVVAGGALVLRWPRLAWLHMPVVAWGAGIELIGGVCPLTPLENSLRGAAGEAGY 73
Query 62 SPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAA 94
+ +GF+ HY+ +IYPAG A V+ V A
Sbjct 74 T-EGFIEHYLLPLIYPAGLTPAIGTVLGVFVLA 105
>gi|312962508|ref|ZP_07776999.1| hypothetical protein PFWH6_4429 [Pseudomonas fluorescens WH6]
gi|311283435|gb|EFQ62025.1| hypothetical protein PFWH6_4429 [Pseudomonas fluorescens WH6]
Length=124
Score = 74.3 bits (181), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 40/91 (44%), Positives = 51/91 (57%), Gaps = 1/91 (1%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
H FI +VL GG LAL+WR +WLH+PA WG+ + + CPLT E R A
Sbjct 17 HLCFILFVLFGGLLALKWRPLIWLHLPAAAWGVAVEVFHLPCPLTRWENLFRHLAGQDGY 76
Query 62 SPDGFVAHYITGVIYPAGWVAAAQLVMFAIV 92
GF+ HYI +IYPAG Q V+ A+V
Sbjct 77 G-GGFIEHYILTLIYPAGLTPQIQWVLGAVV 106
>gi|297560980|ref|YP_003679954.1| hypothetical protein Ndas_2022 [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
gi|296845428|gb|ADH67448.1| conserved hypothetical protein [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
Length=125
Score = 73.9 bits (180), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 40/105 (39%), Positives = 60/105 (58%), Gaps = 1/105 (0%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+HF F+A+V+ GG LA RW R W+H+ + +GI C LT +E W+R +
Sbjct 13 LHFGFLAFVVFGGALAWRWPRMFWVHLAVAAYALGIVIVDWPCFLTEIENWSRARTGRAV 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLWLPRR 105
+ DGF+ Y+TG +YP + ++LVM IVA +W + WL R
Sbjct 73 ME-DGFIDFYLTGNLYPPEHLLTSRLVMAGIVALTWAVAGWLVLR 116
>gi|121583245|ref|YP_973681.1| hypothetical protein Pnap_4875 [Polaromonas naphthalenivorans
CJ2]
gi|120596503|gb|ABM39939.1| conserved hypothetical protein [Polaromonas naphthalenivorans
CJ2]
Length=142
Score = 72.4 bits (176), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/103 (40%), Positives = 58/103 (57%), Gaps = 2/103 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H FI + L GG LA+RWR +H+PA+ W + CPLT +E R +A +
Sbjct 13 LHLVFIVFALFGGALAIRWRWMPLVHLPAMAWAFFVELTGRLCPLTSIENGFRVRAGQSG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFA-IVAASWTLYLWL 102
+ FV HY+ GVIYP+G Q V+ A +VA + +YLWL
Sbjct 73 YAAS-FVEHYLLGVIYPSGLTREVQFVLAAMVVAINIAIYLWL 114
>gi|104781510|ref|YP_608008.1| hypothetical protein PSEEN2397 [Pseudomonas entomophila L48]
gi|95110497|emb|CAK15205.1| conserved hypothetical protein [Pseudomonas entomophila L48]
Length=124
Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/85 (45%), Positives = 50/85 (59%), Gaps = 1/85 (1%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
H +FI VL GG LAL+ RR +++H+PA+ WG+ + ++CPLT E R A +
Sbjct 14 HLSFILLVLFGGLLALKHRRALFIHLPALAWGLAVEGLHLECPLTAWENRLRAAAGDSG- 72
Query 62 SPDGFVAHYITGVIYPAGWVAAAQL 86
P GFV HYI VIYP G QL
Sbjct 73 YPGGFVEHYIWPVIYPTGLTPQIQL 97
>gi|74317131|ref|YP_314871.1| hypothetical protein Tbd_1113 [Thiobacillus denitrificans ATCC
25259]
gi|74056626|gb|AAZ97066.1| conserved hypothetical protein [Thiobacillus denitrificans ATCC
25259]
Length=123
Score = 70.5 bits (171), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 35/78 (45%), Positives = 49/78 (63%), Gaps = 1/78 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H F+A+V+ GGFL +RW R +WLH+PA +WG I CPLT +E R +A +
Sbjct 12 VHLLFVAFVVLGGFLLVRWPRLLWLHLPAAVWGAYIEFSGGICPLTPLENRLRVRAGESA 71
Query 61 LSPDGFVAHYITGVIYPA 78
+ D FV HY+ V+YPA
Sbjct 72 YAGD-FVEHYLLPVLYPA 88
>gi|237653420|ref|YP_002889734.1| hypothetical protein Tmz1t_2757 [Thauera sp. MZ1T]
gi|237624667|gb|ACR01357.1| conserved hypothetical protein [Thauera sp. MZ1T]
Length=134
Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 42/106 (40%), Positives = 57/106 (54%), Gaps = 2/106 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H FIA+ + GG LALRWR WLH+PAV W I A CPLT +E R A +
Sbjct 13 LHLGFIAFAVLGGLLALRWRLAPWLHLPAVAWAAWIEASGGLCPLTPLENRLRVLAGQSG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFA-IVAASWTLYLWLPRR 105
+ GF+ HY+ +IYPA Q + +V + +Y W+ RR
Sbjct 73 YA-GGFIEHYVLPLIYPAALTRELQFALAGFVVLVNLAVYAWVWRR 117
>gi|302038964|ref|YP_003799286.1| hypothetical protein NIDE3683 [Candidatus Nitrospira defluvii]
gi|300607028|emb|CBK43361.1| conserved protein of unknown function [Candidatus Nitrospira
defluvii]
Length=123
Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/100 (37%), Positives = 56/100 (56%), Gaps = 2/100 (2%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H AF+ +V+ GG L L+W R WLH+PA IWG + CPLT +E R A +
Sbjct 13 LHLAFVLFVVTGGLLLLKWPRLAWLHLPAAIWGAIVEYTDWICPLTPIENTLRAMARESA 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVM-FAIVAASWTLY 99
D F+ HY+ ++YPAG + Q ++ +V + T+Y
Sbjct 73 YGSD-FIGHYLLPLMYPAGLTSHIQFLLGTTVVLVNLTIY 111
>gi|330966201|gb|EGH66461.1| hypothetical protein PSYAC_16461 [Pseudomonas syringae pv. actinidiae
str. M302091]
Length=127
Score = 69.7 bits (169), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/85 (44%), Positives = 46/85 (55%), Gaps = 1/85 (1%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
H FI +VL GG L LR LHVPA+IWG + + CPLT +E R+KA
Sbjct 16 HLLFIVFVLFGGLLVLRRPCLAMLHVPAIIWGTAVEFLHLYCPLTPLENALRSKAGEQGY 75
Query 62 SPDGFVAHYITGVIYPAGWVAAAQL 86
GF+ HY+ +IYPAG QL
Sbjct 76 D-GGFIEHYLIPLIYPAGLTPGVQL 99
>gi|78223775|ref|YP_385522.1| hypothetical protein Gmet_2578 [Geobacter metallireducens GS-15]
gi|78195030|gb|ABB32797.1| conserved hypothetical protein [Geobacter metallireducens GS-15]
Length=119
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/106 (41%), Positives = 59/106 (56%), Gaps = 2/106 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H F+ +V+ GG + LRW R WLHVPA IWG I CPLT++E R A T
Sbjct 13 VHGLFVLFVVLGGLVVLRWPRLAWLHVPAAIWGAMIELSGKVCPLTYLEVRFRRLAGETG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAA-SWTLYLWLPRR 105
+ + F+ HYI +IYP G Q+ + +VAA + +Y L RR
Sbjct 73 YT-ESFIEHYIVPLIYPPGLTRGWQIAVGLLVAALNLAVYGVLHRR 117
>gi|113869532|ref|YP_728021.1| hypothetical protein H16_A3595 [Ralstonia eutropha H16]
gi|113528308|emb|CAJ94653.1| conserved hypothetical protein [Ralstonia eutropha H16]
Length=126
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/86 (47%), Positives = 49/86 (57%), Gaps = 1/86 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H FI +V+AGG L LRW R WLH+PA +WG+ I CPLT +E R A
Sbjct 14 VHGLFILFVVAGGLLVLRWPRVAWLHLPAAVWGVLIEWSGWICPLTPLENMLRRAAGQAG 73
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQL 86
S GFV Y+ +IYPAG A QL
Sbjct 74 YS-GGFVERYLLPLIYPAGLTPAVQL 98
>gi|339327625|ref|YP_004687318.1| hypothetical protein CNE_1c35440 [Cupriavidus necator N-1]
gi|338167782|gb|AEI78837.1| hypothetical protein CNE_1c35440 [Cupriavidus necator N-1]
Length=122
Score = 67.4 bits (163), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 40/86 (47%), Positives = 48/86 (56%), Gaps = 1/86 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H FI +V+AGG L LRW R WLH+PA WG+ I CPLT +E R A
Sbjct 14 VHGLFILFVVAGGLLVLRWPRVAWLHLPAAAWGVLIEWSGWICPLTPLENTLRQAAGQAG 73
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQL 86
S GFV Y+ +IYPAG A QL
Sbjct 74 YS-GGFVERYLLPLIYPAGLTPAVQL 98
>gi|334112323|ref|ZP_08486586.1| hypothetical protein MetalDRAFT_3311 [Methylomicrobium album
BG8]
gi|333597680|gb|EGL02513.1| hypothetical protein MetalDRAFT_3311 [Methylomicrobium album
BG8]
Length=121
Score = 67.4 bits (163), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 57/100 (57%), Gaps = 3/100 (3%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H AFI +V+ GG L L++ R +W+H+PA IWG I CPLT +E R ++A
Sbjct 13 IHLAFILFVIFGGLLVLKYPRMIWVHLPAAIWGALIEFAGWLCPLTLLEN--RLRSASGD 70
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVM-FAIVAASWTLY 99
GF+ HY+ +IYP+ Q+V+ F ++ + +Y
Sbjct 71 HYAGGFIEHYLVPIIYPSMLTPNMQIVLGFGVILLNLAVY 110
>gi|73542968|ref|YP_297488.1| hypothetical protein Reut_A3285 [Ralstonia eutropha JMP134]
gi|72120381|gb|AAZ62644.1| conserved hypothetical protein [Ralstonia eutropha JMP134]
Length=120
Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/92 (43%), Positives = 49/92 (54%), Gaps = 1/92 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+HF FIA+V+ G L RW R W+H+PA WG+ + CPLT +E R A
Sbjct 12 VHFLFIAFVVGGALLLFRWPRVAWVHLPAAAWGVFVEWSGRMCPLTPLENALRRTAGEAG 71
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIV 92
GFV Y+ VIYPAG A QL + A V
Sbjct 72 YG-GGFVERYLLPVIYPAGLTPAIQLWLGAFV 102
>gi|330957602|gb|EGH57862.1| hypothetical protein PMA4326_03369 [Pseudomonas syringae pv.
maculicola str. ES4326]
Length=125
Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/85 (42%), Positives = 45/85 (53%), Gaps = 1/85 (1%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
H FI +VL GG L LR LH+PA+ WG + + CPLT +E R++A
Sbjct 14 HLVFILFVLFGGLLVLRKPGLALLHIPAIAWGTAVEFLHLYCPLTPLENALRSQAGEQGY 73
Query 62 SPDGFVAHYITGVIYPAGWVAAAQL 86
GFV HY+ +IYPAG QL
Sbjct 74 D-GGFVEHYLIPLIYPAGLTPDIQL 97
>gi|89901879|ref|YP_524350.1| hypothetical protein Rfer_3110 [Rhodoferax ferrireducens T118]
gi|89346616|gb|ABD70819.1| conserved hypothetical protein [Rhodoferax ferrireducens T118]
Length=120
Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/103 (35%), Positives = 54/103 (53%), Gaps = 2/103 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
H +FI + L GG LA RWR +H+P W I CPLT++E R +A +
Sbjct 13 FHLSFILFALFGGALAARWRWMPLVHLPVAAWAFFIELTGRICPLTYLENDLRVRAGQSG 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVM-FAIVAASWTLYLWL 102
+ + F+ HY+ VIYP+G Q V+ +V + +Y+WL
Sbjct 73 YA-ESFIEHYLLDVIYPSGLTREVQFVLATTVVVVNIAIYVWL 114
>gi|256380915|ref|YP_003104575.1| hypothetical protein Amir_6934 [Actinosynnema mirum DSM 43827]
gi|255925218|gb|ACU40729.1| hypothetical protein Amir_6934 [Actinosynnema mirum DSM 43827]
Length=129
Score = 65.1 bits (157), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 35/96 (37%), Positives = 54/96 (57%), Gaps = 1/96 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H+A + +++ GGFLA RWR ++ HV WG+ + +DCPLT +E R P
Sbjct 13 LHYAALVFLVVGGFLARRWRWLVYPHVAMAAWGLSVVVFALDCPLTMLENELRVLGGEPP 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASW 96
L+ GF+ YI GV+YP Q+++ +V SW
Sbjct 73 LT-RGFIDTYIDGVLYPESLATPVQVLVAVLVLGSW 107
>gi|322421784|ref|YP_004201007.1| hypothetical protein GM18_4319 [Geobacter sp. M18]
gi|320128171|gb|ADW15731.1| Protein of unknown function DUF2784 [Geobacter sp. M18]
Length=130
Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 35/101 (35%), Positives = 55/101 (55%), Gaps = 2/101 (1%)
Query 1 MHFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTP 60
+H F+ +V+ GG L LRW + W+H+PA +WG+ I CPLT +E R + T
Sbjct 13 VHTLFVLFVIGGGLLVLRWPKMAWVHLPAALWGVLIEFGGWLCPLTNLENHWRRQGGGTA 72
Query 61 LSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLW 101
S D F+ Y+ ++YP G Q V+ ++A S L ++
Sbjct 73 YS-DSFIQQYLEPLLYPLGLSPQRQ-VLLGLLALSINLVIY 111
>gi|302188444|ref|ZP_07265117.1| hypothetical protein Psyrps6_18945 [Pseudomonas syringae pv.
syringae 642]
Length=121
Score = 64.3 bits (155), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 36/85 (43%), Positives = 45/85 (53%), Gaps = 1/85 (1%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
H FI +VL GG L LR LHVPA WG + + CPLT +E R+ A +
Sbjct 10 HLLFIVFVLFGGLLVLRKPWLALLHVPAAAWGTAVEFLHLYCPLTPLENTLRSNAGVQGY 69
Query 62 SPDGFVAHYITGVIYPAGWVAAAQL 86
+ GFV HY+ +IYPAG QL
Sbjct 70 T-GGFVEHYLIPLIYPAGLTPGLQL 93
>gi|194291124|ref|YP_002007031.1| hypothetical protein RALTA_A3049 [Cupriavidus taiwanensis LMG
19424]
gi|193224959|emb|CAQ70970.1| conserved hypothetical protein; putative membrane protein [Cupriavidus
taiwanensis LMG 19424]
Length=123
Score = 63.9 bits (154), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/78 (47%), Positives = 45/78 (58%), Gaps = 1/78 (1%)
Query 2 HFAFIAYVLAGGFLALRWRRTMWLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMTPL 61
H FI +V+AGG L LRW R W+H+PA +WG+ I CPLT +E R A
Sbjct 15 HALFIVFVVAGGLLVLRWPRAAWVHLPAAVWGVLIEWAGWICPLTPLENSLRQAAGQAGY 74
Query 62 SPDGFVAHYITGVIYPAG 79
S GFV Y+ +IYPAG
Sbjct 75 S-GGFVERYLLPLIYPAG 91
>gi|343502383|ref|ZP_08740238.1| hypothetical protein VITU9109_05645 [Vibrio tubiashii ATCC 19109]
gi|342814806|gb|EGU49740.1| hypothetical protein VITU9109_05645 [Vibrio tubiashii ATCC 19109]
Length=127
Score = 63.9 bits (154), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 38/102 (38%), Positives = 55/102 (54%), Gaps = 4/102 (3%)
Query 1 MHFAFIAYVLAGGFLALRWRRTM-WLHVPAVIWGIGIAAKRVDCPLTWVERWARTKAAMT 59
+H FI + L GG L L WRR M ++H+PA +W + I+ CPLT +E R A +
Sbjct 13 LHLLFIIFALLGGMLVL-WRRYMLYVHIPAALWAVAISVNGWICPLTPLENKLRFSAGVE 71
Query 60 PLSPDGFVAHYITGVIYPAGWVAAAQLVMFAIVAASWTLYLW 101
P GFV HY+ +IYP G + Q V +VA + ++
Sbjct 72 G-YPGGFVEHYVVPIIYPPGLSSDIQ-VFLGVVAVGINIVVY 111
Lambda K H
0.332 0.140 0.512
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 131458853568
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40