BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2295
Length=212
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609432|ref|NP_216811.1| hypothetical protein Rv2295 [Mycoba... 438 2e-121
gi|15841786|ref|NP_336823.1| hypothetical protein MT2352 [Mycoba... 386 9e-106
gi|326445321|ref|ZP_08220055.1| hypothetical protein SclaA2_2984... 196 2e-48
gi|254389662|ref|ZP_05004887.1| conserved hypothetical protein [... 196 3e-48
gi|78059750|ref|YP_366325.1| hypothetical protein Bcep18194_C663... 170 1e-40
gi|297199656|ref|ZP_06917053.1| conserved hypothetical protein [... 163 1e-38
gi|94968131|ref|YP_590179.1| hypothetical protein Acid345_1102 [... 155 4e-36
gi|29827112|ref|NP_821746.1| hypothetical protein SAV_571 [Strep... 154 6e-36
gi|294811295|ref|ZP_06769938.1| Hypothetical protein SCLAV_0461 ... 154 1e-35
gi|326439758|ref|ZP_08214492.1| hypothetical protein SclaA2_0178... 153 1e-35
gi|323177728|gb|EFZ63312.1| hypothetical protein ECOK1180_3306 [... 136 2e-30
gi|260870448|ref|YP_003236850.1| hypothetical protein ECO111_454... 136 2e-30
gi|323934932|gb|EGB31310.1| cbrC [Escherichia coli E1520] 135 4e-30
gi|191165780|ref|ZP_03027618.1| conserved hypothetical protein [... 135 4e-30
gi|157163198|ref|YP_001460516.1| hypothetical protein EcHS_A3931... 135 4e-30
gi|15596733|ref|NP_250227.1| hypothetical protein PA1536 [Pseudo... 135 4e-30
gi|49081776|gb|AAT50288.1| PA1536 [synthetic construct] 135 4e-30
gi|157155247|ref|YP_001465201.1| hypothetical protein EcE24377A_... 135 5e-30
gi|317054997|ref|YP_004103464.1| hypothetical protein Rumal_0273... 135 5e-30
gi|16131585|ref|NP_418173.1| conserved protein, UPF0167 family [... 134 7e-30
gi|313106541|ref|ZP_07792769.1| hypothetical protein PA39016_000... 134 8e-30
gi|340732372|gb|EGR61510.1| hypothetical protein HUSEC41_20735 [... 134 1e-29
gi|345347147|gb|EGW79461.1| hypothetical protein ECSTEC94C_4457 ... 133 1e-29
gi|116049480|ref|YP_791717.1| hypothetical protein PA14_44580 [P... 133 2e-29
gi|218551250|ref|YP_002385042.1| hypothetical protein EFER_4015 ... 132 2e-29
gi|261823659|ref|YP_003261765.1| hypothetical protein Pecwa_4466... 131 6e-29
gi|153831830|ref|ZP_01984497.1| conserved hypothetical protein [... 131 6e-29
gi|281180775|dbj|BAI57105.1| conserved hypothetical protein [Esc... 131 8e-29
gi|157693409|ref|YP_001487871.1| hypothetical protein BPUM_2653 ... 130 9e-29
gi|332996018|gb|EGK15645.1| hypothetical protein SFVA6_4679 [Shi... 130 9e-29
gi|194017873|ref|ZP_03056482.1| protein YieJ [Bacillus pumilus A... 130 1e-28
gi|340752053|ref|ZP_08688863.1| hypothetical protein FMAG_01631 ... 129 2e-28
gi|160939502|ref|ZP_02086852.1| hypothetical protein CLOBOL_0439... 129 2e-28
gi|325680170|ref|ZP_08159735.1| hypothetical protein CUS_5950 [R... 129 3e-28
gi|239624306|ref|ZP_04667337.1| protein YieJ [Clostridiales bact... 129 4e-28
gi|153831823|ref|ZP_01984490.1| conserved hypothetical protein [... 128 4e-28
gi|170766744|ref|ZP_02901197.1| protein YieJ [Escherichia albert... 128 5e-28
gi|332655034|ref|ZP_08420775.1| conserved hypothetical protein [... 127 1e-27
gi|167770051|ref|ZP_02442104.1| hypothetical protein ANACOL_0139... 127 1e-27
gi|295115447|emb|CBL36294.1| Uncharacterized protein conserved i... 127 1e-27
gi|266621339|ref|ZP_06114274.1| conserved hypothetical protein [... 127 1e-27
gi|295089896|emb|CBK76003.1| Uncharacterized protein conserved i... 127 1e-27
gi|283795284|ref|ZP_06344437.1| conserved hypothetical protein [... 127 1e-27
gi|295102676|emb|CBL00221.1| Uncharacterized protein conserved i... 126 2e-27
gi|336429235|ref|ZP_08609203.1| hypothetical protein HMPREF0994_... 126 2e-27
gi|223985039|ref|ZP_03635137.1| hypothetical protein HOLDEFILI_0... 126 2e-27
gi|295100208|emb|CBK97753.1| Uncharacterized protein conserved i... 126 3e-27
gi|152987810|ref|YP_001349151.1| hypothetical protein PSPA7_3797... 126 3e-27
gi|124007367|ref|ZP_01692074.1| conserved hypothetical protein [... 125 3e-27
gi|254038935|ref|ZP_04872987.1| conserved hypothetical protein [... 125 3e-27
>gi|15609432|ref|NP_216811.1| hypothetical protein Rv2295 [Mycobacterium tuberculosis H37Rv]
gi|31793473|ref|NP_855966.1| hypothetical protein Mb2317 [Mycobacterium bovis AF2122/97]
gi|121638176|ref|YP_978400.1| hypothetical protein BCG_2311 [Mycobacterium bovis BCG str. Pasteur
1173P2]
53 more sequence titles
Length=212
Score = 438 bits (1127), Expect = 2e-121, Method: Compositional matrix adjust.
Identities = 212/212 (100%), Positives = 212/212 (100%), Gaps = 0/212 (0%)
Query 1 MDQSANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHPDPVGTGSIVADEVSCV 60
MDQSANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHPDPVGTGSIVADEVSCV
Sbjct 1 MDQSANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHPDPVGTGSIVADEVSCV 60
Query 61 SCEQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATFTDAMWAVPDDVPEDVTEE 120
SCEQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATFTDAMWAVPDDVPEDVTEE
Sbjct 61 SCEQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATFTDAMWAVPDDVPEDVTEE 120
Query 121 VLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIE 180
VLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIE
Sbjct 121 VLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIE 180
Query 181 EFILTLDRNGLATAYLFRCLSCGVHLAYADFA 212
EFILTLDRNGLATAYLFRCLSCGVHLAYADFA
Sbjct 181 EFILTLDRNGLATAYLFRCLSCGVHLAYADFA 212
>gi|15841786|ref|NP_336823.1| hypothetical protein MT2352 [Mycobacterium tuberculosis CDC1551]
gi|308232091|ref|ZP_07414885.2| hypothetical protein TMAG_00483 [Mycobacterium tuberculosis SUMu001]
gi|308369680|ref|ZP_07418664.2| hypothetical protein TMBG_00840 [Mycobacterium tuberculosis SUMu002]
26 more sequence titles
Length=187
Score = 386 bits (992), Expect = 9e-106, Method: Compositional matrix adjust.
Identities = 187/187 (100%), Positives = 187/187 (100%), Gaps = 0/187 (0%)
Query 26 MPVEETSTPQKLPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAIC 85
MPVEETSTPQKLPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAIC
Sbjct 1 MPVEETSTPQKLPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAIC 60
Query 86 PWCIADGSAASRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAA 145
PWCIADGSAASRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAA
Sbjct 61 PWCIADGSAASRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAA 120
Query 146 FLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVH 205
FLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVH
Sbjct 121 FLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVH 180
Query 206 LAYADFA 212
LAYADFA
Sbjct 181 LAYADFA 187
>gi|326445321|ref|ZP_08220055.1| hypothetical protein SclaA2_29842 [Streptomyces clavuligerus
ATCC 27064]
Length=218
Score = 196 bits (497), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 95/185 (52%), Positives = 117/185 (64%), Gaps = 1/185 (0%)
Query 28 VEETSTPQKLPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAICPW 87
+ + + LP+F YHPDPV TG +V CV C + R + YTGPV+AE +L +CPW
Sbjct 35 LRSAAVSEALPEFPYHPDPVATGVVVPSPAVCVCCGRARGHLYTGPVHAEADLGRGLCPW 94
Query 88 CIADGSAASRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFL 147
CIADGSAA RFDA+FTD + +DVP DV V RTPGF W +W HCGD AFL
Sbjct 95 CIADGSAAGRFDASFTDGS-ILGEDVPLDVFSAVDRRTPGFRAWQAVQWFFHCGDGTAFL 153
Query 148 GPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLA 207
G G E+A PDAL LR + G+ WP +IE + TL A+AYLFRC CG+HLA
Sbjct 154 GEAGPDELAAHPDALGQLRRKASGWGWPPGQIEHHLNTLGTGSSASAYLFRCRHCGIHLA 213
Query 208 YADFA 212
Y+DFA
Sbjct 214 YSDFA 218
>gi|254389662|ref|ZP_05004887.1| conserved hypothetical protein [Streptomyces clavuligerus ATCC
27064]
gi|197703374|gb|EDY49186.1| conserved hypothetical protein [Streptomyces clavuligerus ATCC
27064]
Length=206
Score = 196 bits (497), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 95/185 (52%), Positives = 117/185 (64%), Gaps = 1/185 (0%)
Query 28 VEETSTPQKLPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAICPW 87
+ + + LP+F YHPDPV TG +V CV C + R + YTGPV+AE +L +CPW
Sbjct 23 LRSAAVSEALPEFPYHPDPVATGVVVPSPAVCVCCGRARGHLYTGPVHAEADLGRGLCPW 82
Query 88 CIADGSAASRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFL 147
CIADGSAA RFDA+FTD + +DVP DV V RTPGF W +W HCGD AFL
Sbjct 83 CIADGSAAGRFDASFTDGS-ILGEDVPLDVFSAVDRRTPGFRAWQAVQWFFHCGDGTAFL 141
Query 148 GPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLA 207
G G E+A PDAL LR + G+ WP +IE + TL A+AYLFRC CG+HLA
Sbjct 142 GEAGPDELAAHPDALGQLRRKASGWGWPPGQIEHHLNTLGTGSSASAYLFRCRHCGIHLA 201
Query 208 YADFA 212
Y+DFA
Sbjct 202 YSDFA 206
>gi|78059750|ref|YP_366325.1| hypothetical protein Bcep18194_C6631 [Burkholderia sp. 383]
gi|77964300|gb|ABB05681.1| protein of unknown function UPF0167 [Burkholderia sp. 383]
Length=180
Score = 170 bits (430), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 90/182 (50%), Positives = 110/182 (61%), Gaps = 12/182 (6%)
Query 36 KLPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADGSAA 95
LP FRYHPDP+ TGS + + C C R Y Y GPVYA +E + ICPWCIADGSA
Sbjct 2 SLPAFRYHPDPLATGSAIRSDARCACCGVARGYVYAGPVYAVDEYEQCICPWCIADGSAH 61
Query 96 SRFDATFTD------AMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGP 149
+RFDA FTD W D+VP+ V +E+ CRTPGF GW QE W HCGD F+G
Sbjct 62 ARFDAIFTDTDGIGGGEW---DEVPDAVVDEIACRTPGFQGWQQERWWTHCGDGGQFIGR 118
Query 150 VGASEVADL-PDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAY 208
GA E+ L P A+ ++R E G D A + E F LD++G TAY+FRC+ CG Y
Sbjct 119 AGAGELTTLGPQAVASIR-ESAGLDEGA-EWERFFAALDKDGSPTAYMFRCIHCGELGGY 176
Query 209 AD 210
D
Sbjct 177 QD 178
>gi|297199656|ref|ZP_06917053.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
gi|197713974|gb|EDY58008.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
Length=218
Score = 163 bits (413), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 86/175 (50%), Positives = 109/175 (63%), Gaps = 5/175 (2%)
Query 37 LPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADGSAAS 96
LP FRYHPDPV +GSI +CV CE+ + YT Y ++++ CPWCIADGSAA+
Sbjct 46 LPVFRYHPDPVASGSIREGAETCVCCERSTGWIYTATFYTAQDVDGQFCPWCIADGSAAA 105
Query 97 RFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVA 156
RF+ FTD+ V E++ E V RTPGF W WL HC DAAAF+G VG +E+A
Sbjct 106 RFEGEFTDSYGLA--GVSEEILEHVTRRTPGFHAWQDPHWLVHCDDAAAFVGEVGHTELA 163
Query 157 DLPDALDALRNEYRGYDWP-ADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYAD 210
P+ALD LR + R W A ++E F+ L + A+A LFRC CG HLAYAD
Sbjct 164 AHPEALDQLRTDLRLGGWHDASQLESFLTHLGQG--ASAMLFRCTVCGTHLAYAD 216
>gi|94968131|ref|YP_590179.1| hypothetical protein Acid345_1102 [Candidatus Koribacter versatilis
Ellin345]
gi|94550181|gb|ABF40105.1| conserved hypothetical protein [Candidatus Koribacter versatilis
Ellin345]
Length=180
Score = 155 bits (392), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 84/178 (48%), Positives = 107/178 (61%), Gaps = 4/178 (2%)
Query 37 LPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAE-EELNEAICPWCIADGSAA 95
LP FRYHPDPV +G++V E +CV C+++R Y YT VYAE ++L A+CPWCIADGSA
Sbjct 3 LPNFRYHPDPVKSGNLVVSEETCVCCDKKRGYIYTVSVYAESDDLENALCPWCIADGSAH 62
Query 96 SRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEV 155
+FDA+F D + D++P +E+L RT GF GW E+WL C DA AFL PVG EV
Sbjct 63 RKFDASFVDDP-GLADEIPNSARQEILYRTLGFAGWQSEQWLACCDDAMAFLEPVGIVEV 121
Query 156 -ADLPDALDALRNEY-RGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYADF 211
D P L +E ++ E + +L R TA FRCL CG H AY D
Sbjct 122 RRDYPKLEGTLMHEIVHEWERSGGAANELLNSLHREHGPTANAFRCLHCGEHKAYIDI 179
>gi|29827112|ref|NP_821746.1| hypothetical protein SAV_571 [Streptomyces avermitilis MA-4680]
gi|29604210|dbj|BAC68281.1| hypothetical protein [Streptomyces avermitilis MA-4680]
Length=268
Score = 154 bits (390), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 83/193 (44%), Positives = 110/193 (57%), Gaps = 6/193 (3%)
Query 20 RGQDHEM-PVEETSTPQKLPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEE 78
RG+D + ++ LP FRYHPDPV +GSI C C + + YT Y
Sbjct 78 RGRDGPLFAAAGSAVSVSLPHFRYHPDPVASGSIGESAEVCACCNRSTGWIYTATFYTAH 137
Query 79 ELNEAICPWCIADGSAASRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLH 138
+++ + CPWCIADG+AA RF+ FTD D + ++ +V RTPG W WL
Sbjct 138 DVSGSFCPWCIADGTAAERFEGEFTDPYGL--DGISQETLVQVTRRTPGLHAWQDPHWLV 195
Query 139 HCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWP-ADKIEEFILTLDRNGLATAYLF 197
HC DAAAF+G VG +E+A P+ALD LR + R W A ++E F+ L + A+A LF
Sbjct 196 HCNDAAAFIGEVGYTELAAHPEALDQLRLDLRMGGWNDATQLEHFLTHLGQG--ASAMLF 253
Query 198 RCLSCGVHLAYAD 210
RC CG HLAYAD
Sbjct 254 RCTVCGTHLAYAD 266
>gi|294811295|ref|ZP_06769938.1| Hypothetical protein SCLAV_0461 [Streptomyces clavuligerus ATCC
27064]
gi|294323894|gb|EFG05537.1| Hypothetical protein SCLAV_0461 [Streptomyces clavuligerus ATCC
27064]
Length=190
Score = 154 bits (388), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 91/179 (51%), Positives = 115/179 (65%), Gaps = 2/179 (1%)
Query 35 QKLPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEE-LNEAICPWCIADGS 93
+ LP F YHPDPV TG++V + C C + R + Y GPVYA L+ +CPWC+ADGS
Sbjct 13 EPLPPFPYHPDPVATGAVVPSDAVCAHCGRARGHVYAGPVYAGTPGLSGRLCPWCVADGS 72
Query 94 AASRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGAS 153
AA+ ++A FT + D++P DV V RTP FT W Q W HCGDAAAFLG GA+
Sbjct 73 AAAAYEAHFTSGE-VLGDEIPFDVLLAVDTRTPSFTAWQQTVWYAHCGDAAAFLGAAGAA 131
Query 154 EVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYADFA 212
E+A PDAL LR + + WP D++E + +L R+G TAYLFRC C HLAYADFA
Sbjct 132 ELAAFPDALRLLRAQADAWGWPDDQVEHHLASLHRDGDPTAYLFRCRHCATHLAYADFA 190
>gi|326439758|ref|ZP_08214492.1| hypothetical protein SclaA2_01780 [Streptomyces clavuligerus
ATCC 27064]
Length=183
Score = 153 bits (387), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 91/179 (51%), Positives = 115/179 (65%), Gaps = 2/179 (1%)
Query 35 QKLPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEE-LNEAICPWCIADGS 93
+ LP F YHPDPV TG++V + C C + R + Y GPVYA L+ +CPWC+ADGS
Sbjct 6 EPLPPFPYHPDPVATGAVVPSDAVCAHCGRARGHVYAGPVYAGTPGLSGRLCPWCVADGS 65
Query 94 AASRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGAS 153
AA+ ++A FT + D++P DV V RTP FT W Q W HCGDAAAFLG GA+
Sbjct 66 AAAAYEAHFTSGE-VLGDEIPFDVLLAVDTRTPSFTAWQQTVWYAHCGDAAAFLGAAGAA 124
Query 154 EVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYADFA 212
E+A PDAL LR + + WP D++E + +L R+G TAYLFRC C HLAYADFA
Sbjct 125 ELAAFPDALRLLRAQADAWGWPDDQVEHHLASLHRDGDPTAYLFRCRHCATHLAYADFA 183
>gi|323177728|gb|EFZ63312.1| hypothetical protein ECOK1180_3306 [Escherichia coli 1180]
gi|345331288|gb|EGW63748.1| hypothetical protein EC253486_4805 [Escherichia coli 2534-86]
Length=194
Score = 136 bits (343), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 73/197 (38%), Positives = 105/197 (54%), Gaps = 19/197 (9%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LPQF+YHP P+ TG+ D+ V C CEQ+ Y+GP Y +E+ E +CPWCI
Sbjct 2 TQNIRPLPQFKYHPKPLETGAFEQDKTVECDCCEQQTSVYYSGPFYCVDEV-EHLCPWCI 60
Query 90 ADGSAASRFDATFTD--------------AMWAVPDDVPEDVTEEVLCRTPGFTGWLQEE 135
ADGSAA +F +F D + + P+++ +E++ RTPG+ GW QE
Sbjct 61 ADGSAAEKFTGSFQDDASIEGVEFEYDEEEFAGIKNTYPDEMLKELVERTPGYHGWQQEF 120
Query 136 WLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAY 195
WL HCGD AF+G VG +++ D D L + + + + L + G Y
Sbjct 121 WLAHCGDFCAFIGYVGWNDIKDRLDEFANLEEDCENF---GIRNSDLAKCLQKGGDCQGY 177
Query 196 LFRCLSCGVHLAYADFA 212
LFRCL CG + DF+
Sbjct 178 LFRCLHCGKLRLWGDFS 194
>gi|260870448|ref|YP_003236850.1| hypothetical protein ECO111_4544 [Escherichia coli O111:H- str.
11128]
gi|257766804|dbj|BAI38299.1| conserved predicted protein [Escherichia coli O111:H- str. 11128]
Length=194
Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 73/197 (38%), Positives = 105/197 (54%), Gaps = 19/197 (9%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LPQF+YHP P+ TG+ D+ V C CEQ+ Y+GP Y +E+ E +CPWCI
Sbjct 2 TQNIRPLPQFKYHPKPLETGAFEQDKTVECDCCEQQTSVYYSGPFYCVDEV-EHLCPWCI 60
Query 90 ADGSAASRFDATFTD--------------AMWAVPDDVPEDVTEEVLCRTPGFTGWLQEE 135
ADGSAA +F +F D + + P+++ +E++ RTPG+ GW QE
Sbjct 61 ADGSAAEKFAGSFQDDASIEGVEFEYDEEEFAGIKNTYPDEMLKELVERTPGYHGWQQEF 120
Query 136 WLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAY 195
WL HCGD AF+G VG +++ D D L + + + + L + G Y
Sbjct 121 WLAHCGDFCAFIGYVGWNDIKDRLDEFANLEEDCENF---GIRNSDLAKCLQKGGDCQGY 177
Query 196 LFRCLSCGVHLAYADFA 212
LFRCL CG + DF+
Sbjct 178 LFRCLHCGKLRLWGDFS 194
>gi|323934932|gb|EGB31310.1| cbrC [Escherichia coli E1520]
Length=195
Score = 135 bits (340), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 73/198 (37%), Positives = 105/198 (54%), Gaps = 20/198 (10%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LPQF+YHP P+ TG+ D+ V C CEQ+ Y+GP Y +E+ E +CPWCI
Sbjct 2 TQNIRPLPQFKYHPKPLETGAFEQDKTVECDCCEQQTSVYYSGPFYCVDEV-EHLCPWCI 60
Query 90 ADGSAASRFDATFTD---------------AMWAVPDDVPEDVTEEVLCRTPGFTGWLQE 134
ADGSAA +F +F D + + P+++ +E++ RTPG+ GW QE
Sbjct 61 ADGSAAEKFAGSFQDDASIEGVEFEYDEEDEFAGIKNTYPDEMLKELVERTPGYHGWQQE 120
Query 135 EWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATA 194
WL HCGD AF+G VG +++ D D L + + + + L + G
Sbjct 121 LWLAHCGDFCAFIGYVGWNDIKDRLDEFANLEEDCENF---GIRNSDLAKCLQKGGDCQG 177
Query 195 YLFRCLSCGVHLAYADFA 212
YLFRCL CG + DF+
Sbjct 178 YLFRCLHCGKLRLWGDFS 195
>gi|191165780|ref|ZP_03027618.1| conserved hypothetical protein [Escherichia coli B7A]
gi|293464040|ref|ZP_06664454.1| cbrC protein [Escherichia coli B088]
gi|300815036|ref|ZP_07095261.1| conserved hypothetical protein [Escherichia coli MS 107-1]
20 more sequence titles
Length=195
Score = 135 bits (340), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 73/198 (37%), Positives = 105/198 (54%), Gaps = 20/198 (10%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LPQF+YHP P+ TG+ D+ V C CEQ+ Y+GP Y +E+ E +CPWCI
Sbjct 2 TQNIRPLPQFKYHPKPLETGAFEQDKTVECDCCEQQTSVYYSGPFYCVDEV-EHLCPWCI 60
Query 90 ADGSAASRFDATFTD---------------AMWAVPDDVPEDVTEEVLCRTPGFTGWLQE 134
ADGSAA +F +F D + + P+++ +E++ RTPG+ GW QE
Sbjct 61 ADGSAAEKFAGSFQDDASIEGVEFEYDEEDEFAGIKNTYPDEMLKELVERTPGYHGWQQE 120
Query 135 EWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATA 194
WL HCGD AF+G VG +++ D D L + + + + L + G
Sbjct 121 FWLAHCGDFCAFIGYVGWNDIKDRLDEFANLEEDCENF---GIRNSDLAKCLQKRGDCQG 177
Query 195 YLFRCLSCGVHLAYADFA 212
YLFRCL CG + DF+
Sbjct 178 YLFRCLHCGKLRLWGDFS 195
>gi|157163198|ref|YP_001460516.1| hypothetical protein EcHS_A3931 [Escherichia coli HS]
gi|170022246|ref|YP_001727200.1| hypothetical protein EcolC_4277 [Escherichia coli ATCC 8739]
gi|188496451|ref|ZP_03003721.1| conserved hypothetical protein [Escherichia coli 53638]
53 more sequence titles
Length=195
Score = 135 bits (340), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 73/198 (37%), Positives = 105/198 (54%), Gaps = 20/198 (10%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LPQF+YHP P+ TG+ D+ V C CEQ+ Y+GP Y +E+ E +CPWCI
Sbjct 2 TQNIRPLPQFKYHPKPLETGAFEQDKTVECDCCEQQTSVYYSGPFYCVDEV-EHLCPWCI 60
Query 90 ADGSAASRFDATFTD---------------AMWAVPDDVPEDVTEEVLCRTPGFTGWLQE 134
ADGSAA +F +F D + + P+++ +E++ RTPG+ GW QE
Sbjct 61 ADGSAAEKFAGSFQDDASIEGVEFEYDEEDEFAGIKNTYPDEMLKELVERTPGYHGWQQE 120
Query 135 EWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATA 194
WL HCGD AF+G VG +++ D D L + + + + L + G
Sbjct 121 FWLAHCGDFCAFIGYVGWNDIKDRLDEFANLEEDCENF---GIRNSDLAKCLQKGGDCQG 177
Query 195 YLFRCLSCGVHLAYADFA 212
YLFRCL CG + DF+
Sbjct 178 YLFRCLHCGKLRLWGDFS 195
>gi|15596733|ref|NP_250227.1| hypothetical protein PA1536 [Pseudomonas aeruginosa PAO1]
gi|107100967|ref|ZP_01364885.1| hypothetical protein PaerPA_01001997 [Pseudomonas aeruginosa
PACS2]
gi|218892508|ref|YP_002441375.1| hypothetical protein PLES_37921 [Pseudomonas aeruginosa LESB58]
8 more sequence titles
Length=179
Score = 135 bits (340), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 71/168 (43%), Positives = 93/168 (56%), Gaps = 3/168 (1%)
Query 37 LPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNE-AICPWCIADGSAA 95
LP FRYHP+P+ +GSI A +C C + R Y YTG Y+ EL ++CPWCIADGSAA
Sbjct 5 LPHFRYHPEPLASGSIEASATTCQCCGKARGYVYTGSPYSRHELPPGSLCPWCIADGSAA 64
Query 96 SRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEV 155
+R++A+F+D + V D+ EV RTPG+T W QE WL C DA AF G G E+
Sbjct 65 ARYEASFSDDYPLLDAGVAADIVTEVCERTPGYTSWQQERWLVCCEDACAFRGDAGREEI 124
Query 156 ADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCG 203
L + L + + WPA + + G Y F CL CG
Sbjct 125 GQL--GAEGLAQRFADFAWPAITWQRLVDAYTPGGNPAIYRFDCLHCG 170
>gi|49081776|gb|AAT50288.1| PA1536 [synthetic construct]
Length=180
Score = 135 bits (340), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 71/168 (43%), Positives = 93/168 (56%), Gaps = 3/168 (1%)
Query 37 LPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNE-AICPWCIADGSAA 95
LP FRYHP+P+ +GSI A +C C + R Y YTG Y+ EL ++CPWCIADGSAA
Sbjct 5 LPHFRYHPEPLASGSIEASATTCQCCGKARGYVYTGSPYSRHELPPGSLCPWCIADGSAA 64
Query 96 SRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEV 155
+R++A+F+D + V D+ EV RTPG+T W QE WL C DA AF G G E+
Sbjct 65 ARYEASFSDDYPLLDAGVAADIVTEVCERTPGYTSWQQERWLVCCEDACAFRGDAGREEI 124
Query 156 ADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCG 203
L + L + + WPA + + G Y F CL CG
Sbjct 125 GQL--GAEGLAQRFADFAWPAITWQRLVDAYTPGGNPAIYRFDCLHCG 170
>gi|157155247|ref|YP_001465201.1| hypothetical protein EcE24377A_4226 [Escherichia coli E24377A]
gi|157077277|gb|ABV16985.1| conserved hypothetical protein [Escherichia coli E24377A]
Length=195
Score = 135 bits (339), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 72/198 (37%), Positives = 105/198 (54%), Gaps = 20/198 (10%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LPQF+YHP P+ TG+ D+ + C CEQ+ Y+GP Y +E+ E +CPWCI
Sbjct 2 TQNIRPLPQFKYHPKPLETGAFEQDKTIECDCCEQQTSVYYSGPFYCVDEV-EHLCPWCI 60
Query 90 ADGSAASRFDATFTD---------------AMWAVPDDVPEDVTEEVLCRTPGFTGWLQE 134
ADGSAA +F +F D + + P+++ +E++ RTPG+ GW QE
Sbjct 61 ADGSAAEKFAGSFQDDASIEGVEFEYDEEDEFAGIKNTYPDEMLKELVERTPGYHGWQQE 120
Query 135 EWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATA 194
WL HCGD AF+G VG +++ D D L + + + + L + G
Sbjct 121 FWLAHCGDFCAFIGYVGWNDIKDRLDEFANLEEDCENF---GIRNSDLAKCLQKGGDCQG 177
Query 195 YLFRCLSCGVHLAYADFA 212
YLFRCL CG + DF+
Sbjct 178 YLFRCLHCGKLRLWGDFS 195
>gi|317054997|ref|YP_004103464.1| hypothetical protein Rumal_0273 [Ruminococcus albus 7]
gi|315447266|gb|ADU20830.1| protein of unknown function UPF0167 [Ruminococcus albus 7]
Length=180
Score = 135 bits (339), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 77/179 (44%), Positives = 104/179 (59%), Gaps = 8/179 (4%)
Query 35 QKLPQFRYHPDPVGTGSIV-ADEVS-CVSCEQRRPYTYTGPVYAEEELNEAICPWCIADG 92
+ P+FRYHPDP+GTG+ AD+ C C ++ Y Y P Y+ E++ E +CPWCIADG
Sbjct 5 KDFPKFRYHPDPIGTGAFKKADKPQICGCCGKKTEYVYESPFYSTEDV-ECLCPWCIADG 63
Query 93 SAASRFDATFTDAMWAVP-DDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVG 151
SAA +FD F DA +DV + +E++ RTPG+ GW QE WL HC D AF+G VG
Sbjct 64 SAAKKFDGEFQDAYSCEKINDVSK--LDELIHRTPGYCGWQQEVWLAHCNDYCAFVGYVG 121
Query 152 ASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYAD 210
+E+ + + D L + YR D I + + G YLFRCL CG + YAD
Sbjct 122 MTELEKMGLS-DKLEDIYRK-DEAMFDIGDIRECMTNGGSMQGYLFRCLHCGKYQLYAD 178
>gi|16131585|ref|NP_418173.1| conserved protein, UPF0167 family [Escherichia coli str. K-12
substr. MG1655]
gi|170083219|ref|YP_001732539.1| hypothetical protein ECDH10B_3904 [Escherichia coli str. K-12
substr. DH10B]
gi|238902808|ref|YP_002928604.1| hypothetical protein BWG_3408 [Escherichia coli BW2952]
18 more sequence titles
Length=195
Score = 134 bits (338), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 72/198 (37%), Positives = 104/198 (53%), Gaps = 20/198 (10%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LPQF+YHP P+ TG+ D+ V C CEQ+ Y+GP Y +E+ E +CPWCI
Sbjct 2 TQNIRPLPQFKYHPKPLETGAFEQDKTVECDCCEQQTSVYYSGPFYCVDEV-EHLCPWCI 60
Query 90 ADGSAASRFDATFTD---------------AMWAVPDDVPEDVTEEVLCRTPGFTGWLQE 134
ADGSAA +F +F D + + P+++ +E++ RTPG+ GW QE
Sbjct 61 ADGSAAEKFAGSFQDDASIEGVEFEYDEEDEFAGIKNTYPDEMLKELVERTPGYHGWQQE 120
Query 135 EWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATA 194
WL HCGD F+G VG +++ D D L + + + + L + G
Sbjct 121 FWLAHCGDFCVFIGYVGWNDIKDRLDEFANLEEDCENF---GIRNSDLAKCLQKGGHCQG 177
Query 195 YLFRCLSCGVHLAYADFA 212
YLFRCL CG + DF+
Sbjct 178 YLFRCLHCGKLRLWGDFS 195
>gi|313106541|ref|ZP_07792769.1| hypothetical protein PA39016_000460004 [Pseudomonas aeruginosa
39016]
gi|310879271|gb|EFQ37865.1| hypothetical protein PA39016_000460004 [Pseudomonas aeruginosa
39016]
Length=179
Score = 134 bits (337), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 71/168 (43%), Positives = 93/168 (56%), Gaps = 3/168 (1%)
Query 37 LPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNE-AICPWCIADGSAA 95
LP FRYHP+P+ +GSI A +C C + R Y YTG Y+ EL ++CPWCIADGSAA
Sbjct 5 LPLFRYHPEPLASGSIEASATTCQCCGKARGYVYTGSPYSRHELPPGSLCPWCIADGSAA 64
Query 96 SRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEV 155
+R++A+F+D + V D+ EV RTPG+T W QE WL C DA AF G G E+
Sbjct 65 ARYEASFSDDYPLLDAGVAADIVTEVCERTPGYTSWQQERWLVCCEDACAFRGDAGREEI 124
Query 156 ADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCG 203
L + L + + WPA + + G Y F CL CG
Sbjct 125 GQL--GAEGLAQRFADFAWPAITWQRLVDAYTPGGNPAIYRFDCLHCG 170
>gi|340732372|gb|EGR61510.1| hypothetical protein HUSEC41_20735 [Escherichia coli O104:H4
str. 01-09591]
Length=195
Score = 134 bits (337), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/198 (37%), Positives = 104/198 (53%), Gaps = 20/198 (10%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LPQF+YHP P+ TG+ D+ V C CEQ+ Y+GP Y +E+ E +CPWCI
Sbjct 2 TQNIRPLPQFKYHPKPLETGAFEQDKTVECDCCEQQTSVYYSGPFYCVDEV-EHLCPWCI 60
Query 90 ADGSAASRFDATFTD---------------AMWAVPDDVPEDVTEEVLCRTPGFTGWLQE 134
ADGSAA +F +F D + + P+++ +E++ RTPG+ GW QE
Sbjct 61 ADGSAAEKFAGSFQDDASIEGVEFEYDEEDEFAGIKNTYPDEMLKELVERTPGYHGWQQE 120
Query 135 EWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATA 194
WL HCGD AF+G VG +++ D D L + + + + L + G
Sbjct 121 FWLAHCGDFCAFIGYVGWNDIKDRLDEFANLEEDCENF---GIRNSDLAKCLQKGGDCQG 177
Query 195 YLFRCLSCGVHLAYADFA 212
YLFRCL CG DF+
Sbjct 178 YLFRCLHCGKLRLSGDFS 195
>gi|345347147|gb|EGW79461.1| hypothetical protein ECSTEC94C_4457 [Escherichia coli STEC_94C]
Length=195
Score = 133 bits (335), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 72/198 (37%), Positives = 104/198 (53%), Gaps = 20/198 (10%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LPQF+YHP P+ G+ D+ V C CEQ+ Y+GP Y +E+ E +CPWCI
Sbjct 2 TQNIRPLPQFKYHPKPLEIGAFEQDKTVECDCCEQQTSVYYSGPFYCVDEV-EHLCPWCI 60
Query 90 ADGSAASRFDATFTD---------------AMWAVPDDVPEDVTEEVLCRTPGFTGWLQE 134
ADGSAA +F +F D + + P+++ +E++ RTPG+ GW QE
Sbjct 61 ADGSAAEKFAGSFQDDASIEGVEFEYDEEDEFAGIKNTYPDEMLKELVERTPGYHGWQQE 120
Query 135 EWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATA 194
WL HCGD AF+G VG +++ D D L + + + + L + G
Sbjct 121 FWLAHCGDFCAFIGYVGWNDIKDRLDEFANLEEDCENF---GIRNSDLAKCLQKRGDCQG 177
Query 195 YLFRCLSCGVHLAYADFA 212
YLFRCL CG + DF+
Sbjct 178 YLFRCLHCGKLRLWGDFS 195
>gi|116049480|ref|YP_791717.1| hypothetical protein PA14_44580 [Pseudomonas aeruginosa UCBPP-PA14]
gi|296390096|ref|ZP_06879571.1| hypothetical protein PaerPAb_18181 [Pseudomonas aeruginosa PAb1]
gi|115584701|gb|ABJ10716.1| conserved hypothetical protein [Pseudomonas aeruginosa UCBPP-PA14]
gi|334836934|gb|EGM15718.1| hypothetical protein PA15_23307 [Pseudomonas aeruginosa 152504]
Length=179
Score = 133 bits (335), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/168 (42%), Positives = 92/168 (55%), Gaps = 3/168 (1%)
Query 37 LPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNE-AICPWCIADGSAA 95
LP FRYHP+P+ +GSI A +C C + R Y YTG Y+ EL ++CPWCIADGSAA
Sbjct 5 LPHFRYHPEPLASGSIEASAATCQCCGKARGYVYTGSPYSRHELPPGSLCPWCIADGSAA 64
Query 96 SRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEV 155
+R++A+F+D + V ++ EV RTPG+ W QE WL C DA AF G G E+
Sbjct 65 ARYEASFSDDYPLLDAGVAANIVTEVCERTPGYASWQQERWLVCCEDACAFRGDAGREEI 124
Query 156 ADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCG 203
L + L + + WPA + + G Y F CL CG
Sbjct 125 GQL--GAEGLAQRFADFAWPASTWQRLVDAYTPGGNPAIYRFDCLHCG 170
>gi|218551250|ref|YP_002385042.1| hypothetical protein EFER_4015 [Escherichia fergusonii ATCC 35469]
gi|218358792|emb|CAQ91449.1| conserved hypothetical protein [Escherichia fergusonii ATCC 35469]
gi|324111615|gb|EGC05596.1| cbrC [Escherichia fergusonii B253]
gi|325499522|gb|EGC97381.1| hypothetical protein ECD227_3619 [Escherichia fergusonii ECD227]
Length=193
Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 75/197 (39%), Positives = 100/197 (51%), Gaps = 20/197 (10%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LP F+YHP P+ TG+ D+ V C CEQ YTGP ++ +++ E +CPWCI
Sbjct 2 TQNIRPLPLFKYHPKPLETGAFEQDKIVECDCCEQPTSVYYTGPFFSVDDI-EYLCPWCI 60
Query 90 ADGSAASRFDATFTDAM--------------WAVPDDVPEDVTEEVLCRTPGFTGWLQEE 135
ADGSAA +F +F D A + D EE+L RTPG+ GW QE
Sbjct 61 ADGSAAKKFAGSFQDKASIEGVGTTYYDNDGTATTHSLSNDALEELLTRTPGYCGWQQEH 120
Query 136 WLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAY 195
WL HCG+ AF+G VG E+ D D L ++ + + E L G Y
Sbjct 121 WLTHCGELCAFVGYVGWDEIKDRLDEFAHLEDDCDSF----IRYEHLQECLKNGGYCQGY 176
Query 196 LFRCLSCGVHLAYADFA 212
LFRCL CG + DF+
Sbjct 177 LFRCLHCGKLRLWGDFS 193
>gi|261823659|ref|YP_003261765.1| hypothetical protein Pecwa_4466 [Pectobacterium wasabiae WPP163]
gi|261607672|gb|ACX90158.1| protein of unknown function UPF0167 [Pectobacterium wasabiae
WPP163]
Length=179
Score = 131 bits (330), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/167 (41%), Positives = 90/167 (54%), Gaps = 3/167 (1%)
Query 37 LPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEA-ICPWCIADGSAA 95
P FRYHP+P+ TGSI A + C+ C Q R Y YT Y +L E CPWCIADGSAA
Sbjct 5 FPSFRYHPNPLSTGSIKAADDVCLCCNQARGYVYTASCYTAHKLPEKKFCPWCIADGSAA 64
Query 96 SRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEV 155
+R+D F+D + + ++ EEV RTPG++ W QE WL C DA AF G E+
Sbjct 65 ARYDMHFSDEHPLFSEGIAVEIIEEVCSRTPGYSSWQQEIWLSCCDDACAFAGDASREEL 124
Query 156 ADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSC 202
L +AL ++ + WP + + + + G Y F CL C
Sbjct 125 VAL--GAEALAVQFADFSWPLETWKNVVESYQPGGETALYRFECLHC 169
>gi|153831830|ref|ZP_01984497.1| conserved hypothetical protein [Vibrio harveyi HY01]
gi|148871828|gb|EDL70651.1| conserved hypothetical protein [Vibrio harveyi HY01]
Length=175
Score = 131 bits (330), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 68/178 (39%), Positives = 100/178 (57%), Gaps = 5/178 (2%)
Query 36 KLPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADGSAA 95
+LP F+YHPDP+ TG++ + +C C R Y T +Y+E ++ E ICPWCI+DGSAA
Sbjct 2 ELPTFKYHPDPIKTGAVEVTDANCECCSVSRGYRATSTIYSEHDV-ETICPWCISDGSAA 60
Query 96 SRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEV 155
+FD F D + + + V +EV RTP + W QE WL HCGDA F G S++
Sbjct 61 KKFDREFADPHPLMKAGLDKSVVKEVCERTPSYISWQQEVWLSHCGDACEFHGDAEKSDL 120
Query 156 ADLPDA-LDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYADFA 212
+ DA L+AL N+ +++ + + ++ G Y F+C SCG+ DFA
Sbjct 121 LQVKDAELEALLNDQL---IGSNEWHQIVTYYEKGGNPAIYKFKCRSCGIFTYSLDFA 175
>gi|281180775|dbj|BAI57105.1| conserved hypothetical protein [Escherichia coli SE15]
gi|333971902|gb|AEG38707.1| Hypothetical protein ECNA114_3866 [Escherichia coli NA114]
Length=195
Score = 131 bits (329), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 71/198 (36%), Positives = 104/198 (53%), Gaps = 20/198 (10%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LP F+YHP P+ TG+ D+ V C CEQ+ Y+GP Y +E+ E +CPWCI
Sbjct 2 TQNIRPLPLFKYHPKPLETGAFEQDKTVECDCCEQQTSVYYSGPFYCVDEV-EHLCPWCI 60
Query 90 ADGSAASRFDATFTDA---------------MWAVPDDVPEDVTEEVLCRTPGFTGWLQE 134
A+GSAA +F +F D + + P+++ +E++ RTPG+ GW QE
Sbjct 61 ANGSAAEKFAGSFQDDASIEGVEFEYDEEDDFAGIKNTYPDEMLKELVERTPGYHGWQQE 120
Query 135 EWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATA 194
WL HCGD AF+G VG +++ D D L + + + + L + G
Sbjct 121 FWLAHCGDFCAFIGYVGWNDIKDRLDEFANLEEDCENF---GIRNSDLAKCLQKGGDCQG 177
Query 195 YLFRCLSCGVHLAYADFA 212
YLFRCL CG + DF+
Sbjct 178 YLFRCLHCGKLRLWGDFS 195
>gi|157693409|ref|YP_001487871.1| hypothetical protein BPUM_2653 [Bacillus pumilus SAFR-032]
gi|157682167|gb|ABV63311.1| hypothetical protein BPUM_2653 [Bacillus pumilus SAFR-032]
Length=182
Score = 130 bits (328), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 72/189 (39%), Positives = 104/189 (56%), Gaps = 13/189 (6%)
Query 24 HEMPVEETSTPQKLPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEA 83
H M V T LP F+Y+PDP+ I ++ +C CE+ R Y Y GP Y E++ E
Sbjct 3 HNMEVYMT-----LPTFKYNPDPISLHVIKKEQTTCPVCEKEREYVYHGPFYTVEDV-EG 56
Query 84 ICPWCIADGSAASRFDATFTDAMWAVPDDVPED-VTEEVLCRTPGFTGWLQEEWLHHCGD 142
ICPWCI DGSAA +++ F D A DDV E+ +E++ RTPG+ GW QE WL HCGD
Sbjct 57 ICPWCIKDGSAAKKYNGVFQDD--ASCDDVDEEKYIDELIYRTPGYRGWQQEYWLSHCGD 114
Query 143 AAAFLGPVGASEVADLPDAL-DALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLS 201
A + VG E+ L + L + + + G + ++++++ G YLF+C+
Sbjct 115 FCAIVQYVGWKEIEHLEEELTEDIEDICSGGGLTKENLKQWLVN---GGYLQGYLFQCVY 171
Query 202 CGVHLAYAD 210
C H Y D
Sbjct 172 CNKHRLYID 180
>gi|332996018|gb|EGK15645.1| hypothetical protein SFVA6_4679 [Shigella flexneri VA-6]
Length=195
Score = 130 bits (328), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 72/198 (37%), Positives = 104/198 (53%), Gaps = 20/198 (10%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LPQF+YHP P+ TG+ D+ V C CEQ+ Y+GP Y +E+ E +CP CI
Sbjct 2 TQNIRPLPQFKYHPKPLETGAFEQDKTVECDCCEQQTSVYYSGPFYCVDEV-EHLCPLCI 60
Query 90 ADGSAASRFDATFTD---------------AMWAVPDDVPEDVTEEVLCRTPGFTGWLQE 134
ADGSAA +F +F D + + P+++ +E++ RTPG+ GW QE
Sbjct 61 ADGSAAEKFAGSFQDDASIEGVEFEYDEEDEFAGIKNTYPDEMLKELVERTPGYHGWQQE 120
Query 135 EWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATA 194
WL HCGD AF+G VG +++ D D L + + + + L + G
Sbjct 121 FWLAHCGDFCAFIGYVGWNDIKDRLDEFANLEEDCENF---GIRNSDLAKCLQKGGDCQG 177
Query 195 YLFRCLSCGVHLAYADFA 212
YLFRCL CG + DF+
Sbjct 178 YLFRCLHCGKLRLWGDFS 195
>gi|194017873|ref|ZP_03056482.1| protein YieJ [Bacillus pumilus ATCC 7061]
gi|194010525|gb|EDW20098.1| protein YieJ [Bacillus pumilus ATCC 7061]
Length=174
Score = 130 bits (327), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 70/176 (40%), Positives = 101/176 (58%), Gaps = 8/176 (4%)
Query 37 LPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADGSAAS 96
LP F+Y+PDPV I + +C CE+ R Y Y GP Y+ E++ + ICPWCI DGSAA
Sbjct 3 LPTFKYNPDPVSLNVIKKEPTTCPVCEKDREYVYHGPFYSVEDV-KGICPWCIKDGSAAK 61
Query 97 RFDATFTDAMWAVPDDV-PEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEV 155
++D TF D A DDV E+ +E++ RTPG+ GW QE WL HCGD A + VG E+
Sbjct 62 KYDGTFQDD--ASCDDVEQEEYIDELIFRTPGYRGWQQEYWLSHCGDFCAIVQYVGWKEI 119
Query 156 ADLPDAL-DALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYAD 210
L + L + + + G + ++++++ G YLF+C+ C H Y D
Sbjct 120 EHLEEELTEDIEDICSGGRLTKENLKQWLVN---GGDLQGYLFQCVHCNKHRLYID 172
>gi|340752053|ref|ZP_08688863.1| hypothetical protein FMAG_01631 [Fusobacterium mortiferum ATCC
9817]
gi|229421022|gb|EEO36069.1| hypothetical protein FMAG_01631 [Fusobacterium mortiferum ATCC
9817]
Length=179
Score = 129 bits (325), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 69/180 (39%), Positives = 100/180 (56%), Gaps = 7/180 (3%)
Query 34 PQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADG 92
++LP F+Y+PDP+ TG DE V+C C + YTGP Y+ E++ E +CP CIA+G
Sbjct 2 KKELPFFKYYPDPLKTGEFETDETVTCECCGKETDVYYTGPFYSVEDI-EYLCPECIANG 60
Query 93 SAASRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGA 152
A+ +FD F + D E+ +E++ RTP + GW QE W+ HC D AF+ VGA
Sbjct 61 KASKKFDGDFVSLYFGKVSD--EEKIDELIHRTPSYCGWQQECWITHCDDFCAFIDYVGA 118
Query 153 SEVADLPDALDALRNEY--RGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYAD 210
E+ + + ++N G +W ++I E I + G YLFRCL CG H Y D
Sbjct 119 KELEKMGVLEEVIKNGNPDDGNEWSKEQI-EIIKNMVNGGHVQGYLFRCLHCGKHFLYFD 177
>gi|160939502|ref|ZP_02086852.1| hypothetical protein CLOBOL_04395 [Clostridium bolteae ATCC BAA-613]
gi|158437712|gb|EDP15474.1| hypothetical protein CLOBOL_04395 [Clostridium bolteae ATCC BAA-613]
Length=287
Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 82/211 (39%), Positives = 113/211 (54%), Gaps = 18/211 (8%)
Query 5 ANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHPDPVGTGSI--VADEVSCVSC 62
NH LP P + + + +E LP FRYHPDP+ TG+ + V C C
Sbjct 90 GNHYALPKP------KTPEEKQKEKERQAQLGLPAFRYHPDPLDTGAFEESKEGVICGCC 143
Query 63 EQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATFTDAMWAVPDDV--PEDVTEE 120
+ YTGP Y+ +E+ +CP CIA G AA ++D +F D ++V D V PE + +E
Sbjct 144 GKTTHIYYTGPFYSVDEIA-YLCPECIASGEAARKYDGSFQDD-FSVDDGVDDPEKL-DE 200
Query 121 VLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIE 180
++ RTPG++GW QE W HCGD AFLG VGA E+ L D L+ + + + D I
Sbjct 201 LIHRTPGYSGWQQEYWRAHCGDYCAFLGYVGARELRAL-DVLEEVLGDPMWNEEQKDMIR 259
Query 181 EFILTLDRNGLATAYLFRCLSCGVHLAYADF 211
E + G YLF+CL CG HL + DF
Sbjct 260 ESV----NGGHLQCYLFQCLHCGKHLVWMDF 286
>gi|325680170|ref|ZP_08159735.1| hypothetical protein CUS_5950 [Ruminococcus albus 8]
gi|324108119|gb|EGC02370.1| hypothetical protein CUS_5950 [Ruminococcus albus 8]
Length=177
Score = 129 bits (323), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 76/177 (43%), Positives = 99/177 (56%), Gaps = 6/177 (3%)
Query 36 KLPQFRYHPDPVGTGSIV-ADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADGS 93
+ P+F+YHP+P+GT + ADE C C ++ Y Y P ++ E + E +CP+CIADGS
Sbjct 3 EFPKFKYHPEPIGTKAFKKADEPRVCQCCGKKTEYVYEAPFFSAENV-EVLCPYCIADGS 61
Query 94 AASRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGAS 153
AA +FD F DA D P TEE+ RTPG+ GW QE WL HCGD AF+G VG
Sbjct 62 AAEKFDGEFQDAASCDKVDDPAK-TEELTKRTPGYIGWQQEYWLAHCGDYCAFVGYVGME 120
Query 154 EVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYAD 210
E+ + A D + YR D ++ L G YLFRCL CG + YAD
Sbjct 121 ELEKMGLA-DKTEDIYRK-DAAFFDLDTIREGLYNGGSLQGYLFRCLLCGKYQLYAD 175
>gi|239624306|ref|ZP_04667337.1| protein YieJ [Clostridiales bacterium 1_7_47_FAA]
gi|239520692|gb|EEQ60558.1| protein YieJ [Clostridiales bacterium 1_7_47FAA]
Length=287
Score = 129 bits (323), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 83/211 (40%), Positives = 110/211 (53%), Gaps = 18/211 (8%)
Query 5 ANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHPDPVGTGSIVADE--VSCVSC 62
NH +P P + D + E LP FRYHPDP+ TG+ E V C C
Sbjct 90 GNHYAIPKP------KTPDEKQKERERQAQLGLPTFRYHPDPMDTGAFEESEEGVVCDCC 143
Query 63 EQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATFTDAMWAVPDDV--PEDVTEE 120
+ YT P YA E++ +CP CIA+G AA ++D +F D ++V D V PE + +E
Sbjct 144 GKTTHIFYTAPFYAVEDIA-YLCPECIANGEAARKYDGSFQDD-FSVDDGVDDPEKL-DE 200
Query 121 VLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIE 180
++ RTPG++GW QE W HCGD A+LG VGA E+ AL L W D+ +
Sbjct 201 LIHRTPGYSGWQQEYWRAHCGDYCAYLGHVGARELR----ALGVLEEVLDDPMWD-DEQK 255
Query 181 EFILTLDRNGLATAYLFRCLSCGVHLAYADF 211
E I G YLF+CL CG HL + DF
Sbjct 256 EMIRESVNGGHLQCYLFQCLHCGKHLVWMDF 286
>gi|153831823|ref|ZP_01984490.1| conserved hypothetical protein [Vibrio harveyi HY01]
gi|148871821|gb|EDL70644.1| conserved hypothetical protein [Vibrio harveyi HY01]
Length=202
Score = 128 bits (322), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 67/178 (38%), Positives = 99/178 (56%), Gaps = 5/178 (2%)
Query 36 KLPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADGSAA 95
+LP F+YHPDP+ TG++ + +C C R Y T +Y+ ++ E ICPWCI+DGSAA
Sbjct 29 ELPTFKYHPDPIKTGAVEVTDANCECCGVSRGYKATSTIYSVHDV-ETICPWCISDGSAA 87
Query 96 SRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEV 155
+FD F D + + + V +EV RTP + W QE WL HCGDA F G S++
Sbjct 88 KKFDGEFADPHPLMKAGLDKSVVKEVCERTPSYISWQQEVWLSHCGDACEFHGDAEKSDL 147
Query 156 ADLPDA-LDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYADFA 212
+ DA L+AL N+ +++ + + ++ G Y F+C SCG+ DFA
Sbjct 148 LQVKDAELEALLNDQL---IGSNEWHQIVTYYEKGGNPAIYKFKCRSCGIFTYSLDFA 202
>gi|170766744|ref|ZP_02901197.1| protein YieJ [Escherichia albertii TW07627]
gi|155675627|gb|ABU25145.1| YieJ [Escherichia albertii]
gi|170124182|gb|EDS93113.1| protein YieJ [Escherichia albertii TW07627]
Length=195
Score = 128 bits (321), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 72/198 (37%), Positives = 99/198 (50%), Gaps = 20/198 (10%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LP F+YHP P+ TG+ D+ V C CEQ Y+ P Y +E+ E +CPWCI
Sbjct 2 THNTRPLPIFKYHPQPLETGAFKRDKTVECDCCEQETSVYYSSPFYCVDEI-EYLCPWCI 60
Query 90 ADGSAASRFDATFTD---------------AMWAVPDDVPEDVTEEVLCRTPGFTGWLQE 134
ADGSAA +F +F D + D P ++ +E++ RTPG+ GW QE
Sbjct 61 ADGSAAEKFAGSFQDDTSIEGVEFEYDEEDEFAGIKDTYPAEMLKELVERTPGYHGWQQE 120
Query 135 EWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATA 194
WL HCGD AF+G VG E+ + D L + + + + L G
Sbjct 121 FWLAHCGDFCAFIGYVGWDEIKNRLDEFANLEEDCENF---GIRSLDLAKCLQNGGHCQG 177
Query 195 YLFRCLSCGVHLAYADFA 212
YLFRCL CG + DF+
Sbjct 178 YLFRCLHCGKLRLWGDFS 195
>gi|332655034|ref|ZP_08420775.1| conserved hypothetical protein [Ruminococcaceae bacterium D16]
gi|332515894|gb|EGJ45503.1| conserved hypothetical protein [Ruminococcaceae bacterium D16]
Length=287
Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 80/210 (39%), Positives = 111/210 (53%), Gaps = 16/210 (7%)
Query 5 ANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHPDPVGTGSI--VADEVSCVSC 62
NH LP P + + + +E LP FRYHP+P+ TG+ AD V C C
Sbjct 90 GNHYALPKP------KTPEEKQKEKERQAQLGLPAFRYHPNPLETGAFEESADGVVCDCC 143
Query 63 EQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATFTDAMWAVPDDVPE-DVTEEV 121
+ YT P ++ E++ +CP CIA+G AA ++D +F D ++V D V E + +E+
Sbjct 144 GKTTHIFYTNPFFSVEDIA-YLCPECIANGEAARKYDGSFQDD-FSVDDGVDEPEKLDEL 201
Query 122 LCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEE 181
+ RTPG++GW QE W HCGD A+LG VGA E+ AL L W D+ +E
Sbjct 202 IHRTPGYSGWQQEYWRAHCGDYCAYLGHVGARELR----ALGVLEEVLDDPMWD-DEQKE 256
Query 182 FILTLDRNGLATAYLFRCLSCGVHLAYADF 211
I G YLF+CL CG HL + DF
Sbjct 257 MIRESVNGGHLQCYLFQCLHCGKHLVWMDF 286
>gi|167770051|ref|ZP_02442104.1| hypothetical protein ANACOL_01393 [Anaerotruncus colihominis
DSM 17241]
gi|167667775|gb|EDS11905.1| hypothetical protein ANACOL_01393 [Anaerotruncus colihominis
DSM 17241]
Length=287
Score = 127 bits (318), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 82/211 (39%), Positives = 112/211 (54%), Gaps = 18/211 (8%)
Query 5 ANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHPDPVGTGSI--VADEVSCVSC 62
NH LP P + + +E LP FRYHPDP+ TG+ A+ V C C
Sbjct 90 GNHYALPKPKTPEETQNE------KERRAQLGLPAFRYHPDPLDTGAFEESAEGVVCDCC 143
Query 63 EQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATFTDAMWAVPDDV--PEDVTEE 120
+ YT P ++ E++ +CP CIA G AA ++D +F D ++V D V PE + +E
Sbjct 144 GKMTHIFYTNPFFSVEDIA-YLCPACIASGEAARKYDGSFQDD-FSVDDGVDDPEKL-DE 200
Query 121 VLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIE 180
++ RTPG++GW QE W HCGD AFLG VGA E+ AL AL + W ++ +
Sbjct 201 LIHRTPGYSGWQQEYWRAHCGDYCAFLGYVGARELR----ALGALEDVLDDPMWDEEQ-K 255
Query 181 EFILTLDRNGLATAYLFRCLSCGVHLAYADF 211
E I G YLF+CL CG HL + DF
Sbjct 256 EMIRESVNGGHLQCYLFQCLHCGKHLVWMDF 286
>gi|295115447|emb|CBL36294.1| Uncharacterized protein conserved in bacteria [butyrate-producing
bacterium SM4/1]
Length=287
Score = 127 bits (318), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 78/179 (44%), Positives = 101/179 (57%), Gaps = 12/179 (6%)
Query 37 LPQFRYHPDPVGTGSIVADE--VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADGSA 94
LP FRYHPDP+ TG+ E V C C + YT P Y E++ E +CP CIA G A
Sbjct 116 LPSFRYHPDPLDTGAFEQSEESVVCDCCGKNIHIYYTDPFYTVEDI-EYLCPECIASGEA 174
Query 95 ASRFDATFTDAMWAVPDDV--PEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGA 152
A +++ +F D ++ D V PE + +E+L RTPG++GW QE W HCGD A+LG VGA
Sbjct 175 ARKYNGSFQDVC-SLEDGVDDPEKL-DELLHRTPGYSGWQQEYWRVHCGDYCAYLGNVGA 232
Query 153 SEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYADF 211
SE+ ALD L W D+ +E I G YLF+CL CG HL + DF
Sbjct 233 SELR----ALDVLEEVLDDPMWD-DEQKEMIQESVNGGHLQCYLFQCLHCGKHLVWMDF 286
>gi|266621339|ref|ZP_06114274.1| conserved hypothetical protein [Clostridium hathewayi DSM 13479]
gi|288866986|gb|EFC99284.1| conserved hypothetical protein [Clostridium hathewayi DSM 13479]
Length=287
Score = 127 bits (318), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 82/211 (39%), Positives = 112/211 (54%), Gaps = 18/211 (8%)
Query 5 ANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHPDPVGTGSI--VADEVSCVSC 62
NH LP P + + + +E LP FRYHP+P+ TG+ AD V C C
Sbjct 90 GNHYALPKP------KTPEEKQKEKERQAQLGLPAFRYHPNPLETGAFEESADGVVCDCC 143
Query 63 EQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATFTDAMWAVPDDV--PEDVTEE 120
+ YT P YA E++ +CP CIA+G AA ++D +F D ++V D V PE + +E
Sbjct 144 GKTTHIFYTAPFYAVEDIA-YLCPECIANGEAARKYDGSFQDD-FSVDDGVDDPEKL-DE 200
Query 121 VLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIE 180
++ RTPG++GW QE W HCGD A+LG VGA E+ AL L W D+ +
Sbjct 201 LIHRTPGYSGWQQEYWRAHCGDYCAYLGHVGARELR----ALGVLEEVLDDPMWD-DEQK 255
Query 181 EFILTLDRNGLATAYLFRCLSCGVHLAYADF 211
+ I G YLF+CL CG HL + DF
Sbjct 256 KMIQESVNGGHLQCYLFQCLHCGKHLVWMDF 286
>gi|295089896|emb|CBK76003.1| Uncharacterized protein conserved in bacteria [Clostridium cf.
saccharolyticum K10]
Length=287
Score = 127 bits (318), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 78/179 (44%), Positives = 101/179 (57%), Gaps = 12/179 (6%)
Query 37 LPQFRYHPDPVGTGSIVADE--VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADGSA 94
LP FRYHPDP+ TG+ E V C C + YT P Y E++ E +CP CIA G A
Sbjct 116 LPSFRYHPDPLDTGAFEQSEESVVCDCCGKNIHIYYTDPFYTVEDI-EYLCPECIASGEA 174
Query 95 ASRFDATFTDAMWAVPDDV--PEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGA 152
A +++ +F D ++ D V PE + +E+L RTPG++GW QE W HCGD A+LG VGA
Sbjct 175 ARKYNGSFQDVC-SLDDGVDDPEKL-DELLHRTPGYSGWQQEYWRVHCGDYCAYLGNVGA 232
Query 153 SEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYADF 211
SE+ ALD L W D+ +E I G YLF+CL CG HL + DF
Sbjct 233 SELR----ALDVLEEVLDDPMWD-DEQKEMIQESVNGGHLQCYLFQCLHCGKHLVWMDF 286
>gi|283795284|ref|ZP_06344437.1| conserved hypothetical protein [Clostridium sp. M62/1]
gi|291076933|gb|EFE14297.1| conserved hypothetical protein [Clostridium sp. M62/1]
Length=287
Score = 127 bits (318), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 78/179 (44%), Positives = 101/179 (57%), Gaps = 12/179 (6%)
Query 37 LPQFRYHPDPVGTGSIVADE--VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADGSA 94
LP FRYHPDP+ TG+ E V C C + YT P Y E++ E +CP CIA G A
Sbjct 116 LPSFRYHPDPLDTGAFEQSEESVVCDCCGKNIHIYYTDPFYTVEDI-EYLCPECIASGEA 174
Query 95 ASRFDATFTDAMWAVPDDV--PEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGA 152
A +++ +F D ++ D V PE + +E+L RTPG++GW QE W HCGD A+LG VGA
Sbjct 175 ARKYNGSFQDVC-SLDDGVDDPEKL-DELLHRTPGYSGWQQEYWRVHCGDYCAYLGNVGA 232
Query 153 SEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHLAYADF 211
SE+ ALD L W D+ +E I G YLF+CL CG HL + DF
Sbjct 233 SELR----ALDVLEEVLDDPMWD-DEQKEMIQESVNGGHLQCYLFQCLHCGKHLVWMDF 286
>gi|295102676|emb|CBL00221.1| Uncharacterized protein conserved in bacteria [Faecalibacterium
prausnitzii L2-6]
Length=287
Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 83/211 (40%), Positives = 110/211 (53%), Gaps = 18/211 (8%)
Query 5 ANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHPDPVGTGSI--VADEVSCVSC 62
NH LP P + + + +E LP FRYHP+P+ TG+ AD V C C
Sbjct 90 GNHYALPRP------KTPEEKQKEKERQAQLGLPAFRYHPNPLETGAFEESADGVVCDCC 143
Query 63 EQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATFTDAMWAVPDDV--PEDVTEE 120
+ YT P YA E++ +CP CIA+G AA ++D +F D ++V D V PE + E
Sbjct 144 GKTTHIFYTAPFYAVEDIA-YLCPECIANGEAARKYDGSFQDD-FSVDDGVDDPEKLDEP 201
Query 121 VLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIE 180
+ RTPG++GW QE W HCGD A+LG VGA E+ AL L W D+ +
Sbjct 202 IH-RTPGYSGWQQEYWRAHCGDYCAYLGHVGARELR----ALGVLEEVLDDPMWD-DEQK 255
Query 181 EFILTLDRNGLATAYLFRCLSCGVHLAYADF 211
E I G YLF+CL CG HL + DF
Sbjct 256 EMIRESVNGGHLQCYLFQCLHCGKHLVWMDF 286
>gi|336429235|ref|ZP_08609203.1| hypothetical protein HMPREF0994_05209 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336003151|gb|EGN33242.1| hypothetical protein HMPREF0994_05209 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length=287
Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 81/211 (39%), Positives = 112/211 (54%), Gaps = 18/211 (8%)
Query 5 ANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHPDPVGTGSI--VADEVSCVSC 62
NH LP P + + + +E LP FRYHP+P+ TG+ AD V C C
Sbjct 90 GNHYALPKP------KTPEEKQKEKERQAQLGLPAFRYHPNPLETGAFEESADGVVCDCC 143
Query 63 EQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATFTDAMWAVPDDV--PEDVTEE 120
+ YT P ++ E++ +CP CIA+G AA ++D +F D ++V D V PE + +E
Sbjct 144 GKTTHIFYTNPFFSVEDIA-YLCPECIANGEAARKYDGSFQDD-FSVDDGVDDPEKL-DE 200
Query 121 VLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIE 180
++ RTPG++GW QE W HCGD A+LG VGA E+ AL L W D+ +
Sbjct 201 LIHRTPGYSGWQQEYWRAHCGDYCAYLGNVGARELR----ALGVLEEVLDDPMWD-DEQK 255
Query 181 EFILTLDRNGLATAYLFRCLSCGVHLAYADF 211
E I G YLF+CL CG HL + DF
Sbjct 256 EMIRESVNGGHLQCYLFQCLHCGKHLVWMDF 286
>gi|223985039|ref|ZP_03635137.1| hypothetical protein HOLDEFILI_02441 [Holdemania filiformis DSM
12042]
gi|223963011|gb|EEF67425.1| hypothetical protein HOLDEFILI_02441 [Holdemania filiformis DSM
12042]
Length=287
Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 82/211 (39%), Positives = 111/211 (53%), Gaps = 18/211 (8%)
Query 5 ANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHPDPVGTGSI--VADEVSCVSC 62
NH LP P + + + +E LP FRYHP+P+ TG+ AD V C C
Sbjct 90 GNHYALPKP------KTPEEKQKEKERQAQLGLPAFRYHPNPLETGAFEESADGVVCDCC 143
Query 63 EQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATFTD--AMWAVPDDVPEDVTEE 120
+ YTGP YA E++ E +CP CI+ G AA ++D F D ++ DD PE + +E
Sbjct 144 GKTTHIFYTGPFYAVEDI-EYLCPECISSGEAARKYDGCFQDDCSLDNGVDD-PEKL-DE 200
Query 121 VLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIE 180
++ RTPG++GW QE W HCGD A+LG VGA E+ AL L W D+ +
Sbjct 201 LIHRTPGYSGWQQEYWRAHCGDYCAYLGHVGARELR----ALGVLEEVLDDPMWD-DEQK 255
Query 181 EFILTLDRNGLATAYLFRCLSCGVHLAYADF 211
+ I G YLF+CL CG HL + DF
Sbjct 256 KMIQESVNGGHLQCYLFQCLHCGKHLVWMDF 286
>gi|295100208|emb|CBK97753.1| Uncharacterized protein conserved in bacteria [Faecalibacterium
prausnitzii L2-6]
Length=287
Score = 126 bits (316), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 81/211 (39%), Positives = 112/211 (54%), Gaps = 18/211 (8%)
Query 5 ANHACLPTPLASTTGRGQDHEMPVEETSTPQKLPQFRYHPDPVGTGSI--VADEVSCVSC 62
NH LP P + + + +E LP FRYHP+P+ TG+ AD V C C
Sbjct 90 GNHYALPKP------KTPEEKQKEKERQAQLGLPAFRYHPNPLETGAFEESADGVVCDCC 143
Query 63 EQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATFTDAMWAVPDDV--PEDVTEE 120
+ YT P ++ E++ +CP CIA+G AA ++D +F D ++V D V PE + +E
Sbjct 144 GKTTHIFYTNPFFSVEDIA-YLCPECIANGEAARKYDGSFQDD-FSVDDGVDDPEKL-DE 200
Query 121 VLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIE 180
++ RTPG++GW QE W HCGD A+LG VGA E+ AL L W D+ +
Sbjct 201 LIHRTPGYSGWQQEYWRAHCGDYCAYLGHVGARELR----ALGVLEEVLDDPMWD-DEQK 255
Query 181 EFILTLDRNGLATAYLFRCLSCGVHLAYADF 211
E I G YLF+CL CG HL + DF
Sbjct 256 EMIRESVNGGHLQCYLFQCLHCGKHLVWMDF 286
>gi|152987810|ref|YP_001349151.1| hypothetical protein PSPA7_3797 [Pseudomonas aeruginosa PA7]
gi|150962968|gb|ABR84993.1| conserved hypothetical protein [Pseudomonas aeruginosa PA7]
Length=179
Score = 126 bits (316), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 67/168 (40%), Positives = 90/168 (54%), Gaps = 3/168 (1%)
Query 37 LPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNE-AICPWCIADGSAA 95
LP FRYHP+P+ +GSI A +C C + R Y YT Y+ EL ++CPWCIADGSAA
Sbjct 5 LPYFRYHPEPLASGSIEASAATCRCCGKARGYAYTVSPYSRHELPPGSLCPWCIADGSAA 64
Query 96 SRFDATFTDAMWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEV 155
+R++A+F D + + ++ EV RTPG+ W QE WL C DA AF G G E+
Sbjct 65 ARYEASFCDDHPLLEAGIAAEIVAEVCERTPGYASWQQERWLSCCEDACAFRGDAGREEI 124
Query 156 ADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCG 203
L + L + + WPA + + G Y F CL CG
Sbjct 125 GRL--GAEGLAQRFVDFAWPAITWKRLVDAYAPGGNPAIYRFDCLHCG 170
>gi|124007367|ref|ZP_01692074.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
gi|123987200|gb|EAY26940.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
Length=186
Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 73/184 (40%), Positives = 94/184 (52%), Gaps = 13/184 (7%)
Query 37 LPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADGSAAS 96
LP F+Y+PDPV G I + C C+Q R Y YTGP Y ++ ICPWCI DGSAA
Sbjct 4 LPVFKYNPDPVRLGVIKKERTHCPVCQQERAYVYTGPFYTTAQV-RGICPWCIKDGSAAQ 62
Query 97 RFDATFTDAM---WAVPD-------DVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAF 146
RF T D + PD + DV +E+L RTPG+ GW QE WL HC + A
Sbjct 63 RFQGTLQDYLAIEGISPDPSTPHTINYASDVIDELLERTPGYRGWQQEVWLSHCNEPCAI 122
Query 147 LGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATAYLFRCLSCGVHL 206
+ VG E+A L + L ++ + W + E L L + G YLFRC+ C H
Sbjct 123 IDYVGWKEIAHLQEELMPDLSDIQS-RWNISQTELQGL-LTKPGDIQGYLFRCVKCNKHR 180
Query 207 AYAD 210
D
Sbjct 181 LTID 184
>gi|254038935|ref|ZP_04872987.1| conserved hypothetical protein [Escherichia sp. 1_1_43]
gi|226838900|gb|EEH70927.1| conserved hypothetical protein [Escherichia sp. 1_1_43]
Length=182
Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 68/185 (37%), Positives = 98/185 (53%), Gaps = 20/185 (10%)
Query 31 TSTPQKLPQFRYHPDPVGTGSIVADE-VSCVSCEQRRPYTYTGPVYAEEELNEAICPWCI 89
T + LPQF+YHP P+ TG+ D+ V C CEQ+ Y+GP Y +E+ E +CPWCI
Sbjct 2 TQNIRPLPQFKYHPKPLETGAFEQDKTVECDCCEQQTSVYYSGPFYCVDEV-EHLCPWCI 60
Query 90 ADGSAASRFDATFTD---------------AMWAVPDDVPEDVTEEVLCRTPGFTGWLQE 134
ADGSAA +F +F D + + P+++ +E++ RTPG+ GW QE
Sbjct 61 ADGSAAEKFAGSFQDDASIEGVEFEYDEEDEFAGIKNTYPDEMLKELVERTPGYHGWQQE 120
Query 135 EWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILTLDRNGLATA 194
WL HCGD AF+G VG +++ D D L + + + + L + G
Sbjct 121 FWLAHCGDFCAFIGYVGWNDIKDRLDEFANLEEDCENF---GIRNSDLAKCLQKGGDCQG 177
Query 195 YLFRC 199
YLFRC
Sbjct 178 YLFRC 182
Lambda K H
0.319 0.135 0.441
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 252352077426
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40