BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2114
Length=207
Score E
Sequences producing significant alignments: (Bits) Value
gi|15841606|ref|NP_336643.1| hypothetical protein MT2174 [Mycoba... 426 1e-117
gi|253798824|ref|YP_003031825.1| hypothetical protein TBMG_01866... 425 2e-117
gi|15609251|ref|NP_216630.1| hypothetical protein Rv2114 [Mycoba... 424 3e-117
gi|289447734|ref|ZP_06437478.1| conserved hypothetical protein [... 419 2e-115
gi|308374493|ref|ZP_07436271.2| hypothetical protein TMFG_01073 ... 406 1e-111
gi|240171388|ref|ZP_04750047.1| hypothetical protein MkanA1_1889... 364 3e-99
gi|254775070|ref|ZP_05216586.1| hypothetical protein MaviaA2_104... 340 1e-91
gi|118464079|ref|YP_881601.1| hypothetical protein MAV_2401 [Myc... 339 1e-91
gi|254819645|ref|ZP_05224646.1| hypothetical protein MintA_06969... 336 1e-90
gi|41407936|ref|NP_960772.1| hypothetical protein MAP1838 [Mycob... 336 1e-90
gi|342859807|ref|ZP_08716460.1| hypothetical protein MCOL_13048 ... 326 1e-87
gi|296165186|ref|ZP_06847733.1| conserved hypothetical protein [... 325 2e-87
gi|183983089|ref|YP_001851380.1| hypothetical protein MMAR_3090 ... 324 5e-87
gi|118617846|ref|YP_906178.1| hypothetical protein MUL_2336 [Myc... 291 5e-77
gi|145223658|ref|YP_001134336.1| hypothetical protein Mflv_3071 ... 265 3e-69
gi|120404427|ref|YP_954256.1| hypothetical protein Mvan_3455 [My... 258 3e-67
gi|294993523|ref|ZP_06799214.1| hypothetical protein Mtub2_03189... 248 4e-64
gi|284038595|ref|YP_003388525.1| hypothetical protein Slin_3725 ... 162 3e-38
gi|145589984|ref|YP_001156581.1| hypothetical protein Pnuc_1804 ... 151 7e-35
gi|162451503|ref|YP_001613870.1| hypothetical protein sce3231 [S... 138 5e-31
gi|158421724|ref|YP_001523016.1| hypothetical protein AZC_0100 [... 136 2e-30
gi|270157528|ref|ZP_06186185.1| conserved hypothetical protein [... 133 2e-29
gi|52841347|ref|YP_095146.1| hypothetical protein lpg1113 [Legio... 132 2e-29
gi|148360208|ref|YP_001251415.1| hypothetical protein LPC_2140 [... 129 2e-28
gi|54297068|ref|YP_123437.1| hypothetical protein lpp1113 [Legio... 129 2e-28
gi|307609860|emb|CBW99383.1| hypothetical protein LPW_11601 [Leg... 126 2e-27
gi|54294054|ref|YP_126469.1| hypothetical protein lpl1117 [Legio... 124 9e-27
gi|20091164|ref|NP_617239.1| hypothetical protein MA2329 [Methan... 113 2e-23
gi|303245887|ref|ZP_07332169.1| conserved hypothetical protein [... 110 9e-23
gi|20091163|ref|NP_617238.1| hypothetical protein MA2328 [Methan... 83.2 2e-14
gi|171464106|ref|YP_001798219.1| hypothetical protein Pnec_1517 ... 80.1 2e-13
gi|239907715|ref|YP_002954456.1| hypothetical protein DMR_30790 ... 75.9 4e-12
gi|239904765|ref|YP_002951503.1| hypothetical protein DMR_01260 ... 71.6 6e-11
gi|158424379|ref|YP_001525671.1| hypothetical protein AZC_2755 [... 71.2 8e-11
gi|149919613|ref|ZP_01908092.1| hypothetical protein PPSIR1_0706... 63.2 3e-08
gi|323447988|gb|EGB03893.1| hypothetical protein AURANDRAFT_6764... 58.5 6e-07
gi|148262883|ref|YP_001229589.1| paraquat-inducible protein A [G... 43.5 0.017
gi|209965764|ref|YP_002298679.1| DNA polymerase I, putative [Rho... 38.5 0.65
gi|291410118|ref|XP_002721338.1| PREDICTED: runt-related transcr... 35.8 3.6
gi|149923996|ref|ZP_01912380.1| hypothetical protein PPSIR1_0653... 35.8 3.6
gi|338780768|gb|EGP45169.1| sigma-E factor regulatory protein [A... 35.0 6.9
gi|194673942|ref|XP_612405.4| PREDICTED: jumonji, AT rich intera... 34.7 9.3
>gi|15841606|ref|NP_336643.1| hypothetical protein MT2174 [Mycobacterium tuberculosis CDC1551]
gi|308232037|ref|ZP_07663979.1| hypothetical protein TMAG_00299 [Mycobacterium tuberculosis SUMu001]
gi|308369624|ref|ZP_07666762.1| hypothetical protein TMBG_00660 [Mycobacterium tuberculosis SUMu002]
21 more sequence titles
Length=220
Score = 426 bits (1094), Expect = 1e-117, Method: Compositional matrix adjust.
Identities = 207/207 (100%), Positives = 207/207 (100%), Gaps = 0/207 (0%)
Query 1 MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA 60
MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA
Sbjct 14 MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA 73
Query 61 ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTV 120
ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTV
Sbjct 74 ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTV 133
Query 121 FVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTS 180
FVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTS
Sbjct 134 FVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTS 193
Query 181 ELRVDTTNREARVLQDDLTNSYSLVTA 207
ELRVDTTNREARVLQDDLTNSYSLVTA
Sbjct 194 ELRVDTTNREARVLQDDLTNSYSLVTA 220
>gi|253798824|ref|YP_003031825.1| hypothetical protein TBMG_01866 [Mycobacterium tuberculosis KZN
1435]
gi|254364921|ref|ZP_04980967.1| hypothetical protein TBHG_02068 [Mycobacterium tuberculosis str.
Haarlem]
gi|289554100|ref|ZP_06443310.1| hypothetical protein TBXG_01850 [Mycobacterium tuberculosis KZN
605]
7 more sequence titles
Length=213
Score = 425 bits (1092), Expect = 2e-117, Method: Compositional matrix adjust.
Identities = 207/207 (100%), Positives = 207/207 (100%), Gaps = 0/207 (0%)
Query 1 MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA 60
MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA
Sbjct 7 MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA 66
Query 61 ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTV 120
ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTV
Sbjct 67 ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTV 126
Query 121 FVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTS 180
FVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTS
Sbjct 127 FVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTS 186
Query 181 ELRVDTTNREARVLQDDLTNSYSLVTA 207
ELRVDTTNREARVLQDDLTNSYSLVTA
Sbjct 187 ELRVDTTNREARVLQDDLTNSYSLVTA 213
>gi|15609251|ref|NP_216630.1| hypothetical protein Rv2114 [Mycobacterium tuberculosis H37Rv]
gi|31793294|ref|NP_855787.1| hypothetical protein Mb2138 [Mycobacterium bovis AF2122/97]
gi|121637996|ref|YP_978220.1| hypothetical protein BCG_2131 [Mycobacterium bovis BCG str. Pasteur
1173P2]
41 more sequence titles
Length=207
Score = 424 bits (1091), Expect = 3e-117, Method: Compositional matrix adjust.
Identities = 207/207 (100%), Positives = 207/207 (100%), Gaps = 0/207 (0%)
Query 1 MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA 60
MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA
Sbjct 1 MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA 60
Query 61 ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTV 120
ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTV
Sbjct 61 ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTV 120
Query 121 FVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTS 180
FVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTS
Sbjct 121 FVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTS 180
Query 181 ELRVDTTNREARVLQDDLTNSYSLVTA 207
ELRVDTTNREARVLQDDLTNSYSLVTA
Sbjct 181 ELRVDTTNREARVLQDDLTNSYSLVTA 207
>gi|289447734|ref|ZP_06437478.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289420692|gb|EFD17893.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
Length=212
Score = 419 bits (1076), Expect = 2e-115, Method: Compositional matrix adjust.
Identities = 206/207 (99%), Positives = 206/207 (99%), Gaps = 1/207 (0%)
Query 1 MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA 60
MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA
Sbjct 7 MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA 66
Query 61 ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTV 120
ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQ SRNTV
Sbjct 67 ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQ-SRNTV 125
Query 121 FVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTS 180
FVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTS
Sbjct 126 FVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTS 185
Query 181 ELRVDTTNREARVLQDDLTNSYSLVTA 207
ELRVDTTNREARVLQDDLTNSYSLVTA
Sbjct 186 ELRVDTTNREARVLQDDLTNSYSLVTA 212
>gi|308374493|ref|ZP_07436271.2| hypothetical protein TMFG_01073 [Mycobacterium tuberculosis SUMu006]
gi|308341722|gb|EFP30573.1| hypothetical protein TMFG_01073 [Mycobacterium tuberculosis SUMu006]
Length=198
Score = 406 bits (1043), Expect = 1e-111, Method: Compositional matrix adjust.
Identities = 197/198 (99%), Positives = 198/198 (100%), Gaps = 0/198 (0%)
Query 10 LSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLNGPR 69
+SGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLNGPR
Sbjct 1 MSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLNGPR 60
Query 70 YWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAGEEV 129
YWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAGEEV
Sbjct 61 YWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAGEEV 120
Query 130 YELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDTTNR 189
YELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDTTNR
Sbjct 121 YELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDTTNR 180
Query 190 EARVLQDDLTNSYSLVTA 207
EARVLQDDLTNSYSLVTA
Sbjct 181 EARVLQDDLTNSYSLVTA 198
>gi|240171388|ref|ZP_04750047.1| hypothetical protein MkanA1_18896 [Mycobacterium kansasii ATCC
12478]
Length=214
Score = 364 bits (935), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 173/198 (88%), Positives = 185/198 (94%), Gaps = 0/198 (0%)
Query 10 LSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLNGPR 69
LSG+RYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDP+A+ATEH AA ALLNGPR
Sbjct 17 LSGKRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPKAIATEHGAAAALLNGPR 76
Query 70 YWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAGEEV 129
YWLMNAIEKAPQGPPV K+FGGIEMLQQATVLLSSMNPAPYTV+QVSRNTVF+FNAGEE+
Sbjct 77 YWLMNAIEKAPQGPPVIKSFGGIEMLQQATVLLSSMNPAPYTVNQVSRNTVFIFNAGEEI 136
Query 130 YELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDTTNR 189
YEL+DP+GQRWVMQTWSQVVDPNLSRADLPKL +RLNLP+GWSY LT ELR+DTT R
Sbjct 137 YELRDPEGQRWVMQTWSQVVDPNLSRADLPKLADRLNLPSGWSYQPNRLTDELRIDTTAR 196
Query 190 EARVLQDDLTNSYSLVTA 207
ARVLQDDL NSYSLV A
Sbjct 197 AARVLQDDLANSYSLVMA 214
>gi|254775070|ref|ZP_05216586.1| hypothetical protein MaviaA2_10411 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=214
Score = 340 bits (871), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 167/214 (79%), Positives = 182/214 (86%), Gaps = 7/214 (3%)
Query 1 MSAP-------ERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQA 53
MSAP + L+G+RYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDP A
Sbjct 1 MSAPGSDGAVGKHALDLAGKRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPHA 60
Query 54 LATEHKAATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVS 113
+A EH AA ALLNGPRYWLMN IEK PQGP +TKTFGGIEM+QQATVLLSSMNPAPY +
Sbjct 61 IAKEHGAAMALLNGPRYWLMNGIEKQPQGPRITKTFGGIEMIQQATVLLSSMNPAPYIPN 120
Query 114 QVSRNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSY 173
V+R+TVFVF+AG+E+YEL DP+ Q WVMQTWSQV DP LSRADLP L RL+LPAGWSY
Sbjct 121 TVNRHTVFVFDAGQEIYELIDPENQHWVMQTWSQVSDPTLSRADLPGLAGRLDLPAGWSY 180
Query 174 HTRVLTSELRVDTTNREARVLQDDLTNSYSLVTA 207
RVLTSELRVDTT+R ARVLQDDLTNSYSLVTA
Sbjct 181 QPRVLTSELRVDTTSRPARVLQDDLTNSYSLVTA 214
>gi|118464079|ref|YP_881601.1| hypothetical protein MAV_2401 [Mycobacterium avium 104]
gi|118165366|gb|ABK66263.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=214
Score = 339 bits (870), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 163/198 (83%), Positives = 177/198 (90%), Gaps = 0/198 (0%)
Query 10 LSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLNGPR 69
L+G+RYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDP A+A EH AA ALLNGPR
Sbjct 17 LAGKRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPHAIAKEHGAAMALLNGPR 76
Query 70 YWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAGEEV 129
YWLMN IEK PQGP +TKTFGGIEM+QQATVLLSSMNPAPY + V+R+TVFVF+AG+E+
Sbjct 77 YWLMNGIEKQPQGPRITKTFGGIEMIQQATVLLSSMNPAPYIPNTVNRHTVFVFDAGQEI 136
Query 130 YELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDTTNR 189
YEL DP+ Q WVMQTWSQV DP LSRADLP L RL+LPAGWSY RVLTSELRVDTT+R
Sbjct 137 YELIDPENQHWVMQTWSQVSDPTLSRADLPGLAGRLDLPAGWSYQPRVLTSELRVDTTSR 196
Query 190 EARVLQDDLTNSYSLVTA 207
ARVLQDDLTNSYSLVTA
Sbjct 197 PARVLQDDLTNSYSLVTA 214
>gi|254819645|ref|ZP_05224646.1| hypothetical protein MintA_06969 [Mycobacterium intracellulare
ATCC 13950]
Length=201
Score = 336 bits (862), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 163/198 (83%), Positives = 172/198 (87%), Gaps = 0/198 (0%)
Query 7 VTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLN 66
+T LSG+RYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWS LD QA+A EH AATALLN
Sbjct 1 MTDLSGKRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSKLDAQAIAKEHGAATALLN 60
Query 67 GPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAG 126
GPRYWLMNAIEK QGP +TKTFGGIEM+QQATVLLSSMNPAPYT +QV+R+TVFVFN G
Sbjct 61 GPRYWLMNAIEKQRQGPQITKTFGGIEMIQQATVLLSSMNPAPYTANQVNRHTVFVFNPG 120
Query 127 EEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDT 186
EEVYEL DP GQRWVMQTWSQV DP LSRADLP L RLNLP GW+Y RVLT ELRVDT
Sbjct 121 EEVYELLDPGGQRWVMQTWSQVADPTLSRADLPGLAARLNLPHGWAYQPRVLTEELRVDT 180
Query 187 TNREARVLQDDLTNSYSL 204
R A V QDDLTNSYSL
Sbjct 181 RTRSAHVTQDDLTNSYSL 198
>gi|41407936|ref|NP_960772.1| hypothetical protein MAP1838 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41396290|gb|AAS04155.1| hypothetical protein MAP_1838 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336461990|gb|EGO40839.1| hypothetical protein MAPs_24890 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=214
Score = 336 bits (862), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 166/214 (78%), Positives = 181/214 (85%), Gaps = 7/214 (3%)
Query 1 MSAP-------ERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQA 53
MSAP + L+G+RYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDP A
Sbjct 1 MSAPGSDGAVGKHALDLAGKRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPHA 60
Query 54 LATEHKAATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVS 113
+A EH AA ALLNGPRYWLMN IEK PQGP +TKTFGGIEM+QQATVLLSSMNPAP +
Sbjct 61 IAKEHGAAMALLNGPRYWLMNGIEKQPQGPRITKTFGGIEMIQQATVLLSSMNPAPCIPN 120
Query 114 QVSRNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSY 173
V+R+TVFVF+AG+E+YEL DP+ Q WVMQTWSQV DP LSRADLP L RL+LPAGWSY
Sbjct 121 TVNRHTVFVFDAGQEIYELIDPENQHWVMQTWSQVSDPTLSRADLPGLAGRLDLPAGWSY 180
Query 174 HTRVLTSELRVDTTNREARVLQDDLTNSYSLVTA 207
RVLTSELRVDTT+R ARVLQDDLTNSYSLVTA
Sbjct 181 QPRVLTSELRVDTTSRPARVLQDDLTNSYSLVTA 214
>gi|342859807|ref|ZP_08716460.1| hypothetical protein MCOL_13048 [Mycobacterium colombiense CECT
3035]
gi|342132939|gb|EGT86159.1| hypothetical protein MCOL_13048 [Mycobacterium colombiense CECT
3035]
Length=214
Score = 326 bits (835), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 159/212 (75%), Positives = 177/212 (84%), Gaps = 7/212 (3%)
Query 1 MSAPE-------RVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQA 53
MSAPE LSG+RYGEVLLVTPGEAGPQATVYNSFPLNDCP ELWSALD A
Sbjct 1 MSAPESDHAVGKHALDLSGKRYGEVLLVTPGEAGPQATVYNSFPLNDCPQELWSALDAHA 60
Query 54 LATEHKAATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVS 113
+ATEH AA ALLNGPRYWLMNAIEK +GP +TK+FGGIEM+QQATVLLSSMNPAPY +
Sbjct 61 IATEHGAAAALLNGPRYWLMNAIEKEARGPQITKSFGGIEMIQQATVLLSSMNPAPYIPN 120
Query 114 QVSRNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSY 173
V+R+TVFVFNAG+EVYEL DP+ + W+MQTWSQV D LSRADLP L +RL+LPAGW+Y
Sbjct 121 TVNRHTVFVFNAGQEVYELIDPQSRHWIMQTWSQVADATLSRADLPGLADRLDLPAGWAY 180
Query 174 HTRVLTSELRVDTTNREARVLQDDLTNSYSLV 205
RVLT ELRVDTT R A+VLQD+LTNSYSLV
Sbjct 181 QPRVLTDELRVDTTQRPAQVLQDNLTNSYSLV 212
>gi|296165186|ref|ZP_06847733.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295899375|gb|EFG78834.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=214
Score = 325 bits (834), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 160/213 (76%), Positives = 178/213 (84%), Gaps = 7/213 (3%)
Query 1 MSAPE-------RVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQA 53
MSAPE L+G+RYGEVLLVT GEAGPQATVYNSFPLNDCPAELWSALDP A
Sbjct 1 MSAPESDHAVGKHALDLAGKRYGEVLLVTSGEAGPQATVYNSFPLNDCPAELWSALDPHA 60
Query 54 LATEHKAATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVS 113
+A E+ A ALLNGPRYWLMNAIEK QGP VTKTFGGIEM+QQATVLLSS NPAPY +
Sbjct 61 IAAENGVAAALLNGPRYWLMNAIEKEAQGPQVTKTFGGIEMIQQATVLLSSTNPAPYVPN 120
Query 114 QVSRNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSY 173
+V+R+TVFVFNAG+++YEL DP GQ WVMQT SQV DPNLS+ADLP+L +RL+LPAGWSY
Sbjct 121 KVNRHTVFVFNAGQQIYELIDPHGQHWVMQTLSQVSDPNLSQADLPRLADRLDLPAGWSY 180
Query 174 HTRVLTSELRVDTTNREARVLQDDLTNSYSLVT 206
RVLT ELRVDT R A+VLQD+LTNSYSLVT
Sbjct 181 QPRVLTEELRVDTRTRAAQVLQDNLTNSYSLVT 213
>gi|183983089|ref|YP_001851380.1| hypothetical protein MMAR_3090 [Mycobacterium marinum M]
gi|183176415|gb|ACC41525.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=220
Score = 324 bits (830), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 157/204 (77%), Positives = 175/204 (86%), Gaps = 0/204 (0%)
Query 1 MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA 60
+S PE+V LSG+RYGEVLLV GE+GPQATVYNSFPLNDCPAELWSALD QALA E+
Sbjct 14 VSVPEQVEDLSGKRYGEVLLVEIGESGPQATVYNSFPLNDCPAELWSALDAQALAAENGV 73
Query 61 ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTV 120
A ALLNGPRYWLMN+IEK PQG P TK+FGGIEML+QATV +SSM+PAPYTV++V+R+TV
Sbjct 74 AAALLNGPRYWLMNSIEKEPQGLPETKSFGGIEMLKQATVQMSSMSPAPYTVNRVNRHTV 133
Query 121 FVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTS 180
FVFNAG E+YEL DP GQRWVMQTWSQVVDPNL+RADLP L RL+LP GWSY RVL
Sbjct 134 FVFNAGAEIYELIDPGGQRWVMQTWSQVVDPNLARADLPGLAARLDLPEGWSYEPRVLAE 193
Query 181 ELRVDTTNREARVLQDDLTNSYSL 204
LRVDTTNR A V QDDL+NSYSL
Sbjct 194 TLRVDTTNRPAHVTQDDLSNSYSL 217
>gi|118617846|ref|YP_906178.1| hypothetical protein MUL_2336 [Mycobacterium ulcerans Agy99]
gi|118569956|gb|ABL04707.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=191
Score = 291 bits (744), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 142/188 (76%), Positives = 160/188 (86%), Gaps = 1/188 (0%)
Query 18 VLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLNGPRYWLMNAIE 77
+LLV GE+GPQATVYNSFPLNDCPAELWSALD QALA E+ A ALLNGPRYWLMN+IE
Sbjct 1 MLLVEIGESGPQATVYNSFPLNDCPAELWSALDAQALAAENGVAAALLNGPRYWLMNSIE 60
Query 78 KAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVS-RNTVFVFNAGEEVYELQDPK 136
K PQG P +K+FGGIEML+QATV +SSM+PAPYTV++V+ R+TVFVFNAG E+YEL DP
Sbjct 61 KEPQGLPESKSFGGIEMLKQATVQMSSMSPAPYTVNRVNRRHTVFVFNAGAEIYELIDPG 120
Query 137 GQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDTTNREARVLQD 196
GQ WVMQTWSQVVDPNL+RADLP L RL+LP GWSY RVL LRVDTT+R A V QD
Sbjct 121 GQHWVMQTWSQVVDPNLARADLPGLAARLDLPEGWSYEPRVLAETLRVDTTDRPAHVTQD 180
Query 197 DLTNSYSL 204
DL+NSYSL
Sbjct 181 DLSNSYSL 188
>gi|145223658|ref|YP_001134336.1| hypothetical protein Mflv_3071 [Mycobacterium gilvum PYR-GCK]
gi|315443984|ref|YP_004076863.1| hypothetical protein Mspyr1_23860 [Mycobacterium sp. Spyr1]
gi|145216144|gb|ABP45548.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
gi|315262287|gb|ADT99028.1| hypothetical protein Mspyr1_23860 [Mycobacterium sp. Spyr1]
Length=200
Score = 265 bits (677), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 127/199 (64%), Positives = 154/199 (78%), Gaps = 0/199 (0%)
Query 7 VTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLN 66
++ L G+RYGEVLLV G+AGP+ATV+N++PLNDCPAELW+ LD QA+A EH A ALLN
Sbjct 1 MSSLFGRRYGEVLLVRMGDAGPEATVFNTYPLNDCPAELWNRLDAQAIAAEHHCAAALLN 60
Query 67 GPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAG 126
GPRYWLM+ IEK +TFGGIEMLQQATV LSSMNP+PY+V++V R VFV++ G
Sbjct 61 GPRYWLMSRIEKVGGTETPRETFGGIEMLQQATVSLSSMNPSPYSVNEVDRKAVFVYDPG 120
Query 127 EEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDT 186
V+EL DP+ + WVMQT+SQ VDP LS DLP LG+RLNLP GW Y R L +RV+T
Sbjct 121 TPVFELIDPEDRCWVMQTYSQTVDPELSVDDLPGLGDRLNLPDGWRYRARTLDQTVRVET 180
Query 187 TNREARVLQDDLTNSYSLV 205
R+ARVLQDDL NSYSL+
Sbjct 181 ATRKARVLQDDLANSYSLL 199
>gi|120404427|ref|YP_954256.1| hypothetical protein Mvan_3455 [Mycobacterium vanbaalenii PYR-1]
gi|119957245|gb|ABM14250.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=200
Score = 258 bits (660), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 130/200 (65%), Positives = 156/200 (78%), Gaps = 0/200 (0%)
Query 7 VTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLN 66
+T + G+RYGEVLLV GE GPQATVYN++PLNDCPAELW+ LD Q +A EH AA ALLN
Sbjct 1 MTNVFGKRYGEVLLVYVGENGPQATVYNTYPLNDCPAELWTKLDTQTVAAEHGAAAALLN 60
Query 67 GPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAG 126
GPRYWLM+ IEK +TFGGIEML+QATV LSSMNPAPY+V++V R +FV++AG
Sbjct 61 GPRYWLMSGIEKPGGTESERRTFGGIEMLRQATVALSSMNPAPYSVNEVDRKAIFVYDAG 120
Query 127 EEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDT 186
V+EL DP G+RWVMQT+SQ VDP L+ DLP L RL LPAGW+Y +R L + L VDT
Sbjct 121 TPVFELVDPDGRRWVMQTYSQTVDPALTLEDLPGLAARLTLPAGWTYRSRTLDAPLTVDT 180
Query 187 TNREARVLQDDLTNSYSLVT 206
+NR+A VLQDDL NSYSL +
Sbjct 181 SNRKASVLQDDLANSYSLTS 200
>gi|294993523|ref|ZP_06799214.1| hypothetical protein Mtub2_03189 [Mycobacterium tuberculosis
210]
Length=137
Score = 248 bits (633), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 124/131 (95%), Positives = 125/131 (96%), Gaps = 1/131 (0%)
Query 77 EKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAGEEVYELQDPK 136
E AP GP + FGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAGEEVYELQDPK
Sbjct 8 EGAP-GPAGDEDFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAGEEVYELQDPK 66
Query 137 GQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDTTNREARVLQD 196
GQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDTTNREARVLQD
Sbjct 67 GQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDTTNREARVLQD 126
Query 197 DLTNSYSLVTA 207
DLTNSYSLVTA
Sbjct 127 DLTNSYSLVTA 137
>gi|284038595|ref|YP_003388525.1| hypothetical protein Slin_3725 [Spirosoma linguale DSM 74]
gi|283817888|gb|ADB39726.1| conserved hypothetical protein [Spirosoma linguale DSM 74]
Length=235
Score = 162 bits (410), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 85/200 (43%), Positives = 121/200 (61%), Gaps = 8/200 (4%)
Query 12 GQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLNGPRYW 71
G RY E+L+V+ ATVYN+ N CPA W A+D L E A + L+NGPRY+
Sbjct 37 GARYCEILVVSGKLNDLTATVYNTLGCNSCPASQWKAIDADKLKNELGAKSVLMNGPRYF 96
Query 72 LMNAIEKAPQGPPVTKTFGGIEMLQQATVLLS-----SMNPAPYTVSQVSRNTVFVFNAG 126
LM+ I ++ PP+ T GG+++ ++ATV +S PYT + V R+T +VFN G
Sbjct 97 LMDKIGQSNAAPPMV-TLGGLQLKKRATVPVSLRTVFEGKAKPYTETSVKRSTKYVFNKG 155
Query 127 EEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDT 186
VYEL P Q ++MQ+++Q+ DPNL+ DL L RL LP GW + TR+L ++L + T
Sbjct 156 SRVYELVSPDHQ-YIMQSYAQIADPNLTEKDLATLQTRLKLPKGWHFQTRLLPADLVLQT 214
Query 187 TN-REARVLQDDLTNSYSLV 205
+ EA V QDDL N+Y +
Sbjct 215 IDGGEAHVTQDDLMNTYQRI 234
>gi|145589984|ref|YP_001156581.1| hypothetical protein Pnuc_1804 [Polynucleobacter necessarius
subsp. asymbioticus QLW-P1DMWA-1]
gi|145048390|gb|ABP35017.1| conserved hypothetical protein [Polynucleobacter necessarius
subsp. asymbioticus QLW-P1DMWA-1]
Length=231
Score = 151 bits (381), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 81/203 (40%), Positives = 120/203 (60%), Gaps = 6/203 (2%)
Query 5 ERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATAL 64
+ V+ L QRY E+L + V+N+ LN CP + W + + +A ++ A+
Sbjct 27 KSVSNLRDQRYCEILYGKRHWLNLEVKVFNTQGLNLCPEDQWKTITKEEVAKKYDASFVD 86
Query 65 LNGPRYWLMNAIEKA-PQGPPVTKTFGGIEMLQQATV----LLSSMNPAPYTVSQVSRNT 119
LNGPRYW+M+ I+ A V ++FGGIEM +ATV L + Y+ +Q++R T
Sbjct 87 LNGPRYWMMDEIQAAGATANNVKESFGGIEMNLRATVDIGLLKQILGSKSYSPNQINRTT 146
Query 120 VFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLT 179
F++ AG +YEL P GQ +VMQ++SQ+V+PNL+ DLP L + L LP GW Y + +L
Sbjct 147 NFIYKAGSPIYELVAPDGQVYVMQSYSQIVNPNLTMKDLPNLAKELKLPTGWVYRSTLLE 206
Query 180 SELRVDTTNREARVLQDDLTNSY 202
+L + N A VLQD+L NSY
Sbjct 207 KDLSL-VANGIAYVLQDNLANSY 228
>gi|162451503|ref|YP_001613870.1| hypothetical protein sce3231 [Sorangium cellulosum 'So ce 56']
gi|161162085|emb|CAN93390.1| hypothetical protein sce3231 [Sorangium cellulosum 'So ce 56']
Length=240
Score = 138 bits (347), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 75/199 (38%), Positives = 112/199 (57%), Gaps = 7/199 (3%)
Query 10 LSGQRYGEVLLVTPGEAGPQAT--VYNSFPLNDCPAELWSALDPQALATEHKAATALLNG 67
L G RY E+LL T VYN+ LN+CP W A+D + E A ++NG
Sbjct 38 LRGSRYCEILLGDADLVAGSVTIDVYNTQGLNECPEAAWVAVDEAEVKAETMADVVVMNG 97
Query 68 PRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLL----SSMNPAPYTVSQVSRNTVFVF 123
PR+W++++ E + P +T GGIEM + T+ + +S PY V R+T + +
Sbjct 98 PRHWMIDSFEGSKVLDPEVRTLGGIEMRKTGTLTVALAEASGKAKPYETRAVRRDTTWGY 157
Query 124 NAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELR 183
+AG+ VYEL DP+G + +Q++S V + A L KLGE L LP GW++ TRV+ +L
Sbjct 158 DAGKSVYELVDPEGAIYELQSYS-VQEVQQDEASLAKLGETLTLPDGWAFRTRVIDEKLE 216
Query 184 VDTTNREARVLQDDLTNSY 202
V+ + A V+QDD N+Y
Sbjct 217 VEAVDGLAVVVQDDFGNTY 235
>gi|158421724|ref|YP_001523016.1| hypothetical protein AZC_0100 [Azorhizobium caulinodans ORS 571]
gi|158328613|dbj|BAF86098.1| hypothetical protein [Azorhizobium caulinodans ORS 571]
Length=268
Score = 136 bits (343), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 74/196 (38%), Positives = 112/196 (58%), Gaps = 4/196 (2%)
Query 14 RYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLNGPRYWLM 73
RY E+L ++ G G A V+N+ +DCP W L L A NGPR++LM
Sbjct 72 RYCELLPMSIGLNGVSAQVFNTLGHSDCPQANWDGLTDGELRKAFDALYTARNGPRFFLM 131
Query 74 NAIEKA---PQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAGEEVY 130
+ I + +G VT +E + + L+ ++ PY + R+TV+ F+AG+ V+
Sbjct 132 DQIIASGATAKGEVVTVNGITLEKRAEVQLTLAELHDKPYQERAIDRSTVYRFDAGKPVF 191
Query 131 ELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDTTNRE 190
EL P G +VMQ+++Q+VDP L+ ADLP LG +L LPAGW+Y + +L + + N +
Sbjct 192 ELTSPDGSVYVMQSYAQIVDPKLTYADLPGLGAKLKLPAGWTYAMKTPAQDL-ILSANGK 250
Query 191 ARVLQDDLTNSYSLVT 206
A VLQDDL N+Y +T
Sbjct 251 ATVLQDDLKNTYQKIT 266
>gi|270157528|ref|ZP_06186185.1| conserved hypothetical protein [Legionella longbeachae D-4968]
gi|289164086|ref|YP_003454224.1| hypothetical protein LLO_0742 [Legionella longbeachae NSW150]
gi|269989553|gb|EEZ95807.1| conserved hypothetical protein [Legionella longbeachae D-4968]
gi|288857259|emb|CBJ11086.1| hypothetical protein LLO_0742 [Legionella longbeachae NSW150]
Length=234
Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/206 (36%), Positives = 115/206 (56%), Gaps = 8/206 (3%)
Query 1 MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA 60
S + + L GQRY E+L+ ++ VYN+ LN+CP +W + P + +E +
Sbjct 19 FSFAAQKSHLRGQRYCEILI---EKSRTDFAVYNTIGLNNCPERMWDKITPAVVKSETGS 75
Query 61 ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMN----PAPYTVSQVS 116
+ LNGPRYW+++ ++ + P KTF G++M + + +S + A Y QV+
Sbjct 76 SFVHLNGPRYWVIDGLKNSDLVNPEVKTFDGLKMREAGILHISFWDLFRTGASYKQLQVA 135
Query 117 RNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTR 176
R+T +V++AG+ VYEL DPKG +VMQ++S P ++ L +LG +L LP W + T
Sbjct 136 RHTTWVYDAGKPVYELIDPKGNVYVMQSYSVQKTPQTEQS-LAQLGTKLKLPKKWQFKTG 194
Query 177 VLTSELRVDTTNREARVLQDDLTNSY 202
VL V N A V+QDD N+Y
Sbjct 195 VLKKTGTVPAINNMAIVIQDDFLNTY 220
>gi|52841347|ref|YP_095146.1| hypothetical protein lpg1113 [Legionella pneumophila subsp. pneumophila
str. Philadelphia 1]
gi|52628458|gb|AAU27199.1| hypothetical protein lpg1113 [Legionella pneumophila subsp. pneumophila
str. Philadelphia 1]
Length=246
Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/205 (35%), Positives = 113/205 (56%), Gaps = 7/205 (3%)
Query 1 MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA 60
+S + + + G+RY E++L + VYN++ LNDCP +LWS + A+ E A
Sbjct 37 LSYGAKTSNMRGKRYCEIIL---SKTISSYAVYNTWGLNDCPEQLWSKVSMPAVKKETGA 93
Query 61 ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMN---PAPYTVSQVSR 117
+ LNGPRYW+++ + P KT GI M + + LS ++ PY V R
Sbjct 94 SFVHLNGPRYWVIDGFKNTSLINPAIKTISGIPMREAGILHLSLIDLFKNKPYQSHIVDR 153
Query 118 NTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRV 177
T +++ G+ V+EL DP GQ +VMQ++S P + +L +LG++L LP GW + T V
Sbjct 154 KTTWIYQEGKPVFELIDPTGQVFVMQSYSVQKYPQI-MDNLKQLGDKLQLPKGWKFKTGV 212
Query 178 LTSELRVDTTNREARVLQDDLTNSY 202
L ++ N +A V+QD+ N+Y
Sbjct 213 LNKLETIEAVNNKAVVVQDNFLNTY 237
>gi|148360208|ref|YP_001251415.1| hypothetical protein LPC_2140 [Legionella pneumophila str. Corby]
gi|296106738|ref|YP_003618438.1| hypothetical protein lpa_01731 [Legionella pneumophila 2300/99
Alcoy]
gi|148281981|gb|ABQ56069.1| conserved hypothetical protein [Legionella pneumophila str. Corby]
gi|295648639|gb|ADG24486.1| hypothetical protein lpa_01731 [Legionella pneumophila 2300/99
Alcoy]
Length=236
Score = 129 bits (325), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/205 (36%), Positives = 109/205 (54%), Gaps = 7/205 (3%)
Query 1 MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA 60
+S + + G+RY E++L + VYN++ LNDCP +LWS + A+ E +
Sbjct 27 LSYGAETSNMRGKRYCEIIL---AKTISSYAVYNTWGLNDCPEQLWSKVSISAVKKETGS 83
Query 61 ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMN---PAPYTVSQVSR 117
+ LNGPRYW+++ + P KT GI M + + LS M+ PY V R
Sbjct 84 SFVHLNGPRYWVIDGFKNTSLINPAIKTISGIPMREAGILHLSLMDLFKNKPYQSHVVDR 143
Query 118 NTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRV 177
T +V+ A + V+EL DP GQ +VMQ++S P + L +LG +L LP GW + T V
Sbjct 144 KTTWVYQADKPVFELIDPNGQVFVMQSYSVQKYPQ-TMNTLTQLGAKLQLPKGWKFKTGV 202
Query 178 LTSELRVDTTNREARVLQDDLTNSY 202
L + N +A V+QD+ N+Y
Sbjct 203 LNKPETIQAVNNKAVVVQDNFLNTY 227
>gi|54297068|ref|YP_123437.1| hypothetical protein lpp1113 [Legionella pneumophila str. Paris]
gi|53750853|emb|CAH12264.1| hypothetical protein lpp1113 [Legionella pneumophila str. Paris]
Length=201
Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 71/196 (37%), Positives = 107/196 (55%), Gaps = 7/196 (3%)
Query 10 LSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLNGPR 69
+ G+RY E++L + VYN++ LNDCP +LWS + A+ E ++ LNGPR
Sbjct 1 MRGKRYCEIIL---AKTISSYAVYNTWGLNDCPEQLWSKVSISAVKKETGSSFVHLNGPR 57
Query 70 YWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMN---PAPYTVSQVSRNTVFVFNAG 126
YW+++ + P KT GI M + + LS M+ PY V R T +V+ A
Sbjct 58 YWVIDGFKNTSLINPAIKTISGIPMREAGILHLSLMDLFKNKPYQSHVVDRKTTWVYQAD 117
Query 127 EEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDT 186
+ V+EL DP GQ +VMQ++S P + L +LG +L LP GW + T VL ++
Sbjct 118 KPVFELIDPNGQVFVMQSYSVQKYPQ-TMNTLTQLGAKLQLPKGWKFKTGVLNKPETIEA 176
Query 187 TNREARVLQDDLTNSY 202
N +A V+QD+ N+Y
Sbjct 177 VNNKAVVVQDNFLNTY 192
>gi|307609860|emb|CBW99383.1| hypothetical protein LPW_11601 [Legionella pneumophila 130b]
Length=236
Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/205 (35%), Positives = 109/205 (54%), Gaps = 7/205 (3%)
Query 1 MSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKA 60
+S + + G+RY E++L + VYN++ LNDCP +LW+ + A+ E +
Sbjct 27 LSYGAETSNMRGKRYCEIILT---KTISSYAVYNTWGLNDCPEQLWNKVSISAVKKETGS 83
Query 61 ATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMN---PAPYTVSQVSR 117
+ LNGPRYW+++ + P KT GI M + + LS ++ PY V R
Sbjct 84 SFVHLNGPRYWVIDGFKNTSLINPAIKTISGIPMREAGILHLSLIDLFKNKPYQSHVVDR 143
Query 118 NTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRV 177
T +V+ A + V+EL DP GQ +VMQ++S P + L +LG +L LP GW + T V
Sbjct 144 KTTWVYQADKPVFELIDPNGQVFVMQSYSVQKYPQ-TMNTLTQLGAKLQLPKGWKFKTGV 202
Query 178 LTSELRVDTTNREARVLQDDLTNSY 202
L + N +A V+QD+ N+Y
Sbjct 203 LNKPETIQAVNNKAVVVQDNFLNTY 227
>gi|54294054|ref|YP_126469.1| hypothetical protein lpl1117 [Legionella pneumophila str. Lens]
gi|53753886|emb|CAH15355.1| hypothetical protein lpl1117 [Legionella pneumophila str. Lens]
Length=201
Score = 124 bits (311), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 68/196 (35%), Positives = 106/196 (55%), Gaps = 7/196 (3%)
Query 10 LSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLNGPR 69
+ G+RY E++L + VYN++ LNDCP +LW+ + A+ E ++ LNGPR
Sbjct 1 MRGKRYCEIILT---KTISSYAVYNTWGLNDCPEQLWNKVSISAVKKETGSSFVHLNGPR 57
Query 70 YWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMN---PAPYTVSQVSRNTVFVFNAG 126
YW+++ + P KT GI M + + LS ++ PY V R T +V+ A
Sbjct 58 YWVIDGFKNTSLINPAIKTISGIPMREAGILHLSLIDLFKNKPYQSHVVDRKTTWVYQAD 117
Query 127 EEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDT 186
+ V+EL DP GQ +VMQ++S P + L +LG +L LP GW + T +L +
Sbjct 118 KPVFELIDPNGQVFVMQSYSVQKYPQ-TMNTLTQLGAKLQLPKGWKFKTGMLNKPETIQA 176
Query 187 TNREARVLQDDLTNSY 202
N +A V+QD+ N+Y
Sbjct 177 VNNKAVVVQDNFLNTY 192
>gi|20091164|ref|NP_617239.1| hypothetical protein MA2329 [Methanosarcina acetivorans C2A]
gi|19916271|gb|AAM05719.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
Length=238
Score = 113 bits (283), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/202 (34%), Positives = 108/202 (54%), Gaps = 13/202 (6%)
Query 10 LSGQRYGEVLLVTPGEAGPQATVYNSFPLND-------CPAELWSALDPQALATEHKAAT 62
L G RY EV L+ G+AG + YN+ LN+ CP + + +A+ ++
Sbjct 31 LRGLRYCEVFLMC-GDAG--SGFYNTMGLNNEEDPRDTCPDSIMANFSTEAVKEQYNVPG 87
Query 63 ALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLL-SSMNPAPYTVSQVSRNTVF 121
LN PRY+++++ + P P + + F G++ TV ++ PY ++V R +
Sbjct 88 VALNPPRYFVLDSGD-IPVAPTM-RDFDGLKARWMGTVQAGAAFGKEPYMPTKVDRKSEI 145
Query 122 VFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSE 181
F+ G+ V+ L DP G WVM++++ VD NL+ DL L ++L LP+GWSY +VL +
Sbjct 146 FFDKGKPVFILDDPDGTPWVMKSYTDFVDKNLTYEDLNTLDKKLKLPSGWSYRVKVLDED 205
Query 182 LRVDTTNREARVLQDDLTNSYS 203
L + AR+ QDDL N Y
Sbjct 206 LILRPFKGTARITQDDLQNVYD 227
>gi|303245887|ref|ZP_07332169.1| conserved hypothetical protein [Desulfovibrio fructosovorans
JJ]
gi|302492670|gb|EFL52538.1| conserved hypothetical protein [Desulfovibrio fructosovorans
JJ]
Length=255
Score = 110 bits (276), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 67/207 (33%), Positives = 102/207 (50%), Gaps = 16/207 (7%)
Query 10 LSGQRYGEV-LLVTPGEAGPQATVYNSFPLND-------CPAELWSALDPQALATEHKAA 61
L G +Y E+ +LV E G +N+ LND CP +WS +D +AL ++
Sbjct 41 LRGVQYCEIWMLVGSPETGITGHYFNTSNLNDGTNKMDTCPQAMWSKVDAKALHDDYDTY 100
Query 62 TALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSS-----MNPAPYTVSQVS 116
T NGPR W M+++ P GP TF G++ +L PY +
Sbjct 101 TVFKNGPRGWTMDSVT-IPVGP--VDTFDGLKARWWGKGVLPKGADFKKGLEPYKPLKSH 157
Query 117 RNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTR 176
R +VF F GE V+ ++D +G WVMQ +S++VDP +S L LG+R+ +GW Y
Sbjct 158 RKSVFTFKKGEPVFIIEDAQGTPWVMQAFSKIVDPAMSYNALKTLGDRIKPASGWKYRVA 217
Query 177 VLTSELRVDTTNREARVLQDDLTNSYS 203
+ +L V T ++QD+ N+Y
Sbjct 218 IPEKDLVVSTPKGYNWIVQDEFGNTYD 244
>gi|20091163|ref|NP_617238.1| hypothetical protein MA2328 [Methanosarcina acetivorans C2A]
gi|19916270|gb|AAM05718.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
Length=314
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/210 (30%), Positives = 104/210 (50%), Gaps = 21/210 (10%)
Query 10 LSGQRYGEVLLVTPGEAGPQATVYNSFPLN-------DCPAELWSALDPQALATEHKAAT 62
L RY E+LL P +AG ++N+ LN PA+L++ + + ++
Sbjct 95 LRDYRYAEILLSCP-DAG--TGIFNTIGLNIRENPRDSLPADLFANFSETDVEEHYDSSM 151
Query 63 ALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLL---SSMNPAP--YTVSQVSR 117
+NGP W M+A++ + G++ A +++ ++++ A Y V
Sbjct 152 VWMNGPSNWTMDAMDVLI--AIRVRNLDGLDTRWGADIVVPEGANLSEAENVYMAMPVQC 209
Query 118 NTVFVFNAGEEVYELQDPKGQR-WVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTR 176
N + F+ G+ V+ L+D +VMQ++ Q++D NL+ DL L RL LP GWSY
Sbjct 210 NRTWHFDKGKPVFILEDSNNNTTYVMQSYCQIIDKNLTYEDLQTLDTRLELPPGWSYRVE 269
Query 177 VLTSELRVD---TTNREARVLQDDLTNSYS 203
VL +L ++ T + +V QD L N+YS
Sbjct 270 VLPEDLEMNGIGTNGTDWQVTQDSLQNTYS 299
>gi|171464106|ref|YP_001798219.1| hypothetical protein Pnec_1517 [Polynucleobacter necessarius
subsp. necessarius STIR1]
gi|171193644|gb|ACB44605.1| hypothetical protein Pnec_1517 [Polynucleobacter necessarius
subsp. necessarius STIR1]
Length=203
Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/124 (37%), Positives = 70/124 (57%), Gaps = 5/124 (4%)
Query 7 VTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATALLN 66
V+ L QRY EVL+ + V+N+ LN CP W+AL +++A + A+ LLN
Sbjct 35 VSNLHNQRYCEVLVGKRDWLKLEVRVFNTQGLNLCPEAQWNALTKESIAKTYDASFVLLN 94
Query 67 GPRYWLMNAIEKAPQG-PPVTKTFGGIEMLQQATVLLSSMN----PAPYTVSQVSRNTVF 121
GPRYW+M+ I+ A V +FGGI+M +A + LS + YT ++++R T F
Sbjct 95 GPRYWMMDEIQAAGNTVNDVKASFGGIKMNLRAIIQLSLLKQFIGSKHYTPNEIARTTNF 154
Query 122 VFNA 125
V+ +
Sbjct 155 VYKS 158
>gi|239907715|ref|YP_002954456.1| hypothetical protein DMR_30790 [Desulfovibrio magneticus RS-1]
gi|239797581|dbj|BAH76570.1| hypothetical protein [Desulfovibrio magneticus RS-1]
Length=269
Score = 75.9 bits (185), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 58/198 (30%), Positives = 87/198 (44%), Gaps = 20/198 (10%)
Query 3 APERVTGLSGQRYGEVLLVTPG-EAGPQATVYNSFPLNDCPA------ELWSALDPQALA 55
P + G+ + E+L + + G +NS ND PA + AL + L
Sbjct 45 CPIKAENWRGRAFYEILFMFRQPDGGGIGNYFNSLS-NDLPAPNEEMDARFRALRAETLM 103
Query 56 TEHKAATALLNGPRYWLMNAI---------EKAPQGPPVTKTFGGIEMLQQATVLLSSMN 106
E+ + NGPR + N + ++ G P+ GI + S
Sbjct 104 KEYGSNGVFFNGPRRLVANTVSGMSWDGCKQRVIAGIPLK--LDGIFEVPNLEKFASGKM 161
Query 107 PAPYTVSQVSRNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLN 166
P Y R + FVF+AGE VYEL P+G + M + S +DP + +LP LG+RL
Sbjct 162 PT-YEPMVSKRTSSFVFHAGETVYELITPEGAVYTMFSLSLKIDPKNTIENLPTLGKRLT 220
Query 167 LPAGWSYHTRVLTSELRV 184
LP GW + +R L EL +
Sbjct 221 LPKGWQFRSRKLDKELNL 238
>gi|239904765|ref|YP_002951503.1| hypothetical protein DMR_01260 [Desulfovibrio magneticus RS-1]
gi|239794628|dbj|BAH73617.1| hypothetical protein [Desulfovibrio magneticus RS-1]
Length=510
Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 57/200 (29%), Positives = 91/200 (46%), Gaps = 18/200 (9%)
Query 3 APERVTGLSGQRYGEVLLVTPGEAGPQ-ATVYNSF-----PLNDCPAELWSALDPQALAT 56
P ++ G+ + E+L + + G YNS ++ + AL+ L
Sbjct 33 CPIKIENWRGKPFYEILFMNRKDDGRGVGYYYNSLGKEFEATDEVMDARFRALNADTLKK 92
Query 57 EHKAATALLNGPRYWLMNAI---------EKAPQGPPVTKTFGGIEMLQQATVLLSSMNP 107
E+ + L NGPR + N I E+ P+ + G E + + ++ P
Sbjct 93 EYGSDGILFNGPRRLVTNGITGMAWDGCKERVITTIPL-RVLGIFETPDLSKAVSGTL-P 150
Query 108 APYTVSQVSRNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNL 167
A Y V R+ F FNAGE VYEL P+G + M + S D N + +LP LG+RL L
Sbjct 151 A-YEVLVSKRSNTFSFNAGETVYELITPEGAVYTMFSLSLKKDTNNTIENLPTLGKRLTL 209
Query 168 PAGWSYHTRVLTSELRVDTT 187
P GW + +R L ++ + +T
Sbjct 210 PQGWQFRSRKLDKDMMLTST 229
Score = 42.0 bits (97), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 50/218 (23%), Positives = 83/218 (39%), Gaps = 27/218 (12%)
Query 6 RVTGLSGQRYGEVLLV-TPGEAGPQ-ATVYN-SFPLNDCPA------ELWS-ALDPQALA 55
R+ L R+ E+ L + G A YN S N PA + W+ L+ +
Sbjct 289 RIDNLHKVRFAEIFLAHRDAKTGKMVAECYNTSLAPNAVPASKDTAPQGWAKGLNFNKMK 348
Query 56 TEHKAATALLNGPRYWLMNAIEKAPQGPPVTKTFGG--------IEMLQQATVLLSSMNP 107
+ A NGP+ W+ + IE V + F G ++M A + S
Sbjct 349 NKFGVLGASFNGPKLWMPDWIETLNG---VVRDFNGRNVPWVGRLDMGDNAGGVSES--- 402
Query 108 APYTVSQVSRNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGER--L 165
PY ++R + + G L D +G W+M+ + + P + G+
Sbjct 403 TPYKPVTIARGDIGWYK-GTTALLLDDAEGNTWIMKGFQVGLKPAYTFEQFVAAGQSQFK 461
Query 166 NLPAGWSYHTRVLTSELRVDTTNREARVLQDDLTNSYS 203
LP GW + +VL +L A ++ D+ N Y
Sbjct 462 KLPPGWKFRIKVLDKDLTERPEGGVATIMVDEFFNVYD 499
>gi|158424379|ref|YP_001525671.1| hypothetical protein AZC_2755 [Azorhizobium caulinodans ORS 571]
gi|158331268|dbj|BAF88753.1| hypothetical protein [Azorhizobium caulinodans ORS 571]
Length=264
Score = 71.2 bits (173), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 59/218 (28%), Positives = 92/218 (43%), Gaps = 21/218 (9%)
Query 3 APERVTGLSGQRYGEVLLVTPG-EAGPQATVYNSF-----PLNDCPAELWSALDPQALAT 56
P + G+ + E+L + + G +NS D + AL+ + L
Sbjct 40 CPIKAENWRGRAFYEILFMFRQPDGGGIGNYFNSLSNKLPKSKDVMDARFRALNAETLKK 99
Query 57 EHKAATALLNGPRYWLMNAI---------EKAPQGPPVTKTFGGIEMLQQATVLLSSMNP 107
E NGPR + N I ++ G P+ G+ + +S P
Sbjct 100 EFGGDGVFFNGPRRLVANTITGMSWDGCKQRVIAGIPLN--LDGVFEVPSLEKFVSGSMP 157
Query 108 APYTVSQVSRNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGERLNL 167
A Y R + +F AGE VYEL P+G + M + S +DP + +LP LG+RL L
Sbjct 158 A-YKPMVSKRTSSMLFKAGETVYELITPEGAVYTMFSLSLKIDPKNTIENLPTLGKRLTL 216
Query 168 PAGWSYHTRVLTSELRVDTT---NREARVLQDDLTNSY 202
PAGW + +R L ++ + T N V+ D L +Y
Sbjct 217 PAGWQFRSRKLDKDMVLTATADSNPPNTVVLDQLEGNY 254
>gi|149919613|ref|ZP_01908092.1| hypothetical protein PPSIR1_07068 [Plesiocystis pacifica SIR-1]
gi|149819556|gb|EDM78984.1| hypothetical protein PPSIR1_07068 [Plesiocystis pacifica SIR-1]
Length=282
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 64/243 (27%), Positives = 100/243 (42%), Gaps = 53/243 (21%)
Query 4 PERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFPLNDCPAELWSALDPQALATEHKAATA 63
PE + G R E+ +T V+NS L+DCP +A+DPQ A
Sbjct 45 PEVRESVRGARVCELFELTLEGEHLAMDVWNSGDLHDCPDAWLAAVDPQRYA-------- 96
Query 64 LLNGPRYWLMNA-IEKAPQGPPVTKTFGGIE--------MLQQATVLL------------ 102
+ GPR+ ++ G PV +E M A VLL
Sbjct 97 -VGGPRWRSVDEQYTVDADGEPVGFDAEALEVPAGLGQDMFLAAQVLLMPLAVLEHMLGV 155
Query 103 --SSMNPAP----------------YTVSQVSR--NTVFVFNAGEEVYELQDPKGQRWVM 142
S++ P Y +++V R T V +AG EV+ L D + R+ M
Sbjct 156 DIESLDDLPPMVHQTILDGTLATEGYAINEVERALTTRMVHHAGSEVFVLDDGEC-RYAM 214
Query 143 QTWSQVVDPNLSRAD-LPKLGERL-NLPAGWSYHTRVLTSELRVDTTNREARVLQDDLTN 200
+ ++ +VDP L+ D + +LG++ +LP GW + +L V + A V+ D+ N
Sbjct 215 KYYTNIVDPTLTNEDAVAELGDKFEHLPQGWRFEVLSFEEDLVVAELDGVAHVIADEFGN 274
Query 201 SYS 203
SY
Sbjct 275 SYD 277
>gi|323447988|gb|EGB03893.1| hypothetical protein AURANDRAFT_67647 [Aureococcus anophagefferens]
Length=349
Score = 58.5 bits (140), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/102 (38%), Positives = 54/102 (53%), Gaps = 12/102 (11%)
Query 83 PPVTKTFGGIEMLQQATVLLSSM--------NPAPYTVSQVSRNTVFVFNAGEEVYELQD 134
PP + GG+E QA + S Y V+R+ V V+ AG V+EL D
Sbjct 113 PP--RALGGVEYAVQARLPFESAAAFEGWGDGGLAYEGVLVNRSAVMVWEAGSTVFELVD 170
Query 135 PKGQRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTR 176
G+R+VMQ+ SQ+V NL+ +DL L +LP GWS+ +R
Sbjct 171 AAGKRYVMQSLSQIVVENLAPSDLEALPR--DLPEGWSFRSR 210
>gi|148262883|ref|YP_001229589.1| paraquat-inducible protein A [Geobacter uraniireducens Rf4]
gi|146396383|gb|ABQ25016.1| Paraquat-inducible protein A [Geobacter uraniireducens Rf4]
Length=229
Score = 43.5 bits (101), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 19/49 (39%), Positives = 28/49 (58%), Gaps = 0/49 (0%)
Query 157 DLPKLGERLNLPAGWSYHTRVLTSELRVDTTNREARVLQDDLTNSYSLV 205
DL LG RL LP GW + + +L +L T N + + QD++ N+Y V
Sbjct 172 DLKDLGSRLKLPPGWKFRSPILEQDLVFMTDNGKTHITQDEIGNTYDRV 220
>gi|209965764|ref|YP_002298679.1| DNA polymerase I, putative [Rhodospirillum centenum SW]
gi|209959230|gb|ACI99866.1| DNA polymerase I, putative [Rhodospirillum centenum SW]
Length=987
Score = 38.5 bits (88), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 30/76 (40%), Positives = 38/76 (50%), Gaps = 9/76 (11%)
Query 7 VTGLSGQRYGEVLLVTPGEAGPQATVY-NSFPLNDCPAELWSALD-PQALATEHKAATAL 64
+ G+SG G L +TPGEAG Y FP EL + ++ +A A EH T L
Sbjct 824 IYGISGFGLGRQLGITPGEAGAFIRQYFERFP------ELQTYMETTKAFAREHGYVTTL 877
Query 65 LNGPRYWLMNAIEKAP 80
L G R W+ EKAP
Sbjct 878 L-GRRCWIQGIREKAP 892
>gi|291410118|ref|XP_002721338.1| PREDICTED: runt-related transcription factor 1-like isoform 4
[Oryctolagus cuniculus]
Length=399
Score = 35.8 bits (81), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 61/140 (44%), Gaps = 14/140 (10%)
Query 50 DPQALATEHKAATALLNGPRYWLMNAIEKAPQGPPVTKTF----GGIEMLQQATVLLSSM 105
+P +AT H+A ++GPR + + Q P + +F +E L++ + +S
Sbjct 155 NPPQVATYHRAIKITVDGPREPRRHRQKLDDQTKPGSLSFSERLSELEQLRRTAMRVSPH 214
Query 106 NPAPYTVSQVSRNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVDPNLSRADLPKLGE-R 164
+PAP + S N FN P+GQ Q DP A LP + + R
Sbjct 215 HPAPTPNPRASLNHSTAFNP--------QPQGQMQGTQELGPFSDPRQFPA-LPSISDPR 265
Query 165 LNLPAGWSYHTRVLTSELRV 184
++ P ++Y +TS + +
Sbjct 266 MHYPGAFTYSPTPVTSGIGI 285
>gi|149923996|ref|ZP_01912380.1| hypothetical protein PPSIR1_06531 [Plesiocystis pacifica SIR-1]
gi|149815125|gb|EDM74677.1| hypothetical protein PPSIR1_06531 [Plesiocystis pacifica SIR-1]
Length=1330
Score = 35.8 bits (81), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 27/94 (29%), Positives = 43/94 (46%), Gaps = 7/94 (7%)
Query 69 RYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAGEE 128
R W+ NA A G GIE T +++P P S++ + T F N +
Sbjct 69 RAWISNAQLTAGSG--FVAYVAGIEA-TDVTATSIALSPDPLESSEIKKTTNFRSNTKVD 125
Query 129 VYELQDPK----GQRWVMQTWSQVVDPNLSRADL 158
+Y++++PK G++ V VV P +S DL
Sbjct 126 LYDVKEPKDAPGGKKQVTVKKGGVVAPTMSIGDL 159
>gi|338780768|gb|EGP45169.1| sigma-E factor regulatory protein [Achromobacter xylosoxidans
AXX-A]
Length=359
Score = 35.0 bits (79), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 33/97 (35%), Positives = 47/97 (49%), Gaps = 12/97 (12%)
Query 104 SMNPAPYTVSQVSRNTVFVFNAGEEVYELQDPKG--QRWVMQTWSQVVDPNLSRADLPKL 161
++N A V QVS + G EV DPK RW + W +V++P++ DL L
Sbjct 200 TLNAARGVVEQVSFTS---LRLGAEV----DPKSLSSRWNTRDW-KVLEPSMKTVDLGAL 251
Query 162 GERLNLPAGWSYHTRVLTSELRVDTTNREARVLQDDL 198
G R+ P G++ +V S R T N+ VL D L
Sbjct 252 GWRIPAPKGFTVVMQVARSMGRGATVNQ--MVLSDGL 286
>gi|194673942|ref|XP_612405.4| PREDICTED: jumonji, AT rich interactive domain 1B [Bos taurus]
Length=1723
Score = 34.7 bits (78), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 33/133 (25%), Positives = 51/133 (39%), Gaps = 7/133 (5%)
Query 53 ALATEHKAATALLNGPRYWLMNAIEKAPQGPPVTKTFGGIEMLQQATVLLSSMNPAPYTV 112
A T AA ++ GPR WL ++ + PP+ K + LQ+ V L + Y +
Sbjct 1377 AFHTSCVAAPSIPQGPRVWLCPNCRRS-EKPPLEKILPLLASLQRIRVRLPEGDALRYMI 1435
Query 113 SQV---SRNTVFVFNAGEEVYELQDPKGQRWVMQTWSQVVD--PNLSRADLPKLGERLNL 167
+ + ++G + LQDP G + W P S+ P +L
Sbjct 1436 ERTVSWQHRARQLLSSG-HLKSLQDPVGSGLLCGRWQATAGQVPETSKMSQPPGPTSFSL 1494
Query 168 PAGWSYHTRVLTS 180
P W T L S
Sbjct 1495 PDDWDNRTSYLHS 1507
Lambda K H
0.314 0.130 0.388
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 236380426956
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40