BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0347
Length=328
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607488|ref|NP_214861.1| hypothetical protein Rv0347 [Mycoba... 661 0.0
gi|308231528|ref|ZP_07412778.2| conserved membrane protein [Myco... 551 5e-155
gi|323721256|gb|EGB30314.1| membrane protein [Mycobacterium tube... 542 3e-152
gi|289568258|ref|ZP_06448485.1| LOW QUALITY PROTEIN: conserved m... 423 2e-116
gi|336120957|ref|YP_004575744.1| hypothetical protein MLP_53270 ... 130 4e-28
gi|339293846|gb|AEJ45957.1| hypothetical protein CCDC5079_0767 [... 120 2e-25
gi|15840242|ref|NP_335279.1| hypothetical protein MT0852 [Mycoba... 120 3e-25
gi|15607971|ref|NP_215346.1| hypothetical protein Rv0831c [Mycob... 120 4e-25
gi|289442234|ref|ZP_06431978.1| conserved hypothetical protein [... 120 4e-25
gi|240171876|ref|ZP_04750535.1| hypothetical protein MkanA1_2135... 118 1e-24
gi|54023545|ref|YP_117787.1| hypothetical protein nfa15770 [Noca... 117 2e-24
gi|183984817|ref|YP_001853108.1| hypothetical protein MMAR_4849 ... 108 1e-21
gi|289760960|ref|ZP_06520338.1| conserved hypothetical protein [... 108 1e-21
gi|240171632|ref|ZP_04750291.1| hypothetical protein MkanA1_2011... 74.3 3e-11
gi|336120556|ref|YP_004575342.1| hypothetical protein MLP_49250 ... 55.8 1e-05
gi|167566722|ref|ZP_02359638.1| hypothetical protein BoklE_29451... 53.9 3e-05
gi|220915153|ref|YP_002490457.1| hypothetical protein A2cp1_0030... 53.1 6e-05
gi|126179006|ref|YP_001046971.1| hypothetical protein Memar_1056... 47.0 0.004
gi|87306732|ref|ZP_01088879.1| hypothetical protein DSM3645_1037... 46.6 0.005
gi|219852641|ref|YP_002467073.1| hypothetical protein Mpal_2051 ... 41.6 0.16
gi|56478815|ref|YP_160404.1| hypothetical protein ebA5936 [Aroma... 40.8 0.32
gi|300113075|ref|YP_003759650.1| hypothetical protein Nwat_0359 ... 40.0 0.45
>gi|15607488|ref|NP_214861.1| hypothetical protein Rv0347 [Mycobacterium tuberculosis H37Rv]
gi|15839733|ref|NP_334770.1| hypothetical protein MT0362 [Mycobacterium tuberculosis CDC1551]
gi|31791525|ref|NP_854018.1| hypothetical protein Mb0355 [Mycobacterium bovis AF2122/97]
53 more sequence titles
Length=328
Score = 661 bits (1705), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/328 (99%), Positives = 328/328 (100%), Gaps = 0/328 (0%)
Query 1 VPGARELTLRVERGALFRRRWAASAASSARAAIRRDPRRCALGTRPRWVSFLVIVLVIMN 60
+PGARELTLRVERGALFRRRWAASAASSARAAIRRDPRRCALGTRPRWVSFLVIVLVIMN
Sbjct 1 MPGARELTLRVERGALFRRRWAASAASSARAAIRRDPRRCALGTRPRWVSFLVIVLVIMN 60
Query 61 VVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLETG 120
VVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLETG
Sbjct 61 VVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLETG 120
Query 121 EHTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCI 180
EHTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCI
Sbjct 121 EHTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCI 180
Query 181 RIGLRYINEIRASLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGDSL 240
RIGLRYINEIRASLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGDSL
Sbjct 181 RIGLRYINEIRASLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGDSL 240
Query 241 TLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVAER 300
TLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVAER
Sbjct 241 TLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVAER 300
Query 301 LHTPIGPLFESLITSELRTKVLQQPGQE 328
LHTPIGPLFESLITSELRTKVLQQPGQE
Sbjct 301 LHTPIGPLFESLITSELRTKVLQQPGQE 328
>gi|308231528|ref|ZP_07412778.2| conserved membrane protein [Mycobacterium tuberculosis SUMu001]
gi|308369370|ref|ZP_07417524.2| conserved membrane protein [Mycobacterium tuberculosis SUMu002]
gi|308370381|ref|ZP_07421296.2| conserved membrane protein [Mycobacterium tuberculosis SUMu003]
19 more sequence titles
Length=270
Score = 551 bits (1420), Expect = 5e-155, Method: Compositional matrix adjust.
Identities = 270/270 (100%), Positives = 270/270 (100%), Gaps = 0/270 (0%)
Query 59 MNVVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLE 118
MNVVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLE
Sbjct 1 MNVVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLE 60
Query 119 TGEHTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDG 178
TGEHTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDG
Sbjct 61 TGEHTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDG 120
Query 179 CIRIGLRYINEIRASLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGD 238
CIRIGLRYINEIRASLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGD
Sbjct 121 CIRIGLRYINEIRASLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGD 180
Query 239 SLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVA 298
SLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVA
Sbjct 181 SLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVA 240
Query 299 ERLHTPIGPLFESLITSELRTKVLQQPGQE 328
ERLHTPIGPLFESLITSELRTKVLQQPGQE
Sbjct 241 ERLHTPIGPLFESLITSELRTKVLQQPGQE 270
>gi|323721256|gb|EGB30314.1| membrane protein [Mycobacterium tuberculosis CDC1551A]
gi|339293402|gb|AEJ45513.1| hypothetical protein CCDC5079_0323 [Mycobacterium tuberculosis
CCDC5079]
gi|339297047|gb|AEJ49157.1| hypothetical protein CCDC5180_0320 [Mycobacterium tuberculosis
CCDC5180]
Length=267
Score = 542 bits (1397), Expect = 3e-152, Method: Compositional matrix adjust.
Identities = 266/267 (99%), Positives = 267/267 (100%), Gaps = 0/267 (0%)
Query 62 VTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLETGE 121
+TAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLETGE
Sbjct 1 MTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLETGE 60
Query 122 HTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIR 181
HTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIR
Sbjct 61 HTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIR 120
Query 182 IGLRYINEIRASLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGDSLT 241
IGLRYINEIRASLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGDSLT
Sbjct 121 IGLRYINEIRASLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGDSLT 180
Query 242 LRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVAERL 301
LRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVAERL
Sbjct 181 LRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVAERL 240
Query 302 HTPIGPLFESLITSELRTKVLQQPGQE 328
HTPIGPLFESLITSELRTKVLQQPGQE
Sbjct 241 HTPIGPLFESLITSELRTKVLQQPGQE 267
>gi|289568258|ref|ZP_06448485.1| LOW QUALITY PROTEIN: conserved membrane protein [Mycobacterium
tuberculosis T17]
gi|289542011|gb|EFD45660.1| LOW QUALITY PROTEIN: conserved membrane protein [Mycobacterium
tuberculosis T17]
Length=211
Score = 423 bits (1087), Expect = 2e-116, Method: Compositional matrix adjust.
Identities = 210/211 (99%), Positives = 211/211 (100%), Gaps = 0/211 (0%)
Query 1 VPGARELTLRVERGALFRRRWAASAASSARAAIRRDPRRCALGTRPRWVSFLVIVLVIMN 60
+PGARELTLRVERGALFRRRWAASAASSARAAIRRDPRRCALGTRPRWVSFLVIVLVIMN
Sbjct 1 MPGARELTLRVERGALFRRRWAASAASSARAAIRRDPRRCALGTRPRWVSFLVIVLVIMN 60
Query 61 VVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLETG 120
VVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLETG
Sbjct 61 VVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLETG 120
Query 121 EHTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCI 180
EHTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCI
Sbjct 121 EHTAHSQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCI 180
Query 181 RIGLRYINEIRASLAEPSGWAYWVAESLLGP 211
RIGLRYINEIRASLAEPSGWAYWVAESLLGP
Sbjct 181 RIGLRYINEIRASLAEPSGWAYWVAESLLGP 211
>gi|336120957|ref|YP_004575744.1| hypothetical protein MLP_53270 [Microlunatus phosphovorus NM-1]
gi|334688756|dbj|BAK38341.1| hypothetical protein MLP_53270 [Microlunatus phosphovorus NM-1]
Length=273
Score = 130 bits (326), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 92/276 (34%), Positives = 136/276 (50%), Gaps = 14/276 (5%)
Query 59 MNVVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLE 118
M+ V YP+ P+ L+ IE+RHP EP ++ + + P+ + V ++
Sbjct 1 MSCVREREIYPSAPIVLMAIEVRHPLCEPLDRKQVTDMSARVKHLLPLPSEMNEVSVTVQ 60
Query 119 TGEHTAHSQKKLV-------ARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQ 171
G Q+++V +RD+RTA++ RPD++ +E T+Y ++ R ++ ++ AR
Sbjct 61 AGSDGPPVQQQVVRSFPRWTSRDKRTALSVRPDSLVIETTNYGSYDRMRELLDIVLLARL 120
Query 172 DVAPVDGCIRIGLRYINEIRASLAEPSG---WAYWVAESLLGPGTQLADLKLTTTAQRHV 228
VA G RIGLRYI+EIR S W WV SLLGP A+L L V
Sbjct 121 AVAAPAGVERIGLRYIDEIRVPAENGSSVPTWEQWVDASLLGPAHVGAELSLVPVVNEGV 180
Query 229 IQCEGPEPGDSLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPA 288
G +L LRY +QSTP L+R G F +DIDS W +P
Sbjct 181 FVFSGGS-DHALVLRYGAQSDYAVQSTPDLRRPLP--PPGPLFKLDIDSFWQ-AADEVPE 236
Query 289 LDAHLVDEVAERLHTPIGPLFESLITSELRTKVLQQ 324
D L+ A+ LH P+ +FES+IT LR +VL+
Sbjct 237 FDVDLILRQADALHEPVRGVFESVITDRLREEVLRN 272
>gi|339293846|gb|AEJ45957.1| hypothetical protein CCDC5079_0767 [Mycobacterium tuberculosis
CCDC5079]
Length=271
Score = 120 bits (302), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 88/266 (34%), Positives = 131/266 (50%), Gaps = 16/266 (6%)
Query 69 PNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLET--GEHTAHS 126
PN P+ALV +E+RHP T+ SA LK L PI Q + + G T +
Sbjct 12 PNAPVALVTVEIRHPTTDSLTESANRALKHLLINDLPIERQAQDVSWGMTAPGGAPTPVA 71
Query 127 QK--KLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIRIGL 184
+ + V RD TA + + A+ +E T Y +E F +V +V AR V+ + G RIGL
Sbjct 72 DRFVRYVNRDNTTAASLKNQAIVVETTAYRSFEAFTDVVMRVVDARAQVSSIVGLERIGL 131
Query 185 RYINEIRASLAEPSG------WAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGD 238
R++ EIR P+G W+ W+ E LLGP + T Q + E +PG
Sbjct 132 RFVLEIRV----PAGVDGRITWSNWIDEQLLGPQRFTPGGLVLTEWQGAAVYRE-LQPGK 186
Query 239 SLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVA 298
SL +RY G + L+R+ P G FFL+DIDS W+ IP + +
Sbjct 187 SLIVRYGPGMGQALDPNYHLRRIT-PAQTGPFFLLDIDSFWTPSGGSIPEYNRDALVSTF 245
Query 299 ERLHTPIGPLFESLITSELRTKVLQQ 324
+ L+ P +F+ +ITS L+ ++L+Q
Sbjct 246 QDLYGPAQVVFQEMITSRLKDELLRQ 271
>gi|15840242|ref|NP_335279.1| hypothetical protein MT0852 [Mycobacterium tuberculosis CDC1551]
gi|13880400|gb|AAK45093.1| hypothetical protein MT0852 [Mycobacterium tuberculosis CDC1551]
Length=305
Score = 120 bits (301), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 88/266 (34%), Positives = 131/266 (50%), Gaps = 16/266 (6%)
Query 69 PNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLET--GEHTAHS 126
PN P+ALV +E+RHP T+ SA LK L PI Q + + G T +
Sbjct 46 PNAPVALVTVEIRHPTTDSLTESANRELKHLLINDLPIERQAQDVSWGMTAPGGAPTPVA 105
Query 127 QK--KLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIRIGL 184
+ + V RD TA + + A+ +E T Y +E F +V +V AR V+ + G RIGL
Sbjct 106 DRFVRYVNRDNTTAASLKNQAIVVETTAYRSFEAFTDVVMRVVDARAQVSSIVGLERIGL 165
Query 185 RYINEIRASLAEPSG------WAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGD 238
R++ EIR P+G W+ W+ E LLGP + T Q + E +PG
Sbjct 166 RFVLEIRV----PAGVDGRITWSNWIDEQLLGPQRFTPGGLVLTEWQGAAVYRE-LQPGK 220
Query 239 SLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVA 298
SL +RY G + L+R+ P G FFL+DIDS W+ IP + +
Sbjct 221 SLIVRYGPGMGQALDPNYHLRRIT-PAQTGPFFLLDIDSFWTPSGGSIPEYNRDALVSTF 279
Query 299 ERLHTPIGPLFESLITSELRTKVLQQ 324
+ L+ P +F+ +ITS L+ ++L+Q
Sbjct 280 QDLYGPAQVVFQEMITSRLKDELLRQ 305
>gi|15607971|ref|NP_215346.1| hypothetical protein Rv0831c [Mycobacterium tuberculosis H37Rv]
gi|31792019|ref|NP_854512.1| hypothetical protein Mb0854c [Mycobacterium bovis AF2122/97]
gi|121636755|ref|YP_976978.1| hypothetical protein BCG_0884c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
69 more sequence titles
Length=271
Score = 120 bits (300), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 88/266 (34%), Positives = 131/266 (50%), Gaps = 16/266 (6%)
Query 69 PNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLET--GEHTAHS 126
PN P+ALV +E+RHP T+ SA LK L PI Q + + G T +
Sbjct 12 PNAPVALVTVEIRHPTTDSLTESANRELKHLLINDLPIERQAQDVSWGMTAPGGAPTPVA 71
Query 127 QK--KLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIRIGL 184
+ + V RD TA + + A+ +E T Y +E F +V +V AR V+ + G RIGL
Sbjct 72 DRFVRYVNRDNTTAASLKNQAIVVETTAYRSFEAFTDVVMRVVDARAQVSSIVGLERIGL 131
Query 185 RYINEIRASLAEPSG------WAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGD 238
R++ EIR P+G W+ W+ E LLGP + T Q + E +PG
Sbjct 132 RFVLEIRV----PAGVDGRITWSNWIDEQLLGPQRFTPGGLVLTEWQGAAVYRE-LQPGK 186
Query 239 SLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVA 298
SL +RY G + L+R+ P G FFL+DIDS W+ IP + +
Sbjct 187 SLIVRYGPGMGQALDPNYHLRRIT-PAQTGPFFLLDIDSFWTPSGGSIPEYNRDALVSTF 245
Query 299 ERLHTPIGPLFESLITSELRTKVLQQ 324
+ L+ P +F+ +ITS L+ ++L+Q
Sbjct 246 QDLYGPAQVVFQEMITSRLKDELLRQ 271
>gi|289442234|ref|ZP_06431978.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289568784|ref|ZP_06449011.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289749347|ref|ZP_06508725.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289415153|gb|EFD12393.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289542538|gb|EFD46186.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289689934|gb|EFD57363.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=271
Score = 120 bits (300), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 88/266 (34%), Positives = 131/266 (50%), Gaps = 16/266 (6%)
Query 69 PNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLET--GEHTAHS 126
PN P+ALV +E+RHP T+ SA LK L PI Q + + G T +
Sbjct 12 PNAPVALVTVEIRHPTTDSLTESANRELKHLLINDLPIERQAQDVSWGMTAPGGAPTPVA 71
Query 127 QK--KLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIRIGL 184
+ + V RD TA + + A+ +E T Y +E F +V +V AR V+ + G RIGL
Sbjct 72 DRFVRYVNRDNTTAASLKNQAIVVETTAYRSFEAFTDVVMRVVDARAQVSSIVGLERIGL 131
Query 185 RYINEIRASLAEPSG------WAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGD 238
R++ EIR P+G W+ W+ E LLGP + T Q + E +PG
Sbjct 132 RFVLEIRV----PAGVDGRIMWSNWIDEQLLGPQRFTPGGLVLTEWQGAAVYRE-LQPGK 186
Query 239 SLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVA 298
SL +RY G + L+R+ P G FFL+DIDS W+ IP + +
Sbjct 187 SLIVRYGPGMGQALDPNYHLRRIT-PAQTGPFFLLDIDSFWTPSGGSIPEYNRDALVSTF 245
Query 299 ERLHTPIGPLFESLITSELRTKVLQQ 324
+ L+ P +F+ +ITS L+ ++L+Q
Sbjct 246 QDLYGPAQVVFQEMITSRLKDELLRQ 271
>gi|240171876|ref|ZP_04750535.1| hypothetical protein MkanA1_21350 [Mycobacterium kansasii ATCC
12478]
Length=267
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 85/262 (33%), Positives = 127/262 (49%), Gaps = 8/262 (3%)
Query 69 PNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQ-EEVRQVNLETGEHTAHSQ 127
PN P+ALV +E+RHP T+ S LK L PI Q ++V G
Sbjct 8 PNAPVALVTMEIRHPATDSLTESTSRELKHLLINDLPIERQAQDVSWGVTAPGAAPTPVA 67
Query 128 KKLV---ARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIRIGL 184
+ V RD + + + A+ +E + Y +E F +V + AR V+ + G RIGL
Sbjct 68 DRFVRYGNRDNTVSASLKNQAIVVETSAYRDFETFCDLVLRVADARAQVSSIVGVERIGL 127
Query 185 RYINEIRASLAEPS--GWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGDSLTL 242
RY+ EIR + W W+ E LLGP ++A L+ T + P+PG SL L
Sbjct 128 RYVLEIRVPVGVDGRVNWGNWIDEQLLGP-YRIAPGGLSLTEWQGAAVYREPQPGKSLIL 186
Query 243 RYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVAERLH 302
RY G + + L+R+ PP G FFL+DIDS W+ IP + + L+
Sbjct 187 RYGPGVGQALDQSYHLRRIT-PPQTGPFFLMDIDSFWTPVGGSIPEYNRDALVSTLTDLY 245
Query 303 TPIGPLFESLITSELRTKVLQQ 324
P +F+ LIT+ L+ ++L+Q
Sbjct 246 GPAREVFQDLITARLKDELLRQ 267
>gi|54023545|ref|YP_117787.1| hypothetical protein nfa15770 [Nocardia farcinica IFM 10152]
gi|54015053|dbj|BAD56423.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=261
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/259 (32%), Positives = 118/259 (46%), Gaps = 8/259 (3%)
Query 68 YPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLE--TGEHTAH 125
Y N P+A+V +E+RH T+ ++++L PI + + E T
Sbjct 7 YRNPPIAMVAVEIRHSGTDTVTEEGYRAIRQQLRHQWPIELPAKDVAIEFEGTNPSPTVV 66
Query 126 SQKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIRIGLR 185
++ +RD TAI RP A T+E DY GWE R + A + R V+ G +R+GLR
Sbjct 67 EYRRYASRDLATAIVVRPGATTVETVDYKGWETLRQTLKAALDVRAAVSEPSGYVRVGLR 126
Query 186 YINEIRA-SLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGDSLTLRY 244
YI+E+R W+ W+ SLL Q D H + P G + LRY
Sbjct 127 YIDEVRVPGDGIAPDWSEWMHPSLL--AAQPDDTAGLPLHDWHGLSAFKPADGHMVVLRY 184
Query 245 AGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVAERLHTP 304
G ++ L+R P G FFL+DIDS W + IP + + LH P
Sbjct 185 GPRTGYAVEPDGHLKRPSSP--TGPFFLLDIDSFW-EVTGSIPEFAPDELVTKCDNLHAP 241
Query 305 IGPLFESLITSELRTKVLQ 323
I LFE L+T +LR +V
Sbjct 242 IRKLFEGLVTDKLRKEVFD 260
>gi|183984817|ref|YP_001853108.1| hypothetical protein MMAR_4849 [Mycobacterium marinum M]
gi|183178143|gb|ACC43253.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=270
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 82/261 (32%), Positives = 125/261 (48%), Gaps = 9/261 (3%)
Query 69 PNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQ-EEVRQVNLETGEHTAHSQ 127
PN P+ALV E+RHP T+ S+ LK L PI Q ++V G
Sbjct 12 PNAPVALVTAEIRHPATDSLTESSSRELKHLLINDLPIERQAQDVSWGMTAPGAAPTPVA 71
Query 128 KKLV---ARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIRIGL 184
+ V RD + + + A+ +E + Y ++ F I+ + AR V+ + G RIGL
Sbjct 72 DRFVRYGNRDNTVSASLKNQAIVVETSAYSSFDNFCDILLRVADARAQVSSIVGVERIGL 131
Query 185 RYINEIR--ASLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGDSLTL 242
RY+ EIR A + W+ W+ E LLGP ++A L+ + +PG SL L
Sbjct 132 RYVLEIRVPAGVDGRIAWSNWIDEQLLGP-QRIAPGGLSMAEWQGAAVYREAQPGKSLIL 190
Query 243 RYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVAERLH 302
RY G + + L+R+ G FFL+DIDS W+ P IP + + + L+
Sbjct 191 RYGPGMGQALDANYHLRRVTA-AQTGPFFLMDIDSFWT-PLGSIPEFNRDALVSTLQDLY 248
Query 303 TPIGPLFESLITSELRTKVLQ 323
P +F+ LIT LR ++L+
Sbjct 249 GPAREVFQDLITPRLRDELLR 269
>gi|289760960|ref|ZP_06520338.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
gi|289708466|gb|EFD72482.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
Length=297
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 82/252 (33%), Positives = 120/252 (48%), Gaps = 16/252 (6%)
Query 69 PNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLET--GEHTAHS 126
PN P+ALV +E+RHP T+ SA LK L PI Q + + G T +
Sbjct 46 PNAPVALVTVEIRHPTTDSLTESANRELKHLLINDLPIERQAQDVSWGMTAPGGAPTPVA 105
Query 127 QK--KLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIRIGL 184
+ + V RD TA + + A+ +E T Y +E F +V +V AR V+ + G RIGL
Sbjct 106 DRFVRYVNRDNTTAASLKNQAIVVETTAYRSFEAFTDVVMRVVDARAQVSSIVGLERIGL 165
Query 185 RYINEIRASLAEPSG------WAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGD 238
R++ EIR P+G W+ W+ E LLGP + T Q + E +PG
Sbjct 166 RFVLEIRV----PAGVDGRITWSNWIDEQLLGPQRFTPGGLVLTEWQGAAVYRE-LQPGK 220
Query 239 SLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVA 298
SL +RY G + L+R+ P G FFL+DIDS W+ IP + +
Sbjct 221 SLIVRYGPGMGQALDPNYHLRRIT-PAQTGPFFLLDIDSFWTPSGGSIPEYNRDALVSTF 279
Query 299 ERLHTPIGPLFE 310
+ L+ P +F+
Sbjct 280 QDLYGPAQVVFQ 291
>gi|240171632|ref|ZP_04750291.1| hypothetical protein MkanA1_20118 [Mycobacterium kansasii ATCC
12478]
Length=271
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 69/267 (26%), Positives = 108/267 (41%), Gaps = 25/267 (9%)
Query 68 YPNDPLALVLIELRHP-----RTEPPVPSAISILKEELARWTPILEQEEVRQVNLETGE- 121
+P PLALV E+R R + + + L++ TP V G
Sbjct 8 FPKAPLALVTTEIRFTDSPRLRQQETLDAVAIALEDRFPLNTP------QTSVTFNVGSL 61
Query 122 ------HTAHSQKKLVARDRRT-AITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVA 174
++ ++ RT ++T P + E T Y +++FR V A+ A D
Sbjct 62 GPGVLPQVEQERRVVLTNTTRTESVTITPSSFICETTAYREFDDFRVGVTAVCEALIDAN 121
Query 175 PVDGCIRIGLRYINEIRA--SLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCE 232
+R+GLRYI+E+R + + WA W+ + ++ P T D Q V
Sbjct 122 VRPALVRVGLRYIDEVRVPEPITDVRAWAKWIDDGIIRPLTIGPDDVAVRNVQGLVTFDL 181
Query 233 GPEPGDSLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAH 292
G G L +YA + FL R + P G FF++D D + + LDA
Sbjct 182 G--DGKGLNFQYAALNQTPVVQPQFLNRGQFEP--GPFFVLDFDGFRDFGEQDVVRLDAD 237
Query 293 LVDEVAERLHTPIGPLFESLITSELRT 319
V V +H P G +F+ IT + R
Sbjct 238 EVTNVLTAVHDPTGAMFQRAITEDARN 264
>gi|336120556|ref|YP_004575342.1| hypothetical protein MLP_49250 [Microlunatus phosphovorus NM-1]
gi|334688354|dbj|BAK37939.1| hypothetical protein MLP_49250 [Microlunatus phosphovorus NM-1]
Length=284
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 66/268 (25%), Positives = 101/268 (38%), Gaps = 31/268 (11%)
Query 72 PLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLETGEHTAHSQKKL- 130
PL +IE+R P + L +E P+L E+ +++ + G +
Sbjct 17 PLVYAVIEIRVPFAPRLSKGETAELLQEALAELPVLRAEKRQRLVPKDGNIQVEIEDGWR 76
Query 131 ---VARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIRIGLRYI 187
+A R +T ++ E T YPG+E F A + + A G R+GLRYI
Sbjct 77 FLDLANSRSLVVT--NTSIVYETTRYPGFETFLGEFIACLHLISEHARPAGYERLGLRYI 134
Query 188 NEIRAS--LAEPSGWAYWVAESLL---------------GPGTQLADLKLTTTAQRHVIQ 230
NE+ + + W W+A L+ G QL DL++
Sbjct 135 NEVWPTRPVQSFDDWKQWIAPELVTTLVRSEGEVHRNVDGDRPQLRDLEVHLQFSL-ADN 193
Query 231 CEGPEPGDSLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALD 290
C +LT R A G + L+R P A G+F +ID D W I D
Sbjct 194 C-------ALTTRVATQTGLGVVGNDPLKRWVTPAAAGNFCVIDFDGFWPRIPDSIQPFD 246
Query 291 AHLVDEVAERLHTPIGPLFESLITSELR 318
+ + +H P+ F T E R
Sbjct 247 IEKISRQLKAVHNPVKGGFGWATTHEFR 274
>gi|167566722|ref|ZP_02359638.1| hypothetical protein BoklE_29451 [Burkholderia oklahomensis EO147]
Length=255
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/91 (32%), Positives = 42/91 (47%), Gaps = 0/91 (0%)
Query 127 QKKLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIRIGLRY 186
Q+ A DR + AM + T Y +EE ++ A + A P R GLRY
Sbjct 74 QRSFFAIDRSRQLVLAAQAMFINYTSYSTYEETKAQFVAAIDAISASFPEAKVARFGLRY 133
Query 187 INEIRASLAEPSGWAYWVAESLLGPGTQLAD 217
INEI L +P+ W ++ + LLG + D
Sbjct 134 INEITVPLDDPTQWETYIDDRLLGSRSFFGD 164
>gi|220915153|ref|YP_002490457.1| hypothetical protein A2cp1_0030 [Anaeromyxobacter dehalogenans
2CP-1]
gi|219953007|gb|ACL63391.1| hypothetical protein A2cp1_0030 [Anaeromyxobacter dehalogenans
2CP-1]
Length=263
Score = 53.1 bits (126), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 67/226 (30%), Positives = 101/226 (45%), Gaps = 27/226 (11%)
Query 66 PKYP-----NDPLALVLIELRHPRTEPPVPSAISI-LKEELARWTPIL--EQEEVRQVNL 117
P+YP PL LV+ ++R P +A + ++ LA P+ EQ+ QV
Sbjct 5 PEYPRVLFKKSPLRLVVGQIRFPLQLRLADTAFTAPFQDALADDYPVAAREQQVAFQVTP 64
Query 118 ETGEHTAHSQK--KLVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAP 175
+ G A S+ + +R+ A+ A+TLEV Y EEF S ++ A ++
Sbjct 65 KGGLQAAPSETLLRFASRNGDWAVVLGESALTLEVRGYSAVEEFSSRFEKVLGAAKERLR 124
Query 176 VDGCIRIGLRYINEIRA----SLAEPSGWAYWVAESLLG-PGTQLADLKLTTTAQRHVIQ 230
+ R+GLRYINE R SLA+ WA + LLG G L L T + +
Sbjct 125 LRERSRLGLRYINEFRHDAGRSLAD---WAKLMNPELLGFAGNNL----LGGTVEHMAHE 177
Query 231 CEGPEPGDSLTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDID 276
L +R+ G V++ P P AEG F+L+D+D
Sbjct 178 VRVRRDDGVLAIRHGLLVGGVVEPIP-----TAPVAEGRFYLLDMD 218
>gi|126179006|ref|YP_001046971.1| hypothetical protein Memar_1056 [Methanoculleus marisnigri JR1]
gi|125861800|gb|ABN56989.1| hypothetical protein Memar_1056 [Methanoculleus marisnigri JR1]
Length=257
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 67/262 (26%), Positives = 98/262 (38%), Gaps = 36/262 (13%)
Query 67 KYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLETG------ 120
KY N P A V+ E R P P + IL L P +Q VR+V + G
Sbjct 6 KYVNPPAAEVICEFRFPEDTPWDLTYPGILYSHLKDTYPKRDQRYVREVVMLLGPEGLRE 65
Query 121 EHTAHSQKKLVARDRRTAITFRPDAMTLEVTD-YPGWEEFRSIVHAMVTARQDVAPVDGC 179
E + +A D A+ P +++ Y WE F +H ++V D
Sbjct 66 ELLVAERSIFLAEDEGCAVQVGPRLLSVSCQKPYVHWEAFSEQIHGAFDRFREVIGTDAI 125
Query 180 IRIGLRYINEIRASLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGDS 239
+ LRY+N I E + Y+ L P +L + CE D
Sbjct 126 GTMNLRYVNFIEIPEREVTLSDYFAFYPTLPP-------ELPQVPAGFITGCEFSFHDDR 178
Query 240 LTLRYAGARGAVIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVAE 299
R AV +ST E + FL++ID ++ + IP D VA+
Sbjct 179 DNCR-VELTDAVPEST-----------EHNAFLLNIDYYLTEG-QNIPT------DGVAD 219
Query 300 RL---HTPIGPLFESLITSELR 318
L HT + +FE+ I LR
Sbjct 220 WLEIAHTHVRDIFEACIRDALR 241
>gi|87306732|ref|ZP_01088879.1| hypothetical protein DSM3645_10372 [Blastopirellula marina DSM
3645]
gi|87290911|gb|EAQ82798.1| hypothetical protein DSM3645_10372 [Blastopirellula marina DSM
3645]
Length=289
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/60 (39%), Positives = 33/60 (55%), Gaps = 0/60 (0%)
Query 132 ARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGCIRIGLRYINEIR 191
R RR AI D +T EV+ Y +EEF + A + + A + ++IGLRY+N IR
Sbjct 102 GRKRREAIILTKDFVTYEVSSYTNFEEFVARFSAALDVIANYANITEAVQIGLRYVNVIR 161
>gi|219852641|ref|YP_002467073.1| hypothetical protein Mpal_2051 [Methanosphaerula palustris E1-9c]
gi|219546900|gb|ACL17350.1| conserved hypothetical protein [Methanosphaerula palustris E1-9c]
Length=290
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 20/62 (33%), Positives = 33/62 (54%), Gaps = 1/62 (1%)
Query 130 LVARDRRTAITFRPDAMTLEVTD-YPGWEEFRSIVHAMVTARQDVAPVDGCIRIGLRYIN 188
+ ++RR I F +++ YP WE+FR I+ + A V + G RIGL Y++
Sbjct 117 FLTQNRRMFIQFGERIVSIHCLKPYPSWEKFRPIIEQVYGALSKVTEIKGVDRIGLLYVD 176
Query 189 EI 190
+I
Sbjct 177 KI 178
>gi|56478815|ref|YP_160404.1| hypothetical protein ebA5936 [Aromatoleum aromaticum EbN1]
gi|56314858|emb|CAI09503.1| hypothetical protein ebA5936 [Aromatoleum aromaticum EbN1]
Length=266
Score = 40.8 bits (94), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 66/269 (25%), Positives = 104/269 (39%), Gaps = 23/269 (8%)
Query 64 AHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLETGEHT 123
+PK PL LVL E R R PSA+++LKE +A + + +QV L T
Sbjct 6 GYPKLERQPLTLVLAEFRFARVTFD-PSALAMLKERMASRFGEMTEGVAQQVQLGDAGVT 64
Query 124 AHSQKKLVAR--DRRTAITFRPDAMTLEVTDYPGWEEFR----SIVHAMVTARQDVAPVD 177
++ R + ++ D + T YP +E F +++ A++ Q V
Sbjct 65 IMPAPYVIWRAPESGASVHLEVDRIAYATTMYPRFEGFARDCLAVLEALIETLQPVT--- 121
Query 178 GCIRIGLRYINEIRASLAEPSGWAYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPG 237
R+GLRY + I + P E+ L P + LA + RH+ +
Sbjct 122 -LHRVGLRYNDAI---IPMPDELLEQYVEAPLLPFSPLA--AGSGAVVRHLSETVVQTTA 175
Query 238 DSLTLR-YAGARGAVIQSTPFLQ-----RLKEPPAEGDFFLIDIDSAWSDPCKGIPALDA 291
+L +R AG G + Q R+ PP + ++D D W A D
Sbjct 176 GALVVRALAGMHGLGMMPDLLGQFGLPLRVDVPP-DRPVAVLDFDHYWEVAEAEGVAFDV 234
Query 292 HLVDEVAERLHTPIGPLFESLITSELRTK 320
+ RLH P F + RT+
Sbjct 235 DTASDRLTRLHEPAREAFWKVTKEFARTQ 263
>gi|300113075|ref|YP_003759650.1| hypothetical protein Nwat_0359 [Nitrosococcus watsonii C-113]
gi|299539012|gb|ADJ27329.1| conserved hypothetical protein [Nitrosococcus watsonii C-113]
Length=259
Score = 40.0 bits (92), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 29/131 (23%), Positives = 57/131 (44%), Gaps = 3/131 (2%)
Query 62 VTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEELARWTPILEQEEVRQVNLETGE 121
+ + K N PL VL E R + I ++E L + PI + + + V ++ G
Sbjct 1 MDGYKKLENQPLKFVLAEFRFSPV-MQIAEYIPKIQEALRKQYPIEKTQSEQTVQVQPGG 59
Query 122 HTAHSQKK--LVARDRRTAITFRPDAMTLEVTDYPGWEEFRSIVHAMVTARQDVAPVDGC 179
+ + ++ D+++AI + + +YP ++ F + + D+
Sbjct 60 IAVSTVNRWAFISADKKSAIEINQERLVYITAEYPRFDGFSAACKQAIETLVDIVEPSLI 119
Query 180 IRIGLRYINEI 190
+RIGLRY + I
Sbjct 120 LRIGLRYSDLI 130
Lambda K H
0.320 0.136 0.412
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 580492275184
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40