BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0477
Length=148
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607618|ref|NP_214991.1| hypothetical protein Rv0477 [Mycoba... 297 5e-79
gi|15839867|ref|NP_334904.1| hypothetical protein MT0495 [Mycoba... 295 1e-78
gi|289441859|ref|ZP_06431603.1| conserved secreted protein [Myco... 295 1e-78
gi|297632962|ref|ZP_06950742.1| secreted protein [Mycobacterium ... 288 3e-76
gi|342859021|ref|ZP_08715675.1| hypothetical protein MCOL_09093 ... 176 1e-42
gi|296168079|ref|ZP_06850145.1| conserved hypothetical protein [... 168 3e-40
gi|336460415|gb|EGO39313.1| Protein of unknown function (DUF2599... 165 2e-39
gi|118466135|ref|YP_883801.1| hypothetical protein MAV_4673 [Myc... 165 2e-39
gi|41410068|ref|NP_962904.1| hypothetical protein MAP3970 [Mycob... 164 5e-39
gi|254777110|ref|ZP_05218626.1| hypothetical protein MaviaA2_209... 164 5e-39
gi|254819805|ref|ZP_05224806.1| hypothetical protein MintA_07769... 157 6e-37
gi|118619640|ref|YP_907972.1| hypothetical protein MUL_4545 [Myc... 155 2e-36
gi|183980826|ref|YP_001849117.1| hypothetical protein MMAR_0802 ... 154 3e-36
gi|240169860|ref|ZP_04748519.1| hypothetical protein MkanA1_1113... 145 3e-33
gi|118471304|ref|YP_885324.1| hypothetical protein MSMEG_0921 [M... 137 4e-31
gi|126433249|ref|YP_001068940.1| hypothetical protein Mjls_0638 ... 132 1e-29
gi|108797625|ref|YP_637822.1| hypothetical protein Mmcs_0645 [My... 132 2e-29
gi|15828327|ref|NP_302590.1| hypothetical protein ML2452 [Mycoba... 129 2e-28
gi|333989829|ref|YP_004522443.1| hypothetical protein JDM601_118... 124 4e-27
gi|145220704|ref|YP_001131382.1| hypothetical protein Mflv_0098 ... 124 5e-27
gi|262200979|ref|YP_003272187.1| hypothetical protein Gbro_0983 ... 117 7e-25
gi|333989105|ref|YP_004521719.1| hypothetical protein JDM601_046... 115 2e-24
gi|120401825|ref|YP_951654.1| hypothetical protein Mvan_0810 [My... 112 2e-23
gi|169631159|ref|YP_001704808.1| hypothetical protein MAB_4081c ... 110 1e-22
gi|343925936|ref|ZP_08765451.1| hypothetical protein GOALK_050_0... 109 2e-22
gi|326383070|ref|ZP_08204759.1| hypothetical protein SCNU_09036 ... 107 7e-22
gi|296395132|ref|YP_003660016.1| hypothetical protein Srot_2752 ... 102 2e-20
gi|54024574|ref|YP_118816.1| hypothetical protein nfa26050 [Noca... 101 4e-20
gi|319949075|ref|ZP_08023171.1| hypothetical protein ES5_06712 [... 99.0 2e-19
gi|317509085|ref|ZP_07966714.1| hypothetical protein HMPREF9336_... 92.4 2e-17
gi|170780692|ref|YP_001709024.1| putative secreted protein [Clav... 70.5 8e-11
gi|336321827|ref|YP_004601795.1| hypothetical protein Celgi_2728... 66.2 2e-09
gi|269796752|ref|YP_003316207.1| hypothetical protein Sked_34850... 62.8 2e-08
gi|256833575|ref|YP_003162302.1| hypothetical protein Jden_2365 ... 61.2 5e-08
gi|269957024|ref|YP_003326813.1| hypothetical protein Xcel_2237 ... 57.0 1e-06
gi|332671683|ref|YP_004454691.1| hypothetical protein Celf_3190 ... 56.2 1e-06
gi|296130755|ref|YP_003638005.1| hypothetical protein Cfla_2923 ... 52.8 2e-05
gi|77458733|ref|YP_348239.1| hypothetical protein Pfl01_2508 [Ps... 50.1 1e-04
gi|170721932|ref|YP_001749620.1| hypothetical protein PputW619_2... 50.1 1e-04
gi|152986634|ref|YP_001348730.1| hypothetical protein PSPA7_3370... 47.8 6e-04
gi|218892129|ref|YP_002440996.1| putative halovibrin [Pseudomona... 47.4 7e-04
gi|15597110|ref|NP_250604.1| hypothetical protein PA1914 [Pseudo... 47.4 7e-04
gi|330821454|ref|YP_004350316.1| hypothetical protein bgla_2g237... 47.4 7e-04
gi|313111354|ref|ZP_07797168.1| putative halovibrin [Pseudomonas... 47.4 7e-04
gi|293393465|ref|ZP_06637776.1| conserved hypothetical protein [... 47.0 0.001
gi|50955885|ref|YP_063173.1| hypothetical protein Lxx24425 [Leif... 47.0 0.001
gi|115265542|dbj|BAF32804.1| conserved hypothetical protein [Pse... 46.6 0.001
gi|330965276|gb|EGH65536.1| hypothetical protein PSYAC_11621 [Ps... 46.6 0.001
gi|70729853|ref|YP_259592.1| hypothetical protein PFL_2485 [Pseu... 46.6 0.001
gi|107101349|ref|ZP_01365267.1| hypothetical protein PaerPA_0100... 46.6 0.001
>gi|15607618|ref|NP_214991.1| hypothetical protein Rv0477 [Mycobacterium tuberculosis H37Rv]
gi|31791657|ref|NP_854150.1| hypothetical protein Mb0487 [Mycobacterium bovis AF2122/97]
gi|121636393|ref|YP_976616.1| putative secreted protein [Mycobacterium bovis BCG str. Pasteur
1173P2]
69 more sequence titles
Length=148
Score = 297 bits (760), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 148/148 (100%), Positives = 148/148 (100%), Gaps = 0/148 (0%)
Query 1 MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSL 60
MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSL
Sbjct 1 MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSL 60
Query 61 RVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSW 120
RVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSW
Sbjct 61 RVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSW 120
Query 121 NLEPWRPVVDDSEMLASGCNPGSPEESF 148
NLEPWRPVVDDSEMLASGCNPGSPEESF
Sbjct 121 NLEPWRPVVDDSEMLASGCNPGSPEESF 148
>gi|15839867|ref|NP_334904.1| hypothetical protein MT0495 [Mycobacterium tuberculosis CDC1551]
gi|13880002|gb|AAK44718.1| hypothetical protein MT0495 [Mycobacterium tuberculosis CDC1551]
Length=181
Score = 295 bits (756), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 148/148 (100%), Positives = 148/148 (100%), Gaps = 0/148 (0%)
Query 1 MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSL 60
MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSL
Sbjct 34 MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSL 93
Query 61 RVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSW 120
RVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSW
Sbjct 94 RVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSW 153
Query 121 NLEPWRPVVDDSEMLASGCNPGSPEESF 148
NLEPWRPVVDDSEMLASGCNPGSPEESF
Sbjct 154 NLEPWRPVVDDSEMLASGCNPGSPEESF 181
>gi|289441859|ref|ZP_06431603.1| conserved secreted protein [Mycobacterium tuberculosis T46]
gi|289568397|ref|ZP_06448624.1| conserved secreted protein [Mycobacterium tuberculosis T17]
gi|289748967|ref|ZP_06508345.1| conserved secreted protein [Mycobacterium tuberculosis T92]
gi|289414778|gb|EFD12018.1| conserved secreted protein [Mycobacterium tuberculosis T46]
gi|289542150|gb|EFD45799.1| conserved secreted protein [Mycobacterium tuberculosis T17]
gi|289689554|gb|EFD56983.1| conserved secreted protein [Mycobacterium tuberculosis T92]
Length=148
Score = 295 bits (755), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 147/148 (99%), Positives = 147/148 (99%), Gaps = 0/148 (0%)
Query 1 MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSL 60
MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSL
Sbjct 1 MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSL 60
Query 61 RVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSW 120
RVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADT GMRAQFICHWQYAEIRQPGKPSW
Sbjct 61 RVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTGGMRAQFICHWQYAEIRQPGKPSW 120
Query 121 NLEPWRPVVDDSEMLASGCNPGSPEESF 148
NLEPWRPVVDDSEMLASGCNPGSPEESF
Sbjct 121 NLEPWRPVVDDSEMLASGCNPGSPEESF 148
>gi|297632962|ref|ZP_06950742.1| secreted protein [Mycobacterium tuberculosis KZN 4207]
Length=144
Score = 288 bits (736), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 143/144 (99%), Positives = 144/144 (100%), Gaps = 0/144 (0%)
Query 5 VAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSLRVYP 64
+AVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSLRVYP
Sbjct 1 MAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSLRVYP 60
Query 65 SQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSWNLEP 124
SQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSWNLEP
Sbjct 61 SQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSWNLEP 120
Query 125 WRPVVDDSEMLASGCNPGSPEESF 148
WRPVVDDSEMLASGCNPGSPEESF
Sbjct 121 WRPVVDDSEMLASGCNPGSPEESF 144
>gi|342859021|ref|ZP_08715675.1| hypothetical protein MCOL_09093 [Mycobacterium colombiense CECT
3035]
gi|342133262|gb|EGT86465.1| hypothetical protein MCOL_09093 [Mycobacterium colombiense CECT
3035]
Length=143
Score = 176 bits (445), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 94/148 (64%), Positives = 108/148 (73%), Gaps = 7/148 (4%)
Query 1 MKALVAVSAVAVVAL-LGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPS 59
M+AL+ A+VAL V A A P DPG+G A+ P +P +DHT+WAQW S
Sbjct 1 MRALLTAPIAALVALSCPVPHAVAAP--DPGSGSAD----PPAPPYIDHTQWAQWQGRSS 54
Query 60 LRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPS 119
LRV+PS GR ASR A AD AWAEVLALSP+ADT GMRAQF+CHWQ+AE QPGK S
Sbjct 55 LRVFPSPAGRAASRLQATAPADEAWAEVLALSPDADTPGMRAQFLCHWQFAEFAQPGKVS 114
Query 120 WNLEPWRPVVDDSEMLASGCNPGSPEES 147
WNLEPWRPVVDD+EM+ASGCNPG PEES
Sbjct 115 WNLEPWRPVVDDTEMVASGCNPGGPEES 142
>gi|296168079|ref|ZP_06850145.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295896886|gb|EFG76514.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=137
Score = 168 bits (425), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 91/149 (62%), Positives = 103/149 (70%), Gaps = 13/149 (8%)
Query 1 MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSL 60
MKAL+ V +VAL G PG A G +SP VDHTEWAQW SL
Sbjct 1 MKALLTAPTVVLVALWG----------PPGVAPAEPAG--TSPPFVDHTEWAQWHGRSSL 48
Query 61 RVYPSQVGRTAS-RRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPS 119
RV+P+ GR A+ R A AD AWAEVLALSP+ADTAGMRAQF+CHWQ+AE+ +PGK S
Sbjct 49 RVFPTPSGRLAAGRSSDPALADEAWAEVLALSPDADTAGMRAQFVCHWQFAEVAEPGKAS 108
Query 120 WNLEPWRPVVDDSEMLASGCNPGSPEESF 148
WNLEPWRPVVDD+EM ASGCNPG PEE F
Sbjct 109 WNLEPWRPVVDDAEMAASGCNPGGPEERF 137
>gi|336460415|gb|EGO39313.1| Protein of unknown function (DUF2599)/Protein of unknown function
(DUF2516) [Mycobacterium avium subsp. paratuberculosis
S397]
Length=230
Score = 165 bits (418), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 77/109 (71%), Positives = 92/109 (85%), Gaps = 1/109 (0%)
Query 41 SSPRLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLG-MAAADAAWAEVLALSPEADTAGM 99
+ P +DHT+WAQW SLRV+P+ GRTA++ G MA AD AWAEVLAL+P+ADTAGM
Sbjct 121 ADPPFIDHTDWAQWQGRSSLRVFPTPAGRTAAKTPGSMAMADEAWAEVLALAPDADTAGM 180
Query 100 RAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF 148
RAQF+CHWQ+AE+ +PGK SWNLEPWRPVVDD+EM+ASGCNPG PEESF
Sbjct 181 RAQFLCHWQFAELGRPGKLSWNLEPWRPVVDDAEMVASGCNPGGPEESF 229
>gi|118466135|ref|YP_883801.1| hypothetical protein MAV_4673 [Mycobacterium avium 104]
gi|118167422|gb|ABK68319.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=141
Score = 165 bits (417), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 78/107 (73%), Positives = 91/107 (86%), Gaps = 1/107 (0%)
Query 43 PRLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLG-MAAADAAWAEVLALSPEADTAGMRA 101
P +DHTEWAQW SLRV+P+ GRTA++ G MA AD AWAEVLAL+P+ADTAGMRA
Sbjct 34 PPFIDHTEWAQWQGRSSLRVFPTPAGRTAAKTPGSMAMADEAWAEVLALAPDADTAGMRA 93
Query 102 QFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF 148
QF+CHWQ+AE+ +PGK SWNLEPWRPVVDD+EM+ASGCNPG PEESF
Sbjct 94 QFLCHWQFAELGRPGKVSWNLEPWRPVVDDAEMVASGCNPGGPEESF 140
>gi|41410068|ref|NP_962904.1| hypothetical protein MAP3970 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41398901|gb|AAS06520.1| hypothetical protein MAP_3970 [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=141
Score = 164 bits (415), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 77/109 (71%), Positives = 92/109 (85%), Gaps = 1/109 (0%)
Query 41 SSPRLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLG-MAAADAAWAEVLALSPEADTAGM 99
+ P +DHT+WAQW SLRV+P+ GRTA++ G MA AD AWAEVLAL+P+ADTAGM
Sbjct 32 ADPPFIDHTDWAQWQGRSSLRVFPTPAGRTAAKTPGSMAMADEAWAEVLALAPDADTAGM 91
Query 100 RAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF 148
RAQF+CHWQ+AE+ +PGK SWNLEPWRPVVDD+EM+ASGCNPG PEESF
Sbjct 92 RAQFLCHWQFAELGRPGKLSWNLEPWRPVVDDAEMVASGCNPGGPEESF 140
>gi|254777110|ref|ZP_05218626.1| hypothetical protein MaviaA2_20924 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=141
Score = 164 bits (414), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 77/108 (72%), Positives = 91/108 (85%), Gaps = 1/108 (0%)
Query 42 SPRLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLG-MAAADAAWAEVLALSPEADTAGMR 100
P +DHT+WAQW SLRV+P+ GRTA++ G MA AD AWAEVLAL+P+ADTAGMR
Sbjct 33 DPPFIDHTDWAQWQGRSSLRVFPTPAGRTAAKTPGSMAMADEAWAEVLALAPDADTAGMR 92
Query 101 AQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF 148
AQF+CHWQ+AE+ +PGK SWNLEPWRPVVDD+EM+ASGCNPG PEESF
Sbjct 93 AQFLCHWQFAELGRPGKVSWNLEPWRPVVDDAEMVASGCNPGGPEESF 140
>gi|254819805|ref|ZP_05224806.1| hypothetical protein MintA_07769 [Mycobacterium intracellulare
ATCC 13950]
Length=134
Score = 157 bits (396), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 76/105 (73%), Positives = 87/105 (83%), Gaps = 1/105 (0%)
Query 45 LVDHTEWAQWGSLPSLRVYPSQVGRTASRR-LGMAAADAAWAEVLALSPEADTAGMRAQF 103
LVDHTEWAQW SLRV+P+ GRTASR MA AD AW+EVLAL+P+ADT GMRAQF
Sbjct 29 LVDHTEWAQWQGRSSLRVFPTPAGRTASRTPTSMAWADEAWSEVLALAPDADTPGMRAQF 88
Query 104 ICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF 148
+CHWQ+AE+ +PGK SWNLEPWRPVVDD+EM+AS CNPG EESF
Sbjct 89 LCHWQFAELARPGKTSWNLEPWRPVVDDAEMVASDCNPGGGEESF 133
>gi|118619640|ref|YP_907972.1| hypothetical protein MUL_4545 [Mycobacterium ulcerans Agy99]
gi|118571750|gb|ABL06501.1| conserved hypothetical secreted protein [Mycobacterium ulcerans
Agy99]
Length=160
Score = 155 bits (391), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 76/111 (69%), Positives = 87/111 (79%), Gaps = 2/111 (1%)
Query 39 PPSSPRLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAE--VLALSPEADT 96
P SP VDHTEWAQWGSL LRV+P+ GR A+ + GM AA A A VLAL+PEADT
Sbjct 35 PVHSPPFVDHTEWAQWGSLRGLRVFPTSAGRAAASQFGMTAAAAEEAWAEVLALAPEADT 94
Query 97 AGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEES 147
GMRAQF+CHW +AE+ PGK SWNLEPWRPVVDD+EM+ASGCNPG PEE+
Sbjct 95 PGMRAQFVCHWDFAEVTAPGKSSWNLEPWRPVVDDAEMIASGCNPGGPEEA 145
>gi|183980826|ref|YP_001849117.1| hypothetical protein MMAR_0802 [Mycobacterium marinum M]
gi|183174152|gb|ACC39262.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=160
Score = 154 bits (390), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 76/111 (69%), Positives = 87/111 (79%), Gaps = 2/111 (1%)
Query 39 PPSSPRLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAE--VLALSPEADT 96
P SP VDHTEWAQWGSL LRV+P+ GR A+ + GM AA A A VLAL+PEADT
Sbjct 35 PVHSPPFVDHTEWAQWGSLRGLRVFPTSAGRAAASQFGMTAAAAEEAWAEVLALAPEADT 94
Query 97 AGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEES 147
GMRAQF+CHW +AE+ PGK SWNLEPWRPVVDD+EM+ASGCNPG PEE+
Sbjct 95 PGMRAQFVCHWDFAEVAAPGKSSWNLEPWRPVVDDAEMIASGCNPGGPEEA 145
>gi|240169860|ref|ZP_04748519.1| hypothetical protein MkanA1_11132 [Mycobacterium kansasii ATCC
12478]
Length=146
Score = 145 bits (365), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 81/125 (65%), Positives = 90/125 (72%), Gaps = 2/125 (1%)
Query 26 EADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWA 85
EA G G A+ GP SP VDHT+WA G L SLRVYP+ GR ASR G +AADA A
Sbjct 22 EAGSGGGVADPSGPIYSPPFVDHTQWAYSGRLSSLRVYPTPSGRAASRDPGSSAADADEA 81
Query 86 E--VLALSPEADTAGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGS 143
VL L+P+ADT GMRAQF CHWQ AE+ +PGK SWNLEPWRPVVDD +MLASGCNPG
Sbjct 82 WAEVLGLAPDADTPGMRAQFDCHWQLAEVAEPGKISWNLEPWRPVVDDEQMLASGCNPGG 141
Query 144 PEESF 148
EE F
Sbjct 142 VEEQF 146
>gi|118471304|ref|YP_885324.1| hypothetical protein MSMEG_0921 [Mycobacterium smegmatis str.
MC2 155]
gi|118172591|gb|ABK73487.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=134
Score = 137 bits (346), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 66/104 (64%), Positives = 77/104 (75%), Gaps = 1/104 (0%)
Query 46 VDHTEWAQWGSLPSLRVYPSQVGR-TASRRLGMAAADAAWAEVLALSPEADTAGMRAQFI 104
VDH EWA+WG L SLRVYP+ R +S+ G A AD AW +VLALSP+AD AGMR QF+
Sbjct 31 VDHVEWAKWGDLSSLRVYPTPSARELSSQPDGAALADEAWRQVLALSPDADIAGMREQFV 90
Query 105 CHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF 148
CHW +AE+ +PGK SWNLEPWRP V EM A+GCNPG EE F
Sbjct 91 CHWTFAELVEPGKTSWNLEPWRPEVSPEEMAATGCNPGGTEEPF 134
>gi|126433249|ref|YP_001068940.1| hypothetical protein Mjls_0638 [Mycobacterium sp. JLS]
gi|126233049|gb|ABN96449.1| hypothetical protein Mjls_0638 [Mycobacterium sp. JLS]
Length=138
Score = 132 bits (333), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 63/104 (61%), Positives = 76/104 (74%), Gaps = 1/104 (0%)
Query 46 VDHTEWAQWGSLPSLRVYPSQVGRTASRRLG-MAAADAAWAEVLALSPEADTAGMRAQFI 104
+DH WA+WG L SLRVYP+ GR A+ ++G +A A+ AWAEVLA SP+A GMR QF
Sbjct 35 IDHVTWAKWGDLSSLRVYPTTAGRQAAGQVGTVAEAEHAWAEVLAASPDAAIPGMREQFD 94
Query 105 CHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF 148
CHWQ+AE PGK SWNLEPWRP V +M+A+GCNPG EE F
Sbjct 95 CHWQFAEFAAPGKTSWNLEPWRPEVPVEQMVAAGCNPGGTEEPF 138
>gi|108797625|ref|YP_637822.1| hypothetical protein Mmcs_0645 [Mycobacterium sp. MCS]
gi|119866712|ref|YP_936664.1| hypothetical protein Mkms_0658 [Mycobacterium sp. KMS]
gi|108768044|gb|ABG06766.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119692801|gb|ABL89874.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=138
Score = 132 bits (332), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 63/104 (61%), Positives = 76/104 (74%), Gaps = 1/104 (0%)
Query 46 VDHTEWAQWGSLPSLRVYPSQVGRTASRRLG-MAAADAAWAEVLALSPEADTAGMRAQFI 104
+DH WA+WG L SLRVYP+ GR A+ ++G +A A+ AWAEVLA SP+A GMR QF
Sbjct 35 IDHVTWAKWGDLSSLRVYPTTAGRRAAGQVGTVAEAEHAWAEVLAASPDAAIPGMREQFD 94
Query 105 CHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF 148
CHWQ+AE PGK SWNLEPWRP V +M+A+GCNPG EE F
Sbjct 95 CHWQFAEFAAPGKTSWNLEPWRPEVPVEQMVAAGCNPGGTEEPF 138
>gi|15828327|ref|NP_302590.1| hypothetical protein ML2452 [Mycobacterium leprae TN]
gi|221230804|ref|YP_002504220.1| hypothetical protein MLBr_02452 [Mycobacterium leprae Br4923]
gi|15214348|sp|Q9CB44.1|Y2452_MYCLE RecName: Full=Uncharacterized protein ML2452
gi|13094020|emb|CAC31969.1| hypothetical protein [Mycobacterium leprae]
gi|219933911|emb|CAR72551.1| hypothetical protein MLBr02452 [Mycobacterium leprae Br4923]
Length=123
Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 60/101 (60%), Positives = 74/101 (74%), Gaps = 0/101 (0%)
Query 45 LVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFI 104
VDHT+W +W SL SL+ Y + GRT++ + +AA D AW +VL LSP+ADTA M AQFI
Sbjct 13 FVDHTKWTRWRSLISLQAYSNLFGRTSAMQPDVAAGDEAWGDVLTLSPDADTADMHAQFI 72
Query 105 CHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPE 145
CH Q+AE QP S NLEPWRPVVDDSE+ A+GC+PG E
Sbjct 73 CHGQFAEFVQPSNTSSNLEPWRPVVDDSEIFAAGCHPGISE 113
>gi|333989829|ref|YP_004522443.1| hypothetical protein JDM601_1189 [Mycobacterium sp. JDM601]
gi|333485797|gb|AEF35189.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=416
Score = 124 bits (311), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 57/104 (55%), Positives = 73/104 (71%), Gaps = 1/104 (0%)
Query 46 VDHTEWAQWGSLPSLRVYPSQVGRTASRRLG-MAAADAAWAEVLALSPEADTAGMRAQFI 104
V+ T+W ++ SLRVYP+ GRT + G + + AW+EV+ +P+A+TAGMR QF+
Sbjct 313 VERTQWVEYDDGTSLRVYPTDTGRTEAGETGTLRQGEQAWSEVVNRNPDANTAGMREQFL 372
Query 105 CHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF 148
CHWQ+AE PGK SWNLEPWRPVV MLA+GCNPG EE F
Sbjct 373 CHWQFAEFILPGKTSWNLEPWRPVVSTLSMLANGCNPGGAEEPF 416
>gi|145220704|ref|YP_001131382.1| hypothetical protein Mflv_0098 [Mycobacterium gilvum PYR-GCK]
gi|315442344|ref|YP_004075223.1| hypothetical protein Mspyr1_06850 [Mycobacterium sp. Spyr1]
gi|145213190|gb|ABP42594.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
gi|315260647|gb|ADT97388.1| Protein of unknown function (DUF2599) [Mycobacterium sp. Spyr1]
Length=136
Score = 124 bits (311), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 60/104 (58%), Positives = 70/104 (68%), Gaps = 1/104 (0%)
Query 46 VDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMA-AADAAWAEVLALSPEADTAGMRAQFI 104
+DH WA+WG L SLRVYP+ R AS G ADAAW E+LAL+P+A GMR QF+
Sbjct 33 IDHVTWAKWGDLSSLRVYPTPGARRASGHAGTQPLADAAWNEILALAPDAAIPGMRDQFL 92
Query 105 CHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF 148
CHW AE +PGK SWNLEPWRP V +M+A CNPG EE F
Sbjct 93 CHWNLAEFVEPGKTSWNLEPWRPEVSYEQMVAKRCNPGGTEEPF 136
>gi|262200979|ref|YP_003272187.1| hypothetical protein Gbro_0983 [Gordonia bronchialis DSM 43247]
gi|262084326|gb|ACY20294.1| Protein of unknown function DUF2599 [Gordonia bronchialis DSM
43247]
Length=165
Score = 117 bits (292), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 69/158 (44%), Positives = 89/158 (57%), Gaps = 17/158 (10%)
Query 2 KALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRL-------------VDH 48
+A+V V A+ VALL A P A P + A S+ +DH
Sbjct 12 RAVVGVLALMCVALLSACGADDGP-AGPVSSTAEVATSSSTASTPPRPSTPSYPPPYIDH 70
Query 49 TEWAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQ 108
W Q PSL+VYP+ GR + +A +AAWAEV+ L ADT GM+AQF CHW+
Sbjct 71 VTWVQTAVGPSLQVYPTTSGRYTA---DASAMNAAWAEVVRLDRSADTPGMQAQFDCHWR 127
Query 109 YAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEE 146
+A I +P KPSWNLEP RPVVD++ M+A+ CNPG EE
Sbjct 128 FARIVEPEKPSWNLEPGRPVVDENTMIATRCNPGFAEE 165
>gi|333989105|ref|YP_004521719.1| hypothetical protein JDM601_0465 [Mycobacterium sp. JDM601]
gi|333485073|gb|AEF34465.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=133
Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/108 (58%), Positives = 71/108 (66%), Gaps = 4/108 (3%)
Query 45 LVDHTEWAQWGS---LPSLRVYPS-QVGRTASRRLGMAAADAAWAEVLALSPEADTAGMR 100
VD TEW + S L SLRVYP+ A+R A + AW EVLA +P+ADT MR
Sbjct 26 FVDRTEWVSYSSGSGLASLRVYPTPAGRAAAARVDAAAEGEQAWREVLAAAPDADTVSMR 85
Query 101 AQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF 148
QF CHW YAE +PGK SWNLEPWRPVVD ML SGCNPG+ EE+F
Sbjct 86 DQFRCHWNYAEFARPGKTSWNLEPWRPVVDGVTMLESGCNPGAEEEAF 133
>gi|120401825|ref|YP_951654.1| hypothetical protein Mvan_0810 [Mycobacterium vanbaalenii PYR-1]
gi|119954643|gb|ABM11648.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=136
Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/104 (59%), Positives = 72/104 (70%), Gaps = 1/104 (0%)
Query 46 VDHTEWAQWGSLPSLRVYPS-QVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFI 104
VDH WA+WG L SLRVYP+ RT+ A A+AAWAEVL LSP+A GMR QF+
Sbjct 33 VDHVTWAKWGDLSSLRVYPTPSARRTSGHAGTAALAEAAWAEVLTLSPDAAIPGMREQFL 92
Query 105 CHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF 148
CHW +AE +PGK SWNLEPWRP V +M+A+ CNPG EE F
Sbjct 93 CHWNFAEFVEPGKVSWNLEPWRPEVSPEQMIATRCNPGGTEEPF 136
>gi|169631159|ref|YP_001704808.1| hypothetical protein MAB_4081c [Mycobacterium abscessus ATCC
19977]
gi|169243126|emb|CAM64154.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=150
Score = 110 bits (274), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 56/103 (55%), Positives = 66/103 (65%), Gaps = 4/103 (3%)
Query 43 PRLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQ 102
P L+DH W +LR+YP+ +GRT A D AWAEV+AL+P+A T GMR Q
Sbjct 51 PPLIDHVSWGTTSLGRTLRIYPTPLGRTYD---APDAVDVAWAEVVALAPDAQTPGMRMQ 107
Query 103 FICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPE 145
F CHW Y + P KPSWNLEPWRP VD + M AS CNPG PE
Sbjct 108 FDCHW-YGRVFIPDKPSWNLEPWRPQVDGALMAASQCNPGGPE 149
>gi|343925936|ref|ZP_08765451.1| hypothetical protein GOALK_050_02320 [Gordonia alkanivorans NBRC
16433]
gi|343764287|dbj|GAA12377.1| hypothetical protein GOALK_050_02320 [Gordonia alkanivorans NBRC
16433]
Length=156
Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 53/104 (51%), Positives = 66/104 (64%), Gaps = 3/104 (2%)
Query 43 PRLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQ 102
P +DH EW + PSL V+P+ GR MAAA W EV+ L ADT GM+AQ
Sbjct 55 PPFIDHVEWTETEVGPSLSVFPTTSGRYTQDPAAMAAA---WREVVGLDANADTPGMQAQ 111
Query 103 FICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEE 146
F CHW++A I +P KPSWNLEP RPVV + +M+ + CNPG EE
Sbjct 112 FDCHWRFARIIEPEKPSWNLEPGRPVVPEEDMIGARCNPGFAEE 155
>gi|326383070|ref|ZP_08204759.1| hypothetical protein SCNU_09036 [Gordonia neofelifaecis NRRL
B-59395]
gi|326198206|gb|EGD55391.1| hypothetical protein SCNU_09036 [Gordonia neofelifaecis NRRL
B-59395]
Length=156
Score = 107 bits (266), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 52/96 (55%), Positives = 64/96 (67%), Gaps = 3/96 (3%)
Query 51 WAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYA 110
WA+ PSL++ P+ GR S A D AW EVL + P ADT GM+AQF CHW +A
Sbjct 64 WAETEVGPSLQIAPTPNGRKVS---SPTAGDEAWREVLRIEPNADTPGMKAQFDCHWTFA 120
Query 111 EIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEE 146
I QP KP+WN+EP RPVV EM+A+ CNPG+PEE
Sbjct 121 RIVQPDKPTWNIEPNRPVVTPDEMIATRCNPGAPEE 156
>gi|296395132|ref|YP_003660016.1| hypothetical protein Srot_2752 [Segniliparus rotundus DSM 44985]
gi|296182279|gb|ADG99185.1| Protein of unknown function DUF2599 [Segniliparus rotundus DSM
44985]
Length=125
Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/142 (45%), Positives = 80/142 (57%), Gaps = 21/142 (14%)
Query 4 LVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSLRVY 63
+ A+ + +V G +A ADP A+ GG R ++H WA+ PSL V+
Sbjct 1 MRALFSALIVCAFGGEAAWADP--------ADRGG-----RWIEHVVWARASDGPSLHVF 47
Query 64 PSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSWNLE 123
P++ GR A L A W EVL+L+PEAD+ GMR QF CH +A+ K SWNLE
Sbjct 48 PTRAGRDAPAEL----QGAEWDEVLSLAPEADSPGMREQFDCHQSFAK----SKESWNLE 99
Query 124 PWRPVVDDSEMLASGCNPGSPE 145
PWRP V EMLASGCNPG E
Sbjct 100 PWRPAVGGVEMLASGCNPGKRE 121
>gi|54024574|ref|YP_118816.1| hypothetical protein nfa26050 [Nocardia farcinica IFM 10152]
gi|54016082|dbj|BAD57452.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=166
Score = 101 bits (251), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 49/102 (49%), Positives = 62/102 (61%), Gaps = 3/102 (2%)
Query 45 LVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFI 104
L+D EW L V P++ GR + A + AWAE++ LSP AD+ GMR QF+
Sbjct 68 LIDRVEWTDTVDGDRLLVVPTRAGRDTT---FPGAENRAWAEIVELSPAADSPGMRDQFV 124
Query 105 CHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEE 146
CHW +A + QP KPSWNLEPWRP V E + + CNPG PE
Sbjct 125 CHWHWARLVQPDKPSWNLEPWRPAVGYQETVRASCNPGGPER 166
>gi|319949075|ref|ZP_08023171.1| hypothetical protein ES5_06712 [Dietzia cinnamea P4]
gi|319437294|gb|EFV92318.1| hypothetical protein ES5_06712 [Dietzia cinnamea P4]
Length=182
Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/87 (56%), Positives = 58/87 (67%), Gaps = 3/87 (3%)
Query 59 SLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKP 118
+L V P+ GR R A+AAW EVL L+P+ADT GM QF CHW +A + +P KP
Sbjct 98 TLEVAPTDSGR---RAWAEGDAEAAWQEVLRLAPDADTPGMWEQFDCHWTWARLLEPDKP 154
Query 119 SWNLEPWRPVVDDSEMLASGCNPGSPE 145
+WN+EPWRPVV MLA GCNPG PE
Sbjct 155 TWNVEPWRPVVSPERMLAEGCNPGGPE 181
>gi|317509085|ref|ZP_07966714.1| hypothetical protein HMPREF9336_03086 [Segniliparus rugosus ATCC
BAA-974]
gi|316252614|gb|EFV12055.1| hypothetical protein HMPREF9336_03086 [Segniliparus rugosus ATCC
BAA-974]
Length=122
Score = 92.4 bits (228), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/105 (47%), Positives = 62/105 (60%), Gaps = 8/105 (7%)
Query 41 SSPRLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMR 100
+ P +DH W PSL V+P+Q GR A AWAEVL L+P+AD+ MR
Sbjct 22 ADPVFIDHVAWTNTPDGPSLHVFPTQAGRDAP----ADEQGEAWAEVLRLAPDADSPSMR 77
Query 101 AQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPE 145
QF CH +A+ ++ SWNLEPWRPVV + +LAS CNPG E
Sbjct 78 EQFDCHQSFAKAKE----SWNLEPWRPVVGVAALLASACNPGKKE 118
>gi|170780692|ref|YP_001709024.1| putative secreted protein [Clavibacter michiganensis subsp. sepedonicus]
gi|169155260|emb|CAQ00361.1| putative secreted protein [Clavibacter michiganensis subsp. sepedonicus]
Length=345
Score = 70.5 bits (171), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 40/107 (38%), Positives = 55/107 (52%), Gaps = 8/107 (7%)
Query 45 LVDHTEWA---QWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEA--DTAGM 99
L+D W +WG P+ ++YP+ GR +A +AAW E L + + D +
Sbjct 241 LIDSVVWTAGDEWG--PTAQIYPTSAGRDTIFAPKIAN-EAAWGEALEKTDRSRLDHNNL 297
Query 100 RAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEE 146
QF CHWQ PGK SWNL+ RP V +E +A+ CNPG E
Sbjct 298 HDQFTCHWQVVRYDDPGKSSWNLDSARPDVGLAETIAARCNPGGGSE 344
>gi|336321827|ref|YP_004601795.1| hypothetical protein Celgi_2728 [Cellvibrio gilvus ATCC 13127]
gi|336105408|gb|AEI13227.1| Protein of unknown function DUF2599 [Cellvibrio gilvus ATCC 13127]
Length=271
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/98 (41%), Positives = 51/98 (53%), Gaps = 8/98 (8%)
Query 44 RLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQF 103
R V W + SL V PS R G+AA + ++ A P+A TA MR Q
Sbjct 181 RTVAALAWGEREGGRSLAVTPSDWARVG----GLAAHELVPVQLAAAEPDAGTATMRGQL 236
Query 104 ICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNP 141
CH ++ P K +WNLEPWRP VD EM+A+ CNP
Sbjct 237 ACH----QLGAPDKATWNLEPWRPAVDALEMIAARCNP 270
>gi|269796752|ref|YP_003316207.1| hypothetical protein Sked_34850 [Sanguibacter keddieii DSM 10542]
gi|269098937|gb|ACZ23373.1| Protein of unknown function (DUF2599) [Sanguibacter keddieii
DSM 10542]
Length=303
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/83 (44%), Positives = 45/83 (55%), Gaps = 8/83 (9%)
Query 59 SLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKP 118
SL V P+ GR + G + AWA++LA+ PEADT GM Q CH A K
Sbjct 228 SLAVTPTDWGRVS----GATGSATAWADLLAVEPEADTPGMETQLQCHLLGAR----DKA 279
Query 119 SWNLEPWRPVVDDSEMLASGCNP 141
+WNLEPWRP + E + CNP
Sbjct 280 TWNLEPWRPDLGLVEYALARCNP 302
>gi|256833575|ref|YP_003162302.1| hypothetical protein Jden_2365 [Jonesia denitrificans DSM 20603]
gi|256687106|gb|ACV09999.1| hypothetical protein Jden_2365 [Jonesia denitrificans DSM 20603]
Length=271
Score = 61.2 bits (147), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 35/83 (43%), Positives = 44/83 (54%), Gaps = 8/83 (9%)
Query 59 SLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKP 118
SL V PS GR G+ A WA+V+ +AD GM QF CH + K
Sbjct 194 SLAVTPSTWGRAG----GVTVARFGWADVVERDADADIPGMEEQFACH----ALGASAKD 245
Query 119 SWNLEPWRPVVDDSEMLASGCNP 141
+WNLEPWRP V +M+A+ CNP
Sbjct 246 TWNLEPWRPAVPLPQMMAALCNP 268
>gi|269957024|ref|YP_003326813.1| hypothetical protein Xcel_2237 [Xylanimonas cellulosilytica DSM
15894]
gi|269305705|gb|ACZ31255.1| hypothetical protein Xcel_2237 [Xylanimonas cellulosilytica DSM
15894]
Length=153
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/63 (45%), Positives = 36/63 (58%), Gaps = 4/63 (6%)
Query 79 AADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASG 138
A D W+E++ PEADT M Q +CH + P K +WNLEPWRP V +L +
Sbjct 94 AIDRLWSELIEAEPEADTTVMHDQLVCH----AVGAPDKDTWNLEPWRPDVGLVAVLRAR 149
Query 139 CNP 141
CNP
Sbjct 150 CNP 152
>gi|332671683|ref|YP_004454691.1| hypothetical protein Celf_3190 [Cellulomonas fimi ATCC 484]
gi|332340721|gb|AEE47304.1| Protein of unknown function DUF2599 [Cellulomonas fimi ATCC 484]
Length=272
Score = 56.2 bits (134), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/101 (37%), Positives = 50/101 (50%), Gaps = 14/101 (13%)
Query 44 RLVDHTEWAQWGSLPSLRVYPSQVGRTAS---RRLGMAAADAAWAEVLALSPEADTAGMR 100
R V+ T W SL V P+ R + RL W +++AL P AD+ MR
Sbjct 182 RTVEATAWGDREGGESLAVTPTAWARASGAAAERL-------TWTQLVALEPRADSPTMR 234
Query 101 AQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNP 141
Q CH ++ P K +WNLEPWRP V ++A+ CNP
Sbjct 235 DQLSCH----QLGAPDKATWNLEPWRPDVGWLSVVAARCNP 271
>gi|296130755|ref|YP_003638005.1| hypothetical protein Cfla_2923 [Cellulomonas flavigena DSM 20109]
gi|296022570|gb|ADG75806.1| Protein of unknown function DUF2599 [Cellulomonas flavigena DSM
20109]
Length=256
Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/91 (44%), Positives = 47/91 (52%), Gaps = 8/91 (8%)
Query 51 WAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYA 110
W PSL V P + RR AA A ++V +PEA A M+AQ CH A
Sbjct 169 WGDADGGPSLLVSP----QPWVRRGSTAAQHALVSQVQVAAPEAAAASMQAQLWCHLLGA 224
Query 111 EIRQPGKPSWNLEPWRPVVDDSEMLASGCNP 141
P K SWN+EPWRP V MLA+GCNP
Sbjct 225 ----PDKDSWNIEPWRPEVGTLTMLATGCNP 251
>gi|77458733|ref|YP_348239.1| hypothetical protein Pfl01_2508 [Pseudomonas fluorescens Pf0-1]
gi|77382736|gb|ABA74249.1| Hypothetical protein Pfl01_2508 [Pseudomonas fluorescens Pf0-1]
Length=141
Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/51 (48%), Positives = 29/51 (57%), Gaps = 4/51 (7%)
Query 94 ADTAGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSP 144
+T G+R Q ICH A KP W LEPWRP V S+ +A GCN +P
Sbjct 89 TNTHGLRHQLICHLMVAR----NKPEWYLEPWRPDVGLSKTVAEGCNHETP 135
>gi|170721932|ref|YP_001749620.1| hypothetical protein PputW619_2759 [Pseudomonas putida W619]
gi|169759935|gb|ACA73251.1| conserved hypothetical protein [Pseudomonas putida W619]
Length=431
Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/94 (37%), Positives = 45/94 (48%), Gaps = 17/94 (18%)
Query 59 SLRVYPSQVGRTASRRLGMAAADAAWAEVLAL-------SPEADTAGMRAQFICHWQYA- 110
+L V P+ GR R+ DA +AE++ S GMR Q +CH +
Sbjct 337 TLSVTPTDCGR----RIKSDQTDAMYAELVRKYGADPRWSGAKYGGGMRRQLVCHLTHVS 392
Query 111 ---EIRQPGKPSWNLEPWRPVVDDSEMLASGCNP 141
E+R KP +NLEP RP V + LA GCNP
Sbjct 393 KGQEVRY--KPVYNLEPSRPDVSHDQSLAQGCNP 424
>gi|152986634|ref|YP_001348730.1| hypothetical protein PSPA7_3370 [Pseudomonas aeruginosa PA7]
gi|150961792|gb|ABR83817.1| conserved hypothetical protein [Pseudomonas aeruginosa PA7]
Length=408
Score = 47.8 bits (112), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 22/60 (37%), Positives = 33/60 (55%), Gaps = 5/60 (8%)
Query 81 DAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCN 140
D+AW + + +++ MR Q +CH+ A KP WNLEP RP + + +A GCN
Sbjct 352 DSAWKDN-PDNRDSNIGSMRRQLVCHFNIAR----DKPEWNLEPSRPYTSNEDSIAKGCN 406
>gi|218892129|ref|YP_002440996.1| putative halovibrin [Pseudomonas aeruginosa LESB58]
gi|218772355|emb|CAW28137.1| putative halovibrin [Pseudomonas aeruginosa LESB58]
Length=408
Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 19/48 (40%), Positives = 27/48 (57%), Gaps = 4/48 (8%)
Query 93 EADTAGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCN 140
+++ MR Q +CH+ A KP WNLEP RP + + +A GCN
Sbjct 363 DSNIGSMRRQLVCHFNVAR----NKPEWNLEPSRPYTSNEDSIAKGCN 406
>gi|15597110|ref|NP_250604.1| hypothetical protein PA1914 [Pseudomonas aeruginosa PAO1]
gi|116049866|ref|YP_791325.1| hypothetical protein PA14_39780 [Pseudomonas aeruginosa UCBPP-PA14]
gi|296389679|ref|ZP_06879154.1| hypothetical protein PaerPAb_16086 [Pseudomonas aeruginosa PAb1]
gi|9947907|gb|AAG05302.1|AE004617_6 conserved hypothetical protein [Pseudomonas aeruginosa PAO1]
gi|115585087|gb|ABJ11102.1| putative halovibrin [Pseudomonas aeruginosa UCBPP-PA14]
gi|334833171|gb|EGM12327.1| hypothetical protein PA15_31831 [Pseudomonas aeruginosa 152504]
Length=408
Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 19/48 (40%), Positives = 27/48 (57%), Gaps = 4/48 (8%)
Query 93 EADTAGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCN 140
+++ MR Q +CH+ A KP WNLEP RP + + +A GCN
Sbjct 363 DSNIGSMRRQLVCHFNVAR----NKPEWNLEPSRPYTSNEDSIAKGCN 406
>gi|330821454|ref|YP_004350316.1| hypothetical protein bgla_2g23740 [Burkholderia gladioli BSR3]
gi|327373449|gb|AEA64804.1| hypothetical protein bgla_2g23740 [Burkholderia gladioli BSR3]
Length=409
Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/94 (31%), Positives = 43/94 (46%), Gaps = 14/94 (14%)
Query 56 SLPSLRVYPSQVGRTA---------SRRLGMAAADAAWAEVLALSPEADTAGMRAQFICH 106
+ SL++ P+ GR + + DA W + + +++ A MR Q +CH
Sbjct 319 KIASLQLVPTDCGRAVKDEQTNNFFNELVVQHYLDAPWKDN-EDNRDSNLASMRRQLVCH 377
Query 107 WQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCN 140
+Q A K WNLEP RP E +A GCN
Sbjct 378 FQIAR----NKAEWNLEPSRPYTSQQESVAKGCN 407
>gi|313111354|ref|ZP_07797168.1| putative halovibrin [Pseudomonas aeruginosa 39016]
gi|310883670|gb|EFQ42264.1| putative halovibrin [Pseudomonas aeruginosa 39016]
Length=378
Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 19/48 (40%), Positives = 27/48 (57%), Gaps = 4/48 (8%)
Query 93 EADTAGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCN 140
+++ MR Q +CH+ A KP WNLEP RP + + +A GCN
Sbjct 333 DSNIGSMRRQLVCHFNVAR----NKPEWNLEPSRPYTSNEDSIAKGCN 376
>gi|293393465|ref|ZP_06637776.1| conserved hypothetical protein [Serratia odorifera DSM 4582]
gi|291424066|gb|EFE97284.1| conserved hypothetical protein [Serratia odorifera DSM 4582]
Length=125
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/105 (31%), Positives = 48/105 (46%), Gaps = 12/105 (11%)
Query 41 SSPRLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPE----ADT 96
+ R + W + +L V P+ GR+ R D A+AE++ + +T
Sbjct 28 NCQRYIKSASWIERYGEMTLAVIPTDCGRSIYSR----ETDRAFAELVNKFGKDKYWRNT 83
Query 97 AGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNP 141
GMR QF CH+ A +Q WNL+P+R V +GCNP
Sbjct 84 RGMRDQFACHYSIARKKQV----WNLDPYRKDVGYEATEKAGCNP 124
>gi|50955885|ref|YP_063173.1| hypothetical protein Lxx24425 [Leifsonia xyli subsp. xyli str.
CTCB07]
gi|50952367|gb|AAT90068.1| conserved hypothetical protein [Leifsonia xyli subsp. xyli str.
CTCB07]
Length=43
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 21/43 (49%), Positives = 25/43 (59%), Gaps = 0/43 (0%)
Query 99 MRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNP 141
M QFICHW R P K SWNL+ RP V + +A+ CNP
Sbjct 1 MYYQFICHWDIVRYRAPNKVSWNLDKNRPNVGYAATVAAQCNP 43
>gi|115265542|dbj|BAF32804.1| conserved hypothetical protein [Pseudomonas syringae pv. actinidiae]
Length=415
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/94 (30%), Positives = 46/94 (49%), Gaps = 15/94 (15%)
Query 59 SLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEA-----DTAGMRAQFICHWQYA--- 110
SL+V P++ GR +++ AA+AE+ + + D M Q +CH +++
Sbjct 325 SLQVTPTECGRNMTKQ----QQAAAYAELFSKYGKDKQWNPDNGSMNQQLVCHLEWSGDD 380
Query 111 ---EIRQPGKPSWNLEPWRPVVDDSEMLASGCNP 141
++ K WNLEP RP V ++ GCNP
Sbjct 381 NGKKVYTRDKRFWNLEPVRPAVSWDDVFKQGCNP 414
>gi|330965276|gb|EGH65536.1| hypothetical protein PSYAC_11621 [Pseudomonas syringae pv. actinidiae
str. M302091]
Length=331
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/94 (30%), Positives = 46/94 (49%), Gaps = 15/94 (15%)
Query 59 SLRVYPSQVGRTASRRLGMAAADAAWAEVLALSPEA-----DTAGMRAQFICHWQYA--- 110
SL+V P++ GR +++ AA +AE+ + + D M Q +CH +++
Sbjct 241 SLQVTPTECGRNMTKQQQAAA----YAELFSKYGKDKQWNPDNGSMNQQLVCHLEWSGDD 296
Query 111 ---EIRQPGKPSWNLEPWRPVVDDSEMLASGCNP 141
++ K WNLEP RP V ++ GCNP
Sbjct 297 NGKKVYTRDKRFWNLEPVRPAVSWDDVFKQGCNP 330
>gi|70729853|ref|YP_259592.1| hypothetical protein PFL_2485 [Pseudomonas fluorescens Pf-5]
gi|68344152|gb|AAY91758.1| conserved hypothetical protein [Pseudomonas fluorescens Pf-5]
Length=408
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 19/48 (40%), Positives = 26/48 (55%), Gaps = 4/48 (8%)
Query 93 EADTAGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCN 140
++ MR Q +CH+ A KP WNLEP RP + + +A GCN
Sbjct 363 DSSVGSMRRQLVCHFNIAR----NKPEWNLEPSRPYTSNEDSIAKGCN 406
>gi|107101349|ref|ZP_01365267.1| hypothetical protein PaerPA_01002383 [Pseudomonas aeruginosa
PACS2]
gi|334835284|gb|EGM14171.1| putative halovibrin [Pseudomonas aeruginosa 138244]
Length=63
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 32/60 (54%), Gaps = 5/60 (8%)
Query 81 DAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCN 140
D+ W + + +++ MR Q +CH+ A KP WNLEP RP + + +A GCN
Sbjct 7 DSEWKDT-PDNRDSNIGSMRRQLVCHFNVAR----NKPEWNLEPSRPYTSNEDSIAKGCN 61
Lambda K H
0.313 0.128 0.405
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130586878330
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40