BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2084
Length=378
Score E
Sequences producing significant alignments: (Bits) Value
gi|15841577|ref|NP_336614.1| hypothetical protein MT2146 [Mycoba... 747 0.0
gi|15609221|ref|NP_216600.1| hypothetical protein Rv2084 [Mycoba... 745 0.0
gi|254232249|ref|ZP_04925576.1| hypothetical protein TBCG_02036 ... 604 6e-171
gi|289447705|ref|ZP_06437449.1| conserved hypothetical protein [... 514 1e-143
gi|31793267|ref|NP_855760.1| hypothetical protein Mb2110 [Mycoba... 513 2e-143
gi|289754195|ref|ZP_06513573.1| conserved hypothetical protein [... 512 4e-143
gi|340627096|ref|YP_004745548.1| hypothetical protein MCAN_21091... 506 4e-141
gi|289443590|ref|ZP_06433334.1| LOW QUALITY PROTEIN: hypothetica... 273 3e-71
gi|289758203|ref|ZP_06517581.1| hypothetical protein TBEG_00882 ... 244 2e-62
gi|289570197|ref|ZP_06450424.1| hypothetical protein TBJG_00564 ... 235 8e-60
gi|240173503|ref|ZP_04752161.1| hypothetical protein MkanA1_2960... 168 1e-39
gi|240173502|ref|ZP_04752160.1| hypothetical protein MkanA1_2959... 50.1 6e-04
gi|303246451|ref|ZP_07332730.1| Baseplate J family protein [Desu... 37.4 4.2
gi|341938225|gb|AEL08364.1| carboxylesterase [Xanthomonas campes... 37.0 5.6
>gi|15841577|ref|NP_336614.1| hypothetical protein MT2146 [Mycobacterium tuberculosis CDC1551]
gi|167966799|ref|ZP_02549076.1| hypothetical protein MtubH3_01510 [Mycobacterium tuberculosis
H37Ra]
gi|254551117|ref|ZP_05141564.1| hypothetical protein Mtube_11746 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
gi|297634665|ref|ZP_06952445.1| hypothetical protein MtubK4_11111 [Mycobacterium tuberculosis
KZN 4207]
gi|297731653|ref|ZP_06960771.1| hypothetical protein MtubKR_11216 [Mycobacterium tuberculosis
KZN R506]
gi|313658988|ref|ZP_07815868.1| hypothetical protein MtubKV_11231 [Mycobacterium tuberculosis
KZN V2475]
gi|18314346|sp|Q10692.2|Y2084_MYCTU RecName: Full=Uncharacterized protein Rv2084/MT2146
gi|13881825|gb|AAK46428.1| hypothetical protein MT2146 [Mycobacterium tuberculosis CDC1551]
Length=380
Score = 747 bits (1928), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/378 (100%), Positives = 378/378 (100%), Gaps = 0/378 (0%)
Query 1 VSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 60
VSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA
Sbjct 3 VSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 62
Query 61 VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 120
VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA
Sbjct 63 VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 122
Query 121 YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV 180
YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV
Sbjct 123 YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV 182
Query 181 IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV 240
IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV
Sbjct 183 IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV 242
Query 241 QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQSGTAGDGGGGRRQDSRRRN 300
QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQSGTAGDGGGGRRQDSRRRN
Sbjct 243 QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQSGTAGDGGGGRRQDSRRRN 302
Query 301 GPRRPARRGTSRGRRCAPRVAIGWHTPIGDPLAVEGVEEIGASLPGRESTPSDDGGSLHP 360
GPRRPARRGTSRGRRCAPRVAIGWHTPIGDPLAVEGVEEIGASLPGRESTPSDDGGSLHP
Sbjct 303 GPRRPARRGTSRGRRCAPRVAIGWHTPIGDPLAVEGVEEIGASLPGRESTPSDDGGSLHP 362
Query 361 SGRPRRVHRRRWCGLGLC 378
SGRPRRVHRRRWCGLGLC
Sbjct 363 SGRPRRVHRRRWCGLGLC 380
>gi|15609221|ref|NP_216600.1| hypothetical protein Rv2084 [Mycobacterium tuberculosis H37Rv]
gi|148661900|ref|YP_001283423.1| hypothetical protein MRA_2100 [Mycobacterium tuberculosis H37Ra]
gi|148823300|ref|YP_001288054.1| hypothetical protein TBFG_12121 [Mycobacterium tuberculosis F11]
30 more sequence titles
Length=378
Score = 745 bits (1924), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/378 (99%), Positives = 378/378 (100%), Gaps = 0/378 (0%)
Query 1 VSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 60
+SDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA
Sbjct 1 MSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 60
Query 61 VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 120
VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA
Sbjct 61 VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 120
Query 121 YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV 180
YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV
Sbjct 121 YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV 180
Query 181 IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV 240
IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV
Sbjct 181 IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV 240
Query 241 QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQSGTAGDGGGGRRQDSRRRN 300
QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQSGTAGDGGGGRRQDSRRRN
Sbjct 241 QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQSGTAGDGGGGRRQDSRRRN 300
Query 301 GPRRPARRGTSRGRRCAPRVAIGWHTPIGDPLAVEGVEEIGASLPGRESTPSDDGGSLHP 360
GPRRPARRGTSRGRRCAPRVAIGWHTPIGDPLAVEGVEEIGASLPGRESTPSDDGGSLHP
Sbjct 301 GPRRPARRGTSRGRRCAPRVAIGWHTPIGDPLAVEGVEEIGASLPGRESTPSDDGGSLHP 360
Query 361 SGRPRRVHRRRWCGLGLC 378
SGRPRRVHRRRWCGLGLC
Sbjct 361 SGRPRRVHRRRWCGLGLC 378
>gi|254232249|ref|ZP_04925576.1| hypothetical protein TBCG_02036 [Mycobacterium tuberculosis C]
gi|124601308|gb|EAY60318.1| hypothetical protein TBCG_02036 [Mycobacterium tuberculosis C]
Length=349
Score = 604 bits (1558), Expect = 6e-171, Method: Compositional matrix adjust.
Identities = 348/349 (99%), Positives = 348/349 (99%), Gaps = 0/349 (0%)
Query 30 AAASELLLTVRYQLDTQPRPLVIVHGPLFQAVKAARAQVYGRLIQLRHARCEVLDERWQL 89
AAASELLL VRYQLDTQPRPLVIVHGPLFQAVKAARAQVYGRLIQLRHARCEVLDERWQL
Sbjct 1 AAASELLLIVRYQLDTQPRPLVIVHGPLFQAVKAARAQVYGRLIQLRHARCEVLDERWQL 60
Query 90 RPTGQRDVRALLIDVLNVLLAAITAAGVERAYACAERRAMAAAVVAKNYRDALGVELQCN 149
RPTGQRDVRALLIDVLNVLLAAITAAGVERAYACAERRAMAAAVVAKNYRDALGVELQCN
Sbjct 61 RPTGQRDVRALLIDVLNVLLAAITAAGVERAYACAERRAMAAAVVAKNYRDALGVELQCN 120
Query 150 SVCRAAAEAIHALAHRTGATEDADCLPPVDVIHADVTRRMHGEVATDVVAAGELVIAARH 209
SVCRAAAEAIHALAHRTGATEDADCLPPVDVIHADVTRRMHGEVATDVVAAGELVIAARH
Sbjct 121 SVCRAAAEAIHALAHRTGATEDADCLPPVDVIHADVTRRMHGEVATDVVAAGELVIAARH 180
Query 210 LLDPMPRGELSYGPLHEGGNAARKSVYRRLVQLWQARRAVTDGDVDLRDARTLLTDLDSI 269
LLDPMPRGELSYGPLHEGGNAARKSVYRRLVQLWQARRAVTDGDVDLRDARTLLTDLDSI
Sbjct 181 LLDPMPRGELSYGPLHEGGNAARKSVYRRLVQLWQARRAVTDGDVDLRDARTLLTDLDSI 240
Query 270 LREMRTAATIQQSGTAGDGGGGRRQDSRRRNGPRRPARRGTSRGRRCAPRVAIGWHTPIG 329
LREMRTAATIQQSGTAGDGGGGRRQDSRRRNGPRRPARRGTSRGRRCAPRVAIGWHTPIG
Sbjct 241 LREMRTAATIQQSGTAGDGGGGRRQDSRRRNGPRRPARRGTSRGRRCAPRVAIGWHTPIG 300
Query 330 DPLAVEGVEEIGASLPGRESTPSDDGGSLHPSGRPRRVHRRRWCGLGLC 378
DPLAVEGVEEIGASLPGRESTPSDDGGSLHPSGRPRRVHRRRWCGLGLC
Sbjct 301 DPLAVEGVEEIGASLPGRESTPSDDGGSLHPSGRPRRVHRRRWCGLGLC 349
Score = 60.5 bits (145), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/117 (38%), Positives = 64/117 (55%), Gaps = 7/117 (5%)
Query 4 DSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQAVKA 63
D D+I A++ R++ G E+ D AA EL++ R+ LD PR + +GPL + A
Sbjct 144 DCLPPVDVIHADVTRRMHG-EVATDVVAAGELVIAARHLLDPMPRG-ELSYGPLHEGGNA 201
Query 64 ARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 120
AR VY RL+QL AR V D L RD R LL D+ ++L TAA ++++
Sbjct 202 ARKSVYRRLVQLWQARRAVTDGDVDL-----RDARTLLTDLDSILREMRTAATIQQS 253
>gi|289447705|ref|ZP_06437449.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289574765|ref|ZP_06454992.1| hypothetical protein TBOG_02609 [Mycobacterium tuberculosis K85]
gi|289745361|ref|ZP_06504739.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|294997028|ref|ZP_06802719.1| hypothetical protein Mtub2_21618 [Mycobacterium tuberculosis
210]
gi|289420663|gb|EFD17864.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289539196|gb|EFD43774.1| hypothetical protein TBOG_02609 [Mycobacterium tuberculosis K85]
gi|289685889|gb|EFD53377.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
Length=333
Score = 514 bits (1323), Expect = 1e-143, Method: Compositional matrix adjust.
Identities = 282/284 (99%), Positives = 283/284 (99%), Gaps = 0/284 (0%)
Query 1 VSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 60
VSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA
Sbjct 3 VSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 62
Query 61 VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 120
VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA
Sbjct 63 VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 122
Query 121 YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV 180
YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV
Sbjct 123 YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV 182
Query 181 IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV 240
IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV
Sbjct 183 IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV 242
Query 241 QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQSGT 284
QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQ+ T
Sbjct 243 QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQAYT 286
Score = 79.7 bits (195), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 72/159 (46%), Positives = 98/159 (62%), Gaps = 7/159 (4%)
Query 4 DSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQAVKA 63
D D+I A++ R++ GE+ D AA EL++ R+ LD PR + +GPL + A
Sbjct 175 DCLPPVDVIHADVTRRMH-GEVATDVVAAGELVIAARHLLDPMPRG-ELSYGPLHEGGNA 232
Query 64 ARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERAYAC 123
AR VY RL+QL AR V D L RD R LL D+ ++L TAA +++AY
Sbjct 233 ARKSVYRRLVQLWQARRAVTDGDVDL-----RDARTLLTDLDSILREMRTAATIQQAYTR 287
Query 124 AERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHAL 162
AERRAMAAAVVAK DA+G++ Q ++V RAAA+A+HAL
Sbjct 288 AERRAMAAAVVAKIRGDAMGLDAQRDAVHRAAADALHAL 326
>gi|31793267|ref|NP_855760.1| hypothetical protein Mb2110 [Mycobacterium bovis AF2122/97]
gi|121637969|ref|YP_978193.1| hypothetical protein BCG_2103 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224990463|ref|YP_002645150.1| hypothetical protein JTY_2097 [Mycobacterium bovis BCG str. Tokyo
172]
8 more sequence titles
Length=331
Score = 513 bits (1321), Expect = 2e-143, Method: Compositional matrix adjust.
Identities = 281/284 (99%), Positives = 283/284 (99%), Gaps = 0/284 (0%)
Query 1 VSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 60
+SDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA
Sbjct 1 MSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 60
Query 61 VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 120
VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA
Sbjct 61 VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 120
Query 121 YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV 180
YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV
Sbjct 121 YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV 180
Query 181 IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV 240
IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV
Sbjct 181 IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV 240
Query 241 QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQSGT 284
QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQ+ T
Sbjct 241 QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQAYT 284
Score = 79.3 bits (194), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 71/154 (47%), Positives = 97/154 (63%), Gaps = 7/154 (4%)
Query 9 FDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQAVKAARAQV 68
D+I A++ R++ GE+ D AA EL++ R+ LD PR + +GPL + AAR V
Sbjct 178 VDVIHADVTRRMH-GEVATDVVAAGELVIAARHLLDPMPRG-ELSYGPLHEGGNAARKSV 235
Query 69 YGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERAYACAERRA 128
Y RL+QL AR V D L RD R LL D+ ++L TAA +++AY AERRA
Sbjct 236 YRRLVQLWQARRAVTDGDVDL-----RDARTLLTDLDSILREMRTAATIQQAYTRAERRA 290
Query 129 MAAAVVAKNYRDALGVELQCNSVCRAAAEAIHAL 162
MAAAVVAK DA+G++ Q ++V RAAA+A+HAL
Sbjct 291 MAAAVVAKIRGDAMGLDAQRDAVHRAAADALHAL 324
>gi|289754195|ref|ZP_06513573.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289694782|gb|EFD62211.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
Length=333
Score = 512 bits (1319), Expect = 4e-143, Method: Compositional matrix adjust.
Identities = 281/284 (99%), Positives = 282/284 (99%), Gaps = 0/284 (0%)
Query 1 VSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 60
VSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA
Sbjct 3 VSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 62
Query 61 VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 120
VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA
Sbjct 63 VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 122
Query 121 YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV 180
YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV
Sbjct 123 YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV 182
Query 181 IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV 240
IHADVTRRMHGEVATDVV AGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV
Sbjct 183 IHADVTRRMHGEVATDVVGAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV 242
Query 241 QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQSGT 284
QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQ+ T
Sbjct 243 QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQAYT 286
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 71/159 (45%), Positives = 97/159 (62%), Gaps = 7/159 (4%)
Query 4 DSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQAVKA 63
D D+I A++ R++ GE+ D A EL++ R+ LD PR + +GPL + A
Sbjct 175 DCLPPVDVIHADVTRRMH-GEVATDVVGAGELVIAARHLLDPMPRG-ELSYGPLHEGGNA 232
Query 64 ARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERAYAC 123
AR VY RL+QL AR V D L RD R LL D+ ++L TAA +++AY
Sbjct 233 ARKSVYRRLVQLWQARRAVTDGDVDL-----RDARTLLTDLDSILREMRTAATIQQAYTR 287
Query 124 AERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHAL 162
AERRAMAAAVVAK DA+G++ Q ++V RAAA+A+HAL
Sbjct 288 AERRAMAAAVVAKIRGDAMGLDAQRDAVHRAAADALHAL 326
>gi|340627096|ref|YP_004745548.1| hypothetical protein MCAN_21091 [Mycobacterium canettii CIPT
140010059]
gi|340005286|emb|CCC44441.1| putative uncharacterized protein, no significant Pfam matches
[Mycobacterium canettii CIPT 140010059]
Length=331
Score = 506 bits (1302), Expect = 4e-141, Method: Compositional matrix adjust.
Identities = 278/284 (98%), Positives = 280/284 (99%), Gaps = 0/284 (0%)
Query 1 VSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 60
+SDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA
Sbjct 1 MSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 60
Query 61 VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 120
VKAARAQVYGRLIQLRHARCEVLDERWQL PTGQRDVRALLIDVLNVLLAAITAAGVERA
Sbjct 61 VKAARAQVYGRLIQLRHARCEVLDERWQLPPTGQRDVRALLIDVLNVLLAAITAAGVERA 120
Query 121 YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV 180
YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV
Sbjct 121 YACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDV 180
Query 181 IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV 240
IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV
Sbjct 181 IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLV 240
Query 241 QLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQSGT 284
QLWQARRAVTDGDVDLRDARTLLTDLDSILREM TAATI Q+ T
Sbjct 241 QLWQARRAVTDGDVDLRDARTLLTDLDSILREMHTAATIHQAYT 284
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 72/159 (46%), Positives = 97/159 (62%), Gaps = 7/159 (4%)
Query 4 DSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQAVKA 63
D D+I A++ R++ GE+ D AA EL++ R+ LD PR + +GPL + A
Sbjct 173 DCLPPVDVIHADVTRRMH-GEVATDVVAAGELVIAARHLLDPMPRG-ELSYGPLHEGGNA 230
Query 64 ARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERAYAC 123
AR VY RL+QL AR V D L RD R LL D+ ++L TAA + +AY
Sbjct 231 ARKSVYRRLVQLWQARRAVTDGDVDL-----RDARTLLTDLDSILREMHTAATIHQAYTR 285
Query 124 AERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHAL 162
AERRAMAAAVVAK DA+G++ Q ++V RAAA+A+HAL
Sbjct 286 AERRAMAAAVVAKIRGDAMGLDAQRDAVHRAAADALHAL 324
>gi|289443590|ref|ZP_06433334.1| LOW QUALITY PROTEIN: hypothetical protein TBLG_00692 [Mycobacterium
tuberculosis T46]
gi|289416509|gb|EFD13749.1| LOW QUALITY PROTEIN: hypothetical protein TBLG_00692 [Mycobacterium
tuberculosis T46]
Length=226
Score = 273 bits (699), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 149/152 (99%), Positives = 151/152 (99%), Gaps = 0/152 (0%)
Query 133 VVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDVIHADVTRRMHGE 192
VVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDVIHADVTRRMHGE
Sbjct 28 VVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDVIHADVTRRMHGE 87
Query 193 VATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLVQLWQARRAVTDG 252
VATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGG+AARKSVYRRLVQLWQARRAVTDG
Sbjct 88 VATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGHAARKSVYRRLVQLWQARRAVTDG 147
Query 253 DVDLRDARTLLTDLDSILREMRTAATIQQSGT 284
DVDLRDARTLLTDLDSILREMRTAATIQQ+ T
Sbjct 148 DVDLRDARTLLTDLDSILREMRTAATIQQAYT 179
Score = 82.8 bits (203), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 72/159 (46%), Positives = 98/159 (62%), Gaps = 7/159 (4%)
Query 4 DSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQAVKA 63
D D+I A++ R++ GE+ D AA EL++ R+ LD PR + +GPL + A
Sbjct 68 DCLPPVDVIHADVTRRMH-GEVATDVVAAGELVIAARHLLDPMPRG-ELSYGPLHEGGHA 125
Query 64 ARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERAYAC 123
AR VY RL+QL AR V D L RD R LL D+ ++L TAA +++AY
Sbjct 126 ARKSVYRRLVQLWQARRAVTDGDVDL-----RDARTLLTDLDSILREMRTAATIQQAYTR 180
Query 124 AERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHAL 162
AERRAMAAAVVAK DA+G++ Q ++V RAAA+A+HAL
Sbjct 181 AERRAMAAAVVAKIRGDAMGLDAQRDAVHRAAADALHAL 219
>gi|289758203|ref|ZP_06517581.1| hypothetical protein TBEG_00882 [Mycobacterium tuberculosis T85]
gi|289713767|gb|EFD77779.1| hypothetical protein TBEG_00882 [Mycobacterium tuberculosis T85]
Length=184
Score = 244 bits (623), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 133/136 (98%), Positives = 135/136 (99%), Gaps = 0/136 (0%)
Query 149 NSVCRAAAEAIHALAHRTGATEDADCLPPVDVIHADVTRRMHGEVATDVVAAGELVIAAR 208
+SVCRAAAEAIHALAHRTGATEDADCLPPVDVIHADVTRRMHGEVATDVVAAGELVIAAR
Sbjct 2 HSVCRAAAEAIHALAHRTGATEDADCLPPVDVIHADVTRRMHGEVATDVVAAGELVIAAR 61
Query 209 HLLDPMPRGELSYGPLHEGGNAARKSVYRRLVQLWQARRAVTDGDVDLRDARTLLTDLDS 268
HLLDPMPRGELSYGPLHEGGNAARKSVYRRLVQLWQARRAVTDGDVDLRDARTLLTDLDS
Sbjct 62 HLLDPMPRGELSYGPLHEGGNAARKSVYRRLVQLWQARRAVTDGDVDLRDARTLLTDLDS 121
Query 269 ILREMRTAATIQQSGT 284
ILREMRTAATIQQ+ T
Sbjct 122 ILREMRTAATIQQAYT 137
Score = 83.2 bits (204), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 71/154 (47%), Positives = 97/154 (63%), Gaps = 7/154 (4%)
Query 9 FDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQAVKAARAQV 68
D+I A++ R++ GE+ D AA EL++ R+ LD PR + +GPL + AAR V
Sbjct 31 VDVIHADVTRRMH-GEVATDVVAAGELVIAARHLLDPMPRG-ELSYGPLHEGGNAARKSV 88
Query 69 YGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERAYACAERRA 128
Y RL+QL AR V D L RD R LL D+ ++L TAA +++AY AERRA
Sbjct 89 YRRLVQLWQARRAVTDGDVDL-----RDARTLLTDLDSILREMRTAATIQQAYTRAERRA 143
Query 129 MAAAVVAKNYRDALGVELQCNSVCRAAAEAIHAL 162
MAAAVVAK DA+G++ Q ++V RAAA+A+HAL
Sbjct 144 MAAAVVAKIRGDAMGLDAQRDAVHRAAADALHAL 177
>gi|289570197|ref|ZP_06450424.1| hypothetical protein TBJG_00564 [Mycobacterium tuberculosis T17]
gi|289750680|ref|ZP_06510058.1| hypothetical protein TBDG_00890 [Mycobacterium tuberculosis T92]
gi|289543951|gb|EFD47599.1| hypothetical protein TBJG_00564 [Mycobacterium tuberculosis T17]
gi|289691267|gb|EFD58696.1| hypothetical protein TBDG_00890 [Mycobacterium tuberculosis T92]
Length=178
Score = 235 bits (600), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 120/121 (99%), Positives = 121/121 (100%), Gaps = 0/121 (0%)
Query 1 VSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 60
+SDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA
Sbjct 1 MSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQA 60
Query 61 VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 120
VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA
Sbjct 61 VKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 120
Query 121 Y 121
Y
Sbjct 121 Y 121
Score = 62.4 bits (150), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/112 (39%), Positives = 63/112 (57%), Gaps = 7/112 (6%)
Query 178 VDVIHADVTRRMHG-EVATDVVAAGELVIAARHLLDPMPRG-ELSYGPLHEGGNAARKSV 235
D+I A++ R++ G E+ D AA EL++ R+ LD PR + +GPL + AAR V
Sbjct 9 FDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQAVKAARAQV 68
Query 236 YRRLVQLWQARRAVTDGDVDL-----RDARTLLTDLDSILREMRTAATIQQS 282
Y RL+QL AR V D L RD R LL D+ ++L TAA ++++
Sbjct 69 YGRLIQLRHARCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERA 120
>gi|240173503|ref|ZP_04752161.1| hypothetical protein MkanA1_29601 [Mycobacterium kansasii ATCC
12478]
Length=189
Score = 168 bits (425), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 104/186 (56%), Positives = 126/186 (68%), Gaps = 1/186 (0%)
Query 23 GELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQAVKAARAQVYGRLIQLRHARCEV 82
GE+ D AA ELL V QLD +PRP I G L++A+K ARA++YG L +L HAR EV
Sbjct 5 GEVFADERAAFELLHAV-IQLDIEPRPATIAQGVLYEALKTARAELYGHLTRLWHARREV 63
Query 83 LDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERAYACAERRAMAAAVVAKNYRDAL 142
L+ R + GQR+VRALL DVLNVLLAAI AA VER Y AE++AMAAAV+A+ D
Sbjct 64 LEARREQWLAGQREVRALLRDVLNVLLAAIAAAEVERTYVLAEQQAMAAAVIAEIRGDTT 123
Query 143 GVELQCNSVCRAAAEAIHALAHRTGATEDADCLPPVDVIHADVTRRMHGEVATDVVAAGE 202
V ++ N+V RA AEAI AH T ED D P++VI ADV RR+ GEV TD AAGE
Sbjct 124 VVAVRRNAVHRAVAEAIRVSAHGTLVAEDVDPSAPIEVIRADVARRLRGEVPTDFRAAGE 183
Query 203 LVIAAR 208
L+IA R
Sbjct 184 LMIAVR 189
Score = 43.1 bits (100), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 52/100 (52%), Gaps = 7/100 (7%)
Query 189 MHGEVATDVVAAGELVIAARHL-LDPMPRGELSYGPLHEGGNAARKSVYRRLVQLWQARR 247
M GEV D AA EL+ A L ++P P ++ G L+E AR +Y L +LW ARR
Sbjct 3 MPGEVFADERAAFELLHAVIQLDIEPRP-ATIAQGVLYEALKTARAELYGHLTRLWHARR 61
Query 248 AVTDGDVD-----LRDARTLLTDLDSILREMRTAATIQQS 282
V + + R+ R LL D+ ++L AA ++++
Sbjct 62 EVLEARREQWLAGQREVRALLRDVLNVLLAAIAAAEVERT 101
>gi|240173502|ref|ZP_04752160.1| hypothetical protein MkanA1_29596 [Mycobacterium kansasii ATCC
12478]
Length=91
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/52 (49%), Positives = 36/52 (70%), Gaps = 0/52 (0%)
Query 230 AARKSVYRRLVQLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQ 281
AAR+ VY+RLV LW+ARRA T+ ++RD L DLDS++ ++ T I+Q
Sbjct 3 AAREPVYQRLVALWRARRAKTNTGREMRDLDALFIDLDSVVAQVSTIIAIEQ 54
>gi|303246451|ref|ZP_07332730.1| Baseplate J family protein [Desulfovibrio fructosovorans JJ]
gi|302492161|gb|EFL52036.1| Baseplate J family protein [Desulfovibrio fructosovorans JJ]
Length=354
Score = 37.4 bits (85), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 43/143 (31%), Positives = 64/143 (45%), Gaps = 8/143 (5%)
Query 111 AITAAGVERAYACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEAIHALAHRTGATE 170
A+T AG+ RA+ RR + + VA LG + AAA+A+ + HR A +
Sbjct 201 ALTVAGISRAWTFPNRRDLGSVDVA-----VLGPDGPATPSAIAAAQAVVDV-HRPAACK 254
Query 171 DADCLPPVDV-IHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGN 229
DA L P V + V R+ V T + +L A L +P G + Y E +
Sbjct 255 DAWVLSPTPVDVAVKVAVRLDASVTTLALYTAQLQDALEAALADLPPGGVVYRSKIEAVS 314
Query 230 AARKSVYRRLVQLWQARR-AVTD 251
++ V R V++ QA AV D
Sbjct 315 SSLPGVIDRQVKVPQANFVAVVD 337
>gi|341938225|gb|AEL08364.1| carboxylesterase [Xanthomonas campestris pv. raphani 756C]
Length=502
Score = 37.0 bits (84), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 53/178 (30%), Positives = 76/178 (43%), Gaps = 23/178 (12%)
Query 149 NSVCRAAAEAIHALAHRTGATEDADCLPPVDVIH--ADVTRRMHGEVATDVVAAGELVIA 206
N++ + E + +T A + A LP IH A+V G + T++VA LV+
Sbjct 96 NAMAKRGTEDCLYVEVQTPALQPAKPLPVFVWIHGGANVAGGADGHLPTNLVAQDMLVVT 155
Query 207 ARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLVQL----W-----------QARRAVTD 251
++ L G LS LH+G NAA + Y L Q+ W AR +
Sbjct 156 LQYRLGAF--GFLSLPELHDGDNAAAGN-YALLDQIAALQWVHDNIAQFGGDPARVTIAG 212
Query 252 GDVDLRDARTLLTDLDSILREMRTAATIQQSGTAGDGGGGRRQDSRRRNGPRRPARRG 309
+D L+ L + R + +AA I+QSGTAG G R R G AR G
Sbjct 213 QSAGGQDVGLLM--LSPLARGLFSAA-IEQSGTAGFGLPARSLQDNRALGVSIAARAG 267
Lambda K H
0.321 0.136 0.409
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 728181445350
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40