BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0057 Length=173 Score E Sequences producing significant alignments: (Bits) Value gi|254233456|ref|ZP_04926782.1| hypothetical protein TBCG_00056 ... 351 2e-95 gi|15607199|ref|NP_214571.1| hypothetical protein Rv0057 [Mycoba... 351 2e-95 gi|328456762|gb|AEB02185.1| hypothetical protein TBSG_00056 [Myc... 350 4e-95 gi|340625090|ref|YP_004743542.1| hypothetical protein MCAN_00561... 350 5e-95 gi|289441424|ref|ZP_06431168.1| conserved hypothetical protein [... 350 6e-95 gi|289445583|ref|ZP_06435327.1| conserved hypothetical protein [... 349 9e-95 gi|289567943|ref|ZP_06448170.1| hypothetical protein TBJG_03075 ... 333 4e-90 gi|289764171|ref|ZP_06523549.1| conserved hypothetical protein [... 325 2e-87 gi|254548989|ref|ZP_05139436.1| hypothetical protein Mtube_00746... 299 9e-80 gi|294995673|ref|ZP_06801364.1| hypothetical protein Mtub2_14493... 147 4e-34 gi|118082088|ref|XP_425420.2| PREDICTED: hypothetical protein [G... 39.7 0.18 gi|322387386|ref|ZP_08060996.1| beta-galactosidase [Streptococcu... 33.9 9.6 >gi|254233456|ref|ZP_04926782.1| hypothetical protein TBCG_00056 [Mycobacterium tuberculosis C] gi|289747826|ref|ZP_06507204.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|124603249|gb|EAY61524.1| hypothetical protein TBCG_00056 [Mycobacterium tuberculosis C] gi|289688354|gb|EFD55842.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] Length=175 Score = 351 bits (900), Expect = 2e-95, Method: Compositional matrix adjust. Identities = 173/173 (100%), Positives = 173/173 (100%), Gaps = 0/173 (0%) Query 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT Sbjct 3 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 62 Query 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 120 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE Sbjct 63 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 122 Query 121 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT 173 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT Sbjct 123 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT 175 >gi|15607199|ref|NP_214571.1| hypothetical protein Rv0057 [Mycobacterium tuberculosis H37Rv] gi|15839434|ref|NP_334471.1| hypothetical protein MT0063 [Mycobacterium tuberculosis CDC1551] gi|31791234|ref|NP_853727.1| hypothetical protein Mb0058 [Mycobacterium bovis AF2122/97] 44 more sequence titlesLength=173 Score = 351 bits (900), Expect = 2e-95, Method: Compositional matrix adjust. Identities = 173/173 (100%), Positives = 173/173 (100%), Gaps = 0/173 (0%) Query 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT Sbjct 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 Query 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 120 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE Sbjct 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 120 Query 121 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT 173 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT Sbjct 121 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT 173 >gi|328456762|gb|AEB02185.1| hypothetical protein TBSG_00056 [Mycobacterium tuberculosis KZN 4207] Length=173 Score = 350 bits (898), Expect = 4e-95, Method: Compositional matrix adjust. Identities = 172/173 (99%), Positives = 173/173 (100%), Gaps = 0/173 (0%) Query 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 MPVVTAVGRRRGFAMPWVSTARSGAVM+ANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT Sbjct 1 MPVVTAVGRRRGFAMPWVSTARSGAVMMANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 Query 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 120 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE Sbjct 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 120 Query 121 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT 173 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT Sbjct 121 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT 173 >gi|340625090|ref|YP_004743542.1| hypothetical protein MCAN_00561 [Mycobacterium canettii CIPT 140010059] gi|340003280|emb|CCC42397.1| hypothetical protein MCAN_00561 [Mycobacterium canettii CIPT 140010059] Length=173 Score = 350 bits (898), Expect = 5e-95, Method: Compositional matrix adjust. Identities = 172/173 (99%), Positives = 173/173 (100%), Gaps = 0/173 (0%) Query 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT Sbjct 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 Query 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 120 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNT+VDGYAHAMHSSINSGPLE Sbjct 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTEVDGYAHAMHSSINSGPLE 120 Query 121 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT 173 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT Sbjct 121 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT 173 >gi|289441424|ref|ZP_06431168.1| conserved hypothetical protein [Mycobacterium tuberculosis T46] gi|289414343|gb|EFD11583.1| conserved hypothetical protein [Mycobacterium tuberculosis T46] Length=180 Score = 350 bits (897), Expect = 6e-95, Method: Compositional matrix adjust. Identities = 172/173 (99%), Positives = 172/173 (99%), Gaps = 0/173 (0%) Query 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 PVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT Sbjct 8 QPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 67 Query 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 120 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE Sbjct 68 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 127 Query 121 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT 173 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT Sbjct 128 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT 180 >gi|289445583|ref|ZP_06435327.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A] gi|289418541|gb|EFD15742.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A] Length=173 Score = 349 bits (895), Expect = 9e-95, Method: Compositional matrix adjust. Identities = 172/173 (99%), Positives = 172/173 (99%), Gaps = 0/173 (0%) Query 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKAN PGAVT Sbjct 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANMPGAVT 60 Query 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 120 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE Sbjct 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 120 Query 121 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT 173 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT Sbjct 121 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGESPWRSLMT 173 >gi|289567943|ref|ZP_06448170.1| hypothetical protein TBJG_03075 [Mycobacterium tuberculosis T17] gi|289541696|gb|EFD45345.1| hypothetical protein TBJG_03075 [Mycobacterium tuberculosis T17] Length=172 Score = 333 bits (855), Expect = 4e-90, Method: Compositional matrix adjust. Identities = 164/165 (99%), Positives = 164/165 (99%), Gaps = 0/165 (0%) Query 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 PVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT Sbjct 8 QPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 67 Query 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 120 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE Sbjct 68 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 127 Query 121 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGE 165 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGE Sbjct 128 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGACVGGGE 172 >gi|289764171|ref|ZP_06523549.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|289711677|gb|EFD75693.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] Length=165 Score = 325 bits (832), Expect = 2e-87, Method: Compositional matrix adjust. Identities = 160/160 (100%), Positives = 160/160 (100%), Gaps = 0/160 (0%) Query 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT Sbjct 3 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 62 Query 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 120 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE Sbjct 63 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 122 Query 121 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGAC 160 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGAC Sbjct 123 YLPATFSVFPALGDVGDLGGGVGAATYALDRLSNMRSGAC 162 >gi|254548989|ref|ZP_05139436.1| hypothetical protein Mtube_00746 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] Length=147 Score = 299 bits (766), Expect = 9e-80, Method: Compositional matrix adjust. Identities = 147/147 (100%), Positives = 147/147 (100%), Gaps = 0/147 (0%) Query 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT Sbjct 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 Query 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 120 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE Sbjct 61 WLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLNTDVDGYAHAMHSSINSGPLE 120 Query 121 YLPATFSVFPALGDVGDLGGGVGAATY 147 YLPATFSVFPALGDVGDLGGGVGAATY Sbjct 121 YLPATFSVFPALGDVGDLGGGVGAATY 147 >gi|294995673|ref|ZP_06801364.1| hypothetical protein Mtub2_14493 [Mycobacterium tuberculosis 210] Length=80 Score = 147 bits (372), Expect = 4e-34, Method: Compositional matrix adjust. Identities = 72/72 (100%), Positives = 72/72 (100%), Gaps = 0/72 (0%) Query 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT Sbjct 1 MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMCLKANTPGAVT 60 Query 61 WLDTPKRFLSTQ 72 WLDTPKRFLSTQ Sbjct 61 WLDTPKRFLSTQ 72 >gi|118082088|ref|XP_425420.2| PREDICTED: hypothetical protein [Gallus gallus] Length=929 Score = 39.7 bits (91), Expect = 0.18, Method: Compositional matrix adjust. Identities = 36/109 (34%), Positives = 50/109 (46%), Gaps = 22/109 (20%) Query 63 DTPKR-FLSTQTASRCMAVNSSD---VVTGRIDPQVLHTPLNTD------VDGYAHAMHS 112 D+P R F S Q++ ++N S VTGRID VL L+ + V+ H HS Sbjct 360 DSPSRLFYSIQSSDTYFSINPSTGVLQVTGRIDRDVLPLQLHPNISVIVRVEDSPHGGHS 419 Query 113 S----------INSGPLEYLPATFSVFPALGDVGDLGGGVGAATYALDR 151 S IN P E P+TFS++ + G + LG G Y + R Sbjct 420 SEMEITVIIGDINDNPPECNPSTFSLYYSYGTI--LGIQEGDHIYRMQR 466 >gi|322387386|ref|ZP_08060996.1| beta-galactosidase [Streptococcus infantis ATCC 700779] gi|321141915|gb|EFX37410.1| beta-galactosidase [Streptococcus infantis ATCC 700779] Length=2307 Score = 33.9 bits (76), Expect = 9.6, Method: Composition-based stats. Identities = 22/70 (32%), Positives = 37/70 (53%), Gaps = 4/70 (5%) Query 90 IDPQVLHTPLNTDVDGYAHAMHSSINSGPLEYLPATFSVFPALGDVGDLGGGVGAATYAL 149 ++P V P T G +A+ +++N P EY +VF A+ +V + GGV AA A+ Sbjct 2152 VEPAVHEVPEYT---GGVNAVEAAVNEVP-EYKGGVNAVFAAVNEVPEYTGGVNAAEAAV 2207 Query 150 DRLSNMRSGA 159 + + + GA Sbjct 2208 NDVPEYKGGA 2217 Lambda K H 0.319 0.132 0.410 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 143230884104 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40