BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0961 Length=115 Score E Sequences producing significant alignments: (Bits) Value gi|15608101|ref|NP_215476.1| integral membrane protein [Mycobact... 224 2e-57 gi|148660741|ref|YP_001282264.1| putative integral membrane prot... 221 3e-56 gi|167967734|ref|ZP_02550011.1| hypothetical integral membrane p... 210 5e-53 gi|340625974|ref|YP_004744426.1| putative integral membrane prot... 203 7e-51 gi|313657797|ref|ZP_07814677.1| putative integral membrane prote... 179 1e-43 gi|289749489|ref|ZP_06508867.1| integral membrane protein [Mycob... 97.1 9e-19 gi|296140479|ref|YP_003647722.1| hypothetical protein Tpau_2785 ... 40.4 0.083 gi|334128953|ref|ZP_08502829.1| sulfatase [Centipeda periodontii... 35.8 2.3 >gi|15608101|ref|NP_215476.1| integral membrane protein [Mycobacterium tuberculosis H37Rv] gi|15840386|ref|NP_335423.1| hypothetical protein MT0989 [Mycobacterium tuberculosis CDC1551] gi|31792150|ref|NP_854643.1| integral membrane protein [Mycobacterium bovis AF2122/97] 42 more sequence titlesLength=115 Score = 224 bits (572), Expect = 2e-57, Method: Compositional matrix adjust. Identities = 114/115 (99%), Positives = 115/115 (100%), Gaps = 0/115 (0%) Query 1 VRVPSQWMISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHV 60 +RVPSQWMISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHV Sbjct 1 MRVPSQWMISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHV 60 Query 61 FPAMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLALGLVYVAADAVLH 115 FPAMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLALGLVYVAADAVLH Sbjct 61 FPAMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLALGLVYVAADAVLH 115 >gi|148660741|ref|YP_001282264.1| putative integral membrane protein [Mycobacterium tuberculosis H37Ra] gi|297730469|ref|ZP_06959587.1| hypothetical protein MtubKR_05226 [Mycobacterium tuberculosis KZN R506] gi|298524454|ref|ZP_07011863.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] 29 more sequence titles Length=113 Score = 221 bits (563), Expect = 3e-56, Method: Compositional matrix adjust. Identities = 112/113 (99%), Positives = 113/113 (100%), Gaps = 0/113 (0%) Query 3 VPSQWMISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHVFP 62 +PSQWMISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHVFP Sbjct 1 MPSQWMISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHVFP 60 Query 63 AMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLALGLVYVAADAVLH 115 AMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLALGLVYVAADAVLH Sbjct 61 AMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLALGLVYVAADAVLH 113 >gi|167967734|ref|ZP_02550011.1| hypothetical integral membrane protein [Mycobacterium tuberculosis H37Ra] gi|254549942|ref|ZP_05140389.1| hypothetical protein Mtube_05683 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|297633484|ref|ZP_06951264.1| hypothetical protein MtubK4_05146 [Mycobacterium tuberculosis KZN 4207] Length=108 Score = 210 bits (535), Expect = 5e-53, Method: Compositional matrix adjust. Identities = 108/108 (100%), Positives = 108/108 (100%), Gaps = 0/108 (0%) Query 8 MISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHVFPAMVTM 67 MISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHVFPAMVTM Sbjct 1 MISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHVFPAMVTM 60 Query 68 WIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLALGLVYVAADAVLH 115 WIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLALGLVYVAADAVLH Sbjct 61 WIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLALGLVYVAADAVLH 108 >gi|340625974|ref|YP_004744426.1| putative integral membrane protein [Mycobacterium canettii CIPT 140010059] gi|340004164|emb|CCC43302.1| putative integral membrane protein [Mycobacterium canettii CIPT 140010059] Length=115 Score = 203 bits (516), Expect = 7e-51, Method: Compositional matrix adjust. Identities = 112/115 (98%), Positives = 114/115 (99%), Gaps = 0/115 (0%) Query 1 VRVPSQWMISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHV 60 +RVPSQW ISSRVTVAWNIVGYL+YAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHV Sbjct 1 MRVPSQWTISSRVTVAWNIVGYLLYAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHV 60 Query 61 FPAMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLALGLVYVAADAVLH 115 FPAMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLALGLVYVAADAVLH Sbjct 61 FPAMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLALGLVYVAADAVLH 115 >gi|313657797|ref|ZP_07814677.1| putative integral membrane protein [Mycobacterium tuberculosis KZN V2475] Length=92 Score = 179 bits (454), Expect = 1e-43, Method: Compositional matrix adjust. Identities = 91/92 (99%), Positives = 92/92 (100%), Gaps = 0/92 (0%) Query 24 VYAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHVFPAMVTMWIGVGAVLLLTLVVMV 83 +YAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHVFPAMVTMWIGVGAVLLLTLVVMV Sbjct 1 MYAALAFVGGFAVWFSLFFAMATDGCHDSACDASYHVFPAMVTMWIGVGAVLLLTLVVMV 60 Query 84 RNSSRGNVVIGWPFVGLLALGLVYVAADAVLH 115 RNSSRGNVVIGWPFVGLLALGLVYVAADAVLH Sbjct 61 RNSSRGNVVIGWPFVGLLALGLVYVAADAVLH 92 >gi|289749489|ref|ZP_06508867.1| integral membrane protein [Mycobacterium tuberculosis T92] gi|289690076|gb|EFD57505.1| integral membrane protein [Mycobacterium tuberculosis T92] Length=80 Score = 97.1 bits (240), Expect = 9e-19, Method: Compositional matrix adjust. Identities = 56/62 (91%), Positives = 57/62 (92%), Gaps = 0/62 (0%) Query 44 MATDGCHDSACDASYHVFPAMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGLLAL 103 MATDGCHDSACDASYHVFPAMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWP +G L Sbjct 1 MATDGCHDSACDASYHVFPAMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPLLGCWRL 60 Query 104 GL 105 L Sbjct 61 AL 62 >gi|296140479|ref|YP_003647722.1| hypothetical protein Tpau_2785 [Tsukamurella paurometabola DSM 20162] gi|296028613|gb|ADG79383.1| hypothetical protein Tpau_2785 [Tsukamurella paurometabola DSM 20162] Length=134 Score = 40.4 bits (93), Expect = 0.083, Method: Compositional matrix adjust. Identities = 24/70 (35%), Positives = 35/70 (50%), Gaps = 1/70 (1%) Query 32 GGFAVWFSLFFAMATDGCH-DSACDASYHVFPAMVTMWIGVGAVLLLTLVVMVRNSSRGN 90 GG +FSLFF M +GC + C V A T+W GV L++ L+ + + R Sbjct 45 GGICAFFSLFFGMNLNGCQANRDCPQWDQVLRATWTVWGGVTLALVVALIGTIVCAVRRR 104 Query 91 VVIGWPFVGL 100 V WP +G+ Sbjct 105 YVSYWPLIGM 114 >gi|334128953|ref|ZP_08502829.1| sulfatase [Centipeda periodontii DSM 2778] gi|333385980|gb|EGK57205.1| sulfatase [Centipeda periodontii DSM 2778] Length=673 Score = 35.8 bits (81), Expect = 2.3, Method: Composition-based stats. Identities = 22/61 (37%), Positives = 31/61 (51%), Gaps = 2/61 (3%) Query 17 WNIVGYLVYAALAFVGGFAVWF--SLFFAMATDGCHDSACDASYHVFPAMVTMWIGVGAV 74 W ++GYL Y AL F+ G AV SL ++ TDG A +A V ++ W+ G Sbjct 147 WLLLGYLPYLALCFLAGNAVLSLPSLSYSTVTDGAGQYAVNALVFVLSIVLFYWLRYGGT 206 Query 75 L 75 L Sbjct 207 L 207 Lambda K H 0.331 0.140 0.465 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 131043835296 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40