BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1424c Length=253 Score E Sequences producing significant alignments: (Bits) Value gi|15608562|ref|NP_215940.1| hypothetical protein Rv1424c [Mycob... 509 2e-142 gi|289569441|ref|ZP_06449668.1| membrane protein [Mycobacterium ... 464 5e-129 gi|167968463|ref|ZP_02550740.1| hypothetical membrane protein [M... 410 1e-112 gi|15840881|ref|NP_335918.1| hypothetical protein MT1467 [Mycoba... 229 4e-58 gi|300715081|ref|YP_003739884.1| conserved uncharacterized prote... 37.0 2.6 >gi|15608562|ref|NP_215940.1| hypothetical protein Rv1424c [Mycobacterium tuberculosis H37Rv] gi|31792618|ref|NP_855111.1| hypothetical protein Mb1459c [Mycobacterium bovis AF2122/97] gi|121637354|ref|YP_977577.1| hypothetical protein BCG_1485c [Mycobacterium bovis BCG str. Pasteur 1173P2] 45 more sequence titlesLength=253 Score = 509 bits (1310), Expect = 2e-142, Method: Compositional matrix adjust. Identities = 253/253 (100%), Positives = 253/253 (100%), Gaps = 0/253 (0%) Query 1 MTVVPGAPSRPASAVSRPSYRQCVQASAQTSARRYSFPSYRRPPAEKLVFPVLLGILTLL 60 MTVVPGAPSRPASAVSRPSYRQCVQASAQTSARRYSFPSYRRPPAEKLVFPVLLGILTLL Sbjct 1 MTVVPGAPSRPASAVSRPSYRQCVQASAQTSARRYSFPSYRRPPAEKLVFPVLLGILTLL 60 Query 61 LSACQTASASGYNEPRGYDRATLKLVFSMDLGMCLNRFTYDSKLAPSRPQVVACDSREAR 120 LSACQTASASGYNEPRGYDRATLKLVFSMDLGMCLNRFTYDSKLAPSRPQVVACDSREAR Sbjct 61 LSACQTASASGYNEPRGYDRATLKLVFSMDLGMCLNRFTYDSKLAPSRPQVVACDSREAR 120 Query 121 IRNDGFHANAPSCMRIDYELITQNHRAYYCLKYLVRVGYCYPAVTTPGKPPSVLLYAPSA 180 IRNDGFHANAPSCMRIDYELITQNHRAYYCLKYLVRVGYCYPAVTTPGKPPSVLLYAPSA Sbjct 121 IRNDGFHANAPSCMRIDYELITQNHRAYYCLKYLVRVGYCYPAVTTPGKPPSVLLYAPSA 180 Query 181 CDESLPSPRVATALVPGTRSANREFSRFVVTEIKSLGAGGRCDSASVSLQPPEEIEGPAI 240 CDESLPSPRVATALVPGTRSANREFSRFVVTEIKSLGAGGRCDSASVSLQPPEEIEGPAI Sbjct 181 CDESLPSPRVATALVPGTRSANREFSRFVVTEIKSLGAGGRCDSASVSLQPPEEIEGPAI 240 Query 241 PPASSQLVCVAPK 253 PPASSQLVCVAPK Sbjct 241 PPASSQLVCVAPK 253 >gi|289569441|ref|ZP_06449668.1| membrane protein [Mycobacterium tuberculosis T17] gi|289543195|gb|EFD46843.1| membrane protein [Mycobacterium tuberculosis T17] Length=230 Score = 464 bits (1194), Expect = 5e-129, Method: Compositional matrix adjust. Identities = 229/230 (99%), Positives = 230/230 (100%), Gaps = 0/230 (0%) Query 24 VQASAQTSARRYSFPSYRRPPAEKLVFPVLLGILTLLLSACQTASASGYNEPRGYDRATL 83 +QASAQTSARRYSFPSYRRPPAEKLVFPVLLGILTLLLSACQTASASGYNEPRGYDRATL Sbjct 1 MQASAQTSARRYSFPSYRRPPAEKLVFPVLLGILTLLLSACQTASASGYNEPRGYDRATL 60 Query 84 KLVFSMDLGMCLNRFTYDSKLAPSRPQVVACDSREARIRNDGFHANAPSCMRIDYELITQ 143 KLVFSMDLGMCLNRFTYDSKLAPSRPQVVACDSREARIRNDGFHANAPSCMRIDYELITQ Sbjct 61 KLVFSMDLGMCLNRFTYDSKLAPSRPQVVACDSREARIRNDGFHANAPSCMRIDYELITQ 120 Query 144 NHRAYYCLKYLVRVGYCYPAVTTPGKPPSVLLYAPSACDESLPSPRVATALVPGTRSANR 203 NHRAYYCLKYLVRVGYCYPAVTTPGKPPSVLLYAPSACDESLPSPRVATALVPGTRSANR Sbjct 121 NHRAYYCLKYLVRVGYCYPAVTTPGKPPSVLLYAPSACDESLPSPRVATALVPGTRSANR 180 Query 204 EFSRFVVTEIKSLGAGGRCDSASVSLQPPEEIEGPAIPPASSQLVCVAPK 253 EFSRFVVTEIKSLGAGGRCDSASVSLQPPEEIEGPAIPPASSQLVCVAPK Sbjct 181 EFSRFVVTEIKSLGAGGRCDSASVSLQPPEEIEGPAIPPASSQLVCVAPK 230 >gi|167968463|ref|ZP_02550740.1| hypothetical membrane protein [Mycobacterium tuberculosis H37Ra] gi|254550438|ref|ZP_05140885.1| hypothetical protein Mtube_08247 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|294994986|ref|ZP_06800677.1| hypothetical protein Mtub2_10855 [Mycobacterium tuberculosis 210] 30 more sequence titles Length=202 Score = 410 bits (1054), Expect = 1e-112, Method: Compositional matrix adjust. Identities = 201/202 (99%), Positives = 202/202 (100%), Gaps = 0/202 (0%) Query 52 VLLGILTLLLSACQTASASGYNEPRGYDRATLKLVFSMDLGMCLNRFTYDSKLAPSRPQV 111 +LLGILTLLLSACQTASASGYNEPRGYDRATLKLVFSMDLGMCLNRFTYDSKLAPSRPQV Sbjct 1 MLLGILTLLLSACQTASASGYNEPRGYDRATLKLVFSMDLGMCLNRFTYDSKLAPSRPQV 60 Query 112 VACDSREARIRNDGFHANAPSCMRIDYELITQNHRAYYCLKYLVRVGYCYPAVTTPGKPP 171 VACDSREARIRNDGFHANAPSCMRIDYELITQNHRAYYCLKYLVRVGYCYPAVTTPGKPP Sbjct 61 VACDSREARIRNDGFHANAPSCMRIDYELITQNHRAYYCLKYLVRVGYCYPAVTTPGKPP 120 Query 172 SVLLYAPSACDESLPSPRVATALVPGTRSANREFSRFVVTEIKSLGAGGRCDSASVSLQP 231 SVLLYAPSACDESLPSPRVATALVPGTRSANREFSRFVVTEIKSLGAGGRCDSASVSLQP Sbjct 121 SVLLYAPSACDESLPSPRVATALVPGTRSANREFSRFVVTEIKSLGAGGRCDSASVSLQP 180 Query 232 PEEIEGPAIPPASSQLVCVAPK 253 PEEIEGPAIPPASSQLVCVAPK Sbjct 181 PEEIEGPAIPPASSQLVCVAPK 202 >gi|15840881|ref|NP_335918.1| hypothetical protein MT1467 [Mycobacterium tuberculosis CDC1551] gi|13881082|gb|AAK45732.1| hypothetical protein MT1467 [Mycobacterium tuberculosis CDC1551] Length=114 Score = 229 bits (583), Expect = 4e-58, Method: Compositional matrix adjust. Identities = 113/114 (99%), Positives = 114/114 (100%), Gaps = 0/114 (0%) Query 140 LITQNHRAYYCLKYLVRVGYCYPAVTTPGKPPSVLLYAPSACDESLPSPRVATALVPGTR 199 +ITQNHRAYYCLKYLVRVGYCYPAVTTPGKPPSVLLYAPSACDESLPSPRVATALVPGTR Sbjct 1 MITQNHRAYYCLKYLVRVGYCYPAVTTPGKPPSVLLYAPSACDESLPSPRVATALVPGTR 60 Query 200 SANREFSRFVVTEIKSLGAGGRCDSASVSLQPPEEIEGPAIPPASSQLVCVAPK 253 SANREFSRFVVTEIKSLGAGGRCDSASVSLQPPEEIEGPAIPPASSQLVCVAPK Sbjct 61 SANREFSRFVVTEIKSLGAGGRCDSASVSLQPPEEIEGPAIPPASSQLVCVAPK 114 >gi|300715081|ref|YP_003739884.1| conserved uncharacterized protein [Erwinia billingiae Eb661] gi|299060917|emb|CAX58024.1| conserved uncharacterized protein [Erwinia billingiae Eb661] Length=381 Score = 37.0 bits (84), Expect = 2.6, Method: Compositional matrix adjust. Identities = 35/107 (33%), Positives = 45/107 (43%), Gaps = 11/107 (10%) Query 70 SGYNEPRGYDRATL---KLVFSMDLGM---CLNRFTYDSKLAPSRPQVVACDSREARIRN 123 S +N P + AT K VFS DLG+ R + D KL P+ P + S A R+ Sbjct 172 SDHNGPHAHMIATDLSGKFVFSTDLGLDRIYQYRLSADGKLQPNTPAWIPASSAGAGPRH 231 Query 124 DGFHANAPSCMRIDYELITQNHRAYYCLKYLVRVGYCYPAVTTPGKP 170 FH + S I+ E T H Y L + G A T P P Sbjct 232 FVFHPDGHSVYLINEEASTLTH---YLLNR--KTGTLSEAATVPALP 273 Lambda K H 0.319 0.133 0.402 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 368391130380 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40