BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0603 Length=103 Score E Sequences producing significant alignments: (Bits) Value gi|15607743|ref|NP_215117.1| hypothetical protein Rv0603 [Mycoba... 194 4e-48 gi|31791785|ref|NP_854278.1| hypothetical protein Mb0619 [Mycoba... 190 7e-47 gi|308373533|ref|ZP_07432685.2| hypothetical exported protein [M... 183 6e-45 gi|292659485|pdb|2KGY|A Chain A, Solution Structure Of Rv0603 Pr... 149 2e-34 gi|289573202|ref|ZP_06453429.1| LOW QUALITY PROTEIN: hypothetica... 117 4e-25 gi|307083100|ref|ZP_07492213.1| hypothetical protein TMLG_03348 ... 81.3 4e-14 gi|170783571|ref|YP_001740088.1| hypothetical protein pChr15_26 ... 65.9 2e-09 gi|119715845|ref|YP_922810.1| hypothetical protein Noca_1609 [No... 48.9 2e-04 gi|15842558|ref|NP_337595.1| hypothetical protein MT3080.1 [Myco... 48.5 3e-04 gi|167967831|ref|ZP_02550108.1| hypothetical protein MtubH3_0728... 45.1 0.003 gi|111024795|ref|YP_707215.1| hypothetical protein RHA1_ro08010 ... 41.6 0.039 gi|339630671|ref|YP_004722313.1| hypothetical protein MAF_06100 ... 40.0 0.11 gi|329850292|ref|ZP_08265137.1| hypothetical protein ABI_31930 [... 34.7 5.4 >gi|15607743|ref|NP_215117.1| hypothetical protein Rv0603 [Mycobacterium tuberculosis H37Rv] gi|121636521|ref|YP_976744.1| hypothetical protein BCG_0649 [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|148660373|ref|YP_001281896.1| hypothetical protein MRA_0610 [Mycobacterium tuberculosis H37Ra] 61 more sequence titlesLength=103 Score = 194 bits (493), Expect = 4e-48, Method: Compositional matrix adjust. Identities = 103/103 (100%), Positives = 103/103 (100%), Gaps = 0/103 (0%) Query 1 MNRIVQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDADRARAAAVQAVPGGTAGEVE 60 MNRIVQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDADRARAAAVQAVPGGTAGEVE Sbjct 1 MNRIVQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDADRARAAAVQAVPGGTAGEVE 60 Query 61 TETGEGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDGG 103 TETGEGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDGG Sbjct 61 TETGEGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDGG 103 >gi|31791785|ref|NP_854278.1| hypothetical protein Mb0619 [Mycobacterium bovis AF2122/97] gi|31617372|emb|CAD93481.1| POSSIBLE EXPORTED PROTEIN [Mycobacterium bovis AF2122/97] Length=103 Score = 190 bits (482), Expect = 7e-47, Method: Compositional matrix adjust. Identities = 101/103 (99%), Positives = 102/103 (99%), Gaps = 0/103 (0%) Query 1 MNRIVQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDADRARAAAVQAVPGGTAGEVE 60 MNRIVQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDADRARAAAVQAVPGGTAGEVE Sbjct 1 MNRIVQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDADRARAAAVQAVPGGTAGEVE 60 Query 61 TETGEGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDGG 103 TETGEGAAAYGVLVTR DGTRVEVHLDRDFRVLDT+PADGDGG Sbjct 61 TETGEGAAAYGVLVTRADGTRVEVHLDRDFRVLDTKPADGDGG 103 >gi|308373533|ref|ZP_07432685.2| hypothetical exported protein [Mycobacterium tuberculosis SUMu005] gi|308375209|ref|ZP_07443088.2| hypothetical exported protein [Mycobacterium tuberculosis SUMu007] gi|308337276|gb|EFP26127.1| hypothetical exported protein [Mycobacterium tuberculosis SUMu005] gi|308347070|gb|EFP35921.1| hypothetical exported protein [Mycobacterium tuberculosis SUMu007] Length=99 Score = 183 bits (465), Expect = 6e-45, Method: Compositional matrix adjust. Identities = 98/99 (99%), Positives = 99/99 (100%), Gaps = 0/99 (0%) Query 5 VQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDADRARAAAVQAVPGGTAGEVETETG 64 +QFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDADRARAAAVQAVPGGTAGEVETETG Sbjct 1 MQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDADRARAAAVQAVPGGTAGEVETETG 60 Query 65 EGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDGG 103 EGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDGG Sbjct 61 EGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDGG 99 >gi|292659485|pdb|2KGY|A Chain A, Solution Structure Of Rv0603 Protein From Mycobacterium Tuberculosis H37rv Length=102 Score = 149 bits (375), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 77/78 (99%), Positives = 77/78 (99%), Gaps = 0/78 (0%) Query 26 AAAFDGEDEVTGPDADRARAAAVQAVPGGTAGEVETETGEGAAAYGVLVTRPDGTRVEVH 85 A AFDGEDEVTGPDADRARAAAVQAVPGGTAGEVETETGEGAAAYGVLVTRPDGTRVEVH Sbjct 25 AMAFDGEDEVTGPDADRARAAAVQAVPGGTAGEVETETGEGAAAYGVLVTRPDGTRVEVH 84 Query 86 LDRDFRVLDTEPADGDGG 103 LDRDFRVLDTEPADGDGG Sbjct 85 LDRDFRVLDTEPADGDGG 102 >gi|289573202|ref|ZP_06453429.1| LOW QUALITY PROTEIN: hypothetical exported protein [Mycobacterium tuberculosis K85] gi|289537633|gb|EFD42211.1| LOW QUALITY PROTEIN: hypothetical exported protein [Mycobacterium tuberculosis K85] Length=67 Score = 117 bits (294), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 61/61 (100%), Positives = 61/61 (100%), Gaps = 0/61 (0%) Query 43 ARAAAVQAVPGGTAGEVETETGEGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDG 102 ARAAAVQAVPGGTAGEVETETGEGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDG Sbjct 7 ARAAAVQAVPGGTAGEVETETGEGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDG 66 Query 103 G 103 G Sbjct 67 G 67 >gi|307083100|ref|ZP_07492213.1| hypothetical protein TMLG_03348 [Mycobacterium tuberculosis SUMu012] gi|308367175|gb|EFP56026.1| hypothetical protein TMLG_03348 [Mycobacterium tuberculosis SUMu012] Length=40 Score = 81.3 bits (199), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 40/40 (100%), Positives = 40/40 (100%), Gaps = 0/40 (0%) Query 64 GEGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDGG 103 GEGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDGG Sbjct 1 GEGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGDGG 40 >gi|170783571|ref|YP_001740088.1| hypothetical protein pChr15_26 [Arthrobacter sp. Chr15] gi|150035079|gb|ABR67075.1| hypothetical protein [Arthrobacter sp. Chr15] Length=100 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 49/93 (53%), Positives = 69/93 (75%), Gaps = 2/93 (2%) Query 11 AVAAAAIGIGA-GSGIAAAFDGE-DEVTGPDADRARAAAVQAVPGGTAGEVETETGEGAA 68 A+AA + +GA G+GIA+ DG+ ++TGP AD+ARAAAVQA+PG AG+VE + G Sbjct 2 AIAAGVLALGATGAGIASFADGDGRDITGPAADQARAAAVQAIPGARAGKVEADNESGTD 61 Query 69 AYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGD 101 +Y V VT+PDGT+++V LD+ ++VL T P DGD Sbjct 62 SYRVDVTKPDGTQLQVRLDKAYQVLGTGPVDGD 94 >gi|119715845|ref|YP_922810.1| hypothetical protein Noca_1609 [Nocardioides sp. JS614] gi|119536506|gb|ABL81123.1| conserved hypothetical protein [Nocardioides sp. JS614] Length=107 Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 28/60 (47%), Positives = 39/60 (65%), Gaps = 1/60 (1%) Query 34 EVTGPDADRARAAAVQAVPGGTAGEVETETGEGAAAYGVLVTRPDGTRVEVHLDRDFRVL 93 + T AD+A AA+ A GGTA VET++ E A Y V VT+ DGT V+V LD +++V+ Sbjct 38 QYTQRQADKATEAALAATGGGTANSVETDS-ENGATYEVEVTKSDGTTVDVRLDENYQVV 96 >gi|15842558|ref|NP_337595.1| hypothetical protein MT3080.1 [Mycobacterium tuberculosis CDC1551] gi|148662851|ref|YP_001284374.1| hypothetical protein MRA_3030 [Mycobacterium tuberculosis H37Ra] gi|289571201|ref|ZP_06451428.1| conserved hypothetical protein [Mycobacterium tuberculosis T17] gi|13882870|gb|AAK47409.1| hypothetical protein MT3080.1 [Mycobacterium tuberculosis CDC1551] gi|148507003|gb|ABQ74812.1| hypothetical protein MRA_3030 [Mycobacterium tuberculosis H37Ra] gi|289544955|gb|EFD48603.1| conserved hypothetical protein [Mycobacterium tuberculosis T17] gi|323718381|gb|EGB27555.1| hypothetical protein TMMG_03528 [Mycobacterium tuberculosis CDC1551A] Length=72 Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 25/43 (59%), Positives = 28/43 (66%), Gaps = 0/43 (0%) Query 59 VETETGEGAAAYGVLVTRPDGTRVEVHLDRDFRVLDTEPADGD 101 V T T A VLVT+PDGT+VEVHLD+ FR L TE D D Sbjct 30 VTTTTTRRDEAMRVLVTKPDGTQVEVHLDQGFRFLGTETVDND 72 >gi|167967831|ref|ZP_02550108.1| hypothetical protein MtubH3_07282 [Mycobacterium tuberculosis H37Ra] gi|254552080|ref|ZP_05142527.1| hypothetical protein Mtube_16752 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|294993910|ref|ZP_06799601.1| hypothetical protein Mtub2_05183 [Mycobacterium tuberculosis 210] gi|297635627|ref|ZP_06953407.1| hypothetical protein MtubK4_15967 [Mycobacterium tuberculosis KZN 4207] gi|297732625|ref|ZP_06961743.1| hypothetical protein MtubKR_16127 [Mycobacterium tuberculosis KZN R506] gi|313659957|ref|ZP_07816837.1| hypothetical protein MtubKV_16127 [Mycobacterium tuberculosis KZN V2475] Length=32 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 21/30 (70%), Positives = 24/30 (80%), Gaps = 0/30 (0%) Query 72 VLVTRPDGTRVEVHLDRDFRVLDTEPADGD 101 VLVT+PDGT+VEVHLD+ FR L TE D D Sbjct 3 VLVTKPDGTQVEVHLDQGFRFLGTETVDND 32 >gi|111024795|ref|YP_707215.1| hypothetical protein RHA1_ro08010 [Rhodococcus jostii RHA1] gi|110823774|gb|ABG99057.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length=112 Score = 41.6 bits (96), Expect = 0.039, Method: Compositional matrix adjust. Identities = 30/78 (39%), Positives = 44/78 (57%), Gaps = 2/78 (2%) Query 19 IGAGSGIAAAFDGEDEVTGPDADRARAAAVQAVPGGTAGEVETETGEGAAAYGVLVTRPD 78 IG G +A++ D E +TG +A AAA+ G TE G+ + Y V VT PD Sbjct 19 IGTGVAVASSGDQETPITGDALTKASAAALAHT--GGGTVTGTEIGDEESLYEVEVTLPD 76 Query 79 GTRVEVHLDRDFRVLDTE 96 G +V+V LD++F V+ T+ Sbjct 77 GNQVDVQLDQNFTVVGTK 94 >gi|339630671|ref|YP_004722313.1| hypothetical protein MAF_06100 [Mycobacterium africanum GM041182] gi|339330027|emb|CCC25682.1| putative exported protein [Mycobacterium africanum GM041182] Length=103 Score = 40.0 bits (92), Expect = 0.11, Method: Compositional matrix adjust. Identities = 41/41 (100%), Positives = 41/41 (100%), Gaps = 0/41 (0%) Query 1 MNRIVQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDAD 41 MNRIVQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDAD Sbjct 1 MNRIVQFGVSAVAAAAIGIGAGSGIAAAFDGEDEVTGPDAD 41 >gi|329850292|ref|ZP_08265137.1| hypothetical protein ABI_31930 [Asticcacaulis biprosthecum C19] gi|328840607|gb|EGF90178.1| hypothetical protein ABI_31930 [Asticcacaulis biprosthecum C19] Length=183 Score = 34.7 bits (78), Expect = 5.4, Method: Compositional matrix adjust. Identities = 29/87 (34%), Positives = 45/87 (52%), Gaps = 13/87 (14%) Query 15 AAIGIGAGSGI--AAAFDGEDEVTGPDADRARAAAVQAVPGGTAGEVETETGEGAAAYGV 72 AA+ + AG I AA D EVT A ++A+PG T E + + +G Y V Sbjct 34 AAMSVTAGDLITEVAAADLPPEVT--------ATVLKAIPGMTIAEAQRKERDGRVYYDV 85 Query 73 LVTRPDGTRVEVHLDRD---FRVLDTE 96 RPDG+ VE+ L ++ F+V++ + Sbjct 86 EGKRPDGSDVELDLLQEGDAFKVVEIQ 112 Lambda K H 0.313 0.134 0.377 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 127822873252 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40