BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2076c Length=83 Score E Sequences producing significant alignments: (Bits) Value gi|15841565|ref|NP_336602.1| hypothetical protein MT2136 [Mycoba... 162 1e-38 gi|15609213|ref|NP_216592.1| hypothetical protein Rv2076c [Mycob... 160 7e-38 gi|183984079|ref|YP_001852370.1| hypothetical protein MMAR_5576 ... 67.0 1e-09 gi|118619177|ref|YP_907509.1| hypothetical protein MUL_3974 [Myc... 63.9 8e-09 gi|309362254|emb|CAP28389.2| CBR-SRSX-13 protein [Caenorhabditis... 34.7 4.8 gi|51556650|ref|YP_068284.1| rh195 [Macacine herpesvirus 3] >gi|... 34.3 5.8 >gi|15841565|ref|NP_336602.1| hypothetical protein MT2136 [Mycobacterium tuberculosis CDC1551] gi|254551103|ref|ZP_05141550.1| hypothetical protein Mtube_11676 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|297731640|ref|ZP_06960758.1| hypothetical protein MtubKR_11151 [Mycobacterium tuberculosis KZN R506] gi|13881812|gb|AAK46416.1| hypothetical protein MT2136 [Mycobacterium tuberculosis CDC1551] Length=99 Score = 162 bits (411), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 83/83 (100%), Positives = 83/83 (100%), Gaps = 0/83 (0%) Query 1 VVVCLIGGVAGSLWPRPAGRLRGGCYFAFMGVAWVLLAISAIANAVKGSLWWDIWSLGLL 60 VVVCLIGGVAGSLWPRPAGRLRGGCYFAFMGVAWVLLAISAIANAVKGSLWWDIWSLGLL Sbjct 17 VVVCLIGGVAGSLWPRPAGRLRGGCYFAFMGVAWVLLAISAIANAVKGSLWWDIWSLGLL 76 Query 61 VLIPAVVYGKMRRSRRISSDQDR 83 VLIPAVVYGKMRRSRRISSDQDR Sbjct 77 VLIPAVVYGKMRRSRRISSDQDR 99 >gi|15609213|ref|NP_216592.1| hypothetical protein Rv2076c [Mycobacterium tuberculosis H37Rv] gi|31793258|ref|NP_855751.1| hypothetical protein Mb2101c [Mycobacterium bovis AF2122/97] gi|121637960|ref|YP_978184.1| hypothetical protein BCG_2094c [Mycobacterium bovis BCG str. Pasteur 1173P2] 62 more sequence titlesLength=83 Score = 160 bits (404), Expect = 7e-38, Method: Compositional matrix adjust. Identities = 82/83 (99%), Positives = 83/83 (100%), Gaps = 0/83 (0%) Query 1 VVVCLIGGVAGSLWPRPAGRLRGGCYFAFMGVAWVLLAISAIANAVKGSLWWDIWSLGLL 60 +VVCLIGGVAGSLWPRPAGRLRGGCYFAFMGVAWVLLAISAIANAVKGSLWWDIWSLGLL Sbjct 1 MVVCLIGGVAGSLWPRPAGRLRGGCYFAFMGVAWVLLAISAIANAVKGSLWWDIWSLGLL 60 Query 61 VLIPAVVYGKMRRSRRISSDQDR 83 VLIPAVVYGKMRRSRRISSDQDR Sbjct 61 VLIPAVVYGKMRRSRRISSDQDR 83 >gi|183984079|ref|YP_001852370.1| hypothetical protein MMAR_5576 [Mycobacterium marinum M] gi|183177405|gb|ACC42515.1| conserved hypothetical protein [Mycobacterium marinum M] Length=91 Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 37/61 (61%), Positives = 43/61 (71%), Gaps = 5/61 (8%) Query 19 GRLRGGC-----YFAFMGVAWVLLAISAIANAVKGSLWWDIWSLGLLVLIPAVVYGKMRR 73 RLR G Y A MGVAW+LLA+SAIANA++GSLW + SL LLVLIP VY K+R Sbjct 28 ARLRPGVLHARRYVAIMGVAWLLLAVSAIANAIRGSLWGTVGSLVLLVLIPTAVYSKLRN 87 Query 74 S 74 S Sbjct 88 S 88 >gi|118619177|ref|YP_907509.1| hypothetical protein MUL_3974 [Mycobacterium ulcerans Agy99] gi|118571287|gb|ABL06038.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99] Length=91 Score = 63.9 bits (154), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 37/66 (57%), Positives = 45/66 (69%), Gaps = 1/66 (1%) Query 9 VAGSLWPRPAGRLRGGCYFAFMGVAWVLLAISAIANAVKGSLWWDIWSLGLLVLIPAVVY 68 +AG RP G L Y A MG AW+LLA+SAIANA++GSLW + SL LLVLIP VY Sbjct 24 IAGFARLRP-GVLHARRYVAIMGFAWLLLAVSAIANAIRGSLWGTVGSLVLLVLIPTAVY 82 Query 69 GKMRRS 74 K+R + Sbjct 83 SKLRNN 88 >gi|309362254|emb|CAP28389.2| CBR-SRSX-13 protein [Caenorhabditis briggsae AF16] Length=291 Score = 34.7 bits (78), Expect = 4.8, Method: Composition-based stats. Identities = 23/78 (30%), Positives = 38/78 (49%), Gaps = 11/78 (14%) Query 2 VVCLIGGVAGSLWPRPAGRLRGGC-----YFAFMGVAWVLLAISAIAN---AVKGSLWW- 52 +VCL G V G P+ R R C ++ F A VL+ + +A+ +VK ++++ Sbjct 55 IVCLFGTVLGGFMMDPSTRTRQNCFVHISFYIFCQAAQVLIMLIIMADILVSVKFAIFYH 114 Query 53 --DIWSLGLLVLIPAVVY 68 W L V IP ++Y Sbjct 115 NLSTWKYTLSVSIPVLIY 132 >gi|51556650|ref|YP_068284.1| rh195 [Macacine herpesvirus 3] gi|31378067|gb|AAP50717.1| rh195 [Rhesus cytomegalovirus strain 68-1] gi|73695839|gb|AAZ80713.1| rh195 [Macacine herpesvirus 3] Length=247 Score = 34.3 bits (77), Expect = 5.8, Method: Compositional matrix adjust. Identities = 23/69 (34%), Positives = 34/69 (50%), Gaps = 5/69 (7%) Query 14 WPRPAGRLRGGCYFAFMGVAWVLLAI-SAIANAVKGSLWWDIWSLGLLVLIPAVVYGKMR 72 W RP GC+ VA+ L+A+ S + + + + I+SLG ++LIP Y R Sbjct 137 WIRPLA----GCFITSFAVAFSLIALFSKNLSIILRATGYAIFSLGFVLLIPTQFYLVHR 192 Query 73 RSRRISSDQ 81 R RI S Sbjct 193 RPERIKSSS 201 Lambda K H 0.329 0.143 0.488 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 127525940264 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40