BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2312 Length=89 Score E Sequences producing significant alignments: (Bits) Value gi|15609449|ref|NP_216828.1| hypothetical protein Rv2312 [Mycoba... 177 3e-43 gi|340627322|ref|YP_004745774.1| hypothetical protein MCAN_23431... 177 4e-43 gi|240169986|ref|ZP_04748645.1| hypothetical protein MkanA1_1178... 106 1e-21 gi|145222998|ref|YP_001133676.1| hypothetical protein Mflv_2411 ... 83.2 1e-14 gi|108800739|ref|YP_640936.1| hypothetical protein Mmcs_3774 [My... 81.6 4e-14 gi|319950294|ref|ZP_08024214.1| hypothetical protein ES5_11991 [... 48.9 2e-04 gi|162454992|ref|YP_001617359.1| hypothetical protein sce6710 [S... 37.7 0.58 gi|345138891|dbj|BAK68500.1| hypothetical protein SLG_38250 [Sph... 35.0 4.1 >gi|15609449|ref|NP_216828.1| hypothetical protein Rv2312 [Mycobacterium tuberculosis H37Rv] gi|15841816|ref|NP_336853.1| hypothetical protein MT2374 [Mycobacterium tuberculosis CDC1551] gi|31793495|ref|NP_855988.1| hypothetical protein Mb2339 [Mycobacterium bovis AF2122/97] 50 more sequence titlesLength=89 Score = 177 bits (450), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 89/89 (100%), Positives = 89/89 (100%), Gaps = 0/89 (0%) Query 1 MMKEIELHLVDAAAPSGEIAIKDLAALATALQELTTRISRDPINTPGPGRTKQFMEELSQ 60 MMKEIELHLVDAAAPSGEIAIKDLAALATALQELTTRISRDPINTPGPGRTKQFMEELSQ Sbjct 1 MMKEIELHLVDAAAPSGEIAIKDLAALATALQELTTRISRDPINTPGPGRTKQFMEELSQ 60 Query 61 LASAPGPDIDGGIDLTDDEFQAFLQAARS 89 LASAPGPDIDGGIDLTDDEFQAFLQAARS Sbjct 61 LASAPGPDIDGGIDLTDDEFQAFLQAARS 89 >gi|340627322|ref|YP_004745774.1| hypothetical protein MCAN_23431 [Mycobacterium canettii CIPT 140010059] gi|340005512|emb|CCC44674.1| hypothetical protein MCAN_23431 [Mycobacterium canettii CIPT 140010059] Length=89 Score = 177 bits (449), Expect = 4e-43, Method: Compositional matrix adjust. Identities = 88/89 (99%), Positives = 89/89 (100%), Gaps = 0/89 (0%) Query 1 MMKEIELHLVDAAAPSGEIAIKDLAALATALQELTTRISRDPINTPGPGRTKQFMEELSQ 60 MMKEIELHLVDAAAPSGEIA+KDLAALATALQELTTRISRDPINTPGPGRTKQFMEELSQ Sbjct 1 MMKEIELHLVDAAAPSGEIAVKDLAALATALQELTTRISRDPINTPGPGRTKQFMEELSQ 60 Query 61 LASAPGPDIDGGIDLTDDEFQAFLQAARS 89 LASAPGPDIDGGIDLTDDEFQAFLQAARS Sbjct 61 LASAPGPDIDGGIDLTDDEFQAFLQAARS 89 >gi|240169986|ref|ZP_04748645.1| hypothetical protein MkanA1_11784 [Mycobacterium kansasii ATCC 12478] Length=285 Score = 106 bits (265), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 51/61 (84%), Positives = 56/61 (92%), Gaps = 0/61 (0%) Query 1 MMKEIELHLVDAAAPSGEIAIKDLAALATALQELTTRISRDPINTPGPGRTKQFMEELSQ 60 M++EIEL LVDA APSGE+ +KDLAA+ATALQELTTRISRD IN PGPGRTKQFMEELSQ Sbjct 1 MIREIELRLVDAPAPSGEVTVKDLAAIATALQELTTRISRDVINMPGPGRTKQFMEELSQ 60 Query 61 L 61 L Sbjct 61 L 61 Score = 61.2 bits (147), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 28/29 (97%), Positives = 29/29 (100%), Gaps = 0/29 (0%) Query 61 LASAPGPDIDGGIDLTDDEFQAFLQAARS 89 LASAPGPDIDGGIDLTDDEFQAFL+AARS Sbjct 257 LASAPGPDIDGGIDLTDDEFQAFLEAARS 285 >gi|145222998|ref|YP_001133676.1| hypothetical protein Mflv_2411 [Mycobacterium gilvum PYR-GCK] gi|145215484|gb|ABP44888.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK] Length=329 Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 37/61 (61%), Positives = 51/61 (84%), Gaps = 0/61 (0%) Query 1 MMKEIELHLVDAAAPSGEIAIKDLAALATALQELTTRISRDPINTPGPGRTKQFMEELSQ 60 M+K+IEL LVD +APSGEIA+KDL+ +A ALQEL TR+SR+ + GPGR+KQ++EE ++ Sbjct 45 MIKQIELRLVDGSAPSGEIALKDLSGIAAALQELVTRLSREAADAAGPGRSKQYVEEFAE 104 Query 61 L 61 L Sbjct 105 L 105 Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust. Identities = 20/29 (69%), Positives = 26/29 (90%), Gaps = 0/29 (0%) Query 61 LASAPGPDIDGGIDLTDDEFQAFLQAARS 89 LASAPGPD +GG+DLT++E+ AFL+A RS Sbjct 301 LASAPGPDPNGGLDLTEEEWAAFLEAIRS 329 >gi|108800739|ref|YP_640936.1| hypothetical protein Mmcs_3774 [Mycobacterium sp. MCS] gi|119869878|ref|YP_939830.1| hypothetical protein Mkms_3847 [Mycobacterium sp. KMS] gi|108771158|gb|ABG09880.1| hypothetical protein Mmcs_3774 [Mycobacterium sp. MCS] gi|119695967|gb|ABL93040.1| conserved hypothetical protein [Mycobacterium sp. KMS] Length=285 Score = 81.6 bits (200), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 36/61 (60%), Positives = 50/61 (82%), Gaps = 0/61 (0%) Query 1 MMKEIELHLVDAAAPSGEIAIKDLAALATALQELTTRISRDPINTPGPGRTKQFMEELSQ 60 M+K+IEL LVD +APSGEI +KDL+ +A ALQEL TR+SR+ + GPGR+KQ++EE ++ Sbjct 1 MIKQIELRLVDGSAPSGEITLKDLSGIAAALQELVTRLSREAADAAGPGRSKQYVEEFAE 60 Query 61 L 61 L Sbjct 61 L 61 Score = 43.9 bits (102), Expect = 0.008, Method: Compositional matrix adjust. Identities = 21/29 (73%), Positives = 24/29 (83%), Gaps = 0/29 (0%) Query 61 LASAPGPDIDGGIDLTDDEFQAFLQAARS 89 LASAPGPD +GGIDLTD+E FL+A RS Sbjct 257 LASAPGPDPNGGIDLTDEEAAEFLRAIRS 285 >gi|319950294|ref|ZP_08024214.1| hypothetical protein ES5_11991 [Dietzia cinnamea P4] gi|319436050|gb|EFV91250.1| hypothetical protein ES5_11991 [Dietzia cinnamea P4] Length=283 Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 25/65 (39%), Positives = 37/65 (57%), Gaps = 0/65 (0%) Query 2 MKEIELHLVDAAAPSGEIAIKDLAALATALQELTTRISRDPINTPGPGRTKQFMEELSQL 61 M+ + L+D AP GEI DL A+ A++ LT R++R+ + PG GR +E LS+ Sbjct 1 MRHYGIRLIDMDAPDGEIDTDDLVAIVAAMKRLTRRLTREVLEQPGQGRPSGMVESLSRS 60 Query 62 ASAPG 66 A G Sbjct 61 RVAMG 65 >gi|162454992|ref|YP_001617359.1| hypothetical protein sce6710 [Sorangium cellulosum 'So ce 56'] gi|161165574|emb|CAN96879.1| putative exported protein [Sorangium cellulosum 'So ce 56'] Length=370 Score = 37.7 bits (86), Expect = 0.58, Method: Compositional matrix adjust. Identities = 18/45 (40%), Positives = 28/45 (63%), Gaps = 0/45 (0%) Query 1 MMKEIELHLVDAAAPSGEIAIKDLAALATALQELTTRISRDPINT 45 + E LH+V A AP GE A + + A+A A EL+ +SR P+++ Sbjct 164 LFVEPGLHMVRARAPGGEDAFQSVRAVAKAEHELSLHVSRAPVSS 208 >gi|345138891|dbj|BAK68500.1| hypothetical protein SLG_38250 [Sphingobium sp. SYK-6] Length=184 Score = 35.0 bits (79), Expect = 4.1, Method: Compositional matrix adjust. Identities = 20/68 (30%), Positives = 35/68 (52%), Gaps = 4/68 (5%) Query 16 SGEIA---IKDLAALATALQELTTRISRDPINTPGPGRTKQFMEELSQLASAPGPDIDGG 72 +GEIA ++D+ T + E+ TR P +PG G + + ++++L GPD G Sbjct 39 AGEIATQPVRDVGIQKTEIPEVLTRAVSQPYASPGSG-CRTLVAQIAELNDVLGPDFGGN 97 Query 73 IDLTDDEF 80 +D+F Sbjct 98 SKENEDKF 105 Lambda K H 0.314 0.132 0.360 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129638988780 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40