BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2526 Length=75 Score E Sequences producing significant alignments: (Bits) Value gi|15609663|ref|NP_217042.1| hypothetical protein Rv2526 [Mycoba... 141 3e-32 gi|289762693|ref|ZP_06522071.1| LOW QUALITY PROTEIN: conserved h... 139 1e-31 gi|313659459|ref|ZP_07816339.1| antitoxin [Mycobacterium tubercu... 112 1e-23 gi|294994366|ref|ZP_06800057.1| hypothetical protein Mtub2_07578... 94.7 4e-18 gi|326904140|gb|EGE51073.1| antitoxin [Mycobacterium tuberculosi... 94.7 4e-18 gi|333992735|ref|YP_004525349.1| hypothetical protein JDM601_409... 94.0 7e-18 gi|336120114|ref|YP_004574892.1| hypothetical protein MLP_44750 ... 63.9 8e-09 gi|298531197|ref|ZP_07018598.1| Protein of unknown function DUF2... 35.0 3.7 gi|163741026|ref|ZP_02148419.1| penicillin-binding protein 1A [P... 35.0 4.0 gi|163739754|ref|ZP_02147162.1| Penicillin-binding protein 1A [P... 34.3 6.9 gi|335034130|ref|ZP_08527491.1| hypothetical protein AGRO_1470 [... 33.9 7.6 gi|159184572|ref|NP_354029.2| hypothetical protein Atu1005 [Agro... 33.5 9.9 >gi|15609663|ref|NP_217042.1| hypothetical protein Rv2526 [Mycobacterium tuberculosis H37Rv] gi|15842060|ref|NP_337097.1| hypothetical protein MT2601.2 [Mycobacterium tuberculosis CDC1551] gi|31793707|ref|NP_856200.1| hypothetical protein Mb2555 [Mycobacterium bovis AF2122/97] 61 more sequence titlesLength=75 Score = 141 bits (356), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 75/75 (100%), Positives = 75/75 (100%), Gaps = 0/75 (0%) Query 1 MTVKRTTIELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAG 60 MTVKRTTIELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAG Sbjct 1 MTVKRTTIELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAG 60 Query 61 THVDADVLLSEQAWR 75 THVDADVLLSEQAWR Sbjct 61 THVDADVLLSEQAWR 75 >gi|289762693|ref|ZP_06522071.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|289710199|gb|EFD74215.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] Length=77 Score = 139 bits (351), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 74/74 (100%), Positives = 74/74 (100%), Gaps = 0/74 (0%) Query 1 MTVKRTTIELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAG 60 MTVKRTTIELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAG Sbjct 1 MTVKRTTIELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAG 60 Query 61 THVDADVLLSEQAW 74 THVDADVLLSEQAW Sbjct 61 THVDADVLLSEQAW 74 >gi|313659459|ref|ZP_07816339.1| antitoxin [Mycobacterium tuberculosis KZN V2475] Length=61 Score = 112 bits (281), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 60/61 (99%), Positives = 61/61 (100%), Gaps = 0/61 (0%) Query 15 VRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAGTHVDADVLLSEQAW 74 +RAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAGTHVDADVLLSEQAW Sbjct 1 MRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAGTHVDADVLLSEQAW 60 Query 75 R 75 R Sbjct 61 R 61 >gi|294994366|ref|ZP_06800057.1| hypothetical protein Mtub2_07578 [Mycobacterium tuberculosis 210] Length=66 Score = 94.7 bits (234), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 50/61 (82%), Positives = 51/61 (84%), Gaps = 0/61 (0%) Query 15 VRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAGTHVDADVLLSEQAW 74 + Q RATVERALQQLVAAAAEQAAARRRRIVDHLAHAGTHVDADVLLSEQAW Sbjct 6 ILCGQPRPSPGTRATVERALQQLVAAAAEQAAARRRRIVDHLAHAGTHVDADVLLSEQAW 65 Query 75 R 75 R Sbjct 66 R 66 >gi|326904140|gb|EGE51073.1| antitoxin [Mycobacterium tuberculosis W-148] Length=60 Score = 94.7 bits (234), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 50/59 (85%), Positives = 50/59 (85%), Gaps = 0/59 (0%) Query 17 AAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAGTHVDADVLLSEQAWR 75 Q RATVERALQQLVAAAAEQAAARRRRIVDHLAHAGTHVDADVLLSEQAWR Sbjct 2 CGQPRPSPGTRATVERALQQLVAAAAEQAAARRRRIVDHLAHAGTHVDADVLLSEQAWR 60 >gi|333992735|ref|YP_004525349.1| hypothetical protein JDM601_4095 [Mycobacterium sp. JDM601] gi|333488703|gb|AEF38095.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=73 Score = 94.0 bits (232), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 48/73 (66%), Positives = 58/73 (80%), Gaps = 0/73 (0%) Query 3 VKRTTIELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAGTH 62 +KRTTIELDE+LVR AQ+VTG TLR+TVE AL++L+A A + RRRRI +HLA AG Sbjct 1 MKRTTIELDEELVRKAQSVTGSTLRSTVESALRRLIAEAQGEDDERRRRITEHLASAGKQ 60 Query 63 VDADVLLSEQAWR 75 VD D LLS+QAWR Sbjct 61 VDTDALLSDQAWR 73 >gi|336120114|ref|YP_004574892.1| hypothetical protein MLP_44750 [Microlunatus phosphovorus NM-1] gi|334687904|dbj|BAK37489.1| hypothetical protein MLP_44750 [Microlunatus phosphovorus NM-1] Length=75 Score = 63.9 bits (154), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 40/75 (54%), Positives = 47/75 (63%), Gaps = 0/75 (0%) Query 1 MTVKRTTIELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAG 60 M VKRTTIELD L A VTG TLRATVE+ L+ L+A + + R HLA AG Sbjct 1 MAVKRTTIELDVTLAAQAVDVTGSTLRATVEQGLRLLIAQSRLEEQHRLDLFNRHLAGAG 60 Query 61 THVDADVLLSEQAWR 75 VD +VLLS +AWR Sbjct 61 ESVDLEVLLSGEAWR 75 >gi|298531197|ref|ZP_07018598.1| Protein of unknown function DUF2191 [Desulfonatronospira thiodismutans ASO3-1] gi|298509220|gb|EFI33125.1| Protein of unknown function DUF2191 [Desulfonatronospira thiodismutans ASO3-1] Length=66 Score = 35.0 bits (79), Expect = 3.7, Method: Compositional matrix adjust. Identities = 19/36 (53%), Positives = 26/36 (73%), Gaps = 1/36 (2%) Query 3 VKRTTIELDEDLVRAAQAVTG-ETLRATVERALQQL 37 +KRT + LDE+LVR Q +TG T RA ++ ALQ+L Sbjct 1 MKRTNVVLDEELVRECQVLTGIRTRRALIDYALQEL 36 >gi|163741026|ref|ZP_02148419.1| penicillin-binding protein 1A [Phaeobacter gallaeciensis 2.10] gi|161386017|gb|EDQ10393.1| penicillin-binding protein 1A [Phaeobacter gallaeciensis 2.10] Length=833 Score = 35.0 bits (79), Expect = 4.0, Method: Composition-based stats. Identities = 20/67 (30%), Positives = 32/67 (48%), Gaps = 0/67 (0%) Query 9 ELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAGTHVDADVL 68 +L ED G T+RAT++ +Q A+A + R + G +DA+ L Sbjct 282 QLSEDFGEGEFFTGGFTVRATIDEEMQTEAASALRGGLEKYDRSRGNWRGTGVTLDAEAL 341 Query 69 LSEQAWR 75 SE++WR Sbjct 342 TSEESWR 348 >gi|163739754|ref|ZP_02147162.1| Penicillin-binding protein 1A [Phaeobacter gallaeciensis BS107] gi|161386984|gb|EDQ11345.1| Penicillin-binding protein 1A [Phaeobacter gallaeciensis BS107] Length=727 Score = 34.3 bits (77), Expect = 6.9, Method: Compositional matrix adjust. Identities = 20/67 (30%), Positives = 31/67 (47%), Gaps = 0/67 (0%) Query 9 ELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAGTHVDADVL 68 +L ED G T+RAT++ +Q A A + R + G +DA+ L Sbjct 282 QLSEDFGEGEFFTGGFTVRATIDEEMQTEAAVALRGGLEKYDRSRGNWRGTGVTLDAEAL 341 Query 69 LSEQAWR 75 SE++WR Sbjct 342 ASEESWR 348 >gi|335034130|ref|ZP_08527491.1| hypothetical protein AGRO_1470 [Agrobacterium sp. ATCC 31749] gi|333794448|gb|EGL65784.1| hypothetical protein AGRO_1470 [Agrobacterium sp. ATCC 31749] Length=89 Score = 33.9 bits (76), Expect = 7.6, Method: Compositional matrix adjust. Identities = 19/35 (55%), Positives = 25/35 (72%), Gaps = 1/35 (2%) Query 5 RTTIELDEDLVRAAQAVTG-ETLRATVERALQQLV 38 RT IELD+ L+ A +TG T +ATVE+AL+ LV Sbjct 15 RTNIELDDALIAEAMEITGLSTKKATVEKALRDLV 49 >gi|159184572|ref|NP_354029.2| hypothetical protein Atu1005 [Agrobacterium tumefaciens str. C58] gi|159139876|gb|AAK86814.2| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length=76 Score = 33.5 bits (75), Expect = 9.9, Method: Compositional matrix adjust. Identities = 19/35 (55%), Positives = 25/35 (72%), Gaps = 1/35 (2%) Query 5 RTTIELDEDLVRAAQAVTG-ETLRATVERALQQLV 38 RT IELD+ L+ A +TG T +ATVE+AL+ LV Sbjct 2 RTNIELDDALIAEAMEITGLPTKKATVEKALRDLV 36 Lambda K H 0.319 0.125 0.335 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 131059142040 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40