BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0100 Length=78 Score E Sequences producing significant alignments: (Bits) Value gi|254233495|ref|ZP_04926821.1| conserved hypothetical protein [... 155 2e-36 gi|15607242|ref|NP_214614.1| hypothetical protein Rv0100 [Mycoba... 154 4e-36 gi|340625136|ref|YP_004743588.1| hypothetical protein MCAN_01031... 152 1e-35 gi|183980289|ref|YP_001848580.1| hypothetical protein MMAR_0257 ... 113 8e-24 gi|118619897|ref|YP_908229.1| hypothetical protein MUL_4857 [Myc... 111 3e-23 gi|15828083|ref|NP_302346.1| hypothetical protein ML1995 [Mycoba... 106 1e-21 gi|317506389|ref|ZP_07964198.1| hypothetical protein HMPREF9336_... 92.0 2e-17 gi|169627765|ref|YP_001701414.1| hypothetical protein MAB_0662 [... 90.5 8e-17 gi|169629210|ref|YP_001702859.1| putative phenyloxazoline syntha... 36.6 1.3 gi|226306183|ref|YP_002766143.1| hypothetical protein RER_26960 ... 36.2 1.7 gi|158314872|ref|YP_001507380.1| phosphopantetheine-binding prot... 35.4 3.0 gi|298293285|ref|YP_003695224.1| phosphopantetheine-binding prot... 34.7 4.4 gi|117168626|gb|ABK32290.1| JerD [Sorangium cellulosum] 34.7 4.6 gi|345012844|ref|YP_004815198.1| beta-ketoacyl synthase [Strepto... 34.7 5.0 gi|229918248|ref|YP_002886894.1| beta-lactamase domain protein [... 34.3 6.1 gi|345011738|ref|YP_004814092.1| acyl transferase [Streptomyces ... 34.3 6.8 gi|20150009|gb|AAM12911.1| MmpII [Pseudomonas fluorescens] 34.3 7.0 gi|37521510|ref|NP_924887.1| polyketide synthase [Gloeobacter vi... 34.3 7.2 gi|308174884|ref|YP_003921589.1| isochorismatase [Bacillus amylo... 33.9 8.3 gi|328554838|gb|AEB25330.1| isochorismatase [Bacillus amylolique... 33.9 8.7 gi|305665878|ref|YP_003862165.1| putative thymidine kinase [Mari... 33.5 9.9 >gi|254233495|ref|ZP_04926821.1| conserved hypothetical protein [Mycobacterium tuberculosis C] gi|308231462|ref|ZP_07663864.1| putative phosphopantetheine attachment site [Mycobacterium tuberculosis SUMu001] gi|308372803|ref|ZP_07429942.2| hypothetical protein TMEG_00526 [Mycobacterium tuberculosis SUMu005] 11 more sequence titlesLength=82 Score = 155 bits (392), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 78/78 (100%), Positives = 78/78 (100%), Gaps = 0/78 (0%) Query 1 VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN 60 VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN Sbjct 5 VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN 64 Query 61 PSIAGWLRELEAVCTEFG 78 PSIAGWLRELEAVCTEFG Sbjct 65 PSIAGWLRELEAVCTEFG 82 >gi|15607242|ref|NP_214614.1| hypothetical protein Rv0100 [Mycobacterium tuberculosis H37Rv] gi|15839480|ref|NP_334517.1| hypothetical protein MT0109 [Mycobacterium tuberculosis CDC1551] gi|31791278|ref|NP_853771.1| hypothetical protein Mb0103 [Mycobacterium bovis AF2122/97] 62 more sequence titles Length=78 Score = 154 bits (389), Expect = 4e-36, Method: Compositional matrix adjust. Identities = 77/78 (99%), Positives = 78/78 (100%), Gaps = 0/78 (0%) Query 1 VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN 60 +RDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN Sbjct 1 MRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN 60 Query 61 PSIAGWLRELEAVCTEFG 78 PSIAGWLRELEAVCTEFG Sbjct 61 PSIAGWLRELEAVCTEFG 78 >gi|340625136|ref|YP_004743588.1| hypothetical protein MCAN_01031 [Mycobacterium canettii CIPT 140010059] gi|340003326|emb|CCC42444.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059] Length=78 Score = 152 bits (385), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 76/78 (98%), Positives = 77/78 (99%), Gaps = 0/78 (0%) Query 1 VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN 60 +RDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN Sbjct 1 MRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN 60 Query 61 PSIAGWLRELEAVCTEFG 78 PSIAGWLRELEA CTEFG Sbjct 61 PSIAGWLRELEAACTEFG 78 >gi|183980289|ref|YP_001848580.1| hypothetical protein MMAR_0257 [Mycobacterium marinum M] gi|183173615|gb|ACC38725.1| conserved hypothetical protein [Mycobacterium marinum M] Length=83 Score = 113 bits (283), Expect = 8e-24, Method: Compositional matrix adjust. Identities = 58/70 (83%), Positives = 62/70 (89%), Gaps = 0/70 (0%) Query 1 VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN 60 VR RIL AVCDVLYIDE DLIDGD TDLRDLGLDSVRFVLLMK+LGV+R+SELPSRLA N Sbjct 5 VRQRILLAVCDVLYIDETDLIDGDATDLRDLGLDSVRFVLLMKRLGVDRESELPSRLAEN 64 Query 61 PSIAGWLREL 70 SI GW+ EL Sbjct 65 LSIEGWVSEL 74 >gi|118619897|ref|YP_908229.1| hypothetical protein MUL_4857 [Mycobacterium ulcerans Agy99] gi|118572007|gb|ABL06758.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99] Length=79 Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 56/70 (80%), Positives = 62/70 (89%), Gaps = 0/70 (0%) Query 1 VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN 60 +R+RIL A CDVLYIDE DLIDGD TDLRDLGLDSVRFVLLMK+LGV+R+SELPSRLA N Sbjct 1 MRERILLAACDVLYIDETDLIDGDATDLRDLGLDSVRFVLLMKRLGVDRESELPSRLAEN 60 Query 61 PSIAGWLREL 70 SI GW+ EL Sbjct 61 LSIEGWVSEL 70 >gi|15828083|ref|NP_302346.1| hypothetical protein ML1995 [Mycobacterium leprae TN] gi|221230560|ref|YP_002503976.1| hypothetical protein MLBr_01995 [Mycobacterium leprae Br4923] gi|13093637|emb|CAC30950.1| conserved hypothetical protein [Mycobacterium leprae] gi|219933667|emb|CAR72092.1| conserved hypothetical protein [Mycobacterium leprae Br4923] Length=78 Score = 106 bits (264), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 49/71 (70%), Positives = 63/71 (89%), Gaps = 0/71 (0%) Query 2 RDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAANP 61 R+RIL AVC+VLYI E+DL+DGDETDLRDLGLDSVRF LLMKQLG+++++E+ S+L N Sbjct 6 RERILTAVCEVLYIAESDLVDGDETDLRDLGLDSVRFTLLMKQLGLSQEAEMQSKLMDNF 65 Query 62 SIAGWLRELEA 72 SIA W+R+LE+ Sbjct 66 SIANWVRQLES 76 >gi|317506389|ref|ZP_07964198.1| hypothetical protein HMPREF9336_00568 [Segniliparus rugosus ATCC BAA-974] gi|316255318|gb|EFV14579.1| hypothetical protein HMPREF9336_00568 [Segniliparus rugosus ATCC BAA-974] Length=79 Score = 92.0 bits (227), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 45/66 (69%), Positives = 58/66 (88%), Gaps = 0/66 (0%) Query 1 VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN 60 +R+++LAA+C+VLYI E DL+DGD TDLRDLGLDSVRFVLLM++LGV+R S+L SRL + Sbjct 1 MREKVLAALCEVLYIAEDDLVDGDATDLRDLGLDSVRFVLLMQRLGVDRGSDLWSRLGES 60 Query 61 PSIAGW 66 S+AGW Sbjct 61 LSVAGW 66 >gi|169627765|ref|YP_001701414.1| hypothetical protein MAB_0662 [Mycobacterium abscessus ATCC 19977] gi|169239732|emb|CAM60760.1| Conserved hypothetical protein [Mycobacterium abscessus] Length=71 Score = 90.5 bits (223), Expect = 8e-17, Method: Compositional matrix adjust. Identities = 44/63 (70%), Positives = 52/63 (83%), Gaps = 0/63 (0%) Query 9 VCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAANPSIAGWLR 68 +C+VLYI EADL +GD TDLR+LGLDSVRFVLLMKQLGV R SEL RL ++ S+AGW + Sbjct 1 MCEVLYISEADLFEGDSTDLRELGLDSVRFVLLMKQLGVTRGSELQKRLVSDLSVAGWAQ 60 Query 69 ELE 71 LE Sbjct 61 ILE 63 >gi|169629210|ref|YP_001702859.1| putative phenyloxazoline synthase MbtB [Mycobacterium abscessus ATCC 19977] gi|169241177|emb|CAM62205.1| Putative phenyloxazoline synthase MbtB [Mycobacterium abscessus] Length=1155 Score = 36.6 bits (83), Expect = 1.3, Method: Compositional matrix adjust. Identities = 26/71 (37%), Positives = 35/71 (50%), Gaps = 2/71 (2%) Query 2 RDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAANP 61 RD I V D++ + D+ DGD DL LGLDS+R + L ++LAA P Sbjct 6 RDEIKLTVADLIGLTAQDISDGD--DLITLGLDSIRMMTLAGGWRKRGSRVTFAQLAAEP 63 Query 62 SIAGWLRELEA 72 S+ W L A Sbjct 64 SVDSWYALLRA 74 >gi|226306183|ref|YP_002766143.1| hypothetical protein RER_26960 [Rhodococcus erythropolis PR4] gi|229490894|ref|ZP_04384729.1| isochorismatase [Rhodococcus erythropolis SK121] gi|226185300|dbj|BAH33404.1| hypothetical protein RER_26960 [Rhodococcus erythropolis PR4] gi|229322284|gb|EEN88070.1| isochorismatase [Rhodococcus erythropolis SK121] Length=76 Score = 36.2 bits (82), Expect = 1.7, Method: Compositional matrix adjust. Identities = 26/70 (38%), Positives = 41/70 (59%), Gaps = 4/70 (5%) Query 2 RDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQL-GVNRQSELPSRLAAN 60 RD+I+ V +VL + L GD+ DL D GLDSVR + L+++ G ++ + LA Sbjct 7 RDQIVRDVAEVLSMRAEQL--GDDADLGDEGLDSVRLMTLVERWRGAGADADF-AELAEE 63 Query 61 PSIAGWLREL 70 P+I W++ L Sbjct 64 PTINAWVQVL 73 >gi|158314872|ref|YP_001507380.1| phosphopantetheine-binding protein [Frankia sp. EAN1pec] gi|158110277|gb|ABW12474.1| phosphopantetheine-binding [Frankia sp. EAN1pec] Length=84 Score = 35.4 bits (80), Expect = 3.0, Method: Compositional matrix adjust. Identities = 22/42 (53%), Positives = 31/42 (74%), Gaps = 2/42 (4%) Query 3 DRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQ 44 +R+LA V DVLYI EA+ +D D DLR G+DSVR + L+++ Sbjct 6 NRVLADVADVLYI-EANDVDQD-ADLRAQGMDSVRLMELVER 45 >gi|298293285|ref|YP_003695224.1| phosphopantetheine-binding protein [Starkeya novella DSM 506] gi|296929796|gb|ADH90605.1| phosphopantetheine-binding protein [Starkeya novella DSM 506] Length=95 Score = 34.7 bits (78), Expect = 4.4, Method: Compositional matrix adjust. Identities = 25/69 (37%), Positives = 41/69 (60%), Gaps = 8/69 (11%) Query 1 VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVR---FVLLMKQLGVNRQSELPSRL 57 R+++ A + +L+ D + IDG+++ L DLGLDS+R VLL + G+ + S Sbjct 19 TREKMRADIAKMLHEDPEE-IDGEDS-LIDLGLDSMRAMNLVLLWSEGGLELEF---SEF 73 Query 58 AANPSIAGW 66 A NP++ GW Sbjct 74 AENPTLDGW 82 >gi|117168626|gb|ABK32290.1| JerD [Sorangium cellulosum] Length=1880 Score = 34.7 bits (78), Expect = 4.6, Method: Compositional matrix adjust. Identities = 26/74 (36%), Positives = 39/74 (53%), Gaps = 5/74 (6%) Query 9 VCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLA---ANP-SIA 64 V +VL + + D+ LRDLGLDS+ V L +LG ++ LP+ LA P ++A Sbjct 1730 VAEVLQLPGPAHVPADQ-PLRDLGLDSLMTVELRNRLGARAETTLPTTLAFDYPTPRALA 1788 Query 65 GWLRELEAVCTEFG 78 +L L + E G Sbjct 1789 SYLETLLGISDENG 1802 >gi|345012844|ref|YP_004815198.1| beta-ketoacyl synthase [Streptomyces violaceusniger Tu 4113] gi|344039193|gb|AEM84918.1| Beta-ketoacyl synthase [Streptomyces violaceusniger Tu 4113] Length=5479 Score = 34.7 bits (78), Expect = 5.0, Method: Compositional matrix adjust. Identities = 24/58 (42%), Positives = 33/58 (57%), Gaps = 1/58 (1%) Query 1 VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLA 58 V D +L++ VL D+A IDG E +DLG DS+ V L +L +ELP+ LA Sbjct 3766 VLDVVLSSAAAVLGHDDASAIDG-ERAFQDLGFDSLNVVRLRNRLRELTGAELPTTLA 3822 >gi|229918248|ref|YP_002886894.1| beta-lactamase domain protein [Exiguobacterium sp. AT1b] gi|229469677|gb|ACQ71449.1| beta-lactamase domain protein [Exiguobacterium sp. AT1b] Length=466 Score = 34.3 bits (77), Expect = 6.1, Method: Composition-based stats. Identities = 15/42 (36%), Positives = 28/42 (67%), Gaps = 0/42 (0%) Query 11 DVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSE 52 D+ I AD +D +TDL+ +GLD +RF++ +++L + +E Sbjct 321 DITLIASADDVDQVQTDLQSIGLDRLRFIVPVEELDASLLTE 362 >gi|345011738|ref|YP_004814092.1| acyl transferase [Streptomyces violaceusniger Tu 4113] gi|344038087|gb|AEM83812.1| Acyl transferase [Streptomyces violaceusniger Tu 4113] Length=3456 Score = 34.3 bits (77), Expect = 6.8, Method: Compositional matrix adjust. Identities = 22/57 (39%), Positives = 28/57 (50%), Gaps = 1/57 (1%) Query 1 VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRL 57 +RD + A DVL D AD + D RDLG DS+ V L ++G LP L Sbjct 1503 LRDLVRAQAADVLAHDSADAVASDRA-FRDLGFDSLTAVELRNRIGTATGLRLPVSL 1558 >gi|20150009|gb|AAM12911.1| MmpII [Pseudomonas fluorescens] Length=2076 Score = 34.3 bits (77), Expect = 7.0, Method: Compositional matrix adjust. Identities = 24/71 (34%), Positives = 37/71 (53%), Gaps = 3/71 (4%) Query 3 DRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSEL-PSRLAANP 61 D ++A VC +L D ID TDL D G DSV L+ Q+ EL P + + Sbjct 1544 DELVALVCQLLKTVAGD-ID-PHTDLHDFGFDSVLLTQLLAQISSTYGVELDPGSVLEDA 1601 Query 62 SIAGWLRELEA 72 ++AG + +++A Sbjct 1602 TVAGLVAQVQA 1612 >gi|37521510|ref|NP_924887.1| polyketide synthase [Gloeobacter violaceus PCC 7421] gi|35212507|dbj|BAC89882.1| gll1941 [Gloeobacter violaceus PCC 7421] Length=963 Score = 34.3 bits (77), Expect = 7.2, Method: Compositional matrix adjust. Identities = 28/67 (42%), Positives = 34/67 (51%), Gaps = 6/67 (8%) Query 2 RDRILA----AVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSEL-PSR 56 RDR+ A V DVL + A + E L DLGLDS+ V L + Q L P+R Sbjct 626 RDRLAAYVQKQVMDVLKLHSAQTFE-PERSLSDLGLDSLMLVELRDHFTRDLQVTLSPAR 684 Query 57 LAANPSI 63 L NPSI Sbjct 685 LLRNPSI 691 >gi|308174884|ref|YP_003921589.1| isochorismatase [Bacillus amyloliquefaciens DSM 7] gi|307607748|emb|CBI44119.1| isochorismatase [Bacillus amyloliquefaciens DSM 7] gi|328913200|gb|AEB64796.1| isochorismatase [Bacillus amyloliquefaciens LL3] Length=306 Score = 33.9 bits (76), Expect = 8.3, Method: Compositional matrix adjust. Identities = 22/72 (31%), Positives = 38/72 (53%), Gaps = 2/72 (2%) Query 1 VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN 60 ++R+ + +L +D++D + DL D GLDSVR + L++Q + LA N Sbjct 225 TKERMREQIAAILQESPSDILD--QEDLLDRGLDSVRIMSLVEQWRRDGAEVTFVELAEN 282 Query 61 PSIAGWLRELEA 72 P++ W R L + Sbjct 283 PTLEEWWRLLSS 294 >gi|328554838|gb|AEB25330.1| isochorismatase [Bacillus amyloliquefaciens TA208] gi|341829114|gb|AEK90365.1| isochorismatase [Bacillus amyloliquefaciens XH7] Length=306 Score = 33.9 bits (76), Expect = 8.7, Method: Compositional matrix adjust. Identities = 22/72 (31%), Positives = 38/72 (53%), Gaps = 2/72 (2%) Query 1 VRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAAN 60 ++R+ + +L +D++D + DL D GLDSVR + L++Q + LA N Sbjct 225 TKERMREQIAAILQESPSDILD--QEDLLDRGLDSVRIMSLVEQWRRDGAEVTFVELAEN 282 Query 61 PSIAGWLRELEA 72 P++ W R L + Sbjct 283 PTLEEWWRLLSS 294 >gi|305665878|ref|YP_003862165.1| putative thymidine kinase [Maribacter sp. HTCC2170] gi|88710653|gb|EAR02885.1| putative thymidine kinase [Maribacter sp. HTCC2170] Length=214 Score = 33.5 bits (75), Expect = 9.9, Method: Compositional matrix adjust. Identities = 29/79 (37%), Positives = 39/79 (50%), Gaps = 9/79 (11%) Query 4 RILAAVCDVLYIDEADLIDGDE--TDLRDLGLDSVRFVL--LMKQLGVNRQSELPSRLAA 59 RILA CDV+ IDEA D DE T DL VR ++ L N +P+ +A Sbjct 81 RILADTCDVIGIDEAQFFD-DEIVTVCNDLANRGVRVIVAGLDMDFKGNPFGPMPALMAT 139 Query 60 NPSIAGWLRELEAVCTEFG 78 A ++ ++ AVCT G Sbjct 140 ----AEYVTKVHAVCTRTG 154 Lambda K H 0.323 0.141 0.411 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129734191374 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40