BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0471c Length=162 Score E Sequences producing significant alignments: (Bits) Value gi|15607612|ref|NP_214985.1| hypothetical protein Rv0471c [Mycob... 321 2e-86 gi|289441853|ref|ZP_06431597.1| hypothetical protein TBLG_02596 ... 320 5e-86 gi|289446014|ref|ZP_06435758.1| conserved hypothetical protein [... 319 1e-85 gi|306806281|ref|ZP_07442949.1| hypothetical protein TMGG_03480 ... 318 1e-85 gi|289568390|ref|ZP_06448617.1| hypothetical protein TBJG_03494 ... 270 4e-71 gi|320159719|ref|YP_004172943.1| putative prenyltransferase [Ana... 59.3 2e-07 gi|88706563|ref|ZP_01104267.1| UbiA prenyltransferase [Congregib... 55.8 2e-06 gi|88798571|ref|ZP_01114155.1| 1,4-dihydroxy-2-naphthoate octapr... 49.3 2e-04 gi|39997062|ref|NP_953013.1| hypothetical protein GSU1964 [Geoba... 40.4 0.081 gi|298251774|ref|ZP_06975577.1| ATPase associated with various c... 37.4 0.78 gi|254514686|ref|ZP_05126747.1| UbiA prenyltransferase [gamma pr... 36.6 1.3 gi|343921262|gb|EGV31983.1| 1,4-dihydroxy-2-naphthoate octapreny... 35.8 2.1 gi|327310902|ref|YP_004337799.1| prenyltransferase [Thermoproteu... 35.0 3.7 gi|312129981|ref|YP_003997321.1| 1,4-dihydroxy-2-naphtoate preny... 33.9 8.8 >gi|15607612|ref|NP_214985.1| hypothetical protein Rv0471c [Mycobacterium tuberculosis H37Rv] gi|15839860|ref|NP_334897.1| hypothetical protein MT0488 [Mycobacterium tuberculosis CDC1551] gi|31791651|ref|NP_854144.1| hypothetical protein Mb0481c [Mycobacterium bovis AF2122/97] 60 more sequence titlesLength=162 Score = 321 bits (823), Expect = 2e-86, Method: Compositional matrix adjust. Identities = 162/162 (100%), Positives = 162/162 (100%), Gaps = 0/162 (0%) Query 1 MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGE 60 MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGE Sbjct 1 MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGE 60 Query 61 PGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYARARYAQHPAATGANRAAYTT 120 PGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYARARYAQHPAATGANRAAYTT Sbjct 61 PGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYARARYAQHPAATGANRAAYTT 120 Query 121 PRRTTSCGSPERALEPTTPRWARAVGRSCWRRSPTGCCAPRC 162 PRRTTSCGSPERALEPTTPRWARAVGRSCWRRSPTGCCAPRC Sbjct 121 PRRTTSCGSPERALEPTTPRWARAVGRSCWRRSPTGCCAPRC 162 >gi|289441853|ref|ZP_06431597.1| hypothetical protein TBLG_02596 [Mycobacterium tuberculosis T46] gi|289748961|ref|ZP_06508339.1| hypothetical protein TBDG_01505 [Mycobacterium tuberculosis T92] gi|289414772|gb|EFD12012.1| hypothetical protein TBLG_02596 [Mycobacterium tuberculosis T46] gi|289689548|gb|EFD56977.1| hypothetical protein TBDG_01505 [Mycobacterium tuberculosis T92] Length=162 Score = 320 bits (820), Expect = 5e-86, Method: Compositional matrix adjust. Identities = 161/162 (99%), Positives = 162/162 (100%), Gaps = 0/162 (0%) Query 1 MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGE 60 MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGE Sbjct 1 MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGE 60 Query 61 PGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYARARYAQHPAATGANRAAYTT 120 PGLDWRWLVLWW+SHAPHIANNLMNDLYDTDVGTDSATYARARYAQHPAATGANRAAYTT Sbjct 61 PGLDWRWLVLWWKSHAPHIANNLMNDLYDTDVGTDSATYARARYAQHPAATGANRAAYTT 120 Query 121 PRRTTSCGSPERALEPTTPRWARAVGRSCWRRSPTGCCAPRC 162 PRRTTSCGSPERALEPTTPRWARAVGRSCWRRSPTGCCAPRC Sbjct 121 PRRTTSCGSPERALEPTTPRWARAVGRSCWRRSPTGCCAPRC 162 >gi|289446014|ref|ZP_06435758.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A] gi|289418972|gb|EFD16173.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A] Length=162 Score = 319 bits (817), Expect = 1e-85, Method: Compositional matrix adjust. Identities = 161/162 (99%), Positives = 161/162 (99%), Gaps = 0/162 (0%) Query 1 MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGE 60 MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGE Sbjct 1 MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGE 60 Query 61 PGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYARARYAQHPAATGANRAAYTT 120 PGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYARARYAQHPAATGANRAAYTT Sbjct 61 PGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYARARYAQHPAATGANRAAYTT 120 Query 121 PRRTTSCGSPERALEPTTPRWARAVGRSCWRRSPTGCCAPRC 162 PRRTTSCGSP RALEPTTPRWARAVGRSCWRRSPTGCCAPRC Sbjct 121 PRRTTSCGSPARALEPTTPRWARAVGRSCWRRSPTGCCAPRC 162 >gi|306806281|ref|ZP_07442949.1| hypothetical protein TMGG_03480 [Mycobacterium tuberculosis SUMu007] gi|306966477|ref|ZP_07479138.1| hypothetical protein TMIG_01365 [Mycobacterium tuberculosis SUMu009] gi|308347290|gb|EFP36141.1| hypothetical protein TMGG_03480 [Mycobacterium tuberculosis SUMu007] gi|308355873|gb|EFP44724.1| hypothetical protein TMIG_01365 [Mycobacterium tuberculosis SUMu009] Length=162 Score = 318 bits (816), Expect = 1e-85, Method: Compositional matrix adjust. Identities = 161/162 (99%), Positives = 161/162 (99%), Gaps = 0/162 (0%) Query 1 MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGE 60 MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGE Sbjct 1 MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGE 60 Query 61 PGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYARARYAQHPAATGANRAAYTT 120 PGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYARARYAQHPAATGANRAAYTT Sbjct 61 PGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYARARYAQHPAATGANRAAYTT 120 Query 121 PRRTTSCGSPERALEPTTPRWARAVGRSCWRRSPTGCCAPRC 162 RRTTSCGSPERALEPTTPRWARAVGRSCWRRSPTGCCAPRC Sbjct 121 LRRTTSCGSPERALEPTTPRWARAVGRSCWRRSPTGCCAPRC 162 >gi|289568390|ref|ZP_06448617.1| hypothetical protein TBJG_03494 [Mycobacterium tuberculosis T17] gi|289542143|gb|EFD45792.1| hypothetical protein TBJG_03494 [Mycobacterium tuberculosis T17] Length=136 Score = 270 bits (691), Expect = 4e-71, Method: Compositional matrix adjust. Identities = 135/136 (99%), Positives = 136/136 (100%), Gaps = 0/136 (0%) Query 27 PTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGEPGLDWRWLVLWWESHAPHIANNLMND 86 PTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGEPGLDWRWLVLWW+SHAPHIANNLMND Sbjct 1 PTDTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGEPGLDWRWLVLWWKSHAPHIANNLMND 60 Query 87 LYDTDVGTDSATYARARYAQHPAATGANRAAYTTPRRTTSCGSPERALEPTTPRWARAVG 146 LYDTDVGTDSATYARARYAQHPAATGANRAAYTTPRRTTSCGSPERALEPTTPRWARAVG Sbjct 61 LYDTDVGTDSATYARARYAQHPAATGANRAAYTTPRRTTSCGSPERALEPTTPRWARAVG 120 Query 147 RSCWRRSPTGCCAPRC 162 RSCWRRSPTGCCAPRC Sbjct 121 RSCWRRSPTGCCAPRC 136 >gi|320159719|ref|YP_004172943.1| putative prenyltransferase [Anaerolinea thermophila UNI-1] gi|319993572|dbj|BAJ62343.1| putative prenyltransferase [Anaerolinea thermophila UNI-1] Length=335 Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 35/85 (42%), Positives = 45/85 (53%), Gaps = 1/85 (1%) Query 29 DTVTRWLVVTRAAVLPMTLVSGLVAGLLAIGEPGLDWRWLVLWWESHA-PHIANNLMNDL 87 D ++RWL+ TRAAVL MT S + GLLA + + +L H NNL+NDL Sbjct 27 DVISRWLISTRAAVLIMTFTSATIGGLLAARDGKFNLVLWLLVALGLVLAHALNNLLNDL 86 Query 88 YDTDVGTDSATYARARYAQHPAATG 112 D + G D Y RA+Y HP G Sbjct 87 TDYERGIDENNYFRAQYGPHPLQQG 111 >gi|88706563|ref|ZP_01104267.1| UbiA prenyltransferase [Congregibacter litoralis KT71] gi|88699275|gb|EAQ96390.1| UbiA prenyltransferase [Congregibacter litoralis KT71] Length=329 Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 35/85 (42%), Positives = 45/85 (53%), Gaps = 7/85 (8%) Query 32 TRWLVVTRAAVLPMTLVSGLVAGLLA----IGEPGLDWRWLVLWWESHAPHIANNLMNDL 87 RWL+ R++VL MTL+S + GLLA G+ GL WL+ H NNL+NDL Sbjct 29 VRWLLAVRSSVLFMTLMSATLGGLLAWREGAGDLGL---WLLCMLGLMLAHATNNLLNDL 85 Query 88 YDTDVGTDSATYARARYAQHPAATG 112 D+ G DS Y R +Y H G Sbjct 86 TDSARGVDSGNYYRNQYGIHVLEDG 110 >gi|88798571|ref|ZP_01114155.1| 1,4-dihydroxy-2-naphthoate octaprenyltransferase [Reinekea sp. MED297] gi|88778671|gb|EAR09862.1| 1,4-dihydroxy-2-naphthoate octaprenyltransferase [Reinekea sp. MED297] Length=333 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 31/83 (38%), Positives = 39/83 (47%), Gaps = 1/83 (1%) Query 31 VTRWLVVTRAAVLPMTLVSGLVAGLLAIGEPGLDWRWLVLWWESHA-PHIANNLMNDLYD 89 + RW + +R+AV MTL S L+ LLA+ DW VL H NNL+ND D Sbjct 32 LKRWFIASRSAVFIMTLFSALIGLLLAVPAATFDWLNAVLVTIGLVLAHATNNLINDWTD 91 Query 90 TDVGTDSATYARARYAQHPAATG 112 G D Y R +Y P G Sbjct 92 YRKGVDRDNYFRTQYGPQPLEAG 114 >gi|39997062|ref|NP_953013.1| hypothetical protein GSU1964 [Geobacter sulfurreducens PCA] gi|39983952|gb|AAR35340.1| hypothetical protein GSU1964 [Geobacter sulfurreducens PCA] Length=360 Score = 40.4 bits (93), Expect = 0.081, Method: Compositional matrix adjust. Identities = 18/44 (41%), Positives = 26/44 (60%), Gaps = 1/44 (2%) Query 71 WWESHAPHIANNL-MNDLYDTDVGTDSATYARARYAQHPAATGA 113 +++ H HIANN+ N L+ DVGTD+ R R +H A +G Sbjct 117 YYDIHMRHIANNVDHNRLFLIDVGTDTLEIGRQRIEEHNAGSGV 160 >gi|298251774|ref|ZP_06975577.1| ATPase associated with various cellular activities AAA_5 [Ktedonobacter racemifer DSM 44963] gi|297546366|gb|EFH80234.1| ATPase associated with various cellular activities AAA_5 [Ktedonobacter racemifer DSM 44963] Length=786 Score = 37.4 bits (85), Expect = 0.78, Method: Composition-based stats. Identities = 27/85 (32%), Positives = 38/85 (45%), Gaps = 9/85 (10%) Query 61 PGLD----WRWLVLWW----ESHAPHIANNLMNDLYDTDVGTDSATYARARYAQHPAATG 112 PGLD W +W E + AN L+ + +D+ TDS+T A R + + Sbjct 214 PGLDFIPYWSEFFFYWVLQQEQSSKAGANTLIKEDDPSDLLTDSSTQA-GRVQERSSDWS 272 Query 113 ANRAAYTTPRRTTSCGSPERALEPT 137 N+ YTTP T P L+PT Sbjct 273 GNKEGYTTPSFDTKLIIPNEPLKPT 297 >gi|254514686|ref|ZP_05126747.1| UbiA prenyltransferase [gamma proteobacterium NOR5-3] gi|219676929|gb|EED33294.1| UbiA prenyltransferase [gamma proteobacterium NOR5-3] Length=288 Score = 36.6 bits (83), Expect = 1.3, Method: Compositional matrix adjust. Identities = 15/35 (43%), Positives = 19/35 (55%), Gaps = 0/35 (0%) Query 78 HIANNLMNDLYDTDVGTDSATYARARYAQHPAATG 112 H NNL+NDL D+ G D+ Y R +Y H G Sbjct 35 HATNNLLNDLTDSSRGIDAGNYYRNQYGVHVLEDG 69 >gi|343921262|gb|EGV31983.1| 1,4-dihydroxy-2-naphthoate octaprenyltransferase [Thiorhodococcus drewsii AZ1] Length=306 Score = 35.8 bits (81), Expect = 2.1, Method: Compositional matrix adjust. Identities = 26/69 (38%), Positives = 38/69 (56%), Gaps = 4/69 (5%) Query 31 VTRWLVVTRAAVLPMTLVSGLVAGL-LAIGEPGLDWRWLVL--WWESHAPHIANNLMNDL 87 +TRW+ R LP+T V+ +VAG+ +A+ E G W+ L + A I NL ND Sbjct 20 LTRWIAAARPKTLPLT-VTPVVAGIAIAVAETGSLSLWIALCTLLGAVAIQIGTNLYNDA 78 Query 88 YDTDVGTDS 96 D + GTD+ Sbjct 79 SDFERGTDT 87 >gi|327310902|ref|YP_004337799.1| prenyltransferase [Thermoproteus uzoniensis 768-20] gi|326947381|gb|AEA12487.1| prenyltransferase [Thermoproteus uzoniensis 768-20] Length=294 Score = 35.0 bits (79), Expect = 3.7, Method: Compositional matrix adjust. Identities = 19/51 (38%), Positives = 26/51 (51%), Gaps = 5/51 (9%) Query 78 HIANNLMNDLYDTDVGTDSATYARARYAQHPAATGANRAAYTTPRRTTSCG 128 H A N++ND YDT G D+ T A Y HP +G +PR+ + G Sbjct 53 HAAVNVINDYYDTIRGVDTPTSPTALYRPHPLLSG-----LFSPRQALAVG 98 >gi|312129981|ref|YP_003997321.1| 1,4-dihydroxy-2-naphtoate prenyltransferase [Leadbetterella byssophila DSM 17132] gi|311906527|gb|ADQ16968.1| 1,4-dihydroxy-2-naphtoate prenyltransferase [Leadbetterella byssophila DSM 17132] Length=292 Score = 33.9 bits (76), Expect = 8.8, Method: Compositional matrix adjust. Identities = 22/70 (32%), Positives = 33/70 (48%), Gaps = 5/70 (7%) Query 31 VTRWLVVTRAAVLPMTLVSGLVAGLLAIGEPGLDWRWLVLWWE---SHAPHIANNLMNDL 87 + +W+ R LP+ L S L+ G LA + +RW V W + + +N ND Sbjct 1 MKKWISAARLRTLPLALSSILMGGFLA--QSVFMFRWDVFLWAVITTILLQVMSNFANDY 58 Query 88 YDTDVGTDSA 97 DT G DS+ Sbjct 59 GDTQNGADSS 68 Lambda K H 0.319 0.131 0.450 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 130518307686 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40