BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1957 Length=181 Score E Sequences producing significant alignments: (Bits) Value gi|15609094|ref|NP_216473.1| hypothetical protein Rv1957 [Mycoba... 363 7e-99 gi|15841427|ref|NP_336464.1| hypothetical protein MT2006 [Mycoba... 362 1e-98 gi|341601889|emb|CCC64563.1| hypothetical protein BCGM_1970 [Myc... 361 2e-98 gi|306803674|ref|ZP_07440342.1| hypothetical protein TMHG_01134 ... 361 2e-98 gi|167970526|ref|ZP_02552803.1| hypothetical protein MtubH3_2183... 254 3e-66 gi|297625508|ref|YP_003687271.1| hypothetical protein PFREUD_029... 84.7 5e-15 gi|167970527|ref|ZP_02552804.1| hypothetical protein MtubH3_2183... 58.9 3e-07 gi|328953932|ref|YP_004371266.1| hypothetical protein Desac_2261... 47.4 9e-04 gi|297201292|ref|ZP_06918689.1| conserved hypothetical protein [... 41.6 0.055 gi|336120578|ref|YP_004575364.1| hypothetical protein MLP_49470 ... 41.2 0.068 gi|292493495|ref|YP_003528934.1| hypothetical protein Nhal_3523 ... 39.7 0.19 gi|340781887|ref|YP_004748494.1| hypothetical protein Atc_1145 [... 38.9 0.32 gi|47568159|ref|ZP_00238863.1| 5'-nucleotidase [Bacillus cereus ... 38.1 0.49 gi|229061774|ref|ZP_04199107.1| 5'-Nucleotidase domain protein [... 38.1 0.60 gi|336237046|ref|YP_004589662.1| hypothetical protein Geoth_3764... 37.7 0.73 gi|344225576|gb|EGV51930.1| hypothetical protein Rifp1Sym_au0010... 37.4 1.0 gi|308274082|emb|CBX30681.1| hypothetical protein N47_E41930 [un... 37.0 1.4 gi|126724533|ref|ZP_01740376.1| protein-export protein SecB [Rho... 35.8 2.6 gi|326335748|ref|ZP_08201932.1| hypothetical protein HMPREF9071_... 35.8 2.7 gi|110680199|ref|YP_683206.1| hypothetical protein RD1_3001 [Ros... 35.0 4.3 gi|85857881|ref|YP_460083.1| protein translocase subunit [Syntro... 35.0 4.8 gi|302534223|ref|ZP_07286565.1| predicted protein [Streptomyces ... 34.7 5.8 >gi|15609094|ref|NP_216473.1| hypothetical protein Rv1957 [Mycobacterium tuberculosis H37Rv] gi|31793149|ref|NP_855642.1| hypothetical protein Mb1992 [Mycobacterium bovis AF2122/97] gi|121637862|ref|YP_978085.1| hypothetical protein BCG_1996 [Mycobacterium bovis BCG str. Pasteur 1173P2] 68 more sequence titlesLength=181 Score = 363 bits (931), Expect = 7e-99, Method: Compositional matrix adjust. Identities = 181/181 (100%), Positives = 181/181 (100%), Gaps = 0/181 (0%) Query 1 MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDAD 60 MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDAD Sbjct 1 MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDAD 60 Query 61 PATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDP 120 PATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDP Sbjct 61 PATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDP 120 Query 121 TEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQWPATRGT 180 TEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQWPATRGT Sbjct 121 TEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQWPATRGT 180 Query 181 P 181 P Sbjct 181 P 181 >gi|15841427|ref|NP_336464.1| hypothetical protein MT2006 [Mycobacterium tuberculosis CDC1551] gi|254232127|ref|ZP_04925454.1| hypothetical protein TBCG_01905 [Mycobacterium tuberculosis C] gi|254364774|ref|ZP_04980820.1| hypothetical protein TBHG_01911 [Mycobacterium tuberculosis str. Haarlem] gi|13881665|gb|AAK46278.1| hypothetical protein MT2006 [Mycobacterium tuberculosis CDC1551] gi|124601186|gb|EAY60196.1| hypothetical protein TBCG_01905 [Mycobacterium tuberculosis C] gi|134150288|gb|EBA42333.1| hypothetical protein TBHG_01911 [Mycobacterium tuberculosis str. Haarlem] gi|323719521|gb|EGB28647.1| hypothetical protein TMMG_01217 [Mycobacterium tuberculosis CDC1551A] Length=181 Score = 362 bits (929), Expect = 1e-98, Method: Compositional matrix adjust. Identities = 180/181 (99%), Positives = 181/181 (100%), Gaps = 0/181 (0%) Query 1 MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDAD 60 MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDAD Sbjct 1 MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDAD 60 Query 61 PATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDP 120 PATISAFVVRISCHLRIQNQAAD+DVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDP Sbjct 61 PATISAFVVRISCHLRIQNQAADNDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDP 120 Query 121 TEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQWPATRGT 180 TEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQWPATRGT Sbjct 121 TEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQWPATRGT 180 Query 181 P 181 P Sbjct 181 P 181 >gi|341601889|emb|CCC64563.1| hypothetical protein BCGM_1970 [Mycobacterium bovis BCG str. Moreau RDJ] Length=181 Score = 361 bits (927), Expect = 2e-98, Method: Compositional matrix adjust. Identities = 180/181 (99%), Positives = 180/181 (99%), Gaps = 0/181 (0%) Query 1 MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDAD 60 MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDAD Sbjct 1 MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDAD 60 Query 61 PATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDP 120 P TISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDP Sbjct 61 PGTISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDP 120 Query 121 TEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQWPATRGT 180 TEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQWPATRGT Sbjct 121 TEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQWPATRGT 180 Query 181 P 181 P Sbjct 181 P 181 >gi|306803674|ref|ZP_07440342.1| hypothetical protein TMHG_01134 [Mycobacterium tuberculosis SUMu008] gi|308349677|gb|EFP38528.1| hypothetical protein TMHG_01134 [Mycobacterium tuberculosis SUMu008] Length=181 Score = 361 bits (927), Expect = 2e-98, Method: Compositional matrix adjust. Identities = 180/181 (99%), Positives = 180/181 (99%), Gaps = 0/181 (0%) Query 1 MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDAD 60 MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDAD Sbjct 1 MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDAD 60 Query 61 PATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDP 120 PATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDP Sbjct 61 PATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDP 120 Query 121 TEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQWPATRGT 180 TEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPG QWPATRGT Sbjct 121 TEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGTQWPATRGT 180 Query 181 P 181 P Sbjct 181 P 181 >gi|167970526|ref|ZP_02552803.1| hypothetical protein MtubH3_21833 [Mycobacterium tuberculosis H37Ra] Length=125 Score = 254 bits (650), Expect = 3e-66, Method: Compositional matrix adjust. Identities = 124/125 (99%), Positives = 125/125 (100%), Gaps = 0/125 (0%) Query 57 VDADPATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEG 116 +DADPATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEG Sbjct 1 MDADPATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEG 60 Query 117 EDDPTEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQWPA 176 EDDPTEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQWPA Sbjct 61 EDDPTEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQWPA 120 Query 177 TRGTP 181 TRGTP Sbjct 121 TRGTP 125 >gi|297625508|ref|YP_003687271.1| hypothetical protein PFREUD_02960 [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921273|emb|CBL55825.1| Hypothetical protein PFREUD_02960 [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length=159 Score = 84.7 bits (208), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 59/156 (38%), Positives = 83/156 (54%), Gaps = 15/156 (9%) Query 9 DLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLE--FEPAVDADPATISA 66 ++DL+ AR+A +A +RDIR A V P+P L YDL+ FE A+ D ++ Sbjct 3 EMDLRMKAARVAGQADLRDIRTAALHAEVDFPPQPGSNLGYDLDSNFEFALPQDEGDLTV 62 Query 67 FVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDPTEEELT 126 + + + L VKEGD K E A F AL++ +G D + EEL Sbjct 63 VMGQYTASL---------SVKEGDEKKE---FARLGFTLMALYEVGTPQG-DAFSHEELE 109 Query 127 AYAATTGRFALYPYIREYVYDLTGRLALPPLTLEIL 162 A+ T+G+FALYPY RE + LT RL +P LTL +L Sbjct 110 AFVRTSGQFALYPYARETMSMLTTRLGVPNLTLPVL 145 >gi|167970527|ref|ZP_02552804.1| hypothetical protein MtubH3_21838 [Mycobacterium tuberculosis H37Ra] Length=47 Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 33/41 (81%), Positives = 33/41 (81%), Gaps = 5/41 (12%) Query 1 MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAP 41 MTDRTDADDLDLQRVGARLAARAQIRDIR AA H P Sbjct 1 MTDRTDADDLDLQRVGARLAARAQIRDIR-----AAAHSGP 36 >gi|328953932|ref|YP_004371266.1| hypothetical protein Desac_2261 [Desulfobacca acetoxidans DSM 11109] gi|328454256|gb|AEB10085.1| hypothetical protein Desac_2261 [Desulfobacca acetoxidans DSM 11109] Length=181 Score = 47.4 bits (111), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 40/156 (26%), Positives = 71/156 (46%), Gaps = 19/156 (12%) Query 14 RVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDADP-ATISAFVVRIS 72 +V R+A +A + DI L+ + + +G T L F D P +I V Sbjct 19 KVIQRVAEKASLEDIFLIDAEIKSDPVHRDGRGATLRLVF--GSDIRPKGSIDKLAVL-- 74 Query 73 CHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDPTEEELTAYAATT 132 C+ + VKEGD +D + ++ F+ + H D + ++ +A Sbjct 75 CNFLVSA------VKEGDKEDFSLEIKAT---FSVNYKIH---SPDTFSASDIEMFAKIN 122 Query 133 GRFALYPYIREYVYDLTGRLALPPLTLEI--LSRPM 166 + +PY RE+V ++T R+ LP LT+ + +S+PM Sbjct 123 PIYNCWPYWREFVQNITARMGLPTLTIPLFKISKPM 158 >gi|297201292|ref|ZP_06918689.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] gi|197712846|gb|EDY56880.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083] Length=152 Score = 41.6 bits (96), Expect = 0.055, Method: Compositional matrix adjust. Identities = 19/58 (33%), Positives = 31/58 (54%), Gaps = 1/58 (1%) Query 108 LFDYHLQEGEDDPTEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRP 165 L L EG D+ +E ++ YPY+R++++DL GR+ L TL +L +P Sbjct 81 LLTSFLFEG-DEVSESVYESFGTNIAVMTAYPYLRQHIHDLAGRIGLANFTLGLLKQP 137 >gi|336120578|ref|YP_004575364.1| hypothetical protein MLP_49470 [Microlunatus phosphovorus NM-1] gi|334688376|dbj|BAK37961.1| hypothetical protein MLP_49470 [Microlunatus phosphovorus NM-1] Length=122 Score = 41.2 bits (95), Expect = 0.068, Method: Compositional matrix adjust. Identities = 32/88 (37%), Positives = 42/88 (48%), Gaps = 12/88 (13%) Query 67 FVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDP---TEE 123 FV+ +RI + DD G+ KD +A AL Y L +GE E Sbjct 21 FVIDGEYEVRIFQELEDDG---GNRKD----IAEVSLNVGAL--YELPDGETGAGTYEEA 71 Query 124 ELTAYAATTGRFALYPYIREYVYDLTGR 151 E+ A+ TT R ALYPY+R V D+T R Sbjct 72 EVAAFTHTTARLALYPYVRALVADMTVR 99 >gi|292493495|ref|YP_003528934.1| hypothetical protein Nhal_3523 [Nitrosococcus halophilus Nc4] gi|291582090|gb|ADE16547.1| conserved hypothetical protein [Nitrosococcus halophilus Nc4] Length=161 Score = 39.7 bits (91), Expect = 0.19, Method: Compositional matrix adjust. Identities = 39/147 (27%), Positives = 63/147 (43%), Gaps = 21/147 (14%) Query 24 QIRDIRLLRTQAAVHRAPKPAQGLTYD---LEFEPAV--------DADPATISAFVVRIS 72 +IRD+ L ++A++ A +P D ++F+ V D TI+ F V + Sbjct 13 KIRDVYLHSSRASLEDAFEPKYDSDLDKLEVQFKHVVTRSSVLELDEGNRTINLFRVFVE 72 Query 73 CHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDPTEEELTAYAATT 132 R + K GD E + A E + +Y + DDP E L +A Sbjct 73 LGTR---WIISGERKNGDKALEVK----AYIEGTMVAEYLML---DDPGPEALNQFAMKN 122 Query 133 GRFALYPYIREYVYDLTGRLALPPLTL 159 F ++PY REY+ + R+ LP L + Sbjct 123 ASFHIWPYWREYLTSQSVRMNLPKLVM 149 >gi|340781887|ref|YP_004748494.1| hypothetical protein Atc_1145 [Acidithiobacillus caldus SM-1] gi|340556040|gb|AEK57794.1| conserved hypothetical protein [Acidithiobacillus caldus SM-1] Length=150 Score = 38.9 bits (89), Expect = 0.32, Method: Compositional matrix adjust. Identities = 21/60 (35%), Positives = 33/60 (55%), Gaps = 9/60 (15%) Query 119 DPTEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEILSRPMPVSPGAQ---WP 175 +P + + +A + ++PY REYV +L+ R++LP LTL L +PGA WP Sbjct 93 EPDKATIDNFALNVTPYNVWPYFREYVANLSARMSLPRLTLPAL------TPGANTPTWP 146 >gi|47568159|ref|ZP_00238863.1| 5'-nucleotidase [Bacillus cereus G9241] gi|47555149|gb|EAL13496.1| 5'-nucleotidase [Bacillus cereus G9241] Length=529 Score = 38.1 bits (87), Expect = 0.49, Method: Compositional matrix adjust. Identities = 20/64 (32%), Positives = 33/64 (52%), Gaps = 3/64 (4%) Query 103 FEFAALFDYHLQEGEDDPTEEELTAYAATTGRF--ALYPYIREYVYDL-TGRLALPPLTL 159 F+ + ++ EG D+ Y TG F A +PY+ Y+ TGRL LPP T+ Sbjct 114 FDVGTIGNHEFDEGIDEMNRLIYGGYHEKTGNFKGANFPYVAANFYNKSTGRLFLPPFTV 173 Query 160 EILS 163 ++++ Sbjct 174 KMVN 177 >gi|229061774|ref|ZP_04199107.1| 5'-Nucleotidase domain protein [Bacillus cereus AH603] gi|228717520|gb|EEL69184.1| 5'-Nucleotidase domain protein [Bacillus cereus AH603] Length=490 Score = 38.1 bits (87), Expect = 0.60, Method: Compositional matrix adjust. Identities = 20/63 (32%), Positives = 33/63 (53%), Gaps = 3/63 (4%) Query 103 FEFAALFDYHLQEGEDDPTEEELTAYAATTGRF--ALYPYIREYVYDL-TGRLALPPLTL 159 F+ + ++ EG D+ Y TG+F A +PY+ Y+ TGRL LPP T+ Sbjct 75 FDVGTIGNHEFDEGIDEMQRLIYGGYHEKTGKFKGANFPYVAANFYNKSTGRLFLPPFTV 134 Query 160 EIL 162 +++ Sbjct 135 KMV 137 >gi|336237046|ref|YP_004589662.1| hypothetical protein Geoth_3764 [Geobacillus thermoglucosidasius C56-YS93] gi|335363901|gb|AEH49581.1| hypothetical protein Geoth_3764 [Geobacillus thermoglucosidasius C56-YS93] Length=152 Score = 37.7 bits (86), Expect = 0.73, Method: Compositional matrix adjust. Identities = 19/63 (31%), Positives = 31/63 (50%), Gaps = 5/63 (7%) Query 102 DFEFAALFDYHLQEGED-----DPTEEELTAYAATTGRFALYPYIREYVYDLTGRLALPP 156 D EF+ + +Y LQ+ D + EE + + ++PY RE + LT R+ PP Sbjct 76 DIEFSYILEYRLQKSNDIHLEKEGLEEAIKLFVQRNVPVNIWPYARELISQLTMRMGFPP 135 Query 157 LTL 159 L + Sbjct 136 LLI 138 >gi|344225576|gb|EGV51930.1| hypothetical protein Rifp1Sym_au00100 [endosymbiont of Riftia pachyptila (vent Ph05)] Length=178 Score = 37.4 bits (85), Expect = 1.0, Method: Compositional matrix adjust. Identities = 22/71 (31%), Positives = 37/71 (53%), Gaps = 4/71 (5%) Query 89 GDTKDETQDVATADFEFAALFDYHLQEGEDDPTEEELTAYAATTGRFALYPYIREYVYDL 148 D D+ DV A E L +Y + E++P ++ L A+A + ++PY REY+ + Sbjct 94 SDESDDEADV-KAVIEATFLAEYLM---ENEPGQDALDAFALKNASYHVWPYWREYLMNQ 149 Query 149 TGRLALPPLTL 159 R+ LP + L Sbjct 150 CMRMNLPKIAL 160 >gi|308274082|emb|CBX30681.1| hypothetical protein N47_E41930 [uncultured Desulfobacterium sp.] Length=149 Score = 37.0 bits (84), Expect = 1.4, Method: Compositional matrix adjust. Identities = 17/43 (40%), Positives = 25/43 (59%), Gaps = 0/43 (0%) Query 117 EDDPTEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTL 159 E+ P++EE+ A ++ Y+RE V DLT R +PPL L Sbjct 77 EEIPSKEEVERIAHINCASIIFAYVRESVADLTRRAGMPPLNL 119 >gi|126724533|ref|ZP_01740376.1| protein-export protein SecB [Rhodobacterales bacterium HTCC2150] gi|126705697|gb|EBA04787.1| protein-export protein SecB [Rhodobacterales bacterium HTCC2150] Length=163 Score = 35.8 bits (81), Expect = 2.6, Method: Compositional matrix adjust. Identities = 29/113 (26%), Positives = 51/113 (46%), Gaps = 18/113 (15%) Query 50 DLEFEPAVDADPATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALF 109 D+ + +DA + + + V I C + +N+A DE Q + + ++A +F Sbjct 47 DINVQVNLDAKKRSETQYEVSIKCVINSKNKA-----------DEAQ-IFLLEIDYAGVF 94 Query 110 DYHLQEGEDDPTEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEIL 162 E+ P EE+L + L+P++R V D+T PPL LE + Sbjct 95 QI-----ENVP-EEQLHPFLLIECPRMLFPFLRRIVSDVTRDGGFPPLNLETI 141 >gi|326335748|ref|ZP_08201932.1| hypothetical protein HMPREF9071_1398 [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325692091|gb|EGD34046.1| hypothetical protein HMPREF9071_1398 [Capnocytophaga sp. oral taxon 338 str. F0234] Length=541 Score = 35.8 bits (81), Expect = 2.7, Method: Compositional matrix adjust. Identities = 21/69 (31%), Positives = 38/69 (56%), Gaps = 1/69 (1%) Query 52 EFEPAVDADPATISAFVVRISCHLRIQNQAADDDVKEGDTKDETQDVATADFEFAALFDY 111 ++EP + ++ IS F++RI H R + + D + GDTK ++ D+A D + DY Sbjct 48 DYEPMITSNKLIISDFMIRIRPHSRWEGEFGFDWFRTGDTKRKS-DIAFYDILGRYISDY 106 Query 112 HLQEGEDDP 120 + + +DD Sbjct 107 NKEPVKDDS 115 >gi|110680199|ref|YP_683206.1| hypothetical protein RD1_3001 [Roseobacter denitrificans OCh 114] gi|109456315|gb|ABG32520.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length=269 Score = 35.0 bits (79), Expect = 4.3, Method: Compositional matrix adjust. Identities = 32/143 (23%), Positives = 57/143 (40%), Gaps = 20/143 (13%) Query 29 RLLRTQAAVHRAPKPAQGLTYDLEFEPAVDADPATISAFVVRISCHLRIQNQAADDDVKE 88 +LL + HR P +G L ++ + + ++ H+++ DD VK Sbjct 59 QLLNPRDVPHRKPGSPEGRVALLHAVAHIELNAVDLHWDIIARFAHVKMPMGFYDDWVKA 118 Query 89 GDTKDETQDVATADFEFAALFDYHLQEGEDDPTEEELTAYAATTGRFALYPYIREYVYDL 148 D + + ++ D EE + Y A ++ + VYD Sbjct 119 ADEESKHFNLMC------------------DCLEEFGSHYGALPAHAGMWRAAEDTVYDF 160 Query 149 TGRLALPPLTLEILSRPMPVSPG 171 GRLA+ P+ LE +R + V+PG Sbjct 161 MGRLAVVPMVLE--ARGLDVTPG 181 >gi|85857881|ref|YP_460083.1| protein translocase subunit [Syntrophus aciditrophicus SB] gi|85720972|gb|ABC75915.1| protein translocase subunit [Syntrophus aciditrophicus SB] Length=148 Score = 35.0 bits (79), Expect = 4.8, Method: Compositional matrix adjust. Identities = 16/40 (40%), Positives = 23/40 (58%), Gaps = 0/40 (0%) Query 120 PTEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTL 159 P++EEL A ++PY+RE + DLT R + PL L Sbjct 79 PSDEELERIARINCASIIFPYVRETIADLTRRANMTPLNL 118 >gi|302534223|ref|ZP_07286565.1| predicted protein [Streptomyces sp. C] gi|302443118|gb|EFL14934.1| predicted protein [Streptomyces sp. C] Length=166 Score = 34.7 bits (78), Expect = 5.8, Method: Compositional matrix adjust. Identities = 16/45 (36%), Positives = 25/45 (56%), Gaps = 0/45 (0%) Query 118 DDPTEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLEIL 162 +D E+ + AY ++PY+RE V ++G PPLTL+ L Sbjct 109 EDVPEDVIEAYGENVALATVHPYVRELVRRISGDFGFPPLTLDNL 153 Lambda K H 0.318 0.134 0.393 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 165240920448 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40