BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1439c Length=141 Score E Sequences producing significant alignments: (Bits) Value gi|15608577|ref|NP_215955.1| hypothetical protein Rv1439c [Mycob... 283 7e-75 gi|289745189|ref|ZP_06504567.1| conserved hypothetical protein [... 282 1e-74 gi|340626453|ref|YP_004744905.1| hypothetical protein MCAN_14551... 273 5e-72 gi|342858907|ref|ZP_08715561.1| hypothetical protein MCOL_08523 ... 100 1e-19 gi|325000440|ref|ZP_08121552.1| hypothetical protein PseP1_16812... 68.9 2e-10 gi|329937552|ref|ZP_08287110.1| hypothetical protein SGM_2602 [S... 42.4 0.021 gi|85707873|ref|ZP_01038939.1| hypothetical protein NAP1_01520 [... 41.2 0.048 gi|254282865|ref|ZP_04957833.1| conserved hypothetical protein [... 41.2 0.058 gi|299771218|ref|YP_003733244.1| putative cell wall-associated h... 38.9 0.26 gi|269139236|ref|YP_003295937.1| thiamine kinase [Edwardsiella t... 38.9 0.28 gi|304559146|gb|ADM41810.1| hypothetical protein ETAF_1702 [Edwa... 38.9 0.29 gi|148271759|ref|YP_001221320.1| hypothetical protein CMM_0580 [... 37.7 0.58 gi|254773078|ref|ZP_05214594.1| hypothetical protein MaviaA2_001... 37.4 0.70 gi|294669560|ref|ZP_06734627.1| hypothetical protein NEIELOOT_01... 37.4 0.82 gi|111025301|ref|YP_707721.1| hypothetical protein RHA1_ro08519 ... 37.0 0.86 gi|342860258|ref|ZP_08716910.1| hypothetical protein MCOL_15315 ... 37.0 0.96 gi|260557531|ref|ZP_05829746.1| cell wall-associated hydrolase [... 36.6 1.2 gi|342224241|gb|EGT89289.1| putative cell wall-associated hydrol... 36.6 1.3 gi|261339725|ref|ZP_05967583.1| putative transcriptional regulat... 35.8 1.9 gi|111221925|ref|YP_712719.1| hypothetical protein FRAAL2500 [Fr... 35.8 2.4 gi|126734591|ref|ZP_01750337.1| hypothetical protein RCCS2_11979... 35.4 2.6 gi|46205181|ref|ZP_00209734.1| hypothetical protein Magn03002146... 34.3 6.1 gi|294339004|emb|CAZ87349.1| Type I site-specific deoxyribonucle... 34.3 7.0 gi|114797389|ref|YP_760265.1| limonene-1,2-epoxide hydrolase [Hy... 33.9 8.2 gi|333983669|ref|YP_004512879.1| type I site-specific deoxyribon... 33.9 8.3 >gi|15608577|ref|NP_215955.1| hypothetical protein Rv1439c [Mycobacterium tuberculosis H37Rv] gi|15840897|ref|NP_335934.1| hypothetical protein MT1484 [Mycobacterium tuberculosis CDC1551] gi|31792633|ref|NP_855126.1| hypothetical protein Mb1474c [Mycobacterium bovis AF2122/97] 70 more sequence titlesLength=141 Score = 283 bits (723), Expect = 7e-75, Method: Compositional matrix adjust. Identities = 141/141 (100%), Positives = 141/141 (100%), Gaps = 0/141 (0%) Query 1 MQMSASNAFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGLL 60 MQMSASNAFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGLL Sbjct 1 MQMSASNAFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGLL 60 Query 61 PDLHGEVDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRFLLRGDKAVERVSYFDSLPLL 120 PDLHGEVDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRFLLRGDKAVERVSYFDSLPLL Sbjct 61 PDLHGEVDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRFLLRGDKAVERVSYFDSLPLL 120 Query 121 IKVVKHPSAWRGWLTTMRSRA 141 IKVVKHPSAWRGWLTTMRSRA Sbjct 121 IKVVKHPSAWRGWLTTMRSRA 141 >gi|289745189|ref|ZP_06504567.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289757546|ref|ZP_06516924.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|294995000|ref|ZP_06800691.1| hypothetical protein Mtub2_10935 [Mycobacterium tuberculosis 210] gi|289685717|gb|EFD53205.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289713110|gb|EFD77122.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|326903060|gb|EGE49993.1| hypothetical protein TBPG_00921 [Mycobacterium tuberculosis W-148] gi|339294415|gb|AEJ46526.1| hypothetical protein CCDC5079_1336 [Mycobacterium tuberculosis CCDC5079] gi|339298053|gb|AEJ50163.1| hypothetical protein CCDC5180_1326 [Mycobacterium tuberculosis CCDC5180] Length=141 Score = 282 bits (721), Expect = 1e-74, Method: Compositional matrix adjust. Identities = 140/141 (99%), Positives = 141/141 (100%), Gaps = 0/141 (0%) Query 1 MQMSASNAFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGLL 60 MQMSASNAFVEGFADFWKAPSPDRLTDH+HPDVVLVRPLSPPRHGLGAAQREFTRILGLL Sbjct 1 MQMSASNAFVEGFADFWKAPSPDRLTDHIHPDVVLVRPLSPPRHGLGAAQREFTRILGLL 60 Query 61 PDLHGEVDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRFLLRGDKAVERVSYFDSLPLL 120 PDLHGEVDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRFLLRGDKAVERVSYFDSLPLL Sbjct 61 PDLHGEVDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRFLLRGDKAVERVSYFDSLPLL 120 Query 121 IKVVKHPSAWRGWLTTMRSRA 141 IKVVKHPSAWRGWLTTMRSRA Sbjct 121 IKVVKHPSAWRGWLTTMRSRA 141 >gi|340626453|ref|YP_004744905.1| hypothetical protein MCAN_14551 [Mycobacterium canettii CIPT 140010059] gi|340004643|emb|CCC43787.1| hypothetical protein MCAN_14551 [Mycobacterium canettii CIPT 140010059] Length=141 Score = 273 bits (698), Expect = 5e-72, Method: Compositional matrix adjust. Identities = 134/141 (96%), Positives = 138/141 (98%), Gaps = 0/141 (0%) Query 1 MQMSASNAFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGLL 60 MQMSASNAFVEGFA+FWKAPSPDRLTDHLHPDVVL++PLSPPRHGL AAQREFTRILGLL Sbjct 1 MQMSASNAFVEGFANFWKAPSPDRLTDHLHPDVVLIQPLSPPRHGLAAAQREFTRILGLL 60 Query 61 PDLHGEVDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRFLLRGDKAVERVSYFDSLPLL 120 PDLHGEVDRWSQAGDVVFIEFRLIARLG EVVEWPVVDRFLLRGDKAVERVSYFDSLPLL Sbjct 61 PDLHGEVDRWSQAGDVVFIEFRLIARLGREVVEWPVVDRFLLRGDKAVERVSYFDSLPLL 120 Query 121 IKVVKHPSAWRGWLTTMRSRA 141 IKV+KHPSAWRGWLTTMR RA Sbjct 121 IKVLKHPSAWRGWLTTMRGRA 141 >gi|342858907|ref|ZP_08715561.1| hypothetical protein MCOL_08523 [Mycobacterium colombiense CECT 3035] gi|342133148|gb|EGT86351.1| hypothetical protein MCOL_08523 [Mycobacterium colombiense CECT 3035] Length=112 Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 51/102 (50%), Positives = 68/102 (67%), Gaps = 1/102 (0%) Query 33 VVLVRPLSPPRHGLGAAQREFTRILGLLPDLHGEVDRWSQAGDVVFIEFRLIARLGSEVV 92 +VL++PL P G+ AAQ +F R LP LH VD WS DVVFIEFRL A + ++ Sbjct 1 MVLMQPLVPRMVGIEAAQTQFQRFGYCLPGLHAHVDYWSGDEDVVFIEFRLHAHIAGHLI 60 Query 93 EWPVVDRFLLRGDKAVERVSYFDSLPLLIKVVKHPS-AWRGW 133 EWP V+R LR +A+ERV+YFD + +L +++HPS AWR W Sbjct 61 EWPNVNRLCLRDGRAIERVTYFDPVAMLPTLLRHPSIAWRWW 102 >gi|325000440|ref|ZP_08121552.1| hypothetical protein PseP1_16812 [Pseudonocardia sp. P1] Length=290 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 46/125 (37%), Positives = 58/125 (47%), Gaps = 1/125 (0%) Query 10 VEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGLLPDLHGEVDR 69 VE FA W P P L +V L +PL G + EF R+ LPD+ GEV Sbjct 6 VEEFAAGWHDPGPRAWDSLLAEEVELHQPLMLDGVGPAHWRDEFERLQRFLPDIRGEVVA 65 Query 70 WSQAGDVVFIEFRLIARLGSEVVEWPVVDRFLLRGD-KAVERVSYFDSLPLLIKVVKHPS 128 W+ +G +F+E A VE+ VDR D K R SY D LPLL ++ PS Sbjct 66 WASSGQTLFVEIVCRATAAGRPVEFRAVDRLTFSDDGKVTARSSYLDPLPLLRSLLGRPS 125 Query 129 AWRGW 133 W W Sbjct 126 VWLRW 130 >gi|329937552|ref|ZP_08287110.1| hypothetical protein SGM_2602 [Streptomyces griseoaurantiacus M045] gi|329303428|gb|EGG47315.1| hypothetical protein SGM_2602 [Streptomyces griseoaurantiacus M045] Length=268 Score = 42.4 bits (98), Expect = 0.021, Method: Compositional matrix adjust. Identities = 30/87 (35%), Positives = 43/87 (50%), Gaps = 4/87 (4%) Query 10 VEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRH--GLGAAQREFTRILGLLPDLHGEV 67 ++ D + P+R+ D D V RPL P G G +T+IL LPDLH E+ Sbjct 6 LQSLRDAINSHVPERIADCFTDDYVAERPLRPAEGFIGSGEVAANWTKILAGLPDLHVEI 65 Query 68 DRWSQAGDVVFI--EFRLIARLGSEVV 92 R +Q GD + E+R A G+ V+ Sbjct 66 LRHAQNGDESWSEWEYRGTAPTGATVL 92 >gi|85707873|ref|ZP_01038939.1| hypothetical protein NAP1_01520 [Erythrobacter sp. NAP1] gi|85689407|gb|EAQ29410.1| hypothetical protein NAP1_01520 [Erythrobacter sp. NAP1] Length=131 Score = 41.2 bits (95), Expect = 0.048, Method: Compositional matrix adjust. Identities = 38/122 (32%), Positives = 57/122 (47%), Gaps = 10/122 (8%) Query 3 MSASNA-FVEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRIL-GLL 60 +SA +A F F FW PS +R+ + + PD V+ G E+ ++ GLL Sbjct 9 LSAEHADFYRRFQSFWADPSGERVAEIIGPDAVIHFSGQGTFTG-----EEYKAVMQGLL 63 Query 61 PDLHG-EVDRWSQAG--DVVFIEFRLIARLGSEVVEWPVVDRFLLRGDKAVERVSYFDSL 117 + G EV AG D ++I +R + + VE+ DRF +R AVE FD+ Sbjct 64 DQMEGLEVTPLDCAGNGDQLYIHWRTSVPIDGKRVEYVGFDRFTVRDGMAVEEYVIFDTA 123 Query 118 PL 119 L Sbjct 124 VL 125 >gi|254282865|ref|ZP_04957833.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] gi|219679068|gb|EED35417.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] Length=128 Score = 41.2 bits (95), Expect = 0.058, Method: Compositional matrix adjust. Identities = 26/102 (26%), Positives = 52/102 (51%), Gaps = 9/102 (8%) Query 23 DRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGLLPDLHGEVD---RWSQAGD---V 76 + H HPD +V S P G + FT+++ + ++ E + + AGD + Sbjct 21 ETFAKHTHPDFAVVEADSLPFAGTFSGMEGFTQLVTKVFEMFSEFEPNVKAMAAGDDTVM 80 Query 77 VFIEFRLIARLGSEVVEWPVVDRFLLRGDKAVERVS--YFDS 116 V+I+ +L R + + P+++ F GDK V++++ Y+D+ Sbjct 81 VWIDMKLTGRESGKTINVPMIEVFSFEGDK-VKQITPFYYDT 121 >gi|299771218|ref|YP_003733244.1| putative cell wall-associated hydrolase [Acinetobacter sp. DR1] gi|298701306|gb|ADI91871.1| putative cell wall-associated hydrolase [Acinetobacter sp. DR1] Length=148 Score = 38.9 bits (89), Expect = 0.26, Method: Compositional matrix adjust. Identities = 28/81 (35%), Positives = 37/81 (46%), Gaps = 10/81 (12%) Query 25 LTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGLLPDLHGEVDRWSQAGDVVFIEFRLI 84 L DHL P RP P H QR ILG+ + G Q GD+V +F Sbjct 52 LMDHLDP-----RPYPPDWHLHQMGQRYLELILGVCEQIEGP----PQPGDIVLYQFGKC 102 Query 85 ARLGSEVVEWP-VVDRFLLRG 104 G+ V+EWP V+ +L +G Sbjct 103 ISHGAIVIEWPQVIHSYLHQG 123 >gi|269139236|ref|YP_003295937.1| thiamine kinase [Edwardsiella tarda EIB202] gi|267984897|gb|ACY84726.1| thiamine kinase [Edwardsiella tarda EIB202] Length=280 Score = 38.9 bits (89), Expect = 0.28, Method: Compositional matrix adjust. Identities = 26/74 (36%), Positives = 38/74 (52%), Gaps = 14/74 (18%) Query 8 AFVEGFA-DFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRIL------GLL 60 + +G + D W+A + D +T +L+RP++PPR LG +R R L GL Sbjct 27 SLTQGLSGDNWRAQNGDSVT-------ILLRPITPPRRHLGIDRRREGRALRMGAAAGLG 79 Query 61 PDLHGEVDRWSQAG 74 P +G VD W AG Sbjct 80 PTCYGTVDGWLLAG 93 >gi|304559146|gb|ADM41810.1| hypothetical protein ETAF_1702 [Edwardsiella tarda FL6-60] Length=273 Score = 38.9 bits (89), Expect = 0.29, Method: Compositional matrix adjust. Identities = 26/74 (36%), Positives = 38/74 (52%), Gaps = 14/74 (18%) Query 8 AFVEGFA-DFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRIL------GLL 60 + +G + D W+A + D +T +L+RP++PPR LG +R R L GL Sbjct 20 SLTQGLSGDNWRAQNGDSVT-------ILLRPITPPRRHLGIDRRREGRALRMGAAAGLG 72 Query 61 PDLHGEVDRWSQAG 74 P +G VD W AG Sbjct 73 PTCYGTVDGWLLAG 86 >gi|148271759|ref|YP_001221320.1| hypothetical protein CMM_0580 [Clavibacter michiganensis subsp. michiganensis NCPPB 382] gi|147829689|emb|CAN00605.1| conserved hypothetical protein [Clavibacter michiganensis subsp. michiganensis NCPPB 382] Length=140 Score = 37.7 bits (86), Expect = 0.58, Method: Compositional matrix adjust. Identities = 35/134 (27%), Positives = 55/134 (42%), Gaps = 10/134 (7%) Query 8 AFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLSP-PRHGLGAAQREFTRILGLLPDLHGE 66 A ++ F +F + + +HPD + P+SP P GL + + G +PD+ Sbjct 7 ATMQRFLEFINSGDEAIGREVIHPDAAFMTPISPEPLRGLDGYLQILGVMRGAIPDVQWT 66 Query 67 VDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRFLLRGDKA---------VERVSYFDSL 117 +++ GD V F L G E V P R + G A ++ V D L Sbjct 67 LEQLVVEGDHVAARFELRGTHGGESVGVPATGRPIAVGASAFYRFADGRIIDEVGQPDLL 126 Query 118 PLLIKVVKHPSAWR 131 LL ++ P A R Sbjct 127 ALLGQIGALPPALR 140 >gi|254773078|ref|ZP_05214594.1| hypothetical protein MaviaA2_00129 [Mycobacterium avium subsp. avium ATCC 25291] Length=120 Score = 37.4 bits (85), Expect = 0.70, Method: Compositional matrix adjust. Identities = 28/83 (34%), Positives = 38/83 (46%), Gaps = 8/83 (9%) Query 8 AFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLS----PPRHGL---GAAQREF-TRILGL 59 AF E + W A D + D H DVV P++ P G+ AA R++ T L Sbjct 7 AFSERWVRNWNAHDVDAVLDEFHDDVVFTSPVAAQLLPETQGIIRGKAALRQYWTAALAS 66 Query 60 LPDLHGEVDRWSQAGDVVFIEFR 82 +PDL V+ Q D V I +R Sbjct 67 IPDLRFSVEGVYQGIDTVVIAYR 89 >gi|294669560|ref|ZP_06734627.1| hypothetical protein NEIELOOT_01459 [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291308473|gb|EFE49716.1| hypothetical protein NEIELOOT_01459 [Neisseria elongata subsp. glycolytica ATCC 29315] Length=173 Score = 37.4 bits (85), Expect = 0.82, Method: Compositional matrix adjust. Identities = 31/140 (23%), Positives = 58/140 (42%), Gaps = 16/140 (11%) Query 1 MQMSASNAFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGLL 60 M ++AF++ F + + SP + + PD V ++ H + IL L Sbjct 34 MSQIGNHAFMQKFVELLNSASPKLAVELIAPDAVFNSVITGREH---TGPSGYLEILAAL 90 Query 61 ----PDLHGEVDRWSQAGDVVFIEFRLIA---------RLGSEVVEWPVVDRFLLRGDKA 107 PD+ E+ G+VV F + + G + V ++ + L+ K Sbjct 91 KAGFPDVRWELQELVGEGNVVAARFNISGTHLGEFQGIKPGGKEVRISAMNLYRLKSGKV 150 Query 108 VERVSYFDSLPLLIKVVKHP 127 V+ + FD+L LL ++ +P Sbjct 151 VKEFNQFDNLSLLWQIGVYP 170 >gi|111025301|ref|YP_707721.1| hypothetical protein RHA1_ro08519 [Rhodococcus jostii RHA1] gi|110824280|gb|ABG99563.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length=141 Score = 37.0 bits (84), Expect = 0.86, Method: Compositional matrix adjust. Identities = 21/65 (33%), Positives = 32/65 (50%), Gaps = 2/65 (3%) Query 23 DRLTDHLHPDVVLVRPLSPPRHGLGAAQ--REFTRILGLLPDLHGEVDRWSQAGDVVFIE 80 D + D D + P P R G+ Q R +T I G +PDL V RW+ GD ++ E Sbjct 26 DAVVDCFASDYLNETPAHPARGFTGSEQVRRNWTEIFGGVPDLRARVLRWAVDGDTIWGE 85 Query 81 FRLIA 85 + ++ Sbjct 86 WEMVG 90 >gi|342860258|ref|ZP_08716910.1| hypothetical protein MCOL_15315 [Mycobacterium colombiense CECT 3035] gi|342132636|gb|EGT85865.1| hypothetical protein MCOL_15315 [Mycobacterium colombiense CECT 3035] Length=129 Score = 37.0 bits (84), Expect = 0.96, Method: Compositional matrix adjust. Identities = 26/83 (32%), Positives = 37/83 (45%), Gaps = 8/83 (9%) Query 8 AFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLS----PPRHGL----GAAQREFTRILGL 59 AF E + W A + + H DVV P++ P HG+ A +R +T L Sbjct 17 AFSERWVQNWNAHDVEAVLADFHDDVVFTSPVAAEIFPETHGVVRGKAALRRYWTGALAR 76 Query 60 LPDLHGEVDRWSQAGDVVFIEFR 82 +PDL VD Q D + I +R Sbjct 77 IPDLRFSVDGVYQGIDTIVIAYR 99 >gi|260557531|ref|ZP_05829746.1| cell wall-associated hydrolase [Acinetobacter baumannii ATCC 19606] gi|260409157|gb|EEX02460.1| cell wall-associated hydrolase [Acinetobacter baumannii ATCC 19606] Length=140 Score = 36.6 bits (83), Expect = 1.2, Method: Compositional matrix adjust. Identities = 28/81 (35%), Positives = 36/81 (45%), Gaps = 10/81 (12%) Query 25 LTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGLLPDLHGEVDRWSQAGDVVFIEFRLI 84 L DHL P RP P H QR ILG+ + G Q GD+V F Sbjct 44 LMDHLDP-----RPYPPDWHLHQMGQRYLELILGVCDPVEGP----PQPGDIVLYHFGKC 94 Query 85 ARLGSEVVEWP-VVDRFLLRG 104 G+ V+EWP V+ +L +G Sbjct 95 ISHGAIVIEWPQVIHSYLHQG 115 >gi|342224241|gb|EGT89289.1| putative cell wall-associated hydrolase [Acinetobacter baumannii ABNIH1] gi|342237133|gb|EGU01621.1| putative cell wall-associated hydrolase [Acinetobacter baumannii ABNIH4] Length=140 Score = 36.6 bits (83), Expect = 1.3, Method: Compositional matrix adjust. Identities = 27/81 (34%), Positives = 36/81 (45%), Gaps = 10/81 (12%) Query 25 LTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGLLPDLHGEVDRWSQAGDVVFIEFRLI 84 L DHL P RP P H QR ILG+ + G Q GD+V F Sbjct 44 LMDHLDP-----RPYPPDWHMHQMGQRYLELILGVCDPIEGP----PQPGDIVLYHFGKC 94 Query 85 ARLGSEVVEWP-VVDRFLLRG 104 G+ V+EWP V+ ++ +G Sbjct 95 ISHGAIVIEWPQVIHSYIHQG 115 >gi|261339725|ref|ZP_05967583.1| putative transcriptional regulator [Enterobacter cancerogenus ATCC 35316] gi|288318554|gb|EFC57492.1| putative transcriptional regulator [Enterobacter cancerogenus ATCC 35316] Length=144 Score = 35.8 bits (81), Expect = 1.9, Method: Compositional matrix adjust. Identities = 32/129 (25%), Positives = 53/129 (42%), Gaps = 12/129 (9%) Query 3 MSASNAFVEGFADFWKA---PSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGL 59 MS + F +++ A P L + H D LV P HGL A QR FT +L Sbjct 1 MSTLPTVISRFVNYYVALDTQPPSALAEIYHSDATLVDPFGE-HHGLLALQRYFTHLLAN 59 Query 60 LPDLHGEVDRWSQAGDVVFIEFRL------IARLGSEVVEWPVVDRFLLRGDKAVERVSY 113 + + VD Q GD + + + IA G + P + D+ + + Y Sbjct 60 VQECRFTVDAPLQQGDRFVVTWVMHWSHPRIA--GGALRHLPGCSVVNMHNDRIIRQRDY 117 Query 114 FDSLPLLIK 122 +D+ ++ + Sbjct 118 YDAGEMIYE 126 >gi|111221925|ref|YP_712719.1| hypothetical protein FRAAL2500 [Frankia alni ACN14a] gi|111149457|emb|CAJ61147.1| hypothetical protein FRAAL2500 [Frankia alni ACN14a] gi|116633611|emb|CAH69513.1| hypothetical protein [Frankia alni] Length=267 Score = 35.8 bits (81), Expect = 2.4, Method: Compositional matrix adjust. Identities = 26/75 (35%), Positives = 37/75 (50%), Gaps = 4/75 (5%) Query 22 PDRLTDHLHPDVVLVRPLSPPRHGLGA--AQREFTRILGLLPDLHGEVDRWSQAGDVVFI 79 P+R+ D V RPL P +G+ + +IL LPDL E+ R +Q GD ++ Sbjct 18 PERIASCFTDDYVAERPLKPHEGFIGSDHVAANWRKILAGLPDLQAEILRHAQNGDELWS 77 Query 80 --EFRLIARLGSEVV 92 E R A G+ VV Sbjct 78 EWEMRGTAPTGATVV 92 >gi|126734591|ref|ZP_01750337.1| hypothetical protein RCCS2_11979 [Roseobacter sp. CCS2] gi|126715146|gb|EBA12011.1| hypothetical protein RCCS2_11979 [Roseobacter sp. CCS2] Length=263 Score = 35.4 bits (80), Expect = 2.6, Method: Compositional matrix adjust. Identities = 22/53 (42%), Positives = 32/53 (61%), Gaps = 4/53 (7%) Query 1 MQMSASN-AFVEGF-ADFWKAPSPDRLTDHLHPDVVLVRP--LSPPRHGLGAA 49 + ++A N A VEGF AD + +PDR+TD++ PD+ L + GLGAA Sbjct 138 IDLTADNKALVEGFIADVLQGAAPDRITDYISPDLYLQHNPMVGDGLDGLGAA 190 >gi|46205181|ref|ZP_00209734.1| hypothetical protein Magn03002146 [Magnetospirillum magnetotacticum MS-1] Length=85 Score = 34.3 bits (77), Expect = 6.1, Method: Compositional matrix adjust. Identities = 18/46 (40%), Positives = 22/46 (48%), Gaps = 0/46 (0%) Query 14 ADFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGL 59 AD W+ PSP R L VV RP P G + + RILG+ Sbjct 40 ADIWRRPSPSRRQILLGMCVVQARPSPSPLQGKASEVENYQRILGI 85 >gi|294339004|emb|CAZ87349.1| Type I site-specific deoxyribonuclease HsdR [Thiomonas sp. 3As] gi|294341831|emb|CAZ90260.1| Type I site-specific deoxyribonuclease HsdR [Thiomonas sp. 3As] Length=1069 Score = 34.3 bits (77), Expect = 7.0, Method: Compositional matrix adjust. Identities = 29/99 (30%), Positives = 48/99 (49%), Gaps = 8/99 (8%) Query 32 DVVLVRPLSPPRHGLGAAQREFTRILGLLPDLHGEVDRWSQAGDVVFIEFRLIARLGSEV 91 D L L+P GL AA R+F R L + + D ++ V I+F + R Sbjct 72 DAALRLVLNPNTPGLVAANRQFHRWLVEGVPVEYQRDGETRGDRVRLIDFSDVGR----- 126 Query 92 VEWPVVDRFLLRGDKAVER---VSYFDSLPLLIKVVKHP 127 +W V++F ++G K R V + + LPL++ +K+P Sbjct 127 NDWLAVNQFTVQGPKHTRRPDLVLFLNGLPLVVLELKNP 165 >gi|114797389|ref|YP_760265.1| limonene-1,2-epoxide hydrolase [Hyphomonas neptunium ATCC 15444] gi|114737563|gb|ABI75688.1| limonene-1,2-epoxide hydrolase [Hyphomonas neptunium ATCC 15444] Length=125 Score = 33.9 bits (76), Expect = 8.2, Method: Compositional matrix adjust. Identities = 27/113 (24%), Positives = 43/113 (39%), Gaps = 0/113 (0%) Query 3 MSASNAFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQREFTRILGLLPD 62 M+ + + F W D + + D V P G GA + L + Sbjct 1 MTENEKIIREFIAAWSRLGVDEIVSYFADDGVYHNMPFRPVTGHGALKAFVGNFLKDWTE 60 Query 63 LHGEVDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRFLLRGDKAVERVSYFD 115 + E+ +GD+V E RLG++ V+ P F++ G K YFD Sbjct 61 TNWEILTLVSSGDIVVAERLDRTRLGTQAVDLPCCGVFVMEGGKIKVWRDYFD 113 >gi|333983669|ref|YP_004512879.1| type I site-specific deoxyribonuclease, HsdR family [Methylomonas methanica MC09] gi|333807710|gb|AEG00380.1| type I site-specific deoxyribonuclease, HsdR family [Methylomonas methanica MC09] Length=1044 Score = 33.9 bits (76), Expect = 8.3, Method: Compositional matrix adjust. Identities = 25/87 (29%), Positives = 43/87 (50%), Gaps = 8/87 (9%) Query 45 GLGAAQREFTRILGLLPDLHGEVDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRFLLRG 104 GL AA R F R+L ++G + + GD RLI EW +++F ++G Sbjct 78 GLLAANRHFHRLL-----VNGVPVEYQKDGDTRGDRVRLIEFAEPSSNEWLAINQFAIKG 132 Query 105 DKAVER---VSYFDSLPLLIKVVKHPS 128 K R + + + LPL++ +K+P+ Sbjct 133 PKHSRRPDIILFINGLPLVLLELKNPA 159 Lambda K H 0.325 0.140 0.448 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 130347035928 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40