BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1761c Length=127 Score E Sequences producing significant alignments: (Bits) Value gi|15608899|ref|NP_216277.1| hypothetical protein Rv1761c [Mycob... 253 9e-66 gi|340626771|ref|YP_004745223.1| hypothetical protein MCAN_17771... 251 3e-65 gi|289574436|ref|ZP_06454663.1| hypothetical exported protein [M... 250 4e-65 gi|219689176|pdb|2K3M|A Chain A, Rv1761c 248 2e-64 gi|240171225|ref|ZP_04749884.1| hypothetical protein MkanA1_1807... 214 5e-54 gi|118618488|ref|YP_906820.1| hypothetical protein MUL_3111 [Myc... 168 2e-40 gi|183982646|ref|YP_001850937.1| hypothetical protein MMAR_2636 ... 166 8e-40 gi|296164739|ref|ZP_06847303.1| conserved hypothetical protein [... 166 1e-39 gi|148257261|ref|YP_001241846.1| delta-1-pyrroline-5-carboxylate... 35.8 2.4 gi|110677975|ref|YP_680982.1| type I secretion target domain-con... 35.4 2.5 gi|52222874|gb|AAU34215.1| two component sensor kinase MppV [Str... 35.4 3.2 gi|163857754|ref|YP_001632052.1| hypothetical protein Bpet3441 [... 35.0 3.8 gi|301770405|ref|XP_002920603.1| PREDICTED: LOW QUALITY PROTEIN:... 34.3 5.7 >gi|15608899|ref|NP_216277.1| hypothetical protein Rv1761c [Mycobacterium tuberculosis H37Rv] gi|15841230|ref|NP_336267.1| hypothetical protein MT1810 [Mycobacterium tuberculosis CDC1551] gi|31792951|ref|NP_855444.1| hypothetical protein Mb1792c [Mycobacterium bovis AF2122/97] 43 more sequence titlesLength=127 Score = 253 bits (645), Expect = 9e-66, Method: Compositional matrix adjust. Identities = 127/127 (100%), Positives = 127/127 (100%), Gaps = 0/127 (0%) Query 1 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNPERIQIGDWRY 60 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNPERIQIGDWRY Sbjct 1 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNPERIQIGDWRY 60 Query 61 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG 120 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG Sbjct 61 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG 120 Query 121 TGTDYRF 127 TGTDYRF Sbjct 121 TGTDYRF 127 >gi|340626771|ref|YP_004745223.1| hypothetical protein MCAN_17771 [Mycobacterium canettii CIPT 140010059] gi|340004961|emb|CCC44109.1| hypothetical exported protein [Mycobacterium canettii CIPT 140010059] Length=127 Score = 251 bits (640), Expect = 3e-65, Method: Compositional matrix adjust. Identities = 126/127 (99%), Positives = 126/127 (99%), Gaps = 0/127 (0%) Query 1 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNPERIQIGDWRY 60 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTP RRGFFRSNPERIQIGDWRY Sbjct 1 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPVRRGFFRSNPERIQIGDWRY 60 Query 61 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG 120 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG Sbjct 61 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG 120 Query 121 TGTDYRF 127 TGTDYRF Sbjct 121 TGTDYRF 127 >gi|289574436|ref|ZP_06454663.1| hypothetical exported protein [Mycobacterium tuberculosis K85] gi|289538867|gb|EFD43445.1| hypothetical exported protein [Mycobacterium tuberculosis K85] Length=127 Score = 250 bits (639), Expect = 4e-65, Method: Compositional matrix adjust. Identities = 126/127 (99%), Positives = 126/127 (99%), Gaps = 0/127 (0%) Query 1 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNPERIQIGDWRY 60 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNPERIQIGDWRY Sbjct 1 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNPERIQIGDWRY 60 Query 61 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG 120 EVA DGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG Sbjct 61 EVAQDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG 120 Query 121 TGTDYRF 127 TGTDYRF Sbjct 121 TGTDYRF 127 >gi|219689176|pdb|2K3M|A Chain A, Rv1761c Length=151 Score = 248 bits (634), Expect = 2e-64, Method: Compositional matrix adjust. Identities = 124/127 (98%), Positives = 124/127 (98%), Gaps = 0/127 (0%) Query 1 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNPERIQIGDWRY 60 MSDFDTERVSRAVAAALVGPGGVALVVKV AGLPGVIHTPARRGFFR NPERIQIGDWRY Sbjct 25 MSDFDTERVSRAVAAALVGPGGVALVVKVCAGLPGVIHTPARRGFFRCNPERIQIGDWRY 84 Query 61 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG 120 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIV RYGATVIPNINAAIEVLG Sbjct 85 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVCRYGATVIPNINAAIEVLG 144 Query 121 TGTDYRF 127 TGTDYRF Sbjct 145 TGTDYRF 151 >gi|240171225|ref|ZP_04749884.1| hypothetical protein MkanA1_18071 [Mycobacterium kansasii ATCC 12478] Length=127 Score = 214 bits (544), Expect = 5e-54, Method: Compositional matrix adjust. Identities = 104/127 (82%), Positives = 113/127 (89%), Gaps = 0/127 (0%) Query 1 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNPERIQIGDWRY 60 M+ FD E+VSR + AAL GPGGVALVV VFA LPGVIHT ARRG FRSNPERIQIGDWRY Sbjct 1 MTHFDAEQVSRTIGAALAGPGGVALVVNVFANLPGVIHTAARRGLFRSNPERIQIGDWRY 60 Query 61 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG 120 EVAHDGRLLAAHMVNGIVIAED L A+AVGPH++RALGQIVSRYG TVIPNINAA+E+LG Sbjct 61 EVAHDGRLLAAHMVNGIVIAEDILAADAVGPHVSRALGQIVSRYGPTVIPNINAAVEILG 120 Query 121 TGTDYRF 127 T T YR+ Sbjct 121 TSTGYRY 127 >gi|118618488|ref|YP_906820.1| hypothetical protein MUL_3111 [Mycobacterium ulcerans Agy99] gi|118570598|gb|ABL05349.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99] Length=127 Score = 168 bits (426), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 82/121 (68%), Positives = 97/121 (81%), Gaps = 0/121 (0%) Query 1 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNPERIQIGDWRY 60 M+ +D +RVS AV AAL GPGGV +VVKVF +PGV+ TPARRGFFRS PERI IGDWRY Sbjct 1 MTGYDQQRVSDAVGAALAGPGGVGMVVKVFCAVPGVVLTPARRGFFRSEPERILIGDWRY 60 Query 61 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG 120 EV DGRL A+H+VNGIV+AE L A AVGPH+ARAL Q+VS YG T+ P I+AA+E+L Sbjct 61 EVTSDGRLSASHLVNGIVLAEQVLAAGAVGPHIARALAQLVSHYGPTIQPGIDAALEMLE 120 Query 121 T 121 T Sbjct 121 T 121 >gi|183982646|ref|YP_001850937.1| hypothetical protein MMAR_2636 [Mycobacterium marinum M] gi|183175972|gb|ACC41082.1| conserved hypothetical protein [Mycobacterium marinum M] Length=127 Score = 166 bits (421), Expect = 8e-40, Method: Compositional matrix adjust. Identities = 81/121 (67%), Positives = 96/121 (80%), Gaps = 0/121 (0%) Query 1 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNPERIQIGDWRY 60 M+ +D +RVS AV AAL GPGGV +VVKVF +PGV+ TPARRGFFRS PERI IGDWRY Sbjct 1 MTGYDQQRVSDAVGAALAGPGGVGMVVKVFCAVPGVVLTPARRGFFRSEPERILIGDWRY 60 Query 61 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG 120 EV DGRL A+H+VNGIV+AE L A VGPH+ARAL Q+VS YG T+ P I+AA+E+L Sbjct 61 EVTSDGRLSASHLVNGIVLAEQVLAAGEVGPHIARALAQLVSHYGPTIQPGIDAALEMLE 120 Query 121 T 121 T Sbjct 121 T 121 >gi|296164739|ref|ZP_06847303.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295899910|gb|EFG79352.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=126 Score = 166 bits (420), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 79/126 (63%), Positives = 99/126 (79%), Gaps = 0/126 (0%) Query 1 MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNPERIQIGDWRY 60 M+ D ERV AV AL GPGGV +V++VF G+PGV+ PARRGF RS PER+QIGDWRY Sbjct 1 MASLDHERVGHAVGTALSGPGGVGMVLRVFCGVPGVVFHPARRGFLRSQPERVQIGDWRY 60 Query 61 EVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRYGATVIPNINAAIEVLG 120 EV DGRL AAH+V+GIV++E+ L A AVGPH+ARALGQ+V YG T++PNI+AA++VL Sbjct 61 EVTADGRLSAAHLVSGIVLSEEILAAGAVGPHIARALGQLVGSYGPTIVPNIDAALDVLE 120 Query 121 TGTDYR 126 G+ R Sbjct 121 AGSGPR 126 >gi|148257261|ref|YP_001241846.1| delta-1-pyrroline-5-carboxylate dehydrogenase / L-proline dehydrogenase [Bradyrhizobium sp. BTAi1] gi|146409434|gb|ABQ37940.1| L-proline dehydrogenase [Bradyrhizobium sp. BTAi1] Length=1226 Score = 35.8 bits (81), Expect = 2.4, Method: Compositional matrix adjust. Identities = 34/106 (33%), Positives = 49/106 (47%), Gaps = 6/106 (5%) Query 16 ALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNPERIQIGDWRYEVAHDGRLLAAHMVN 75 AL GVAL+ A L V T R R ++I GDWR + H LL Sbjct 91 ALSSQEGVALMCLAEA-LLRVPDTATRDALIR---DKIATGDWRAHIGHSPSLLVNAATW 146 Query 76 GIVIAEDALIAEAVGPHLARALGQIVSRYGATVI-PNINAAIEVLG 120 G+++ L A A LA AL ++++R G +I + A+ +LG Sbjct 147 GLIVT-GKLTAAASEDSLASALTRLIARGGEPIIRQGVTLAMRLLG 191 >gi|110677975|ref|YP_680982.1| type I secretion target domain-containing protein [Roseobacter denitrificans OCh 114] gi|109454091|gb|ABG30296.1| type I secretion target domain protein, putative [Roseobacter denitrificans OCh 114] Length=1021 Score = 35.4 bits (80), Expect = 2.5, Method: Compositional matrix adjust. Identities = 20/76 (27%), Positives = 34/76 (45%), Gaps = 0/76 (0%) Query 37 IHTPARRGFFRSNPERIQIGDWRYEVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARA 96 + + + F RS+P I+ GDW + G AA ++G+ + D+ G + Sbjct 453 LQSDSGNTFLRSDPGTIEAGDWSHVAVSFGPEGAALFLDGVQVDTDSYTGGLAGNNEPWT 512 Query 97 LGQIVSRYGATVIPNI 112 LG +R G V N+ Sbjct 513 LGASQNRSGDGVATNL 528 >gi|52222874|gb|AAU34215.1| two component sensor kinase MppV [Streptomyces hygroscopicus] Length=390 Score = 35.4 bits (80), Expect = 3.2, Method: Compositional matrix adjust. Identities = 22/64 (35%), Positives = 34/64 (54%), Gaps = 1/64 (1%) Query 57 DWRYEVAHDGRLLAAHMVNGIVI-AEDALIAEAVGPHLARALGQIVSRYGATVIPNINAA 115 + R EVA D AH V GIV+ A+ A ++E GP RAL Q + + G + +++ Sbjct 178 EQRLEVARDLHDFVAHEVTGIVLEAQAAQVSEDAGPEEHRALLQRIEKAGLRALDSMDQT 237 Query 116 IEVL 119 + L Sbjct 238 VTTL 241 >gi|163857754|ref|YP_001632052.1| hypothetical protein Bpet3441 [Bordetella petrii DSM 12804] gi|163261482|emb|CAP43784.1| unnamed protein product [Bordetella petrii] Length=822 Score = 35.0 bits (79), Expect = 3.8, Method: Compositional matrix adjust. Identities = 19/47 (41%), Positives = 27/47 (58%), Gaps = 2/47 (4%) Query 56 GDWRYEVAHDGRLLAAHMVNGI--VIAEDALIAEAVGPHLARALGQI 100 GDW YEV +DG + A + +G + + A PHLARA+GQ+ Sbjct 234 GDWLYEVKYDGYRILARLEDGKARLYTRNGHDWTARLPHLARAIGQL 280 >gi|301770405|ref|XP_002920603.1| PREDICTED: LOW QUALITY PROTEIN: DALR anticodon-binding domain-containing protein 3-like [Ailuropoda melanoleuca] Length=508 Score = 34.3 bits (77), Expect = 5.7, Method: Compositional matrix adjust. Identities = 44/127 (35%), Positives = 55/127 (44%), Gaps = 23/127 (18%) Query 7 ERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFF----RSNPERIQIGDWRYEV 62 ERV RAVA L GPG VA V++ GL V+H PA RG S + + D Sbjct 58 ERVVRAVAG-LQGPG-VAPVLRCTLGLRVVLHCPALRGALGALRLSQLRAVLVAD----- 110 Query 63 AHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQIVSRY-GATVIPNI----NAAIE 117 H R L AH A L+ PH+ L Q+ + A+ PNI N A+ Sbjct 111 -HLARALRAHG------ASVRLVPAVRDPHMPTFLQQLCVDWPSASQRPNIEALRNLALA 163 Query 118 VLGTGTD 124 VL D Sbjct 164 VLSPARD 170 Lambda K H 0.324 0.140 0.414 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129319095676 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40