BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0612 Length=201 Score E Sequences producing significant alignments: (Bits) Value gi|15607752|ref|NP_215126.1| hypothetical protein Rv0612 [Mycoba... 392 1e-107 gi|308231600|ref|ZP_07413056.2| hypothetical protein TMAG_02492 ... 356 1e-96 gi|307083110|ref|ZP_07492223.1| hypothetical protein TMLG_03359 ... 342 1e-92 gi|333989262|ref|YP_004521876.1| hypothetical protein JDM601_062... 171 5e-41 gi|260907600|ref|ZP_05915922.1| hypothetical protein BlinB_19847... 119 2e-25 gi|31791792|ref|NP_854285.1| hypothetical protein Mb0626 [Mycoba... 50.1 2e-04 gi|15840010|ref|NP_335047.1| hypothetical protein MT0638.1 [Myco... 49.3 4e-04 gi|336115309|ref|YP_004570076.1| hypothetical protein BCO26_2632... 48.1 7e-04 gi|255280369|ref|ZP_05344924.1| hypothetical protein BRYFOR_0570... 47.0 0.001 gi|289441999|ref|ZP_06431743.1| conserved hypothetical protein [... 47.0 0.002 gi|229542672|ref|ZP_04431732.1| hypothetical protein BcoaDRAFT_5... 45.1 0.007 gi|337769103|emb|CCB77816.1| putative FMN-dependent monooxygenas... 41.2 0.095 gi|124006831|ref|ZP_01691661.1| hypothetical protein M23134_0653... 38.5 0.57 gi|317038352|ref|XP_001402108.2| pH-response regulator protein p... 34.7 7.1 gi|134074717|emb|CAK38902.1| unnamed protein product [Aspergillu... 34.7 7.4 gi|310791717|gb|EFQ27244.1| NACHT and TPR domain-containing prot... 34.7 8.5 >gi|15607752|ref|NP_215126.1| hypothetical protein Rv0612 [Mycobacterium tuberculosis H37Rv] gi|15840014|ref|NP_335051.1| hypothetical protein MT0642 [Mycobacterium tuberculosis CDC1551] gi|31791795|ref|NP_854288.1| hypothetical protein Mb0629 [Mycobacterium bovis AF2122/97] 60 more sequence titlesLength=201 Score = 392 bits (1008), Expect = 1e-107, Method: Compositional matrix adjust. Identities = 200/201 (99%), Positives = 201/201 (100%), Gaps = 0/201 (0%) Query 1 VLGPIRQPRLTVRPGRLPGMIAGVAAKRMNREQFFRAASGLDEDRLRKALWNLYWRGTAN 60 +LGPIRQPRLTVRPGRLPGMIAGVAAKRMNREQFFRAASGLDEDRLRKALWNLYWRGTAN Sbjct 1 MLGPIRQPRLTVRPGRLPGMIAGVAAKRMNREQFFRAASGLDEDRLRKALWNLYWRGTAN 60 Query 61 MRERIEAELASAGRARPARKIKPPADPDIVGWEVDEFVSLARSGAYLGGDRRVSPRERSR 120 MRERIEAELASAGRARPARKIKPPADPDIVGWEVDEFVSLARSGAYLGGDRRVSPRERSR Sbjct 61 MRERIEAELASAGRARPARKIKPPADPDIVGWEVDEFVSLARSGAYLGGDRRVSPRERSR 120 Query 121 WRFTFKRLAAEAQDALRAEDAEPAASALEQLIDLAREADGYDYFRSDDPVAAAGFVVSDV 180 WRFTFKRLAAEAQDALRAEDAEPAASALEQLIDLAREADGYDYFRSDDPVAAAGFVVSDV Sbjct 121 WRFTFKRLAAEAQDALRAEDAEPAASALEQLIDLAREADGYDYFRSDDPVAAAGFVVSDV 180 Query 181 AAAGHPHFREFAAEIGAAIPP 201 AAAGHPHFREFAAEIGAAIPP Sbjct 181 AAAGHPHFREFAAEIGAAIPP 201 >gi|308231600|ref|ZP_07413056.2| hypothetical protein TMAG_02492 [Mycobacterium tuberculosis SUMu001] gi|308369992|ref|ZP_07419888.2| hypothetical protein TMBG_03473 [Mycobacterium tuberculosis SUMu002] gi|308370461|ref|ZP_07421579.2| hypothetical protein TMCG_03818 [Mycobacterium tuberculosis SUMu003] 15 more sequence titles Length=182 Score = 356 bits (913), Expect = 1e-96, Method: Compositional matrix adjust. Identities = 182/182 (100%), Positives = 182/182 (100%), Gaps = 0/182 (0%) Query 20 MIAGVAAKRMNREQFFRAASGLDEDRLRKALWNLYWRGTANMRERIEAELASAGRARPAR 79 MIAGVAAKRMNREQFFRAASGLDEDRLRKALWNLYWRGTANMRERIEAELASAGRARPAR Sbjct 1 MIAGVAAKRMNREQFFRAASGLDEDRLRKALWNLYWRGTANMRERIEAELASAGRARPAR 60 Query 80 KIKPPADPDIVGWEVDEFVSLARSGAYLGGDRRVSPRERSRWRFTFKRLAAEAQDALRAE 139 KIKPPADPDIVGWEVDEFVSLARSGAYLGGDRRVSPRERSRWRFTFKRLAAEAQDALRAE Sbjct 61 KIKPPADPDIVGWEVDEFVSLARSGAYLGGDRRVSPRERSRWRFTFKRLAAEAQDALRAE 120 Query 140 DAEPAASALEQLIDLAREADGYDYFRSDDPVAAAGFVVSDVAAAGHPHFREFAAEIGAAI 199 DAEPAASALEQLIDLAREADGYDYFRSDDPVAAAGFVVSDVAAAGHPHFREFAAEIGAAI Sbjct 121 DAEPAASALEQLIDLAREADGYDYFRSDDPVAAAGFVVSDVAAAGHPHFREFAAEIGAAI 180 Query 200 PP 201 PP Sbjct 181 PP 182 >gi|307083110|ref|ZP_07492223.1| hypothetical protein TMLG_03359 [Mycobacterium tuberculosis SUMu012] gi|308367186|gb|EFP56037.1| hypothetical protein TMLG_03359 [Mycobacterium tuberculosis SUMu012] Length=189 Score = 342 bits (878), Expect = 1e-92, Method: Compositional matrix adjust. Identities = 176/188 (94%), Positives = 177/188 (95%), Gaps = 0/188 (0%) Query 1 VLGPIRQPRLTVRPGRLPGMIAGVAAKRMNREQFFRAASGLDEDRLRKALWNLYWRGTAN 60 +LGPIRQPRLTVRPGRLPGMIAGVAAKRMNREQFFRAASGLDEDRLRKALWNLYWRGTAN Sbjct 1 MLGPIRQPRLTVRPGRLPGMIAGVAAKRMNREQFFRAASGLDEDRLRKALWNLYWRGTAN 60 Query 61 MRERIEAELASAGRARPARKIKPPADPDIVGWEVDEFVSLARSGAYLGGDRRVSPRERSR 120 MRERIEAELASAGRARPARKIKPPADPDIVGWEVDEFVSLARSGAYLGGDRRVSPRERSR Sbjct 61 MRERIEAELASAGRARPARKIKPPADPDIVGWEVDEFVSLARSGAYLGGDRRVSPRERSR 120 Query 121 WRFTFKRLAAEAQDALRAEDAEPAASALEQLIDLAREADGYDYFRSDDPVAAAGFVVSDV 180 WRFTFKRLAAEAQDALRAEDAEPAASALEQLIDLAREADGYDYFRSDDPVAAAGF Sbjct 121 WRFTFKRLAAEAQDALRAEDAEPAASALEQLIDLAREADGYDYFRSDDPVAAAGFRRVRC 180 Query 181 AAAGHPHF 188 G P Sbjct 181 GGGGPPTL 188 >gi|333989262|ref|YP_004521876.1| hypothetical protein JDM601_0622 [Mycobacterium sp. JDM601] gi|333485230|gb|AEF34622.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=256 Score = 171 bits (433), Expect = 5e-41, Method: Compositional matrix adjust. Identities = 105/161 (66%), Positives = 120/161 (75%), Gaps = 5/161 (3%) Query 24 VAAKRMNREQFFRAASGLDEDRLRKALWNLYWRGTANMRERIEAELASAGRARPARKIKP 83 +A RMNREQFF A SG D+ LRKALWNLYWRGTA++R RIE ELA G RPA IKP Sbjct 1 MAVDRMNREQFFSAMSGHDDAALRKALWNLYWRGTADVRRRIETELA--GDIRPA--IKP 56 Query 84 -PADPDIVGWEVDEFVSLARSGAYLGGDRRVSPRERSRWRFTFKRLAAEAQDALRAEDAE 142 P DP V V EFV+LAR+GAYLG DRRVSP+ER+RWRFTFK+LA A +AL A + Sbjct 57 EPLDPQAVHDAVTEFVALARAGAYLGRDRRVSPKERTRWRFTFKQLATAAVEALHAGEPT 116 Query 143 PAASALEQLIDLAREADGYDYFRSDDPVAAAGFVVSDVAAA 183 PA +AL LIDL +A YFRS+DPV+AAGFVVSD AAA Sbjct 117 PAITALTLLIDLILDARDTYYFRSEDPVSAAGFVVSDAAAA 157 >gi|260907600|ref|ZP_05915922.1| hypothetical protein BlinB_19847 [Brevibacterium linens BL2] Length=299 Score = 119 bits (298), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 78/163 (48%), Positives = 103/163 (64%), Gaps = 10/163 (6%) Query 24 VAAKRMNREQFFRAASGLDEDRLRKALWNLYWRGTANMRERIEAELASAGRARPARKIKP 83 + AK+ +R QFFR SG D D L K LW LYWRG A RER+E EL + + Sbjct 1 MVAKKFDRTQFFRTTSGFDRDDLEKVLWTLYWRGDARTRERVE-ELIDPSQV----TVTA 55 Query 84 PADP--DIVGWEVDEFVSLARSGAYLGGDRRVSPRERSRWRFTFKRLAAEAQDALRA--- 138 PA P ++V V EF +LAR+ AYL DRRVSP+ER+RWRFT+K A++ AL A Sbjct 56 PAPPSAEVVRRNVKEFAALARARAYLARDRRVSPKERTRWRFTYKDHFAQSFAALSAGTG 115 Query 139 EDAEPAASALEQLIDLAREADGYDYFRSDDPVAAAGFVVSDVA 181 E+ PA A+ LI LA E +G+DYFRS+DP+ A+ V+S++ Sbjct 116 EEIRPAVEAVSTLITLACETEGFDYFRSEDPIEASKVVISEMV 158 >gi|31791792|ref|NP_854285.1| hypothetical protein Mb0626 [Mycobacterium bovis AF2122/97] gi|57116759|ref|YP_177628.1| hypothetical protein Rv0609A [Mycobacterium tuberculosis H37Rv] gi|121636528|ref|YP_976751.1| hypothetical protein BCG_0656 [Mycobacterium bovis BCG str. Pasteur 1173P2] 27 more sequence titles Length=75 Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 26/35 (75%), Positives = 28/35 (80%), Gaps = 2/35 (5%) Query 152 IDLAR--EADGYDYFRSDDPVAAAGFVVSDVAAAG 184 + LAR EA GYDYFRSDDPVAAAGFVVS V + G Sbjct 22 VSLARRCEAHGYDYFRSDDPVAAAGFVVSAVWSCG 56 >gi|15840010|ref|NP_335047.1| hypothetical protein MT0638.1 [Mycobacterium tuberculosis CDC1551] gi|13880153|gb|AAK44861.1| hypothetical protein MT0638.1 [Mycobacterium tuberculosis CDC1551] Length=57 Score = 49.3 bits (116), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 26/35 (75%), Positives = 28/35 (80%), Gaps = 2/35 (5%) Query 152 IDLAR--EADGYDYFRSDDPVAAAGFVVSDVAAAG 184 + LAR EA GYDYFRSDDPVAAAGFVVS V + G Sbjct 4 VSLARRCEAHGYDYFRSDDPVAAAGFVVSAVWSCG 38 >gi|336115309|ref|YP_004570076.1| hypothetical protein BCO26_2632 [Bacillus coagulans 2-6] gi|335368739|gb|AEH54690.1| hypothetical protein BCO26_2632 [Bacillus coagulans 2-6] Length=337 Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 24/83 (29%), Positives = 41/83 (50%), Gaps = 1/83 (1%) Query 93 EVDEFVSLARSGAYLGGDRRVSPRERSRWRFTFKRLAAEAQDA-LRAEDAEPAASALEQL 151 E+ +F+ A Y+ +R VS +ER +WRF K + Q + + + A LE+L Sbjct 71 EITQFIQNAYKQNYIAPNRYVSKKERPKWRFKVKSYIKQLQQVPVEGNEGKRATDLLEKL 130 Query 152 IDLAREADGYDYFRSDDPVAAAG 174 ++ GY F +D+P + G Sbjct 131 YEMLCYGCGYYIFNTDNPFRSIG 153 >gi|255280369|ref|ZP_05344924.1| hypothetical protein BRYFOR_05702 [Bryantella formatexigens DSM 14469] gi|255268834|gb|EET62039.1| hypothetical protein BRYFOR_05702 [Bryantella formatexigens DSM 14469] Length=344 Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust. Identities = 37/144 (26%), Positives = 63/144 (44%), Gaps = 3/144 (2%) Query 42 DEDRLRKALWNLYWRGTANMRERIEAELASAGRARPARKIK--PPADPDIVGWEVDEFVS 99 D + KA Y + T + +E I+ + R A K K + + E+++F+ Sbjct 14 DRSYVEKAFAESYKQLTKSQKEEIDPVIIDILEGREAEKKKKGSAVSFEKLEQEIEDFIE 73 Query 100 LARSGAYLGGDRRVSPRERSRWRFTFKRLAAEAQDAL-RAEDAEPAASALEQLIDLAREA 158 A + Y +R + +R +WRF K E + L +E+ + A L +L L EA Sbjct 74 NAYAQNYFAPNRVIPKSQRPKWRFMVKNFIKELEKILVESENYDRAVKLLTELYKLICEA 133 Query 159 DGYDYFRSDDPVAAAGFVVSDVAA 182 Y F +DD + G+ D+ A Sbjct 134 CNYYLFSTDDAFRSIGWSQPDLFA 157 >gi|289441999|ref|ZP_06431743.1| conserved hypothetical protein [Mycobacterium tuberculosis T46] gi|289568544|ref|ZP_06448771.1| conserved hypothetical protein [Mycobacterium tuberculosis T17] gi|289749108|ref|ZP_06508486.1| conserved hypothetical protein [Mycobacterium tuberculosis T92] gi|289414918|gb|EFD12158.1| conserved hypothetical protein [Mycobacterium tuberculosis T46] gi|289542298|gb|EFD45946.1| conserved hypothetical protein [Mycobacterium tuberculosis T17] gi|289689695|gb|EFD57124.1| conserved hypothetical protein [Mycobacterium tuberculosis T92] Length=75 Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust. Identities = 25/35 (72%), Positives = 27/35 (78%), Gaps = 2/35 (5%) Query 152 IDLAR--EADGYDYFRSDDPVAAAGFVVSDVAAAG 184 + LAR EA GYDYFRS DPVAAAGFVVS V + G Sbjct 22 VSLARRCEAHGYDYFRSVDPVAAAGFVVSAVWSCG 56 >gi|229542672|ref|ZP_04431732.1| hypothetical protein BcoaDRAFT_5242 [Bacillus coagulans 36D1] gi|229327092|gb|EEN92767.1| hypothetical protein BcoaDRAFT_5242 [Bacillus coagulans 36D1] Length=340 Score = 45.1 bits (105), Expect = 0.007, Method: Compositional matrix adjust. Identities = 23/83 (28%), Positives = 40/83 (49%), Gaps = 1/83 (1%) Query 93 EVDEFVSLARSGAYLGGDRRVSPRERSRWRFTFKRLAAEAQDA-LRAEDAEPAASALEQL 151 E+ +F+ A Y+ +R VS +ER +WRF K + Q + + + A L+ L Sbjct 71 EITQFIQNAYKQNYIAPNRYVSKKERPKWRFKVKSYIKQLQQVPVEGNEGKRATDLLDAL 130 Query 152 IDLAREADGYDYFRSDDPVAAAG 174 ++ GY F +D+P + G Sbjct 131 YEMLCYGCGYYIFNTDNPFRSIG 153 >gi|337769103|emb|CCB77816.1| putative FMN-dependent monooxygenase [Streptomyces cattleya NRRL 8057] Length=349 Score = 41.2 bits (95), Expect = 0.095, Method: Compositional matrix adjust. Identities = 55/162 (34%), Positives = 75/162 (47%), Gaps = 24/162 (14%) Query 3 GPIRQPRLTVRPGR--LPGMIAGVAAKRMNREQFFRAASGLDEDRLRKALWNLYWRGTAN 60 GP + +LTV P R +P IA + K N EQ A G AL L + + Sbjct 146 GPGKPLKLTVHPVREYIPLYIAAIGPK--NLEQTGEIADG--------AL--LIFPSAEH 193 Query 61 MRERIEAELASAGRARPARKIK----PPADPDIVGWEVDEFVSLAR--SGAYLGG--DRR 112 + E A L AGR R R ++ P P VG +VD L R + Y+GG R+ Sbjct 194 LAETALAPL-RAGRERAGRTLEGFDVCPTLPIAVGEDVDALADLFRPYTALYVGGMGSRK 252 Query 113 VSPRERSRWRFTFKRLAAEAQDALRAEDAEPAASAL-EQLID 153 + R R +++ AAE QD A D E AA+A+ +LID Sbjct 253 QNFYNRLAQRMGYEKAAAEIQDKYLAGDKEGAAAAVPRELID 294 >gi|124006831|ref|ZP_01691661.1| hypothetical protein M23134_06531 [Microscilla marina ATCC 23134] gi|123987512|gb|EAY27221.1| hypothetical protein M23134_06531 [Microscilla marina ATCC 23134] Length=336 Score = 38.5 bits (88), Expect = 0.57, Method: Compositional matrix adjust. Identities = 22/83 (27%), Positives = 38/83 (46%), Gaps = 1/83 (1%) Query 93 EVDEFVSLARSGAYLGGDRRVSPRERSRWRFTFKRLAAEAQDALRAE-DAEPAASALEQL 151 E + F+ A+ G Y+ ++ V ++RS+WRF K+L R + D L L Sbjct 65 ETEVFILNAKEGNYIKPNQMVPKKDRSKWRFLVKQLYKALSKHNRPDKDLGLQVQLLSGL 124 Query 152 IDLAREADGYDYFRSDDPVAAAG 174 + +A+ YF + P + G Sbjct 125 YGVLCQAESLSYFTTQSPFNSVG 147 >gi|317038352|ref|XP_001402108.2| pH-response regulator protein palF/RIM8 [Aspergillus niger CBS 513.88] Length=739 Score = 34.7 bits (78), Expect = 7.1, Method: Compositional matrix adjust. Identities = 25/90 (28%), Positives = 41/90 (46%), Gaps = 5/90 (5%) Query 37 AASGLDEDRLRK-----ALWNLYWRGTANMRERIEAELASAGRARPARKIKPPADPDIVG 91 A + LD D++R+ A+ GT + + ++A+ + A P +PPA PD Sbjct 469 AGNILDTDQIRREKGVVAVMFEVVIGTRDSQRGVKAKERTPSTAAPVDPNQPPAGPDGET 528 Query 92 WEVDEFVSLARSGAYLGGDRRVSPRERSRW 121 W D+ + G Y+ + PRE S W Sbjct 529 WTTDQSPTPNAEGEYVAQEDYGFPREPSHW 558 >gi|134074717|emb|CAK38902.1| unnamed protein product [Aspergillus niger] Length=759 Score = 34.7 bits (78), Expect = 7.4, Method: Compositional matrix adjust. Identities = 25/90 (28%), Positives = 41/90 (46%), Gaps = 5/90 (5%) Query 37 AASGLDEDRLRK-----ALWNLYWRGTANMRERIEAELASAGRARPARKIKPPADPDIVG 91 A + LD D++R+ A+ GT + + ++A+ + A P +PPA PD Sbjct 489 AGNILDTDQIRREKGVVAVMFEVVIGTRDSQRGVKAKERTPSTAAPVDPNQPPAGPDGET 548 Query 92 WEVDEFVSLARSGAYLGGDRRVSPRERSRW 121 W D+ + G Y+ + PRE S W Sbjct 549 WTTDQSPTPNAEGEYVAQEDYGFPREPSHW 578 >gi|310791717|gb|EFQ27244.1| NACHT and TPR domain-containing protein [Glomerella graminicola M1.001] Length=1442 Score = 34.7 bits (78), Expect = 8.5, Method: Composition-based stats. Identities = 29/96 (31%), Positives = 44/96 (46%), Gaps = 12/96 (12%) Query 93 EVDEFVSLARSGAYLGGDRRVSPRERSRWRFTFKRLAA-------EAQDALRAEDAEPAA 145 E D +S+ S ++ R S R + +FK+LA E LR+E EP Sbjct 1138 EYDNAISIQESICFVEYKARGSLAVRVEYIASFKQLARAYALKALEVDQVLRSETVEPWV 1197 Query 146 SALEQLIDLAREADGYDYFRSDDPVAAAGFVVSDVA 181 LEQL++ R+ Y S+ P+ AGF ++ A Sbjct 1198 RKLEQLMEQQRK-----YQNSNVPLHMAGFDSNEAA 1228 Lambda K H 0.321 0.136 0.409 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 217214446392 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40