BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1724c Length=139 Score E Sequences producing significant alignments: (Bits) Value gi|15608862|ref|NP_216240.1| hypothetical protein Rv1724c [Mycob... 288 1e-76 gi|340626729|ref|YP_004745181.1| hypothetical protein MCAN_17351... 287 4e-76 gi|289753814|ref|ZP_06513192.1| conserved hypothetical protein [... 286 5e-76 gi|254364561|ref|ZP_04980607.1| hypothetical protein TBHG_01681 ... 285 2e-75 gi|330817203|ref|YP_004360908.1| hypothetical protein bgla_1g232... 36.6 1.2 gi|110597193|ref|ZP_01385482.1| transcription antitermination fa... 36.2 1.6 gi|294673017|ref|YP_003573633.1| hypothetical protein PRU_0242 [... 35.4 2.6 gi|221486941|gb|EEE25187.1| conserved hypothetical protein [Toxo... 35.4 2.7 gi|221506628|gb|EEE32245.1| conserved hypothetical protein [Toxo... 35.4 2.8 gi|242399492|ref|YP_002994917.1| Glutamate synthase beta chain-r... 35.4 3.0 gi|20089624|ref|NP_615699.1| hypothetical protein MA0739 [Methan... 35.4 3.1 gi|220909077|ref|YP_002484388.1| hypothetical protein Cyan7425_3... 35.4 3.2 gi|237831827|ref|XP_002365211.1| hypothetical protein, conserved... 35.4 3.2 >gi|15608862|ref|NP_216240.1| hypothetical protein Rv1724c [Mycobacterium tuberculosis H37Rv] gi|15841186|ref|NP_336223.1| hypothetical protein MT1765 [Mycobacterium tuberculosis CDC1551] gi|31792912|ref|NP_855405.1| hypothetical protein Mb1753c [Mycobacterium bovis AF2122/97] 42 more sequence titlesLength=139 Score = 288 bits (738), Expect = 1e-76, Method: Compositional matrix adjust. Identities = 139/139 (100%), Positives = 139/139 (100%), Gaps = 0/139 (0%) Query 1 MVGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQ 60 MVGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQ Sbjct 1 MVGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQ 60 Query 61 RQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIENEQAALDH 120 RQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIENEQAALDH Sbjct 61 RQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIENEQAALDH 120 Query 121 KLAEVRKRRMDTWDESYFR 139 KLAEVRKRRMDTWDESYFR Sbjct 121 KLAEVRKRRMDTWDESYFR 139 >gi|340626729|ref|YP_004745181.1| hypothetical protein MCAN_17351 [Mycobacterium canettii CIPT 140010059] gi|340004919|emb|CCC44067.1| hypothetical protein MCAN_17351 [Mycobacterium canettii CIPT 140010059] Length=139 Score = 287 bits (734), Expect = 4e-76, Method: Compositional matrix adjust. Identities = 138/139 (99%), Positives = 138/139 (99%), Gaps = 0/139 (0%) Query 1 MVGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQ 60 MVGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQ Sbjct 1 MVGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQ 60 Query 61 RQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIENEQAALDH 120 RQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIENEQ ALDH Sbjct 61 RQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIENEQVALDH 120 Query 121 KLAEVRKRRMDTWDESYFR 139 KLAEVRKRRMDTWDESYFR Sbjct 121 KLAEVRKRRMDTWDESYFR 139 >gi|289753814|ref|ZP_06513192.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054] gi|289694401|gb|EFD61830.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054] Length=139 Score = 286 bits (733), Expect = 5e-76, Method: Compositional matrix adjust. Identities = 138/138 (100%), Positives = 138/138 (100%), Gaps = 0/138 (0%) Query 1 MVGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQ 60 MVGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQ Sbjct 1 MVGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQ 60 Query 61 RQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIENEQAALDH 120 RQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIENEQAALDH Sbjct 61 RQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIENEQAALDH 120 Query 121 KLAEVRKRRMDTWDESYF 138 KLAEVRKRRMDTWDESYF Sbjct 121 KLAEVRKRRMDTWDESYF 138 >gi|254364561|ref|ZP_04980607.1| hypothetical protein TBHG_01681 [Mycobacterium tuberculosis str. Haarlem] gi|289554504|ref|ZP_06443714.1| hypothetical protein TBXG_02254 [Mycobacterium tuberculosis KZN 605] gi|298525222|ref|ZP_07012631.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] 27 more sequence titles Length=138 Score = 285 bits (729), Expect = 2e-75, Method: Compositional matrix adjust. Identities = 137/138 (99%), Positives = 138/138 (100%), Gaps = 0/138 (0%) Query 2 VGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQR 61 +GNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQR Sbjct 1 MGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQR 60 Query 62 QLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIENEQAALDHK 121 QLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIENEQAALDHK Sbjct 61 QLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIENEQAALDHK 120 Query 122 LAEVRKRRMDTWDESYFR 139 LAEVRKRRMDTWDESYFR Sbjct 121 LAEVRKRRMDTWDESYFR 138 >gi|330817203|ref|YP_004360908.1| hypothetical protein bgla_1g23250 [Burkholderia gladioli BSR3] gi|327369596|gb|AEA60952.1| hypothetical protein bgla_1g23250 [Burkholderia gladioli BSR3] Length=127 Score = 36.6 bits (83), Expect = 1.2, Method: Compositional matrix adjust. Identities = 21/55 (39%), Positives = 30/55 (55%), Gaps = 1/55 (1%) Query 27 IGVYNGEQAIIVYDLRPVPHWPKYWIQAL-AKHFQRQLKPSPKIDISLLDDRIRF 80 I + E ++IV PV WP +AL A+HF+ L PSP++ I R+RF Sbjct 24 INLARHEHSVIVLPAAPVMAWPVDAFEALEARHFELLLDPSPEVVIFGSGARLRF 78 >gi|110597193|ref|ZP_01385482.1| transcription antitermination factor NusB [Chlorobium ferrooxidans DSM 13031] gi|110341384|gb|EAT59849.1| transcription antitermination factor NusB [Chlorobium ferrooxidans DSM 13031] Length=191 Score = 36.2 bits (82), Expect = 1.6, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 32/50 (64%), Gaps = 0/50 (0%) Query 83 FVSTDVSAKDLCKLDDAVYNAVRNAGRAIENEQAALDHKLAEVRKRRMDT 132 F STD S+K + + DA++N ++ G+ +N + +DH A+V+K +++ Sbjct 142 FTSTDKSSKFVNGILDAIFNELKAEGKVHKNGRGLIDHSTAKVQKPEIES 191 >gi|294673017|ref|YP_003573633.1| hypothetical protein PRU_0242 [Prevotella ruminicola 23] gi|294472325|gb|ADE81714.1| conserved hypothetical protein [Prevotella ruminicola 23] Length=513 Score = 35.4 bits (80), Expect = 2.6, Method: Compositional matrix adjust. Identities = 25/76 (33%), Positives = 37/76 (49%), Gaps = 7/76 (9%) Query 55 LAKHFQRQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIENE 114 L KHF R P PKI ++ DD++R ++ TD+ + L DA+ GR E Sbjct 59 LTKHFSRLQMPVPKI-LAASDDQLR---YLQTDLGSMSLF---DAIRGGREAGGRYTLKE 111 Query 115 QAALDHKLAEVRKRRM 130 Q L + E+ K +M Sbjct 112 QELLRRTIRELPKMQM 127 >gi|221486941|gb|EEE25187.1| conserved hypothetical protein [Toxoplasma gondii GT1] Length=1108 Score = 35.4 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 28/94 (30%), Positives = 44/94 (47%), Gaps = 1/94 (1%) Query 19 CFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQRQLKPSPKIDISLLDDRI 78 C + + + GE +IV D RPV + +QA +QR + ++SLL D++ Sbjct 1008 CLAESASVRTAGEGEYEVIVSDDRPVGAGVFWGMQAGEAFWQRPGSDAGLWEVSLLRDQV 1067 Query 79 RFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIE 112 + V DV K+L K N RN +A+E Sbjct 1068 PLDMHVGGDVRVKNLMKELTECLNR-RNQKKALE 1100 >gi|221506628|gb|EEE32245.1| conserved hypothetical protein [Toxoplasma gondii VEG] Length=1108 Score = 35.4 bits (80), Expect = 2.8, Method: Composition-based stats. Identities = 28/94 (30%), Positives = 44/94 (47%), Gaps = 1/94 (1%) Query 19 CFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQRQLKPSPKIDISLLDDRI 78 C + + + GE +IV D RPV + +QA +QR + ++SLL D++ Sbjct 1008 CLAESASVRTAGEGEYEVIVSDDRPVGAGVFWGMQAGEAFWQRPGSDAGLWEVSLLRDQV 1067 Query 79 RFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIE 112 + V DV K+L K N RN +A+E Sbjct 1068 PLDMHVGGDVRVKNLMKELTECLNR-RNQKKALE 1100 >gi|242399492|ref|YP_002994917.1| Glutamate synthase beta chain-related oxidoreductase, containing 2Fe- 2S and 4Fe-4S clusters [Thermococcus sibiricus MM 739] gi|242265886|gb|ACS90568.1| Glutamate synthase beta chain-related oxidoreductase, containing 2Fe- 2S and 4Fe-4S clusters [Thermococcus sibiricus MM 739] Length=963 Score = 35.4 bits (80), Expect = 3.0, Method: Composition-based stats. Identities = 20/71 (29%), Positives = 37/71 (53%), Gaps = 5/71 (7%) Query 29 VYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQRQLKPSPKIDISLLDDRIRFSVFVSTDV 88 +Y+ + +++DLRP HW K + +H +R+ P++ + LLD IR S F + Sbjct 522 IYDEDLYRVLFDLRPYNHWKKV-TEKDYEHVERK----PRVKVKLLDPEIRKSNFKEVEP 576 Query 89 SAKDLCKLDDA 99 + + L +A Sbjct 577 TMDEETVLTEA 587 >gi|20089624|ref|NP_615699.1| hypothetical protein MA0739 [Methanosarcina acetivorans C2A] gi|19914545|gb|AAM04179.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length=219 Score = 35.4 bits (80), Expect = 3.1, Method: Compositional matrix adjust. Identities = 23/73 (32%), Positives = 40/73 (55%), Gaps = 6/73 (8%) Query 3 GNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQRQ 62 G E+ L+ LRN++ + A V +G+ A+ + D+ V H Y++ A K + RQ Sbjct 19 GGMEDPLEFLRNIK------SVAVATVDDGKPAVRMSDVMLVEHEKLYFLTARGKPYYRQ 72 Query 63 LKPSPKIDISLLD 75 LK +P+I + +D Sbjct 73 LKKNPEIALVGMD 85 >gi|220909077|ref|YP_002484388.1| hypothetical protein Cyan7425_3708 [Cyanothece sp. PCC 7425] gi|219865688|gb|ACL46027.1| hypothetical protein Cyan7425_3708 [Cyanothece sp. PCC 7425] Length=207 Score = 35.4 bits (80), Expect = 3.2, Method: Compositional matrix adjust. Identities = 14/33 (43%), Positives = 23/33 (70%), Gaps = 0/33 (0%) Query 12 LRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPV 44 L+N P FS A+ P+G+ N ++ +IV +LRP+ Sbjct 135 LKNGELPVFSAAQLPVGLTNDDKVMIVGELRPI 167 >gi|237831827|ref|XP_002365211.1| hypothetical protein, conserved [Toxoplasma gondii ME49] gi|211962875|gb|EEA98070.1| hypothetical protein, conserved [Toxoplasma gondii ME49] Length=1108 Score = 35.4 bits (80), Expect = 3.2, Method: Composition-based stats. Identities = 28/94 (30%), Positives = 44/94 (47%), Gaps = 1/94 (1%) Query 19 CFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKYWIQALAKHFQRQLKPSPKIDISLLDDRI 78 C + + + GE +IV D RPV + +QA +QR + ++SLL D++ Sbjct 1008 CLAESASVRTAGEGEYEVIVSDDRPVGAGVFWGMQAGEAFWQRPGSDAGLWEVSLLRDQV 1067 Query 79 RFSVFVSTDVSAKDLCKLDDAVYNAVRNAGRAIE 112 + V DV K+L K N RN +A+E Sbjct 1068 PLDMHVGGDVRVKNLMKELTECLNR-RNQEKALE 1100 Lambda K H 0.321 0.136 0.416 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 131443546824 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40