BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2658c
Length=120
Score E
Sequences producing significant alignments: (Bits) Value
gi|15842198|ref|NP_337235.1| hypothetical protein MT2734.1 [Myco... 251 3e-65
gi|15609795|ref|NP_217174.1| prophage protein [Mycobacterium tub... 251 3e-65
gi|253798260|ref|YP_003031261.1| phiRv2 phage protein [Mycobacte... 251 3e-65
gi|167967174|ref|ZP_02549451.1| hypothetical protein MtubH3_0368... 130 8e-29
gi|301307858|ref|ZP_07213814.1| two-component system sensor hist... 35.0 3.4
gi|262382720|ref|ZP_06075857.1| two-component system sensor hist... 35.0 3.4
gi|256838767|ref|ZP_05544277.1| two-component system sensor hist... 35.0 3.4
>gi|15842198|ref|NP_337235.1| hypothetical protein MT2734.1 [Mycobacterium tuberculosis CDC1551]
gi|254551712|ref|ZP_05142159.1| hypothetical protein Mtube_14849 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
gi|294994247|ref|ZP_06799938.1| hypothetical protein Mtub2_06963 [Mycobacterium tuberculosis
210]
gi|13882486|gb|AAK47049.1| hypothetical protein MT2734.1 [Mycobacterium tuberculosis CDC1551]
Length=122
Score = 251 bits (641), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 120/120 (100%), Positives = 120/120 (100%), Gaps = 0/120 (0%)
Query 1 MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRA 60
MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRA
Sbjct 3 MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRA 62
Query 61 CRKYAPISEMTAAAILDGFGAKLHELRTSTIPDADDPSIAEARHVIPFSALCLRLSQLGG 120
CRKYAPISEMTAAAILDGFGAKLHELRTSTIPDADDPSIAEARHVIPFSALCLRLSQLGG
Sbjct 63 CRKYAPISEMTAAAILDGFGAKLHELRTSTIPDADDPSIAEARHVIPFSALCLRLSQLGG 122
>gi|15609795|ref|NP_217174.1| prophage protein [Mycobacterium tuberculosis H37Rv]
gi|148662500|ref|YP_001284023.1| hypothetical protein MRA_2688 [Mycobacterium tuberculosis H37Ra]
gi|148823849|ref|YP_001288603.1| phiRv2 prophage protein [Mycobacterium tuberculosis F11]
27 more sequence titles
Length=120
Score = 251 bits (641), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 120/120 (100%), Positives = 120/120 (100%), Gaps = 0/120 (0%)
Query 1 MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRA 60
MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRA
Sbjct 1 MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRA 60
Query 61 CRKYAPISEMTAAAILDGFGAKLHELRTSTIPDADDPSIAEARHVIPFSALCLRLSQLGG 120
CRKYAPISEMTAAAILDGFGAKLHELRTSTIPDADDPSIAEARHVIPFSALCLRLSQLGG
Sbjct 61 CRKYAPISEMTAAAILDGFGAKLHELRTSTIPDADDPSIAEARHVIPFSALCLRLSQLGG 120
>gi|253798260|ref|YP_003031261.1| phiRv2 phage protein [Mycobacterium tuberculosis KZN 1435]
gi|254232767|ref|ZP_04926094.1| hypothetical protein TBCG_02598 [Mycobacterium tuberculosis C]
gi|289746459|ref|ZP_06505837.1| phiRv2 prophage protein [Mycobacterium tuberculosis 02_1987]
25 more sequence titles
Length=121
Score = 251 bits (641), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 120/120 (100%), Positives = 120/120 (100%), Gaps = 0/120 (0%)
Query 1 MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRA 60
MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRA
Sbjct 2 MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRA 61
Query 61 CRKYAPISEMTAAAILDGFGAKLHELRTSTIPDADDPSIAEARHVIPFSALCLRLSQLGG 120
CRKYAPISEMTAAAILDGFGAKLHELRTSTIPDADDPSIAEARHVIPFSALCLRLSQLGG
Sbjct 62 CRKYAPISEMTAAAILDGFGAKLHELRTSTIPDADDPSIAEARHVIPFSALCLRLSQLGG 121
>gi|167967174|ref|ZP_02549451.1| hypothetical protein MtubH3_03682 [Mycobacterium tuberculosis
H37Ra]
Length=95
Score = 130 bits (326), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 60/63 (96%), Positives = 60/63 (96%), Gaps = 0/63 (0%)
Query 1 MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRA 60
MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECR
Sbjct 2 MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRG 61
Query 61 CRK 63
K
Sbjct 62 MPK 64
>gi|301307858|ref|ZP_07213814.1| two-component system sensor histidine kinase [Bacteroides sp.
20_3]
gi|300834201|gb|EFK64815.1| two-component system sensor histidine kinase [Bacteroides sp.
20_3]
Length=767
Score = 35.0 bits (79), Expect = 3.4, Method: Composition-based stats.
Identities = 23/84 (28%), Positives = 38/84 (46%), Gaps = 6/84 (7%)
Query 5 VKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRACRKY 64
VKY DDE L++ I ER GH+ + T+W + I+ + + ++
Sbjct 486 VKYRSKTKWDDEWQTLLVTGIQVERDKEGHVTRYTGITINNTKWEKMIQ---QLKELKEK 542
Query 65 APISEMTAAAILDGFGAKLHELRT 88
A +S+ +A L HE+RT
Sbjct 543 AELSDRLKSAFLANMS---HEIRT 563
>gi|262382720|ref|ZP_06075857.1| two-component system sensor histidine kinase [Bacteroides sp.
2_1_33B]
gi|262295598|gb|EEY83529.1| two-component system sensor histidine kinase [Bacteroides sp.
2_1_33B]
Length=767
Score = 35.0 bits (79), Expect = 3.4, Method: Composition-based stats.
Identities = 23/84 (28%), Positives = 38/84 (46%), Gaps = 6/84 (7%)
Query 5 VKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRACRKY 64
VKY DDE L++ I ER GH+ + T+W + I+ + + ++
Sbjct 486 VKYRSKTKWDDEWQTLLVTGIQVERDKEGHVTRYTGITINNTKWEKMIQ---QLKELKEK 542
Query 65 APISEMTAAAILDGFGAKLHELRT 88
A +S+ +A L HE+RT
Sbjct 543 AELSDRLKSAFLANMS---HEIRT 563
>gi|256838767|ref|ZP_05544277.1| two-component system sensor histidine kinase [Parabacteroides
sp. D13]
gi|256739686|gb|EEU53010.1| two-component system sensor histidine kinase [Parabacteroides
sp. D13]
Length=657
Score = 35.0 bits (79), Expect = 3.4, Method: Composition-based stats.
Identities = 23/84 (28%), Positives = 38/84 (46%), Gaps = 6/84 (7%)
Query 5 VKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRACRKY 64
VKY DDE L++ I ER GH+ + T+W + I+ + + ++
Sbjct 376 VKYRSKTKWDDEWQTLLVTGIQVERDKEGHVTRYTGITINNTKWEKMIQ---QLKELKEK 432
Query 65 APISEMTAAAILDGFGAKLHELRT 88
A +S+ +A L HE+RT
Sbjct 433 AELSDRLKSAFLANMS---HEIRT 453
Lambda K H
0.323 0.136 0.438
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128530997826
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40