BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2126c
Length=256
Score E
Sequences producing significant alignments: (Bits) Value
gi|57116949|ref|YP_177862.1| PE-PGRS family protein [Mycobacteri... 390 1e-106
gi|340627135|ref|YP_004745587.1| PE-PGRS family protein [Mycobac... 381 5e-104
gi|148823337|ref|YP_001288091.1| PE-PGRS family protein [Mycobac... 56.2 5e-06
gi|31793306|ref|NP_855799.1| hypothetical protein Mb2150c [Mycob... 55.8 5e-06
gi|289447746|ref|ZP_06437490.1| predicted protein [Mycobacterium... 52.8 5e-05
>gi|57116949|ref|YP_177862.1| PE-PGRS family protein [Mycobacterium tuberculosis H37Rv]
gi|121638008|ref|YP_978232.1| PE-PGRS family protein [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|148661941|ref|YP_001283464.1| PE-PGRS family protein [Mycobacterium tuberculosis H37Ra]
13 more sequence titles
Length=256
Score = 390 bits (1001), Expect = 1e-106, Method: Compositional matrix adjust.
Identities = 255/256 (99%), Positives = 256/256 (100%), Gaps = 0/256 (0%)
Query 1 LIGDGANGGPGQPGGPGGLLYGNGGHGGAGAAGQDRGAGNSAGLIGNGGAGGAGGNGGIG 60
+IGDGANGGPGQPGGPGGLLYGNGGHGGAGAAGQDRGAGNSAGLIGNGGAGGAGGNGGIG
Sbjct 1 MIGDGANGGPGQPGGPGGLLYGNGGHGGAGAAGQDRGAGNSAGLIGNGGAGGAGGNGGIG 60
Query 61 GAGAPGGLGGDGGKGGFADEFTGGFAQGGRGGFGGNGNTGASGGMGGAGGAGGAGGAGGL 120
GAGAPGGLGGDGGKGGFADEFTGGFAQGGRGGFGGNGNTGASGGMGGAGGAGGAGGAGGL
Sbjct 61 GAGAPGGLGGDGGKGGFADEFTGGFAQGGRGGFGGNGNTGASGGMGGAGGAGGAGGAGGL 120
Query 121 LIGDGGAGGAGGIGGAGGVGGGGGAGGTGGGGVASAFGGGNAFGGRGGDGGDGGDGGTGG 180
LIGDGGAGGAGGIGGAGGVGGGGGAGGTGGGGVASAFGGGNAFGGRGGDGGDGGDGGTGG
Sbjct 121 LIGDGGAGGAGGIGGAGGVGGGGGAGGTGGGGVASAFGGGNAFGGRGGDGGDGGDGGTGG 180
Query 181 AGGARGAGGAGGAGGWLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKA 240
AGGARGAGGAGGAGGWLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKA
Sbjct 181 AGGARGAGGAGGAGGWLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKA 240
Query 241 GAPGTQGDSGDPGPPG 256
GAPGTQGDSGDPGPPG
Sbjct 241 GAPGTQGDSGDPGPPG 256
>gi|340627135|ref|YP_004745587.1| PE-PGRS family protein [Mycobacterium canettii CIPT 140010059]
gi|340005325|emb|CCC44482.1| PE-PGRS family protein [Mycobacterium canettii CIPT 140010059]
Length=256
Score = 381 bits (979), Expect = 5e-104, Method: Compositional matrix adjust.
Identities = 252/256 (99%), Positives = 253/256 (99%), Gaps = 0/256 (0%)
Query 1 LIGDGANGGPGQPGGPGGLLYGNGGHGGAGAAGQDRGAGNSAGLIGNGGAGGAGGNGGIG 60
+IGDGANGGPGQPGGPGGLLYGNGGHGGAGAAGQDRGAGNSAGLIGNGGAGGAGGNGGIG
Sbjct 1 MIGDGANGGPGQPGGPGGLLYGNGGHGGAGAAGQDRGAGNSAGLIGNGGAGGAGGNGGIG 60
Query 61 GAGAPGGLGGDGGKGGFADEFTGGFAQGGRGGFGGNGNTGASGGMGGAGGAGGAGGAGGL 120
GAGAPGGLGGDGGKGGFADEFTGGFAQGGRGGFGGNGNTGASGGMGGAGGAGGAGGAGGL
Sbjct 61 GAGAPGGLGGDGGKGGFADEFTGGFAQGGRGGFGGNGNTGASGGMGGAGGAGGAGGAGGL 120
Query 121 LIGDGGAGGAGGIGGAGGVGGGGGAGGTGGGGVASAFGGGNAFGGRGGDGGDGGDGGTGG 180
LIGDGGAGGAGGIGGAGGVGGGGGAGGTGGGGVASAFGGGNAFGGRGGDGGDGGDGGTGG
Sbjct 121 LIGDGGAGGAGGIGGAGGVGGGGGAGGTGGGGVASAFGGGNAFGGRGGDGGDGGDGGTGG 180
Query 181 AGGARGAGGAGGAGGWLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKA 240
AG A GAGGAGGAGGWLSGHSGAHGAMG GGEGGAGGGGGARGEAGAGGGTSTGTNPGKA
Sbjct 181 AGAAGGAGGAGGAGGWLSGHSGAHGAMGGGGEGGAGGGGGARGEAGAGGGTSTGTNPGKA 240
Query 241 GAPGTQGDSGDPGPPG 256
GAPGTQGDSGDPGPPG
Sbjct 241 GAPGTQGDSGDPGPPG 256
>gi|148823337|ref|YP_001288091.1| PE-PGRS family protein [Mycobacterium tuberculosis F11]
gi|148721864|gb|ABR06489.1| PE-PGRS family protein [Mycobacterium tuberculosis F11]
Length=362
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 61/61 (100%), Positives = 61/61 (100%), Gaps = 0/61 (0%)
Query 196 WLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKAGAPGTQGDSGDPGPP 255
WLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKAGAPGTQGDSGDPGPP
Sbjct 302 WLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKAGAPGTQGDSGDPGPP 361
Query 256 G 256
G
Sbjct 362 G 362
>gi|31793306|ref|NP_855799.1| hypothetical protein Mb2150c [Mycobacterium bovis AF2122/97]
gi|289554088|ref|ZP_06443298.1| predicted protein [Mycobacterium tuberculosis KZN 605]
gi|31618898|emb|CAD97003.1| conserved hypothetical protein, PE_PGRS [Mycobacterium bovis
AF2122/97]
gi|289438720|gb|EFD21213.1| predicted protein [Mycobacterium tuberculosis KZN 605]
gi|328458573|gb|AEB03996.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN 4207]
Length=355
Score = 55.8 bits (133), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 61/61 (100%), Positives = 61/61 (100%), Gaps = 0/61 (0%)
Query 196 WLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKAGAPGTQGDSGDPGPP 255
WLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKAGAPGTQGDSGDPGPP
Sbjct 295 WLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKAGAPGTQGDSGDPGPP 354
Query 256 G 256
G
Sbjct 355 G 355
>gi|289447746|ref|ZP_06437490.1| predicted protein [Mycobacterium tuberculosis CPHL_A]
gi|289420704|gb|EFD17905.1| predicted protein [Mycobacterium tuberculosis CPHL_A]
Length=118
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 61/61 (100%), Positives = 61/61 (100%), Gaps = 0/61 (0%)
Query 196 WLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKAGAPGTQGDSGDPGPP 255
WLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKAGAPGTQGDSGDPGPP
Sbjct 58 WLSGHSGAHGAMGSGGEGGAGGGGGARGEAGAGGGTSTGTNPGKAGAPGTQGDSGDPGPP 117
Query 256 G 256
G
Sbjct 118 G 118
Lambda K H
0.307 0.147 0.468
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 377837056800
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40