BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3344c Length=484 Score E Sequences producing significant alignments: (Bits) Value gi|57117093|ref|YP_177961.1| PE-PGRS family protein [Mycobacteri... 711 0.0 >gi|57117093|ref|YP_177961.1| PE-PGRS family protein [Mycobacterium tuberculosis H37Rv] gi|7442084|pir||G70846 hypothetical glycine-rich protein Rv3344c - Mycobacterium tuberculosis (strain H37RV) gi|38490358|emb|CAE55586.1| PE-PGRS FAMILY PROTEIN [Mycobacterium tuberculosis H37Rv] Length=484 Score = 711 bits (1834), Expect = 0.0, Method: Compositional matrix adjust. Identities = 484/484 (100%), Positives = 484/484 (100%), Gaps = 0/484 (0%) Query 1 AQASPAAHGGSGGAGGNGGAGSAGNGGAGGAGGNGGAGGNGGGGDAGNAGSGGNGGKGGD 60 AQASPAAHGGSGGAGGNGGAGSAGNGGAGGAGGNGGAGGNGGGGDAGNAGSGGNGGKGGD Sbjct 1 AQASPAAHGGSGGAGGNGGAGSAGNGGAGGAGGNGGAGGNGGGGDAGNAGSGGNGGKGGD 60 Query 61 GVGPGSTGGAGGKGGAGANGGSSNGNARGGNAGNGGHGGAGGSGDTGGAGGAGGQGGFGG 120 GVGPGSTGGAGGKGGAGANGGSSNGNARGGNAGNGGHGGAGGSGDTGGAGGAGGQGGFGG Sbjct 61 GVGPGSTGGAGGKGGAGANGGSSNGNARGGNAGNGGHGGAGGSGDTGGAGGAGGQGGFGG 120 Query 121 TGGSGSGIGGGAGGNGGNGGAGGTGVVLGGKGGDGGNGDHGGPATNPGSGSRGGAGGSGG 180 TGGSGSGIGGGAGGNGGNGGAGGTGVVLGGKGGDGGNGDHGGPATNPGSGSRGGAGGSGG Sbjct 121 TGGSGSGIGGGAGGNGGNGGAGGTGVVLGGKGGDGGNGDHGGPATNPGSGSRGGAGGSGG 180 Query 181 NGGAGGNATGSGGKGGAGGNGGDGSFGATSGPASIGVTGAPGGNGGKGGAGGSNPNGSGG 240 NGGAGGNATGSGGKGGAGGNGGDGSFGATSGPASIGVTGAPGGNGGKGGAGGSNPNGSGG Sbjct 181 NGGAGGNATGSGGKGGAGGNGGDGSFGATSGPASIGVTGAPGGNGGKGGAGGSNPNGSGG 240 Query 241 DGGKGGNGGAGGNGGSIGANSGIVGGSGGAGGAGGAGGNGSLSSGEGGKGGDGGHGGDGV 300 DGGKGGNGGAGGNGGSIGANSGIVGGSGGAGGAGGAGGNGSLSSGEGGKGGDGGHGGDGV Sbjct 241 DGGKGGNGGAGGNGGSIGANSGIVGGSGGAGGAGGAGGNGSLSSGEGGKGGDGGHGGDGV 300 Query 301 GGNSSVTQGGSGGGGGAGGAGGSGFFGGKGGFGGDGGQGGPNGGGTVGTVAGGGGNGGVG 360 GGNSSVTQGGSGGGGGAGGAGGSGFFGGKGGFGGDGGQGGPNGGGTVGTVAGGGGNGGVG Sbjct 301 GGNSSVTQGGSGGGGGAGGAGGSGFFGGKGGFGGDGGQGGPNGGGTVGTVAGGGGNGGVG 360 Query 361 GRGGDGVFAGAGGQGGLGGQGGNGGGSTGGNGGLGGAGGGGGNAPDGGFGGNGGKGGQGG 420 GRGGDGVFAGAGGQGGLGGQGGNGGGSTGGNGGLGGAGGGGGNAPDGGFGGNGGKGGQGG Sbjct 361 GRGGDGVFAGAGGQGGLGGQGGNGGGSTGGNGGLGGAGGGGGNAPDGGFGGNGGKGGQGG 420 Query 421 IGGGTQSATGLGGDGGDGGDGGNGGNSGAKAGGAGGKGQAGQPNSGTEPGFGGDGGLGGA 480 IGGGTQSATGLGGDGGDGGDGGNGGNSGAKAGGAGGKGQAGQPNSGTEPGFGGDGGLGGA Sbjct 421 IGGGTQSATGLGGDGGDGGDGGNGGNSGAKAGGAGGKGQAGQPNSGTEPGFGGDGGLGGA 480 Query 481 GATP 484 GATP Sbjct 481 GATP 484 Lambda K H 0.302 0.143 0.445 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 1029114582640 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40