BLASTP 2.2.25+


Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.



Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           15,229,318 sequences; 5,219,829,388 total letters



Query= Rv2288

Length=125
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|15609425|ref|NP_216804.1|  hypothetical protein Rv2288 [Mycoba...   251    3e-65
gi|340627292|ref|YP_004745744.1|  hypothetical protein MCAN_23101...   248    2e-64
gi|313239781|emb|CBY14661.1|  unnamed protein product [Oikopleura...  35.0    4.2  


>gi|15609425|ref|NP_216804.1| hypothetical protein Rv2288 [Mycobacterium tuberculosis H37Rv]
 gi|15841779|ref|NP_336816.1| hypothetical protein MT2345.1 [Mycobacterium tuberculosis CDC1551]
 gi|31793466|ref|NP_855959.1| hypothetical protein Mb2310 [Mycobacterium bovis AF2122/97]
 52 more sequence titles
 Length=125

 Score =  251 bits (641),  Expect = 3e-65, Method: Compositional matrix adjust.
 Identities = 124/125 (99%), Positives = 125/125 (100%), Gaps = 0/125 (0%)

Query  1    VSRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVDGRKLLP  60
            +SRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVDGRKLLP
Sbjct  1    MSRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVDGRKLLP  60

Query  61   PARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPGWAPFGWLHEPSGARCPKA  120
            PARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPGWAPFGWLHEPSGARCPKA
Sbjct  61   PARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPGWAPFGWLHEPSGARCPKA  120

Query  121  DGQSV  125
            DGQSV
Sbjct  121  DGQSV  125


>gi|340627292|ref|YP_004745744.1| hypothetical protein MCAN_23101 [Mycobacterium canettii CIPT 
140010059]
 gi|340005482|emb|CCC44642.1| hypothetical protein MCAN_23101 [Mycobacterium canettii CIPT 
140010059]
Length=125

 Score =  248 bits (633),  Expect = 2e-64, Method: Compositional matrix adjust.
 Identities = 123/125 (99%), Positives = 124/125 (99%), Gaps = 0/125 (0%)

Query  1    VSRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVDGRKLLP  60
            +SRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVDGRKLLP
Sbjct  1    MSRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVDGRKLLP  60

Query  61   PARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPGWAPFGWLHEPSGARCPKA  120
            PARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPGWAPFGWLHE SGARCPKA
Sbjct  61   PARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPGWAPFGWLHELSGARCPKA  120

Query  121  DGQSV  125
            DGQSV
Sbjct  121  DGQSV  125


>gi|313239781|emb|CBY14661.1| unnamed protein product [Oikopleura dioica]
Length=1286

 Score = 35.0 bits (79),  Expect = 4.2, Method: Compositional matrix adjust.
 Identities = 29/96 (31%), Positives = 44/96 (46%), Gaps = 8/96 (8%)

Query  13   VQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVD---GRKLLPPARRTGTQQ  69
            VQ+L   F +  ++S H   R+ GC  A + P R  CD + +    R L P    T T++
Sbjct  354  VQILPTGFREVMTLSTHLQGRQTGCIHASVLPQRGGCDPEHEIMLTRSLFPDLPETTTKE  413

Query  70   ----RRIRPAAPRVYTTGDILRDRKGI-APWQEQRE  100
                R +  A  +  T     R+RK + A  QE R+
Sbjct  414  YNLNREVLLAVEKSKTVRMATRNRKVVQAQEQELRD  449



Lambda     K      H
   0.322    0.137    0.461 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Effective search space used: 130354689300


  Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
    Posted date:  Sep 5, 2011  4:36 AM
  Number of letters in database: 5,219,829,388
  Number of sequences in database:  15,229,318



Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40