BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2288 Length=125 Score E Sequences producing significant alignments: (Bits) Value gi|15609425|ref|NP_216804.1| hypothetical protein Rv2288 [Mycoba... 251 3e-65 gi|340627292|ref|YP_004745744.1| hypothetical protein MCAN_23101... 248 2e-64 gi|313239781|emb|CBY14661.1| unnamed protein product [Oikopleura... 35.0 4.2 >gi|15609425|ref|NP_216804.1| hypothetical protein Rv2288 [Mycobacterium tuberculosis H37Rv] gi|15841779|ref|NP_336816.1| hypothetical protein MT2345.1 [Mycobacterium tuberculosis CDC1551] gi|31793466|ref|NP_855959.1| hypothetical protein Mb2310 [Mycobacterium bovis AF2122/97] 52 more sequence titlesLength=125 Score = 251 bits (641), Expect = 3e-65, Method: Compositional matrix adjust. Identities = 124/125 (99%), Positives = 125/125 (100%), Gaps = 0/125 (0%) Query 1 VSRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVDGRKLLP 60 +SRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVDGRKLLP Sbjct 1 MSRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVDGRKLLP 60 Query 61 PARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPGWAPFGWLHEPSGARCPKA 120 PARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPGWAPFGWLHEPSGARCPKA Sbjct 61 PARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPGWAPFGWLHEPSGARCPKA 120 Query 121 DGQSV 125 DGQSV Sbjct 121 DGQSV 125 >gi|340627292|ref|YP_004745744.1| hypothetical protein MCAN_23101 [Mycobacterium canettii CIPT 140010059] gi|340005482|emb|CCC44642.1| hypothetical protein MCAN_23101 [Mycobacterium canettii CIPT 140010059] Length=125 Score = 248 bits (633), Expect = 2e-64, Method: Compositional matrix adjust. Identities = 123/125 (99%), Positives = 124/125 (99%), Gaps = 0/125 (0%) Query 1 VSRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVDGRKLLP 60 +SRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVDGRKLLP Sbjct 1 MSRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVDGRKLLP 60 Query 61 PARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPGWAPFGWLHEPSGARCPKA 120 PARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPGWAPFGWLHE SGARCPKA Sbjct 61 PARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQREPGWAPFGWLHELSGARCPKA 120 Query 121 DGQSV 125 DGQSV Sbjct 121 DGQSV 125 >gi|313239781|emb|CBY14661.1| unnamed protein product [Oikopleura dioica] Length=1286 Score = 35.0 bits (79), Expect = 4.2, Method: Compositional matrix adjust. Identities = 29/96 (31%), Positives = 44/96 (46%), Gaps = 8/96 (8%) Query 13 VQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCDGDVD---GRKLLPPARRTGTQQ 69 VQ+L F + ++S H R+ GC A + P R CD + + R L P T T++ Sbjct 354 VQILPTGFREVMTLSTHLQGRQTGCIHASVLPQRGGCDPEHEIMLTRSLFPDLPETTTKE 413 Query 70 ----RRIRPAAPRVYTTGDILRDRKGI-APWQEQRE 100 R + A + T R+RK + A QE R+ Sbjct 414 YNLNREVLLAVEKSKTVRMATRNRKVVQAQEQELRD 449 Lambda K H 0.322 0.137 0.461 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 130354689300 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40