BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0964c Length=160 Score E Sequences producing significant alignments: (Bits) Value gi|15608104|ref|NP_215479.1| hypothetical protein Rv0964c [Mycob... 322 9e-87 gi|31792153|ref|NP_854646.1| hypothetical protein Mb0989c [Mycob... 320 6e-86 gi|15840390|ref|NP_335427.1| hypothetical protein MT0992.1 [Myco... 275 2e-72 gi|240169358|ref|ZP_04748017.1| hypothetical protein MkanA1_0859... 133 8e-30 gi|340625977|ref|YP_004744429.1| hypothetical protein MCAN_09681... 132 2e-29 gi|294996454|ref|ZP_06802145.1| hypothetical protein Mtub2_18604... 117 7e-25 gi|339640529|ref|ZP_08661973.1| RHS repeat-associated core domai... 38.1 0.43 gi|227833423|ref|YP_002835130.1| chromosome segregation protein ... 35.4 2.5 gi|262184413|ref|ZP_06043834.1| chromosome segregation protein [... 35.0 3.7 gi|301632042|ref|XP_002945100.1| PREDICTED: hypothetical protein... 34.7 5.4 gi|254428041|ref|ZP_05041748.1| undecaprenyl diphosphate synthas... 34.3 6.9 >gi|15608104|ref|NP_215479.1| hypothetical protein Rv0964c [Mycobacterium tuberculosis H37Rv] gi|148660744|ref|YP_001282267.1| hypothetical protein MRA_0971A [Mycobacterium tuberculosis H37Ra] gi|167967738|ref|ZP_02550015.1| hypothetical protein MtubH3_06758 [Mycobacterium tuberculosis H37Ra] 11 more sequence titlesLength=160 Score = 322 bits (826), Expect = 9e-87, Method: Compositional matrix adjust. Identities = 160/160 (100%), Positives = 160/160 (100%), Gaps = 0/160 (0%) Query 1 MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAA 60 MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAA Sbjct 1 MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAA 60 Query 61 REHHLTIAYATGTALPPPDLSSYSPADQQAILNTAIRRASNVCWPTPRPPMRIWPRRFDA 120 REHHLTIAYATGTALPPPDLSSYSPADQQAILNTAIRRASNVCWPTPRPPMRIWPRRFDA Sbjct 61 REHHLTIAYATGTALPPPDLSSYSPADQQAILNTAIRRASNVCWPTPRPPMRIWPRRFDA 120 Query 121 PPGPCRASRSMPNSAMRHPQCRRCRRRTATLRRSSGGGIR 160 PPGPCRASRSMPNSAMRHPQCRRCRRRTATLRRSSGGGIR Sbjct 121 PPGPCRASRSMPNSAMRHPQCRRCRRRTATLRRSSGGGIR 160 >gi|31792153|ref|NP_854646.1| hypothetical protein Mb0989c [Mycobacterium bovis AF2122/97] gi|121636890|ref|YP_977113.1| hypothetical protein BCG_1018c [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|148822173|ref|YP_001286927.1| hypothetical protein TBFG_10982 [Mycobacterium tuberculosis F11] 53 more sequence titles Length=160 Score = 320 bits (819), Expect = 6e-86, Method: Compositional matrix adjust. Identities = 159/160 (99%), Positives = 159/160 (99%), Gaps = 0/160 (0%) Query 1 MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAA 60 MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAA Sbjct 1 MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAA 60 Query 61 REHHLTIAYATGTALPPPDLSSYSPADQQAILNTAIRRASNVCWPTPRPPMRIWPRRFDA 120 REHHLTIAYATGTALPPPDLSSYSPADQQAILNTAIRRASNVCWPTPRPPMRIWPRRFDA Sbjct 61 REHHLTIAYATGTALPPPDLSSYSPADQQAILNTAIRRASNVCWPTPRPPMRIWPRRFDA 120 Query 121 PPGPCRASRSMPNSAMRHPQCRRCRRRTATLRRSSGGGIR 160 PPG CRASRSMPNSAMRHPQCRRCRRRTATLRRSSGGGIR Sbjct 121 PPGTCRASRSMPNSAMRHPQCRRCRRRTATLRRSSGGGIR 160 >gi|15840390|ref|NP_335427.1| hypothetical protein MT0992.1 [Mycobacterium tuberculosis CDC1551] gi|254231271|ref|ZP_04924598.1| hypothetical protein TBCG_00956 [Mycobacterium tuberculosis C] gi|13880558|gb|AAK45241.1| hypothetical protein MT0992.1 [Mycobacterium tuberculosis CDC1551] gi|124600330|gb|EAY59340.1| hypothetical protein TBCG_00956 [Mycobacterium tuberculosis C] Length=138 Score = 275 bits (702), Expect = 2e-72, Method: Compositional matrix adjust. Identities = 136/138 (99%), Positives = 137/138 (99%), Gaps = 0/138 (0%) Query 23 VLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAAREHHLTIAYATGTALPPPDLSS 82 +LLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAAREHHLTIAYATGTALPPPDLSS Sbjct 1 MLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAAREHHLTIAYATGTALPPPDLSS 60 Query 83 YSPADQQAILNTAIRRASNVCWPTPRPPMRIWPRRFDAPPGPCRASRSMPNSAMRHPQCR 142 YSPADQQAILNTAIRRASNVCWPTPRPPMRIWPRRFDAPPG CRASRSMPNSAMRHPQCR Sbjct 61 YSPADQQAILNTAIRRASNVCWPTPRPPMRIWPRRFDAPPGTCRASRSMPNSAMRHPQCR 120 Query 143 RCRRRTATLRRSSGGGIR 160 RCRRRTATLRRSSGGGIR Sbjct 121 RCRRRTATLRRSSGGGIR 138 >gi|240169358|ref|ZP_04748017.1| hypothetical protein MkanA1_08596 [Mycobacterium kansasii ATCC 12478] Length=546 Score = 133 bits (335), Expect = 8e-30, Method: Compositional matrix adjust. Identities = 73/96 (77%), Positives = 81/96 (85%), Gaps = 0/96 (0%) Query 1 MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAA 60 M LL FGG AA+AA ATH TT+LLD HA ACEAVARAAEK+AEEVAAIK+RL IRD+A Sbjct 23 MRLLDFGGDAADAALAATHRTTMLLDDHADACEAVARAAEKSAEEVAAIKLRLHAIRDSA 82 Query 61 REHHLTIAYATGTALPPPDLSSYSPADQQAILNTAI 96 R++HLTI ATGTALPP DL SYSPADQQ+ILN AI Sbjct 83 RDYHLTIDNATGTALPPADLWSYSPADQQSILNAAI 118 >gi|340625977|ref|YP_004744429.1| hypothetical protein MCAN_09681 [Mycobacterium canettii CIPT 140010059] gi|340004167|emb|CCC43305.1| hypothetical protein MCAN_09681 [Mycobacterium canettii CIPT 140010059] Length=163 Score = 132 bits (331), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 86/102 (85%), Positives = 90/102 (89%), Gaps = 0/102 (0%) Query 1 MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAA 60 M LL FGGAAAEAAQVATHHTTVLLD HAGACEAVARAAEKAA+EV AIKMRLQ+IRDAA Sbjct 1 MRLLDFGGAAAEAAQVATHHTTVLLDDHAGACEAVARAAEKAADEVTAIKMRLQLIRDAA 60 Query 61 REHHLTIAYATGTALPPPDLSSYSPADQQAILNTAIRRASNV 102 R +HL I ATGTALPPPDLSSYSPADQQAILNTAIR + Sbjct 61 RGYHLMIDDATGTALPPPDLSSYSPADQQAILNTAIRLTEGI 102 >gi|294996454|ref|ZP_06802145.1| hypothetical protein Mtub2_18604 [Mycobacterium tuberculosis 210] Length=65 Score = 117 bits (292), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 59/59 (100%), Positives = 59/59 (100%), Gaps = 0/59 (0%) Query 1 MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDA 59 MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDA Sbjct 1 MGLLGFGGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDA 59 >gi|339640529|ref|ZP_08661973.1| RHS repeat-associated core domain protein [Streptococcus sp. oral taxon 056 str. F0418] gi|339453798|gb|EGP66413.1| RHS repeat-associated core domain protein [Streptococcus sp. oral taxon 056 str. F0418] Length=3176 Score = 38.1 bits (87), Expect = 0.43, Method: Composition-based stats. Identities = 31/92 (34%), Positives = 44/92 (48%), Gaps = 13/92 (14%) Query 68 AYATGTALPPPDLSSYSPADQQAILNTAIRRASNVCWPTPRPPMRIWPRRFDAPPGPCRA 127 ++ATGT PP+LS A ++ N A +R S + PT + P R+ P FD R Sbjct 3075 SWATGTR--PPNLSP-EGAGRKGAFNEA-KRHSGI--PTSQSPDRVEPN-FD------RR 3121 Query 128 SRSMPNSAMRHPQCRRCRRRTATLRRSSGGGI 159 R P Q + R+ T T+R +GG I Sbjct 3122 GRRQPGRTYEFDQIQNGRKNTITIRDDAGGHI 3153 >gi|227833423|ref|YP_002835130.1| chromosome segregation protein [Corynebacterium aurimucosum ATCC 700975] gi|227454439|gb|ACP33192.1| chromosome segregation protein [Corynebacterium aurimucosum ATCC 700975] Length=1167 Score = 35.4 bits (80), Expect = 2.5, Method: Compositional matrix adjust. Identities = 26/60 (44%), Positives = 33/60 (55%), Gaps = 5/60 (8%) Query 8 GAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAAREHHLTI 67 GA AEA +VA L+H A E + A +AA E AA RL+ +RDA REH T+ Sbjct 425 GAQAEADEVADK-----LEHLAAEREPLEEAHVRAASESAAADKRLEQLRDAQREHERTV 479 >gi|262184413|ref|ZP_06043834.1| chromosome segregation protein [Corynebacterium aurimucosum ATCC 700975] Length=806 Score = 35.0 bits (79), Expect = 3.7, Method: Compositional matrix adjust. Identities = 26/60 (44%), Positives = 33/60 (55%), Gaps = 5/60 (8%) Query 8 GAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAAREHHLTI 67 GA AEA +VA L+H A E + A +AA E AA RL+ +RDA REH T+ Sbjct 425 GAQAEADEVADK-----LEHLAAEREPLEEAHVRAASESAAADKRLEQLRDAQREHERTV 479 >gi|301632042|ref|XP_002945100.1| PREDICTED: hypothetical protein LOC100486317 [Xenopus (Silurana) tropicalis] Length=1166 Score = 34.7 bits (78), Expect = 5.4, Method: Composition-based stats. Identities = 38/152 (25%), Positives = 56/152 (37%), Gaps = 7/152 (4%) Query 10 AAEAAQVATHHTTVLLDH---HAGACEAVARAAEKAAEEVAAIKMRLQVIRDAAREHHLT 66 AA+A +V L H A + +E+A A +++ A R Sbjct 35 AAQAGEVERRLIAELCLHPLPQGATSGAESPPSEEATSPAAPSSAPSALLKRAPRRGSAP 94 Query 67 IAYATGTALPPPDLSSYSPADQQ--AILNTAIRRASNVCWPTPRPPMRIWPRRFDAPPGP 124 GT +S +PA ++ A+ T+ RAS P P WP +PP P Sbjct 95 SGSKVGTQRKRNTGASKAPAKKRKPALSFTSPSRASASGSQDPALPQATWPAM--SPPAP 152 Query 125 CRASRSMPNSAMRHPQCRRCRRRTATLRRSSG 156 CR +S + + C TA RS G Sbjct 153 CRQQAPAASSGVSAVTPQSCNTVTAASDRSDG 184 >gi|254428041|ref|ZP_05041748.1| undecaprenyl diphosphate synthase [Alcanivorax sp. DG881] gi|196194210|gb|EDX89169.1| undecaprenyl diphosphate synthase [Alcanivorax sp. DG881] Length=260 Score = 34.3 bits (77), Expect = 6.9, Method: Compositional matrix adjust. Identities = 28/82 (35%), Positives = 43/82 (53%), Gaps = 5/82 (6%) Query 7 GGAAAEAAQVATHHTTVLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAAREHHLT 66 G A AE+ A + TV++ + G +A AA AE+VAA +++ + I + HL+ Sbjct 126 GMAEAESLTGANNRMTVVIAVNYGGQWDIAHAARIMAEKVAAGEIQPEQISAESMAPHLS 185 Query 67 IAYATGTALPPPDLSSYSPADQ 88 +A LPPPDL + DQ Sbjct 186 MA-----DLPPPDLLIRTGGDQ 202 Lambda K H 0.322 0.131 0.412 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 127750398496 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40