BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3891c Length=107 Score E Sequences producing significant alignments: (Bits) Value gi|15611027|ref|NP_218408.1| ESAT-6 like protein EsxD [Mycobacte... 215 1e-54 gi|294995576|ref|ZP_06801267.1| esat-6 like protein esxD [Mycoba... 214 5e-54 gi|31795064|ref|NP_857557.1| hypothetical protein Mb3920c [Mycob... 213 6e-54 gi|240168371|ref|ZP_04747030.1| hypothetical protein MkanA1_0360... 171 4e-41 gi|336457493|gb|EGO36500.1| WXG repeat protein [Mycobacterium av... 162 1e-38 gi|118466409|ref|YP_879447.1| hypothetical protein MAV_0153 [Myc... 162 2e-38 gi|296167007|ref|ZP_06849420.1| conserved hypothetical protein [... 160 7e-38 gi|8919126|emb|CAB96048.1| hypothetical protein [Mycobacterium a... 158 3e-37 gi|41406258|ref|NP_959094.1| hypothetical protein MAP0160 [Mycob... 156 9e-37 gi|342860126|ref|ZP_08716778.1| hypothetical protein MCOL_14645 ... 147 6e-34 gi|254773209|ref|ZP_05214725.1| hypothetical protein MaviaA2_008... 131 3e-29 gi|254821253|ref|ZP_05226254.1| hypothetical protein MintA_15047... 130 4e-29 gi|333988694|ref|YP_004521308.1| ESAT-6 like protein EsxD [Mycob... 105 2e-21 gi|332243340|ref|XP_003270836.1| PREDICTED: uncharacterized prot... 37.7 0.65 gi|329954846|ref|ZP_08295863.1| tetratricopeptide repeat protein... 35.4 2.5 >gi|15611027|ref|NP_218408.1| ESAT-6 like protein EsxD [Mycobacterium tuberculosis H37Rv] gi|15843522|ref|NP_338559.1| hypothetical protein MT4006 [Mycobacterium tuberculosis CDC1551] gi|148663758|ref|YP_001285281.1| putative esat-6 like protein EsxD [Mycobacterium tuberculosis H37Ra] 55 more sequence titlesLength=107 Score = 215 bits (548), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 106/107 (99%), Positives = 107/107 (100%), Gaps = 0/107 (0%) Query 1 VADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE 60 +ADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE Sbjct 1 MADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE 60 Query 61 ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS 107 ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS Sbjct 61 ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS 107 >gi|294995576|ref|ZP_06801267.1| esat-6 like protein esxD [Mycobacterium tuberculosis 210] Length=107 Score = 214 bits (544), Expect = 5e-54, Method: Compositional matrix adjust. Identities = 105/107 (99%), Positives = 106/107 (99%), Gaps = 0/107 (0%) Query 1 VADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE 60 +ADTIQVTPQMLRST NDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE Sbjct 1 MADTIQVTPQMLRSTGNDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE 60 Query 61 ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS 107 ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS Sbjct 61 ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS 107 >gi|31795064|ref|NP_857557.1| hypothetical protein Mb3920c [Mycobacterium bovis AF2122/97] gi|121639802|ref|YP_980026.1| hypothetical protein BCG_3947c [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224992297|ref|YP_002646987.1| hypothetical protein JTY_3949 [Mycobacterium bovis BCG str. Tokyo 172] 18 more sequence titles Length=107 Score = 213 bits (543), Expect = 6e-54, Method: Compositional matrix adjust. Identities = 105/107 (99%), Positives = 106/107 (99%), Gaps = 0/107 (0%) Query 1 VADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE 60 +ADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSG GVVASHMTATE Sbjct 1 MADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGAGVVASHMTATE 60 Query 61 ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS 107 ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS Sbjct 61 ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS 107 >gi|240168371|ref|ZP_04747030.1| hypothetical protein MkanA1_03602 [Mycobacterium kansasii ATCC 12478] Length=97 Score = 171 bits (432), Expect = 4e-41, Method: Compositional matrix adjust. Identities = 84/97 (87%), Positives = 87/97 (90%), Gaps = 0/97 (0%) Query 11 MLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELNKVLT 70 MLRSTA+DIQANME AM IA+GYLANQENVMNPATWSG GVVASH TATE+ NELNKVLT Sbjct 1 MLRSTAHDIQANMEHAMAIAQGYLANQENVMNPATWSGAGVVASHATATEVANELNKVLT 60 Query 71 GGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS 107 GGTRLAEGL QAAALME HEADSQ AFQALFG HGS Sbjct 61 GGTRLAEGLTQAAALMESHEADSQHAFQALFGGGHGS 97 >gi|336457493|gb|EGO36500.1| WXG repeat protein [Mycobacterium avium subsp. paratuberculosis S397] Length=107 Score = 162 bits (410), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 80/104 (77%), Positives = 88/104 (85%), Gaps = 1/104 (0%) Query 4 TIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITN 63 TI+VTPQMLR T+N IQANME A+GI +GY+ANQENVMNPATWSG V ASH TA E+ N Sbjct 5 TIKVTPQMLRDTSNAIQANMEHAIGIGQGYVANQENVMNPATWSGDAVAASHATAIEVQN 64 Query 64 ELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS 107 +LNKVLTGGTRLAEGL +AAALMEGHEADS AF ALFG HGS Sbjct 65 DLNKVLTGGTRLAEGLTKAAALMEGHEADSSHAFSALFGG-HGS 107 >gi|118466409|ref|YP_879447.1| hypothetical protein MAV_0153 [Mycobacterium avium 104] gi|118167696|gb|ABK68593.1| conserved hypothetical protein [Mycobacterium avium 104] Length=104 Score = 162 bits (410), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 80/104 (77%), Positives = 88/104 (85%), Gaps = 1/104 (0%) Query 4 TIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITN 63 TI+VTPQMLR T+N IQANME A+GI +GY+ANQENVMNPATWSG V ASH TA E+ N Sbjct 2 TIKVTPQMLRDTSNAIQANMEHAIGIGQGYVANQENVMNPATWSGDAVAASHATAIEVQN 61 Query 64 ELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS 107 +LNKVLTGGTRLAEGL +AAALMEGHEADS AF ALFG HGS Sbjct 62 DLNKVLTGGTRLAEGLTKAAALMEGHEADSSHAFSALFGG-HGS 104 >gi|296167007|ref|ZP_06849420.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295897637|gb|EFG77230.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=104 Score = 160 bits (405), Expect = 7e-38, Method: Compositional matrix adjust. Identities = 79/103 (77%), Positives = 87/103 (85%), Gaps = 2/103 (1%) Query 4 TIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITN 63 TI+VTPQMLR T+N IQANME A+GI +GY+ANQENVMNP+TWSG VVASH TA E+ N Sbjct 2 TIKVTPQMLRDTSNAIQANMEHAIGIGQGYVANQENVMNPSTWSGDAVVASHATAIEVQN 61 Query 64 ELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHG 106 +LNKVL GGTRLAEGL QAAALMEGHEADS AF ALFG HG Sbjct 62 DLNKVLNGGTRLAEGLKQAAALMEGHEADSSHAFSALFG--HG 102 >gi|8919126|emb|CAB96048.1| hypothetical protein [Mycobacterium avium subsp. paratuberculosis] Length=100 Score = 158 bits (399), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 78/101 (78%), Positives = 85/101 (85%), Gaps = 1/101 (0%) Query 7 VTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELN 66 VTPQMLR T+N IQANME A+GI +GY+ANQENVMNPATWSG V ASH TA E+ N+LN Sbjct 1 VTPQMLRDTSNAIQANMEHAIGIGQGYVANQENVMNPATWSGDAVAASHATAIEVQNDLN 60 Query 67 KVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS 107 KVLTGGTRLAEGL +AAALMEGHEADS AF ALFG HGS Sbjct 61 KVLTGGTRLAEGLTKAAALMEGHEADSSHAFSALFGG-HGS 100 >gi|41406258|ref|NP_959094.1| hypothetical protein MAP0160 [Mycobacterium avium subsp. paratuberculosis K-10] gi|41394606|gb|AAS02477.1| hypothetical protein MAP_0160 [Mycobacterium avium subsp. paratuberculosis K-10] Length=100 Score = 156 bits (395), Expect = 9e-37, Method: Compositional matrix adjust. Identities = 77/101 (77%), Positives = 85/101 (85%), Gaps = 1/101 (0%) Query 7 VTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELN 66 +TPQMLR T+N IQANME A+GI +GY+ANQENVMNPATWSG V ASH TA E+ N+LN Sbjct 1 MTPQMLRDTSNAIQANMEHAIGIGQGYVANQENVMNPATWSGDAVAASHATAIEVQNDLN 60 Query 67 KVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS 107 KVLTGGTRLAEGL +AAALMEGHEADS AF ALFG HGS Sbjct 61 KVLTGGTRLAEGLTKAAALMEGHEADSSHAFSALFGG-HGS 100 >gi|342860126|ref|ZP_08716778.1| hypothetical protein MCOL_14645 [Mycobacterium colombiense CECT 3035] gi|342132504|gb|EGT85733.1| hypothetical protein MCOL_14645 [Mycobacterium colombiense CECT 3035] Length=96 Score = 147 bits (370), Expect = 6e-34, Method: Compositional matrix adjust. Identities = 72/97 (75%), Positives = 81/97 (84%), Gaps = 1/97 (1%) Query 11 MLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELNKVLT 70 MLR +N IQANME A+GI +GY+ANQENVMNP+TWSG+ V ASH TA E+ N+LNKVLT Sbjct 1 MLRDASNAIQANMEHAIGIGQGYVANQENVMNPSTWSGSAVTASHATAIEVQNDLNKVLT 60 Query 71 GGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS 107 GGTRLAEGL +AAALMEGHEADS AF ALFG HGS Sbjct 61 GGTRLAEGLTKAAALMEGHEADSSHAFSALFGG-HGS 96 >gi|254773209|ref|ZP_05214725.1| hypothetical protein MaviaA2_00806 [Mycobacterium avium subsp. avium ATCC 25291] Length=84 Score = 131 bits (330), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 65/85 (77%), Positives = 71/85 (84%), Gaps = 1/85 (1%) Query 23 MEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELNKVLTGGTRLAEGLVQA 82 ME A+GI +GY+ANQENVMNPATWSG V ASH TA E+ N+LNKVLTGGTRLAEGL +A Sbjct 1 MEHAIGIGQGYVANQENVMNPATWSGDAVAASHATAIEVQNDLNKVLTGGTRLAEGLTKA 60 Query 83 AALMEGHEADSQTAFQALFGASHGS 107 AALMEGHEADS AF ALFG HGS Sbjct 61 AALMEGHEADSSHAFSALFGG-HGS 84 >gi|254821253|ref|ZP_05226254.1| hypothetical protein MintA_15047 [Mycobacterium intracellulare ATCC 13950] Length=84 Score = 130 bits (328), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 65/85 (77%), Positives = 71/85 (84%), Gaps = 1/85 (1%) Query 23 MEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELNKVLTGGTRLAEGLVQA 82 ME A+GI +GY+ANQENVMNPATWSG V ASH TA E+ N+LNKVLTGGTRLAEGL +A Sbjct 1 MEHAIGIGQGYVANQENVMNPATWSGDAVAASHATAIEVQNDLNKVLTGGTRLAEGLTKA 60 Query 83 AALMEGHEADSQTAFQALFGASHGS 107 AALMEGHEADS AF ALFG HGS Sbjct 61 AALMEGHEADSSHAFTALFGG-HGS 84 >gi|333988694|ref|YP_004521308.1| ESAT-6 like protein EsxD [Mycobacterium sp. JDM601] gi|333484662|gb|AEF34054.1| ESAT-6 like protein EsxD [Mycobacterium sp. JDM601] Length=105 Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 51/98 (53%), Positives = 69/98 (71%), Gaps = 0/98 (0%) Query 5 IQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNE 64 I VTP+++R+TA+ + ++E A IA YLA+ EN++ TW G G AS +TA +I + Sbjct 4 IVVTPELMRNTASKLAQHIEHAQAIANQYLADHENILGAGTWDGAGSKASFVTAGQIHED 63 Query 65 LNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFG 102 + KVL GGTRL EGL QAAALME HE+ S+ AF +LFG Sbjct 64 MQKVLIGGTRLTEGLNQAAALMESHESHSEHAFHSLFG 101 >gi|332243340|ref|XP_003270836.1| PREDICTED: uncharacterized protein C2orf16-like [Nomascus leucogenys] Length=2027 Score = 37.7 bits (86), Expect = 0.65, Method: Composition-based stats. Identities = 28/94 (30%), Positives = 36/94 (39%), Gaps = 6/94 (6%) Query 9 PQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELNKV 68 PQ RS + QA G+ K +L Q NV W T + N L Sbjct 843 PQSWRSLSRTFQAESGVQKGLIKSFLGRQHNVWESHAWRQRLPRKYLSTMLMLGNNL--- 899 Query 69 LTGGTRLAEGLVQAAALMEGHEADSQTAFQALFG 102 GT + L +L EG AD+ + Q LFG Sbjct 900 ---GTTMERKLCSQTSLAEGATADTCQSIQNLFG 930 >gi|329954846|ref|ZP_08295863.1| tetratricopeptide repeat protein [Bacteroides clarus YIT 12056] gi|328526950|gb|EGF53961.1| tetratricopeptide repeat protein [Bacteroides clarus YIT 12056] Length=420 Score = 35.4 bits (80), Expect = 2.5, Method: Composition-based stats. Identities = 16/51 (32%), Positives = 24/51 (48%), Gaps = 0/51 (0%) Query 13 RSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITN 63 +S AN+++ N QA + G L N E N TW G + + E+ N Sbjct 44 KSIANEVKPNFAQAEKLINGALTNAETKDNAETWDVAGFIQKRINEKEMEN 94 Lambda K H 0.311 0.122 0.333 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 130484177216 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40