BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3890c Length=95 Score E Sequences producing significant alignments: (Bits) Value gi|15611026|ref|NP_218407.1| ESAT-6 like protein ESXC (ESAT-6 li... 191 4e-47 gi|148825098|ref|YP_001289852.1| Esat-6 like protein esxC (Esat-... 189 1e-46 gi|240168370|ref|ZP_04747029.1| esat-6 like protein EsxC [Mycoba... 168 3e-40 gi|342860125|ref|ZP_08716777.1| hypothetical protein MCOL_14640 ... 152 1e-35 gi|254773210|ref|ZP_05214726.1| hypothetical protein MaviaA2_008... 151 3e-35 gi|41406259|ref|NP_959095.1| hypothetical protein MAP0161 [Mycob... 151 4e-35 gi|254821254|ref|ZP_05226255.1| hypothetical protein MintA_15052... 150 7e-35 gi|296167006|ref|ZP_06849419.1| ESAT-6 like protein ESXC (ESAT-6... 150 8e-35 gi|31795063|ref|NP_857556.1| putative ESAT-6 like protein 11 [My... 118 2e-25 gi|339633880|ref|YP_004725522.1| ESAT-6 like protein ESXC (ESAT-... 118 3e-25 gi|333988693|ref|YP_004521307.1| ESAT-6 like protein ESXC (ESAT-... 115 3e-24 gi|257054512|ref|YP_003132344.1| hypothetical protein Svir_04410... 38.5 0.37 gi|154508575|ref|ZP_02044217.1| hypothetical protein ACTODO_0107... 37.0 1.0 gi|302524038|ref|ZP_07276380.1| predicted protein [Streptomyces ... 35.0 4.0 gi|296164027|ref|ZP_06846650.1| conserved hypothetical protein [... 34.7 5.3 gi|195996253|ref|XP_002107995.1| hypothetical protein TRIADDRAFT... 33.9 8.2 gi|153962582|gb|ABS53459.1| putative sulfide-quinone reductase [... 33.9 8.8 >gi|15611026|ref|NP_218407.1| ESAT-6 like protein ESXC (ESAT-6 like protein 11) [Mycobacterium tuberculosis H37Rv] gi|15843521|ref|NP_338558.1| hypothetical protein MT4005 [Mycobacterium tuberculosis CDC1551] gi|148663757|ref|YP_001285280.1| esat-6 like protein EsxC [Mycobacterium tuberculosis H37Ra] 62 more sequence titlesLength=95 Score = 191 bits (484), Expect = 4e-47, Method: Compositional matrix adjust. Identities = 95/95 (100%), Positives = 95/95 (100%), Gaps = 0/95 (0%) Query 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML 60 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML Sbjct 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML 60 Query 61 SGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 SGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF Sbjct 61 SGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 >gi|148825098|ref|YP_001289852.1| Esat-6 like protein esxC (Esat-6 like protein 11) [Mycobacterium tuberculosis F11] gi|148723625|gb|ABR08250.1| Esat-6 like protein esxC (Esat-6 like protein 11) [Mycobacterium tuberculosis F11] Length=95 Score = 189 bits (479), Expect = 1e-46, Method: Compositional matrix adjust. Identities = 94/95 (99%), Positives = 94/95 (99%), Gaps = 0/95 (0%) Query 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML 60 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML Sbjct 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML 60 Query 61 SGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 SGLQGLIETVGQHGTTTGHVLDNAIG DQAIAGLF Sbjct 61 SGLQGLIETVGQHGTTTGHVLDNAIGADQAIAGLF 95 >gi|240168370|ref|ZP_04747029.1| esat-6 like protein EsxC [Mycobacterium kansasii ATCC 12478] Length=95 Score = 168 bits (425), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 82/95 (87%), Positives = 88/95 (93%), Gaps = 0/95 (0%) Query 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML 60 M DQITYNP AVSDFA+DVGSRAGQLH I+EDT++KTNALQEFFAGHGAQGFFDAQAQML Sbjct 1 MGDQITYNPAAVSDFATDVGSRAGQLHEIHEDTSNKTNALQEFFAGHGAQGFFDAQAQML 60 Query 61 SGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 SGLQGLIETVGQHGTTT HVLDNA+ TD AI+ LF Sbjct 61 SGLQGLIETVGQHGTTTSHVLDNALTTDSAISNLF 95 >gi|342860125|ref|ZP_08716777.1| hypothetical protein MCOL_14640 [Mycobacterium colombiense CECT 3035] gi|342132503|gb|EGT85732.1| hypothetical protein MCOL_14640 [Mycobacterium colombiense CECT 3035] Length=95 Score = 152 bits (385), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 73/95 (77%), Positives = 85/95 (90%), Gaps = 0/95 (0%) Query 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML 60 MSD ITYNPGAV+DFA+DV SRAGQL I++DT+++T+ALQEFFAGHGA GFF+AQAQML Sbjct 1 MSDPITYNPGAVADFATDVASRAGQLQSIFDDTSNRTHALQEFFAGHGASGFFEAQAQML 60 Query 61 SGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 SGLQGLI+T+ QHG TT HVLDNA+ TDQ IAGLF Sbjct 61 SGLQGLIDTIRQHGVTTSHVLDNALSTDQHIAGLF 95 >gi|254773210|ref|ZP_05214726.1| hypothetical protein MaviaA2_00811 [Mycobacterium avium subsp. avium ATCC 25291] Length=97 Score = 151 bits (381), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 73/95 (77%), Positives = 85/95 (90%), Gaps = 0/95 (0%) Query 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML 60 MSD ITYNPGAV+DFA+DV SRAGQL I++DT+++T+ALQEFFAGHGA GFF+AQAQML Sbjct 3 MSDPITYNPGAVADFATDVASRAGQLQSIFDDTSNRTHALQEFFAGHGASGFFEAQAQML 62 Query 61 SGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 SGLQGLI+T+ QHG TT HVLD+AI TDQ IAGLF Sbjct 63 SGLQGLIDTIRQHGQTTSHVLDSAISTDQHIAGLF 97 >gi|41406259|ref|NP_959095.1| hypothetical protein MAP0161 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118467234|ref|YP_879448.1| hypothetical protein MAV_0154 [Mycobacterium avium 104] gi|14548042|sp|Q9K548.1|ES6LB_MYCPA RecName: Full=Putative ESAT-6-like protein 11; AltName: Full=ORF3890c gi|8919125|emb|CAB96047.1| hypothetical protein [Mycobacterium avium subsp. paratuberculosis] gi|41394607|gb|AAS02478.1| hypothetical protein MAP_0161 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118168521|gb|ABK69418.1| conserved hypothetical protein [Mycobacterium avium 104] gi|336457494|gb|EGO36501.1| hypothetical protein MAPs_22300 [Mycobacterium avium subsp. paratuberculosis S397] Length=95 Score = 151 bits (381), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 73/95 (77%), Positives = 85/95 (90%), Gaps = 0/95 (0%) Query 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML 60 MSD ITYNPGAV+DFA+DV SRAGQL I++DT+++T+ALQEFFAGHGA GFF+AQAQML Sbjct 1 MSDPITYNPGAVADFATDVASRAGQLQSIFDDTSNRTHALQEFFAGHGASGFFEAQAQML 60 Query 61 SGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 SGLQGLI+T+ QHG TT HVLD+AI TDQ IAGLF Sbjct 61 SGLQGLIDTIRQHGQTTSHVLDSAISTDQHIAGLF 95 >gi|254821254|ref|ZP_05226255.1| hypothetical protein MintA_15052 [Mycobacterium intracellulare ATCC 13950] Length=95 Score = 150 bits (378), Expect = 7e-35, Method: Compositional matrix adjust. Identities = 72/95 (76%), Positives = 85/95 (90%), Gaps = 0/95 (0%) Query 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML 60 MSD ITYNPGAV+DFA+DV SRAGQL I++DT+++T+ALQEFFAGHGA GFF+AQAQML Sbjct 1 MSDPITYNPGAVADFATDVASRAGQLQSIFDDTSNRTHALQEFFAGHGASGFFEAQAQML 60 Query 61 SGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 SGLQGLI+T+ QHG TT HVLD+A+ TDQ IAGLF Sbjct 61 SGLQGLIDTIRQHGQTTSHVLDSALSTDQHIAGLF 95 >gi|296167006|ref|ZP_06849419.1| ESAT-6 like protein ESXC (ESAT-6 like protein 11) [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295897636|gb|EFG77229.1| ESAT-6 like protein ESXC (ESAT-6 like protein 11) [Mycobacterium parascrofulaceum ATCC BAA-614] Length=95 Score = 150 bits (378), Expect = 8e-35, Method: Compositional matrix adjust. Identities = 73/95 (77%), Positives = 84/95 (89%), Gaps = 0/95 (0%) Query 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML 60 MSD ITYNPGAV+DFA+DV SRAGQL I++DT+++TNALQEFFAGHGA GFF+AQAQML Sbjct 1 MSDPITYNPGAVADFATDVASRAGQLQGIFDDTSNRTNALQEFFAGHGASGFFEAQAQML 60 Query 61 SGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 SGLQGLI+T+ QHG TT HVLD A+ TDQ IAGLF Sbjct 61 SGLQGLIDTIRQHGQTTSHVLDGALSTDQHIAGLF 95 >gi|31795063|ref|NP_857556.1| putative ESAT-6 like protein 11 [Mycobacterium bovis AF2122/97] gi|121639801|ref|YP_980025.1| putative ESAT-6 like protein 11 esxC [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224992296|ref|YP_002646986.1| putative EsaT-6 like protein 11 [Mycobacterium bovis BCG str. Tokyo 172] 6 more sequence titles Length=124 Score = 118 bits (296), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 55/55 (100%), Positives = 55/55 (100%), Gaps = 0/55 (0%) Query 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDA 55 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDA Sbjct 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDA 55 >gi|339633880|ref|YP_004725522.1| ESAT-6 like protein ESXC (ESAT-6 like protein 11) [Mycobacterium africanum GM041182] gi|339333236|emb|CCC28973.1| ESAT-6 like protein ESXC (ESAT-6 like protein 11) [Mycobacterium africanum GM041182] Length=96 Score = 118 bits (295), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 55/55 (100%), Positives = 55/55 (100%), Gaps = 0/55 (0%) Query 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDA 55 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDA Sbjct 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDA 55 >gi|333988693|ref|YP_004521307.1| ESAT-6 like protein ESXC (ESAT-6 like protein 11) [Mycobacterium sp. JDM601] gi|333484661|gb|AEF34053.1| ESAT-6 like protein ESXC (ESAT-6 like protein 11) [Mycobacterium sp. JDM601] Length=95 Score = 115 bits (287), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 60/95 (64%), Positives = 70/95 (74%), Gaps = 0/95 (0%) Query 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML 60 M+D ITYNPG V+D A V + AG L I+ D T L E+FAGHGA GFF+AQAQML Sbjct 1 MADGITYNPGPVADQAHSVITSAGTLDQIHADAHQLTQMLTEYFAGHGATGFFEAQAQML 60 Query 61 SGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 SGLQGLIET+GQHG+T G VL+ AI TDQ I+ LF Sbjct 61 SGLQGLIETIGQHGSTIGSVLEGAIQTDQTISSLF 95 >gi|257054512|ref|YP_003132344.1| hypothetical protein Svir_04410 [Saccharomonospora viridis DSM 43017] gi|256584384|gb|ACU95517.1| uncharacterized conserved protein [Saccharomonospora viridis DSM 43017] Length=95 Score = 38.5 bits (88), Expect = 0.37, Method: Compositional matrix adjust. Identities = 20/95 (22%), Positives = 41/95 (44%), Gaps = 0/95 (0%) Query 1 MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQML 60 M + I + + A D G+L ++ED ++ L + ++G + + Q + Sbjct 1 MPNGIVVDYATIHTAAEDCQRTGGELEALFEDLKARLAPLVDSWSGEAMEAWMQCQNEWN 60 Query 61 SGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 L + + + Q T + D TD++I G+F Sbjct 61 QSLDEMKQVLAQIATALPQIADGYQATDKSIQGMF 95 >gi|154508575|ref|ZP_02044217.1| hypothetical protein ACTODO_01076 [Actinomyces odontolyticus ATCC 17982] gi|293191453|ref|ZP_06609195.1| putative low molecular weight protein antigen 7 [Actinomyces odontolyticus F0309] gi|153798209|gb|EDN80629.1| hypothetical protein ACTODO_01076 [Actinomyces odontolyticus ATCC 17982] gi|292820554|gb|EFF79530.1| putative low molecular weight protein antigen 7 [Actinomyces odontolyticus F0309] Length=93 Score = 37.0 bits (84), Expect = 1.0, Method: Compositional matrix adjust. Identities = 24/88 (28%), Positives = 39/88 (45%), Gaps = 0/88 (0%) Query 8 NPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQMLSGLQGLI 67 N GA+ A+D+ + A L +D N L+ + G + + AQ Q GL+GL Sbjct 6 NYGALDAAAADINTGAANLQNCLDDLEQTLNQLRSSWEGQTQEAYDVAQRQWNQGLEGLK 65 Query 68 ETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 + + + + N TDQ+ A F Sbjct 66 DVLRRTSSAVDSARSNYQQTDQSNAARF 93 >gi|302524038|ref|ZP_07276380.1| predicted protein [Streptomyces sp. AA4] gi|302432933|gb|EFL04749.1| predicted protein [Streptomyces sp. AA4] Length=96 Score = 35.0 bits (79), Expect = 4.0, Method: Compositional matrix adjust. Identities = 20/96 (21%), Positives = 40/96 (42%), Gaps = 1/96 (1%) Query 1 MSD-QITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQM 59 M D +I +PG + A D + G+L ++++ S + L + G + AQ + Sbjct 1 MPDGRIVVDPGTIHRAAEDCTATGGELKTLFDNLQSDLSPLTNSWTGEAKDQYHQAQNEW 60 Query 60 LSGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 + + + Q + D TD+++ LF Sbjct 61 NQKFEEFTQLLAQIAAVLPQIADGYQATDRSVQNLF 96 >gi|296164027|ref|ZP_06846650.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295900575|gb|EFG79958.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length=99 Score = 34.7 bits (78), Expect = 5.3, Method: Compositional matrix adjust. Identities = 21/93 (23%), Positives = 38/93 (41%), Gaps = 0/93 (0%) Query 3 DQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQMLSG 62 D + Y+ ++D S ++ + N + + HG+ A ++ Sbjct 4 DSMRYDHAMIADHVSAQAQLVAHMNDLRTRAMGTINQVATVWTQHGSDAAQVAMHEIDQA 63 Query 63 LQGLIETVGQHGTTTGHVLDNAIGTDQAIAGLF 95 Q + T+ +HG GH NA+GTD A+ F Sbjct 64 FQAVFTTIERHGQAQGHASTNALGTDHAVQAGF 96 >gi|195996253|ref|XP_002107995.1| hypothetical protein TRIADDRAFT_11246 [Trichoplax adhaerens] gi|190588771|gb|EDV28793.1| hypothetical protein TRIADDRAFT_11246 [Trichoplax adhaerens] Length=604 Score = 33.9 bits (76), Expect = 8.2, Method: Composition-based stats. Identities = 18/67 (27%), Positives = 36/67 (54%), Gaps = 1/67 (1%) Query 21 SRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQMLSGLQGLIETVGQHGTTTGHV 80 + +G L+ YE A+ + F+AG G +G + A + ++ G+ GL+ + H T +V Sbjct 44 TESGSLYSAYEVAAAIIGIVTSFYAGQGHKGRYLAVSAVIIGIGGLVFAL-PHWITDNYV 102 Query 81 LDNAIGT 87 + +G+ Sbjct 103 AEGTVGS 109 >gi|153962582|gb|ABS53459.1| putative sulfide-quinone reductase [uncultured bacterium] Length=114 Score = 33.9 bits (76), Expect = 8.8, Method: Compositional matrix adjust. Identities = 15/34 (45%), Positives = 23/34 (68%), Gaps = 0/34 (0%) Query 61 SGLQGLIETVGQHGTTTGHVLDNAIGTDQAIAGL 94 +G++GL ET+GQHG T+ + D A T + + GL Sbjct 67 AGVEGLTETLGQHGVTSNYCFDLAPYTWKLVQGL 100 Lambda K H 0.317 0.132 0.377 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 131599744116 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40