BLASTP 2.2.25+


Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.



Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.



Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
           15,229,318 sequences; 5,219,829,388 total letters



Query= Rv3891c

Length=107
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

gi|15611027|ref|NP_218408.1|  ESAT-6 like protein EsxD [Mycobacte...   215    1e-54
gi|294995576|ref|ZP_06801267.1|  esat-6 like protein esxD [Mycoba...   214    5e-54
gi|31795064|ref|NP_857557.1|  hypothetical protein Mb3920c [Mycob...   213    6e-54
gi|240168371|ref|ZP_04747030.1|  hypothetical protein MkanA1_0360...   171    4e-41
gi|336457493|gb|EGO36500.1|  WXG repeat protein [Mycobacterium av...   162    1e-38
gi|118466409|ref|YP_879447.1|  hypothetical protein MAV_0153 [Myc...   162    2e-38
gi|296167007|ref|ZP_06849420.1|  conserved hypothetical protein [...   160    7e-38
gi|8919126|emb|CAB96048.1|  hypothetical protein [Mycobacterium a...   158    3e-37
gi|41406258|ref|NP_959094.1|  hypothetical protein MAP0160 [Mycob...   156    9e-37
gi|342860126|ref|ZP_08716778.1|  hypothetical protein MCOL_14645 ...   147    6e-34
gi|254773209|ref|ZP_05214725.1|  hypothetical protein MaviaA2_008...   131    3e-29
gi|254821253|ref|ZP_05226254.1|  hypothetical protein MintA_15047...   130    4e-29
gi|333988694|ref|YP_004521308.1|  ESAT-6 like protein EsxD [Mycob...   105    2e-21
gi|332243340|ref|XP_003270836.1|  PREDICTED: uncharacterized prot...  37.7    0.65 
gi|329954846|ref|ZP_08295863.1|  tetratricopeptide repeat protein...  35.4    2.5  


>gi|15611027|ref|NP_218408.1| ESAT-6 like protein EsxD [Mycobacterium tuberculosis H37Rv]
 gi|15843522|ref|NP_338559.1| hypothetical protein MT4006 [Mycobacterium tuberculosis CDC1551]
 gi|148663758|ref|YP_001285281.1| putative esat-6 like protein EsxD [Mycobacterium tuberculosis 
H37Ra]
 55 more sequence titles
 Length=107

 Score =  215 bits (548),  Expect = 1e-54, Method: Compositional matrix adjust.
 Identities = 106/107 (99%), Positives = 107/107 (100%), Gaps = 0/107 (0%)

Query  1    VADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE  60
            +ADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE
Sbjct  1    MADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE  60

Query  61   ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS  107
            ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS
Sbjct  61   ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS  107


>gi|294995576|ref|ZP_06801267.1| esat-6 like protein esxD [Mycobacterium tuberculosis 210]
Length=107

 Score =  214 bits (544),  Expect = 5e-54, Method: Compositional matrix adjust.
 Identities = 105/107 (99%), Positives = 106/107 (99%), Gaps = 0/107 (0%)

Query  1    VADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE  60
            +ADTIQVTPQMLRST NDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE
Sbjct  1    MADTIQVTPQMLRSTGNDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE  60

Query  61   ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS  107
            ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS
Sbjct  61   ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS  107


>gi|31795064|ref|NP_857557.1| hypothetical protein Mb3920c [Mycobacterium bovis AF2122/97]
 gi|121639802|ref|YP_980026.1| hypothetical protein BCG_3947c [Mycobacterium bovis BCG str. 
Pasteur 1173P2]
 gi|224992297|ref|YP_002646987.1| hypothetical protein JTY_3949 [Mycobacterium bovis BCG str. Tokyo 
172]
 18 more sequence titles
 Length=107

 Score =  213 bits (543),  Expect = 6e-54, Method: Compositional matrix adjust.
 Identities = 105/107 (99%), Positives = 106/107 (99%), Gaps = 0/107 (0%)

Query  1    VADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATE  60
            +ADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSG GVVASHMTATE
Sbjct  1    MADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGAGVVASHMTATE  60

Query  61   ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS  107
            ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS
Sbjct  61   ITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS  107


>gi|240168371|ref|ZP_04747030.1| hypothetical protein MkanA1_03602 [Mycobacterium kansasii ATCC 
12478]
Length=97

 Score =  171 bits (432),  Expect = 4e-41, Method: Compositional matrix adjust.
 Identities = 84/97 (87%), Positives = 87/97 (90%), Gaps = 0/97 (0%)

Query  11   MLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELNKVLT  70
            MLRSTA+DIQANME AM IA+GYLANQENVMNPATWSG GVVASH TATE+ NELNKVLT
Sbjct  1    MLRSTAHDIQANMEHAMAIAQGYLANQENVMNPATWSGAGVVASHATATEVANELNKVLT  60

Query  71   GGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS  107
            GGTRLAEGL QAAALME HEADSQ AFQALFG  HGS
Sbjct  61   GGTRLAEGLTQAAALMESHEADSQHAFQALFGGGHGS  97


>gi|336457493|gb|EGO36500.1| WXG repeat protein [Mycobacterium avium subsp. paratuberculosis 
S397]
Length=107

 Score =  162 bits (410),  Expect = 1e-38, Method: Compositional matrix adjust.
 Identities = 80/104 (77%), Positives = 88/104 (85%), Gaps = 1/104 (0%)

Query  4    TIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITN  63
            TI+VTPQMLR T+N IQANME A+GI +GY+ANQENVMNPATWSG  V ASH TA E+ N
Sbjct  5    TIKVTPQMLRDTSNAIQANMEHAIGIGQGYVANQENVMNPATWSGDAVAASHATAIEVQN  64

Query  64   ELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS  107
            +LNKVLTGGTRLAEGL +AAALMEGHEADS  AF ALFG  HGS
Sbjct  65   DLNKVLTGGTRLAEGLTKAAALMEGHEADSSHAFSALFGG-HGS  107


>gi|118466409|ref|YP_879447.1| hypothetical protein MAV_0153 [Mycobacterium avium 104]
 gi|118167696|gb|ABK68593.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=104

 Score =  162 bits (410),  Expect = 2e-38, Method: Compositional matrix adjust.
 Identities = 80/104 (77%), Positives = 88/104 (85%), Gaps = 1/104 (0%)

Query  4    TIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITN  63
            TI+VTPQMLR T+N IQANME A+GI +GY+ANQENVMNPATWSG  V ASH TA E+ N
Sbjct  2    TIKVTPQMLRDTSNAIQANMEHAIGIGQGYVANQENVMNPATWSGDAVAASHATAIEVQN  61

Query  64   ELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS  107
            +LNKVLTGGTRLAEGL +AAALMEGHEADS  AF ALFG  HGS
Sbjct  62   DLNKVLTGGTRLAEGLTKAAALMEGHEADSSHAFSALFGG-HGS  104


>gi|296167007|ref|ZP_06849420.1| conserved hypothetical protein [Mycobacterium parascrofulaceum 
ATCC BAA-614]
 gi|295897637|gb|EFG77230.1| conserved hypothetical protein [Mycobacterium parascrofulaceum 
ATCC BAA-614]
Length=104

 Score =  160 bits (405),  Expect = 7e-38, Method: Compositional matrix adjust.
 Identities = 79/103 (77%), Positives = 87/103 (85%), Gaps = 2/103 (1%)

Query  4    TIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITN  63
            TI+VTPQMLR T+N IQANME A+GI +GY+ANQENVMNP+TWSG  VVASH TA E+ N
Sbjct  2    TIKVTPQMLRDTSNAIQANMEHAIGIGQGYVANQENVMNPSTWSGDAVVASHATAIEVQN  61

Query  64   ELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHG  106
            +LNKVL GGTRLAEGL QAAALMEGHEADS  AF ALFG  HG
Sbjct  62   DLNKVLNGGTRLAEGLKQAAALMEGHEADSSHAFSALFG--HG  102


>gi|8919126|emb|CAB96048.1| hypothetical protein [Mycobacterium avium subsp. paratuberculosis]
Length=100

 Score =  158 bits (399),  Expect = 3e-37, Method: Compositional matrix adjust.
 Identities = 78/101 (78%), Positives = 85/101 (85%), Gaps = 1/101 (0%)

Query  7    VTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELN  66
            VTPQMLR T+N IQANME A+GI +GY+ANQENVMNPATWSG  V ASH TA E+ N+LN
Sbjct  1    VTPQMLRDTSNAIQANMEHAIGIGQGYVANQENVMNPATWSGDAVAASHATAIEVQNDLN  60

Query  67   KVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS  107
            KVLTGGTRLAEGL +AAALMEGHEADS  AF ALFG  HGS
Sbjct  61   KVLTGGTRLAEGLTKAAALMEGHEADSSHAFSALFGG-HGS  100


>gi|41406258|ref|NP_959094.1| hypothetical protein MAP0160 [Mycobacterium avium subsp. paratuberculosis 
K-10]
 gi|41394606|gb|AAS02477.1| hypothetical protein MAP_0160 [Mycobacterium avium subsp. paratuberculosis 
K-10]
Length=100

 Score =  156 bits (395),  Expect = 9e-37, Method: Compositional matrix adjust.
 Identities = 77/101 (77%), Positives = 85/101 (85%), Gaps = 1/101 (0%)

Query  7    VTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELN  66
            +TPQMLR T+N IQANME A+GI +GY+ANQENVMNPATWSG  V ASH TA E+ N+LN
Sbjct  1    MTPQMLRDTSNAIQANMEHAIGIGQGYVANQENVMNPATWSGDAVAASHATAIEVQNDLN  60

Query  67   KVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS  107
            KVLTGGTRLAEGL +AAALMEGHEADS  AF ALFG  HGS
Sbjct  61   KVLTGGTRLAEGLTKAAALMEGHEADSSHAFSALFGG-HGS  100


>gi|342860126|ref|ZP_08716778.1| hypothetical protein MCOL_14645 [Mycobacterium colombiense CECT 
3035]
 gi|342132504|gb|EGT85733.1| hypothetical protein MCOL_14645 [Mycobacterium colombiense CECT 
3035]
Length=96

 Score =  147 bits (370),  Expect = 6e-34, Method: Compositional matrix adjust.
 Identities = 72/97 (75%), Positives = 81/97 (84%), Gaps = 1/97 (1%)

Query  11   MLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELNKVLT  70
            MLR  +N IQANME A+GI +GY+ANQENVMNP+TWSG+ V ASH TA E+ N+LNKVLT
Sbjct  1    MLRDASNAIQANMEHAIGIGQGYVANQENVMNPSTWSGSAVTASHATAIEVQNDLNKVLT  60

Query  71   GGTRLAEGLVQAAALMEGHEADSQTAFQALFGASHGS  107
            GGTRLAEGL +AAALMEGHEADS  AF ALFG  HGS
Sbjct  61   GGTRLAEGLTKAAALMEGHEADSSHAFSALFGG-HGS  96


>gi|254773209|ref|ZP_05214725.1| hypothetical protein MaviaA2_00806 [Mycobacterium avium subsp. 
avium ATCC 25291]
Length=84

 Score =  131 bits (330),  Expect = 3e-29, Method: Compositional matrix adjust.
 Identities = 65/85 (77%), Positives = 71/85 (84%), Gaps = 1/85 (1%)

Query  23   MEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELNKVLTGGTRLAEGLVQA  82
            ME A+GI +GY+ANQENVMNPATWSG  V ASH TA E+ N+LNKVLTGGTRLAEGL +A
Sbjct  1    MEHAIGIGQGYVANQENVMNPATWSGDAVAASHATAIEVQNDLNKVLTGGTRLAEGLTKA  60

Query  83   AALMEGHEADSQTAFQALFGASHGS  107
            AALMEGHEADS  AF ALFG  HGS
Sbjct  61   AALMEGHEADSSHAFSALFGG-HGS  84


>gi|254821253|ref|ZP_05226254.1| hypothetical protein MintA_15047 [Mycobacterium intracellulare 
ATCC 13950]
Length=84

 Score =  130 bits (328),  Expect = 4e-29, Method: Compositional matrix adjust.
 Identities = 65/85 (77%), Positives = 71/85 (84%), Gaps = 1/85 (1%)

Query  23   MEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELNKVLTGGTRLAEGLVQA  82
            ME A+GI +GY+ANQENVMNPATWSG  V ASH TA E+ N+LNKVLTGGTRLAEGL +A
Sbjct  1    MEHAIGIGQGYVANQENVMNPATWSGDAVAASHATAIEVQNDLNKVLTGGTRLAEGLTKA  60

Query  83   AALMEGHEADSQTAFQALFGASHGS  107
            AALMEGHEADS  AF ALFG  HGS
Sbjct  61   AALMEGHEADSSHAFTALFGG-HGS  84


>gi|333988694|ref|YP_004521308.1| ESAT-6 like protein EsxD [Mycobacterium sp. JDM601]
 gi|333484662|gb|AEF34054.1| ESAT-6 like protein EsxD [Mycobacterium sp. JDM601]
Length=105

 Score =  105 bits (263),  Expect = 2e-21, Method: Compositional matrix adjust.
 Identities = 51/98 (53%), Positives = 69/98 (71%), Gaps = 0/98 (0%)

Query  5    IQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNE  64
            I VTP+++R+TA+ +  ++E A  IA  YLA+ EN++   TW G G  AS +TA +I  +
Sbjct  4    IVVTPELMRNTASKLAQHIEHAQAIANQYLADHENILGAGTWDGAGSKASFVTAGQIHED  63

Query  65   LNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQALFG  102
            + KVL GGTRL EGL QAAALME HE+ S+ AF +LFG
Sbjct  64   MQKVLIGGTRLTEGLNQAAALMESHESHSEHAFHSLFG  101


>gi|332243340|ref|XP_003270836.1| PREDICTED: uncharacterized protein C2orf16-like [Nomascus leucogenys]
Length=2027

 Score = 37.7 bits (86),  Expect = 0.65, Method: Composition-based stats.
 Identities = 28/94 (30%), Positives = 36/94 (39%), Gaps = 6/94 (6%)

Query  9    PQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELNKV  68
            PQ  RS +   QA      G+ K +L  Q NV     W          T   + N L   
Sbjct  843  PQSWRSLSRTFQAESGVQKGLIKSFLGRQHNVWESHAWRQRLPRKYLSTMLMLGNNL---  899

Query  69   LTGGTRLAEGLVQAAALMEGHEADSQTAFQALFG  102
               GT +   L    +L EG  AD+  + Q LFG
Sbjct  900  ---GTTMERKLCSQTSLAEGATADTCQSIQNLFG  930


>gi|329954846|ref|ZP_08295863.1| tetratricopeptide repeat protein [Bacteroides clarus YIT 12056]
 gi|328526950|gb|EGF53961.1| tetratricopeptide repeat protein [Bacteroides clarus YIT 12056]
Length=420

 Score = 35.4 bits (80),  Expect = 2.5, Method: Composition-based stats.
 Identities = 16/51 (32%), Positives = 24/51 (48%), Gaps = 0/51 (0%)

Query  13  RSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITN  63
           +S AN+++ N  QA  +  G L N E   N  TW   G +   +   E+ N
Sbjct  44  KSIANEVKPNFAQAEKLINGALTNAETKDNAETWDVAGFIQKRINEKEMEN  94



Lambda     K      H
   0.311    0.122    0.333 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Effective search space used: 130484177216


  Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
    Posted date:  Sep 5, 2011  4:36 AM
  Number of letters in database: 5,219,829,388
  Number of sequences in database:  15,229,318



Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40