BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3904c
Length=90
Score E
Sequences producing significant alignments: (Bits) Value
gi|254366442|ref|ZP_04982486.1| Esat-6 like protein esxE [Mycoba... 177 4e-43
gi|308378513|ref|ZP_07482817.2| esat-6 like protein esxE [Mycoba... 177 6e-43
gi|15611040|ref|NP_218421.1| hypothetical protein Rv3904c [Mycob... 176 1e-42
gi|31795077|ref|NP_857570.1| hypothetical protein Mb3934c [Mycob... 154 4e-36
gi|240168382|ref|ZP_04747041.1| Esat-6 like protein esxE [Mycoba... 127 7e-28
gi|183985435|ref|YP_001853726.1| EsaT-6 like protein EsxE [Mycob... 123 1e-26
gi|118465918|ref|YP_883772.1| hypothetical protein MAV_4643 [Myc... 114 4e-24
gi|41410097|ref|NP_962933.1| hypothetical protein MAP3999c [Myco... 112 1e-23
gi|296168165|ref|ZP_06850186.1| Esat-6 like protein esxE [Mycoba... 109 2e-22
gi|342858989|ref|ZP_08715643.1| hypothetical protein MCOL_08933 ... 108 3e-22
gi|118620057|ref|YP_908389.1| EsaT-6 like protein EsxE [Mycobact... 103 1e-20
gi|296395184|ref|YP_003660068.1| hypothetical protein Srot_2805 ... 74.7 4e-12
gi|54026241|ref|YP_120483.1| hypothetical protein nfa42700 [Noca... 72.0 2e-11
gi|120402132|ref|YP_951961.1| hypothetical protein Mvan_1119 [My... 68.2 4e-10
gi|118473642|ref|YP_886237.1| hypothetical protein MSMEG_1871 [M... 67.0 9e-10
gi|118464199|ref|YP_883956.1| hypothetical protein MAV_4832 [Myc... 66.2 2e-09
gi|296392743|ref|YP_003657627.1| hypothetical protein Srot_0309 ... 65.9 2e-09
gi|317505964|ref|ZP_07963796.1| hypothetical protein HMPREF9336_... 63.5 1e-08
gi|296393520|ref|YP_003658404.1| hypothetical protein Srot_1100 ... 63.2 1e-08
gi|317505870|ref|ZP_07963711.1| hypothetical protein HMPREF9336_... 62.0 3e-08
gi|317507008|ref|ZP_07964775.1| hypothetical protein HMPREF9336_... 61.2 5e-08
gi|296393211|ref|YP_003658095.1| hypothetical protein Srot_0785 ... 60.5 8e-08
gi|54025389|ref|YP_119631.1| hypothetical protein nfa34195 [Noca... 57.4 6e-07
gi|343926518|ref|ZP_08766022.1| hypothetical protein GOALK_061_0... 57.0 1e-06
gi|119867412|ref|YP_937364.1| hypothetical protein Mkms_1362 [My... 57.0 1e-06
gi|108798315|ref|YP_638512.1| hypothetical protein Mmcs_1344 [My... 56.6 1e-06
gi|126433980|ref|YP_001069671.1| hypothetical protein Mjls_1378 ... 56.2 1e-06
gi|240171657|ref|ZP_04750316.1| hypothetical protein MkanA1_2025... 55.8 2e-06
gi|119717952|ref|YP_924917.1| hypothetical protein Noca_3730 [No... 48.5 4e-04
gi|296395325|ref|YP_003660209.1| hypothetical protein Srot_2949 ... 46.2 0.002
gi|184201481|ref|YP_001855688.1| hypothetical protein KRH_18350 ... 45.4 0.003
gi|54023529|ref|YP_117771.1| hypothetical protein nfa15615 [Noca... 42.4 0.021
gi|238062376|ref|ZP_04607085.1| hypothetical protein MCAG_03342 ... 42.0 0.026
gi|269124874|ref|YP_003298244.1| hypothetical protein Tcur_0611 ... 40.4 0.078
gi|291298518|ref|YP_003509796.1| hypothetical protein Snas_0993 ... 38.5 0.33
gi|46117048|ref|XP_384542.1| hypothetical protein FG04366.1 [Gib... 38.5 0.36
gi|154508575|ref|ZP_02044217.1| hypothetical protein ACTODO_0107... 38.1 0.44
gi|54027560|ref|YP_121802.1| hypothetical protein nfa55860 [Noca... 37.4 0.67
gi|256390225|ref|YP_003111789.1| hypothetical protein Caci_1020 ... 37.4 0.79
gi|54022450|ref|YP_116692.1| hypothetical protein nfa4855 [Nocar... 37.4 0.84
gi|302865073|ref|YP_003833710.1| hypothetical protein Micau_0567... 37.0 0.93
gi|345002677|ref|YP_004805531.1| hypothetical protein SACTE_5173... 36.6 1.3
gi|169630195|ref|YP_001703844.1| hypothetical protein MAB_3113 [... 35.8 2.1
gi|296119294|ref|ZP_06837862.1| early secretory antigenic target... 35.8 2.4
gi|225420537|ref|ZP_03762840.1| hypothetical protein CLOSTASPAR_... 35.4 2.5
gi|54022472|ref|YP_116714.1| hypothetical protein nfa5064 [Nocar... 35.4 2.8
gi|226305392|ref|YP_002765350.1| hypothetical protein RER_19030 ... 35.4 3.0
gi|170699241|ref|ZP_02890292.1| conserved hypothetical protein [... 35.4 3.1
gi|297202748|ref|ZP_06920145.1| conserved hypothetical protein [... 35.4 3.1
gi|256380551|ref|YP_003104211.1| hypothetical protein Amir_6566 ... 34.7 4.5
>gi|254366442|ref|ZP_04982486.1| Esat-6 like protein esxE [Mycobacterium tuberculosis str. Haarlem]
gi|308369275|ref|ZP_07417135.2| esat-6 like protein esxE [Mycobacterium tuberculosis SUMu002]
gi|308371499|ref|ZP_07425147.2| esat-6 like protein esxE [Mycobacterium tuberculosis SUMu003]
18 more sequence titles
Length=96
Score = 177 bits (450), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 90/90 (100%), Positives = 90/90 (100%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM
Sbjct 7 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 66
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
RQALAQLTAAGQSAHANYTGAMATNLGMWS
Sbjct 67 RQALAQLTAAGQSAHANYTGAMATNLGMWS 96
>gi|308378513|ref|ZP_07482817.2| esat-6 like protein esxE [Mycobacterium tuberculosis SUMu009]
gi|308380860|ref|ZP_07491266.2| esat-6 like protein esxE [Mycobacterium tuberculosis SUMu011]
gi|308352280|gb|EFP41131.1| esat-6 like protein esxE [Mycobacterium tuberculosis SUMu009]
gi|308360168|gb|EFP49019.1| esat-6 like protein esxE [Mycobacterium tuberculosis SUMu011]
Length=100
Score = 177 bits (448), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 90/90 (100%), Positives = 90/90 (100%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM
Sbjct 11 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 70
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
RQALAQLTAAGQSAHANYTGAMATNLGMWS
Sbjct 71 RQALAQLTAAGQSAHANYTGAMATNLGMWS 100
>gi|15611040|ref|NP_218421.1| hypothetical protein Rv3904c [Mycobacterium tuberculosis H37Rv]
gi|15843536|ref|NP_338573.1| hypothetical protein MT4023 [Mycobacterium tuberculosis CDC1551]
gi|148663771|ref|YP_001285294.1| putative esat-6 like protein EsxE [Mycobacterium tuberculosis
H37Ra]
47 more sequence titles
Length=90
Score = 176 bits (446), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 89/90 (99%), Positives = 90/90 (100%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
+DPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM
Sbjct 1 MDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
RQALAQLTAAGQSAHANYTGAMATNLGMWS
Sbjct 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
>gi|31795077|ref|NP_857570.1| hypothetical protein Mb3934c [Mycobacterium bovis AF2122/97]
gi|121639815|ref|YP_980039.1| putative ESAT-6 like protein 12 esxE [Mycobacterium bovis BCG
str. Pasteur 1173P2]
gi|224992310|ref|YP_002647000.1| putative EsaT-6 like protein 12 [Mycobacterium bovis BCG str.
Tokyo 172]
gi|31620675|emb|CAD96120.1| PUTATIVE ESAT-6 LIKE PROTEIN 12 (HYPOTHETICAL ALANINE RICH PROTEIN)
[Mycobacterium bovis AF2122/97]
gi|121495463|emb|CAL73951.1| Putative ESAT-6 like protein 12 esxE [Mycobacterium bovis BCG
str. Pasteur 1173P2]
gi|224775426|dbj|BAH28232.1| putative EsaT-6 like protein 12 [Mycobacterium bovis BCG str.
Tokyo 172]
gi|341603836|emb|CCC66518.1| putative ESAT-6 like protein 12 esxE [Mycobacterium bovis BCG
str. Moreau RDJ]
Length=90
Score = 154 bits (389), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 88/90 (98%), Positives = 90/90 (100%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
+DPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM
Sbjct 1 MDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
RQALAQLTAAGQSAHANYTGA+ATNLGMWS
Sbjct 61 RQALAQLTAAGQSAHANYTGAVATNLGMWS 90
>gi|240168382|ref|ZP_04747041.1| Esat-6 like protein esxE [Mycobacterium kansasii ATCC 12478]
Length=90
Score = 127 bits (318), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 73/90 (82%), Positives = 78/90 (87%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
+DP LADAV RMAEFGRH E ++AEIE+LVTRLHVTWTGEGAAAHAEA RHWA GEAMM
Sbjct 1 MDPAALADAVERMAEFGRHAESMLAEIEALVTRLHVTWTGEGAAAHAEAHRHWALGEAMM 60
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
RQALAQL AAG SA ANYTGAM+TN MWS
Sbjct 61 RQALAQLKAAGHSAKANYTGAMSTNFTMWS 90
>gi|183985435|ref|YP_001853726.1| EsaT-6 like protein EsxE [Mycobacterium marinum M]
gi|183178761|gb|ACC43871.1| EsaT-6 like protein EsxE [Mycobacterium marinum M]
Length=96
Score = 123 bits (308), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 70/90 (78%), Positives = 76/90 (85%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VDP L+DAVARMAEFGRH E ++AEIESLVTRLHVTWTGEGAAAHAEA RHWA GEAMM
Sbjct 7 VDPASLSDAVARMAEFGRHAEAMIAEIESLVTRLHVTWTGEGAAAHAEAHRHWAGGEAMM 66
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
R+ALAQL +AG SAH NYT M+ N MWS
Sbjct 67 REALAQLRSAGHSAHTNYTAVMSANQSMWS 96
>gi|118465918|ref|YP_883772.1| hypothetical protein MAV_4643 [Mycobacterium avium 104]
gi|254777080|ref|ZP_05218596.1| hypothetical protein MaviaA2_20774 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|118167205|gb|ABK68102.1| conserved hypothetical protein [Mycobacterium avium 104]
gi|336460442|gb|EGO39338.1| WXG repeat protein [Mycobacterium avium subsp. paratuberculosis
S397]
Length=96
Score = 114 bits (285), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 65/90 (73%), Positives = 74/90 (83%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VDP LADAV RMAEF R+ E+++AEI+S VTRLH TWTGEGAAAHAEA +HW GEAMM
Sbjct 7 VDPAALADAVQRMAEFQRYAEDMIAEIDSRVTRLHQTWTGEGAAAHAEAHQHWVRGEAMM 66
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
R+ALAQL A +AH NYTGAM+ NLGMWS
Sbjct 67 REALAQLGKAASTAHGNYTGAMSKNLGMWS 96
>gi|41410097|ref|NP_962933.1| hypothetical protein MAP3999c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41398930|gb|AAS06549.1| hypothetical protein MAP_3999c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=90
Score = 112 bits (281), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/90 (72%), Positives = 74/90 (83%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
+DP LADAV RMAEF R+ E+++AEI+S VTRLH TWTGEGAAAHAEA +HW GEAMM
Sbjct 1 MDPAALADAVQRMAEFQRYAEDMIAEIDSRVTRLHQTWTGEGAAAHAEAHQHWVRGEAMM 60
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
R+ALAQL A +AH NYTGAM+ NLGMWS
Sbjct 61 REALAQLGKAASTAHGNYTGAMSKNLGMWS 90
>gi|296168165|ref|ZP_06850186.1| Esat-6 like protein esxE [Mycobacterium parascrofulaceum ATCC
BAA-614]
gi|295896843|gb|EFG76472.1| Esat-6 like protein esxE [Mycobacterium parascrofulaceum ATCC
BAA-614]
Length=96
Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/90 (68%), Positives = 71/90 (79%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VDP LADA RMA F R+ E+++ EI+S VTRLH TWTG+ AAAHAEA +HW GEAMM
Sbjct 7 VDPQALADAAQRMAAFQRYAEDMITEIDSRVTRLHATWTGQAAAAHAEAHQHWVRGEAMM 66
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
R+ALAQL A +AH NYTGAM+TNLGMWS
Sbjct 67 REALAQLEKAAATAHGNYTGAMSTNLGMWS 96
>gi|342858989|ref|ZP_08715643.1| hypothetical protein MCOL_08933 [Mycobacterium colombiense CECT
3035]
gi|342133230|gb|EGT86433.1| hypothetical protein MCOL_08933 [Mycobacterium colombiense CECT
3035]
Length=96
Score = 108 bits (270), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 60/90 (67%), Positives = 71/90 (79%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VDP LA+AV RMA F R+ E+++AEI+S VTRLH +WTG AAAHAEA +HW GEAMM
Sbjct 7 VDPQALAEAVQRMAAFQRYAEDMIAEIDSRVTRLHTSWTGAAAAAHAEAHQHWVRGEAMM 66
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
R+ALAQL +AH NYTGAM+TNLGMWS
Sbjct 67 REALAQLEKVATTAHGNYTGAMSTNLGMWS 96
>gi|118620057|ref|YP_908389.1| EsaT-6 like protein EsxE [Mycobacterium ulcerans Agy99]
gi|118572167|gb|ABL06918.1| EsaT-6 like protein EsxE [Mycobacterium ulcerans Agy99]
Length=90
Score = 103 bits (256), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/90 (70%), Positives = 71/90 (79%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
+DP L+DAVA+MAEFGRH E ++AEIESLVTRLHVT TGE AAAHAEA RHWA GEAMM
Sbjct 1 MDPASLSDAVAQMAEFGRHAEAMIAEIESLVTRLHVTLTGESAAAHAEAHRHWAGGEAMM 60
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
R+AL QL +AG SAH N T M+ N WS
Sbjct 61 REALVQLRSAGHSAHINCTAVMSANQSTWS 90
>gi|296395184|ref|YP_003660068.1| hypothetical protein Srot_2805 [Segniliparus rotundus DSM 44985]
gi|296182331|gb|ADG99237.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=99
Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 34/89 (39%), Positives = 51/89 (58%), Gaps = 0/89 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VD L + V R EF + VE+ + +IE +TRLHV+W+G+ AAA + HW G M
Sbjct 10 VDLDQLLEHVERCEEFDKKVEDWLDQIEQAITRLHVSWSGDAAAAQRDYHDHWVQGAQEM 69
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMW 89
+ +L +L Q+ H +Y+G + GMW
Sbjct 70 KDSLGKLRDRAQTNHDSYSGLIEHQRGMW 98
>gi|54026241|ref|YP_120483.1| hypothetical protein nfa42700 [Nocardia farcinica IFM 10152]
gi|54017749|dbj|BAD59119.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=98
Score = 72.0 bits (175), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/89 (41%), Positives = 52/89 (59%), Gaps = 0/89 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
V+ L DA+ MA+FG VE + E+++ + LH++W+ + A A A W AG M
Sbjct 9 VNLQQLDDAITTMADFGTEVEGWLREVDAKIAALHLSWSSQAAEAQRAAHNRWMAGAEEM 68
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMW 89
R+ L +L + AH NYT A+ TNLGMW
Sbjct 69 RENLDELREVARRAHTNYTAAVQTNLGMW 97
>gi|120402132|ref|YP_951961.1| hypothetical protein Mvan_1119 [Mycobacterium vanbaalenii PYR-1]
gi|119954950|gb|ABM11955.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=94
Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/79 (46%), Positives = 45/79 (57%), Gaps = 0/79 (0%)
Query 10 VARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLTA 69
V R+ F +H E + ++ + LH TW+G+GAAAH W A A MR+ALAQL A
Sbjct 14 VDRLQAFDQHAEAVATRVDGQIANLHDTWSGDGAAAHRARHDEWMAAAAQMREALAQLRA 73
Query 70 AGQSAHANYTGAMATNLGM 88
A AH NYT A NL M
Sbjct 74 AAHRAHRNYTEAAQLNLDM 92
>gi|118473642|ref|YP_886237.1| hypothetical protein MSMEG_1871 [Mycobacterium smegmatis str.
MC2 155]
gi|118174929|gb|ABK75825.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=92
Score = 67.0 bits (162), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 34/90 (38%), Positives = 49/90 (55%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VD + DA+A MA + R V + +I+ + L +W GE + A A AQ W G M
Sbjct 2 VDFGQMQDAIAHMASYEREVTSCLEDIDRTMASLRQSWHGEASDAQAHAQAQWREGAEQM 61
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
R+AL+QL Q+A +NY+ A+ N MW
Sbjct 62 REALSQLQKIAQTARSNYSDAVTKNSEMWD 91
>gi|118464199|ref|YP_883956.1| hypothetical protein MAV_4832 [Mycobacterium avium 104]
gi|254777269|ref|ZP_05218785.1| hypothetical protein MaviaA2_21729 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|118165486|gb|ABK66383.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=96
Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/89 (36%), Positives = 46/89 (52%), Gaps = 0/89 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VDP L+ R+A+F E++ A ++ + LH +W G GA A E + W A + M
Sbjct 7 VDPEALSHFADRLAKFTASAEQIAAAVDQCIAELHGSWLGRGADAEREYHQRWVAADKQM 66
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMW 89
R+AL +L + AH NY G N MW
Sbjct 67 REALTELRTNVERAHRNYDGVAQHNTAMW 95
>gi|296392743|ref|YP_003657627.1| hypothetical protein Srot_0309 [Segniliparus rotundus DSM 44985]
gi|296179890|gb|ADG96796.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=99
Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 44/80 (55%), Gaps = 0/80 (0%)
Query 10 VARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLTA 69
V R EF + VE+ + +I++ + RLH +W+GE AAA E W AG MR ALA+L
Sbjct 19 VERCEEFDKKVEDWLDQIDAAIARLHTSWSGEAAAAQREQHDRWVAGAREMRDALARLKD 78
Query 70 AGQSAHANYTGAMATNLGMW 89
+ H Y G + GMW
Sbjct 79 QAAADHGRYAGLIGHQRGMW 98
>gi|317505964|ref|ZP_07963796.1| hypothetical protein HMPREF9336_00165 [Segniliparus rugosus ATCC
BAA-974]
gi|316255740|gb|EFV14978.1| hypothetical protein HMPREF9336_00165 [Segniliparus rugosus ATCC
BAA-974]
Length=97
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/90 (37%), Positives = 45/90 (50%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VD + + + +M +F E E+++ V LH TW+GE A+A A W AG M
Sbjct 7 VDLSHMQQTIDKMTQFTAQAESWCQEVDADVQSLHGTWSGEAASAQKAAHDQWVAGLNAM 66
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
+ L +L Q HANYT A N MWS
Sbjct 67 KADLEKLKKIAQDGHANYTDAGNKNKEMWS 96
>gi|296393520|ref|YP_003658404.1| hypothetical protein Srot_1100 [Segniliparus rotundus DSM 44985]
gi|296180667|gb|ADG97573.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=96
Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 45/90 (50%), Gaps = 0/90 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VD LA+ V +A+F + EE +AE E L W A + A W+ G A M
Sbjct 7 VDHETLAERVDDLADFEKRAEEDIAEAEQLAAAYKHLWDSTAAEEYQRAHALWSKGAAEM 66
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
R ALA L +G +AH NYT AM N G W
Sbjct 67 RAALAHLRQSGANAHGNYTAAMGANTGFWK 96
>gi|317505870|ref|ZP_07963711.1| hypothetical protein HMPREF9336_00080 [Segniliparus rugosus ATCC
BAA-974]
gi|316255825|gb|EFV15054.1| hypothetical protein HMPREF9336_00080 [Segniliparus rugosus ATCC
BAA-974]
Length=97
Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/88 (36%), Positives = 45/88 (52%), Gaps = 0/88 (0%)
Query 2 DPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMR 61
D + V+RM F V+ EI++ V +LHV+W+G+ A+A W +
Sbjct 8 DLDQMDSTVSRMEAFENQVQSWTEEIDAEVKKLHVSWSGDAASAQQADHDRWTKDAQAAK 67
Query 62 QALAQLTAAGQSAHANYTGAMATNLGMW 89
+AL +L A + AH NYTGA N MW
Sbjct 68 EALEKLKAVMRKAHENYTGANTANKAMW 95
>gi|317507008|ref|ZP_07964775.1| hypothetical protein HMPREF9336_01146 [Segniliparus rugosus ATCC
BAA-974]
gi|316254692|gb|EFV13995.1| hypothetical protein HMPREF9336_01146 [Segniliparus rugosus ATCC
BAA-974]
Length=97
Score = 61.2 bits (147), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 30/89 (34%), Positives = 45/89 (51%), Gaps = 0/89 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
+D L+ + F + VEE V +I++ + RLH +W+G+ A WA G A M
Sbjct 8 IDLDRLSAMIEDCEAFDKKVEEWVGQIDAAIARLHASWSGQAAEQQRAYHDRWAKGLAQM 67
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMW 89
R+ L +L + H YTG +A GMW
Sbjct 68 REGLQELRGQAKGNHQAYTGVIAHQKGMW 96
>gi|296393211|ref|YP_003658095.1| hypothetical protein Srot_0785 [Segniliparus rotundus DSM 44985]
gi|296180358|gb|ADG97264.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=97
Score = 60.5 bits (145), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 30/89 (34%), Positives = 45/89 (51%), Gaps = 0/89 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
+D L++ + F + VEE V +I++ + RLH W+G+ A WA G A M
Sbjct 8 IDLEQLSEHIEACEAFDKKVEEWVGQIDAAIARLHAAWSGQAAEQQRAYHERWAKGLAEM 67
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMW 89
R+ L +L + H YTG +A GMW
Sbjct 68 REGLKELREQAKGNHQAYTGVIAHQKGMW 96
>gi|54025389|ref|YP_119631.1| hypothetical protein nfa34195 [Nocardia farcinica IFM 10152]
gi|54016897|dbj|BAD58267.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=95
Score = 57.4 bits (137), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/89 (32%), Positives = 46/89 (52%), Gaps = 0/89 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VD +L +AR+ F + + ++ ++RL W G+ A A A+A R A +
Sbjct 5 VDLDLLDSVIARLKGFEEFFVDQITAFDTAISRLQTGWEGDAATAQADAHRRLMAAAQDI 64
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMW 89
R + + A ++AH NYT A+A N+ MW
Sbjct 65 RDGVGDMRRAAEAAHTNYTQAIAANVAMW 93
>gi|343926518|ref|ZP_08766022.1| hypothetical protein GOALK_061_00040 [Gordonia alkanivorans NBRC
16433]
gi|343763521|dbj|GAA12948.1| hypothetical protein GOALK_061_00040 [Gordonia alkanivorans NBRC
16433]
Length=98
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/80 (33%), Positives = 42/80 (53%), Gaps = 0/80 (0%)
Query 10 VARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLTA 69
V +A F V+E + ++ V LH+ WTGEGA AH +A W G +R+A+ ++
Sbjct 18 VEDLARFATAVDERIHTLDKRVDDLHLVWTGEGAVAHRDAHTQWREGVKDIREAIVEIKK 77
Query 70 AGQSAHANYTGAMATNLGMW 89
A +H+ ++G MW
Sbjct 78 AANRSHSAFSGLQEHQRKMW 97
>gi|119867412|ref|YP_937364.1| hypothetical protein Mkms_1362 [Mycobacterium sp. KMS]
gi|119693501|gb|ABL90574.1| putative EsaT-6 like protein 12 (hypothetical alanine rich protein)
[Mycobacterium sp. KMS]
Length=93
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/81 (42%), Positives = 46/81 (57%), Gaps = 0/81 (0%)
Query 9 AVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLT 68
A+ M EF R V E + ++E +T L TW GE + A A+AQ+ W G M++ALAQL
Sbjct 12 AIDHMREFERDVAECLDDVERAMTALRATWHGEASDAQAQAQQQWEDGAEQMKKALAQLQ 71
Query 69 AAGQSAHANYTGAMATNLGMW 89
+A NY A+A N MW
Sbjct 72 TIADTARKNYADAVAKNGRMW 92
>gi|108798315|ref|YP_638512.1| hypothetical protein Mmcs_1344 [Mycobacterium sp. MCS]
gi|108768734|gb|ABG07456.1| putative EsaT-6 like protein 12 (hypothetical alanine rich protein)
[Mycobacterium sp. MCS]
Length=95
Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/81 (42%), Positives = 46/81 (57%), Gaps = 0/81 (0%)
Query 9 AVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLT 68
A+ M EF R V E + ++E +T L TW GE + A A+AQ+ W G M++ALAQL
Sbjct 14 AIDHMREFERDVAECLDDVERAMTALRATWHGEASDAQAQAQQQWEDGAEQMKKALAQLQ 73
Query 69 AAGQSAHANYTGAMATNLGMW 89
+A NY A+A N MW
Sbjct 74 TIADTARKNYADAVAKNGRMW 94
>gi|126433980|ref|YP_001069671.1| hypothetical protein Mjls_1378 [Mycobacterium sp. JLS]
gi|126233780|gb|ABN97180.1| putative EsaT-6 like protein 12 (hypothetical alanine rich protein)
[Mycobacterium sp. JLS]
Length=95
Score = 56.2 bits (134), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/81 (42%), Positives = 46/81 (57%), Gaps = 0/81 (0%)
Query 9 AVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLT 68
A+ M EF R V E + ++E +T L TW GE + A A+AQ+ W G M++ALAQL
Sbjct 14 AIDHMREFERDVAECLDDVERAMTALRATWHGEASDAQAQAQQQWEDGAEQMKKALAQLQ 73
Query 69 AAGQSAHANYTGAMATNLGMW 89
+A NY A+A N MW
Sbjct 74 TIADTARRNYADAVAKNGRMW 94
>gi|240171657|ref|ZP_04750316.1| hypothetical protein MkanA1_20253 [Mycobacterium kansasii ATCC
12478]
Length=96
Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/89 (34%), Positives = 43/89 (49%), Gaps = 0/89 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VD L R+A+F E++ ++ V LH +W G GA A E + W A + M
Sbjct 7 VDLDALLAFADRLAKFTARAEQIATAVDQCVAELHGSWLGAGADAEREYHQKWMAADQQM 66
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMW 89
R+ L +L + AH NYTG + MW
Sbjct 67 REGLDELRRNAELAHRNYTGVAELSTTMW 95
>gi|119717952|ref|YP_924917.1| hypothetical protein Noca_3730 [Nocardioides sp. JS614]
gi|119538613|gb|ABL83230.1| putative EsaT-6 like protein 12 (hypothetical alanine rich protein)
[Nocardioides sp. JS614]
Length=100
Score = 48.5 bits (114), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/89 (39%), Positives = 45/89 (51%), Gaps = 0/89 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VD L D V + G ++ L+ ++ V L TW+G A A A AQ W +G M
Sbjct 8 VDLDELLDVVDELVRCGVALDALLDDVVGRVAALQQTWSGSAADAQAGAQAEWESGFREM 67
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMW 89
ALA + +A +AH NY A TNL MW
Sbjct 68 SAALAVMRSAADTAHDNYERAATTNLRMW 96
>gi|296395325|ref|YP_003660209.1| hypothetical protein Srot_2949 [Segniliparus rotundus DSM 44985]
gi|296182472|gb|ADG99378.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=95
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/73 (35%), Positives = 36/73 (50%), Gaps = 0/73 (0%)
Query 17 GRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLTAAGQSAHA 76
G+ V+E +A ++ V L V W G A H + W MR+AL +L AH
Sbjct 22 GKRVDEHLAAVDREVASLDVHWHGNAKAQHEAKHQAWMKAAREMREALTELKTNVNHAHQ 81
Query 77 NYTGAMATNLGMW 89
Y+ +A N+GMW
Sbjct 82 VYSDNIAHNVGMW 94
>gi|184201481|ref|YP_001855688.1| hypothetical protein KRH_18350 [Kocuria rhizophila DC2201]
gi|183581711|dbj|BAG30182.1| hypothetical protein [Kocuria rhizophila DC2201]
Length=96
Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/74 (38%), Positives = 40/74 (55%), Gaps = 3/74 (4%)
Query 19 HVEELVAEIE---SLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLTAAGQSAH 75
H+E L AE+ S +T L +WTG +A+ EA R W + +A M QA+AQ+ A +A
Sbjct 21 HIERLTAEVNGMTSTLTDLQGSWTGTASASFQEAFRSWRSAQAQMEQAVAQINQALAAAG 80
Query 76 ANYTGAMATNLGMW 89
Y A N M+
Sbjct 81 TQYAETEAANTRMF 94
>gi|54023529|ref|YP_117771.1| hypothetical protein nfa15615 [Nocardia farcinica IFM 10152]
gi|54015037|dbj|BAD56407.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=99
Score = 42.4 bits (98), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 30/91 (33%), Positives = 46/91 (51%), Gaps = 1/91 (1%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVT-WTGEGAAAHAEAQRHWAAGEAM 59
VD L + VAR++ + E + EI+ V L T W A A+AEA WAA
Sbjct 7 VDLDHLEEIVARLSGLAGFIGEHLDEIDDRVATLTGTGWESVAARAYAEAHTQWAAAARE 66
Query 60 MRQALAQLTAAGQSAHANYTGAMATNLGMWS 90
+ + ++ A ++AH YT A+ TN M++
Sbjct 67 FVEGVRDMSDAAKAAHTRYTRAVDTNYKMFN 97
>gi|238062376|ref|ZP_04607085.1| hypothetical protein MCAG_03342 [Micromonospora sp. ATCC 39149]
gi|237884187|gb|EEP73015.1| hypothetical protein MCAG_03342 [Micromonospora sp. ATCC 39149]
Length=93
Score = 42.0 bits (97), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 25/90 (28%), Positives = 42/90 (47%), Gaps = 2/90 (2%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VD VL + +M R ++E + + S++T+LH W GE A+ + Q W A +
Sbjct 5 VDYAVLESSNQQMTAISRTIDEKLDTLRSMLTKLH--WEGEDRTAYQQHQAQWDAAVRDI 62
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMWS 90
+ L + A A NY +N +W+
Sbjct 63 NKILNDIGGAVGIARENYVTTEMSNSKVWN 92
>gi|269124874|ref|YP_003298244.1| hypothetical protein Tcur_0611 [Thermomonospora curvata DSM 43183]
gi|268309832|gb|ACY96206.1| hypothetical protein Tcur_0611 [Thermomonospora curvata DSM 43183]
Length=97
Score = 40.4 bits (93), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 25/80 (32%), Positives = 37/80 (47%), Gaps = 3/80 (3%)
Query 14 AEFGRHVEELVAEIESLVTRLHVT---WTGEGAAAHAEAQRHWAAGEAMMRQALAQLTAA 70
A+F L+ E+ L +L W G+ AA+ Q+ W A M+ +AQL A
Sbjct 17 AQFSMTARALLDELTDLEGKLRAKLDRWDGDAQAAYWNYQKEWDAAAKDMQNVVAQLGVA 76
Query 71 GQSAHANYTGAMATNLGMWS 90
+ AH NY A N +W+
Sbjct 77 IREAHDNYQAAERANTSIWA 96
>gi|291298518|ref|YP_003509796.1| hypothetical protein Snas_0993 [Stackebrandtia nassauensis DSM
44728]
gi|290567738|gb|ADD40703.1| hypothetical protein Snas_0993 [Stackebrandtia nassauensis DSM
44728]
Length=91
Score = 38.5 bits (88), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 22/77 (29%), Positives = 38/77 (50%), Gaps = 1/77 (1%)
Query 14 AEFGRHVEELVAEIESLVTRLHVT-WTGEGAAAHAEAQRHWAAGEAMMRQALAQLTAAGQ 72
AE G VEEL+ IE L+ R++ T W + ++ E Q W + + ++ L ++
Sbjct 15 AELGTKVEELIQVIEDLMARVNSTEWEADDRNSYQELQEMWNSEDNSLQDVLREIAGQVG 74
Query 73 SAHANYTGAMATNLGMW 89
+A Y +A N G +
Sbjct 75 TAREGYQSTIAHNAGRF 91
>gi|46117048|ref|XP_384542.1| hypothetical protein FG04366.1 [Gibberella zeae PH-1]
Length=1028
Score = 38.5 bits (88), Expect = 0.36, Method: Composition-based stats.
Identities = 27/86 (32%), Positives = 39/86 (46%), Gaps = 2/86 (2%)
Query 2 DPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMR 61
D TVL V + E RH+++ +A + +V + T T +GAA + R A M R
Sbjct 572 DRTVLVLNVEALDEVKRHIDDGIASLTEIVRDVKQTVTDQGAALQRVSDRQQEAANEMAR 631
Query 62 QALAQLTAAGQS--AHANYTGAMATN 85
AL+ TA S A +T N
Sbjct 632 LALSPPTAGASSDGARTQFTSGRKLN 657
>gi|154508575|ref|ZP_02044217.1| hypothetical protein ACTODO_01076 [Actinomyces odontolyticus
ATCC 17982]
gi|293191453|ref|ZP_06609195.1| putative low molecular weight protein antigen 7 [Actinomyces
odontolyticus F0309]
gi|153798209|gb|EDN80629.1| hypothetical protein ACTODO_01076 [Actinomyces odontolyticus
ATCC 17982]
gi|292820554|gb|EFF79530.1| putative low molecular weight protein antigen 7 [Actinomyces
odontolyticus F0309]
Length=93
Score = 38.1 bits (87), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 17/67 (26%), Positives = 35/67 (53%), Gaps = 0/67 (0%)
Query 19 HVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLTAAGQSAHANY 78
+++ + ++E + +L +W G+ A+ AQR W G ++ L + ++A SA +NY
Sbjct 23 NLQNCLDDLEQTLNQLRSSWEGQTQEAYDVAQRQWNQGLEGLKDVLRRTSSAVDSARSNY 82
Query 79 TGAMATN 85
+N
Sbjct 83 QQTDQSN 89
>gi|54027560|ref|YP_121802.1| hypothetical protein nfa55860 [Nocardia farcinica IFM 10152]
gi|54019068|dbj|BAD60438.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=99
Score = 37.4 bits (85), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 27/73 (37%), Positives = 39/73 (54%), Gaps = 2/73 (2%)
Query 16 FGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLTAAGQSAH 75
F H++ L ++ L+ WTG AAA+AE R W A + LAQ+ AG++AH
Sbjct 26 FADHLDALDRTVQGLMQSGQ--WTGAAAAAYAEEHRVWVAAARELLDGLAQMEQAGRTAH 83
Query 76 ANYTGAMATNLGM 88
+Y+ A NL M
Sbjct 84 ESYSDATTVNLRM 96
>gi|256390225|ref|YP_003111789.1| hypothetical protein Caci_1020 [Catenulispora acidiphila DSM
44928]
gi|256356451|gb|ACU69948.1| protein of unknown function DUF909 [Catenulispora acidiphila
DSM 44928]
Length=98
Score = 37.4 bits (85), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 18/64 (29%), Positives = 32/64 (50%), Gaps = 0/64 (0%)
Query 26 EIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLTAAGQSAHANYTGAMATN 85
++ +L+ L TW+G+ AA + QR W +A + + L Q+ + A Y A N
Sbjct 34 DLRTLLQPLTQTWSGQAAADYQARQRQWDQAQADLNEVLQQIGKVLEVAQQQYGDAERAN 93
Query 86 LGMW 89
+ +W
Sbjct 94 IDVW 97
>gi|54022450|ref|YP_116692.1| hypothetical protein nfa4855 [Nocardia farcinica IFM 10152]
gi|54013958|dbj|BAD55328.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=78
Score = 37.4 bits (85), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 23/73 (32%), Positives = 39/73 (54%), Gaps = 1/73 (1%)
Query 19 HVEELVAEIESLVTRLHVT-WTGEGAAAHAEAQRHWAAGEAMMRQALAQLTAAGQSAHAN 77
++ E + E+++ V L T W G A A+ +A W AG + + ++ A + AHA
Sbjct 4 YIGECLDEVDAKVATLSGTGWEGVAAQAYVDAHVQWLAGARAFAEGVRDMSDAARVAHAR 63
Query 78 YTGAMATNLGMWS 90
YT A+ TN M++
Sbjct 64 YTRAVETNHKMFN 76
>gi|302865073|ref|YP_003833710.1| hypothetical protein Micau_0567 [Micromonospora aurantiaca ATCC
27029]
gi|315501573|ref|YP_004080460.1| hypothetical protein ML5_0762 [Micromonospora sp. L5]
gi|302567932|gb|ADL44134.1| hypothetical protein Micau_0567 [Micromonospora aurantiaca ATCC
27029]
gi|315408192|gb|ADU06309.1| hypothetical protein ML5_0762 [Micromonospora sp. L5]
Length=93
Score = 37.0 bits (84), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 23/89 (26%), Positives = 42/89 (48%), Gaps = 2/89 (2%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VD VL + +M R ++E + + +++T+L W G+ AA+ + Q W A +
Sbjct 5 VDYAVLESSNQQMQAISRTIDEKLDTLRAMLTKLE--WEGQDRAAYQQHQAQWDAAVRDI 62
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMW 89
+ L ++ A A NY +N +W
Sbjct 63 NKVLNEIGGAVGIARENYMTTEMSNSKVW 91
>gi|345002677|ref|YP_004805531.1| hypothetical protein SACTE_5173 [Streptomyces sp. SirexAA-E]
gi|344318303|gb|AEN12991.1| protein of unknown function DUF909 [Streptomyces sp. SirexAA-E]
Length=109
Score = 36.6 bits (83), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/89 (23%), Positives = 42/89 (48%), Gaps = 0/89 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VD + + +A M + + ++ ++ +E+ + L TW G+ A +AE Q W M
Sbjct 14 VDYSHMQNAADDMVQQTKAIDSILTNLEAELQELQKTWEGDDKAVYAEKQASWNHAVEEM 73
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMW 89
++ LA+ +A +Y + + +W
Sbjct 74 KRILAENSALLNDVSDSYQYSEKSLTSLW 102
>gi|169630195|ref|YP_001703844.1| hypothetical protein MAB_3113 [Mycobacterium abscessus ATCC 19977]
gi|169242162|emb|CAM63190.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=103
Score = 35.8 bits (81), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 43/90 (48%), Gaps = 2/90 (2%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVT--WTGEGAAAHAEAQRHWAAGEA 58
D + +R F V E + ++ES +L + WTG AAA++EA + W
Sbjct 11 FDLDHIEQVTSRARGFKEFVTENLDQLESRAQKLVQSGQWTGAAAAAYSEAHKEWMDAAR 70
Query 59 MMRQALAQLTAAGQSAHANYTGAMATNLGM 88
+ + L Q+ A ++AH Y+ A NL M
Sbjct 71 ELVEGLNQMEEAARTAHGAYSEAQEANLRM 100
>gi|296119294|ref|ZP_06837862.1| early secretory antigenic target EsaT6 [Corynebacterium ammoniagenes
DSM 20306]
gi|295967686|gb|EFG80943.1| early secretory antigenic target EsaT6 [Corynebacterium ammoniagenes
DSM 20306]
Length=95
Score = 35.8 bits (81), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 17/63 (27%), Positives = 33/63 (53%), Gaps = 0/63 (0%)
Query 6 LADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALA 65
+++A A + + L+ +++ ++ + TW GE AAA+ EAQ W A + LA
Sbjct 11 ISNAAADINSTSSRINGLLDDLKGVIAPMAATWEGESAAAYNEAQAKWDNAAAELNTVLA 70
Query 66 QLT 68
++
Sbjct 71 TIS 73
>gi|225420537|ref|ZP_03762840.1| hypothetical protein CLOSTASPAR_06882 [Clostridium asparagiforme
DSM 15981]
gi|225040822|gb|EEG51068.1| hypothetical protein CLOSTASPAR_06882 [Clostridium asparagiforme
DSM 15981]
Length=195
Score = 35.4 bits (80), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/74 (30%), Positives = 39/74 (53%), Gaps = 1/74 (1%)
Query 13 MAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLTAAGQ 72
M+ + VE+L IE+L R+ +++G G+AA AQ+ E+ R+A A+L +
Sbjct 11 MSTLTKAVEKLSGLIENLTNRMEGSFSGAGSAAQTAAQKVDTVAES-ARKAAAELERLQK 69
Query 73 SAHANYTGAMATNL 86
A + G + N+
Sbjct 70 EKDATFRGTIENNV 83
>gi|54022472|ref|YP_116714.1| hypothetical protein nfa5064 [Nocardia farcinica IFM 10152]
gi|54013980|dbj|BAD55350.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=99
Score = 35.4 bits (80), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 24/89 (27%), Positives = 41/89 (47%), Gaps = 1/89 (1%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVT-WTGEGAAAHAEAQRHWAAGEAM 59
VD L + V +++ ++E + I+ + L T W A A+ EA R W+
Sbjct 7 VDLDHLDEIVTKLSALAAFIDERLDGIDGRIATLSGTGWESIAAQAYTEAHRQWSVSARE 66
Query 60 MRQALAQLTAAGQSAHANYTGAMATNLGM 88
+ + L+ A + AHA+ T A+ N M
Sbjct 67 FVEGVKDLSDAAKKAHADLTYAVELNSNM 95
>gi|226305392|ref|YP_002765350.1| hypothetical protein RER_19030 [Rhodococcus erythropolis PR4]
gi|229489620|ref|ZP_04383483.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|226184507|dbj|BAH32611.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
gi|229323717|gb|EEN89475.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=97
Score = 35.4 bits (80), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 17/59 (29%), Positives = 30/59 (51%), Gaps = 0/59 (0%)
Query 12 RMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLTAA 70
+M + + + + +I+ +V +L W G G+ A +AQR WA + LAQ+ A
Sbjct 18 QMKQQAKAINDKQGDIKVVVDKLQAGWEGAGSEAWNDAQRRWAQASEELNMVLAQIAGA 76
>gi|170699241|ref|ZP_02890292.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10]
gi|170135845|gb|EDT04122.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10]
Length=840
Score = 35.4 bits (80), Expect = 3.1, Method: Composition-based stats.
Identities = 20/59 (34%), Positives = 36/59 (62%), Gaps = 6/59 (10%)
Query 38 WTGEGAAAHAEAQRHWA-----AGEAMMRQALAQLTAAGQSAHA-NYTGAMATNLGMWS 90
+T G+AA + AQR+ + +G A++ +L+ +T G SA + NY ++ +NLG W+
Sbjct 138 FTNHGSAAASSAQRYLSVPTHGSGGALLSHSLSFVTDPGSSASSLNYDLSLTSNLGGWT 196
>gi|297202748|ref|ZP_06920145.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
gi|197716736|gb|EDY60770.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
Length=97
Score = 35.4 bits (80), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 20/64 (32%), Positives = 29/64 (46%), Gaps = 0/64 (0%)
Query 26 EIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLTAAGQSAHANYTGAMATN 85
E+ES V R+ +W G ++ Q W A M+Q L + A SA NY + N
Sbjct 32 ELESGVKRISASWEGSAQESYRAKQAEWDQRAASMQQTLEAIAKALDSAAQNYQATESKN 91
Query 86 LGMW 89
+W
Sbjct 92 AQIW 95
>gi|256380551|ref|YP_003104211.1| hypothetical protein Amir_6566 [Actinosynnema mirum DSM 43827]
gi|255924854|gb|ACU40365.1| protein of unknown function DUF909 [Actinosynnema mirum DSM 43827]
Length=96
Score = 34.7 bits (78), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 24/89 (27%), Positives = 39/89 (44%), Gaps = 0/89 (0%)
Query 1 VDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMM 60
VD L+ A ++ V+ + E++ + + W G + + EAQR W A +
Sbjct 7 VDFGQLSGAAGDISGHATKVQTELDELKQRIAPVVAQWDGVSSETYQEAQRKWDTAAADL 66
Query 61 RQALAQLTAAGQSAHANYTGAMATNLGMW 89
+Q LA + A +A Y A N G W
Sbjct 67 QQVLASIGTAVAAATEAYQSAEQRNTGRW 95
Lambda K H
0.316 0.123 0.364
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 129182109240
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40