BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3872
Length=99
Score E
Sequences producing significant alignments: (Bits) Value
gi|57117163|ref|YP_178021.1| PE family-related protein [Mycobact... 194 3e-48
gi|15843503|ref|NP_338540.1| hypothetical protein MT3986 [Mycoba... 192 1e-47
gi|31795046|ref|NP_857539.1| PE family-like protein [Mycobacteri... 192 1e-47
gi|340628844|ref|YP_004747296.1| PE family-like protein [Mycobac... 190 6e-47
gi|308406276|ref|ZP_07495784.2| PE family protein [Mycobacterium... 187 5e-46
gi|289445474|ref|ZP_06435218.1| PE family protein [Mycobacterium... 185 2e-45
gi|323717455|gb|EGB26659.1| PE family protein [Mycobacterium tub... 157 6e-37
gi|240168351|ref|ZP_04747010.1| PPE family protein [Mycobacteriu... 125 2e-27
gi|240170200|ref|ZP_04748859.1| PE family protein, PE34 [Mycobac... 100 1e-19
gi|240168503|ref|ZP_04747162.1| PE family protein, PE34 [Mycobac... 99.8 1e-19
gi|118619500|ref|YP_907832.1| PE family protein [Mycobacterium u... 99.8 1e-19
gi|289443461|ref|ZP_06433205.1| PE family protein [Mycobacterium... 97.1 7e-19
gi|289447594|ref|ZP_06437338.1| PE family protein [Mycobacterium... 97.1 8e-19
gi|289750551|ref|ZP_06509929.1| predicted protein [Mycobacterium... 97.1 9e-19
gi|289754074|ref|ZP_06513452.1| predicted protein [Mycobacterium... 97.1 9e-19
gi|183982898|ref|YP_001851189.1| PE family protein [Mycobacteriu... 96.7 1e-18
gi|289746084|ref|ZP_06505462.1| PE family protein [Mycobacterium... 96.7 1e-18
gi|240170199|ref|ZP_04748858.1| PE family protein [Mycobacterium... 95.9 2e-18
gi|289763927|ref|ZP_06523305.1| PE family protein [Mycobacterium... 95.9 2e-18
gi|118467647|ref|YP_884481.1| PE family protein [Mycobacterium s... 91.7 3e-17
gi|183985418|ref|YP_001853709.1| PE family protein [Mycobacteriu... 88.2 3e-16
gi|31794916|ref|NP_857409.1| PE family protein [Mycobacterium bo... 84.0 7e-15
gi|15843367|ref|NP_338404.1| PE family protein [Mycobacterium tu... 83.2 1e-14
gi|308372643|ref|ZP_07429349.2| PE family protein [Mycobacterium... 80.9 6e-14
gi|145221364|ref|YP_001132042.1| PE-like protein [Mycobacterium ... 77.8 6e-13
gi|296164967|ref|ZP_06847522.1| PE family protein [Mycobacterium... 75.1 3e-12
gi|120401104|ref|YP_950933.1| PE-like protein [Mycobacterium van... 72.8 2e-11
gi|183980219|ref|YP_001848510.1| PE family protein, PE35 [Mycoba... 67.4 7e-10
gi|108797049|ref|YP_637246.1| PE-like protein [Mycobacterium sp.... 52.4 2e-05
gi|145226055|ref|YP_001136709.1| PE domain-containing protein [M... 42.4 0.026
gi|118617769|ref|YP_906101.1| PE family protein [Mycobacterium u... 37.4 0.80
gi|317506395|ref|ZP_07964203.1| PE family protein [Segniliparus ... 37.4 0.82
gi|320161456|ref|YP_004174680.1| thiol-disulfide oxidoreductase ... 36.2 1.5
gi|169627151|ref|YP_001700800.1| PE family protein [Mycobacteriu... 35.4 2.9
gi|15839671|ref|NP_334708.1| PE family protein [Mycobacterium tu... 35.4 2.9
gi|118616921|ref|YP_905253.1| PE family protein [Mycobacterium u... 35.0 3.6
gi|240168499|ref|ZP_04747158.1| PE family protein, PE5 [Mycobact... 34.7 4.5
gi|118616216|ref|YP_904548.1| PE-PGRS family protein family prot... 34.3 5.6
gi|183984754|ref|YP_001853045.1| PE-PGRS family protein [Mycobac... 33.9 8.8
>gi|57117163|ref|YP_178021.1| PE family-related protein [Mycobacterium tuberculosis H37Rv]
gi|148663739|ref|YP_001285262.1| PPE family protein [Mycobacterium tuberculosis H37Ra]
gi|167967454|ref|ZP_02549731.1| hypothetical protein MtubH3_05222 [Mycobacterium tuberculosis
H37Ra]
9 more sequence titles
Length=99
Score = 194 bits (494), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 99/99 (100%), Positives = 99/99 (100%), Gaps = 0/99 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ
Sbjct 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE 99
LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE
Sbjct 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE 99
>gi|15843503|ref|NP_338540.1| hypothetical protein MT3986 [Mycobacterium tuberculosis CDC1551]
gi|289572130|ref|ZP_06452357.1| PE family protein [Mycobacterium tuberculosis T17]
gi|289764062|ref|ZP_06523440.1| PE family protein [Mycobacterium tuberculosis GM 1503]
gi|13883877|gb|AAK48354.1| hypothetical protein MT3986 [Mycobacterium tuberculosis CDC1551]
gi|289545885|gb|EFD49532.1| PE family protein [Mycobacterium tuberculosis T17]
gi|289711568|gb|EFD75584.1| PE family protein [Mycobacterium tuberculosis GM 1503]
Length=112
Score = 192 bits (489), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 98/98 (100%), Positives = 98/98 (100%), Gaps = 0/98 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ
Sbjct 15 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 74
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFA 98
LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFA
Sbjct 75 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFA 112
>gi|31795046|ref|NP_857539.1| PE family-like protein [Mycobacterium bovis AF2122/97]
gi|148825080|ref|YP_001289834.1| PE family protein [Mycobacterium tuberculosis F11]
gi|253800922|ref|YP_003033924.1| hypothetical protein TBMG_03920 [Mycobacterium tuberculosis KZN
1435]
47 more sequence titles
Length=98
Score = 192 bits (489), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 98/98 (100%), Positives = 98/98 (100%), Gaps = 0/98 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ
Sbjct 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFA 98
LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFA
Sbjct 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFA 98
>gi|340628844|ref|YP_004747296.1| PE family-like protein [Mycobacterium canettii CIPT 140010059]
gi|340007034|emb|CCC46225.1| PE family-related protein [Mycobacterium canettii CIPT 140010059]
Length=98
Score = 190 bits (483), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 96/98 (98%), Positives = 97/98 (99%), Gaps = 0/98 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ
Sbjct 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFA 98
LLASN SAQDQLHRAGEAVQDVARTYSQ+DDGAAGVFA
Sbjct 61 LLASNVSAQDQLHRAGEAVQDVARTYSQVDDGAAGVFA 98
>gi|308406276|ref|ZP_07495784.2| PE family protein [Mycobacterium tuberculosis SUMu012]
gi|308363936|gb|EFP52787.1| PE family protein [Mycobacterium tuberculosis SUMu012]
Length=96
Score = 187 bits (475), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 96/96 (100%), Positives = 96/96 (100%), Gaps = 0/96 (0%)
Query 4 MSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLA 63
MSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLA
Sbjct 1 MSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLA 60
Query 64 SNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE 99
SNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE
Sbjct 61 SNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE 96
>gi|289445474|ref|ZP_06435218.1| PE family protein [Mycobacterium tuberculosis CPHL_A]
gi|298527345|ref|ZP_07014754.1| PE family protein [Mycobacterium tuberculosis 94_M4241A]
gi|289418432|gb|EFD15633.1| PE family protein [Mycobacterium tuberculosis CPHL_A]
gi|298497139|gb|EFI32433.1| PE family protein [Mycobacterium tuberculosis 94_M4241A]
Length=95
Score = 185 bits (469), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 95/95 (100%), Positives = 95/95 (100%), Gaps = 0/95 (0%)
Query 4 MSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLA 63
MSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLA
Sbjct 1 MSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLA 60
Query 64 SNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFA 98
SNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFA
Sbjct 61 SNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFA 95
>gi|323717455|gb|EGB26659.1| PE family protein [Mycobacterium tuberculosis CDC1551A]
Length=82
Score = 157 bits (396), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 81/82 (99%), Positives = 82/82 (100%), Gaps = 0/82 (0%)
Query 17 VSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAG 76
+SDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAG
Sbjct 1 MSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAG 60
Query 77 EAVQDVARTYSQIDDGAAGVFA 98
EAVQDVARTYSQIDDGAAGVFA
Sbjct 61 EAVQDVARTYSQIDDGAAGVFA 82
>gi|240168351|ref|ZP_04747010.1| PPE family protein [Mycobacterium kansasii ATCC 12478]
Length=98
Score = 125 bits (313), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 76/98 (78%), Positives = 85/98 (87%), Gaps = 0/98 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
ME+MSH AADIG QVSDNAL GV AG+TA+TSVTGLVPAGADEVSAQAA AF S G Q
Sbjct 1 MEEMSHAAAAADIGGQVSDNALGGVAAGATAVTSVTGLVPAGADEVSAQAAAAFASAGAQ 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFA 98
+LASN+SAQ +L RAG+AVQD+ARTYSQ+DDGAAGV A
Sbjct 61 MLASNSSAQAELQRAGDAVQDIARTYSQVDDGAAGVVA 98
>gi|240170200|ref|ZP_04748859.1| PE family protein, PE34 [Mycobacterium kansasii ATCC 12478]
Length=111
Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/94 (54%), Positives = 70/94 (75%), Gaps = 0/94 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
M+ MS DP+ ADIG+QV+D + AG+TA++SVTGL PAGADEVSAQA TAF ++
Sbjct 1 MQPMSFDPVVADIGSQVADLGTRSLQAGATAVSSVTGLAPAGADEVSAQAVTAFHTQAAS 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAA 94
+LA N +AQ++L R G A + VA++Y+ +D+ AA
Sbjct 61 MLALNQAAQEELVRTGAAFRQVAQSYTDVDEAAA 94
>gi|240168503|ref|ZP_04747162.1| PE family protein, PE34 [Mycobacterium kansasii ATCC 12478]
Length=111
Score = 99.8 bits (247), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/94 (62%), Positives = 71/94 (76%), Gaps = 0/94 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
M+ MS DP AADIG QV+DNA G+ AG+TA T +T L+PAGADEVSAQA AFT+E Q
Sbjct 1 MQSMSIDPAAADIGAQVADNASQGLQAGATASTPLTSLLPAGADEVSAQAVAAFTAEAAQ 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAA 94
LLA N +AQ++L RAGEA D+AR Y+ +D AA
Sbjct 61 LLALNQAAQEELRRAGEAFADIARMYTDVDTTAA 94
>gi|118619500|ref|YP_907832.1| PE family protein [Mycobacterium ulcerans Agy99]
gi|183985254|ref|YP_001853545.1| PE family protein, PE34 [Mycobacterium marinum M]
gi|118571610|gb|ABL06361.1| PE family protein [Mycobacterium ulcerans Agy99]
gi|183178580|gb|ACC43690.1| PE family protein, PE34 [Mycobacterium marinum M]
Length=111
Score = 99.8 bits (247), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/90 (56%), Positives = 65/90 (73%), Gaps = 0/90 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
M+ MS DP+AADIG Q+++ A G+ AG+TA TS+T + PAGADEVS QA AFT Q
Sbjct 1 MQSMSIDPVAADIGAQLAEGAFRGLQAGATAATSITSVRPAGADEVSTQAMLAFTKHAGQ 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQID 90
+LA N +AQ++L RAGEAV +AR Y+ D
Sbjct 61 MLALNQAAQEELRRAGEAVNAIARMYADTD 90
>gi|289443461|ref|ZP_06433205.1| PE family protein [Mycobacterium tuberculosis T46]
gi|289416380|gb|EFD13620.1| PE family protein [Mycobacterium tuberculosis T46]
Length=215
Score = 97.1 bits (240), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 49/98 (50%), Positives = 68/98 (70%), Gaps = 1/98 (1%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTA-LTSVTGLVPAGADEVSAQAATAFTSEGI 59
M+ +SHDP A DIG+Q+ + G+ AG+ A + ++TGLVPAG +EVSAQA AF +E
Sbjct 1 MDSLSHDPAAGDIGSQLVEIGSRGLAAGNAATMPTMTGLVPAGGEEVSAQAVMAFATEAA 60
Query 60 QLLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVF 97
++ASN +AQ++L RAG A+ D+AR Y DD AAG
Sbjct 61 SMIASNTAAQEELMRAGTALTDIARMYGDTDDNAAGAL 98
>gi|289447594|ref|ZP_06437338.1| PE family protein [Mycobacterium tuberculosis CPHL_A]
gi|340626986|ref|YP_004745438.1| pe family protein [Mycobacterium canettii CIPT 140010059]
gi|289420552|gb|EFD17753.1| PE family protein [Mycobacterium tuberculosis CPHL_A]
gi|340005176|emb|CCC44325.1| pe family protein [Mycobacterium canettii CIPT 140010059]
Length=215
Score = 97.1 bits (240), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 49/98 (50%), Positives = 68/98 (70%), Gaps = 1/98 (1%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTA-LTSVTGLVPAGADEVSAQAATAFTSEGI 59
M+ +SHDP A DIG+Q+ + G+ AG+ A + ++TGLVPAG +EVSAQA AF +E
Sbjct 1 MDSLSHDPAAGDIGSQLVEIGSRGLAAGNAATMPTMTGLVPAGGEEVSAQAVMAFATEAA 60
Query 60 QLLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVF 97
++ASN +AQ++L RAG A+ D+AR Y DD AAG
Sbjct 61 SMIASNTAAQEELMRAGTALTDIARMYGDTDDNAAGAL 98
>gi|289750551|ref|ZP_06509929.1| predicted protein [Mycobacterium tuberculosis T92]
gi|289691138|gb|EFD58567.1| predicted protein [Mycobacterium tuberculosis T92]
Length=255
Score = 97.1 bits (240), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 49/98 (50%), Positives = 68/98 (70%), Gaps = 1/98 (1%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTA-LTSVTGLVPAGADEVSAQAATAFTSEGI 59
M+ +SHDP A DIG+Q+ + G+ AG+ A + ++TGLVPAG +EVSAQA AF +E
Sbjct 41 MDSLSHDPAAGDIGSQLVEIGSRGLAAGNAATMPTMTGLVPAGGEEVSAQAVMAFATEAA 100
Query 60 QLLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVF 97
++ASN +AQ++L RAG A+ D+AR Y DD AAG
Sbjct 101 SMIASNTAAQEELMRAGTALTDIARMYGDTDDNAAGAL 138
>gi|289754074|ref|ZP_06513452.1| predicted protein [Mycobacterium tuberculosis EAS054]
gi|289694661|gb|EFD62090.1| predicted protein [Mycobacterium tuberculosis EAS054]
Length=255
Score = 97.1 bits (240), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 49/98 (50%), Positives = 68/98 (70%), Gaps = 1/98 (1%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTA-LTSVTGLVPAGADEVSAQAATAFTSEGI 59
M+ +SHDP A DIG+Q+ + G+ AG+ A + ++TGLVPAG +EVSAQA AF +E
Sbjct 41 MDSLSHDPAAGDIGSQLVEIGSRGLAAGNAATMPTMTGLVPAGGEEVSAQAVMAFATEAA 100
Query 60 QLLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVF 97
++ASN +AQ++L RAG A+ D+AR Y DD AAG
Sbjct 101 SMIASNTAAQEELMRAGTALTDIARMYGDTDDNAAGAL 138
>gi|183982898|ref|YP_001851189.1| PE family protein [Mycobacterium marinum M]
gi|183176224|gb|ACC41334.1| PE family protein [Mycobacterium marinum M]
Length=230
Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 48/87 (56%), Positives = 64/87 (74%), Gaps = 1/87 (1%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTA-LTSVTGLVPAGADEVSAQAATAFTSEGI 59
M+ MSHDP A DIG+Q+ D G++AG+TA +T +TGL+PAGA+EVSAQA AF E
Sbjct 1 MDPMSHDPAAGDIGSQLVDIGTQGISAGTTAAMTVLTGLIPAGAEEVSAQAVLAFAQEAA 60
Query 60 QLLASNASAQDQLHRAGEAVQDVARTY 86
+LASN +AQ++L R G A+ D+AR Y
Sbjct 61 TMLASNVAAQEELMRTGTALSDIARMY 87
>gi|289746084|ref|ZP_06505462.1| PE family protein [Mycobacterium tuberculosis 02_1987]
gi|294996901|ref|ZP_06802592.1| PE family protein [Mycobacterium tuberculosis 210]
gi|289686612|gb|EFD54100.1| PE family protein [Mycobacterium tuberculosis 02_1987]
gi|326903584|gb|EGE50517.1| hypothetical protein TBPG_01460 [Mycobacterium tuberculosis W-148]
gi|339294902|gb|AEJ47013.1| PE family protein [Mycobacterium tuberculosis CCDC5079]
gi|339298525|gb|AEJ50635.1| PE family protein [Mycobacterium tuberculosis CCDC5180]
Length=163
Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/98 (50%), Positives = 68/98 (70%), Gaps = 1/98 (1%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTA-LTSVTGLVPAGADEVSAQAATAFTSEGI 59
M+ +SHDP A DIG+Q+ + G+ AG+ A + ++TGLVPAG +EVSAQA AF +E
Sbjct 1 MDSLSHDPAAGDIGSQLVEIGSRGLAAGNAATMPTMTGLVPAGGEEVSAQAVMAFATEAA 60
Query 60 QLLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVF 97
++ASN +AQ++L RAG A+ D+AR Y DD AAG
Sbjct 61 SMIASNTAAQEELMRAGTALTDIARMYGDTDDNAAGAL 98
>gi|240170199|ref|ZP_04748858.1| PE family protein [Mycobacterium kansasii ATCC 12478]
Length=214
Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/87 (56%), Positives = 64/87 (74%), Gaps = 1/87 (1%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTA-LTSVTGLVPAGADEVSAQAATAFTSEGI 59
M+ MSHDP A DIG+Q+ D G++AGSTA +T +TGL+PAGA+EVSAQA AF E
Sbjct 1 MDTMSHDPSAGDIGSQLVDIGSQGISAGSTAAMTVLTGLIPAGAEEVSAQAVMAFAQEAA 60
Query 60 QLLASNASAQDQLHRAGEAVQDVARTY 86
+LASN +AQ++L R G A+ ++AR Y
Sbjct 61 SMLASNVAAQEELMRTGSALTNIARMY 87
>gi|289763927|ref|ZP_06523305.1| PE family protein [Mycobacterium tuberculosis GM 1503]
gi|289711433|gb|EFD75449.1| PE family protein [Mycobacterium tuberculosis GM 1503]
Length=111
Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/99 (50%), Positives = 64/99 (65%), Gaps = 0/99 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
M+ MS DP ADIG+QV +NA G+ AG+ A S++ L+PAGA+EVSA A AFT+
Sbjct 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVMAFTTAATG 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE 99
LLA N +AQ++L +AGE +AR YS D AA E
Sbjct 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLE 99
>gi|118467647|ref|YP_884481.1| PE family protein [Mycobacterium smegmatis str. MC2 155]
gi|118168934|gb|ABK69830.1| PE family protein [Mycobacterium smegmatis str. MC2 155]
Length=97
Score = 91.7 bits (226), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 47/90 (53%), Positives = 63/90 (70%), Gaps = 0/90 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
M+ M+H+P A + QV NA G+ G+TA +VT LVPAGADEVSA AA AF SEG++
Sbjct 1 MQPMTHNPGAEAVAAQVIANAARGLAGGTTASAAVTALVPAGADEVSALAAVAFASEGVE 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQID 90
LA+NA AQ++L RAG A ++A Y+ +D
Sbjct 61 ALAANAFAQEELTRAGAAFAEIAGIYNAVD 90
>gi|183985418|ref|YP_001853709.1| PE family protein [Mycobacterium marinum M]
gi|183178744|gb|ACC43854.1| PE family protein [Mycobacterium marinum M]
Length=98
Score = 88.2 bits (217), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/97 (58%), Positives = 72/97 (75%), Gaps = 0/97 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
ME+ SH ADIGT +S NA GVT+ + AL SVTG+VPAGADEVS QAATAF +EG Q
Sbjct 1 MEQKSHGAAIADIGTLLSGNARIGVTSDAAALASVTGVVPAGADEVSTQAATAFAAEGAQ 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVF 97
LLAS+++AQ ++HRAGE+ +A +++ DGAA V
Sbjct 61 LLASSSAAQREIHRAGESPHRIAPNPAEVSDGAASVI 97
>gi|31794916|ref|NP_857409.1| PE family protein [Mycobacterium bovis AF2122/97]
gi|57117152|ref|YP_178011.1| PE family protein [Mycobacterium tuberculosis H37Rv]
gi|121639660|ref|YP_979884.1| putative PE family protein [Mycobacterium bovis BCG str. Pasteur
1173P2]
65 more sequence titles
Length=111
Score = 84.0 bits (206), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 50/99 (51%), Positives = 65/99 (66%), Gaps = 0/99 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
M+ MS DP ADIG+QV +NA G+ AG+ A S++ L+PAGA+EVSA A TAFT+
Sbjct 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE 99
LLA N +AQ++L +AGE +AR YS D AA E
Sbjct 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLE 99
>gi|15843367|ref|NP_338404.1| PE family protein [Mycobacterium tuberculosis CDC1551]
gi|13883731|gb|AAK48218.1| PE family protein [Mycobacterium tuberculosis CDC1551]
Length=123
Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/99 (51%), Positives = 65/99 (66%), Gaps = 0/99 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
M+ MS DP ADIG+QV +NA G+ AG+ A S++ L+PAGA+EVSA A TAFT+
Sbjct 13 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 72
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE 99
LLA N +AQ++L +AGE +AR YS D AA E
Sbjct 73 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLE 111
>gi|308372643|ref|ZP_07429349.2| PE family protein [Mycobacterium tuberculosis SUMu004]
gi|308373806|ref|ZP_07433713.2| PE family protein [Mycobacterium tuberculosis SUMu006]
gi|308376214|ref|ZP_07438050.2| PE family protein [Mycobacterium tuberculosis SUMu008]
9 more sequence titles
Length=108
Score = 80.9 bits (198), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 49/96 (52%), Positives = 63/96 (66%), Gaps = 0/96 (0%)
Query 4 MSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLA 63
MS DP ADIG+QV +NA G+ AG+ A S++ L+PAGA+EVSA A TAFT+ LLA
Sbjct 1 MSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATGLLA 60
Query 64 SNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE 99
N +AQ++L +AGE +AR YS D AA E
Sbjct 61 LNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLE 96
>gi|145221364|ref|YP_001132042.1| PE-like protein [Mycobacterium gilvum PYR-GCK]
gi|315441752|ref|YP_004074631.1| hypothetical protein Mspyr1_00620 [Mycobacterium sp. Spyr1]
gi|145213850|gb|ABP43254.1| PE-like protein [Mycobacterium gilvum PYR-GCK]
gi|315260055|gb|ADT96796.1| hypothetical protein Mspyr1_00620 [Mycobacterium sp. Spyr1]
Length=98
Score = 77.8 bits (190), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 41/90 (46%), Positives = 55/90 (62%), Gaps = 0/90 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
M+ ++H+P AA IG QV+ N G+ G+ A V+ L PAG DE+SA AA +F SEGIQ
Sbjct 1 MQPLNHNPAAAGIGGQVTANGARGLGVGTAATAEVSALAPAGVDEISAVAALSFASEGIQ 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQID 90
L NA AQ ++ RAG V + + Y D
Sbjct 61 TLGINAMAQQEIARAGATVIEASVAYEATD 90
>gi|296164967|ref|ZP_06847522.1| PE family protein [Mycobacterium parascrofulaceum ATCC BAA-614]
gi|295899615|gb|EFG79066.1| PE family protein [Mycobacterium parascrofulaceum ATCC BAA-614]
Length=237
Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 48/97 (50%), Positives = 64/97 (66%), Gaps = 0/97 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
ME M+HDP A IG QV + A G+ +G+ A +VT L PAGADEVS QA AF +EG
Sbjct 1 MEPMTHDPAAGAIGLQVVEIATQGLASGAAASVAVTALAPAGADEVSIQAVAAFAAEGAA 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVF 97
+LA N +AQ+++ R G A+ D+AR Y+Q+D AG
Sbjct 61 MLALNTAAQEEMARTGVALTDIARMYAQVDGETAGTL 97
>gi|120401104|ref|YP_950933.1| PE-like protein [Mycobacterium vanbaalenii PYR-1]
gi|119953922|gb|ABM10927.1| PE-like protein [Mycobacterium vanbaalenii PYR-1]
Length=98
Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/90 (46%), Positives = 59/90 (66%), Gaps = 0/90 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
M+ +SH+P A +G QV+ N G+ G+ A ++V+ L PAGADEVSA AA F +EGIQ
Sbjct 1 MQPLSHNPGAIGVGGQVTANGARGLATGTAATSAVSALAPAGADEVSAAAAVTFAAEGIQ 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQID 90
L NA AQ+++ RAG ++ + A Y +D
Sbjct 61 TLGINALAQEEIARAGASIIEAAGAYQAVD 90
>gi|183980219|ref|YP_001848510.1| PE family protein, PE35 [Mycobacterium marinum M]
gi|183173545|gb|ACC38655.1| PE family protein, PE35 [Mycobacterium marinum M]
Length=98
Score = 67.4 bits (163), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 48/91 (53%), Positives = 65/91 (72%), Gaps = 0/91 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
M MS DP AA + +S +A G+ AG+ A +SVTGL PAGADE+SAQ A AF +EG Q
Sbjct 1 MRSMSFDPAAASVAAAISAHASRGLDAGTAAASSVTGLAPAGADEISAQFAAAFAAEGAQ 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDD 91
+LA N +AQD+L RAG+A++ +A YS +D+
Sbjct 61 VLALNTAAQDELARAGQALRQIAGMYSAVDN 91
>gi|108797049|ref|YP_637246.1| PE-like protein [Mycobacterium sp. MCS]
gi|119866134|ref|YP_936086.1| PE domain-containing protein [Mycobacterium sp. KMS]
gi|126432671|ref|YP_001068362.1| PE domain-containing protein [Mycobacterium sp. JLS]
gi|108767468|gb|ABG06190.1| PE-like protein [Mycobacterium sp. MCS]
gi|119692223|gb|ABL89296.1| PE domain protein [Mycobacterium sp. KMS]
gi|126232471|gb|ABN95871.1| PE domain protein [Mycobacterium sp. JLS]
Length=97
Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/97 (47%), Positives = 60/97 (62%), Gaps = 0/97 (0%)
Query 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
M+ + H+P AA IG QV N G+ G+ A +VT LVPAGADEVSA AA F +EG +
Sbjct 1 MQPLEHNPGAAGIGGQVVANGARGLAGGTAATAAVTALVPAGADEVSAMAAATFAAEGAE 60
Query 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVF 97
LA N AQ++L RAG A +++ Y+ +D A F
Sbjct 61 TLALNTFAQEELSRAGAAFTEISGIYAAVDAANASTF 97
>gi|145226055|ref|YP_001136709.1| PE domain-containing protein [Mycobacterium gilvum PYR-GCK]
gi|145218518|gb|ABP47921.1| PE domain protein [Mycobacterium gilvum PYR-GCK]
Length=97
Score = 42.4 bits (98), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 23/52 (45%), Positives = 30/52 (58%), Gaps = 0/52 (0%)
Query 35 VTGLVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAGEAVQDVARTY 86
V+ LVP+G DEVS AA AF ++GI+ A +A L AGE + V Y
Sbjct 34 VSALVPSGVDEVSVMAAAAFGAQGIEFSAMSAEGAAMLTLAGEGLTAVGAAY 85
>gi|118617769|ref|YP_906101.1| PE family protein [Mycobacterium ulcerans Agy99]
gi|183983006|ref|YP_001851297.1| PE family protein [Mycobacterium marinum M]
gi|118569879|gb|ABL04630.1| PE family protein [Mycobacterium ulcerans Agy99]
gi|183176332|gb|ACC41442.1| PE family protein [Mycobacterium marinum M]
Length=101
Score = 37.4 bits (85), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 22/66 (34%), Positives = 37/66 (57%), Gaps = 3/66 (4%)
Query 35 VTGLVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAGEAVQDVARTYSQIDDGAA 94
+T +VP AD VS +AA F++ G++ +A ++L R+G V + +Y+ D AA
Sbjct 35 ITAVVPPAADPVSLEAAIGFSAHGVEHVAVTTEGIEELGRSGVGVGESGLSYASGDAAAA 94
Query 95 ---GVF 97
G+F
Sbjct 95 LTYGLF 100
>gi|317506395|ref|ZP_07964203.1| PE family protein [Segniliparus rugosus ATCC BAA-974]
gi|316255311|gb|EFV14573.1| PE family protein [Segniliparus rugosus ATCC BAA-974]
Length=102
Score = 37.4 bits (85), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 33/60 (55%), Gaps = 0/60 (0%)
Query 38 LVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVF 97
+VP AD VS QAA F++ G A A A ++L R+G V + A +Y+ D A VF
Sbjct 38 VVPPAADPVSLQAAAGFSARGSSHTAVAAEAVEELGRSGLGVAETAESYAVGDLQGAAVF 97
>gi|320161456|ref|YP_004174680.1| thiol-disulfide oxidoreductase [Anaerolinea thermophila UNI-1]
gi|319995309|dbj|BAJ64080.1| thiol-disulfide oxidoreductase [Anaerolinea thermophila UNI-1]
Length=200
Score = 36.2 bits (82), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 27/95 (29%), Positives = 44/95 (47%), Gaps = 12/95 (12%)
Query 14 GTQVSDNALHGVTAGSTALTSVTGLV----------PAGADEVSA--QAATAFTSEGIQL 61
G D L + G+ L+ + G V P +E+ A + A+ S+G+++
Sbjct 58 GFLAPDFELRSIDGGTIRLSDLRGKVVILNFWASWCPPCREEMPALQRVYQAYQSQGVEV 117
Query 62 LASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGV 96
+A NA++QD L VQD T+S + D GV
Sbjct 118 IAVNATSQDTLSDVLNFVQDNGLTFSVLLDEQGGV 152
>gi|169627151|ref|YP_001700800.1| PE family protein [Mycobacterium abscessus ATCC 19977]
gi|169629317|ref|YP_001702966.1| PE family protein [Mycobacterium abscessus ATCC 19977]
gi|169239118|emb|CAM60146.1| Probable PE family protein [Mycobacterium abscessus]
gi|169241284|emb|CAM62312.1| Hypothetical PE family protein [Mycobacterium abscessus]
Length=102
Score = 35.4 bits (80), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/60 (37%), Positives = 33/60 (55%), Gaps = 0/60 (0%)
Query 38 LVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVF 97
+VP AD VS + A F++ GI+ A A ++L RAG V + A +Y+ D AA +
Sbjct 38 VVPPAADPVSLETAAGFSARGIEHSGVAAQAVEELGRAGLGVSESAASYTTGDMQAAAAY 97
>gi|15839671|ref|NP_334708.1| PE family protein [Mycobacterium tuberculosis CDC1551]
gi|31791464|ref|NP_853957.1| PE family protein [Mycobacterium bovis AF2122/97]
gi|57116715|ref|YP_177710.1| PE family protein [Mycobacterium tuberculosis H37Rv]
81 more sequence titles
Length=102
Score = 35.4 bits (80), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 18/52 (35%), Positives = 29/52 (56%), Gaps = 0/52 (0%)
Query 35 VTGLVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAGEAVQDVARTY 86
+T +VP AD VS Q A F+++G++ A ++L RAG V + +Y
Sbjct 35 ITAVVPPAADPVSLQTAAGFSAQGVEHAVVTAEGVEELGRAGVGVGESGASY 86
>gi|118616921|ref|YP_905253.1| PE family protein [Mycobacterium ulcerans Agy99]
gi|183980572|ref|YP_001848863.1| PE family protein, PE5 [Mycobacterium marinum M]
gi|118569031|gb|ABL03782.1| PE family protein [Mycobacterium ulcerans Agy99]
gi|183173898|gb|ACC39008.1| PE family protein, PE5 [Mycobacterium marinum M]
Length=102
Score = 35.0 bits (79), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 19/52 (37%), Positives = 30/52 (58%), Gaps = 0/52 (0%)
Query 35 VTGLVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAGEAVQDVARTY 86
++ +VPA D VS Q A F+++GI+ A A ++L RAG V + +Y
Sbjct 35 ISAVVPAAVDPVSLQTAAGFSAQGIEHAAVAAEGVEELGRAGLGVGESGVSY 86
>gi|240168499|ref|ZP_04747158.1| PE family protein, PE5 [Mycobacterium kansasii ATCC 12478]
Length=104
Score = 34.7 bits (78), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 18/53 (34%), Positives = 30/53 (57%), Gaps = 0/53 (0%)
Query 35 VTGLVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAGEAVQDVARTYS 87
+T +VP AD VS Q A F+ +G++ A A ++L R+G V + +Y+
Sbjct 35 ITAVVPPAADPVSLQTAAGFSGQGVEHAAIVAEGVEELGRSGVGVGEAGVSYA 87
>gi|118616216|ref|YP_904548.1| PE-PGRS family protein family protein [Mycobacterium ulcerans
Agy99]
gi|118568326|gb|ABL03077.1| PE-PGRS family protein family protein [Mycobacterium ulcerans
Agy99]
Length=99
Score = 34.3 bits (77), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 25/74 (34%), Positives = 37/74 (50%), Gaps = 5/74 (6%)
Query 22 LHGVTAGSTA-----LTSVTGLVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAG 76
LH V AG++A + +TG+VPA ADEVSA A F + G +A AQ
Sbjct 18 LHSVGAGTSAGNAAAMAPITGVVPAAADEVSALTAAHFAAHGAMCQTLSAQAQAIYEMFV 77
Query 77 EAVQDVARTYSQID 90
+Q +Y++ +
Sbjct 78 TTLQACGGSYAETE 91
>gi|183984754|ref|YP_001853045.1| PE-PGRS family protein [Mycobacterium marinum M]
gi|183178080|gb|ACC43190.1| PE-PGRS family protein [Mycobacterium marinum M]
Length=99
Score = 33.9 bits (76), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 24/58 (42%), Positives = 33/58 (57%), Gaps = 7/58 (12%)
Query 22 LHGVTAG-----STALTSVTGLVPAGADEVSAQAATAFTSEG--IQLLASNASAQDQL 72
LH V AG + A+ +TG+VPA ADEVSA A F + G Q L++ A A ++
Sbjct 18 LHSVGAGMSAGNAAAMAPITGVVPAAADEVSALTAAHFAAHGAMYQTLSAQAQAIHEM 75
Lambda K H
0.309 0.121 0.321
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 129711308684
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40