BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3746c
Length=111
Score E
Sequences producing significant alignments: (Bits) Value
gi|31794916|ref|NP_857409.1| PE family protein [Mycobacterium bo... 220 7e-56
gi|15843367|ref|NP_338404.1| PE family protein [Mycobacterium tu... 219 8e-56
gi|289763927|ref|ZP_06523305.1| PE family protein [Mycobacterium... 218 3e-55
gi|308372643|ref|ZP_07429349.2| PE family protein [Mycobacterium... 213 6e-54
gi|118619500|ref|YP_907832.1| PE family protein [Mycobacterium u... 120 5e-26
gi|240168503|ref|ZP_04747162.1| PE family protein, PE34 [Mycobac... 119 1e-25
gi|240170200|ref|ZP_04748859.1| PE family protein, PE34 [Mycobac... 103 6e-21
gi|57117163|ref|YP_178021.1| PE family-related protein [Mycobact... 97.8 5e-19
gi|289443461|ref|ZP_06433205.1| PE family protein [Mycobacterium... 96.7 1e-18
gi|289447594|ref|ZP_06437338.1| PE family protein [Mycobacterium... 96.7 1e-18
gi|31795046|ref|NP_857539.1| PE family-like protein [Mycobacteri... 96.3 1e-18
gi|340628844|ref|YP_004747296.1| PE family-like protein [Mycobac... 96.3 1e-18
gi|15843503|ref|NP_338540.1| hypothetical protein MT3986 [Mycoba... 95.9 2e-18
gi|289754074|ref|ZP_06513452.1| predicted protein [Mycobacterium... 95.9 2e-18
gi|289750551|ref|ZP_06509929.1| predicted protein [Mycobacterium... 95.9 2e-18
gi|289746084|ref|ZP_06505462.1| PE family protein [Mycobacterium... 95.1 3e-18
gi|308406276|ref|ZP_07495784.2| PE family protein [Mycobacterium... 94.7 4e-18
gi|289445474|ref|ZP_06435218.1| PE family protein [Mycobacterium... 93.2 1e-17
gi|240170199|ref|ZP_04748858.1| PE family protein [Mycobacterium... 89.7 1e-16
gi|183982898|ref|YP_001851189.1| PE family protein [Mycobacteriu... 86.3 1e-15
gi|240168351|ref|ZP_04747010.1| PPE family protein [Mycobacteriu... 74.7 4e-12
gi|118467647|ref|YP_884481.1| PE family protein [Mycobacterium s... 72.4 2e-11
gi|323717455|gb|EGB26659.1| PE family protein [Mycobacterium tub... 72.4 2e-11
gi|296164967|ref|ZP_06847522.1| PE family protein [Mycobacterium... 71.6 4e-11
gi|183980219|ref|YP_001848510.1| PE family protein, PE35 [Mycoba... 63.5 9e-09
gi|145221364|ref|YP_001132042.1| PE-like protein [Mycobacterium ... 60.5 9e-08
gi|120401104|ref|YP_950933.1| PE-like protein [Mycobacterium van... 56.6 1e-06
gi|108797049|ref|YP_637246.1| PE-like protein [Mycobacterium sp.... 54.7 5e-06
gi|183985418|ref|YP_001853709.1| PE family protein [Mycobacteriu... 53.1 1e-05
gi|154295459|ref|XP_001548165.1| predicted protein [Botryotinia ... 34.3 5.6
gi|333917216|ref|YP_004490948.1| HipA domain-containing protein ... 33.9 7.5
gi|224154639|ref|XP_002337498.1| predicted protein [Populus tric... 33.5 9.4
>gi|31794916|ref|NP_857409.1| PE family protein [Mycobacterium bovis AF2122/97]
gi|57117152|ref|YP_178011.1| PE family protein [Mycobacterium tuberculosis H37Rv]
gi|121639660|ref|YP_979884.1| putative PE family protein [Mycobacterium bovis BCG str. Pasteur
1173P2]
65 more sequence titles
Length=111
Score = 220 bits (560), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 111/111 (100%), Positives = 111/111 (100%), Gaps = 0/111 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG
Sbjct 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPGQTLARE 111
LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPGQTLARE
Sbjct 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPGQTLARE 111
>gi|15843367|ref|NP_338404.1| PE family protein [Mycobacterium tuberculosis CDC1551]
gi|13883731|gb|AAK48218.1| PE family protein [Mycobacterium tuberculosis CDC1551]
Length=123
Score = 219 bits (559), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 111/111 (100%), Positives = 111/111 (100%), Gaps = 0/111 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG
Sbjct 13 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 72
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPGQTLARE 111
LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPGQTLARE
Sbjct 73 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPGQTLARE 123
>gi|289763927|ref|ZP_06523305.1| PE family protein [Mycobacterium tuberculosis GM 1503]
gi|289711433|gb|EFD75449.1| PE family protein [Mycobacterium tuberculosis GM 1503]
Length=111
Score = 218 bits (554), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 110/111 (99%), Positives = 110/111 (99%), Gaps = 0/111 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAV AFTTAATG
Sbjct 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVMAFTTAATG 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPGQTLARE 111
LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPGQTLARE
Sbjct 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPGQTLARE 111
>gi|308372643|ref|ZP_07429349.2| PE family protein [Mycobacterium tuberculosis SUMu004]
gi|308373806|ref|ZP_07433713.2| PE family protein [Mycobacterium tuberculosis SUMu006]
gi|308376214|ref|ZP_07438050.2| PE family protein [Mycobacterium tuberculosis SUMu008]
9 more sequence titles
Length=108
Score = 213 bits (543), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 108/108 (100%), Positives = 108/108 (100%), Gaps = 0/108 (0%)
Query 4 MSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATGLLA 63
MSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATGLLA
Sbjct 1 MSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATGLLA 60
Query 64 LNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPGQTLARE 111
LNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPGQTLARE
Sbjct 61 LNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPGQTLARE 108
>gi|118619500|ref|YP_907832.1| PE family protein [Mycobacterium ulcerans Agy99]
gi|183985254|ref|YP_001853545.1| PE family protein, PE34 [Mycobacterium marinum M]
gi|118571610|gb|ABL06361.1| PE family protein [Mycobacterium ulcerans Agy99]
gi|183178580|gb|ACC43690.1| PE family protein, PE34 [Mycobacterium marinum M]
Length=111
Score = 120 bits (302), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 65/109 (60%), Positives = 78/109 (72%), Gaps = 0/109 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
MQSMS DP ADIG+Q+ AF+GLQAGA A S++S+ PAGA+EVS A+ AFT A
Sbjct 1 MQSMSIDPVAADIGAQLAEGAFRGLQAGATAATSITSVRPAGADEVSTQAMLAFTKHAGQ 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPGQTLA 109
+LALNQAAQEELR+AGE AIARMY+D DV A L++ R G LA
Sbjct 61 MLALNQAAQEELRRAGEAVNAIARMYADTDVAVARNLIDVGWRSGSALA 109
>gi|240168503|ref|ZP_04747162.1| PE family protein, PE34 [Mycobacterium kansasii ATCC 12478]
Length=111
Score = 119 bits (299), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/105 (68%), Positives = 77/105 (74%), Gaps = 0/105 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
MQSMS DPA ADIG+QV +NA QGLQAGA A L+SLLPAGA+EVSA AV AFT A
Sbjct 1 MQSMSIDPAAADIGAQVADNASQGLQAGATASTPLTSLLPAGADEVSAQAVAAFTAEAAQ 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAIPRPG 105
LLALNQAAQEELR+AGE F IARMY+D D AA L PG
Sbjct 61 LLALNQAAQEELRRAGEAFADIARMYTDVDTTAATSLTGVGLLPG 105
>gi|240170200|ref|ZP_04748859.1| PE family protein, PE34 [Mycobacterium kansasii ATCC 12478]
Length=111
Score = 103 bits (258), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 59/101 (59%), Positives = 71/101 (71%), Gaps = 0/101 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
MQ MSFDP VADIGSQV + + LQAGA A S++ L PAGA+EVSA AVTAF T A
Sbjct 1 MQPMSFDPVVADIGSQVADLGTRSLQAGATAVSSVTGLAPAGADEVSAQAVTAFHTQAAS 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLEAI 101
+LALNQAAQEEL + G F +A+ Y+D D AA +L A+
Sbjct 61 MLALNQAAQEELVRTGAAFRQVAQSYTDVDEAAAESVLLAL 101
>gi|57117163|ref|YP_178021.1| PE family-related protein [Mycobacterium tuberculosis H37Rv]
gi|148663739|ref|YP_001285262.1| PPE family protein [Mycobacterium tuberculosis H37Ra]
gi|167967454|ref|ZP_02549731.1| hypothetical protein MtubH3_05222 [Mycobacterium tuberculosis
H37Ra]
9 more sequence titles
Length=99
Score = 97.8 bits (242), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/99 (51%), Positives = 65/99 (66%), Gaps = 0/99 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
M+ MS DP ADIG+QV +NA G+ AG+ A S++ L+PAGA+EVSA A TAFT+
Sbjct 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLE 99
LLA N +AQ++L +AGE +AR YS D AA E
Sbjct 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE 99
>gi|289443461|ref|ZP_06433205.1| PE family protein [Mycobacterium tuberculosis T46]
gi|289416380|gb|EFD13620.1| PE family protein [Mycobacterium tuberculosis T46]
Length=215
Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 53/98 (55%), Positives = 64/98 (66%), Gaps = 1/98 (1%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWV-SLSSLLPAGAEEVSAWAVTAFTTAAT 59
M S+S DPA DIGSQ+V +GL AG A + +++ L+PAG EEVSA AV AF T A
Sbjct 1 MDSLSHDPAAGDIGSQLVEIGSRGLAAGNAATMPTMTGLVPAGGEEVSAQAVMAFATEAA 60
Query 60 GLLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACL 97
++A N AAQEEL +AG T IARMY D D AA L
Sbjct 61 SMIASNTAAQEELMRAGTALTDIARMYGDTDDNAAGAL 98
>gi|289447594|ref|ZP_06437338.1| PE family protein [Mycobacterium tuberculosis CPHL_A]
gi|340626986|ref|YP_004745438.1| pe family protein [Mycobacterium canettii CIPT 140010059]
gi|289420552|gb|EFD17753.1| PE family protein [Mycobacterium tuberculosis CPHL_A]
gi|340005176|emb|CCC44325.1| pe family protein [Mycobacterium canettii CIPT 140010059]
Length=215
Score = 96.7 bits (239), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 53/98 (55%), Positives = 64/98 (66%), Gaps = 1/98 (1%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWV-SLSSLLPAGAEEVSAWAVTAFTTAAT 59
M S+S DPA DIGSQ+V +GL AG A + +++ L+PAG EEVSA AV AF T A
Sbjct 1 MDSLSHDPAAGDIGSQLVEIGSRGLAAGNAATMPTMTGLVPAGGEEVSAQAVMAFATEAA 60
Query 60 GLLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACL 97
++A N AAQEEL +AG T IARMY D D AA L
Sbjct 61 SMIASNTAAQEELMRAGTALTDIARMYGDTDDNAAGAL 98
>gi|31795046|ref|NP_857539.1| PE family-like protein [Mycobacterium bovis AF2122/97]
gi|148825080|ref|YP_001289834.1| PE family protein [Mycobacterium tuberculosis F11]
gi|253800922|ref|YP_003033924.1| hypothetical protein TBMG_03920 [Mycobacterium tuberculosis KZN
1435]
47 more sequence titles
Length=98
Score = 96.3 bits (238), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/97 (51%), Positives = 64/97 (66%), Gaps = 0/97 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
M+ MS DP ADIG+QV +NA G+ AG+ A S++ L+PAGA+EVSA A TAFT+
Sbjct 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACL 97
LLA N +AQ++L +AGE +AR YS D AA
Sbjct 61 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVF 97
>gi|340628844|ref|YP_004747296.1| PE family-like protein [Mycobacterium canettii CIPT 140010059]
gi|340007034|emb|CCC46225.1| PE family-related protein [Mycobacterium canettii CIPT 140010059]
Length=98
Score = 96.3 bits (238), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/97 (51%), Positives = 64/97 (66%), Gaps = 0/97 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
M+ MS DP ADIG+QV +NA G+ AG+ A S++ L+PAGA+EVSA A TAFT+
Sbjct 1 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACL 97
LLA N +AQ++L +AGE +AR YS D AA
Sbjct 61 LLASNVSAQDQLHRAGEAVQDVARTYSQVDDGAAGVF 97
>gi|15843503|ref|NP_338540.1| hypothetical protein MT3986 [Mycobacterium tuberculosis CDC1551]
gi|289572130|ref|ZP_06452357.1| PE family protein [Mycobacterium tuberculosis T17]
gi|289764062|ref|ZP_06523440.1| PE family protein [Mycobacterium tuberculosis GM 1503]
gi|13883877|gb|AAK48354.1| hypothetical protein MT3986 [Mycobacterium tuberculosis CDC1551]
gi|289545885|gb|EFD49532.1| PE family protein [Mycobacterium tuberculosis T17]
gi|289711568|gb|EFD75584.1| PE family protein [Mycobacterium tuberculosis GM 1503]
Length=112
Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/97 (51%), Positives = 64/97 (66%), Gaps = 0/97 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
M+ MS DP ADIG+QV +NA G+ AG+ A S++ L+PAGA+EVSA A TAFT+
Sbjct 15 MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 74
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACL 97
LLA N +AQ++L +AGE +AR YS D AA
Sbjct 75 LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVF 111
>gi|289754074|ref|ZP_06513452.1| predicted protein [Mycobacterium tuberculosis EAS054]
gi|289694661|gb|EFD62090.1| predicted protein [Mycobacterium tuberculosis EAS054]
Length=255
Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/98 (55%), Positives = 64/98 (66%), Gaps = 1/98 (1%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWV-SLSSLLPAGAEEVSAWAVTAFTTAAT 59
M S+S DPA DIGSQ+V +GL AG A + +++ L+PAG EEVSA AV AF T A
Sbjct 41 MDSLSHDPAAGDIGSQLVEIGSRGLAAGNAATMPTMTGLVPAGGEEVSAQAVMAFATEAA 100
Query 60 GLLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACL 97
++A N AAQEEL +AG T IARMY D D AA L
Sbjct 101 SMIASNTAAQEELMRAGTALTDIARMYGDTDDNAAGAL 138
>gi|289750551|ref|ZP_06509929.1| predicted protein [Mycobacterium tuberculosis T92]
gi|289691138|gb|EFD58567.1| predicted protein [Mycobacterium tuberculosis T92]
Length=255
Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/98 (55%), Positives = 64/98 (66%), Gaps = 1/98 (1%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWV-SLSSLLPAGAEEVSAWAVTAFTTAAT 59
M S+S DPA DIGSQ+V +GL AG A + +++ L+PAG EEVSA AV AF T A
Sbjct 41 MDSLSHDPAAGDIGSQLVEIGSRGLAAGNAATMPTMTGLVPAGGEEVSAQAVMAFATEAA 100
Query 60 GLLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACL 97
++A N AAQEEL +AG T IARMY D D AA L
Sbjct 101 SMIASNTAAQEELMRAGTALTDIARMYGDTDDNAAGAL 138
>gi|289746084|ref|ZP_06505462.1| PE family protein [Mycobacterium tuberculosis 02_1987]
gi|294996901|ref|ZP_06802592.1| PE family protein [Mycobacterium tuberculosis 210]
gi|289686612|gb|EFD54100.1| PE family protein [Mycobacterium tuberculosis 02_1987]
gi|326903584|gb|EGE50517.1| hypothetical protein TBPG_01460 [Mycobacterium tuberculosis W-148]
gi|339294902|gb|AEJ47013.1| PE family protein [Mycobacterium tuberculosis CCDC5079]
gi|339298525|gb|AEJ50635.1| PE family protein [Mycobacterium tuberculosis CCDC5180]
Length=163
Score = 95.1 bits (235), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 53/98 (55%), Positives = 64/98 (66%), Gaps = 1/98 (1%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWV-SLSSLLPAGAEEVSAWAVTAFTTAAT 59
M S+S DPA DIGSQ+V +GL AG A + +++ L+PAG EEVSA AV AF T A
Sbjct 1 MDSLSHDPAAGDIGSQLVEIGSRGLAAGNAATMPTMTGLVPAGGEEVSAQAVMAFATEAA 60
Query 60 GLLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACL 97
++A N AAQEEL +AG T IARMY D D AA L
Sbjct 61 SMIASNTAAQEELMRAGTALTDIARMYGDTDDNAAGAL 98
>gi|308406276|ref|ZP_07495784.2| PE family protein [Mycobacterium tuberculosis SUMu012]
gi|308363936|gb|EFP52787.1| PE family protein [Mycobacterium tuberculosis SUMu012]
Length=96
Score = 94.7 bits (234), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 49/96 (52%), Positives = 63/96 (66%), Gaps = 0/96 (0%)
Query 4 MSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATGLLA 63
MS DP ADIG+QV +NA G+ AG+ A S++ L+PAGA+EVSA A TAFT+ LLA
Sbjct 1 MSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLA 60
Query 64 LNQAAQEELRKAGEVFTAIARMYSDADVRAAACLLE 99
N +AQ++L +AGE +AR YS D AA E
Sbjct 61 SNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE 96
>gi|289445474|ref|ZP_06435218.1| PE family protein [Mycobacterium tuberculosis CPHL_A]
gi|298527345|ref|ZP_07014754.1| PE family protein [Mycobacterium tuberculosis 94_M4241A]
gi|289418432|gb|EFD15633.1| PE family protein [Mycobacterium tuberculosis CPHL_A]
gi|298497139|gb|EFI32433.1| PE family protein [Mycobacterium tuberculosis 94_M4241A]
Length=95
Score = 93.2 bits (230), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/94 (52%), Positives = 62/94 (66%), Gaps = 0/94 (0%)
Query 4 MSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATGLLA 63
MS DP ADIG+QV +NA G+ AG+ A S++ L+PAGA+EVSA A TAFT+ LLA
Sbjct 1 MSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLA 60
Query 64 LNQAAQEELRKAGEVFTAIARMYSDADVRAAACL 97
N +AQ++L +AGE +AR YS D AA
Sbjct 61 SNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVF 94
>gi|240170199|ref|ZP_04748858.1| PE family protein [Mycobacterium kansasii ATCC 12478]
Length=214
Score = 89.7 bits (221), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 48/87 (56%), Positives = 60/87 (69%), Gaps = 1/87 (1%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVS-LSSLLPAGAEEVSAWAVTAFTTAAT 59
M +MS DP+ DIGSQ+V+ QG+ AG+ A ++ L+ L+PAGAEEVSA AV AF A
Sbjct 1 MDTMSHDPSAGDIGSQLVDIGSQGISAGSTAAMTVLTGLIPAGAEEVSAQAVMAFAQEAA 60
Query 60 GLLALNQAAQEELRKAGEVFTAIARMY 86
+LA N AAQEEL + G T IARMY
Sbjct 61 SMLASNVAAQEELMRTGSALTNIARMY 87
>gi|183982898|ref|YP_001851189.1| PE family protein [Mycobacterium marinum M]
gi|183176224|gb|ACC41334.1| PE family protein [Mycobacterium marinum M]
Length=230
Score = 86.3 bits (212), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/87 (56%), Positives = 58/87 (67%), Gaps = 1/87 (1%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVS-LSSLLPAGAEEVSAWAVTAFTTAAT 59
M MS DPA DIGSQ+V+ QG+ AG A ++ L+ L+PAGAEEVSA AV AF A
Sbjct 1 MDPMSHDPAAGDIGSQLVDIGTQGISAGTTAAMTVLTGLIPAGAEEVSAQAVLAFAQEAA 60
Query 60 GLLALNQAAQEELRKAGEVFTAIARMY 86
+LA N AAQEEL + G + IARMY
Sbjct 61 TMLASNVAAQEELMRTGTALSDIARMY 87
>gi|240168351|ref|ZP_04747010.1| PPE family protein [Mycobacterium kansasii ATCC 12478]
Length=98
Score = 74.7 bits (182), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 48/97 (50%), Positives = 62/97 (64%), Gaps = 0/97 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
M+ MS A ADIG QV +NA G+ AGA A S++ L+PAGA+EVSA A AF +A
Sbjct 1 MEEMSHAAAAADIGGQVSDNALGGVAAGATAVTSVTGLVPAGADEVSAQAAAAFASAGAQ 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACL 97
+LA N +AQ EL++AG+ IAR YS D AA +
Sbjct 61 MLASNSSAQAELQRAGDAVQDIARTYSQVDDGAAGVV 97
>gi|118467647|ref|YP_884481.1| PE family protein [Mycobacterium smegmatis str. MC2 155]
gi|118168934|gb|ABK69830.1| PE family protein [Mycobacterium smegmatis str. MC2 155]
Length=97
Score = 72.4 bits (176), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/97 (43%), Positives = 58/97 (60%), Gaps = 0/97 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
MQ M+ +P + +QV+ NA +GL G A ++++L+PAGA+EVSA A AF +
Sbjct 1 MQPMTHNPGAEAVAAQVIANAARGLAGGTTASAAVTALVPAGADEVSALAAVAFASEGVE 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACL 97
LA N AQEEL +AG F IA +Y+ D AA +
Sbjct 61 ALAANAFAQEELTRAGAAFAEIAGIYNAVDAANAATM 97
>gi|323717455|gb|EGB26659.1| PE family protein [Mycobacterium tuberculosis CDC1551A]
Length=82
Score = 72.4 bits (176), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/79 (49%), Positives = 51/79 (65%), Gaps = 0/79 (0%)
Query 19 NNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATGLLALNQAAQEELRKAGEV 78
+NA G+ AG+ A S++ L+PAGA+EVSA A TAFT+ LLA N +AQ++L +AGE
Sbjct 3 DNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAGEA 62
Query 79 FTAIARMYSDADVRAAACL 97
+AR YS D AA
Sbjct 63 VQDVARTYSQIDDGAAGVF 81
>gi|296164967|ref|ZP_06847522.1| PE family protein [Mycobacterium parascrofulaceum ATCC BAA-614]
gi|295899615|gb|EFG79066.1| PE family protein [Mycobacterium parascrofulaceum ATCC BAA-614]
Length=237
Score = 71.6 bits (174), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 49/97 (51%), Positives = 61/97 (63%), Gaps = 0/97 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
M+ M+ DPA IG QVV A QGL +GA A V++++L PAGA+EVS AV AF
Sbjct 1 MEPMTHDPAAGAIGLQVVEIATQGLASGAAASVAVTALAPAGADEVSIQAVAAFAAEGAA 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACL 97
+LALN AAQEE+ + G T IARMY+ D A L
Sbjct 61 MLALNTAAQEEMARTGVALTDIARMYAQVDGETAGTL 97
>gi|183980219|ref|YP_001848510.1| PE family protein, PE35 [Mycobacterium marinum M]
gi|183173545|gb|ACC38655.1| PE family protein, PE35 [Mycobacterium marinum M]
Length=98
Score = 63.5 bits (153), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 43/90 (48%), Positives = 57/90 (64%), Gaps = 0/90 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
M+SMSFDPA A + + + +A +GL AG A S++ L PAGA+E+SA AF
Sbjct 1 MRSMSFDPAAASVAAAISAHASRGLDAGTAAASSVTGLAPAGADEISAQFAAAFAAEGAQ 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDAD 90
+LALN AAQ+EL +AG+ IA MYS D
Sbjct 61 VLALNTAAQDELARAGQALRQIAGMYSAVD 90
>gi|145221364|ref|YP_001132042.1| PE-like protein [Mycobacterium gilvum PYR-GCK]
gi|315441752|ref|YP_004074631.1| hypothetical protein Mspyr1_00620 [Mycobacterium sp. Spyr1]
gi|145213850|gb|ABP43254.1| PE-like protein [Mycobacterium gilvum PYR-GCK]
gi|315260055|gb|ADT96796.1| hypothetical protein Mspyr1_00620 [Mycobacterium sp. Spyr1]
Length=98
Score = 60.5 bits (145), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 35/98 (36%), Positives = 51/98 (53%), Gaps = 0/98 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
MQ ++ +PA A IG QV N +GL G A +S+L PAG +E+SA A +F +
Sbjct 1 MQPLNHNPAAAGIGGQVTANGARGLGVGTAATAEVSALAPAGVDEISAVAALSFASEGIQ 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLL 98
L +N AQ+E+ +AG + Y D A L+
Sbjct 61 TLGINAMAQQEIARAGATVIEASVAYEATDAANGAKLI 98
>gi|120401104|ref|YP_950933.1| PE-like protein [Mycobacterium vanbaalenii PYR-1]
gi|119953922|gb|ABM10927.1| PE-like protein [Mycobacterium vanbaalenii PYR-1]
Length=98
Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/98 (38%), Positives = 49/98 (50%), Gaps = 0/98 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
MQ +S +P +G QV N +GL G A ++S+L PAGA+EVSA A F
Sbjct 1 MQPLSHNPGAIGVGGQVTANGARGLATGTAATSAVSALAPAGADEVSAAAAVTFAAEGIQ 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACLL 98
L +N AQEE+ +AG A Y D A L+
Sbjct 61 TLGINALAQEEIARAGASIIEAAGAYQAVDASNATTLI 98
>gi|108797049|ref|YP_637246.1| PE-like protein [Mycobacterium sp. MCS]
gi|119866134|ref|YP_936086.1| PE domain-containing protein [Mycobacterium sp. KMS]
gi|126432671|ref|YP_001068362.1| PE domain-containing protein [Mycobacterium sp. JLS]
gi|108767468|gb|ABG06190.1| PE-like protein [Mycobacterium sp. MCS]
gi|119692223|gb|ABL89296.1| PE domain protein [Mycobacterium sp. KMS]
gi|126232471|gb|ABN95871.1| PE domain protein [Mycobacterium sp. JLS]
Length=97
Score = 54.7 bits (130), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/90 (46%), Positives = 54/90 (60%), Gaps = 0/90 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
MQ + +P A IG QVV N +GL G A ++++L+PAGA+EVSA A F
Sbjct 1 MQPLEHNPGAAGIGGQVVANGARGLAGGTAATAAVTALVPAGADEVSAMAAATFAAEGAE 60
Query 61 LLALNQAAQEELRKAGEVFTAIARMYSDAD 90
LALN AQEEL +AG FT I+ +Y+ D
Sbjct 61 TLALNTFAQEELSRAGAAFTEISGIYAAVD 90
>gi|183985418|ref|YP_001853709.1| PE family protein [Mycobacterium marinum M]
gi|183178744|gb|ACC43854.1| PE family protein [Mycobacterium marinum M]
Length=98
Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/83 (44%), Positives = 50/83 (61%), Gaps = 0/83 (0%)
Query 1 MQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTAATG 60
M+ S A+ADIG+ + NA G+ + A A S++ ++PAGA+EVS A TAF
Sbjct 1 MEQKSHGAAIADIGTLLSGNARIGVTSDAAALASVTGVVPAGADEVSTQAATAFAAEGAQ 60
Query 61 LLALNQAAQEELRKAGEVFTAIA 83
LLA + AAQ E+ +AGE IA
Sbjct 61 LLASSSAAQREIHRAGESPHRIA 83
>gi|154295459|ref|XP_001548165.1| predicted protein [Botryotinia fuckeliana B05.10]
gi|150844076|gb|EDN19269.1| predicted protein [Botryotinia fuckeliana B05.10]
Length=242
Score = 34.3 bits (77), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 18/68 (27%), Positives = 33/68 (49%), Gaps = 0/68 (0%)
Query 38 LLPAGAEEVSAWAVTAFTTAATGLLALNQAAQEELRKAGEVFTAIARMYSDADVRAAACL 97
++P ++W++T T A G+L QA + + KA + + R + +VR
Sbjct 131 IMPNAEVRSTSWSLTEETQKAAGILGREQAIKNAISKADDYARVLGRKVAATEVRDNGTN 190
Query 98 LEAIPRPG 105
L+A+ R G
Sbjct 191 LKAVKRKG 198
>gi|333917216|ref|YP_004490948.1| HipA domain-containing protein [Delftia sp. Cs1-4]
gi|333747416|gb|AEF92593.1| HipA domain protein [Delftia sp. Cs1-4]
Length=462
Score = 33.9 bits (76), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 31/97 (32%), Positives = 41/97 (43%), Gaps = 9/97 (9%)
Query 1 MQSMSFD---PAVADI-----GSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVT 52
++S FD PA D+ G V+ A G G + VSL L EV WA T
Sbjct 277 LESERFDRSQPAPGDVPSVVPGDMPVSQAHPGSNPGRIGMVSLQVLNAQYVGEVDNWAAT 336
Query 53 AFTTAATGLLALNQAAQEELRKA-GEVFTAIARMYSD 88
A AA GL+ A L +A G++ R Y +
Sbjct 337 ANRLAARGLITKADARSLRLLEAYGQLIANTDRHYGN 373
>gi|224154639|ref|XP_002337498.1| predicted protein [Populus trichocarpa]
gi|222839475|gb|EEE77812.1| predicted protein [Populus trichocarpa]
Length=445
Score = 33.5 bits (75), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 31/92 (34%), Positives = 40/92 (44%), Gaps = 5/92 (5%)
Query 1 MQSMSFD---PAVADIGSQVVNNAFQGLQAGAVAWVSLSSLLPAGAEEVSAWAVTAFTTA 57
++S FD PA D G V+ A G G + VSL L EV WA TA A
Sbjct 80 LESERFDRSQPAPGD-GDVPVSQAHPGSNPGRIGMVSLQVLNAQYVGEVDNWAATANRLA 138
Query 58 ATGLLALNQAAQEELRKA-GEVFTAIARMYSD 88
A GL+ A L +A G++ R Y +
Sbjct 139 ARGLITEADARSLRLLEAYGQLIANTDRHYGN 170
Lambda K H
0.316 0.125 0.349
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128534824512
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40