BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3706c
Length=106
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610842|ref|NP_218223.1| proline rich protein [Mycobacterium... 204 4e-51
gi|340628680|ref|YP_004747132.1| hypothetical protein MCAN_37281... 201 3e-50
gi|297636384|ref|ZP_06954164.1| hypothetical protein MtubK4_1976... 197 5e-49
gi|167970861|ref|ZP_02553138.1| hypothetical protein MtubH3_2358... 163 8e-39
gi|240172818|ref|ZP_04751477.1| hypothetical protein MkanA1_2613... 95.1 3e-18
gi|183985188|ref|YP_001853479.1| hypothetical protein MMAR_5219 ... 86.3 1e-15
gi|118619449|ref|YP_907781.1| hypothetical protein MUL_4293 [Myc... 84.0 7e-15
gi|41406406|ref|NP_959242.1| hypothetical protein MAP0308c [Myco... 64.7 4e-09
gi|336460883|gb|EGO39768.1| hypothetical protein MAPs_37060 [Myc... 63.9 8e-09
gi|145221899|ref|YP_001132577.1| hypothetical protein Mflv_1307 ... 62.8 2e-08
gi|118472795|ref|YP_890473.1| hypothetical protein MSMEG_6254 [M... 57.4 6e-07
gi|120404607|ref|YP_954436.1| hypothetical protein Mvan_3639 [My... 52.8 2e-05
gi|254818518|ref|ZP_05223519.1| hypothetical protein MintA_01269... 52.0 3e-05
gi|120406440|ref|YP_956269.1| hypothetical protein Mvan_5494 [My... 51.2 5e-05
gi|254773400|ref|ZP_05214916.1| hypothetical protein MaviaA2_017... 49.7 1e-04
gi|118463566|ref|YP_879681.1| hypothetical protein MAV_0396 [Myc... 49.7 1e-04
gi|15843325|ref|NP_338362.1| hypothetical protein MT3808.1 [Myco... 49.7 1e-04
gi|308371427|ref|ZP_07667161.1| conserved proline rich protein [... 49.7 2e-04
gi|308380776|ref|ZP_07669282.1| conserved proline rich protein [... 49.3 2e-04
gi|308406223|ref|ZP_07669546.1| conserved proline rich protein [... 49.3 2e-04
gi|183981450|ref|YP_001849741.1| hypothetical protein MMAR_1428 ... 49.3 2e-04
gi|340628679|ref|YP_004747131.1| hypothetical protein MCAN_37271... 48.9 2e-04
gi|126436474|ref|YP_001072165.1| hypothetical protein Mjls_3898 ... 48.5 3e-04
gi|308369203|ref|ZP_07666685.1| conserved proline rich protein [... 48.5 3e-04
gi|289572356|ref|ZP_06452583.1| predicted protein [Mycobacterium... 48.1 4e-04
gi|342861928|ref|ZP_08718572.1| hypothetical protein MCOL_23675 ... 47.8 6e-04
gi|120401524|ref|YP_951353.1| hypothetical protein Mvan_0502 [My... 47.0 9e-04
gi|183982552|ref|YP_001850843.1| hypothetical protein MMAR_2539 ... 44.3 0.007
gi|120405589|ref|YP_955418.1| hypothetical protein Mvan_4637 [My... 43.5 0.011
gi|240173179|ref|ZP_04751837.1| hypothetical protein MkanA1_2795... 41.6 0.036
gi|296166797|ref|ZP_06849216.1| hypothetical protein HMPREF0591_... 41.2 0.058
gi|296165795|ref|ZP_06848301.1| conserved hypothetical protein [... 39.7 0.17
gi|254773401|ref|ZP_05214917.1| hypothetical protein MaviaA2_018... 38.5 0.30
gi|254821104|ref|ZP_05226105.1| hypothetical protein MintA_14302... 38.1 0.42
gi|296166798|ref|ZP_06849217.1| hypothetical protein HMPREF0591_... 38.1 0.47
gi|336460882|gb|EGO39767.1| hypothetical protein MAPs_37050 [Myc... 36.6 1.4
gi|118466855|ref|YP_879682.1| hypothetical protein MAV_0397 [Myc... 36.6 1.4
gi|342861929|ref|ZP_08718573.1| hypothetical protein MCOL_23680 ... 34.7 4.3
gi|183985187|ref|YP_001853478.1| hypothetical protein MMAR_5218 ... 34.7 4.6
gi|118619448|ref|YP_907780.1| hypothetical protein MUL_4292 [Myc... 33.9 9.4
>gi|15610842|ref|NP_218223.1| proline rich protein [Mycobacterium tuberculosis H37Rv]
gi|15843326|ref|NP_338363.1| hypothetical protein MT3809 [Mycobacterium tuberculosis CDC1551]
gi|31794878|ref|NP_857371.1| proline rich protein [Mycobacterium bovis AF2122/97]
70 more sequence titles
Length=106
Score = 204 bits (519), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 106/106 (100%), Positives = 106/106 (100%), Gaps = 0/106 (0%)
Query 1 MRHMSETSETPTPPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHR 60
MRHMSETSETPTPPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHR
Sbjct 1 MRHMSETSETPTPPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHR 60
Query 61 QHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPATPAP 106
QHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPATPAP
Sbjct 61 QHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPATPAP 106
>gi|340628680|ref|YP_004747132.1| hypothetical protein MCAN_37281 [Mycobacterium canettii CIPT
140010059]
gi|340006870|emb|CCC46059.1| conserved hypothetical proline rich protein [Mycobacterium canettii
CIPT 140010059]
Length=106
Score = 201 bits (511), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 104/106 (99%), Positives = 105/106 (99%), Gaps = 0/106 (0%)
Query 1 MRHMSETSETPTPPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHR 60
MRHMSETSETPTP PHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAG+GGFHHR
Sbjct 1 MRHMSETSETPTPSPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGYGGFHHR 60
Query 61 QHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPATPAP 106
QHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPATPAP
Sbjct 61 QHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPATPAP 106
>gi|297636384|ref|ZP_06954164.1| hypothetical protein MtubK4_19760 [Mycobacterium tuberculosis
KZN 4207]
gi|339296516|gb|AEJ48627.1| hypothetical protein CCDC5079_3438 [Mycobacterium tuberculosis
CCDC5079]
gi|339300116|gb|AEJ52226.1| hypothetical protein CCDC5180_3389 [Mycobacterium tuberculosis
CCDC5180]
Length=103
Score = 197 bits (501), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 103/103 (100%), Positives = 103/103 (100%), Gaps = 0/103 (0%)
Query 4 MSETSETPTPPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHRQHH 63
MSETSETPTPPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHRQHH
Sbjct 1 MSETSETPTPPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHRQHH 60
Query 64 QHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPATPAP 106
QHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPATPAP
Sbjct 61 QHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPATPAP 103
>gi|167970861|ref|ZP_02553138.1| hypothetical protein MtubH3_23580 [Mycobacterium tuberculosis
H37Ra]
gi|254552817|ref|ZP_05143264.1| hypothetical protein Mtube_20617 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
gi|294995384|ref|ZP_06801075.1| hypothetical protein Mtub2_12946 [Mycobacterium tuberculosis
210]
gi|297733378|ref|ZP_06962496.1| hypothetical protein MtubKR_19900 [Mycobacterium tuberculosis
KZN R506]
gi|313660709|ref|ZP_07817589.1| hypothetical protein MtubKV_19895 [Mycobacterium tuberculosis
KZN V2475]
Length=86
Score = 163 bits (412), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 85/86 (99%), Positives = 86/86 (100%), Gaps = 0/86 (0%)
Query 21 VFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHRQHHQHPAMMLRPGSPHGGPA 80
+FKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHRQHHQHPAMMLRPGSPHGGPA
Sbjct 1 MFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHRQHHQHPAMMLRPGSPHGGPA 60
Query 81 AVRPGPGPGGPGQVPSSVSPPATPAP 106
AVRPGPGPGGPGQVPSSVSPPATPAP
Sbjct 61 AVRPGPGPGGPGQVPSSVSPPATPAP 86
>gi|240172818|ref|ZP_04751477.1| hypothetical protein MkanA1_26132 [Mycobacterium kansasii ATCC
12478]
Length=119
Score = 95.1 bits (235), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 69/117 (59%), Positives = 77/117 (66%), Gaps = 16/117 (13%)
Query 4 MSETSETPTPP----------------PHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYI 47
MSET E PT P PHQTPKVFKAAAWV I AG VFIVAVIFFTG+
Sbjct 1 MSETPEIPTAPTSTAVAPPPPPAVPPAPHQTPKVFKAAAWVVIVAGIVFIVAVIFFTGFR 60
Query 48 LGKHAGHGGFHHRQHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPATP 104
LG +GHGG+ H +HH+ AMM R G P+GG AV P GPGGP QVP+SV+P TP
Sbjct 61 LGMQSGHGGYGHHRHHKPHAMMHRMGGPNGGVPAVSPSTGPGGPTQVPTSVAPATTP 117
>gi|183985188|ref|YP_001853479.1| hypothetical protein MMAR_5219 [Mycobacterium marinum M]
gi|183178514|gb|ACC43624.1| conserved hypothetical proline rich protein [Mycobacterium marinum
M]
Length=115
Score = 86.3 bits (212), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/113 (61%), Positives = 73/113 (65%), Gaps = 12/113 (10%)
Query 4 MSETSETPTPPP-----------HQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGK-H 51
MSETSET P HQTPKVFKAAAWVAI AG VFIV+VIFFTG+ LG
Sbjct 1 MSETSETSAAPTSTVVAPPPPPQHQTPKVFKAAAWVAIVAGIVFIVSVIFFTGFRLGMHS 60
Query 52 AGHGGFHHRQHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPATP 104
G H HPAMMLRPG HGG AV PG GPGGPG+VP+SV+P TP
Sbjct 61 GHGGFHRGHHHKHHPAMMLRPGMHHGGAPAVSPGSGPGGPGEVPTSVAPSTTP 113
>gi|118619449|ref|YP_907781.1| hypothetical protein MUL_4293 [Mycobacterium ulcerans Agy99]
gi|118571559|gb|ABL06310.1| conserved hypothetical proline rich protein [Mycobacterium ulcerans
Agy99]
Length=115
Score = 84.0 bits (206), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 67/113 (60%), Positives = 72/113 (64%), Gaps = 12/113 (10%)
Query 4 MSETSETPTPPP-----------HQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGK-H 51
MSETSET P H TPKVFKAAAWVAI AG VFIV+VIFFTG+ LG
Sbjct 1 MSETSETSAAPTSTVVAPPPPPQHLTPKVFKAAAWVAIVAGIVFIVSVIFFTGFRLGMHS 60
Query 52 AGHGGFHHRQHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPPATP 104
G H HPAMMLRPG HGG AV PG GPGGPG+VP+SV+P TP
Sbjct 61 GHGGFHRGHHHKHHPAMMLRPGMHHGGAPAVSPGSGPGGPGEVPTSVAPSTTP 113
>gi|41406406|ref|NP_959242.1| hypothetical protein MAP0308c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41394755|gb|AAS02625.1| hypothetical protein MAP_0308c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=181
Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 35/58 (61%), Positives = 40/58 (69%), Gaps = 13/58 (22%)
Query 4 MSETSETPT-------------PPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYIL 48
MSETSETPT P++TP+VF+ AAWVAI AG VFIVAVIFFTG+IL
Sbjct 29 MSETSETPTVRTSTATAPAPAAAAPYRTPRVFQVAAWVAIVAGIVFIVAVIFFTGFIL 86
>gi|336460883|gb|EGO39768.1| hypothetical protein MAPs_37060 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=153
Score = 63.9 bits (154), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 35/58 (61%), Positives = 40/58 (69%), Gaps = 13/58 (22%)
Query 4 MSETSETPT-------------PPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYIL 48
MSETSETPT P++TP+VF+ AAWVAI AG VFIVAVIFFTG+IL
Sbjct 1 MSETSETPTVRTSTATAPAPAAAAPYRTPRVFQVAAWVAIVAGIVFIVAVIFFTGFIL 58
>gi|145221899|ref|YP_001132577.1| hypothetical protein Mflv_1307 [Mycobacterium gilvum PYR-GCK]
gi|315446365|ref|YP_004079244.1| hypothetical protein Mspyr1_48720 [Mycobacterium sp. Spyr1]
gi|145214385|gb|ABP43789.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
gi|315264668|gb|ADU01410.1| hypothetical protein Mspyr1_48720 [Mycobacterium sp. Spyr1]
Length=132
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/79 (50%), Positives = 48/79 (61%), Gaps = 5/79 (6%)
Query 15 PHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHRQHHQHPAMML--RP 72
P ++ ++ KAAAWV IAAG VFIVAV+FF G++LG++ GG R HH P MM
Sbjct 30 PQESNRLNKAAAWVGIAAGAVFIVAVVFFAGFLLGQNVDGGG---RAHHGGPGMMQPGPA 86
Query 73 GSPHGGPAAVRPGPGPGGP 91
G P G P GPG GP
Sbjct 87 GFPMGPPGGFHHGPGFAGP 105
>gi|118472795|ref|YP_890473.1| hypothetical protein MSMEG_6254 [Mycobacterium smegmatis str.
MC2 155]
gi|118174082|gb|ABK74978.1| hypothetical protein MSMEG_6254 [Mycobacterium smegmatis str.
MC2 155]
Length=156
Score = 57.4 bits (137), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 29/51 (57%), Positives = 37/51 (73%), Gaps = 1/51 (1%)
Query 20 KVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHRQHHQHPAMML 70
+V +AAAWV I AG VFIVAV+F TG++LGK++G G HHR H + MM
Sbjct 52 RVVQAAAWVGIVAGVVFIVAVVFGTGFVLGKNSG-PGHHHRGHDRPEIMMF 101
>gi|120404607|ref|YP_954436.1| hypothetical protein Mvan_3639 [Mycobacterium vanbaalenii PYR-1]
gi|119957425|gb|ABM14430.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=153
Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/84 (39%), Positives = 41/84 (49%), Gaps = 15/84 (17%)
Query 4 MSETSETPTPPPHQTP--------------KVFKAAAWVAIAAGTVFIVAVIFFTGYILG 49
M+ET ET T P T ++ + AA V I AG VFI+A IFF+G++LG
Sbjct 1 MTETPETRTEPTAATTDRRESSAVQRDRPNRLNRIAALVGIVAGVVFIIAAIFFSGFVLG 60
Query 50 KHAGHGGFHHRQHHQHPAMMLRPG 73
H+G G F MM R G
Sbjct 61 AHSG-GDFGRDHRGDEFGMMNRDG 83
>gi|254818518|ref|ZP_05223519.1| hypothetical protein MintA_01269 [Mycobacterium intracellulare
ATCC 13950]
Length=83
Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/57 (55%), Positives = 33/57 (58%), Gaps = 12/57 (21%)
Query 4 MSETSETPTPPPHQTPK------------VFKAAAWVAIAAGTVFIVAVIFFTGYIL 48
MSET ETPT VF+ AAWVAI AG VFIVAVIFFTG+IL
Sbjct 1 MSETPETPTVRTSTATAPAAAAAAYRAPRVFQLAAWVAIVAGIVFIVAVIFFTGFIL 57
>gi|120406440|ref|YP_956269.1| hypothetical protein Mvan_5494 [Mycobacterium vanbaalenii PYR-1]
gi|119959258|gb|ABM16263.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=136
Score = 51.2 bits (121), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/72 (45%), Positives = 40/72 (56%), Gaps = 3/72 (4%)
Query 20 KVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHRQHHQHPAMMLRPGSPHGGP 79
++ KAA WV I AG+VFIVA IF G +GK+ G G +H H M+ P P GG
Sbjct 42 RLNKAALWVGIVAGSVFIVAAIFGAGVFVGKNIGDGPRNHHIGVMHHGPMMSPMGPQGG- 100
Query 80 AAVRPGPGPGGP 91
+ GPG GP
Sbjct 101 --FQRGPGSAGP 110
>gi|254773400|ref|ZP_05214916.1| hypothetical protein MaviaA2_01796 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=144
Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/28 (83%), Positives = 25/28 (90%), Gaps = 0/28 (0%)
Query 21 VFKAAAWVAIAAGTVFIVAVIFFTGYIL 48
VF+ AAWVAI AG VFIVAVIFFTG+IL
Sbjct 22 VFQVAAWVAIVAGIVFIVAVIFFTGFIL 49
>gi|118463566|ref|YP_879681.1| hypothetical protein MAV_0396 [Mycobacterium avium 104]
gi|118164853|gb|ABK65750.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=144
Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/28 (83%), Positives = 25/28 (90%), Gaps = 0/28 (0%)
Query 21 VFKAAAWVAIAAGTVFIVAVIFFTGYIL 48
VF+ AAWVAI AG VFIVAVIFFTG+IL
Sbjct 22 VFQVAAWVAIVAGIVFIVAVIFFTGFIL 49
>gi|15843325|ref|NP_338362.1| hypothetical protein MT3808.1 [Mycobacterium tuberculosis CDC1551]
gi|31794877|ref|NP_857370.1| proline rich protein [Mycobacterium bovis AF2122/97]
gi|57117146|ref|YP_178006.1| proline rich protein [Mycobacterium tuberculosis H37Rv]
47 more sequence titles
Length=129
Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/72 (48%), Positives = 42/72 (59%), Gaps = 13/72 (18%)
Query 19 PKVFKAAAWVAIAAGTVFIVAVIFFTG-YILGKHAGHGGFHHRQHHQHPAMMLRPGSPHG 77
P++++AAAWV I AG VF VAVIFF+G +LG+ G +H HH M RP P
Sbjct 29 PRLYRAAAWVVIVAGIVFTVAVIFFSGALVLGQ--GKCPYHRYYHHG----MFRPVGP-- 80
Query 78 GPAAVRPGPGPG 89
V PGPG G
Sbjct 81 ----VAPGPGMG 88
>gi|308371427|ref|ZP_07667161.1| conserved proline rich protein [Mycobacterium tuberculosis SUMu003]
gi|308372625|ref|ZP_07667424.1| conserved proline rich protein [Mycobacterium tuberculosis SUMu004]
gi|308372713|ref|ZP_07667441.1| conserved proline rich protein [Mycobacterium tuberculosis SUMu005]
11 more sequence titles
Length=129
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/72 (48%), Positives = 42/72 (59%), Gaps = 13/72 (18%)
Query 19 PKVFKAAAWVAIAAGTVFIVAVIFFTG-YILGKHAGHGGFHHRQHHQHPAMMLRPGSPHG 77
P++++AAAWV I AG VF VAVIFF+G +LG+ G +H HH M RP P
Sbjct 29 PRLYRAAAWVVIVAGIVFTVAVIFFSGALVLGQ--GKCPYHRYYHHG----MFRPVGP-- 80
Query 78 GPAAVRPGPGPG 89
V PGPG G
Sbjct 81 ----VAPGPGMG 88
>gi|308380776|ref|ZP_07669282.1| conserved proline rich protein [Mycobacterium tuberculosis SUMu011]
gi|308360399|gb|EFP49250.1| conserved proline rich protein [Mycobacterium tuberculosis SUMu011]
Length=154
Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/72 (48%), Positives = 42/72 (59%), Gaps = 13/72 (18%)
Query 19 PKVFKAAAWVAIAAGTVFIVAVIFFTG-YILGKHAGHGGFHHRQHHQHPAMMLRPGSPHG 77
P++++AAAWV I AG VF VAVIFF+G +LG+ G +H HH M RP P
Sbjct 29 PRLYRAAAWVVIVAGIVFTVAVIFFSGALVLGQ--GKCPYHRYYHHG----MFRPVGP-- 80
Query 78 GPAAVRPGPGPG 89
V PGPG G
Sbjct 81 ----VAPGPGMG 88
>gi|308406223|ref|ZP_07669546.1| conserved proline rich protein [Mycobacterium tuberculosis SUMu012]
gi|308364095|gb|EFP52946.1| conserved proline rich protein [Mycobacterium tuberculosis SUMu012]
Length=154
Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/72 (48%), Positives = 42/72 (59%), Gaps = 13/72 (18%)
Query 19 PKVFKAAAWVAIAAGTVFIVAVIFFTG-YILGKHAGHGGFHHRQHHQHPAMMLRPGSPHG 77
P++++AAAWV I AG VF VAVIFF+G +LG+ G +H HH M RP P
Sbjct 29 PRLYRAAAWVVIVAGIVFTVAVIFFSGALVLGQ--GKCPYHRYYHHG----MFRPVGP-- 80
Query 78 GPAAVRPGPGPG 89
V PGPG G
Sbjct 81 ----VAPGPGMG 88
>gi|183981450|ref|YP_001849741.1| hypothetical protein MMAR_1428 [Mycobacterium marinum M]
gi|183174776|gb|ACC39886.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=123
Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/82 (35%), Positives = 41/82 (50%), Gaps = 20/82 (24%)
Query 4 MSETSETPTPPPHQTP-------------------KVFKAAAWVAIAAGTVFIVAVIFFT 44
M+E+ E+PT P TP ++ + WV I AG +FI+AVIFF
Sbjct 1 MTESPESPTQPSGSTPEDRAAEPALPHQRQQAQPSRLTQVLEWVGIVAGVLFIIAVIFFW 60
Query 45 GYILGKHAGHG-GFHHRQHHQH 65
G+ +G+ +G G+HH H H
Sbjct 61 GFFMGRASGDSYGWHHGDHAAH 82
>gi|340628679|ref|YP_004747131.1| hypothetical protein MCAN_37271 [Mycobacterium canettii CIPT
140010059]
gi|340006869|emb|CCC46058.1| conserved hypothetical proline rich protein [Mycobacterium canettii
CIPT 140010059]
Length=129
Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/72 (48%), Positives = 42/72 (59%), Gaps = 13/72 (18%)
Query 19 PKVFKAAAWVAIAAGTVFIVAVIFFTG-YILGKHAGHGGFHHRQHHQHPAMMLRPGSPHG 77
P++++AAAWV I AG VF VAVIFF+G +LG+ G +H HH M RP P
Sbjct 29 PRLYRAAAWVVIVAGIVFTVAVIFFSGALVLGQ--GKCPYHRYYHHG----MFRPVGP-- 80
Query 78 GPAAVRPGPGPG 89
V PGPG G
Sbjct 81 ----VAPGPGMG 88
>gi|126436474|ref|YP_001072165.1| hypothetical protein Mjls_3898 [Mycobacterium sp. JLS]
gi|126436481|ref|YP_001072172.1| hypothetical protein Mjls_3906 [Mycobacterium sp. JLS]
gi|126236274|gb|ABN99674.1| hypothetical protein Mjls_3898 [Mycobacterium sp. JLS]
gi|126236281|gb|ABN99681.1| hypothetical protein Mjls_3906 [Mycobacterium sp. JLS]
Length=145
Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/62 (41%), Positives = 38/62 (62%), Gaps = 6/62 (9%)
Query 11 PTPPPHQTPKVFKAAA---WVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHRQHHQHPA 67
+P P+ + ++++ WV I AG VFIVAVIFF+G+ +G+H+ G F R + P
Sbjct 37 ESPSPYDDGRRNRSSSILVWVGIVAGVVFIVAVIFFSGFFIGRHS-DGNF--RGGYHQPG 93
Query 68 MM 69
MM
Sbjct 94 MM 95
>gi|308369203|ref|ZP_07666685.1| conserved proline rich protein [Mycobacterium tuberculosis SUMu002]
gi|308328324|gb|EFP17175.1| conserved proline rich protein [Mycobacterium tuberculosis SUMu002]
Length=138
Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/72 (48%), Positives = 42/72 (59%), Gaps = 13/72 (18%)
Query 19 PKVFKAAAWVAIAAGTVFIVAVIFFTG-YILGKHAGHGGFHHRQHHQHPAMMLRPGSPHG 77
P++++AAAWV I AG VF VAVIFF+G +LG+ G +H HH M RP P
Sbjct 29 PRLYRAAAWVVIVAGIVFTVAVIFFSGALVLGQ--GKCPYHRYYHHG----MFRPVGP-- 80
Query 78 GPAAVRPGPGPG 89
V PGPG G
Sbjct 81 ----VAPGPGMG 88
>gi|289572356|ref|ZP_06452583.1| predicted protein [Mycobacterium tuberculosis K85]
gi|289536787|gb|EFD41365.1| predicted protein [Mycobacterium tuberculosis K85]
Length=129
Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/72 (48%), Positives = 41/72 (57%), Gaps = 13/72 (18%)
Query 19 PKVFKAAAWVAIAAGTVFIVAVIFFTG-YILGKHAGHGGFHHRQHHQHPAMMLRPGSPHG 77
P+++ AAAWV I AG VF VAVIFF+G +LG+ G +H HH M RP P
Sbjct 29 PRLYLAAAWVVIVAGIVFTVAVIFFSGALVLGQ--GKCPYHRYYHHG----MFRPVGP-- 80
Query 78 GPAAVRPGPGPG 89
V PGPG G
Sbjct 81 ----VAPGPGMG 88
>gi|342861928|ref|ZP_08718572.1| hypothetical protein MCOL_23675 [Mycobacterium colombiense CECT
3035]
gi|342130468|gb|EGT83777.1| hypothetical protein MCOL_23675 [Mycobacterium colombiense CECT
3035]
Length=145
Score = 47.8 bits (112), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 22/27 (82%), Positives = 24/27 (89%), Gaps = 0/27 (0%)
Query 22 FKAAAWVAIAAGTVFIVAVIFFTGYIL 48
F+ AAWVAI AG VFIVAVIFFTG+IL
Sbjct 24 FQLAAWVAIVAGIVFIVAVIFFTGFIL 50
>gi|120401524|ref|YP_951353.1| hypothetical protein Mvan_0502 [Mycobacterium vanbaalenii PYR-1]
gi|120406355|ref|YP_956184.1| hypothetical protein Mvan_5408 [Mycobacterium vanbaalenii PYR-1]
gi|145225983|ref|YP_001136637.1| hypothetical protein Mflv_5388 [Mycobacterium gilvum PYR-GCK]
gi|119954342|gb|ABM11347.1| hypothetical protein Mvan_0502 [Mycobacterium vanbaalenii PYR-1]
gi|119959173|gb|ABM16178.1| hypothetical protein Mvan_5408 [Mycobacterium vanbaalenii PYR-1]
gi|145218446|gb|ABP47849.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=158
Score = 47.0 bits (110), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 29/40 (73%), Gaps = 0/40 (0%)
Query 12 TPPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGKH 51
+P + ++ ++AAWV I AG VFIVAVIFF+G+ +GK
Sbjct 24 SPGSDRPNRLTQSAAWVGIVAGVVFIVAVIFFSGFFVGKQ 63
>gi|183982552|ref|YP_001850843.1| hypothetical protein MMAR_2539 [Mycobacterium marinum M]
gi|183175878|gb|ACC40988.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=341
Score = 44.3 bits (103), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/81 (38%), Positives = 39/81 (49%), Gaps = 11/81 (13%)
Query 4 MSETSETPTPPPHQTP--------KVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHG 55
M+ET E+ T P T ++ + AWV I AG VFI AVIFF+ LG ++G
Sbjct 1 MAETPESTTKPATVTSQPRYDRSGRLSQVLAWVGIIAGAVFIAAVIFFSATFLGWYSGG- 59
Query 56 GFHHRQHHQHPAMMLRPGSPH 76
H+ H A L P S
Sbjct 60 --HYSWHRGGAAGQLSPRSSQ 78
>gi|120405589|ref|YP_955418.1| hypothetical protein Mvan_4637 [Mycobacterium vanbaalenii PYR-1]
gi|119958407|gb|ABM15412.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=155
Score = 43.5 bits (101), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query 16 HQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGKH-AGHGGFHHRQHHQ 64
++ +V AAWV I AG +FIV ++F G +LG+ AG GF ++H
Sbjct 35 DRSGRVTHVAAWVGIVAGALFIVFLVFLAGVLLGRQSAGDDGFGRWRYHD 84
>gi|240173179|ref|ZP_04751837.1| hypothetical protein MkanA1_27951 [Mycobacterium kansasii ATCC
12478]
Length=107
Score = 41.6 bits (96), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 20/39 (52%), Positives = 26/39 (67%), Gaps = 1/39 (2%)
Query 23 KAAAWVAIAAGTVFIVAVIFFTGYILGKHAGHGGFHHRQ 61
+ WV I AG VFIVA+IFF+G+ LG+ A HG + R
Sbjct 31 QLLTWVGIIAGVVFIVALIFFSGFFLGR-ATHGPYGGRD 68
>gi|296166797|ref|ZP_06849216.1| hypothetical protein HMPREF0591_2657 [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897846|gb|EFG77433.1| hypothetical protein HMPREF0591_2657 [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=116
Score = 41.2 bits (95), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 17/27 (63%), Positives = 22/27 (82%), Gaps = 0/27 (0%)
Query 19 PKVFKAAAWVAIAAGTVFIVAVIFFTG 45
P+++KAAAWV I AG VFI+A +FF G
Sbjct 33 PRLYKAAAWVVIVAGIVFIIATVFFAG 59
>gi|296165795|ref|ZP_06848301.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295898849|gb|EFG78349.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=113
Score = 39.7 bits (91), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 22/63 (35%), Positives = 35/63 (56%), Gaps = 9/63 (14%)
Query 1 MRHMSETSETPTPPPH--------QTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGKHA 52
M H SE++E+ +P + ++ V I AG VF+V++IFF+G+ LG+ A
Sbjct 1 MTHESESTESISPAESGQSDSHSGRADRLDLLLTVVGIVAGVVFVVSLIFFSGFFLGR-A 59
Query 53 GHG 55
HG
Sbjct 60 THG 62
>gi|254773401|ref|ZP_05214917.1| hypothetical protein MaviaA2_01801 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=141
Score = 38.5 bits (88), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 23/32 (72%), Gaps = 0/32 (0%)
Query 19 PKVFKAAAWVAIAAGTVFIVAVIFFTGYILGK 50
P+++ AAAWV I AG VFI++ +FF G + K
Sbjct 4 PRLYTAAAWVVIVAGIVFIISSVFFVGAFIWK 35
>gi|254821104|ref|ZP_05226105.1| hypothetical protein MintA_14302 [Mycobacterium intracellulare
ATCC 13950]
Length=129
Score = 38.1 bits (87), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 23/32 (72%), Gaps = 0/32 (0%)
Query 19 PKVFKAAAWVAIAAGTVFIVAVIFFTGYILGK 50
P+++ AAAWV I AG VFI++ +FF G + K
Sbjct 31 PRLYTAAAWVVIVAGIVFILSSVFFVGAFIWK 62
>gi|296166798|ref|ZP_06849217.1| hypothetical protein HMPREF0591_2658 [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897847|gb|EFG77434.1| hypothetical protein HMPREF0591_2658 [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=122
Score = 38.1 bits (87), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 51/123 (42%), Positives = 60/123 (49%), Gaps = 23/123 (18%)
Query 4 MSETSETPTP--------------PPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILG 49
MSE SE+PT P ++TP+VF AAAWVAI AG VFIV+VIFFTG LG
Sbjct 1 MSEASESPTARTSTATAPAPAAAQPLYKTPRVFIAAAWVAIVAGVVFIVSVIFFTGMALG 60
Query 50 KHAGH--------GGFHHRQHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSPP 101
H GH + H H + PG GPGQ+PSSV+P
Sbjct 61 HHGGHHHHHHKHPAAWMH-PHRMGGPGGPGGQAGVQQGGPASATPGAPGPGQIPSSVAPS 119
Query 102 ATP 104
TP
Sbjct 120 RTP 122
>gi|336460882|gb|EGO39767.1| hypothetical protein MAPs_37050 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=172
Score = 36.6 bits (83), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 15/30 (50%), Positives = 21/30 (70%), Gaps = 0/30 (0%)
Query 21 VFKAAAWVAIAAGTVFIVAVIFFTGYILGK 50
++ AAAWV I AG VFI++ +FF G + K
Sbjct 37 LYTAAAWVVIVAGIVFIISSVFFVGAFIWK 66
>gi|118466855|ref|YP_879682.1| hypothetical protein MAV_0397 [Mycobacterium avium 104]
gi|118168142|gb|ABK69039.1| hypothetical protein MAV_0397 [Mycobacterium avium 104]
Length=172
Score = 36.6 bits (83), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 15/30 (50%), Positives = 21/30 (70%), Gaps = 0/30 (0%)
Query 21 VFKAAAWVAIAAGTVFIVAVIFFTGYILGK 50
++ AAAWV I AG VFI++ +FF G + K
Sbjct 37 LYTAAAWVVIVAGIVFIISSVFFVGAFIWK 66
>gi|342861929|ref|ZP_08718573.1| hypothetical protein MCOL_23680 [Mycobacterium colombiense CECT
3035]
gi|342130469|gb|EGT83778.1| hypothetical protein MCOL_23680 [Mycobacterium colombiense CECT
3035]
Length=183
Score = 34.7 bits (78), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 16/31 (52%), Positives = 20/31 (65%), Gaps = 0/31 (0%)
Query 21 VFKAAAWVAIAAGTVFIVAVIFFTGYILGKH 51
++ AAAWV I AG VFI+ FF G + KH
Sbjct 39 LYTAAAWVVIVAGVVFILTSAFFVGAFIWKH 69
>gi|183985187|ref|YP_001853478.1| hypothetical protein MMAR_5218 [Mycobacterium marinum M]
gi|183178513|gb|ACC43623.1| conserved hypothetical proline rich protein [Mycobacterium marinum
M]
Length=145
Score = 34.7 bits (78), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 23/55 (42%), Positives = 29/55 (53%), Gaps = 13/55 (23%)
Query 4 MSETSETPTPPP-------------HQTPKVFKAAAWVAIAAGTVFIVAVIFFTG 45
MSETSE TPPP + P +++ AAWV I AG FI + +FF G
Sbjct 1 MSETSEPATPPPAVATAAPPPPPPVEKVPALYRVAAWVVIVAGITFIASTLFFAG 55
>gi|118619448|ref|YP_907780.1| hypothetical protein MUL_4292 [Mycobacterium ulcerans Agy99]
gi|118571558|gb|ABL06309.1| conserved hypothetical proline rich protein [Mycobacterium ulcerans
Agy99]
Length=124
Score = 33.9 bits (76), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 13/30 (44%), Positives = 19/30 (64%), Gaps = 0/30 (0%)
Query 16 HQTPKVFKAAAWVAIAAGTVFIVAVIFFTG 45
+ P +++ AAWV I AG FI + +FF G
Sbjct 2 EKVPALYRVAAWVVIVAGITFIASTLFFAG 31
Lambda K H
0.318 0.136 0.453
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130971515392
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40