BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1906c
Length=156
Score E
Sequences producing significant alignments: (Bits) Value
gi|15841377|ref|NP_336414.1| hypothetical protein MT1957 [Mycoba... 308 1e-82
gi|15609043|ref|NP_216422.1| hypothetical protein Rv1906c [Mycob... 308 2e-82
gi|339294841|gb|AEJ46952.1| hypothetical protein CCDC5079_1762 [... 258 2e-67
gi|240172583|ref|ZP_04751242.1| hypothetical protein MkanA1_2493... 213 6e-54
gi|15828088|ref|NP_302351.1| lipoprotein [Mycobacterium leprae T... 184 5e-45
gi|183982808|ref|YP_001851099.1| hypothetical protein MMAR_2802 ... 178 2e-43
gi|118618336|ref|YP_906668.1| hypothetical protein MUL_2927 [Myc... 176 8e-43
gi|342859558|ref|ZP_08716211.1| hypothetical protein MCOL_11783 ... 139 2e-31
gi|41407728|ref|NP_960564.1| hypothetical protein MAP1630c [Myco... 135 3e-30
gi|118464436|ref|YP_881982.1| hypothetical protein MAV_2795 [Myc... 131 3e-29
gi|336457388|gb|EGO36397.1| hypothetical protein MAPs_22530 [Myc... 131 4e-29
gi|296164931|ref|ZP_06847487.1| conserved hypothetical protein [... 129 2e-28
gi|254775275|ref|ZP_05216791.1| hypothetical protein MaviaA2_114... 113 7e-24
gi|254821873|ref|ZP_05226874.1| hypothetical protein MintA_18207... 109 2e-22
gi|118471492|ref|YP_887795.1| hypothetical protein MSMEG_3491 [M... 103 7e-21
gi|145221660|ref|YP_001132338.1| hypothetical protein Mflv_1068 ... 100 1e-19
gi|126438120|ref|YP_001073811.1| hypothetical protein Mjls_5557 ... 94.7 4e-18
gi|108802139|ref|YP_642336.1| hypothetical protein Mmcs_5176 [My... 93.2 1e-17
gi|333992833|ref|YP_004525447.1| hypothetical protein JDM601_419... 92.4 2e-17
gi|120406698|ref|YP_956527.1| hypothetical protein Mvan_5756 [My... 90.5 7e-17
gi|118616960|ref|YP_905292.1| hypothetical protein MUL_1252 [Myc... 71.2 5e-11
gi|183983592|ref|YP_001851883.1| hypothetical protein MMAR_3613 ... 70.9 6e-11
gi|342857900|ref|ZP_08714556.1| hypothetical protein MCOL_03460 ... 69.3 2e-10
gi|240170959|ref|ZP_04749618.1| hypothetical protein MkanA1_1673... 68.2 4e-10
gi|297157986|gb|ADI07698.1| hypothetical protein SBI_04578 [Stre... 67.8 5e-10
gi|254823128|ref|ZP_05228129.1| hypothetical protein MintA_24580... 67.8 5e-10
gi|240167884|ref|ZP_04746543.1| hypothetical protein MkanA1_0111... 66.6 1e-09
gi|336460995|gb|EGO39875.1| hypothetical protein MAPs_34340 [Myc... 63.5 9e-09
gi|269836269|ref|YP_003318497.1| hypothetical protein Sthe_0236 ... 63.2 1e-08
gi|258651339|ref|YP_003200495.1| hypothetical protein Namu_1099 ... 63.2 1e-08
gi|118466415|ref|YP_880895.1| hypothetical protein MAV_1662 [Myc... 60.1 1e-07
gi|41408419|ref|NP_961255.1| hypothetical protein MAP2321 [Mycob... 60.1 1e-07
gi|254774489|ref|ZP_05216005.1| hypothetical protein MaviaA2_074... 60.1 1e-07
gi|302534876|ref|ZP_07287218.1| predicted protein [Streptomyces ... 59.3 2e-07
gi|297190749|ref|ZP_06908147.1| predicted protein [Streptomyces ... 58.5 3e-07
gi|15608492|ref|NP_215868.1| hypothetical protein Rv1352 [Mycoba... 55.8 2e-06
gi|269794246|ref|YP_003313701.1| hypothetical protein Sked_09170... 54.3 6e-06
gi|118472896|ref|YP_885966.1| hypothetical protein MSMEG_1588 [M... 53.1 1e-05
gi|183983957|ref|YP_001852248.1| hypothetical protein MMAR_3985 ... 52.4 2e-05
gi|118619072|ref|YP_907404.1| hypothetical protein MUL_3846 [Myc... 51.2 5e-05
gi|342857983|ref|ZP_08714639.1| hypothetical protein MCOL_03875 ... 51.2 5e-05
gi|183983999|ref|YP_001852290.1| hypothetical protein MMAR_4028 ... 50.4 8e-05
gi|260907602|ref|ZP_05915924.1| hypothetical protein BlinB_19857... 48.9 3e-04
gi|334336394|ref|YP_004541546.1| hypothetical protein Isova_0868... 48.5 3e-04
gi|118619107|ref|YP_907439.1| hypothetical protein MUL_3893 [Myc... 48.1 4e-04
gi|145225497|ref|YP_001136175.1| hypothetical protein Mflv_4921 ... 47.0 9e-04
gi|120402508|ref|YP_952337.1| hypothetical protein Mvan_1499 [My... 47.0 0.001
gi|254819192|ref|ZP_05224193.1| hypothetical protein MintA_04659... 44.7 0.005
gi|261736091|ref|YP_003257433.1| hypothetical protein pZL12.69 [... 44.3 0.005
gi|41408501|ref|NP_961337.1| hypothetical protein MAP2403c [Myco... 44.3 0.006
>gi|15841377|ref|NP_336414.1| hypothetical protein MT1957 [Mycobacterium tuberculosis CDC1551]
gi|167970394|ref|ZP_02552671.1| hypothetical protein MtubH3_21128 [Mycobacterium tuberculosis
H37Ra]
gi|254232082|ref|ZP_04925409.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
27 more sequence titles
Length=157
Score = 308 bits (790), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 156/156 (100%), Positives = 156/156 (100%), Gaps = 0/156 (0%)
Query 1 MRLKPAPSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGT 60
MRLKPAPSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGT
Sbjct 2 MRLKPAPSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGT 61
Query 61 YSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEPTDKAFKTHGCQPWQNTGSEG 120
YSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEPTDKAFKTHGCQPWQNTGSEG
Sbjct 62 YSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEPTDKAFKTHGCQPWQNTGSEG 121
Query 121 AAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVPQP 156
AAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVPQP
Sbjct 122 AAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVPQP 157
>gi|15609043|ref|NP_216422.1| hypothetical protein Rv1906c [Mycobacterium tuberculosis H37Rv]
gi|31793099|ref|NP_855592.1| hypothetical protein Mb1941c [Mycobacterium bovis AF2122/97]
gi|121637812|ref|YP_978035.1| hypothetical protein BCG_1945c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
48 more sequence titles
Length=156
Score = 308 bits (789), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 156/156 (100%), Positives = 156/156 (100%), Gaps = 0/156 (0%)
Query 1 MRLKPAPSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGT 60
MRLKPAPSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGT
Sbjct 1 MRLKPAPSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGT 60
Query 61 YSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEPTDKAFKTHGCQPWQNTGSEG 120
YSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEPTDKAFKTHGCQPWQNTGSEG
Sbjct 61 YSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEPTDKAFKTHGCQPWQNTGSEG 120
Query 121 AAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVPQP 156
AAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVPQP
Sbjct 121 AAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVPQP 156
>gi|339294841|gb|AEJ46952.1| hypothetical protein CCDC5079_1762 [Mycobacterium tuberculosis
CCDC5079]
gi|339298466|gb|AEJ50576.1| hypothetical protein CCDC5180_1739 [Mycobacterium tuberculosis
CCDC5180]
Length=130
Score = 258 bits (660), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 129/130 (99%), Positives = 130/130 (100%), Gaps = 0/130 (0%)
Query 27 VGLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALID 86
+GLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALID
Sbjct 1 MGLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALID 60
Query 87 NALSKKPQVVTIEPTDKAFKTHGCQPWQNTGSEGAAPAGVPGPEAGAQLQNQLGILNGLL 146
NALSKKPQVVTIEPTDKAFKTHGCQPWQNTGSEGAAPAGVPGPEAGAQLQNQLGILNGLL
Sbjct 61 NALSKKPQVVTIEPTDKAFKTHGCQPWQNTGSEGAAPAGVPGPEAGAQLQNQLGILNGLL 120
Query 147 GPTGGRVPQP 156
GPTGGRVPQP
Sbjct 121 GPTGGRVPQP 130
>gi|240172583|ref|ZP_04751242.1| hypothetical protein MkanA1_24935 [Mycobacterium kansasii ATCC
12478]
Length=154
Score = 213 bits (543), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 119/150 (80%), Positives = 127/150 (85%), Gaps = 3/150 (2%)
Query 8 SPAAAFAVAGLILAGWAGSVGLAGADPEPAPTP--KTAIDSDGTYAVGIDIAPGTYSSAG 65
+PAA F VAGLILAGWAGS G+A ADP+PAP KT ID DGTY VG DIAPGTYSSAG
Sbjct 5 APAARFLVAGLILAGWAGSAGVAAADPQPAPPSAPKTTIDHDGTYTVGTDIAPGTYSSAG 64
Query 66 PVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEPTDKAFKTHGCQPWQNTGSEGAAPAG 125
PVG+GTCYWKR+GNP+GAL+DNALSKKPQVV IEPTDKAF+T GCQPWQ TG EG APA
Sbjct 65 PVGNGTCYWKRLGNPEGALLDNALSKKPQVVQIEPTDKAFRTSGCQPWQITG-EGTAPAE 123
Query 126 VPGPEAGAQLQNQLGILNGLLGPTGGRVPQ 155
VPGP AGAQLQ L ILNGLLGPTG RVPQ
Sbjct 124 VPGPAAGAQLQTDLAILNGLLGPTGQRVPQ 153
>gi|15828088|ref|NP_302351.1| lipoprotein [Mycobacterium leprae TN]
gi|221230565|ref|YP_002503981.1| putative lipoprotein [Mycobacterium leprae Br4923]
gi|2276321|emb|CAA04160.1| putative lipoprotein [Mycobacterium leprae]
gi|4582356|emb|CAB40304.1| hypothetical protein MLCL672.01 [Mycobacterium leprae]
gi|13093642|emb|CAC30965.1| putative lipoprotein [Mycobacterium leprae]
gi|219933672|emb|CAR72107.1| putative lipoprotein [Mycobacterium leprae Br4923]
Length=153
Score = 184 bits (466), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 98/155 (64%), Positives = 114/155 (74%), Gaps = 3/155 (1%)
Query 1 MRLKPAPSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGT 60
M L AP+ AA F VA +I A A S G+AGADP+ PKT ID DGTYAVG DIAPGT
Sbjct 1 MWLTAAPA-AARFVVASVIAASCAASTGVAGADPQSPSAPKTTIDHDGTYAVGTDIAPGT 59
Query 61 YSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEPTDKAFKTHGCQPWQNTGSEG 120
YSSAGPVG+GTCYWKR+ NPDG IDNA+SKKP++V IE ++KAFKT GCQPWQ T S
Sbjct 60 YSSAGPVGNGTCYWKRIDNPDGP-IDNAMSKKPKIVQIEASNKAFKTTGCQPWQQT-SNT 117
Query 121 AAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVPQ 155
+PGP AG QL++ LGILNGLL G +VP+
Sbjct 118 TVSTDLPGPIAGIQLESNLGILNGLLASNGQQVPR 152
>gi|183982808|ref|YP_001851099.1| hypothetical protein MMAR_2802 [Mycobacterium marinum M]
gi|183176134|gb|ACC41244.1| conserved membrane protein [Mycobacterium marinum M]
Length=172
Score = 178 bits (452), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 87/114 (77%), Positives = 95/114 (84%), Gaps = 1/114 (0%)
Query 41 KTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEP 100
KT +D DGTYAVG DIAPGTY+SAGPVGDGTCYWKRM NPDGALIDNA++KKPQVV IEP
Sbjct 58 KTTMDHDGTYAVGTDIAPGTYTSAGPVGDGTCYWKRMSNPDGALIDNAMTKKPQVVQIEP 117
Query 101 TDKAFKTHGCQPWQNTGSEGAAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVP 154
TDKAFKT GCQPWQ+T + PA +PGP AG QLQ LGILNGLL P G +VP
Sbjct 118 TDKAFKTSGCQPWQST-PDADVPAELPGPAAGGQLQMNLGILNGLLAPNGQQVP 170
>gi|118618336|ref|YP_906668.1| hypothetical protein MUL_2927 [Mycobacterium ulcerans Agy99]
gi|118570446|gb|ABL05197.1| conserved membrane protein [Mycobacterium ulcerans Agy99]
Length=155
Score = 176 bits (447), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 86/114 (76%), Positives = 94/114 (83%), Gaps = 1/114 (0%)
Query 41 KTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEP 100
KT +D DGTYAVG DI PGTY+SAGPVGDGTCYWKRM NPDGALIDNA++KKPQVV IEP
Sbjct 41 KTTMDHDGTYAVGTDIVPGTYTSAGPVGDGTCYWKRMSNPDGALIDNAMTKKPQVVQIEP 100
Query 101 TDKAFKTHGCQPWQNTGSEGAAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVP 154
TDKAFKT GCQPWQ+T + PA +PGP AG QLQ LGILNGLL P G +VP
Sbjct 101 TDKAFKTSGCQPWQST-PDADVPAELPGPAAGGQLQMNLGILNGLLAPNGQQVP 153
>gi|342859558|ref|ZP_08716211.1| hypothetical protein MCOL_11783 [Mycobacterium colombiense CECT
3035]
gi|342132690|gb|EGT85910.1| hypothetical protein MCOL_11783 [Mycobacterium colombiense CECT
3035]
Length=209
Score = 139 bits (349), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/111 (68%), Positives = 80/111 (73%), Gaps = 11/111 (9%)
Query 40 PKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIE 99
PKT+ID DG YAVG DI PG YSS GPV +GTCYWKRM NPDGALIDNA+SKKPQVV I+
Sbjct 56 PKTSIDKDGIYAVGTDIVPGIYSSGGPVDNGTCYWKRMSNPDGALIDNAMSKKPQVVQID 115
Query 100 PTDKAFKTHGCQPWQNTGSEGAAPAGVP---GPEAGAQLQNQLGILNGLLG 147
DKAFKT GCQPWQ T P VP P AG +Q LGILN LLG
Sbjct 116 AGDKAFKTSGCQPWQLT------PDAVPPSTAPPAG--VQGTLGILNSLLG 158
>gi|41407728|ref|NP_960564.1| hypothetical protein MAP1630c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41396081|gb|AAS03947.1| hypothetical protein MAP_1630c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=208
Score = 135 bits (339), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 73/135 (55%), Positives = 82/135 (61%), Gaps = 35/135 (25%)
Query 17 GLILAGWAGSVGLAGADPEPAP-----------------------------------TPK 41
GL+ AGWA S +AGADP+ PK
Sbjct 14 GLMAAGWAASTVVAGADPDETHGPAPGPAATPSPAPSTSPSPAASPSPSASPAPAPAAPK 73
Query 42 TAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEPT 101
T ID DG YAVG DIAPG YSS GPVG+GTCYWKRMGNPDGA+IDNA++KKPQVV I+P
Sbjct 74 TTIDKDGIYAVGTDIAPGVYSSGGPVGNGTCYWKRMGNPDGAVIDNAMTKKPQVVQIDPG 133
Query 102 DKAFKTHGCQPWQNT 116
DK+FKT GCQPWQ T
Sbjct 134 DKSFKTSGCQPWQLT 148
>gi|118464436|ref|YP_881982.1| hypothetical protein MAV_2795 [Mycobacterium avium 104]
gi|118165723|gb|ABK66620.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=193
Score = 131 bits (329), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 60/76 (79%), Positives = 66/76 (87%), Gaps = 0/76 (0%)
Query 41 KTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEP 100
KT ID DG YAVG DIAPG YSS GPVG+GTCYWKRMGNPDGA+IDNA++KKPQVV I+P
Sbjct 58 KTTIDKDGIYAVGTDIAPGVYSSGGPVGNGTCYWKRMGNPDGAVIDNAMTKKPQVVQIDP 117
Query 101 TDKAFKTHGCQPWQNT 116
DK+FKT GCQPWQ T
Sbjct 118 GDKSFKTSGCQPWQLT 133
>gi|336457388|gb|EGO36397.1| hypothetical protein MAPs_22530 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=193
Score = 131 bits (329), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 60/76 (79%), Positives = 66/76 (87%), Gaps = 0/76 (0%)
Query 41 KTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEP 100
KT ID DG YAVG DIAPG YSS GPVG+GTCYWKRMGNPDGA+IDNA++KKPQVV I+P
Sbjct 58 KTTIDKDGIYAVGTDIAPGVYSSGGPVGNGTCYWKRMGNPDGAVIDNAMTKKPQVVQIDP 117
Query 101 TDKAFKTHGCQPWQNT 116
DK+FKT GCQPWQ T
Sbjct 118 GDKSFKTSGCQPWQLT 133
>gi|296164931|ref|ZP_06847487.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295899773|gb|EFG79223.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=191
Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/148 (50%), Positives = 82/148 (56%), Gaps = 44/148 (29%)
Query 13 FAVAGLILAGWAGSVGLAGADPEPAPTP-------------------------------- 40
AVAGL++AG A S G+AGADP+ P P
Sbjct 1 MAVAGLMIAGLAASAGVAGADPDQTPAPSPSPAASSRGPAPGPPPSSPAPSPAASSAPAP 60
Query 41 ------------KTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNA 88
KT ID DG Y VG DI PG YSS GP +G CYWKRMGNPDG +IDN+
Sbjct 61 AASPPASPAPGPKTTIDKDGIYNVGSDIVPGIYSSGGPTDNGACYWKRMGNPDGNIIDNS 120
Query 89 LSKKPQVVTIEPTDKAFKTHGCQPWQNT 116
+SKKPQVV I+PTDKAFKT GCQPWQ T
Sbjct 121 MSKKPQVVQIDPTDKAFKTSGCQPWQLT 148
>gi|254775275|ref|ZP_05216791.1| hypothetical protein MaviaA2_11476 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=125
Score = 113 bits (283), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 51/65 (79%), Positives = 58/65 (90%), Gaps = 0/65 (0%)
Query 52 VGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEPTDKAFKTHGCQ 111
+G DIAPG YSS GPVG+GTCYWKRMGNPDGA+IDNA++KKPQVV I+P DK+FKT GCQ
Sbjct 1 MGTDIAPGVYSSGGPVGNGTCYWKRMGNPDGAVIDNAMTKKPQVVQIDPGDKSFKTSGCQ 60
Query 112 PWQNT 116
PWQ T
Sbjct 61 PWQLT 65
>gi|254821873|ref|ZP_05226874.1| hypothetical protein MintA_18207 [Mycobacterium intracellulare
ATCC 13950]
Length=124
Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/65 (77%), Positives = 55/65 (85%), Gaps = 0/65 (0%)
Query 52 VGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEPTDKAFKTHGCQ 111
+G DI PG YSS GPVG+GTCYWKR GNPDGALIDNA++KK QVV I+P DKAFKT GCQ
Sbjct 1 MGTDIVPGIYSSGGPVGNGTCYWKRTGNPDGALIDNAMTKKAQVVQIDPGDKAFKTSGCQ 60
Query 112 PWQNT 116
PWQ T
Sbjct 61 PWQLT 65
>gi|118471492|ref|YP_887795.1| hypothetical protein MSMEG_3491 [Mycobacterium smegmatis str.
MC2 155]
gi|118172779|gb|ABK73675.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=166
Score = 103 bits (258), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 61/123 (50%), Positives = 72/123 (59%), Gaps = 9/123 (7%)
Query 43 AIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGA-LIDNALSKKPQVVTIEPT 101
+D DG Y VG DI PGTY++ GP TCYWKRM + G +IDNA +KKP VV IE T
Sbjct 44 VMDHDGIYVVGTDIQPGTYATGGPADGQTCYWKRMADLHGGDIIDNAFTKKPDVVQIEAT 103
Query 102 DKAFKTHGCQPWQNTGSEGA-------APAGVPGPEAGAQLQNQLG-ILNGLLGPTGGRV 153
DKAFKT GCQPW T + AP +PG A A+L+ L I NG G +V
Sbjct 104 DKAFKTSGCQPWTQTTAAPTDTPTGEGAPGDLPGLLAKAKLRTWLDTINNGARQYDGSQV 163
Query 154 PQP 156
P P
Sbjct 164 PLP 166
>gi|145221660|ref|YP_001132338.1| hypothetical protein Mflv_1068 [Mycobacterium gilvum PYR-GCK]
gi|315446602|ref|YP_004079481.1| hypothetical protein Mspyr1_51200 [Mycobacterium sp. Spyr1]
gi|145214146|gb|ABP43550.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
gi|315264905|gb|ADU01647.1| hypothetical protein Mspyr1_51200 [Mycobacterium sp. Spyr1]
Length=145
Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/95 (54%), Positives = 60/95 (64%), Gaps = 7/95 (7%)
Query 29 LAGADPEPAPTP-----KTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGA 83
+A A PEP P P KT ID+ G+YAVG +I PG Y SAGPV DG CYWKR+ +
Sbjct 23 VASAQPEPTPAPDAEGPKTTIDASGSYAVGTEILPGVYQSAGPVDDGACYWKRVAGEE-- 80
Query 84 LIDNALSKKPQVVTIEPTDKAFKTHGCQPWQNTGS 118
L+DNAL+KKP V I PTD F T CQ W T +
Sbjct 81 LVDNALTKKPAFVQIMPTDTTFTTSDCQAWTLTNA 115
>gi|126438120|ref|YP_001073811.1| hypothetical protein Mjls_5557 [Mycobacterium sp. JLS]
gi|126237920|gb|ABO01321.1| hypothetical protein Mjls_5557 [Mycobacterium sp. JLS]
Length=148
Score = 94.7 bits (234), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 58/114 (51%), Positives = 71/114 (63%), Gaps = 7/114 (6%)
Query 42 TAIDSDGTYAVGIDIAPGTYSSAGPVGDG-TCYWKRMGNPDGALIDNALSKKPQVVTIEP 100
+ ID DGTY VG+DI PGTY +AG GDG TCYWKR+G DG L+DNAL+KK V IE
Sbjct 41 STIDKDGTYKVGVDILPGTY-NAGASGDGMTCYWKRVG-ADGKLLDNALTKKAASVRIEA 98
Query 101 TDKAFKTHGCQPWQNTGSEGAAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVP 154
TD +F T+ CQ WQ + G A P P ++ QLG G+LG G +P
Sbjct 99 TDASFTTNDCQTWQ-LAACGTACPPPPPPPGPLEMLGQLG---GMLGRAKGPMP 148
>gi|108802139|ref|YP_642336.1| hypothetical protein Mmcs_5176 [Mycobacterium sp. MCS]
gi|119871292|ref|YP_941244.1| hypothetical protein Mkms_5265 [Mycobacterium sp. KMS]
gi|108772558|gb|ABG11280.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119697381|gb|ABL94454.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=148
Score = 93.2 bits (230), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/114 (50%), Positives = 71/114 (63%), Gaps = 7/114 (6%)
Query 42 TAIDSDGTYAVGIDIAPGTYSSAGPVGDG-TCYWKRMGNPDGALIDNALSKKPQVVTIEP 100
+ I+ DGTY VG+DI PGTY+ AG GDG TCYWKR+G DG L+DNAL+KK V IE
Sbjct 41 STIEKDGTYKVGVDILPGTYN-AGASGDGMTCYWKRVG-ADGKLLDNALTKKAASVRIEA 98
Query 101 TDKAFKTHGCQPWQNTGSEGAAPAGVPGPEAGAQLQNQLGILNGLLGPTGGRVP 154
TD +F T+ CQ WQ + G A P P ++ QLG G+LG G +P
Sbjct 99 TDASFTTNDCQTWQ-LAACGTACPPPPPPPGPLEMLGQLG---GMLGRAKGPMP 148
>gi|333992833|ref|YP_004525447.1| hypothetical protein JDM601_4193 [Mycobacterium sp. JDM601]
gi|333488801|gb|AEF38193.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=147
Score = 92.4 bits (228), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/106 (47%), Positives = 63/106 (60%), Gaps = 8/106 (7%)
Query 41 KTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEP 100
K ID+DGT+ VG DIAPGTY S GP CYW+R N ++D+A++KKPQVV IEP
Sbjct 35 KQVIDTDGTFTVGKDIAPGTYRSDGPREGNACYWRR--NSGDKIVDSAMTKKPQVVLIEP 92
Query 101 TDKAFKTHGCQPWQNTGSEGAAPAGVPGPEAGAQLQNQLGILNGLL 146
+D F+T C PWQ T P P P+ A L + +L G +
Sbjct 93 SDTTFRTDRCAPWQLT---ECPPTCAPVPQQSAGLPD---VLKGFV 132
>gi|120406698|ref|YP_956527.1| hypothetical protein Mvan_5756 [Mycobacterium vanbaalenii PYR-1]
gi|119959516|gb|ABM16521.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=151
Score = 90.5 bits (223), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 42/78 (54%), Positives = 51/78 (66%), Gaps = 2/78 (2%)
Query 41 KTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIEP 100
KT ID++G+YAVG +IAPG Y SAGP+ G CYWKR + +DN L+KKP V I P
Sbjct 47 KTTIDANGSYAVGTEIAPGVYQSAGPIDGGACYWKRTAGEE--TVDNRLTKKPSFVQIMP 104
Query 101 TDKAFKTHGCQPWQNTGS 118
TD F T CQ WQ T +
Sbjct 105 TDTTFTTSDCQTWQLTNA 122
>gi|118616960|ref|YP_905292.1| hypothetical protein MUL_1252 [Mycobacterium ulcerans Agy99]
gi|118569070|gb|ABL03821.1| conserved hypothetical secreted protein [Mycobacterium ulcerans
Agy99]
Length=257
Score = 71.2 bits (173), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 39/77 (51%), Positives = 46/77 (60%), Gaps = 6/77 (7%)
Query 40 PKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGAL---IDNALSKKPQVV 96
P + I DGT+ VG+DI GTY SAG GD +CYW+R+ G L I N PQVV
Sbjct 73 PASGIPGDGTFRVGVDIKAGTYRSAG--GD-SCYWERLRGVGGTLDEVIANGAGTGPQVV 129
Query 97 TIEPTDKAFKTHGCQPW 113
I P+D AFKT C PW
Sbjct 130 QIRPSDAAFKTQHCPPW 146
>gi|183983592|ref|YP_001851883.1| hypothetical protein MMAR_3613 [Mycobacterium marinum M]
gi|183176918|gb|ACC42028.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=260
Score = 70.9 bits (172), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 39/77 (51%), Positives = 46/77 (60%), Gaps = 6/77 (7%)
Query 40 PKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGAL---IDNALSKKPQVV 96
P + I DGT+ VG+DI GTY SAG GD +CYW+R+ G L I N PQVV
Sbjct 76 PASGIPGDGTFRVGVDIKAGTYRSAG--GD-SCYWERLRGVGGTLDEVIANGAGTGPQVV 132
Query 97 TIEPTDKAFKTHGCQPW 113
I P+D AFKT C PW
Sbjct 133 QIRPSDAAFKTQHCPPW 149
>gi|342857900|ref|ZP_08714556.1| hypothetical protein MCOL_03460 [Mycobacterium colombiense CECT
3035]
gi|342135233|gb|EGT88399.1| hypothetical protein MCOL_03460 [Mycobacterium colombiense CECT
3035]
Length=134
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/133 (37%), Positives = 63/133 (48%), Gaps = 23/133 (17%)
Query 3 LKPAPSPAAAFAVA----GLILAGWAGSVGLAGA--DPEPAPTP--------------KT 42
++P + A FA+ L LAG GS A PA T KT
Sbjct 1 MRPGAAAVAGFALFLAGFALFLAGCGGSSHTATTVTSTLPAETKTVTKTVAPPPPPGPKT 60
Query 43 AIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPD-GALIDNALSKKPQVVTIEPT 101
+I+++GTY V DIA GTY + G G CYW R+ + + +IDN + QVV I PT
Sbjct 61 SIENNGTYIVNKDIAAGTYRTDG--GRNACYWARLRSFNTNDVIDNNVGDGQQVVQILPT 118
Query 102 DKAFKTHGCQPWQ 114
D AF T C WQ
Sbjct 119 DAAFMTRSCGKWQ 131
>gi|240170959|ref|ZP_04749618.1| hypothetical protein MkanA1_16733 [Mycobacterium kansasii ATCC
12478]
Length=259
Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 37/77 (49%), Positives = 41/77 (54%), Gaps = 6/77 (7%)
Query 40 PKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGA---LIDNALSKKPQVV 96
P I DGTY VG+DI PGTY S G CYW+R+ G +I N PQVV
Sbjct 68 PANTIPGDGTYRVGVDIRPGTYRSQ---GGNACYWERLRGLGGTVDDIIANGAGTGPQVV 124
Query 97 TIEPTDKAFKTHGCQPW 113
I P D AFKT GC W
Sbjct 125 QIAPADVAFKTQGCPAW 141
>gi|297157986|gb|ADI07698.1| hypothetical protein SBI_04578 [Streptomyces bingchenggensis
BCW-1]
Length=239
Score = 67.8 bits (164), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 39/96 (41%), Positives = 52/96 (55%), Gaps = 4/96 (4%)
Query 25 GSVGLAGADPEPAPTPKTAIDSDG-TYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNP--- 80
G G A + + TPK+ +D +G Y +G DIAPGTY S G D TCYW+R +
Sbjct 143 GCHGWAAVPAKASGTPKSEVDGNGGMYRIGADIAPGTYKSTGNANDATCYWERSKDALHG 202
Query 81 DGALIDNALSKKPQVVTIEPTDKAFKTHGCQPWQNT 116
++I N VVTI D FKT+GC+ W+ T
Sbjct 203 IDSIIANENVSGNGVVTIAAGDGYFKTNGCKNWKKT 238
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/85 (42%), Positives = 46/85 (55%), Gaps = 4/85 (4%)
Query 32 ADPEPAPTPKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDG---ALIDNA 88
A+ P + AI DGT+ VG D+ PGTY S DG CYW+R + G +LI N
Sbjct 64 ANKADKPKAEPAIGGDGTFQVGSDVKPGTYMSTDNT-DGMCYWERAKDSSGETDSLIAND 122
Query 89 LSKKPQVVTIEPTDKAFKTHGCQPW 113
VTI+ +DK FK++GC W
Sbjct 123 NVMGNAYVTIKASDKIFKSNGCHGW 147
>gi|254823128|ref|ZP_05228129.1| hypothetical protein MintA_24580 [Mycobacterium intracellulare
ATCC 13950]
Length=125
Score = 67.8 bits (164), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 36/75 (48%), Positives = 45/75 (60%), Gaps = 3/75 (4%)
Query 41 KTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPD-GALIDNALSKKPQVVTIE 99
KT+I+++GTY V DIA GTY + G G CYW R+ + D +IDN + PQVV I
Sbjct 50 KTSIEANGTYIVNKDIAAGTYRTDG--GKYGCYWARLRSFDTNDIIDNNVGDGPQVVRIM 107
Query 100 PTDKAFKTHGCQPWQ 114
PTD AF T C W
Sbjct 108 PTDTAFMTRSCGNWH 122
>gi|240167884|ref|ZP_04746543.1| hypothetical protein MkanA1_01110 [Mycobacterium kansasii ATCC
12478]
Length=121
Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/81 (45%), Positives = 46/81 (57%), Gaps = 4/81 (4%)
Query 35 EPAPTPKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPD-GALIDNALSKKP 93
+P T ID DGTY VG+DI PG Y +A G TC W R+G+ D G +ID+ P
Sbjct 33 DPFGTYTGVIDRDGTYLVGVDIMPGMYRTA---GGATCSWARLGSLDTGDIIDSWKGSGP 89
Query 94 QVVTIEPTDKAFKTHGCQPWQ 114
Q + I+ +D AF T C WQ
Sbjct 90 QRIQIKESDTAFLTQDCGTWQ 110
>gi|336460995|gb|EGO39875.1| hypothetical protein MAPs_34340 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=86
Score = 63.5 bits (153), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 45/75 (60%), Gaps = 4/75 (5%)
Query 41 KTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPD-GALIDNALSKKPQVVTIE 99
KT+I+++GT+ V +IA GTY + G G CYW R+ + D +IDN + QVV I
Sbjct 12 KTSIETNGTFIVNKEIAAGTYRTDGKSG---CYWARLRSFDTNDIIDNNVGDGAQVVRIL 68
Query 100 PTDKAFKTHGCQPWQ 114
PTD AF T C W+
Sbjct 69 PTDVAFMTRSCGIWR 83
>gi|269836269|ref|YP_003318497.1| hypothetical protein Sthe_0236 [Sphaerobacter thermophilus DSM
20745]
gi|269785532|gb|ACZ37675.1| hypothetical protein Sthe_0236 [Sphaerobacter thermophilus DSM
20745]
Length=473
Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/84 (43%), Positives = 42/84 (50%), Gaps = 5/84 (5%)
Query 33 DPEPAPTPKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALID---NAL 89
+P P P DGTY VGIDIAPGTY + CYW+R+ G L D N
Sbjct 303 EPSEQPDPGPLSFPDGTYRVGIDIAPGTYRNTD--SSQFCYWERLSGFTGELNDIIANGF 360
Query 90 SKKPQVVTIEPTDKAFKTHGCQPW 113
S Q+VTI PTD F + C W
Sbjct 361 SDARQIVTIAPTDAGFVSRRCGRW 384
Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/88 (39%), Positives = 41/88 (47%), Gaps = 5/88 (5%)
Query 33 DPEPAPTPKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALID---NAL 89
D P + TA DG Y VG DIAPG + ++ G CYW+R+ G D N
Sbjct 387 DDTPLTSSPTAPFGDGMYRVGRDIAPGLWRNSDSTG--FCYWERLRGFSGEFDDIIANEF 444
Query 90 SKKPQVVTIEPTDKAFKTHGCQPWQNTG 117
S PQ+V I PTD F C W G
Sbjct 445 SNAPQIVRIAPTDVGFSASRCGIWTRIG 472
>gi|258651339|ref|YP_003200495.1| hypothetical protein Namu_1099 [Nakamurella multipartita DSM
44233]
gi|258554564|gb|ACV77506.1| hypothetical protein Namu_1099 [Nakamurella multipartita DSM
44233]
Length=184
Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/80 (44%), Positives = 46/80 (58%), Gaps = 6/80 (7%)
Query 35 EPAPTPKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDN-ALSKKP 93
E A P T I DG VG+D+ PGTY S +G CYW R+ G+L D + S
Sbjct 106 EAAAGPSTTIK-DGINLVGVDVQPGTYRSE----NGDCYWARLSGTSGSLNDIISNSNGA 160
Query 94 QVVTIEPTDKAFKTHGCQPW 113
VVTI+P+D+AF++ C PW
Sbjct 161 TVVTIDPSDRAFESRRCAPW 180
>gi|118466415|ref|YP_880895.1| hypothetical protein MAV_1662 [Mycobacterium avium 104]
gi|118167702|gb|ABK68599.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=141
Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/128 (37%), Positives = 61/128 (48%), Gaps = 20/128 (15%)
Query 4 KPAPSPAAAFAVAGLILAGWAGSVGLAGADPEPAP----------------TPKTAIDSD 47
+ +P AA A A L LAG GS A P PKT+I+++
Sbjct 14 RRSPGKTAAVAGAALFLAGCGGSTPTATTVISTPPAQTKTVTVTVTPPPPPGPKTSIETN 73
Query 48 GTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPD-GALIDNALSKKPQVVTIEPTDKAFK 106
GT+ V +IA GTY + G G CYW R+ + D +IDN + QVV I PTD AF
Sbjct 74 GTFIVNKEIAAGTYRTDGKSG---CYWARLRSFDTNDIIDNNVGDGAQVVRILPTDVAFM 130
Query 107 THGCQPWQ 114
T C W+
Sbjct 131 TRSCGIWR 138
>gi|41408419|ref|NP_961255.1| hypothetical protein MAP2321 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41396775|gb|AAS04638.1| hypothetical protein MAP_2321 [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=112
Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/73 (43%), Positives = 43/73 (59%), Gaps = 4/73 (5%)
Query 43 AIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPD-GALIDNALSKKPQVVTIEPT 101
+I+++GT+ V +IA GTY + G G CYW R+ + D +IDN + QVV I PT
Sbjct 40 SIETNGTFIVNKEIAAGTYRTDGKSG---CYWARLRSFDTNDIIDNNVGDGAQVVRILPT 96
Query 102 DKAFKTHGCQPWQ 114
D AF T C W+
Sbjct 97 DVAFMTRSCGIWR 109
>gi|254774489|ref|ZP_05216005.1| hypothetical protein MaviaA2_07438 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=114
Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/73 (43%), Positives = 43/73 (59%), Gaps = 4/73 (5%)
Query 43 AIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPD-GALIDNALSKKPQVVTIEPT 101
+I+++GT+ V +IA GTY + G G CYW R+ + D +IDN + QVV I PT
Sbjct 42 SIETNGTFIVNKEIAAGTYRTDGKSG---CYWARLRSFDTNDIIDNNVGDGAQVVRILPT 98
Query 102 DKAFKTHGCQPWQ 114
D AF T C W+
Sbjct 99 DAAFMTRSCGIWR 111
>gi|302534876|ref|ZP_07287218.1| predicted protein [Streptomyces sp. C]
gi|302443771|gb|EFL15587.1| predicted protein [Streptomyces sp. C]
Length=249
Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/82 (42%), Positives = 44/82 (54%), Gaps = 5/82 (6%)
Query 40 PKTAIDSDG-TYAVGIDIAPGTYSSAGP-VGDGTCYWKRMGNPDGALIDNALSKKPQ--- 94
PKT DG + VG+D+APGTY S+GP G CYW+R N + A + P
Sbjct 168 PKTEAAGDGGMFKVGLDLAPGTYKSSGPAAGSAGCYWERAKNASHDVDSIAANDNPTGPA 227
Query 95 VVTIEPTDKAFKTHGCQPWQNT 116
VVT+ D FKT GC W+ +
Sbjct 228 VVTVSAQDGYFKTTGCATWKKS 249
Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/77 (41%), Positives = 43/77 (56%), Gaps = 4/77 (5%)
Query 41 KTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDG---ALIDNALSKKPQVVT 97
K A+ GT+ VG D+ PGTY + G G G CYW+R + G +++ N VT
Sbjct 82 KAAMRGSGTFQVGSDLQPGTYRTTGNKGLG-CYWERAKDASGEVESILANDNVVGTSYVT 140
Query 98 IEPTDKAFKTHGCQPWQ 114
IE DK FK++GC W+
Sbjct 141 IEAGDKLFKSNGCNDWE 157
>gi|297190749|ref|ZP_06908147.1| predicted protein [Streptomyces pristinaespiralis ATCC 25486]
gi|297150607|gb|EFH30685.1| predicted protein [Streptomyces pristinaespiralis ATCC 25486]
Length=195
Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/83 (41%), Positives = 44/83 (54%), Gaps = 4/83 (4%)
Query 38 PTPKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDG---ALIDNALSKKPQ 94
P P T+ DG Y VG DIA GTY +AG G CYW R+ + G A+I N + P
Sbjct 114 PGPATSFAGDGEYLVGTDIAAGTYKTAGADGSFGCYWARLKDASGEFDAIIANNNLEGPG 173
Query 95 VVTIEPTDKAFKTHGCQPWQNTG 117
VT+ + F+T C W+ TG
Sbjct 174 RVTLN-KGEYFQTQRCAQWKKTG 195
>gi|15608492|ref|NP_215868.1| hypothetical protein Rv1352 [Mycobacterium tuberculosis H37Rv]
gi|15840807|ref|NP_335844.1| hypothetical protein MT1395 [Mycobacterium tuberculosis CDC1551]
gi|31792548|ref|NP_855041.1| hypothetical protein Mb1387 [Mycobacterium bovis AF2122/97]
82 more sequence titles
Length=123
Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/83 (38%), Positives = 40/83 (49%), Gaps = 14/83 (16%)
Query 47 DGTYAVGIDIAPGTYSSAGP----------VGD-GTCYWKRMGNPDGA---LIDNALSKK 92
DG + VG DIAPGTY + GP V + TC W P+ + ++D S
Sbjct 39 DGVFLVGTDIAPGTYRTEGPSNPLILVFGRVSELSTCSWSTHSAPEVSNENIVDTNTSMG 98
Query 93 PQVVTIEPTDKAFKTHGCQPWQN 115
P V I PT AF+TH C+ W
Sbjct 99 PMSVVIPPTVAAFQTHNCKLWMR 121
>gi|269794246|ref|YP_003313701.1| hypothetical protein Sked_09170 [Sanguibacter keddieii DSM 10542]
gi|269096431|gb|ACZ20867.1| hypothetical protein Sked_09170 [Sanguibacter keddieii DSM 10542]
Length=808
Score = 54.3 bits (129), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/82 (38%), Positives = 42/82 (52%), Gaps = 5/82 (6%)
Query 39 TPKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDG---ALIDNALSKKPQV 95
TPKT++ DG YAVG +APG Y + G GD CY+ + + G A+ N S Q
Sbjct 729 TPKTSVTQDGVYAVGTQLAPGRYRALG-TGDF-CYYSVVSSFSGELDAIRINGFSAYTQE 786
Query 96 VTIEPTDKAFKTHGCQPWQNTG 117
V I D F++ C W+ G
Sbjct 787 VVITSRDVGFESSNCSTWRKVG 808
Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/71 (40%), Positives = 36/71 (51%), Gaps = 6/71 (8%)
Query 47 DGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGAL---IDNALSKKPQVVTIEPTDK 103
DG + VG I+PGTY S P CYW+R N L I N + ++VTI P DK
Sbjct 655 DGIFRVGSQISPGTYISKNP---DFCYWERRSNATDDLEGIIANDIGNGQRIVTISPNDK 711
Query 104 AFKTHGCQPWQ 114
F + C W+
Sbjct 712 YFSSDWCGGWK 722
>gi|118472896|ref|YP_885966.1| hypothetical protein MSMEG_1588 [Mycobacterium smegmatis str.
MC2 155]
gi|118174183|gb|ABK75079.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=111
Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/82 (36%), Positives = 37/82 (46%), Gaps = 15/82 (18%)
Query 47 DGTYAVGIDIAPGTYSSAG------------PVGDGTCYWKRMGNPDGA---LIDNALSK 91
+G + VG DIAPG Y++ G P D C W PD ++ +S
Sbjct 26 EGFFLVGTDIAPGLYTTGGSESPFAVWINDVPTQDSMCMWFTYSTPDANKDNVVATNMSV 85
Query 92 KPQVVTIEPTDKAFKTHGCQPW 113
P I T KAF+TH CQPW
Sbjct 86 GPMNANINSTVKAFETHNCQPW 107
>gi|183983957|ref|YP_001852248.1| hypothetical protein MMAR_3985 [Mycobacterium marinum M]
gi|183177283|gb|ACC42393.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=122
Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/122 (32%), Positives = 50/122 (41%), Gaps = 20/122 (16%)
Query 7 PSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGTYSSAG- 65
P P A AGL+ W ++ LA P P I +G + VG DIA G Y + G
Sbjct 2 PLPRLRRAAAGLV--TWVATIALA---PPALADPIDPIPGNGFFLVGPDIATGLYHTVGS 56
Query 66 -----------PVGDGTCYWKRMGNPDG---ALIDNALSKKPQVVTIEPTDKAFKTHGCQ 111
P D C W PD +++ +S P I T KAF++ CQ
Sbjct 57 ASDWTVWINNVPTQDSMCVWFTYSTPDANKDHVVETNISVGPMYANINSTVKAFESQNCQ 116
Query 112 PW 113
PW
Sbjct 117 PW 118
>gi|118619072|ref|YP_907404.1| hypothetical protein MUL_3846 [Mycobacterium ulcerans Agy99]
gi|118571182|gb|ABL05933.1| conserved hypothetical secreted protein [Mycobacterium ulcerans
Agy99]
Length=122
Score = 51.2 bits (121), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 38/122 (32%), Positives = 50/122 (41%), Gaps = 20/122 (16%)
Query 7 PSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGTYSSAG- 65
P P A AGL+ W ++ LA P P I +G + VG DIA G Y + G
Sbjct 2 PLPRLRRAAAGLV--TWVATIALA---PPALADPIDPIPGNGFFLVGPDIATGLYRTVGS 56
Query 66 -----------PVGDGTCYWKRMGNPDG---ALIDNALSKKPQVVTIEPTDKAFKTHGCQ 111
P D C W PD +++ +S P I T KAF++ CQ
Sbjct 57 ASDWTVWINNVPTQDSMCVWFTYSTPDANKDHVVETNISVGPIYANINSTVKAFESQNCQ 116
Query 112 PW 113
PW
Sbjct 117 PW 118
>gi|342857983|ref|ZP_08714639.1| hypothetical protein MCOL_03875 [Mycobacterium colombiense CECT
3035]
gi|342135316|gb|EGT88482.1| hypothetical protein MCOL_03875 [Mycobacterium colombiense CECT
3035]
Length=106
Score = 51.2 bits (121), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/98 (33%), Positives = 40/98 (41%), Gaps = 15/98 (15%)
Query 31 GADPEPAPTPKTAIDSDGTYAVGIDIAPGTYSSAG------------PVGDGTCYWKRMG 78
A P P I +G + VG DIAPG Y +AG P D C W
Sbjct 5 AAAPTVVADPVDPIPGNGVFVVGPDIAPGLYRTAGSASTFGVWINNVPTQDSMCAWFTYS 64
Query 79 NPDG---ALIDNALSKKPQVVTIEPTDKAFKTHGCQPW 113
PD ++ +S P I T KAF++ CQPW
Sbjct 65 TPDANKDHVVQTNMSVGPMFANINTTVKAFESQNCQPW 102
>gi|183983999|ref|YP_001852290.1| hypothetical protein MMAR_4028 [Mycobacterium marinum M]
gi|183177325|gb|ACC42435.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=121
Score = 50.4 bits (119), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 37/111 (34%), Positives = 45/111 (41%), Gaps = 17/111 (15%)
Query 24 AGSVGLAGADPEPAPT---PKTAIDSDGTYAVGIDIAPGTYSSAGPVG-----------D 69
AG V A APT P I +G + VG D+ PGTY + GP
Sbjct 11 AGIVACMTACVFAAPTVADPGDQIPGNGVFLVGTDMLPGTYRTEGPANPLILVFGRVSEL 70
Query 70 GTCYWKRMGNPDGA---LIDNALSKKPQVVTIEPTDKAFKTHGCQPWQNTG 117
TC W P + +ID S P V I PT AF+T C+ W G
Sbjct 71 STCSWSTHSTPAASTDDIIDTNTSMGPMSVVIPPTVAAFQTMNCKLWMRVG 121
>gi|260907602|ref|ZP_05915924.1| hypothetical protein BlinB_19857 [Brevibacterium linens BL2]
Length=265
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/83 (34%), Positives = 39/83 (47%), Gaps = 3/83 (3%)
Query 34 PEPAPTPKTAIDSDGTYAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNAL--SK 91
E + I +DGTY VG D+ GTY V CYW+R+ + G +I+N S
Sbjct 182 KEEEARDRGEIFNDGTYRVGDDVKAGTYVVESDVPFEGCYWERL-DASGEIIENNFVNSG 240
Query 92 KPQVVTIEPTDKAFKTHGCQPWQ 114
VTI +D +F C W+
Sbjct 241 FRVEVTISSSDYSFSAERCGEWR 263
>gi|334336394|ref|YP_004541546.1| hypothetical protein Isova_0868 [Isoptericola variabilis 225]
gi|334106762|gb|AEG43652.1| hypothetical protein Isova_0868 [Isoptericola variabilis 225]
Length=224
Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/71 (40%), Positives = 38/71 (54%), Gaps = 7/71 (9%)
Query 47 DGTYAVGIDIAPGTYSSAGPVGDGTCYW--KRMGNPDGALIDNALSK--KPQVVTIEPTD 102
+GT+ VG+DI PGTY + PV +G CYW R G+ ++DN + P V E D
Sbjct 154 EGTWTVGVDIEPGTYRTTEPV-NGDCYWGIYRSGSNKSDIVDNDIVTGGYPTVTLREGQD 212
Query 103 KAFKTHGCQPW 113
F+T C W
Sbjct 213 --FQTSRCGSW 221
>gi|118619107|ref|YP_907439.1| hypothetical protein MUL_3893 [Mycobacterium ulcerans Agy99]
gi|118571217|gb|ABL05968.1| conserved hypothetical secreted protein [Mycobacterium ulcerans
Agy99]
Length=121
Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/111 (34%), Positives = 44/111 (40%), Gaps = 17/111 (15%)
Query 24 AGSVGLAGADPEPAPT---PKTAIDSDGTYAVGIDIAPGTYSSAGPVG-----------D 69
AG V A APT P I +G + VG D+ PGTY + GP
Sbjct 11 AGIVACTTACVFAAPTVADPGDQIPGNGVFLVGTDMLPGTYRTEGPANPLILVFGRVSEL 70
Query 70 GTCYWKRMGNPDGA---LIDNALSKKPQVVTIEPTDKAFKTHGCQPWQNTG 117
TC W P +ID S P V I PT AF+T C+ W G
Sbjct 71 STCSWSTPSTPAANTDDIIDTNTSMGPMSVVIPPTVAAFQTMNCKLWMRVG 121
>gi|145225497|ref|YP_001136175.1| hypothetical protein Mflv_4921 [Mycobacterium gilvum PYR-GCK]
gi|315445850|ref|YP_004078729.1| hypothetical protein Mspyr1_43380 [Mycobacterium sp. Spyr1]
gi|145217983|gb|ABP47387.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
gi|315264153|gb|ADU00895.1| hypothetical protein Mspyr1_43380 [Mycobacterium sp. Spyr1]
Length=122
Score = 47.0 bits (110), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 36/115 (32%), Positives = 45/115 (40%), Gaps = 20/115 (17%)
Query 14 AVAGLILAGWAGSVGLAGADPEPAPTPKTAIDSDGTYAVGIDIAPGTYSSAG-------- 65
A AGL+ A S G P P I +G + VG DIAPG Y + G
Sbjct 9 AAAGLVAVTAAVSFG-----PPAGAEPVNPIPGNGIFLVGQDIAPGLYRTDGTASVFGVW 63
Query 66 ----PVGDGTCYWKRMGNPD---GALIDNALSKKPQVVTIEPTDKAFKTHGCQPW 113
P D C W PD ++ +S P I T KAF++ C PW
Sbjct 64 INDVPTQDSMCLWFTYSTPDTNKDNVVATNMSIGPMYANINATVKAFESRNCAPW 118
>gi|120402508|ref|YP_952337.1| hypothetical protein Mvan_1499 [Mycobacterium vanbaalenii PYR-1]
gi|119955326|gb|ABM12331.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=122
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 43/100 (43%), Gaps = 23/100 (23%)
Query 30 AGADP-EPAPTPKTAIDSDGTYAVGIDIAPGTYSSAG------------PVGDGTCYWKR 76
AGADP +P P +G + VG DIAPG Y + G P D C W
Sbjct 26 AGADPMDPIP-------GNGFFLVGPDIAPGLYRTGGSASPFGVWINDVPTQDSMCAWFT 78
Query 77 MGNPD---GALIDNALSKKPQVVTIEPTDKAFKTHGCQPW 113
PD ++ +S P I T +AF++ CQPW
Sbjct 79 YSTPDTNKDHVVATNMSIGPMFANINSTVQAFESRNCQPW 118
>gi|254819192|ref|ZP_05224193.1| hypothetical protein MintA_04659 [Mycobacterium intracellulare
ATCC 13950]
Length=113
Score = 44.7 bits (104), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/82 (32%), Positives = 33/82 (41%), Gaps = 15/82 (18%)
Query 47 DGTYAVGIDIAPGTYSSAG------------PVGDGTCYWKRMGNPDG---ALIDNALSK 91
+G + VG DIAPG Y + G P D C W PD ++ S
Sbjct 28 NGVFVVGPDIAPGLYHTGGSGSAFGVWINNVPTQDSMCSWFTYSTPDANKDHVLQTNTSI 87
Query 92 KPQVVTIEPTDKAFKTHGCQPW 113
P I KAF++ CQPW
Sbjct 88 GPMYANINSAVKAFESQNCQPW 109
>gi|261736091|ref|YP_003257433.1| hypothetical protein pZL12.69 [Streptomyces sp. ZL12]
gi|261349214|gb|ACX71146.1| hypothetical protein pZL12.69 [Streptomyces sp. ZL12]
Length=252
Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/66 (40%), Positives = 35/66 (54%), Gaps = 5/66 (7%)
Query 54 IDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDN--ALSKKPQVVTIEPTDKAFKTHGCQ 111
+ I PGTY ++G V D CYW+R G +IDN A S + VTI +D F + C
Sbjct 190 VTIPPGTYRASGRVED--CYWERTSE-GGDIIDNNFATSARSITVTIASSDGQFTSERCA 246
Query 112 PWQNTG 117
W+ G
Sbjct 247 VWKPVG 252
>gi|41408501|ref|NP_961337.1| hypothetical protein MAP2403c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41396857|gb|AAS04720.1| hypothetical protein MAP_2403c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336461075|gb|EGO39955.1| hypothetical protein MAPs_35170 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=120
Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/91 (31%), Positives = 35/91 (39%), Gaps = 15/91 (16%)
Query 38 PTPKTAIDSDGTYAVGIDIAPGTYSSAG------------PVGDGTCYWKRMGNPDG--- 82
P P I +G + VG DIAPG Y + G P D C W D
Sbjct 26 PDPMDPIPGNGVFVVGPDIAPGLYHTGGSGSAFGVWINNVPTQDSMCSWFTYSTADANKE 85
Query 83 ALIDNALSKKPQVVTIEPTDKAFKTHGCQPW 113
++ S P I KAF++ CQPW
Sbjct 86 HVLQTNTSIGPMYANINSAVKAFESQNCQPW 116
Lambda K H
0.312 0.135 0.422
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130065254832
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40