BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1870c
Length=211
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609007|ref|NP_216386.1| hypothetical protein Rv1870c [Mycob... 421 4e-116
gi|31793060|ref|NP_855553.1| hypothetical protein Mb1901c [Mycob... 421 4e-116
gi|148823081|ref|YP_001287835.1| hypothetical protein TBFG_11898... 418 3e-115
gi|289443347|ref|ZP_06433091.1| conserved hypothetical protein [... 407 4e-112
gi|167970352|ref|ZP_02552629.1| hypothetical protein MtubH3_2091... 407 4e-112
gi|289753964|ref|ZP_06513342.1| conserved hypothetical protein [... 406 9e-112
gi|339294806|gb|AEJ46917.1| hypothetical protein CCDC5079_1727 [... 399 1e-109
gi|342861226|ref|ZP_08717874.1| hypothetical protein MCOL_20181 ... 298 3e-79
gi|240170506|ref|ZP_04749165.1| hypothetical protein MkanA1_1443... 291 5e-77
gi|296164869|ref|ZP_06847425.1| endonuclease III family protein ... 285 4e-75
gi|41407679|ref|NP_960515.1| hypothetical protein MAP1581c [Myco... 284 7e-75
gi|336457518|gb|EGO36524.1| hypothetical protein MAPs_21900 [Myc... 281 5e-74
gi|254775323|ref|ZP_05216839.1| hypothetical protein MaviaA2_117... 280 1e-73
gi|118463692|ref|YP_882033.1| hypothetical protein MAV_2847 [Myc... 277 6e-73
gi|254819321|ref|ZP_05224322.1| hypothetical protein MintA_05318... 275 4e-72
gi|333990597|ref|YP_004523211.1| hypothetical protein JDM601_195... 255 3e-66
gi|119868853|ref|YP_938805.1| hypothetical protein Mkms_2821 [My... 252 2e-65
gi|108799743|ref|YP_639940.1| hypothetical protein Mmcs_2777 [My... 251 4e-65
gi|126435384|ref|YP_001071075.1| hypothetical protein Mjls_2804 ... 251 5e-65
gi|315444255|ref|YP_004077134.1| hypothetical protein Mspyr1_266... 250 1e-64
gi|145223923|ref|YP_001134601.1| hypothetical protein Mflv_3337 ... 236 1e-60
gi|226359642|ref|YP_002777420.1| hypothetical protein ROP_02280 ... 194 5e-48
gi|343928029|ref|ZP_08767494.1| hypothetical protein GOALK_100_0... 194 9e-48
gi|312139889|ref|YP_004007225.1| hypothetical protein REQ_25050 ... 188 5e-46
gi|325674232|ref|ZP_08153921.1| endonuclease III family protein ... 188 6e-46
gi|328882550|emb|CCA55789.1| hypothetical protein SVEN_2503 [Str... 173 2e-41
gi|256393971|ref|YP_003115535.1| hypothetical protein Caci_4833 ... 172 4e-41
gi|326333016|ref|ZP_08199272.1| hypothetical protein NBCG_04456 ... 171 8e-41
gi|291299531|ref|YP_003510809.1| hypothetical protein Snas_2021 ... 170 1e-40
gi|297563725|ref|YP_003682699.1| endonuclease III-like protein [... 164 6e-39
gi|159038109|ref|YP_001537362.1| hypothetical protein Sare_2528 ... 163 2e-38
gi|145594918|ref|YP_001159215.1| hypothetical protein Strop_2390... 160 1e-37
gi|117165263|emb|CAJ88824.1| putative endonuclease III-like prot... 159 2e-37
gi|302526875|ref|ZP_07279217.1| conserved hypothetical protein [... 159 2e-37
gi|29828148|ref|NP_822782.1| endonuclease III-like protein [Stre... 159 3e-37
gi|302541039|ref|ZP_07293381.1| conserved hypothetical protein [... 154 6e-36
gi|302562117|ref|ZP_07314459.1| conserved hypothetical protein [... 154 1e-35
gi|269125834|ref|YP_003299204.1| hypothetical protein Tcur_1590 ... 152 3e-35
gi|320006885|gb|ADW01735.1| hypothetical protein Sfla_0267 [Stre... 152 4e-35
gi|345010736|ref|YP_004813090.1| endonuclease III-like protein [... 152 4e-35
gi|337769409|emb|CCB78122.1| conserved protein of unknown functi... 150 8e-35
gi|134096776|ref|YP_001102437.1| endonuclease III-like protein [... 148 4e-34
gi|291435768|ref|ZP_06575158.1| conserved hypothetical protein [... 148 4e-34
gi|302555579|ref|ZP_07307921.1| conserved hypothetical protein [... 147 1e-33
gi|297196328|ref|ZP_06913726.1| conserved hypothetical protein [... 146 2e-33
gi|345003861|ref|YP_004806715.1| hypothetical protein SACTE_6405... 145 3e-33
gi|257067724|ref|YP_003153979.1| hypothetical protein Bfae_05210... 143 1e-32
gi|302867699|ref|YP_003836336.1| hypothetical protein Micau_3231... 142 2e-32
gi|330468131|ref|YP_004405874.1| hypothetical protein VAB18032_2... 138 6e-31
gi|182434279|ref|YP_001821998.1| hypothetical protein SGR_486 [S... 137 9e-31
>gi|15609007|ref|NP_216386.1| hypothetical protein Rv1870c [Mycobacterium tuberculosis H37Rv]
gi|15841339|ref|NP_336376.1| hypothetical protein MT1919 [Mycobacterium tuberculosis CDC1551]
gi|148661676|ref|YP_001283199.1| hypothetical protein MRA_1881 [Mycobacterium tuberculosis H37Ra]
26 more sequence titles
Length=211
Score = 421 bits (1081), Expect = 4e-116, Method: Compositional matrix adjust.
Identities = 210/211 (99%), Positives = 211/211 (100%), Gaps = 0/211 (0%)
Query 1 LPPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIG 60
+PPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIG
Sbjct 1 MPPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIG 60
Query 61 AATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDE 120
AATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDE
Sbjct 61 AATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDE 120
Query 121 YSGDLRELAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAA 180
YSGDLRELAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAA
Sbjct 121 YSGDLRELAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAA 180
Query 181 KQLGLPTDPKKLASVAPSSNALLAAALVRVA 211
KQLGLPTDPKKLASVAPSSNALLAAALVRVA
Sbjct 181 KQLGLPTDPKKLASVAPSSNALLAAALVRVA 211
>gi|31793060|ref|NP_855553.1| hypothetical protein Mb1901c [Mycobacterium bovis AF2122/97]
gi|121637773|ref|YP_977996.1| hypothetical protein BCG_1906c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224990257|ref|YP_002644944.1| hypothetical protein JTY_1890 [Mycobacterium bovis BCG str. Tokyo
172]
gi|31618651|emb|CAD94604.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
gi|121493420|emb|CAL71893.1| Conserved hypothetical protein [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224773370|dbj|BAH26176.1| hypothetical protein JTY_1890 [Mycobacterium bovis BCG str. Tokyo
172]
gi|341601800|emb|CCC64474.1| conserved hypothetical protein [Mycobacterium bovis BCG str.
Moreau RDJ]
Length=222
Score = 421 bits (1081), Expect = 4e-116, Method: Compositional matrix adjust.
Identities = 210/211 (99%), Positives = 211/211 (100%), Gaps = 0/211 (0%)
Query 1 LPPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIG 60
+PPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIG
Sbjct 1 MPPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIG 60
Query 61 AATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDE 120
AATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDE
Sbjct 61 AATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDE 120
Query 121 YSGDLRELAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAA 180
YSGDLRELAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAA
Sbjct 121 YSGDLRELAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAA 180
Query 181 KQLGLPTDPKKLASVAPSSNALLAAALVRVA 211
KQLGLPTDPKKLASVAPSSNALLAAALVRVA
Sbjct 181 KQLGLPTDPKKLASVAPSSNALLAAALVRVA 211
>gi|148823081|ref|YP_001287835.1| hypothetical protein TBFG_11898 [Mycobacterium tuberculosis F11]
gi|148721608|gb|ABR06233.1| conserved hypothetical protein [Mycobacterium tuberculosis F11]
Length=211
Score = 418 bits (1074), Expect = 3e-115, Method: Compositional matrix adjust.
Identities = 209/211 (99%), Positives = 210/211 (99%), Gaps = 0/211 (0%)
Query 1 LPPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIG 60
+PPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIG
Sbjct 1 MPPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIG 60
Query 61 AATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDE 120
AATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDE
Sbjct 61 AATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDE 120
Query 121 YSGDLRELAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAA 180
YS DLRELAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAA
Sbjct 121 YSDDLRELAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAA 180
Query 181 KQLGLPTDPKKLASVAPSSNALLAAALVRVA 211
KQLGLPTDPKKLASVAPSSNALLAAALVRVA
Sbjct 181 KQLGLPTDPKKLASVAPSSNALLAAALVRVA 211
>gi|289443347|ref|ZP_06433091.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289447484|ref|ZP_06437228.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289569947|ref|ZP_06450174.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
7 more sequence titles
Length=215
Score = 407 bits (1047), Expect = 4e-112, Method: Compositional matrix adjust.
Identities = 204/204 (100%), Positives = 204/204 (100%), Gaps = 0/204 (0%)
Query 8 MRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARA 67
MRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARA
Sbjct 1 MRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARA 60
Query 68 ARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRE 127
ARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRE
Sbjct 61 ARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRE 120
Query 128 LAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPT 187
LAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPT
Sbjct 121 LAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPT 180
Query 188 DPKKLASVAPSSNALLAAALVRVA 211
DPKKLASVAPSSNALLAAALVRVA
Sbjct 181 DPKKLASVAPSSNALLAAALVRVA 204
>gi|167970352|ref|ZP_02552629.1| hypothetical protein MtubH3_20918 [Mycobacterium tuberculosis
H37Ra]
gi|254550881|ref|ZP_05141328.1| hypothetical protein Mtube_10551 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
gi|294996779|ref|ZP_06802470.1| hypothetical protein Mtub2_20303 [Mycobacterium tuberculosis
210]
gi|297634431|ref|ZP_06952211.1| hypothetical protein MtubK4_09931 [Mycobacterium tuberculosis
KZN 4207]
gi|297731418|ref|ZP_06960536.1| hypothetical protein MtubKR_10031 [Mycobacterium tuberculosis
KZN R506]
gi|313658752|ref|ZP_07815632.1| hypothetical protein MtubKV_10046 [Mycobacterium tuberculosis
KZN V2475]
gi|323719611|gb|EGB28734.1| hypothetical protein TMMG_01127 [Mycobacterium tuberculosis CDC1551A]
Length=204
Score = 407 bits (1047), Expect = 4e-112, Method: Compositional matrix adjust.
Identities = 204/204 (100%), Positives = 204/204 (100%), Gaps = 0/204 (0%)
Query 8 MRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARA 67
MRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARA
Sbjct 1 MRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARA 60
Query 68 ARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRE 127
ARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRE
Sbjct 61 ARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRE 120
Query 128 LAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPT 187
LAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPT
Sbjct 121 LAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPT 180
Query 188 DPKKLASVAPSSNALLAAALVRVA 211
DPKKLASVAPSSNALLAAALVRVA
Sbjct 181 DPKKLASVAPSSNALLAAALVRVA 204
>gi|289753964|ref|ZP_06513342.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289694551|gb|EFD61980.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
Length=215
Score = 406 bits (1044), Expect = 9e-112, Method: Compositional matrix adjust.
Identities = 203/204 (99%), Positives = 204/204 (100%), Gaps = 0/204 (0%)
Query 8 MRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARA 67
MRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARA
Sbjct 1 MRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARA 60
Query 68 ARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRE 127
ARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRE
Sbjct 61 ARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRE 120
Query 128 LAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPT 187
LAQRTRPDVSAAKRMLKTFNGIGDTGADIFLR+VQDVWIWVRPYFDDRATAAAKQLGLPT
Sbjct 121 LAQRTRPDVSAAKRMLKTFNGIGDTGADIFLRQVQDVWIWVRPYFDDRATAAAKQLGLPT 180
Query 188 DPKKLASVAPSSNALLAAALVRVA 211
DPKKLASVAPSSNALLAAALVRVA
Sbjct 181 DPKKLASVAPSSNALLAAALVRVA 204
>gi|339294806|gb|AEJ46917.1| hypothetical protein CCDC5079_1727 [Mycobacterium tuberculosis
CCDC5079]
gi|339298432|gb|AEJ50542.1| hypothetical protein CCDC5180_1705 [Mycobacterium tuberculosis
CCDC5180]
Length=200
Score = 399 bits (1025), Expect = 1e-109, Method: Compositional matrix adjust.
Identities = 199/200 (99%), Positives = 200/200 (100%), Gaps = 0/200 (0%)
Query 12 VIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAAREL 71
+IKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAAREL
Sbjct 1 MIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAAREL 60
Query 72 FCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQR 131
FCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQR
Sbjct 61 FCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQR 120
Query 132 TRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKK 191
TRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKK
Sbjct 121 TRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKK 180
Query 192 LASVAPSSNALLAAALVRVA 211
LASVAPSSNALLAAALVRVA
Sbjct 181 LASVAPSSNALLAAALVRVA 200
>gi|342861226|ref|ZP_08717874.1| hypothetical protein MCOL_20181 [Mycobacterium colombiense CECT
3035]
gi|342131126|gb|EGT84407.1| hypothetical protein MCOL_20181 [Mycobacterium colombiense CECT
3035]
Length=208
Score = 298 bits (763), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 148/192 (78%), Positives = 162/192 (85%), Gaps = 0/192 (0%)
Query 20 RRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTP 79
RRLL +AGTTYAA+A I + DKPMPLF+LLVLCMLASKPI A A AARELF +GLRTP
Sbjct 6 RRLLDVAGTTYAAQARITMSDKPMPLFELLVLCMLASKPIDAGIATAAARELFKAGLRTP 65
Query 80 KAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAA 139
KAVL A RQTMI AFGRAHYVRYDESSATRL +A RVRDEYSGDLR LA+R+R D +AA
Sbjct 66 KAVLQANRQTMIDAFGRAHYVRYDESSATRLADMAERVRDEYSGDLRLLAERSRHDSAAA 125
Query 140 KRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSS 199
KRMLK F GIGDTGADI+LREVQDVW WVRP+FDDR T AK+LGLP DPKKL S+AP
Sbjct 126 KRMLKQFKGIGDTGADIYLREVQDVWTWVRPHFDDRTTGTAKRLGLPADPKKLGSLAPQD 185
Query 200 NALLAAALVRVA 211
NA LAAALVRV+
Sbjct 186 NARLAAALVRVS 197
>gi|240170506|ref|ZP_04749165.1| hypothetical protein MkanA1_14430 [Mycobacterium kansasii ATCC
12478]
Length=226
Score = 291 bits (744), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 141/180 (79%), Positives = 155/180 (87%), Gaps = 0/180 (0%)
Query 14 KPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFC 73
KPEP +RLL++AGT+YAAEAGIRI DKPMPLFQLLVLCMLASKPI A A RAARELF
Sbjct 18 KPEPRVKRLLEVAGTSYAAEAGIRIDDKPMPLFQLLVLCMLASKPIDATIAMRAARELFK 77
Query 74 SGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTR 133
+GLRTPK VL + RQTMI AFGRAHYVRYDESSATRLT +A RVRDE+ GDLRE+A+R+
Sbjct 78 AGLRTPKGVLESRRQTMIDAFGRAHYVRYDESSATRLTEMAERVRDEFRGDLREIARRSD 137
Query 134 PDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLA 193
D S AKR+LK F GIGDTGADIFLREVQDVW W RPYFDDRATA AK+LGLPTDP KL+
Sbjct 138 HDPSKAKRILKQFKGIGDTGADIFLREVQDVWTWARPYFDDRATATAKELGLPTDPAKLS 197
>gi|296164869|ref|ZP_06847425.1| endonuclease III family protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295899711|gb|EFG79161.1| endonuclease III family protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=214
Score = 285 bits (728), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 149/196 (77%), Positives = 171/196 (88%), Gaps = 0/196 (0%)
Query 16 EPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSG 75
E L RRLL +AGTTYAAEA I++ DKPMPLFQLL++CMLASKPI AA A A RELF +G
Sbjct 8 ERLVRRLLDVAGTTYAAEARIKLGDKPMPLFQLLIVCMLASKPIDAAIAMAAGRELFKAG 67
Query 76 LRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPD 135
LRTPKAVL+A+R+ MI AFGRAHYVRYDESSATRLT +A RVRDEYSGDLRELA+R+R D
Sbjct 68 LRTPKAVLAADRRAMIEAFGRAHYVRYDESSATRLTDMAERVRDEYSGDLRELAKRSRHD 127
Query 136 VSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASV 195
V+A KR+L F GIGDTGADI+LREVQDVW WVRPYFDDRATAAA+QLGLPT P+KL ++
Sbjct 128 VAATKRLLTQFKGIGDTGADIYLREVQDVWTWVRPYFDDRATAAAQQLGLPTRPEKLGAL 187
Query 196 APSSNALLAAALVRVA 211
AP +NA LAAAL+RV+
Sbjct 188 APRANARLAAALIRVS 203
>gi|41407679|ref|NP_960515.1| hypothetical protein MAP1581c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41396032|gb|AAS03898.1| hypothetical protein MAP_1581c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=222
Score = 284 bits (726), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 143/200 (72%), Positives = 161/200 (81%), Gaps = 0/200 (0%)
Query 1 LPPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIG 60
+P G R + + RRLL +AG TYAA+A I++ DKPMPLFQLLVLCMLASKPI
Sbjct 1 MPAHAGGTRTGMSNRDQRVRRLLDVAGQTYAAQARIKLSDKPMPLFQLLVLCMLASKPID 60
Query 61 AATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDE 120
AA A AA ELF +GLRTPKAVL A+RQTMI AFGRAHYVRYDESSATRLT +A RVRD+
Sbjct 61 AAIAVGAAAELFKAGLRTPKAVLDADRQTMIDAFGRAHYVRYDESSATRLTDMAERVRDD 120
Query 121 YSGDLRELAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAA 180
YSGDLRELA R+ D ++AKRMLK F GIGDTGADIFLREVQDVW WVRPYFDDRAT AA
Sbjct 121 YSGDLRELAARSEHDTASAKRMLKKFKGIGDTGADIFLREVQDVWTWVRPYFDDRATGAA 180
Query 181 KQLGLPTDPKKLASVAPSSN 200
K+LGLP +P KL S+AP +N
Sbjct 181 KKLGLPAEPDKLGSLAPQAN 200
>gi|336457518|gb|EGO36524.1| hypothetical protein MAPs_21900 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=211
Score = 281 bits (718), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 140/181 (78%), Positives = 155/181 (86%), Gaps = 0/181 (0%)
Query 20 RRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTP 79
RRLL +AG TYAA+A I++ DKPMPLFQLLVLCMLASKPI AA A AA ELF +GLRTP
Sbjct 9 RRLLDVAGQTYAAQARIKLSDKPMPLFQLLVLCMLASKPIDAAIAVGAAAELFKAGLRTP 68
Query 80 KAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAA 139
KAVL A+RQTMI AFGRAHYVRYDESSATRLT +A RVRD+YSGDLRELA R+ D ++A
Sbjct 69 KAVLDADRQTMIDAFGRAHYVRYDESSATRLTDMAERVRDDYSGDLRELAARSEHDTASA 128
Query 140 KRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSS 199
KRMLK F GIGDTGADIFLREVQDVW WVRPYFDDRAT AAK+LGLP +P KL S+AP +
Sbjct 129 KRMLKKFKGIGDTGADIFLREVQDVWTWVRPYFDDRATGAAKKLGLPAEPDKLGSLAPQA 188
Query 200 N 200
N
Sbjct 189 N 189
>gi|254775323|ref|ZP_05216839.1| hypothetical protein MaviaA2_11716 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=211
Score = 280 bits (715), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 139/181 (77%), Positives = 155/181 (86%), Gaps = 0/181 (0%)
Query 20 RRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTP 79
RRLL +AG TYAA+A I++ DKPMPLFQLLVLCMLASKPI AA A AA ELF +GLRTP
Sbjct 9 RRLLDVAGQTYAAQARIKLSDKPMPLFQLLVLCMLASKPIDAAIAVGAAAELFKAGLRTP 68
Query 80 KAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAA 139
KAVL A+RQTMI AFGRAHYVRYDESSATRLT +A RVRD+YSGDLRELA R+ D ++A
Sbjct 69 KAVLDADRQTMIDAFGRAHYVRYDESSATRLTDMAERVRDDYSGDLRELAARSEHDTASA 128
Query 140 KRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSS 199
KRMLK F GIGDTGADIFLREVQDVW WV+PYFDDRAT AAK+LGLP +P KL S+AP +
Sbjct 129 KRMLKKFKGIGDTGADIFLREVQDVWTWVQPYFDDRATGAAKKLGLPAEPDKLGSLAPQA 188
Query 200 N 200
N
Sbjct 189 N 189
>gi|118463692|ref|YP_882033.1| hypothetical protein MAV_2847 [Mycobacterium avium 104]
gi|118164979|gb|ABK65876.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=211
Score = 277 bits (709), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 138/181 (77%), Positives = 154/181 (86%), Gaps = 0/181 (0%)
Query 20 RRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTP 79
RRLL +A TYAA+A I++ DKPMPLFQLLVLCMLASKPI AA A AA ELF +GLRTP
Sbjct 9 RRLLDVASQTYAAQARIKLSDKPMPLFQLLVLCMLASKPIDAAIAVGAAAELFKAGLRTP 68
Query 80 KAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAA 139
KAVL A+RQTMI AFGRAHYVRYDESSATRLT +A RVRD+YSGDLRELA R+ D ++A
Sbjct 69 KAVLDADRQTMIDAFGRAHYVRYDESSATRLTDMAERVRDDYSGDLRELAARSEHDTASA 128
Query 140 KRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSS 199
KRMLK F GIGDTGADIFLREVQDVW WV+PYFDDRAT AAK+LGLP +P KL S+AP +
Sbjct 129 KRMLKKFKGIGDTGADIFLREVQDVWTWVQPYFDDRATGAAKKLGLPAEPDKLGSLAPQA 188
Query 200 N 200
N
Sbjct 189 N 189
>gi|254819321|ref|ZP_05224322.1| hypothetical protein MintA_05318 [Mycobacterium intracellulare
ATCC 13950]
Length=208
Score = 275 bits (702), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 136/181 (76%), Positives = 155/181 (86%), Gaps = 0/181 (0%)
Query 20 RRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTP 79
RRLL +AGTTYAA+A I + DKPMPLFQLLVLCMLASKPI A A AARELF +GLRTP
Sbjct 6 RRLLDVAGTTYAAQARITLSDKPMPLFQLLVLCMLASKPIDATIATAAARELFKAGLRTP 65
Query 80 KAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAA 139
KAVL+++R+TMI AFGRAHYVRYDESSATRLT +A R+RD+YSGD+RELA R+ DV+ A
Sbjct 66 KAVLASDRKTMIDAFGRAHYVRYDESSATRLTDMAERLRDDYSGDMRELADRSGHDVATA 125
Query 140 KRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSS 199
KRMLK F GIGDTGADI+LREVQDVW WVRPYFDDRAT AK+ GLP +PKKL S+AP +
Sbjct 126 KRMLKKFKGIGDTGADIYLREVQDVWTWVRPYFDDRATGTAKRFGLPAEPKKLGSLAPQA 185
Query 200 N 200
N
Sbjct 186 N 186
>gi|333990597|ref|YP_004523211.1| hypothetical protein JDM601_1957 [Mycobacterium sp. JDM601]
gi|333486565|gb|AEF35957.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=211
Score = 255 bits (652), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 131/196 (67%), Positives = 152/196 (78%), Gaps = 0/196 (0%)
Query 16 EPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSG 75
+ L RRLL+ AG TYA EAGIR+R++PMPLFQLL LCMLASKPI AA AARA+RELF SG
Sbjct 5 KELVRRLLRRAGKTYAQEAGIRLRNQPMPLFQLLTLCMLASKPIDAAIAARASRELFRSG 64
Query 76 LRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPD 135
LRTP VL A+R+TMI A GRAHY RYDESSATRL IA V+ +Y GDLR LA+R+ +
Sbjct 65 LRTPHKVLEADRRTMIEAMGRAHYRRYDESSATRLVEIAEAVQQDYRGDLRLLAERSGRN 124
Query 136 VSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASV 195
V AA ++LK F GIGDTGA IFLREVQDVW W R FDD A AAA+ LGLP DP +L ++
Sbjct 125 VEAAVQLLKGFKGIGDTGASIFLREVQDVWPWARSTFDDLALAAARDLGLPGDPSELGAL 184
Query 196 APSSNALLAAALVRVA 211
+ NA LAAALVR +
Sbjct 185 SRGRNAELAAALVRYS 200
>gi|119868853|ref|YP_938805.1| hypothetical protein Mkms_2821 [Mycobacterium sp. KMS]
gi|119694942|gb|ABL92015.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=210
Score = 252 bits (644), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 126/180 (70%), Positives = 140/180 (78%), Gaps = 2/180 (1%)
Query 21 RLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTPK 80
RL AG TYAAEAGIR+ DKPMPLFQLLVLCMLASKPI A A RAARELF GLRTP+
Sbjct 10 RLHDQAGQTYAAEAGIRLADKPMPLFQLLVLCMLASKPIDATVAMRAARELFAEGLRTPQ 69
Query 81 AVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAK 140
AVL A R+TMI AFGRA Y RYDESSATRL +AH V ++Y GDLR LA+ DV+AA+
Sbjct 70 AVLDANRKTMIDAFGRAGYARYDESSATRLVEMAHAVNNDYGGDLRRLAEAR--DVAAAR 127
Query 141 RMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSN 200
R LK F GIGDTG+DIFLREVQD W W+RPYFD+RATA A +LGLP DP KL +AP
Sbjct 128 RKLKQFKGIGDTGSDIFLREVQDTWTWLRPYFDERATATAARLGLPDDPVKLQRLAPDGT 187
>gi|108799743|ref|YP_639940.1| hypothetical protein Mmcs_2777 [Mycobacterium sp. MCS]
gi|108770162|gb|ABG08884.1| conserved hypothetical protein [Mycobacterium sp. MCS]
Length=234
Score = 251 bits (642), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 126/180 (70%), Positives = 140/180 (78%), Gaps = 2/180 (1%)
Query 21 RLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTPK 80
RL AG TYAAEAGIR+ DKPMPLFQLLVLCMLASKPI A A RAARELF GLRTP+
Sbjct 34 RLHDQAGQTYAAEAGIRLADKPMPLFQLLVLCMLASKPIDATVAMRAARELFAEGLRTPQ 93
Query 81 AVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAK 140
AVL A R+TMI AFGRA Y RYDESSATRL +AH V ++Y GDLR LA+ DV+AA+
Sbjct 94 AVLDANRKTMIDAFGRAGYARYDESSATRLVEMAHAVNNDYGGDLRRLAEAR--DVAAAR 151
Query 141 RMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSN 200
R LK F GIGDTG+DIFLREVQD W W+RPYFD+RATA A +LGLP DP KL +AP
Sbjct 152 RKLKQFKGIGDTGSDIFLREVQDTWTWLRPYFDERATATAARLGLPDDPVKLQRLAPDGT 211
>gi|126435384|ref|YP_001071075.1| hypothetical protein Mjls_2804 [Mycobacterium sp. JLS]
gi|126235184|gb|ABN98584.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=210
Score = 251 bits (641), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 125/180 (70%), Positives = 139/180 (78%), Gaps = 2/180 (1%)
Query 21 RLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTPK 80
RL AG TYA EAGIR+ DKPMPLFQLLVLCMLASKPI A A RAARELF GLRTP+
Sbjct 10 RLHDQAGQTYAEEAGIRLADKPMPLFQLLVLCMLASKPIDATVAMRAARELFAEGLRTPQ 69
Query 81 AVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAK 140
AVL A R+TMI AFGRA Y RYDESSATRL +AH V ++Y GDLR LA+ DV+AA+
Sbjct 70 AVLDANRKTMIDAFGRAGYARYDESSATRLVEMAHAVNNDYGGDLRRLAEAQ--DVAAAR 127
Query 141 RMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSN 200
R LK F GIGDTG+DIFLREVQD W W+RPYFD+RATA A +LGLP DP KL +AP
Sbjct 128 RKLKQFKGIGDTGSDIFLREVQDTWTWLRPYFDERATATAARLGLPDDPVKLQRLAPDGT 187
>gi|315444255|ref|YP_004077134.1| hypothetical protein Mspyr1_26680 [Mycobacterium sp. Spyr1]
gi|315262558|gb|ADT99299.1| hypothetical protein Mspyr1_26680 [Mycobacterium sp. Spyr1]
Length=210
Score = 250 bits (638), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 135/199 (68%), Positives = 149/199 (75%), Gaps = 0/199 (0%)
Query 13 IKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELF 72
+ + A RLL+ AGTTYA EAGI + ++PMPLF+LLVLCMLASKPI A A RAARE+F
Sbjct 1 MSADDTADRLLREAGTTYAEEAGITLANRPMPLFELLVLCMLASKPIDATVATRAAREIF 60
Query 73 CSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRT 132
+ LRTP AVL A R TMI AFGRA Y RYDESSATRL IA VRD+Y GDLR LA R
Sbjct 61 GAKLRTPDAVLDATRPTMIRAFGRAGYARYDESSATRLVDIAAAVRDDYGGDLRGLADRA 120
Query 133 RPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKL 192
DV AKR+LK F GIGDTGADIFLREVQDVW WVRPYFD RA AAA LGLP D +L
Sbjct 121 EQDVGKAKRLLKRFTGIGDTGADIFLREVQDVWTWVRPYFDARAMAAAADLGLPDDAAEL 180
Query 193 ASVAPSSNALLAAALVRVA 211
A +A A LAAALVRV+
Sbjct 181 ADLARDDCARLAAALVRVS 199
>gi|145223923|ref|YP_001134601.1| hypothetical protein Mflv_3337 [Mycobacterium gilvum PYR-GCK]
gi|145216409|gb|ABP45813.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=210
Score = 236 bits (602), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 133/196 (68%), Positives = 150/196 (77%), Gaps = 0/196 (0%)
Query 16 EPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSG 75
+ A RLL+ AGTTYA EAGI + ++PMPLF+LLVLCMLASKPI A A RAARE+F +
Sbjct 4 DDTADRLLREAGTTYAEEAGITLANRPMPLFELLVLCMLASKPIDATVATRAAREIFGAK 63
Query 76 LRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPD 135
LRTP AVL A R TMI AFGRA Y RYDESSATRL IA VRD+Y GDLR LA R D
Sbjct 64 LRTPDAVLDATRPTMIRAFGRAGYARYDESSATRLVDIAAAVRDDYGGDLRGLADRAEQD 123
Query 136 VSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASV 195
V+ AK++LK F GIGDTGADIFLREVQDVW WVRPY+D RA AAA LGLP D +LA++
Sbjct 124 VAKAKQLLKRFTGIGDTGADIFLREVQDVWTWVRPYYDARAMAAAADLGLPDDAAELAAL 183
Query 196 APSSNALLAAALVRVA 211
A A LAAALVRV+
Sbjct 184 ARDDCARLAAALVRVS 199
>gi|226359642|ref|YP_002777420.1| hypothetical protein ROP_02280 [Rhodococcus opacus B4]
gi|226238127|dbj|BAH48475.1| hypothetical protein [Rhodococcus opacus B4]
Length=223
Score = 194 bits (494), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 113/193 (59%), Positives = 135/193 (70%), Gaps = 1/193 (0%)
Query 20 RRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTP 79
R LL AG TYA+ + I ++D P PLFQLL L +L S I + A AARELF + LR+P
Sbjct 20 RMLLDKAGPTYASASHITLKDTPTPLFQLLTLSLLLSTRISSEIAVSAARELFRARLRSP 79
Query 80 KAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAA 139
+A+ A+R+T+ISA GR +YVRYDES+ATRL A A RV +EY GDLR LA SAA
Sbjct 80 RAMREADRRTVISALGRGNYVRYDESTATRLHAAAVRVDEEYGGDLRRLASACDHATSAA 139
Query 140 KRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSS 199
R+L F+GIG GADIFLREVQDVW WVRPYFD RA +A+ LGLP DP L +AP
Sbjct 140 VRLLTEFDGIGPVGADIFLREVQDVWSWVRPYFDTRARESARDLGLPDDPHALYDLAPPG 199
Query 200 N-ALLAAALVRVA 211
A LAAALVRV+
Sbjct 200 RVAELAAALVRVS 212
>gi|343928029|ref|ZP_08767494.1| hypothetical protein GOALK_100_00330 [Gordonia alkanivorans NBRC
16433]
gi|343762037|dbj|GAA14420.1| hypothetical protein GOALK_100_00330 [Gordonia alkanivorans NBRC
16433]
Length=214
Score = 194 bits (492), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 107/190 (57%), Positives = 132/190 (70%), Gaps = 0/190 (0%)
Query 22 LLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTPKA 81
L+ AG TYA +AGI ++D P PLF+LLVL +L S I A A AARELF +G RTP+
Sbjct 13 LMARAGRTYAEDAGIALKDTPSPLFELLVLSLLLSTRISADIAVAAARELFAAGYRTPEK 72
Query 82 VLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAKR 141
+ A Q+++ A GR HY RYDES+ATRL +V D+Y GDLRELA + PD AA +
Sbjct 73 MARASWQSLVDALGRGHYKRYDESTATRLGEAGRKVLDDYHGDLRELAAKAEPDSDAAAK 132
Query 142 MLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSNA 201
+L+ F GIG GADIFLREVQ VW WV P+FD+RA AAAK LGLP DP +LA +A
Sbjct 133 LLQEFTGIGPVGADIFLREVQSVWDWVAPHFDERARAAAKDLGLPDDPGRLADLADGKPE 192
Query 202 LLAAALVRVA 211
+LAAALVR +
Sbjct 193 VLAAALVRAS 202
>gi|312139889|ref|YP_004007225.1| hypothetical protein REQ_25050 [Rhodococcus equi 103S]
gi|311889228|emb|CBH48542.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=219
Score = 188 bits (477), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 111/194 (58%), Positives = 133/194 (69%), Gaps = 1/194 (0%)
Query 18 LARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLR 77
+ R LL AG T+A EA I ++D P PLFQLLVL ML S I AA A RAARELF +G R
Sbjct 10 VVRDLLDRAGRTFAEEARITLKDTPKPLFQLLVLSMLLSSRISAAIATRAARELFAAGWR 69
Query 78 TPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVS 137
TP + +A R +I+A R Y RYDES+ATRL +AHRV EY GDLRELAQR+ D
Sbjct 70 TPDTMEAAPRAEVIAALQRGRYTRYDESTATRLRKMAHRVTVEYRGDLRELAQRSDHDAG 129
Query 138 AAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAP 197
AA R+L+ F+GIG GA+IFLREVQD W W+RP+ DDRA A+ L LP D L ++A
Sbjct 130 AAARLLEEFDGIGPVGAEIFLREVQDTWTWLRPHLDDRALDGAEALHLPRDRGGLGALAG 189
Query 198 SSN-ALLAAALVRV 210
+ A LAAALVRV
Sbjct 190 TRGMAPLAAALVRV 203
>gi|325674232|ref|ZP_08153921.1| endonuclease III family protein [Rhodococcus equi ATCC 33707]
gi|325554912|gb|EGD24585.1| endonuclease III family protein [Rhodococcus equi ATCC 33707]
Length=219
Score = 188 bits (477), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 110/194 (57%), Positives = 133/194 (69%), Gaps = 1/194 (0%)
Query 18 LARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLR 77
+ R LL AG T+A EA I ++D P PLFQLLVL ML S I AA A RAARELF +G R
Sbjct 10 VVRDLLDRAGRTFAEEARITLKDTPKPLFQLLVLSMLLSSRISAAIATRAARELFAAGWR 69
Query 78 TPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVS 137
TP + +A R +I+A R Y RYDES+ATRL +AHRV EY GDLRELAQR+ D
Sbjct 70 TPDTMEAAPRAEVIAALQRGRYTRYDESTATRLRKMAHRVTAEYRGDLRELAQRSDHDAG 129
Query 138 AAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAP 197
AA R+L+ F+GIG GA+IFLRE+QD W W+RP+ DDRA A+ L LP D L ++A
Sbjct 130 AAARLLEEFDGIGPVGAEIFLREIQDTWTWLRPHLDDRALDGAEALHLPRDRGGLGALAG 189
Query 198 SSN-ALLAAALVRV 210
+ A LAAALVRV
Sbjct 190 TRGMAPLAAALVRV 203
>gi|328882550|emb|CCA55789.1| hypothetical protein SVEN_2503 [Streptomyces venezuelae ATCC
10712]
Length=214
Score = 173 bits (438), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 101/192 (53%), Positives = 122/192 (64%), Gaps = 1/192 (0%)
Query 21 RLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTPK 80
RLL+ G TYA EAGI + DKP PL+QLLVL +L S I A A AARELF GLRTP
Sbjct 9 RLLREHGRTYADEAGIVLHDKPAPLYQLLVLTVLCSVRIRADIATAAARELFSDGLRTPS 68
Query 81 AVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAK 140
A+ + Q + A GRAHYVRYDES+AT L A V D Y GDLR+L + D A +
Sbjct 69 AMADSSWQQRVDALGRAHYVRYDESTATALGEGARLVLDRYRGDLRQLREEAAGDPDALR 128
Query 141 RMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLA-SVAPSS 199
+L+ IG GA IF RE Q VW +RPYFD+R+ AA +LGLP P LA V P+
Sbjct 129 GLLREVPRIGPVGAGIFCREAQAVWPELRPYFDERSLTAAARLGLPHTPAGLARHVDPAD 188
Query 200 NALLAAALVRVA 211
+ LAAAL+RV+
Sbjct 189 LSRLAAALIRVS 200
>gi|256393971|ref|YP_003115535.1| hypothetical protein Caci_4833 [Catenulispora acidiphila DSM
44928]
gi|256360197|gb|ACU73694.1| conserved hypothetical protein [Catenulispora acidiphila DSM
44928]
Length=207
Score = 172 bits (435), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 100/192 (53%), Positives = 121/192 (64%), Gaps = 1/192 (0%)
Query 19 ARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRT 78
A RLL G TYA EAGIR+RDKP PL+QLLVL L S I A A AARE+F +G RT
Sbjct 13 AERLLAAGGETYAHEAGIRLRDKPSPLYQLLVLSTLLSARITAHIAVDAAREIFTAGWRT 72
Query 79 PKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSA 138
P+AV A Q ++ A GRAHY RYDES+AT L A + D + GDLR L + D +
Sbjct 73 PRAVADASWQELVDALGRAHYRRYDESTATALHEGAELLIDRWHGDLRRLRKEAGGDPAR 132
Query 139 AKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKL-ASVAP 197
+++L+ F IG GA+IF RE Q W + P FD RA A++ GLP DP +L ASVAP
Sbjct 133 IRQLLQEFPRIGPVGAEIFCREAQGEWEELCPAFDRRALDGAEKNGLPKDPDRLAASVAP 192
Query 198 SSNALLAAALVR 209
LAAALVR
Sbjct 193 QDVPRLAAALVR 204
>gi|326333016|ref|ZP_08199272.1| hypothetical protein NBCG_04456 [Nocardioidaceae bacterium Broad-1]
gi|325949210|gb|EGD41294.1| hypothetical protein NBCG_04456 [Nocardioidaceae bacterium Broad-1]
Length=222
Score = 171 bits (432), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 100/193 (52%), Positives = 120/193 (63%), Gaps = 1/193 (0%)
Query 18 LARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLR 77
L RRLL AGTTY+ EAGIR+RDKP PL++LLVL ML+S I A A AAREL +G R
Sbjct 19 LVRRLLDAAGTTYSEEAGIRLRDKPSPLYRLLVLAMLSSTRITADIAVAAARELSAAGWR 78
Query 78 TPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVS 137
TP +L + Q + A GRAHY RYDES+AT+L A + D + GDLR L +R D
Sbjct 79 TPHRLLDSTWQQRVDALGRAHYRRYDESTATKLEEQAQWLLDTHRGDLRRLRPTSRDDAD 138
Query 138 AAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAP 197
A L + IG GA IF REVQDVW + P+ D A +LGLP DP +LAS+ P
Sbjct 139 ALIDALTSSPRIGPVGARIFCREVQDVWPALSPFLDGPLLEQAGELGLPEDPDELASLVP 198
Query 198 SSN-ALLAAALVR 209
A LAAAL R
Sbjct 199 DGRCAPLAAALTR 211
>gi|291299531|ref|YP_003510809.1| hypothetical protein Snas_2021 [Stackebrandtia nassauensis DSM
44728]
gi|290568751|gb|ADD41716.1| conserved hypothetical protein [Stackebrandtia nassauensis DSM
44728]
Length=210
Score = 170 bits (431), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 96/195 (50%), Positives = 118/195 (61%), Gaps = 2/195 (1%)
Query 18 LARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLR 77
+ R L++ GTTYA +AGI +RD P PL+QLL+L L S IG+ A AA+ELF SG R
Sbjct 7 VVRGLIERHGTTYAEQAGINLRDTPAPLYQLLMLSTLLSARIGSDIAVAAAKELFTSGYR 66
Query 78 TPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVS 137
TPKA+ A Q + A GR HY RYDE ++T L A D + GDLR L D+S
Sbjct 67 TPKAMREASWQDRVDALGRGHYRRYDEKTSTMLGDGAQLALDRWKGDLRRLHTEA-DDLS 125
Query 138 AAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLAS-VA 196
R+L+ F GIG TGA IF REVQ VW + PY D+R A +LGLPTD +LA V
Sbjct 126 DVTRLLREFPGIGSTGASIFCREVQGVWTDLAPYVDERTAKGADKLGLPTDANELAKLVK 185
Query 197 PSSNALLAAALVRVA 211
P+ L A VRVA
Sbjct 186 PADFPRLVAGCVRVA 200
>gi|297563725|ref|YP_003682699.1| endonuclease III-like protein [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
gi|296848173|gb|ADH70193.1| endonuclease III-like protein [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
Length=215
Score = 164 bits (416), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 98/195 (51%), Positives = 122/195 (63%), Gaps = 1/195 (0%)
Query 18 LARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLR 77
+AR ++ AGTT AAEAG+R+ D+P L+QLLVL L S I A A AAREL +G
Sbjct 7 VARAVVDEAGTTLAAEAGLRMADRPSALWQLLVLVNLLSARISARIAIAAARELNEAGGT 66
Query 78 TPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVS 137
TP + S Q + A GR HYVRYDES+ATRL RVR+EY GDLR L +R+ S
Sbjct 67 TPDGMASLSWQDRVDALGRGHYVRYDESTATRLGECVDRVREEYQGDLRRLGERSEHRAS 126
Query 138 AAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAP 197
+ M++ F+GIG TGA +F REVQ VW W+RPY D AK++GLP D KLA +
Sbjct 127 RVEAMVREFSGIGPTGAVMFCREVQAVWPWLRPYTDKYVLGGAKKVGLPEDGGKLAKLVE 186
Query 198 SSN-ALLAAALVRVA 211
A LAA L R+A
Sbjct 187 GDEVARLAAGLARIA 201
>gi|159038109|ref|YP_001537362.1| hypothetical protein Sare_2528 [Salinispora arenicola CNS-205]
gi|157916944|gb|ABV98371.1| conserved hypothetical protein [Salinispora arenicola CNS-205]
Length=216
Score = 163 bits (412), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 98/194 (51%), Positives = 124/194 (64%), Gaps = 1/194 (0%)
Query 19 ARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRT 78
A+ LL+ G TY EAGIR+ D+P PL+QLLVL L S I A A AARELF +G RT
Sbjct 8 AQALLERQGQTYTEEAGIRLADRPGPLYQLLVLATLLSTRIRAGVAVAAARELFAAGYRT 67
Query 79 PKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSA 138
P A+ +A Q + A GR HY RYDE +AT L A D + GDLR L + + +
Sbjct 68 PSAMEAASWQDRVDALGRGHYRRYDERTATILGTGARLCLDRWHGDLRRLHREAQGQPAQ 127
Query 139 AKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLAS-VAP 197
+R+L +F GIG TGADI+LREVQ VW +RPY D R + A++LGLP P++LAS VA
Sbjct 128 LRRLLTSFPGIGPTGADIYLREVQAVWPDLRPYADRRTLSGAERLGLPKTPQRLASLVAE 187
Query 198 SSNALLAAALVRVA 211
+ A+ALVRVA
Sbjct 188 AEFGRFASALVRVA 201
>gi|145594918|ref|YP_001159215.1| hypothetical protein Strop_2390 [Salinispora tropica CNB-440]
gi|145304255|gb|ABP54837.1| hypothetical protein Strop_2390 [Salinispora tropica CNB-440]
Length=286
Score = 160 bits (405), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 101/206 (50%), Positives = 124/206 (61%), Gaps = 1/206 (0%)
Query 7 GMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAAR 66
G LV AR LL+ G TY EAGIR+ D+P PL+QLLVL L S I A A
Sbjct 66 GYAPLVGDNHETARVLLERRGQTYTEEAGIRLADRPGPLYQLLVLATLLSSRIRAGVAVA 125
Query 67 AARELFCSGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLR 126
AARELF +G RTP A+ +A Q + A GR HY RYDE +AT L A D + GDLR
Sbjct 126 AARELFAAGYRTPSAMEAATWQDRVDALGRGHYRRYDERTATLLGTGARLCLDRWQGDLR 185
Query 127 ELAQRTRPDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLP 186
L + + +R+L F GIG TGADIFLRE Q VW +RP+ D R + A++LGLP
Sbjct 186 RLHHAAQGRPTQLRRLLTAFPGIGPTGADIFLREAQAVWPDLRPFADRRTLSGAERLGLP 245
Query 187 TDPKKLAS-VAPSSNALLAAALVRVA 211
P++LAS VA + A+ALVRVA
Sbjct 246 QTPRRLASLVAEAEFGRFASALVRVA 271
>gi|117165263|emb|CAJ88824.1| putative endonuclease III-like protein [Streptomyces ambofaciens
ATCC 23877]
Length=216
Score = 159 bits (403), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 86/186 (47%), Positives = 115/186 (62%), Gaps = 0/186 (0%)
Query 15 PEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCS 74
P + R LL G TYA EAGIR+RD P PL++LLVL L S I + A AR L +
Sbjct 5 PGRVLRELLDAHGRTYAEEAGIRLRDTPQPLYRLLVLAHLLSARIRGSIAVATARALHEA 64
Query 75 GLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRP 134
GLR P+ + A+ Q + A GR Y RYDE +AT+L A + + + GDLR L ++
Sbjct 65 GLRDPRRMAGADWQERVDALGRGGYRRYDERTATQLGDAAELLTERWGGDLRRLREQADG 124
Query 135 DVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLAS 194
+V +R+L+ F GIG TGADIFLREVQ VW + PY D +A A++LGLP DP +L
Sbjct 125 EVPETRRLLQEFPGIGPTGADIFLREVQRVWPGIAPYLDRKALQGAQRLGLPDDPGRLLE 184
Query 195 VAPSSN 200
+A S++
Sbjct 185 LAGSTD 190
>gi|302526875|ref|ZP_07279217.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302435770|gb|EFL07586.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=200
Score = 159 bits (403), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 99/189 (53%), Positives = 120/189 (64%), Gaps = 1/189 (0%)
Query 22 LLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTPKA 81
LL GTT+A EAGI++ DKP PL++LLVL L S I A A AAREL SG RTP A
Sbjct 7 LLAEHGTTFAEEAGIKLADKPQPLYRLLVLATLLSTRISADIAVAAARELSRSGWRTPAA 66
Query 82 VLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAKR 141
+ + Q + A GRAHYVRYDES++ L A A +RD Y DLR +A D ++
Sbjct 67 MRDSTWQQRVDALGRAHYVRYDESTSQHLGAGAEFIRDRYRDDLRRMADDADGDPRRLEK 126
Query 142 MLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSN- 200
+L F IG TGA IF RE Q VW W+RPYFD +A AK+ GLPTDP +LA + P +
Sbjct 127 LLAEFPRIGPTGAQIFCREAQAVWPWLRPYFDRKALDGAKKAGLPTDPDRLAKLVPGEDS 186
Query 201 ALLAAALVR 209
A LAAALVR
Sbjct 187 ARLAAALVR 195
>gi|29828148|ref|NP_822782.1| endonuclease III-like protein [Streptomyces avermitilis MA-4680]
gi|29605250|dbj|BAC69317.1| putative endonuclease III-like protein [Streptomyces avermitilis
MA-4680]
Length=215
Score = 159 bits (402), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 88/191 (47%), Positives = 121/191 (64%), Gaps = 1/191 (0%)
Query 22 LLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTPKA 81
L+ GTTYA EAGIR++D P PL++LLV+ L S I ++ A A R L+ +GLR P+
Sbjct 11 LIAAHGTTYADEAGIRLKDAPQPLYRLLVMACLLSARIRSSVALAATRALYDAGLRDPRR 70
Query 82 VLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAKR 141
+ A+ Q + A GR Y RYDE +AT+L A + + + GDLR + V +R
Sbjct 71 MAEADWQERVDALGRGGYRRYDERTATQLGEGAELLIERWGGDLRRMRDEADGKVPELRR 130
Query 142 MLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSN- 200
+L+ G+G GADIFLREVQ VW V P+ D++A + A++LGLP DP+KL A +
Sbjct 131 LLREIPGMGPAGADIFLREVQHVWPGVAPHLDNKALSGAERLGLPKDPRKLMERAGDTEP 190
Query 201 ALLAAALVRVA 211
A+LAAALVRVA
Sbjct 191 AVLAAALVRVA 201
>gi|302541039|ref|ZP_07293381.1| conserved hypothetical protein [Streptomyces hygroscopicus ATCC
53653]
gi|302458657|gb|EFL21750.1| conserved hypothetical protein [Streptomyces himastatinicus ATCC
53653]
Length=212
Score = 154 bits (390), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 91/199 (46%), Positives = 115/199 (58%), Gaps = 6/199 (3%)
Query 14 KPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFC 73
K + R LL GTTYAAEAGI +R+ P PL+QLLVL L S I A A +AR LF
Sbjct 6 KNTSVVRSLLDQQGTTYAAEAGITLRNTPGPLYQLLVLAHLLSARIKADIAVASARALFD 65
Query 74 SGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTR 133
+G+R P+ + A Q + A G Y RYDE +AT+L A VR EY GDLR + +
Sbjct 66 AGMRDPRTMADATWQQRVDALGEGGYRRYDERTATQLGEAAELVRREYQGDLRRMREAGD 125
Query 134 PDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLA 193
P +++L+ GIG G DIFLRE Q VW PYFD +A A+++GLP L
Sbjct 126 P-----RKLLREITGIGPAGVDIFLREAQGVWPEFAPYFDRKALEGAERVGLPKTAASLE 180
Query 194 SVAPSSN-ALLAAALVRVA 211
+ P + LAAALVRVA
Sbjct 181 KLVPGKDLPRLAAALVRVA 199
>gi|302562117|ref|ZP_07314459.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000]
gi|302479735|gb|EFL42828.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000]
Length=218
Score = 154 bits (388), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 88/191 (47%), Positives = 115/191 (61%), Gaps = 1/191 (0%)
Query 22 LLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTPKA 81
L+ G TYA EAGI +RD P PL++LLVL L S I + A AAR L +GLR P+
Sbjct 12 LVGAHGRTYAEEAGITLRDTPQPLYRLLVLAHLLSARIRGSIAVAAARALHEAGLRDPRR 71
Query 82 VLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAKR 141
+ A Q + A GR Y RYDE +AT+L A + + + GDLR L + D +R
Sbjct 72 MAGARWQERVDALGRGGYRRYDERTATQLGEAAELLNERWGGDLRRLRREADGDTGELRR 131
Query 142 MLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSN- 200
+L+ F G+G GADIFLREVQ VW P D +A A++LGLP DP++L +A +
Sbjct 132 LLQEFPGMGPAGADIFLREVQRVWPETSPRLDAKALQGAERLGLPKDPERLLGLAGRTEP 191
Query 201 ALLAAALVRVA 211
A+LAAALVR A
Sbjct 192 AVLAAALVRSA 202
>gi|269125834|ref|YP_003299204.1| hypothetical protein Tcur_1590 [Thermomonospora curvata DSM 43183]
gi|268310792|gb|ACY97166.1| conserved hypothetical protein [Thermomonospora curvata DSM 43183]
Length=212
Score = 152 bits (384), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 89/196 (46%), Positives = 124/196 (64%), Gaps = 1/196 (0%)
Query 16 EPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSG 75
+ + ++LL+ AG TYA EAG+ ++D+P LF+LLVL L S I A A +ARELF +G
Sbjct 2 DAVVKKLLREAGRTYAEEAGVVVKDQPPALFKLLVLSSLLSSRIPADIAVASARELFAAG 61
Query 76 LRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPD 135
T + + Q ++ A GR +R+DES+ TRL+ A + +Y GDLR LA+ + D
Sbjct 62 GGTARGLARMSWQDLVDALGRDRCMRHDESAPTRLSDTAELAQHKYDGDLRRLARDSGRD 121
Query 136 VSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASV 195
A +L+ F G G G D F RE Q +W W+RPYFD++A A A++LGLPTDPK+LA +
Sbjct 122 RVRAAELLQEFPGFGPAGVDAFCREAQAIWPWLRPYFDEQARAGAERLGLPTDPKRLADL 181
Query 196 APSSN-ALLAAALVRV 210
P + A A ALVRV
Sbjct 182 VPDKDLARFAVALVRV 197
>gi|320006885|gb|ADW01735.1| hypothetical protein Sfla_0267 [Streptomyces flavogriseus ATCC
33331]
Length=204
Score = 152 bits (383), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 99/191 (52%), Positives = 118/191 (62%), Gaps = 1/191 (0%)
Query 22 LLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTPKA 81
LL+ GTTYAAEAGIR+RD P PL+QLLVL L S I A A AAR LF GLRTP+
Sbjct 10 LLERHGTTYAAEAGIRLRDTPQPLYQLLVLSDLLSARIRAGIAVAAARALFSRGLRTPRR 69
Query 82 VLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAKR 141
+ A Q + A G Y RYDE +AT+L A + D + GDLR L + DV A +
Sbjct 70 MADATWQERVDALGEGGYRRYDERTATQLGDGALLLLDAFGGDLRRLRREAGGDVHALRS 129
Query 142 MLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSN- 200
L+ F GIG TGADIFLREVQ VW PY D +A A++LGLP P L +A +
Sbjct 130 GLRRFPGIGPTGADIFLREVQTVWPETAPYLDKKALQGAERLGLPGTPAALTRLARDEDP 189
Query 201 ALLAAALVRVA 211
A+LAAALVR A
Sbjct 190 AVLAAALVRGA 200
>gi|345010736|ref|YP_004813090.1| endonuclease III-like protein [Streptomyces violaceusniger Tu
4113]
gi|344037085|gb|AEM82810.1| endonuclease III-like protein [Streptomyces violaceusniger Tu
4113]
Length=210
Score = 152 bits (383), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 91/197 (47%), Positives = 114/197 (58%), Gaps = 6/197 (3%)
Query 16 EPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSG 75
E +AR LL GTT+A +AGI++RD P PL+QLLVL L S I + A AAR LF +G
Sbjct 6 ESVARSLLDQQGTTFATQAGIKLRDTPGPLYQLLVLAHLLSARISSDIAVAAARALFDAG 65
Query 76 LRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPD 135
+R P+ + A Q + A G Y RYDE +AT+L A V EY GDLR + + P
Sbjct 66 MRDPRRMAEATWQQRVDALGEGGYRRYDERTATQLGEAAELVNREYGGDLRRMREAGDP- 124
Query 136 VSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLAS- 194
K++L GIG G +IFLREVQ VW PYFD +A A+++GLP L
Sbjct 125 ----KKLLPEIKGIGPAGVNIFLREVQGVWPEFAPYFDRKALEGAERVGLPKSAGALGRL 180
Query 195 VAPSSNALLAAALVRVA 211
VA LAAALVRVA
Sbjct 181 VAEKDLPRLAAALVRVA 197
>gi|337769409|emb|CCB78122.1| conserved protein of unknown function [Streptomyces cattleya
NRRL 8057]
Length=220
Score = 150 bits (380), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 91/193 (48%), Positives = 117/193 (61%), Gaps = 1/193 (0%)
Query 20 RRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTP 79
R LL G TYA EAGI ++D P PL+QL+VL L S I A A AAR LF +GL P
Sbjct 12 RALLDEHGRTYAEEAGIHLKDTPQPLYQLVVLATLLSARIKAQVATAAARALFEAGLGDP 71
Query 80 KAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAA 139
+ + A+ Q + A G HY RYDE +AT+L A + D Y GD+R + Q + D A
Sbjct 72 RRMADADWQRRVDALGEGHYRRYDERTATQLGEGAALLLDRYGGDVRRMRQASDGDPDAL 131
Query 140 KRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSS 199
+R L+ GIG GADIF+RE Q +W P+FDD+A A ++GLPT P++LA +A
Sbjct 132 RRSLREVPGIGPVGADIFIREAQSLWPEFGPFFDDKAVRGAGRVGLPTSPEELAHLAGRE 191
Query 200 N-ALLAAALVRVA 211
A LAAALVR A
Sbjct 192 RWAALAAALVRSA 204
>gi|134096776|ref|YP_001102437.1| endonuclease III-like protein [Saccharopolyspora erythraea NRRL
2338]
gi|291005180|ref|ZP_06563153.1| endonuclease III-like protein [Saccharopolyspora erythraea NRRL
2338]
gi|133909399|emb|CAL99511.1| endonuclease III-like protein [Saccharopolyspora erythraea NRRL
2338]
Length=209
Score = 148 bits (374), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 89/181 (50%), Positives = 111/181 (62%), Gaps = 2/181 (1%)
Query 20 RRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTP 79
R LL AGTTYA EAGI++ DKP PL++LLVL +L S I A A AAREL G TP
Sbjct 8 RDLLDRAGTTYAEEAGIKLEDKPAPLYRLLVLSVLLSTRIKAGIAVSAARELAEFG--TP 65
Query 80 KAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAA 139
+ + A Q + A GR HYVRYDES+AT L A + D Y GDLR+L + A
Sbjct 66 QKMRDATWQQRVDALGRGHYVRYDESTATSLGDGAEYLLDRYRGDLRKLRDEAGGGIKAL 125
Query 140 KRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSS 199
K L+ G+G GADIF RE Q VW +RPYFD +A + AK++GLP D K+LA +
Sbjct 126 KSKLQEVKGLGPVGADIFCREAQAVWPELRPYFDKKALSGAKKVGLPEDAKRLAELVGDK 185
Query 200 N 200
+
Sbjct 186 D 186
>gi|291435768|ref|ZP_06575158.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC
14672]
gi|291338663|gb|EFE65619.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC
14672]
Length=217
Score = 148 bits (374), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 81/183 (45%), Positives = 110/183 (61%), Gaps = 0/183 (0%)
Query 14 KPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFC 73
+ E + R L+ G TYA EAGIR+RD P PL++LLVL L S I + A AR L
Sbjct 4 RDERVVRALVDAHGRTYAEEAGIRLRDTPQPLYRLLVLAHLLSARIRGSIAVATARALHE 63
Query 74 SGLRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTR 133
+GLR P+ + +A+ Q + A GR Y RYDE +AT+L A + + + GDLR L +
Sbjct 64 AGLRDPRRMAAADWQERVDALGRGGYRRYDERTATQLGEEAELLIERWGGDLRGLREEAG 123
Query 134 PDVSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLA 193
+V +R+L+ F G+G GADIFLREVQ VW PY D +A A++L LP DP +L
Sbjct 124 GEVPELRRLLQVFPGVGPAGADIFLREVQRVWPGTAPYLDAKALQGARRLELPQDPNRLT 183
Query 194 SVA 196
+A
Sbjct 184 GLA 186
>gi|302555579|ref|ZP_07307921.1| conserved hypothetical protein [Streptomyces viridochromogenes
DSM 40736]
gi|302473197|gb|EFL36290.1| conserved hypothetical protein [Streptomyces viridochromogenes
DSM 40736]
Length=215
Score = 147 bits (371), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 81/185 (44%), Positives = 109/185 (59%), Gaps = 0/185 (0%)
Query 16 EPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSG 75
E + R L+ G T+A EAGIR+RD P PL++LLVL L S I + A AR L +G
Sbjct 5 ERVVRELVGAHGRTFADEAGIRLRDTPQPLYRLLVLSHLLSARIRGSIALATARALHEAG 64
Query 76 LRTPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPD 135
LR P+ + A Q + A GR Y RYDE +AT+L A + D + GDLR L +
Sbjct 65 LRDPRRMAEASWQERVDALGRGGYRRYDERTATQLGEEAELLTDRWGGDLRRLRREADGK 124
Query 136 VSAAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASV 195
VS + +L+ F G+G G+DIFLRE Q VW PY D +A A++L LP DP+ LA +
Sbjct 125 VSELRHLLQDFPGMGPAGSDIFLREAQGVWPEAAPYLDAKALQGAERLNLPKDPEHLADL 184
Query 196 APSSN 200
A S++
Sbjct 185 AGSTD 189
>gi|297196328|ref|ZP_06913726.1| conserved hypothetical protein [Streptomyces pristinaespiralis
ATCC 25486]
gi|297153169|gb|EFH32182.1| conserved hypothetical protein [Streptomyces pristinaespiralis
ATCC 25486]
Length=224
Score = 146 bits (369), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 89/191 (47%), Positives = 118/191 (62%), Gaps = 1/191 (0%)
Query 22 LLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTPKA 81
LL+ G TYA EAGI +RD P PL++LLVL L S I A+ A AAREL +G+R+P+A
Sbjct 19 LLEDHGRTYAEEAGITLRDTPQPLYRLLVLAHLLSARIRASVAVAAARELSDAGMRSPEA 78
Query 82 VLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAKR 141
+ A Q + A GR Y RYDE +AT+L A + + Y GDLR + ++ DV +R
Sbjct 79 MKKASWQERVDALGRGGYRRYDERTATQLGDGAELLTERYGGDLRRMRKQADGDVDELRR 138
Query 142 MLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSNA 201
+L+ GIG GADIFLREVQ VW P D++A A++LG+ DP L +A ++A
Sbjct 139 LLREVPGIGPAGADIFLREVQAVWPEAGPLLDEKALQGARRLGVADDPGTLLGLAGDADA 198
Query 202 L-LAAALVRVA 211
LAA LVR A
Sbjct 199 AHLAAGLVRAA 209
>gi|345003861|ref|YP_004806715.1| hypothetical protein SACTE_6405 [Streptomyces sp. SirexAA-E]
gi|344319487|gb|AEN14175.1| conserved hypothetical protein [Streptomyces sp. SirexAA-E]
Length=213
Score = 145 bits (367), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 85/175 (49%), Positives = 105/175 (60%), Gaps = 0/175 (0%)
Query 22 LLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTPKA 81
LL+ GTTYA EAGIR+RD P PL+QLLVL L S I A+ A +AR LF G+R+P+
Sbjct 10 LLEEHGTTYAEEAGIRLRDTPQPLYQLLVLSHLLSARIRASVAVASARALFSHGMRSPRR 69
Query 82 VLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAKR 141
+ A Q + A G Y RYDE +AT+L A + D Y GDLR L D A +
Sbjct 70 MADATWQQRVDALGEGGYRRYDERTATQLGDGAELLLDAYGGDLRRLRGDADGDTDALRS 129
Query 142 MLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVA 196
L+ F G+G GADIFLREVQ VW PY D +A A++LGLPT P L +A
Sbjct 130 GLRRFPGMGPAGADIFLREVQAVWPEAAPYLDSKALQGAERLGLPTSPTGLRRLA 184
>gi|257067724|ref|YP_003153979.1| hypothetical protein Bfae_05210 [Brachybacterium faecium DSM
4810]
gi|256558542|gb|ACU84389.1| hypothetical protein Bfae_05210 [Brachybacterium faecium DSM
4810]
Length=215
Score = 143 bits (361), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 94/194 (49%), Positives = 117/194 (61%), Gaps = 0/194 (0%)
Query 18 LARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLR 77
LA RLL G T+AA+A I +RDKP PL+QLLVL +L S I + A ARELF +G R
Sbjct 7 LAHRLLTAHGRTFAADAAITLRDKPAPLWQLLVLSLLLSTRISSDIAVATARELFSAGWR 66
Query 78 TPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVS 137
TP + + Q + A GR Y RYDES+ATRL A + D + GDLR L D
Sbjct 67 TPGHLRESTWQERVDALGRGGYRRYDESTATRLDDAAALLLDRWKGDLRRLRDEAESDPR 126
Query 138 AAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAP 197
+L+TF+GIG TGA IFLREVQ VW VRP+ D+ A+ GLP D + LA +
Sbjct 127 RIAELLQTFDGIGPTGASIFLREVQQVWPTVRPFADELVLKGARAAGLPQDAESLAGLVD 186
Query 198 SSNALLAAALVRVA 211
A LA+ALVR+A
Sbjct 187 DDFASLASALVRIA 200
>gi|302867699|ref|YP_003836336.1| hypothetical protein Micau_3231 [Micromonospora aurantiaca ATCC
27029]
gi|315505900|ref|YP_004084787.1| hypothetical protein ML5_5163 [Micromonospora sp. L5]
gi|302570558|gb|ADL46760.1| hypothetical protein Micau_3231 [Micromonospora aurantiaca ATCC
27029]
gi|315412519|gb|ADU10636.1| hypothetical protein ML5_5163 [Micromonospora sp. L5]
Length=218
Score = 142 bits (359), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 99/195 (51%), Positives = 120/195 (62%), Gaps = 1/195 (0%)
Query 18 LARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLR 77
+AR LL+ TYA EAGI + D+P PL+QLLVL L S I A+ A AARELF +G R
Sbjct 7 VARALLERQNRTYAEEAGIALADRPGPLYQLLVLTTLLSTRIRASVAVAAARELFAAGYR 66
Query 78 TPKAVLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVS 137
TP+A+ +A Q + A GR HY RYDE +AT L A D + GDLR L + D +
Sbjct 67 TPQAMEAASWQDRVDALGRGHYRRYDERTATMLGTGARLCLDRWHGDLRRLHKEAGADRA 126
Query 138 AAKRMLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLAS-VA 196
+R+L F GIG TGADIFLRE Q VW VRPY D R A A++LGLP P L V
Sbjct 127 VLRRLLTEFPGIGPTGADIFLREAQSVWPDVRPYADRRTLAGARRLGLPASPGDLVGLVG 186
Query 197 PSSNALLAAALVRVA 211
+ LA+ALVRVA
Sbjct 187 EADFGRLASALVRVA 201
>gi|330468131|ref|YP_004405874.1| hypothetical protein VAB18032_20870 [Verrucosispora maris AB-18-032]
gi|328811102|gb|AEB45274.1| hypothetical protein VAB18032_20870 [Verrucosispora maris AB-18-032]
Length=215
Score = 138 bits (347), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 95/184 (52%), Positives = 113/184 (62%), Gaps = 1/184 (0%)
Query 29 TYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTPKAVLSAERQ 88
TYA EAGI + D+P PL+QLLVL L S I A A AARELF +G RT A+ +A Q
Sbjct 16 TYAEEAGISLADRPAPLYQLLVLVTLLSTRIRAQVAVAAARELFAAGYRTAAAMEAASWQ 75
Query 89 TMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAKRMLKTFNG 148
+ A GR HY RYDE +AT L A D + GDLR L ++ D +R+L F G
Sbjct 76 QRVDALGRGHYRRYDERTATMLGTGARLCLDRWHGDLRRLRRQADGDQDQLRRLLVEFPG 135
Query 149 IGDTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSN-ALLAAAL 207
IG TGADIFLREVQ VW +RPY D RA A AK+LGLP +LA + LA+AL
Sbjct 136 IGPTGADIFLREVQTVWPELRPYADRRAVAGAKRLGLPAATDRLAGLVDEGEFGRLASAL 195
Query 208 VRVA 211
VRVA
Sbjct 196 VRVA 199
>gi|182434279|ref|YP_001821998.1| hypothetical protein SGR_486 [Streptomyces griseus subsp. griseus
NBRC 13350]
gi|178462795|dbj|BAG17315.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus
NBRC 13350]
Length=223
Score = 137 bits (346), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 85/183 (47%), Positives = 102/183 (56%), Gaps = 8/183 (4%)
Query 22 LLKLAGTTYAAEAGIRIRDKPMPLFQLLVLCMLASKPIGAATAARAARELFCSGLRTPKA 81
LL G TYA EAGIR+RD P PL++LLVL L S I A+ A AAR LF GLRTP+
Sbjct 16 LLDGYGRTYAQEAGIRLRDTPQPLYRLLVLSHLLSARIRASIAVSAARALFADGLRTPRR 75
Query 82 VLSAERQTMISAFGRAHYVRYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAKR 141
+ A Q + A G Y RYDE ++T+L A V D + GDLR L A
Sbjct 76 MAGASWQQRVDALGEGGYRRYDERTSTQLGEGARLVLDVWKGDLRRLRAEADGSADAVCD 135
Query 142 MLKTFNGIGDTGADIFLREVQDVWIWVRPYFDDRATA----AAKQLGLPTDPKKLASVAP 197
L+ GIG GADIF+REVQD+W P RA A A++LGLPT P LA +AP
Sbjct 136 GLQRVRGIGPAGADIFVREVQDLW----PETGFRAGAKGLRGAEKLGLPTSPGALARLAP 191
Query 198 SSN 200
Sbjct 192 DGG 194
Lambda K H
0.323 0.135 0.389
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 249157747332
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40