BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1056
Length=254
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608196|ref|NP_215572.1| hypothetical protein Rv1056 [Mycoba... 517 5e-145
gi|340626067|ref|YP_004744519.1| hypothetical protein MCAN_10621... 516 1e-144
gi|254231345|ref|ZP_04924672.1| conserved hypothetical protein [... 516 2e-144
gi|253799909|ref|YP_003032910.1| hypothetical protein TBMG_02930... 516 2e-144
gi|289573696|ref|ZP_06453923.1| conserved hypothetical protein [... 514 3e-144
gi|339294053|gb|AEJ46164.1| hypothetical protein CCDC5079_0974 [... 460 8e-128
gi|254822125|ref|ZP_05227126.1| hypothetical protein MintA_19477... 441 4e-122
gi|254774067|ref|ZP_05215583.1| hypothetical protein MaviaA2_052... 441 7e-122
gi|118465664|ref|YP_880430.1| hypothetical protein MAV_1184 [Myc... 440 8e-122
gi|41407104|ref|NP_959940.1| hypothetical protein MAP1006 [Mycob... 439 2e-121
gi|296169923|ref|ZP_06851533.1| conserved hypothetical protein [... 438 4e-121
gi|240170310|ref|ZP_04748969.1| hypothetical protein MkanA1_1343... 435 3e-120
gi|342861651|ref|ZP_08718297.1| hypothetical protein MCOL_22296 ... 434 8e-120
gi|183984385|ref|YP_001852676.1| hypothetical protein MMAR_4414 ... 427 6e-118
gi|120405671|ref|YP_955500.1| hypothetical protein Mvan_4720 [My... 387 9e-106
gi|88856690|ref|ZP_01131346.1| hypothetical protein A20C1_10925 ... 274 9e-72
gi|302526356|ref|ZP_07278698.1| conserved hypothetical protein [... 259 3e-67
gi|297161047|gb|ADI10759.1| hypothetical protein SBI_07639 [Stre... 257 1e-66
gi|284030481|ref|YP_003380412.1| hypothetical protein Kfla_2542 ... 251 6e-65
gi|345011838|ref|YP_004814192.1| hypothetical protein Strvi_4261... 250 2e-64
gi|320006809|gb|ADW01659.1| protein of unknown function DUF427 [... 246 3e-63
gi|298524554|ref|ZP_07011963.1| conserved hypothetical protein [... 245 4e-63
gi|308371845|ref|ZP_07426438.2| hypothetical protein TMDG_03875 ... 243 2e-62
gi|302556748|ref|ZP_07309090.1| conserved hypothetical protein [... 238 5e-61
gi|289767449|ref|ZP_06526827.1| conserved hypothetical protein [... 237 9e-61
gi|21225412|ref|NP_631191.1| hypothetical protein SCO7130 [Strep... 236 3e-60
gi|269128733|ref|YP_003302103.1| hypothetical protein Tcur_4538 ... 231 1e-58
gi|337768973|emb|CCB77686.1| conserved protein of unknown functi... 228 7e-58
gi|333920476|ref|YP_004494057.1| hypothetical protein AS9A_2810 ... 214 1e-53
gi|111224144|ref|YP_714938.1| hypothetical protein FRAAL4754 [Fr... 203 2e-50
gi|291435757|ref|ZP_06575147.1| conserved hypothetical protein [... 201 7e-50
gi|256390991|ref|YP_003112555.1| hypothetical protein Caci_1793 ... 201 7e-50
gi|288916063|ref|ZP_06410444.1| protein of unknown function DUF4... 201 8e-50
gi|312197395|ref|YP_004017456.1| hypothetical protein FraEuI1c_3... 195 6e-48
gi|108803134|ref|YP_643071.1| hypothetical protein Rxyl_0283 [Ru... 194 1e-47
gi|302555590|ref|ZP_07307932.1| conserved hypothetical protein [... 188 7e-46
gi|300784890|ref|YP_003765181.1| hypothetical protein AMED_2986 ... 186 3e-45
gi|297197885|ref|ZP_06915282.1| conserved hypothetical protein [... 185 5e-45
gi|336180036|ref|YP_004585411.1| hypothetical protein FsymDg_422... 180 2e-43
gi|297189910|ref|ZP_06907308.1| conserved hypothetical protein [... 176 3e-42
gi|302675254|ref|XP_003027311.1| hypothetical protein SCHCODRAFT... 174 8e-42
gi|330925525|ref|XP_003301086.1| hypothetical protein PTT_12502 ... 173 2e-41
gi|336363527|gb|EGN91912.1| hypothetical protein SERLA73DRAFT_19... 171 1e-40
gi|189207523|ref|XP_001940095.1| conserved hypothetical protein ... 170 2e-40
gi|292490297|ref|YP_003525736.1| hypothetical protein Nhal_0132 ... 170 2e-40
gi|115387671|ref|XP_001211341.1| predicted protein [Aspergillus ... 169 4e-40
gi|134098508|ref|YP_001104169.1| hypothetical protein SACE_1933 ... 169 4e-40
gi|331698863|ref|YP_004335102.1| hypothetical protein Psed_5112 ... 168 7e-40
gi|242205892|ref|XP_002468803.1| predicted protein [Postia place... 167 1e-39
gi|330465367|ref|YP_004403110.1| hypothetical protein VAB18032_0... 166 2e-39
>gi|15608196|ref|NP_215572.1| hypothetical protein Rv1056 [Mycobacterium tuberculosis H37Rv]
gi|15840488|ref|NP_335525.1| hypothetical protein MT1085 [Mycobacterium tuberculosis CDC1551]
gi|31792247|ref|NP_854740.1| hypothetical protein Mb1085 [Mycobacterium bovis AF2122/97]
57 more sequence titles
Length=254
Score = 517 bits (1332), Expect = 5e-145, Method: Compositional matrix adjust.
Identities = 254/254 (100%), Positives = 254/254 (100%), Gaps = 0/254 (0%)
Query 1 MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF 60
MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF
Sbjct 1 MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF 60
Query 61 LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE 120
LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE
Sbjct 61 LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE 120
Query 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH
Sbjct 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
Query 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT
Sbjct 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
Query 241 VDGVALPRPHTQFS 254
VDGVALPRPHTQFS
Sbjct 241 VDGVALPRPHTQFS 254
>gi|340626067|ref|YP_004744519.1| hypothetical protein MCAN_10621 [Mycobacterium canettii CIPT
140010059]
gi|340004257|emb|CCC43398.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=254
Score = 516 bits (1329), Expect = 1e-144, Method: Compositional matrix adjust.
Identities = 253/254 (99%), Positives = 254/254 (100%), Gaps = 0/254 (0%)
Query 1 MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF 60
MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVR+EF
Sbjct 1 MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRLEF 60
Query 61 LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE 120
LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE
Sbjct 61 LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE 120
Query 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH
Sbjct 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
Query 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT
Sbjct 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
Query 241 VDGVALPRPHTQFS 254
VDGVALPRPHTQFS
Sbjct 241 VDGVALPRPHTQFS 254
>gi|254231345|ref|ZP_04924672.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|124600404|gb|EAY59414.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
Length=254
Score = 516 bits (1328), Expect = 2e-144, Method: Compositional matrix adjust.
Identities = 253/254 (99%), Positives = 253/254 (99%), Gaps = 0/254 (0%)
Query 1 MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF 60
MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF
Sbjct 1 MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF 60
Query 61 LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE 120
LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE
Sbjct 61 LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE 120
Query 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH
Sbjct 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
Query 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLP VAPIAGLVAFYNEKVDLT
Sbjct 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPVVAPIAGLVAFYNEKVDLT 240
Query 241 VDGVALPRPHTQFS 254
VDGVALPRPHTQFS
Sbjct 241 VDGVALPRPHTQFS 254
>gi|253799909|ref|YP_003032910.1| hypothetical protein TBMG_02930 [Mycobacterium tuberculosis KZN
1435]
gi|289555160|ref|ZP_06444370.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
gi|297633587|ref|ZP_06951367.1| hypothetical protein MtubK4_05663 [Mycobacterium tuberculosis
KZN 4207]
6 more sequence titles
Length=254
Score = 516 bits (1328), Expect = 2e-144, Method: Compositional matrix adjust.
Identities = 253/254 (99%), Positives = 254/254 (100%), Gaps = 0/254 (0%)
Query 1 MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF 60
MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF
Sbjct 1 MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF 60
Query 61 LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE 120
LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE
Sbjct 61 LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE 120
Query 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
PI+GHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH
Sbjct 121 PIHGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
Query 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT
Sbjct 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
Query 241 VDGVALPRPHTQFS 254
VDGVALPRPHTQFS
Sbjct 241 VDGVALPRPHTQFS 254
>gi|289573696|ref|ZP_06453923.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|339631120|ref|YP_004722762.1| hypothetical protein MAF_10690 [Mycobacterium africanum GM041182]
gi|289538127|gb|EFD42705.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|339330476|emb|CCC26141.1| conserved hypothetical protein [Mycobacterium africanum GM041182]
Length=254
Score = 514 bits (1325), Expect = 3e-144, Method: Compositional matrix adjust.
Identities = 253/254 (99%), Positives = 253/254 (99%), Gaps = 0/254 (0%)
Query 1 MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF 60
MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF
Sbjct 1 MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF 60
Query 61 LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE 120
LR ENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE
Sbjct 61 LRGENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDE 120
Query 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH
Sbjct 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
Query 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT
Sbjct 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
Query 241 VDGVALPRPHTQFS 254
VDGVALPRPHTQFS
Sbjct 241 VDGVALPRPHTQFS 254
>gi|339294053|gb|AEJ46164.1| hypothetical protein CCDC5079_0974 [Mycobacterium tuberculosis
CCDC5079]
gi|339297693|gb|AEJ49803.1| hypothetical protein CCDC5180_0966 [Mycobacterium tuberculosis
CCDC5180]
Length=226
Score = 460 bits (1184), Expect = 8e-128, Method: Compositional matrix adjust.
Identities = 225/226 (99%), Positives = 226/226 (100%), Gaps = 0/226 (0%)
Query 29 VLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRDENHPQRVQLGPSRLHSLVSAGQTHR 88
+LVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRDENHPQRVQLGPSRLHSLVSAGQTHR
Sbjct 1 MLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRDENHPQRVQLGPSRLHSLVSAGQTHR 60
Query 89 SAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEPIYGHPRNPYQRADALRSHRHVRVELDG 148
SAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEPIYGHPRNPYQRADALRSHRHVRVELDG
Sbjct 61 SAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEPIYGHPRNPYQRADALRSHRHVRVELDG 120
Query 149 IVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLCPYKGTTSGYWSVRVGDAV 208
IVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLCPYKGTTSGYWSVRVGDAV
Sbjct 121 IVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLCPYKGTTSGYWSVRVGDAV 180
Query 209 HRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPRPHTQFS 254
HRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPRPHTQFS
Sbjct 181 HRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPRPHTQFS 226
>gi|254822125|ref|ZP_05227126.1| hypothetical protein MintA_19477 [Mycobacterium intracellulare
ATCC 13950]
Length=253
Score = 441 bits (1135), Expect = 4e-122, Method: Compositional matrix adjust.
Identities = 208/252 (83%), Positives = 223/252 (89%), Gaps = 0/252 (0%)
Query 3 VDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLR 62
DYPQMAA RGRIEPAPRRVRG+LG LVFDT+AARYVWEVPYYPQYYIPLADVR EFLR
Sbjct 2 TDYPQMAAARGRIEPAPRRVRGFLGDALVFDTTAARYVWEVPYYPQYYIPLADVRTEFLR 61
Query 63 DENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEPI 122
DENH Q VQ GPSRL+SLV+ GQTH SAARVFD D DSP+AGTVRF W+PLRWFEEDEP+
Sbjct 62 DENHAQTVQFGPSRLYSLVAEGQTHASAARVFDADSDSPLAGTVRFEWNPLRWFEEDEPV 121
Query 123 YGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLE 182
YGHPRNPY R DALRSHRHVRVE DGI LA T+SPVLLFETG+PTRYYIDP D+ FEHL+
Sbjct 122 YGHPRNPYSRVDALRSHRHVRVEFDGITLAATKSPVLLFETGLPTRYYIDPTDVVFEHLQ 181
Query 183 PTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVD 242
P++TQTLCPYKGTTSGYWSVRVGD VH DLAWTYHYPLPAV IAGL+AFYNEKVD+ VD
Sbjct 182 PSTTQTLCPYKGTTSGYWSVRVGDIVHEDLAWTYHYPLPAVGQIAGLIAFYNEKVDIVVD 241
Query 243 GVALPRPHTQFS 254
G L RP TQFS
Sbjct 242 GAPLARPQTQFS 253
>gi|254774067|ref|ZP_05215583.1| hypothetical protein MaviaA2_05258 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=256
Score = 441 bits (1133), Expect = 7e-122, Method: Compositional matrix adjust.
Identities = 211/251 (85%), Positives = 225/251 (90%), Gaps = 0/251 (0%)
Query 4 DYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRD 63
DYPQMAA RGRIEPAPRRVRGYLG VLVFDT+AARYVWEVPYYPQYYIPLADVR E LRD
Sbjct 6 DYPQMAAARGRIEPAPRRVRGYLGDVLVFDTTAARYVWEVPYYPQYYIPLADVRTELLRD 65
Query 64 ENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEPIY 123
ENH QRVQ GPSRL+S+V+ +T SAARVFD DGD P+AGTVRF WDPLRWFEEDEPIY
Sbjct 66 ENHAQRVQFGPSRLYSVVAGDRTCESAARVFDADGDGPLAGTVRFEWDPLRWFEEDEPIY 125
Query 124 GHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEP 183
GHPRNPY R DALRSHRHVRVE +GI LADTRSPVLLFETG+PTRYYIDP D+ F HLEP
Sbjct 126 GHPRNPYARVDALRSHRHVRVEHEGITLADTRSPVLLFETGLPTRYYIDPTDVDFAHLEP 185
Query 184 TSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDG 243
++TQTLCPYKGTTSGYWSVRVGD VH DLAWTYHYPLPAVA IAGL+AFYNEK+D+ VDG
Sbjct 186 SATQTLCPYKGTTSGYWSVRVGDVVHEDLAWTYHYPLPAVAQIAGLIAFYNEKLDIVVDG 245
Query 244 VALPRPHTQFS 254
LPRPHTQFS
Sbjct 246 TPLPRPHTQFS 256
>gi|118465664|ref|YP_880430.1| hypothetical protein MAV_1184 [Mycobacterium avium 104]
gi|118166951|gb|ABK67848.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=256
Score = 440 bits (1132), Expect = 8e-122, Method: Compositional matrix adjust.
Identities = 211/251 (85%), Positives = 224/251 (90%), Gaps = 0/251 (0%)
Query 4 DYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRD 63
DYPQMAA RGRIEPAPRRVRGYLG VLVFDT+AARYVWEVPYYPQYYIPLADVR E LRD
Sbjct 6 DYPQMAAARGRIEPAPRRVRGYLGDVLVFDTTAARYVWEVPYYPQYYIPLADVRTELLRD 65
Query 64 ENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEPIY 123
ENH QRVQ GPSRL+S+V+ G+T SAARVFD DGD P+AGTVRF WDPLRWFEEDEPIY
Sbjct 66 ENHAQRVQFGPSRLYSVVAGGRTCESAARVFDADGDGPLAGTVRFEWDPLRWFEEDEPIY 125
Query 124 GHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEP 183
GHPRNPY R DALRSHRHV VE DGI LADTRSPVLLFETG+PTRYYID D+ F HLEP
Sbjct 126 GHPRNPYARVDALRSHRHVHVERDGITLADTRSPVLLFETGLPTRYYIDATDVDFAHLEP 185
Query 184 TSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDG 243
++TQTLCPYKGTTSGYWSVRVGD VH DLAWTYHYPLPAVA IAGL+AFYNEK+D+ VDG
Sbjct 186 SATQTLCPYKGTTSGYWSVRVGDVVHEDLAWTYHYPLPAVAQIAGLIAFYNEKLDIVVDG 245
Query 244 VALPRPHTQFS 254
LPRPHTQFS
Sbjct 246 TPLPRPHTQFS 256
>gi|41407104|ref|NP_959940.1| hypothetical protein MAP1006 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41395455|gb|AAS03323.1| hypothetical protein MAP_1006 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336461472|gb|EGO40342.1| hypothetical protein MAPs_30520 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=256
Score = 439 bits (1129), Expect = 2e-121, Method: Compositional matrix adjust.
Identities = 210/251 (84%), Positives = 224/251 (90%), Gaps = 0/251 (0%)
Query 4 DYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRD 63
DYPQMAA RGRIEPAPRRVRGYLG VLVFDT+AARYVWEVPYYPQYYIPLADVR E LRD
Sbjct 6 DYPQMAAARGRIEPAPRRVRGYLGDVLVFDTTAARYVWEVPYYPQYYIPLADVRTELLRD 65
Query 64 ENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEPIY 123
ENH QRVQ GPSRL+S+V+ G+T SAARVFD DGD P+AGTVRF WDPLRWFEEDEPIY
Sbjct 66 ENHAQRVQFGPSRLYSVVAGGRTCESAARVFDADGDGPLAGTVRFEWDPLRWFEEDEPIY 125
Query 124 GHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEP 183
GHPRNPY R DALRSHRHV VE +GI LADT SPVLLFETG+PTRYYIDP D+ F HLEP
Sbjct 126 GHPRNPYARVDALRSHRHVHVEREGITLADTSSPVLLFETGLPTRYYIDPTDVDFAHLEP 185
Query 184 TSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDG 243
++TQTLCPYKGTTSGYWSVRVGD VH DLAWTYHYPLPAVA IAGL+AFYNEK+D+ VDG
Sbjct 186 SATQTLCPYKGTTSGYWSVRVGDVVHEDLAWTYHYPLPAVAQIAGLIAFYNEKLDIVVDG 245
Query 244 VALPRPHTQFS 254
LPRPHTQFS
Sbjct 246 TPLPRPHTQFS 256
>gi|296169923|ref|ZP_06851533.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295895420|gb|EFG75124.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=256
Score = 438 bits (1127), Expect = 4e-121, Method: Compositional matrix adjust.
Identities = 207/251 (83%), Positives = 223/251 (89%), Gaps = 0/251 (0%)
Query 4 DYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRD 63
DYPQMAA RGR+EPAPRRVRGYLG LVFDT+AARYVWEVPYYPQYYIPLADVR EFLRD
Sbjct 6 DYPQMAAARGRVEPAPRRVRGYLGDALVFDTTAARYVWEVPYYPQYYIPLADVRAEFLRD 65
Query 64 ENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEPIY 123
E+HPQ+VQ GPSRLHSL +AG+TH SAARVFD DGD PVAGTVRF W+ LRWFEEDEPIY
Sbjct 66 EDHPQQVQFGPSRLHSLRAAGETHPSAARVFDADGDGPVAGTVRFEWNALRWFEEDEPIY 125
Query 124 GHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEP 183
GHPRNPY R DALRSHRH+RVELDGI LAD+ SPVLLFETG+PTRYYIDP D+AFE LEP
Sbjct 126 GHPRNPYSRVDALRSHRHIRVELDGITLADSSSPVLLFETGLPTRYYIDPTDVAFEQLEP 185
Query 184 TSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDG 243
++TQTLCPYKG TSGYWSVR G + DLAWTYHYPLPAV IAGLVAFYNEK+D+ VDG
Sbjct 186 SATQTLCPYKGVTSGYWSVRTGSGLQPDLAWTYHYPLPAVGQIAGLVAFYNEKLDIVVDG 245
Query 244 VALPRPHTQFS 254
ALPRP TQFS
Sbjct 246 TALPRPQTQFS 256
>gi|240170310|ref|ZP_04748969.1| hypothetical protein MkanA1_13438 [Mycobacterium kansasii ATCC
12478]
Length=256
Score = 435 bits (1118), Expect = 3e-120, Method: Compositional matrix adjust.
Identities = 206/251 (83%), Positives = 227/251 (91%), Gaps = 0/251 (0%)
Query 4 DYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRD 63
DYPQMAA RGRIEPAPRR+RGYL LVFDT+AARYVWE+PYYP YY+P+ DVR EFLRD
Sbjct 6 DYPQMAAARGRIEPAPRRIRGYLDDALVFDTTAARYVWELPYYPTYYVPITDVRREFLRD 65
Query 64 ENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEPIY 123
E+HPQ+VQ GPSRL+SLV A +TH SAARVFD DGDSP+AGTVRF+WDPLRWFEEDE IY
Sbjct 66 EDHPQKVQFGPSRLYSLVGANRTHPSAARVFDADGDSPLAGTVRFDWDPLRWFEEDEQIY 125
Query 124 GHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEP 183
GHPRNPY R DALRSHRHVRV+LDG+VLADTRSPVL+FETG+PTRYYIDP D+AFEHLE
Sbjct 126 GHPRNPYTRVDALRSHRHVRVQLDGVVLADTRSPVLVFETGLPTRYYIDPTDVAFEHLEL 185
Query 184 TSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDG 243
+ST+TLCPYKGTTSGYWSVRVGD +H DLAWTY YPLPAVA IAGLVAFYNEK+D+ VDG
Sbjct 186 SSTRTLCPYKGTTSGYWSVRVGDTLHADLAWTYQYPLPAVAAIAGLVAFYNEKLDIIVDG 245
Query 244 VALPRPHTQFS 254
V LPRP TQFS
Sbjct 246 VVLPRPRTQFS 256
>gi|342861651|ref|ZP_08718297.1| hypothetical protein MCOL_22296 [Mycobacterium colombiense CECT
3035]
gi|342130785|gb|EGT84081.1| hypothetical protein MCOL_22296 [Mycobacterium colombiense CECT
3035]
Length=256
Score = 434 bits (1115), Expect = 8e-120, Method: Compositional matrix adjust.
Identities = 207/250 (83%), Positives = 221/250 (89%), Gaps = 0/250 (0%)
Query 5 YPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRDE 64
YPQMAA RGRIEPAPRRVRGYLG LVFDT+AARYVWEVPYYPQYYIPL DVR EFL DE
Sbjct 7 YPQMAAARGRIEPAPRRVRGYLGDTLVFDTTAARYVWEVPYYPQYYIPLDDVRSEFLHDE 66
Query 65 NHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEPIYG 124
NH Q+VQ GPSRL+SLV AGQ+H SAARVFD DG P+AGTVRF W+PLRWFEEDEPIYG
Sbjct 67 NHAQKVQFGPSRLYSLVGAGQSHESAARVFDADGGGPLAGTVRFEWNPLRWFEEDEPIYG 126
Query 125 HPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPT 184
HPRNPY R DALRSHRHVRVELDGIVLADT +PVLLFETG+PTRYYIDP DI+FEHLE +
Sbjct 127 HPRNPYSRVDALRSHRHVRVELDGIVLADTTTPVLLFETGLPTRYYIDPTDISFEHLESS 186
Query 185 STQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGV 244
T+TLCPYKG TSGYWSVRVGDAVH DLAWTYHYPLPAV IAGL+AFYNEK+D+ VDG
Sbjct 187 PTRTLCPYKGVTSGYWSVRVGDAVHEDLAWTYHYPLPAVGHIAGLIAFYNEKLDIAVDGS 246
Query 245 ALPRPHTQFS 254
L RP TQFS
Sbjct 247 RLARPQTQFS 256
>gi|183984385|ref|YP_001852676.1| hypothetical protein MMAR_4414 [Mycobacterium marinum M]
gi|183177711|gb|ACC42821.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=261
Score = 427 bits (1099), Expect = 6e-118, Method: Compositional matrix adjust.
Identities = 207/253 (82%), Positives = 225/253 (89%), Gaps = 2/253 (0%)
Query 4 DYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRD 63
DYPQMAA RGRIEPAPRRVRGYLGH LVFDT+ ARYVWEVPYYP YY+PLADVR EFLRD
Sbjct 9 DYPQMAAARGRIEPAPRRVRGYLGHELVFDTTQARYVWEVPYYPAYYVPLADVRAEFLRD 68
Query 64 ENHPQRVQLGPSRLHSLVSAG--QTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEP 121
ENH QRVQLG S L+S+V +G QTH SAARVFD DG SPVAGTVRF+WD LRWFEEDE
Sbjct 69 ENHAQRVQLGASHLYSVVGSGATQTHPSAARVFDADGASPVAGTVRFDWDVLRWFEEDEQ 128
Query 122 IYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHL 181
I+GHPRNPY R DALRS RHVRVELDG+VLADT +PVLLFETG+PTRYYIDP D+AFEHL
Sbjct 129 IHGHPRNPYSRVDALRSQRHVRVELDGVVLADTGAPVLLFETGLPTRYYIDPTDVAFEHL 188
Query 182 EPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTV 241
EP++TQTLCPYKGTT+GYWSVRVGD+VH DLAWTYHYPLPAVA IAGLVAFYNEK+D++V
Sbjct 189 EPSATQTLCPYKGTTTGYWSVRVGDSVHPDLAWTYHYPLPAVASIAGLVAFYNEKLDISV 248
Query 242 DGVALPRPHTQFS 254
DGV L RP T F
Sbjct 249 DGVNLSRPRTHFG 261
>gi|120405671|ref|YP_955500.1| hypothetical protein Mvan_4720 [Mycobacterium vanbaalenii PYR-1]
gi|119958489|gb|ABM15494.1| protein of unknown function DUF427 [Mycobacterium vanbaalenii
PYR-1]
Length=258
Score = 387 bits (994), Expect = 9e-106, Method: Compositional matrix adjust.
Identities = 189/253 (75%), Positives = 207/253 (82%), Gaps = 1/253 (0%)
Query 2 SVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFL 61
+ DYP+ AA RGR+EP PRRVRGY+G LVFDT+AARYVWEVPYYPQYYIPL DVR L
Sbjct 7 AADYPRTAADRGRVEPVPRRVRGYVGAELVFDTNAARYVWEVPYYPQYYIPLRDVRPGLL 66
Query 62 RDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEP 121
RD+ PQ+VQ GPSR+ S+V+ +T SAARVFD DGD PVAG V+F WD L WFEEDEP
Sbjct 67 RDDGRPQKVQFGPSRVFSVVAGSRTAVSAARVFD-DGDGPVAGLVKFEWDALTWFEEDEP 125
Query 122 IYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHL 181
IYGHPRNPY R DALRSHRHV VELDG+ LADT SPV+LFETG+PTRYYID DIAFEHL
Sbjct 126 IYGHPRNPYARVDALRSHRHVAVELDGVSLADTHSPVMLFETGLPTRYYIDRTDIAFEHL 185
Query 182 EPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTV 241
EP+ TQTLCPYKG TSGYWSVR VH DLAWTY PLPAVA IA +VAFYNEKVD+TV
Sbjct 186 EPSGTQTLCPYKGVTSGYWSVRTDHGVHADLAWTYQTPLPAVAAIANMVAFYNEKVDITV 245
Query 242 DGVALPRPHTQFS 254
DGV L RP T FS
Sbjct 246 DGVQLSRPKTHFS 258
>gi|88856690|ref|ZP_01131346.1| hypothetical protein A20C1_10925 [marine actinobacterium PHSC20C1]
gi|88814151|gb|EAR24017.1| hypothetical protein A20C1_10925 [marine actinobacterium PHSC20C1]
Length=260
Score = 274 bits (701), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 136/251 (55%), Positives = 167/251 (67%), Gaps = 1/251 (0%)
Query 4 DYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRD 63
DYP+M + + I+P PRR+RGY L+FDT+ A YVWE YPQYYIP+ DV + L D
Sbjct 3 DYPRMISEKNLIQPVPRRIRGYFAGQLMFDTTRAIYVWEWSPYPQYYIPIEDVNDDLLVD 62
Query 64 ENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-RWFEEDEPI 122
E G + H +AA+ + D ++G VRF WD L WFEEDE I
Sbjct 63 EVRESHETRGTYMRLGFTVGEREHPAAAKKYTDDSLEGLSGMVRFEWDALDSWFEEDEQI 122
Query 123 YGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLE 182
+ HPRNPY R DA+RS R VRVELDG VLA++ SPV++FETG+PTRYY++ D+ FEHL
Sbjct 123 FVHPRNPYTRVDAIRSTRTVRVELDGEVLAESSSPVMVFETGLPTRYYLNRTDVNFEHLI 182
Query 183 PTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVD 242
P T T CPYKGTT+ YWSV VG VH DLAW+Y +P + PIAGLVAFYNEKVD+ +D
Sbjct 183 PNDTVTECPYKGTTTDYWSVNVGGTVHADLAWSYSFPTRQLLPIAGLVAFYNEKVDIFID 242
Query 243 GVALPRPHTQF 253
V LPR T F
Sbjct 243 DVELPRAKTHF 253
>gi|302526356|ref|ZP_07278698.1| conserved hypothetical protein [Streptomyces sp. AA4]
gi|302435251|gb|EFL07067.1| conserved hypothetical protein [Streptomyces sp. AA4]
Length=244
Score = 259 bits (661), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 144/254 (57%), Positives = 164/254 (65%), Gaps = 16/254 (6%)
Query 4 DYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRD 63
DYP A G +EP PRRVRG L + D++ A+YVWE PYYPQ+Y PL DV L
Sbjct 3 DYPAAAVETGHVEPVPRRVRGMLAGKTIVDSTRAKYVWEWPYYPQFYFPLDDVLPGALVP 62
Query 64 ENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-RWFEEDEPI 122
E R HSL G+ + AA DS + G VRF WD L WFEEDE +
Sbjct 63 EEEAGR--------HSL-HVGEVEKPAAAWVT---DSVLPGHVRFAWDALDAWFEEDEQV 110
Query 123 YGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHL- 181
+ HPRNPY R DALRS RHVRV LDG+ LA++ SPVLLFETG+PTRYY + ++ F HL
Sbjct 111 FVHPRNPYTRVDALRSTRHVRVRLDGVTLAESSSPVLLFETGLPTRYYFNRTEVDFTHLV 170
Query 182 -EPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
EP T CPYKG TSGYWSVRVGD +H LAWTY YP AV IAGLVAFYNE VD+
Sbjct 171 AEPDMV-TACPYKGETSGYWSVRVGDVLHEHLAWTYAYPTVAVQAIAGLVAFYNEMVDIE 229
Query 241 VDGVALPRPHTQFS 254
VDG LPRP T FS
Sbjct 230 VDGELLPRPRTHFS 243
>gi|297161047|gb|ADI10759.1| hypothetical protein SBI_07639 [Streptomyces bingchenggensis
BCW-1]
Length=263
Score = 257 bits (656), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 133/249 (54%), Positives = 163/249 (66%), Gaps = 1/249 (0%)
Query 2 SVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFL 61
SV +P + G +EP PRR+RG +G +VFDT A YVWE P YPQ+ IP+ D+ L
Sbjct 5 SVQHPSLIVPIGHVEPVPRRIRGLIGGRVVFDTRRALYVWERPAYPQFSIPVEDMVEGVL 64
Query 62 RDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-RWFEEDE 120
D++H + + GP+R HSL + + AA ++D P+ GTVRF W+ L WFEEDE
Sbjct 65 TDDHHTEPLGAGPARRHSLHIGPEVRQGAAWLWDDGAPEPLRGTVRFEWEALDSWFEEDE 124
Query 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
P++ HPR+PY R DALRS VRVELDG VLAD V LFETG+PTRYY+D I +
Sbjct 125 PVFVHPRSPYSRVDALRSSSSVRVELDGTVLADAPHCVKLFETGLPTRYYLDRTHIDWPR 184
Query 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
L PT T T CPYKGTTSGYWS A H D+AW Y +P IAGLVAFYNE+VDL
Sbjct 185 LRPTDTVTSCPYKGTTSGYWSFDSDVATHEDIAWAYDFPTAHANRIAGLVAFYNEQVDLY 244
Query 241 VDGVALPRP 249
+DG LPRP
Sbjct 245 IDGTLLPRP 253
>gi|284030481|ref|YP_003380412.1| hypothetical protein Kfla_2542 [Kribbella flavida DSM 17836]
gi|283809774|gb|ADB31613.1| protein of unknown function DUF427 [Kribbella flavida DSM 17836]
Length=245
Score = 251 bits (642), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 132/252 (53%), Positives = 171/252 (68%), Gaps = 11/252 (4%)
Query 4 DYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRD 63
+YP+ G++ P PRR+R LG V DT++A+YVWE+P +PQYYIP+ADV L D
Sbjct 3 NYPEAIVPPGQLAPVPRRIRATLGGRTVLDTTSAQYVWEIPPFPQYYIPVADVAGGVLAD 62
Query 64 ENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-RWFEEDEPI 122
+ LG R+H+ AG+ A ++D D P+AG VRF WD L WFEEDE +
Sbjct 63 TGDTRPSDLGVGRVHT-AGAGR-----AWLYD---DGPLAGLVRFEWDALDSWFEEDEEV 113
Query 123 YGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLE 182
+ HPRNPY R DAL+S R VRV LD +VLAD+ S V++FETG+ R+Y +AF+HLE
Sbjct 114 FVHPRNPYSRCDALKSGRRVRVCLDDVVLADSTSTVIVFETGLSPRHYFPRTAVAFDHLE 173
Query 183 PTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVD 242
P+ T+T CPYKG TS YWS+R D +H DLAW+Y +P A+ PIAG VAF+ EK+DLTVD
Sbjct 174 PSDTETACPYKGRTSAYWSIRT-DTLHPDLAWSYDFPTAALLPIAGHVAFFTEKLDLTVD 232
Query 243 GVALPRPHTQFS 254
GV + RP T FS
Sbjct 233 GVPVARPVTPFS 244
>gi|345011838|ref|YP_004814192.1| hypothetical protein Strvi_4261 [Streptomyces violaceusniger
Tu 4113]
gi|344038187|gb|AEM83912.1| protein of unknown function DUF427 [Streptomyces violaceusniger
Tu 4113]
Length=266
Score = 250 bits (638), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 131/247 (54%), Positives = 160/247 (65%), Gaps = 1/247 (0%)
Query 4 DYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRD 63
DYP M G +EP PRR+RGY+ LVFDT ARYVW P YPQY +P D+ L D
Sbjct 7 DYPGMIVPVGHVEPVPRRIRGYVAGRLVFDTVRARYVWLWPGYPQYCVPRDDIGEGALVD 66
Query 64 ENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-RWFEEDEPI 122
E ++ G +R +L T AA + D S + G V F W+ + WFEEDE +
Sbjct 67 EGRSLTLKAGGARRQTLQLGSLTRPGAAWEWAEDAPSGIVGHVSFRWEAIDAWFEEDEQV 126
Query 123 YGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLE 182
+ HPR+PY R DALRS R VRVELDG VLAD + V++ ETG+PTRYY+D + + +E
Sbjct 127 FVHPRSPYTRVDALRSGRGVRVELDGTVLADAPNSVMVLETGLPTRYYLDRVYLDWTRME 186
Query 183 PTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVD 242
PT T T CPYKG TSGYW+VR G A + DLAW Y +P V+PIAGLVAFYNEKVDL +D
Sbjct 187 PTDTVTSCPYKGMTSGYWAVRTGTATYPDLAWAYDFPTRQVSPIAGLVAFYNEKVDLYLD 246
Query 243 GVALPRP 249
G LPRP
Sbjct 247 GRPLPRP 253
>gi|320006809|gb|ADW01659.1| protein of unknown function DUF427 [Streptomyces flavogriseus
ATCC 33331]
Length=256
Score = 246 bits (627), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 129/249 (52%), Positives = 156/249 (63%), Gaps = 2/249 (0%)
Query 2 SVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFL 61
S+ YP + G +EP PRRVR +G VFDT A YVWE P YPQ+ IP+ D+ L
Sbjct 5 SIQYPGLIVPAGHVEPVPRRVRATIGGSTVFDTRRALYVWEWPPYPQFSIPVEDLSEGVL 64
Query 62 RDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-RWFEEDE 120
D+ + GP+R H+L + AA V+ S + GTVRF W+ L WFEEDE
Sbjct 65 TDDGRTEERGAGPARRHTLTVGSEVREGAAWVWTDGAPSALLGTVRFEWEALDSWFEEDE 124
Query 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
P++ HPR+PY R DALRS +RVELDG VLA+ V LFETG+PTRYY+D +
Sbjct 125 PVFVHPRSPYSRVDALRSSSSIRVELDGAVLAEAPGCVKLFETGLPTRYYLDLTHVDRAR 184
Query 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
L + T T CPYKGTTS YWS GDA H D+AW Y +P V IAGLVAFYNE+VDL
Sbjct 185 LRRSDTVTRCPYKGTTSSYWSFD-GDATHEDIAWAYDFPTVHVDRIAGLVAFYNERVDLH 243
Query 241 VDGVALPRP 249
VDG LPRP
Sbjct 244 VDGTKLPRP 252
>gi|298524554|ref|ZP_07011963.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298494348|gb|EFI29642.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=160
Score = 245 bits (626), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 119/120 (99%), Positives = 120/120 (100%), Gaps = 0/120 (0%)
Query 135 ALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLCPYKG 194
+LRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLCPYKG
Sbjct 41 SLRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLCPYKG 100
Query 195 TTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPRPHTQFS 254
TTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPRPHTQFS
Sbjct 101 TTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPRPHTQFS 160
>gi|308371845|ref|ZP_07426438.2| hypothetical protein TMDG_03875 [Mycobacterium tuberculosis SUMu004]
gi|308335268|gb|EFP24119.1| hypothetical protein TMDG_03875 [Mycobacterium tuberculosis SUMu004]
Length=119
Score = 243 bits (619), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 118/119 (99%), Positives = 119/119 (100%), Gaps = 0/119 (0%)
Query 136 LRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLCPYKGT 195
+RSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLCPYKGT
Sbjct 1 MRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLCPYKGT 60
Query 196 TSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPRPHTQFS 254
TSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPRPHTQFS
Sbjct 61 TSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPRPHTQFS 119
>gi|302556748|ref|ZP_07309090.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000]
gi|302474366|gb|EFL37459.1| conserved hypothetical protein [Streptomyces griseoflavus Tu4000]
Length=259
Score = 238 bits (608), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 127/249 (52%), Positives = 155/249 (63%), Gaps = 1/249 (0%)
Query 2 SVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFL 61
SV +P + G +EP PRRVRG +G +VFDT A YVWE YPQ+ IPL D+ L
Sbjct 5 SVLHPGLIVPVGHVEPVPRRVRGTIGGRVVFDTRRALYVWERRAYPQFSIPLGDLAEGVL 64
Query 62 RDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-RWFEEDE 120
E ++ GP+R HSL AA V++ + GTVRF W L WFEEDE
Sbjct 65 TAEERVEQRGAGPARRHSLRVGPDVREGAAWVWEDGAPEALHGTVRFVWAALDSWFEEDE 124
Query 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
P++ HPR+PY R DALRS VRVELDG+VLAD V LFETG+PTRYY+D A +
Sbjct 125 PVFVHPRSPYARVDALRSSSGVRVELDGVVLADAPHCVKLFETGLPTRYYLDRAHVDLTR 184
Query 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
L + T T CPYKGTTSGYW+ H D+AW Y +P P+AG+VAF+NE+VDL
Sbjct 185 LRRSDTVTRCPYKGTTSGYWAFDGDAGTHEDIAWAYDFPTVQAHPVAGMVAFFNERVDLH 244
Query 241 VDGVALPRP 249
VDG LPRP
Sbjct 245 VDGSPLPRP 253
>gi|289767449|ref|ZP_06526827.1| conserved hypothetical protein [Streptomyces lividans TK24]
gi|289697648|gb|EFD65077.1| conserved hypothetical protein [Streptomyces lividans TK24]
Length=262
Score = 237 bits (605), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 123/249 (50%), Positives = 155/249 (63%), Gaps = 1/249 (0%)
Query 2 SVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFL 61
SV +P + G +EP PRR+RG +G + FDT A YVWE YPQ+ IP+ D+ L
Sbjct 5 SVLHPSLIVPIGHVEPVPRRIRGLVGGRVAFDTRRALYVWEWQAYPQFSIPVEDLVEGVL 64
Query 62 RDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-RWFEEDE 120
D+ H +++ GP+ H+L + AA V+ + TVRF W+ L WFEEDE
Sbjct 65 DDDKHTEQLGAGPAHRHTLRVGPEVRAGAAWVWGEGSPEALRDTVRFEWEALDAWFEEDE 124
Query 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
P++ HPR+PY R DALRS VRVE+DG+VLA+ V LFETG+PTRYY+DP DI +
Sbjct 125 PVFVHPRSPYSRVDALRSRSTVRVEVDGVVLAEASGCVKLFETGLPTRYYLDPMDIDWTR 184
Query 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
L + T T CPYKGTTS YWS H D+AWTY +P IAGL AFYNE VDL
Sbjct 185 LRHSDTVTRCPYKGTTSDYWSFDGETGAHEDIAWTYDFPTIHANRIAGLTAFYNEHVDLY 244
Query 241 VDGVALPRP 249
VDG LP+P
Sbjct 245 VDGFLLPKP 253
>gi|21225412|ref|NP_631191.1| hypothetical protein SCO7130 [Streptomyces coelicolor A3(2)]
gi|9885228|emb|CAC04236.1| conserved hypothetical protein [Streptomyces coelicolor A3(2)]
Length=262
Score = 236 bits (601), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 122/249 (49%), Positives = 155/249 (63%), Gaps = 1/249 (0%)
Query 2 SVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFL 61
SV +P + G +EP PRR+RG +G + FDT A YVWE YPQ+ IP+ D+ L
Sbjct 5 SVLHPSLIVPIGHVEPVPRRIRGLVGGRVAFDTRRALYVWEWQAYPQFSIPVEDLVEGVL 64
Query 62 RDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-RWFEEDE 120
D+ H +++ GP+ H+L + AA V+ + TVRF W+ L WFEEDE
Sbjct 65 DDDKHTEQLGAGPAHRHTLRVGPEVRAGAAWVWGEGSPEALRDTVRFEWEALDAWFEEDE 124
Query 121 PIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEH 180
P++ HPR+PY R DALRS VRVE+DG+VLA+ V LFETG+PTRYY+DP +I +
Sbjct 125 PVFVHPRSPYSRVDALRSRSTVRVEVDGVVLAEASGCVKLFETGLPTRYYLDPMNIDWTR 184
Query 181 LEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLT 240
L + T T CPYKGTTS YWS H D+AWTY +P IAGL AFYNE VDL
Sbjct 185 LRHSDTVTRCPYKGTTSDYWSFDGETGAHEDIAWTYDFPTIHANRIAGLTAFYNEHVDLY 244
Query 241 VDGVALPRP 249
VDG LP+P
Sbjct 245 VDGFLLPKP 253
>gi|269128733|ref|YP_003302103.1| hypothetical protein Tcur_4538 [Thermomonospora curvata DSM 43183]
gi|268313691|gb|ACZ00066.1| protein of unknown function DUF427 [Thermomonospora curvata DSM
43183]
Length=249
Score = 231 bits (588), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 126/247 (52%), Positives = 158/247 (64%), Gaps = 11/247 (4%)
Query 14 RIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFL--RDENHPQRVQ 71
RIEP+ +RVR YLG + D+ VWEVPYYP YY P+ DVR + L +E+ P
Sbjct 8 RIEPSAKRVRAYLGGEAIADSLRPFLVWEVPYYPTYYFPVEDVRTDLLVPEEESKPSPT- 66
Query 72 LGPSRLHSLVSAGQTHRSAARVFDVDGDSPVA---GTVRFNWDPLR-WFEEDEPIYGHPR 127
LG R+ ++ + T AA + DSPV G VR WD + WFEEDE ++ HPR
Sbjct 67 LGEGRVFTVKTEKATAPKAALRYP---DSPVEALRGLVRLEWDAMDGWFEEDEEVFTHPR 123
Query 128 NPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQ 187
+PY R D L S RHVRVE+DG+ +A++ SP LLFETG+PTRYY+ + + LEPT T
Sbjct 124 DPYHRVDVLASSRHVRVEVDGVTVAESSSPRLLFETGLPTRYYLPKPHVRTDLLEPTGTV 183
Query 188 TLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALP 247
T CPYKG YWSVR+GD + DLAW+Y PLP IAGL+AFYNEKVD+ VDGV
Sbjct 184 THCPYKGQAE-YWSVRIGDRTYPDLAWSYRSPLPESQKIAGLIAFYNEKVDIYVDGVKQE 242
Query 248 RPHTQFS 254
RP T F+
Sbjct 243 RPQTPFA 249
>gi|337768973|emb|CCB77686.1| conserved protein of unknown function [Streptomyces cattleya
NRRL 8057]
Length=261
Score = 228 bits (581), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 119/240 (50%), Positives = 153/240 (64%), Gaps = 1/240 (0%)
Query 5 YPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRDE 64
+P M G +EP PRR+RG++ VFDT ARYVW P YPQY +P DV L DE
Sbjct 8 HPGMIVPVGHVEPVPRRIRGFVAGRPVFDTVRARYVWLWPGYPQYCVPYEDVADGALADE 67
Query 65 NHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-RWFEEDEPIY 123
+++G R H+L + AA + D S V G V F W+ + WFEEDE ++
Sbjct 68 GRDDNLEVGQGRRHTLRLGALSRPGAAWRWGDDAPSGVTGHVTFRWEAVDAWFEEDEEVF 127
Query 124 GHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEP 183
HPR+PY R DALRS R VRV LDG+VLAD S V++FETG+PTRYY+D + + L P
Sbjct 128 VHPRSPYTRVDALRSGRPVRVTLDGVVLADAPSSVMVFETGLPTRYYLDRVHLDWTRLHP 187
Query 184 TSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDG 243
T+T T CPYKG T+GYWSV + DLAW+Y +P ++P+AGLVAFYNE VD+ +DG
Sbjct 188 TATVTNCPYKGRTTGYWSVTTDRGTYPDLAWSYDFPTRQLSPVAGLVAFYNEHVDIDLDG 247
>gi|333920476|ref|YP_004494057.1| hypothetical protein AS9A_2810 [Amycolicicoccus subflavus DQS3-9A1]
gi|333482697|gb|AEF41257.1| hypothetical protein AS9A_2810 [Amycolicicoccus subflavus DQS3-9A1]
Length=251
Score = 214 bits (545), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 111/247 (45%), Positives = 151/247 (62%), Gaps = 8/247 (3%)
Query 9 AATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRDENHPQ 68
+A+ R EP +R+R YL +V DT+ A YVWE P++P YY P D++ E + ++
Sbjct 5 SASAVRFEPCAKRIRAYLAGHVVVDTTRALYVWEWPHFPTYYFPTDDIQAELIELDDTAD 64
Query 69 RVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVA---GTVRFNWDPL-RWFEEDEPIYG 124
LG + L+ L G AAR + DSP+ G VR +W + W EEDEP+Y
Sbjct 65 PTNLGVAALYDLAVDGSVASRAARRY---VDSPLEELRGRVRLSWTAMDEWLEEDEPVYT 121
Query 125 HPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPT 184
H R+PY R D L S RH+++ LDG+V+AD+R +LFETG+P RYY+ DI + L +
Sbjct 122 HARDPYARIDILASSRHIQIMLDGVVVADSRHARILFETGLPPRYYLPLTDIRMDLLRRS 181
Query 185 STQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGV 244
T + CPYKG T+ YWSV +GD VHRDL W Y PLP +AGL +FY+E VD+ +DGV
Sbjct 182 DTTSQCPYKG-TANYWSVVIGDTVHRDLVWMYRAPLPESQKVAGLASFYSESVDVYLDGV 240
Query 245 ALPRPHT 251
RP T
Sbjct 241 LQKRPVT 247
>gi|111224144|ref|YP_714938.1| hypothetical protein FRAAL4754 [Frankia alni ACN14a]
gi|111151676|emb|CAJ63395.1| conserved hypothetical protein [Frankia alni ACN14a]
Length=245
Score = 203 bits (516), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 117/245 (48%), Positives = 155/245 (64%), Gaps = 14/245 (5%)
Query 7 QMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFL---RD 63
Q A R R+E +RVR YL LV DT++ VWE P+YP YY+P ADV E + R
Sbjct 4 QQARGRVRLEQGRKRVRAYLAGRLVVDTTSPALVWENPHYPAYYLPRADVVAELVPTART 63
Query 64 ENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGT---VRFNWDPL-RWFEED 119
E+ P R G + + +V G+T +AA + SP+ G VR +W+ + RW EED
Sbjct 64 EHSPSR---GEAVYYDVVVEGRTAPAAAWAYP---QSPLEGLRDLVRLDWEAMDRWLEED 117
Query 120 EPIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFE 179
EP+Y HPR+PY R DAL S RHVRVE+DG+V+A++ PV+LFETG+ RYY+ D+ E
Sbjct 118 EPVYVHPRSPYTRIDALPSSRHVRVEIDGVVVAESHRPVVLFETGLVPRYYLPLVDVRQE 177
Query 180 HLEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDL 239
L P+ T+T CPYKG+ Y+SV V H D+ WTY PLP A I GLV FY+E+V +
Sbjct 178 LLRPSDTRTHCPYKGSAE-YFSVEVDGRRHDDVVWTYRTPLPESARITGLVCFYDERVTV 236
Query 240 TVDGV 244
+VDGV
Sbjct 237 SVDGV 241
Score = 36.6 bits (83), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 21/58 (37%), Positives = 32/58 (56%), Gaps = 0/58 (0%)
Query 5 YPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLR 62
+P+ TR P+ R VR + V+V ++ ++E P+YY+PL DVR E LR
Sbjct 123 HPRSPYTRIDALPSSRHVRVEIDGVVVAESHRPVVLFETGLVPRYYLPLVDVRQELLR 180
>gi|291435757|ref|ZP_06575147.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC
14672]
gi|291338652|gb|EFE65608.1| conserved hypothetical protein [Streptomyces ghanaensis ATCC
14672]
Length=252
Score = 201 bits (512), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 119/248 (48%), Positives = 149/248 (61%), Gaps = 13/248 (5%)
Query 16 EPAPRRVRGYLGHVLVFDTSAARYVWE--VPYYPQYYIPLADVRMEFLRDENHPQRVQ-L 72
EP+ R VR G V V D+ VWE +P PQY P +VR + LR +P +
Sbjct 4 EPSERWVRATAGGVTVVDSRHPLLVWEPRLPV-PQYAFPREEVRTDLLRPARNPLTGRHT 62
Query 73 GPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL------RWFEEDEPIYGHP 126
G + + L + G+ AA F D +AG + F W P RW+EE+E I+ HP
Sbjct 63 GSTVFYDLEAGGEVRPDAAWTFPADD---LAGHIAFEWFPRTGTGLDRWYEEEEEIFVHP 119
Query 127 RNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTST 186
R+P+ R DA+ S RHVRVE++G V+ADTRSPVLLFET +PTRYY+ D+ + E T
Sbjct 120 RDPHTRVDAVPSSRHVRVEIEGTVVADTRSPVLLFETSLPTRYYLPRQDVRLDLFEATDH 179
Query 187 QTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVAL 246
T CPYKGT YWS R G AV D+ W+Y PLPAVA I G +AF+NE VDLTVDG L
Sbjct 180 STRCPYKGTADQYWSWRGGGAVPPDIVWSYPDPLPAVAAIRGRLAFFNEAVDLTVDGERL 239
Query 247 PRPHTQFS 254
PRP T FS
Sbjct 240 PRPVTSFS 247
>gi|256390991|ref|YP_003112555.1| hypothetical protein Caci_1793 [Catenulispora acidiphila DSM
44928]
gi|256357217|gb|ACU70714.1| protein of unknown function DUF427 [Catenulispora acidiphila
DSM 44928]
Length=265
Score = 201 bits (512), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 108/248 (44%), Positives = 146/248 (59%), Gaps = 7/248 (2%)
Query 9 AATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFL---RDEN 65
A R ++E +RVR YL + LV DT VWE P+YP YY+P DV + E+
Sbjct 22 ARGRVKVETGAKRVRLYLENRLVADTLTPLLVWEKPFYPTYYVPAKDVLADLKPTGESEH 81
Query 66 HPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEPIYGH 125
P R G +++H ++ AG T AR + VRF++D WFEEDEPIY H
Sbjct 82 SPSR---GDAQVHDVLLAGATAPGKARTVPESPLEELRDAVRFDFDAFDWFEEDEPIYTH 138
Query 126 PRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTS 185
PR+PY R D + S RH R ELDG++LAD+ + ++ FETG+P RYY+ + + L P+
Sbjct 139 PRDPYSRIDVVASSRHFRAELDGVLLADSPNSMIAFETGLPPRYYVPITALNQDILRPSE 198
Query 186 TQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVA 245
T T CPYKG + YWSV++G+ V D+ W Y P V IAGL A YNEKVD+ +DGV
Sbjct 199 TVTHCPYKGAAT-YWSVQIGEEVRDDIIWGYRTPFAEVQKIAGLAAVYNEKVDIFLDGVL 257
Query 246 LPRPHTQF 253
RP ++
Sbjct 258 QERPKPRY 265
>gi|288916063|ref|ZP_06410444.1| protein of unknown function DUF427 [Frankia sp. EUN1f]
gi|288352459|gb|EFC86655.1| protein of unknown function DUF427 [Frankia sp. EUN1f]
Length=268
Score = 201 bits (511), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 114/244 (47%), Positives = 146/244 (60%), Gaps = 10/244 (4%)
Query 6 PQMAATRGRI--EPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRD 63
P RGR+ E A +RVR LG +V DT VWE P+YP YY+P DVR +
Sbjct 26 PATTIARGRVHAEQANKRVRALLGGHVVVDTIRPVLVWEGPHYPVYYLPAEDVRATLEPN 85
Query 64 ENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVA---GTVRFNWDPL-RWFEED 119
+ G + H +V G+T AA + DSP+ G VR +WD + W EED
Sbjct 86 GKIARSPSRGDAVRHDVVIGGRTAPDAAGTYP---DSPIPQLRGLVRLDWDAMDEWLEED 142
Query 120 EPIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFE 179
E +YGH RNPY R D L S R VRVE+DG+ +A++ PV+LFE+GI RYY+ D+ E
Sbjct 143 EVVYGHARNPYHRIDILSSSRQVRVEIDGVTVAESTRPVVLFESGIRPRYYVPLTDVRTE 202
Query 180 HLEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDL 239
L P+ + T CPYKG T+GY+SV+V D VH D+ W Y PLP IAGLV FY+EKVD+
Sbjct 203 LLVPSESSTHCPYKG-TAGYFSVQVNDKVHEDVVWIYRTPLPESIRIAGLVCFYDEKVDV 261
Query 240 TVDG 243
VDG
Sbjct 262 YVDG 265
>gi|312197395|ref|YP_004017456.1| hypothetical protein FraEuI1c_3579 [Frankia sp. EuI1c]
gi|311228731|gb|ADP81586.1| protein of unknown function DUF427 [Frankia sp. EuI1c]
Length=273
Score = 195 bits (495), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 111/244 (46%), Positives = 141/244 (58%), Gaps = 10/244 (4%)
Query 6 PQMAATRGRI--EPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRD 63
P A RGR+ E A +RVR L +V DT VWE P+YP YY+P DVR +
Sbjct 31 PSAANARGRVHAEQAHKRVRALLAGHVVVDTIRPVLVWEGPHYPVYYVPAEDVRAALEPN 90
Query 64 ENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPV---AGTVRFNWDPL-RWFEED 119
+ G + H +V G AA + DSPV G VR +WD + W EED
Sbjct 91 GKTVRSPSRGDAARHDVVIGGHRAEDAAGTYP---DSPVPEFQGLVRLDWDAMDTWLEED 147
Query 120 EPIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFE 179
E +YGH RNPY R D + S RHV VE+ G+ +AD+ PV+LFETG+ RYY+ D+ E
Sbjct 148 EIVYGHARNPYHRVDVMASSRHVTVEIGGVTVADSVRPVVLFETGLRPRYYLPLTDVKTE 207
Query 180 HLEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDL 239
L P+ + T CPYKG T+GY+SV V VH D+ W Y PLP +AGLV FY+EKVD+
Sbjct 208 LLRPSDSATHCPYKG-TAGYFSVEVDGRVHEDVVWIYRTPLPESIKVAGLVCFYDEKVDV 266
Query 240 TVDG 243
VDG
Sbjct 267 YVDG 270
>gi|108803134|ref|YP_643071.1| hypothetical protein Rxyl_0283 [Rubrobacter xylanophilus DSM
9941]
gi|108764377|gb|ABG03259.1| protein of unknown function DUF427 [Rubrobacter xylanophilus
DSM 9941]
Length=274
Score = 194 bits (492), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 111/255 (44%), Positives = 151/255 (60%), Gaps = 8/255 (3%)
Query 7 QMAATRGRI---EPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRD 63
++ A RG + E +PRRVR LG V D+ + + E P YY P DVR E L
Sbjct 20 EVKAPRGHVLYFEDSPRRVRVELGGETVADSRRMKLLHETGLLPVYYFPEEDVRTELLER 79
Query 64 ENHPQRVQL-GPSRLHSLVSAGQTHRSAARVFD--VDGDSPVAGTVRFNWDPL-RWFEED 119
+H R G + ++ + G+T +AA + ++G P+ G + F WD + RWFEED
Sbjct 80 TDHTTRCPFKGEAVYWTVRAGGRTAENAAWAYPEPLEGAPPLGGHIAFYWDRMDRWFEED 139
Query 120 EPIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFE 179
E + HPR+PY R DAL S RHVRV ++G ++A+TR PV+LFETG+P RYYI D+ E
Sbjct 140 EEVDVHPRDPYHRIDALPSSRHVRVTVNGELVAETRRPVILFETGLPPRYYIPREDVREE 199
Query 180 HLEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDL 239
L P+ + ++CPYKG S YWSVR G DL W+Y P + GL+ F+NE+VDL
Sbjct 200 LLVPSESSSVCPYKGVAS-YWSVRAGGETVEDLVWSYPEPRRDAERVGGLLCFFNERVDL 258
Query 240 TVDGVALPRPHTQFS 254
VDG RP TQ+S
Sbjct 259 EVDGERQERPETQWS 273
>gi|302555590|ref|ZP_07307932.1| conserved hypothetical protein [Streptomyces viridochromogenes
DSM 40736]
gi|302473208|gb|EFL36301.1| conserved hypothetical protein [Streptomyces viridochromogenes
DSM 40736]
Length=271
Score = 188 bits (477), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 115/248 (47%), Positives = 147/248 (60%), Gaps = 14/248 (5%)
Query 16 EPAPRRVRGYLGHVLVFDTSAARYVWEVPYYP--QYYIPLADVRMEFLR-DENHPQRVQL 72
EP+ R VRG G V V D+ VWE P+ P QY P ADVR + LR +N P
Sbjct 23 EPSERWVRGRKGDVTVVDSRRPVLVWE-PHLPVPQYVFPDADVRTDLLRPAKNPPTGTHT 81
Query 73 GPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL------RWFEEDEPIYGHP 126
G + L + G+ +AA F D +AG + F W P W+EEDE I+ HP
Sbjct 82 GSRTFYDLDADGEVRANAAFRFPADD---LAGHLAFEWFPRTDTGLDHWYEEDEEIFIHP 138
Query 127 RNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTST 186
R+P++R DAL S RHVRVE+DG ++ADT +PVLLFET +PTRYYI D+ + + T
Sbjct 139 RDPHKRVDALPSSRHVRVEIDGRLVADTHAPVLLFETSLPTRYYIPREDVRLDFFDATDH 198
Query 187 QTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVAL 246
T CPYKGT YWS R V ++ W+Y PLPAVA + G +AF+NE VD+T+DG L
Sbjct 199 STGCPYKGTAE-YWSWRGEGDVPPNIVWSYPDPLPAVAAVQGRLAFFNEVVDITLDGERL 257
Query 247 PRPHTQFS 254
RP T FS
Sbjct 258 ERPATPFS 265
>gi|300784890|ref|YP_003765181.1| hypothetical protein AMED_2986 [Amycolatopsis mediterranei U32]
gi|299794404|gb|ADJ44779.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340526320|gb|AEK41525.1| hypothetical protein RAM_15185 [Amycolatopsis mediterranei S699]
Length=236
Score = 186 bits (472), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 112/243 (47%), Positives = 139/243 (58%), Gaps = 20/243 (8%)
Query 20 RRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRDENHPQRVQLGPSRLHS 79
+RVR +LG +V DT VWEVPYYP YYIP ADV L R PSR +
Sbjct 6 KRVRAFLGGQVVADTVHPLLVWEVPYYPTYYIPRADVVSGVLTPSG---RTSHSPSRGEA 62
Query 80 LVSAGQTHRSAARVFDVDG-----DSPVA---GTVRFNWDPLRWFEEDEPIYGHPRNPYQ 131
++S + + A VDG DSP+ G VRF + WFEEDE I+ HPR+P
Sbjct 63 VLSTIKGAGAEA----VDGALEYPDSPIEELRGHVRFEFGAFDWFEEDEQIFTHPRDPGV 118
Query 132 RADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLCP 191
R D L S RHVR+E+DG+ +ADT P LLFETG+PTRYY+ D+ + LE T CP
Sbjct 119 RVDILPSSRHVRIEVDGVTVADTVRPHLLFETGLPTRYYLPRVDVRMDLLEKIDVVTHCP 178
Query 192 YKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPRPHT 251
YKG + H DLAW+Y PLP +AGLVAF +EKVD+ VD V RP T
Sbjct 179 YKGAAEHFDVTG-----HEDLAWSYPTPLPESTRVAGLVAFLDEKVDVYVDDVRQERPKT 233
Query 252 QFS 254
+F+
Sbjct 234 KFA 236
>gi|297197885|ref|ZP_06915282.1| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
gi|297146904|gb|EDY61651.2| conserved hypothetical protein [Streptomyces sviceus ATCC 29083]
Length=268
Score = 185 bits (470), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 115/245 (47%), Positives = 147/245 (60%), Gaps = 12/245 (4%)
Query 16 EPAPRRVRGYLGHVLVFDTSAARYVWE--VPYYPQYYIPLADVRMEFLRDENHPQR-VQL 72
EP+ R VRG G V V D+ VWE VP P Y P ADVR + LR +P
Sbjct 26 EPSERWVRGRKGDVTVVDSRRPVLVWEPDVPV-PLYAFPRADVREDLLRPAKNPATGTHT 84
Query 73 GPSRLHSLVSAGQTHRSAARVF---DVDGDSPVAGTVRFNWDPLRWFEEDEPIYGHPRNP 129
G + L G+ +AA F D+ A R+ W+EE+E I+ HPR+P
Sbjct 85 GSQVFYDLEVDGELVENAAWTFPAADLADHIAFAWFRRWGTGLDHWYEEEEEIFVHPRDP 144
Query 130 YQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTL 189
++R DA+ S RHV+VE+DG V+ADTR PVLLFETG+PTRYYI D+ + L+ T T
Sbjct 145 HKRVDAMPSSRHVQVEIDGTVVADTRRPVLLFETGLPTRYYIPREDVRLDLLDATDHHTA 204
Query 190 CPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPRP 249
CPYKG T+GYWS VGD H ++ W+Y PLPAV + GL+AF+NE VD+TVDG L RP
Sbjct 205 CPYKG-TAGYWS--VGD--HANIVWSYPDPLPAVGAVKGLLAFFNEAVDITVDGERLERP 259
Query 250 HTQFS 254
T F+
Sbjct 260 VTPFT 264
>gi|336180036|ref|YP_004585411.1| hypothetical protein FsymDg_4227 [Frankia symbiont of Datisca
glomerata]
gi|334861016|gb|AEH11490.1| protein of unknown function DUF427 [Frankia symbiont of Datisca
glomerata]
Length=279
Score = 180 bits (457), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 104/246 (43%), Positives = 145/246 (59%), Gaps = 6/246 (2%)
Query 14 RIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRDENHPQRV-QL 72
R EP RR+R + ++ D+ YV+E + P YY P ADVR + L +H R +
Sbjct 35 RTEPNGRRIRVFFNGQVIADSIRTLYVFETGHLPVYYFPRADVRFDLLTPTDHHTRCPRK 94
Query 73 GPSRLHSLVSAGQTHRSAARVFD--VDGDSPVAGTVRFNWDPLR-WFEEDEPIYGHPRNP 129
G + ++ ++ +A + + + +A V F WD W+EEDE ++ HPR+P
Sbjct 95 GDASYFTITVGDRSAENAVWAYPDPIPDVAELADHVAFYWDSADAWYEEDEEVFVHPRDP 154
Query 130 YQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTL 189
Y+R DAL S RHV V + G +LADT P LLFETG+P RYY+ D+ ++ L P T+T
Sbjct 155 YKRVDALPSSRHVEVRVGGELLADTHHPTLLFETGLPIRYYLPKLDVRWDRLTPAPTRTR 214
Query 190 CPYKGTTSGYWSVRVGDAVH-RDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPR 248
CPYKG + YWS D D+AW+Y +P + IAGLVAF+NE+VDLTVDGV PR
Sbjct 215 CPYKG-EARYWSYEGPDGTRIDDIAWSYAESVPEIPKIAGLVAFFNERVDLTVDGVRQPR 273
Query 249 PHTQFS 254
P T +S
Sbjct 274 PGTPWS 279
>gi|297189910|ref|ZP_06907308.1| conserved hypothetical protein [Streptomyces pristinaespiralis
ATCC 25486]
gi|297150307|gb|EDY62475.2| conserved hypothetical protein [Streptomyces pristinaespiralis
ATCC 25486]
Length=276
Score = 176 bits (446), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 110/246 (45%), Positives = 138/246 (57%), Gaps = 10/246 (4%)
Query 16 EPAPRRVRGYLGHVLVFDTSAARYVWEVPY-YPQYYIPLADVRMEFLRDENHPQ--RVQL 72
EP+ R VR G V V D+ VWE P Y P DVRM+ LR P R
Sbjct 27 EPSERWVRAMKGEVKVVDSRRPVLVWEPGRPVPLYAFPADDVRMDLLRATARPANPRRHA 86
Query 73 GPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNW---DPL-RWFEEDEPIYGHPRN 128
G + + LV A T +AA + +A V F W D L W+EEDE I+ HPR+
Sbjct 87 GATLFYDLVLADGTVPAAAWTY---PGEELADHVSFEWFGRDVLDHWYEEDEEIFVHPRD 143
Query 129 PYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQT 188
P++R DAL S RHV+VE++G V+ADTR+PVLLFET +P RYY D+ + PT + T
Sbjct 144 PHKRVDALPSSRHVQVEIEGTVVADTRTPVLLFETDLPVRYYFPREDVRLDLFTPTGSHT 203
Query 189 LCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPR 248
CPYKG + YWS V D+AW+Y PLP+V I VAFYNE VD+ VDG R
Sbjct 204 RCPYKGVATDYWSWAGSGDVRPDIAWSYPDPLPSVGIIKDRVAFYNESVDIVVDGERQQR 263
Query 249 PHTQFS 254
P + FS
Sbjct 264 PVSFFS 269
>gi|302675254|ref|XP_003027311.1| hypothetical protein SCHCODRAFT_61269 [Schizophyllum commune
H4-8]
gi|300100997|gb|EFI92408.1| hypothetical protein SCHCODRAFT_61269 [Schizophyllum commune
H4-8]
Length=239
Score = 174 bits (442), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 91/242 (38%), Positives = 144/242 (60%), Gaps = 9/242 (3%)
Query 15 IEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRDENHPQRVQLGP 74
+E P+R+R + + DT A+ VWE P YP Y+ P ++ +L + ++L P
Sbjct 5 MEDCPKRIRVVFEGIYIIDTKRAKLVWEKPQYPTYFFPNNELPAWYLHN------MRLIP 58
Query 75 SRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-RWFEEDEPIYGHPRNPYQRA 133
++AG + SP+ G + +++ + WFEEDE I+ HP++PY+R
Sbjct 59 DGALYDIAAGHKRAPNGLTKYSNPISPLEGFFKLDFNAMDAWFEEDEEIFVHPKDPYKRV 118
Query 134 DALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLCPYK 193
D L+S RHVR+E++G+++A+TR+P +L+ET +P R YI D E L P+ T + CPYK
Sbjct 119 DVLQSSRHVRIEINGLMVAETRAPRMLYETTLPPRTYIPQTDCQVELLVPSETTSRCPYK 178
Query 194 GTTSGYWSVRVGDA-VHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPRPHTQ 252
G + YW+V++ + + +D+AW+Y YP + I G V FY+EKVD+ VDG RP TQ
Sbjct 179 G-EARYWNVQLLNGEIIKDIAWSYRYPTLESSSIRGYVCFYDEKVDMWVDGEKQARPATQ 237
Query 253 FS 254
F+
Sbjct 238 FA 239
>gi|330925525|ref|XP_003301086.1| hypothetical protein PTT_12502 [Pyrenophora teres f. teres 0-1]
gi|311324444|gb|EFQ90817.1| hypothetical protein PTT_12502 [Pyrenophora teres f. teres 0-1]
Length=252
Score = 173 bits (439), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 96/232 (42%), Positives = 136/232 (59%), Gaps = 7/232 (3%)
Query 14 RIEPAPRRVRGYLGHVLVFDTSAARYVWEV-PYYPQYYIPLADVRMEFLRDENHPQRVQL 72
+ E RRVR FDT+ A +VWE P YPQ+Y+PL+ + + P
Sbjct 21 KFEHTSRRVRALFNGKYAFDTTKAYHVWEYEPRYPQFYVPLSSFTRDAEICKAAPVDGTD 80
Query 73 GPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-RWFEEDEPIYGHPRNPYQ 131
G + L L +RS+ RV + + V+ ++ + +WFEED PIY HP++PY+
Sbjct 81 GGAHLAKLTVG---NRSSNRVI-IFNTGVLRDFVKVDFGAVDQWFEEDMPIYCHPKDPYK 136
Query 132 RADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLCP 191
R D L S R V+V +DG+ LA++ + + L ET +PTRYY+ P + +E L P+ T TLCP
Sbjct 137 RIDILPSTRCVKVAIDGVTLAESSNALFLLETTLPTRYYVPPTSVNWECLTPSDTATLCP 196
Query 192 YKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDG 243
YKG + Y+++ V V+RDL W Y YP APIAG + FYNEKVD+ VDG
Sbjct 197 YKG-KANYYNITVNGRVYRDLVWHYRYPTTESAPIAGHLCFYNEKVDIWVDG 247
>gi|336363527|gb|EGN91912.1| hypothetical protein SERLA73DRAFT_191826 [Serpula lacrymans var.
lacrymans S7.3]
gi|336383303|gb|EGO24452.1| hypothetical protein SERLADRAFT_467794 [Serpula lacrymans var.
lacrymans S7.9]
Length=243
Score = 171 bits (433), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 96/235 (41%), Positives = 138/235 (59%), Gaps = 7/235 (2%)
Query 14 RIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRDENHPQRVQLG 73
RIEP P+R+R + DT A+ VWE YYP YY P++D+ +LR E + G
Sbjct 9 RIEPCPKRIRVLFHGKYIVDTLNAKLVWEHAYYPSYYFPVSDLSPTYLR-ETQAATGEEG 67
Query 74 PSRLHSLVSAGQTHRSAARVFDVDGDSP--VAGTVRFNWDPL-RWFEEDEPIYGHPRNPY 130
+++ LV + ++A + F G +AG ++ + W EEDE IY HP++PY
Sbjct 68 -VKIYDLVVGDRHAKAAVKEFTGKGSGTEDLAGLLKVAFSVADAWLEEDEQIYVHPKDPY 126
Query 131 QRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLC 190
+R D L+S RHVRVE++G+ +A+T P LLFET + R YI D+ + L P+ T T C
Sbjct 127 KRVDVLQSSRHVRVEINGVEVANTHKPRLLFETLLRVRTYIPLTDVRVDLLRPSDTTTQC 186
Query 191 PYKGTTSGYWSVRVGDA-VHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGV 244
PYKG + Y++V + + VHRD+ W Y P I G +AFY+EKVD+ VDGV
Sbjct 187 PYKG-VANYYNVELPNGEVHRDVVWYYRTAQPECGQITGFLAFYDEKVDVWVDGV 240
>gi|189207523|ref|XP_001940095.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187976188|gb|EDU42814.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length=253
Score = 170 bits (431), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 97/234 (42%), Positives = 136/234 (59%), Gaps = 10/234 (4%)
Query 14 RIEPAPRRVRGYLGHVLVFDTSAARYVWEV-PYYPQYYIPLADVRMEFLRDENHPQRVQL 72
+ E PRRVR FDT+ A +VWE P YPQ+YIPL+ F R+ + +
Sbjct 21 KYEHTPRRVRALFNGKYAFDTTKAYHVWEYEPRYPQFYIPLS----SFTREASISKATTP 76
Query 73 GPSRLHS--LVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-RWFEEDEPIYGHPRNP 129
P L + +RS RV + ++ V+ ++ + +WFEED PIY HP++P
Sbjct 77 IPDTNSGAHLATLTIGNRSTNRVI-IFTTGVLSDLVKIDFRAVDQWFEEDMPIYCHPKDP 135
Query 130 YQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTL 189
Y+R D L S R V+V +DG+ LA+ + + L ET +PTRYY+ P + +E+L + T+TL
Sbjct 136 YKRIDILPSTRSVKVAVDGVTLAECSNALFLMETTLPTRYYVPPTSVNWEYLTASGTETL 195
Query 190 CPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDG 243
CPYKG + Y+ V V V+RDL W Y YP APIAG + FYNE VD+ VDG
Sbjct 196 CPYKG-KAEYYDVDVKGRVYRDLVWYYRYPTTESAPIAGHLCFYNEMVDIWVDG 248
>gi|292490297|ref|YP_003525736.1| hypothetical protein Nhal_0132 [Nitrosococcus halophilus Nc4]
gi|291578892|gb|ADE13349.1| protein of unknown function DUF427 [Nitrosococcus halophilus
Nc4]
Length=261
Score = 170 bits (430), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 98/242 (41%), Positives = 135/242 (56%), Gaps = 6/242 (2%)
Query 7 QMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRDENH 66
Q A R + P P+RVR + D++ + E P YY P DVRME+L+ +H
Sbjct 14 QGPAHRVEVVPIPKRVRVLFNQETIVDSTQVLLLRETYLPPVYYFPPQDVRMEWLQRTDH 73
Query 67 PQRVQL-GPSRLHSLVSAGQTHRSAARVFD--VDGDSPVAGTVRFNWDPL-RWFEEDEPI 122
R G + S+ ++ + A + ++ P+ + F WD + W+EEDEP+
Sbjct 74 SSRCPFKGEAAYWSVTVRERSAENGAWSYPEPLEQVVPIKNHIAFYWDKMDAWYEEDEPV 133
Query 123 YGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLE 182
+ HP +PY R D S R VRV L G V+A+TR LFETG+PTRYYI D+ + LE
Sbjct 134 FVHPCDPYVRIDVRESFRPVRVVLGGKVVAETRRARFLFETGLPTRYYIPQEDVQMDWLE 193
Query 183 PTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVD-LTV 241
P+ T T CPYKG S YWSVR+GD +DL W+Y PLP + + +AFY EKV+ V
Sbjct 194 PSETHTACPYKGKAS-YWSVRIGDQYFKDLVWSYPDPLPEASQVKNYLAFYQEKVEAFYV 252
Query 242 DG 243
DG
Sbjct 253 DG 254
>gi|115387671|ref|XP_001211341.1| predicted protein [Aspergillus terreus NIH2624]
gi|114195425|gb|EAU37125.1| predicted protein [Aspergillus terreus NIH2624]
Length=248
Score = 169 bits (428), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 105/256 (42%), Positives = 141/256 (56%), Gaps = 10/256 (3%)
Query 1 MSVDYPQMAATRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEF 60
MS+ +P G E RRVR ++ D+ + VWE PYYP YY P+ D+ + +
Sbjct 1 MSIPFPYA----GYSEDVARRVRVVFNGEMIVDSHTPKLVWEHPYYPVYYFPIKDITISY 56
Query 61 LRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLR-WFEED 119
+N G ++ LV +T S A V V P+ + +D W EED
Sbjct 57 DCLQNE-TIASDGDEAIYDLVIGHRT--SPAAVTRVLKAGPLMDHYKIGFDKADLWLEED 113
Query 120 EPIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFE 179
E + GHPR+PY+R L+S +HVRVE+DG+V+ADT P LL+ETG+P R YI AD+ +E
Sbjct 114 ERMLGHPRDPYKRIQILQSSKHVRVEIDGVVVADTTRPKLLYETGLPVRKYIPFADVKWE 173
Query 180 HL-EPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVD 238
L + T CPYKG S Y+ VR+ LAW Y PLP I G VAFY+EKVD
Sbjct 174 LLRDDVGRSTSCPYKGDAS-YYIVRLPSGEKTGLAWWYKTPLPESTEIRGHVAFYDEKVD 232
Query 239 LTVDGVALPRPHTQFS 254
+ VDG +P T+FS
Sbjct 233 VWVDGKKQEKPATKFS 248
>gi|134098508|ref|YP_001104169.1| hypothetical protein SACE_1933 [Saccharopolyspora erythraea NRRL
2338]
gi|291003276|ref|ZP_06561249.1| hypothetical protein SeryN2_01974 [Saccharopolyspora erythraea
NRRL 2338]
gi|133911131|emb|CAM01244.1| protein of unknown function DUF427 [Saccharopolyspora erythraea
NRRL 2338]
Length=264
Score = 169 bits (428), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 103/260 (40%), Positives = 146/260 (57%), Gaps = 7/260 (2%)
Query 1 MSVDYPQMAATRGRIEPAP-----RRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLAD 55
+ +D +++ + G+ + P +RVR YL LV DT VWE +YP YY+P D
Sbjct 3 LELDVSEVSMSSGQSQDVPTEVSHKRVRAYLRGGLVADTRRPVLVWEHQHYPTYYLPAED 62
Query 56 VRMEFLRDENHPQRVQLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL-R 114
V + +LG ++ + + +AA + + G VR W+ +
Sbjct 63 VLARLEPTGATRRSGRLGDGTVYDVRAGEAVAEAAAIGYPESPVPELRGLVRIAWEAMDH 122
Query 115 WFEEDEPIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPA 174
WFEEDEP+Y HPR+P++R D L S RHV V + +V+AD+ P +LFETG+P RYY+
Sbjct 123 WFEEDEPVYVHPRDPHKRVDVLASSRHVVVRIGDVVVADSHRPHILFETGLPPRYYLPIT 182
Query 175 DIAFEHLEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLVAFYN 234
D+ + L P+ +T CPYKGT S YW V +GD H + W+Y PLP IAGL FY+
Sbjct 183 DVRIDLLRPSDHRTQCPYKGTAS-YWDVVIGDTEHAGIVWSYPVPLPESQKIAGLACFYD 241
Query 235 EKVDLTVDGVALPRPHTQFS 254
E+VD+TVDG RP T FS
Sbjct 242 ERVDITVDGEPQQRPRTPFS 261
>gi|331698863|ref|YP_004335102.1| hypothetical protein Psed_5112 [Pseudonocardia dioxanivorans
CB1190]
gi|326953552|gb|AEA27249.1| protein of unknown function DUF427 [Pseudonocardia dioxanivorans
CB1190]
Length=274
Score = 168 bits (425), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 107/264 (41%), Positives = 144/264 (55%), Gaps = 29/264 (10%)
Query 14 RIEPAPRRVRGYLGHVLVFDTSAARYVWEVP-YYPQYYIPLADVRMEFLRDENH------ 66
R EP RRVR + G L+ D+S A VWE PQY +P+ DV D +
Sbjct 17 RHEPIARRVRAWSGGTLLLDSSRAALVWEPGRVVPQYAVPVDDVVATLTPDPGYAGRRDG 76
Query 67 PQRVQLGPSRLHSLV--SAGQTHRSAARVFDVD-GDSPVAGTVRFNWDPL---------- 113
P V +GP+ L + H + V + G +P+ G DP
Sbjct 77 PGAVPVGPAGAQVLTPETGFGVHSTPGAVLTMSTGAAPLRGAAFRPEDPDLAGHVVVDFA 136
Query 114 ---RWFEEDEPIYGHPRNPYQRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYY 170
W+EEDEP+ GHPR+PY R DA RS RHVR+ DG++LA++R+P +FET +P R+Y
Sbjct 137 GPDTWWEEDEPVVGHPRDPYHRVDARRSSRHVRISADGVLLAESRTPTAVFETNLPVRHY 196
Query 171 IDPADIAFEHLEPTSTQTLCPYKGTTSGYWSVRVGDAVHRDLAWTYHYPLPAVAPIAGLV 230
+ AD+ + L P+ T T C YKG S Y S D+AWTY PLP P+AGLV
Sbjct 197 LPRADLVAD-LAPSDTVTTCAYKGVAS-YLSA----GGLPDVAWTYPQPLPDATPLAGLV 250
Query 231 AFYNEKVDLTVDGVALPRPHTQFS 254
AF++E+VD+ +DGVAL RP T +S
Sbjct 251 AFFDERVDVEIDGVALARPRTPWS 274
>gi|242205892|ref|XP_002468803.1| predicted protein [Postia placenta Mad-698-R]
gi|220732188|gb|EED86026.1| predicted protein [Postia placenta Mad-698-R]
Length=242
Score = 167 bits (424), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 95/245 (39%), Positives = 136/245 (56%), Gaps = 9/245 (3%)
Query 11 TRGRIEPAPRRVRGYLGHVLVFDTSAARYVWEVPYYPQYYIPLADVRMEFLRDENHPQRV 70
T+ IE PRRVR + DT A+ VW P YP ++ ADV ++L + +
Sbjct 6 TQPHIETLPRRVRVLFAGQYIVDTKKAKLVWLKPNYPTFFFDSADVPQKYLSQRSTSDEL 65
Query 71 QLGPSRLHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPLRWFEEDEPIYGHPRNPY 130
Q + +V + +AA + + GD T+ F+ WFEEDE ++ HP++PY
Sbjct 66 QQ-----YDIVVGSRKAEAAATEY-LGGDLKGLITIAFS-SMDAWFEEDEQVFVHPKDPY 118
Query 131 QRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLC 190
+R D L+S RHVRVE++G+ LA+T P LLFETG+P R YI D + L+P+ T C
Sbjct 119 KRVDVLQSSRHVRVEVNGVELANTTKPRLLFETGLPVRTYIPKTDCRVDLLKPSQLTTEC 178
Query 191 PYKGTTSGYWSVRVGDA-VHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDGVALPRP 249
PYKG + Y++V + ++ W Y P P I G VAFY+EKVD+ VDG PRP
Sbjct 179 PYKG-IANYYNVSISSGETFENIVWWYRVPQPECVDIKGFVAFYDEKVDVWVDGELQPRP 237
Query 250 HTQFS 254
+ +S
Sbjct 238 RSPWS 242
>gi|330465367|ref|YP_004403110.1| hypothetical protein VAB18032_06930 [Verrucosispora maris AB-18-032]
gi|328808338|gb|AEB42510.1| hypothetical protein VAB18032_06930 [Verrucosispora maris AB-18-032]
Length=260
Score = 166 bits (421), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 113/243 (47%), Positives = 139/243 (58%), Gaps = 17/243 (6%)
Query 20 RRVRGYLGHVLVFDTSAARYVWE--VPYYPQYYIPLADVRMEFLRD-ENHPQRVQLGPSR 76
R VRG +G +V D+ VWE +P P Y PLAD+ LR E P+ S
Sbjct 22 RWVRGRIGDTVVVDSRRPLLVWEPGLPV-PFYVFPLADLVGGTLRPAEQPPEPGSRAGSS 80
Query 77 LHSLVSAGQTHRSAARVFDVDGDSPVAGTVRFNWDPL------RWFEEDEPIYGHPRNPY 130
H L G T +AA + D A TV W RW+EEDE ++ HPR+P+
Sbjct 81 FHDLTVDGVTLPNAAWTYPGDV---FAQTVCLAWREWFGQGVERWYEEDEEVFVHPRDPF 137
Query 131 QRADALRSHRHVRVELDGIVLADTRSPVLLFETGIPTRYYIDPADIAFEHLEPTSTQTLC 190
R D+L S RHV V +G+VLADTR PVLLFETG+PTRYYI D+ E L P+ T C
Sbjct 138 SRVDSLPSTRHVVVAHEGVVLADTRRPVLLFETGLPTRYYIPADDLVQELLLPSEHHTRC 197
Query 191 PYKGTTSGYWSVR-VGDAVHRDLAWTYHYPLPAVAPIAGLVAFYNEKVDLTVDG--VALP 247
PYKG S YWS+R V A R++AW Y PLP+VA IAG AFY E+V + VDG V+ P
Sbjct 198 PYKGVAS-YWSLRQVPGAAGRNIAWYYPDPLPSVANIAGFTAFYPERVTILVDGESVSPP 256
Query 248 RPH 250
PH
Sbjct 257 TPH 259
Lambda K H
0.322 0.139 0.443
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 371539772520
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40