BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3395A
Length=208
Score E
Sequences producing significant alignments: (Bits) Value
gi|148824602|ref|YP_001289357.1| hypothetical protein TBFG_13430... 422 1e-116
gi|289555671|ref|ZP_06444881.1| membrane protein [Mycobacterium ... 421 3e-116
gi|15842989|ref|NP_338026.1| hypothetical protein MT3503 [Mycoba... 421 5e-116
gi|31794576|ref|NP_857069.1| hypothetical protein Mb3428 [Mycoba... 419 1e-115
gi|289449093|ref|ZP_06438837.1| membrane protein [Mycobacterium ... 417 3e-115
gi|289747222|ref|ZP_06506600.1| conserved hypothetical protein [... 296 1e-78
gi|289763577|ref|ZP_06522955.1| hypothetical membrane protein [M... 280 1e-73
gi|240167852|ref|ZP_04746511.1| hypothetical protein MkanA1_0095... 236 2e-60
gi|183981171|ref|YP_001849462.1| hypothetical protein MMAR_1149 ... 231 5e-59
gi|118616682|ref|YP_905014.1| hypothetical protein MUL_0914 [Myc... 229 1e-58
gi|289571632|ref|ZP_06451859.1| membrane protein [Mycobacterium ... 229 2e-58
gi|302538436|ref|ZP_07290778.1| conserved hypothetical protein [... 140 1e-31
gi|328887122|emb|CCA60361.1| hypothetical protein SVEN_7075 [Str... 139 2e-31
gi|169631750|ref|YP_001705399.1| hypothetical protein MAB_4677c ... 134 7e-30
gi|37595048|gb|AAQ94239.1| unknown [Saccharopolyspora erythraea] 123 1e-26
gi|134100681|ref|YP_001106342.1| hypothetical protein SACE_4148 ... 122 3e-26
gi|302520204|ref|ZP_07272546.1| conserved hypothetical protein [... 110 2e-22
gi|318061702|ref|ZP_07980423.1| hypothetical protein SSA3_27438 ... 107 1e-21
gi|333026046|ref|ZP_08454110.1| hypothetical protein STTU_3550 [... 106 3e-21
gi|318079665|ref|ZP_07986997.1| hypothetical protein SSA3_23986 ... 101 8e-20
gi|111219823|ref|YP_710617.1| putative branched chain amino acid... 38.9 0.44
gi|312219773|emb|CBX99715.1| similar to fatty acid synthase beta... 35.8 4.5
gi|189204113|ref|XP_001938392.1| conserved hypothetical protein ... 35.0 6.2
gi|326446723|ref|ZP_08221457.1| hypothetical protein SclaA2_3690... 35.0 7.0
gi|227504004|ref|ZP_03934053.1| L-aminopeptidase/D-esterase [Cor... 35.0 7.3
gi|299470765|emb|CBN79811.1| aspartyl/glutamyl-tRNA amidotransfe... 35.0 7.6
gi|171683419|ref|XP_001906652.1| hypothetical protein [Podospora... 35.0 7.7
>gi|148824602|ref|YP_001289357.1| hypothetical protein TBFG_13430 [Mycobacterium tuberculosis F11]
gi|167968719|ref|ZP_02550996.1| hypothetical membrane protein [Mycobacterium tuberculosis H37Ra]
gi|253800442|ref|YP_003033443.1| hypothetical protein TBMG_03446 [Mycobacterium tuberculosis KZN
1435]
25 more sequence titles
Length=236
Score = 422 bits (1086), Expect = 1e-116, Method: Compositional matrix adjust.
Identities = 208/208 (100%), Positives = 208/208 (100%), Gaps = 0/208 (0%)
Query 1 VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 60
VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL
Sbjct 29 VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 88
Query 61 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF 120
TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF
Sbjct 89 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF 148
Query 121 DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL 180
DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL
Sbjct 149 DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL 208
Query 181 DVARRLVEEAGGDWNATTIAHGRREFVN 208
DVARRLVEEAGGDWNATTIAHGRREFVN
Sbjct 209 DVARRLVEEAGGDWNATTIAHGRREFVN 236
>gi|289555671|ref|ZP_06444881.1| membrane protein [Mycobacterium tuberculosis KZN 605]
gi|289440303|gb|EFD22796.1| membrane protein [Mycobacterium tuberculosis KZN 605]
Length=219
Score = 421 bits (1083), Expect = 3e-116, Method: Compositional matrix adjust.
Identities = 208/208 (100%), Positives = 208/208 (100%), Gaps = 0/208 (0%)
Query 1 VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 60
VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL
Sbjct 12 VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 71
Query 61 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF 120
TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF
Sbjct 72 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF 131
Query 121 DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL 180
DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL
Sbjct 132 DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL 191
Query 181 DVARRLVEEAGGDWNATTIAHGRREFVN 208
DVARRLVEEAGGDWNATTIAHGRREFVN
Sbjct 192 DVARRLVEEAGGDWNATTIAHGRREFVN 219
>gi|15842989|ref|NP_338026.1| hypothetical protein MT3503 [Mycobacterium tuberculosis CDC1551]
gi|254233997|ref|ZP_04927322.1| hypothetical protein TBCG_03338 [Mycobacterium tuberculosis C]
gi|13883329|gb|AAK47840.1| hypothetical protein MT3503 [Mycobacterium tuberculosis CDC1551]
gi|124599526|gb|EAY58630.1| hypothetical protein TBCG_03338 [Mycobacterium tuberculosis C]
gi|326905239|gb|EGE52172.1| membrane protein [Mycobacterium tuberculosis W-148]
gi|328460174|gb|AEB05597.1| membrane protein [Mycobacterium tuberculosis KZN 4207]
Length=213
Score = 421 bits (1081), Expect = 5e-116, Method: Compositional matrix adjust.
Identities = 208/208 (100%), Positives = 208/208 (100%), Gaps = 0/208 (0%)
Query 1 VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 60
VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL
Sbjct 6 VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 65
Query 61 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF 120
TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF
Sbjct 66 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF 125
Query 121 DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL 180
DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL
Sbjct 126 DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL 185
Query 181 DVARRLVEEAGGDWNATTIAHGRREFVN 208
DVARRLVEEAGGDWNATTIAHGRREFVN
Sbjct 186 DVARRLVEEAGGDWNATTIAHGRREFVN 213
>gi|31794576|ref|NP_857069.1| hypothetical protein Mb3428 [Mycobacterium bovis AF2122/97]
gi|57117103|ref|YP_177969.1| hypothetical protein Rv3395A [Mycobacterium tuberculosis H37Rv]
gi|121639320|ref|YP_979544.1| hypothetical protein BCG_3465 [Mycobacterium bovis BCG str. Pasteur
1173P2]
35 more sequence titles
Length=208
Score = 419 bits (1077), Expect = 1e-115, Method: Compositional matrix adjust.
Identities = 207/208 (99%), Positives = 208/208 (100%), Gaps = 0/208 (0%)
Query 1 VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 60
+QSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL
Sbjct 1 MQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 60
Query 61 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF 120
TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF
Sbjct 61 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF 120
Query 121 DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL 180
DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL
Sbjct 121 DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL 180
Query 181 DVARRLVEEAGGDWNATTIAHGRREFVN 208
DVARRLVEEAGGDWNATTIAHGRREFVN
Sbjct 181 DVARRLVEEAGGDWNATTIAHGRREFVN 208
>gi|289449093|ref|ZP_06438837.1| membrane protein [Mycobacterium tuberculosis CPHL_A]
gi|289422051|gb|EFD19252.1| membrane protein [Mycobacterium tuberculosis CPHL_A]
Length=208
Score = 417 bits (1073), Expect = 3e-115, Method: Compositional matrix adjust.
Identities = 206/208 (99%), Positives = 208/208 (100%), Gaps = 0/208 (0%)
Query 1 VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 60
+QSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL
Sbjct 1 MQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 60
Query 61 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF 120
TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVV+APTLHNAAEALHRQFNQEAVLTF
Sbjct 61 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVEAPTLHNAAEALHRQFNQEAVLTF 120
Query 121 DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL 180
DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL
Sbjct 121 DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDL 180
Query 181 DVARRLVEEAGGDWNATTIAHGRREFVN 208
DVARRLVEEAGGDWNATTIAHGRREFVN
Sbjct 181 DVARRLVEEAGGDWNATTIAHGRREFVN 208
>gi|289747222|ref|ZP_06506600.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289687750|gb|EFD55238.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
Length=145
Score = 296 bits (759), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 144/145 (99%), Positives = 145/145 (100%), Gaps = 0/145 (0%)
Query 64 VVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTFDYL 123
+VTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTFDYL
Sbjct 1 MVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTFDYL 60
Query 124 PQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDLDVA 183
PQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDLDVA
Sbjct 61 PQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDLDVA 120
Query 184 RRLVEEAGGDWNATTIAHGRREFVN 208
RRLVEEAGGDWNATTIAHGRREFVN
Sbjct 121 RRLVEEAGGDWNATTIAHGRREFVN 145
>gi|289763577|ref|ZP_06522955.1| hypothetical membrane protein [Mycobacterium tuberculosis GM
1503]
gi|289711083|gb|EFD75099.1| hypothetical membrane protein [Mycobacterium tuberculosis GM
1503]
Length=179
Score = 280 bits (716), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 137/137 (100%), Positives = 137/137 (100%), Gaps = 0/137 (0%)
Query 1 VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 60
VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL
Sbjct 29 VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 88
Query 61 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF 120
TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF
Sbjct 89 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF 148
Query 121 DYLPQNAPEADAILITV 137
DYLPQNAPEADAILITV
Sbjct 149 DYLPQNAPEADAILITV 165
>gi|240167852|ref|ZP_04746511.1| hypothetical protein MkanA1_00950 [Mycobacterium kansasii ATCC
12478]
Length=217
Score = 236 bits (601), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 121/199 (61%), Positives = 139/199 (70%), Gaps = 17/199 (8%)
Query 27 PATGGGPACRPAELFATDNT-----------------TDGFELPAVATIALTGTVVTGST 69
PA+ +C+PAELFATDNT + FE AV TIA TG V GST
Sbjct 18 PASSAAGSCQPAELFATDNTWVITGSDAKATGQLHGDLEPFERQAVVTIAQTGAEVRGST 77
Query 70 LVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTFDYLPQNAPE 129
LVDGVFWS QQI YER+R+FHLCVVD PTLH AAEA++RQF+QE LTF+YLPQ AP
Sbjct 78 LVDGVFWSATLQQIAYERARQFHLCVVDEPTLHTAAEAMNRQFHQETALTFEYLPQGAPR 137
Query 130 ADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDLDVARRLVEE 189
ADA+LI VP I +ARF DA +D AA RL GGS+TTAD TL+LVA N DLD+AR LVE+
Sbjct 138 ADAMLIAVPGIDLARFGDALTADSAARQRLLGGSITTADRTLLLVAANRDLDIARHLVEQ 197
Query 190 AGGDWNATTIAHGRREFVN 208
AGG W A TI +GRRE V
Sbjct 198 AGGSWTAATINYGRRELVQ 216
>gi|183981171|ref|YP_001849462.1| hypothetical protein MMAR_1149 [Mycobacterium marinum M]
gi|183174497|gb|ACC39607.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=229
Score = 231 bits (589), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 131/222 (60%), Positives = 151/222 (69%), Gaps = 15/222 (6%)
Query 1 VQSRKTTSVLAAALLFCGLLGPGTAP-PATGGGPACRPAELFATDNT---TDG------- 49
V+SR+T S LAA L+ CG AP PA G PACRPAE+FAT+NT TD
Sbjct 6 VRSRRTLSALAALLVACGGSSSWVAPAPARGEEPACRPAEIFATNNTAVSTDPDSSQSHD 65
Query 50 ----FELPAVATIALTGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAA 105
FE+ A ATIA G +TGS LV+GV WS+E Q YERSREFH+CVVDAPTLHN A
Sbjct 66 QLQLFEMQAAATIAQNGAAMTGSRLVNGVLWSDELHQNTYERSREFHVCVVDAPTLHNVA 125
Query 106 EALHRQFNQEAVLTFDYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVT 165
EAL QF+Q +VLTF+YLPQ AP A+A I VPDI R A +D A RL GGS++
Sbjct 126 EALRNQFDQGSVLTFEYLPQGAPAANAFTIDVPDIDAPRLGQALMADAVARDRLLGGSIS 185
Query 166 TADHTLILVAGNGDLDVARRLVEEAGGDWNATTIAHGRREFV 207
T DHTLILVAGN D+D+ARRLV EAGG W A TI +GRRE V
Sbjct 186 TDDHTLILVAGNDDVDIARRLVNEAGGCWRAATITYGRRELV 227
>gi|118616682|ref|YP_905014.1| hypothetical protein MUL_0914 [Mycobacterium ulcerans Agy99]
gi|118568792|gb|ABL03543.1| conserved hypothetical secreted protein [Mycobacterium ulcerans
Agy99]
Length=229
Score = 229 bits (585), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 130/222 (59%), Positives = 150/222 (68%), Gaps = 15/222 (6%)
Query 1 VQSRKTTSVLAAALLFCGLLGPGTAP-PATGGGPACRPAELFATDNT---TDG------- 49
V+SR+T S LAA L+ CG AP PA G PACRPAE+FAT+NT TD
Sbjct 6 VRSRRTLSALAALLVACGGSSSWVAPAPARGEEPACRPAEIFATNNTAVSTDPDSSQSHD 65
Query 50 ----FELPAVATIALTGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAA 105
FE+ A ATIA G +TGS LV+GV WS+E Q YERSREFH+CVVDAPTLHN A
Sbjct 66 QLQLFEMQAAATIAQNGAAMTGSRLVNGVLWSDELHQNTYERSREFHVCVVDAPTLHNVA 125
Query 106 EALHRQFNQEAVLTFDYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVT 165
EAL QF+Q +VLTF+YLPQ AP A+A I VPDI R A +D A RL GGS++
Sbjct 126 EALRNQFDQGSVLTFEYLPQGAPAANAFTIDVPDIDAPRLGQALMADAVARDRLLGGSIS 185
Query 166 TADHTLILVAGNGDLDVARRLVEEAGGDWNATTIAHGRREFV 207
T DHTLILVAGN D+D+ARRLV EAGG W TI +GRRE V
Sbjct 186 TDDHTLILVAGNDDVDIARRLVNEAGGCWRTATITYGRRELV 227
>gi|289571632|ref|ZP_06451859.1| membrane protein [Mycobacterium tuberculosis T17]
gi|289545386|gb|EFD49034.1| membrane protein [Mycobacterium tuberculosis T17]
Length=117
Score = 229 bits (583), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 111/115 (97%), Positives = 114/115 (99%), Gaps = 0/115 (0%)
Query 1 VQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 60
+QSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL
Sbjct 1 MQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIAL 60
Query 61 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQE 115
TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQF ++
Sbjct 61 TGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFIKK 115
>gi|302538436|ref|ZP_07290778.1| conserved hypothetical protein [Streptomyces sp. C]
gi|302447331|gb|EFL19147.1| conserved hypothetical protein [Streptomyces sp. C]
Length=221
Score = 140 bits (353), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 90/219 (42%), Positives = 113/219 (52%), Gaps = 20/219 (9%)
Query 8 SVLAAALLFCGLLGPGTA-----PPATGGGPACRPAELFATDNT---TD----------- 48
+++A +L G GTA P + G C AE+FATDNT TD
Sbjct 3 ALVAVGVLALASAGTGTAQADAPSPTSSGTGGCPAAEVFATDNTAVITDPADPRLRTHLL 62
Query 49 GFELPAVATIALTGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEAL 108
F+ I G STL+DGVFWS E QQ YERSREF + VDA LH+ A+ +
Sbjct 63 RFDREVREIIRSHGARTESSTLLDGVFWSEEEQQATYERSREFDVARVDANGLHHIADVI 122
Query 109 HRQFNQEAVLTFDYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTAD 168
+Q+ QE+VLTF LP+ +PE DA+ I + R RDA +D A RL GGSV
Sbjct 123 RKQYRQESVLTFRCLPRTSPETDAVEIQADGVSATRLRDALLADPVARERLGGGSVALGG 182
Query 169 HTLILVAGNGDLDVARRLVEEAGGDWNATTIAHGRREFV 207
LILVA DL +AR G DW + I +G EFV
Sbjct 183 R-LILVAPLADLPLAREFTARLGVDWKSAEIRYGDEEFV 220
>gi|328887122|emb|CCA60361.1| hypothetical protein SVEN_7075 [Streptomyces venezuelae ATCC
10712]
Length=225
Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 91/226 (41%), Positives = 123/226 (55%), Gaps = 22/226 (9%)
Query 1 VQSRKTTSVLAAAL--LFCGLLGPGT--APPATGGGPACRPAELFATDNTT--------- 47
+ +R+ +LA L L GL G + A PA G C AE+FATDNT+
Sbjct 4 IATRRRLRLLATGLAALVLGLTGTQSTYASPADQG---CPTAEIFATDNTSIITDPADPR 60
Query 48 -----DGFELPAVATIALTGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLH 102
F+ I G STL+DGVFWS++ +Q YERSREF + VD LH
Sbjct 61 LRTRLTRFDHEVRTLIRAHGARPAASTLLDGVFWSDDLKQATYERSREFDVNRVDRDGLH 120
Query 103 NAAEALHRQFNQEAVLTFDYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGG 162
+ A + ++++QE+VLTF LP+ +PE DA+ I VP + ++ RDA +D A +L GG
Sbjct 121 HIAGVIAKEYDQESVLTFRCLPRTSPETDAVEIEVPGVRVSGLRDALVADPEAREKLGGG 180
Query 163 SVTTADHTLILVAGNGDLDVARRLVEEAGGDWNATTIAHGRREFVN 208
SVT D L+LVA +L AR V G DWN + +G REFVN
Sbjct 181 SVTL-DGRLLLVAPIAELPFARTFVTGLGADWNEARVRYGDREFVN 225
>gi|169631750|ref|YP_001705399.1| hypothetical protein MAB_4677c [Mycobacterium abscessus ATCC
19977]
gi|169243717|emb|CAM64745.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=257
Score = 134 bits (338), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 82/210 (40%), Positives = 110/210 (53%), Gaps = 21/210 (10%)
Query 19 LLGPGTA-----PPATGGGPACRP-AELFATDNTT--------------DGFELPAVATI 58
++GP + P A G RP ELFA++NT D F + A
Sbjct 46 VVGPSSGYARADPQACVPGDVARPQGELFASNNTATITDPADARLQDPLDDFSVQVSAMT 105
Query 59 ALTGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVL 118
+ ST VDGV+WS + ++ YERSR F L VD L++ E + R+F QE+VL
Sbjct 106 VQNLALPVRSTRVDGVYWSQDNDRMTYERSRAFELACVDGDDLYSIGEQVGRRFGQESVL 165
Query 119 TFDYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNG 178
TF+YLP +A+ + VP + RF D +D AA L GGSVT D LIL+A
Sbjct 166 TFEYLPAGDARVNAVAVEVPGVDRVRFHDVLLTDPAARAALSGGSVTE-DGWLILIADVK 224
Query 179 DLDVARRLVEEAGGDWNATTIAHGRREFVN 208
D+ +ARRLV+ AGG W I +G+REFV
Sbjct 225 DIGIARRLVDAAGGRWQDVAIQYGKREFVE 254
>gi|37595048|gb|AAQ94239.1| unknown [Saccharopolyspora erythraea]
Length=220
Score = 123 bits (309), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 86/218 (40%), Positives = 113/218 (52%), Gaps = 22/218 (10%)
Query 5 KTTSVLAAALLFCGLLGPGTAPPATGGGPACRP-AELFATDNT---TD-----------G 49
+ + V +LLF GTA A PA P A LFAT NT TD
Sbjct 9 RRSVVAVVSLLFA--FTAGTAQAA----PAEEPQALLFATSNTAVITDPGDPRLDTPLTE 62
Query 50 FELPAVATIALTGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALH 109
F I G S L+DGVFWS E Q+ YERSR F + D LH+ A+ +
Sbjct 63 FARAVRGIIRDNGARDGRSELLDGVFWSGELQRATYERSRSFDVRETDPVELHHIADLVR 122
Query 110 RQFNQEAVLTFDYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADH 169
++F QE+VLTF++LP+++ DA L+ VP + + D A+D A RL GGSVT D
Sbjct 123 KEFGQESVLTFEHLPRDSARTDAFLVEVPGVDVTDLHDGLATDPQARERLGGGSVTM-DG 181
Query 170 TLILVAGNGDLDVARRLVEEAGGDWNATTIAHGRREFV 207
L+LVA DL++AR V GG W+ + +G REFV
Sbjct 182 ELVLVAELADLELAREFVVRLGGRWDGAGLRYGDREFV 219
>gi|134100681|ref|YP_001106342.1| hypothetical protein SACE_4148 [Saccharopolyspora erythraea NRRL
2338]
gi|291006515|ref|ZP_06564488.1| hypothetical protein SeryN2_18511 [Saccharopolyspora erythraea
NRRL 2338]
gi|133913304|emb|CAM03417.1| probable membrane protein [Saccharopolyspora erythraea NRRL 2338]
Length=206
Score = 122 bits (307), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/140 (45%), Positives = 87/140 (63%), Gaps = 1/140 (0%)
Query 68 STLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTFDYLPQNA 127
S L+DGVFWS E Q+ YERSR F + D LH+ A+ + ++F QE+VLTF++LP+++
Sbjct 67 SELLDGVFWSGELQRATYERSRSFDVRETDPVELHHIADLVRKEFGQESVLTFEHLPRDS 126
Query 128 PEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDLDVARRLV 187
DA L+ VP + + D A+D A RL GGSVT D L+LVA DL++AR V
Sbjct 127 ARTDAFLVEVPGVDVTDLHDGLATDPQARERLGGGSVTM-DGELVLVAELADLELAREFV 185
Query 188 EEAGGDWNATTIAHGRREFV 207
GG W+ + +G REFV
Sbjct 186 VRLGGRWDGAGLRYGDREFV 205
>gi|302520204|ref|ZP_07272546.1| conserved hypothetical protein [Streptomyces sp. SPB78]
gi|302429099|gb|EFL00915.1| conserved hypothetical protein [Streptomyces sp. SPB78]
Length=220
Score = 110 bits (274), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 75/200 (38%), Positives = 104/200 (52%), Gaps = 17/200 (8%)
Query 25 APPATGGGPA--CRPAELFATDNT---TD-----------GFELPAVATIALTGTVVTGS 68
A PA GG C E+FAT+NT TD F+ I G S
Sbjct 22 AAPARSGGAHRDCPSGEVFATNNTAVVTDPADPRLRTRLTRFDREVRGIIRAHGARPGAS 81
Query 69 TLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTFDYLPQNAP 128
TL+DGVFWS ++ +ERSREF + L + A L ++++QE+VLTF LP++AP
Sbjct 82 TLLDGVFWSAGLRKTTFERSREFDVDGTGRDGLRHLAGVLAKRYHQESVLTFRCLPRHAP 141
Query 129 EADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDLDVARRLVE 188
DA I P + A R+A + A L GGSVT D L+LV+ +L +AR+ +
Sbjct 142 ATDAARIEAPGVSAAALREALRTHPGAREELGGGSVTE-DGRLVLVSPLEELPLARKFTK 200
Query 189 EAGGDWNATTIAHGRREFVN 208
+ G DWN + +G REFV+
Sbjct 201 DLGVDWNTAEVRYGEREFVS 220
>gi|318061702|ref|ZP_07980423.1| hypothetical protein SSA3_27438 [Streptomyces sp. SA3_actG]
Length=238
Score = 107 bits (267), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 75/200 (38%), Positives = 102/200 (51%), Gaps = 17/200 (8%)
Query 25 APPATGGGPA--CRPAELFATDNT---TD-----------GFELPAVATIALTGTVVTGS 68
A PA GG C E+FAT+NT TD F+ I G S
Sbjct 40 AAPARSGGAHRDCPSGEVFATNNTAVVTDPADPRLRTRLTRFDREVRGIIRAHGARPGAS 99
Query 69 TLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTFDYLPQNAP 128
TL+DGVFWS +ERSREF + L + A L ++++QE+VLTF LP++AP
Sbjct 100 TLLDGVFWSAGLGTTTFERSREFDVDGTGRDGLRHLAGVLAKRYHQESVLTFRCLPRHAP 159
Query 129 EADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDLDVARRLVE 188
DA I P + A R+A + A L GGSVT D L+LV+ +L +AR+ +
Sbjct 160 ATDAARIEAPGVSAAALREALRTHPGAREELGGGSVTE-DGRLVLVSPLEELPLARKFTK 218
Query 189 EAGGDWNATTIAHGRREFVN 208
+ G DWN + +G REFV+
Sbjct 219 DLGVDWNTAEVRYGEREFVS 238
>gi|333026046|ref|ZP_08454110.1| hypothetical protein STTU_3550 [Streptomyces sp. Tu6071]
gi|332745898|gb|EGJ76339.1| hypothetical protein STTU_3550 [Streptomyces sp. Tu6071]
Length=238
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 102/200 (51%), Gaps = 17/200 (8%)
Query 25 APPATGGGPA--CRPAELFATDNT---TD-----------GFELPAVATIALTGTVVTGS 68
A PA GG C E+FAT+NT TD F+ I G S
Sbjct 40 AAPARSGGAHRDCPSGEVFATNNTAVVTDPADPRLRTRLTRFDREVRGIIRAHGARPGAS 99
Query 69 TLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTFDYLPQNAP 128
TL+DGVFWS + +ERSREF + L + A L ++++QE+VLTF LP++AP
Sbjct 100 TLLDGVFWSAGLGKTTFERSREFDVDGTGRDGLRHLAGVLAKRYHQESVLTFRCLPRHAP 159
Query 129 EADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDLDVARRLVE 188
DA I P + A R+A + A L GGSVT D L+LV+ +L +A + +
Sbjct 160 ATDAARIEAPGVSAAALREALRTHPGAREELGGGSVTE-DGRLVLVSPLEELPLAWKFTK 218
Query 189 EAGGDWNATTIAHGRREFVN 208
+ G DWN + +G REFV+
Sbjct 219 DLGVDWNTAEVCYGEREFVS 238
>gi|318079665|ref|ZP_07986997.1| hypothetical protein SSA3_23986 [Streptomyces sp. SA3_actF]
Length=157
Score = 101 bits (251), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 58/141 (42%), Positives = 82/141 (59%), Gaps = 1/141 (0%)
Query 68 STLVDGVFWSNERQQIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTFDYLPQNA 127
STL+DGVFWS +ERSREF + L + A L ++++QE+VLTF LP++A
Sbjct 18 STLLDGVFWSAGLGTTTFERSREFDVDGTGRDGLRHLAGVLAKRYHQESVLTFRCLPRHA 77
Query 128 PEADAILITVPDIGIARFRDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDLDVARRLV 187
P DA I P + A R+A + A L GGSVT D L+LV+ +L +AR+
Sbjct 78 PATDAARIEAPGVSAAALREALRTHPGAREELGGGSVTE-DGRLVLVSPLEELPLARKFT 136
Query 188 EEAGGDWNATTIAHGRREFVN 208
++ G DWN + +G REFV+
Sbjct 137 KDLGVDWNTAEVRYGEREFVS 157
>gi|111219823|ref|YP_710617.1| putative branched chain amino acid ABC transporter ATP-binding
protein [Frankia alni ACN14a]
gi|111147355|emb|CAJ59006.1| putative branched chain amino acid ABC transporter ATP-binding
protein [Frankia alni ACN14a]
Length=940
Score = 38.9 bits (89), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 45/143 (32%), Positives = 59/143 (42%), Gaps = 16/143 (11%)
Query 62 GTVVTGSTLVDGVFWSNERQ-QIGYERSREFHLCVVDAPTLHNAAEALHRQFNQEAVLTF 120
G V G T +DG WS ER+ + G RS + D L N A R+ + A +T
Sbjct 718 GEVRLGETRIDG--WSRERRARAGLGRSFQSLELFEDLTVLENLQSACDRR-DSLAYVTN 774
Query 121 DYLPQNAPEADAILITVPDIGIARFRDAFASDLAAHHR--LRGGSVTTADHTLILV---- 174
+P A V D G+A F D DL HR L H+++L+
Sbjct 775 LVVPDRGRLTPAAWAAVADFGLAPFLDTPVQDLGYAHRRMLAVARAVAGGHSVLLLDEPA 834
Query 175 AGNGDLD------VARRLVEEAG 191
AG GD V RRL +E G
Sbjct 835 AGLGDAQTRELGAVLRRLADERG 857
>gi|312219773|emb|CBX99715.1| similar to fatty acid synthase beta subunit [Leptosphaeria maculans]
Length=2109
Score = 35.8 bits (81), Expect = 4.5, Method: Composition-based stats.
Identities = 32/130 (25%), Positives = 53/130 (41%), Gaps = 14/130 (10%)
Query 36 RPAELFATDNTTDGFELPAVATIALTGTVVTGSTLVDGVFWSNERQQIGYERSR----EF 91
R L D+T G L + +A V+ +LV VF+ Q+ ER +
Sbjct 1843 RAKGLVQADSTFAGHSLGEYSALAALAEVMPIESLVSVVFYRGLTMQVAVERDETGRSNY 1902
Query 92 HLCVVDAPTLHNAAEALHRQFNQEAVLTFDYLPQNAPEADAILITVPDIGIARFRDAFAS 151
+C V+ + + FN++A+ Y+ +N E L+ + + IA + A
Sbjct 1903 SMCAVN-------PSRISKTFNEQAL---QYVVENIAETTGWLLEIVNYNIANMQYVAAG 1952
Query 152 DLAAHHRLRG 161
DL A L G
Sbjct 1953 DLRALDCLTG 1962
>gi|189204113|ref|XP_001938392.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187985491|gb|EDU50979.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length=614
Score = 35.0 bits (79), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 34/99 (35%), Positives = 46/99 (47%), Gaps = 13/99 (13%)
Query 41 FATDNTTDGFELPAVATIALTGTVVTGST---LVDGVFWSNERQQIGYERSREFHLCVVD 97
FATD TD E A A I+ G +TGS LV V WS R+Q G+ + R + +V
Sbjct 123 FATDERTDAHEFRA-ARISPIGAALTGSATFILVWLVHWS--RRQEGFSKGRTLLVLLVF 179
Query 98 APTLHNAAEALHRQFNQEAVLTFDYLPQNAPEADAILIT 136
A A + RQ+ YL Q A + + L+T
Sbjct 180 AAMATAAYGYMRRQW-------LYYLRQEAVKGASALVT 211
>gi|326446723|ref|ZP_08221457.1| hypothetical protein SclaA2_36900 [Streptomyces clavuligerus
ATCC 27064]
Length=155
Score = 35.0 bits (79), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 35/117 (30%), Positives = 50/117 (43%), Gaps = 10/117 (8%)
Query 4 RKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFA--TDNTTDGFELPAVATIALT 61
R +VL AAL LG TAPPA G P P F D+ DG AVA I+
Sbjct 32 RTRAAVLGAALTAALSLGAVTAPPAAAGNPYFCPQSKFCLWEDSNYDGAMATAVAGISWI 91
Query 62 GTVVT--GSTLVDGVFWSNERQQIGYERSREFH-LCVVDAPTLHNAAEALHRQFNQE 115
G + GS+ +W+ + + R +F C++ + +A L Q N +
Sbjct 92 GPYMNDRGSS-----YWNRTGEWVTLYRDIDFQGGCLMGSIAPQESATVLSAQANDQ 143
>gi|227504004|ref|ZP_03934053.1| L-aminopeptidase/D-esterase [Corynebacterium striatum ATCC 6940]
gi|227199398|gb|EEI79446.1| L-aminopeptidase/D-esterase [Corynebacterium striatum ATCC 6940]
Length=310
Score = 35.0 bits (79), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 34/113 (31%), Positives = 43/113 (39%), Gaps = 21/113 (18%)
Query 14 LLFCGLLGPGTAPPATGGGPACRPAELFATDNTTDGFELPAVATIALTGTVVTGSTLVDG 73
+L+CG G + GGGP R +L NT + V I L G G DG
Sbjct 15 VLYCGSQGAVASIDVRGGGPGTRETDLLEPHNTVE-----RVHAITLAGGSAFGLAAADG 69
Query 74 VFWSNERQQIGYERSRE---------------FHLCVVDA-PTLHNAAEALHR 110
V E Q IG+ E F L + D PT + AEA+ R
Sbjct 70 VMRELESQGIGFPVLGEGKPGPRVPIVPGAVIFDLLLGDERPTAEDGAEAVKR 122
>gi|299470765|emb|CBN79811.1| aspartyl/glutamyl-tRNA amidotransferase subunit B [Ectocarpus
siliculosus]
Length=560
Score = 35.0 bits (79), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 23/85 (28%), Positives = 36/85 (43%), Gaps = 0/85 (0%)
Query 23 GTAPPATGGGPACRPAELFATDNTTDGFELPAVATIALTGTVVTGSTLVDGVFWSNERQQ 82
G P GG R A+ + GF +PA+ + V +G +L D ER
Sbjct 129 GPTKPKAGGAAKARSADELQGGSNDPGFHVPAIGITRIQLEVDSGKSLHDRKGEGAERSL 188
Query 83 IGYERSREFHLCVVDAPTLHNAAEA 107
+ R+ + +V P + +AAEA
Sbjct 189 VDLNRAGTALMEIVFEPEIRSAAEA 213
>gi|171683419|ref|XP_001906652.1| hypothetical protein [Podospora anserina S mat+]
gi|170941669|emb|CAP67323.1| unnamed protein product [Podospora anserina S mat+]
Length=2091
Score = 35.0 bits (79), Expect = 7.7, Method: Composition-based stats.
Identities = 31/126 (25%), Positives = 53/126 (43%), Gaps = 14/126 (11%)
Query 40 LFATDNTTDGFELPAVATIALTGTVVTGSTLVDGVFWSNERQQIGYERSRE----FHLCV 95
L D+T G L + +A V+ +LV VF+ Q+ ER + + +C
Sbjct 1828 LVPRDSTFAGHSLGEYSALAALADVMPIESLVSVVFYRGLTMQVAVERDEQGRSNYSMCA 1887
Query 96 VDAPTLHNAAEALHRQFNQEAVLTFDYLPQNAPEADAILITVPDIGIARFRDAFASDLAA 155
V+ + + FN+EA+ ++ N E+ L+ + + IA + A DL A
Sbjct 1888 VN-------PSRISKTFNEEAL---RFVVSNIAESTGWLLEIVNFNIANMQYVCAGDLRA 1937
Query 156 HHRLRG 161
L G
Sbjct 1938 LDTLAG 1943
Lambda K H
0.320 0.135 0.405
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 239574757050
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40