BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0628c
Length=383
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607768|ref|NP_215142.1| hypothetical protein Rv0628c [Mycob... 743 0.0
gi|289442020|ref|ZP_06431764.1| conserved hypothetical protein [... 743 0.0
gi|306774736|ref|ZP_07413073.1| hypothetical protein TMAG_02508 ... 742 0.0
gi|289749126|ref|ZP_06508504.1| conserved hypothetical protein [... 741 0.0
gi|254230962|ref|ZP_04924289.1| conserved hypothetical protein [... 740 0.0
gi|340625645|ref|YP_004744097.1| hypothetical protein MCAN_06251... 729 0.0
gi|308396149|ref|ZP_07492241.2| hypothetical protein TMLG_03378 ... 656 0.0
gi|240169380|ref|ZP_04748039.1| hypothetical protein MkanA1_0870... 607 9e-172
gi|15608014|ref|NP_215389.1| hypothetical protein Rv0874c [Mycob... 587 2e-165
gi|289756959|ref|ZP_06516337.1| conserved hypothetical protein [... 585 4e-165
gi|289744604|ref|ZP_06503982.1| conserved hypothetical protein [... 576 3e-162
gi|307078563|ref|ZP_07487733.1| hypothetical protein TMKG_03909 ... 568 4e-160
gi|15840288|ref|NP_335325.1| hypothetical protein MT0897 [Mycoba... 568 6e-160
gi|339293886|gb|AEJ45997.1| hypothetical protein CCDC5079_0807 [... 562 4e-158
gi|289573230|ref|ZP_06453457.1| LOW QUALITY PROTEIN: conserved h... 536 3e-150
gi|308375278|ref|ZP_07667983.1| hypothetical protein TMGG_02939 ... 489 4e-136
gi|306796378|ref|ZP_07434680.1| hypothetical protein TMFG_03295 ... 407 2e-111
gi|289568838|ref|ZP_06449065.1| conserved hypothetical protein [... 392 7e-107
gi|289744342|ref|ZP_06503720.1| conserved hypothetical protein [... 389 3e-106
gi|306796379|ref|ZP_07434681.1| hypothetical protein TMFG_03296 ... 353 3e-95
gi|289749395|ref|ZP_06508773.1| conserved hypothetical protein [... 353 3e-95
gi|289744343|ref|ZP_06503721.1| conserved hypothetical protein [... 329 4e-88
gi|283778153|ref|YP_003368908.1| hypothetical protein Psta_0358 ... 286 3e-75
gi|87306450|ref|ZP_01088597.1| hypothetical protein DSM3645_0896... 270 4e-70
gi|302035705|ref|YP_003796027.1| hypothetical protein NIDE0322 [... 268 1e-69
gi|271969747|ref|YP_003343943.1| hypothetical protein Sros_8558 ... 268 2e-69
gi|325111105|ref|YP_004272173.1| hypothetical protein Plabr_4580... 266 6e-69
gi|297171923|gb|ADI22910.1| uncharacterized protein conserved in... 265 1e-68
gi|284044707|ref|YP_003395047.1| hypothetical protein Cwoe_3254 ... 264 2e-68
gi|296121655|ref|YP_003629433.1| hypothetical protein Plim_1400 ... 242 6e-62
gi|296271068|ref|YP_003653700.1| hypothetical protein Tbis_3113 ... 242 6e-62
gi|269125309|ref|YP_003298679.1| hypothetical protein Tcur_1055 ... 241 1e-61
gi|72160848|ref|YP_288505.1| hypothetical protein Tfu_0444 [Ther... 240 3e-61
gi|117929098|ref|YP_873649.1| hypothetical protein Acel_1891 [Ac... 236 4e-60
gi|297559074|ref|YP_003678048.1| hypothetical protein Ndas_0091 ... 230 3e-58
gi|149923652|ref|ZP_01912048.1| hypothetical protein PPSIR1_1692... 226 7e-57
gi|223939736|ref|ZP_03631608.1| protein of unknown function DUF1... 223 6e-56
gi|262196432|ref|YP_003267641.1| hypothetical protein Hoch_3246 ... 215 8e-54
gi|86609276|ref|YP_478038.1| hypothetical protein CYB_1819 [Syne... 210 3e-52
gi|320103039|ref|YP_004178630.1| hypothetical protein Isop_1496 ... 209 7e-52
gi|86606541|ref|YP_475304.1| hypothetical protein CYA_1894 [Syne... 208 1e-51
gi|294055462|ref|YP_003549120.1| hypothetical protein Caka_1932 ... 202 7e-50
gi|37520395|ref|NP_923772.1| hypothetical protein gll0826 [Gloeo... 201 1e-49
gi|153006881|ref|YP_001381206.1| hypothetical protein Anae109_40... 198 2e-48
gi|159028345|emb|CAO87243.1| unnamed protein product [Microcysti... 196 7e-48
gi|166366981|ref|YP_001659254.1| hypothetical protein MAE_42400 ... 190 3e-46
gi|298246483|ref|ZP_06970289.1| protein of unknown function DUF1... 190 4e-46
gi|298490695|ref|YP_003720872.1| hypothetical protein Aazo_1561 ... 189 5e-46
gi|158336704|ref|YP_001517878.1| hypothetical protein AM1_3572 [... 188 1e-45
gi|254412137|ref|ZP_05025912.1| conserved domain protein [Microc... 182 7e-44
>gi|15607768|ref|NP_215142.1| hypothetical protein Rv0628c [Mycobacterium tuberculosis H37Rv]
gi|15840029|ref|NP_335066.1| hypothetical protein MT0656 [Mycobacterium tuberculosis CDC1551]
gi|31791810|ref|NP_854303.1| hypothetical protein Mb0644c [Mycobacterium bovis AF2122/97]
56 more sequence titles
Length=383
Score = 743 bits (1919), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/383 (99%), Positives = 383/383 (100%), Gaps = 0/383 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG
Sbjct 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV
Sbjct 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
Query 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD
Sbjct 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
Query 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG
Sbjct 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPVAGHNALHGFTASMALFVD 383
EIGPVAGHNALHGFTASMALFVD
Sbjct 361 EIGPVAGHNALHGFTASMALFVD 383
>gi|289442020|ref|ZP_06431764.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289568565|ref|ZP_06448792.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289414939|gb|EFD12179.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289542319|gb|EFD45967.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=383
Score = 743 bits (1917), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/383 (99%), Positives = 382/383 (99%), Gaps = 0/383 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG
Sbjct 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV
Sbjct 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
Query 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD
Sbjct 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
Query 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
KDLRLAVER AAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG
Sbjct 301 KDLRLAVERVAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPVAGHNALHGFTASMALFVD 383
EIGPVAGHNALHGFTASMALFVD
Sbjct 361 EIGPVAGHNALHGFTASMALFVD 383
>gi|306774736|ref|ZP_07413073.1| hypothetical protein TMAG_02508 [Mycobacterium tuberculosis SUMu001]
gi|306970840|ref|ZP_07483501.1| hypothetical protein TMJG_02372 [Mycobacterium tuberculosis SUMu010]
gi|308216629|gb|EFO76028.1| hypothetical protein TMAG_02508 [Mycobacterium tuberculosis SUMu001]
gi|308359625|gb|EFP48476.1| hypothetical protein TMJG_02372 [Mycobacterium tuberculosis SUMu010]
Length=383
Score = 742 bits (1916), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/383 (99%), Positives = 383/383 (100%), Gaps = 0/383 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG
Sbjct 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV
Sbjct 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
Query 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
SRGLQIGIVVDEHLAVPGQG+FLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD
Sbjct 241 SRGLQIGIVVDEHLAVPGQGNFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
Query 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG
Sbjct 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPVAGHNALHGFTASMALFVD 383
EIGPVAGHNALHGFTASMALFVD
Sbjct 361 EIGPVAGHNALHGFTASMALFVD 383
>gi|289749126|ref|ZP_06508504.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289689713|gb|EFD57142.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=383
Score = 741 bits (1912), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/383 (99%), Positives = 381/383 (99%), Gaps = 0/383 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG
Sbjct 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV
Sbjct 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
Query 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPT GAIGIGEVVEVGATVQFQVRDAAAAD
Sbjct 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTKGAIGIGEVVEVGATVQFQVRDAAAAD 300
Query 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
KDLRLAVER AAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG
Sbjct 301 KDLRLAVERVAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPVAGHNALHGFTASMALFVD 383
EIGPVAGHNALHGFTASMALFVD
Sbjct 361 EIGPVAGHNALHGFTASMALFVD 383
>gi|254230962|ref|ZP_04924289.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|124600021|gb|EAY59031.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
Length=383
Score = 740 bits (1911), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/383 (99%), Positives = 382/383 (99%), Gaps = 0/383 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
HDLHLLLPDPYSFPSNLLIE LNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG
Sbjct 121 HDLHLLLPDPYSFPSNLLIERLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV
Sbjct 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
Query 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD
Sbjct 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
Query 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG
Sbjct 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPVAGHNALHGFTASMALFVD 383
EIGPVAGHNALHGFTASMALFVD
Sbjct 361 EIGPVAGHNALHGFTASMALFVD 383
>gi|340625645|ref|YP_004744097.1| hypothetical protein MCAN_06251 [Mycobacterium canettii CIPT
140010059]
gi|340003835|emb|CCC42965.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=383
Score = 729 bits (1883), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/383 (99%), Positives = 378/383 (99%), Gaps = 0/383 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVGVSTAPDVRRAAAEAAAHA EELAGGTPALAVLLGSRSHTDQAVDLLAAVQ SVE
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAHEELAGGTPALAVLLGSRSHTDQAVDLLAAVQESVE 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
PAALIGCVAQGIVAGRHELENEPAVAVWLASG PAETFHLDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGSPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
HDLHLLLPDPYSFPSNLLI+HLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG
Sbjct 121 HDLHLLLPDPYSFPSNLLIDHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
VRLPGAHSVSVVSQ CRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV
Sbjct 181 VRLPGAHSVSVVSQSCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
Query 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD
Sbjct 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
Query 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
KDLRLAVERAAAELPGPPVGGLLFT NGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG
Sbjct 301 KDLRLAVERAAAELPGPPVGGLLFTGNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPVAGHNALHGFTASMALFVD 383
EIGPVAGHNALHGFTASMALFVD
Sbjct 361 EIGPVAGHNALHGFTASMALFVD 383
>gi|308396149|ref|ZP_07492241.2| hypothetical protein TMLG_03378 [Mycobacterium tuberculosis SUMu012]
gi|308367164|gb|EFP56015.1| hypothetical protein TMLG_03378 [Mycobacterium tuberculosis SUMu012]
Length=335
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/335 (99%), Positives = 335/335 (100%), Gaps = 0/335 (0%)
Query 49 VDLLAAVQASVEPAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSG 108
+DLLAAVQASVEPAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSG
Sbjct 1 MDLLAAVQASVEPAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSG 60
Query 109 ALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLF 168
ALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLF
Sbjct 61 ALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLF 120
Query 169 RDRDVLTSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREI 228
RDRDVLTSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREI
Sbjct 121 RDRDVLTSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREI 180
Query 229 VLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGAT 288
VLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGAT
Sbjct 181 VLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGAT 240
Query 289 VQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLL 348
VQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLL
Sbjct 241 VQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLL 300
Query 349 GGIPLAGFFAAGEIGPVAGHNALHGFTASMALFVD 383
GGIPLAGFFAAGEIGPVAGHNALHGFTASMALFVD
Sbjct 301 GGIPLAGFFAAGEIGPVAGHNALHGFTASMALFVD 335
>gi|240169380|ref|ZP_04748039.1| hypothetical protein MkanA1_08708 [Mycobacterium kansasii ATCC
12478]
Length=383
Score = 607 bits (1565), Expect = 9e-172, Method: Compositional matrix adjust.
Identities = 313/383 (82%), Positives = 336/383 (88%), Gaps = 0/383 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVG STAPD R+AA EAA A +ELAG P+LAVLLGSRSH+DQA D+L AVQ V
Sbjct 1 MRIGVGFSTAPDARKAAVEAATQACDELAGEMPSLAVLLGSRSHSDQAADVLNAVQEIVG 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
LIGCVAQ +VAGRHE+E++PAVAVWLASG AETF LDFVRTGSG L+TGYRFDRTA
Sbjct 61 SPPLIGCVAQAVVAGRHEIEDQPAVAVWLASGLAAETFQLDFVRTGSGGLLTGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
HDLHLLLPDPY+FPS+LLIEHLN+DLPGTTVVGG+ SGGR G TRLFRDR V +SGLVG
Sbjct 121 HDLHLLLPDPYTFPSSLLIEHLNSDLPGTTVVGGLASGGRGPGGTRLFRDRGVFSSGLVG 180
Query 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
VRLPG HS+ +VSQGCRPIG PYIVTGADGAVITELGGRPPL RLREIV G+ EQELV
Sbjct 181 VRLPGVHSIPIVSQGCRPIGRPYIVTGADGAVITELGGRPPLVRLREIVEGLPLHEQELV 240
Query 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
SRGLQIGIVVDEHLA PGQGDFLIRGLLGADP+TG I IGEVVEVG TVQFQVRDAA+AD
Sbjct 241 SRGLQIGIVVDEHLAAPGQGDFLIRGLLGADPSTGVIEIGEVVEVGTTVQFQVRDAASAD 300
Query 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
KDL LAVERAAAEL G P G LLFTCNGRGRRMFGV DHDASTIEDLLGGIPLAGFFAAG
Sbjct 301 KDLHLAVERAAAELGGRPAGALLFTCNGRGRRMFGVADHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPVAGHNALHGFTASMALFVD 383
EIGPV G NALHG+TAS+ALFVD
Sbjct 361 EIGPVFGRNALHGYTASLALFVD 383
>gi|15608014|ref|NP_215389.1| hypothetical protein Rv0874c [Mycobacterium tuberculosis H37Rv]
gi|31792062|ref|NP_854555.1| hypothetical protein Mb0898c [Mycobacterium bovis AF2122/97]
gi|121636797|ref|YP_977020.1| hypothetical protein BCG_0926c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
62 more sequence titles
Length=386
Score = 587 bits (1512), Expect = 2e-165, Method: Compositional matrix adjust.
Identities = 312/383 (82%), Positives = 339/383 (89%), Gaps = 0/383 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVGV T PD R+AA EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++
Sbjct 1 MRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA
Sbjct 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VG
Sbjct 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
Query 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
VRLPG V VVSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LV
Sbjct 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
Query 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
S GLQIGIVVDEHLA PGQGDF+IRGLLGADP+TG+I I EVV+VGAT+QFQVRDAA AD
Sbjct 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
Query 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
KDLRL VERAAA LPG G LLFTCNGRGRRMFGV DHDASTIE+LLGGIPLAGFFAAG
Sbjct 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
Query 361 EIGPVAGHNALHGFTASMALFVD 383
EIGP+AG NALHGFTASMALFVD
Sbjct 361 EIGPIAGRNALHGFTASMALFVD 383
>gi|289756959|ref|ZP_06516337.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|294996354|ref|ZP_06802045.1| hypothetical protein Mtub2_18086 [Mycobacterium tuberculosis
210]
gi|298524366|ref|ZP_07011775.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|289712523|gb|EFD76535.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|298494160|gb|EFI29454.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|326904907|gb|EGE51840.1| hypothetical protein TBPG_02828 [Mycobacterium tuberculosis W-148]
gi|339297527|gb|AEJ49637.1| hypothetical protein CCDC5180_0800 [Mycobacterium tuberculosis
CCDC5180]
Length=386
Score = 585 bits (1508), Expect = 4e-165, Method: Compositional matrix adjust.
Identities = 311/383 (82%), Positives = 338/383 (89%), Gaps = 0/383 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVGV T PD R+AA EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++
Sbjct 1 MRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA
Sbjct 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VG
Sbjct 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
Query 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
VRLPG V VVSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LV
Sbjct 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
Query 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
S LQIGIVVDEHLA PGQGDF+IRGLLGADP+TG+I I EVV+VGAT+QFQVRDAA AD
Sbjct 241 SHSLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
Query 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
KDLRL VERAAA LPG G LLFTCNGRGRRMFGV DHDASTIE+LLGGIPLAGFFAAG
Sbjct 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
Query 361 EIGPVAGHNALHGFTASMALFVD 383
EIGP+AG NALHGFTASMALFVD
Sbjct 361 EIGPIAGRNALHGFTASMALFVD 383
>gi|289744604|ref|ZP_06503982.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289685132|gb|EFD52620.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
Length=385
Score = 576 bits (1484), Expect = 3e-162, Method: Compositional matrix adjust.
Identities = 307/383 (81%), Positives = 334/383 (88%), Gaps = 0/383 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVGV T PD R+AA EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++
Sbjct 1 MRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA
Sbjct 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VG
Sbjct 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
Query 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
VRLPG V VVSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LV
Sbjct 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
Query 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
S LQIGIVVDEHLA PGQGDF+IRGLLGADP+TG+I I EVV+VGAT+QFQVRDAA AD
Sbjct 241 SHSLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
Query 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
KDLRL VERAAA LPG G LLFTCNGRGRRMFGV DHDASTIE+LLGGIPLAGFFAAG
Sbjct 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
Query 361 EIGPVAGHNALHGFTASMALFVD 383
EIGP+AG NAL GFTASM L D
Sbjct 361 EIGPIAGRNALQGFTASMGLVFD 383
>gi|307078563|ref|ZP_07487733.1| hypothetical protein TMKG_03909 [Mycobacterium tuberculosis SUMu011]
gi|308363552|gb|EFP52403.1| hypothetical protein TMKG_03909 [Mycobacterium tuberculosis SUMu011]
Length=290
Score = 568 bits (1465), Expect = 4e-160, Method: Compositional matrix adjust.
Identities = 289/290 (99%), Positives = 290/290 (100%), Gaps = 0/290 (0%)
Query 94 PAETFHLDFVRTGSGALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVG 153
PAETFHLDFVRTGSGALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVG
Sbjct 1 PAETFHLDFVRTGSGALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVG 60
Query 154 GVVSGGRRRGDTRLFRDRDVLTSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVI 213
GVVSGGRRRGDTRLFRDRDVLTSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVI
Sbjct 61 GVVSGGRRRGDTRLFRDRDVLTSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVI 120
Query 214 TELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPT 273
TELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPGQG+FLIRGLLGADPT
Sbjct 121 TELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGNFLIRGLLGADPT 180
Query 274 TGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRM 333
TGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRM
Sbjct 181 TGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRM 240
Query 334 FGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASMALFVD 383
FGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASMALFVD
Sbjct 241 FGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASMALFVD 290
>gi|15840288|ref|NP_335325.1| hypothetical protein MT0897 [Mycobacterium tuberculosis CDC1551]
gi|13880449|gb|AAK45139.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
Length=427
Score = 568 bits (1464), Expect = 6e-160, Method: Compositional matrix adjust.
Identities = 303/370 (82%), Positives = 329/370 (89%), Gaps = 0/370 (0%)
Query 14 RRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEPAALIGCVAQGIV 73
R+AA EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++P AL+GC+AQ IV
Sbjct 55 RQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQAIV 114
Query 74 AGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTAHDLHLLLPDPYSF 133
AGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA DLHLLLPDPY+F
Sbjct 115 AGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTARDLHLLLPDPYTF 174
Query 134 PSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVGVRLPGAHSVSVVS 193
PSNLLIEH NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VGVRLPG V VVS
Sbjct 175 PSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVS 234
Query 194 QGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEH 253
QGCRPIG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LVS GLQIGIVVDEH
Sbjct 235 QGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHGLQIGIVVDEH 294
Query 254 LAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAE 313
LA PGQGDF+IRGLLGADP+TG+I I EVV+VGAT+QFQVRDAA ADKDLRL VERAAA
Sbjct 295 LAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAAR 354
Query 314 LPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHG 373
LPG G LLFTCNGRGRRMFGV DHDASTIE+LLGGIPLAGFFAAGEIGP+AG NALHG
Sbjct 355 LPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHG 414
Query 374 FTASMALFVD 383
FTASMALFVD
Sbjct 415 FTASMALFVD 424
>gi|339293886|gb|AEJ45997.1| hypothetical protein CCDC5079_0807 [Mycobacterium tuberculosis
CCDC5079]
Length=369
Score = 562 bits (1448), Expect = 4e-158, Method: Compositional matrix adjust.
Identities = 299/365 (82%), Positives = 324/365 (89%), Gaps = 0/365 (0%)
Query 19 EAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEPAALIGCVAQGIVAGRHE 78
EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++P AL+GC+AQ IVAGRHE
Sbjct 2 EAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQAIVAGRHE 61
Query 79 LENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTAHDLHLLLPDPYSFPSNLL 138
+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA DLHLLLPDPY+FPSNLL
Sbjct 62 IEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNLL 121
Query 139 IEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVGVRLPGAHSVSVVSQGCRP 198
IEH NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VGVRLPG V VVSQGCRP
Sbjct 122 IEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVSQGCRP 181
Query 199 IGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPG 258
IG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LVS LQIGIVVDEHLA PG
Sbjct 182 IGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHSLQIGIVVDEHLAAPG 241
Query 259 QGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPP 318
QGDF+IRGLLGADP+TG+I I EVV+VGAT+QFQVRDAA ADKDLRL VERAAA LPG
Sbjct 242 QGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGRA 301
Query 319 VGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASM 378
G LLFTCNGRGRRMFGV DHDASTIE+LLGGIPLAGFFAAGEIGP+AG NALHGFTASM
Sbjct 302 AGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHGFTASM 361
Query 379 ALFVD 383
ALFVD
Sbjct 362 ALFVD 366
>gi|289573230|ref|ZP_06453457.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis K85]
gi|289537661|gb|EFD42239.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis K85]
Length=320
Score = 536 bits (1380), Expect = 3e-150, Method: Compositional matrix adjust.
Identities = 297/315 (95%), Positives = 298/315 (95%), Gaps = 1/315 (0%)
Query 70 QGIVAGRHEL-ENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTAHDLHLLLP 128
+GIVAG E P PAETFHLDFVRTGSGALITGYRFDRTAHDLHLLLP
Sbjct 6 KGIVAGSPRAGERAPRWRCGWRPAHPAETFHLDFVRTGSGALITGYRFDRTAHDLHLLLP 65
Query 129 DPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVGVRLPGAHS 188
DPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVGVRLPGAHS
Sbjct 66 DPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVGVRLPGAHS 125
Query 189 VSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGI 248
VSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGI
Sbjct 126 VSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGI 185
Query 249 VVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVE 308
VVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVE
Sbjct 186 VVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVE 245
Query 309 RAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGH 368
RAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGH
Sbjct 246 RAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGH 305
Query 369 NALHGFTASMALFVD 383
NALHGFTASMALFVD
Sbjct 306 NALHGFTASMALFVD 320
>gi|308375278|ref|ZP_07667983.1| hypothetical protein TMGG_02939 [Mycobacterium tuberculosis SUMu007]
gi|308346735|gb|EFP35586.1| hypothetical protein TMGG_02939 [Mycobacterium tuberculosis SUMu007]
Length=347
Score = 489 bits (1258), Expect = 4e-136, Method: Compositional matrix adjust.
Identities = 263/330 (80%), Positives = 288/330 (88%), Gaps = 0/330 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVGV T PD R+AA EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++
Sbjct 1 MRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA
Sbjct 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VG
Sbjct 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
Query 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
VRLPG V VVSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LV
Sbjct 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
Query 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
S GLQIGIVVDEHLA PGQGDF+IRGLLGADP+TG+I I EVV+VGAT+QFQVRDAA AD
Sbjct 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
Query 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRG 330
KDLRL VERAAA LPG G LLFTCNGRG
Sbjct 301 KDLRLTVERAAARLPGRAAGALLFTCNGRG 330
>gi|306796378|ref|ZP_07434680.1| hypothetical protein TMFG_03295 [Mycobacterium tuberculosis SUMu006]
gi|308343226|gb|EFP32077.1| hypothetical protein TMFG_03295 [Mycobacterium tuberculosis SUMu006]
Length=209
Score = 407 bits (1046), Expect = 2e-111, Method: Compositional matrix adjust.
Identities = 209/209 (100%), Positives = 209/209 (100%), Gaps = 0/209 (0%)
Query 175 TSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAP 234
TSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAP
Sbjct 1 TSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAP 60
Query 235 DEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVR 294
DEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVR
Sbjct 61 DEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVR 120
Query 295 DAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLA 354
DAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLA
Sbjct 121 DAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLA 180
Query 355 GFFAAGEIGPVAGHNALHGFTASMALFVD 383
GFFAAGEIGPVAGHNALHGFTASMALFVD
Sbjct 181 GFFAAGEIGPVAGHNALHGFTASMALFVD 209
>gi|289568838|ref|ZP_06449065.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289542592|gb|EFD46240.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=304
Score = 392 bits (1006), Expect = 7e-107, Method: Compositional matrix adjust.
Identities = 212/266 (80%), Positives = 233/266 (88%), Gaps = 0/266 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVGV T PD R+AA EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++
Sbjct 1 MRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA
Sbjct 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VG
Sbjct 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
Query 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
VRLPG V VVSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LV
Sbjct 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
Query 241 SRGLQIGIVVDEHLAVPGQGDFLIRG 266
S GLQIGIVVDEHLA PGQGDF+IRG
Sbjct 241 SHGLQIGIVVDEHLAAPGQGDFVIRG 266
>gi|289744342|ref|ZP_06503720.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289684870|gb|EFD52358.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
Length=201
Score = 389 bits (1000), Expect = 3e-106, Method: Compositional matrix adjust.
Identities = 199/201 (99%), Positives = 200/201 (99%), Gaps = 0/201 (0%)
Query 183 LPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSR 242
+PGAH VSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSR
Sbjct 1 MPGAHRVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSR 60
Query 243 GLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKD 302
GLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKD
Sbjct 61 GLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKD 120
Query 303 LRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEI 362
LRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEI
Sbjct 121 LRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEI 180
Query 363 GPVAGHNALHGFTASMALFVD 383
GPVAGHNALHGFTASMALFVD
Sbjct 181 GPVAGHNALHGFTASMALFVD 201
>gi|306796379|ref|ZP_07434681.1| hypothetical protein TMFG_03296 [Mycobacterium tuberculosis SUMu006]
gi|308343156|gb|EFP32007.1| hypothetical protein TMFG_03296 [Mycobacterium tuberculosis SUMu006]
Length=181
Score = 353 bits (906), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 180/181 (99%), Positives = 181/181 (100%), Gaps = 0/181 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG
Sbjct 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 V 181
V
Sbjct 181 V 181
>gi|289749395|ref|ZP_06508773.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289689982|gb|EFD57411.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=311
Score = 353 bits (906), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 198/248 (80%), Positives = 213/248 (86%), Gaps = 0/248 (0%)
Query 100 LDFVRTGSGALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGG 159
+DFVRTGSGALITGYRFDRTA DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGGVVSGG
Sbjct 1 MDFVRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGG 60
Query 160 RRRGDTRLFRDRDVLTSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGR 219
RRRGDTRLFRD DVLTSG+VGVRLPG V VVSQGCRPIG PYIVTGADG +ITELGGR
Sbjct 61 RRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGR 120
Query 220 PPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGI 279
PPL RLREIV G++PDE+ LVS GLQIGIVVDEHLA PGQGDF+IRGLLGADP+TG+I I
Sbjct 121 PPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEI 180
Query 280 GEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDH 339
EVV+VGAT+QFQVRDAA ADKDLRL VERAAA LPG G LFTC+ R +FGV
Sbjct 181 DEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGRAAGAPLFTCHARRTTIFGVPRP 240
Query 340 DASTIEDL 347
TIE+L
Sbjct 241 RRVTIEEL 248
>gi|289744343|ref|ZP_06503721.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289684871|gb|EFD52359.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
Length=168
Score = 329 bits (844), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 167/168 (99%), Positives = 168/168 (100%), Gaps = 0/168 (0%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+RIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLF 168
HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLF
Sbjct 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLF 168
>gi|283778153|ref|YP_003368908.1| hypothetical protein Psta_0358 [Pirellula staleyi DSM 6068]
gi|283436606|gb|ADB15048.1| domain of unknown function DUF1745 [Pirellula staleyi DSM 6068]
Length=400
Score = 286 bits (733), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 171/382 (45%), Positives = 223/382 (59%), Gaps = 7/382 (1%)
Query 7 VSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEPAALIG 66
+S+ D A A A + TP L ++ S H +A L + A + LIG
Sbjct 18 LSSTADAVEEVARKALTALQSSGPRTPDLGLVFFSNHHAPEADFLAKKLCALLGTENLIG 77
Query 67 CVAQGIVAGRHELENEPAVAVWLASGPP--AETFHLDFVRTGSGALITGY----RFDRTA 120
C + IV E+E PA+++WLAS A +L +T G +I G+ + +
Sbjct 78 CSGESIVGTGVEVEGSPAISLWLASFATGTATPMYLHLEQTAEGGVIDGWPEAISGEWSG 137
Query 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
LLL +PYSFP++LL+E LN D G VVGG+ SGG G+ RL G V
Sbjct 138 DTFLLLLGEPYSFPADLLLERLNEDRAGVPVVGGMASGGDSPGEHRLILGPQTYAEGAVA 197
Query 181 VRLPGAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQEL 239
V + A + +VVSQGCRPIG+P+IVT A+ VI ELGGRP L +L+E+ + EQ L
Sbjct 198 VLIQNAAKLHTVVSQGCRPIGKPFIVTRAERNVIQELGGRPALLQLKELFDTLPTREQAL 257
Query 240 VSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAA 299
V R L +G VV E+ QGDFL+R ++G DP GAI IG+ + VG TVQF VRD AA
Sbjct 258 VQRKLHLGRVVSEYRDHFEQGDFLVRNVVGIDPQAGAIAIGDYIRVGQTVQFHVRDQDAA 317
Query 300 DKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAA 359
D +L+ + A + G PVG LLFTCNGRG RMF HDA+ I + LG IPLAGFFAA
Sbjct 318 DAELKQLLAVAKSGAAGVPVGALLFTCNGRGSRMFKEPHHDAACIAEKLGDIPLAGFFAA 377
Query 360 GEIGPVAGHNALHGFTASMALF 381
GEIGP+ G N +HGFTAS+ +F
Sbjct 378 GEIGPIGGQNFVHGFTASIVIF 399
>gi|87306450|ref|ZP_01088597.1| hypothetical protein DSM3645_08962 [Blastopirellula marina DSM
3645]
gi|87290629|gb|EAQ82516.1| hypothetical protein DSM3645_08962 [Blastopirellula marina DSM
3645]
Length=395
Score = 270 bits (689), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 157/388 (41%), Positives = 218/388 (57%), Gaps = 11/388 (2%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
++ +ST A A+ A E+L+ LA + S H D+ + + +
Sbjct 6 LKFAAALSTHEATEDAIAQVVREALEQLSAPV-DLAFVFVSPQHADKLETIATQLCGLLG 64
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAET--FHLDFVRTGSGALITGYR--- 115
L G + IV E+E PA+++WLA P E HL+F RT G G+
Sbjct 65 TENLFGGTGEAIVGVGREIEQAPAISLWLAHLPGVEVTPMHLEFQRTPDGGSFIGWSGKL 124
Query 116 -FDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVL 174
LL+ +P+SFP++ L+ +N D PG ++GG+ SGG G+ L R+V
Sbjct 125 PLQWPKEATLLLMGEPFSFPADALLARMNEDQPGIPIIGGMASGGHAPGENLLVHGREVK 184
Query 175 TSGLVGVRLPGAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMA 233
+G + L GA V SVVSQGCRPIGEP ++T ++ I LGGRPPL +REI +
Sbjct 185 KTGASAIYLHGAVRVRSVVSQGCRPIGEPMVITKSERNEIHLLGGRPPLEIIREIFAQLP 244
Query 234 PDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQV 293
+Q+LV+RGL IG VVDE+ GDF+IR ++G + TG I +G+ V G T+QF V
Sbjct 245 TSDQQLVNRGLHIGQVVDEYREKFEPGDFIIRNVIGVNQETGGIAVGDYVRPGQTIQFHV 304
Query 294 RDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPL 353
RD +AD DL+ + A E G P+G L+FTCNGRG R+F HDA ++ G IP
Sbjct 305 RDENSADADLK---QLLATESSGQPLGALVFTCNGRGTRLFSAPHHDAECLQAACGDIPA 361
Query 354 AGFFAAGEIGPVAGHNALHGFTASMALF 381
AG FA GE+GP+AG N +HGFTAS+ALF
Sbjct 362 AGIFAMGELGPIAGQNFMHGFTASLALF 389
>gi|302035705|ref|YP_003796027.1| hypothetical protein NIDE0322 [Candidatus Nitrospira defluvii]
gi|300603769|emb|CBK40101.1| conserved exported protein of unknown function [Candidatus Nitrospira
defluvii]
Length=408
Score = 268 bits (685), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 164/389 (43%), Positives = 228/389 (59%), Gaps = 6/389 (1%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+R ++ DV+ AA E RE+L +A L S H DQA L A++ ++
Sbjct 9 LRFASALTRHADVQTAADELIRSIREQLGSSRIDVAFLFISVQHADQAETLSHALRTALG 68
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGP--PAETFHLDFVRTGSGALITGY---R 115
P L+GC +G++A E+E PA +W A P A L F + +
Sbjct 69 PDTLVGCTGEGVIATGREVETGPAATLWAAHLPGVIAHPLRLSFSSVHDQFSLRDWPDLD 128
Query 116 FDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLT 175
+ + + LL DP+S P ++ + P +GG+ GG+ + RLF D +V +
Sbjct 129 YGGESAPVMLLFADPFSTPLQDVLGLIEERYPHARALGGLAGGGQDLAENRLFLDDEVYS 188
Query 176 SGLVGVRLPGAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAP 234
GLVGV L G SV +V+SQGCRPIG+ +IVT A+ VI ELGG P LH L+ + ++
Sbjct 189 DGLVGVALSGNISVRTVISQGCRPIGDRFIVTKAEHNVIQELGGIPALHCLQTVFGQLSM 248
Query 235 DEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVR 294
DE+ R L IGI +DE A +GDFLIR LLGAD TGAI +G+V++ G TVQFQVR
Sbjct 249 DERAQAQRALHIGIAMDEQRAQFTRGDFLIRNLLGADQQTGAIVVGDVIQEGQTVQFQVR 308
Query 295 DAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLA 354
DA +AD+DL + + + P+G LLF+C GRG+ +FGV +HDAS + + LG IPLA
Sbjct 309 DAQSADEDLHALLAASRLDESQRPLGALLFSCCGRGKGLFGVPNHDASVLGEQLGAIPLA 368
Query 355 GFFAAGEIGPVAGHNALHGFTASMALFVD 383
GFFA GE+GPV G N LHG+TAS+A+F +
Sbjct 369 GFFAQGELGPVGGRNFLHGYTASIAIFSE 397
>gi|271969747|ref|YP_003343943.1| hypothetical protein Sros_8558 [Streptosporangium roseum DSM
43021]
gi|270512922|gb|ACZ91200.1| conserved hypothetical protein [Streptosporangium roseum DSM
43021]
Length=398
Score = 268 bits (684), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 153/330 (47%), Positives = 205/330 (63%), Gaps = 4/330 (1%)
Query 55 VQASVEPAALIGCVAQGIVAGRHELENEPAVAVWLAS--GPPAETFHLDFVRTGSGALIT 112
V + A++IGC A G++ +E P+V+VW A+ G TF LD +RT ++
Sbjct 58 VMSMASDASVIGCSATGVIGDGQGIEVTPSVSVWAATLEGARLTTFALDTLRTDDRFVVV 117
Query 113 GYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRD 172
G +L DPYSFP++ +E L ++GG+ + + RG RLF D +
Sbjct 118 GLPERHPDDHAAILFADPYSFPTDGFVERSQEVLGDLPLIGGLANAIQGRGAVRLFADGE 177
Query 173 VLTSGLVGVRLPGAHSVS-VVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLG 231
+ T G VGV L G ++S VVSQGCRPIG VT + ++ EL G+P L RL EIV
Sbjct 178 IYTEGAVGVLLSGPVNISTVVSQGCRPIGPTMAVTAVEDNLLLELAGQPALARLEEIVSA 237
Query 232 MAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQF 291
+ D+++LV+ GLQIGI +DE+ +GDFLIRG+LG DP A+ IG+VVE+G TV+F
Sbjct 238 LDEDDRDLVASGLQIGIAMDEYAERHERGDFLIRGVLGIDPEREAVAIGDVVEIGRTVRF 297
Query 292 QVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGI 351
QVRDAA AD+DL ++ E G G LLF+CNGRG MFG DHDA + D LG I
Sbjct 298 QVRDAATADEDLYELLDAHREEF-GRVDGALLFSCNGRGSAMFGTADHDAVALRDTLGPI 356
Query 352 PLAGFFAAGEIGPVAGHNALHGFTASMALF 381
+AGFFAAGE+GPV GHN +HGFTAS+ +F
Sbjct 357 SVAGFFAAGEVGPVGGHNHVHGFTASVLVF 386
>gi|325111105|ref|YP_004272173.1| hypothetical protein Plabr_4580 [Planctomyces brasiliensis DSM
5305]
gi|324971373|gb|ADY62151.1| domain of unknown function DUF1745 [Planctomyces brasiliensis
DSM 5305]
Length=407
Score = 266 bits (679), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 149/387 (39%), Positives = 216/387 (56%), Gaps = 6/387 (1%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
++I V ST + RA E E+L G P L L S H D L +++ +
Sbjct 1 MKIHVQYSTEAETPRAVDEVVNGLLEKLDGAHPELTFLFVSHHHEDHFSTLAGQIRSRLN 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLA--SGPPAETFHLDFVRTGSGALITGYRFDR 118
L+G A+GIVAG ELE P + ++ SG + FH++F R L G +
Sbjct 61 SKHLVGSTAEGIVAGDRELEERPGLVAYVIADSGAVIQPFHMEFQRDDEQILCFGGPENI 120
Query 119 TAHDLH---LLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLT 175
+ + L +PYS + + + L+ + GGV SGG G+ LF D + +
Sbjct 121 GSEGDNGAVFLFCEPYSSSAPVALPELSESQGHLPIFGGVASGGIGPGENCLFLDGEKID 180
Query 176 SGLVGVRLPGAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAP 234
G +GV + +VSQGCRPIG +++T ++ +I ELGG P + + RE+ +
Sbjct 181 HGAIGVVYRCKQKLRQIVSQGCRPIGYTFVITKSEKNIIYELGGLPAMQQFREMFKELTE 240
Query 235 DEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVR 294
D+QELV +G +G+V +E+ + +GDFL+ +LG+DP +GAI + + V G TVQF VR
Sbjct 241 DDQELVRQGPHLGVVTNEYKEIFERGDFLVSNVLGSDPESGAIAVSQAVRPGRTVQFHVR 300
Query 295 DAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLA 354
DA AD+DLRL +E+ + +G LLFTCNGRG ++FG +HD I+D G IP A
Sbjct 301 DAITADEDLRLMIEQDKSYHSNKVIGSLLFTCNGRGEKLFGAANHDVKAIQDAYGPIPTA 360
Query 355 GFFAAGEIGPVAGHNALHGFTASMALF 381
GFFA GEIGP+A + LHGFTAS+ LF
Sbjct 361 GFFAQGEIGPLADRSYLHGFTASIVLF 387
>gi|297171923|gb|ADI22910.1| uncharacterized protein conserved in bacteria [uncultured Rhizobium
sp. HF0500_35F13]
Length=395
Score = 265 bits (676), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 153/389 (40%), Positives = 216/389 (56%), Gaps = 10/389 (2%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
R +S + D ++A E + R P L V+ S H + A L A + ++
Sbjct 7 TRFASALSESVDWQQAVDEVCSQVRGP-DDPPPDLVVMFFSSDHAEVAEQLAAEIHRRLQ 65
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETF--HLDFVRTGSGALITGYRFDR 118
AL+G A+ ++ E+E +PA+++W P A LDF RT G +I G+ D
Sbjct 66 CDALLGTSAESVLGRGQEVEQQPALSLWAGWLPGASLLPMKLDFERTPEGGVILGWP-DD 124
Query 119 TAHDLH-----LLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDV 173
D L+L DP+SFP LL+E N D PG + GG+ SG G++RL D
Sbjct 125 LPQDWQDPAALLVLADPFSFPMELLLERFNADQPGMPICGGMASGCSVPGESRLVLAGDC 184
Query 174 LTSGLVGVRLPGAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGM 232
++ G V VRL G + ++VSQGCRPIGE ++T ++ V+ +L G + RL+E+ +
Sbjct 185 MSEGAVAVRLGGELKIRTLVSQGCRPIGEHMVITQSEHNVVQQLRGESAMLRLKEVFDRL 244
Query 233 APDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQ 292
++QE V +GL +G VV E+ QGDFLIR ++G DP G I + + + G TVQF
Sbjct 245 PANDQERVQQGLFLGRVVSEYQDDFEQGDFLIRNVIGMDPEQGTITVADYMRAGQTVQFH 304
Query 293 VRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIP 352
+RD A +L + A+ P GGLLFTCNGRG R+F HDA+ ++ L IP
Sbjct 305 IRDQETASAELVQLLSSLQADDSFQPAGGLLFTCNGRGSRLFDTPHHDATMVQQHLADIP 364
Query 353 LAGFFAAGEIGPVAGHNALHGFTASMALF 381
LAGFFA GEIGP+ G N LHGFTAS+ LF
Sbjct 365 LAGFFAQGEIGPIGGENFLHGFTASVILF 393
>gi|284044707|ref|YP_003395047.1| hypothetical protein Cwoe_3254 [Conexibacter woesei DSM 14684]
gi|283948928|gb|ADB51672.1| domain of unknown function DUF1745 [Conexibacter woesei DSM 14684]
Length=385
Score = 264 bits (674), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 168/382 (44%), Positives = 212/382 (56%), Gaps = 3/382 (0%)
Query 2 RIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEP 61
RIG G+ST D R A EAA A LAG +A++ + +H L V ++ P
Sbjct 4 RIGTGISTHGDARVGAIEAAHAAGVALAGERADVAIVFAAGAHLAAPEATLEGVHEALRP 63
Query 62 AALIGCVAQGIVAGRHELENEPAVAVWLAS--GPPAETFHLDFVRTGSGALITGYRFDRT 119
LIGC A G++ E E AVAVW AS A TFH + +TG D
Sbjct 64 PELIGCGAGGVLGCGAEHEGGTAVAVWAASLGDGHATTFHASAEQLDDSIAVTGME-DLA 122
Query 120 AHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLV 179
+LLPDP+SFP++ L++ L T PG +VGG+ S G T LF V SG V
Sbjct 123 GSRGAILLPDPFSFPTDALLQDLATRAPGVPIVGGLASARTAEGATALFHGERVCESGAV 182
Query 180 GVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQEL 239
GVR G + VSQG P+G VT A+G VI EL GRP L +RE++ + E+EL
Sbjct 183 GVRFDGVELLPCVSQGATPVGPEMTVTAAEGNVIAELAGRPALDHIRELIEQLDAREREL 242
Query 240 VSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAA 299
V+ GL +G+V+D GDFL+RGLLGADP G I I VE G ++ RDAA A
Sbjct 243 VAGGLLVGVVLDGGKPEYSHGDFLVRGLLGADPVAGTIAIAAPVEPGQVLRLHARDAAEA 302
Query 300 DKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAA 359
D+D + L G P G L F+C+ RGR MFGV DHDA + D L G P AGFFAA
Sbjct 303 DRDFHDQLRVRVEALGGAPAGALAFSCHSRGREMFGVADHDAGMLADELAGAPSAGFFAA 362
Query 360 GEIGPVAGHNALHGFTASMALF 381
GEIGPV G + +H FTA++ALF
Sbjct 363 GEIGPVGGASFMHSFTATVALF 384
>gi|296121655|ref|YP_003629433.1| hypothetical protein Plim_1400 [Planctomyces limnophilus DSM
3776]
gi|296013995|gb|ADG67234.1| domain of unknown function DUF1745 [Planctomyces limnophilus
DSM 3776]
Length=398
Score = 242 bits (618), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 154/393 (40%), Positives = 214/393 (55%), Gaps = 15/393 (3%)
Query 2 RIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEP 61
R +T + RA + A + +L G P L ++ S + D +L A + ++
Sbjct 7 RYAAAWTTEVSLVRAMEQVAIEIQSQLEGRHPDLLLVFCSHHYADAWQNLSAGLVSTTGA 66
Query 62 AALIGCVAQGIVAGRHELENEPAVAVWLAS--GPPAETFHLDFVRTGSGALITGYR---- 115
L+GC + IVA ELEN PA+++W AS G F F RT G + TG
Sbjct 67 KVLLGCSGESIVATGRELENGPALSIWAASWDGVGMIPFQATFERTPDGIVTTGLPQGVN 126
Query 116 -FDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFR----- 169
+ ++L DPYS ++L+ +HL DLP V+GG+ SGG + RLF
Sbjct 127 GLLQGNARCAIVLADPYSSLTDLITDHLAEDLPNLPVIGGMASGGGPG-ENRLFYAHKAI 185
Query 170 DRDVLTSGLVGVRLPGAHSVS-VVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREI 228
+ V G +GV L G + + VVSQGC+P+G Y+VT AD I ELGG PPL RL ++
Sbjct 186 EPQVFEEGAIGVILSGNLTFTPVVSQGCKPVGTTYVVTKADRNFIVELGGEPPLARLEQL 245
Query 229 VLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGAT 288
++ +Q L+ GL +G+ + E+ +GDFLI ++GAD TG + IG VG T
Sbjct 246 YADLSATDQRLIENGLHLGLAMTEYRDQFRRGDFLIANVIGADRNTGVLAIGGKARVGQT 305
Query 289 VQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLL 348
VQF +RD A +DL ++ A + P P LLFTCNGRG R+F HDA +E+
Sbjct 306 VQFHLRDHVTASEDLVEMLKTARSSHPAPQ-AALLFTCNGRGTRLFSAPHHDAQKLEEFF 364
Query 349 GGIPLAGFFAAGEIGPVAGHNALHGFTASMALF 381
G IP+AGFFA GE+G V N LHGFTAS+ LF
Sbjct 365 GSIPVAGFFAQGELGQVGTKNFLHGFTASIGLF 397
>gi|296271068|ref|YP_003653700.1| hypothetical protein Tbis_3113 [Thermobispora bispora DSM 43833]
gi|296093855|gb|ADG89807.1| domain of unknown function DUF1745 [Thermobispora bispora DSM
43833]
Length=397
Score = 242 bits (618), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 156/386 (41%), Positives = 215/386 (56%), Gaps = 9/386 (2%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAG--GTPALAVLLGSRSHTDQAVDLLAAVQAS 58
R G++ D+ AA A R LAG G P L D+ V
Sbjct 6 CRFADGLAVGGDLEEAAETAV---RRALAGLSGPPDLLCFFICGQDPDEVGRAGLRVMDM 62
Query 59 VEPAALIGCVAQGIVAGRHELENEPAVAVWLASGPPA--ETFHLDFVRTGSGALITGYRF 116
A +IGC A G++ G +E PAV+ A A TF L+ RT ++ G
Sbjct 63 APTAEVIGCSATGVIGGDRGIELRPAVSALAACFGEAAVTTFALETFRTEDRFVVVGLPE 122
Query 117 DRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTS 176
A +L DPYSFP + +E + G +VGG+ +G + G RLF +V T
Sbjct 123 RGPADRAMILFTDPYSFPVDAFVERSGEVIGGLPIVGGLANGWQGPGSVRLFAGGEVYTE 182
Query 177 GLVGVRLPGAHSVS-VVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPD 235
G VG + G +V+ +VSQGCRPIG +VT A ++ EL G P L RL +IV + +
Sbjct 183 GAVGAVISGPVNVTAMVSQGCRPIGPSMVVTRAQENLLLELAGEPALARLEDIVSALDEE 242
Query 236 EQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRD 295
++ELV+ GLQIG+V+DE+ +GDFLIRG++G DP ++ IG+++E+G TV+FQVRD
Sbjct 243 DRELVAAGLQIGVVMDEYAERQERGDFLIRGVIGIDPERESVAIGDMLEIGRTVRFQVRD 302
Query 296 AAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAG 355
A AD+DLR A+ + G G LL CNGRG MFG DHD + + LG I +AG
Sbjct 303 AETADEDLR-AILDEHKPMIGRAEGALLICCNGRGSAMFGTADHDPVAVREALGPIGVAG 361
Query 356 FFAAGEIGPVAGHNALHGFTASMALF 381
FFAAGE+GPVAGHN +HG +A++ +F
Sbjct 362 FFAAGEVGPVAGHNHVHGCSAALLVF 387
>gi|269125309|ref|YP_003298679.1| hypothetical protein Tcur_1055 [Thermomonospora curvata DSM 43183]
gi|268310267|gb|ACY96641.1| domain of unknown function DUF1745 [Thermomonospora curvata DSM
43183]
Length=389
Score = 241 bits (616), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 160/385 (42%), Positives = 213/385 (56%), Gaps = 9/385 (2%)
Query 2 RIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRS--HTDQAVDLLAAVQASV 59
R G G++ PD+ AA A A E L+ + V L +A V +
Sbjct 3 RFGDGLALGPDLIGAAESAVKQALEPLSAPPDLVCVFLACEDVGAVGEAARRAMRVADAA 62
Query 60 EPAALIGCVAQGIVAGRHELENEPAVAVWLASGPPA--ETFHLDFVRTGSGALITGYRFD 117
+IGC G++ G +E AV+ W P A E F L+ +R ++ G
Sbjct 63 GARLVIGCNGSGVIGGDRGVEETSAVSAWAGVLPGAHLEPFRLETLRAEDRLVVVGMPEG 122
Query 118 RTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSG 177
+ +LL DPYSFP + +E LPG +VG + G TRL D +V G
Sbjct 123 SDEDVVAVLLADPYSFPVDAFVERSEEALPGLPMVGALAGGQGAG-RTRLLLDGEVYDDG 181
Query 178 LVGVRLPGAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDE 236
VGV L G S +VVSQG RPIG +VT AD V+ EL G P L +L +IVL + +E
Sbjct 182 AVGVVLGGPISAATVVSQGARPIGPDMVVTKADENVLYELAGTPALEKLEQIVLALPEEE 241
Query 237 QELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDA 296
Q++ S+GL IG+ +DE+ GDFL+RG++GAD TGAI IG+VVEVG TV+FQVRDA
Sbjct 242 QQMASQGLLIGVAMDEYAEQHEHGDFLVRGVVGADADTGAIAIGDVVEVGRTVRFQVRDA 301
Query 297 AAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGF 356
AA++DL ++R + P G LLF+CNGRGR MF +DHD + G + GF
Sbjct 302 EAAEEDLTALLQRFDLK---PVEGALLFSCNGRGRAMFPDSDHDVKLLRRTFGPAGVGGF 358
Query 357 FAAGEIGPVAGHNALHGFTASMALF 381
FAAGEIGPV+G N +HGFTAS+ F
Sbjct 359 FAAGEIGPVSGRNHVHGFTASILAF 383
>gi|72160848|ref|YP_288505.1| hypothetical protein Tfu_0444 [Thermobifida fusca YX]
gi|71914580|gb|AAZ54482.1| conserved hypothetical protein [Thermobifida fusca YX]
Length=412
Score = 240 bits (612), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 163/384 (43%), Positives = 210/384 (55%), Gaps = 6/384 (1%)
Query 2 RIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEP 61
R ++T D+ AA A A E L G + V + + A+ A+ A E
Sbjct 29 RFSDALATGVDLVSAAERATRQALERLDGPADLVCVFVSGIDPEEVALAGERAM-ALAEG 87
Query 62 AALIGCVAQGIVAGRHELENEPAVAVWLASGP--PAETFHLDFVRTGSGALITGYRFDRT 119
A IGC A G++ G E + AV+VW A P F L + G + G
Sbjct 88 ATTIGCSAGGVIGGGRGTEGQGAVSVWAAMLPGVTMTPFELAAIAEGDQLAVIGVLEPTP 147
Query 120 AHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLV 179
A LLL +PY FP++ +EH NT L G +VGG+ G RLF + + +G V
Sbjct 148 ADQAALLLANPYVFPTHTFVEHSNTILDGLPIVGGLADGTYGGDSVRLFLQGETVQAGAV 207
Query 180 GVRLPGAHSV--SVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQ 237
G+ L G + V +VVSQGCRPIG +VT A+ V+ EL G P +L IV + P+EQ
Sbjct 208 GL-LFGGNGVLGTVVSQGCRPIGPSMVVTKAEDNVLIELAGTPAYAKLESIVSALPPEEQ 266
Query 238 ELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAA 297
+LV+ GL IGI +DE+ GDFLIRG+L ADP I IG+VV+VG TV+FQVRD A
Sbjct 267 QLVADGLHIGIAIDEYADRHESGDFLIRGVLDADPEQSTITIGDVVDVGQTVRFQVRDQA 326
Query 298 AADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFF 357
AD DL + A + G G LLF+CNGRG MF DHD ++ +LG + GFF
Sbjct 327 TADSDLLERLRLFAHDTGGTAEGALLFSCNGRGSGMFPSADHDVRRVQQILGIDAVGGFF 386
Query 358 AAGEIGPVAGHNALHGFTASMALF 381
AAGEIGPVAG N LHGFTA M F
Sbjct 387 AAGEIGPVAGRNHLHGFTACMLAF 410
>gi|117929098|ref|YP_873649.1| hypothetical protein Acel_1891 [Acidothermus cellulolyticus 11B]
gi|117649561|gb|ABK53663.1| domain of unknown function DUF1745 [Acidothermus cellulolyticus
11B]
Length=391
Score = 236 bits (603), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 151/357 (43%), Positives = 198/357 (56%), Gaps = 5/357 (1%)
Query 28 LAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEPAALIGCVAQGIVAGRHELENEPAVAV 87
L G P LA++ + L A+V +IGC A G++ +E A +V
Sbjct 29 LGGHNPDLALVFVCGDDPAETARALERAAAAVHARTVIGCSASGVIGAGRAVERRAAASV 88
Query 88 W--LASGPPAETFHLDFVRTGSGALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTD 145
W + G FHL+ +RT G + G A L ++L DPYSFP++ +E N
Sbjct 89 WAGVLPGVRIRAFHLEVIRTPQGMAVLGLPPVDDADVLGIVLADPYSFPADGFVEQANRT 148
Query 146 LPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVGVRLPGAHSV-SVVSQGCRPIGEPYI 204
+ +VGG+ G G TRL DR + G VGV L G V + VSQGCRPIG P
Sbjct 149 V-SVPLVGGMAFGAAGPGSTRLSLDRRSVERGAVGVLLGGPVGVRTAVSQGCRPIGPPMT 207
Query 205 VTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLI 264
VT A V+ EL G P + +L ++ ++ ++Q L S GLQIGI +DE+ GDFL+
Sbjct 208 VTAARDNVLLELAGMPAVRKLERVLAELSAEDQALASAGLQIGIAMDEYAEDHDMGDFLV 267
Query 265 RGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLF 324
RG+LG DP I IG+VV VG TV+F VRDAA+A DLR V+R E LLF
Sbjct 268 RGILGIDPARQGIAIGDVVPVGRTVRFHVRDAASAGDDLRSTVKRLREEFTAVE-SALLF 326
Query 325 TCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASMALF 381
+CNGRG +F HD S + +LG +AGFFAAGEIGPVAG LHGF+AS+A F
Sbjct 327 SCNGRGSHLFPDAAHDVSVVRGVLGVQAVAGFFAAGEIGPVAGRTYLHGFSASIAAF 383
>gi|297559074|ref|YP_003678048.1| hypothetical protein Ndas_0091 [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
gi|296843522|gb|ADH65542.1| domain of unknown function DUF1745 [Nocardiopsis dassonvillei
subsp. dassonvillei DSM 43111]
Length=383
Score = 230 bits (586), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 154/384 (41%), Positives = 212/384 (56%), Gaps = 8/384 (2%)
Query 2 RIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEP 61
R G ++T D+ AA A A E++ G T L + ++ V
Sbjct 3 RFGDALTTGADLVNAAERAVLSALEQVDGPTD-LVCFFVCGADPEEVTLAGKRVMELAGD 61
Query 62 AALIGCVAQGIVAGRHELENEPAVAVWLASGPPAET--FHLDFVRTGSGALITGYRFDRT 119
AA +GC + G++ G +E + +V+VW A P E F LD V + G +
Sbjct 62 AATLGCSSTGVIGGGRSVEGQGSVSVWCAGLPGVEITPFRLDTVVEDDHLAVIGMQEPGP 121
Query 120 AHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLV 179
+ +LL +PY FP+ + L G +VGG+ G R RLF D +V G +
Sbjct 122 RDSVAILLTNPYEFPTQAFVRESTEALGGLPLVGGMADGMRGEESVRLFCDGEVAEHGAI 181
Query 180 GVRLPGAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQE 238
GV + G + + +VVSQGCRPIG P VT A+G ++ EL G +L E+V ++ +++E
Sbjct 182 GVLVGGENVLGTVVSQGCRPIGSPMTVTKAEGNLLLELAGTNAYEKLEELVESLSEEDRE 241
Query 239 LVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAA 298
L GL IGI +DE++ QGDFLIR L GADP GA+ I ++VEVG TV+FQVRDA
Sbjct 242 LAEHGLHIGIAMDEYVDRHEQGDFLIRTLAGADPELGALTIDDMVEVGQTVRFQVRDAGT 301
Query 299 ADKDLRLAVERAAAELPGPPVG-GLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFF 357
AD+DL + AE PVG GLLF+CNGRG +F +DHD + +LG +AGFF
Sbjct 302 ADEDLARRLSDFGAE---HPVGAGLLFSCNGRGSSLFPQSDHDVLAVHRVLGVDAVAGFF 358
Query 358 AAGEIGPVAGHNALHGFTASMALF 381
AAGEIGPV G N +HGFTA + F
Sbjct 359 AAGEIGPVGGVNHVHGFTACLLAF 382
>gi|149923652|ref|ZP_01912048.1| hypothetical protein PPSIR1_16925 [Plesiocystis pacifica SIR-1]
gi|149815467|gb|EDM75004.1| hypothetical protein PPSIR1_16925 [Plesiocystis pacifica SIR-1]
Length=409
Score = 226 bits (575), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 145/403 (36%), Positives = 211/403 (53%), Gaps = 22/403 (5%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+R + +P + A A E+L P L + +R H + ++ A++
Sbjct 1 MRWAASIDNSPTLEVALARGEESLSEQLGDQRPDLVLAFATRDHQARWHEIPEALRQRFP 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAET--FHLDF-----VRTGSGALITG 113
AA++GC A G++A ELE+ P +A+ A P E FH+D + GSG
Sbjct 61 DAAVVGCSAGGVLANGTELEDGPGLALCAARLPGVERTPFHIDAEALEALVGGSGDSGES 120
Query 114 YRFDRTAHDLH------------LLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRR 161
R D A L +L PDP+S+P ++ L+ P TVVGG+ SGG R
Sbjct 121 ERDDLRARWLAAIGIAEGPDPLLMLFPDPFSWPGPEVLGSLDRAFPQGTVVGGLASGGAR 180
Query 162 RGDTRLFRDRDVLTSGLVGVRLPGAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGRP 220
G+ RLF DR G+VG+ L G V ++V+QGCRP+G P VT ++ EL GRP
Sbjct 181 PGEHRLFCDRSTHHRGMVGLALRGNLEVETIVAQGCRPVGAPMFVTRRQANIVYELDGRP 240
Query 221 PLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIG 280
+ L+++ + PD++ L IG+ + L V QGDFL+R L+G DP++GA+GI
Sbjct 241 AVEALQQLFTTLEPDDRARARTSLLIGLSMHPQLEVHDQGDFLVRNLIGVDPSSGAVGIA 300
Query 281 EVVEVGATVQFQVRDAAAADKDLR-LAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDH 339
+ VQF +RDA A +L LA E P LLF+C GRG ++G T H
Sbjct 301 AELHGHPVVQFHLRDAQTAASELHDLAAEHQRIHGERAPAVALLFSCLGRGEHLYGRTGH 360
Query 340 DASTIEDLLGG-IPLAGFFAAGEIGPVAGHNALHGFTASMALF 381
D+ + + LG +PLAGFF GEIGP+AG +HG+T+S+ L
Sbjct 361 DSEVLREHLGATLPLAGFFCNGEIGPIAGRTFMHGYTSSILLL 403
>gi|223939736|ref|ZP_03631608.1| protein of unknown function DUF1745 [bacterium Ellin514]
gi|223891607|gb|EEF58096.1| protein of unknown function DUF1745 [bacterium Ellin514]
Length=396
Score = 223 bits (567), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 136/381 (36%), Positives = 205/381 (54%), Gaps = 9/381 (2%)
Query 12 DVRRAAAEAAAHA-REELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEPAALIGCVAQ 70
+ AA +A A R EL +L ++ S QA +L ++ + L GC +
Sbjct 14 EFEEAAFQAWARKLRAELHAPKVSLGLVFMSPKMFPQAEQILEILRVDGQIPLLAGCSSN 73
Query 71 GIVAGRHELENEPAVAVWLASGPPAETFHLDFVR----TGSGALITGYRFDRTAHDLH-- 124
++ G HE E++ + V L S P AE F + GSG ++ T +
Sbjct 74 SLITGVHEFEDDGGLVVALYSLPGAELKAFRFTQADLEQGSGRAYWQHKTGVTPEQTNGW 133
Query 125 LLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVGVRLP 184
L DP++ + N ++GG+ SG + T+L+ + +V G V + +
Sbjct 134 LAFADPFNMDCEAWLGSWNEAYAPAPILGGLASGEQTTQQTQLYLNGEVYEEGGVAISIG 193
Query 185 G-AHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSRG 243
G V V+SQGC PIG+ + +T + +I E+G RP L E + DEQ+
Sbjct 194 GDVKLVGVISQGCTPIGDTWTLTKVEKNLIQEIGNRPAFEVLAETFGTLTQDEQQASRGN 253
Query 244 LQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDL 303
L IG+V++E+L +GDFL+R L+G DP +G I +G + +G T+QFQ RDAAAA +D+
Sbjct 254 LFIGLVMNEYLEEYHRGDFLVRNLIGVDPQSGIIAVGALPRLGQTIQFQRRDAAAATEDM 313
Query 304 RLAVERAAAELPGPPV-GGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEI 362
+ + RA +L G V GG L +CNGRG+ +FG DHDA I+++LG + ++GFF GEI
Sbjct 314 KALLARARKQLAGATVYGGCLCSCNGRGQGLFGEPDHDAKMIQEMLGPVGMSGFFCNGEI 373
Query 363 GPVAGHNALHGFTASMALFVD 383
GPV N LHG+TAS+ALFV
Sbjct 374 GPVGERNFLHGYTASLALFVK 394
>gi|262196432|ref|YP_003267641.1| hypothetical protein Hoch_3246 [Haliangium ochraceum DSM 14365]
gi|262079779|gb|ACY15748.1| domain of unknown function DUF1745 [Haliangium ochraceum DSM
14365]
Length=396
Score = 215 bits (548), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 137/379 (37%), Positives = 194/379 (52%), Gaps = 11/379 (2%)
Query 7 VSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEPAALIG 66
V+ + A EA H +L G P L V + D L+ V+ L+G
Sbjct 7 VANTAHLEDALDEAVEHIDADLNGAAPDLMVAFAHNDYGDHLQRLVEVVRERYPGVVLLG 66
Query 67 CVAQGIVAGRHELENEPAVAVWLASGPPAET--FHLDFVRTGSGALITGYRFDRTAHDLH 124
C A G++ G +E+E +PA+++ A P E FHLD + I G + +
Sbjct 67 CSADGVIGGGNEIEYQPALSLTAAVLPGVELVPFHLDGAPASWRSRI-GMQTGQPPS--F 123
Query 125 LLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVGVRLP 184
+L+PDP+S P + + P + +GG+ SG G T LF + SG VGV +
Sbjct 124 VLIPDPFSCPVEDTLRWFDAVYPNSPKIGGLASGAGMAGTTTLFAGGHLARSGAVGVAMR 183
Query 185 GAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSRG 243
GA + ++V+QGCRPIG P VT D V+ EL GRP L + +A +QEL
Sbjct 184 GALEMRTLVAQGCRPIGAPMFVTRHDEDVVFELDGRPALQAIEATFASLASADQELFRHS 243
Query 244 LQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDL 303
L +G+V D V G+GDFL+R +LG DP GA+ + +E VQF +RDAA + DL
Sbjct 244 LYLGVVTDRSKQVYGRGDFLVRNILGVDPELGAVAVDAELEDNQVVQFHLRDAATSAADL 303
Query 304 RLAVERAAAELPG-PPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEI 362
E + G PP G L+F C GRG+ ++G +HD+ G +PL GFF GEI
Sbjct 304 ----EHLLSTYDGPPPRGALMFPCLGRGQALYGHANHDSDAFRARFGEVPLGGFFCNGEI 359
Query 363 GPVAGHNALHGFTASMALF 381
GP G +HG+T +MALF
Sbjct 360 GPFGGRTFVHGYTTAMALF 378
>gi|86609276|ref|YP_478038.1| hypothetical protein CYB_1819 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86557818|gb|ABD02775.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length=441
Score = 210 bits (535), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 147/374 (40%), Positives = 205/374 (55%), Gaps = 25/374 (6%)
Query 33 PALAVLLGSRSHTDQAVDLLAAVQASVEPAALIGCVAQGIVAGRHELENEPAVAVWLASG 92
P L VL S + + + +L + +E LIGC GIV G HE+E+ PA+++ LA
Sbjct 64 PNLGVLFVSAAFASEYIRVLPLLSGLLEVDVLIGCSGGGIVGGGHEIEDGPALSLSLAVM 123
Query 93 PPA--ETFHLD-----FVRTGSGALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTD 145
P FHL + A + + LLL D +S + L++ L+
Sbjct 124 PEVVLHPFHLRGNQLPDLDAAPSAWVDCVGVSPQSKPHFLLLADGFSSGISELLQGLDFA 183
Query 146 LPGTTVVGGVVSGGR-RRGDTRLFRD-------RDVLTSGLVGVRLPGAHSV-SVVSQGC 196
PG+ VGG+ SGGR RG+ D R++ G VG+ L G + +VV+QGC
Sbjct 184 YPGSVKVGGLASGGRGPRGNALFLLDARTLTPRRELYREGTVGLALYGNVVLDAVVAQGC 243
Query 197 RPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAV 256
RPIG+P VT A+G VI L GRPPL L+++ ++P +Q L L IG+++DE +
Sbjct 244 RPIGDPLRVTEAEGNVILGLEGRPPLAVLQDLAERLSPVDQRLARHSLFIGLLMDEFKSE 303
Query 257 PGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAE--- 313
P GDFLIR +LG DP GA+ IG+ V G TVQF +RDA + +DLR A+ R AE
Sbjct 304 PTPGDFLIRVILGVDPRVGALAIGDQVRPGQTVQFHLRDAQTSAEDLRWALSRYCAERNL 363
Query 314 -----LPGP-PVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAG 367
P P P G L+F+C GRG+ ++G D D+ +LLG +PL GFF GEIGPV G
Sbjct 364 RQSPSQPRPEPCGALMFSCLGRGKGLYGTPDFDSQRFRELLGELPLGGFFCNGEIGPVGG 423
Query 368 HNALHGFTASMALF 381
LHG+T+ +F
Sbjct 424 STFLHGYTSCFGIF 437
>gi|320103039|ref|YP_004178630.1| hypothetical protein Isop_1496 [Isosphaera pallida ATCC 43644]
gi|319750321|gb|ADV62081.1| domain of unknown function DUF1745 [Isosphaera pallida ATCC 43644]
Length=401
Score = 209 bits (532), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 135/334 (41%), Positives = 183/334 (55%), Gaps = 20/334 (5%)
Query 64 LIGCVAQGIVAGRHELENEPAVAVW---LASGPPAETFHLDFVRTGSGALITGYRFD--- 117
+IG A+ + E+E PA+ W L G +TF L G + R D
Sbjct 62 VIGVTAESVAGVAREVEGLPALTAWAIQLPEGSRCDTFRLTSSEAPLGDWVDSVRIDPAP 121
Query 118 --------RTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFR 169
+ + L +LL DP+SF ++ L + G V+GG+ SG R G RL
Sbjct 122 VSRVSLTEKDKNKLVILLADPFSFAADEWFSRLEEEKIGLRVIGGMASGANRPGGNRLVI 181
Query 170 DRDVLTSGLVGVRLPGAH-SVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREI 228
D V+ G VGV L G + +VVSQGCRPIG ++VT D ++ ELG RP + LRE
Sbjct 182 DGAVVQQGAVGVALSGPFVAETVVSQGCRPIGRHFVVTKVDRNILHELGRRPVIEVLREQ 241
Query 229 VLGMAPDEQ-ELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGA 287
+ ++ E +L + GL IG V++E+ +GDFLIR ++G ++ I ++ VG
Sbjct 242 LETLSDAETAKLRNGGLHIGRVINEYQERFERGDFLIRNVIGI-AEEQSLAISDLPRVGQ 300
Query 288 TVQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDL 347
TVQFQ+RDA AD+DL + R EL G G L+FTCNGRG R+F HDA + +
Sbjct 301 TVQFQLRDAQTADEDLTDLLGRP--ELKGTK-GALMFTCNGRGTRLFDQPHHDAQALANA 357
Query 348 LGGIPLAGFFAAGEIGPVAGHNALHGFTASMALF 381
+G IP AGFFA GE GPV G N +HGFTAS ALF
Sbjct 358 VGPIPAAGFFAMGEFGPVGGRNFIHGFTASFALF 391
>gi|86606541|ref|YP_475304.1| hypothetical protein CYA_1894 [Synechococcus sp. JA-3-3Ab]
gi|86555083|gb|ABD00041.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length=446
Score = 208 bits (530), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 146/380 (39%), Positives = 204/380 (54%), Gaps = 32/380 (8%)
Query 33 PALAVLLGSRSHTDQAVDLLAAVQASVEPAALIGCVAQGIVAGRHELENEPAVAVWLASG 92
P L +L S + + + +L + +E LIGC GIV G HE+E PA+++ LA
Sbjct 64 PNLGILFVSAAFASEYIRVLPLLSELLEVDVLIGCSGGGIVGGGHEIEEGPALSLSLAVL 123
Query 93 PPAETFHLDFVRTGS--------GALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNT 144
P H ++R A I + LLL D +S + L++ L+
Sbjct 124 PDV-ALHPFYLRGNQLPDLDAPPSAWIDLVGVLPQSKPHFLLLADGFSSRISELLQGLDF 182
Query 145 DLPGTTVVGGVVSGGR-RRGDTRLFRD-------RDVLTSGLVGVRLPGAHSV-SVVSQG 195
PG VGG+ SGGR RG+ D R++ G VG+ L G + +VV+QG
Sbjct 183 AYPGAVKVGGLASGGRGPRGNALFLLDARTPTPRRELYREGTVGLALSGNVVLDAVVAQG 242
Query 196 CRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLA 255
CRPIG+P VT A+G VI L GRPPL L+++ ++P +Q L + L IG+++DE +
Sbjct 243 CRPIGDPLRVTEAEGNVILSLEGRPPLAVLQDLAERLSPSDQRLARQALFIGLLMDEFKS 302
Query 256 VPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAE-- 313
P GDFLIR +LG DP GAI IG+ V G TVQF +RDA + +DLR A+ R AE
Sbjct 303 EPTSGDFLIRVILGIDPRVGAIAIGDRVRPGQTVQFHLRDAQTSAEDLRWALSRYCAERN 362
Query 314 -----------LPGP-PVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGE 361
P P P G L+F+C GRG+ ++G + D+ +LLG +PL GFF GE
Sbjct 363 LQQSYPAERSSQPKPDPCGALMFSCLGRGKGLYGTPNFDSQRFRELLGELPLGGFFCNGE 422
Query 362 IGPVAGHNALHGFTASMALF 381
IGPV G LHG+T+ +F
Sbjct 423 IGPVGGSTFLHGYTSCFGIF 442
>gi|294055462|ref|YP_003549120.1| hypothetical protein Caka_1932 [Coraliomargarita akajimensis
DSM 45221]
gi|293614795|gb|ADE54950.1| domain of unknown function DUF1745 [Coraliomargarita akajimensis
DSM 45221]
Length=402
Score = 202 bits (514), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 124/371 (34%), Positives = 185/371 (50%), Gaps = 9/371 (2%)
Query 21 AAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEPAALIGCVAQGIVAGRHELE 80
+A R EL GG A++ S+ H D DL+ VQ ++GC G++A E+E
Sbjct 26 SAQQRREL-GGPATFALIFCSQEHVDDISDLIEIVQIYAHVPTVVGCSGVGLIANSDEIE 84
Query 81 NEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRT------AHDLHLLLPDPYSFP 134
N+ V++ L P + + G + T F R + +L S
Sbjct 85 NDAGVSIALYRLPGTQAIAHHIPTSCFGTVDTPASFKRDLGSSLDQANAWMLFASSESIG 144
Query 135 SNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVGVRLPGAHSVS-VVS 193
+ + N G +GG S + LF + G V + L G ++ +++
Sbjct 145 HDSWLPAWNQATGGKVTIGGFASSPSENPQSHLFLNGQHYQDGAVALSLEGHVTIEPLLT 204
Query 194 QGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEH 253
QGCRPIG P+IVT A+ +I ++G RP L LR+ + M+ D+Q+L + IG+V+DE+
Sbjct 205 QGCRPIGSPWIVTEAEHNLIHKIGNRPILEVLRDTLENMSDDDQQLAHGNIFIGLVLDEY 264
Query 254 LAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAE 313
+ G GDFL+R L DP TGAI I +G +QFQ+RD A D+ ++R A
Sbjct 265 KSSFGTGDFLVRNLAAIDPQTGAIAIATPPRIGQNLQFQIRDPHTAAIDMEELLKRKKAR 324
Query 314 LPGPPV-GGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALH 372
L G + GG L C GRG ++G + D S I++ L GIPL+G F GE V LH
Sbjct 325 LQGRRIYGGCLCDCIGRGASLYGAPNQDVSAIQNALPGIPLSGIFCNGEFATVKQQTQLH 384
Query 373 GFTASMALFVD 383
G+ AS+ LFV+
Sbjct 385 GYAASLGLFVE 395
>gi|37520395|ref|NP_923772.1| hypothetical protein gll0826 [Gloeobacter violaceus PCC 7421]
gi|35211388|dbj|BAC88767.1| gll0826 [Gloeobacter violaceus PCC 7421]
Length=407
Score = 201 bits (512), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 112/260 (44%), Positives = 159/260 (62%), Gaps = 3/260 (1%)
Query 125 LLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVGVRLP 184
+L+ D SFP ++LI L+ P VGG+ SGG R G RLF + SG VGV L
Sbjct 140 VLMVDGSSFPVDVLIGGLDFAFPKAIKVGGLASGGNRPGQNRLFFGDQAVGSGAVGVVLA 199
Query 185 GAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSRG 243
G +V + V+QGCRP+GE + +T A+G ++ EL G+P L L+ ++ + ++Q L
Sbjct 200 GDIAVEAAVAQGCRPVGETFQITRAEGNLLWELDGQPALQVLQTVLQQLDENDQRLARNA 259
Query 244 LQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDL 303
L +G+ + E + QGDFL+R L+G D TG + +GE + G TV+F +RDAA + DL
Sbjct 260 LFVGVRMSEFHSGSEQGDFLVRNLMGVDSRTGGLAVGEWLRTGQTVRFHLRDAATSRDDL 319
Query 304 RLAVERAAAELPG-PPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLG-GIPLAGFFAAGE 361
+L ++R E G PP G LLF+C GRG ++G D D++ +LG G+PLAGFF GE
Sbjct 320 QLVLQRHRLEHSGAPPAGALLFSCLGRGESLYGEPDVDSTLFAQVLGEGVPLAGFFCNGE 379
Query 362 IGPVAGHNALHGFTASMALF 381
IGPV LHG+T+S LF
Sbjct 380 IGPVGSTTFLHGYTSSFGLF 399
>gi|153006881|ref|YP_001381206.1| hypothetical protein Anae109_4044 [Anaeromyxobacter sp. Fw109-5]
gi|152030454|gb|ABS28222.1| domain of unknown function DUF1745 [Anaeromyxobacter sp. Fw109-5]
Length=401
Score = 198 bits (503), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 141/387 (37%), Positives = 196/387 (51%), Gaps = 7/387 (1%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
+R +S P A AEAAA L G P L V S H ++ L+
Sbjct 1 MRWSSAISRQPRAVDAFAEAAAPLEARLEGDPPDLLVAFVSPHHAGESEQLVDLAARRFP 60
Query 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRT- 119
A L+GC A G++ HE+E+ PA+++ A P E V G+ L R
Sbjct 61 RALLVGCTAGGVIGDAHEVEDGPALSLTAAVLPGVELSPFR-VEPGAQPLDPSAWRARVG 119
Query 120 ----AHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLT 175
A LLL DP++ L+E L+ P GG+ SGGR RL DV
Sbjct 120 CPPEARPKLLLLADPFTVDIGALVEGLDGAYPAAPKFGGLASGGRGLDQNRLLVAEDVHR 179
Query 176 SGLVGVRLPGAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAP 234
+G VGV G V ++++QGCR IG P +VT V+ EL GRPPL + E+ + P
Sbjct 180 NGGVGVVFTGNLEVDTLIAQGCRAIGAPMLVTRCQHGVLQELDGRPPLQVIAELYASLEP 239
Query 235 DEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVR 294
++EL+ L +G+ + G+ L+R L+GAD TGA+ +G + VQF +R
Sbjct 240 RDRELMQTSLFLGLELRSDEVEFQPGELLVRNLIGADEDTGALAVGAELRPLTVVQFVLR 299
Query 295 DAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLA 354
DA +A+++LR + R G P G LLF+C GRG +FG DHD S E+ LG PL
Sbjct 300 DAHSAEQELRRMLARHRRAATGRPAGALLFSCVGRGAGLFGHPDHDTSLFEEQLGPAPLG 359
Query 355 GFFAAGEIGPVAGHNALHGFTASMALF 381
GFF GEIGPV G +HG+T++ A+F
Sbjct 360 GFFCNGEIGPVGGTTFVHGYTSAFAMF 386
>gi|159028345|emb|CAO87243.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length=417
Score = 196 bits (497), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 138/406 (34%), Positives = 205/406 (51%), Gaps = 34/406 (8%)
Query 7 VSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEPAALIG 66
+ST P + A E +++L G + +A++ S ++ L+ + + LIG
Sbjct 11 LSTRPSLEAAVTEVVEKVQDKLVG-SADIAIIFISSAYASDYPRLVPLILDKLPVPVLIG 69
Query 67 CVAQGIV-----AGRHELENEPAVAVWLASGPPAET--FHLDFVRT-------GSGALIT 112
C GIV E+E PA+++ +A P E F+++ S +
Sbjct 70 CGGAGIVGMGDREKAREIEASPALSLTVAHLPDVEVQPFYIEAAEMPDLDSSPSSWTELL 129
Query 113 GYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGG--RRRG-----DT 165
G + +LL DP+S N L+E L+ PG+ +GG+VSGG R G D
Sbjct 130 GVEAAKNPQ--FILLADPFSSRINDLLEGLDFAYPGSAKIGGLVSGGMIERSGGLFYHDQ 187
Query 166 RLFRDRDVLTSGLVGVRLPGAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGR----- 219
+ R+ + G VG+ L G V ++V+QGCRPIG Y V+ + +I + G+
Sbjct 188 QKPRNSYLYRQGTVGIALSGNIIVETIVAQGCRPIGPIYQVSEGERNIIISMTGKGADGT 247
Query 220 --PPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAI 277
PPL+ LR+++ + ++ELV L IGI DE GDFLIR +LG DP GAI
Sbjct 248 PQPPLNLLRDLIPSLREKDRELVQNSLFIGIARDEFKMQLRAGDFLIRSVLGVDPRQGAI 307
Query 278 GIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPP--VGGLLFTCNGRGRRMFG 335
IG+ V G VQF +RDA + DL L ++ E P +G L+F+C GRG ++
Sbjct 308 AIGDRVRPGQRVQFHLRDADTSALDLELLLQAFPQERPNSSEVLGALIFSCLGRGENLYE 367
Query 336 VTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASMALF 381
D D+ + +PLAGFF GEIGPVAG LHG+T++ ALF
Sbjct 368 KPDFDSGLFQRYFANVPLAGFFCNGEIGPVAGRTFLHGYTSAFALF 413
>gi|166366981|ref|YP_001659254.1| hypothetical protein MAE_42400 [Microcystis aeruginosa NIES-843]
gi|166089354|dbj|BAG04062.1| hypothetical protein MAE_42400 [Microcystis aeruginosa NIES-843]
Length=417
Score = 190 bits (483), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 136/406 (34%), Positives = 201/406 (50%), Gaps = 34/406 (8%)
Query 7 VSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEPAALIG 66
+ST P + A E +++L G + LA++ S ++ L+ + + LIG
Sbjct 11 LSTRPSLEAAVTEVVEKVQDKLVG-SADLAIIFISSAYASDYPRLVPLILDKLSVPVLIG 69
Query 67 CVAQGIV-----AGRHELENEPAVAVWLASGPPAET--FHLDFVRT-------GSGALIT 112
C GIV E+E PA+++ +A P E F+++ S +
Sbjct 70 CGGAGIVGMDDREKAREIEASPALSLTVAHLPNVEVQPFYIEAAEMPDLDSSPSSWTELL 129
Query 113 GYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGG--RRRG-----DT 165
G + +LL DP+S N L+E L+ P + +GG+VSGG R G D
Sbjct 130 GVEAAKNPQ--FILLADPFSSRINDLLEGLDFAYPSSAKIGGLVSGGMIERSGGLFYHDQ 187
Query 166 RLFRDRDVLTSGLVGVRLPGAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGR----- 219
+ R+ + G VG+ L G V ++V+QGCRPIG Y V+ + +I + G+
Sbjct 188 QKPRNTYLYRQGTVGIALSGNIIVETIVAQGCRPIGPIYQVSEGERNIIISMTGKGADGT 247
Query 220 --PPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAI 277
PPL+ LR ++ + ++EL L IGI DE GDFLIR +LG DP GAI
Sbjct 248 PQPPLNLLRALIPSLREKDRELAQHSLFIGIARDEFKMQLRAGDFLIRNVLGVDPRQGAI 307
Query 278 GIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPP--VGGLLFTCNGRGRRMFG 335
IG+ V G VQF +RDA + DL L ++ E P +G L+F+C GRG ++
Sbjct 308 AIGDRVRPGQRVQFHLRDAETSALDLELLLQAFPQEKPASSDILGALIFSCLGRGENLYE 367
Query 336 VTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASMALF 381
D D+ + +PLAGFF GEIGPV G LHG+T++ ALF
Sbjct 368 KPDFDSGLFQRYFANVPLAGFFGNGEIGPVGGRTFLHGYTSAFALF 413
>gi|298246483|ref|ZP_06970289.1| protein of unknown function DUF1745 [Ktedonobacter racemifer
DSM 44963]
gi|297553964|gb|EFH87829.1| protein of unknown function DUF1745 [Ktedonobacter racemifer
DSM 44963]
Length=400
Score = 190 bits (482), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 124/362 (35%), Positives = 183/362 (51%), Gaps = 15/362 (4%)
Query 35 LAVLLGSRSHTDQAVDLLAAVQASVEPAALIGCVAQGIVAGRHELENEPAVAVWLASGPP 94
+A+L S + + ++L ++ + ++GC QGI+ ELE+ PA+++ S P
Sbjct 36 VALLFASGEYEEHFPEMLRIIKKETGASIVLGCSGQGIIGTGVELEDVPALSLMTMSLPG 95
Query 95 AETFHL-----DFVRTGSGALITGYRFDRTAHDLH--LLLPDPYSFPSNLLIEHLNTDLP 147
A T H D V + D D++ LL DP+ S LI+ L P
Sbjct 96 A-TLHATRLPPDIVEMFNTPEELRTLLDVPLDDVNGWLLFLDPFHLNSESLIDALARAYP 154
Query 148 GTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVGVRLPGAHSV-SVVSQGCRPIGEPYIVT 206
++GG+ S + F + V G +G+ + G + + S+VSQGC PIGEP+ +T
Sbjct 155 QVPMMGGLASNDMQDSPCYFFFNDTVYNDGGIGLAIGGPYKILSIVSQGCEPIGEPWTIT 214
Query 207 GA-DGAVITELGGRPPLHRLREIVLGMAPDEQELVSRGLQ-IGIVVDEHLAVPGQGDFLI 264
D ++I + RP L + ++P Q R L +G+ DE+ G+G FLI
Sbjct 215 KVQDNSLIETISNRPAYDMLVDTFQKLSPAAQIRAQRNLLLVGLAADEYSERFGRGSFLI 274
Query 265 RGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPP----VG 320
R LLG D A+ IG VG T+QFQ+RD+ AD DLR + + L V
Sbjct 275 RNLLGVDRRNKALAIGAQPRVGQTIQFQMRDSETADLDLRELLNKLHYRLKKAEAYQIVS 334
Query 321 GLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASMAL 380
G+L TCNGRG +F +HDA +E++LG +P G F GEIGPV + LH FTA +AL
Sbjct 335 GILCTCNGRGESLFPTPNHDAGMVEEILGPLPTIGLFCNGEIGPVGDRSFLHSFTACLAL 394
Query 381 FV 382
V
Sbjct 395 IV 396
>gi|298490695|ref|YP_003720872.1| hypothetical protein Aazo_1561 ['Nostoc azollae' 0708]
gi|298232613|gb|ADI63749.1| domain of unknown function DUF1745 ['Nostoc azollae' 0708]
Length=404
Score = 189 bits (481), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 130/390 (34%), Positives = 201/390 (52%), Gaps = 18/390 (4%)
Query 7 VSTAPDVRRAAAEAAAHAREELAGGTPA-LAVLLGSRSHTDQAVDLLAAVQASVEPAALI 65
+ST + A + A L PA L ++ S + T + LL + + LI
Sbjct 11 LSTHHSLETAVTDVVQQAVSSLTA--PADLGLVFISSAFTSEYSRLLPLLTEKLSVPMLI 68
Query 66 GCVAQGIVAGR-----HELENEPAVAVWLASGPPAE--TFH-----LDFVRTGSGALITG 113
GC A G+V + E+E+EPA+++ LA P + FH L + A I
Sbjct 69 GCSAAGVVGTKSGNKTQEIESEPAISLTLAHLPGVDIRAFHILGDQLPDLDCSPDAWIDL 128
Query 114 YRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDV 173
++ +LL +S +N L++ L+ P + +VGG SGG LF + +
Sbjct 129 VGVLPSSAPQFILLSSAFSSGTNDLLQGLDFAYPSSVIVGGQASGGFVSDRIALFCNDRL 188
Query 174 LTSGLVGVRLPGAHSV-SVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGM 232
G VG+ L G + ++V+QGCRPIGE VT A+ +I EL + PL LR ++ +
Sbjct 189 YRQGTVGLALSGDIVLETIVAQGCRPIGELLQVTKAERNIILELDEQVPLVVLRNLISSL 248
Query 233 APDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQ 292
+ +E+ L L +G+ ++E QGDFLIR LLG DP+ GAI IG+ V G +QF
Sbjct 249 SEEEKMLTQHSLFVGLAMNEFQLSLKQGDFLIRNLLGVDPSAGAIAIGDRVRPGQRLQFH 308
Query 293 VRDAAAADKDLRLAVERAAAELP--GPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGG 350
+RDA A+ +DL L ++ + P+ L+F+C GRG ++G + D+ +
Sbjct 309 LRDAQASAEDLELILQEYQEQSTSGSSPLAALMFSCVGRGAGLYGKANFDSELFKRYFHD 368
Query 351 IPLAGFFAAGEIGPVAGHNALHGFTASMAL 380
IP+ G+F AGEIGPV+G LHG+T+ A+
Sbjct 369 IPMGGYFCAGEIGPVSGRTFLHGYTSVFAI 398
>gi|158336704|ref|YP_001517878.1| hypothetical protein AM1_3572 [Acaryochloris marina MBIC11017]
gi|158306945|gb|ABW28562.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length=417
Score = 188 bits (478), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 139/414 (34%), Positives = 200/414 (49%), Gaps = 35/414 (8%)
Query 1 VRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPA-LAVLLGSRSHTDQAVDLLAAVQASV 59
++ +ST P + A E A A + L PA LA+L S + + L + +
Sbjct 1 MKWASALSTQPSLEAALDEVIATAMQSL--DAPADLAILFISTTFASEFPRLQPLLADKL 58
Query 60 EPAALIGCVAQGIVAGRH-----ELENEPAVAVWLASGP--PAETFHL--DFVRTGSGAL 110
IGC G++ E+E EP + + LAS P +TFH+ D + A
Sbjct 59 PVQHFIGCGGNGVIGPTQGGSTAEVEEEPGITLTLASLPGVDIQTFHIYEDELPDPDSAP 118
Query 111 ITGYRFDRT--AHDLHLLL-PDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRL 167
+T AH H +L DP S + L++ L+ PG +GG+ SG + L
Sbjct 119 LTWTELLEVDPAHQPHFILFADPSSSKISDLLQGLDYAYPGAVKIGGLASGRSSWSGSGL 178
Query 168 FRDRDVLTSGLVGVRLPGAHSV-SVVSQGCRPIGEPYIVTGADGAVI------------- 213
F D + G VGV L G V ++V+QGCRPIG+PY V A+ V+
Sbjct 179 FCDDQLYREGTVGVALSGNIMVETIVAQGCRPIGQPYRVAEAERNVVLQVEEQTVPVEAT 238
Query 214 ---TELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGA 270
E+ + PL L+ +V + DE+EL L +GIV +E GDFLIR L+G
Sbjct 239 FNADEVELQTPLEALQTLVQDLDEDERELAQHSLSVGIVCNEFKQNLEPGDFLIRNLIGV 298
Query 271 DPTTGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELP---GPPVGGLLFTCN 327
DP GAI IG+ + G +QF +RDA A+ +L ++ + P P+ LLF C
Sbjct 299 DPRIGAIAIGDRIRPGQRIQFHLRDAQASADELEELLQHYFQKSPPDQSQPIAALLFDCL 358
Query 328 GRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASMALF 381
GRG R +G D D+ IP++GFF GEIGP+AG LHG+TA+ +F
Sbjct 359 GRGERFYGEPDFDSQLFRRYFHNIPVSGFFCNGEIGPIAGTTFLHGYTAAFGIF 412
>gi|254412137|ref|ZP_05025912.1| conserved domain protein [Microcoleus chthonoplastes PCC 7420]
gi|196181103|gb|EDX76092.1| conserved domain protein [Microcoleus chthonoplastes PCC 7420]
Length=416
Score = 182 bits (463), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 119/330 (37%), Positives = 175/330 (54%), Gaps = 32/330 (9%)
Query 77 HELENEPAVAVWLASGPPAET--FHL------DFVRTGSGAL-ITGYRFDRTAHDLHLLL 127
E+E EPA+++ LAS P FH+ D + S + + G H +LL
Sbjct 81 QEIEAEPALSISLASMPEVSVRAFHIPGSDLPDLDSSPSTWVDLIGVSPQDQPH--FILL 138
Query 128 PDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLF-RDRDVLT------SGLVG 180
DP+S N L++ L+ PG+ VGG+ S + LF RD + + G +G
Sbjct 139 ADPFSSKINDLLQGLDFAYPGSVKVGGLASASAMGVQSGLFYRDSERYSGGTLHREGTIG 198
Query 181 VRLPGAHSVS-VVSQGCRPIGEPYIVTG-----------ADGAVITELGGRPPLHRLREI 228
V L G + +VSQGCRPIG+PY +T ++G +E+ +PPL LR++
Sbjct 199 VALSGNVVLDPIVSQGCRPIGQPYQITKGERNIVLELADSNGMSFSEVESQPPLAVLRDV 258
Query 229 VLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGAT 288
+ ++ ++EL L IGI DE GQGDFLIR LLG DP GAI IG+ V G
Sbjct 259 IQNLSESDRELAQHSLFIGIARDEFKQSLGQGDFLIRNLLGVDPRLGAIAIGDRVRPGQR 318
Query 289 VQFQVRDAAAADKDLRLAVERAAAELPGPP--VGGLLFTCNGRGRRMFGVTDHDASTIED 346
+QF +RDA +++DL L ++ ++ P G L+F+C GRG+ ++G D D+ +
Sbjct 319 IQFHLRDARTSEEDLELLLQNYQNQVNSTPETAGALMFSCLGRGQGLYGKPDFDSQLLCR 378
Query 347 LLGGIPLAGFFAAGEIGPVAGHNALHGFTA 376
+ I + GFF GEIGPV G LHG+T+
Sbjct 379 YINNISVGGFFCNGEIGPVGGSTFLHGYTS 408
Lambda K H
0.320 0.140 0.412
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 743543923100
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40