BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0874c
Length=386
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608014|ref|NP_215389.1| hypothetical protein Rv0874c [Mycob... 754 0.0
gi|289756959|ref|ZP_06516337.1| conserved hypothetical protein [... 753 0.0
gi|289744604|ref|ZP_06503982.1| conserved hypothetical protein [... 737 0.0
gi|15840288|ref|NP_335325.1| hypothetical protein MT0897 [Mycoba... 728 0.0
gi|339293886|gb|AEJ45997.1| hypothetical protein CCDC5079_0807 [... 720 0.0
gi|308375278|ref|ZP_07667983.1| hypothetical protein TMGG_02939 ... 645 0.0
gi|240169380|ref|ZP_04748039.1| hypothetical protein MkanA1_0870... 610 2e-172
gi|15607768|ref|NP_215142.1| hypothetical protein Rv0628c [Mycob... 556 2e-156
gi|306774736|ref|ZP_07413073.1| hypothetical protein TMAG_02508 ... 555 4e-156
gi|289442020|ref|ZP_06431764.1| conserved hypothetical protein [... 555 5e-156
gi|289749126|ref|ZP_06508504.1| conserved hypothetical protein [... 553 2e-155
gi|254230962|ref|ZP_04924289.1| conserved hypothetical protein [... 553 2e-155
gi|340625645|ref|YP_004744097.1| hypothetical protein MCAN_06251... 553 2e-155
gi|289568838|ref|ZP_06449065.1| conserved hypothetical protein [... 523 3e-146
gi|308396149|ref|ZP_07492241.2| hypothetical protein TMLG_03378 ... 518 9e-145
gi|289573230|ref|ZP_06453457.1| LOW QUALITY PROTEIN: conserved h... 455 6e-126
gi|307078563|ref|ZP_07487733.1| hypothetical protein TMKG_03909 ... 454 1e-125
gi|289749395|ref|ZP_06508773.1| conserved hypothetical protein [... 425 6e-117
gi|306796378|ref|ZP_07434680.1| hypothetical protein TMFG_03295 ... 342 5e-92
gi|289744342|ref|ZP_06503720.1| conserved hypothetical protein [... 328 1e-87
gi|283778153|ref|YP_003368908.1| hypothetical protein Psta_0358 ... 268 1e-69
gi|284044707|ref|YP_003395047.1| hypothetical protein Cwoe_3254 ... 265 8e-69
gi|271969747|ref|YP_003343943.1| hypothetical protein Sros_8558 ... 259 8e-67
gi|325111105|ref|YP_004272173.1| hypothetical protein Plabr_4580... 254 2e-65
gi|302035705|ref|YP_003796027.1| hypothetical protein NIDE0322 [... 250 3e-64
gi|87306450|ref|ZP_01088597.1| hypothetical protein DSM3645_0896... 249 5e-64
gi|297171923|gb|ADI22910.1| uncharacterized protein conserved in... 248 2e-63
gi|296271068|ref|YP_003653700.1| hypothetical protein Tbis_3113 ... 241 1e-61
gi|72160848|ref|YP_288505.1| hypothetical protein Tfu_0444 [Ther... 239 4e-61
gi|296121655|ref|YP_003629433.1| hypothetical protein Plim_1400 ... 238 1e-60
gi|306796379|ref|ZP_07434681.1| hypothetical protein TMFG_03296 ... 228 1e-57
gi|117929098|ref|YP_873649.1| hypothetical protein Acel_1891 [Ac... 228 1e-57
gi|269125309|ref|YP_003298679.1| hypothetical protein Tcur_1055 ... 223 4e-56
gi|297559074|ref|YP_003678048.1| hypothetical protein Ndas_0091 ... 221 1e-55
gi|223939736|ref|ZP_03631608.1| protein of unknown function DUF1... 216 6e-54
gi|289744343|ref|ZP_06503721.1| conserved hypothetical protein [... 208 1e-51
gi|320103039|ref|YP_004178630.1| hypothetical protein Isop_1496 ... 208 1e-51
gi|149923652|ref|ZP_01912048.1| hypothetical protein PPSIR1_1692... 207 2e-51
gi|294055462|ref|YP_003549120.1| hypothetical protein Caka_1932 ... 197 2e-48
gi|86609276|ref|YP_478038.1| hypothetical protein CYB_1819 [Syne... 196 5e-48
gi|262196432|ref|YP_003267641.1| hypothetical protein Hoch_3246 ... 196 5e-48
gi|153006881|ref|YP_001381206.1| hypothetical protein Anae109_40... 192 7e-47
gi|86606541|ref|YP_475304.1| hypothetical protein CYA_1894 [Syne... 190 4e-46
gi|37520395|ref|NP_923772.1| hypothetical protein gll0826 [Gloeo... 188 1e-45
gi|159028345|emb|CAO87243.1| unnamed protein product [Microcysti... 188 2e-45
gi|298490695|ref|YP_003720872.1| hypothetical protein Aazo_1561 ... 187 3e-45
gi|166366981|ref|YP_001659254.1| hypothetical protein MAE_42400 ... 185 1e-44
gi|17230343|ref|NP_486891.1| hypothetical protein alr2851 [Nosto... 184 2e-44
gi|75907272|ref|YP_321568.1| hypothetical protein Ava_1049 [Anab... 184 2e-44
gi|254412137|ref|ZP_05025912.1| conserved domain protein [Microc... 176 6e-42
>gi|15608014|ref|NP_215389.1| hypothetical protein Rv0874c [Mycobacterium tuberculosis H37Rv]
gi|31792062|ref|NP_854555.1| hypothetical protein Mb0898c [Mycobacterium bovis AF2122/97]
gi|121636797|ref|YP_977020.1| hypothetical protein BCG_0926c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
62 more sequence titles
Length=386
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/386 (99%), Positives = 386/386 (100%), Gaps = 0/386 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID
Sbjct 1 MRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA
Sbjct 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG
Sbjct 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
Query 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV
Sbjct 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
Query 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD
Sbjct 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
Query 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG
Sbjct 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
Query 361 EIGPIAGRNALHGFTASMALFVDDME 386
EIGPIAGRNALHGFTASMALFVDDME
Sbjct 361 EIGPIAGRNALHGFTASMALFVDDME 386
>gi|289756959|ref|ZP_06516337.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|294996354|ref|ZP_06802045.1| hypothetical protein Mtub2_18086 [Mycobacterium tuberculosis
210]
gi|298524366|ref|ZP_07011775.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|289712523|gb|EFD76535.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|298494160|gb|EFI29454.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|326904907|gb|EGE51840.1| hypothetical protein TBPG_02828 [Mycobacterium tuberculosis W-148]
gi|339297527|gb|AEJ49637.1| hypothetical protein CCDC5180_0800 [Mycobacterium tuberculosis
CCDC5180]
Length=386
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/386 (99%), Positives = 385/386 (99%), Gaps = 0/386 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID
Sbjct 1 MRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA
Sbjct 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG
Sbjct 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
Query 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV
Sbjct 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
Query 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
SH LQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD
Sbjct 241 SHSLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
Query 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG
Sbjct 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
Query 361 EIGPIAGRNALHGFTASMALFVDDME 386
EIGPIAGRNALHGFTASMALFVDDME
Sbjct 361 EIGPIAGRNALHGFTASMALFVDDME 386
>gi|289744604|ref|ZP_06503982.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289685132|gb|EFD52620.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
Length=385
Score = 737 bits (1902), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/383 (99%), Positives = 378/383 (99%), Gaps = 0/383 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID
Sbjct 1 MRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA
Sbjct 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG
Sbjct 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
Query 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV
Sbjct 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
Query 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
SH LQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD
Sbjct 241 SHSLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
Query 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG
Sbjct 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
Query 361 EIGPIAGRNALHGFTASMALFVD 383
EIGPIAGRNAL GFTASM L D
Sbjct 361 EIGPIAGRNALQGFTASMGLVFD 383
>gi|15840288|ref|NP_335325.1| hypothetical protein MT0897 [Mycobacterium tuberculosis CDC1551]
gi|13880449|gb|AAK45139.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
Length=427
Score = 728 bits (1879), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/373 (100%), Positives = 373/373 (100%), Gaps = 0/373 (0%)
Query 14 RQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQAIV 73
RQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQAIV
Sbjct 55 RQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQAIV 114
Query 74 AGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTARDLHLLLPDPYTF 133
AGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTARDLHLLLPDPYTF
Sbjct 115 AGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTARDLHLLLPDPYTF 174
Query 134 PSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVS 193
PSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVS
Sbjct 175 PSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVS 234
Query 194 QGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHGLQIGIVVDEH 253
QGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHGLQIGIVVDEH
Sbjct 235 QGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHGLQIGIVVDEH 294
Query 254 LAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAAR 313
LAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAAR
Sbjct 295 LAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAAR 354
Query 314 LPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHG 373
LPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHG
Sbjct 355 LPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHG 414
Query 374 FTASMALFVDDME 386
FTASMALFVDDME
Sbjct 415 FTASMALFVDDME 427
>gi|339293886|gb|AEJ45997.1| hypothetical protein CCDC5079_0807 [Mycobacterium tuberculosis
CCDC5079]
Length=369
Score = 720 bits (1859), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/369 (99%), Positives = 368/369 (99%), Gaps = 0/369 (0%)
Query 18 VEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQAIVAGRH 77
+EAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQAIVAGRH
Sbjct 1 MEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQAIVAGRH 60
Query 78 EIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNL 137
EIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNL
Sbjct 61 EIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNL 120
Query 138 LIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVSQGCR 197
LIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVSQGCR
Sbjct 121 LIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVSQGCR 180
Query 198 PIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAAP 257
PIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSH LQIGIVVDEHLAAP
Sbjct 181 PIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHSLQIGIVVDEHLAAP 240
Query 258 GQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGR 317
GQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGR
Sbjct 241 GQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGR 300
Query 318 AAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHGFTAS 377
AAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHGFTAS
Sbjct 301 AAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHGFTAS 360
Query 378 MALFVDDME 386
MALFVDDME
Sbjct 361 MALFVDDME 369
>gi|308375278|ref|ZP_07667983.1| hypothetical protein TMGG_02939 [Mycobacterium tuberculosis SUMu007]
gi|308346735|gb|EFP35586.1| hypothetical protein TMGG_02939 [Mycobacterium tuberculosis SUMu007]
Length=347
Score = 645 bits (1664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/330 (99%), Positives = 330/330 (100%), Gaps = 0/330 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID
Sbjct 1 MRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA
Sbjct 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG
Sbjct 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
Query 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV
Sbjct 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
Query 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD
Sbjct 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
Query 301 KDLRLTVERAAARLPGRAAGALLFTCNGRG 330
KDLRLTVERAAARLPGRAAGALLFTCNGRG
Sbjct 301 KDLRLTVERAAARLPGRAAGALLFTCNGRG 330
>gi|240169380|ref|ZP_04748039.1| hypothetical protein MkanA1_08708 [Mycobacterium kansasii ATCC
12478]
Length=383
Score = 610 bits (1572), Expect = 2e-172, Method: Compositional matrix adjust.
Identities = 310/383 (81%), Positives = 341/383 (90%), Gaps = 0/383 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVG T PDAR+AAVEAA QA DELAGE PSLAVLLGSR+H+D+AADVL+AV +++
Sbjct 1 MRIGVGFSTAPDARKAAVEAATQACDELAGEMPSLAVLLGSRSHSDQAADVLNAVQEIVG 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
P L+GC+AQA+VAGRHEIED+PAV VWLASGLAAETFQLDFVRTGSG L+TGYRFDRTA
Sbjct 61 SPPLIGCVAQAVVAGRHEIEDQPAVAVWLASGLAAETFQLDFVRTGSGGLLTGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
DLHLLLPDPYTFPS+LLIEH N+DLPGT VVGG+ SGGR G TRLFRD V +SG+VG
Sbjct 121 HDLHLLLPDPYTFPSSLLIEHLNSDLPGTTVVGGLASGGRGPGGTRLFRDRGVFSSGLVG 180
Query 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
VRLPG+ +P+VSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIVEGL E+ LV
Sbjct 181 VRLPGVHSIPIVSQGCRPIGRPYIVTGADGAVITELGGRPPLVRLREIVEGLPLHEQELV 240
Query 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
S GLQIGIVVDEHLAAPGQGDF+IRGLLGADPSTG IEI EVV+VG T+QFQVRDAA AD
Sbjct 241 SRGLQIGIVVDEHLAAPGQGDFLIRGLLGADPSTGVIEIGEVVEVGTTVQFQVRDAASAD 300
Query 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
KDL L VERAAA L GR AGALLFTCNGRGRRMFGVADHDASTIE+LLGGIPLAGFFAAG
Sbjct 301 KDLHLAVERAAAELGGRPAGALLFTCNGRGRRMFGVADHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPIAGRNALHGFTASMALFVD 383
EIGP+ GRNALHG+TAS+ALFVD
Sbjct 361 EIGPVFGRNALHGYTASLALFVD 383
>gi|15607768|ref|NP_215142.1| hypothetical protein Rv0628c [Mycobacterium tuberculosis H37Rv]
gi|15840029|ref|NP_335066.1| hypothetical protein MT0656 [Mycobacterium tuberculosis CDC1551]
gi|31791810|ref|NP_854303.1| hypothetical protein Mb0644c [Mycobacterium bovis AF2122/97]
56 more sequence titles
Length=383
Score = 556 bits (1434), Expect = 2e-156, Method: Compositional matrix adjust.
Identities = 312/383 (82%), Positives = 339/383 (89%), Gaps = 0/383 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVGV T PD R+AA EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VG
Sbjct 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
VRLPG V VVSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LV
Sbjct 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
Query 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
S GLQIGIVVDEHLA PGQGDF+IRGLLGADP+TG+I I EVV+VGAT+QFQVRDAA AD
Sbjct 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
Query 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
KDLRL VERAAA LPG G LLFTCNGRGRRMFGV DHDASTIE+LLGGIPLAGFFAAG
Sbjct 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPIAGRNALHGFTASMALFVD 383
EIGP+AG NALHGFTASMALFVD
Sbjct 361 EIGPVAGHNALHGFTASMALFVD 383
>gi|306774736|ref|ZP_07413073.1| hypothetical protein TMAG_02508 [Mycobacterium tuberculosis SUMu001]
gi|306970840|ref|ZP_07483501.1| hypothetical protein TMJG_02372 [Mycobacterium tuberculosis SUMu010]
gi|308216629|gb|EFO76028.1| hypothetical protein TMAG_02508 [Mycobacterium tuberculosis SUMu001]
gi|308359625|gb|EFP48476.1| hypothetical protein TMJG_02372 [Mycobacterium tuberculosis SUMu010]
Length=383
Score = 555 bits (1431), Expect = 4e-156, Method: Compositional matrix adjust.
Identities = 311/383 (82%), Positives = 339/383 (89%), Gaps = 0/383 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVGV T PD R+AA EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VG
Sbjct 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
VRLPG V VVSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LV
Sbjct 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
Query 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
S GLQIGIVVDEHLA PGQG+F+IRGLLGADP+TG+I I EVV+VGAT+QFQVRDAA AD
Sbjct 241 SRGLQIGIVVDEHLAVPGQGNFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
Query 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
KDLRL VERAAA LPG G LLFTCNGRGRRMFGV DHDASTIE+LLGGIPLAGFFAAG
Sbjct 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPIAGRNALHGFTASMALFVD 383
EIGP+AG NALHGFTASMALFVD
Sbjct 361 EIGPVAGHNALHGFTASMALFVD 383
>gi|289442020|ref|ZP_06431764.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289568565|ref|ZP_06448792.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289414939|gb|EFD12179.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289542319|gb|EFD45967.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=383
Score = 555 bits (1430), Expect = 5e-156, Method: Compositional matrix adjust.
Identities = 311/383 (82%), Positives = 338/383 (89%), Gaps = 0/383 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVGV T PD R+AA EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VG
Sbjct 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
VRLPG V VVSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LV
Sbjct 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
Query 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
S GLQIGIVVDEHLA PGQGDF+IRGLLGADP+TG+I I EVV+VGAT+QFQVRDAA AD
Sbjct 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
Query 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
KDLRL VER AA LPG G LLFTCNGRGRRMFGV DHDASTIE+LLGGIPLAGFFAAG
Sbjct 301 KDLRLAVERVAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPIAGRNALHGFTASMALFVD 383
EIGP+AG NALHGFTASMALFVD
Sbjct 361 EIGPVAGHNALHGFTASMALFVD 383
>gi|289749126|ref|ZP_06508504.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289689713|gb|EFD57142.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=383
Score = 553 bits (1425), Expect = 2e-155, Method: Compositional matrix adjust.
Identities = 310/383 (81%), Positives = 337/383 (88%), Gaps = 0/383 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVGV T PD R+AA EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VG
Sbjct 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
VRLPG V VVSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LV
Sbjct 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
Query 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
S GLQIGIVVDEHLA PGQGDF+IRGLLGADP+ G+I I EVV+VGAT+QFQVRDAA AD
Sbjct 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTKGAIGIGEVVEVGATVQFQVRDAAAAD 300
Query 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
KDLRL VER AA LPG G LLFTCNGRGRRMFGV DHDASTIE+LLGGIPLAGFFAAG
Sbjct 301 KDLRLAVERVAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPIAGRNALHGFTASMALFVD 383
EIGP+AG NALHGFTASMALFVD
Sbjct 361 EIGPVAGHNALHGFTASMALFVD 383
>gi|254230962|ref|ZP_04924289.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|124600021|gb|EAY59031.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
Length=383
Score = 553 bits (1425), Expect = 2e-155, Method: Compositional matrix adjust.
Identities = 311/383 (82%), Positives = 338/383 (89%), Gaps = 0/383 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVGV T PD R+AA EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
DLHLLLPDPY+FPSNLLIE NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VG
Sbjct 121 HDLHLLLPDPYSFPSNLLIERLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
VRLPG V VVSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LV
Sbjct 181 VRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
Query 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
S GLQIGIVVDEHLA PGQGDF+IRGLLGADP+TG+I I EVV+VGAT+QFQVRDAA AD
Sbjct 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
Query 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
KDLRL VERAAA LPG G LLFTCNGRGRRMFGV DHDASTIE+LLGGIPLAGFFAAG
Sbjct 301 KDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPIAGRNALHGFTASMALFVD 383
EIGP+AG NALHGFTASMALFVD
Sbjct 361 EIGPVAGHNALHGFTASMALFVD 383
>gi|340625645|ref|YP_004744097.1| hypothetical protein MCAN_06251 [Mycobacterium canettii CIPT
140010059]
gi|340003835|emb|CCC42965.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=383
Score = 553 bits (1425), Expect = 2e-155, Method: Compositional matrix adjust.
Identities = 308/383 (81%), Positives = 337/383 (88%), Gaps = 0/383 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVGV T PD R+AA EAA A +ELAG P+LAVLLGSR+HTD+A D+L+AV + ++
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAHEELAGGTPALAVLLGSRSHTDQAVDLLAAVQESVE 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGSPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
DLHLLLPDPY+FPSNLLI+H NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VG
Sbjct 121 HDLHLLLPDPYSFPSNLLIDHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
VRLPG V VVSQ CRPIG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LV
Sbjct 181 VRLPGAHSVSVVSQSCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV 240
Query 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
S GLQIGIVVDEHLA PGQGDF+IRGLLGADP+TG+I I EVV+VGAT+QFQVRDAA AD
Sbjct 241 SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAAD 300
Query 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
KDLRL VERAAA LPG G LLFT NGRGRRMFGV DHDASTIE+LLGGIPLAGFFAAG
Sbjct 301 KDLRLAVERAAAELPGPPVGGLLFTGNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAG 360
Query 361 EIGPIAGRNALHGFTASMALFVD 383
EIGP+AG NALHGFTASMALFVD
Sbjct 361 EIGPVAGHNALHGFTASMALFVD 383
>gi|289568838|ref|ZP_06449065.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289542592|gb|EFD46240.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=304
Score = 523 bits (1346), Expect = 3e-146, Method: Compositional matrix adjust.
Identities = 265/266 (99%), Positives = 266/266 (100%), Gaps = 0/266 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID
Sbjct 1 MRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA
Sbjct 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG
Sbjct 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
Query 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV
Sbjct 181 VRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
Query 241 SHGLQIGIVVDEHLAAPGQGDFVIRG 266
SHGLQIGIVVDEHLAAPGQGDFVIRG
Sbjct 241 SHGLQIGIVVDEHLAAPGQGDFVIRG 266
>gi|308396149|ref|ZP_07492241.2| hypothetical protein TMLG_03378 [Mycobacterium tuberculosis SUMu012]
gi|308367164|gb|EFP56015.1| hypothetical protein TMLG_03378 [Mycobacterium tuberculosis SUMu012]
Length=335
Score = 518 bits (1333), Expect = 9e-145, Method: Compositional matrix adjust.
Identities = 278/334 (84%), Positives = 299/334 (90%), Gaps = 0/334 (0%)
Query 50 DVLSAVLQMIDPPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGA 109
D+L+AV ++P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGA
Sbjct 2 DLLAAVQASVEPAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGA 61
Query 110 LITGYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFR 169
LITGYRFDRTA DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGGVVSGGRRRGDTRLFR
Sbjct 62 LITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFR 121
Query 170 DHDVLTSGVVGVRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIV 229
D DVLTSG+VGVRLPG V VVSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIV
Sbjct 122 DRDVLTSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIV 181
Query 230 EGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATM 289
G++PDE+ LVS GLQIGIVVDEHLA PGQGDF+IRGLLGADP+TG+I I EVV+VGAT+
Sbjct 182 LGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATV 241
Query 290 QFQVRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLG 349
QFQVRDAA ADKDLRL VERAAA LPG G LLFTCNGRGRRMFGV DHDASTIE+LLG
Sbjct 242 QFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLG 301
Query 350 GIPLAGFFAAGEIGPIAGRNALHGFTASMALFVD 383
GIPLAGFFAAGEIGP+AG NALHGFTASMALFVD
Sbjct 302 GIPLAGFFAAGEIGPVAGHNALHGFTASMALFVD 335
>gi|289573230|ref|ZP_06453457.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis K85]
gi|289537661|gb|EFD42239.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis K85]
Length=320
Score = 455 bits (1171), Expect = 6e-126, Method: Compositional matrix adjust.
Identities = 249/289 (87%), Positives = 262/289 (91%), Gaps = 0/289 (0%)
Query 95 AETFQLDFVRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGG 154
AETF LDFVRTGSGALITGYRFDRTA DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGG
Sbjct 32 AETFHLDFVRTGSGALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGG 91
Query 155 VVSGGRRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVSQGCRPIGYPYIVTGADGILIT 214
VVSGGRRRGDTRLFRD DVLTSG+VGVRLPG V VVSQGCRPIG PYIVTGADG +IT
Sbjct 92 VVSGGRRRGDTRLFRDRDVLTSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVIT 151
Query 215 ELGGRPPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPST 274
ELGGRPPL RLREIV G++PDE+ LVS GLQIGIVVDEHLA PGQGDF+IRGLLGADP+T
Sbjct 152 ELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTT 211
Query 275 GSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMF 334
G+I I EVV+VGAT+QFQVRDAA ADKDLRL VERAAA LPG G LLFTCNGRGRRMF
Sbjct 212 GAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMF 271
Query 335 GVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHGFTASMALFVD 383
GV DHDASTIE+LLGGIPLAGFFAAGEIGP+AG NALHGFTASMALFVD
Sbjct 272 GVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASMALFVD 320
>gi|307078563|ref|ZP_07487733.1| hypothetical protein TMKG_03909 [Mycobacterium tuberculosis SUMu011]
gi|308363552|gb|EFP52403.1| hypothetical protein TMKG_03909 [Mycobacterium tuberculosis SUMu011]
Length=290
Score = 454 bits (1169), Expect = 1e-125, Method: Compositional matrix adjust.
Identities = 248/289 (86%), Positives = 262/289 (91%), Gaps = 0/289 (0%)
Query 95 AETFQLDFVRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGG 154
AETF LDFVRTGSGALITGYRFDRTA DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGG
Sbjct 2 AETFHLDFVRTGSGALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGG 61
Query 155 VVSGGRRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVSQGCRPIGYPYIVTGADGILIT 214
VVSGGRRRGDTRLFRD DVLTSG+VGVRLPG V VVSQGCRPIG PYIVTGADG +IT
Sbjct 62 VVSGGRRRGDTRLFRDRDVLTSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVIT 121
Query 215 ELGGRPPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPST 274
ELGGRPPL RLREIV G++PDE+ LVS GLQIGIVVDEHLA PGQG+F+IRGLLGADP+T
Sbjct 122 ELGGRPPLHRLREIVLGMAPDEQELVSRGLQIGIVVDEHLAVPGQGNFLIRGLLGADPTT 181
Query 275 GSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMF 334
G+I I EVV+VGAT+QFQVRDAA ADKDLRL VERAAA LPG G LLFTCNGRGRRMF
Sbjct 182 GAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMF 241
Query 335 GVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHGFTASMALFVD 383
GV DHDASTIE+LLGGIPLAGFFAAGEIGP+AG NALHGFTASMALFVD
Sbjct 242 GVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASMALFVD 290
>gi|289749395|ref|ZP_06508773.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289689982|gb|EFD57411.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=311
Score = 425 bits (1093), Expect = 6e-117, Method: Compositional matrix adjust.
Identities = 234/248 (95%), Positives = 237/248 (96%), Gaps = 0/248 (0%)
Query 100 LDFVRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGG 159
+DFVRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGG
Sbjct 1 MDFVRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGG 60
Query 160 RRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGR 219
RRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGR
Sbjct 61 RRRGDTRLFRDHDVLTSGVVGVRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGR 120
Query 220 PPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEI 279
PPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEI
Sbjct 121 PPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEI 180
Query 280 DEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADH 339
DEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGRAAGA LFTC+ R +FGV
Sbjct 181 DEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGRAAGAPLFTCHARRTTIFGVPRP 240
Query 340 DASTIEEL 347
TIEEL
Sbjct 241 RRVTIEEL 248
>gi|306796378|ref|ZP_07434680.1| hypothetical protein TMFG_03295 [Mycobacterium tuberculosis SUMu006]
gi|308343226|gb|EFP32077.1| hypothetical protein TMFG_03295 [Mycobacterium tuberculosis SUMu006]
Length=209
Score = 342 bits (878), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 175/209 (84%), Positives = 187/209 (90%), Gaps = 0/209 (0%)
Query 175 TSGVVGVRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSP 234
TSG+VGVRLPG V VVSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIV G++P
Sbjct 1 TSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAP 60
Query 235 DERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVR 294
DE+ LVS GLQIGIVVDEHLA PGQGDF+IRGLLGADP+TG+I I EVV+VGAT+QFQVR
Sbjct 61 DEQELVSRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVR 120
Query 295 DAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLA 354
DAA ADKDLRL VERAAA LPG G LLFTCNGRGRRMFGV DHDASTIE+LLGGIPLA
Sbjct 121 DAAAADKDLRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLA 180
Query 355 GFFAAGEIGPIAGRNALHGFTASMALFVD 383
GFFAAGEIGP+AG NALHGFTASMALFVD
Sbjct 181 GFFAAGEIGPVAGHNALHGFTASMALFVD 209
>gi|289744342|ref|ZP_06503720.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289684870|gb|EFD52358.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
Length=201
Score = 328 bits (840), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 167/201 (84%), Positives = 179/201 (90%), Gaps = 0/201 (0%)
Query 183 LPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSH 242
+PG V VVSQGCRPIG PYIVTGADG +ITELGGRPPL RLREIV G++PDE+ LVS
Sbjct 1 MPGAHRVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELVSR 60
Query 243 GLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKD 302
GLQIGIVVDEHLA PGQGDF+IRGLLGADP+TG+I I EVV+VGAT+QFQVRDAA ADKD
Sbjct 61 GLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKD 120
Query 303 LRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEI 362
LRL VERAAA LPG G LLFTCNGRGRRMFGV DHDASTIE+LLGGIPLAGFFAAGEI
Sbjct 121 LRLAVERAAAELPGPPVGGLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEI 180
Query 363 GPIAGRNALHGFTASMALFVD 383
GP+AG NALHGFTASMALFVD
Sbjct 181 GPVAGHNALHGFTASMALFVD 201
>gi|283778153|ref|YP_003368908.1| hypothetical protein Psta_0358 [Pirellula staleyi DSM 6068]
gi|283436606|gb|ADB15048.1| domain of unknown function DUF1745 [Pirellula staleyi DSM 6068]
Length=400
Score = 268 bits (684), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 160/383 (42%), Positives = 218/383 (57%), Gaps = 9/383 (2%)
Query 7 VCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVG 66
+ +T DA + A A P L ++ S H A + + ++ L+G
Sbjct 18 LSSTADAVEEVARKALTALQSSGPRTPDLGLVFFSNHHAPEADFLAKKLCALLGTENLIG 77
Query 67 CIAQAIVAGRHEIEDEPAVVVWLAS---GLAAETFQLDFVRTGSGALITGY----RFDRT 119
C ++IV E+E PA+ +WLAS G A + L +T G +I G+ + +
Sbjct 78 CSGESIVGTGVEVEGSPAISLWLASFATGTATPMY-LHLEQTAEGGVIDGWPEAISGEWS 136
Query 120 ARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVV 179
LLL +PY+FP++LL+E N D G VVGG+ SGG G+ RL G V
Sbjct 137 GDTFLLLLGEPYSFPADLLLERLNEDRAGVPVVGGMASGGDSPGEHRLILGPQTYAEGAV 196
Query 180 GVRLPGMRGV-PVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERA 238
V + + VVSQGCRPIG P+IVT A+ +I ELGGRP L +L+E+ + L E+A
Sbjct 197 AVLIQNAAKLHTVVSQGCRPIGKPFIVTRAERNVIQELGGRPALLQLKELFDTLPTREQA 256
Query 239 LVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAG 298
LV L +G VV E+ QGDF++R ++G DP G+I I + ++VG T+QF VRD
Sbjct 257 LVQRKLHLGRVVSEYRDHFEQGDFLVRNVVGIDPQAGAIAIGDYIRVGQTVQFHVRDQDA 316
Query 299 ADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFA 358
AD +L+ + A + G GALLFTCNGRG RMF HDA+ I E LG IPLAGFFA
Sbjct 317 ADAELKQLLAVAKSGAAGVPVGALLFTCNGRGSRMFKEPHHDAACIAEKLGDIPLAGFFA 376
Query 359 AGEIGPIAGRNALHGFTASMALF 381
AGEIGPI G+N +HGFTAS+ +F
Sbjct 377 AGEIGPIGGQNFVHGFTASIVIF 399
>gi|284044707|ref|YP_003395047.1| hypothetical protein Cwoe_3254 [Conexibacter woesei DSM 14684]
gi|283948928|gb|ADB51672.1| domain of unknown function DUF1745 [Conexibacter woesei DSM 14684]
Length=385
Score = 265 bits (678), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 165/382 (44%), Positives = 216/382 (57%), Gaps = 3/382 (0%)
Query 2 RIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDP 61
RIG G+ T DAR A+EAA A LAGE +A++ + AH L V + + P
Sbjct 4 RIGTGISTHGDARVGAIEAAHAAGVALAGERADVAIVFAAGAHLAAPEATLEGVHEALRP 63
Query 62 PALVGCIAQAIVAGRHEIEDEPAVVVWLAS--GLAAETFQLDFVRTGSGALITGYRFDRT 119
P L+GC A ++ E E AV VW AS A TF + +TG D
Sbjct 64 PELIGCGAGGVLGCGAEHEGGTAVAVWAASLGDGHATTFHASAEQLDDSIAVTGME-DLA 122
Query 120 ARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVV 179
+LLPDP++FP++ L++ T PG +VGG+ S G T LF V SG V
Sbjct 123 GSRGAILLPDPFSFPTDALLQDLATRAPGVPIVGGLASARTAEGATALFHGERVCESGAV 182
Query 180 GVRLPGMRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERAL 239
GVR G+ +P VSQG P+G VT A+G +I EL GRP L +RE++E L ER L
Sbjct 183 GVRFDGVELLPCVSQGATPVGPEMTVTAAEGNVIAELAGRPALDHIRELIEQLDAREREL 242
Query 240 VSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGA 299
V+ GL +G+V+D GDF++RGLLGADP G+I I V+ G ++ RDAA A
Sbjct 243 VAGGLLVGVVLDGGKPEYSHGDFLVRGLLGADPVAGTIAIAAPVEPGQVLRLHARDAAEA 302
Query 300 DKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAA 359
D+D + L G AGAL F+C+ RGR MFGVADHDA + + L G P AGFFAA
Sbjct 303 DRDFHDQLRVRVEALGGAPAGALAFSCHSRGREMFGVADHDAGMLADELAGAPSAGFFAA 362
Query 360 GEIGPIAGRNALHGFTASMALF 381
GEIGP+ G + +H FTA++ALF
Sbjct 363 GEIGPVGGASFMHSFTATVALF 384
>gi|271969747|ref|YP_003343943.1| hypothetical protein Sros_8558 [Streptosporangium roseum DSM
43021]
gi|270512922|gb|ACZ91200.1| conserved hypothetical protein [Streptosporangium roseum DSM
43021]
Length=398
Score = 259 bits (661), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 147/330 (45%), Positives = 200/330 (61%), Gaps = 4/330 (1%)
Query 55 VLQMIDPPALVGCIAQAIVAGRHEIEDEPAVVVWLAS--GLAAETFQLDFVRTGSGALIT 112
V+ M +++GC A ++ IE P+V VW A+ G TF LD +RT ++
Sbjct 58 VMSMASDASVIGCSATGVIGDGQGIEVTPSVSVWAATLEGARLTTFALDTLRTDDRFVVV 117
Query 113 GYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHD 172
G +L DPY+FP++ +E L ++GG+ + + RG RLF D +
Sbjct 118 GLPERHPDDHAAILFADPYSFPTDGFVERSQEVLGDLPLIGGLANAIQGRGAVRLFADGE 177
Query 173 VLTSGVVGVRLPGMRGVP-VVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEG 231
+ T G VGV L G + VVSQGCRPIG VT + L+ EL G+P L RL EIV
Sbjct 178 IYTEGAVGVLLSGPVNISTVVSQGCRPIGPTMAVTAVEDNLLLELAGQPALARLEEIVSA 237
Query 232 LSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQF 291
L D+R LV+ GLQIGI +DE+ +GDF+IRG+LG DP ++ I +VV++G T++F
Sbjct 238 LDEDDRDLVASGLQIGIAMDEYAERHERGDFLIRGVLGIDPEREAVAIGDVVEIGRTVRF 297
Query 292 QVRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGI 351
QVRDAA AD+DL ++ GR GALLF+CNGRG MFG ADHDA + + LG I
Sbjct 298 QVRDAATADEDLYELLDAHREEF-GRVDGALLFSCNGRGSAMFGTADHDAVALRDTLGPI 356
Query 352 PLAGFFAAGEIGPIAGRNALHGFTASMALF 381
+AGFFAAGE+GP+ G N +HGFTAS+ +F
Sbjct 357 SVAGFFAAGEVGPVGGHNHVHGFTASVLVF 386
>gi|325111105|ref|YP_004272173.1| hypothetical protein Plabr_4580 [Planctomyces brasiliensis DSM
5305]
gi|324971373|gb|ADY62151.1| domain of unknown function DUF1745 [Planctomyces brasiliensis
DSM 5305]
Length=407
Score = 254 bits (648), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/390 (37%), Positives = 219/390 (57%), Gaps = 12/390 (3%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
++I V T + +A E ++L G P L L S H D + + + ++
Sbjct 1 MKIHVQYSTEAETPRAVDEVVNGLLEKLDGAHPELTFLFVSHHHEDHFSTLAGQIRSRLN 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLA--SGLAAETFQLDFVRTGSGALI------T 112
LVG A+ IVAG E+E+ P +V ++ SG + F ++F R L
Sbjct 61 SKHLVGSTAEGIVAGDRELEERPGLVAYVIADSGAVIQPFHMEFQRDDEQILCFGGPENI 120
Query 113 GYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHD 172
G D A L +PY+ + + + + + GGV SGG G+ LF D +
Sbjct 121 GSEGDNGAV---FLFCEPYSSSAPVALPELSESQGHLPIFGGVASGGIGPGENCLFLDGE 177
Query 173 VLTSGVVGVRLPGMRGV-PVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEG 231
+ G +GV + + +VSQGCRPIGY +++T ++ +I ELGG P +Q+ RE+ +
Sbjct 178 KIDHGAIGVVYRCKQKLRQIVSQGCRPIGYTFVITKSEKNIIYELGGLPAMQQFREMFKE 237
Query 232 LSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQF 291
L+ D++ LV G +G+V +E+ +GDF++ +LG+DP +G+I + + V+ G T+QF
Sbjct 238 LTEDDQELVRQGPHLGVVTNEYKEIFERGDFLVSNVLGSDPESGAIAVSQAVRPGRTVQF 297
Query 292 QVRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGI 351
VRDA AD+DLRL +E+ + + G+LLFTCNGRG ++FG A+HD I++ G I
Sbjct 298 HVRDAITADEDLRLMIEQDKSYHSNKVIGSLLFTCNGRGEKLFGAANHDVKAIQDAYGPI 357
Query 352 PLAGFFAAGEIGPIAGRNALHGFTASMALF 381
P AGFFA GEIGP+A R+ LHGFTAS+ LF
Sbjct 358 PTAGFFAQGEIGPLADRSYLHGFTASIVLF 387
>gi|302035705|ref|YP_003796027.1| hypothetical protein NIDE0322 [Candidatus Nitrospira defluvii]
gi|300603769|emb|CBK40101.1| conserved exported protein of unknown function [Candidatus Nitrospira
defluvii]
Length=408
Score = 250 bits (639), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 161/391 (42%), Positives = 222/391 (57%), Gaps = 10/391 (2%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+R + D + AA E R++L +A L S H D+A + A+ +
Sbjct 9 LRFASALTRHADVQTAADELIRSIREQLGSSRIDVAFLFISVQHADQAETLSHALRTALG 68
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLAS--GLAAETFQLDFVRTGSGALITGY---R 115
P LVGC + ++A E+E PA +W A G+ A +L F + +
Sbjct 69 PDTLVGCTGEGVIATGREVETGPAATLWAAHLPGVIAHPLRLSFSSVHDQFSLRDWPDLD 128
Query 116 FDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLT 175
+ + + LL DP++ P ++ P +GG+ GG+ + RLF D +V +
Sbjct 129 YGGESAPVMLLFADPFSTPLQDVLGLIEERYPHARALGGLAGGGQDLAENRLFLDDEVYS 188
Query 176 SGVVGVRLPGMRGV-PVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSP 234
G+VGV L G V V+SQGCRPIG +IVT A+ +I ELGG P L L+ + LS
Sbjct 189 DGLVGVALSGNISVRTVISQGCRPIGDRFIVTKAEHNVIQELGGIPALHCLQTVFGQLSM 248
Query 235 DERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVR 294
DERA L IGI +DE A +GDF+IR LLGAD TG+I + +V+Q G T+QFQVR
Sbjct 249 DERAQAQRALHIGIAMDEQRAQFTRGDFLIRNLLGADQQTGAIVVGDVIQEGQTVQFQVR 308
Query 295 DAAGADKDLRLTVERAAARL--PGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIP 352
DA AD+DL + AA+RL R GALLF+C GRG+ +FGV +HDAS + E LG IP
Sbjct 309 DAQSADEDLHALL--AASRLDESQRPLGALLFSCCGRGKGLFGVPNHDASVLGEQLGAIP 366
Query 353 LAGFFAAGEIGPIAGRNALHGFTASMALFVD 383
LAGFFA GE+GP+ GRN LHG+TAS+A+F +
Sbjct 367 LAGFFAQGELGPVGGRNFLHGYTASIAIFSE 397
>gi|87306450|ref|ZP_01088597.1| hypothetical protein DSM3645_08962 [Blastopirellula marina DSM
3645]
gi|87290629|gb|EAQ82516.1| hypothetical protein DSM3645_08962 [Blastopirellula marina DSM
3645]
Length=395
Score = 249 bits (636), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 149/389 (39%), Positives = 214/389 (56%), Gaps = 13/389 (3%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAP-SLAVLLGSRAHTDRAADVLSAVLQMI 59
++ + T A + +A ++L+ AP LA + S H D+ + + + ++
Sbjct 6 LKFAAALSTHEATEDAIAQVVREALEQLS--APVDLAFVFVSPQHADKLETIATQLCGLL 63
Query 60 DPPALVGCIAQAIVAGRHEIEDEPAVVVWLAS--GLAAETFQLDFVRTGSGALITGYR-- 115
L G +AIV EIE PA+ +WLA G+ L+F RT G G+
Sbjct 64 GTENLFGGTGEAIVGVGREIEQAPAISLWLAHLPGVEVTPMHLEFQRTPDGGSFIGWSGK 123
Query 116 --FDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDV 173
LL+ +P++FP++ L+ N D PG ++GG+ SGG G+ L +V
Sbjct 124 LPLQWPKEATLLLMGEPFSFPADALLARMNEDQPGIPIIGGMASGGHAPGENLLVHGREV 183
Query 174 LTSGVVGVRLPG-MRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGL 232
+G + L G +R VVSQGCRPIG P ++T ++ I LGGRPPL+ +REI L
Sbjct 184 KKTGASAIYLHGAVRVRSVVSQGCRPIGEPMVITKSERNEIHLLGGRPPLEIIREIFAQL 243
Query 233 SPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQ 292
++ LV+ GL IG VVDE+ GDF+IR ++G + TG I + + V+ G T+QF
Sbjct 244 PTSDQQLVNRGLHIGQVVDEYREKFEPGDFIIRNVIGVNQETGGIAVGDYVRPGQTIQFH 303
Query 293 VRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIP 352
VRD AD DL+ + A G+ GAL+FTCNGRG R+F HDA ++ G IP
Sbjct 304 VRDENSADADLK---QLLATESSGQPLGALVFTCNGRGTRLFSAPHHDAECLQAACGDIP 360
Query 353 LAGFFAAGEIGPIAGRNALHGFTASMALF 381
AG FA GE+GPIAG+N +HGFTAS+ALF
Sbjct 361 AAGIFAMGELGPIAGQNFMHGFTASLALF 389
>gi|297171923|gb|ADI22910.1| uncharacterized protein conserved in bacteria [uncultured Rhizobium
sp. HF0500_35F13]
Length=395
Score = 248 bits (632), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 145/389 (38%), Positives = 217/389 (56%), Gaps = 10/389 (2%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
R + + D +QA E Q R P L V+ S H + A + + + + +
Sbjct 7 TRFASALSESVDWQQAVDEVCSQVRGP-DDPPPDLVVMFFSSDHAEVAEQLAAEIHRRLQ 65
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVW--LASGLAAETFQLDFVRTGSGALITGYRFDR 118
AL+G A++++ E+E +PA+ +W G + +LDF RT G +I G+ D
Sbjct 66 CDALLGTSAESVLGRGQEVEQQPALSLWAGWLPGASLLPMKLDFERTPEGGVILGWP-DD 124
Query 119 TARDLH-----LLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDV 173
+D L+L DP++FP LL+E N D PG + GG+ SG G++RL D
Sbjct 125 LPQDWQDPAALLVLADPFSFPMELLLERFNADQPGMPICGGMASGCSVPGESRLVLAGDC 184
Query 174 LTSGVVGVRLPG-MRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGL 232
++ G V VRL G ++ +VSQGCRPIG ++T ++ ++ +L G + RL+E+ + L
Sbjct 185 MSEGAVAVRLGGELKIRTLVSQGCRPIGEHMVITQSEHNVVQQLRGESAMLRLKEVFDRL 244
Query 233 SPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQ 292
+++ V GL +G VV E+ QGDF+IR ++G DP G+I + + ++ G T+QF
Sbjct 245 PANDQERVQQGLFLGRVVSEYQDDFEQGDFLIRNVIGMDPEQGTITVADYMRAGQTVQFH 304
Query 293 VRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIP 352
+RD A +L + A + AG LLFTCNGRG R+F HDA+ +++ L IP
Sbjct 305 IRDQETASAELVQLLSSLQADDSFQPAGGLLFTCNGRGSRLFDTPHHDATMVQQHLADIP 364
Query 353 LAGFFAAGEIGPIAGRNALHGFTASMALF 381
LAGFFA GEIGPI G N LHGFTAS+ LF
Sbjct 365 LAGFFAQGEIGPIGGENFLHGFTASVILF 393
>gi|296271068|ref|YP_003653700.1| hypothetical protein Tbis_3113 [Thermobispora bispora DSM 43833]
gi|296093855|gb|ADG89807.1| domain of unknown function DUF1745 [Thermobispora bispora DSM
43833]
Length=397
Score = 241 bits (615), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 153/384 (40%), Positives = 211/384 (55%), Gaps = 5/384 (1%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
R G+ D +AA A +A L+G P L D V+ M
Sbjct 6 CRFADGLAVGGDLEEAAETAVRRALAGLSG-PPDLLCFFICGQDPDEVGRAGLRVMDMAP 64
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLAS--GLAAETFQLDFVRTGSGALITGYRFDR 118
++GC A ++ G IE PAV A A TF L+ RT ++ G
Sbjct 65 TAEVIGCSATGVIGGDRGIELRPAVSALAACFGEAAVTTFALETFRTEDRFVVVGLPERG 124
Query 119 TARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGV 178
A +L DPY+FP + +E + G +VGG+ +G + G RLF +V T G
Sbjct 125 PADRAMILFTDPYSFPVDAFVERSGEVIGGLPIVGGLANGWQGPGSVRLFAGGEVYTEGA 184
Query 179 VGVRLPGMRGVP-VVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDER 237
VG + G V +VSQGCRPIG +VT A L+ EL G P L RL +IV L ++R
Sbjct 185 VGAVISGPVNVTAMVSQGCRPIGPSMVVTRAQENLLLELAGEPALARLEDIVSALDEEDR 244
Query 238 ALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAA 297
LV+ GLQIG+V+DE+ +GDF+IRG++G DP S+ I +++++G T++FQVRDA
Sbjct 245 ELVAAGLQIGVVMDEYAERQERGDFLIRGVIGIDPERESVAIGDMLEIGRTVRFQVRDAE 304
Query 298 GADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFF 357
AD+DLR ++ + GRA GALL CNGRG MFG ADHD + E LG I +AGFF
Sbjct 305 TADEDLRAILDEHKPMI-GRAEGALLICCNGRGSAMFGTADHDPVAVREALGPIGVAGFF 363
Query 358 AAGEIGPIAGRNALHGFTASMALF 381
AAGE+GP+AG N +HG +A++ +F
Sbjct 364 AAGEVGPVAGHNHVHGCSAALLVF 387
>gi|72160848|ref|YP_288505.1| hypothetical protein Tfu_0444 [Thermobifida fusca YX]
gi|71914580|gb|AAZ54482.1| conserved hypothetical protein [Thermobifida fusca YX]
Length=412
Score = 239 bits (611), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 159/385 (42%), Positives = 209/385 (55%), Gaps = 6/385 (1%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
R + T D AA A QA + L G A + V + S + A + + +
Sbjct 28 TRFSDALATGVDLVSAAERATRQALERLDGPADLVCVFV-SGIDPEEVALAGERAMALAE 86
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLA--SGLAAETFQLDFVRTGSGALITGYRFDR 118
+GC A ++ G E + AV VW A G+ F+L + G + G
Sbjct 87 GATTIGCSAGGVIGGGRGTEGQGAVSVWAAMLPGVTMTPFELAAIAEGDQLAVIGVLEPT 146
Query 119 TARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGV 178
A LLL +PY FP++ +EH NT L G +VGG+ G RLF + + +G
Sbjct 147 PADQAALLLANPYVFPTHTFVEHSNTILDGLPIVGGLADGTYGGDSVRLFLQGETVQAGA 206
Query 179 VGVRLPGMRGV--PVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDE 236
VG+ L G GV VVSQGCRPIG +VT A+ ++ EL G P +L IV L P+E
Sbjct 207 VGL-LFGGNGVLGTVVSQGCRPIGPSMVVTKAEDNVLIELAGTPAYAKLESIVSALPPEE 265
Query 237 RALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDA 296
+ LV+ GL IGI +DE+ GDF+IRG+L ADP +I I +VV VG T++FQVRD
Sbjct 266 QQLVADGLHIGIAIDEYADRHESGDFLIRGVLDADPEQSTITIGDVVDVGQTVRFQVRDQ 325
Query 297 AGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGF 356
A AD DL + A G A GALLF+CNGRG MF ADHD ++++LG + GF
Sbjct 326 ATADSDLLERLRLFAHDTGGTAEGALLFSCNGRGSGMFPSADHDVRRVQQILGIDAVGGF 385
Query 357 FAAGEIGPIAGRNALHGFTASMALF 381
FAAGEIGP+AGRN LHGFTA M F
Sbjct 386 FAAGEIGPVAGRNHLHGFTACMLAF 410
>gi|296121655|ref|YP_003629433.1| hypothetical protein Plim_1400 [Planctomyces limnophilus DSM
3776]
gi|296013995|gb|ADG67234.1| domain of unknown function DUF1745 [Planctomyces limnophilus
DSM 3776]
Length=398
Score = 238 bits (607), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 151/394 (39%), Positives = 218/394 (56%), Gaps = 17/394 (4%)
Query 2 RIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDP 61
R T +A + A + + +L G P L ++ S + D ++ + ++
Sbjct 7 RYAAAWTTEVSLVRAMEQVAIEIQSQLEGRHPDLLLVFCSHHYADAWQNLSAGLVSTTGA 66
Query 62 PALVGCIAQAIVAGRHEIEDEPAVVVWLAS--GLAAETFQLDFVRTGSGALITGY----- 114
L+GC ++IVA E+E+ PA+ +W AS G+ FQ F RT G + TG
Sbjct 67 KVLLGCSGESIVATGRELENGPALSIWAASWDGVGMIPFQATFERTPDGIVTTGLPQGVN 126
Query 115 -RFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDH-- 171
AR ++L DPY+ ++L+ +H DLP V+GG+ SGG + RLF H
Sbjct 127 GLLQGNAR-CAIVLADPYSSLTDLITDHLAEDLPNLPVIGGMASGGGPG-ENRLFYAHKA 184
Query 172 ---DVLTSGVVGVRLPG-MRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLRE 227
V G +GV L G + PVVSQGC+P+G Y+VT AD I ELGG PPL RL +
Sbjct 185 IEPQVFEEGAIGVILSGNLTFTPVVSQGCKPVGTTYVVTKADRNFIVELGGEPPLARLEQ 244
Query 228 IVEGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGA 287
+ LS ++ L+ +GL +G+ + E+ +GDF+I ++GAD +TG + I +VG
Sbjct 245 LYADLSATDQRLIENGLHLGLAMTEYRDQFRRGDFLIANVIGADRNTGVLAIGGKARVGQ 304
Query 288 TMQFQVRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEEL 347
T+QF +RD A +DL ++ A + P A ALLFTCNGRG R+F HDA +EE
Sbjct 305 TVQFHLRDHVTASEDLVEMLKTARSSHPAPQA-ALLFTCNGRGTRLFSAPHHDAQKLEEF 363
Query 348 LGGIPLAGFFAAGEIGPIAGRNALHGFTASMALF 381
G IP+AGFFA GE+G + +N LHGFTAS+ LF
Sbjct 364 FGSIPVAGFFAQGELGQVGTKNFLHGFTASIGLF 397
>gi|306796379|ref|ZP_07434681.1| hypothetical protein TMFG_03296 [Mycobacterium tuberculosis SUMu006]
gi|308343156|gb|EFP32007.1| hypothetical protein TMFG_03296 [Mycobacterium tuberculosis SUMu006]
Length=181
Score = 228 bits (582), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 143/181 (80%), Positives = 159/181 (88%), Gaps = 0/181 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVGV T PD R+AA EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVG 180
DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGGVVSGGRRRGDTRLFRD DVLTSG+VG
Sbjct 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLFRDRDVLTSGLVG 180
Query 181 V 181
V
Sbjct 181 V 181
>gi|117929098|ref|YP_873649.1| hypothetical protein Acel_1891 [Acidothermus cellulolyticus 11B]
gi|117649561|gb|ABK53663.1| domain of unknown function DUF1745 [Acidothermus cellulolyticus
11B]
Length=391
Score = 228 bits (581), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 139/321 (44%), Positives = 187/321 (59%), Gaps = 5/321 (1%)
Query 64 LVGCIAQAIVAGRHEIEDEPAVVVW--LASGLAAETFQLDFVRTGSGALITGYRFDRTAR 121
++GC A ++ +E A VW + G+ F L+ +RT G + G A
Sbjct 65 VIGCSASGVIGAGRAVERRAAASVWAGVLPGVRIRAFHLEVIRTPQGMAVLGLPPVDDAD 124
Query 122 DLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGV 181
L ++L DPY+FP++ +E N + +VGG+ G G TRL D + G VGV
Sbjct 125 VLGIVLADPYSFPADGFVEQANRTV-SVPLVGGMAFGAAGPGSTRLSLDRRSVERGAVGV 183
Query 182 RLPGMRGV-PVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALV 240
L G GV VSQGCRPIG P VT A ++ EL G P +++L ++ LS +++AL
Sbjct 184 LLGGPVGVRTAVSQGCRPIGPPMTVTAARDNVLLELAGMPAVRKLERVLAELSAEDQALA 243
Query 241 SHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGAD 300
S GLQIGI +DE+ GDF++RG+LG DP+ I I +VV VG T++F VRDAA A
Sbjct 244 SAGLQIGIAMDEYAEDHDMGDFLVRGILGIDPARQGIAIGDVVPVGRTVRFHVRDAASAG 303
Query 301 KDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
DLR TV+R ALLF+CNGRG +F A HD S + +LG +AGFFAAG
Sbjct 304 DDLRSTVKRLREEFTA-VESALLFSCNGRGSHLFPDAAHDVSVVRGVLGVQAVAGFFAAG 362
Query 361 EIGPIAGRNALHGFTASMALF 381
EIGP+AGR LHGF+AS+A F
Sbjct 363 EIGPVAGRTYLHGFSASIAAF 383
>gi|269125309|ref|YP_003298679.1| hypothetical protein Tcur_1055 [Thermomonospora curvata DSM 43183]
gi|268310267|gb|ACY96641.1| domain of unknown function DUF1745 [Thermomonospora curvata DSM
43183]
Length=389
Score = 223 bits (568), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 150/392 (39%), Positives = 209/392 (54%), Gaps = 23/392 (5%)
Query 2 RIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSR---------AHTDRAADVL 52
R G G+ PD AA A QA + L+ + V L R AD
Sbjct 3 RFGDGLALGPDLIGAAESAVKQALEPLSAPPDLVCVFLACEDVGAVGEAARRAMRVADAA 62
Query 53 SAVLQMIDPPALVGCIAQAIVAGRHEIEDEPAVVVW--LASGLAAETFQLDFVRTGSGAL 110
A L ++GC ++ G +E+ AV W + G E F+L+ +R +
Sbjct 63 GARL-------VIGCNGSGVIGGDRGVEETSAVSAWAGVLPGAHLEPFRLETLRAEDRLV 115
Query 111 ITGYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRD 170
+ G + +LL DPY+FP + +E LPG +VG + G TRL D
Sbjct 116 VVGMPEGSDEDVVAVLLADPYSFPVDAFVERSEEALPGLPMVGALAGGQGAG-RTRLLLD 174
Query 171 HDVLTSGVVGVRLPG-MRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIV 229
+V G VGV L G + VVSQG RPIG +VT AD ++ EL G P L++L +IV
Sbjct 175 GEVYDDGAVGVVLGGPISAATVVSQGARPIGPDMVVTKADENVLYELAGTPALEKLEQIV 234
Query 230 EGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATM 289
L +E+ + S GL IG+ +DE+ GDF++RG++GAD TG+I I +VV+VG T+
Sbjct 235 LALPEEEQQMASQGLLIGVAMDEYAEQHEHGDFLVRGVVGADADTGAIAIGDVVEVGRTV 294
Query 290 QFQVRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLG 349
+FQVRDA A++DL ++R + GALLF+CNGRGR MF +DHD + G
Sbjct 295 RFQVRDAEAAEEDLTALLQRFDLK---PVEGALLFSCNGRGRAMFPDSDHDVKLLRRTFG 351
Query 350 GIPLAGFFAAGEIGPIAGRNALHGFTASMALF 381
+ GFFAAGEIGP++GRN +HGFTAS+ F
Sbjct 352 PAGVGGFFAAGEIGPVSGRNHVHGFTASILAF 383
>gi|297559074|ref|YP_003678048.1| hypothetical protein Ndas_0091 [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
gi|296843522|gb|ADH65542.1| domain of unknown function DUF1745 [Nocardiopsis dassonvillei
subsp. dassonvillei DSM 43111]
Length=383
Score = 221 bits (564), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 136/330 (42%), Positives = 190/330 (58%), Gaps = 5/330 (1%)
Query 55 VLQMIDPPALVGCIAQAIVAGRHEIEDEPAVVVWLAS--GLAAETFQLDFVRTGSGALIT 112
V+++ A +GC + ++ G +E + +V VW A G+ F+LD V +
Sbjct 55 VMELAGDAATLGCSSTGVIGGGRSVEGQGSVSVWCAGLPGVEITPFRLDTVVEDDHLAVI 114
Query 113 GYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHD 172
G + + +LL +PY FP+ + L G +VGG+ G R RLF D +
Sbjct 115 GMQEPGPRDSVAILLTNPYEFPTQAFVRESTEALGGLPLVGGMADGMRGEESVRLFCDGE 174
Query 173 VLTSGVVGVRLPGMRGV-PVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEG 231
V G +GV + G + VVSQGCRPIG P VT A+G L+ EL G ++L E+VE
Sbjct 175 VAEHGAIGVLVGGENVLGTVVSQGCRPIGSPMTVTKAEGNLLLELAGTNAYEKLEELVES 234
Query 232 LSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQF 291
LS ++R L HGL IGI +DE++ QGDF+IR L GADP G++ ID++V+VG T++F
Sbjct 235 LSEEDRELAEHGLHIGIAMDEYVDRHEQGDFLIRTLAGADPELGALTIDDMVEVGQTVRF 294
Query 292 QVRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGI 351
QVRDA AD+DL + A P LLF+CNGRG +F +DHD + +LG
Sbjct 295 QVRDAGTADEDLARRLSDFGAEHP--VGAGLLFSCNGRGSSLFPQSDHDVLAVHRVLGVD 352
Query 352 PLAGFFAAGEIGPIAGRNALHGFTASMALF 381
+AGFFAAGEIGP+ G N +HGFTA + F
Sbjct 353 AVAGFFAAGEIGPVGGVNHVHGFTACLLAF 382
>gi|223939736|ref|ZP_03631608.1| protein of unknown function DUF1745 [bacterium Ellin514]
gi|223891607|gb|EEF58096.1| protein of unknown function DUF1745 [bacterium Ellin514]
Length=396
Score = 216 bits (549), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 135/382 (36%), Positives = 206/382 (54%), Gaps = 9/382 (2%)
Query 12 DARQAAVEA-AGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQ 70
+ +AA +A A + R EL SL ++ S +A +L + P L GC +
Sbjct 14 EFEEAAFQAWARKLRAELHAPKVSLGLVFMSPKMFPQAEQILEILRVDGQIPLLAGCSSN 73
Query 71 AIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRT----GSGALITGYRFDRTARDLH-- 124
+++ G HE ED+ +VV L S AE F + GSG ++ T +
Sbjct 74 SLITGVHEFEDDGGLVVALYSLPGAELKAFRFTQADLEQGSGRAYWQHKTGVTPEQTNGW 133
Query 125 LLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRLP 184
L DP+ + N ++GG+ SG + T+L+ + +V G V + +
Sbjct 134 LAFADPFNMDCEAWLGSWNEAYAPAPILGGLASGEQTTQQTQLYLNGEVYEEGGVAISIG 193
Query 185 G-MRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHG 243
G ++ V V+SQGC PIG + +T + LI E+G RP + L E L+ DE+
Sbjct 194 GDVKLVGVISQGCTPIGDTWTLTKVEKNLIQEIGNRPAFEVLAETFGTLTQDEQQASRGN 253
Query 244 LQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDL 303
L IG+V++E+L +GDF++R L+G DP +G I + + ++G T+QFQ RDAA A +D+
Sbjct 254 LFIGLVMNEYLEEYHRGDFLVRNLIGVDPQSGIIAVGALPRLGQTIQFQRRDAAAATEDM 313
Query 304 RLTVERAAARLPGRAA-GALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEI 362
+ + RA +L G G L +CNGRG+ +FG DHDA I+E+LG + ++GFF GEI
Sbjct 314 KALLARARKQLAGATVYGGCLCSCNGRGQGLFGEPDHDAKMIQEMLGPVGMSGFFCNGEI 373
Query 363 GPIAGRNALHGFTASMALFVDD 384
GP+ RN LHG+TAS+ALFV
Sbjct 374 GPVGERNFLHGYTASLALFVKK 395
>gi|289744343|ref|ZP_06503721.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289684871|gb|EFD52359.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
Length=168
Score = 208 bits (530), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 132/168 (79%), Positives = 147/168 (88%), Gaps = 0/168 (0%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+RIGVGV T PD R+AA EAA AR+ELAG P+LAVLLGSR+HTD+A D+L+AV ++
Sbjct 1 MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVE 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRTA 120
P AL+GC+AQ IVAGRHE+E+EPAV VWLASG AETF LDFVRTGSGALITGYRFDRTA
Sbjct 61 PAALIGCVAQGIVAGRHELENEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTA 120
Query 121 RDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLF 168
DLHLLLPDPY+FPSNLLIEH NTDLPGT VVGGVVSGGRRRGDTRLF
Sbjct 121 HDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGRRRGDTRLF 168
>gi|320103039|ref|YP_004178630.1| hypothetical protein Isop_1496 [Isosphaera pallida ATCC 43644]
gi|319750321|gb|ADV62081.1| domain of unknown function DUF1745 [Isosphaera pallida ATCC 43644]
Length=401
Score = 208 bits (530), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 135/334 (41%), Positives = 184/334 (56%), Gaps = 20/334 (5%)
Query 64 LVGCIAQAIVAGRHEIEDEPAVVVW---LASGLAAETFQLDFVRTGSGALITGYRFD--- 117
++G A+++ E+E PA+ W L G +TF+L G + R D
Sbjct 62 VIGVTAESVAGVAREVEGLPALTAWAIQLPEGSRCDTFRLTSSEAPLGDWVDSVRIDPAP 121
Query 118 --------RTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFR 169
+ L +LL DP++F ++ + G V+GG+ SG R G RL
Sbjct 122 VSRVSLTEKDKNKLVILLADPFSFAADEWFSRLEEEKIGLRVIGGMASGANRPGGNRLVI 181
Query 170 DHDVLTSGVVGVRLPG-MRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREI 228
D V+ G VGV L G VVSQGCRPIG ++VT D ++ ELG RP ++ LRE
Sbjct 182 DGAVVQQGAVGVALSGPFVAETVVSQGCRPIGRHFVVTKVDRNILHELGRRPVIEVLREQ 241
Query 229 VEGLSPDERA-LVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGA 287
+E LS E A L + GL IG V++E+ +GDF+IR ++G S+ I ++ +VG
Sbjct 242 LETLSDAETAKLRNGGLHIGRVINEYQERFERGDFLIRNVIGI-AEEQSLAISDLPRVGQ 300
Query 288 TMQFQVRDAAGADKDLRLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEEL 347
T+QFQ+RDA AD+DL + R L G GAL+FTCNGRG R+F HDA +
Sbjct 301 TVQFQLRDAQTADEDLTDLLGRP--ELKG-TKGALMFTCNGRGTRLFDQPHHDAQALANA 357
Query 348 LGGIPLAGFFAAGEIGPIAGRNALHGFTASMALF 381
+G IP AGFFA GE GP+ GRN +HGFTAS ALF
Sbjct 358 VGPIPAAGFFAMGEFGPVGGRNFIHGFTASFALF 391
>gi|149923652|ref|ZP_01912048.1| hypothetical protein PPSIR1_16925 [Plesiocystis pacifica SIR-1]
gi|149815467|gb|EDM75004.1| hypothetical protein PPSIR1_16925 [Plesiocystis pacifica SIR-1]
Length=409
Score = 207 bits (527), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 139/403 (35%), Positives = 203/403 (51%), Gaps = 22/403 (5%)
Query 1 VRIGVGVCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMID 60
+R + +P A ++L + P L + +R H R ++ A+ Q
Sbjct 1 MRWAASIDNSPTLEVALARGEESLSEQLGDQRPDLVLAFATRDHQARWHEIPEALRQRFP 60
Query 61 PPALVGCIAQAIVAGRHEIEDEPAVVVWLAS--GLAAETFQLDF-----VRTGSGALITG 113
A+VGC A ++A E+ED P + + A G+ F +D + GSG
Sbjct 61 DAAVVGCSAGGVLANGTELEDGPGLALCAARLPGVERTPFHIDAEALEALVGGSGDSGES 120
Query 114 YRFDRTAR------------DLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRR 161
R D AR L +L PDP+++P ++ + P VVGG+ SGG R
Sbjct 121 ERDDLRARWLAAIGIAEGPDPLLMLFPDPFSWPGPEVLGSLDRAFPQGTVVGGLASGGAR 180
Query 162 RGDTRLFRDHDVLTSGVVGVRLPGMRGVP-VVSQGCRPIGYPYIVTGADGILITELGGRP 220
G+ RLF D G+VG+ L G V +V+QGCRP+G P VT ++ EL GRP
Sbjct 181 PGEHRLFCDRSTHHRGMVGLALRGNLEVETIVAQGCRPVGAPMFVTRRQANIVYELDGRP 240
Query 221 PLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEID 280
++ L+++ L PD+RA L IG+ + L QGDF++R L+G DPS+G++ I
Sbjct 241 AVEALQQLFTTLEPDDRARARTSLLIGLSMHPQLEVHDQGDFLVRNLIGVDPSSGAVGIA 300
Query 281 EVVQVGATMQFQVRDAAGADKDLR-LTVERAAARLPGRAAGALLFTCNGRGRRMFGVADH 339
+ +QF +RDA A +L L E A ALLF+C GRG ++G H
Sbjct 301 AELHGHPVVQFHLRDAQTAASELHDLAAEHQRIHGERAPAVALLFSCLGRGEHLYGRTGH 360
Query 340 DASTIEELLGG-IPLAGFFAAGEIGPIAGRNALHGFTASMALF 381
D+ + E LG +PLAGFF GEIGPIAGR +HG+T+S+ L
Sbjct 361 DSEVLREHLGATLPLAGFFCNGEIGPIAGRTFMHGYTSSILLL 403
>gi|294055462|ref|YP_003549120.1| hypothetical protein Caka_1932 [Coraliomargarita akajimensis
DSM 45221]
gi|293614795|gb|ADE54950.1| domain of unknown function DUF1745 [Coraliomargarita akajimensis
DSM 45221]
Length=402
Score = 197 bits (502), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 122/374 (33%), Positives = 188/374 (51%), Gaps = 9/374 (2%)
Query 21 AGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQAIVAGRHEIE 80
+ Q R EL G A + A++ S+ H D +D++ V P +VGC ++A EIE
Sbjct 26 SAQQRRELGGPA-TFALIFCSQEHVDDISDLIEIVQIYAHVPTVVGCSGVGLIANSDEIE 84
Query 81 DEPAVVVWLASGLAAETFQLDFVRTGSGALITGYRFDRT------ARDLHLLLPDPYTFP 134
++ V + L + + G + T F R + +L +
Sbjct 85 NDAGVSIALYRLPGTQAIAHHIPTSCFGTVDTPASFKRDLGSSLDQANAWMLFASSESIG 144
Query 135 SNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRLPGMRGV-PVVS 193
+ + N G +GG S + LF + G V + L G + P+++
Sbjct 145 HDSWLPAWNQATGGKVTIGGFASSPSENPQSHLFLNGQHYQDGAVALSLEGHVTIEPLLT 204
Query 194 QGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHGLQIGIVVDEH 253
QGCRPIG P+IVT A+ LI ++G RP L+ LR+ +E +S D++ L + IG+V+DE+
Sbjct 205 QGCRPIGSPWIVTEAEHNLIHKIGNRPILEVLRDTLENMSDDDQQLAHGNIFIGLVLDEY 264
Query 254 LAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAAR 313
++ G GDF++R L DP TG+I I ++G +QFQ+RD A D+ ++R AR
Sbjct 265 KSSFGTGDFLVRNLAAIDPQTGAIAIATPPRIGQNLQFQIRDPHTAAIDMEELLKRKKAR 324
Query 314 LPG-RAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALH 372
L G R G L C GRG ++G + D S I+ L GIPL+G F GE + + LH
Sbjct 325 LQGRRIYGGCLCDCIGRGASLYGAPNQDVSAIQNALPGIPLSGIFCNGEFATVKQQTQLH 384
Query 373 GFTASMALFVDDME 386
G+ AS+ LFV+ E
Sbjct 385 GYAASLGLFVEKNE 398
>gi|86609276|ref|YP_478038.1| hypothetical protein CYB_1819 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86557818|gb|ABD02775.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length=441
Score = 196 bits (499), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 141/374 (38%), Positives = 200/374 (54%), Gaps = 25/374 (6%)
Query 33 PSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQAIVAGRHEIEDEPAVVVWLA-- 90
P+L VL S A VL + +++ L+GC IV G HEIED PA+ + LA
Sbjct 64 PNLGVLFVSAAFASEYIRVLPLLSGLLEVDVLIGCSGGGIVGGGHEIEDGPALSLSLAVM 123
Query 91 SGLAAETF-----QLDFVRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTD 145
+ F QL + A + ++ LLL D ++ + L++ +
Sbjct 124 PEVVLHPFHLRGNQLPDLDAAPSAWVDCVGVSPQSKPHFLLLADGFSSGISELLQGLDFA 183
Query 146 LPGTAVVGGVVSGGR-RRGDTRLFRDHDVLT-------SGVVGVRLPGMRGV-PVVSQGC 196
PG+ VGG+ SGGR RG+ D LT G VG+ L G + VV+QGC
Sbjct 184 YPGSVKVGGLASGGRGPRGNALFLLDARTLTPRRELYREGTVGLALYGNVVLDAVVAQGC 243
Query 197 RPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAA 256
RPIG P VT A+G +I L GRPPL L+++ E LSP ++ L H L IG+++DE +
Sbjct 244 RPIGDPLRVTEAEGNVILGLEGRPPLAVLQDLAERLSPVDQRLARHSLFIGLLMDEFKSE 303
Query 257 PGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPG 316
P GDF+IR +LG DP G++ I + V+ G T+QF +RDA + +DLR + R A
Sbjct 304 PTPGDFLIRVILGVDPRVGALAIGDQVRPGQTVQFHLRDAQTSAEDLRWALSRYCAERNL 363
Query 317 RAA---------GALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEIGPIAG 367
R + GAL+F+C GRG+ ++G D D+ ELLG +PL GFF GEIGP+ G
Sbjct 364 RQSPSQPRPEPCGALMFSCLGRGKGLYGTPDFDSQRFRELLGELPLGGFFCNGEIGPVGG 423
Query 368 RNALHGFTASMALF 381
LHG+T+ +F
Sbjct 424 STFLHGYTSCFGIF 437
>gi|262196432|ref|YP_003267641.1| hypothetical protein Hoch_3246 [Haliangium ochraceum DSM 14365]
gi|262079779|gb|ACY15748.1| domain of unknown function DUF1745 [Haliangium ochraceum DSM
14365]
Length=396
Score = 196 bits (499), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 128/378 (34%), Positives = 189/378 (50%), Gaps = 9/378 (2%)
Query 7 VCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVG 66
V T A EA +L G AP L V + D ++ V + L+G
Sbjct 7 VANTAHLEDALDEAVEHIDADLNGAAPDLMVAFAHNDYGDHLQRLVEVVRERYPGVVLLG 66
Query 67 CIAQAIVAGRHEIEDEPAVVVWLA--SGLAAETFQLDFVRTGSGALITGYRFDRTARDLH 124
C A ++ G +EIE +PA+ + A G+ F LD + I G + +
Sbjct 67 CSADGVIGGGNEIEYQPALSLTAAVLPGVELVPFHLDGAPASWRSRI-GMQTGQPPS--F 123
Query 125 LLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRLP 184
+L+PDP++ P + + P + +GG+ SG G T LF + SG VGV +
Sbjct 124 VLIPDPFSCPVEDTLRWFDAVYPNSPKIGGLASGAGMAGTTTLFAGGHLARSGAVGVAMR 183
Query 185 G-MRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHG 243
G + +V+QGCRPIG P VT D ++ EL GRP LQ + L+ ++ L H
Sbjct 184 GALEMRTLVAQGCRPIGAPMFVTRHDEDVVFELDGRPALQAIEATFASLASADQELFRHS 243
Query 244 LQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDL 303
L +G+V D G+GDF++R +LG DP G++ +D ++ +QF +RDAA + DL
Sbjct 244 LYLGVVTDRSKQVYGRGDFLVRNILGVDPELGAVAVDAELEDNQVVQFHLRDAATSAADL 303
Query 304 RLTVERAAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEIG 363
+ P GAL+F C GRG+ ++G A+HD+ G +PL GFF GEIG
Sbjct 304 EHLLSTYDGPPP---RGALMFPCLGRGQALYGHANHDSDAFRARFGEVPLGGFFCNGEIG 360
Query 364 PIAGRNALHGFTASMALF 381
P GR +HG+T +MALF
Sbjct 361 PFGGRTFVHGYTTAMALF 378
>gi|153006881|ref|YP_001381206.1| hypothetical protein Anae109_4044 [Anaeromyxobacter sp. Fw109-5]
gi|152030454|gb|ABS28222.1| domain of unknown function DUF1745 [Anaeromyxobacter sp. Fw109-5]
Length=401
Score = 192 bits (488), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 131/362 (37%), Positives = 185/362 (52%), Gaps = 11/362 (3%)
Query 28 LAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQAIVAGRHEIEDEPAVVV 87
L G+ P L V S H + ++ + LVGC A ++ HE+ED PA+ +
Sbjct 28 LEGDPPDLLVAFVSPHHAGESEQLVDLAARRFPRALLVGCTAGGVIGDAHEVEDGPALSL 87
Query 88 WLA--SGLAAETFQLDFVRTGSGALITGYRFDRT-----ARDLHLLLPDPYTFPSNLLIE 140
A G+ F+ V G+ L R AR LLL DP+T L+E
Sbjct 88 TAAVLPGVELSPFR---VEPGAQPLDPSAWRARVGCPPEARPKLLLLADPFTVDIGALVE 144
Query 141 HPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRLPGMRGV-PVVSQGCRPI 199
+ P GG+ SGGR RL DV +G VGV G V +++QGCR I
Sbjct 145 GLDGAYPAAPKFGGLASGGRGLDQNRLLVAEDVHRNGGVGVVFTGNLEVDTLIAQGCRAI 204
Query 200 GYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAAPGQ 259
G P +VT ++ EL GRPPLQ + E+ L P +R L+ L +G+ +
Sbjct 205 GAPMLVTRCQHGVLQELDGRPPLQVIAELYASLEPRDRELMQTSLFLGLELRSDEVEFQP 264
Query 260 GDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGRAA 319
G+ ++R L+GAD TG++ + ++ +QF +RDA A+++LR + R GR A
Sbjct 265 GELLVRNLIGADEDTGALAVGAELRPLTVVQFVLRDAHSAEQELRRMLARHRRAATGRPA 324
Query 320 GALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHGFTASMA 379
GALLF+C GRG +FG DHD S EE LG PL GFF GEIGP+ G +HG+T++ A
Sbjct 325 GALLFSCVGRGAGLFGHPDHDTSLFEEQLGPAPLGGFFCNGEIGPVGGTTFVHGYTSAFA 384
Query 380 LF 381
+F
Sbjct 385 MF 386
>gi|86606541|ref|YP_475304.1| hypothetical protein CYA_1894 [Synechococcus sp. JA-3-3Ab]
gi|86555083|gb|ABD00041.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length=446
Score = 190 bits (482), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 138/379 (37%), Positives = 200/379 (53%), Gaps = 30/379 (7%)
Query 33 PSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQAIVAGRHEIEDEPAVVVWLA-- 90
P+L +L S A VL + ++++ L+GC IV G HEIE+ PA+ + LA
Sbjct 64 PNLGILFVSAAFASEYIRVLPLLSELLEVDVLIGCSGGGIVGGGHEIEEGPALSLSLAVL 123
Query 91 SGLAAETF-----QLDFVRTGSGALITGYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTD 145
+A F QL + A I ++ LLL D ++ + L++ +
Sbjct 124 PDVALHPFYLRGNQLPDLDAPPSAWIDLVGVLPQSKPHFLLLADGFSSRISELLQGLDFA 183
Query 146 LPGTAVVGGVVSGGR-RRGDTRLFRD-------HDVLTSGVVGVRLPGMRGV-PVVSQGC 196
PG VGG+ SGGR RG+ D ++ G VG+ L G + VV+QGC
Sbjct 184 YPGAVKVGGLASGGRGPRGNALFLLDARTPTPRRELYREGTVGLALSGNVVLDAVVAQGC 243
Query 197 RPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAA 256
RPIG P VT A+G +I L GRPPL L+++ E LSP ++ L L IG+++DE +
Sbjct 244 RPIGDPLRVTEAEGNVILSLEGRPPLAVLQDLAERLSPSDQRLARQALFIGLLMDEFKSE 303
Query 257 PGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAAR--- 313
P GDF+IR +LG DP G+I I + V+ G T+QF +RDA + +DLR + R A
Sbjct 304 PTSGDFLIRVILGIDPRVGAIAIGDRVRPGQTVQFHLRDAQTSAEDLRWALSRYCAERNL 363
Query 314 ---LPGRAA--------GALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAGEI 362
P + GAL+F+C GRG+ ++G + D+ ELLG +PL GFF GEI
Sbjct 364 QQSYPAERSSQPKPDPCGALMFSCLGRGKGLYGTPNFDSQRFRELLGELPLGGFFCNGEI 423
Query 363 GPIAGRNALHGFTASMALF 381
GP+ G LHG+T+ +F
Sbjct 424 GPVGGSTFLHGYTSCFGIF 442
>gi|37520395|ref|NP_923772.1| hypothetical protein gll0826 [Gloeobacter violaceus PCC 7421]
gi|35211388|dbj|BAC88767.1| gll0826 [Gloeobacter violaceus PCC 7421]
Length=407
Score = 188 bits (477), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 106/260 (41%), Positives = 158/260 (61%), Gaps = 3/260 (1%)
Query 125 LLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRLP 184
+L+ D +FP ++LI + P VGG+ SGG R G RLF + SG VGV L
Sbjct 140 VLMVDGSSFPVDVLIGGLDFAFPKAIKVGGLASGGNRPGQNRLFFGDQAVGSGAVGVVLA 199
Query 185 GMRGVPV-VSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSHG 243
G V V+QGCRP+G + +T A+G L+ EL G+P LQ L+ +++ L +++ L +
Sbjct 200 GDIAVEAAVAQGCRPVGETFQITRAEGNLLWELDGQPALQVLQTVLQQLDENDQRLARNA 259
Query 244 LQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKDL 303
L +G+ + E + QGDF++R L+G D TG + + E ++ G T++F +RDAA + DL
Sbjct 260 LFVGVRMSEFHSGSEQGDFLVRNLMGVDSRTGGLAVGEWLRTGQTVRFHLRDAATSRDDL 319
Query 304 RLTVERAAARLPGR-AAGALLFTCNGRGRRMFGVADHDASTIEELLG-GIPLAGFFAAGE 361
+L ++R G AGALLF+C GRG ++G D D++ ++LG G+PLAGFF GE
Sbjct 320 QLVLQRHRLEHSGAPPAGALLFSCLGRGESLYGEPDVDSTLFAQVLGEGVPLAGFFCNGE 379
Query 362 IGPIAGRNALHGFTASMALF 381
IGP+ LHG+T+S LF
Sbjct 380 IGPVGSTTFLHGYTSSFGLF 399
>gi|159028345|emb|CAO87243.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length=417
Score = 188 bits (477), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 132/404 (33%), Positives = 200/404 (50%), Gaps = 30/404 (7%)
Query 7 VCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVG 66
+ T P A E + +D+L G A +A++ S A+ ++ +L + P L+G
Sbjct 11 LSTRPSLEAAVTEVVEKVQDKLVGSA-DIAIIFISSAYASDYPRLVPLILDKLPVPVLIG 69
Query 67 CIAQAIVA-----GRHEIEDEPAVVVWLA-------SGLAAETFQLDFVRTGSGALITGY 114
C IV EIE PA+ + +A E ++ + + +
Sbjct 70 CGGAGIVGMGDREKAREIEASPALSLTVAHLPDVEVQPFYIEAAEMPDLDSSPSSWTELL 129
Query 115 RFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGG--RRRG-----DTRL 167
+ +LL DP++ N L+E + PG+A +GG+VSGG R G D +
Sbjct 130 GVEAAKNPQFILLADPFSSRINDLLEGLDFAYPGSAKIGGLVSGGMIERSGGLFYHDQQK 189
Query 168 FRDHDVLTSGVVGVRLPGMRGVP-VVSQGCRPIGYPYIVTGADGILITELGGR------- 219
R+ + G VG+ L G V +V+QGCRPIG Y V+ + +I + G+
Sbjct 190 PRNSYLYRQGTVGIALSGNIIVETIVAQGCRPIGPIYQVSEGERNIIISMTGKGADGTPQ 249
Query 220 PPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEI 279
PPL LR+++ L +R LV + L IGI DE GDF+IR +LG DP G+I I
Sbjct 250 PPLNLLRDLIPSLREKDRELVQNSLFIGIARDEFKMQLRAGDFLIRSVLGVDPRQGAIAI 309
Query 280 DEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGRAA--GALLFTCNGRGRRMFGVA 337
+ V+ G +QF +RDA + DL L ++ P + GAL+F+C GRG ++
Sbjct 310 GDRVRPGQRVQFHLRDADTSALDLELLLQAFPQERPNSSEVLGALIFSCLGRGENLYEKP 369
Query 338 DHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHGFTASMALF 381
D D+ + +PLAGFF GEIGP+AGR LHG+T++ ALF
Sbjct 370 DFDSGLFQRYFANVPLAGFFCNGEIGPVAGRTFLHGYTSAFALF 413
>gi|298490695|ref|YP_003720872.1| hypothetical protein Aazo_1561 ['Nostoc azollae' 0708]
gi|298232613|gb|ADI63749.1| domain of unknown function DUF1745 ['Nostoc azollae' 0708]
Length=404
Score = 187 bits (474), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 126/380 (34%), Positives = 197/380 (52%), Gaps = 16/380 (4%)
Query 16 AAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVGCIAQAIVAG 75
A + QA L A L ++ S A T + +L + + + P L+GC A +V
Sbjct 20 AVTDVVQQAVSSLTAPA-DLGLVFISSAFTSEYSRLLPLLTEKLSVPMLIGCSAAGVVGT 78
Query 76 R-----HEIEDEPAVVVWLAS--GLAAETF-----QLDFVRTGSGALITGYRFDRTARDL 123
+ EIE EPA+ + LA G+ F QL + A I ++
Sbjct 79 KSGNKTQEIESEPAISLTLAHLPGVDIRAFHILGDQLPDLDCSPDAWIDLVGVLPSSAPQ 138
Query 124 HLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVLTSGVVGVRL 183
+LL ++ +N L++ + P + +VGG SGG LF + + G VG+ L
Sbjct 139 FILLSSAFSSGTNDLLQGLDFAYPSSVIVGGQASGGFVSDRIALFCNDRLYRQGTVGLAL 198
Query 184 PG-MRGVPVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLSPDERALVSH 242
G + +V+QGCRPIG VT A+ +I EL + PL LR ++ LS +E+ L H
Sbjct 199 SGDIVLETIVAQGCRPIGELLQVTKAERNIILELDEQVPLVVLRNLISSLSEEEKMLTQH 258
Query 243 GLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQVRDAAGADKD 302
L +G+ ++E + QGDF+IR LLG DPS G+I I + V+ G +QF +RDA + +D
Sbjct 259 SLFVGLAMNEFQLSLKQGDFLIRNLLGVDPSAGAIAIGDRVRPGQRLQFHLRDAQASAED 318
Query 303 LRLTVERAAARLPGRAA--GALLFTCNGRGRRMFGVADHDASTIEELLGGIPLAGFFAAG 360
L L ++ + ++ AL+F+C GRG ++G A+ D+ + IP+ G+F AG
Sbjct 319 LELILQEYQEQSTSGSSPLAALMFSCVGRGAGLYGKANFDSELFKRYFHDIPMGGYFCAG 378
Query 361 EIGPIAGRNALHGFTASMAL 380
EIGP++GR LHG+T+ A+
Sbjct 379 EIGPVSGRTFLHGYTSVFAI 398
>gi|166366981|ref|YP_001659254.1| hypothetical protein MAE_42400 [Microcystis aeruginosa NIES-843]
gi|166089354|dbj|BAG04062.1| hypothetical protein MAE_42400 [Microcystis aeruginosa NIES-843]
Length=417
Score = 185 bits (470), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 133/406 (33%), Positives = 199/406 (50%), Gaps = 34/406 (8%)
Query 7 VCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVG 66
+ T P A E + +D+L G A LA++ S A+ ++ +L + P L+G
Sbjct 11 LSTRPSLEAAVTEVVEKVQDKLVGSA-DLAIIFISSAYASDYPRLVPLILDKLSVPVLIG 69
Query 67 CIAQAIVA-----GRHEIEDEPAVVVWLAS--GLAAETFQLDFVRT-------GSGALIT 112
C IV EIE PA+ + +A + + F ++ S +
Sbjct 70 CGGAGIVGMDDREKAREIEASPALSLTVAHLPNVEVQPFYIEAAEMPDLDSSPSSWTELL 129
Query 113 GYRFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGG--RRRG-----DT 165
G + + +LL DP++ N L+E + P +A +GG+VSGG R G D
Sbjct 130 GVEAAKNPQ--FILLADPFSSRINDLLEGLDFAYPSSAKIGGLVSGGMIERSGGLFYHDQ 187
Query 166 RLFRDHDVLTSGVVGVRLPGMRGVP-VVSQGCRPIGYPYIVTGADGILITELGGR----- 219
+ R+ + G VG+ L G V +V+QGCRPIG Y V+ + +I + G+
Sbjct 188 QKPRNTYLYRQGTVGIALSGNIIVETIVAQGCRPIGPIYQVSEGERNIIISMTGKGADGT 247
Query 220 --PPLQRLREIVEGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSI 277
PPL LR ++ L +R L H L IGI DE GDF+IR +LG DP G+I
Sbjct 248 PQPPLNLLRALIPSLREKDRELAQHSLFIGIARDEFKMQLRAGDFLIRNVLGVDPRQGAI 307
Query 278 EIDEVVQVGATMQFQVRDAAGADKDLRLTVERAAARLPGRA--AGALLFTCNGRGRRMFG 335
I + V+ G +QF +RDA + DL L ++ P + GAL+F+C GRG ++
Sbjct 308 AIGDRVRPGQRVQFHLRDAETSALDLELLLQAFPQEKPASSDILGALIFSCLGRGENLYE 367
Query 336 VADHDASTIEELLGGIPLAGFFAAGEIGPIAGRNALHGFTASMALF 381
D D+ + +PLAGFF GEIGP+ GR LHG+T++ ALF
Sbjct 368 KPDFDSGLFQRYFANVPLAGFFGNGEIGPVGGRTFLHGYTSAFALF 413
>gi|17230343|ref|NP_486891.1| hypothetical protein alr2851 [Nostoc sp. PCC 7120]
gi|17131945|dbj|BAB74550.1| alr2851 [Nostoc sp. PCC 7120]
Length=406
Score = 184 bits (468), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 125/385 (33%), Positives = 196/385 (51%), Gaps = 16/385 (4%)
Query 7 VCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVG 66
+ T P A + +A L A L ++ S A + VL + + + P ++G
Sbjct 11 LSTRPSLEAAVTDVVQRAVSTLTAPA-DLGLVFISSAFASEYSRVLPLLAEQLSVPVMIG 69
Query 67 C-----IAQAIVAGRHEIEDEPAVVVWLAS--GLAAETF-----QLDFVRTGSGALITGY 114
C I A E+E E A+ + LA G+ + F +L + + I
Sbjct 70 CSGGGVIGTAASGQTQELEAEAALSLTLAHLPGVNLQVFHVLGEELPDLDSPPDTWINLI 129
Query 115 RFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVL 174
+ +LL ++ N L++ + PG+ ++GG S G G LF + +
Sbjct 130 GVPPSPTPHFILLSSAFSSGINDLLQGLDFAYPGSVILGGQASVGGMGGRLALFCNGSLH 189
Query 175 TSGVVGVRLPGMRGV-PVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLS 233
G VG+ L G + P+V+QGCRPIG P VT A+ +I EL + PL LR+++ LS
Sbjct 190 REGTVGLALSGNIVLEPIVAQGCRPIGEPLQVTKAERNIILELDEKAPLVVLRDLIASLS 249
Query 234 PDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQV 293
ERAL H L +G+ +DE + QGDF+IR +LG DPS G+I I ++V+ G +QF +
Sbjct 250 EHERALAQHSLFVGVAMDEFKLSLQQGDFLIRSILGVDPSGGAIAIGDLVRPGQRLQFHL 309
Query 294 RDAAGADKDLRLTVER--AAARLPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGI 351
RD+ + ++L +ER A A GAL+F+C GRG ++G + D+ + + +
Sbjct 310 RDSQASAEELEFLLERYQTKAEFDNAAVGALMFSCVGRGEGLYGKPNFDSELFKRYIQDV 369
Query 352 PLAGFFAAGEIGPIAGRNALHGFTA 376
P+ GFF GEIGP+ GR LHG+T+
Sbjct 370 PVGGFFCGGEIGPVGGRTFLHGYTS 394
>gi|75907272|ref|YP_321568.1| hypothetical protein Ava_1049 [Anabaena variabilis ATCC 29413]
gi|75700997|gb|ABA20673.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length=406
Score = 184 bits (467), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 124/385 (33%), Positives = 196/385 (51%), Gaps = 16/385 (4%)
Query 7 VCTTPDARQAAVEAAGQARDELAGEAPSLAVLLGSRAHTDRAADVLSAVLQMIDPPALVG 66
+ T P A + +A L A L ++ S A + VL + + + P ++G
Sbjct 11 LSTRPSLEAAVTDVVQRAVSTLTAPA-DLGLVFISSAFASEYSRVLPLLAEQLSVPVMIG 69
Query 67 C-----IAQAIVAGRHEIEDEPAVVVWLAS--GLAAETF-----QLDFVRTGSGALITGY 114
C I A E+E E A+ + LA G+ + F +L + + I
Sbjct 70 CSGGGVIGTAASGQTQELEAEAALSLTLAHLPGVNLQVFHVLGEELPDLDSPPDTWINLI 129
Query 115 RFDRTARDLHLLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLFRDHDVL 174
+ +LL ++ N L++ + PG+ ++GG S G G LF + +
Sbjct 130 GVPPSPTPHFILLSSAFSSGINDLLQGLDFAYPGSVILGGQASVGGMGGRLALFCNGSLH 189
Query 175 TSGVVGVRLPGMRGV-PVVSQGCRPIGYPYIVTGADGILITELGGRPPLQRLREIVEGLS 233
G VG+ L G + P+V+QGCRPIG P VT A+ +I EL + PL LR+++ LS
Sbjct 190 REGTVGLALSGNIVLEPIVAQGCRPIGEPLQVTKAERNIILELDEKVPLVVLRDLIASLS 249
Query 234 PDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQVGATMQFQV 293
ERAL H L +G+ +DE + QGDF+IR +LG DPS G+I I ++V+ G +QF +
Sbjct 250 EKERALAQHSLFVGVAMDEFKLSLQQGDFLIRSILGVDPSGGAIAIGDLVRPGQRLQFHL 309
Query 294 RDAAGADKDLRLTVERAAAR--LPGRAAGALLFTCNGRGRRMFGVADHDASTIEELLGGI 351
RD+ + ++L +ER + A GAL+F+C GRG ++G + D+ + + +
Sbjct 310 RDSQASAEELEFLLERYQTKPEFDNSAVGALMFSCVGRGEGLYGKPNFDSELFKRYIQDV 369
Query 352 PLAGFFAAGEIGPIAGRNALHGFTA 376
P+ GFF GEIGP+ GR LHG+T+
Sbjct 370 PVGGFFCGGEIGPVGGRTFLHGYTS 394
>gi|254412137|ref|ZP_05025912.1| conserved domain protein [Microcoleus chthonoplastes PCC 7420]
gi|196181103|gb|EDX76092.1| conserved domain protein [Microcoleus chthonoplastes PCC 7420]
Length=416
Score = 176 bits (446), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 103/273 (38%), Positives = 155/273 (57%), Gaps = 21/273 (7%)
Query 125 LLLPDPYTFPSNLLIEHPNTDLPGTAVVGGVVSGGRRRGDTRLF-RDHDVLT------SG 177
+LL DP++ N L++ + PG+ VGG+ S + LF RD + + G
Sbjct 136 ILLADPFSSKINDLLQGLDFAYPGSVKVGGLASASAMGVQSGLFYRDSERYSGGTLHREG 195
Query 178 VVGVRLPGMRGV-PVVSQGCRPIGYPYIVTG-----------ADGILITELGGRPPLQRL 225
+GV L G + P+VSQGCRPIG PY +T ++G+ +E+ +PPL L
Sbjct 196 TIGVALSGNVVLDPIVSQGCRPIGQPYQITKGERNIVLELADSNGMSFSEVESQPPLAVL 255
Query 226 REIVEGLSPDERALVSHGLQIGIVVDEHLAAPGQGDFVIRGLLGADPSTGSIEIDEVVQV 285
R++++ LS +R L H L IGI DE + GQGDF+IR LLG DP G+I I + V+
Sbjct 256 RDVIQNLSESDRELAQHSLFIGIARDEFKQSLGQGDFLIRNLLGVDPRLGAIAIGDRVRP 315
Query 286 GATMQFQVRDAAGADKDLRLTVERAAARLPG--RAAGALLFTCNGRGRRMFGVADHDAST 343
G +QF +RDA +++DL L ++ ++ AGAL+F+C GRG+ ++G D D+
Sbjct 316 GQRIQFHLRDARTSEEDLELLLQNYQNQVNSTPETAGALMFSCLGRGQGLYGKPDFDSQL 375
Query 344 IEELLGGIPLAGFFAAGEIGPIAGRNALHGFTA 376
+ + I + GFF GEIGP+ G LHG+T+
Sbjct 376 LCRYINNISVGGFFCNGEIGPVGGSTFLHGYTS 408
Lambda K H
0.321 0.140 0.413
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 752761409750
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40