BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3732
Length=352
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610868|ref|NP_218249.1| hypothetical protein Rv3732 [Mycoba... 708 0.0
gi|298527209|ref|ZP_07014618.1| conserved hypothetical protein [... 707 0.0
gi|289572383|ref|ZP_06452610.1| conserved hypothetical protein [... 706 0.0
gi|308374968|ref|ZP_07442246.2| hypothetical protein TMGG_01275 ... 702 0.0
gi|167970890|ref|ZP_02553167.1| hypothetical protein MtubH3_2373... 664 0.0
gi|339300142|gb|AEJ52252.1| hypothetical protein CCDC5180_3415 [... 578 4e-163
gi|240172303|ref|ZP_04750962.1| hypothetical protein MkanA1_2351... 456 2e-126
gi|183985237|ref|YP_001853528.1| hypothetical protein MMAR_5269 ... 441 7e-122
gi|118619485|ref|YP_907817.1| hypothetical protein MUL_4343 [Myc... 436 3e-120
gi|8515855|gb|AAF76209.1|AF272032_1 hypothetical protein [Mycoba... 351 1e-94
gi|54022529|ref|YP_116771.1| hypothetical protein nfa5620 [Nocar... 264 2e-68
gi|120401831|ref|YP_951660.1| hypothetical protein Mvan_0816 [My... 251 2e-64
gi|290960889|ref|YP_003492071.1| hypothetical protein SCAB_65291... 177 3e-42
gi|344999420|ref|YP_004802274.1| hypothetical protein SACTE_1826... 174 1e-41
gi|229822485|ref|YP_002884011.1| hypothetical protein Bcav_4008 ... 174 2e-41
gi|317508507|ref|ZP_07966174.1| hypothetical protein HMPREF9336_... 152 6e-35
gi|29832381|ref|NP_827015.1| hypothetical protein SAV_5838 [Stre... 152 8e-35
gi|297194633|ref|ZP_06912031.1| conserved hypothetical protein [... 152 1e-34
gi|256390782|ref|YP_003112346.1| hypothetical protein Caci_1584 ... 142 8e-32
gi|328882096|emb|CCA55335.1| hypothetical protein SVEN_2048 [Str... 138 2e-30
gi|297157354|gb|ADI07066.1| hypothetical protein SBI_03945 [Stre... 122 8e-26
gi|309791058|ref|ZP_07685594.1| hypothetical protein OSCT_1545 [... 102 1e-19
gi|254386705|ref|ZP_05001999.1| conserved hypothetical protein [... 96.7 5e-18
gi|159897385|ref|YP_001543632.1| hypothetical protein Haur_0856 ... 93.6 5e-17
gi|149922676|ref|ZP_01911103.1| hypothetical protein PPSIR1_1990... 75.9 9e-12
gi|119483235|ref|ZP_01618649.1| hypothetical protein L8106_04261... 74.3 3e-11
gi|149917671|ref|ZP_01906167.1| hypothetical protein PPSIR1_2812... 72.8 8e-11
gi|282896858|ref|ZP_06304864.1| conserved hypothetical protein [... 71.2 2e-10
gi|158335199|ref|YP_001516371.1| hypothetical protein AM1_2042 [... 70.9 3e-10
gi|218246787|ref|YP_002372158.1| hypothetical protein PCC8801_19... 70.1 5e-10
gi|172036939|ref|YP_001803440.1| hypothetical protein cce_2024 [... 69.7 7e-10
gi|149924456|ref|ZP_01912818.1| hypothetical protein PPSIR1_4108... 69.3 8e-10
gi|170078040|ref|YP_001734678.1| hypothetical protein SYNPCC7002... 68.6 1e-09
gi|300866679|ref|ZP_07111363.1| conserved exported hypothetical ... 68.6 2e-09
gi|67922512|ref|ZP_00516021.1| similar to Uncharacterized protei... 68.2 2e-09
gi|71909668|ref|YP_287255.1| hypothetical protein Daro_4059 [Dec... 67.4 3e-09
gi|126658382|ref|ZP_01729531.1| hypothetical protein CY0110_2752... 67.0 4e-09
gi|17229490|ref|NP_486038.1| hypothetical protein alr1998 [Nosto... 67.0 4e-09
gi|149918942|ref|ZP_01907428.1| hypothetical protein PPSIR1_1676... 67.0 5e-09
gi|148259008|ref|YP_001243593.1| hypothetical protein BBta_7862 ... 66.6 6e-09
gi|257059829|ref|YP_003137717.1| hypothetical protein Cyan8802_1... 66.2 7e-09
gi|149918055|ref|ZP_01906548.1| hypothetical protein PPSIR1_4168... 65.9 1e-08
gi|254415125|ref|ZP_05028887.1| hypothetical protein MC7420_2551... 65.1 2e-08
gi|220907398|ref|YP_002482709.1| hypothetical protein Cyan7425_1... 64.3 3e-08
gi|149918943|ref|ZP_01907429.1| hypothetical protein PPSIR1_1677... 63.5 5e-08
gi|302036533|ref|YP_003796855.1| hypothetical protein NIDE1172 [... 63.2 6e-08
gi|75911164|ref|YP_325460.1| hypothetical protein Ava_4968 [Anab... 63.2 7e-08
gi|223937185|ref|ZP_03629092.1| conserved hypothetical protein [... 62.8 7e-08
gi|1230542|gb|AAA93039.1| ORFI [Synechocystis sp.] 62.0 2e-07
gi|149919171|ref|ZP_01907655.1| hypothetical protein PPSIR1_3538... 61.2 2e-07
>gi|15610868|ref|NP_218249.1| hypothetical protein Rv3732 [Mycobacterium tuberculosis H37Rv]
gi|15843353|ref|NP_338390.1| hypothetical protein MT3837 [Mycobacterium tuberculosis CDC1551]
gi|31794904|ref|NP_857397.1| hypothetical protein Mb3759 [Mycobacterium bovis AF2122/97]
63 more sequence titles
Length=352
Score = 708 bits (1827), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/352 (100%), Positives = 352/352 (100%), Gaps = 0/352 (0%)
Query 1 MAVLPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGT 60
MAVLPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGT
Sbjct 1 MAVLPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGT 60
Query 61 TETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGV 120
TETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGV
Sbjct 61 TETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGV 120
Query 121 GASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYV 180
GASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYV
Sbjct 121 GASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYV 180
Query 181 RDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQ 240
RDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQ
Sbjct 181 RDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQ 240
Query 241 RTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGN 300
RTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGN
Sbjct 241 RTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGN 300
Query 301 APNDDPYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG 352
APNDDPYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG
Sbjct 301 APNDDPYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG 352
>gi|298527209|ref|ZP_07014618.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298497003|gb|EFI32297.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=378
Score = 707 bits (1824), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/352 (100%), Positives = 352/352 (100%), Gaps = 0/352 (0%)
Query 1 MAVLPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGT 60
MAVLPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGT
Sbjct 27 MAVLPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGT 86
Query 61 TETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGV 120
TETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGV
Sbjct 87 TETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGV 146
Query 121 GASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYV 180
GASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYV
Sbjct 147 GASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYV 206
Query 181 RDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQ 240
RDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQ
Sbjct 207 RDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQ 266
Query 241 RTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGN 300
RTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGN
Sbjct 267 RTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGN 326
Query 301 APNDDPYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG 352
APNDDPYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG
Sbjct 327 APNDDPYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG 378
>gi|289572383|ref|ZP_06452610.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|339633723|ref|YP_004725365.1| hypothetical protein MAF_37410 [Mycobacterium africanum GM041182]
gi|289536814|gb|EFD41392.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|339333079|emb|CCC28810.1| conserved hypothetical protein [Mycobacterium africanum GM041182]
Length=352
Score = 706 bits (1823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/352 (99%), Positives = 351/352 (99%), Gaps = 0/352 (0%)
Query 1 MAVLPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGT 60
MAVLPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGT
Sbjct 1 MAVLPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGT 60
Query 61 TETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGV 120
TETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRR V
Sbjct 61 TETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRSV 120
Query 121 GASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYV 180
GASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYV
Sbjct 121 GASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYV 180
Query 181 RDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQ 240
RDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQ
Sbjct 181 RDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQ 240
Query 241 RTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGN 300
RTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGN
Sbjct 241 RTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGN 300
Query 301 APNDDPYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG 352
APNDDPYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG
Sbjct 301 APNDDPYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG 352
>gi|308374968|ref|ZP_07442246.2| hypothetical protein TMGG_01275 [Mycobacterium tuberculosis SUMu007]
gi|308347876|gb|EFP36727.1| hypothetical protein TMGG_01275 [Mycobacterium tuberculosis SUMu007]
Length=349
Score = 702 bits (1812), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/349 (99%), Positives = 349/349 (100%), Gaps = 0/349 (0%)
Query 4 LPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTET 63
+PACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTET
Sbjct 1 MPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTET 60
Query 64 IVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGVGAS 123
IVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGVGAS
Sbjct 61 IVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGVGAS 120
Query 124 GPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDG 183
GPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDG
Sbjct 121 GPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDG 180
Query 184 WAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTD 243
WAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTD
Sbjct 181 WAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTD 240
Query 244 ADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPN 303
ADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPN
Sbjct 241 ADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPN 300
Query 304 DDPYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG 352
DDPYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG
Sbjct 301 DDPYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG 349
>gi|167970890|ref|ZP_02553167.1| hypothetical protein MtubH3_23735 [Mycobacterium tuberculosis
H37Ra]
gi|254552846|ref|ZP_05143293.1| hypothetical protein Mtube_20776 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
gi|294995355|ref|ZP_06801046.1| hypothetical protein Mtub2_12801 [Mycobacterium tuberculosis
210]
gi|297636413|ref|ZP_06954193.1| hypothetical protein MtubK4_19905 [Mycobacterium tuberculosis
KZN 4207]
gi|297733407|ref|ZP_06962525.1| hypothetical protein MtubKR_20045 [Mycobacterium tuberculosis
KZN R506]
gi|313660738|ref|ZP_07817618.1| hypothetical protein MtubKV_20040 [Mycobacterium tuberculosis
KZN V2475]
Length=329
Score = 664 bits (1714), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/329 (100%), Positives = 329/329 (100%), Gaps = 0/329 (0%)
Query 24 MVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPT 83
MVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPT
Sbjct 1 MVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPT 60
Query 84 PTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGVGASGPQEAAARAPHVLNQVRLGP 143
PTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGVGASGPQEAAARAPHVLNQVRLGP
Sbjct 61 PTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGVGASGPQEAAARAPHVLNQVRLGP 120
Query 144 LEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTDLIVGGLDP 203
LEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTDLIVGGLDP
Sbjct 121 LEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTDLIVGGLDP 180
Query 204 VRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTHVRFAGDMSTAV 263
VRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTHVRFAGDMSTAV
Sbjct 181 VRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTHVRFAGDMSTAV 240
Query 264 RDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQVVTVYDDVALPPLL 323
RDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQVVTVYDDVALPPLL
Sbjct 241 RDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQVVTVYDDVALPPLL 300
Query 324 LVVVSAIAVGAAGGAVVVVLRRRRRAHTG 352
LVVVSAIAVGAAGGAVVVVLRRRRRAHTG
Sbjct 301 LVVVSAIAVGAAGGAVVVVLRRRRRAHTG 329
>gi|339300142|gb|AEJ52252.1| hypothetical protein CCDC5180_3415 [Mycobacterium tuberculosis
CCDC5180]
Length=287
Score = 578 bits (1491), Expect = 4e-163, Method: Compositional matrix adjust.
Identities = 287/287 (100%), Positives = 287/287 (100%), Gaps = 0/287 (0%)
Query 66 MQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGVGASGP 125
MQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGVGASGP
Sbjct 1 MQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGVGASGP 60
Query 126 QEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWA 185
QEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWA
Sbjct 61 QEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWA 120
Query 186 FVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDAD 245
FVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDAD
Sbjct 121 FVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDAD 180
Query 246 AATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDD 305
AATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDD
Sbjct 181 AATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDD 240
Query 306 PYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG 352
PYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG
Sbjct 241 PYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAHTG 287
>gi|240172303|ref|ZP_04750962.1| hypothetical protein MkanA1_23513 [Mycobacterium kansasii ATCC
12478]
Length=372
Score = 456 bits (1174), Expect = 2e-126, Method: Compositional matrix adjust.
Identities = 226/329 (69%), Positives = 268/329 (82%), Gaps = 2/329 (0%)
Query 1 MAVLPACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGT 60
MAV CRL V+ + +V AT VLA P ACACGAA+ G++AT+NHEVAL+HWDG
Sbjct 13 MAVPRFCRLVFVIGLLLSVTMATTVLAAPGRACACGAAIAPGGARATMNHEVALVHWDGA 72
Query 61 TETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGV 120
TETIVMQLAM+A TDNVALVVPTP PA V AD++TF ELD L+AP ++H+R W L G+
Sbjct 73 TETIVMQLAMDATTDNVALVVPTPAPASVAAADKATFVELDALTAPQVQHKRRWILGIGM 132
Query 121 GASGPQEAAA--RAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDP 178
S P+E AA AP V++QVRLGPLEATTL GGDL+GLQ WL+ NGYAIRPAV+AALDP
Sbjct 133 VGSAPREGAATAHAPDVVSQVRLGPLEATTLAGGDLAGLQNWLAGNGYAIRPAVAAALDP 192
Query 179 YVRDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHR 238
YVRDGWAFVAIRLTST IVGGLDPVRMTF + +LVYPMRLSVAA +PQHV ++TLS+HR
Sbjct 193 YVRDGWAFVAIRLTSTAPIVGGLDPVRMTFPAPQLVYPMRLSVAALDPQHVVVYTLSEHR 252
Query 239 QQRTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTF 298
QQRTDAD + Q T V+FAG ++ VRDP+LREL GNHGSYLTK +VD+YQTS+ISSDFTF
Sbjct 253 QQRTDADRSRQFTQVQFAGTVAGQVRDPVLRELAGNHGSYLTKTQVDVYQTSQISSDFTF 312
Query 299 GNAPNDDPYRQVVTVYDDVALPPLLLVVV 327
GNA NDD YRQVV VYD+VA+P ++++ V
Sbjct 313 GNAANDDAYRQVVVVYDNVAIPIVVILFV 341
>gi|183985237|ref|YP_001853528.1| hypothetical protein MMAR_5269 [Mycobacterium marinum M]
gi|183178563|gb|ACC43673.1| conserved hypothetical transmembrane protein [Mycobacterium marinum
M]
Length=364
Score = 441 bits (1135), Expect = 7e-122, Method: Compositional matrix adjust.
Identities = 226/335 (68%), Positives = 265/335 (80%), Gaps = 8/335 (2%)
Query 1 MAVLPACRLGLV----VCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLH 56
M VL CR GLV VCV AV A VLA P YACACGAA+ +QAT+NHEVAL+H
Sbjct 1 MTVLRVCRSGLVAIFLVCVTAAVALAATVLAAPGYACACGAAIAPGDAQATMNHEVALVH 60
Query 57 WDGTTETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSL 116
WDGTTETIV+QLA++A TDN+ALVVPTP PA V D++ F ELD L+ P I HQR W+L
Sbjct 61 WDGTTETIVVQLAVDATTDNLALVVPTPMPATVAPGDKAAFLELDALTTPEIRHQRRWNL 120
Query 117 RRGVGASGPQEA--AARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSA 174
G A GP E AAR P V+NQV LGPLEATTL GGDL GLQ WL+ NGYA+RPAV+A
Sbjct 121 DLGFRAGGPDEGVRAARPPEVINQVHLGPLEATTLAGGDLPGLQAWLASNGYALRPAVAA 180
Query 175 ALDPYVRDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTL 234
ALDPYVR+GWAFVA+RLTST IVGGL+PVRMTFRSS+LVYPMR+S AA +PQHV +FTL
Sbjct 181 ALDPYVREGWAFVAMRLTSTVPIVGGLNPVRMTFRSSQLVYPMRMSAAALDPQHVVVFTL 240
Query 235 SDHRQQRTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISS 294
+DHRQ RTDAD A Q T V+FAG+++ V DPLLREL+GNHGSYLTK +VD+Y+TSRISS
Sbjct 241 TDHRQVRTDADTAIQATEVQFAGNIANHVHDPLLRELVGNHGSYLTKTQVDVYETSRISS 300
Query 295 DFTFGNAPNDDPYRQVVTVYDDVALPPLLLVVVSA 329
DFTF +APNDD YR V+ VYD+VA+P L+V++ A
Sbjct 301 DFTFADAPNDDAYRPVIVVYDNVAIP--LVVILFA 333
>gi|118619485|ref|YP_907817.1| hypothetical protein MUL_4343 [Mycobacterium ulcerans Agy99]
gi|118571595|gb|ABL06346.1| conserved hypothetical transmembrane protein [Mycobacterium ulcerans
Agy99]
Length=364
Score = 436 bits (1121), Expect = 3e-120, Method: Compositional matrix adjust.
Identities = 221/331 (67%), Positives = 262/331 (80%), Gaps = 6/331 (1%)
Query 1 MAVLPACRLGLV----VCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLH 56
M VL CR GLV VCV AV A VLA P YACACG A+ +QAT+NHEVAL+H
Sbjct 1 MTVLRVCRSGLVAVFLVCVTAAVALAATVLAAPGYACACGTAIAPGDAQATMNHEVALVH 60
Query 57 WDGTTETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSL 116
WDGTTETIV+QLA++A TDN+ALVVPTP A V D++ F ELD L+ P I HQR W+L
Sbjct 61 WDGTTETIVIQLAVDATTDNLALVVPTPMAATVAPGDKAAFLELDALTTPEIRHQRRWNL 120
Query 117 RRGVGASGPQEA--AARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSA 174
G A GP E AA +P V+NQV LGPLEATTL GGDL GLQ WL+DNGYA+RPAV+A
Sbjct 121 DLGFRAGGPDERVRAAHSPEVINQVHLGPLEATTLAGGDLPGLQAWLADNGYALRPAVAA 180
Query 175 ALDPYVRDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTL 234
ALDPYVR+GWAFVA+RLTST IVGGLDPVRMTFRSS+LVYPM+LS AA +PQHV +FTL
Sbjct 181 ALDPYVREGWAFVAMRLTSTVPIVGGLDPVRMTFRSSQLVYPMQLSAAALDPQHVVVFTL 240
Query 235 SDHRQQRTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISS 294
+DHRQ RTDAD A Q T V+F+G+++ V DPLLREL+GNHGSYLTK +VD+Y+TSRISS
Sbjct 241 TDHRQVRTDADTAIQATEVQFSGNIANHVHDPLLRELVGNHGSYLTKTQVDVYETSRISS 300
Query 295 DFTFGNAPNDDPYRQVVTVYDDVALPPLLLV 325
DFTF +APNDD YR V+ VYD+VA+P ++++
Sbjct 301 DFTFADAPNDDAYRPVIVVYDNVAIPLVVIL 331
>gi|8515855|gb|AAF76209.1|AF272032_1 hypothetical protein [Mycobacterium smegmatis]
Length=511
Score = 351 bits (900), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 174/177 (99%), Positives = 174/177 (99%), Gaps = 0/177 (0%)
Query 174 AALDPYVRDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFT 233
A DPYVRDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFT
Sbjct 71 GARDPYVRDGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFT 130
Query 234 LSDHRQQRTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRIS 293
LSDHRQQRTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRIS
Sbjct 131 LSDHRQQRTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRIS 190
Query 294 SDFTFGNAPNDDPYRQVVTVYDDVALPPLLLVVVSAIAVGAAGGAVVVVLRRRRRAH 350
SDFTFGNAPNDDPYRQVVTVYDDVALPP LLVVVSAIAVGAAGGAVVVVLRRRRRAH
Sbjct 191 SDFTFGNAPNDDPYRQVVTVYDDVALPPALLVVVSAIAVGAAGGAVVVVLRRRRRAH 247
>gi|54022529|ref|YP_116771.1| hypothetical protein nfa5620 [Nocardia farcinica IFM 10152]
gi|54014037|dbj|BAD55407.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=348
Score = 264 bits (674), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 149/292 (52%), Positives = 191/292 (66%), Gaps = 5/292 (1%)
Query 29 PSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPTPAI 88
P+ ACACG V+ G A ++ E A+L WDG ETI+M+LA+ A++ + AL+VPTP PA
Sbjct 27 PASACACGGVVSP-GDTARVDQETAVLAWDGRRETILMRLALTAESAHAALIVPTPRPAT 85
Query 89 VTTADQSTFGELDTLSAPLIEHQRHWSLRRGVGASGPQEAAARAPHVLNQVRLGPLEATT 148
VT TF EL L+AP + W G+ P AA AP VL+QVRLGPLEATT
Sbjct 86 VTAGSPDTFAELSRLTAPEFVVETEWFADAADGSGAP---AAVAPTVLDQVRLGPLEATT 142
Query 149 LTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTDLIVGGLDPVRMTF 208
L+GGDL+GL+TWL NGYA+RP V+A L PYVR+GW+FVA+RLT + G LDPVR++F
Sbjct 143 LSGGDLTGLRTWLGANGYALRPEVTATLAPYVREGWSFVAMRLTGAQPLDGALDPVRLSF 202
Query 209 RSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTHVRFAGDMSTA-VRDPL 267
S RLVYPMR+S AA+ PQ V ++ L HR R D DAA + V FAG + A V DPL
Sbjct 203 DSDRLVYPMRMSAAARTPQSVHLYVLDRHRVARADDDAAHHYSSVEFAGRVDPADVADPL 262
Query 268 LRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQVVTVYDDVAL 319
LREL YLT+++V I + +++DFTF AP D YR+ D+V L
Sbjct 263 LRELTAAGQDYLTEMQVHIADPTTVTTDFTFTAAPEDADYRRRFVQTDEVML 314
>gi|120401831|ref|YP_951660.1| hypothetical protein Mvan_0816 [Mycobacterium vanbaalenii PYR-1]
gi|119954649|gb|ABM11654.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=354
Score = 251 bits (640), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 153/327 (47%), Positives = 198/327 (61%), Gaps = 15/327 (4%)
Query 32 ACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPTPAIVTT 91
ACACG ++ S + E+ALL DG TETIVM+L ++ DN ALV+PTP PA V+
Sbjct 28 ACACGGLLSVDPSM-RIADELALLTADGDTETIVMRLNLSTSADNAALVMPTPAPATVSA 86
Query 92 ADQSTFGELDTLSAPLIEHQRHWSLRRGVGASGPQEAAARAPH-------VLNQVRLGPL 144
A F +L LSAP IE R W++ GA + A ARAP VL QV+LGPL
Sbjct 87 APADLFDDLAELSAPRIETVRRWTIGWD-GAMASEGATARAPGAGPGDPTVLQQVQLGPL 145
Query 145 EATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTDL--IVGGLD 202
EATTL+GGDL G++ WL DNGY +R +SA LDPY+R+GW+ VA+RLT TD + G L
Sbjct 146 EATTLSGGDLDGIRKWLDDNGYQLRDEISAGLDPYLREGWSVVAMRLT-TDAASLAGPLA 204
Query 203 PVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTHVRFAGDMSTA 262
PV + F S LVYPMR+S A Q VTI+TL +HR +R DADA+ +AG ++
Sbjct 205 PVMLRFASEELVYPMRMSAQAATGQTVTIYTLGEHRMRRDDADASVHIVRQDYAGSIAGR 264
Query 263 VRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQVVTVYD--DVALP 320
+ L L G GSYLTKV I + + I+SDF F AP+DDPY+ VV Y+ D+ +P
Sbjct 265 TDNAALTGLAG-AGSYLTKVTTTIVEPASITSDFEFVEAPDDDPYQAVVYRYERIDLTIP 323
Query 321 PLLLVVVSAIAVGAAGGAVVVVLRRRR 347
L+ V + V GA + RRR
Sbjct 324 VLIGAAVLVLTVAVLLGARLTRGMRRR 350
>gi|290960889|ref|YP_003492071.1| hypothetical protein SCAB_65291 [Streptomyces scabiei 87.22]
gi|260650415|emb|CBG73531.1| putative membrane protein [Streptomyces scabiei 87.22]
Length=411
Score = 177 bits (448), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 121/342 (36%), Positives = 169/342 (50%), Gaps = 27/342 (7%)
Query 26 LATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPT 85
L P+YAC CGA V +N E +++ WDG E IVM L ++ D A ++P P
Sbjct 13 LVAPAYACGCGAMVPDGRRNVYVNRETSVVRWDGREEQIVMSLTVSGDARTAAWIMPVPH 72
Query 86 PAIVTTADQSTFGELDTLSAPLIEHQ-------RHW--SLRRGVGASGPQEAAARAP--- 133
A V D + F L +AP+ E + R W L GA+G +AA R+P
Sbjct 73 RATVRLGDAAVFERLAAETAPVYERREYFWPRSRDWPFDLFESDGAAG--DAAPRSPGAP 130
Query 134 -HVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLT 192
V+ + RLGP + LT D L WL +NG+++ + AL+PYVR W +VA+RL
Sbjct 131 VEVVGRERLGPFDVARLTATDSGALGDWLDENGFSLPDRLDTALEPYVRQEWEYVAVRLA 190
Query 193 STDL-----IVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDADAA 247
D + G LDP+ +TF + VYPMRLS A PQ + ++ L+ HR + +
Sbjct 191 PEDTAAGRPLTGTLDPLHLTFAADAPVYPMRLSRLATTPQALDLYVLAGHRME-PGSSIG 249
Query 248 TQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDDPY 307
V FAG ++ R L L G +LT VE D + IS D T A D+PY
Sbjct 250 GDAPQVTFAGRLTG--RSGPLAGLTGGGTDFLTAVEQDFPRPELISGDHTLRRAATDEPY 307
Query 308 RQVVTV---YDDVALPPLLLVVVSAIAVGAAGG-AVVVVLRR 345
R+V+ V + +P L+ AV A G AV +VLRR
Sbjct 308 RKVIHVDEMWTVWGIPGWLVTFGIGFAVLAIKGVAVAIVLRR 349
>gi|344999420|ref|YP_004802274.1| hypothetical protein SACTE_1826 [Streptomyces sp. SirexAA-E]
gi|344315046|gb|AEN09734.1| Protein of unknown function DUF2330 [Streptomyces sp. SirexAA-E]
Length=365
Score = 174 bits (442), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 105/295 (36%), Positives = 156/295 (53%), Gaps = 11/295 (3%)
Query 26 LATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPT 85
L P+YAC CGA V + S+ + E +++HWDG TE IVM+L + D A ++P P
Sbjct 29 LVAPAYACGCGAMVPSERSRIAVGQETSVVHWDGRTEQIVMRLTVRGDAREAAWIMPVPH 88
Query 86 PAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGVGASGPQE-------AAARAPHVLNQ 138
A V D + F EL ++AP+ E + H+ R G ++ AA V+++
Sbjct 89 RASVELGDAALFDELAEITAPVRETRHHFWPRDGDWPFAERDGAGAPAPGAAAPVGVVDR 148
Query 139 VRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTD--- 195
RLGP + LT D L+TWL NG+A+ + AAL YV GW +VA+RL +
Sbjct 149 RRLGPFDVARLTATDPEALRTWLEGNGFALPGPLEAALRTYVDQGWEYVAVRLAPEEAGA 208
Query 196 LIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTHVRF 255
++ G LDP+R+ F S R VYPMRLS A+ Q + ++ +++HR + + A + V F
Sbjct 209 VLTGALDPLRLRFASDRPVYPMRLSRLARTAQSLGLYVIAEHRMEPSGAIGGRE-PEVTF 267
Query 256 AGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQV 310
AG + V D + L+G+ ++LT + SRI D A D PYR V
Sbjct 268 AGRLERTVPDGAVAGLVGDGPAFLTAFDQYFPDPSRIDGDHELRAAAADTPYRTV 322
>gi|229822485|ref|YP_002884011.1| hypothetical protein Bcav_4008 [Beutenbergia cavernae DSM 12333]
gi|229568398|gb|ACQ82249.1| conserved hypothetical protein [Beutenbergia cavernae DSM 12333]
Length=375
Score = 174 bits (442), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 118/320 (37%), Positives = 161/320 (51%), Gaps = 12/320 (3%)
Query 6 ACRLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIV 65
A R LV A A +V+ P+ AC CG VT+ + HE LL WDG+TE ++
Sbjct 17 ARRRALVAVGALVGAGALVVVPGPAQACGCGGVVTSEAYDVAITHERVLLQWDGSTERLL 76
Query 66 MQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHW---SLRRGVGA 122
M+L +D AL++PTP PA V D + LD S P + W G+
Sbjct 77 MELDAISDAPEAALLLPTPEPAEVELGDPAVLDALDEASRPEVVRVSDWWPEVGGFGLDG 136
Query 123 SGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRD 182
VL+QV LGP+EATTL D L WL +NGY + + +AL PYV +
Sbjct 137 GAGGAPGDPGVDVLDQVDLGPVEATTLAARDAGALTDWLDENGYVLSAGLESALVPYVAE 196
Query 183 GWAFVAIRLT--STDLIV--GGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHR 238
GW++VA+RLT +D + G L P+++TF S VYPMRLS AA++ Q V + L+ R
Sbjct 197 GWSYVAVRLTPEGSDAVALTGELQPLQVTFGSDTFVYPMRLSSAAEDTQRVRTYVLAPQR 256
Query 239 QQRTDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGS-YLTKVEVDIYQ-TSRISSDF 296
R D A VRFAG++S L L N GS YLT + ++I +DF
Sbjct 257 TDRVDDQAG--DAEVRFAGEVSAQDWPALADVLAENDGSAYLTTTDQTFTDPPTQIYADF 314
Query 297 TFGNAPNDDPYRQVVTVYDD 316
F + D R+V T+ D
Sbjct 315 VFAPSSGGD-VREVETIVVD 333
>gi|317508507|ref|ZP_07966174.1| hypothetical protein HMPREF9336_02546 [Segniliparus rugosus ATCC
BAA-974]
gi|316253198|gb|EFV12601.1| hypothetical protein HMPREF9336_02546 [Segniliparus rugosus ATCC
BAA-974]
Length=355
Score = 152 bits (385), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 101/319 (32%), Positives = 165/319 (52%), Gaps = 14/319 (4%)
Query 12 VVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDG-----TTETIVM 66
++ +AT ++ V + ACACGA V A S+ T E AL+ D T++T+V+
Sbjct 7 ILEIATLLLATGFVAPGQAEACACGAFVAAD-SRLTAVEETALIEVDSLAPGRTSQTVVL 65
Query 67 QLAMNADTD--NVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEH--QRHWSLRRGVGA 122
L + ++ + A V+P P PA + A + F LD +S P +++ + H++L
Sbjct 66 NLGLRSEASVTDAAFVMPVPGPAQFSLAGPTLFTALDEMSKPKVQYDVEHHFALTLPFLM 125
Query 123 SG-PQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVR 181
G P+ +A V N+V LGP + LTG + S ++ WL +G+ + + L Y+
Sbjct 126 GGVPRAGSAPGVVVENRVTLGPYDVVALTGSEASAVRDWLHVHGFELSAELGEGLTEYLA 185
Query 182 DGWAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQR 241
GW VA++LTS D + G L P+R+T+ S +VYPMRLS A+ Q + ++ L+DHR
Sbjct 186 KGWQIVAVKLTSADGLQGVLPPMRITYESDGVVYPMRLSAHAKSQQQLRVYVLADHRASI 245
Query 242 TDADAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNA 301
T+ + FAG + + PL E G +LT+ + + + + ++D A
Sbjct 246 TNPTPDSAAPEETFAGWVRSEDVPPLQTEFSGQR--FLTRYDQEFHPAAN-AADIRVAAA 302
Query 302 PNDDPYRQVVTVYDDVALP 320
P+D P+R VV V D P
Sbjct 303 PDDAPFRAVVHVVDREPWP 321
>gi|29832381|ref|NP_827015.1| hypothetical protein SAV_5838 [Streptomyces avermitilis MA-4680]
gi|29609500|dbj|BAC73550.1| hypothetical protein [Streptomyces avermitilis MA-4680]
Length=382
Score = 152 bits (384), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 107/298 (36%), Positives = 150/298 (51%), Gaps = 15/298 (5%)
Query 26 LATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPT 85
L P+YAC CGA V T+N EV+ + WD E I M L ++ D A ++P P
Sbjct 28 LLAPAYACGCGAMVPDGRQYVTVNREVSAVRWDDGREQIAMSLTVSGDARRAAWIMPVPH 87
Query 86 PAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGVGASGPQE---------AAARAPHVL 136
A V D++ F +L +AP + ++ R G + AAA V+
Sbjct 88 RATVRLGDRALFDQLAEATAPEYRTRHYFWPRDGDWPFDKSDNAAAAPGAGAAAPPVGVV 147
Query 137 NQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRL--TST 194
++ RLGP + LT D L TWL+DNG+ + + AL PYV GW +VA+RL S
Sbjct 148 DRERLGPFDVARLTATDPGALGTWLTDNGFHLPDRLDRALRPYVDQGWEYVAVRLAPKSA 207
Query 195 DLIVGG-LDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTHV 253
D +GG LDP+R+TF S R VYPMRLS A+ PQ + ++ L+ HR + A + V
Sbjct 208 DAALGGTLDPLRLTFASDRPVYPMRLSRLARTPQSLRLYILAAHRTEPRSAIGGDR-PRV 266
Query 254 RFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQVV 311
FAG + TA PL L +LT V+ + + SRIS D D RQ++
Sbjct 267 WFAGRV-TAASGPLA-GLTEGGTDFLTTVDQEFPRPSRISGDHELRRTAADTTRRQII 322
>gi|297194633|ref|ZP_06912031.1| conserved hypothetical protein [Streptomyces pristinaespiralis
ATCC 25486]
gi|297152370|gb|EDY64208.2| conserved hypothetical protein [Streptomyces pristinaespiralis
ATCC 25486]
Length=354
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 100/304 (33%), Positives = 158/304 (52%), Gaps = 21/304 (6%)
Query 26 LATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPT 85
L +P+YAC CGA V ++ ++ E +++ WDG TE IVM+ ++++ A ++P P
Sbjct 13 LISPAYACGCGAMVVDRNAEISVARESSVIDWDGRTEQIVMRFTVDSNAPEAAWIMPVPN 72
Query 86 PAIVTTADQSTFGELDTLSAPLIEHQ-RHWSLRRG---------VGASGPQEAAARAPHV 135
A V AD + F EL ++AP +H+ RH+ RG + AAA V
Sbjct 73 RATVELADGALFDELVRIAAP--QHRTRHYFWPRGGDWPFDDTDGAGAPAPGAAAPGVGV 130
Query 136 LNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTD 195
+ + RLG + LT D L WL++NG+ + + L+PYV GW +VA++L +
Sbjct 131 VGRERLGDFDVARLTATDPDALGDWLAENGFELPEGLGQDLEPYVDAGWEYVAVKLAPSS 190
Query 196 ---LIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTH 252
+ G LDP+R++F S +LVYPMRLS A PQ + +F L+ HR + D ++
Sbjct 191 EGTTLDGTLDPLRLSFASEKLVYPMRLSRRATTPQSLGLFVLAGHRMEPRDNIGGSE-PE 249
Query 253 VRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQVVT 312
V +AG + R + R L G +LT ++ + RI D D P+R+V
Sbjct 250 VTYAGKVEP--RGAVGR-LTGGEERFLTALDQHFPEPGRIDGDHELVATAQDTPFRRV-- 304
Query 313 VYDD 316
++DD
Sbjct 305 IWDD 308
>gi|256390782|ref|YP_003112346.1| hypothetical protein Caci_1584 [Catenulispora acidiphila DSM
44928]
gi|256357008|gb|ACU70505.1| conserved hypothetical protein [Catenulispora acidiphila DSM
44928]
Length=363
Score = 142 bits (358), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 103/335 (31%), Positives = 159/335 (48%), Gaps = 23/335 (6%)
Query 26 LATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPT 85
+A P++AC CGA + G ++ E A++ +DGTTE +VM+ ++ + A V+P P
Sbjct 23 VADPAWACGCGAMIPGSGGTMSVAREQAVVRFDGTTENVVMRFFTQSNVTDAAWVMPVPA 82
Query 86 PAIVTTADQSTFGELDTLSAPLIE-HQRHW------SLRRGVGASGPQEAAARAPHVLNQ 138
A DQ+ F +L P+ H W S RGV P A A VL+
Sbjct 83 QATAKLGDQALFSDLTDAEEPVAAVHHYFWPHIGGSSGNRGVYEGAPAAAPPSAVQVLSD 142
Query 139 VRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTDL-- 196
R+G E L D L WL+ + + ++ + ++ L Y GW FVA+RL S
Sbjct 143 QRIGEFEVANLASSDPKALGDWLNQHSFTLKDSTASRLAAYTSQGWKFVAVRLASGSAAD 202
Query 197 IVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTHVRFA 256
+ G LDP+ ++F + VYPMRLS A PQ+V + L+ HR + A ++T F
Sbjct 203 LNGVLDPISLSFPAKSAVYPMRLSAGATTPQNVQVSVLAPHRMDAASSPIAAESTPSAFG 262
Query 257 GDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQVVTVYDD 316
+ + P L L +LT + + S I+ D+ F A +D + Q T YD
Sbjct 263 DWIDPSKVGPALASLAAGR-MFLTVYDGFFSEPSLITQDYAFAPASSD--WVQHST-YDK 318
Query 317 VAL-----PPLLLVVVSAIAVGAAGGAVVVVLRRR 346
L P+ L+V+ +A GA V+++R R
Sbjct 319 EELLTVLGIPVYLIVLLVLAAGA-----VLLVRWR 348
>gi|328882096|emb|CCA55335.1| hypothetical protein SVEN_2048 [Streptomyces venezuelae ATCC
10712]
Length=365
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 95/296 (33%), Positives = 138/296 (47%), Gaps = 15/296 (5%)
Query 26 LATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPT 85
L P+YAC CGA + + ++ E + + WDG TET+VM+ ++ + + A ++P P
Sbjct 22 LVAPAYACGCGAMIPTKEQRIGVDREESAVRWDGRTETVVMRFNVHGNARHAAWIMPVPH 81
Query 86 PAIVTTADQSTFGELDTLSAPLIEHQRH-------WSLRRGVGASGPQEAAARAP-HVLN 137
A V+ D F E+ L+ P + H W G A V+
Sbjct 82 RADVSLGDPELFDEIGRLTEPEQRDRFHFWPRADDWPFDTDYGDGAGAPAPGAGTVGVVG 141
Query 138 QVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTD-- 195
+ RLGP + LT D L WL +GY + ++ AL PYV W +VA+RL +
Sbjct 142 RERLGPFDVARLTATDPEALGDWLRTHGYELPERLTGALQPYVDRRWEYVAVRLAPDEKD 201
Query 196 -LIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTHVR 254
+ G L P+R+TF S+ LVYPMRLS A Q + + LS+HR + + V
Sbjct 202 ATLQGELTPLRITFASTELVYPMRLSRLATTSQSLGLSILSEHRME-PRSPIGGDAPEVT 260
Query 255 FAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQV 310
FAG + L L G+ +LT +E + RI D A D PYRQV
Sbjct 261 FAGRIERP--SGALAALAGDRPVHLTVLEQEFPHPERIDDDHHL-RAVADTPYRQV 313
>gi|297157354|gb|ADI07066.1| hypothetical protein SBI_03945 [Streptomyces bingchenggensis
BCW-1]
Length=354
Score = 122 bits (307), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 86/283 (31%), Positives = 134/283 (48%), Gaps = 30/283 (10%)
Query 55 LHWDGTTETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHW 114
+ WDG TE I+M L + D A ++P P A V D++ F E+ ++AP + ++
Sbjct 1 MRWDGHTEEIIMSLTVGGDAHEAAWIMPVPNRATVRLGDRALFDEVGEVTAPAHRTRHYF 60
Query 115 SLRRGVGASGPQE---------------AAARAPHVLNQVRLGPLEATTLTGGDLSGLQT 159
R G ++ +A V+ + RLGP + LT D L+
Sbjct 61 WPRTGDWPFDDRKIRYVDGAGAGAGAPPSAPPRVGVVGRERLGPFDVARLTATDPDALRD 120
Query 160 WLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLT----------STDLIVGGLDPVRMTFR 209
WL +G+ + ++ L PYVR W +VA+RL + D++ G LDP+ +TF
Sbjct 121 WLEKHGFQLPDRLADGLKPYVRAKWEYVAVRLAPAAVADRGDGAKDVLGGTLDPLWLTFD 180
Query 210 SSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTHVRFAGDMSTAVRDPL-- 267
S+RL+YPMRLS A+ PQ++ ++ L+ HR + D V +AG T + P
Sbjct 181 SNRLIYPMRLSRLAKTPQNLELYVLAPHRME-PRGDIGGGAPRVTYAG-WVTPGQAPRGD 238
Query 268 LRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQV 310
L EL G +LT + + I D F A DD Y++V
Sbjct 239 LAELAGKR-MFLTSFQQSFPRPELIYGDHEFRRAEKDDTYQRV 280
>gi|309791058|ref|ZP_07685594.1| hypothetical protein OSCT_1545 [Oscillochloris trichoides DG6]
gi|308226913|gb|EFO80605.1| hypothetical protein OSCT_1545 [Oscillochloris trichoides DG6]
Length=335
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 90/324 (28%), Positives = 154/324 (48%), Gaps = 30/324 (9%)
Query 37 AAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTP-TPAIVTTAD-Q 94
AA G+Q +++E ALL G +++ + + + N ALV+P P P I +D
Sbjct 23 AASLPLGTQ--ISYERALLIDAGKQHHLIISIDVRGASPNAALVIPVPGIPTIDQASDLD 80
Query 95 STFGELDTLSAPLIEHQRH--WSLRRGVGASGPQEAAARAPHVLNQVRLGPLEATTLTGG 152
+ F L+ + P ++ Q W +R + P +L +LG LE ++
Sbjct 81 ALFPYLNMATQPDVDEQNRYVWRVRTTPTPTPPNV------DLLGHQQLGDLEIASVNSS 134
Query 153 DLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRL--TSTDLIVGGLDPVRMTFRS 210
D LQ WL N Y + A + LD YV +GW+FV +RL ++D G P+R+++ +
Sbjct 135 DAVALQAWLHANQYELPAASAPLLDAYVAEGWSFVLVRLRNPASD---GATPPIRISYTA 191
Query 211 SRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDADAATQTTHVRFAGDMS--TAVRDPLL 268
+ LVYP+R++ + P + ++ LS HR Q A T + FAG ++ T D +
Sbjct 192 NELVYPLRMAALSASPIGLDLYVLSAHRYQ------AAGLTPI-FAGPVTSLTPPPDAGV 244
Query 269 RELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPNDDPYRQVVTVYDDVALPP---LLLV 325
+L+ YLT++ + +S D A +D PYR + + V+ +L+V
Sbjct 245 ADLLAT-APYLTRLHSSTLDPAMLSGDLQLERAADDSPYRATIIRAETVSFADRYGVLMV 303
Query 326 VVSAIAVGAAGGAVVVVLRRRRRA 349
++ A + + +RRR RA
Sbjct 304 LLCLAAFSPTSFVIALSIRRRIRA 327
>gi|254386705|ref|ZP_05001999.1| conserved hypothetical protein [Streptomyces sp. Mg1]
gi|194345544|gb|EDX26510.1| conserved hypothetical protein [Streptomyces sp. Mg1]
Length=232
Score = 96.7 bits (239), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 64/187 (35%), Positives = 91/187 (49%), Gaps = 7/187 (3%)
Query 128 AAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFV 187
A A A V+ + RLG + LT D L+ WL NG+ + +S L PYV W +V
Sbjct 3 AGAPAVGVVGRERLGDFDVARLTATDPDALRDWLRSNGFELPDRLSTELRPYVDQKWEYV 62
Query 188 AIRLTSTD---LIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDHRQQRTDA 244
A+RL + + G LDP+R+ F S RLVYPMRLS A+ Q + ++ L+DHR + + +
Sbjct 63 AVRLAPREPGTPLRGTLDPLRIRFDSDRLVYPMRLSRMARTAQSLGLYVLADHRMEPS-S 121
Query 245 DAATQTTHVRFAGDMSTAVRDPLLRELIGNHGSYLTKVEVDIYQTSRISSDFTFGNAPND 304
V FAG T L L ++LT ++ + RI D D
Sbjct 122 PIGGDAPEVTFAG---TVTPHGPLAGLTDGKPAFLTAIDQRFPEPGRIDGDHELRRTAAD 178
Query 305 DPYRQVV 311
PYR+ V
Sbjct 179 TPYRRAV 185
>gi|159897385|ref|YP_001543632.1| hypothetical protein Haur_0856 [Herpetosiphon aurantiacus DSM
785]
gi|159890424|gb|ABX03504.1| conserved hypothetical protein [Herpetosiphon aurantiacus DSM
785]
Length=404
Score = 93.6 bits (231), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 84/323 (27%), Positives = 139/323 (44%), Gaps = 43/323 (13%)
Query 20 ITATMVLATPSYACACGAAVTAHGS--QATLNHEVALLHWDGTTE--TIVMQLAMNADTD 75
+ + + PS A ACGA + A QA LN A+ DG T +Q+ D
Sbjct 10 LCSILSFTLPSIAAACGALIPADDQIRQAGLNVIFAV---DGQANQTTAYIQINYVGDPA 66
Query 76 NVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEHQRHWSLRRGVGASGPQEAAARAPHV 135
A ++P P+ V + STF EL TL+ P + + + P + A +AP+V
Sbjct 67 EFAWILPVPSNPKVDVIEASTFAELHTLTDPRVTFPSPPECFPAIVGAAP-DGAGQAPNV 125
Query 136 LNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTD 195
L Q ++GP + + + D + L+TWL NGY + AAL PY G +A++L
Sbjct 126 LQQGQVGPYDYSVIEDRDPAALETWLKTNGYQTPAGLEAALKPYTEAGMPLIAMKL-KPG 184
Query 196 LIVGGLDPVRMTFRSSRLVYPMRLSVAAQEPQ----------------HVTIFTLSDHRQ 239
+ PV ++F + + P+RL+ + EP+ + FT+ ++
Sbjct 185 ADTNDIQPVAISFTGTTPMLPLRLAALSSEPKTPITVWIFGEAQAIPTNTERFTMRENDL 244
Query 240 QRTDADAATQTTHVRFAGDMSTAVR----------------DPLLRELIGNHGSYLTKVE 283
T D + +R S A + D LL+EL + ++LT++
Sbjct 245 ALTAYDGSNNYKELRSGVLASVAGKGFLTEYAQQSKFLNPQDSLLKELTSKY-AFLTRLY 303
Query 284 VDIYQTSRISSDFTFGNAPNDDP 306
+I + D TFG +P+ P
Sbjct 304 AEI-SPEEMLFDPTFGYSPDLPP 325
>gi|149922676|ref|ZP_01911103.1| hypothetical protein PPSIR1_19904 [Plesiocystis pacifica SIR-1]
gi|149816473|gb|EDM75972.1| hypothetical protein PPSIR1_19904 [Plesiocystis pacifica SIR-1]
Length=573
Score = 75.9 bits (185), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 65/226 (29%), Positives = 95/226 (43%), Gaps = 22/226 (9%)
Query 34 ACGAAVTAHGSQA---TLNHEVALLH-WDGTTET-IVMQLAMNADTDNVALVVPTPTPAI 88
ACG G QA E L H DG E I +Q +A+ + A V+P
Sbjct 27 ACGGTFCDQGPQAMPVDQTGENILFHIGDGFVEAHIQIQYDPDAEAEQFAWVIPVTAIPT 86
Query 89 VTTADQSTFGELDTLSAPLI-----------EHQRHWSLRR-GVGASGPQEA---AARAP 133
+ + F + S P E + W G +G E P
Sbjct 87 FSVGSDNLFSTMLNASVPSYGLTVSNEFCGEETEDGWGGDDLGEDEAGSDEGTDGGNGNP 146
Query 134 HVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTS 193
+V+ + +G E L GG + G+ TWL+DNGY PA L Y+ DG FVA++LT+
Sbjct 147 NVVLETTVGAFEIAVLDGGTVEGVMTWLNDNGYQQDPAAEPILGEYLADGHLFVALKLTN 206
Query 194 TDLIVGGLDPVRMTFRSSRLVYPMRLS-VAAQEPQHVTIFTLSDHR 238
D V + PV + + P++L+ +AA E + +F L D R
Sbjct 207 -DAEVSEIHPVTLRYDGDESCVPIKLTRIAAVENMDIRVFFLQDAR 251
>gi|119483235|ref|ZP_01618649.1| hypothetical protein L8106_04261 [Lyngbya sp. PCC 8106]
gi|119458002|gb|EAW39124.1| hypothetical protein L8106_04261 [Lyngbya sp. PCC 8106]
Length=437
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 69/244 (29%), Positives = 104/244 (43%), Gaps = 30/244 (12%)
Query 22 ATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVV 81
A + +PS CG V S+ +L DG + M + + AL+V
Sbjct 16 AIFTVFSPSAWAFCGFYVAKADSRLYNEASQVILARDGYRTVLTMANDYKGEVKDFALIV 75
Query 82 PTPTPAIVTTADQSTFGE------LDTLSAP-LIEHQRH----WSLR--RGVG----ASG 124
P P +V T +Q GE +D SAP L+E+ + R RG G S
Sbjct 76 PVP---VVLTEEQVRIGEPKIIERIDAFSAPRLVEYFDENPCTYYPRGTRGGGDVFLQSA 132
Query 125 PQEAAARAPHVL-----NQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPY 179
P EA A++ L ++ +G + L+ + GL+TWL NGY I S L PY
Sbjct 133 P-EAEAQSDKTLGITIESRFTVGEYDIIILSAKESDGLETWLIQNGYQIPQGASQLLQPY 191
Query 180 VRDGWAFVAIRLTSTDLIVGG---LDPVRMTFRSSRLVYPMRLS-VAAQEPQHVTIFTLS 235
+R F ++ T+ G L P+ M + S R + P+RL + A Q + ++ LS
Sbjct 192 IRQNLKFFVAKVNLTEFDRAGFQSLRPLMMAYESPRFMLPIRLGMINATGEQDLIVYLLS 251
Query 236 DHRQ 239
Q
Sbjct 252 PQGQ 255
>gi|149917671|ref|ZP_01906167.1| hypothetical protein PPSIR1_28123 [Plesiocystis pacifica SIR-1]
gi|149821453|gb|EDM80853.1| hypothetical protein PPSIR1_28123 [Plesiocystis pacifica SIR-1]
Length=558
Score = 72.8 bits (177), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 63/243 (26%), Positives = 101/243 (42%), Gaps = 26/243 (10%)
Query 24 MVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPT 83
MV A S CG V ++ N + ++ DG + MQ T++ A+VVP
Sbjct 31 MVTAPSSAEAFCGFYVAGADAELYNNATMVVMMRDGKRTVLAMQNNYQGPTEDFAMVVPV 90
Query 84 P---TPAIVTTADQSTFGELDTLSAP-LIEH------------QRHWSLRRGVGASG--P 125
P A V T ++ F +D L+AP L+E+ +R +++ S P
Sbjct 91 PVVLQEADVLTLERDVFDRVDQLAAPRLVEYWEQDPCYRPPRPKRSRAMKSMAVESSMPP 150
Query 126 QEAAARAPH---VLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRD 182
EA A + + + +G E L D +GL +WL DNGY+I + L PYV+
Sbjct 151 SEAEGDADYGVTIEAEFTVGEYEIVVLGAQDSTGLDSWLRDNGYSIPEGAADVLGPYVQS 210
Query 183 GWAFVAIRLTSTDLIVGG-----LDPVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLSDH 237
G F ++ + + L P+R + S P+RL + I +
Sbjct 211 GMKFFVAKVDAQKVRFDANGQAQLSPLRFHYDSDTFSLPVRLGLINANGDQDLIIHILGQ 270
Query 238 RQQ 240
RQ+
Sbjct 271 RQR 273
>gi|282896858|ref|ZP_06304864.1| conserved hypothetical protein [Raphidiopsis brookii D9]
gi|281198267|gb|EFA73157.1| conserved hypothetical protein [Raphidiopsis brookii D9]
Length=447
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/244 (26%), Positives = 99/244 (41%), Gaps = 26/244 (10%)
Query 22 ATMVLAT--PSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVAL 79
A +VL + P+ CG V ++ ++ DGT + M +D + A+
Sbjct 2 AILVLISFAPTAWAFCGFYVAKADTRLYNQASQVIIARDGTKTVLTMANDFQSDIKDFAV 61
Query 80 VVPTPT---PAIVTTADQSTFGELDTLSAP-LIEHQRHWSLRRGVGASGP--QEAAARAP 133
VVP PT V D LD +AP L+E+ RR S E R P
Sbjct 62 VVPVPTIIQEHQVRVPDPKIIQRLDAFTAPRLVEYFDQDPCRRRYYDSPGVIPETGTRRP 121
Query 134 HVLN--------------QVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPY 179
+ Q +G + L+ + GL+TWL+ NGY I + L PY
Sbjct 122 SAVEKIPGDNTLGVTIEAQFNVGEYDIVILSAKESDGLETWLNLNGYKIPRGANRLLQPY 181
Query 180 VRDGWAFVAIRLTSTDLIVGG---LDPVRMTFRSSRLVYPMRLS-VAAQEPQHVTIFTLS 235
VR G F ++ G L P+++ + SS+ + P+RL + A Q + ++ +S
Sbjct 182 VRSGMKFFVAKVNLDKFEQSGYQFLRPLQIAYESSKFILPIRLGMINATTEQDLIVYIIS 241
Query 236 DHRQ 239
Q
Sbjct 242 PRGQ 245
>gi|158335199|ref|YP_001516371.1| hypothetical protein AM1_2042 [Acaryochloris marina MBIC11017]
gi|158305440|gb|ABW27057.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length=445
Score = 70.9 bits (172), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 71/256 (28%), Positives = 111/256 (44%), Gaps = 35/256 (13%)
Query 11 LVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNH--EVALLHWDGTTETIVMQL 68
L C+ ++ ++ +P+ CG V A A N +VA+ DG+ + M
Sbjct 7 LSACLVAVLMWTSL---SPNALAFCGFYV-AKADTALYNEASQVAIAK-DGSRTVLTMAN 61
Query 69 AMNADTDNVALVVPTPTPAIVTTADQSTFGE------LDTLSAP-LIEHQRH-------W 114
D + A+VVP P ++ DQ G+ LD SAP L+E+
Sbjct 62 DFKGDVKDFAMVVPVP---VLLQEDQVHVGDPTILQRLDDFSAPRLVEYFDENPCQVFRK 118
Query 115 SLRRGVGA---SGP-QEAAARAP---HVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYA 167
SL R + A S P QE+ A V Q +G + L+ + +GLQTWL+ NGY
Sbjct 119 SLDRELSAAPSSAPLQESRTEADLGVTVEAQFTVGEYDIVILSAKESNGLQTWLNRNGYK 178
Query 168 IRPAVSAALDPYVRDGWAFVAIRLTSTDL---IVGGLDPVRMTFRSSRLVYPMRLS-VAA 223
I + L PY+R F ++ + L P++M F S + + P+RL + A
Sbjct 179 IPRGARSLLKPYIRQNMKFFVAKVNLEEFEKSEFQKLRPLQMAFDSPKFMLPIRLGMINA 238
Query 224 QEPQHVTIFTLSDHRQ 239
Q + ++ LS Q
Sbjct 239 NTEQDLIVYLLSPKGQ 254
>gi|218246787|ref|YP_002372158.1| hypothetical protein PCC8801_1964 [Cyanothece sp. PCC 8801]
gi|218167265|gb|ACK66002.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
Length=444
Score = 70.1 bits (170), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 64/254 (26%), Positives = 102/254 (41%), Gaps = 29/254 (11%)
Query 10 GLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLA 69
G ++ + T+++ + L P+ A CG V + ++ DG + M
Sbjct 6 GFLISLLTSILM-LVFLIKPALAF-CGFYVAKADTSLYNKASQVIIARDGNRTVLTMAND 63
Query 70 MNADTDNVALVVPTPTPAIVTTADQSTFGE------LDTLSAPLI--------------E 109
+ + ALVVP P +V T +Q GE LD SAP + E
Sbjct 64 YQGEAKDFALVVPVP---VVITEEQVNIGEPEILTRLDGFSAPRLVEYFDTNPCAIYRTE 120
Query 110 HQRHWSLRRGVGASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIR 169
Q S + Q A A + Q +G + L+ GL+TWL N Y I
Sbjct 121 EQIFPSSAARDSFAEKQSANALGVTIEEQFSVGEYDIVILSAKQSDGLETWLKQNDYKIP 180
Query 170 PAVSAALDPYVRDGWAFVAIRLTSTDLIVGG---LDPVRMTFRSSRLVYPMRLS-VAAQE 225
VS L PY+R F ++ ++ G L P+ + + S + + P+RL + AQ
Sbjct 181 QGVSELLLPYIRQNMKFFVAKVNLSEYSKQGFKSLRPLMIAYESPKFILPIRLGMLNAQG 240
Query 226 PQHVTIFTLSDHRQ 239
Q + ++ LS Q
Sbjct 241 EQDLIVYLLSPKGQ 254
>gi|172036939|ref|YP_001803440.1| hypothetical protein cce_2024 [Cyanothece sp. ATCC 51142]
gi|171698393|gb|ACB51374.1| unknown [Cyanothece sp. ATCC 51142]
Length=482
Score = 69.7 bits (169), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 61/240 (26%), Positives = 99/240 (42%), Gaps = 29/240 (12%)
Query 26 LATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPT 85
++ S A CG V S+ ++ DG + M + ++ A+VVP P
Sbjct 63 ISMNSAAAFCGFYVAKADSELYNQASQVIIARDGKRTVLTMANDYQGEVNDFAMVVPVP- 121
Query 86 PAIVTTADQSTFGE------LDTLSAP-LIEHQRHWSLRRGV-------GASGPQEAAAR 131
++ T +Q GE LD SAP L+E+ R + PQ A+
Sbjct 122 --VILTEEQVKVGEPKIIERLDAFSAPRLVEYFDEDPCTRSYLEDELFRTPAAPQAASEM 179
Query 132 APHVLNQV--------RLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDG 183
N + +G + L+ + SGL+ WL +NGY I S L PY++
Sbjct 180 KESRNNALGVTVEEAFSVGEYDIVILSAKESSGLEVWLRENGYKIPNGASEILRPYIQQN 239
Query 184 WAFVAIRLTSTDLIVGG---LDPVRMTFRSSRLVYPMRLS-VAAQEPQHVTIFTLSDHRQ 239
F ++ T+ G L P+ M + S + + P+RL + AQ Q + ++ LS Q
Sbjct 240 LKFFVAKVNLTEYENTGFKSLRPLMMAYESPKFMLPIRLGMLNAQGEQDLIVYLLSPKGQ 299
>gi|149924456|ref|ZP_01912818.1| hypothetical protein PPSIR1_41084 [Plesiocystis pacifica SIR-1]
gi|149814659|gb|EDM74236.1| hypothetical protein PPSIR1_41084 [Plesiocystis pacifica SIR-1]
Length=567
Score = 69.3 bits (168), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 62/238 (27%), Positives = 98/238 (42%), Gaps = 32/238 (13%)
Query 35 CGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPT---PAIVTT 91
CG V ++ + V +L G + MQ + ++ A+VVP P V T
Sbjct 38 CGFYVADADTEMFNDATVVVLMRQGQRTVLSMQNSYAGPPEDFAMVVPVPVVLQEHQVVT 97
Query 92 ADQSTFGELDTLSAP-LIEH------QRHWSLRRGVGASG-------------PQEAAAR 131
Q F +D LSAP L+E+ + GVGA + +
Sbjct 98 LPQGVFERIDRLSAPRLVEYWERDPCESPMDFGYGVGAGAIGLGGSGAGFGLAAESVSPP 157
Query 132 AP--HVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFV-- 187
P V ++ +G + L+ D GL+TWL GY I L PYV+ G+ F
Sbjct 158 RPVVKVESEFVVGEYDIVVLSAEDSVGLETWLHQEGYQIPKGAEKQLRPYVQQGYKFFVA 217
Query 188 AIRLTSTDLIVG--GLDPVRMTFRSSRLVYPMRLSV---AAQEPQHVTIFTLSDHRQQ 240
+ ++ + G L P+RM + S P+RL + + Q PQ + + L+D R +
Sbjct 218 KVDVSKVKFVDGRVALSPLRMHYDSDSFSLPIRLGLINASKQGPQDLIVHILADDRYE 275
>gi|170078040|ref|YP_001734678.1| hypothetical protein SYNPCC7002_A1431 [Synechococcus sp. PCC
7002]
gi|169885709|gb|ACA99422.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length=446
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 63/245 (26%), Positives = 105/245 (43%), Gaps = 33/245 (13%)
Query 24 MVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPT 83
++L P++A CG V + ++ DG + M + + ALVVP
Sbjct 20 LLLTQPAWAF-CGFYVAKADTDLYNQASQVIIARDGDRTVLTMANDYEGEVSDFALVVPV 78
Query 84 PTPAIVTTADQSTFGE------LDTLSAP-LIEH--QRHWSLRR---GVG----ASGPQE 127
P ++ +Q GE L+ SAP L+E+ + ++RR G G A+ P
Sbjct 79 P---VILKEEQVHIGEASIIKRLNDFSAPRLVEYFDENPCAVRRFDDGFGLLQNAAPPMP 135
Query 128 AAARAPH---------VLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDP 178
AA + + Q +G + L+ + GL+TWL N Y + S L P
Sbjct 136 MAAESMRETAADLGVTIEEQFSVGEYDILILSAKESDGLETWLRQNDYQLPQGASELLRP 195
Query 179 YVRDGWAFVAIRLTSTDLIVGG---LDPVRMTFRSSRLVYPMRLSVA-AQEPQHVTIFTL 234
Y+R+ F ++ + G L P+ M F S + + P+RL + AQ+ Q + ++ L
Sbjct 196 YIRNKLKFFVAKVNLEEFDRSGVNQLRPLMMAFESPKYMLPIRLGMMNAQQAQDLIVYIL 255
Query 235 SDHRQ 239
S Q
Sbjct 256 SPKGQ 260
>gi|300866679|ref|ZP_07111363.1| conserved exported hypothetical protein [Oscillatoria sp. PCC
6506]
gi|300335279|emb|CBN56523.1| conserved exported hypothetical protein [Oscillatoria sp. PCC
6506]
Length=398
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/217 (27%), Positives = 94/217 (44%), Gaps = 17/217 (7%)
Query 35 CGAAVTAHGSQATLNH-EVALLHWDGTTETIVMQLAMNADTDNVALVVPTP---TPAIVT 90
CG V +Q N EVA+ H + T T + D ALV+P P V
Sbjct 30 CGFFVAKVDAQLFNNRSEVAIAHRNNDT-TYSLAFDYKGDPKEFALVLPVPIVLKKQDVK 88
Query 91 TADQSTFGELDTLSAPLIEHQRHWSLRRGVGASGPQEAAA----RAP-HVLNQVRLGPLE 145
D F LD +AP + R+ R+ A P+ A RAP V+ + +G +
Sbjct 89 VIDAKLFQRLDDFTAPRLV--RYQDFRQNAPAGAPRSATETKRDRAPVTVVERFTVGEYD 146
Query 146 ATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTD---LIVGGLD 202
L+ + + L+TWL N Y + + L PY+ F +R+ + L L
Sbjct 147 VVILSATESNALETWLRQNKYRLPNNAARYLKPYIDQKLYFFVVRINFKEQQRLGFQNLR 206
Query 203 PVRMTF-RSSRLVYPMRL-SVAAQEPQHVTIFTLSDH 237
P++ T S++++ P +L + ++ Q + ++ LSD
Sbjct 207 PLQFTVANSNQIMLPFQLGKINSEGTQDIIVYFLSDK 243
>gi|67922512|ref|ZP_00516021.1| similar to Uncharacterized protein conserved in bacteria [Crocosphaera
watsonii WH 8501]
gi|67855683|gb|EAM50933.1| similar to Uncharacterized protein conserved in bacteria [Crocosphaera
watsonii WH 8501]
Length=443
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/240 (26%), Positives = 96/240 (40%), Gaps = 37/240 (15%)
Query 30 SYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPTPAIV 89
S A CG V S ++ DG + M + D+ A+VVP P ++
Sbjct 23 SAAAFCGFYVAKADSNLYNQASQVVIARDGKRTVLTMANDYQGEVDDFAMVVPVP---VI 79
Query 90 TTADQSTFGE------LDTLSAP-LIEHQRHWSLRRGVGASGPQEAAARAPH-------- 134
+Q GE LD SAP L+E+ G S ++ RAP
Sbjct 80 LKEEQVKVGEPEIIERLDAFSAPRLVEYFGE----DPCGRSYLEDEVLRAPTAPRSAPEM 135
Query 135 -----------VLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDG 183
V + +G + L+ + +GL+ WL +NGY I S L PY++
Sbjct 136 SSSRDNALEVTVEEEFTVGEYDIVILSAKESNGLEIWLRENGYKIPNGASTILRPYIQQN 195
Query 184 WAFVAIRLTSTDLIVGG---LDPVRMTFRSSRLVYPMRLS-VAAQEPQHVTIFTLSDHRQ 239
F ++ T+ G L P+ M + S + + P+RL + AQ Q + ++ LS Q
Sbjct 196 LKFFVAKVNLTEYENTGFKSLRPLMMAYESPKFMLPIRLGMLNAQGEQDLIVYLLSPKGQ 255
>gi|71909668|ref|YP_287255.1| hypothetical protein Daro_4059 [Dechloromonas aromatica RCB]
gi|71849289|gb|AAZ48785.1| conserved hypothetical protein [Dechloromonas aromatica RCB]
Length=464
Score = 67.4 bits (163), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 54/202 (27%), Positives = 83/202 (42%), Gaps = 22/202 (10%)
Query 58 DGTTETIVMQLAMNADTDNVALVVPTPT---PAIVTTADQSTFGELDTLSAP-LIEH--- 110
DG + M D ALVVP PT + D+ TF LD SAP L E+
Sbjct 44 DGDKTVVSMLNDYKGDAKEFALVVPVPTVLQKGQINVGDKKTFDRLDAYSAPRLAEYYDP 103
Query 111 ----QRHWSLRRG-------VGASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQT 159
+R + +R G A A + +G + L+ GL+T
Sbjct 104 NPCDRRLYEMRSKDAAVAPMAAPVGSAAAKALGVTIEASYTVGEYDIVILSATQSDGLET 163
Query 160 WLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTDLIVGG---LDPVRMTFRSSRLVYP 216
WL +GY I + AL PY+R F ++ + G L P++ + S + + P
Sbjct 164 WLKQSGYRIPANSAKALAPYIRQNMKFFVAKVNLAEQAKSGFTMLRPLQFAYESEKFMLP 223
Query 217 MRLSVA-AQEPQHVTIFTLSDH 237
+RL +A A PQ + + L+ +
Sbjct 224 IRLGMANANGPQDLIAYMLTKN 245
>gi|126658382|ref|ZP_01729531.1| hypothetical protein CY0110_27520 [Cyanothece sp. CCY0110]
gi|126620314|gb|EAZ91034.1| hypothetical protein CY0110_27520 [Cyanothece sp. CCY0110]
Length=442
Score = 67.0 bits (162), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 58/227 (26%), Positives = 92/227 (41%), Gaps = 29/227 (12%)
Query 35 CGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPTPAIVTTADQ 94
CG V S ++ DG + M ++ A+VVP P +V T +Q
Sbjct 29 CGFYVAKADSNLYNQASQVIIARDGKRTVLTMANDYQGSVNDFAMVVPVP---VVLTEEQ 85
Query 95 STFGE------LDTLSAP-LIEHQRHWSLRRGV-------GASGPQEAAARAPH------ 134
GE LD SAP L+E+ R + PQ A+
Sbjct 86 VKVGEPAIIERLDAFSAPRLVEYFDEDPCDRSYLEDEVFRTPAAPQAASEMKESRDNALG 145
Query 135 --VLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLT 192
V + +G + L+ + +GL+ WL +NGY I S L PY++ F ++
Sbjct 146 VTVEEEFTVGEYDIVILSAKESNGLEVWLRENGYEIPNGASEILQPYIQQNLKFFVAKVN 205
Query 193 STDLIVGG---LDPVRMTFRSSRLVYPMRLS-VAAQEPQHVTIFTLS 235
T+ G L P+ M + S + + P+RL + AQ Q + ++ LS
Sbjct 206 LTEYENTGFKSLRPLMMAYESPKFMLPIRLGMLNAQGEQDLIVYLLS 252
>gi|17229490|ref|NP_486038.1| hypothetical protein alr1998 [Nostoc sp. PCC 7120]
gi|17131088|dbj|BAB73697.1| alr1998 [Nostoc sp. PCC 7120]
Length=455
Score = 67.0 bits (162), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 67/256 (27%), Positives = 107/256 (42%), Gaps = 26/256 (10%)
Query 8 RLGLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQ 67
RL L+V + V+ A + A ++A CG V ++ +L DG + M
Sbjct 3 RLKLLVPLFL-VMLAVLCFAPAAWAF-CGFYVAKADTKLYNKASQVVLARDGDRTVLTMA 60
Query 68 LAMNADTDNVALVVPTPT---PAIVTTADQSTFGELDTLSAP-LIEHQ-------RHWSL 116
+ + A+VVP PT V A+ LD SAP L+E+ +SL
Sbjct 61 NDYQGEVKDFAMVVPVPTVIKKEQVRVAEPKIIERLDAFSAPRLVEYFDSNPCAVEDFSL 120
Query 117 RR------GVGASG-PQEAAARAPHVLNQVRL--GPLEATTLTGGDLSGLQTWLSDNGYA 167
+ V SG + R V + R G + L+ + GL+TWL+ NGY
Sbjct 121 QALPAPSAAVNESGVARRRGDRNLGVTVEARFNVGEYDIVVLSAKESGGLETWLNRNGYK 180
Query 168 IRPAVSAALDPYVRDGWAFVAIRLTSTDLIVGG---LDPVRMTFRSSRLVYPMRLS-VAA 223
I L PY+R F ++ G L P+++ ++SS+ + P+RL + A
Sbjct 181 IPRGAKQLLKPYIRSSMKFFVAKVNLDKFEQSGYQFLRPLQIAYKSSKFMLPIRLGMINA 240
Query 224 QEPQHVTIFTLSDHRQ 239
Q + ++ LS Q
Sbjct 241 TTEQDLIVYVLSPKGQ 256
>gi|149918942|ref|ZP_01907428.1| hypothetical protein PPSIR1_16765 [Plesiocystis pacifica SIR-1]
gi|149820316|gb|EDM79733.1| hypothetical protein PPSIR1_16765 [Plesiocystis pacifica SIR-1]
Length=590
Score = 67.0 bits (162), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 57/233 (25%), Positives = 93/233 (40%), Gaps = 24/233 (10%)
Query 29 PSYACACGAAVTAHGSQA-----TLNHEVALLHWDGTTET-IVMQLAMNADTDNVALVVP 82
P+ A ACG G Q+ T + + ++ + E I +Q +AD D A V+P
Sbjct 25 PNAAEACGGTFCDTGPQSMPVDQTGENILFVMGDENIVEAHIQIQYDPDADADKFAWVIP 84
Query 83 TPTPAIVTTADQSTFGELDTLSAPLIEHQ--------------RHW--SLRRGVGASGPQ 126
+ F + + P + W G G +
Sbjct 85 MTAVPEFEVGSERLFQNMLAGTVPTYGYSTTQESCGGEGDDEAGGWGDEGESGTGDESTE 144
Query 127 EAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAF 186
++ P V+ + +G E L GG + G+ WL DNGY P L Y+ +G F
Sbjct 145 DSGGGGPTVVLEEIVGAFEIAVLEGGTIEGVMQWLEDNGYQQDPNAEPILAEYLEEGHLF 204
Query 187 VAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLS-VAAQEPQHVTIFTLSDHR 238
VAI+L + V + P+ + ++ P+RL+ +AA E V +F + D R
Sbjct 205 VAIKL-GMNAEVDEIHPIVLRYQGDETCVPLRLTRIAAVEDMDVRVFVIGDGR 256
>gi|148259008|ref|YP_001243593.1| hypothetical protein BBta_7862 [Bradyrhizobium sp. BTAi1]
gi|146411181|gb|ABQ39687.1| putative exported protein of unknown function [Bradyrhizobium
sp. BTAi1]
Length=442
Score = 66.6 bits (161), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 63/232 (28%), Positives = 99/232 (43%), Gaps = 25/232 (10%)
Query 35 CGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPT---PAIVTT 91
CG V ++ +L DG +I M D A+VVP PT +
Sbjct 31 CGFYVAQADAKLFNKSSKVVLARDGEQTSITMASDFEGDVKEFAVVVPVPTFIERKQIGV 90
Query 92 ADQSTFGELDTLSAP-LIEHQRH-----WSLRRGVGASGPQEAAARAPHVLNQVRLGPLE 145
+ T LD+ +AP L+E+ R GA P A AR L++ +E
Sbjct 91 VEPKTIDHLDSYTAPRLVEYHDEDPCHPIMYRMAPGAPMPS-AVARGEASLSRQYGVTIE 149
Query 146 AT---------TLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLTSTDL 196
A+ L+ + GL WL+DN Y I A L Y+R G F ++ +
Sbjct 150 ASYDVAEYDVLILSAQESDGLTRWLTDNDYRIPQGAEAVLGSYIRQGMRFFVAKVNVERM 209
Query 197 IV---GGLDPVRMTFRSSRLVYPMRL-SVAAQEPQHVTIFTLSDHRQQRTDA 244
G L P+++ ++S++ + P+RL +V A PQ + I+ L+ R R +A
Sbjct 210 KAVGNGTLRPLQVRYQSAKFMVPLRLGTVNAAGPQDLIIYALT--RSGRIEA 259
>gi|257059829|ref|YP_003137717.1| hypothetical protein Cyan8802_1991 [Cyanothece sp. PCC 8802]
gi|256589995|gb|ACV00882.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
Length=444
Score = 66.2 bits (160), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 63/254 (25%), Positives = 104/254 (41%), Gaps = 29/254 (11%)
Query 10 GLVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLA 69
G ++ + T+++ + L P+ A CG V + ++ DG + M
Sbjct 6 GFLISLLTSILM-LVFLIKPALAF-CGFYVAKADTSLYNKASQVIIARDGNRTVLTMAND 63
Query 70 MNADTDNVALVVPTPTPAIVTTADQSTFGE------LDTLSAP-LIEH--QRHWSLRRGV 120
+ + ALVVP P +V T +Q G+ LD SAP L+E+ ++ R
Sbjct 64 YQGEAKDFALVVPVP---VVITEEQVNIGDPEILTRLDGFSAPRLVEYFDANPCAIYRTE 120
Query 121 GASGP-----------QEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIR 169
P Q A A + Q +G + L+ GL+TWL N Y I
Sbjct 121 EQILPSSAARDSFAEKQSANALGVTIEEQFTVGEYDIVILSAKQSDGLETWLKQNDYKIP 180
Query 170 PAVSAALDPYVRDGWAFVAIRLTSTDLIVGG---LDPVRMTFRSSRLVYPMRLS-VAAQE 225
S L PY+R F ++ ++ G L P+ + + S + + P+RL + AQ
Sbjct 181 QGASELLHPYIRQNMKFFVAKVNLSEYSKQGFKSLRPLMIAYESPKFILPIRLGMLNAQG 240
Query 226 PQHVTIFTLSDHRQ 239
Q + ++ LS Q
Sbjct 241 EQDLIVYLLSPKGQ 254
>gi|149918055|ref|ZP_01906548.1| hypothetical protein PPSIR1_41684 [Plesiocystis pacifica SIR-1]
gi|149821060|gb|EDM80466.1| hypothetical protein PPSIR1_41684 [Plesiocystis pacifica SIR-1]
Length=684
Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 63/238 (27%), Positives = 94/238 (40%), Gaps = 32/238 (13%)
Query 12 VVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMN 71
+ A+ A ++ TPS+A CG V ++ + V +L DG + MQ A
Sbjct 3 IALTASLAAGAISLIPTPSHAF-CGFYVAGADAELFNDATVVVLMRDGKRTVLSMQNAYR 61
Query 72 ADTDNVALVVPTPTPAIVTTADQ------STFGELDTLSAP-LIEHQRHWSLRRGVGASG 124
+ A+VVP P +V + D F ++TLS+P L+E+ G G +G
Sbjct 62 GPPEAFAMVVPVP---VVLSEDDVKVLRPELFDRVETLSSPRLVEYWEQDPCGDGYGVAG 118
Query 125 PQEAAARAPH----------------VLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAI 168
V + +G E L+ + +GL TWL DNGY+I
Sbjct 119 LGLIGTGRGGGGTGSGYGFGGVPTVTVEAEFEVGEYEVVILSATESTGLDTWLRDNGYSI 178
Query 169 RPAVSAALDPYVRDGWAFVA-----IRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLSV 221
L PYV G F R+ D L P+RM + S P+RL +
Sbjct 179 PAGAEPVLRPYVEAGSKFFVAKVDPARVQFDDQGRAMLSPLRMHYDSEEFSLPVRLGL 236
>gi|254415125|ref|ZP_05028887.1| hypothetical protein MC7420_2551 [Microcoleus chthonoplastes
PCC 7420]
gi|196177931|gb|EDX72933.1| hypothetical protein MC7420_2551 [Microcoleus chthonoplastes
PCC 7420]
Length=439
Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/253 (26%), Positives = 103/253 (41%), Gaps = 24/253 (9%)
Query 11 LVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAM 70
L V +A A+ +++ P CG V ++ ++ DG + M
Sbjct 5 LQVLIALALAVFAIMIFAPKALAFCGFYVAKADTKLYNQASQVIIARDGDRTILTMANDY 64
Query 71 NADTDNVALVVPTP---TPAIVTTADQSTFGELDTLSAP-LIEH-----------QRHWS 115
D + A+VVP P V A+ LD SAP L+E+ + S
Sbjct 65 QGDVKDFAVVVPVPVVLEQEQVQVANPKIIERLDGFSAPRLVEYFDPNPCIPPAPRELRS 124
Query 116 LRRGVGASGPQEAAARAPHVL-----NQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRP 170
L GV +S A R L ++ +G + L+ + GL+TWL N Y I
Sbjct 125 LSGGVTSSADNSAGNRGDSALGVTIESRFSVGEYDILILSAKESDGLETWLRRNDYRIPR 184
Query 171 AVSAALDPYVRDGWAFVAIRLTSTDLIVGG---LDPVRMTFRSSRLVYPMRLS-VAAQEP 226
S L PY+R F ++ + GG L P++M + S R + P+RL + A
Sbjct 185 GASRLLQPYIRQNMKFFVAKVNLKEFESGGSQLLRPLQMAYESPRFMLPIRLGMINATTE 244
Query 227 QHVTIFTLSDHRQ 239
Q + ++ LS + Q
Sbjct 245 QDLIVYILSRNGQ 257
>gi|220907398|ref|YP_002482709.1| hypothetical protein Cyan7425_1983 [Cyanothece sp. PCC 7425]
gi|219864009|gb|ACL44348.1| conserved hypothetical protein [Cyanothece sp. PCC 7425]
Length=450
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 64/255 (26%), Positives = 100/255 (40%), Gaps = 32/255 (12%)
Query 11 LVVCVATAVITATMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAM 70
L+ C +I ++ TP+ CG V ++ + +G + M
Sbjct 8 LITCGLIVMICCSL---TPAAWAFCGFYVAKADTKLYNRASQVAIARNGNRTVLTMANDY 64
Query 71 NADTDNVALVVPTPTPAIVTTADQSTFG------ELDTLSAP-LIEH---------QRHW 114
D + A+VVP PT V +Q G LD SAP L+E+ R
Sbjct 65 QGDVKDFAIVVPVPT---VLKKEQVQVGHPKIMERLDAFSAPRLVEYFDPDPCAPPARLE 121
Query 115 SLRRGVGASGPQEAAARAPH------VLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAI 168
+ AS P + + V + +G L+ + GL+TWL NGY I
Sbjct 122 DRGLQMPASAPMRSEMKRRDNALGVTVEAKFNVGEYNILILSAKESGGLETWLVRNGYKI 181
Query 169 RPAVSAALDPYVRDGWAFVAIRLTSTDLIVGG---LDPVRMTFRSSRLVYPMRLS-VAAQ 224
L PYVR F ++ + G L P+++++ S R + P+RL V AQ
Sbjct 182 PQGARQLLQPYVRQQMKFFVAKVNLAEFNKAGYQNLRPLQISYDSPRFMLPIRLGMVNAQ 241
Query 225 EPQHVTIFTLSDHRQ 239
Q + ++ LS Q
Sbjct 242 TAQDLMVYILSPQGQ 256
>gi|149918943|ref|ZP_01907429.1| hypothetical protein PPSIR1_16770 [Plesiocystis pacifica SIR-1]
gi|149820317|gb|EDM79734.1| hypothetical protein PPSIR1_16770 [Plesiocystis pacifica SIR-1]
Length=571
Score = 63.5 bits (153), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 38/116 (33%), Positives = 57/116 (50%), Gaps = 2/116 (1%)
Query 124 GPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDG 183
G + P V+ Q +G E L GG + G+ WL DN Y PA L Y+ +G
Sbjct 99 GEESTTGGEPVVVLQEIVGAFEIAVLDGGTIEGVMQWLGDNDYQQDPAAEPILAEYLNEG 158
Query 184 WAFVAIRLTSTDLIVGGLDPVRMTFRSSRLVYPMRLS-VAAQEPQHVTIFTLSDHR 238
FVAI+L + + V + P+ + ++ P+RL+ +AA E V +F L D R
Sbjct 159 HLFVAIKL-AMNTEVDEIHPIVLRYQGDETCVPLRLTRIAAVEDMDVRVFILGDGR 213
>gi|302036533|ref|YP_003796855.1| hypothetical protein NIDE1172 [Candidatus Nitrospira defluvii]
gi|300604597|emb|CBK40929.1| conserved membrane protein of unknown function (modular protein)
[Candidatus Nitrospira defluvii]
Length=759
Score = 63.2 bits (152), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 65/235 (28%), Positives = 103/235 (44%), Gaps = 30/235 (12%)
Query 35 CGAAVTAHGSQATLNH--EVALLHWDGTTETIVMQLAMNADTDNVALVVPTPT------- 85
CG V +Q N EVA+ + T I M D ALVVP PT
Sbjct 337 CGFYVGKADTQ-LFNKASEVAIARHENKT-VITMANDFRGDVKEFALVVPVPTLLEREQI 394
Query 86 ----PAIVT-TADQST------FGELDTLSAPLIEHQRHWSLRRGVGASGPQEAAARAPH 134
PA++ AD S F E L L+E + +++ AS P +A
Sbjct 395 HVGNPAVLKHLADYSAPRLVEYFDENPCLRHELMERRSMDAMKSMAPASAPARERDKALG 454
Query 135 VLNQVR--LGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVAIRLT 192
V + +G + L+ + +GL++WL++NGY I S L Y++ G F ++
Sbjct 455 VTVEAEYMVGEYDILILSAKESNGLESWLTENGYRIPNGASVVLHSYLKQGMKFFVAKVN 514
Query 193 ---STDLIVGGLDPVRMTFRSSRLVYPMRL-SVAAQEPQHVTIFTLSDHRQQRTD 243
T L + L P+++ F S + + P+RL +V A Q + I+ L+ +Q R +
Sbjct 515 LGEQTKLGLTHLRPLQIAFESPKFMLPIRLGTVNADGAQELFIYFLT--KQGRVE 567
>gi|75911164|ref|YP_325460.1| hypothetical protein Ava_4968 [Anabaena variabilis ATCC 29413]
gi|75704889|gb|ABA24565.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length=455
Score = 63.2 bits (152), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 59/235 (26%), Positives = 95/235 (41%), Gaps = 24/235 (10%)
Query 29 PSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVPTPT--- 85
P+ CG V ++ +L DG + M + + A+VVP PT
Sbjct 22 PAAWAFCGFYVAKADAKLYNKASQVVLARDGDRTVLTMANDYQGEVKDFAMVVPVPTVIK 81
Query 86 PAIVTTADQSTFGELDTLSAP-LIEHQ-------RHWSLRR--GVGASGPQEAAAR---- 131
V A+ LD SAP L+E+ ++L A+ + AAR
Sbjct 82 KEQVRVAEPKIIERLDAFSAPRLVEYFDSNPCAVEDFALEALPAPSAALNESGAARRRGD 141
Query 132 ---APHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYVRDGWAFVA 188
V + +G + L+ + GL+TWL+ NGY I L PY+R F
Sbjct 142 RSLGVTVEARFNVGEYDIVVLSAKESGGLETWLNRNGYKIPRGAKQLLKPYIRSSMKFFV 201
Query 189 IRLTSTDLIVGG---LDPVRMTFRSSRLVYPMRLS-VAAQEPQHVTIFTLSDHRQ 239
++ G L P+++ ++SSR + P+RL + A Q + ++ LS Q
Sbjct 202 AKVNLDRFEQSGYQFLRPLQIAYKSSRFMLPIRLGMINATTEQDLIVYVLSPQGQ 256
>gi|223937185|ref|ZP_03629092.1| conserved hypothetical protein [bacterium Ellin514]
gi|223894207|gb|EEF60661.1| conserved hypothetical protein [bacterium Ellin514]
Length=715
Score = 62.8 bits (151), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 43/213 (21%), Positives = 85/213 (40%), Gaps = 29/213 (13%)
Query 51 EVALLHWDGTTETIVMQLAMNADTDNVALVVPTPTPAIVTTADQSTFGELDTLSAPLIEH 110
+ A++ D E +++Q+ ++ ++PTP V F EL L+
Sbjct 34 QKAIIFHDAGREDLLLQVKYEGPLEDFGWLIPTPNLPDVREGTMGPFYELSKLTQRHFGS 93
Query 111 QRHWSLRRGVGASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRP 170
W RG+ +A V+ +G E + L+ D LQ WL + Y+
Sbjct 94 GEGWGRGRGLDTLS-NGGSAEDVKVIQIKTVGAYEVSILSPKDAGSLQRWLKAHAYSFPE 152
Query 171 AVSAALDPYVRDGWAFVAIRLT----------------------------STDLIVGGLD 202
S ++ Y+R GW F+A ++ + L G L
Sbjct 153 GKSEIVEEYIRLGWYFIAAKIELNKGLGFKKVPATSPKEAPGAATARTTLQSKLSSGELH 212
Query 203 PVRMTFRSSRLVYPMRLSVAAQEPQHVTIFTLS 235
P+ ++F + + V+P+++S +P V+++ ++
Sbjct 213 PLLISFDTPKAVFPLKISAVGGKPSEVSLYVIA 245
>gi|1230542|gb|AAA93039.1| ORFI [Synechocystis sp.]
Length=387
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/251 (27%), Positives = 104/251 (42%), Gaps = 37/251 (14%)
Query 15 VATAVITATMVLATPSYACACGAAVTAHGSQATLNH-EVALLHWDGTTETIVMQLAMNAD 73
+ A + + + P+ A CG V A + NH ++ DG + M
Sbjct 17 ILVACLLSLLFFVRPALAF-CGFYV-AQADTSLYNHASQVIIAKDGDQTVLTMANDYQGK 74
Query 74 TDNVALVVPTPTPAIVTTADQSTFGE------LDTLSAP-LIEH----------QRHW-- 114
+ ALVVP +V DQ GE LD SAP L+E+ R +
Sbjct 75 AQDFALVVPV---PVVLQEDQVNVGERKIIERLDNFSAPRLVEYFDNNPCETYGGRQFMD 131
Query 115 ------SLRRGVGASGPQEAAARAPHVLNQVRLGPLEATTLTGGDLSGLQTWLSDNGYAI 168
S+ RG+ EA + NQ +G + L+ + +GL+TWL+ N Y I
Sbjct 132 AMPAAPSMTRGLQEKISNEALGVT--IENQFSVGEYDILILSAKESNGLETWLNQNNYRI 189
Query 169 RPAVSAALDPYVRDGWAFVAIRLTSTDLIVGG---LDPVRMTFRSSRLVYPMRLS-VAAQ 224
P + L Y++ G F ++ + G L P+ M + S R + P+RL V A
Sbjct 190 PPGATDVLGAYIKQGLKFFVAKVNLKEFDRQGFQALRPLMMAYESPRFMLPIRLGMVNAD 249
Query 225 EPQHVTIFTLS 235
PQ + ++ LS
Sbjct 250 GPQELIVYLLS 260
>gi|149919171|ref|ZP_01907655.1| hypothetical protein PPSIR1_35387 [Plesiocystis pacifica SIR-1]
gi|149820101|gb|EDM79522.1| hypothetical protein PPSIR1_35387 [Plesiocystis pacifica SIR-1]
Length=566
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/241 (25%), Positives = 92/241 (39%), Gaps = 31/241 (12%)
Query 23 TMVLATPSYACACGAAVTAHGSQATLNHEVALLHWDGTTETIVMQLAMNADTDNVALVVP 82
T+ L A CG V ++ N + +L DGT + M ++ A+VVP
Sbjct 35 TLSLVPSQAAAFCGFYVAGADAELYNNATMVVLMRDGTRTVLSMANNYEGPPEDFAMVVP 94
Query 83 TPTPAIVTTADQ------STFGELDTLSAP-LIEHQR------HWSLRRGVGASGPQEAA 129
P +V D F +D L+AP L+E+ +W E A
Sbjct 95 VP---VVLDEDDVRVLPADVFDRVDKLAAPRLVEYWEQDPCNPYWGYPEPDTVEDAMEMA 151
Query 130 ----ARAPHVLN-----QVRLGPLEATTLTGGDLSGLQTWLSDNGYAIRPAVSAALDPYV 180
R P L + +G + L+ + +GL TWL Y I L+PYV
Sbjct 152 PAGGGREPKDLGVTIEAEFEVGEYQVVILSAKESTGLDTWLRQEQYNIPAGAQPLLEPYV 211
Query 181 RDGWAFVAIRLTSTDLIVGG-----LDPVRMTFRSSRLVYPMRLS-VAAQEPQHVTIFTL 234
G F ++ S + L P+R + S P+RL + AQ PQ + + L
Sbjct 212 ASGSKFFVAKVDSEKVTFDANGQAELSPLRFHYDSQDFALPVRLGLINAQGPQDLLVHIL 271
Query 235 S 235
+
Sbjct 272 A 272
Lambda K H
0.320 0.132 0.392
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 654597672016
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40