BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0918
Length=158
Score E
Sequences producing significant alignments: (Bits) Value
gi|15840340|ref|NP_335377.1| hypothetical protein MT0944 [Mycoba... 315 1e-84
gi|15608058|ref|NP_215433.1| hypothetical protein Rv0918 [Mycoba... 315 2e-84
gi|31792106|ref|NP_854599.1| hypothetical protein Mb0942 [Mycoba... 313 8e-84
gi|339630981|ref|YP_004722623.1| hypothetical protein MAF_09270 ... 299 9e-80
gi|308231671|ref|ZP_07413385.2| hypothetical protein TMAG_00854 ... 186 1e-45
gi|145221381|ref|YP_001132059.1| hypothetical protein Mflv_0786 ... 137 5e-31
gi|333988665|ref|YP_004521279.1| hypothetical protein JDM601_002... 135 2e-30
gi|315441736|ref|YP_004074615.1| hypothetical protein Mspyr1_004... 131 4e-29
gi|336178777|ref|YP_004584152.1| hypothetical protein FsymDg_290... 112 3e-23
gi|302343978|ref|YP_003808507.1| hypothetical protein Deba_2558 ... 80.5 7e-14
gi|302344014|ref|YP_003808543.1| hypothetical protein Deba_2595 ... 76.3 1e-12
gi|240169137|ref|ZP_04747796.1| hypothetical protein MkanA1_0747... 73.9 7e-12
gi|154508312|ref|ZP_02043954.1| hypothetical protein ACTODO_0080... 71.6 4e-11
gi|116620423|ref|YP_822579.1| hypothetical protein Acid_1300 [Ca... 69.7 1e-10
gi|194336777|ref|YP_002018571.1| hypothetical protein Ppha_1718 ... 68.6 3e-10
gi|258404534|ref|YP_003197276.1| hypothetical protein Dret_0400 ... 68.2 4e-10
gi|333990697|ref|YP_004523311.1| hypothetical protein JDM601_205... 68.2 4e-10
gi|86739879|ref|YP_480279.1| CopG-like DNA-binding [Frankia sp. ... 67.4 7e-10
gi|296134812|ref|YP_003642054.1| Protein of unknown function DUF... 65.9 2e-09
gi|315605436|ref|ZP_07880476.1| conserved hypothetical protein [... 65.5 2e-09
gi|288916910|ref|ZP_06411282.1| Protein of unknown function DUF1... 65.1 3e-09
gi|94265184|ref|ZP_01288946.1| conserved hypothetical protein [d... 64.7 4e-09
gi|83648276|ref|YP_436711.1| hypothetical protein HCH_05628 [Hah... 64.3 5e-09
gi|304311929|ref|YP_003811527.1| hypothetical protein HDN1F_2300... 64.3 6e-09
gi|220935245|ref|YP_002514144.1| hypothetical protein Tgr7_2077 ... 63.9 8e-09
gi|158314003|ref|YP_001506511.1| hypothetical protein Franean1_2... 63.5 8e-09
gi|90425040|ref|YP_533410.1| hypothetical protein RPC_3552 [Rhod... 62.8 1e-08
gi|294338764|emb|CAZ87098.1| conserved hypothetical protein; put... 62.8 2e-08
gi|340627803|ref|YP_004746255.1| hypothetical protein MCAN_28301... 62.8 2e-08
gi|306807392|ref|ZP_07444060.1| hypothetical protein TMGG_02066 ... 61.2 4e-08
gi|307085503|ref|ZP_07494616.1| hypothetical protein TMLG_02519 ... 61.2 4e-08
gi|119898317|ref|YP_933530.1| hypothetical protein azo2026 [Azoa... 61.2 4e-08
gi|15842341|ref|NP_337378.1| hypothetical protein MT2871 [Mycoba... 61.2 5e-08
gi|110597351|ref|ZP_01385639.1| conserved hypothetical protein [... 60.8 6e-08
gi|308232248|ref|ZP_07664041.1| hypothetical protein TMAG_03186 ... 60.8 6e-08
gi|289444352|ref|ZP_06434096.1| conserved hypothetical protein [... 60.8 7e-08
gi|344201044|ref|YP_004785370.1| hypothetical protein Acife_2970... 59.7 1e-07
gi|87300639|ref|ZP_01083481.1| putative cytoplasmic protein [Syn... 59.7 2e-07
gi|333989585|ref|YP_004522199.1| hypothetical protein JDM601_094... 59.3 2e-07
gi|111221314|ref|YP_712108.1| hypothetical protein FRAAL1874 [Fr... 58.5 3e-07
gi|334112599|ref|ZP_08486858.1| protein of unknown function DUF1... 58.2 4e-07
gi|320095182|ref|ZP_08026888.1| hypothetical protein HMPREF9005_... 58.2 4e-07
gi|344337719|ref|ZP_08768653.1| protein of unknown function DUF1... 57.4 6e-07
gi|316934787|ref|YP_004109769.1| hypothetical protein Rpdx1_3466... 57.4 7e-07
gi|227113264|ref|ZP_03826920.1| hypothetical protein PcarbP_0988... 57.0 1e-06
gi|237808068|ref|YP_002892508.1| hypothetical protein Tola_1305 ... 56.2 2e-06
gi|344199686|ref|YP_004784012.1| hypothetical protein Acife_1545... 55.8 2e-06
gi|82778765|ref|YP_405114.1| hypothetical protein SDY_3657 [Shig... 54.7 5e-06
gi|15803925|ref|NP_289961.1| hypothetical protein Z4776 [Escheri... 54.7 5e-06
gi|254498060|ref|ZP_05110820.1| conserved hypothetical protein [... 53.9 7e-06
>gi|15840340|ref|NP_335377.1| hypothetical protein MT0944 [Mycobacterium tuberculosis CDC1551]
gi|13880504|gb|AAK45191.1| hypothetical protein MT0944 [Mycobacterium tuberculosis CDC1551]
Length=178
Score = 315 bits (808), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 158/158 (100%), Positives = 158/158 (100%), Gaps = 0/158 (0%)
Query 1 LHRAGAAVTANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSLGSSGLPALNCTPIRHT 60
LHRAGAAVTANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSLGSSGLPALNCTPIRHT
Sbjct 21 LHRAGAAVTANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSLGSSGLPALNCTPIRHT 80
Query 61 IDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLF 120
IDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLF
Sbjct 81 IDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLF 140
Query 121 VLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG 158
VLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG
Sbjct 141 VLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG 178
>gi|15608058|ref|NP_215433.1| hypothetical protein Rv0918 [Mycobacterium tuberculosis H37Rv]
gi|148660697|ref|YP_001282220.1| hypothetical protein MRA_0926 [Mycobacterium tuberculosis H37Ra]
gi|148822127|ref|YP_001286881.1| hypothetical protein TBFG_10936 [Mycobacterium tuberculosis F11]
18 more sequence titles
Length=158
Score = 315 bits (806), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 157/158 (99%), Positives = 158/158 (100%), Gaps = 0/158 (0%)
Query 1 LHRAGAAVTANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSLGSSGLPALNCTPIRHT 60
+HRAGAAVTANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSLGSSGLPALNCTPIRHT
Sbjct 1 MHRAGAAVTANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSLGSSGLPALNCTPIRHT 60
Query 61 IDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLF 120
IDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLF
Sbjct 61 IDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLF 120
Query 121 VLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG 158
VLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG
Sbjct 121 VLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG 158
>gi|31792106|ref|NP_854599.1| hypothetical protein Mb0942 [Mycobacterium bovis AF2122/97]
gi|121636841|ref|YP_977064.1| hypothetical protein BCG_0970 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224989312|ref|YP_002643999.1| hypothetical protein JTY_0940 [Mycobacterium bovis BCG str. Tokyo
172]
26 more sequence titles
Length=158
Score = 313 bits (801), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 156/158 (99%), Positives = 157/158 (99%), Gaps = 0/158 (0%)
Query 1 LHRAGAAVTANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSLGSSGLPALNCTPIRHT 60
+HRAGAAVTANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSLG SGLPALNCTPIRHT
Sbjct 1 MHRAGAAVTANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSLGGSGLPALNCTPIRHT 60
Query 61 IDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLF 120
IDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLF
Sbjct 61 IDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLF 120
Query 121 VLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG 158
VLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG
Sbjct 121 VLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG 158
>gi|339630981|ref|YP_004722623.1| hypothetical protein MAF_09270 [Mycobacterium africanum GM041182]
gi|339330337|emb|CCC25999.1| conserved hypothetical protein [Mycobacterium africanum GM041182]
Length=151
Score = 299 bits (766), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 149/151 (99%), Positives = 150/151 (99%), Gaps = 0/151 (0%)
Query 8 VTANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSLGSSGLPALNCTPIRHTIDVMATK 67
+TANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSLG SGLPALNCTPIRHTIDVMATK
Sbjct 1 MTANVWCRAGGIRMAPRPVIPVATQQRLRRQADRQSLGGSGLPALNCTPIRHTIDVMATK 60
Query 68 PERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAW 127
PERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAW
Sbjct 61 PERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAW 120
Query 128 TEFLAALDRPVSHKPRLEKLFAARSIFDTEG 158
TEFLAALDRPVSHKPRLEKLFAARSIFDTEG
Sbjct 121 TEFLAALDRPVSHKPRLEKLFAARSIFDTEG 151
>gi|308231671|ref|ZP_07413385.2| hypothetical protein TMAG_00854 [Mycobacterium tuberculosis SUMu001]
gi|308370126|ref|ZP_07420376.2| hypothetical protein TMBG_01699 [Mycobacterium tuberculosis SUMu002]
gi|308370536|ref|ZP_07421917.2| hypothetical protein TMCG_03176 [Mycobacterium tuberculosis SUMu003]
22 more sequence titles
Length=95
Score = 186 bits (472), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 95/95 (100%), Positives = 95/95 (100%), Gaps = 0/95 (0%)
Query 64 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 123
MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT
Sbjct 1 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 60
Query 124 DAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG 158
DAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG
Sbjct 61 DAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG 95
>gi|145221381|ref|YP_001132059.1| hypothetical protein Mflv_0786 [Mycobacterium gilvum PYR-GCK]
gi|145213867|gb|ABP43271.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=101
Score = 137 bits (345), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 71/102 (70%), Positives = 81/102 (80%), Gaps = 4/102 (3%)
Query 54 CTPIRHTIDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDV 113
C P HT++ +A K +ER A RLT EQDALIRRAAE EGTDLTNFT+TA LAHAR+V
Sbjct 3 CVPDGHTMEAVAAK----SERFAVRLTAEQDALIRRAAEVEGTDLTNFTLTATLAHAREV 58
Query 114 LADRRLFVLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFD 155
LADRRLF+L + AW+EFL+ LDRPVS KPRL KLFA SIFD
Sbjct 59 LADRRLFMLGEPAWSEFLSVLDRPVSPKPRLAKLFAEPSIFD 100
>gi|333988665|ref|YP_004521279.1| hypothetical protein JDM601_0025 [Mycobacterium sp. JDM601]
gi|333484633|gb|AEF34025.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=87
Score = 135 bits (339), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 67/85 (79%), Positives = 74/85 (88%), Gaps = 0/85 (0%)
Query 71 KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEF 130
KTER A RLT EQDALIRRAAE EGTDLTNFTVTA LAHAR+VLADRRLF+L +AW+EF
Sbjct 2 KTERFAVRLTAEQDALIRRAAEVEGTDLTNFTVTATLAHAREVLADRRLFMLDASAWSEF 61
Query 131 LAALDRPVSHKPRLEKLFAARSIFD 155
++ LDRPVS +PRLEKLFA SIFD
Sbjct 62 VSVLDRPVSQQPRLEKLFAEPSIFD 86
>gi|315441736|ref|YP_004074615.1| hypothetical protein Mspyr1_00440 [Mycobacterium sp. Spyr1]
gi|315260039|gb|ADT96780.1| uncharacterized conserved protein [Mycobacterium sp. Spyr1]
Length=92
Score = 131 bits (329), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 66/85 (78%), Positives = 73/85 (86%), Gaps = 0/85 (0%)
Query 71 KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEF 130
K+ER A RLT EQDALIRRAAE EGTDLTNFT+TA LAHAR+VLADRRLF+L + AW+EF
Sbjct 7 KSERFAVRLTAEQDALIRRAAEVEGTDLTNFTLTATLAHAREVLADRRLFMLGEPAWSEF 66
Query 131 LAALDRPVSHKPRLEKLFAARSIFD 155
L+ LDRPVS KPRL KLFA SIFD
Sbjct 67 LSVLDRPVSPKPRLAKLFAEPSIFD 91
>gi|336178777|ref|YP_004584152.1| hypothetical protein FsymDg_2905 [Frankia symbiont of Datisca
glomerata]
gi|334859757|gb|AEH10231.1| protein of unknown function DUF1778 [Frankia symbiont of Datisca
glomerata]
Length=87
Score = 112 bits (279), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 55/85 (65%), Positives = 66/85 (78%), Gaps = 0/85 (0%)
Query 71 KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEF 130
K+ R+A RL EQDALIR AAE EG+ +T FTV AA+A A+DVLAD+RLF L AWTEF
Sbjct 3 KSSRIAVRLPEEQDALIRAAAEVEGSTVTEFTVNAAVARAKDVLADQRLFTLGSEAWTEF 62
Query 131 LAALDRPVSHKPRLEKLFAARSIFD 155
L LDRPV+ KPRLE+LF +SIF+
Sbjct 63 LTILDRPVTRKPRLERLFTEKSIFE 87
>gi|302343978|ref|YP_003808507.1| hypothetical protein Deba_2558 [Desulfarculus baarsii DSM 2075]
gi|301640591|gb|ADK85913.1| Protein of unknown function DUF1778 [Desulfarculus baarsii DSM
2075]
Length=92
Score = 80.5 bits (197), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 40/92 (44%), Positives = 55/92 (60%), Gaps = 0/92 (0%)
Query 64 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 123
M+ + KTER+ RL+P AL++ AA+A +++ F + A + A LADRRLF+L
Sbjct 1 MSPTTQTKTERIDVRLSPSSKALLQEAAKASHKNVSEFILEAGIVAANQALADRRLFLLD 60
Query 124 DAAWTEFLAALDRPVSHKPRLEKLFAARSIFD 155
+A W EF LDRPV KPRL KL + D
Sbjct 61 EARWKEFQEVLDRPVQKKPRLSKLLNEPGVLD 92
>gi|302344014|ref|YP_003808543.1| hypothetical protein Deba_2595 [Desulfarculus baarsii DSM 2075]
gi|301640627|gb|ADK85949.1| Protein of unknown function DUF1778 [Desulfarculus baarsii DSM
2075]
Length=92
Score = 76.3 bits (186), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/92 (42%), Positives = 55/92 (60%), Gaps = 0/92 (0%)
Query 64 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 123
M+ + KTER+ RL+P AL++ AA+A +++ F + A + A LA+RR+F+L
Sbjct 1 MSPTTQTKTERIDVRLSPSSKALLQEAAKATHKNVSEFILEAGIVAANQALAERRIFLLD 60
Query 124 DAAWTEFLAALDRPVSHKPRLEKLFAARSIFD 155
+A W EF LDRPV KPRL KL + D
Sbjct 61 EARWKEFQEVLDRPVQKKPRLGKLLNDPGVLD 92
>gi|240169137|ref|ZP_04747796.1| hypothetical protein MkanA1_07474 [Mycobacterium kansasii ATCC
12478]
Length=129
Score = 73.9 bits (180), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 42/97 (44%), Positives = 56/97 (58%), Gaps = 0/97 (0%)
Query 59 HTIDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRR 118
+T + E K+ R R+T QDA++RR + G L ++ V A+ A LADRR
Sbjct 32 YTNSAYTSGVETKSGRWHLRVTAAQDAVVRRVLDVTGESLNDYVVRHAVQAAEADLADRR 91
Query 119 LFVLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFD 155
+FVL DAAWT+ A LDRP S KP L +L A SI +
Sbjct 92 VFVLDDAAWTDLQALLDRPPSPKPELARLLANPSILE 128
>gi|154508312|ref|ZP_02043954.1| hypothetical protein ACTODO_00809 [Actinomyces odontolyticus
ATCC 17982]
gi|153797946|gb|EDN80366.1| hypothetical protein ACTODO_00809 [Actinomyces odontolyticus
ATCC 17982]
Length=102
Score = 71.6 bits (174), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 39/86 (46%), Positives = 52/86 (61%), Gaps = 0/86 (0%)
Query 71 KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEF 130
KTERL RLT EQDALIRRAAE + +++F ++ A A+ LAD+R+F + D + F
Sbjct 5 KTERLDLRLTKEQDALIRRAAEQDARSISDFVLSTATLEAQRRLADQRIFFMNDEQYARF 64
Query 131 LAALDRPVSHKPRLEKLFAARSIFDT 156
L P + P+L KLF S F T
Sbjct 65 EEILQTPPTDDPKLRKLFERPSPFGT 90
>gi|116620423|ref|YP_822579.1| hypothetical protein Acid_1300 [Candidatus Solibacter usitatus
Ellin6076]
gi|116223585|gb|ABJ82294.1| conserved hypothetical protein [Candidatus Solibacter usitatus
Ellin6076]
Length=97
Score = 69.7 bits (169), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/88 (47%), Positives = 53/88 (61%), Gaps = 2/88 (2%)
Query 64 MATK-PER-KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFV 121
MAT PE+ K R R T ++ LIR AAE G ++T F + AA A + L D+ FV
Sbjct 1 MATSLPEQPKDRRFQLRATAREETLIRVAAERRGINVTEFILGAAREKAEESLTDQTKFV 60
Query 122 LTDAAWTEFLAALDRPVSHKPRLEKLFA 149
L + W +F+ ALDRP KPRL+KLFA
Sbjct 61 LDEKQWKQFMEALDRPAREKPRLKKLFA 88
>gi|194336777|ref|YP_002018571.1| hypothetical protein Ppha_1718 [Pelodictyon phaeoclathratiforme
BU-1]
gi|194309254|gb|ACF43954.1| Protein of unknown function DUF1778 [Pelodictyon phaeoclathratiforme
BU-1]
Length=92
Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/85 (45%), Positives = 52/85 (62%), Gaps = 0/85 (0%)
Query 71 KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEF 130
KTE+L RL+ + ++R AA A L+ F + +AL+ A + LADRR F L WT F
Sbjct 7 KTEKLDLRLSQQAKQVLRSAAIASNRTLSEFVLESALSRAEETLADRRSFSLDVTQWTAF 66
Query 131 LAALDRPVSHKPRLEKLFAARSIFD 155
+AALD PRL+KLF +SIF+
Sbjct 67 IAALDASPRELPRLKKLFQEQSIFE 91
>gi|258404534|ref|YP_003197276.1| hypothetical protein Dret_0400 [Desulfohalobium retbaense DSM
5692]
gi|257796761|gb|ACV67698.1| Protein of unknown function DUF1778 [Desulfohalobium retbaense
DSM 5692]
Length=98
Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 35/90 (39%), Positives = 52/90 (58%), Gaps = 0/90 (0%)
Query 66 TKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDA 125
TK E+ +ER + R++ + LIR AA+ + + + +A A LADRR F L++
Sbjct 5 TKREKCSERFSFRISARNNELIRSAAQLKNMSASQYAAESAANQAEMDLADRRSFSLSEG 64
Query 126 AWTEFLAALDRPVSHKPRLEKLFAARSIFD 155
EFL ALDRP+ K RL +LF ++I D
Sbjct 65 QMQEFLNALDRPIQEKTRLRQLFREKTILD 94
>gi|333990697|ref|YP_004523311.1| hypothetical protein JDM601_2057 [Mycobacterium sp. JDM601]
gi|333486665|gb|AEF36057.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=56
Score = 68.2 bits (165), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 33/56 (59%), Positives = 40/56 (72%), Gaps = 0/56 (0%)
Query 103 VTAALAHARDVLADRRLFVLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTEG 158
+ A +AHARDVLADRR F+L DA W EF A LDRPV ++P L K+F S+FD E
Sbjct 1 MCATVAHARDVLADRRHFLLDDANWAEFSALLDRPVQYRPALAKMFDKPSLFDPEA 56
>gi|86739879|ref|YP_480279.1| CopG-like DNA-binding [Frankia sp. CcI3]
gi|86566741|gb|ABD10550.1| CopG-like DNA-binding [Frankia sp. CcI3]
Length=92
Score = 67.4 bits (163), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 36/78 (47%), Positives = 48/78 (62%), Gaps = 0/78 (0%)
Query 71 KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEF 130
+ +RLA RL ++ +IR AAE EG L +F A +AR+VLADRRLF + DA W++
Sbjct 5 RDDRLAVRLPEDEAEIIRSAAEGEGLSLEDFAAEAMKRYAREVLADRRLFRVPDADWSKL 64
Query 131 LAALDRPVSHKPRLEKLF 148
A LD P PRL +L
Sbjct 65 EALLDAPPPDSPRLRRLL 82
>gi|296134812|ref|YP_003642054.1| Protein of unknown function DUF1778 [Thiomonas intermedia K12]
gi|295794934|gb|ADG29724.1| Protein of unknown function DUF1778 [Thiomonas intermedia K12]
Length=92
Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/85 (45%), Positives = 49/85 (58%), Gaps = 0/85 (0%)
Query 64 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 123
M+T KTER+ R + L++ AA A +++ F + A + A LADRR FVL
Sbjct 1 MSTLNLSKTERIDVRASAPVKKLLQEAARACHKNVSEFLLDAGVTAAAQTLADRRQFVLG 60
Query 124 DAAWTEFLAALDRPVSHKPRLEKLF 148
DA W F ALDRPV KPRL+KL
Sbjct 61 DAQWQAFQDALDRPVQSKPRLKKLL 85
>gi|315605436|ref|ZP_07880476.1| conserved hypothetical protein [Actinomyces sp. oral taxon 180
str. F0310]
gi|315312843|gb|EFU60920.1| conserved hypothetical protein [Actinomyces sp. oral taxon 180
str. F0310]
Length=102
Score = 65.5 bits (158), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/86 (44%), Positives = 50/86 (59%), Gaps = 0/86 (0%)
Query 71 KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEF 130
KT RL RLT +QD LIRRAAE EG ++ FT++ A+ LAD+R+F+L++ F
Sbjct 5 KTRRLDLRLTDKQDTLIRRAAEQEGRSVSEFTLSTVTMEAQRRLADQRVFILSEEDHARF 64
Query 131 LAALDRPVSHKPRLEKLFAARSIFDT 156
L P+ P+L KL S F T
Sbjct 65 EEILQTPLKDDPKLRKLLNRPSPFGT 90
>gi|288916910|ref|ZP_06411282.1| Protein of unknown function DUF1778 [Frankia sp. EUN1f]
gi|288351619|gb|EFC85824.1| Protein of unknown function DUF1778 [Frankia sp. EUN1f]
Length=100
Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/69 (53%), Positives = 44/69 (64%), Gaps = 1/69 (1%)
Query 81 PEQDA-LIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEFLAALDRPVS 139
PE DA +IR AAEAEG L F V + HARDVLADRRLF + ++AW E A L P +
Sbjct 14 PEDDAEIIRSAAEAEGLSLAMFAVESMRRHARDVLADRRLFRVAESAWAELEAMLQAPPA 73
Query 140 HKPRLEKLF 148
PR+ +L
Sbjct 74 DNPRMRRLL 82
>gi|94265184|ref|ZP_01288946.1| conserved hypothetical protein [delta proteobacterium MLMS-1]
gi|94267974|ref|ZP_01291049.1| conserved hypothetical protein [delta proteobacterium MLMS-1]
gi|93451779|gb|EAT02537.1| conserved hypothetical protein [delta proteobacterium MLMS-1]
gi|93454325|gb|EAT04632.1| conserved hypothetical protein [delta proteobacterium MLMS-1]
Length=92
Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 37/92 (41%), Positives = 50/92 (55%), Gaps = 0/92 (0%)
Query 64 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 123
M T KTER+ R L++ AA A +++ F + A + A LA+RR FVL
Sbjct 1 MKTLNISKTERIDVRANRSVKLLLQEAARACHKNVSEFLLDAGVTAANQALAERRHFVLN 60
Query 124 DAAWTEFLAALDRPVSHKPRLEKLFAARSIFD 155
+ W EF ALDRPV KPRL+KL + + D
Sbjct 61 EDQWREFQQALDRPVQAKPRLKKLLSEPGVLD 92
>gi|83648276|ref|YP_436711.1| hypothetical protein HCH_05628 [Hahella chejuensis KCTC 2396]
gi|83636319|gb|ABC32286.1| uncharacterized protein conserved in bacteria [Hahella chejuensis
KCTC 2396]
Length=92
Score = 64.3 bits (155), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/85 (43%), Positives = 48/85 (57%), Gaps = 0/85 (0%)
Query 64 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 123
M+T KTER+ R + L++ AA +++ F + A + A LADRR FVL
Sbjct 1 MSTPNVSKTERIDVRASTPVKQLLQEAARVCHKNVSEFLLDAGVTAATQTLADRRQFVLG 60
Query 124 DAAWTEFLAALDRPVSHKPRLEKLF 148
+A W F ALDRPV KPRL+KL
Sbjct 61 EAQWQAFQEALDRPVQSKPRLKKLL 85
>gi|304311929|ref|YP_003811527.1| hypothetical protein HDN1F_23000 [gamma proteobacterium HdN1]
gi|301797662|emb|CBL45883.1| Conserved hypothetical protein [gamma proteobacterium HdN1]
Length=92
Score = 64.3 bits (155), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 37/85 (44%), Positives = 48/85 (57%), Gaps = 0/85 (0%)
Query 64 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 123
M+T KTER+ R + L++ AA A +++ F + A + A LADRR FVL
Sbjct 1 MSTLNLSKTERIDVRASTPVKQLLQDAARACHKNVSEFLLDAGVTAAAQTLADRRQFVLD 60
Query 124 DAAWTEFLAALDRPVSHKPRLEKLF 148
D W F ALDRPV KPRL+KL
Sbjct 61 DTQWQAFQEALDRPVQSKPRLKKLL 85
>gi|220935245|ref|YP_002514144.1| hypothetical protein Tgr7_2077 [Thioalkalivibrio sulfidophilus
HL-EbGr7]
gi|219996555|gb|ACL73157.1| Protein of unknown function DUF1778 [Thioalkalivibrio sulfidophilus
HL-EbGr7]
Length=92
Score = 63.9 bits (154), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 37/85 (44%), Positives = 49/85 (58%), Gaps = 0/85 (0%)
Query 64 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 123
M+T KTER+ R + L++ AA A +++ F + A + A LADRR FVL
Sbjct 1 MSTLNLPKTERIDVRASTPVKQLLQEAARACHKNVSEFLLDAGVTAAAQTLADRRQFVLD 60
Query 124 DAAWTEFLAALDRPVSHKPRLEKLF 148
+A W F ALDRPV KPRL+KL
Sbjct 61 EAQWQAFQEALDRPVQSKPRLKKLL 85
>gi|158314003|ref|YP_001506511.1| hypothetical protein Franean1_2169 [Frankia sp. EAN1pec]
gi|158109408|gb|ABW11605.1| Protein of unknown function DUF1778 [Frankia sp. EAN1pec]
Length=135
Score = 63.5 bits (153), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 43/101 (43%), Positives = 56/101 (56%), Gaps = 7/101 (6%)
Query 50 PALNC-TPIRHTIDVMATKPERKTERLAARLTPEQDA-LIRRAAEAEGTDLTNFTVTAAL 107
P + C P R T + + + ER T L P DA +IR AA+AEG L F V +
Sbjct 33 PGVACRGPGRQTASMTSAEDERITFGL-----PADDAEIIRSAADAEGLSLELFAVESMR 87
Query 108 AHARDVLADRRLFVLTDAAWTEFLAALDRPVSHKPRLEKLF 148
+AR+VLADRRLF +++ AW E A L P + PRL KL
Sbjct 88 RYAREVLADRRLFRVSETAWAELDAMLQAPPTDNPRLRKLL 128
>gi|90425040|ref|YP_533410.1| hypothetical protein RPC_3552 [Rhodopseudomonas palustris BisB18]
gi|90107054|gb|ABD89091.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18]
Length=95
Score = 62.8 bits (151), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/85 (42%), Positives = 51/85 (60%), Gaps = 0/85 (0%)
Query 71 KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEF 130
++E+L RLTPE L+ AA+A ++ F + +AL A++ L DR+ F L A W F
Sbjct 6 RSEKLDLRLTPEAKRLLNAAAQASRRSVSEFVLESALDRAQEALPDRQHFGLDAAGWEAF 65
Query 131 LAALDRPVSHKPRLEKLFAARSIFD 155
LAALD P PRL++L S+F+
Sbjct 66 LAALDAPPRPLPRLQRLLTEPSVFE 90
>gi|294338764|emb|CAZ87098.1| conserved hypothetical protein; putative Methionine repressor-like
[Thiomonas sp. 3As]
Length=92
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/85 (44%), Positives = 48/85 (57%), Gaps = 0/85 (0%)
Query 64 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 123
M+T KTER+ R + L++ AA A +++ F + A + A LADRR FVL
Sbjct 1 MSTLNLSKTERIDVRASAPVKKLLQEAARACHKNVSEFLLDAGVTAAAQTLADRRQFVLD 60
Query 124 DAAWTEFLAALDRPVSHKPRLEKLF 148
DA W F ALDRPV K RL+KL
Sbjct 61 DAQWQAFQDALDRPVQSKSRLKKLL 85
>gi|340627803|ref|YP_004746255.1| hypothetical protein MCAN_28301 [Mycobacterium canettii CIPT
140010059]
gi|340005993|emb|CCC45161.1| putative uncharacterized protein [Mycobacterium canettii CIPT
140010059]
Length=91
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 49/80 (62%), Gaps = 0/80 (0%)
Query 68 PERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAW 127
PE T RL RL+PEQ+ +RRAA G L+ F ++AA+ HA D+LA L++AA+
Sbjct 2 PETLTGRLNFRLSPEQEQALRRAAALTGQSLSGFVLSAAVDHAHDLLARANRIELSEAAF 61
Query 128 TEFLAALDRPVSHKPRLEKL 147
F+AALD P P L +L
Sbjct 62 RRFVAALDEPDEAAPELVRL 81
>gi|306807392|ref|ZP_07444060.1| hypothetical protein TMGG_02066 [Mycobacterium tuberculosis SUMu007]
gi|308346195|gb|EFP35046.1| hypothetical protein TMGG_02066 [Mycobacterium tuberculosis SUMu007]
Length=274
Score = 61.2 bits (147), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/80 (44%), Positives = 48/80 (60%), Gaps = 0/80 (0%)
Query 68 PERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAW 127
PE T RL RL+PEQ+ +R AA G L+ F ++AA+ HA D+LA L++AA+
Sbjct 66 PETLTGRLNFRLSPEQEQALRHAAALTGQSLSGFVLSAAVDHAHDLLARANRIELSEAAF 125
Query 128 TEFLAALDRPVSHKPRLEKL 147
F+AALD P P L +L
Sbjct 126 RRFVAALDEPDEAAPELVRL 145
>gi|307085503|ref|ZP_07494616.1| hypothetical protein TMLG_02519 [Mycobacterium tuberculosis SUMu012]
gi|308364960|gb|EFP53811.1| hypothetical protein TMLG_02519 [Mycobacterium tuberculosis SUMu012]
Length=180
Score = 61.2 bits (147), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/80 (44%), Positives = 48/80 (60%), Gaps = 0/80 (0%)
Query 68 PERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAW 127
PE T RL RL+PEQ+ +R AA G L+ F ++AA+ HA D+LA L++AA+
Sbjct 66 PETLTGRLNFRLSPEQEQALRHAAALTGQSLSGFVLSAAVDHAHDLLARANRIELSEAAF 125
Query 128 TEFLAALDRPVSHKPRLEKL 147
F+AALD P P L +L
Sbjct 126 RRFVAALDEPDEAAPELVRL 145
>gi|119898317|ref|YP_933530.1| hypothetical protein azo2026 [Azoarcus sp. BH72]
gi|119670730|emb|CAL94643.1| conserved hypothetical protein [Azoarcus sp. BH72]
Length=97
Score = 61.2 bits (147), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/92 (35%), Positives = 55/92 (60%), Gaps = 0/92 (0%)
Query 64 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 123
+ + P ++ RL R TPEQ+A++RRAAEA LT+F + +A A L D+RLF+++
Sbjct 4 ITSSPAVRSARLGLRATPEQEAVLRRAAEAAHKSLTDFILDSACLAAEQTLLDQRLFMVS 63
Query 124 DAAWTEFLAALDRPVSHKPRLEKLFAARSIFD 155
+ + + L+RP L +LF+ ++ +D
Sbjct 64 GSQYQALMDLLERPEQSNEGLRELFSRKAPWD 95
>gi|15842341|ref|NP_337378.1| hypothetical protein MT2871 [Mycobacterium tuberculosis CDC1551]
gi|31793979|ref|NP_856472.1| hypothetical protein Mb2826 [Mycobacterium bovis AF2122/97]
gi|57117026|ref|YP_177678.1| hypothetical protein Rv2803 [Mycobacterium tuberculosis H37Rv]
58 more sequence titles
Length=155
Score = 61.2 bits (147), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 35/80 (44%), Positives = 48/80 (60%), Gaps = 0/80 (0%)
Query 68 PERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAW 127
PE T RL RL+PEQ+ +R AA G L+ F ++AA+ HA D+LA L++AA+
Sbjct 66 PETLTGRLNFRLSPEQEQALRHAAALTGQSLSGFVLSAAVDHAHDLLARANRIELSEAAF 125
Query 128 TEFLAALDRPVSHKPRLEKL 147
F+AALD P P L +L
Sbjct 126 RRFVAALDEPDEAAPELVRL 145
>gi|110597351|ref|ZP_01385639.1| conserved hypothetical protein [Chlorobium ferrooxidans DSM 13031]
gi|110341187|gb|EAT59655.1| conserved hypothetical protein [Chlorobium ferrooxidans DSM 13031]
Length=115
Score = 60.8 bits (146), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 38/98 (39%), Positives = 54/98 (56%), Gaps = 1/98 (1%)
Query 60 TIDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRL 119
T MAT+ KT +L RL+ + RAA A + L+ F + +AL+ A ++L DRR
Sbjct 19 TTFFMATQT-TKTAKLDIRLSMATKDTLSRAATAAHSSLSQFVLESALSRAEEMLVDRRS 77
Query 120 FVLTDAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTE 157
F L W F+ ALD P PR+ KLF +S+F+ E
Sbjct 78 FNLDAEQWALFMEALDAPDRTLPRVNKLFKQQSVFENE 115
>gi|308232248|ref|ZP_07664041.1| hypothetical protein TMAG_03186 [Mycobacterium tuberculosis SUMu001]
gi|308369864|ref|ZP_07419323.2| hypothetical protein TMBG_02938 [Mycobacterium tuberculosis SUMu002]
gi|308371134|ref|ZP_07667098.1| hypothetical protein TMCG_02049 [Mycobacterium tuberculosis SUMu003]
11 more sequence titles
Length=91
Score = 60.8 bits (146), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 35/80 (44%), Positives = 48/80 (60%), Gaps = 0/80 (0%)
Query 68 PERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAW 127
PE T RL RL+PEQ+ +R AA G L+ F ++AA+ HA D+LA L++AA+
Sbjct 2 PETLTGRLNFRLSPEQEQALRHAAALTGQSLSGFVLSAAVDHAHDLLARANRIELSEAAF 61
Query 128 TEFLAALDRPVSHKPRLEKL 147
F+AALD P P L +L
Sbjct 62 RRFVAALDEPDEAAPELVRL 81
>gi|289444352|ref|ZP_06434096.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289417271|gb|EFD14511.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
Length=157
Score = 60.8 bits (146), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 35/80 (44%), Positives = 48/80 (60%), Gaps = 0/80 (0%)
Query 68 PERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAW 127
PE T RL RL+PEQ+ +R AA G L+ F ++AA+ HA D+LA L++AA+
Sbjct 68 PETLTGRLNFRLSPEQEQALRHAAALTGQSLSGFVLSAAVDHAHDLLARANRIELSEAAF 127
Query 128 TEFLAALDRPVSHKPRLEKL 147
F+AALD P P L +L
Sbjct 128 RRFVAALDEPDEAAPELVRL 147
>gi|344201044|ref|YP_004785370.1| hypothetical protein Acife_2970 [Acidithiobacillus ferrivorans
SS3]
gi|343776488|gb|AEM49044.1| protein of unknown function DUF1778 [Acidithiobacillus ferrivorans
SS3]
Length=92
Score = 59.7 bits (143), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/78 (43%), Positives = 43/78 (56%), Gaps = 0/78 (0%)
Query 71 KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEF 130
KTER+ R + L++ AA A +++ F + A + A LADRR FVL W F
Sbjct 8 KTERIDVRASSSVKQLLQDAAHACHKNVSEFLLDAGVTAANQALADRRHFVLNKEQWQAF 67
Query 131 LAALDRPVSHKPRLEKLF 148
ALDRPV KP L+KL
Sbjct 68 QDALDRPVQAKPHLKKLL 85
>gi|87300639|ref|ZP_01083481.1| putative cytoplasmic protein [Synechococcus sp. WH 5701]
gi|87284510|gb|EAQ76462.1| putative cytoplasmic protein [Synechococcus sp. WH 5701]
Length=97
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/94 (37%), Positives = 53/94 (57%), Gaps = 0/94 (0%)
Query 64 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 123
+ P ++ RL R TPEQ+A++RRAAEA LT+F + +A A L D+RLF+++
Sbjct 4 IGASPPARSARLGLRATPEQEAVLRRAAEATHKSLTDFILDSACLAAEQTLLDQRLFMVS 63
Query 124 DAAWTEFLAALDRPVSHKPRLEKLFAARSIFDTE 157
+ + LDRP L LFA ++ +D +
Sbjct 64 ASQAQALMDLLDRPEQANEGLRDLFARQAPWDAQ 97
>gi|333989585|ref|YP_004522199.1| hypothetical protein JDM601_0945 [Mycobacterium sp. JDM601]
gi|333485553|gb|AEF34945.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=100
Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/87 (38%), Positives = 47/87 (55%), Gaps = 0/87 (0%)
Query 71 KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEF 130
+ +RL R + +Q LIR+AAEA +T+F + + L A +LADR+ FV D W F
Sbjct 13 RDQRLNFRASAQQQLLIRQAAEAADRTVTDFILGSVLESAERILADRKWFVANDEQWERF 72
Query 131 LAALDRPVSHKPRLEKLFAARSIFDTE 157
LD P+ P+L +L S F E
Sbjct 73 QELLDAPLQPMPKLRQLLQRESPFAEE 99
>gi|111221314|ref|YP_712108.1| hypothetical protein FRAAL1874 [Frankia alni ACN14a]
gi|111148846|emb|CAJ60524.1| hypothetical protein FRAAL1874 [Frankia alni ACN14a]
Length=79
Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 31/63 (50%), Positives = 39/63 (62%), Gaps = 0/63 (0%)
Query 86 LIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEFLAALDRPVSHKPRLE 145
+IR AAE EG L +F V A +A++VLADRRLF + D W F A LD P + PRL
Sbjct 10 VIRSAAEGEGLSLEDFAVEAMKRYAQEVLADRRLFRVADTDWGAFEALLDGPPADTPRLR 69
Query 146 KLF 148
+L
Sbjct 70 RLL 72
>gi|334112599|ref|ZP_08486858.1| protein of unknown function DUF1778 [Methylomicrobium album BG8]
gi|333597353|gb|EGL02190.1| protein of unknown function DUF1778 [Methylomicrobium album BG8]
Length=96
Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/92 (36%), Positives = 49/92 (54%), Gaps = 0/92 (0%)
Query 67 KPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAA 126
K ++++R+ R+ P+ ++ AA + ++ F + AL+ A +VLADRR FVL
Sbjct 2 KTNKRSDRIDLRIDPKAKEALQAAALLKHKTVSEFILENALSAANEVLADRRYFVLNAEQ 61
Query 127 WTEFLAALDRPVSHKPRLEKLFAARSIFDTEG 158
W F AALD P PRLE+L FD
Sbjct 62 WEAFQAALDAPPRALPRLERLMKEPGFFDVHA 93
>gi|320095182|ref|ZP_08026888.1| hypothetical protein HMPREF9005_1500 [Actinomyces sp. oral taxon
178 str. F0338]
gi|319977887|gb|EFW09524.1| hypothetical protein HMPREF9005_1500 [Actinomyces sp. oral taxon
178 str. F0338]
Length=97
Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/87 (37%), Positives = 51/87 (59%), Gaps = 0/87 (0%)
Query 68 PERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAW 127
P KT R+ R+T Q+ LIR+AA +++F +T+A A+ LAD+R+F +++ +
Sbjct 2 PPAKTHRMDLRVTERQNLLIRQAAALTDRSVSDFVLTSATLEAQRALADQRVFPVSEDQF 61
Query 128 TEFLAALDRPVSHKPRLEKLFAARSIF 154
F +DRPV+ P+L KL S F
Sbjct 62 AHFQEIVDRPVTDMPKLRKLLNRPSPF 88
>gi|344337719|ref|ZP_08768653.1| protein of unknown function DUF1778 [Thiocapsa marina 5811]
gi|343802672|gb|EGV20612.1| protein of unknown function DUF1778 [Thiocapsa marina 5811]
Length=96
Score = 57.4 bits (137), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/89 (39%), Positives = 49/89 (56%), Gaps = 0/89 (0%)
Query 67 KPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAA 126
P+ ++ +L RLTPE A + AA + F +++AL A + LADRR F L
Sbjct 3 NPQARSTKLDLRLTPEAKARLSAAAWERHQSVGQFVLSSALERADETLADRRHFGLNAER 62
Query 127 WTEFLAALDRPVSHKPRLEKLFAARSIFD 155
W+ F+AAL+ P PRLE+L + FD
Sbjct 63 WSAFMAALEMPPRALPRLERLLREPTPFD 91
>gi|316934787|ref|YP_004109769.1| hypothetical protein Rpdx1_3466 [Rhodopseudomonas palustris DX-1]
gi|315602501|gb|ADU45036.1| Protein of unknown function DUF1778 [Rhodopseudomonas palustris
DX-1]
Length=96
Score = 57.4 bits (137), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 34/84 (41%), Positives = 46/84 (55%), Gaps = 0/84 (0%)
Query 71 KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEF 130
+TE+L RLT E + AA+A+ L+ F + +AL A + LADRR+F L W F
Sbjct 7 RTEKLDLRLTAEAKHTLTAAAQAQRRSLSEFVLESALRRAEETLADRRVFQLPPEQWEAF 66
Query 131 LAALDRPVSHKPRLEKLFAARSIF 154
+AALD P PRL L +F
Sbjct 67 VAALDAPPRDLPRLRTLLEEPGVF 90
>gi|227113264|ref|ZP_03826920.1| hypothetical protein PcarbP_09885 [Pectobacterium carotovorum
subsp. brasiliensis PBR1692]
Length=119
Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/102 (36%), Positives = 53/102 (52%), Gaps = 4/102 (3%)
Query 53 NCTPIRHTIDVMATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARD 112
+C I +T M KPE K + R Q LI AA T+F + AA A+D
Sbjct 17 SCIAIHYT--GMMMKPETKEAPINIRAKASQRDLIDMAANLVSKSRTDFMLEAACREAQD 74
Query 113 VLADRRLFVLTDAAWTEFLAALDRPVSH--KPRLEKLFAARS 152
+L D+RLF+L + + EFLA LD P++ + R+ L + +S
Sbjct 75 ILLDQRLFILDNEQFDEFLAELDAPITEERQARINDLMSRKS 116
>gi|237808068|ref|YP_002892508.1| hypothetical protein Tola_1305 [Tolumonas auensis DSM 9187]
gi|237500329|gb|ACQ92922.1| Protein of unknown function DUF1778 [Tolumonas auensis DSM 9187]
Length=98
Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/75 (39%), Positives = 43/75 (58%), Gaps = 0/75 (0%)
Query 75 LAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEFLAAL 134
L R TPEQ+A++RRAA+ LT+F + +A A L D+RLFV++ + L L
Sbjct 17 LGLRATPEQEAVLRRAADISHKSLTDFILDSACHAAEQTLLDQRLFVVSGEQYQTLLDML 76
Query 135 DRPVSHKPRLEKLFA 149
D+P L+ LF+
Sbjct 77 DQPAQENQGLQDLFS 91
>gi|344199686|ref|YP_004784012.1| hypothetical protein Acife_1545 [Acidithiobacillus ferrivorans
SS3]
gi|343775130|gb|AEM47686.1| protein of unknown function DUF1778 [Acidithiobacillus ferrivorans
SS3]
Length=96
Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/85 (35%), Positives = 50/85 (59%), Gaps = 0/85 (0%)
Query 71 KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEF 130
++ RL TPEQ+ ++RRAA+ LT+F + +A A L D+RLF++T + +F
Sbjct 11 RSTRLGLCATPEQEVILRRAADVAHKSLTDFVLDSACLSAEQTLLDQRLFMVTGRQYQDF 70
Query 131 LAALDRPVSHKPRLEKLFAARSIFD 155
L L++P L+ LF+ ++ +D
Sbjct 71 LDLLEQPEQVNEGLQDLFSRKAPWD 95
>gi|82778765|ref|YP_405114.1| hypothetical protein SDY_3657 [Shigella dysenteriae Sd197]
gi|81242913|gb|ABB63623.1| conserved hypothetical protein [Shigella dysenteriae Sd197]
Length=94
Score = 54.7 bits (130), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 31/76 (41%), Positives = 41/76 (54%), Gaps = 0/76 (0%)
Query 64 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 123
M KPE K + R Q LI AA T+F + AA A+D+L D+RLF+L
Sbjct 1 MNMKPESKEAPINIRAKASQRDLIDMAANLVAKSRTDFMLDAACREAQDILLDQRLFILD 60
Query 124 DAAWTEFLAALDRPVS 139
D + FLAALD P++
Sbjct 61 DEQYDAFLAALDAPIT 76
>gi|15803925|ref|NP_289961.1| hypothetical protein Z4776 [Escherichia coli O157:H7 EDL933]
gi|15833515|ref|NP_312288.1| hypothetical protein ECs4261 [Escherichia coli O157:H7 str. Sakai]
gi|188024510|ref|ZP_02997135.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
EC4113]
12 more sequence titles
Length=94
Score = 54.7 bits (130), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 31/76 (41%), Positives = 41/76 (54%), Gaps = 0/76 (0%)
Query 64 MATKPERKTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLT 123
M KPE K + R Q LI AA T+F + AA A+D+L D+RLF+L
Sbjct 1 MNMKPESKEAPINIRAKASQRDLIDMAANLVAKSRTDFMLDAACREAQDILLDQRLFILD 60
Query 124 DAAWTEFLAALDRPVS 139
D + FLAALD P++
Sbjct 61 DEQYDAFLAALDAPIT 76
>gi|254498060|ref|ZP_05110820.1| conserved hypothetical protein [Legionella drancourtii LLAP12]
gi|254352725|gb|EET11500.1| conserved hypothetical protein [Legionella drancourtii LLAP12]
Length=123
Score = 53.9 bits (128), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/85 (35%), Positives = 46/85 (55%), Gaps = 0/85 (0%)
Query 71 KTERLAARLTPEQDALIRRAAEAEGTDLTNFTVTAALAHARDVLADRRLFVLTDAAWTEF 130
KT RL R + Q +IR A++ LT F +A A V+ D+R+F+++D + +F
Sbjct 32 KTSRLGFRASESQAKIIRMASDISHKSLTEFITDSAYQAAEKVILDQRIFMVSDDIFNKF 91
Query 131 LAALDRPVSHKPRLEKLFAARSIFD 155
LDRP L+ LF++ S +D
Sbjct 92 SEILDRPEQDNEGLKDLFSSSSPWD 116
Lambda K H
0.322 0.132 0.389
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128907826664
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40