BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2826c
Length=294
Score E
Sequences producing significant alignments: (Bits) Value
gi|15842367|ref|NP_337404.1| hypothetical protein MT2893 [Mycoba... 586 1e-165
gi|289746625|ref|ZP_06506003.1| conserved hypothetical protein [... 585 3e-165
gi|15609963|ref|NP_217342.1| hypothetical protein Rv2826c [Mycob... 585 4e-165
gi|289758944|ref|ZP_06518322.1| conserved hypothetical protein [... 582 2e-164
gi|340626053|ref|YP_004744505.1| hypothetical protein MCAN_10481... 576 1e-162
gi|254232920|ref|ZP_04926247.1| hypothetical protein TBCG_02762 ... 455 5e-126
gi|289751485|ref|ZP_06510863.1| conserved hypothetical protein [... 257 2e-66
gi|289751486|ref|ZP_06510864.1| conserved hypothetical protein [... 156 4e-36
gi|295106028|emb|CBL03571.1| Domain of unknown function (DUF1814... 99.4 6e-19
gi|335433879|ref|ZP_08558694.1| hypothetical protein HLRTI_02314... 47.4 0.003
gi|257053998|ref|YP_003131831.1| hypothetical protein Huta_2937 ... 45.4 0.010
gi|48477106|ref|YP_022812.1| hypothetical protein PTO0034 [Picro... 44.7 0.019
gi|23466016|ref|NP_696619.1| hypothetical protein BL1460 [Bifido... 43.1 0.050
gi|227547359|ref|ZP_03977408.1| conserved hypothetical protein [... 43.1 0.051
gi|301310079|ref|ZP_07216018.1| conserved hypothetical protein [... 43.1 0.056
gi|336451022|ref|ZP_08621468.1| hypothetical protein A28LD_1129 ... 42.7 0.067
gi|239621312|ref|ZP_04664343.1| conserved hypothetical protein [... 42.7 0.067
gi|291516788|emb|CBK70404.1| Uncharacterized conserved protein [... 42.0 0.11
gi|192360269|ref|YP_001984087.1| hypothetical protein CJA_3634 [... 42.0 0.12
gi|255514035|gb|EET90299.1| hypothetical protein UNLARM2_0328 [C... 41.6 0.16
gi|118576239|ref|YP_875982.1| hypothetical protein CENSYa_1047 [... 41.6 0.17
gi|284172940|ref|YP_003406321.1| Domain of unknown function DUF1... 41.6 0.17
gi|197286340|ref|YP_002152212.1| hypothetical protein PMI2493 [P... 41.2 0.19
gi|323488128|ref|ZP_08093379.1| hypothetical protein GPDM_02255 ... 41.2 0.19
gi|315231742|ref|YP_004072178.1| hypothetical protein TERMP_0198... 40.8 0.23
gi|312796950|ref|YP_004029872.1| hypothetical protein RBRH_02471... 40.8 0.24
gi|212225016|ref|YP_002308252.1| protein TON_1864 [Thermococcus ... 40.8 0.28
gi|157364064|ref|YP_001470831.1| CRISPR-associated Csx11 family ... 40.4 0.30
gi|10803613|ref|NP_046011.1| hypothetical protein VNG7066 [Halob... 40.4 0.34
gi|253827855|ref|ZP_04870740.1| conserved hypothetical protein [... 40.0 0.42
gi|334128959|ref|ZP_08502835.1| hypothetical protein HMPREF9081_... 39.7 0.58
gi|239621824|ref|ZP_04664855.1| conserved hypothetical protein [... 39.7 0.60
gi|14590279|ref|NP_142345.1| hypothetical protein PH0371 [Pyroco... 39.7 0.63
gi|315231632|ref|YP_004072068.1| hypothetical protein TERMP_0187... 39.3 0.76
gi|242400011|ref|YP_002995436.1| hypothetical protein TSIB_2040 ... 38.5 1.2
gi|337284997|ref|YP_004624471.1| hypothetical protein PYCH_15330... 38.1 1.4
gi|254173885|ref|ZP_04880556.1| conserved domain protein [Thermo... 38.1 1.5
gi|160873178|ref|YP_001552494.1| hypothetical protein Sbal195_00... 38.1 1.7
gi|88803569|ref|ZP_01119094.1| pigmentation and extracellular pr... 37.7 2.2
gi|121608391|ref|YP_996198.1| hypothetical protein Veis_1419 [Ve... 37.7 2.3
gi|226325170|ref|ZP_03800688.1| hypothetical protein COPCOM_0296... 37.4 3.0
gi|325830613|ref|ZP_08164034.1| hypothetical protein HMPREF9404_... 37.0 3.2
gi|317487825|ref|ZP_07946418.1| hypothetical protein HMPREF1023_... 37.0 3.3
gi|148550950|ref|YP_001260380.1| hypothetical protein Swit_4997 ... 37.0 3.4
gi|89255316|ref|NP_659987.2| hypothetical protein RHE_PD00050 [R... 37.0 3.5
gi|327192828|gb|EGE59754.1| hypothetical protein RHECNPAF_19005 ... 37.0 3.8
gi|86134342|ref|ZP_01052924.1| DegT/DnrJ/EryC1/StrS aminotransfe... 36.6 4.2
gi|336036620|gb|AEH82551.1| conserved hypothetical protein [Sino... 36.2 6.0
gi|209883381|ref|YP_002287238.1| hypothetical protein OCAR_4224 ... 36.2 6.0
gi|16262679|ref|NP_435472.1| hypothetical protein SMa0429 [Sinor... 36.2 6.1
>gi|15842367|ref|NP_337404.1| hypothetical protein MT2893 [Mycobacterium tuberculosis CDC1551]
gi|148824015|ref|YP_001288769.1| hypothetical protein TBFG_12840 [Mycobacterium tuberculosis F11]
gi|167968180|ref|ZP_02550457.1| hypothetical protein MtubH3_09159 [Mycobacterium tuberculosis
H37Ra]
27 more sequence titles
Length=296
Score = 586 bits (1510), Expect = 1e-165, Method: Compositional matrix adjust.
Identities = 294/294 (100%), Positives = 294/294 (100%), Gaps = 0/294 (0%)
Query 1 VAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL 60
VAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL
Sbjct 3 VAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL 62
Query 61 GNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE 120
GNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE
Sbjct 63 GNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE 122
Query 121 PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVAL 180
PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVAL
Sbjct 123 PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVAL 182
Query 181 ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS 240
ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS
Sbjct 183 ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS 242
Query 241 IGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS 294
IGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS
Sbjct 243 IGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS 296
>gi|289746625|ref|ZP_06506003.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289687153|gb|EFD54641.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
Length=296
Score = 585 bits (1508), Expect = 3e-165, Method: Compositional matrix adjust.
Identities = 293/294 (99%), Positives = 294/294 (100%), Gaps = 0/294 (0%)
Query 1 VAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL 60
VAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL
Sbjct 3 VAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL 62
Query 61 GNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE 120
GNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE
Sbjct 63 GNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE 122
Query 121 PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVAL 180
PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVAL
Sbjct 123 PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVAL 182
Query 181 ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS 240
ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS
Sbjct 183 ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS 242
Query 241 IGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS 294
IGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRRE+ENALAVLRS
Sbjct 243 IGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRRELENALAVLRS 296
>gi|15609963|ref|NP_217342.1| hypothetical protein Rv2826c [Mycobacterium tuberculosis H37Rv]
gi|31794002|ref|NP_856495.1| hypothetical protein Mb2850c [Mycobacterium bovis AF2122/97]
gi|121638705|ref|YP_978929.1| hypothetical protein BCG_2845c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
38 more sequence titles
Length=294
Score = 585 bits (1507), Expect = 4e-165, Method: Compositional matrix adjust.
Identities = 293/294 (99%), Positives = 294/294 (100%), Gaps = 0/294 (0%)
Query 1 VAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL 60
+AGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL
Sbjct 1 MAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL 60
Query 61 GNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE 120
GNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE
Sbjct 61 GNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE 120
Query 121 PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVAL 180
PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVAL
Sbjct 121 PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVAL 180
Query 181 ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS 240
ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS
Sbjct 181 ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS 240
Query 241 IGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS 294
IGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS
Sbjct 241 IGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS 294
>gi|289758944|ref|ZP_06518322.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|289714508|gb|EFD78520.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
Length=294
Score = 582 bits (1501), Expect = 2e-164, Method: Compositional matrix adjust.
Identities = 292/294 (99%), Positives = 293/294 (99%), Gaps = 0/294 (0%)
Query 1 VAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL 60
+AGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL
Sbjct 1 MAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL 60
Query 61 GNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE 120
GNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE
Sbjct 61 GNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE 120
Query 121 PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVAL 180
PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYR VAL
Sbjct 121 PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRHVAL 180
Query 181 ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS 240
ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS
Sbjct 181 ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS 240
Query 241 IGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS 294
IGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS
Sbjct 241 IGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS 294
>gi|340626053|ref|YP_004744505.1| hypothetical protein MCAN_10481 [Mycobacterium canettii CIPT
140010059]
gi|340004243|emb|CCC43384.1| hypothetical protein MCAN_10481 [Mycobacterium canettii CIPT
140010059]
Length=294
Score = 576 bits (1484), Expect = 1e-162, Method: Compositional matrix adjust.
Identities = 288/294 (98%), Positives = 290/294 (99%), Gaps = 0/294 (0%)
Query 1 VAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL 60
+AGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL
Sbjct 1 MAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL 60
Query 61 GNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE 120
GN GRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE
Sbjct 61 GNAGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGE 120
Query 121 PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVAL 180
PRI ASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVAL
Sbjct 121 PRIAASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVAL 180
Query 181 ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS 240
ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS
Sbjct 181 ARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDS 240
Query 241 IGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS 294
IGVLTRPVAMAAWEARVR RFAFLTDLDADEQRWAACDERHRREVENALA L+S
Sbjct 241 IGVLTRPVAMAAWEARVRTRFAFLTDLDADEQRWAACDERHRREVENALAALQS 294
>gi|254232920|ref|ZP_04926247.1| hypothetical protein TBCG_02762 [Mycobacterium tuberculosis C]
gi|124601979|gb|EAY60989.1| hypothetical protein TBCG_02762 [Mycobacterium tuberculosis C]
Length=238
Score = 455 bits (1170), Expect = 5e-126, Method: Compositional matrix adjust.
Identities = 227/229 (99%), Positives = 228/229 (99%), Gaps = 0/229 (0%)
Query 66 FSTDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGEPRIVA 125
FSTDL+FSAPDDEVVLEVCELIDGARVGGFEFGVQ TRGDGRHWQLRVRHTELGEPRIVA
Sbjct 10 FSTDLNFSAPDDEVVLEVCELIDGARVGGFEFGVQITRGDGRHWQLRVRHTELGEPRIVA 69
Query 126 SVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVALARDLY 185
SVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVALARDLY
Sbjct 70 SVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVALARDLY 129
Query 186 DLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDSIGVLT 245
DLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDSIGVLT
Sbjct 130 DLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVEDVLAARSEHDFQPDSIGVLT 189
Query 246 RPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS 294
RPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS
Sbjct 190 RPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREVENALAVLRS 238
>gi|289751485|ref|ZP_06510863.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289692072|gb|EFD59501.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=129
Score = 257 bits (656), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 129/129 (100%), Positives = 129/129 (100%), Gaps = 0/129 (0%)
Query 166 EACAEKLARYRRVALARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVE 225
EACAEKLARYRRVALARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVE
Sbjct 1 EACAEKLARYRRVALARDLYDLNHFASRTIDEPLVRRLWVLKVWGDVVDDRRGTRPLRVE 60
Query 226 DVLAARSEHDFQPDSIGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREV 285
DVLAARSEHDFQPDSIGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREV
Sbjct 61 DVLAARSEHDFQPDSIGVLTRPVAMAAWEARVRKRFAFLTDLDADEQRWAACDERHRREV 120
Query 286 ENALAVLRS 294
ENALAVLRS
Sbjct 121 ENALAVLRS 129
>gi|289751486|ref|ZP_06510864.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289692073|gb|EFD59502.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=101
Score = 156 bits (394), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 77/77 (100%), Positives = 77/77 (100%), Gaps = 0/77 (0%)
Query 1 VAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL 60
VAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL
Sbjct 25 VAGLTRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRL 84
Query 61 GNVGRFSTDLDFSAPDD 77
GNVGRFSTDLDFSAPDD
Sbjct 85 GNVGRFSTDLDFSAPDD 101
>gi|295106028|emb|CBL03571.1| Domain of unknown function (DUF1814). [Gordonibacter pamelaeae
7-10-1-b]
Length=295
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 93/296 (32%), Positives = 129/296 (44%), Gaps = 54/296 (18%)
Query 9 VARHAL--GRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVGRF 66
+ARH A+ +AA++DVAQD LL RL + G
Sbjct 11 IARHTPRNAGAQGREAAVVDVAQDLLLQ------------------------RLHDDG-- 44
Query 67 STDLDFSAPD-----DEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGEP 121
DLDFS D DEV +D +G F + V+ RG W + + EP
Sbjct 45 --DLDFSVSDFDLGRDEVAEAFASAVDRLSIGPFRYSVRERRG---KWSVVFESGFVREP 99
Query 122 RIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVALA 181
+ ++F+ P P E ++ +PIHK Y LP + V E AEK+AR R A
Sbjct 100 SLATKLDFSPAPWLEPVE-RTWVAMPIHKQYAAPLPAIKTVRLEENIAEKVARLNRTTTA 158
Query 182 RDLYDLNHFA-----SRTIDEPLVRRLWVLKVWGDVVDDRRGTRP---------LRVEDV 227
RD+YDL +R++D LVRRL VLK+W D G+ VE
Sbjct 159 RDMYDLAWIMGKAPLARSLDLDLVRRLSVLKIWVDSNGLHSGSMTWPPGHEKSVFDVERW 218
Query 228 LAARSEHDFQPDSIGVLTRPV-AMAAWEARVRKRFAFLTDLDADEQRWAACDERHR 282
L RS+ +F + IG L P + VR F+FL++L +E+ A D R R
Sbjct 219 LRERSDGEFDLEDIGALAVPAPSPKELSESVRIGFSFLSNLTDEEEVLAKADNRDR 274
>gi|335433879|ref|ZP_08558694.1| hypothetical protein HLRTI_02314 [Halorhabdus tiamatea SARL4B]
gi|335438228|ref|ZP_08560977.1| hypothetical protein HLRTI_13840 [Halorhabdus tiamatea SARL4B]
gi|334892686|gb|EGM30916.1| hypothetical protein HLRTI_13840 [Halorhabdus tiamatea SARL4B]
gi|334898369|gb|EGM36478.1| hypothetical protein HLRTI_02314 [Halorhabdus tiamatea SARL4B]
Length=269
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/162 (32%), Positives = 76/162 (47%), Gaps = 23/162 (14%)
Query 39 TVQFGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAP----DDEVVLEVCELIDGARVGG 94
T +GDN L+FKGGT+L K R+S DLDF E L+ L D AR G
Sbjct 36 TSSYGDN-LLFKGGTALSKLYFPETWRYSEDLDFGVEGAYRGSETGLQDA-LEDAARTSG 93
Query 95 FEFGVQSTRGDGR-----HW-QLRVRHTELGEPRIVASVEFARRPLALPSELLAFIQLPI 148
+F V R + H+ + +++T + + S++ + E + F +
Sbjct 94 IDFEVTKHRELQKEAYPTHYVDIDIQYTAVLGQKNTTSLD------VMIDEYVVFDSVSH 147
Query 149 HKAYGFGLPTLPVVAEA--EACAEKL-ARYRRVALARDLYDL 187
H +Y +P + A + E AEKL A Y+R + ARD YDL
Sbjct 148 HHSYE-DVPEFELTAYSLEEIFAEKLRALYQR-SQARDYYDL 187
>gi|257053998|ref|YP_003131831.1| hypothetical protein Huta_2937 [Halorhabdus utahensis DSM 12940]
gi|256692761|gb|ACV13098.1| Domain of unknown function DUF1814 [Halorhabdus utahensis DSM
12940]
Length=269
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 52/186 (28%), Positives = 77/186 (42%), Gaps = 16/186 (8%)
Query 39 TVQFGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDDEVVLEV---CELIDGARVGGF 95
T Q+GDN L+FKGGT+L K R+S DLDF + EV L D R G
Sbjct 36 TSQYGDN-LLFKGGTALSKLYFPETWRYSEDLDFGVEGEYQGSEVELRDVLEDATRASGI 94
Query 96 EFGVQSTRGDGRHWQLRVRHTELGEPRIVASVEFARRPLA----LPSELLAFIQLPIHKA 151
+F V R Q T + I + + + E + F + +
Sbjct 95 DFEVTKH----RELQKEAYPTHYVDIDIQYNAVLGHKNTTSLDVMIDEYVVFDSVNHRHS 150
Query 152 YGFGLPTLPVVAEA--EACAEKLARYRRVALARDLYDLNHFASRT-IDEPLVRRLWVLKV 208
Y +P + A + E AEKL + + ARD YDL + +D+ ++R + K
Sbjct 151 YE-DVPEFELTAYSVEEIFAEKLRALYQRSKARDHYDLYRMITEADVDDSVIRPAFTRKC 209
Query 209 WGDVVD 214
D +D
Sbjct 210 EHDGLD 215
>gi|48477106|ref|YP_022812.1| hypothetical protein PTO0034 [Picrophilus torridus DSM 9790]
gi|48429754|gb|AAT42619.1| conserved hypothetical protein [Picrophilus torridus DSM 9790]
Length=262
Score = 44.7 bits (104), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 44/177 (25%), Positives = 82/177 (47%), Gaps = 23/177 (12%)
Query 27 VAQDHLLYLLSQTV--QFGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDDEVVLE-- 82
+ +D+LL LL + +F D L+FKGGTSL+ N+ RFS DLDFS + L+
Sbjct 20 LEKDYLLTLLLYEIYNEFND-ELIFKGGTSLK--YFYNLNRFSEDLDFSYLSKKHSLKSI 76
Query 83 VCELIDGARVGGFEFGVQSTRGDGR---------HWQLRVRHTELGEPRIVASVEF---A 130
++ + ++ + +T G +++LR++ + + +++
Sbjct 77 YAKMNRAFKHVNLQYDIINTEHRGHKVGDTVVRINFELRIKGPLYNKLNYMENIDIDLSL 136
Query 131 RRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVALARDLYDL 187
R + LP ++ + P + + +PV+ E +EK+A RD+YDL
Sbjct 137 RNDVILPPDIKYLV--PTYP--DIPMFPVPVMNLNEIISEKVASIIERNKMRDIYDL 189
>gi|23466016|ref|NP_696619.1| hypothetical protein BL1460 [Bifidobacterium longum NCC2705]
gi|23326735|gb|AAN25255.1| hypothetical protein BL1460 [Bifidobacterium longum NCC2705]
gi|338754396|gb|AEI97385.1| hypothetical protein BLNIAS_01218 [Bifidobacterium longum subsp.
longum KACC 91563]
Length=317
Score = 43.1 bits (100), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 79/289 (28%), Positives = 116/289 (41%), Gaps = 41/289 (14%)
Query 6 RALVARHALGRAEAYDAALLDVAQDHLLY--LLSQTVQFGD-NRLVFKGGTSLRKCRLGN 62
+A+ A L E LL V + LL+ +L ++ G + LVF+GGTSLR C
Sbjct 11 KAMAAGIVLAGGEGM-GNLLPVVEKELLHYRILDAMMREGFFSSLVFQGGTSLRLCH--G 67
Query 63 VGRFSTDLDFSAP---DDEVVLEVCELIDGARVG---GFEFGVQSTRGDG----RHWQLR 112
R+S DLDF+ D + + + I + G V+ R D R W++
Sbjct 68 SPRYSEDLDFAGGTSFDMDTLKGLGSCISDSLSGMGDDVTVRVKEPRPDADGLTRRWRIA 127
Query 113 VRHTELGE--PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAE 170
+R + P +E A P P A + P+ A G L V + E A+
Sbjct 128 IRTAGQRKDLPSQTIKLEVASIPAYEPQHRPALVNYPMFPALS-GQIILDVESPTEILAD 186
Query 171 KLARYRRVALA--RDLYDLNHFASRT-IDEPLVRRLWVLK--------VWGDVVDDRRGT 219
KL Y + RDL+D+ ASR +D L LK +W D R
Sbjct 187 KLLSYACASHLRRRDLWDMCWLASRGDVDSRRAMELAELKSSDYGEEGLWAD-----RAD 241
Query 220 RPLRVEDVLAARSEHD----FQPDSIGVLTRPVAMAAWEARVRKRFAFL 264
R V DV+ + + D F P G++T V W A ++ L
Sbjct 242 RVAGVADVIGSDAFADEMRRFLP--AGLMTSTVESPRWSAWAIEQIGTL 288
>gi|227547359|ref|ZP_03977408.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis ATCC 55813]
gi|227212174|gb|EEI80070.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis ATCC 55813]
Length=318
Score = 43.1 bits (100), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 78/280 (28%), Positives = 113/280 (41%), Gaps = 41/280 (14%)
Query 6 RALVARHALGRAEAYDAALLDVAQDHLLY--LLSQTVQFGD-NRLVFKGGTSLRKCRLGN 62
+A+ A L E LL V + LL+ +L ++ G + LVF+GGTSLR C
Sbjct 12 KAMAAGIVLAGGEGM-GNLLPVVEKELLHYRILDAMMREGFFSSLVFQGGTSLRLCH--G 68
Query 63 VGRFSTDLDFSAP---DDEVVLEVCELIDGARVG---GFEFGVQSTRGDG----RHWQLR 112
R+S DLDF+ D + + + I + G V+ R D R W++
Sbjct 69 SPRYSEDLDFAGGTSFDMDTLKGLGSCISDSLSGMGDDVTVRVKEPRPDADGLTRRWRIA 128
Query 113 VRHTELGE--PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAE 170
+R + P +E A P P A + P+ A G L V + E A+
Sbjct 129 IRTAGQRKDLPSQTIKLEVASIPAYEPQHRPALVNYPMFPALS-GQIILDVESPTEILAD 187
Query 171 KLARYRRVALA--RDLYDLNHFASRT-IDEPLVRRLWVLK--------VWGDVVDDRRGT 219
KL Y + RDL+D+ ASR +D L LK +W D R
Sbjct 188 KLLSYACASHLRRRDLWDMCWLASRGDVDSRRAMELAELKSSDYGEEGLWAD-----RAD 242
Query 220 RPLRVEDVLAARSEHD----FQPDSIGVLTRPVAMAAWEA 255
R V DV+ + + D F P G++T V W A
Sbjct 243 RVAGVADVIGSDAFADEMRRFLP--AGLMTSTVESPRWSA 280
>gi|301310079|ref|ZP_07216018.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|300831653|gb|EFK62284.1| conserved hypothetical protein [Bacteroides sp. 20_3]
Length=262
Score = 43.1 bits (100), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 45/166 (28%), Positives = 70/166 (43%), Gaps = 30/166 (18%)
Query 46 RLVFKGGTSLRKCRLGNVGRFSTDLDFSAPD---DEVVLEVCELIDGARVGGFEFGVQST 102
++ F GGT+LR + + RFS DLDF + DE + E+ +G G++
Sbjct 46 KMAFIGGTNLRLVK--GIDRFSEDLDFDCKNLSKDEFI----EMTNGVIQFLERSGLRVE 99
Query 103 RGDGRHWQL-----RVRHTEL---------GEPRIVASVEFARRPLALPSELLAFIQLPI 148
D ++ +L + EL E R + VE + +A P +
Sbjct 100 AKDKKNPKLTAFRRNIHFPELLFDLGLSGHKEERFLIKVESQYQGIAYPPVITNI----- 154
Query 149 HKAYGFGLPTLPVVAEAEACAEKLARYRRVALARDLYDLNHFASRT 194
K YGF P PV ++ C+ K+A A RD YDL S++
Sbjct 155 -KGYGFFFP-FPVPSDGVLCSMKIAAMLARAKGRDFYDLMFLLSQS 198
>gi|336451022|ref|ZP_08621468.1| hypothetical protein A28LD_1129 [Idiomarina sp. A28L]
gi|336282278|gb|EGN75516.1| hypothetical protein A28LD_1129 [Idiomarina sp. A28L]
Length=306
Score = 42.7 bits (99), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 54/171 (32%), Positives = 79/171 (47%), Gaps = 39/171 (22%)
Query 45 NRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDDEVVLEVCEL------IDGARVG----- 93
+ L+F+GGTSLR C GN RFS DLDF+ D ++ E+ G R G
Sbjct 48 DSLIFQGGTSLRLCYGGN--RFSEDLDFAGGYDFSSSQLAEMKACIETYIGNRYGLEVTV 105
Query 94 --GFEFGVQSTRGDGR--HWQLRV----RHTELGEPRI---VASVE-FARRPLALPSELL 141
E + T + R WQ+ V + L + RI VA+V + ++PLAL +
Sbjct 106 KEPNELKAEPTYAELRIEKWQIAVVTAPENKSLPKQRIKLEVANVPAYTKQPLALQAN-- 163
Query 142 AFIQLPIHKAYGFGLPTLPVVAEA--EACAEK---LARYRRVALARDLYDL 187
+ LP G L V+ E+ E A+K LA ++ RD++DL
Sbjct 164 -YQFLPS------GYSDLLVMTESLDEIMADKIVSLAATKKYTRNRDIWDL 207
>gi|239621312|ref|ZP_04664343.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis CCUG 52486]
gi|239515773|gb|EEQ55640.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis CCUG 52486]
Length=305
Score = 42.7 bits (99), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 74/262 (29%), Positives = 107/262 (41%), Gaps = 40/262 (15%)
Query 24 LLDVAQDHLLY--LLSQTVQFGD-NRLVFKGGTSLRKCRLGNVGRFSTDLDFSAP---DD 77
LL V + LL+ +L ++ G + LVF+GGTSLR C R+S DLDF+ D
Sbjct 16 LLPVVEKELLHYRILDAMMREGFFSSLVFQGGTSLRLCH--GSPRYSEDLDFAGGTSFDM 73
Query 78 EVVLEVCELIDGARVG---GFEFGVQSTRGDG----RHWQLRVRHTELGE--PRIVASVE 128
+ + + I + G V+ R D R W++ +R + P +E
Sbjct 74 DTLKGLGSCISDSLSGMGDDVTVRVKEPRPDADGLTRRWRIAIRTAGQRKDLPSQTIKLE 133
Query 129 FARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVALA--RDLYD 186
A P P A + P+ A G L V + E A+KL Y + RDL+D
Sbjct 134 VASIPAYEPQHRPALVNYPMFPALS-GQIILDVESPTEILADKLLSYACASHLRRRDLWD 192
Query 187 LNHFASRT-IDEPLVRRLWVLK--------VWGDVVDDRRGTRPLRVEDVLAARSEHD-- 235
+ ASR +D L LK +W D R R V DV+ + + D
Sbjct 193 MCWLASRGDVDSRRAMELAELKSSDYGEEGLWAD-----RADRVAGVADVIGSDAFADEM 247
Query 236 --FQPDSIGVLTRPVAMAAWEA 255
F P G++T V W A
Sbjct 248 RRFLP--AGLMTSTVESPRWSA 267
>gi|291516788|emb|CBK70404.1| Uncharacterized conserved protein [Bifidobacterium longum subsp.
longum F8]
Length=318
Score = 42.0 bits (97), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 77/280 (28%), Positives = 112/280 (40%), Gaps = 41/280 (14%)
Query 6 RALVARHALGRAEAYDAALLDVAQDHLLY--LLSQTVQFGD-NRLVFKGGTSLRKCRLGN 62
+A+ A L E LL V + LL+ +L ++ G + LVF+GGTSLR C
Sbjct 12 KAMAAGIVLAGGEGM-GNLLPVVEKELLHYRILDAMMREGFFSSLVFQGGTSLRLCH--G 68
Query 63 VGRFSTDLDFSAP---DDEVVLEVCELIDGARVG---GFEFGVQSTRGDG----RHWQLR 112
R+S DLDF+ D + + + I + G V+ R D R W++
Sbjct 69 SPRYSEDLDFAGGTSFDMDTLKGLGSCISDSLSGMGDDVTVRVKEPRPDADGLTRRWRIA 128
Query 113 VRHTELGE--PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAE 170
+R + P +E A P P A + P+ A G L + E A+
Sbjct 129 IRTAGQRKDLPSQTIKLEVASIPAYEPQHRPALVNYPMFPALS-GQIILDAESPTEILAD 187
Query 171 KLARYRRVALA--RDLYDLNHFASRT-IDEPLVRRLWVLK--------VWGDVVDDRRGT 219
KL Y + RDL+D+ ASR +D L LK +W D R
Sbjct 188 KLLSYACASHLRRRDLWDMCWLASRGDVDSRRAMELAELKSSDYGEEGLWAD-----RAD 242
Query 220 RPLRVEDVLAARSEHD----FQPDSIGVLTRPVAMAAWEA 255
R V DV+ + + D F P G++T V W A
Sbjct 243 RAAGVADVIGSDAFADEMRRFLP--AGLMTSTVESPRWSA 280
>gi|192360269|ref|YP_001984087.1| hypothetical protein CJA_3634 [Cellvibrio japonicus Ueda107]
gi|190686434|gb|ACE84112.1| conserved hypothetical protein [Cellvibrio japonicus Ueda107]
Length=306
Score = 42.0 bits (97), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/33 (64%), Positives = 24/33 (73%), Gaps = 2/33 (6%)
Query 45 NRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDD 77
+ LVF+GGTSLR CR GN RFS DLDF+ D
Sbjct 48 DNLVFQGGTSLRLCRGGN--RFSEDLDFAGGKD 78
>gi|255514035|gb|EET90299.1| hypothetical protein UNLARM2_0328 [Candidatus Micrarchaeum acidiphilum
ARMAN-2]
Length=265
Score = 41.6 bits (96), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 51/176 (29%), Positives = 79/176 (45%), Gaps = 25/176 (14%)
Query 29 QDHLL-YLLSQTVQFGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDDEVVLEVCELI 87
+D+LL LL + N LVFKGGT+L+ + RFS DLDFS +
Sbjct 22 RDYLLTLLLDEICSVFSNELVFKGGTALK--YFYGLNRFSEDLDFSYSGTNDTRSRKSIN 79
Query 88 DGARVGGFEFGVQ-----------STRGD--GRHWQLRVR---HTELGEPRIVASVEFAR 131
DG + FG+Q +G G ++ +RV + LG+ + + SV+ +
Sbjct 80 DGISIALKRFGMQYEVVSQERRAKKEKGVVLGINYIIRVAGPLNKALGQLQNI-SVDLSL 138
Query 132 RPLALPSELLAFIQLPIH-KAYGFGLPTLPVVAEAEACAEKLARYRRVALARDLYD 186
R + +L ++ PI+ F + T+ V E AEK+A RD+YD
Sbjct 139 RNDIIEKPVLKYMS-PIYPDITTFSVLTMGV---EEILAEKIAAIIERDKMRDIYD 190
>gi|118576239|ref|YP_875982.1| hypothetical protein CENSYa_1047 [Cenarchaeum symbiosum A]
gi|118194760|gb|ABK77678.1| conserved hypothetical protein [Cenarchaeum symbiosum A]
Length=254
Score = 41.6 bits (96), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 48/162 (30%), Positives = 75/162 (47%), Gaps = 19/162 (11%)
Query 35 LLSQTVQF-GDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDDEVVLEVCELIDGARVG 93
LLS F G +++VFKGGTS++K + R+S DLDF+ +D V ++ E + G +G
Sbjct 29 LLSIIADFPGIDKIVFKGGTSVKKMFFRDF-RYSEDLDFNGLED-VTEDLIEHLRG-NMG 85
Query 94 GFEFGVQSTRGDGR---HWQLRVRHTELGEPRIVASVEFARRP--LALPS--ELLA-FIQ 145
G R RV + + R +V+ + R + P E+L +
Sbjct 86 GLNVDFTEIIPKDRTRVSASFRVMYKSVNGTRSSVNVDMSMRMNLMMKPQTREMLTDYED 145
Query 146 LPIHKAYGFGLPTLPVVAEAEACAEKLARYRRVALARDLYDL 187
LP G +PV+ E AEK++ A AR +YD+
Sbjct 146 LP-------GPYHIPVMDLEEIMAEKISAVTYSAHARHVYDV 180
>gi|284172940|ref|YP_003406321.1| Domain of unknown function DUF1814 [Haloterrigena turkmenica
DSM 5511]
gi|284017700|gb|ADB63648.1| Domain of unknown function DUF1814 [Haloterrigena turkmenica
DSM 5511]
Length=267
Score = 41.6 bits (96), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 45/158 (29%), Positives = 69/158 (44%), Gaps = 15/158 (9%)
Query 39 TVQFGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAP-----DDEVVLEVCELIDGARVG 93
T FG+N L+FKGGT+L K RFS DLDF ++ + +V + +
Sbjct 36 TSGFGEN-LMFKGGTALSKLYFPQSWRFSEDLDFGVEGQYKGSEDGLRDVLDTV--TDRS 92
Query 94 GFEFGVQSTRGDGRHWQLRVRHTELG-EPRIVASVEFARRPLALPSELLAFIQLPIHKAY 152
G EF + S + R + ++ + R V + E +AF P+H +
Sbjct 93 GIEFTI-SEHHESRQQHYPTHYVDMSIQYRAVLDHPNTTSLDVMVDEYVAFD--PVHYTH 149
Query 153 GF-GLPTLPVVAEA--EACAEKLARYRRVALARDLYDL 187
+ +P + A + E AEKL + ARD YDL
Sbjct 150 SYEDIPEFELQAYSVEEIFAEKLRAIFQRGAARDYYDL 187
>gi|197286340|ref|YP_002152212.1| hypothetical protein PMI2493 [Proteus mirabilis HI4320]
gi|194683827|emb|CAR44928.1| conserved hypothetical protein [Proteus mirabilis HI4320]
Length=306
Score = 41.2 bits (95), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 20/33 (61%), Positives = 23/33 (70%), Gaps = 2/33 (6%)
Query 45 NRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDD 77
N+L F+GGTSLR C GN RFS DLDF+ D
Sbjct 48 NKLTFQGGTSLRLCYGGN--RFSEDLDFAGGKD 78
>gi|323488128|ref|ZP_08093379.1| hypothetical protein GPDM_02255 [Planococcus donghaensis MPA1U2]
gi|323398132|gb|EGA90927.1| hypothetical protein GPDM_02255 [Planococcus donghaensis MPA1U2]
Length=316
Score = 41.2 bits (95), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 24/59 (41%), Positives = 32/59 (55%), Gaps = 1/59 (1%)
Query 17 AEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAP 75
A AY + +D+ + LL + + +VFKGGTSL KC + RFS DLD S P
Sbjct 18 AAAYGLQNFQIEKDYYVSLLLKKLVSNFPGVVFKGGTSLSKC-YDVIKRFSEDLDLSVP 75
>gi|315231742|ref|YP_004072178.1| hypothetical protein TERMP_01981 [Thermococcus barophilus MP]
gi|315184770|gb|ADT84955.1| hypothetical protein TERMP_01981 [Thermococcus barophilus MP]
Length=312
Score = 40.8 bits (94), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 25/59 (43%), Positives = 32/59 (55%), Gaps = 2/59 (3%)
Query 26 DVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDDEVVLEVC 84
D+ +L L F N L FKGGT L KC LG RFS DLDF++ D + +E+
Sbjct 25 DIILHSILRELYSNEYFSSNYL-FKGGTCLIKCYLGYY-RFSVDLDFTSRDPQTWIELS 81
>gi|312796950|ref|YP_004029872.1| hypothetical protein RBRH_02471 [Burkholderia rhizoxinica HKI
454]
gi|312168725|emb|CBW75728.1| unnamed protein product [Burkholderia rhizoxinica HKI 454]
Length=307
Score = 40.8 bits (94), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 52/214 (25%), Positives = 84/214 (40%), Gaps = 38/214 (17%)
Query 7 ALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVGRF 66
AL ++A R + + ++ +LY L Q+ L F+GGT+LR C G R+
Sbjct 15 ALAGQYANDRKVPTNTIMKEILHYEILYALLQSGAAA--ALTFQGGTALRLCYQGT--RY 70
Query 67 STDLDFSAPDDEVVLEVCELIDGARVGGFEFGVQSTRGDGRHWQLRVRHTELGEP----- 121
S DLDF+ D+ D + F +Q D Q+ ++ + P
Sbjct 71 SEDLDFAGGDN---------FDPRLMAPFAELLQKEIADAYGLQIEIKAPKEKPPSDGVN 121
Query 122 --RIVASVEFARRPLALPSELLAFIQ---LPIHKA----YGFGLPTLP-----VVAEAEA 167
R A V + ++P + I+ +P H A P LP ++ AE
Sbjct 122 VTRWSAKVHIPQIDPSVPQNQIINIEVASVPAHDADLVSIAANYPHLPAPHRQLIITAET 181
Query 168 CAEKLARY------RRVALARDLYDLNHFASRTI 195
E LA R ARD++D+ + R +
Sbjct 182 PNEILADKLLALGARPFLKARDIWDIKYLTDRQV 215
>gi|212225016|ref|YP_002308252.1| protein TON_1864 [Thermococcus onnurineus NA1]
gi|212009973|gb|ACJ17355.1| hypothetical protein TON_1864 [Thermococcus onnurineus NA1]
Length=263
Score = 40.8 bits (94), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 46/173 (27%), Positives = 78/173 (46%), Gaps = 10/173 (5%)
Query 29 QDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVG--RFSTDLDFSAPDDEVVLEVCEL 86
++ + +LLSQ + + + +GGT+L + L +G RFS D+D D ++ E+
Sbjct 23 EERISFLLSQLWEIFGEKAILRGGTALNRVYLAKIGAARFSEDIDIDYFDGDIGRAAEEI 82
Query 87 IDGAR-VGGFEFGVQSTRGDGRHWQLRVRHTELGEPRIVASVEFARRPLALPSELLAFIQ 145
G + V GF+ ++ R R ++ + R VEF L+ P + A I+
Sbjct 83 KKGMKLVEGFD--IKGPRILHRTFRFDCYYRNPLGNRDRVKVEFY---LSRPPYVEAGIE 137
Query 146 LPIHKAYGFGLPTLPVVAEAE-ACAEKLARYRRVALARDLYDLNHFASRTIDE 197
L + + PT+ V E A+KLA +D+YD H + DE
Sbjct 138 L-VKSPFVSEYPTMFRVYSFEDLLAKKLAALYNRTEGKDIYDSFHALNMEFDE 189
>gi|157364064|ref|YP_001470831.1| CRISPR-associated Csx11 family protein [Thermotoga lettingae
TMO]
gi|157314668|gb|ABV33767.1| CRISPR-associated protein Csx11 [Thermotoga lettingae TMO]
Length=1218
Score = 40.4 bits (93), Expect = 0.30, Method: Composition-based stats.
Identities = 56/195 (29%), Positives = 83/195 (43%), Gaps = 27/195 (13%)
Query 26 DVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPD----DEVVL 81
D AQ+ LL LS LV KGGT +RK + N RFS DLDF+ + +E
Sbjct 24 DYAQNWLLMALSSL------PLVLKGGTGIRKVYISNY-RFSDDLDFTLLEEFSAEEFKT 76
Query 82 EVCELIDGAR-------VGGFEFGVQSTRGDGRHWQLRVRHTELGEPRIVASVEFARRPL 134
+ ++I+ AR FEF +G + + GE R ++ +
Sbjct 77 TIDKVIEKAREESGMNFFEDFEF---QKNNNGFEIDTYFQFMQRGENRTKIKLDITKA-- 131
Query 135 ALPSELLAFIQLPIHKAYGFGLP-TLPVVAEAEACAEKLARYRRVALARDLYDLNHFASR 193
LL ++ I Y L + V + E AEK+ + RDLYD+ + S+
Sbjct 132 KNERILLPVLREKIIHLYSDDLDCEVKVYSLEEIVAEKIRSLFQRTRPRDLYDVWYLWSK 191
Query 194 TIDEPLVRRLWVLKV 208
T D + R VLK+
Sbjct 192 TND---IDRRKVLKI 203
>gi|10803613|ref|NP_046011.1| hypothetical protein VNG7066 [Halobacterium sp. NRC-1]
gi|10803690|ref|NP_046088.1| hypothetical protein VNG7143 [Halobacterium sp. NRC-1]
gi|16120051|ref|NP_395639.1| hypothetical protein VNG6087C [Halobacterium sp. NRC-1]
7 more sequence titles
Length=267
Score = 40.4 bits (93), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 44/158 (28%), Positives = 68/158 (44%), Gaps = 15/158 (9%)
Query 39 TVQFGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAP-----DDEVVLEVCELIDGARVG 93
T FG+N L+FKGGT+L K RFS DLDF ++ + +V + +
Sbjct 36 TSDFGEN-LMFKGGTALSKLYFPQSWRFSEDLDFGVEGQYNGSEDDLRDVLDTV--TERS 92
Query 94 GFEFGVQSTRGDGRHWQLRVRHTELG-EPRIVASVEFARRPLALPSELLAFIQLPIHKAY 152
G EF + S + R + ++ + R V + E +AF +H +
Sbjct 93 GIEFTI-SEHHESRQQHYPTHYVDMSIQYRAVLDHPNTTSLDVMVDEYVAFDS--VHHTH 149
Query 153 GF-GLPTLPVVAEA--EACAEKLARYRRVALARDLYDL 187
+ +P + A + E AEKL + ARD YDL
Sbjct 150 SYEDIPEFELQAYSVEEIFAEKLRAIFQRGAARDYYDL 187
>gi|253827855|ref|ZP_04870740.1| conserved hypothetical protein [Helicobacter canadensis MIT 98-5491]
gi|313142416|ref|ZP_07804609.1| conserved hypothetical protein [Helicobacter canadensis MIT 98-5491]
gi|253511261|gb|EES89920.1| conserved hypothetical protein [Helicobacter canadensis MIT 98-5491]
gi|313131447|gb|EFR49064.1| conserved hypothetical protein [Helicobacter canadensis MIT 98-5491]
Length=292
Score = 40.0 bits (92), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 27/68 (40%), Positives = 38/68 (56%), Gaps = 4/68 (5%)
Query 12 HALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVGRFSTDLD 71
+A DA + ++ +L LSQ+ D +VF+GGTSLR C GN R S DLD
Sbjct 11 YAFELGNTKDAVIKEILHYDILQSLSQSDIAND--IVFQGGTSLRLC-YGN-NRHSEDLD 66
Query 72 FSAPDDEV 79
F+ D++V
Sbjct 67 FALKDEKV 74
>gi|334128959|ref|ZP_08502835.1| hypothetical protein HMPREF9081_2423 [Centipeda periodontii DSM
2778]
gi|333385986|gb|EGK57211.1| hypothetical protein HMPREF9081_2423 [Centipeda periodontii DSM
2778]
Length=314
Score = 39.7 bits (91), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 24/76 (32%), Positives = 41/76 (54%), Gaps = 7/76 (9%)
Query 27 VAQDHLLYLLSQTVQFGD--NRLVFKGGTSLRKCRLGNVGRFSTDLDFS-----APDDEV 79
V +D+ + +L Q ++ ++ VFKGGTSL KC + RFS D+D + D+
Sbjct 28 VRRDYFIVMLLQQLEVSAYADQCVFKGGTSLSKCYPETIKRFSEDIDITFLMGECATDKK 87
Query 80 VLEVCELIDGARVGGF 95
++ +L++ A G F
Sbjct 88 YDKMLKLVEKAIAGKF 103
>gi|239621824|ref|ZP_04664855.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis CCUG 52486]
gi|239515015|gb|EEQ54882.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis CCUG 52486]
Length=321
Score = 39.7 bits (91), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 63/222 (29%), Positives = 93/222 (42%), Gaps = 29/222 (13%)
Query 13 ALGRAEAYDAALLDVAQDHLLYLLSQTVQFGD--NRLVFKGGTSLRKCRLGNVGRFSTDL 70
+ R+E A L V ++ L Y + +Q G +VF+GGTSLR C R+S DL
Sbjct 11 GIARSEGMGALLPVVEKELLHYRILSAMQDGGFFGPIVFQGGTSLRLCH--GSPRYSEDL 68
Query 71 DFSAPDDEVVLEVCELIDGAR--VGGFEFGVQ-----STRGDG--RHWQLRVRHTELGE- 120
DF+ V ++ L + R + G VQ R +G R W++ +R
Sbjct 69 DFAGGTGFGVDDLRGLGECVRSSLAGMSPDVQVKVREPVRDEGLVRRWRISIRTAAQRRD 128
Query 121 -PRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTLPVVAEAEACAEKLARY---- 175
P +E A P P + P A G L V + E A+KL +
Sbjct 129 LPSQSIKLEVASVPAHEPQTRPVRVNYPSVSAIA-GDIILAVESPTEILADKLLSFACSS 187
Query 176 --RRVALARDLYDLNHFASRT-IDEPLVRRLWVLKV--WGDV 212
RR RDL+D+ +SR +D R+ LK +G+V
Sbjct 188 HIRR----RDLWDMCWLSSRADVDASRAFRMATLKAGEYGEV 225
>gi|14590279|ref|NP_142345.1| hypothetical protein PH0371 [Pyrococcus horikoshii OT3]
gi|3256762|dbj|BAA29445.1| 261aa long hypothetical protein [Pyrococcus horikoshii OT3]
Length=261
Score = 39.7 bits (91), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 45/163 (28%), Positives = 70/163 (43%), Gaps = 11/163 (6%)
Query 30 DHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLG--NVGRFSTDLDFSAPDDEVVLEVCELI 87
+ L YLL Q + +++ KGGT+L + L N RFS D+D DD + E + I
Sbjct 24 EKLSYLLFQLWEIFGRKVILKGGTALNRVYLSKLNASRFSEDIDLDYFDDIPLNEKIKDI 83
Query 88 DGARVGGFEFGVQSTRGDGRHWQLRVRH-TELGEPRIVASVEFARRPLALPSELLAFIQL 146
+F V+ R R + + ELG R +EF +A ++
Sbjct 84 KEKMALIKDFDVKGPRILHRTLRFDCYYINELGN-RDRVKIEFYLSQPPFVEANIALVKS 142
Query 147 PIHKAYGFGLPTLPVVAEAEACAEK--LARYRRVALARDLYDL 187
P ++Y PT+ V E K +A Y R +D+YD+
Sbjct 143 PFVESY----PTMFRVYSFEDLLAKKLIALYNRTE-GKDIYDV 180
>gi|315231632|ref|YP_004072068.1| hypothetical protein TERMP_01870 [Thermococcus barophilus MP]
gi|315184660|gb|ADT84845.1| hypothetical protein TERMP_01870 [Thermococcus barophilus MP]
Length=338
Score = 39.3 bits (90), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 23/49 (47%), Positives = 29/49 (60%), Gaps = 2/49 (4%)
Query 36 LSQTVQFGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDDEVVLEVC 84
L + F +N VFKGGT L KC LG RFS DLDF+ + E + E+
Sbjct 47 LKKDPHFREN-YVFKGGTYLVKCHLG-YYRFSRDLDFAYRNSEELQEMS 93
>gi|242400011|ref|YP_002995436.1| hypothetical protein TSIB_2040 [Thermococcus sibiricus MM 739]
gi|242266405|gb|ACS91087.1| hypothetical protein TSIB_2040 [Thermococcus sibiricus MM 739]
Length=136
Score = 38.5 bits (88), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/110 (30%), Positives = 53/110 (49%), Gaps = 6/110 (5%)
Query 29 QDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVG--RFSTDLDFSAPDDEVVLEVCEL 86
++ + LLSQ + + + KGGT L + L +G RFS D+D + +V E+
Sbjct 23 EEKISLLLSQLWEIFGEKAILKGGTGLNRVYLARIGTVRFSEDMDIDYFNGDVETSAQEI 82
Query 87 IDGARVGGFE-FGVQSTRGDGRHWQLRVRHTELGEPRIVASVEFA-RRPL 134
++G + G E F V+ +R R ++ +T R VEF RP+
Sbjct 83 VEGMK--GIEGFNVKGSRILHRTFRFDCYYTNTLGNRDRVKVEFYLSRPV 130
>gi|337284997|ref|YP_004624471.1| hypothetical protein PYCH_15330 [Pyrococcus yayanosii CH1]
gi|334900931|gb|AEH25199.1| hypothetical protein PYCH_15330 [Pyrococcus yayanosii CH1]
Length=253
Score = 38.1 bits (87), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/48 (48%), Positives = 29/48 (61%), Gaps = 2/48 (4%)
Query 36 LSQTVQFGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDDEVVLEV 83
L + F +N VFKGGT L KC LG RFS DLDF+ + E + E+
Sbjct 47 LEKDPYFREN-YVFKGGTCLVKCHLGYY-RFSRDLDFAYRNSEELQEM 92
>gi|254173885|ref|ZP_04880556.1| conserved domain protein [Thermococcus sp. AM4]
gi|214032134|gb|EEB72965.1| conserved domain protein [Thermococcus sp. AM4]
Length=284
Score = 38.1 bits (87), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 44/168 (27%), Positives = 71/168 (43%), Gaps = 27/168 (16%)
Query 47 LVFKGGTSLRKCRLGNVGRFSTDLDFS----APD-DEVVLEVCELIDGARVGGFEF---- 97
L FKGGT L+K + RFS DLD++ PD +V ++ E ++ A G +F
Sbjct 42 LAFKGGTCLKKAYFSDY-RFSEDLDYTLLLEEPDIGDVQAKIAEAVEAANEGLVQFLDFE 100
Query 98 -----GVQSTRGDGRHWQLRVRHTEL----GEPRIVASVEFARRPLALPSELLAFIQLPI 148
GV+ G+ +++R+ L P+I + + LL + PI
Sbjct 101 LRPRYGVKLFPGELLGFEVRIPFRLLSRTGNPPKIKMDITLEK----YEKILLPLQERPI 156
Query 149 HKAYG----FGLPTLPVVAEAEACAEKLARYRRVALARDLYDLNHFAS 192
Y F + ++ + E AEK+ + RDLYD+ S
Sbjct 157 LHGYSDSPRFSVVSVRTYSLEEILAEKIRSLFQRTRPRDLYDIWFLKS 204
>gi|160873178|ref|YP_001552494.1| hypothetical protein Sbal195_0052 [Shewanella baltica OS195]
gi|160858700|gb|ABX47234.1| Domain of unknown function DUF1814 [Shewanella baltica OS195]
gi|315265403|gb|ADT92256.1| Domain of unknown function DUF1814 [Shewanella baltica OS678]
Length=355
Score = 38.1 bits (87), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/48 (46%), Positives = 27/48 (57%), Gaps = 1/48 (2%)
Query 26 DVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFS 73
DV +L LL GD + FKGGT+L KC G + RFS D+D S
Sbjct 40 DVWVAEILRLLYDERLLGDCSVAFKGGTALSKC-WGAIERFSEDIDLS 86
>gi|88803569|ref|ZP_01119094.1| pigmentation and extracellular proteinase regulator [Polaribacter
irgensii 23-P]
gi|88780581|gb|EAR11761.1| pigmentation and extracellular proteinase regulator [Polaribacter
irgensii 23-P]
Length=387
Score = 37.7 bits (86), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 19/63 (31%), Positives = 34/63 (54%), Gaps = 1/63 (1%)
Query 5 TRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVG 64
T+A+V H G+ +A ++++A++H L+++ Q FK GT + +GNVG
Sbjct 126 TKAIVPVHLFGQVANMEA-VMEIAKEHNLFVIEDNAQAIGANYTFKDGTKQKAGTIGNVG 184
Query 65 RFS 67
S
Sbjct 185 TTS 187
>gi|121608391|ref|YP_996198.1| hypothetical protein Veis_1419 [Verminephrobacter eiseniae EF01-2]
gi|121553031|gb|ABM57180.1| hypothetical protein Veis_1419 [Verminephrobacter eiseniae EF01-2]
Length=447
Score = 37.7 bits (86), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 49/154 (32%), Positives = 70/154 (46%), Gaps = 19/154 (12%)
Query 42 FGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDDEVVLEVCELIDGARVGGFEF-GVQ 100
GD LV KGGT+L ++ RFS DLDF AP L + I + G V
Sbjct 8 IGDTPLVLKGGTALLLAY--DLSRFSEDLDFDAPHK---LNLESRIQRSVPMGITLDDVA 62
Query 101 STRGDGRHWQLRVR-HTELGEPRIVASVEFARRPLALPSELLAFIQLPIHKAYGFGLPTL 159
+ + G + R + HTE G PR + +E + R SE + +G + +L
Sbjct 63 ALKDTGTVTRYRAKYHTEHG-PRSL-KLEVSYRTPTPDSE--------VRSVHGIRVASL 112
Query 160 PVVAEAEACAEKLARYRRVALARDLYDLNHFASR 193
P + + + A R A RDLYDL+ FA+R
Sbjct 113 PRIIDQKLKAAHDGHDPR-AKVRDLYDLD-FAAR 144
>gi|226325170|ref|ZP_03800688.1| hypothetical protein COPCOM_02962 [Coprococcus comes ATCC 27758]
gi|225206518|gb|EEG88872.1| hypothetical protein COPCOM_02962 [Coprococcus comes ATCC 27758]
Length=921
Score = 37.4 bits (85), Expect = 3.0, Method: Composition-based stats.
Identities = 21/47 (45%), Positives = 26/47 (56%), Gaps = 2/47 (4%)
Query 31 HLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVGRFSTDLDFSAPDD 77
H+ L S Q + LVF+GGT+LR C R+S DLDFS D
Sbjct 36 HIDLLNSFVPQMQNTSLVFQGGTALRLCY--GAPRYSEDLDFSVGSD 80
>gi|325830613|ref|ZP_08164034.1| hypothetical protein HMPREF9404_5510 [Eggerthella sp. HGA1]
gi|325487359|gb|EGC89801.1| hypothetical protein HMPREF9404_5510 [Eggerthella sp. HGA1]
Length=304
Score = 37.0 bits (84), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 25/67 (38%), Positives = 35/67 (53%), Gaps = 9/67 (13%)
Query 8 LVARHA--LGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVGR 65
LVAR A AEA+ V +D+ ++L + + +VFKGGT L KC + R
Sbjct 13 LVARAAKRYALAEAF------VIKDYFIFLALKLITQEYPEIVFKGGTCLSKCH-NAIAR 65
Query 66 FSTDLDF 72
FS D+D
Sbjct 66 FSEDVDL 72
>gi|317487825|ref|ZP_07946418.1| hypothetical protein HMPREF1023_00116 [Eggerthella sp. 1_3_56FAA]
gi|316913100|gb|EFV34616.1| hypothetical protein HMPREF1023_00116 [Eggerthella sp. 1_3_56FAA]
Length=302
Score = 37.0 bits (84), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 25/67 (38%), Positives = 35/67 (53%), Gaps = 9/67 (13%)
Query 8 LVARHA--LGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVGR 65
LVAR A AEA+ V +D+ ++L + + +VFKGGT L KC + R
Sbjct 11 LVARAAKRYALAEAF------VIKDYFIFLALKLITQEYPEIVFKGGTCLSKCH-NAIAR 63
Query 66 FSTDLDF 72
FS D+D
Sbjct 64 FSEDVDL 70
>gi|148550950|ref|YP_001260380.1| hypothetical protein Swit_4997 [Sphingomonas wittichii RW1]
gi|148503361|gb|ABQ71613.1| Domain of unknown function DUF1814 [Sphingomonas wittichii RW1]
Length=247
Score = 37.0 bits (84), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 18/27 (67%), Positives = 20/27 (75%), Gaps = 1/27 (3%)
Query 47 LVFKGGTSLRKCRLGNVGRFSTDLDFS 73
L FKGGT+LR+C N RFS DLDFS
Sbjct 49 LAFKGGTALRRCWFENY-RFSEDLDFS 74
>gi|89255316|ref|NP_659987.2| hypothetical protein RHE_PD00050 [Rhizobium etli CFN 42]
gi|89213270|gb|AAM55000.2| hypothetical conserved protein [Rhizobium etli CFN 42]
Length=144
Score = 37.0 bits (84), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 27/74 (37%), Positives = 34/74 (46%), Gaps = 21/74 (28%)
Query 47 LVFKGGTSLRKCRLGNVGRFSTDLDFS------APDDEVVLEVCELIDGARVGGFEFGVQ 100
LVFKGGTSL K G + RFS D+D + APD VG + +
Sbjct 46 LVFKGGTSLSKA-YGVIKRFSEDVDLTYDIRALAPD--------------LVGDNDEALP 90
Query 101 STRGDGRHWQLRVR 114
TR + +HW VR
Sbjct 91 KTRSEEKHWTSEVR 104
>gi|327192828|gb|EGE59754.1| hypothetical protein RHECNPAF_19005 [Rhizobium etli CNPAF512]
Length=135
Score = 37.0 bits (84), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 27/74 (37%), Positives = 34/74 (46%), Gaps = 21/74 (28%)
Query 47 LVFKGGTSLRKCRLGNVGRFSTDLDFS------APDDEVVLEVCELIDGARVGGFEFGVQ 100
LVFKGGTSL K G + RFS D+D + APD VG + +
Sbjct 37 LVFKGGTSLSKA-YGVIKRFSEDVDLTYDIRALAPD--------------LVGDNDEALP 81
Query 101 STRGDGRHWQLRVR 114
TR + +HW VR
Sbjct 82 KTRSEEKHWTSEVR 95
>gi|86134342|ref|ZP_01052924.1| DegT/DnrJ/EryC1/StrS aminotransferase family protein [Polaribacter
sp. MED152]
gi|85821205|gb|EAQ42352.1| DegT/DnrJ/EryC1/StrS aminotransferase family protein [Polaribacter
sp. MED152]
Length=387
Score = 36.6 bits (83), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 19/63 (31%), Positives = 34/63 (54%), Gaps = 1/63 (1%)
Query 5 TRALVARHALGRAEAYDAALLDVAQDHLLYLLSQTVQFGDNRLVFKGGTSLRKCRLGNVG 64
T+A+V H G+ DA +L++A++H L+++ Q FK G+ + +G+VG
Sbjct 126 TKAIVPVHLFGQVANMDA-ILEIAKEHNLFVIEDNAQAIGANYTFKDGSQQKAGTIGDVG 184
Query 65 RFS 67
S
Sbjct 185 TTS 187
>gi|336036620|gb|AEH82551.1| conserved hypothetical protein [Sinorhizobium meliloti SM11]
Length=340
Score = 36.2 bits (82), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 35/106 (34%), Positives = 46/106 (44%), Gaps = 31/106 (29%)
Query 47 LVFKGGTSLRKCRLGNVGRFSTDLDFS------APDDEVVLEVCELIDGARVGGFEFGVQ 100
LVFKGGTSL K G + RFS D+D + APD VG + +
Sbjct 53 LVFKGGTSLSKA-YGAIRRFSEDIDLTYDIRALAPD--------------LVGDNDEALP 97
Query 101 STRGDGRHWQLRVRH------TELGEPRIVASVEFARRPLALPSEL 140
TR + + W VR E EP I A+V R +LP+ +
Sbjct 98 KTRSEEKRWTSEVRKRLPVWVAESVEPVIAAAV----RGQSLPARI 139
>gi|209883381|ref|YP_002287238.1| hypothetical protein OCAR_4224 [Oligotropha carboxidovorans OM5]
gi|337739534|ref|YP_004631262.1| hypothetical protein OCA5_c02920 [Oligotropha carboxidovorans
OM5]
gi|209871577|gb|ACI91373.1| conserved hypothetical protein [Oligotropha carboxidovorans OM5]
gi|336093620|gb|AEI01446.1| hypothetical protein OCA4_c02910 [Oligotropha carboxidovorans
OM4]
gi|336097198|gb|AEI05021.1| hypothetical protein OCA5_c02920 [Oligotropha carboxidovorans
OM5]
Length=338
Score = 36.2 bits (82), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 32/87 (37%), Positives = 40/87 (46%), Gaps = 25/87 (28%)
Query 38 QTVQFGD---NRLVFKGGTSLRKCRLGNVGRFSTDLDFS------APDDEVVLEVCELID 88
QTV FG + LVFKGGTSL K G + RFS D+D + APD
Sbjct 42 QTV-FGSALGDHLVFKGGTSLSKA-YGVIQRFSEDVDLTYDIRAIAPD------------ 87
Query 89 GARVGGFEFGVQSTRGDGRHWQLRVRH 115
VG + +TR + + W VRH
Sbjct 88 --LVGDNGEALPATRSEEKRWSKAVRH 112
>gi|16262679|ref|NP_435472.1| hypothetical protein SMa0429 [Sinorhizobium meliloti 1021]
gi|14523302|gb|AAK64884.1| conserved hypothetical protein [Sinorhizobium meliloti 1021]
Length=338
Score = 36.2 bits (82), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 35/106 (34%), Positives = 46/106 (44%), Gaps = 31/106 (29%)
Query 47 LVFKGGTSLRKCRLGNVGRFSTDLDFS------APDDEVVLEVCELIDGARVGGFEFGVQ 100
LVFKGGTSL K G + RFS D+D + APD VG + +
Sbjct 53 LVFKGGTSLSKA-YGAIRRFSEDIDLTYDIRALAPD--------------LVGDNDEALP 97
Query 101 STRGDGRHWQLRVRH------TELGEPRIVASVEFARRPLALPSEL 140
TR + + W VR E EP I A+V R +LP+ +
Sbjct 98 KTRSEEKRWTSEVRKRLPVWVAESVEPVIAAAV----RGQSLPARI 139
Lambda K H
0.324 0.139 0.420
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 486436626624
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40