BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2990c Length=286 Score E Sequences producing significant alignments: (Bits) Value gi|167969989|ref|ZP_02552266.1| hypothetical protein MtubH3_1893... 582 2e-164 gi|15610127|ref|NP_217506.1| hypothetical protein Rv2990c [Mycob... 581 4e-164 gi|31794166|ref|NP_856659.1| hypothetical protein Mb3014c [Mycob... 580 1e-163 gi|289575697|ref|ZP_06455924.1| conserved hypothetical protein [... 578 3e-163 gi|340627979|ref|YP_004746431.1| hypothetical protein MCAN_30121... 578 3e-163 gi|254233073|ref|ZP_04926400.1| hypothetical protein TBCG_02928 ... 574 5e-162 gi|289755106|ref|ZP_06514484.1| conserved hypothetical protein [... 376 3e-102 gi|289209598|ref|YP_003461664.1| hypothetical protein TK90_2438 ... 322 4e-86 gi|333991640|ref|YP_004524254.1| hypothetical protein JDM601_300... 291 9e-77 gi|94495628|ref|ZP_01302208.1| hypothetical protein SKA58_06250 ... 270 2e-70 gi|298707328|emb|CBJ25955.1| conserved unknown protein [Ectocarp... 211 8e-53 gi|224012811|ref|XP_002295058.1| predicted protein [Thalassiosir... 163 3e-38 gi|301111438|ref|XP_002904798.1| conserved hypothetical protein ... 152 5e-35 gi|219120308|ref|XP_002180895.1| predicted protein [Phaeodactylu... 147 1e-33 gi|320164022|gb|EFW40921.1| conserved hypothetical protein [Caps... 145 1e-32 gi|284008259|emb|CBA74578.1| conserved hypothetical protein [Ars... 133 4e-29 gi|284008258|emb|CBA74576.1| conserved hypothetical protein [Ars... 119 6e-25 gi|323454902|gb|EGB10771.1| hypothetical protein AURANDRAFT_5992... 88.2 1e-15 gi|114706990|ref|ZP_01439889.1| SAM (and some other nucleotide) ... 37.4 2.6 gi|91792858|ref|YP_562509.1| peptidoglycan binding domain-contai... 37.0 3.7 >gi|167969989|ref|ZP_02552266.1| hypothetical protein MtubH3_18938 [Mycobacterium tuberculosis H37Ra] gi|254552067|ref|ZP_05142514.1| hypothetical protein Mtube_16687 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] Length=308 Score = 582 bits (1501), Expect = 2e-164, Method: Compositional matrix adjust. Identities = 286/286 (100%), Positives = 286/286 (100%), Gaps = 0/286 (0%) Query 1 MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTG 60 MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTG Sbjct 23 MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTG 82 Query 61 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT 120 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT Sbjct 83 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT 142 Query 121 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG 180 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG Sbjct 143 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG 202 Query 181 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARI 240 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARI Sbjct 203 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARI 262 Query 241 ERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM 286 ERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM Sbjct 263 ERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM 308 >gi|15610127|ref|NP_217506.1| hypothetical protein Rv2990c [Mycobacterium tuberculosis H37Rv] gi|15842546|ref|NP_337583.1| hypothetical protein MT3068 [Mycobacterium tuberculosis CDC1551] gi|121638871|ref|YP_979095.1| hypothetical protein BCG_3011c [Mycobacterium bovis BCG str. Pasteur 1173P2] 53 more sequence titlesLength=286 Score = 581 bits (1498), Expect = 4e-164, Method: Compositional matrix adjust. Identities = 286/286 (100%), Positives = 286/286 (100%), Gaps = 0/286 (0%) Query 1 MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTG 60 MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTG Sbjct 1 MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTG 60 Query 61 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT 120 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT Sbjct 61 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT 120 Query 121 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG 180 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG Sbjct 121 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG 180 Query 181 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARI 240 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARI Sbjct 181 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARI 240 Query 241 ERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM 286 ERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM Sbjct 241 ERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM 286 >gi|31794166|ref|NP_856659.1| hypothetical protein Mb3014c [Mycobacterium bovis AF2122/97] gi|31619761|emb|CAD96701.1| HYPOTHETICAL PROTEIN Mb3014c [Mycobacterium bovis AF2122/97] Length=286 Score = 580 bits (1494), Expect = 1e-163, Method: Compositional matrix adjust. Identities = 285/286 (99%), Positives = 285/286 (99%), Gaps = 0/286 (0%) Query 1 MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTG 60 MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTG Sbjct 1 MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTG 60 Query 61 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT 120 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT Sbjct 61 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT 120 Query 121 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG 180 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG Sbjct 121 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG 180 Query 181 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARI 240 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILE RRFPIRYRARYVNGQLNMCLARI Sbjct 181 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEERRFPIRYRARYVNGQLNMCLARI 240 Query 241 ERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM 286 ERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM Sbjct 241 ERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM 286 >gi|289575697|ref|ZP_06455924.1| conserved hypothetical protein [Mycobacterium tuberculosis K85] gi|339632996|ref|YP_004724638.1| hypothetical protein MAF_29950 [Mycobacterium africanum GM041182] gi|289540128|gb|EFD44706.1| conserved hypothetical protein [Mycobacterium tuberculosis K85] gi|339332352|emb|CCC28065.1| hypothetical protein MAF_29950 [Mycobacterium africanum GM041182] Length=286 Score = 578 bits (1491), Expect = 3e-163, Method: Compositional matrix adjust. Identities = 285/286 (99%), Positives = 285/286 (99%), Gaps = 0/286 (0%) Query 1 MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTG 60 MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTG Sbjct 1 MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTG 60 Query 61 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT 120 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT Sbjct 61 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT 120 Query 121 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG 180 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG Sbjct 121 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG 180 Query 181 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARI 240 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARI Sbjct 181 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARI 240 Query 241 ERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM 286 ERFSSN LGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM Sbjct 241 ERFSSNELGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM 286 >gi|340627979|ref|YP_004746431.1| hypothetical protein MCAN_30121 [Mycobacterium canettii CIPT 140010059] gi|340006169|emb|CCC45343.1| hypothetical protein MCAN_30121 [Mycobacterium canettii CIPT 140010059] Length=286 Score = 578 bits (1490), Expect = 3e-163, Method: Compositional matrix adjust. Identities = 284/286 (99%), Positives = 286/286 (100%), Gaps = 0/286 (0%) Query 1 MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTG 60 MCVTWAEMP+IAALIRHIEDLHARHGRSY+LRAGISSLFRYIEGVHGERPWGTVLDAGTG Sbjct 1 MCVTWAEMPEIAALIRHIEDLHARHGRSYLLRAGISSLFRYIEGVHGERPWGTVLDAGTG 60 Query 61 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT 120 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT Sbjct 61 VKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDT 120 Query 121 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG 180 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG Sbjct 121 ILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIG 180 Query 181 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARI 240 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARI Sbjct 181 RVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARI 240 Query 241 ERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM 286 ERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM Sbjct 241 ERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM 286 >gi|254233073|ref|ZP_04926400.1| hypothetical protein TBCG_02928 [Mycobacterium tuberculosis C] gi|308232307|ref|ZP_07415620.2| hypothetical protein TMAG_01196 [Mycobacterium tuberculosis SUMu001] gi|308369922|ref|ZP_07419531.2| hypothetical protein TMBG_03140 [Mycobacterium tuberculosis SUMu002] 12 more sequence titles Length=284 Score = 574 bits (1480), Expect = 5e-162, Method: Compositional matrix adjust. Identities = 283/284 (99%), Positives = 284/284 (100%), Gaps = 0/284 (0%) Query 3 VTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTGVK 62 +TWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTGVK Sbjct 1 MTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTGVK 60 Query 63 SLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDTIL 122 SLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDTIL Sbjct 61 SLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDTIL 120 Query 123 VDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIGRV 182 VDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIGRV Sbjct 121 VDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIGRV 180 Query 183 RDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARIER 242 RDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARIER Sbjct 181 RDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARIER 240 Query 243 FSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM 286 FSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM Sbjct 241 FSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM 284 >gi|289755106|ref|ZP_06514484.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054] gi|289695693|gb|EFD63122.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054] Length=185 Score = 376 bits (965), Expect = 3e-102, Method: Compositional matrix adjust. Identities = 184/185 (99%), Positives = 185/185 (100%), Gaps = 0/185 (0%) Query 102 LLVGNWVDDSLLAGETFDTILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLE 161 +LVGNWVDDSLLAGETFDTILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLE Sbjct 1 MLVGNWVDDSLLAGETFDTILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLE 60 Query 162 PYVQFEPETESGKIIWEIGRVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFP 221 PYVQFEPETESGKIIWEIGRVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFP Sbjct 61 PYVQFEPETESGKIIWEIGRVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFP 120 Query 222 IRYRARYVNGQLNMCLARIERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVI 281 IRYRARYVNGQLNMCLARIERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVI Sbjct 121 IRYRARYVNGQLNMCLARIERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVI 180 Query 282 AVEPM 286 AVEPM Sbjct 181 AVEPM 185 >gi|289209598|ref|YP_003461664.1| hypothetical protein TK90_2438 [Thioalkalivibrio sp. K90mix] gi|288945229|gb|ADC72928.1| conserved hypothetical protein [Thioalkalivibrio sp. K90mix] Length=286 Score = 322 bits (825), Expect = 4e-86, Method: Compositional matrix adjust. Identities = 152/253 (61%), Positives = 191/253 (76%), Gaps = 2/253 (0%) Query 34 GISSLFRYIEGVHGERPWGTVLDAGTGVKSLQWIQTLPTERWTAVTAARSLADKTRAALG 93 G S +F+ IE G++PWG+ LDAGTG KS++WI L TERWTAVTA++ +A TR A G Sbjct 15 GTSPIFQAIEKAQGDQPWGSFLDAGTGRKSIEWISRLDTERWTAVTASQEMARTTRKAAG 74 Query 94 SAMRPQDRLLVGNWVDDSLLAGETFDTILVDYLVGAIEGFAPYWQDRVFERLRPHLADHG 153 +A R QDR+LVGNW+ D LL GE FDT+L+DY +GAIEGFAPYWQDR RLRPH+ D Sbjct 75 TARRRQDRILVGNWMSDQLLFGERFDTVLLDYFIGAIEGFAPYWQDRALHRLRPHVGD-- 132 Query 154 RLYLVGLEPYVQFEPETESGKIIWEIGRVRDACLLLAGERPYREFPLDWMLGRLGLAGFR 213 RLYLVG+EPYV EP+ E+G ++ EIGR+RDACLL+AG RPYRE+P W++ +LG+AGFR Sbjct 133 RLYLVGVEPYVLVEPKDEAGALVREIGRLRDACLLIAGNRPYREYPSSWVMRQLGIAGFR 192 Query 214 ILEARRFPIRYRARYVNGQLNMCLARIERFSSNGLGMAMRAYVEELRARALQLNERQDGL 273 +L+ R FPI YR R+VNGQL++CL ++ FS GL AM +E LR RAL L Q+GL Sbjct 193 VLDVRYFPIHYRERFVNGQLDLCLRQLPHFSDEGLAKAMHHQIEVLRERALPLARSQEGL 252 Query 274 WHGNDYVIAVEPM 286 HG DY+I EPM Sbjct 253 KHGADYLIVAEPM 265 >gi|333991640|ref|YP_004524254.1| hypothetical protein JDM601_3000 [Mycobacterium sp. JDM601] gi|333487608|gb|AEF37000.1| conserved hypothetical protein [Mycobacterium sp. JDM601] Length=203 Score = 291 bits (744), Expect = 9e-77, Method: Compositional matrix adjust. Identities = 141/203 (70%), Positives = 169/203 (84%), Gaps = 1/203 (0%) Query 84 LADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDTILVDYLVGAIEGFAPYWQDRVFE 143 + D TRA G +RPQDRL++GNW+DD+LLAGETFDT+LVDYLVGAIEGFAPYWQDR+FE Sbjct 1 MVDATRATPGD-IRPQDRLMLGNWMDDNLLAGETFDTVLVDYLVGAIEGFAPYWQDRLFE 59 Query 144 RLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIGRVRDACLLLAGERPYREFPLDWM 203 RLRP +AD GRLY+ GLEPYVQ+ P TESG +IWEIGR RDACLLLAGERPYRE+PL+W+ Sbjct 60 RLRPLVADGGRLYVTGLEPYVQYRPNTESGHVIWEIGRARDACLLLAGERPYREYPLEWI 119 Query 204 LGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARIERFSSNGLGMAMRAYVEELRARA 263 L +L AGF +E+R FPIRY AR+V GQLNMC R+ERF S LG +MR Y+++L++RA Sbjct 120 LRQLEQAGFLAVESRYFPIRYGARHVYGQLNMCRNRLERFHSQALGSSMRQYIDDLQSRA 179 Query 264 LQLNERQDGLWHGNDYVIAVEPM 286 L L ER+ L +G DYVIAVEPM Sbjct 180 LALIEREGSLRYGRDYVIAVEPM 202 >gi|94495628|ref|ZP_01302208.1| hypothetical protein SKA58_06250 [Sphingomonas sp. SKA58] gi|94425016|gb|EAT10037.1| hypothetical protein SKA58_06250 [Sphingomonas sp. SKA58] Length=241 Score = 270 bits (690), Expect = 2e-70, Method: Compositional matrix adjust. Identities = 136/240 (57%), Positives = 172/240 (72%), Gaps = 2/240 (0%) Query 45 VHGERPWGTVLDAGTGVKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDRLLV 104 + G+RPWGT LDAGTG S+ W+ L T+RW AVT A A + R A RPQDR+++ Sbjct 2 LQGDRPWGTFLDAGTGTNSIGWVSGLATDRWVAVTGAAGHAVQVRDASDRVRRPQDRIIL 61 Query 105 GNWVDDSLLAGETFDTILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYV 164 GNW + +LLAGE FDTIL DYL+GAIEGFAPY+Q+R+F RLR LA GRLYLVGLEPYV Sbjct 62 GNWANPTLLAGERFDTILADYLIGAIEGFAPYFQERMFARLRT-LA-RGRLYLVGLEPYV 119 Query 165 QFEPETESGKIIWEIGRVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRY 224 PET G+I+ +IGR RDA LL AGERPYREFP++W+L ++ +GFR++ A RFPIRY Sbjct 120 AERPETPDGRILCDIGRWRDAVLLQAGERPYREFPMEWVLEQMTASGFRVVSAHRFPIRY 179 Query 225 RARYVNGQLNMCLARIERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVE 284 + ++VN Q++MC +R+ R L A+ A E LR AL + R+ GL HG DYVIA E Sbjct 180 QEKFVNSQIDMCASRLSRLGDRSLAAALHARGEALRQDALAIIGREGGLRHGFDYVIAAE 239 >gi|298707328|emb|CBJ25955.1| conserved unknown protein [Ectocarpus siliculosus] Length=358 Score = 211 bits (538), Expect = 8e-53, Method: Compositional matrix adjust. Identities = 110/256 (43%), Positives = 160/256 (63%), Gaps = 11/256 (4%) Query 36 SSLFRYIEGVH--GERPWGTVLDAGTGVKSLQWIQTLPTERWTAVTAARSLADKTRAALG 93 +LFR IEG+ +PWG LDAGTG SL+WI TL TE +TAVTA A+ TR +G Sbjct 64 DALFRSIEGMQKAANKPWGKFLDAGTGTHSLKWINTLNTEGFTAVTADPQFAENTRKEIG 123 Query 94 SAMRPQDRLLVGNWVDDSLLAGETFDTILVDYLVGAIEGFAPYWQDRVFERLRPHLADHG 153 ++ D ++VGNW D+ L G FDT+L DYLVGAI+GFAPY+QD+VFERL+ H+A G Sbjct 124 FKVKTPDEIVVGNWRDEKFLEGRVFDTVLADYLVGAIDGFAPYYQDQVFERLKRHVAPGG 183 Query 154 RLYLVGLEPYVQFEPETESG--KIIWEIGRVRDACLLLAGERPYREFPLDWMLGRLGLAG 211 R+YLVG++P P+ G +++ E R+RD+C+LLAG RPYRE+PLDW+ ++ +G Sbjct 184 RIYLVGMQPL----PDHPGGAAELVCEAARLRDSCILLAGHRPYREYPLDWITRQMKKSG 239 Query 212 FRILEARRFPIRYRARYVNGQLNMCLARIERFSSNG--LGMAMRAYVEELRARALQLNER 269 + A++ P+ Y V QL++ ++ ++ L A+ ++ +L R + Sbjct 240 MVVTSAKKMPVLYAPHTVKRQLDVASRKLPIIAATDPKLAAALERHISDLDGRVRKELAG 299 Query 270 QDG-LWHGNDYVIAVE 284 G + G DYV+A E Sbjct 300 AGGRVEVGFDYVVAAE 315 >gi|224012811|ref|XP_002295058.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220969497|gb|EED87838.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length=331 Score = 163 bits (412), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 106/300 (36%), Positives = 156/300 (52%), Gaps = 47/300 (15%) Query 28 SYILRAGISSLFRYIEGVHGERP---WGTVLDAGTGVKSLQWIQTL-------------- 70 S + G LF YIE E +G LDAGTG SL+WI ++ Sbjct 31 SKFAKKGGDVLFGYIEKSQAESSSPSFGRFLDAGTGSHSLRWIASVIHREHLLTDSLGDA 90 Query 71 ----PTERWTAVTAARSLADKT-RAALGSAMRPQDRLLVGNWVD----------DS---- 111 E ++A+TA + + A + + +L+GNW D DS Sbjct 91 APLVSLESYSAITADEVMMRRVIEEAESLGIADKGDVLIGNWKDGVDKNGNIEFDSDAGG 150 Query 112 ---LLAGETFDTILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEP 168 LL G FDTIL DYLVGA++GF+PY+QD + +RL PHLA GRLY++GL+P P Sbjct 151 KKLLLEGREFDTILADYLVGAVDGFSPYFQDLIIQRLVPHLAPGGRLYIIGLQPI----P 206 Query 169 ETESG--KIIWEIGRVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRA 226 + G + I +VRDAC+ LA R YRE+P+DW+ + AG R++E R++PIRY Sbjct 207 DNVQGDADVFCRITKVRDACIKLANHRCYREYPVDWIERHVRRAGLRVVETRQYPIRYDH 266 Query 227 RYVNGQLNMCLARIERFSSNGLGMAMRAYVEELRARALQLNERQ-DG-LWHGNDYVIAVE 284 + Q+N+ ++++ F S GL M ++ L + ++ +Q DG + G DYV+ E Sbjct 267 ATMLRQINVGRSKLKLFPSKGLADEMGKVLDSLEKESKEVTAKQADGRITLGFDYVVVAE 326 >gi|301111438|ref|XP_002904798.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262095128|gb|EEY53180.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length=285 Score = 152 bits (384), Expect = 5e-35, Method: Compositional matrix adjust. Identities = 107/275 (39%), Positives = 142/275 (52%), Gaps = 30/275 (10%) Query 37 SLFRYIEGVHGE----RPWGTVLDAGTGVKSLQW-----IQTLPTERWTAVTAARSLADK 87 SLFR+IE PWG VLDAGTG SL W + +L E AVT + LA+ Sbjct 7 SLFRWIEEREHHDSSISPWGRVLDAGTGRHSLSWLLHGGVSSL-IEEVVAVTGEKPLAND 65 Query 88 TRAALGSAMRPQD---RLLVGNWVDDSLLAGET-FDTILVDYLVGAIEGFAPYWQDRVFE 143 A + P ++ GNW + + L+ E FD I+ DYLVGAIEGFAPY+QD++ + Sbjct 66 LSAEYDPSKTPHATPFKVHAGNWQNATFLSNEKPFDIIIADYLVGAIEGFAPYYQDQICD 125 Query 144 RLRPHLADHGRLYLVGLEPYVQFEP-------ETESGKIIWEIGRVRDACLLLAGERPYR 196 RL LA GR+YLVGL+P + + E E+GK+I E+ R RDACLLLAG R YR Sbjct 126 RLEKLLAPGGRIYLVGLQPLSESQTPAGSSDAEIEAGKLIQEVARTRDACLLLAGRRCYR 185 Query 197 EFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARIERFSSNGLGMAMRAYV 256 E+P++W +L G + + R Y + QL + I F L M+ + Sbjct 186 EYPIEWSQRQLEKVGLEVTNSVRLTNVYGRSAITRQLEVGRRHIPLFWDPVLAGHMQQAL 245 Query 257 --------EELRARALQLNERQDGLWHGNDYVIAV 283 EE + AL E Q + G DYVIA Sbjct 246 DCVDERLEEEFGSGALP-KEEQRRIRFGFDYVIAA 279 >gi|219120308|ref|XP_002180895.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217407611|gb|EEC47547.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length=331 Score = 147 bits (372), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 88/265 (34%), Positives = 146/265 (56%), Gaps = 18/265 (6%) Query 37 SLFRYIEGVHGERPWGTVLDAGTGVKSLQWIQTLPTERWT---AVTAARSLADKTRAALG 93 +LF +IE R +G VLDAGTG+ SL+W+ TL + A+TA R++ + + Sbjct 65 ALFGWIEEQQEGRDFGKVLDAGTGLHSLRWLATLELKGMVSVDAITADRTMQKNVQQEVD 124 Query 94 S-AMRPQDRLLVGNWVDDSLLAGET----------FDTILVDYLVGAIEGFAPYWQDRVF 142 + + R+L+GNW DS+ + +D IL DYL+GA++GF+PY QD++ Sbjct 125 ALGVSHLSRVLIGNWFPDSITEPDQNPLLQDISSDYDVILADYLIGAMDGFSPYKQDQMI 184 Query 143 ERLRPHLADHGRLYLVGLEPYVQFEPETESGKIIWEIGRVRDACLLLAGERPYREFPLDW 202 +L L GRLY+VGL+P P ++ +I + + RDAC+LLAG R YRE+P+DW Sbjct 185 SQLVGLLKPGGRLYVVGLQPIPDKTPGNDAANVICRVRQARDACILLAGHRCYREYPVDW 244 Query 203 MLGRL-GLAGFRILEARRFPIRYRARYVNGQLNMCLARIERFSSNGLGMAMRAYVEELRA 261 + ++ +L +R+FPI YR + Q+ + ++ + F L +M A +++L Sbjct 245 VQRQVEDHPDLELLPSRQFPILYRHETICKQIQVGRSKFKLFRPE-LVSSMGALLDDLEK 303 Query 262 RALQLNERQDG--LWHGNDYVIAVE 284 ++ + + + G DYV+ E Sbjct 304 QSFEATSKAPNGKIQLGFDYVVTAE 328 >gi|320164022|gb|EFW40921.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864] Length=354 Score = 145 bits (365), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 97/291 (34%), Positives = 141/291 (49%), Gaps = 59/291 (20%) Query 49 RP-WGTVLDAGTGVKSLQWIQTLPTERW-----------TAVTAARSLADKTRAALGSAM 96 RP WG +LDAGTG SL W +L R TAVTA+ + T AL + Sbjct 54 RPLWGRLLDAGTGTDSLNWALSLAPSRIQEETALDPASITAVTASVDMYRTTLRALQTYQ 113 Query 97 RPQD-----------RLLVGNWVDDSLLAG------------------------------ 115 QD L+ G W++ SLLA Sbjct 114 ARQDANPRWKQDETVHLVRGAWLNPSLLAKPTSQAWDAQAVWVQSLTSNCEEEEDSDKTA 173 Query 116 --ETFDTILVDYLVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQFEPETESG 173 E FDTIL DYL+GA++GF P+ Q V RL H+ R++++G+EPY T G Sbjct 174 AYEKFDTILADYLIGAVDGFTPFHQHTVLSRLARHMVHGSRMFVLGMEPYPD-SASTPGG 232 Query 174 KIIWEIGRVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARRFPIRYRARYVNGQL 233 +++ ++ +RDAC+LLAG+RPYRE+P++W+ L LA I E+ FP+ Y AR + QL Sbjct 233 ELVLKVAALRDACILLAGQRPYREYPIEWIQDHLRLANLTIRESITFPVVYGARKLISQL 292 Query 234 NMCLARIERFSSNG---LGMAMRAYVEELRARALQLNERQDGLWHGNDYVI 281 +C +++ + G + A++ V+ L+ E GL G DYV+ Sbjct 293 EVCEYKLQLMTELGEQTIREALQERVDALKVAVENDPEIAAGLCFGADYVV 343 >gi|284008259|emb|CBA74578.1| conserved hypothetical protein [Arsenophonus nasoniae] Length=108 Score = 133 bits (334), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 55/104 (53%), Positives = 74/104 (72%), Gaps = 0/104 (0%) Query 35 ISSLFRYIEGVHGERPWGTVLDAGTGVKSLQWIQTLPTERWTAVTAARSLADKTRAALGS 94 +S+LFR+IE +HG PWG +LDAGTG+ SL WI L +E WTAVT A ++ + + + Sbjct 2 VSTLFRHIEMIHGNNPWGKILDAGTGINSLSWISQLKSESWTAVTCAINMKADIQQIISA 61 Query 95 AMRPQDRLLVGNWVDDSLLAGETFDTILVDYLVGAIEGFAPYWQ 138 RPQDRLL+GNW D + E FDT++ DYL+GA++GF PYWQ Sbjct 62 RQRPQDRLLLGNWADSDFMVNERFDTVIADYLLGAVDGFVPYWQ 105 >gi|284008258|emb|CBA74576.1| conserved hypothetical protein [Arsenophonus nasoniae] Length=127 Score = 119 bits (297), Expect = 6e-25, Method: Compositional matrix adjust. Identities = 53/125 (43%), Positives = 78/125 (63%), Gaps = 0/125 (0%) Query 160 LEPYVQFEPETESGKIIWEIGRVRDACLLLAGERPYREFPLDWMLGRLGLAGFRILEARR 219 +EPYV + +G ++ IGR+RDACLLLAGERPYRE+P DW++ L GF I++ + Sbjct 1 MEPYVPYNANCRAGHLVVSIGRLRDACLLLAGERPYREYPADWVIYHLQQMGFEIVDLKH 60 Query 220 FPIRYRARYVNGQLNMCLARIERFSSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDY 279 +PI Y ++ GQ+ MC R+ F L M+M ++ +L +AL +Q L HG DY Sbjct 61 YPINYGHNWLTGQMEMCRQRVNTFVDRQLAMSMLEHINQLEQQALLCIAQQGSLKHGADY 120 Query 280 VIAVE 284 VI+ + Sbjct 121 VISAK 125 >gi|323454902|gb|EGB10771.1| hypothetical protein AURANDRAFT_59920 [Aureococcus anophagefferens] Length=217 Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 63/221 (29%), Positives = 105/221 (48%), Gaps = 12/221 (5%) Query 70 LPTERWTAVTA----ARSLADKTRAALGSAMRPQDRLLVGNWVDDSLLAGETFDTILVDY 125 +P TAVTA + D+ R A A ++VGNW + LAGE +D ++ DY Sbjct 1 MPCSTLTAVTARAAGTDAYGDRLRDAFAGAAVD---VVVGNWREAGFLAGERYDVVVADY 57 Query 126 LVGAIEGFAPYWQDRVFERLRPHLADHGRLYLVGLEPYVQ-FEPETESGKIIWEIGRVRD 184 L+GA+E P+ D V RL L G L VG+EPY + ++ +++ ++ + D Sbjct 58 LLGAVELHWPHGADAVLARLLGALKPGGTLLFVGVEPYESLLDRADDADRLVLDVESLGD 117 Query 185 ACLLLAGERPYREFPLDWMLGRLGL-AGFRILEARRFPIRYRARYVNGQLNMCLARIERF 243 + LAGE YRE P W+ ++ G+ ++ + FP+ A + Q+ + Sbjct 118 SAAALAGEATYREVPEAWITRQVDARDGYAVVASETFPMTLSAASLRKQVTYARTTSAKI 177 Query 244 SSNGLGMAMRAYVEELRARALQLNERQDGLWHGNDYVIAVE 284 + GL A V EL +++N + G +Y + V+ Sbjct 178 ADAGLRKAYERRVAEL---TIEVNAWKGTHRKGRNYALVVK 215 >gi|114706990|ref|ZP_01439889.1| SAM (and some other nucleotide) binding motif:Generic methyltransferase:Bacterial regulatory protein, ArsR [Fulvimarina pelagi HTCC2506] gi|114537540|gb|EAU40665.1| SAM (and some other nucleotide) binding motif:Generic methyltransferase:Bacterial regulatory protein, ArsR [Fulvimarina pelagi HTCC2506] Length=337 Score = 37.4 bits (85), Expect = 2.6, Method: Compositional matrix adjust. Identities = 33/125 (27%), Positives = 56/125 (45%), Gaps = 12/125 (9%) Query 42 IEGVHGERPWGTVLDAGTGVKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQDR 101 ++ V G++ GT+LD GTG + + ER V +R + RA L A + Sbjct 148 LDRVLGKQRIGTMLDIGTGTGRMMEMLANRCERMLGVDTSREMISAARAKLDDAKVKNAQ 207 Query 102 LLVGNWVDDSLLAGETFDTILVDYLVGAIEGFAPYWQDRVFERLRPH---LADHGRLYLV 158 L VG+ + GET+D +++ ++ ++ D +R LA GRL +V Sbjct 208 LRVGDAYNLP-ANGETYDLVVLHQVL--------HYLDEPMRAVREASSVLAPGGRLVIV 258 Query 159 GLEPY 163 P+ Sbjct 259 DFAPH 263 >gi|91792858|ref|YP_562509.1| peptidoglycan binding domain-containing protein [Shewanella denitrificans OS217] gi|91714860|gb|ABE54786.1| Peptidoglycan-binding domain 1 [Shewanella denitrificans OS217] Length=498 Score = 37.0 bits (84), Expect = 3.7, Method: Compositional matrix adjust. Identities = 22/69 (32%), Positives = 34/69 (50%), Gaps = 6/69 (8%) Query 6 AEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERPWGTVLDAGTGVKSLQ 65 A +P IAA + + D H H + Y+L + S + + HG + G + G K+L Sbjct 184 ASIPAIAARLSLLGDFHGAH-QGYVLTPALESGLKAFQRRHGLKDDGVI-----GPKTLS 237 Query 66 WIQTLPTER 74 W+ LP ER Sbjct 238 WLNQLPIER 246 Lambda K H 0.325 0.141 0.447 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 461491158592 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40