BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0367c
Length=129
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607508|ref|NP_214881.1| hypothetical protein Rv0367c [Mycob... 254 3e-66
gi|340625397|ref|YP_004743849.1| hypothetical protein MCAN_03681... 251 2e-65
gi|118465548|ref|YP_881661.1| hypothetical protein MAV_2469 [Myc... 207 4e-52
gi|254775129|ref|ZP_05216645.1| hypothetical protein MaviaA2_107... 207 4e-52
gi|336457322|gb|EGO36336.1| Protein of unknown function (DUF3423... 206 8e-52
gi|41407875|ref|NP_960711.1| hypothetical protein MAP1777c [Myco... 205 1e-51
gi|342859742|ref|ZP_08716395.1| hypothetical protein MCOL_12713 ... 201 2e-50
gi|254822143|ref|ZP_05227144.1| hypothetical protein MintA_19569... 191 2e-47
gi|108799626|ref|YP_639823.1| hypothetical protein Mmcs_2659 [My... 189 1e-46
gi|118468220|ref|YP_887739.1| hypothetical protein MSMEG_3435 [M... 187 3e-46
gi|145225773|ref|YP_001136451.1| hypothetical protein Mflv_5197 ... 185 2e-45
gi|296165040|ref|ZP_06847595.1| conserved hypothetical protein [... 182 1e-44
gi|120402153|ref|YP_951982.1| hypothetical protein Mvan_1141 [My... 165 2e-39
gi|226362188|ref|YP_002779966.1| hypothetical protein ROP_27740 ... 150 8e-35
gi|111025519|ref|YP_707939.1| hypothetical protein RHA1_ro08737 ... 69.3 2e-10
gi|40787300|gb|AAR90217.1| hypothetical protein PDK3.076 [Rhodoc... 66.6 1e-09
gi|334144812|ref|YP_004538021.1| hypothetical protein PP1Y_Mpl53... 50.8 7e-05
gi|154252813|ref|YP_001413637.1| hypothetical protein Plav_2370 ... 48.5 3e-04
gi|260906266|ref|ZP_05914588.1| hypothetical protein BlinB_13141... 48.5 4e-04
gi|118590086|ref|ZP_01547490.1| hypothetical protein SIAM614_155... 47.0 0.001
gi|83945300|ref|ZP_00957649.1| hypothetical protein OA2633_00995... 46.6 0.001
gi|197105590|ref|YP_002130967.1| hypothetical protein PHZ_c2127 ... 46.6 0.001
gi|335425094|ref|ZP_08554085.1| hypothetical protein SSPSH_20376... 46.2 0.001
gi|83643094|ref|YP_431529.1| hypothetical protein HCH_00187 [Hah... 43.5 0.011
gi|258654626|ref|YP_003203782.1| hypothetical protein Namu_4514 ... 42.7 0.016
gi|84502041|ref|ZP_01000199.1| hypothetical protein OB2597_18177... 40.8 0.075
gi|150377946|ref|YP_001314541.1| hypothetical protein Smed_5915 ... 40.4 0.099
gi|146280155|ref|YP_001170312.1| hypothetical protein Rsph17025_... 39.3 0.17
gi|311694042|gb|ADP96915.1| conserved hypothetical protein [Mari... 37.0 1.0
gi|281211997|gb|EFA86158.1| hypothetical protein PPL_00720 [Poly... 35.8 1.9
gi|253574463|ref|ZP_04851804.1| type II secretion system protein... 35.0 3.7
gi|226701028|gb|ACO72990.1| DRE-binding protein 1a [Zea mays] >g... 34.7 4.7
gi|328886356|emb|CCA59595.1| 3-hydroxybutyryl-CoA dehydrogenase ... 34.7 4.8
gi|301058323|ref|ZP_07199356.1| general secretion pathway protei... 34.7 5.4
gi|121583352|ref|YP_973783.1| hypothetical protein Pnap_4620 [Po... 34.3 5.6
gi|343919698|gb|EGV30441.1| hypothetical protein ThidrDRAFT_2642... 33.9 8.0
gi|149927403|ref|ZP_01915658.1| hypothetical protein LMED105_009... 33.9 8.8
gi|336248930|ref|YP_004592640.1| N-acyl-L-amino acid amidohydrol... 33.5 9.5
>gi|15607508|ref|NP_214881.1| hypothetical protein Rv0367c [Mycobacterium tuberculosis H37Rv]
gi|31791544|ref|NP_854037.1| hypothetical protein Mb0374c [Mycobacterium bovis AF2122/97]
gi|121636280|ref|YP_976503.1| hypothetical protein BCG_0405c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
71 more sequence titles
Length=129
Score = 254 bits (649), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 128/129 (99%), Positives = 129/129 (100%), Gaps = 0/129 (0%)
Query 1 VPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALA 60
+PKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALA
Sbjct 1 MPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALA 60
Query 61 GHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDG 120
GHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDG
Sbjct 61 GHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDG 120
Query 121 TSVVLAATP 129
TSVVLAATP
Sbjct 121 TSVVLAATP 129
>gi|340625397|ref|YP_004743849.1| hypothetical protein MCAN_03681 [Mycobacterium canettii CIPT
140010059]
gi|340003587|emb|CCC42708.1| hypothetical protein MCAN_03681 [Mycobacterium canettii CIPT
140010059]
Length=129
Score = 251 bits (642), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 127/129 (99%), Positives = 128/129 (99%), Gaps = 0/129 (0%)
Query 1 VPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALA 60
+PKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALA
Sbjct 1 MPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALA 60
Query 61 GHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDG 120
GHLPM DLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDG
Sbjct 61 GHLPMRDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDG 120
Query 121 TSVVLAATP 129
TSVVLAATP
Sbjct 121 TSVVLAATP 129
>gi|118465548|ref|YP_881661.1| hypothetical protein MAV_2469 [Mycobacterium avium 104]
gi|118166835|gb|ABK67732.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=140
Score = 207 bits (527), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 104/126 (83%), Positives = 116/126 (93%), Gaps = 0/126 (0%)
Query 1 VPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALA 60
+ +AVDRVTRVA+DL+DSAAAEGARQSRSAKQQLDHWARVGRAVS+QHTASRRRVEAALA
Sbjct 12 MAEAVDRVTRVASDLMDSAAAEGARQSRSAKQQLDHWARVGRAVSSQHTASRRRVEAALA 71
Query 61 GHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDG 120
G L +LT+EEGVVFNAEISAAI+E L+RTNYG LA QG+TTVALND G+IVEHRPDG
Sbjct 72 GRLSTAELTVEEGVVFNAEISAAIDESLARTNYGATLAGQGVTTVALNDDGEIVEHRPDG 131
Query 121 TSVVLA 126
T+VVLA
Sbjct 132 TAVVLA 137
>gi|254775129|ref|ZP_05216645.1| hypothetical protein MaviaA2_10726 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=129
Score = 207 bits (527), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 104/126 (83%), Positives = 116/126 (93%), Gaps = 0/126 (0%)
Query 1 VPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALA 60
+ +AVDRVTRVA+DL+DSAAAEGARQSRSAKQQLDHWARVGRAVS+QHTASRRRVEAALA
Sbjct 1 MAEAVDRVTRVASDLMDSAAAEGARQSRSAKQQLDHWARVGRAVSSQHTASRRRVEAALA 60
Query 61 GHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDG 120
G L +LT+EEGVVFNAEISAAI+E L+RTNYG LA QG+TTVALND G+IVEHRPDG
Sbjct 61 GRLSTAELTVEEGVVFNAEISAAIDESLARTNYGATLAGQGVTTVALNDDGEIVEHRPDG 120
Query 121 TSVVLA 126
T+VVLA
Sbjct 121 TAVVLA 126
>gi|336457322|gb|EGO36336.1| Protein of unknown function (DUF3423) [Mycobacterium avium subsp.
paratuberculosis S397]
Length=129
Score = 206 bits (525), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 104/126 (83%), Positives = 115/126 (92%), Gaps = 0/126 (0%)
Query 1 VPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALA 60
+ +AVDRVTRVA+DL+DSAAAEGARQSRSAKQQLDHWARVGRAVS+QHTASRRRVEAALA
Sbjct 1 MAEAVDRVTRVASDLMDSAAAEGARQSRSAKQQLDHWARVGRAVSSQHTASRRRVEAALA 60
Query 61 GHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDG 120
G L +LT EEGVVFNAEISAAI+E L+RTNYG LA QG+TTVALND G+IVEHRPDG
Sbjct 61 GRLSTAELTAEEGVVFNAEISAAIDESLARTNYGATLAGQGVTTVALNDDGEIVEHRPDG 120
Query 121 TSVVLA 126
T+VVLA
Sbjct 121 TAVVLA 126
>gi|41407875|ref|NP_960711.1| hypothetical protein MAP1777c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41396229|gb|AAS04094.1| hypothetical protein MAP_1777c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=129
Score = 205 bits (522), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 103/126 (82%), Positives = 115/126 (92%), Gaps = 0/126 (0%)
Query 1 VPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALA 60
+ +AVDRVTRVA+DL+DSAAAEGARQSRSAKQQLDHWARVGRAVS+QHTASRRRVEAALA
Sbjct 1 MAEAVDRVTRVASDLMDSAAAEGARQSRSAKQQLDHWARVGRAVSSQHTASRRRVEAALA 60
Query 61 GHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDG 120
G L ++T EEGVVFNAEISAAI+E L+RTNYG LA QG+TTVALND G+IVEHRPDG
Sbjct 61 GRLSTAEITAEEGVVFNAEISAAIDESLARTNYGATLAGQGVTTVALNDDGEIVEHRPDG 120
Query 121 TSVVLA 126
T+VVLA
Sbjct 121 TAVVLA 126
>gi|342859742|ref|ZP_08716395.1| hypothetical protein MCOL_12713 [Mycobacterium colombiense CECT
3035]
gi|342132874|gb|EGT86094.1| hypothetical protein MCOL_12713 [Mycobacterium colombiense CECT
3035]
Length=129
Score = 201 bits (512), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 101/126 (81%), Positives = 115/126 (92%), Gaps = 0/126 (0%)
Query 1 VPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALA 60
+ +A+DRVTRVA+DL+DSAAAEGARQSRSAKQQLDHWARVGRAVS+QHTA RRRVEAALA
Sbjct 1 MAEALDRVTRVASDLMDSAAAEGARQSRSAKQQLDHWARVGRAVSSQHTAPRRRVEAALA 60
Query 61 GHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDG 120
G L ++LT+EEGVVFNAEISAAIEE L+RT+YG LA QG+TTVALND G+IVEHRPDG
Sbjct 61 GQLATSELTVEEGVVFNAEISAAIEESLARTHYGATLAGQGVTTVALNDDGEIVEHRPDG 120
Query 121 TSVVLA 126
+VVLA
Sbjct 121 AAVVLA 126
>gi|254822143|ref|ZP_05227144.1| hypothetical protein MintA_19569 [Mycobacterium intracellulare
ATCC 13950]
Length=129
Score = 191 bits (486), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 98/123 (80%), Positives = 108/123 (88%), Gaps = 0/123 (0%)
Query 4 AVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHL 63
A DRVTRVA+DL+DSAA+EGARQSRSAKQQLDHWARVGRAVS+QHTASRRRVEAALAG L
Sbjct 4 APDRVTRVASDLMDSAASEGARQSRSAKQQLDHWARVGRAVSSQHTASRRRVEAALAGQL 63
Query 64 PMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDGTSV 123
+LT+EEGVVFNAEISAAIEE L +YG LA G+TTVALN+ GDIVEHRPDG +V
Sbjct 64 ATGELTVEEGVVFNAEISAAIEESLVHADYGATLAGHGVTTVALNEDGDIVEHRPDGAAV 123
Query 124 VLA 126
VLA
Sbjct 124 VLA 126
>gi|108799626|ref|YP_639823.1| hypothetical protein Mmcs_2659 [Mycobacterium sp. MCS]
gi|119868737|ref|YP_938689.1| hypothetical protein Mkms_2704 [Mycobacterium sp. KMS]
gi|126435269|ref|YP_001070960.1| hypothetical protein Mjls_2689 [Mycobacterium sp. JLS]
gi|108770045|gb|ABG08767.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119694826|gb|ABL91899.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126235069|gb|ABN98469.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=130
Score = 189 bits (480), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 98/124 (80%), Positives = 110/124 (89%), Gaps = 0/124 (0%)
Query 6 DRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPM 65
DRVTRVAADL+DSAA EGARQSRSAKQQLDHWARVGRAVS+ HTA+RRRVEAALAG +
Sbjct 6 DRVTRVAADLIDSAAVEGARQSRSAKQQLDHWARVGRAVSSHHTAARRRVEAALAGVAGL 65
Query 66 TDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDGTSVVL 125
L EEGVVFNAEISAAIEERL+ +YG++LAA+GITTVAL+DAG IV++RPDGTSVVL
Sbjct 66 DTLNREEGVVFNAEISAAIEERLAGADYGELLAARGITTVALDDAGRIVQYRPDGTSVVL 125
Query 126 AATP 129
TP
Sbjct 126 DDTP 129
>gi|118468220|ref|YP_887739.1| hypothetical protein MSMEG_3435 [Mycobacterium smegmatis str.
MC2 155]
gi|118169507|gb|ABK70403.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=130
Score = 187 bits (476), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/121 (80%), Positives = 108/121 (90%), Gaps = 0/121 (0%)
Query 6 DRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPM 65
DRVTR AADLV+SAAAEGARQSRSAKQQLDHWARVGRAVS+QHTA+RRRVEAALAG L +
Sbjct 6 DRVTRFAADLVESAAAEGARQSRSAKQQLDHWARVGRAVSSQHTAARRRVEAALAGDLAL 65
Query 66 TDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDGTSVVL 125
DLT EEGVVFNAEISAAIEE L+ T+YG VLA +G+TTVAL+D G IV ++PDGT+ VL
Sbjct 66 RDLTPEEGVVFNAEISAAIEENLAHTDYGQVLAGRGVTTVALDDDGAIVRYQPDGTTTVL 125
Query 126 A 126
A
Sbjct 126 A 126
>gi|145225773|ref|YP_001136451.1| hypothetical protein Mflv_5197 [Mycobacterium gilvum PYR-GCK]
gi|315446134|ref|YP_004079013.1| hypothetical protein Mspyr1_46280 [Mycobacterium sp. Spyr1]
gi|145218259|gb|ABP47663.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
gi|315264437|gb|ADU01179.1| hypothetical protein Mspyr1_46280 [Mycobacterium sp. Spyr1]
Length=129
Score = 185 bits (470), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 92/120 (77%), Positives = 110/120 (92%), Gaps = 0/120 (0%)
Query 6 DRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPM 65
DRVTR+AADL+DSAAAEGARQSRSAKQQLDHWARVGRAVS+QH+ +RR+VEAALAG +P+
Sbjct 6 DRVTRIAADLMDSAAAEGARQSRSAKQQLDHWARVGRAVSSQHSVARRKVEAALAGDVPL 65
Query 66 TDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDGTSVVL 125
DLT EEGVVFNAEISAAI+ERL R +YG VLAA+G+TTVAL++ G+IV++ PDG+SV L
Sbjct 66 RDLTDEEGVVFNAEISAAIQERLVRADYGAVLAARGVTTVALDEDGEIVQYAPDGSSVRL 125
>gi|296165040|ref|ZP_06847595.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295899688|gb|EFG79139.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=129
Score = 182 bits (463), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 93/120 (78%), Positives = 107/120 (90%), Gaps = 0/120 (0%)
Query 6 DRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPM 65
DRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVS+ TASRRR+EAALAG L
Sbjct 6 DRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSSHQTASRRRIEAALAGDLDT 65
Query 66 TDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDGTSVVL 125
L+ +EG+VFNAEISAAIEE L+ +YGD+L+A+GITTVALND G+IVE+RPDGT+ V+
Sbjct 66 GQLSDDEGLVFNAEISAAIEESLATAHYGDMLSARGITTVALNDDGEIVEYRPDGTTSVV 125
>gi|120402153|ref|YP_951982.1| hypothetical protein Mvan_1141 [Mycobacterium vanbaalenii PYR-1]
gi|119954971|gb|ABM11976.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=129
Score = 165 bits (417), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 83/120 (70%), Positives = 105/120 (88%), Gaps = 0/120 (0%)
Query 6 DRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPM 65
D+VTRVAADL+DSAA EGAR+S SA+QQLDHWARVGRAVS+QH+ +RR+VEAALAG +
Sbjct 6 DQVTRVAADLMDSAATEGARRSWSAEQQLDHWARVGRAVSSQHSVARRKVEAALAGDVHT 65
Query 66 TDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDGTSVVL 125
+L+ EEGVVFNAEISAAI+ERL+ +YG VLA +GITTVAL+D G+IV+++PDG++ L
Sbjct 66 RELSDEEGVVFNAEISAAIQERLASADYGAVLATRGITTVALDDDGEIVQYQPDGSATPL 125
>gi|226362188|ref|YP_002779966.1| hypothetical protein ROP_27740 [Rhodococcus opacus B4]
gi|226240673|dbj|BAH51021.1| hypothetical protein [Rhodococcus opacus B4]
Length=125
Score = 150 bits (378), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 71/125 (57%), Positives = 101/125 (81%), Gaps = 0/125 (0%)
Query 1 VPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALA 60
+ K D+VTR ++DLVD+A+ EG R++RSA+QQL+HWARVGR VSNQ +RRRVEAAL
Sbjct 1 MKKIADKVTRFSSDLVDAASTEGERENRSARQQLEHWARVGREVSNQRHVARRRVEAALT 60
Query 61 GHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDG 120
G +P+++L++EEGVVFNAEISAA+EE L+ N+ A +G++TVAL++ G +V++ PDG
Sbjct 61 GRVPLSELSVEEGVVFNAEISAALEESLATGNHVAERAGRGLSTVALDEQGRVVKYLPDG 120
Query 121 TSVVL 125
T ++L
Sbjct 121 TQILL 125
>gi|111025519|ref|YP_707939.1| hypothetical protein RHA1_ro08737 [Rhodococcus jostii RHA1]
gi|110824498|gb|ABG99781.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=160
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/108 (40%), Positives = 55/108 (51%), Gaps = 0/108 (0%)
Query 9 TRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDL 68
TR+ +L SA G R SRSA QQ+ HWAR+GR + H+ S R V L G +L
Sbjct 43 TRIDNELYASAKLVGGRMSRSAAQQIAHWARIGRELEASHSVSYRDVADVLDGRRDYDEL 102
Query 69 TLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEH 116
T E V AE + I ER N + A G + V L+ G+IV H
Sbjct 103 TDREQAVVRAEWTERITERREGLNLAEQFAHSGRSYVELDQHGNIVRH 150
>gi|40787300|gb|AAR90217.1| hypothetical protein PDK3.076 [Rhodococcus sp. DK17]
Length=353
Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/106 (40%), Positives = 54/106 (51%), Gaps = 0/106 (0%)
Query 9 TRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDL 68
TR+ +L SA G R SRSA QQ+ HWAR+GR + H+ S R V L G +L
Sbjct 43 TRIDNELYASAKLVGGRMSRSAAQQIAHWARIGRELEASHSVSYRDVADVLDGRRDYDEL 102
Query 69 TLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIV 114
T E V AE + I ER N + A G + V L+ G+IV
Sbjct 103 TDREQAVVRAEWTERITERREGLNLAEQFAHSGRSYVELDQHGNIV 148
>gi|334144812|ref|YP_004538021.1| hypothetical protein PP1Y_Mpl535 [Novosphingobium sp. PP1Y]
gi|333936695|emb|CCA90054.1| conserved hypothetical protein [Novosphingobium sp. PP1Y]
Length=118
Score = 50.8 bits (120), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 35/108 (33%), Positives = 51/108 (48%), Gaps = 0/108 (0%)
Query 10 RVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDLT 69
++ D++ E QSRS Q+ HW R+GRA+ R+ AALAG + TDLT
Sbjct 6 KLGDDIMKIVRRESELQSRSIAGQIAHWVRIGRAIEKSGNFDHARITAALAGDIQTTDLT 65
Query 70 LEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHR 117
EE V+ +E+ S N Q V L+ AG++V +
Sbjct 66 DEEKDVWLDSFIEKMEQPGSDENAFFARRRQYGLGVGLDAAGNVVREK 113
>gi|154252813|ref|YP_001413637.1| hypothetical protein Plav_2370 [Parvibaculum lavamentivorans
DS-1]
gi|154156763|gb|ABS63980.1| conserved hypothetical protein [Parvibaculum lavamentivorans
DS-1]
Length=127
Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/67 (38%), Positives = 36/67 (54%), Gaps = 0/67 (0%)
Query 10 RVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDLT 69
++A D++ E QSRS Q+ HW R+GRA+ R+ AAL+G + DLT
Sbjct 15 KLADDIMKIVRRESELQSRSVSGQVAHWVRIGRAIEKSGNFDYARITAALSGEIGTVDLT 74
Query 70 LEEGVVF 76
EE V+
Sbjct 75 CEEKDVW 81
>gi|260906266|ref|ZP_05914588.1| hypothetical protein BlinB_13141 [Brevibacterium linens BL2]
Length=110
Score = 48.5 bits (114), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/113 (32%), Positives = 56/113 (50%), Gaps = 10/113 (8%)
Query 11 VAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDLTL 70
+ AD+ +SA A SR+ QQ+ HWAR+GR + + T + R++ LAG L
Sbjct 1 MPADVYESAVAAAKAASRTVPQQIAHWARIGREMESSPTVNHRQITQVLAGTSSYDSLAE 60
Query 71 EEGVVFNAEISAAIEERLSRT-----NYGDVLAAQGITTVALNDAGDIVEHRP 118
E + + A EER +RT +Y + G L++ G++V HRP
Sbjct 61 REQAI----VREAWEER-TRTLRKGLDYAAGFDSAGEEYSELDEDGNLVVHRP 108
>gi|118590086|ref|ZP_01547490.1| hypothetical protein SIAM614_15515 [Stappia aggregata IAM 12614]
gi|118437583|gb|EAV44220.1| hypothetical protein SIAM614_15515 [Stappia aggregata IAM 12614]
Length=127
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/67 (36%), Positives = 35/67 (53%), Gaps = 0/67 (0%)
Query 11 VAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDLTL 70
+ D++ E Q+RS +Q+ W R+GR + + RV AALAG L TDLT
Sbjct 12 LCDDVMSLVCCEAELQNRSVSEQITLWLRIGRVIEKSGAFDQARVSAALAGELQTTDLTA 71
Query 71 EEGVVFN 77
E V++
Sbjct 72 LEKAVWS 78
>gi|83945300|ref|ZP_00957649.1| hypothetical protein OA2633_00995 [Oceanicaulis alexandrii HTCC2633]
gi|83851470|gb|EAP89326.1| hypothetical protein OA2633_00995 [Oceanicaulis alexandrii HTCC2633]
Length=118
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/114 (31%), Positives = 54/114 (48%), Gaps = 8/114 (7%)
Query 10 RVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDLT 69
+++ D++ E RQSRS Q+ HW R+GRA+ R+ A LAG L+
Sbjct 6 KLSDDIMKLVRTESERQSRSIAGQIAHWVRIGRAIETSGNFDHARINAVLAGEAGPNTLS 65
Query 70 LEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITT----VALNDAGDIVEHRPD 119
EE V+ SAA L+ + G+ +G V L++ G++V PD
Sbjct 66 DEEHDVWLDAFSAA----LAEPSDGEEAFFEGRRQLGRGVGLDENGELVRETPD 115
>gi|197105590|ref|YP_002130967.1| hypothetical protein PHZ_c2127 [Phenylobacterium zucineum HLK1]
gi|196479010|gb|ACG78538.1| hypothetical protein PHZ_c2127 [Phenylobacterium zucineum HLK1]
Length=122
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/71 (41%), Positives = 38/71 (54%), Gaps = 0/71 (0%)
Query 15 LVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDLTLEEGV 74
LVD+A E RS Q++HWA +GRA+ S RV AALAG L + DL+ E
Sbjct 14 LVDAAREEAELFHRSLSGQIEHWATLGRALETAQGVSLDRVRAALAGGLKIEDLSDVEQD 73
Query 75 VFNAEISAAIE 85
F A + A +
Sbjct 74 AFFANLGEAFD 84
>gi|335425094|ref|ZP_08554085.1| hypothetical protein SSPSH_20376 [Salinisphaera shabanensis E1L3A]
gi|334886770|gb|EGM25117.1| hypothetical protein SSPSH_20376 [Salinisphaera shabanensis E1L3A]
Length=111
Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/69 (34%), Positives = 39/69 (57%), Gaps = 0/69 (0%)
Query 10 RVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDLT 69
RV+ +L +++ AE RS Q+++WAR+GRA+ + + AL +P+ DL+
Sbjct 7 RVSEELSNASKAESRLMHRSQAGQIEYWARIGRAIEQSGQFDYQHIARALKAEIPVDDLS 66
Query 70 LEEGVVFNA 78
E VF+A
Sbjct 67 AYEKPVFDA 75
>gi|83643094|ref|YP_431529.1| hypothetical protein HCH_00187 [Hahella chejuensis KCTC 2396]
gi|83631137|gb|ABC27104.1| conserved hypothetical protein [Hahella chejuensis KCTC 2396]
Length=113
Score = 43.5 bits (101), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 22/75 (30%), Positives = 42/75 (56%), Gaps = 4/75 (5%)
Query 1 VPKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALA 60
+P++V R+ L+DSA E RS + Q++HWA++G+ V S R+ + L+
Sbjct 1 MPQSV----RIDDFLIDSARREAKGAHRSVQGQIEHWAKIGQMVERSGVLSYERIRSFLS 56
Query 61 GHLPMTDLTLEEGVV 75
G + + +L +E ++
Sbjct 57 GEIQIDNLNNDERLM 71
>gi|258654626|ref|YP_003203782.1| hypothetical protein Namu_4514 [Nakamurella multipartita DSM
44233]
gi|258557851|gb|ACV80793.1| hypothetical protein Namu_4514 [Nakamurella multipartita DSM
44233]
Length=118
Score = 42.7 bits (99), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 36/106 (34%), Positives = 47/106 (45%), Gaps = 0/106 (0%)
Query 9 TRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDL 68
TR+ DL ++A A SRS QQ+ HWARVGR + S R V+ LAG P L
Sbjct 6 TRLPDDLYEAARRAAAVASRSTAQQIAHWARVGRELEASPDVSIREVQRVLAGLGPYASL 65
Query 69 TLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIV 114
V AE I + + N+ A G T + + G V
Sbjct 66 NEGGQAVVRAEWDERIADGIGELNFAAEFTAAGDTWIVGDGKGGAV 111
>gi|84502041|ref|ZP_01000199.1| hypothetical protein OB2597_18177 [Oceanicola batsensis HTCC2597]
gi|84390036|gb|EAQ02670.1| hypothetical protein OB2597_18177 [Oceanicola batsensis HTCC2597]
Length=118
Score = 40.8 bits (94), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 20/51 (40%), Positives = 28/51 (55%), Gaps = 0/51 (0%)
Query 26 QSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDLTLEEGVVF 76
SRS Q+ HW ++GRA+ + + R+ AAL G L T L EE V +
Sbjct 22 HSRSVAGQITHWLKIGRAIEHSGSFDYARITAALEGRLDTTQLGAEEEVAW 72
>gi|150377946|ref|YP_001314541.1| hypothetical protein Smed_5915 [Sinorhizobium medicae WSM419]
gi|150032493|gb|ABR64608.1| hypothetical protein Smed_5915 [Sinorhizobium medicae WSM419]
Length=118
Score = 40.4 bits (93), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 23/77 (30%), Positives = 38/77 (50%), Gaps = 0/77 (0%)
Query 10 RVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDLT 69
++A D++ E QSRS Q+ HW ++GRA+ R++ AL G L T+L
Sbjct 6 KLADDVMSLVRREAELQSRSVAGQIAHWIKIGRAIERSSAFDYSRIKQALEGRLDTTELK 65
Query 70 LEEGVVFNAEISAAIEE 86
E + E++ + E
Sbjct 66 EGEEAAWLDELTNKMAE 82
>gi|146280155|ref|YP_001170312.1| hypothetical protein Rsph17025_4156 [Rhodobacter sphaeroides
ATCC 17025]
gi|145558396|gb|ABP73007.1| hypothetical protein Rsph17025_4156 [Rhodobacter sphaeroides
ATCC 17025]
Length=118
Score = 39.3 bits (90), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 35/108 (33%), Positives = 51/108 (48%), Gaps = 6/108 (5%)
Query 10 RVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDLT 69
++A D++ E SRS Q+ HW R+G+A+ RV AAL G L +L
Sbjct 6 KLADDVMAQVRREAELHSRSVAGQITHWLRLGQAIEQSGAYDHARVTAALEGRLDTVELG 65
Query 70 LEEGVVFNAEISAAIEE--RLSRTNYGDVLAAQGIT-TVALNDAGDIV 114
EE + + I A E+ R SRT Q + V L+ AG++V
Sbjct 66 EEEEIAW---IDAFTEKMSRPSRTEQAFFAKRQRLGRGVGLDAAGNLV 110
>gi|311694042|gb|ADP96915.1| conserved hypothetical protein [Marinobacter adhaerens HP15]
Length=137
Score = 37.0 bits (84), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 19/41 (47%), Positives = 27/41 (66%), Gaps = 0/41 (0%)
Query 10 RVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTA 50
R+ LV AAAEGA RS +Q+++WA +GRAV+ +A
Sbjct 6 RLDDSLVRHAAAEGAVNRRSTPKQIEYWAEIGRAVAGDVSA 46
>gi|281211997|gb|EFA86158.1| hypothetical protein PPL_00720 [Polysphondylium pallidum PN500]
Length=1208
Score = 35.8 bits (81), Expect = 1.9, Method: Composition-based stats.
Identities = 23/74 (32%), Positives = 38/74 (52%), Gaps = 2/74 (2%)
Query 22 EGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAALAGHLPMTDLTLEEGVVFNAEIS 81
E + R QL+ W R+ +V++ T R + L G LP++ + LE G FN+ I
Sbjct 541 ESIKTLRGVSGQLNEWVRIPDSVTSL-TFGRHFNQPILKGMLPVSLIYLEFGYHFNSTIH 599
Query 82 A-AIEERLSRTNYG 94
++ +RL N+G
Sbjct 600 PHSLPDRLEVLNFG 613
>gi|253574463|ref|ZP_04851804.1| type II secretion system protein E [Paenibacillus sp. oral taxon
786 str. D14]
gi|251846168|gb|EES74175.1| type II secretion system protein E [Paenibacillus sp. oral taxon
786 str. D14]
Length=559
Score = 35.0 bits (79), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 17/41 (42%), Positives = 27/41 (66%), Gaps = 1/41 (2%)
Query 65 MTDLTLEEGVVFNAEISAAIEERL-SRTNYGDVLAAQGITT 104
+ +L LE G++ ++ AA+EE+ +R GDVL AQG+ T
Sbjct 8 LGELLLESGIITEQQLQAALEEQQRTRKKLGDVLLAQGVLT 48
>gi|226701028|gb|ACO72990.1| DRE-binding protein 1a [Zea mays]
gi|238007086|gb|ACR34578.1| unknown [Zea mays]
Length=367
Score = 34.7 bits (78), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 20/65 (31%), Positives = 33/65 (51%), Gaps = 8/65 (12%)
Query 63 LPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQGITTVALNDAGDIVEHRPDGTS 122
+PM L+ G + AE SA + +G + G T+ALN+ +V+ +P G S
Sbjct 9 MPMQPPALQPGRAYGAEGSAVV--------HGSIRTVAGGPTLALNECQILVQQKPQGDS 60
Query 123 VVLAA 127
+LA+
Sbjct 61 RLLAS 65
>gi|328886356|emb|CCA59595.1| 3-hydroxybutyryl-CoA dehydrogenase ; 3-hydroxyacyl-CoA dehydrogenase
[Streptomyces venezuelae ATCC 10712]
Length=593
Score = 34.7 bits (78), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 27/71 (39%), Positives = 33/71 (47%), Gaps = 2/71 (2%)
Query 2 PKAVDRVTRVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAVSNQHTASRRRVEAA--L 59
P+AVD VTR+A DL A G R A L + A+ ASR ++AA L
Sbjct 163 PRAVDAVTRLAQDLGKEPVAVGDRAGFIADGLLFGYLNQAAAMYEAKYASREDIDAAMKL 222
Query 60 AGHLPMTDLTL 70
LPM L L
Sbjct 223 GCGLPMGPLAL 233
>gi|301058323|ref|ZP_07199356.1| general secretion pathway protein E family protein [delta proteobacterium
NaphS2]
gi|300447559|gb|EFK11291.1| general secretion pathway protein E family protein [delta proteobacterium
NaphS2]
Length=649
Score = 34.7 bits (78), Expect = 5.4, Method: Composition-based stats.
Identities = 17/63 (27%), Positives = 31/63 (50%), Gaps = 0/63 (0%)
Query 42 RAVSNQHTASRRRVEAALAGHLPMTDLTLEEGVVFNAEISAAIEERLSRTNYGDVLAAQG 101
R +S QH ++++ A G+ P+ + +E + ++ +E R N GD+L G
Sbjct 19 RLISEQHLIEAQKIQNAEDGYKPIGQVLVEMEAITRNQLDLVLERFNKRANLGDILLRSG 78
Query 102 ITT 104
I T
Sbjct 79 IIT 81
>gi|121583352|ref|YP_973783.1| hypothetical protein Pnap_4620 [Polaromonas naphthalenivorans
CJ2]
gi|120596606|gb|ABM40041.1| conserved hypothetical protein [Polaromonas naphthalenivorans
CJ2]
Length=74
Score = 34.3 bits (77), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 14/35 (40%), Positives = 25/35 (72%), Gaps = 0/35 (0%)
Query 10 RVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAV 44
+++ +LV A A A + RS +Q+++WAR+G+AV
Sbjct 8 KLSDELVQDAKAVAAAEHRSVPKQIEYWARIGKAV 42
>gi|343919698|gb|EGV30441.1| hypothetical protein ThidrDRAFT_2642 [Thiorhodococcus drewsii
AZ1]
Length=124
Score = 33.9 bits (76), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 23/67 (35%), Positives = 36/67 (54%), Gaps = 3/67 (4%)
Query 16 VDSAAAEGARQSRSAKQQLDHWARVGRAVSN-QHTASRRRVEAALAGHL--PMTDLTLEE 72
+ +AA G R RS +Q+++WA +GR VS H + + A LA P+T L+
Sbjct 1 MQAAAVTGERFHRSTAEQIEYWASIGRQVSQLLHPDALLSITAGLARVRVEPVTTAPLDP 60
Query 73 GVVFNAE 79
VFN++
Sbjct 61 NEVFNSQ 67
>gi|149927403|ref|ZP_01915658.1| hypothetical protein LMED105_00917 [Limnobacter sp. MED105]
gi|149823895|gb|EDM83120.1| hypothetical protein LMED105_00917 [Limnobacter sp. MED105]
Length=136
Score = 33.9 bits (76), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 13/35 (38%), Positives = 23/35 (66%), Gaps = 0/35 (0%)
Query 10 RVAADLVDSAAAEGARQSRSAKQQLDHWARVGRAV 44
R+ +DL+ +A G RSA +Q+++WA +G+ V
Sbjct 9 RIQSDLMSNATVLGKLNHRSAAEQIEYWASIGQKV 43
>gi|336248930|ref|YP_004592640.1| N-acyl-L-amino acid amidohydrolase; aminoacylase [Enterobacter
aerogenes KCTC 2190]
gi|334734986|gb|AEG97361.1| N-acyl-L-amino acid amidohydrolase; aminoacylase [Enterobacter
aerogenes KCTC 2190]
Length=393
Score = 33.5 bits (75), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 17/55 (31%), Positives = 29/55 (53%), Gaps = 1/55 (1%)
Query 37 WARVGRAVSNQHTASRRRVEAALAGHLPMTDLTLEEGVVFNAEISAAIEERLSRT 91
W + G AV N H A+ A+A H P L L++ +F +E ++ +E++ T
Sbjct 293 WQQ-GYAVGNNHDATNHIAREAIARHFPAGTLQLQDKALFGSEDFSSYQEKIPGT 346
Lambda K H
0.314 0.126 0.345
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128283502052
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40