BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3922c
Length=120
Score E
Sequences producing significant alignments: (Bits) Value
gi|15611058|ref|NP_218439.1| hypothetical protein Rv3922c [Mycob... 246 7e-64
gi|308232612|ref|ZP_07664133.1| hemolysin [Mycobacterium tubercu... 207 4e-52
gi|254818676|ref|ZP_05223677.1| hypothetical protein MintA_02069... 153 7e-36
gi|118620077|ref|YP_908409.1| hypothetical protein MUL_5075 [Myc... 153 7e-36
gi|240168406|ref|ZP_04747065.1| hypothetical protein MkanA1_0378... 150 8e-35
gi|336459813|gb|EGO38727.1| conserved hypothetical protein YidD ... 149 1e-34
gi|41410446|ref|NP_963282.1| hypothetical protein MAP4348c [Myco... 149 1e-34
gi|118465987|ref|YP_884421.1| hypothetical protein MAV_5311 [Myc... 147 4e-34
gi|342862350|ref|ZP_08718991.1| hypothetical protein MCOL_25788 ... 137 4e-31
gi|108802370|ref|YP_642567.1| hypothetical protein Mmcs_5411 [My... 133 1e-29
gi|118469284|ref|YP_891136.1| hypothetical protein MSMEG_6944 [M... 130 9e-29
gi|145221424|ref|YP_001132102.1| hypothetical protein Mflv_0830 ... 126 1e-27
gi|120407005|ref|YP_956834.1| hypothetical protein Mvan_6076 [My... 117 4e-25
gi|312142025|ref|YP_004009361.1| hypothetical protein REQ_47440 ... 102 1e-20
gi|226309517|ref|YP_002769479.1| hypothetical protein RER_60320 ... 102 1e-20
gi|296141905|ref|YP_003649148.1| hypothetical protein Tpau_4241 ... 100 9e-20
gi|226362892|ref|YP_002780672.1| hypothetical protein ROP_34800 ... 99.4 2e-19
gi|111020652|ref|YP_703624.1| hypothetical protein RHA1_ro03663 ... 99.4 2e-19
gi|262204660|ref|YP_003275868.1| hypothetical protein Gbro_4862 ... 96.7 9e-19
gi|291005727|ref|ZP_06563700.1| hypothetical protein SeryN2_1449... 95.5 2e-18
gi|134103815|ref|YP_001109476.1| hypothetical protein SACE_7395 ... 95.1 3e-18
gi|54027645|ref|YP_121887.1| hypothetical protein nfa56710 [Noca... 94.4 4e-18
gi|256381074|ref|YP_003104734.1| hypothetical protein Amir_7098 ... 92.0 2e-17
gi|205374603|ref|ZP_03227398.1| hypothetical protein Bcoam_16106... 92.0 3e-17
gi|158333932|ref|YP_001515104.1| hypothetical protein AM1_0746 [... 91.7 3e-17
gi|196250028|ref|ZP_03148723.1| protein of unknown function DUF3... 91.7 3e-17
gi|257065516|ref|YP_003145188.1| conserved hypothetical protein ... 90.9 6e-17
gi|254302383|ref|ZP_04969741.1| possible alpha-hemolysin [Fusoba... 90.5 7e-17
gi|289523249|ref|ZP_06440103.1| alpha-hemolysin [Anaerobaculum h... 90.1 9e-17
gi|343928732|ref|ZP_08768177.1| hypothetical protein GOALK_120_0... 90.1 1e-16
gi|302880139|ref|YP_003848703.1| hypothetical protein Galf_2947 ... 90.1 1e-16
gi|237743157|ref|ZP_04573638.1| alpha-hemolysin [Fusobacterium s... 89.7 1e-16
gi|119475234|ref|ZP_01615587.1| hypothetical protein GP2143_1548... 88.6 3e-16
gi|158341482|ref|YP_001522647.1| hypothetical protein AM1_G0153 ... 88.6 3e-16
gi|329894826|ref|ZP_08270626.1| YidD [gamma proteobacterium IMCC... 88.6 3e-16
gi|335039144|ref|ZP_08532326.1| UPF0161 protein yidD [Caldalkali... 88.2 3e-16
gi|289524496|ref|ZP_06441350.1| putative alpha-hemolysin [Anaero... 88.2 3e-16
gi|15615392|ref|NP_243695.1| hypothetical protein BH2828 [Bacill... 88.2 4e-16
gi|86160774|ref|YP_467559.1| hypothetical protein Adeh_4359 [Ana... 88.2 4e-16
gi|19703355|ref|NP_602917.1| alpha-hemolysin [Fusobacterium nucl... 88.2 4e-16
gi|260424965|ref|ZP_05779066.1| putative toxin-antitoxin system,... 88.2 4e-16
gi|269792318|ref|YP_003317222.1| hypothetical protein Taci_0704 ... 87.8 4e-16
gi|239828120|ref|YP_002950744.1| hypothetical protein GWCH70_279... 87.8 5e-16
gi|342214934|ref|ZP_08707604.1| hypothetical protein HMPREF9200_... 87.4 6e-16
gi|56752481|ref|YP_173182.1| hypothetical protein syc2472_c [Syn... 87.4 6e-16
gi|269216521|ref|ZP_06160375.1| YidD family protein [Slackia exi... 87.4 7e-16
gi|172035662|ref|YP_001802163.1| hypothetical protein cce_0746 [... 87.0 8e-16
gi|326383906|ref|ZP_08205590.1| hypothetical protein SCNU_13273 ... 87.0 8e-16
gi|86607211|ref|YP_475974.1| hypothetical protein CYA_2594 [Syne... 87.0 9e-16
gi|197124876|ref|YP_002136827.1| hypothetical protein AnaeK_4496... 87.0 9e-16
>gi|15611058|ref|NP_218439.1| hypothetical protein Rv3922c [Mycobacterium tuberculosis H37Rv]
gi|15843556|ref|NP_338593.1| hypothetical protein MT4040.1 [Mycobacterium tuberculosis CDC1551]
gi|31795095|ref|NP_857588.1| hypothetical protein Mb3953c [Mycobacterium bovis AF2122/97]
64 more sequence titles
Length=120
Score = 246 bits (628), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 119/120 (99%), Positives = 120/120 (100%), Gaps = 0/120 (0%)
Query 1 VSLSRQSCGRVVRVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYG 60
+SLSRQSCGRVVRVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYG
Sbjct 1 MSLSRQSCGRVVRVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYG 60
Query 61 LLRGSWLTMIRLAKCGPWHRGGWDPIPEGLTTGRSCQTDVDGANDDWNPASKRGERESFV 120
LLRGSWLTMIRLAKCGPWHRGGWDPIPEGLTTGRSCQTDVDGANDDWNPASKRGERESFV
Sbjct 61 LLRGSWLTMIRLAKCGPWHRGGWDPIPEGLTTGRSCQTDVDGANDDWNPASKRGERESFV 120
>gi|308232612|ref|ZP_07664133.1| hemolysin [Mycobacterium tuberculosis SUMu001]
gi|308369278|ref|ZP_07666709.1| hemolysin [Mycobacterium tuberculosis SUMu002]
gi|308371503|ref|ZP_07667185.1| hemolysin [Mycobacterium tuberculosis SUMu003]
17 more sequence titles
Length=98
Score = 207 bits (527), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 97/98 (99%), Positives = 98/98 (100%), Gaps = 0/98 (0%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
+IFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG
Sbjct 1 MIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 60
Query 83 WDPIPEGLTTGRSCQTDVDGANDDWNPASKRGERESFV 120
WDPIPEGLTTGRSCQTDVDGANDDWNPASKRGERESFV
Sbjct 61 WDPIPEGLTTGRSCQTDVDGANDDWNPASKRGERESFV 98
>gi|254818676|ref|ZP_05223677.1| hypothetical protein MintA_02069 [Mycobacterium intracellulare
ATCC 13950]
Length=116
Score = 153 bits (387), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 74/116 (64%), Positives = 86/116 (75%), Gaps = 4/116 (3%)
Query 5 RQSCGRVVRVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRG 64
R + G + GR+ ARGLIF+IQ+YRHM+SPLRPA+CRFVPTCSQYAVDAL EYGL+RG
Sbjct 4 RSAVGALTHTVGRSVARGLIFLIQLYRHMVSPLRPATCRFVPTCSQYAVDALNEYGLIRG 63
Query 65 SWLTMIRLAKCGPWHRGGWDPIPEGLTTGRSCQTDVDGANDDWNPASKRGERESFV 120
SWL +RLAKCGPWH GGWDPIP+ CQ D A+D + RGE ESFV
Sbjct 64 SWLAAVRLAKCGPWHEGGWDPIPD----RPGCQGDCQDASDARVVRATRGESESFV 115
>gi|118620077|ref|YP_908409.1| hypothetical protein MUL_5075 [Mycobacterium ulcerans Agy99]
gi|183985456|ref|YP_001853747.1| hypothetical protein MMAR_5578 [Mycobacterium marinum M]
gi|118572187|gb|ABL06938.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
gi|183178782|gb|ACC43892.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=102
Score = 153 bits (387), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 74/110 (68%), Positives = 90/110 (82%), Gaps = 8/110 (7%)
Query 11 VVRVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMI 70
+VR +GR +ARGLI+VI++YRHM+SPLRPASCRF+PTCSQYAVDAL+EYGL+RGSWLT+
Sbjct 1 MVRTSGRLAARGLIYVIELYRHMVSPLRPASCRFIPTCSQYAVDALSEYGLIRGSWLTVA 60
Query 71 RLAKCGPWHRGGWDPIPEGLTTGRSCQTDVDGANDDWNPASKRGERESFV 120
RLAKCGPWH+GGWDPIPE + C+ D G +PA K+GE E FV
Sbjct 61 RLAKCGPWHQGGWDPIPE----RQGCRADEAGC----SPAVKQGESEFFV 102
>gi|240168406|ref|ZP_04747065.1| hypothetical protein MkanA1_03787 [Mycobacterium kansasii ATCC
12478]
Length=103
Score = 150 bits (378), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 77/110 (70%), Positives = 85/110 (78%), Gaps = 7/110 (6%)
Query 11 VVRVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMI 70
+V+ TGR +AR LIFVI +YRHM+SPLRPASCRFVPTCSQYAV ALTEYGL+RGSWLT+
Sbjct 1 MVKATGRWTARALIFVINLYRHMVSPLRPASCRFVPTCSQYAVAALTEYGLIRGSWLTVA 60
Query 71 RLAKCGPWHRGGWDPIPEGLTTGRSCQTDVDGANDDWNPASKRGERESFV 120
RLAKCGPWHRGGWDPIPE C D PA K+GERESFV
Sbjct 61 RLAKCGPWHRGGWDPIPERQRYSDCCAVGGD------LPA-KQGERESFV 103
>gi|336459813|gb|EGO38727.1| conserved hypothetical protein YidD [Mycobacterium avium subsp.
paratuberculosis S397]
Length=126
Score = 149 bits (377), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/112 (65%), Positives = 83/112 (75%), Gaps = 4/112 (3%)
Query 9 GRVVRVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLT 68
G +R GR ARGLIF+IQ+YRHM+SPLRPA+CRFVPTCSQYAVDAL EYGL+RGSWL
Sbjct 19 GAAIRGAGRTVARGLIFLIQLYRHMVSPLRPATCRFVPTCSQYAVDALDEYGLIRGSWLA 78
Query 69 MIRLAKCGPWHRGGWDPIPEGLTTGRSCQTDVDGANDDWNPASKRGERESFV 120
RLAKCGPWH+GGWDPIPE C+ + A+D W + RGE S V
Sbjct 79 AARLAKCGPWHQGGWDPIPE----RPGCRVNCQDASDAWAVRATRGESGSLV 126
>gi|41410446|ref|NP_963282.1| hypothetical protein MAP4348c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|14286069|sp|Q9L7M0.1|Y4348_MYCPA RecName: Full=UPF0161 protein MAP_4348c
gi|6969276|gb|AAF33697.1| unknown [Mycobacterium avium subsp. paratuberculosis]
gi|41399280|gb|AAS06898.1| hypothetical protein MAP_4348c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=115
Score = 149 bits (376), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/112 (65%), Positives = 83/112 (75%), Gaps = 4/112 (3%)
Query 9 GRVVRVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLT 68
G +R GR ARGLIF+IQ+YRHM+SPLRPA+CRFVPTCSQYAVDAL EYGL+RGSWL
Sbjct 8 GAAIRGAGRTVARGLIFLIQLYRHMVSPLRPATCRFVPTCSQYAVDALDEYGLIRGSWLA 67
Query 69 MIRLAKCGPWHRGGWDPIPEGLTTGRSCQTDVDGANDDWNPASKRGERESFV 120
RLAKCGPWH+GGWDPIPE C+ + A+D W + RGE S V
Sbjct 68 AARLAKCGPWHQGGWDPIPE----RPGCRVNCQDASDAWAVRATRGESGSLV 115
>gi|118465987|ref|YP_884421.1| hypothetical protein MAV_5311 [Mycobacterium avium 104]
gi|254777660|ref|ZP_05219176.1| hypothetical protein MaviaA2_23726 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|166227311|sp|A0QND3.1|Y5311_MYCA1 RecName: Full=UPF0161 protein MAV_5311
gi|118167274|gb|ABK68171.1| conserved hypothetical protein, putative [Mycobacterium avium
104]
Length=115
Score = 147 bits (372), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 71/112 (64%), Positives = 82/112 (74%), Gaps = 4/112 (3%)
Query 9 GRVVRVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLT 68
G +R GR ARGLIF+IQ+YRHM+SPLRPA+CRFVPTCSQYAVDAL EYGL+RGSWL
Sbjct 8 GAAIRGAGRTVARGLIFLIQLYRHMVSPLRPATCRFVPTCSQYAVDALDEYGLIRGSWLA 67
Query 69 MIRLAKCGPWHRGGWDPIPEGLTTGRSCQTDVDGANDDWNPASKRGERESFV 120
RL KCGPWH+GGWDPIPE C+ + A+D W + RGE S V
Sbjct 68 AARLTKCGPWHQGGWDPIPE----RPGCRVNCQDASDAWAVRATRGESGSLV 115
>gi|342862350|ref|ZP_08718991.1| hypothetical protein MCOL_25788 [Mycobacterium colombiense CECT
3035]
gi|342130207|gb|EGT83535.1| hypothetical protein MCOL_25788 [Mycobacterium colombiense CECT
3035]
Length=95
Score = 137 bits (346), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 66/98 (68%), Positives = 76/98 (78%), Gaps = 4/98 (4%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
+IF+IQ+YRHM+SPLRPA+CRFVPTCSQYAVDAL EYGL+RGSWL RLAKCGPWH GG
Sbjct 1 MIFLIQLYRHMVSPLRPATCRFVPTCSQYAVDALNEYGLIRGSWLAAARLAKCGPWHEGG 60
Query 83 WDPIPEGLTTGRSCQTDVDGANDDWNPASKRGERESFV 120
WDPIP+ SCQ + A+D + RGE ESFV
Sbjct 61 WDPIPD----RPSCQGNCQDASDARVVRATRGESESFV 94
>gi|108802370|ref|YP_642567.1| hypothetical protein Mmcs_5411 [Mycobacterium sp. MCS]
gi|119866063|ref|YP_936015.1| hypothetical protein Mkms_0006 [Mycobacterium sp. KMS]
gi|126438350|ref|YP_001074041.1| hypothetical protein Mjls_5787 [Mycobacterium sp. JLS]
gi|108772789|gb|ABG11511.1| protein of unknown function DUF37 [Mycobacterium sp. MCS]
gi|119692152|gb|ABL89225.1| protein of unknown function DUF37 [Mycobacterium sp. KMS]
gi|126238150|gb|ABO01551.1| protein of unknown function DUF37 [Mycobacterium sp. JLS]
Length=108
Score = 133 bits (334), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 59/96 (62%), Positives = 74/96 (78%), Gaps = 0/96 (0%)
Query 18 ASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGP 77
++A+ +I+VIQ+YRHM+SPLR SCRF PTCSQYAVDALTEYGL+RG WL+++RLAKCGP
Sbjct 10 SAAKAVIYVIQLYRHMISPLRYPSCRFTPTCSQYAVDALTEYGLVRGGWLSLVRLAKCGP 69
Query 78 WHRGGWDPIPEGLTTGRSCQTDVDGANDDWNPASKR 113
WHRGGWDPIPE + D GA+ +P +
Sbjct 70 WHRGGWDPIPERPDAASAPPEDRSGADFTADPVRSK 105
>gi|118469284|ref|YP_891136.1| hypothetical protein MSMEG_6944 [Mycobacterium smegmatis str.
MC2 155]
gi|118170571|gb|ABK71467.1| conserved hypothetical protein, putative [Mycobacterium smegmatis
str. MC2 155]
Length=115
Score = 130 bits (326), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 64/114 (57%), Positives = 78/114 (69%), Gaps = 7/114 (6%)
Query 13 RVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRL 72
R G + R +I+VIQ+YRH +SPLR +CRF+PTCSQYAVDALTEYGL+RG+WL +RL
Sbjct 3 RRIGVRAVRAVIYVIQLYRHTISPLRLPTCRFMPTCSQYAVDALTEYGLIRGTWLATVRL 62
Query 73 AKCGPWHRGGWDPIPE------GLTTGRSCQTDVDGANDDWNPASKRGERESFV 120
KCGPWH+GGWDPIPE G S +T N W +KRGE +S V
Sbjct 63 LKCGPWHQGGWDPIPERCAHETETVAGVSAET-AAAENPVWETPAKRGESKSRV 115
>gi|145221424|ref|YP_001132102.1| hypothetical protein Mflv_0830 [Mycobacterium gilvum PYR-GCK]
gi|315446824|ref|YP_004079703.1| hypothetical protein Mspyr1_53440 [Mycobacterium sp. Spyr1]
gi|145213910|gb|ABP43314.1| protein of unknown function DUF37 [Mycobacterium gilvum PYR-GCK]
gi|315265127|gb|ADU01869.1| conserved hypothetical protein TIGR00278 [Mycobacterium sp. Spyr1]
Length=115
Score = 126 bits (316), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 63/115 (55%), Positives = 76/115 (67%), Gaps = 8/115 (6%)
Query 14 VTGR-ASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRL 72
++GR A R +++VIQ+YRH +SPLR +CRF PTCSQYAVDALTE+G RGSWL +IRL
Sbjct 1 MSGRSAVVRAVVYVIQLYRHTISPLRLPTCRFTPTCSQYAVDALTEFGFFRGSWLALIRL 60
Query 73 AKCGPWHRGGWDPIPE------GLTTGRSCQTDV-DGANDDWNPASKRGERESFV 120
KCGPWHRGGWDPIPE G SC +V D A+ P + S V
Sbjct 61 LKCGPWHRGGWDPIPERTSHSDATADGASCAAEVCDTADHRVAPVQQGKSHTSVV 115
>gi|120407005|ref|YP_956834.1| hypothetical protein Mvan_6076 [Mycobacterium vanbaalenii PYR-1]
gi|119959823|gb|ABM16828.1| protein of unknown function DUF37 [Mycobacterium vanbaalenii
PYR-1]
Length=115
Score = 117 bits (294), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 53/90 (59%), Positives = 67/90 (75%), Gaps = 6/90 (6%)
Query 14 VTGRASA-RGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRL 72
+T R+ A R ++++IQ+YRH +SPLR +CRF PTCSQYAVDALTE+GL RG WL ++RL
Sbjct 1 MTARSRAVRAVVYLIQLYRHTISPLRLPTCRFTPTCSQYAVDALTEFGLFRGGWLALVRL 60
Query 73 AKCGPWHRGGWDPIPEGLTTGRSCQTDVDG 102
KCGPWH GGWDPIP+ R +D G
Sbjct 61 LKCGPWHNGGWDPIPD-----RESHSDATG 85
>gi|312142025|ref|YP_004009361.1| hypothetical protein REQ_47440 [Rhodococcus equi 103S]
gi|325677524|ref|ZP_08157188.1| alpha-hemolysin [Rhodococcus equi ATCC 33707]
gi|311891364|emb|CBH50685.1| conserved hypothetical protein [Rhodococcus equi 103S]
gi|325551771|gb|EGD21469.1| alpha-hemolysin [Rhodococcus equi ATCC 33707]
Length=112
Score = 102 bits (255), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 44/69 (64%), Positives = 57/69 (83%), Gaps = 0/69 (0%)
Query 20 ARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWH 79
AR LIF+I++YR +SPLR +CRF PTCS+YAVDA+ +G ++GS LT++RLAKC PWH
Sbjct 32 ARVLIFLIELYRTYVSPLRLPTCRFTPTCSEYAVDAMRTHGAIKGSALTVVRLAKCAPWH 91
Query 80 RGGWDPIPE 88
GGWDP+PE
Sbjct 92 PGGWDPVPE 100
>gi|226309517|ref|YP_002769479.1| hypothetical protein RER_60320 [Rhodococcus erythropolis PR4]
gi|226188636|dbj|BAH36740.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=126
Score = 102 bits (255), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 44/77 (58%), Positives = 59/77 (77%), Gaps = 0/77 (0%)
Query 11 VVRVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMI 70
V++ RA A LIF I++YR +SPLR +CRF PTCS+YAV+AL +G ++G +LT++
Sbjct 20 VLKAVRRAPATVLIFCIELYRTYVSPLRMPTCRFTPTCSEYAVEALRTHGAIKGLFLTVV 79
Query 71 RLAKCGPWHRGGWDPIP 87
RLAKC PWH GGWDP+P
Sbjct 80 RLAKCAPWHSGGWDPVP 96
>gi|296141905|ref|YP_003649148.1| hypothetical protein Tpau_4241 [Tsukamurella paurometabola DSM
20162]
gi|296030039|gb|ADG80809.1| protein of unknown function DUF37 [Tsukamurella paurometabola
DSM 20162]
Length=74
Score = 100 bits (248), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 43/68 (64%), Positives = 57/68 (84%), Gaps = 0/68 (0%)
Query 21 RGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHR 80
R LI ++Q YR +SP+RP +CRF+PTCS+YAV+AL+ +G LRG +L+ IRL KCGPWHR
Sbjct 7 RTLIGLVQWYRTWISPMRPPTCRFMPTCSEYAVEALSRHGALRGVYLSTIRLLKCGPWHR 66
Query 81 GGWDPIPE 88
GGWDP+P+
Sbjct 67 GGWDPVPQ 74
>gi|226362892|ref|YP_002780672.1| hypothetical protein ROP_34800 [Rhodococcus opacus B4]
gi|254765056|sp|C1B7S4.1|Y3480_RHOOB RecName: Full=UPF0161 protein ROP_34800
gi|226241379|dbj|BAH51727.1| hypothetical protein [Rhodococcus opacus B4]
Length=121
Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/86 (53%), Positives = 61/86 (71%), Gaps = 4/86 (4%)
Query 20 ARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWH 79
AR LIF I++YR +SPLR +CRF+PTCS+YAV++L +G ++G LT++RLAKC PWH
Sbjct 31 ARTLIFFIELYRTYVSPLRMPTCRFMPTCSEYAVESLRTHGTIKGLLLTVVRLAKCAPWH 90
Query 80 RGGWDPIP----EGLTTGRSCQTDVD 101
GGWDP+P + R C +VD
Sbjct 91 PGGWDPVPARHDRHAGSRRCCPANVD 116
>gi|111020652|ref|YP_703624.1| hypothetical protein RHA1_ro03663 [Rhodococcus jostii RHA1]
gi|116256234|sp|Q0SAH0.1|Y3663_RHOSR RecName: Full=UPF0161 protein RHA1_ro03663
gi|110820182|gb|ABG95466.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=121
Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 41/68 (61%), Positives = 57/68 (84%), Gaps = 0/68 (0%)
Query 20 ARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWH 79
AR LIF I++YR +SPLR +CRF+PTCS+YAV++L +G+++G +LT++RLAKC PWH
Sbjct 31 ARTLIFFIELYRTYVSPLRMPTCRFMPTCSEYAVESLRTHGVIKGLFLTVVRLAKCAPWH 90
Query 80 RGGWDPIP 87
GGWDP+P
Sbjct 91 PGGWDPVP 98
>gi|262204660|ref|YP_003275868.1| hypothetical protein Gbro_4862 [Gordonia bronchialis DSM 43247]
gi|262088007|gb|ACY23975.1| protein of unknown function DUF37 [Gordonia bronchialis DSM 43247]
Length=115
Score = 96.7 bits (239), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 47/93 (51%), Positives = 61/93 (66%), Gaps = 0/93 (0%)
Query 10 RVVRVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTM 69
RV+ R LIFVI++YR +SP+R +CRF PTCS YAV+AL +G G WL++
Sbjct 22 RVLTTVRLLPRRTLIFVIELYRTWISPMRLPTCRFEPTCSGYAVEALERHGFRYGGWLSV 81
Query 70 IRLAKCGPWHRGGWDPIPEGLTTGRSCQTDVDG 102
IRL KCGPWH+ G+DP+PE R T+V G
Sbjct 82 IRLLKCGPWHKPGYDPVPEDGFRWRRPPTEVRG 114
>gi|291005727|ref|ZP_06563700.1| hypothetical protein SeryN2_14499 [Saccharopolyspora erythraea
NRRL 2338]
Length=102
Score = 95.5 bits (236), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 43/73 (59%), Positives = 54/73 (74%), Gaps = 0/73 (0%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
L+ + VYR ++SPL P SCRF P+CS YAV+ALT +G LRG WLT RL +CGPWH GG
Sbjct 21 LLVPVHVYRKVISPLLPPSCRFYPSCSAYAVEALTVHGALRGGWLTARRLLRCGPWHPGG 80
Query 83 WDPIPEGLTTGRS 95
DP+P T+GR+
Sbjct 81 LDPVPPRRTSGRA 93
>gi|134103815|ref|YP_001109476.1| hypothetical protein SACE_7395 [Saccharopolyspora erythraea NRRL
2338]
gi|133916438|emb|CAM06551.1| hypothetical protein SACE_7395 [Saccharopolyspora erythraea NRRL
2338]
Length=83
Score = 95.1 bits (235), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 43/73 (59%), Positives = 54/73 (74%), Gaps = 0/73 (0%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
L+ + VYR ++SPL P SCRF P+CS YAV+ALT +G LRG WLT RL +CGPWH GG
Sbjct 2 LLVPVHVYRKVISPLLPPSCRFYPSCSAYAVEALTVHGALRGGWLTARRLLRCGPWHPGG 61
Query 83 WDPIPEGLTTGRS 95
DP+P T+GR+
Sbjct 62 LDPVPPRRTSGRA 74
>gi|54027645|ref|YP_121887.1| hypothetical protein nfa56710 [Nocardia farcinica IFM 10152]
gi|81372755|sp|Q5YMR8.1|Y5671_NOCFA RecName: Full=UPF0161 protein NFA_56710
gi|54019153|dbj|BAD60523.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=118
Score = 94.4 bits (233), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 43/72 (60%), Positives = 52/72 (73%), Gaps = 0/72 (0%)
Query 17 RASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCG 76
R AR LIF+I++YR +SP R CRF PTCS+YAV AL GL+ G LT +RLAKC
Sbjct 10 RLPARALIFLIELYRTYVSPTRMPVCRFTPTCSEYAVTALRTRGLVVGLGLTAVRLAKCA 69
Query 77 PWHRGGWDPIPE 88
PWH GGWDP+P+
Sbjct 70 PWHPGGWDPVPQ 81
>gi|256381074|ref|YP_003104734.1| hypothetical protein Amir_7098 [Actinosynnema mirum DSM 43827]
gi|255925377|gb|ACU40888.1| protein of unknown function DUF37 [Actinosynnema mirum DSM 43827]
Length=89
Score = 92.0 bits (227), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 41/71 (58%), Positives = 52/71 (74%), Gaps = 0/71 (0%)
Query 17 RASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCG 76
R + R L+ I+ Y+ +SP P +CRF PTCS YAV+ALT +G LRGSWLT+ RL +CG
Sbjct 8 RPAVRVLVLPIRFYQRFISPYLPPTCRFHPTCSSYAVEALTVHGALRGSWLTVRRLGRCG 67
Query 77 PWHRGGWDPIP 87
PWH GG DP+P
Sbjct 68 PWHPGGLDPVP 78
>gi|205374603|ref|ZP_03227398.1| hypothetical protein Bcoam_16106 [Bacillus coahuilensis m4-4]
Length=77
Score = 92.0 bits (227), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 34/69 (50%), Positives = 56/69 (82%), Gaps = 0/69 (0%)
Query 20 ARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWH 79
++ IF+I++Y+ ++SPL+P SCRF PTCS Y V+A+ ++G ++G WLT+ R+++C P+H
Sbjct 3 SKLFIFLIKIYQRVISPLKPPSCRFYPTCSHYGVEAIQKHGAVKGGWLTIKRISRCHPFH 62
Query 80 RGGWDPIPE 88
GG+DP+PE
Sbjct 63 PGGFDPVPE 71
>gi|158333932|ref|YP_001515104.1| hypothetical protein AM1_0746 [Acaryochloris marina MBIC11017]
gi|158304173|gb|ABW25790.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length=97
Score = 91.7 bits (226), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 41/93 (45%), Positives = 61/93 (66%), Gaps = 0/93 (0%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
LI +IQVYR ++SPL P SCRF PTCS YA++A++ +G ++GSWL R+ +C P+H GG
Sbjct 5 LIGMIQVYRRLISPLFPPSCRFHPTCSAYAIEAISTWGSVKGSWLAAKRICRCNPFHPGG 64
Query 83 WDPIPEGLTTGRSCQTDVDGANDDWNPASKRGE 115
+DP+P T S ++ + D + +S E
Sbjct 65 YDPVPTVATLSDSAHSETVDSEVDQDNSSHGSE 97
>gi|196250028|ref|ZP_03148723.1| protein of unknown function DUF37 [Geobacillus sp. G11MC16]
gi|196210542|gb|EDY05306.1| protein of unknown function DUF37 [Geobacillus sp. G11MC16]
Length=95
Score = 91.7 bits (226), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 36/69 (53%), Positives = 53/69 (77%), Gaps = 0/69 (0%)
Query 20 ARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWH 79
A+ LI +I+ Y+ LSPL+P +CRF PTCS Y ++A+T +G ++G WLT+ R+ KC P+H
Sbjct 5 AKLLIMLIRFYQRFLSPLKPPTCRFYPTCSHYGIEAVTRFGAIKGGWLTVKRILKCHPFH 64
Query 80 RGGWDPIPE 88
GG+DP+PE
Sbjct 65 PGGFDPVPE 73
>gi|257065516|ref|YP_003145188.1| conserved hypothetical protein TIGR00278 [Slackia heliotrinireducens
DSM 20476]
gi|256793169|gb|ACV23839.1| conserved hypothetical protein TIGR00278 [Slackia heliotrinireducens
DSM 20476]
Length=77
Score = 90.9 bits (224), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 36/70 (52%), Positives = 52/70 (75%), Gaps = 0/70 (0%)
Query 24 IFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGGW 83
I +I+ YR +SPL P+ CR+ PTCSQY + A+ +G L+GSWLT+ R+++C PWH GG+
Sbjct 7 ILLIRFYRAAISPLFPSCCRYTPTCSQYGIIAIKRFGFLKGSWLTIKRISRCHPWHEGGY 66
Query 84 DPIPEGLTTG 93
DP+PE + G
Sbjct 67 DPVPEKDSEG 76
>gi|254302383|ref|ZP_04969741.1| possible alpha-hemolysin [Fusobacterium nucleatum subsp. polymorphum
ATCC 10953]
gi|148322575|gb|EDK87825.1| possible alpha-hemolysin [Fusobacterium nucleatum subsp. polymorphum
ATCC 10953]
Length=82
Score = 90.5 bits (223), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 34/66 (52%), Positives = 51/66 (78%), Gaps = 0/66 (0%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
LI +I+ Y+ +SP+ PA CRF PTCSQY ++A+ EYG ++G++L + R+ KC P+H GG
Sbjct 5 LILLIRFYQKFISPMFPAKCRFYPTCSQYTLEAIKEYGAMKGTYLGIKRILKCHPFHEGG 64
Query 83 WDPIPE 88
+DPIP+
Sbjct 65 YDPIPK 70
>gi|289523249|ref|ZP_06440103.1| alpha-hemolysin [Anaerobaculum hydrogeniformans ATCC BAA-1850]
gi|289503792|gb|EFD24956.1| alpha-hemolysin [Anaerobaculum hydrogeniformans ATCC BAA-1850]
Length=90
Score = 90.1 bits (222), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 39/71 (55%), Positives = 55/71 (78%), Gaps = 0/71 (0%)
Query 17 RASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCG 76
+A R LI I++Y+ LSPL +CRF P+CSQYA++A+ ++G +RGSWLT+ RL +CG
Sbjct 13 KAFKRVLILSIRLYQIFLSPLLGRNCRFYPSCSQYALEAIEQWGAIRGSWLTVKRLCRCG 72
Query 77 PWHRGGWDPIP 87
PWH GG+DP+P
Sbjct 73 PWHPGGYDPVP 83
>gi|343928732|ref|ZP_08768177.1| hypothetical protein GOALK_120_01600 [Gordonia alkanivorans NBRC
16433]
gi|343761481|dbj|GAA15103.1| hypothetical protein GOALK_120_01600 [Gordonia alkanivorans NBRC
16433]
Length=118
Score = 90.1 bits (222), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 41/76 (54%), Positives = 54/76 (72%), Gaps = 0/76 (0%)
Query 13 RVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRL 72
RV R +IF+I++YR +SPLR +CRF PTCS YAV+AL +G L GS L +RL
Sbjct 3 RVAHLLPRRTVIFLIELYRTWVSPLRLPTCRFEPTCSGYAVEALERHGFLYGSLLATVRL 62
Query 73 AKCGPWHRGGWDPIPE 88
KCGPWH+ G+DP+P+
Sbjct 63 LKCGPWHKPGYDPVPD 78
>gi|302880139|ref|YP_003848703.1| hypothetical protein Galf_2947 [Gallionella capsiferriformans
ES-2]
gi|302582928|gb|ADL56939.1| protein of unknown function DUF37 [Gallionella capsiferriformans
ES-2]
Length=69
Score = 90.1 bits (222), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 38/67 (57%), Positives = 53/67 (80%), Gaps = 0/67 (0%)
Query 21 RGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHR 80
R LI VI Y++++SP+RP +CRF P+CS YA DALT +GL+ GSWL++ R+A+C PW+
Sbjct 3 RILIKVIHGYQYLISPMRPPTCRFTPSCSHYACDALTRHGLITGSWLSIKRIARCNPWNP 62
Query 81 GGWDPIP 87
GG+DP P
Sbjct 63 GGYDPAP 69
>gi|237743157|ref|ZP_04573638.1| alpha-hemolysin [Fusobacterium sp. 7_1]
gi|260495363|ref|ZP_05815490.1| conserved hypothetical protein [Fusobacterium sp. 3_1_33]
gi|289765523|ref|ZP_06524901.1| alpha-hemolysin [Fusobacterium sp. D11]
8 more sequence titles
Length=82
Score = 89.7 bits (221), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 51/66 (78%), Gaps = 0/66 (0%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
LI +I+ Y+ +SP+ PA CRF PTCSQY ++A+ EYG ++G++L + R+ KC P+H GG
Sbjct 5 LILLIRFYQKFISPMFPAKCRFYPTCSQYTLEAIKEYGAIKGTYLGIKRILKCHPFHEGG 64
Query 83 WDPIPE 88
+DP+P+
Sbjct 65 YDPVPK 70
>gi|119475234|ref|ZP_01615587.1| hypothetical protein GP2143_15481 [marine gamma proteobacterium
HTCC2143]
gi|119451437|gb|EAW32670.1| hypothetical protein GP2143_15481 [marine gamma proteobacterium
HTCC2143]
Length=81
Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 36/68 (53%), Positives = 50/68 (74%), Gaps = 0/68 (0%)
Query 21 RGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHR 80
R I IQ YR+++SPL CRF P+CS YA+ A+ +YG+ +GSWLT+ RL +C P+H
Sbjct 3 RLFISFIQCYRYLISPLMANHCRFHPSCSNYAIQAIEQYGVFKGSWLTVKRLLRCHPFHP 62
Query 81 GGWDPIPE 88
GG+DP+PE
Sbjct 63 GGYDPVPE 70
>gi|158341482|ref|YP_001522647.1| hypothetical protein AM1_G0153 [Acaryochloris marina MBIC11017]
gi|158311723|gb|ABW33333.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length=99
Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 39/84 (47%), Positives = 58/84 (70%), Gaps = 1/84 (1%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
LI +IQ YR +SPL SCRF PTCS YA++A++ +G+L+GSWL + R+++C P+H GG
Sbjct 5 LIGLIQFYRRFISPLFLPSCRFQPTCSAYAIEAISTWGVLKGSWLALKRISRCHPFHPGG 64
Query 83 WDPIPEGLTTGRSCQTDVDGANDD 106
+DP+P S Q D + A+ +
Sbjct 65 YDPVPTA-AVKPSAQIDSESADPN 87
>gi|329894826|ref|ZP_08270626.1| YidD [gamma proteobacterium IMCC3088]
gi|328922720|gb|EGG30054.1| YidD [gamma proteobacterium IMCC3088]
Length=83
Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 40/70 (58%), Positives = 51/70 (73%), Gaps = 0/70 (0%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
LI I++YR +LSP +CRF PTCS+YA++AL E+G LRGS LT RL KC PWH GG
Sbjct 11 LISFIKLYRLLLSPFLGRNCRFYPTCSEYAIEALHEHGALRGSILTAQRLGKCHPWHDGG 70
Query 83 WDPIPEGLTT 92
+DP+P+ T
Sbjct 71 YDPVPKTQQT 80
>gi|335039144|ref|ZP_08532326.1| UPF0161 protein yidD [Caldalkalibacillus thermarum TA2.A1]
gi|334180954|gb|EGL83537.1| UPF0161 protein yidD [Caldalkalibacillus thermarum TA2.A1]
Length=70
Score = 88.2 bits (217), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 36/65 (56%), Positives = 50/65 (77%), Gaps = 0/65 (0%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
++ +I+ Y+ +SPLRP SCRFVPTCS YA A +YGL +GS+L + R+ KC P+HRGG
Sbjct 6 ILALIRFYQKWISPLRPPSCRFVPTCSHYAYQAFQKYGLFKGSYLVLKRMMKCHPFHRGG 65
Query 83 WDPIP 87
+DP+P
Sbjct 66 YDPLP 70
>gi|289524496|ref|ZP_06441350.1| putative alpha-hemolysin [Anaerobaculum hydrogeniformans ATCC
BAA-1850]
gi|289502265|gb|EFD23429.1| putative alpha-hemolysin [Anaerobaculum hydrogeniformans ATCC
BAA-1850]
Length=76
Score = 88.2 bits (217), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 38/67 (57%), Positives = 53/67 (80%), Gaps = 0/67 (0%)
Query 21 RGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHR 80
R LI I++Y+ LSPL +CRF P+CSQYA++A+ ++G +RGSWLT+ RL +CGPWH
Sbjct 3 RMLILSIRLYQIFLSPLLGRNCRFYPSCSQYALEAIEQWGAIRGSWLTVKRLCRCGPWHP 62
Query 81 GGWDPIP 87
GG+DP+P
Sbjct 63 GGYDPVP 69
>gi|15615392|ref|NP_243695.1| hypothetical protein BH2828 [Bacillus halodurans C-125]
gi|14286088|sp|Q9K921.1|Y2828_BACHD RecName: Full=UPF0161 protein BH2828
gi|10175450|dbj|BAB06548.1| alpha-hemolysin [Bacillus halodurans C-125]
Length=75
Score = 88.2 bits (217), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 35/69 (51%), Positives = 51/69 (74%), Gaps = 0/69 (0%)
Query 20 ARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWH 79
R I I++Y+ +SPL P +CRF PTCS Y ++A+ +G+L+G WLT+ R++KC P+H
Sbjct 2 KRFFIGCIRLYQKFISPLTPPTCRFYPTCSHYGIEAIQRFGVLKGGWLTIKRISKCHPFH 61
Query 80 RGGWDPIPE 88
RGG DP+PE
Sbjct 62 RGGIDPVPE 70
>gi|86160774|ref|YP_467559.1| hypothetical protein Adeh_4359 [Anaeromyxobacter dehalogenans
2CP-C]
gi|116256245|sp|Q2IHR6.1|Y4359_ANADE RecName: Full=UPF0161 protein Adeh_4359
gi|85777285|gb|ABC84122.1| protein of unknown function DUF37 [Anaeromyxobacter dehalogenans
2CP-C]
Length=76
Score = 88.2 bits (217), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 40/74 (55%), Positives = 55/74 (75%), Gaps = 1/74 (1%)
Query 21 RGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHR 80
R L+ ++++Y+ ++SPL P +CRF P+CS YAV AL +G LRGSWLT+ RL +C P+H
Sbjct 4 RALVLLVRIYQRLVSPLLPPACRFYPSCSAYAVTALQRHGALRGSWLTVRRLCRCHPFHP 63
Query 81 GGWDPIPEGLTTGR 94
GG DP+PE LT R
Sbjct 64 GGVDPVPE-LTPKR 76
>gi|19703355|ref|NP_602917.1| alpha-hemolysin [Fusobacterium nucleatum subsp. nucleatum ATCC
25586]
gi|25009580|sp|Q8RHA5.1|Y003_FUSNN RecName: Full=UPF0161 protein FN0003
gi|19713417|gb|AAL94216.1| hypothetical cytosolic protein [Fusobacterium nucleatum subsp.
nucleatum ATCC 25586]
Length=82
Score = 88.2 bits (217), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 32/66 (49%), Positives = 51/66 (78%), Gaps = 0/66 (0%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
LI +I+ Y+ +SP+ PA CRF PTCSQY ++A+ E+G ++G++L + R+ KC P+H GG
Sbjct 5 LILLIRFYQKFISPMFPAKCRFYPTCSQYTLEAIKEHGTIKGTYLAIRRILKCHPFHEGG 64
Query 83 WDPIPE 88
+DP+P+
Sbjct 65 YDPVPK 70
>gi|260424965|ref|ZP_05779066.1| putative toxin-antitoxin system, toxin component [Dialister invisus
DSM 15470]
gi|260403760|gb|EEW97307.1| putative toxin-antitoxin system, toxin component [Dialister invisus
DSM 15470]
Length=70
Score = 88.2 bits (217), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 35/65 (54%), Positives = 50/65 (77%), Gaps = 0/65 (0%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
LI +I+ YR +SPL+P CRF PTCS YA++A+ +YG L+G WL + R+ KC P+H+GG
Sbjct 6 LIVLIRFYRKYVSPLKPPCCRFTPTCSTYALEAVEKYGALKGGWLALKRILKCHPFHKGG 65
Query 83 WDPIP 87
+DP+P
Sbjct 66 YDPVP 70
>gi|269792318|ref|YP_003317222.1| hypothetical protein Taci_0704 [Thermanaerovibrio acidaminovorans
DSM 6589]
gi|269099953|gb|ACZ18940.1| protein of unknown function DUF37 [Thermanaerovibrio acidaminovorans
DSM 6589]
Length=75
Score = 87.8 bits (216), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 38/65 (59%), Positives = 50/65 (77%), Gaps = 0/65 (0%)
Query 24 IFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGGW 83
+F I+VY+ LSPL +CRF PTCSQYA +A++ +G+LRG WL+ RL KCGPWH GG+
Sbjct 10 LFSIRVYQIFLSPLLGRNCRFHPTCSQYAFEAISRFGVLRGIWLSAKRLVKCGPWHPGGY 69
Query 84 DPIPE 88
DP+P
Sbjct 70 DPVPH 74
>gi|239828120|ref|YP_002950744.1| hypothetical protein GWCH70_2790 [Geobacillus sp. WCH70]
gi|259646903|sp|C5D6T9.1|Y2790_GEOSW RecName: Full=UPF0161 protein GWCH70_2790
gi|239808413|gb|ACS25478.1| protein of unknown function DUF37 [Geobacillus sp. WCH70]
Length=77
Score = 87.8 bits (216), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 35/73 (48%), Positives = 52/73 (72%), Gaps = 0/73 (0%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
LI I+ Y+ +SPL+P +CRF PTCS Y ++A+ +G ++G WLT+ R+ KC P+H GG
Sbjct 5 LISFIRFYQIFISPLKPPTCRFYPTCSHYGLEAVKRFGAIKGGWLTIKRILKCHPFHPGG 64
Query 83 WDPIPEGLTTGRS 95
+DP+PE G+S
Sbjct 65 FDPVPEKQENGKS 77
>gi|342214934|ref|ZP_08707604.1| hypothetical protein HMPREF9200_0812 [Veillonella sp. oral taxon
780 str. F0422]
gi|341590236|gb|EGS33482.1| hypothetical protein HMPREF9200_0812 [Veillonella sp. oral taxon
780 str. F0422]
Length=75
Score = 87.4 bits (215), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 36/65 (56%), Positives = 51/65 (79%), Gaps = 0/65 (0%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
LI ++Q YR+ LSPL+P++CRF PTCS Y + AL YG L+GS+L + R+ KC P+H+GG
Sbjct 11 LIGLVQFYRYALSPLKPSTCRFYPTCSMYMLQALRRYGPLKGSYLGIKRILKCHPFHKGG 70
Query 83 WDPIP 87
+DP+P
Sbjct 71 YDPLP 75
>gi|56752481|ref|YP_173182.1| hypothetical protein syc2472_c [Synechococcus elongatus PCC 6301]
gi|81300295|ref|YP_400503.1| hypothetical protein Synpcc7942_1486 [Synechococcus elongatus
PCC 7942]
gi|81561212|sp|Q5MZ58.1|Y2472_SYNP6 RecName: Full=UPF0161 protein syc2472_c
gi|116256137|sp|Q31N53.1|Y1486_SYNE7 RecName: Full=UPF0161 protein Synpcc7942_1486
gi|56687440|dbj|BAD80662.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169176|gb|ABB57516.1| Protein of unknown function DUF37 [Synechococcus elongatus PCC
7942]
Length=82
Score = 87.4 bits (215), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 37/66 (57%), Positives = 51/66 (78%), Gaps = 0/66 (0%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
L+ +IQ YR +SPL PASCRF PTCSQY ++A+ +G L+GSWLT+ R+ +C P+H GG
Sbjct 5 LLALIQFYRRWISPLTPASCRFYPTCSQYGLEAIDRFGPLKGSWLTLCRILRCHPFHPGG 64
Query 83 WDPIPE 88
+DP+P
Sbjct 65 YDPVPP 70
>gi|269216521|ref|ZP_06160375.1| YidD family protein [Slackia exigua ATCC 700122]
gi|269130050|gb|EEZ61132.1| YidD family protein [Slackia exigua ATCC 700122]
Length=96
Score = 87.4 bits (215), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 36/78 (47%), Positives = 54/78 (70%), Gaps = 0/78 (0%)
Query 11 VVRVTGRASARGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMI 70
++R A+G +F+I YR +SPL P+ CRF PTCS+Y + A+ +G L+GS LT+
Sbjct 7 LLRYIKGIPAQGALFLITFYRAAISPLFPSCCRFTPTCSEYGLTAIRRFGFLKGSLLTLK 66
Query 71 RLAKCGPWHRGGWDPIPE 88
R+++C P H GG+DP+PE
Sbjct 67 RISRCHPLHAGGYDPVPE 84
>gi|172035662|ref|YP_001802163.1| hypothetical protein cce_0746 [Cyanothece sp. ATCC 51142]
gi|254806602|sp|B1WR39.1|Y746_CYAA5 RecName: Full=UPF0161 protein cce_0746
gi|171697116|gb|ACB50097.1| DUF37-containing protein [Cyanothece sp. ATCC 51142]
Length=74
Score = 87.0 bits (214), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 36/67 (54%), Positives = 52/67 (78%), Gaps = 0/67 (0%)
Query 21 RGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHR 80
+ LI +I+ YR ++SPL P SCRF PTCSQY ++A+ ++G LRGSWL + R+ +C P+H
Sbjct 3 KTLILLIKGYRRLISPLFPPSCRFQPTCSQYTLEAIEKFGALRGSWLGLRRILRCHPFHP 62
Query 81 GGWDPIP 87
GG+DP+P
Sbjct 63 GGYDPVP 69
>gi|326383906|ref|ZP_08205590.1| hypothetical protein SCNU_13273 [Gordonia neofelifaecis NRRL
B-59395]
gi|326197365|gb|EGD54555.1| hypothetical protein SCNU_13273 [Gordonia neofelifaecis NRRL
B-59395]
Length=105
Score = 87.0 bits (214), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 59/94 (63%), Gaps = 4/94 (4%)
Query 23 LIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRGG 82
LIFVI++YR +SP R +CRF PTCS YAV+AL +G + GS L RL KCGPWH+ G
Sbjct 15 LIFVIELYRTWISPTRMPTCRFEPTCSGYAVEALQRHGFVWGSVLATWRLLKCGPWHKPG 74
Query 83 WDPIPE-GLTTGRSCQTDVDGANDDWNPASKRGE 115
+DP+PE G +T GA P S RG+
Sbjct 75 YDPVPELGPVEWLRSKTSNSGATA---PTSDRGK 105
>gi|86607211|ref|YP_475974.1| hypothetical protein CYA_2594 [Synechococcus sp. JA-3-3Ab]
gi|116256210|sp|Q2JQD1.1|Y2594_SYNJA RecName: Full=UPF0161 protein CYA_2594
gi|86555753|gb|ABD00711.1| conserved hypothetical protein TIGR00278 [Synechococcus sp. JA-3-3Ab]
Length=100
Score = 87.0 bits (214), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 36/68 (53%), Positives = 52/68 (77%), Gaps = 0/68 (0%)
Query 21 RGLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHR 80
RGL+ +I+ Y+ +SPL P +CR+ PTCSQY ++A+ YG +RGSWL + RL +C PWH
Sbjct 10 RGLVALIRGYQVAISPLLPPACRYYPTCSQYTLEAVRRYGAIRGSWLGIRRLCRCHPWHP 69
Query 81 GGWDPIPE 88
GG+DP+P+
Sbjct 70 GGYDPVPD 77
>gi|197124876|ref|YP_002136827.1| hypothetical protein AnaeK_4496 [Anaeromyxobacter sp. K]
gi|220919594|ref|YP_002494898.1| hypothetical protein A2cp1_4515 [Anaeromyxobacter dehalogenans
2CP-1]
gi|226706194|sp|B4UKG2.1|Y4496_ANASK RecName: Full=UPF0161 protein AnaeK_4496
gi|254803922|sp|B8JDK6.1|Y4515_ANAD2 RecName: Full=UPF0161 protein A2cp1_4515
gi|196174725|gb|ACG75698.1| protein of unknown function DUF37 [Anaeromyxobacter sp. K]
gi|219957448|gb|ACL67832.1| protein of unknown function DUF37 [Anaeromyxobacter dehalogenans
2CP-1]
Length=76
Score = 87.0 bits (214), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 39/73 (54%), Positives = 54/73 (74%), Gaps = 1/73 (1%)
Query 22 GLIFVIQVYRHMLSPLRPASCRFVPTCSQYAVDALTEYGLLRGSWLTMIRLAKCGPWHRG 81
L+ ++++Y+ ++SPL P +CRF P+CS YAV AL +G LRGSWLT+ RL +C P+H G
Sbjct 5 ALVLLVRIYQRLVSPLLPPACRFYPSCSAYAVTALQRHGALRGSWLTVRRLCRCHPFHAG 64
Query 82 GWDPIPEGLTTGR 94
G DP+PE LT R
Sbjct 65 GVDPVPE-LTPKR 76
Lambda K H
0.323 0.137 0.455
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128530997826
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40