BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0968
Length=98
Score E
Sequences producing significant alignments: (Bits) Value
gi|15840394|ref|NP_335431.1| hypothetical protein MT0996 [Mycoba... 192 2e-47
gi|15608108|ref|NP_215483.1| hypothetical protein Rv0968 [Mycoba... 191 4e-47
gi|340625980|ref|YP_004744432.1| hypothetical protein MCAN_09721... 188 2e-46
gi|296164289|ref|ZP_06846875.1| conserved hypothetical protein [... 123 1e-26
gi|240171334|ref|ZP_04749993.1| hypothetical protein MkanA1_1862... 121 4e-26
gi|296164642|ref|ZP_06847208.1| conserved hypothetical protein [... 117 6e-25
gi|183981452|ref|YP_001849743.1| hypothetical protein MMAR_1430 ... 114 4e-24
gi|296165052|ref|ZP_06847606.1| conserved hypothetical protein [... 103 7e-21
gi|118467605|ref|YP_890281.1| hypothetical protein MSMEG_6059 [M... 97.1 8e-19
gi|15842859|ref|NP_337896.1| hypothetical protein MT3369 [Mycoba... 91.3 5e-17
gi|15610405|ref|NP_217786.1| hypothetical protein Rv3269 [Mycoba... 90.5 8e-17
gi|342861473|ref|ZP_08718120.1| hypothetical protein MCOL_21411 ... 89.7 1e-16
gi|308232401|ref|ZP_07415935.2| hypothetical protein TMAG_02722 ... 89.4 1e-16
gi|41409481|ref|NP_962317.1| hypothetical protein MAP3383 [Mycob... 89.0 2e-16
gi|118463473|ref|YP_883375.1| hypothetical protein MAV_4234 [Myc... 88.6 3e-16
gi|254821358|ref|ZP_05226359.1| hypothetical protein MintA_15582... 85.1 3e-15
gi|333991660|ref|YP_004524274.1| hypothetical protein JDM601_302... 84.7 4e-15
gi|226359852|ref|YP_002777630.1| hypothetical protein ROP_04380 ... 84.3 5e-15
gi|240172498|ref|ZP_04751157.1| hypothetical protein MkanA1_2450... 84.0 6e-15
gi|183982550|ref|YP_001850841.1| hypothetical protein MMAR_2537 ... 80.9 6e-14
gi|118618085|ref|YP_906417.1| hypothetical protein MUL_2615 [Myc... 78.6 3e-13
gi|296168935|ref|ZP_06850604.1| conserved hypothetical protein [... 78.6 3e-13
gi|15827316|ref|NP_301579.1| hypothetical protein ML0748 [Mycoba... 77.8 5e-13
gi|340627004|ref|YP_004745456.1| hypothetical protein MCAN_20131... 77.0 8e-13
gi|183984843|ref|YP_001853134.1| hypothetical protein MMAR_4875 ... 73.2 1e-11
gi|183982159|ref|YP_001850450.1| hypothetical protein MMAR_2146 ... 69.3 2e-10
gi|296165061|ref|ZP_06847615.1| conserved hypothetical protein [... 64.7 4e-09
gi|15609130|ref|NP_216509.1| hypothetical protein Rv1993c [Mycob... 63.5 9e-09
gi|289570093|ref|ZP_06450320.1| conserved hypothetical protein [... 62.4 2e-08
gi|167970478|ref|ZP_02552755.1| hypothetical protein MtubH3_2159... 62.0 3e-08
gi|296164259|ref|ZP_06846848.1| conserved hypothetical protein [... 61.2 5e-08
gi|118616275|ref|YP_904607.1| hypothetical protein MUL_0424 [Myc... 59.7 1e-07
gi|333919412|ref|YP_004492993.1| hypothetical protein AS9A_1744 ... 56.6 1e-06
gi|289749496|ref|ZP_06508874.1| conserved hypothetical protein [... 55.8 2e-06
gi|325000600|ref|ZP_08121712.1| hypothetical protein PseP1_17612... 55.1 4e-06
gi|296164386|ref|ZP_06846962.1| conserved hypothetical protein [... 43.1 0.013
gi|258653982|ref|YP_003203138.1| hypothetical protein Namu_3853 ... 37.4 0.71
gi|86139077|ref|ZP_01057648.1| UvrABC system protein C [Roseobac... 35.8 1.9
gi|147792395|emb|CAN70278.1| hypothetical protein VITISV_015612 ... 35.4 3.1
gi|221485569|gb|EEE23850.1| conserved hypothetical protein [Toxo... 35.0 3.8
gi|83942317|ref|ZP_00954778.1| hypothetical protein EE36_14792 [... 34.3 6.0
gi|83953536|ref|ZP_00962257.1| hypothetical protein NAS141_04913... 33.9 7.8
gi|254466793|ref|ZP_05080204.1| excinuclease ABC, C subunit [Rho... 33.9 8.9
gi|152964259|ref|YP_001360043.1| hypothetical protein Krad_0289 ... 33.9 9.2
gi|186685628|ref|YP_001868824.1| secretion protein HlyD [Nostoc ... 33.5 10.0
>gi|15840394|ref|NP_335431.1| hypothetical protein MT0996 [Mycobacterium tuberculosis CDC1551]
gi|13880562|gb|AAK45245.1| hypothetical protein MT0996 [Mycobacterium tuberculosis CDC1551]
Length=118
Score = 192 bits (487), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 98/98 (100%), Positives = 98/98 (100%), Gaps = 0/98 (0%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE
Sbjct 21 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 80
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH
Sbjct 81 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 118
>gi|15608108|ref|NP_215483.1| hypothetical protein Rv0968 [Mycobacterium tuberculosis H37Rv]
gi|31792157|ref|NP_854650.1| hypothetical protein Mb0993 [Mycobacterium bovis AF2122/97]
gi|121636894|ref|YP_977117.1| hypothetical protein BCG_1022 [Mycobacterium bovis BCG str. Pasteur
1173P2]
68 more sequence titles
Length=98
Score = 191 bits (484), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 98/98 (100%), Positives = 98/98 (100%), Gaps = 0/98 (0%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE
Sbjct 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH
Sbjct 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
>gi|340625980|ref|YP_004744432.1| hypothetical protein MCAN_09721 [Mycobacterium canettii CIPT
140010059]
gi|340004170|emb|CCC43309.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=98
Score = 188 bits (478), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 97/98 (99%), Positives = 98/98 (100%), Gaps = 0/98 (0%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
MVWHGFLAKAVPTVVTGAVGVAAYEALRK+VVKAPLRAATVSVAAWGIRLAREAERKAGE
Sbjct 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKVVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH
Sbjct 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
>gi|296164289|ref|ZP_06846875.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295900351|gb|EFG79771.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=113
Score = 123 bits (308), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/94 (72%), Positives = 71/94 (76%), Gaps = 0/94 (0%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
MV HGFL KA PTV+TG VGVAAYEAL K+ KAPLR ATV AWGIR+ REAER AG
Sbjct 18 MVLHGFLVKAAPTVLTGVVGVAAYEALCKVAGKAPLRKATVIATAWGIRVVREAERTAGA 77
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDD 94
SAEQARL ADV+AEA ERAGEE PL V S D
Sbjct 78 SAEQARLTVADVVAEAKERAGEEAGPLTVVDSGD 111
>gi|240171334|ref|ZP_04749993.1| hypothetical protein MkanA1_18626 [Mycobacterium kansasii ATCC
12478]
Length=99
Score = 121 bits (303), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 62/95 (66%), Positives = 70/95 (74%), Gaps = 0/95 (0%)
Query 3 WHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGESA 62
WHG LAK+VPT+VTG VG AAYEALRK VK PLR ATV+ AWG+R R AERKA +S+
Sbjct 4 WHGLLAKSVPTLVTGVVGAAAYEALRKTAVKVPLREATVATTAWGLRGLRTAERKAQQSS 63
Query 63 EQARLMFADVLAEASERAGEEVPPLAVAGSDDGHD 97
EQARL ADV+AEA ER GE+V VAG HD
Sbjct 64 EQARLTLADVIAEAKERIGEDVSLSQVAGGCHDHD 98
>gi|296164642|ref|ZP_06847208.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295899950|gb|EFG79390.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=90
Score = 117 bits (293), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 65/91 (72%), Positives = 71/91 (79%), Gaps = 4/91 (4%)
Query 4 HGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGESAE 63
HGF+AKA+PTV+TG VGVAAYEAL KAP R ATV AWGIR+AREAERKAG SAE
Sbjct 2 HGFVAKAMPTVMTGVVGVAAYEAL----AKAPWRKATVVATAWGIRVAREAERKAGRSAE 57
Query 64 QARLMFADVLAEASERAGEEVPPLAVAGSDD 94
QARL ADV+AEA ERAG E PL VAGS +
Sbjct 58 QARLTVADVMAEARERAGSEAAPLTVAGSGE 88
>gi|183981452|ref|YP_001849743.1| hypothetical protein MMAR_1430 [Mycobacterium marinum M]
gi|183174778|gb|ACC39888.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=98
Score = 114 bits (286), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 64/98 (66%), Positives = 72/98 (74%), Gaps = 0/98 (0%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
M W GF+ K V TG VGV AYEALRK + KAPLR A+V+ AWG+R+ REAERKAG+
Sbjct 1 MAWQGFVVKGAHAVGTGVVGVVAYEALRKTLAKAPLRKASVATTAWGLRVVREAERKAGQ 60
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
SAEQARL ADV+AEA ERAGEEV PL V S HDH
Sbjct 61 SAEQARLTVADVMAEAKERAGEEVAPLTVVDSGSDHDH 98
>gi|296165052|ref|ZP_06847606.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295899584|gb|EFG79036.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=139
Score = 103 bits (258), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 64/97 (66%), Positives = 70/97 (73%), Gaps = 0/97 (0%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
M WHGFL KA PTVVTG VGVAAYEA+R V K PLR ATV+V AWGIR AREA+R A
Sbjct 42 MAWHGFLMKAAPTVVTGVVGVAAYEAVRTAVAKVPLRTATVAVTAWGIRAAREAQRTAET 101
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHD 97
+ EQ RL ADV+AEA ERAGE+ P V DGHD
Sbjct 102 NTEQIRLTVADVVAEARERAGEDTEPAIVTSPGDGHD 138
>gi|118467605|ref|YP_890281.1| hypothetical protein MSMEG_6059 [Mycobacterium smegmatis str.
MC2 155]
gi|118168892|gb|ABK69788.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=94
Score = 97.1 bits (240), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 62/98 (64%), Positives = 67/98 (69%), Gaps = 6/98 (6%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
MVWHG L KA TVVTGAVGVAAYE LRK V KAP+R A V+ A +R AR+AE
Sbjct 1 MVWHGLLVKAATTVVTGAVGVAAYEGLRKAVAKAPVREAAVATTALALRGARKAE----V 56
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
AE ARL ADV+AEA ER GEEVPP A D GHDH
Sbjct 57 GAESARLKVADVMAEARERIGEEVPPPAA--GDAGHDH 92
>gi|15842859|ref|NP_337896.1| hypothetical protein MT3369 [Mycobacterium tuberculosis CDC1551]
gi|13883189|gb|AAK47710.1| hypothetical protein MT3369 [Mycobacterium tuberculosis CDC1551]
Length=145
Score = 91.3 bits (225), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 54/98 (56%), Positives = 62/98 (64%), Gaps = 5/98 (5%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
M FLAKA TV+TG GV AYE L+K KAPLR VS AA G+R R+AE E
Sbjct 53 MAIQVFLAKATTTVITGLAGVTAYEILKKAAAKAPLRQTAVSAAALGLRGTRKAE----E 108
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
+AE ARL ADV+AEA ER GEE P A++ D HDH
Sbjct 109 AAESARLKVADVMAEARERIGEESPTPAISDLHD-HDH 145
>gi|15610405|ref|NP_217786.1| hypothetical protein Rv3269 [Mycobacterium tuberculosis H37Rv]
gi|31794449|ref|NP_856942.1| hypothetical protein Mb3297 [Mycobacterium bovis AF2122/97]
gi|121639158|ref|YP_979382.1| hypothetical protein BCG_3298 [Mycobacterium bovis BCG str. Pasteur
1173P2]
65 more sequence titles
Length=93
Score = 90.5 bits (223), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 54/98 (56%), Positives = 61/98 (63%), Gaps = 5/98 (5%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
M FLAKA TV+TG GV AYE L+K KAPLR VS AA G+R RKA E
Sbjct 1 MAIQVFLAKATTTVITGLAGVTAYEILKKAAAKAPLRQTAVSAAALGLR----GTRKAEE 56
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
+AE ARL ADV+AEA ER GEE P A++ D HDH
Sbjct 57 AAESARLKVADVMAEARERIGEESPTPAISDLHD-HDH 93
>gi|342861473|ref|ZP_08718120.1| hypothetical protein MCOL_21411 [Mycobacterium colombiense CECT
3035]
gi|342130962|gb|EGT84251.1| hypothetical protein MCOL_21411 [Mycobacterium colombiense CECT
3035]
Length=93
Score = 89.7 bits (221), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/98 (56%), Positives = 63/98 (65%), Gaps = 5/98 (5%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
M +G LAKA TVVTG VGV AYE +RK V KAPL V A G+R R+AE E
Sbjct 1 MAVYGLLAKAAGTVVTGLVGVTAYEVVRKAVAKAPLHETAVKGAELGLRGTRKAE----E 56
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
+AE ARL ADV+AEA ER GEE P ++A + D HDH
Sbjct 57 AAESARLRLADVMAEARERIGEEAPTPSIADTHD-HDH 93
>gi|308232401|ref|ZP_07415935.2| hypothetical protein TMAG_02722 [Mycobacterium tuberculosis SUMu001]
gi|308370210|ref|ZP_07420657.2| hypothetical protein TMBG_01964 [Mycobacterium tuberculosis SUMu002]
gi|308373690|ref|ZP_07433333.2| hypothetical protein TMEG_03666 [Mycobacterium tuberculosis SUMu005]
9 more sequence titles
Length=89
Score = 89.4 bits (220), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/93 (57%), Positives = 61/93 (66%), Gaps = 5/93 (5%)
Query 6 FLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGESAEQA 65
FLAKA TV+TG GV AYE L+K KAPLR VS AA G+R R+AE E+AE A
Sbjct 2 FLAKATTTVITGLAGVTAYEILKKAAAKAPLRQTAVSAAALGLRGTRKAE----EAAESA 57
Query 66 RLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
RL ADV+AEA ER GEE P A++ D HDH
Sbjct 58 RLKVADVMAEARERIGEESPTPAISDLHD-HDH 89
>gi|41409481|ref|NP_962317.1| hypothetical protein MAP3383 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41398312|gb|AAS05933.1| hypothetical protein MAP_3383 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336459654|gb|EGO38589.1| Protein of unknown function (DUF1490) [Mycobacterium avium subsp.
paratuberculosis S397]
Length=93
Score = 89.0 bits (219), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/98 (56%), Positives = 62/98 (64%), Gaps = 7/98 (7%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
M +G LAKA TVVTG VGV AYE +RK V KAPL V A G+R R+AE E
Sbjct 1 MAVYGLLAKAAGTVVTGLVGVTAYEVVRKAVAKAPLHETAVKGAELGLRGTRKAE----E 56
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
+AE ARL ADV+AEA ER GEE P ++A D HDH
Sbjct 57 AAESARLKLADVMAEARERIGEETPTPSIA---DTHDH 91
>gi|118463473|ref|YP_883375.1| hypothetical protein MAV_4234 [Mycobacterium avium 104]
gi|254776669|ref|ZP_05218185.1| hypothetical protein MaviaA2_18651 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|118164760|gb|ABK65657.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=93
Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/98 (56%), Positives = 62/98 (64%), Gaps = 7/98 (7%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
M +G LAKA TVVTG VGV AYE +RK V KAPL V A G+R R+AE E
Sbjct 1 MAVYGLLAKAAGTVVTGLVGVTAYEVVRKAVAKAPLHETAVKGAELGLRGTRKAE----E 56
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
+AE ARL ADV+AEA ER GEE P ++A D HDH
Sbjct 57 AAESARLKLADVMAEARERIGEEAPTPSIA---DTHDH 91
>gi|254821358|ref|ZP_05226359.1| hypothetical protein MintA_15582 [Mycobacterium intracellulare
ATCC 13950]
Length=93
Score = 85.1 bits (209), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/98 (55%), Positives = 61/98 (63%), Gaps = 5/98 (5%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
M +G LAKA TVVTG VGV AYE +RK V KAPL V A G+R R AE E
Sbjct 1 MAVYGLLAKAAGTVVTGLVGVTAYEVVRKAVAKAPLHETAVKGAELGLRGTRRAE----E 56
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
+AE ARL ADV+AEA ER GEE P ++A D H+H
Sbjct 57 AAESARLRLADVMAEARERIGEEAPTPSIAEPHD-HEH 93
>gi|333991660|ref|YP_004524274.1| hypothetical protein JDM601_3020 [Mycobacterium sp. JDM601]
gi|333487628|gb|AEF37020.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=93
Score = 84.7 bits (208), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 54/98 (56%), Positives = 60/98 (62%), Gaps = 5/98 (5%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
M G LAKA TV TG VGV+AYE +RK + KAPL A V W +R R AE E
Sbjct 1 MAVQGLLAKAATTVFTGLVGVSAYEVVRKALEKAPLHEAAVIATEWSLRGTRRAE----E 56
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
AE ARL ADV+AEA ER GEE P AVA + D HDH
Sbjct 57 VAESARLKVADVVAEARERIGEEATPPAVAVAHD-HDH 93
>gi|226359852|ref|YP_002777630.1| hypothetical protein ROP_04380 [Rhodococcus opacus B4]
gi|226238337|dbj|BAH48685.1| hypothetical protein [Rhodococcus opacus B4]
Length=94
Score = 84.3 bits (207), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 56/98 (58%), Positives = 63/98 (65%), Gaps = 4/98 (4%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
M H L KA TVVTGAVGVAAY A RK+V KAPL A V+ AWG+R AR+AE E
Sbjct 1 MALHILLVKAASTVVTGAVGVAAYNAARKVVAKAPLHEAAVTATAWGLRGARKAE----E 56
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
AE ARL +DV+AEA R GEEVPP GH+H
Sbjct 57 GAESARLKVSDVVAEARGRIGEEVPPPPDPEIGHGHNH 94
>gi|240172498|ref|ZP_04751157.1| hypothetical protein MkanA1_24500 [Mycobacterium kansasii ATCC
12478]
Length=93
Score = 84.0 bits (206), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 50/98 (52%), Positives = 59/98 (61%), Gaps = 5/98 (5%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
M AKA TV+TG GV AYE L+K+ KAPL VS A G+R R+AE E
Sbjct 1 MAVQAIFAKAATTVITGLAGVTAYEVLKKVAAKAPLHQTAVSAAELGLRGTRKAE----E 56
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
+AE ARL +DV+AEA ER GEE P AV + D HDH
Sbjct 57 AAESARLKISDVMAEARERVGEEAPTPAVGHAHD-HDH 93
>gi|183982550|ref|YP_001850841.1| hypothetical protein MMAR_2537 [Mycobacterium marinum M]
gi|183175876|gb|ACC40986.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=101
Score = 80.9 bits (198), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 48/85 (57%), Positives = 55/85 (65%), Gaps = 4/85 (4%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
M W KAV TVVTGAVGVAAYE V+ P R ATV A G+R R RK E
Sbjct 1 MAWQVLAGKAVHTVVTGAVGVAAYEVF----VRVPWRKATVGATALGLRAGRTTGRKTKE 56
Query 61 SAEQARLMFADVLAEASERAGEEVP 85
+AE+A+L ADVLAEA+ER GE+VP
Sbjct 57 AAERAQLAVADVLAEAAERIGEQVP 81
>gi|118618085|ref|YP_906417.1| hypothetical protein MUL_2615 [Mycobacterium ulcerans Agy99]
gi|183981294|ref|YP_001849585.1| hypothetical protein MMAR_1272 [Mycobacterium marinum M]
gi|118570195|gb|ABL04946.1| conserved protein [Mycobacterium ulcerans Agy99]
gi|183174620|gb|ACC39730.1| conserved protein [Mycobacterium marinum M]
Length=92
Score = 78.6 bits (192), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 49/98 (50%), Positives = 58/98 (60%), Gaps = 6/98 (6%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
MV +G AK VV G VG AAY+ +RK KAPLR VS A +R R+AE E
Sbjct 1 MVAYGLFAKLGTLVVHGVVGAAAYDVVRKAAKKAPLRQTAVSAAELSLRGTRKAE----E 56
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
+AE ARL +DV++EA ER GEE P AV D HDH
Sbjct 57 AAESARLKISDVMSEARERIGEEAPTPAVGAHD--HDH 92
>gi|296168935|ref|ZP_06850604.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295896404|gb|EFG76057.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=99
Score = 78.6 bits (192), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/98 (49%), Positives = 59/98 (61%), Gaps = 7/98 (7%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
++ HG KA + TG G AAY+ +RK V KAPL VS A G+R R+AE E
Sbjct 7 VIAHGAFGKAAAWLATGVAGAAAYDLVRKAVAKAPLHETAVSAAELGLRGTRKAE----E 62
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
+AE ARL +DV+AEA ER GEE P AV +GHDH
Sbjct 63 AAESARLKISDVMAEARERVGEEAPTPAVG---NGHDH 97
>gi|15827316|ref|NP_301579.1| hypothetical protein ML0748 [Mycobacterium leprae TN]
gi|221229794|ref|YP_002503210.1| hypothetical protein MLBr_00748 [Mycobacterium leprae Br4923]
gi|13092865|emb|CAC30257.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219932901|emb|CAR70842.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=92
Score = 77.8 bits (190), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 48/95 (51%), Positives = 56/95 (59%), Gaps = 5/95 (5%)
Query 4 HGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGESAE 63
G LAKA V+TG GV AYE LRK V K PL VS G+R +R+AE E+AE
Sbjct 3 QGLLAKAATMVITGLTGVTAYEMLRKAVTKVPLHQIAVSALELGLRGSRKAE----EAAE 58
Query 64 QARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
ARL ADV+AEA ER G+E AV+ HDH
Sbjct 59 SARLKLADVMAEARERIGKETTAPAVSDIHQ-HDH 92
>gi|340627004|ref|YP_004745456.1| hypothetical protein MCAN_20131 [Mycobacterium canettii CIPT
140010059]
gi|340005194|emb|CCC44345.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=90
Score = 77.0 bits (188), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 46/85 (55%), Positives = 55/85 (65%), Gaps = 4/85 (4%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
MV H L KA V+TG VGV+AYE LRK + AP+R A+V+V WG+R R AE
Sbjct 1 MVTHELLVKAAGAVLTGLVGVSAYETLRKALGTAPIRRASVTVMEWGLRGTRRAE----V 56
Query 61 SAEQARLMFADVLAEASERAGEEVP 85
+AE ARL ADV+AEA R GEE P
Sbjct 57 AAESARLTVADVVAEARGRIGEEAP 81
>gi|183984843|ref|YP_001853134.1| hypothetical protein MMAR_4875 [Mycobacterium marinum M]
gi|183178169|gb|ACC43279.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=102
Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/86 (59%), Positives = 58/86 (68%), Gaps = 4/86 (4%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
+WHG KAVP+VVTG VGVA YEAL KAP R+ATV+ WG+R AR ERK +
Sbjct 2 TLWHGLWTKAVPSVVTGVVGVATYEAL----AKAPWRSATVTATVWGLRTARTTERKTKQ 57
Query 61 SAEQARLMFADVLAEASERAGEEVPP 86
+ E+ RL ADVLAEA ER G EV P
Sbjct 58 ATERVRLAAADVLAEAVERVGAEVAP 83
>gi|183982159|ref|YP_001850450.1| hypothetical protein MMAR_2146 [Mycobacterium marinum M]
gi|183175485|gb|ACC40595.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=93
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 52/98 (54%), Positives = 64/98 (66%), Gaps = 5/98 (5%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
M+ HG LAKA TVVTG VGV+AYE LRK + AP+ A V+ G+R R AE+
Sbjct 1 MIVHGLLAKAGATVVTGVVGVSAYELLRKALGSAPVHRAAVATTEVGLRGTRGAEK---- 56
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
AE ARL +DV+AEA ER GEE PP A+ G+ D H+H
Sbjct 57 VAESARLKVSDVVAEARERIGEEAPPPAI-GNHDQHEH 93
>gi|296165061|ref|ZP_06847615.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295899593|gb|EFG79045.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=85
Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 42/81 (52%), Positives = 50/81 (62%), Gaps = 6/81 (7%)
Query 16 TGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGESAEQARLMFADVLAE 75
TG VGV+AYE LRK V AP+ A V+V WG+R R AE +AE ARL ADV+AE
Sbjct 3 TGLVGVSAYEVLRKAVGTAPVHRAAVTVTEWGLRGTRSAE----VAAESARLKVADVVAE 58
Query 76 ASERAGEEV--PPLAVAGSDD 94
A R G++ PP A A DD
Sbjct 59 ARGRIGDDAPRPPAAKADDDD 79
>gi|15609130|ref|NP_216509.1| hypothetical protein Rv1993c [Mycobacterium tuberculosis H37Rv]
gi|15841475|ref|NP_336512.1| hypothetical protein MT2049 [Mycobacterium tuberculosis CDC1551]
gi|31793173|ref|NP_855666.1| hypothetical protein Mb2016c [Mycobacterium bovis AF2122/97]
64 more sequence titles
Length=90
Score = 63.5 bits (153), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 45/85 (53%), Positives = 55/85 (65%), Gaps = 4/85 (4%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
MV H L KA V+TG VGV+AYE LRK + AP+R A+V+V WG+R R+A
Sbjct 1 MVTHELLVKAAGAVLTGLVGVSAYETLRKALGTAPIRRASVTVMEWGLR----GTRRAEA 56
Query 61 SAEQARLMFADVLAEASERAGEEVP 85
+AE ARL ADV+AEA R GEE P
Sbjct 57 AAESARLTVADVVAEARGRIGEEAP 81
>gi|289570093|ref|ZP_06450320.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289750575|ref|ZP_06509953.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289754099|ref|ZP_06513477.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289543847|gb|EFD47495.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289691162|gb|EFD58591.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289694686|gb|EFD62115.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
Length=90
Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 59/94 (63%), Gaps = 4/94 (4%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
MV H L KA V+TG VGV+AYE +RK + AP+R A+V+V WG+R R+A
Sbjct 1 MVTHELLVKAAGAVLTGLVGVSAYETVRKALGTAPIRRASVTVMEWGLR----GTRRAEA 56
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDD 94
+AE ARL ADV+AEA R GEE P A A D+
Sbjct 57 AAESARLTVADVVAEARGRIGEEAPLPAGARVDE 90
>gi|167970478|ref|ZP_02552755.1| hypothetical protein MtubH3_21593 [Mycobacterium tuberculosis
H37Ra]
Length=101
Score = 62.0 bits (149), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/85 (52%), Positives = 55/85 (65%), Gaps = 4/85 (4%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
+V H L KA V+TG VGV+AYE LRK + AP+R A+V+V WG+R R+A
Sbjct 12 VVTHELLVKAAGAVLTGLVGVSAYETLRKALGTAPIRRASVTVMEWGLR----GTRRAEA 67
Query 61 SAEQARLMFADVLAEASERAGEEVP 85
+AE ARL ADV+AEA R GEE P
Sbjct 68 AAESARLTVADVVAEARGRIGEEAP 92
>gi|296164259|ref|ZP_06846848.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295900385|gb|EFG79802.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=93
Score = 61.2 bits (147), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 42/83 (51%), Positives = 54/83 (66%), Gaps = 4/83 (4%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
MV HG LAKA TV TG VGV+AYE +R+ V AP+ A V+ WG+R R+A
Sbjct 1 MVAHGLLAKATRTVFTGLVGVSAYEVVRRAVGNAPVHRAAVTATEWGLR----GTRRAEV 56
Query 61 SAEQARLMFADVLAEASERAGEE 83
+AE ARL ADV+AEA +R G++
Sbjct 57 AAEAARLKVADVVAEARDRIGDD 79
>gi|118616275|ref|YP_904607.1| hypothetical protein MUL_0424 [Mycobacterium ulcerans Agy99]
gi|118568385|gb|ABL03136.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=102
Score = 59.7 bits (143), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/87 (59%), Positives = 60/87 (69%), Gaps = 4/87 (4%)
Query 2 VWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGES 61
+WHG KAVP+VVTG VGVA YEAL KAP R+ATV+ AWG+R AR ER+ ++
Sbjct 3 LWHGLWTKAVPSVVTGVVGVATYEAL----AKAPWRSATVTATAWGLRTARTTERRTKQA 58
Query 62 AEQARLMFADVLAEASERAGEEVPPLA 88
E+ RL ADVLAEA ER G EV P A
Sbjct 59 TERVRLAAADVLAEAVERVGAEVAPPA 85
>gi|333919412|ref|YP_004492993.1| hypothetical protein AS9A_1744 [Amycolicicoccus subflavus DQS3-9A1]
gi|333481633|gb|AEF40193.1| hypothetical protein AS9A_1744 [Amycolicicoccus subflavus DQS3-9A1]
Length=94
Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/96 (50%), Positives = 55/96 (58%), Gaps = 5/96 (5%)
Query 4 HG-FLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGESA 62
HG L KA+ TVVTG VGVAAY R + K P R VS A GI+ AR+AE E A
Sbjct 3 HGVLLGKALGTVVTGVVGVAAYNGARWVAKKTPTREIAVSATALGIKGARKAE----EGA 58
Query 63 EQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
E RL AD++AEA R GEE PP + GH H
Sbjct 59 ENVRLTAADIVAEARGRVGEEAPPPPSHPAGHGHAH 94
>gi|289749496|ref|ZP_06508874.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289690083|gb|EFD57512.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=54
Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/30 (97%), Positives = 30/30 (100%), Gaps = 0/30 (0%)
Query 14 VVTGAVGVAAYEALRKMVVKAPLRAATVSV 43
+VTGAVGVAAYEALRKMVVKAPLRAATVSV
Sbjct 1 MVTGAVGVAAYEALRKMVVKAPLRAATVSV 30
>gi|325000600|ref|ZP_08121712.1| hypothetical protein PseP1_17612 [Pseudonocardia sp. P1]
Length=88
Score = 55.1 bits (131), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 46/80 (58%), Gaps = 4/80 (5%)
Query 6 FLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGESAEQA 65
+ KA V +G G AY+ ++++ +R A V+V WG+R AR AE AE+A
Sbjct 1 MVGKAAGLVASGLAGAVAYDGVKRVARSGAVREAAVTVTGWGLRGARAAE----TGAEKA 56
Query 66 RLMFADVLAEASERAGEEVP 85
RL AD+++EA R GEE P
Sbjct 57 RLATADIVSEARGRIGEEAP 76
>gi|296164386|ref|ZP_06846962.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295900266|gb|EFG79696.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=103
Score = 43.1 bits (100), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 38/81 (47%), Positives = 48/81 (60%), Gaps = 4/81 (4%)
Query 2 VWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGES 61
+W G LAKAVPTVVT A + + KAP R TV A G++ R RK ++
Sbjct 4 MWQGLLAKAVPTVVT----GVVGAAAYEALAKAPWRKVTVGATAVGLQATRTTARKTKQA 59
Query 62 AEQARLMFADVLAEASERAGE 82
AE+ARL ADV AEA++R GE
Sbjct 60 AEKARLATADVFAEAADRLGE 80
>gi|258653982|ref|YP_003203138.1| hypothetical protein Namu_3853 [Nakamurella multipartita DSM
44233]
gi|258557207|gb|ACV80149.1| hypothetical protein Namu_3853 [Nakamurella multipartita DSM
44233]
Length=94
Score = 37.4 bits (85), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 32/98 (33%), Positives = 46/98 (47%), Gaps = 4/98 (4%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGE 60
MV A+A +++G G + L++ L A V+V A +R R E
Sbjct 1 MVTGHLAARAAGMLISGVAGAIVVDRLKQRSTGRGLNQAAVAVTALALRGKRRVE----A 56
Query 61 SAEQARLMFADVLAEASERAGEEVPPLAVAGSDDGHDH 98
AE RL DV+A+A E+ GE+ PP A + HDH
Sbjct 57 GAENLRLGAGDVVAQAREKIGEQAPPPAQSAQPHEHDH 94
>gi|86139077|ref|ZP_01057648.1| UvrABC system protein C [Roseobacter sp. MED193]
gi|85824308|gb|EAQ44512.1| UvrABC system protein C [Roseobacter sp. MED193]
Length=643
Score = 35.8 bits (81), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 28/73 (39%), Positives = 39/73 (54%), Gaps = 6/73 (8%)
Query 25 EALRKMVVKAPLRAATVSVAAWGIRLAREA-ERKAGESAEQARLMFADVLAEASERAGEE 83
+A RK+ + PLR + A +R ARE+ R+ ESA QARL+ LAEA G +
Sbjct 363 KAERKVEILVPLRGEKTELVAGAVRNARESLARRMAESATQARLLKG--LAEA---FGLK 417
Query 84 VPPLAVAGSDDGH 96
PP + D+ H
Sbjct 418 APPQRIEVYDNSH 430
>gi|147792395|emb|CAN70278.1| hypothetical protein VITISV_015612 [Vitis vinifera]
Length=853
Score = 35.4 bits (80), Expect = 3.1, Method: Composition-based stats.
Identities = 29/84 (35%), Positives = 41/84 (49%), Gaps = 10/84 (11%)
Query 6 FLAKAVPTVVTGAVGVAAY--EALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGESAE 63
F+AK P V+ GV Y E LRK +V V R+++E ++K+ ES
Sbjct 26 FVAKRKPLVLVVPEGVKGYGWENLRKAIVSVLDFFVQVE------RVSKEKQKKSQESKG 79
Query 64 QAR--LMFADVLAEASERAGEEVP 85
R +ADV+AE R G E+P
Sbjct 80 MYRGEWSYADVVAEKGPRNGVEMP 103
>gi|221485569|gb|EEE23850.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length=2535
Score = 35.0 bits (79), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 26/77 (34%), Positives = 38/77 (50%), Gaps = 10/77 (12%)
Query 19 VGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAER----KAGESAEQARLMFADVLA 74
G+ +ALR +AP S+++ + L RE ER + GE++E+ RLMF +A
Sbjct 1923 FGLMERDALRSFAFEAPFALPVSSLSSPAV-LPREGERPPGDEGGETSEEERLMFRGDVA 1981
Query 75 EA-----SERAGEEVPP 86
A S R G PP
Sbjct 1982 SAKNAKKSNRCGRAPPP 1998
>gi|83942317|ref|ZP_00954778.1| hypothetical protein EE36_14792 [Sulfitobacter sp. EE-36]
gi|83846410|gb|EAP84286.1| hypothetical protein EE36_14792 [Sulfitobacter sp. EE-36]
Length=183
Score = 34.3 bits (77), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 27/104 (26%), Positives = 51/104 (50%), Gaps = 6/104 (5%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAA----TVSVAAWGIRLAREAER 56
M L +P + AV V A + + VV++P+ + +A+G+R+A + E
Sbjct 1 MKLRALLICLLPVLPLAAVPVFAQQQSERRVVQSPILTIDSDRVFNESAFGLRVADDLET 60
Query 57 KAGESAEQARLMFADVLAEASERAGE--EVPPLAVAGSDDGHDH 98
++ E + + RL+ AD+ AE + + ++ P A +G D D
Sbjct 61 QSAEISAENRLIEADLKAEERKLTDQRSKLSPDAFSGLADAFDE 104
>gi|83953536|ref|ZP_00962257.1| hypothetical protein NAS141_04913 [Sulfitobacter sp. NAS-14.1]
gi|83841481|gb|EAP80650.1| hypothetical protein NAS141_04913 [Sulfitobacter sp. NAS-14.1]
Length=183
Score = 33.9 bits (76), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 27/104 (26%), Positives = 50/104 (49%), Gaps = 6/104 (5%)
Query 1 MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAA----TVSVAAWGIRLAREAER 56
M L +P + AV V A + + VV++P+ +A+G+R+A + E
Sbjct 1 MKLRALLICLLPVLPLAAVPVFAQQQSERRVVQSPILTIDSDRVFKESAFGLRVADDVET 60
Query 57 KAGESAEQARLMFADVLAEASERAGE--EVPPLAVAGSDDGHDH 98
++ E + + RL+ AD+ AE + + ++ P A +G D D
Sbjct 61 QSAEISAENRLIEADLKAEERKLTDQRSKLSPDAFSGLADAFDE 104
>gi|254466793|ref|ZP_05080204.1| excinuclease ABC, C subunit [Rhodobacterales bacterium Y4I]
gi|206687701|gb|EDZ48183.1| excinuclease ABC, C subunit [Rhodobacterales bacterium Y4I]
Length=633
Score = 33.9 bits (76), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 26/70 (38%), Positives = 37/70 (53%), Gaps = 6/70 (8%)
Query 28 RKMVVKAPLRAATVSVAAWGIRLAREA-ERKAGESAEQARLMFADVLAEASERAGEEVPP 86
RK+ + P R + A+ +R ARE+ R+ ESA QA+L+ LAEA G E PP
Sbjct 356 RKVELLVPQRGEKTELVAFAVRNARESLARRMAESATQAKLLRG--LAEA---FGLEGPP 410
Query 87 LAVAGSDDGH 96
+ D+ H
Sbjct 411 QRIEVYDNSH 420
>gi|152964259|ref|YP_001360043.1| hypothetical protein Krad_0289 [Kineococcus radiotolerans SRS30216]
gi|151358776|gb|ABS01779.1| hypothetical protein Krad_0289 [Kineococcus radiotolerans SRS30216]
Length=102
Score = 33.9 bits (76), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 25/65 (39%), Positives = 35/65 (54%), Gaps = 8/65 (12%)
Query 36 LRAATVSVAAWGIRLAREAERKAGESAEQARLMFADVLAEASERAGEE--VPPLAVAGSD 93
LR V+ A GI+ +R AE E+ RL D++A+A + GE+ VPP AG D
Sbjct 44 LRRLAVNGAKAGIKASRAAE----TGVERLRLGTGDIVAQAYDELGEQVPVPPTPQAGHD 99
Query 94 DGHDH 98
H+H
Sbjct 100 --HEH 102
>gi|186685628|ref|YP_001868824.1| secretion protein HlyD [Nostoc punctiforme PCC 73102]
gi|186468080|gb|ACC83881.1| secretion protein HlyD family protein [Nostoc punctiforme PCC
73102]
Length=474
Score = 33.5 bits (75), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 19/47 (41%), Positives = 28/47 (60%), Gaps = 2/47 (4%)
Query 2 VWHGFLAKAVPTVVT-GAVGVAAYEALRKMVVKAPLRAATVSVAAWG 47
+W G +A AVP V+ G +G A E LRK+ P+ +T S++A G
Sbjct 22 IWWG-IAVAVPIVIAAGILGTAKIEQLRKLTTSVPVMPSTNSISAVG 67
Lambda K H
0.317 0.128 0.368
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130183417542
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40