BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2009
Length=80
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609146|ref|NP_216525.1| hypothetical protein Rv2009 [Mycoba... 156 1e-36
gi|31793189|ref|NP_855682.1| hypothetical protein Mb2032 [Mycoba... 154 6e-36
gi|167970462|ref|ZP_02552739.1| hypothetical protein MtubH3_2149... 145 1e-33
gi|15608698|ref|NP_216076.1| hypothetical protein Rv1560 [Mycoba... 71.2 5e-11
gi|289761723|ref|ZP_06521101.1| conserved hypothetical protein [... 67.8 5e-10
gi|167967386|ref|ZP_02549663.1| hypothetical protein MtubH3_0485... 67.8 5e-10
gi|333989208|ref|YP_004521822.1| hypothetical protein JDM601_056... 66.2 1e-09
gi|297563900|ref|YP_003682873.1| Protein of unknown function DUF... 62.4 2e-08
gi|284990201|ref|YP_003408755.1| hypothetical protein Gobs_1668 ... 56.6 1e-06
gi|296166367|ref|ZP_06848802.1| toxin-antitoxin system [Mycobact... 53.5 1e-05
gi|336179604|ref|YP_004584979.1| hypothetical protein FsymDg_378... 52.4 2e-05
gi|159184572|ref|NP_354029.2| hypothetical protein Atu1005 [Agro... 44.3 0.007
gi|289760758|ref|ZP_06520136.1| conserved hypothetical protein [... 41.2 0.049
gi|15840036|ref|NP_335073.1| hypothetical protein MT0662.1 [Myco... 41.2 0.051
gi|340625652|ref|YP_004744104.1| hypothetical protein MCAN_06321... 40.8 0.064
gi|88811565|ref|ZP_01126819.1| hypothetical protein NB231_04150 ... 40.8 0.065
gi|167969063|ref|ZP_02551340.1| hypothetical protein MtubH3_1395... 40.4 0.077
gi|118616508|ref|YP_904840.1| hypothetical protein MUL_0716 [Myc... 39.7 0.13
gi|15828026|ref|NP_302289.1| hypothetical protein ML1911A [Mycob... 39.7 0.15
gi|183980986|ref|YP_001849277.1| hypothetical protein MMAR_0966 ... 39.7 0.16
gi|284038009|ref|YP_003387939.1| hypothetical protein Slin_3129 ... 39.7 0.17
gi|254231784|ref|ZP_04925111.1| conserved hypothetical protein [... 39.3 0.18
gi|325292386|ref|YP_004278250.1| hypothetical protein AGROH133_0... 39.3 0.19
gi|320105489|ref|YP_004181079.1| hypothetical protein AciPR4_024... 39.3 0.19
gi|335034130|ref|ZP_08527491.1| hypothetical protein AGRO_1470 [... 38.9 0.24
gi|258592452|emb|CBE68761.1| conserved protein of unknown functi... 38.9 0.26
gi|327190277|gb|EGE57377.1| hypothetical protein RHECNPAF_439009... 37.7 0.50
gi|217979807|ref|YP_002363954.1| hypothetical protein Msil_3709 ... 37.7 0.60
gi|337281344|ref|YP_004620816.1| hypothetical protein Rta_36810 ... 37.4 0.77
gi|115524558|ref|YP_781469.1| hypothetical protein RPE_2551 [Rho... 37.0 0.94
gi|299135986|ref|ZP_07029170.1| Protein of unknown function DUF2... 36.6 1.1
gi|218661252|ref|ZP_03517182.1| hypothetical protein RetlI_17778... 36.6 1.1
gi|313672675|ref|YP_004050786.1| hypothetical protein Calni_0712... 36.6 1.2
gi|304318060|ref|YP_003853205.1| hypothetical protein Tthe_2672 ... 36.6 1.3
gi|345013747|ref|YP_004816101.1| hypothetical protein Strvi_6366... 36.6 1.4
gi|297154825|gb|ADI04537.1| hypothetical protein SBI_01416 [Stre... 36.2 1.5
gi|345302429|ref|YP_004824331.1| hypothetical protein Rhom172_05... 36.2 1.5
gi|206889882|ref|YP_002249459.1| hypothetical protein THEYE_A166... 36.2 1.8
gi|240169407|ref|ZP_04748066.1| hypothetical protein MkanA1_0884... 35.8 2.2
gi|333992735|ref|YP_004525349.1| hypothetical protein JDM601_409... 35.8 2.3
gi|268316119|ref|YP_003289838.1| hypothetical protein Rmar_0549 ... 35.0 3.4
gi|296137154|ref|YP_003644396.1| Protein of unknown function DUF... 34.7 5.0
gi|88813609|ref|ZP_01128840.1| hypothetical protein NB231_12876 ... 34.7 5.2
gi|294341452|emb|CAZ89869.1| hypothetical protein THI_3273 [Thio... 34.7 5.4
gi|163759143|ref|ZP_02166229.1| hypothetical protein HPDFL43_052... 34.3 5.5
gi|77165011|ref|YP_343536.1| hypothetical protein Noc_1521 [Nitr... 34.3 6.0
gi|338533812|ref|YP_004667146.1| excinuclease ABC subunit B [Myx... 33.9 8.1
>gi|15609146|ref|NP_216525.1| hypothetical protein Rv2009 [Mycobacterium tuberculosis H37Rv]
gi|15841491|ref|NP_336528.1| hypothetical protein MT2064.1 [Mycobacterium tuberculosis CDC1551]
gi|148661823|ref|YP_001283346.1| hypothetical protein MRA_2025 [Mycobacterium tuberculosis H37Ra]
44 more sequence titles
Length=80
Score = 156 bits (394), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 79/80 (99%), Positives = 80/80 (100%), Gaps = 0/80 (0%)
Query 1 VYSGVVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFD 60
+YSGVVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFD
Sbjct 1 MYSGVVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFD 60
Query 61 FSNDEIESFSDTDRKLADES 80
FSNDEIESFSDTDRKLADES
Sbjct 61 FSNDEIESFSDTDRKLADES 80
>gi|31793189|ref|NP_855682.1| hypothetical protein Mb2032 [Mycobacterium bovis AF2122/97]
gi|121637893|ref|YP_978116.1| hypothetical protein BCG_2026 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224990387|ref|YP_002645074.1| hypothetical protein JTY_2021 [Mycobacterium bovis BCG str. Tokyo
172]
gi|31618781|emb|CAD96885.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
gi|121493540|emb|CAL72014.1| Conserved hypothetical protein [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224773500|dbj|BAH26306.1| hypothetical protein JTY_2021 [Mycobacterium bovis BCG str. Tokyo
172]
gi|341601930|emb|CCC64604.1| conserved hypothetical protein [Mycobacterium bovis BCG str.
Moreau RDJ]
Length=80
Score = 154 bits (388), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 78/80 (98%), Positives = 80/80 (100%), Gaps = 0/80 (0%)
Query 1 VYSGVVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFD 60
+YSGVVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFD
Sbjct 1 MYSGVVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFD 60
Query 61 FSNDEIESFSDTDRKLADES 80
FS+DEIESFSDTDRKLADES
Sbjct 61 FSDDEIESFSDTDRKLADES 80
>gi|167970462|ref|ZP_02552739.1| hypothetical protein MtubH3_21498 [Mycobacterium tuberculosis
H37Ra]
gi|254551032|ref|ZP_05141479.1| antitoxin [Mycobacterium tuberculosis '98-R604 INH-RIF-EM']
gi|308232003|ref|ZP_07414583.2| antitoxin [Mycobacterium tuberculosis SUMu001]
23 more sequence titles
Length=75
Score = 145 bits (367), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 74/75 (99%), Positives = 75/75 (100%), Gaps = 0/75 (0%)
Query 6 VSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSNDE 65
+SRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSNDE
Sbjct 1 MSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSNDE 60
Query 66 IESFSDTDRKLADES 80
IESFSDTDRKLADES
Sbjct 61 IESFSDTDRKLADES 75
>gi|15608698|ref|NP_216076.1| hypothetical protein Rv1560 [Mycobacterium tuberculosis H37Rv]
gi|15841027|ref|NP_336064.1| hypothetical protein MT1611 [Mycobacterium tuberculosis CDC1551]
gi|31792745|ref|NP_855238.1| hypothetical protein Mb1586 [Mycobacterium bovis AF2122/97]
72 more sequence titles
Length=72
Score = 71.2 bits (173), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 36/68 (53%), Positives = 48/68 (71%), Gaps = 0/68 (0%)
Query 1 VYSGVVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFD 60
+Y +SRTNI+IDDEL A R + L +KR+AVDLALRRLVG PL R+ L L+G G++
Sbjct 1 MYRWCMSRTNIDIDDELAAEVMRRFGLTTKRAAVDLALRRLVGSPLSREFLLGLEGVGWE 60
Query 61 FSNDEIES 68
D++ S
Sbjct 61 GDLDDLRS 68
>gi|289761723|ref|ZP_06521101.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
gi|289709229|gb|EFD73245.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
Length=72
Score = 67.8 bits (164), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 35/68 (52%), Positives = 47/68 (70%), Gaps = 0/68 (0%)
Query 1 VYSGVVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFD 60
+Y +SRTNI+IDDEL A R + L +KR+AVDLALRRLV PL R+ L L+G G++
Sbjct 1 MYRWCMSRTNIDIDDELAAEVMRRFGLTTKRAAVDLALRRLVRSPLSREFLLGLEGVGWE 60
Query 61 FSNDEIES 68
D++ S
Sbjct 61 GDLDDLRS 68
>gi|167967386|ref|ZP_02549663.1| hypothetical protein MtubH3_04852 [Mycobacterium tuberculosis
H37Ra]
gi|254550580|ref|ZP_05141027.1| antitoxin [Mycobacterium tuberculosis '98-R604 INH-RIF-EM']
Length=67
Score = 67.8 bits (164), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 35/63 (56%), Positives = 46/63 (74%), Gaps = 0/63 (0%)
Query 6 VSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSNDE 65
+SRTNI+IDDEL A R + L +KR+AVDLALRRLVG PL R+ L L+G G++ D+
Sbjct 1 MSRTNIDIDDELAAEVMRRFGLTTKRAAVDLALRRLVGSPLSREFLLGLEGVGWEGDLDD 60
Query 66 IES 68
+ S
Sbjct 61 LRS 63
>gi|333989208|ref|YP_004521822.1| hypothetical protein JDM601_0568 [Mycobacterium sp. JDM601]
gi|333485176|gb|AEF34568.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=70
Score = 66.2 bits (160), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/63 (54%), Positives = 45/63 (72%), Gaps = 0/63 (0%)
Query 6 VSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSNDE 65
++RTNIEIDDEL A R + + +K++AVDLALRRLVG PL R+ L L+G G+ DE
Sbjct 1 MARTNIEIDDELTAEVMRRFGVTTKKAAVDLALRRLVGAPLTREFLLGLEGIGWAGDLDE 60
Query 66 IES 68
+ S
Sbjct 61 LRS 63
>gi|297563900|ref|YP_003682873.1| Protein of unknown function DUF2191 [Nocardiopsis dassonvillei
subsp. dassonvillei DSM 43111]
gi|296848349|gb|ADH70367.1| Protein of unknown function DUF2191 [Nocardiopsis dassonvillei
subsp. dassonvillei DSM 43111]
Length=73
Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/55 (53%), Positives = 42/55 (77%), Gaps = 0/55 (0%)
Query 6 VSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFD 60
+SRTNI+IDDELV A +++ +KR AVD+ALRR VG PL ++ L+L+G G++
Sbjct 1 MSRTNIDIDDELVTTAMERFQVSTKREAVDIALRRAVGTPLTKEFLLSLEGIGWE 55
>gi|284990201|ref|YP_003408755.1| hypothetical protein Gobs_1668 [Geodermatophilus obscurus DSM
43160]
gi|284063446|gb|ADB74384.1| Protein of unknown function DUF2191 [Geodermatophilus obscurus
DSM 43160]
Length=81
Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/59 (48%), Positives = 40/59 (68%), Gaps = 0/59 (0%)
Query 1 VYSGVVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGF 59
+Y VSRTNI+IDD+L+A R Y L +K+ AVD ALR++ P+ E A++GSG+
Sbjct 1 MYGRCVSRTNIDIDDDLIAGVMRRYGLATKKDAVDFALRQVSVVPMTAREMHAMRGSGW 59
>gi|296166367|ref|ZP_06848802.1| toxin-antitoxin system [Mycobacterium parascrofulaceum ATCC BAA-614]
gi|295898277|gb|EFG77848.1| toxin-antitoxin system [Mycobacterium parascrofulaceum ATCC BAA-614]
Length=70
Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/49 (52%), Positives = 34/49 (70%), Gaps = 0/49 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQG 56
RTNIE++D + Y + +K AVDLALR L G+P+ RDEALA++G
Sbjct 5 RTNIELEDTYIQTIMDRYGVRTKTEAVDLALRHLAGQPMTRDEALAMRG 53
>gi|336179604|ref|YP_004584979.1| hypothetical protein FsymDg_3781 [Frankia symbiont of Datisca
glomerata]
gi|334860584|gb|AEH11058.1| Protein of unknown function DUF2191 [Frankia symbiont of Datisca
glomerata]
Length=102
Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/59 (43%), Positives = 40/59 (68%), Gaps = 0/59 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSNDEI 66
R +I +DDEL+ A R++ ++++R VDLALR L+G R ALA++G+G+D E+
Sbjct 7 RASIAVDDELIEEAMRLFGVNTRREIVDLALRHLIGRADFRRRALAMEGTGWDTDMTEL 65
>gi|159184572|ref|NP_354029.2| hypothetical protein Atu1005 [Agrobacterium tumefaciens str.
C58]
gi|159139876|gb|AAK86814.2| conserved hypothetical protein [Agrobacterium tumefaciens str.
C58]
Length=76
Score = 44.3 bits (103), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/60 (47%), Positives = 40/60 (67%), Gaps = 2/60 (3%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALA-LQGSGFDFSNDEI 66
RTNIE+DD L+A A + L +K++ V+ ALR LV E LGR +AL L+G G+ +E+
Sbjct 2 RTNIELDDALIAEAMEITGLPTKKATVEKALRDLV-ENLGRRKALQELRGIGWKGDLEEV 60
>gi|289760758|ref|ZP_06520136.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
gi|289708264|gb|EFD72280.1| conserved hypothetical protein [Mycobacterium tuberculosis GM
1503]
Length=94
Score = 41.2 bits (95), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 19/40 (48%), Positives = 27/40 (68%), Gaps = 0/40 (0%)
Query 5 VVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGE 44
++ R IE+DD+L+ R YR+ R AV+LALR L+GE
Sbjct 14 MLKRVEIEVDDDLIQKVIRRYRVKGAREAVNLALRTLLGE 53
>gi|15840036|ref|NP_335073.1| hypothetical protein MT0662.1 [Mycobacterium tuberculosis CDC1551]
gi|31791817|ref|NP_854310.1| hypothetical protein Mb0652 [Mycobacterium bovis AF2122/97]
gi|57116763|ref|YP_177629.1| hypothetical protein Rv0634A [Mycobacterium tuberculosis H37Rv]
40 more sequence titles
Length=83
Score = 41.2 bits (95), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 19/40 (48%), Positives = 27/40 (68%), Gaps = 0/40 (0%)
Query 5 VVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGE 44
++ R IE+DD+L+ R YR+ R AV+LALR L+GE
Sbjct 14 MLKRVEIEVDDDLIQKVIRRYRVKGAREAVNLALRTLLGE 53
>gi|340625652|ref|YP_004744104.1| hypothetical protein MCAN_06321 [Mycobacterium canettii CIPT
140010059]
gi|340003842|emb|CCC42972.1| hypothetical protein MCAN_06321 [Mycobacterium canettii CIPT
140010059]
Length=83
Score = 40.8 bits (94), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 19/40 (48%), Positives = 27/40 (68%), Gaps = 0/40 (0%)
Query 5 VVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGE 44
++ R IE+DD+L+ R YR+ R AV+LALR L+GE
Sbjct 14 MLKRVEIEVDDDLIQEVIRRYRVKGAREAVNLALRTLLGE 53
>gi|88811565|ref|ZP_01126819.1| hypothetical protein NB231_04150 [Nitrococcus mobilis Nb-231]
gi|88790956|gb|EAR22069.1| hypothetical protein NB231_04150 [Nitrococcus mobilis Nb-231]
Length=73
Score = 40.8 bits (94), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 24/52 (47%), Positives = 32/52 (62%), Gaps = 1/52 (1%)
Query 6 VSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGS 57
+ RTNIE+DDEL+ A R+ ++ SKR V LAL+ V E R + L L G
Sbjct 1 MMRTNIELDDELITQALRLAKVRSKRELVHLALKEFV-ENHQRKDVLELVGK 51
>gi|167969063|ref|ZP_02551340.1| hypothetical protein MtubH3_13957 [Mycobacterium tuberculosis
H37Ra]
gi|254230967|ref|ZP_04924294.1| hypothetical protein TBCG_00629 [Mycobacterium tuberculosis C]
gi|254549593|ref|ZP_05140040.1| hypothetical protein Mtube_03880 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
30 more sequence titles
Length=70
Score = 40.4 bits (93), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 19/40 (48%), Positives = 27/40 (68%), Gaps = 0/40 (0%)
Query 5 VVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGE 44
++ R IE+DD+L+ R YR+ R AV+LALR L+GE
Sbjct 1 MLKRVEIEVDDDLIQKVIRRYRVKGAREAVNLALRTLLGE 40
>gi|118616508|ref|YP_904840.1| hypothetical protein MUL_0716 [Mycobacterium ulcerans Agy99]
gi|118568618|gb|ABL03369.1| conserved protein [Mycobacterium ulcerans Agy99]
Length=69
Score = 39.7 bits (91), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 27/67 (41%), Positives = 35/67 (53%), Gaps = 11/67 (16%)
Query 5 VVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSND 64
++ + IEIDD+LV A R YRL R V LALR L+ E +G G +D
Sbjct 1 MLKKVEIEIDDDLVQEAIRRYRLAGPREVVHLALRTLLAEA-------GEKGGG----DD 49
Query 65 EIESFSD 71
E + FSD
Sbjct 50 EYDEFSD 56
>gi|15828026|ref|NP_302289.1| hypothetical protein ML1911A [Mycobacterium leprae TN]
gi|221230503|ref|YP_002503919.1| hypothetical protein MLBr_01911A [Mycobacterium leprae Br4923]
gi|25397670|pir||B87148 conserved hypothetical protein ML1911A [imported] - Mycobacterium
leprae
gi|13093579|emb|CAC30866.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219933610|emb|CAR72008.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=71
Score = 39.7 bits (91), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 19/41 (47%), Positives = 27/41 (66%), Gaps = 0/41 (0%)
Query 5 VVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEP 45
++ + IE+DD+LV R Y L +R AV LAL+ L+GEP
Sbjct 1 MLKKVEIEVDDDLVQEVIRRYGLLGRREAVHLALKALLGEP 41
>gi|183980986|ref|YP_001849277.1| hypothetical protein MMAR_0966 [Mycobacterium marinum M]
gi|183174312|gb|ACC39422.1| conserved protein [Mycobacterium marinum M]
Length=69
Score = 39.7 bits (91), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 28/67 (42%), Positives = 35/67 (53%), Gaps = 11/67 (16%)
Query 5 VVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSND 64
++ + IEIDD+LV A R YRL R V LALR L+ E A + G D D
Sbjct 1 MLKKVEIEIDDDLVQEAIRRYRLAGPREVVHLALRTLLAE--------AGENGGGD---D 49
Query 65 EIESFSD 71
E + FSD
Sbjct 50 EYDEFSD 56
>gi|284038009|ref|YP_003387939.1| hypothetical protein Slin_3129 [Spirosoma linguale DSM 74]
gi|283817302|gb|ADB39140.1| Protein of unknown function DUF2191 [Spirosoma linguale DSM 74]
Length=71
Score = 39.7 bits (91), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 34/50 (68%), Gaps = 1/50 (2%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGS 57
RTNI+IDDEL+ A ++ RL +K++ V+LAL++ + E R L+L G
Sbjct 2 RTNIDIDDELIDKALQISRLKTKKAVVELALQQYI-ERQARQNLLSLFGK 50
>gi|254231784|ref|ZP_04925111.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|124600843|gb|EAY59853.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
Length=41
Score = 39.3 bits (90), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 20/37 (55%), Positives = 27/37 (73%), Gaps = 0/37 (0%)
Query 32 SAVDLALRRLVGEPLGRDEALALQGSGFDFSNDEIES 68
+AVDLALRRLVG PL R+ L L+G G++ D++ S
Sbjct 1 AAVDLALRRLVGSPLSREFLLGLEGVGWEGDLDDLRS 37
>gi|325292386|ref|YP_004278250.1| hypothetical protein AGROH133_04985 [Agrobacterium sp. H13-3]
gi|325060239|gb|ADY63930.1| hypothetical protein AGROH133_04985 [Agrobacterium sp. H13-3]
Length=76
Score = 39.3 bits (90), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 25/64 (40%), Positives = 38/64 (60%), Gaps = 0/64 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSNDEIE 67
RTNIE+DD L+A A + L +K++ V+ ALR LV AL+G G++ + DE+
Sbjct 2 RTNIELDDALIAEAMEITGLSTKKATVEKALRDLVRIHRQMRALDALEGMGWEGNLDEMR 61
Query 68 SFSD 71
+ D
Sbjct 62 TDWD 65
>gi|320105489|ref|YP_004181079.1| hypothetical protein AciPR4_0247 [Terriglobus saanensis SP1PR4]
gi|319924010|gb|ADV81085.1| Protein of unknown function DUF2191 [Terriglobus saanensis SP1PR4]
Length=70
Score = 39.3 bits (90), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 19/32 (60%), Positives = 23/32 (72%), Gaps = 0/32 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALR 39
RTNIEIDD L+ R +L +KR+AVD ALR
Sbjct 2 RTNIEIDDALIKQVMRRGKLPTKRAAVDAALR 33
>gi|335034130|ref|ZP_08527491.1| hypothetical protein AGRO_1470 [Agrobacterium sp. ATCC 31749]
gi|333794448|gb|EGL65784.1| hypothetical protein AGRO_1470 [Agrobacterium sp. ATCC 31749]
Length=89
Score = 38.9 bits (89), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 24/59 (41%), Positives = 36/59 (62%), Gaps = 0/59 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSNDEI 66
RTNIE+DD L+A A + L +K++ V+ ALR LV AL+G G++ + DE+
Sbjct 15 RTNIELDDALIAEAMEITGLSTKKATVEKALRDLVRIHRQMRALDALEGMGWEGNLDEM 73
>gi|258592452|emb|CBE68761.1| conserved protein of unknown function [NC10 bacterium 'Dutch
sediment']
Length=66
Score = 38.9 bits (89), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 32/50 (64%), Gaps = 1/50 (2%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGS 57
RTNIE+D+ELV A ++ L +K+ V+ AL+ LV + RD L+ +G
Sbjct 3 RTNIELDEELVNEAMKLTHLKTKKELVNYALKELVRKVKRRD-LLSFEGK 51
>gi|327190277|gb|EGE57377.1| hypothetical protein RHECNPAF_439009 [Rhizobium etli CNPAF512]
Length=71
Score = 37.7 bits (86), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 30/72 (42%), Positives = 41/72 (57%), Gaps = 2/72 (2%)
Query 5 VVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALA-LQGSGFDFSN 63
+V RT I+IDD L+ AA L +K + V+LALR LV E R A+A L G G++
Sbjct 1 MVMRTTIDIDDGLLDAAMIAAGLVTKEATVELALRNLV-ERHRRKNAIADLAGIGWEGEL 59
Query 64 DEIESFSDTDRK 75
DE+ DR+
Sbjct 60 DEMPCDQPDDRR 71
>gi|217979807|ref|YP_002363954.1| hypothetical protein Msil_3709 [Methylocella silvestris BL2]
gi|217505183|gb|ACK52592.1| conserved hypothetical protein [Methylocella silvestris BL2]
Length=70
Score = 37.7 bits (86), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 20/35 (58%), Positives = 25/35 (72%), Gaps = 0/35 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLV 42
RTNIEIDDEL+A A L +K++ V+ ALR LV
Sbjct 2 RTNIEIDDELLAEAMAATGLSTKKATVEEALRALV 36
>gi|337281344|ref|YP_004620816.1| hypothetical protein Rta_36810 [Ramlibacter tataouinensis TTB310]
gi|334732421|gb|AEG94797.1| Conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
Length=101
Score = 37.4 bits (85), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 24/51 (48%), Positives = 33/51 (65%), Gaps = 1/51 (1%)
Query 7 SRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGS 57
+RTNI +DDELVA A + +K++AV+ ALR V +P LAL+GS
Sbjct 3 TRTNIVLDDELVAQAMARAGVKTKKAAVEAALRAYVRKP-DYSGLLALEGS 52
>gi|115524558|ref|YP_781469.1| hypothetical protein RPE_2551 [Rhodopseudomonas palustris BisA53]
gi|115518505|gb|ABJ06489.1| conserved hypothetical protein [Rhodopseudomonas palustris BisA53]
Length=67
Score = 37.0 bits (84), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 23/47 (49%), Positives = 29/47 (62%), Gaps = 0/47 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALAL 54
RTNIEIDD L+A AQ+ +K+ V+ ALR +V RD ALA
Sbjct 2 RTNIEIDDTLMAEAQKAAGQATKKDTVEQALRLMVRLKKQRDVALAF 48
>gi|299135986|ref|ZP_07029170.1| Protein of unknown function DUF2191 [Acidobacterium sp. MP5ACTX8]
gi|298602110|gb|EFI58264.1| Protein of unknown function DUF2191 [Acidobacterium sp. MP5ACTX8]
Length=67
Score = 36.6 bits (83), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 20/35 (58%), Positives = 26/35 (75%), Gaps = 0/35 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLV 42
RTNIEIDD+L+A A R +KR+AV+ ALR L+
Sbjct 2 RTNIEIDDQLMAEALRSSGEPTKRAAVEAALRLLI 36
>gi|218661252|ref|ZP_03517182.1| hypothetical protein RetlI_17778 [Rhizobium etli IE4771]
Length=92
Score = 36.6 bits (83), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/64 (44%), Positives = 38/64 (60%), Gaps = 2/64 (3%)
Query 4 GVVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALA-LQGSGFDFS 62
GV+ R I+IDD L+ AA L ++ + V+LALR LV E R A+A L G G++
Sbjct 21 GVIMRMTIDIDDGLLDAAMIATGLATREAMVELALRNLV-ERHRRKNAIADLAGLGWEGE 79
Query 63 NDEI 66
DEI
Sbjct 80 LDEI 83
>gi|313672675|ref|YP_004050786.1| hypothetical protein Calni_0712 [Calditerrivibrio nitroreducens
DSM 19672]
gi|312939431|gb|ADR18623.1| Protein of unknown function DUF2191 [Calditerrivibrio nitroreducens
DSM 19672]
Length=66
Score = 36.6 bits (83), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/57 (41%), Positives = 32/57 (57%), Gaps = 2/57 (3%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSND 64
RTNI IDD+L+ A ++ + SK+ V+ ALR V E L R L+G F +D
Sbjct 2 RTNIVIDDKLLEEAMKLSNIKSKKELVNTALREFV-ENLKRKNIKELKGK-IKFKDD 56
>gi|304318060|ref|YP_003853205.1| hypothetical protein Tthe_2672 [Thermoanaerobacterium thermosaccharolyticum
DSM 571]
gi|302779562|gb|ADL70121.1| Protein of unknown function DUF2191 [Thermoanaerobacterium thermosaccharolyticum
DSM 571]
Length=69
Score = 36.6 bits (83), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 31/50 (62%), Gaps = 1/50 (2%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGS 57
RTNI IDDEL+ A ++ + +K+ V++AL+ L+ E R + L+G
Sbjct 2 RTNIIIDDELIKEALKITGIKTKKEIVNIALKELI-ENHKRKNLMDLKGK 50
>gi|345013747|ref|YP_004816101.1| hypothetical protein Strvi_6366 [Streptomyces violaceusniger
Tu 4113]
gi|344040096|gb|AEM85821.1| Protein of unknown function DUF2191 [Streptomyces violaceusniger
Tu 4113]
Length=70
Score = 36.6 bits (83), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/59 (34%), Positives = 34/59 (58%), Gaps = 0/59 (0%)
Query 6 VSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSND 64
+SRT I++DDE+V A R+Y + +K AV +A+ V L ++ A++ D + D
Sbjct 1 MSRTMIDLDDEMVEHAMRLYGVKTKAKAVRMAMEEAVKRRLRQEGIDAIKSGDLDLTYD 59
>gi|297154825|gb|ADI04537.1| hypothetical protein SBI_01416 [Streptomyces bingchenggensis
BCW-1]
Length=70
Score = 36.2 bits (82), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/59 (34%), Positives = 34/59 (58%), Gaps = 0/59 (0%)
Query 6 VSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSND 64
+SRT I++DDE+V A R+Y + +K AV +A+ V L ++ A++ D + D
Sbjct 1 MSRTMIDLDDEMVEQAMRLYGVKTKAKAVRMAMEEAVKRRLRQEGIDAIKSGDLDLTYD 59
>gi|345302429|ref|YP_004824331.1| hypothetical protein Rhom172_0553 [Rhodothermus marinus SG0.5JP17-172]
gi|345111662|gb|AEN72494.1| Protein of unknown function DUF2191 [Rhodothermus marinus SG0.5JP17-172]
Length=70
Score = 36.2 bits (82), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/35 (55%), Positives = 25/35 (72%), Gaps = 0/35 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLV 42
RT+IEIDDEL+ R+ L +KR AV+L LR L+
Sbjct 2 RTSIEIDDELMRKVLRVTGLKTKREAVELGLRTLL 36
>gi|206889882|ref|YP_002249459.1| hypothetical protein THEYE_A1668 [Thermodesulfovibrio yellowstonii
DSM 11347]
gi|206741820|gb|ACI20877.1| conserved hypothetical protein [Thermodesulfovibrio yellowstonii
DSM 11347]
Length=67
Score = 36.2 bits (82), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/52 (31%), Positives = 33/52 (64%), Gaps = 1/52 (1%)
Query 6 VSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGS 57
+ RTNIE+D++++ A + ++ +K+ ++ A+ LV + L R + L L+G
Sbjct 1 MRRTNIELDEKILKEAMELTKMKTKKDVINFAISELV-KKLKRKKILELEGK 51
>gi|240169407|ref|ZP_04748066.1| hypothetical protein MkanA1_08843 [Mycobacterium kansasii ATCC
12478]
Length=86
Score = 35.8 bits (81), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 28/65 (44%), Positives = 33/65 (51%), Gaps = 11/65 (16%)
Query 7 SRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSNDEI 66
+ IEIDD+LV A R Y L R AV LALR L LA GSG + +E
Sbjct 20 KKVEIEIDDDLVQEAIRRYGLADAREAVHLALRTL----------LAEGGSG-EADEEEY 68
Query 67 ESFSD 71
+ FSD
Sbjct 69 DEFSD 73
>gi|333992735|ref|YP_004525349.1| hypothetical protein JDM601_4095 [Mycobacterium sp. JDM601]
gi|333488703|gb|AEF38095.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=73
Score = 35.8 bits (81), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 21/44 (48%), Positives = 28/44 (64%), Gaps = 1/44 (2%)
Query 6 VSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRD 49
+ RT IE+D+ELV AQ + + RS V+ ALRRL+ E G D
Sbjct 1 MKRTTIELDEELVRKAQSVTG-STLRSTVESALRRLIAEAQGED 43
>gi|268316119|ref|YP_003289838.1| hypothetical protein Rmar_0549 [Rhodothermus marinus DSM 4252]
gi|262333653|gb|ACY47450.1| conserved hypothetical protein [Rhodothermus marinus DSM 4252]
Length=70
Score = 35.0 bits (79), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 18/35 (52%), Positives = 25/35 (72%), Gaps = 0/35 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLV 42
RT+IEIDDEL+ R+ L +KR A++L LR L+
Sbjct 2 RTSIEIDDELMRKVLRVTGLKTKREAMELGLRTLL 36
>gi|296137154|ref|YP_003644396.1| Protein of unknown function DUF2191 [Thiomonas intermedia K12]
gi|295797276|gb|ADG32066.1| Protein of unknown function DUF2191 [Thiomonas intermedia K12]
Length=102
Score = 34.7 bits (78), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGS 57
RTNIEIDDEL+ AA R +K+ AV+ L+ L RD LAL+G
Sbjct 2 RTNIEIDDELMDAAMRAGPFKTKKEAVEEGLKLLRRRAAYRD-LLALRGK 50
>gi|88813609|ref|ZP_01128840.1| hypothetical protein NB231_12876 [Nitrococcus mobilis Nb-231]
gi|88789114|gb|EAR20250.1| hypothetical protein NB231_12876 [Nitrococcus mobilis Nb-231]
Length=64
Score = 34.7 bits (78), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 18/35 (52%), Positives = 25/35 (72%), Gaps = 0/35 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLV 42
RTNI IDD+L+ A R+ L +KR V+LAL+ L+
Sbjct 2 RTNIVIDDKLMERALRLTGLKTKREVVELALQTLL 36
>gi|294341452|emb|CAZ89869.1| hypothetical protein THI_3273 [Thiomonas sp. 3As]
Length=102
Score = 34.7 bits (78), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 18/34 (53%), Positives = 23/34 (68%), Gaps = 0/34 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRL 41
RTNIEIDDEL+ AA R +K+ AV+ L+ L
Sbjct 2 RTNIEIDDELMDAAMRAGPFKTKKEAVEEGLKLL 35
>gi|163759143|ref|ZP_02166229.1| hypothetical protein HPDFL43_05245 [Hoeflea phototrophica DFL-43]
gi|162283547|gb|EDQ33832.1| hypothetical protein HPDFL43_05245 [Hoeflea phototrophica DFL-43]
Length=65
Score = 34.3 bits (77), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 27/50 (54%), Gaps = 0/50 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGS 57
RTNIEIDDELV ++ +KR VD ALR + L LQG+
Sbjct 2 RTNIEIDDELVTELMKLTGRKTKRQVVDDALRDHLRRRRAAQAILDLQGT 51
>gi|77165011|ref|YP_343536.1| hypothetical protein Noc_1521 [Nitrosococcus oceani ATCC 19707]
gi|254434343|ref|ZP_05047851.1| hypothetical protein NOC27_1274 [Nitrosococcus oceani AFC27]
gi|76883325|gb|ABA58006.1| conserved hypothetical protein [Nitrosococcus oceani ATCC 19707]
gi|207090676|gb|EDZ67947.1| hypothetical protein NOC27_1274 [Nitrosococcus oceani AFC27]
Length=69
Score = 34.3 bits (77), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 18/42 (43%), Positives = 26/42 (62%), Gaps = 0/42 (0%)
Query 8 RTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRD 49
RTNI++DD+L+ AA R + SK+ V LAL+ V +D
Sbjct 2 RTNIDLDDKLLEAAFRCVSVKSKKELVHLALKEFVEHHQRKD 43
>gi|338533812|ref|YP_004667146.1| excinuclease ABC subunit B [Myxococcus fulvus HW-1]
gi|337259908|gb|AEI66068.1| excinuclease ABC subunit B [Myxococcus fulvus HW-1]
Length=704
Score = 33.9 bits (76), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 17/33 (52%), Positives = 23/33 (70%), Gaps = 0/33 (0%)
Query 17 LVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRD 49
+VA+ +Y L + RS VDLA+R VGE +GRD
Sbjct 137 IVASVSCIYGLGAARSYVDLAVRAAVGEEMGRD 169
Lambda K H
0.314 0.132 0.349
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128850890930
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40