BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0027
Length=105
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607169|ref|NP_214541.1| hypothetical protein Rv0027 [Mycoba... 215 1e-54
gi|340625060|ref|YP_004743512.1| hypothetical protein MCAN_00261... 212 2e-53
gi|183980081|ref|YP_001848372.1| hypothetical protein MMAR_0046 ... 170 7e-41
gi|118615959|ref|YP_904291.1| hypothetical protein MUL_0045 [Myc... 169 1e-40
gi|240172401|ref|ZP_04751060.1| hypothetical protein MkanA1_2400... 163 8e-39
gi|41406137|ref|NP_958973.1| hypothetical protein MAP0039 [Mycob... 161 3e-38
gi|254819081|ref|ZP_05224082.1| hypothetical protein MintA_04098... 159 2e-37
gi|296167098|ref|ZP_06849508.1| conserved hypothetical protein [... 157 5e-37
gi|254773090|ref|ZP_05214606.1| hypothetical protein MaviaA2_001... 151 2e-35
gi|342860245|ref|ZP_08716897.1| hypothetical protein MCOL_15250 ... 150 5e-35
gi|333988682|ref|YP_004521296.1| hypothetical protein JDM601_004... 95.5 2e-18
gi|326428775|gb|EGD74345.1| hypothetical protein PTSG_06355 [Sal... 40.8 0.066
gi|260808191|ref|XP_002598891.1| hypothetical protein BRAFLDRAFT... 38.9 0.27
gi|281200969|gb|EFA75183.1| RhoGAP domain-containing protein [Po... 38.9 0.29
gi|126445495|ref|YP_001062782.1| methyl-accepting chemotaxis pro... 38.5 0.35
gi|119855150|ref|YP_935755.1| hypothetical protein Mkms_5765 [My... 38.5 0.38
gi|260805256|ref|XP_002597503.1| hypothetical protein BRAFLDRAFT... 38.1 0.44
gi|260787082|ref|XP_002588584.1| hypothetical protein BRAFLDRAFT... 37.7 0.56
gi|260817000|ref|XP_002603375.1| hypothetical protein BRAFLDRAFT... 37.7 0.62
gi|183984138|ref|YP_001852429.1| hypothetical protein MMAR_4167 ... 37.4 0.68
gi|189501781|ref|YP_001957498.1| hypothetical protein Aasi_0340 ... 37.0 0.87
gi|260830836|ref|XP_002610366.1| hypothetical protein BRAFLDRAFT... 36.6 1.4
gi|78065258|ref|YP_368027.1| glutathione S-transferase-like prot... 36.2 1.5
gi|339327182|ref|YP_004686875.1| glutathione S-transferase [Cupr... 36.2 1.7
gi|260795164|ref|XP_002592576.1| hypothetical protein BRAFLDRAFT... 36.2 1.7
gi|260788618|ref|XP_002589346.1| hypothetical protein BRAFLDRAFT... 36.2 1.8
gi|194290686|ref|YP_002006593.1| glutathione s-transferase enzym... 35.8 2.0
gi|113869079|ref|YP_727568.1| glutathione S-transferase [Ralston... 35.8 2.1
gi|226198019|ref|ZP_03793591.1| conserved hypothetical protein [... 35.8 2.2
gi|186477296|ref|YP_001858766.1| glutathione S-transferase domai... 35.8 2.5
gi|260816994|ref|XP_002603372.1| hypothetical protein BRAFLDRAFT... 35.4 2.7
gi|296165465|ref|ZP_06848000.1| conserved hypothetical protein [... 35.0 3.3
gi|260811630|ref|XP_002600525.1| hypothetical protein BRAFLDRAFT... 35.0 3.9
gi|189502336|ref|YP_001958053.1| hypothetical protein Aasi_0966 ... 35.0 4.0
gi|134283723|ref|ZP_01770421.1| conserved hypothetical protein [... 35.0 4.0
gi|260791148|ref|XP_002590602.1| hypothetical protein BRAFLDRAFT... 35.0 4.0
gi|260814986|ref|XP_002602194.1| hypothetical protein BRAFLDRAFT... 35.0 4.2
gi|260793721|ref|XP_002591859.1| hypothetical protein BRAFLDRAFT... 34.7 4.4
gi|292899534|ref|YP_003538903.1| CorA-like Mg2+ transporter [Erw... 34.7 4.5
gi|260801839|ref|XP_002595802.1| hypothetical protein BRAFLDRAFT... 34.7 4.5
gi|254183623|ref|ZP_04890215.1| conserved hypothetical protein [... 34.7 5.0
gi|301120464|ref|XP_002907959.1| hypothetical protein PITG_01286... 34.7 5.2
gi|53722285|ref|YP_111270.1| hypothetical protein BPSS1263 [Burk... 34.7 5.2
gi|126457515|ref|YP_001075735.1| hypothetical protein BURPS1106A... 34.7 5.2
gi|254301764|ref|ZP_04969207.1| conserved hypothetical protein [... 34.7 5.5
gi|260811622|ref|XP_002600521.1| hypothetical protein BRAFLDRAFT... 34.3 5.7
gi|260816988|ref|XP_002603369.1| hypothetical protein BRAFLDRAFT... 34.3 5.9
gi|82190152|sp|O57611.1|K1C18_SCYST RecName: Full=Keratin, type ... 34.3 6.1
gi|167906729|ref|ZP_02493934.1| hypothetical protein BpseN_31155... 34.3 6.4
gi|292488335|ref|YP_003531217.1| Zinc transport protein [Erwinia... 34.3 6.8
>gi|15607169|ref|NP_214541.1| hypothetical protein Rv0027 [Mycobacterium tuberculosis H37Rv]
gi|15839401|ref|NP_334438.1| hypothetical protein MT0030 [Mycobacterium tuberculosis CDC1551]
gi|31791204|ref|NP_853697.1| hypothetical protein Mb0028 [Mycobacterium bovis AF2122/97]
80 more sequence titles
Length=105
Score = 215 bits (548), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 105/105 (100%), Positives = 105/105 (100%), Gaps = 0/105 (0%)
Query 1 MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL
Sbjct 1 MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
Query 61 RKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR 105
RKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR
Sbjct 61 RKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR 105
>gi|340625060|ref|YP_004743512.1| hypothetical protein MCAN_00261 [Mycobacterium canettii CIPT
140010059]
gi|340003250|emb|CCC42367.1| hypothetical protein MCAN_00261 [Mycobacterium canettii CIPT
140010059]
Length=105
Score = 212 bits (539), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 103/105 (99%), Positives = 104/105 (99%), Gaps = 0/105 (0%)
Query 1 MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
M DRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL
Sbjct 1 MADRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
Query 61 RKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR 105
R+QCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR
Sbjct 61 RRQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR 105
>gi|183980081|ref|YP_001848372.1| hypothetical protein MMAR_0046 [Mycobacterium marinum M]
gi|183173407|gb|ACC38517.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=105
Score = 170 bits (430), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 78/105 (75%), Positives = 95/105 (91%), Gaps = 0/105 (0%)
Query 1 MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
M DRIHV+PAHLR+AAAHH+QT++YLR+VPSSH AI+ESL+SLGPIFSELRD GR+LLEL
Sbjct 1 MADRIHVEPAHLREAAAHHEQTSEYLRSVPSSHAAIQESLNSLGPIFSELRDAGRDLLEL 60
Query 61 RKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR 105
R+QCY+QQAD+HADIAQNLRTSAA+W+QHE+ A+ G I+D R
Sbjct 61 RRQCYEQQADDHADIAQNLRTSAAVWDQHEQDAAHDFGGIVDNGR 105
>gi|118615959|ref|YP_904291.1| hypothetical protein MUL_0045 [Mycobacterium ulcerans Agy99]
gi|118568069|gb|ABL02820.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=105
Score = 169 bits (428), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 78/105 (75%), Positives = 94/105 (90%), Gaps = 0/105 (0%)
Query 1 MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
M DRIHV+PAHLR+AAAHH+QT++YLR+VPSSH AI+ESL+SLGPIFSELRD GR+LLEL
Sbjct 1 MADRIHVEPAHLREAAAHHEQTSEYLRSVPSSHAAIQESLNSLGPIFSELRDAGRDLLEL 60
Query 61 RKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR 105
R+QCY+QQAD+HADIAQN RTSAAMW+QHE+ A+ G I+D R
Sbjct 61 RRQCYEQQADDHADIAQNPRTSAAMWDQHEQDAAHDFGGIVDNGR 105
>gi|240172401|ref|ZP_04751060.1| hypothetical protein MkanA1_24003 [Mycobacterium kansasii ATCC
12478]
Length=105
Score = 163 bits (412), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 77/105 (74%), Positives = 92/105 (88%), Gaps = 0/105 (0%)
Query 1 MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
M DRIHV PAHLR+AAAHHQQT++YLRTVPSSH AI ESL+SLGPIF+ELR GRELLEL
Sbjct 1 MADRIHVVPAHLREAAAHHQQTSEYLRTVPSSHAAILESLESLGPIFAELRGAGRELLEL 60
Query 61 RKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR 105
R+QCY+QQAD+HADIAQ+LRTSAAMWE+H++ A+R + + D R
Sbjct 61 RRQCYEQQADDHADIAQSLRTSAAMWERHDQDAARDISRVFDPDR 105
>gi|41406137|ref|NP_958973.1| hypothetical protein MAP0039 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|118464594|ref|YP_879345.1| hypothetical protein MAV_0045 [Mycobacterium avium 104]
gi|41394485|gb|AAS02356.1| hypothetical protein MAP_0039 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|118165881|gb|ABK66778.1| conserved hypothetical protein [Mycobacterium avium 104]
gi|336459446|gb|EGO38388.1| Protein of unknown function (DUF2580) [Mycobacterium avium subsp.
paratuberculosis S397]
Length=105
Score = 161 bits (408), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 77/105 (74%), Positives = 88/105 (84%), Gaps = 0/105 (0%)
Query 1 MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
M D IHV PAHLRQAAAHHQ T++YLRTVPSSH AI+ESLDSLGPIFSELRD GRELLEL
Sbjct 1 MADSIHVVPAHLRQAAAHHQDTSEYLRTVPSSHAAIQESLDSLGPIFSELRDAGRELLEL 60
Query 61 RKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR 105
R+QCY+QQA +HAD+A L SAAMWEQHE+ A+ G+I+D R
Sbjct 61 RRQCYEQQAADHADLADQLTVSAAMWEQHEQEAADKFGSIVDRGR 105
>gi|254819081|ref|ZP_05224082.1| hypothetical protein MintA_04098 [Mycobacterium intracellulare
ATCC 13950]
Length=105
Score = 159 bits (401), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 74/105 (71%), Positives = 87/105 (83%), Gaps = 0/105 (0%)
Query 1 MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
M D IHV PAHLRQAAAHHQ T++YLRTVPSSH AI+ESLDSLGPIF ELRD GRELLEL
Sbjct 1 MADSIHVVPAHLRQAAAHHQDTSEYLRTVPSSHAAIQESLDSLGPIFGELRDAGRELLEL 60
Query 61 RKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR 105
R+QCY+QQA +HA++A L SA MWEQHE+ A+R G+++D R
Sbjct 61 RRQCYEQQAADHANLADQLTVSATMWEQHEQEAARKFGDVVDRGR 105
>gi|296167098|ref|ZP_06849508.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897540|gb|EFG77136.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=105
Score = 157 bits (397), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 74/105 (71%), Positives = 89/105 (85%), Gaps = 0/105 (0%)
Query 1 MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
M D IHV PAHLRQAAA HQ+T++YLRTVPSSH+AI++SLDSLGP+FSELRD GRELLEL
Sbjct 1 MADSIHVVPAHLRQAAARHQETSEYLRTVPSSHEAIQDSLDSLGPVFSELRDAGRELLEL 60
Query 61 RKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR 105
R+QCY+QQA +HAD+A L SA MWE+HE+ A+R+LG I D R
Sbjct 61 RRQCYEQQAADHADLADKLAASATMWEEHEQDAARNLGGIADRGR 105
>gi|254773090|ref|ZP_05214606.1| hypothetical protein MaviaA2_00191 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=98
Score = 151 bits (382), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 72/97 (75%), Positives = 83/97 (86%), Gaps = 0/97 (0%)
Query 9 PAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLELRKQCYQQQ 68
PAHLRQAAAHHQ T++YLRTVPSSH AI+ESLDSLGPIFSELRD GRELLELR+QCY+QQ
Sbjct 2 PAHLRQAAAHHQDTSEYLRTVPSSHAAIQESLDSLGPIFSELRDAGRELLELRRQCYEQQ 61
Query 69 ADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR 105
A +HAD+A L SAAMWEQHE+ A+ G+I+D R
Sbjct 62 AADHADLADQLTVSAAMWEQHEQEAADKFGSIVDRGR 98
>gi|342860245|ref|ZP_08716897.1| hypothetical protein MCOL_15250 [Mycobacterium colombiense CECT
3035]
gi|342132623|gb|EGT85852.1| hypothetical protein MCOL_15250 [Mycobacterium colombiense CECT
3035]
Length=98
Score = 150 bits (380), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 71/97 (74%), Positives = 82/97 (85%), Gaps = 0/97 (0%)
Query 9 PAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLELRKQCYQQQ 68
PAHLRQAAAHHQ T++YLRTVPSSH AI+ESLDSLGPIFSELRD GRELLELR+QCY+QQ
Sbjct 2 PAHLRQAAAHHQDTSEYLRTVPSSHAAIQESLDSLGPIFSELRDAGRELLELRRQCYEQQ 61
Query 69 ADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDGSR 105
A +HAD+A L SA MWEQHE+ A+ G+I+D R
Sbjct 62 AADHADLADKLTVSATMWEQHEQEAAGKFGDIVDRGR 98
>gi|333988682|ref|YP_004521296.1| hypothetical protein JDM601_0042 [Mycobacterium sp. JDM601]
gi|333484650|gb|AEF34042.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=104
Score = 95.5 bits (236), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 43/103 (42%), Positives = 64/103 (63%), Gaps = 0/103 (0%)
Query 1 MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
M + IHV L AA +H++ +DYL TVP+SH+AI+ +L +LGPI+ + R LL+
Sbjct 1 MPENIHVDQGVLTNAAGNHREASDYLATVPASHEAIQTTLSALGPIYGDFRRAAGALLDA 60
Query 61 RKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNIIDG 103
RK CY Q++ HA+++ NL + W QHE A+ S + DG
Sbjct 61 RKNCYDDQSNEHAEMSDNLNLAVTTWNQHEDDAATSFRRLTDG 103
>gi|326428775|gb|EGD74345.1| hypothetical protein PTSG_06355 [Salpingoeca sp. ATCC 50818]
Length=1374
Score = 40.8 bits (94), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 53/100 (53%), Gaps = 10/100 (10%)
Query 4 RIHVQPAHLRQAAAHHQQT-ADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLELRK 62
++H Q ++A A +Q A +TV H ++DSLG +F+E+ D GR +
Sbjct 1020 QVHTQLGQYQRALACLEQAFAIQTQTVGVVHPNAATTVDSLGTVFAEIGDQGRAI----- 1074
Query 63 QCYQQQADNHADIA--QNLRTSAAMWEQHERAASRSLGNI 100
QC++Q + D Q+ T+A ++E + A RS+GN+
Sbjct 1075 QCFEQALNVWLDTVGEQHPNTAATLYEIAQ--AHRSMGNV 1112
>gi|260808191|ref|XP_002598891.1| hypothetical protein BRAFLDRAFT_90083 [Branchiostoma floridae]
gi|229284166|gb|EEN54903.1| hypothetical protein BRAFLDRAFT_90083 [Branchiostoma floridae]
Length=1173
Score = 38.9 bits (89), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 28/87 (33%), Positives = 46/87 (53%), Gaps = 14/87 (16%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
R+A +++QQ+ +T+ ++H I ESL++LG S+L D + + L +RK
Sbjct 1074 RKAISYYQQSLSMKKTIYGDNTAHPDIAESLNNLGSSLSKLGDHKKAIGYFEKSLSMRKT 1133
Query 64 CYQQQADN--HADIAQNLRTSAAMWEQ 88
Y DN H DIAQ+L + W +
Sbjct 1134 IY---GDNTAHPDIAQSLNNLGSSWTE 1157
Score = 38.1 bits (87), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 31/96 (33%), Positives = 49/96 (52%), Gaps = 17/96 (17%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
++A +++QQ+ +T+ ++H I ESL++LG S+L D + + L RK
Sbjct 898 KKAISYYQQSLSMKKTIYGDNTAHPDIAESLNNLGSSLSKLGDHKKAIGYFQQSLSTRKT 957
Query 64 CYQQQADN--HADIAQNLRTSAAMWEQ---HERAAS 94
Y DN H DIAQ+L + W Q H +A S
Sbjct 958 IY---GDNTAHPDIAQSLNNLGSSWTQLGDHRKAIS 990
Score = 37.7 bits (86), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 28/87 (33%), Positives = 45/87 (52%), Gaps = 14/87 (16%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
R+A +++QQ+ +T+ ++H I ESL++LG S+L D + + L RK
Sbjct 986 RKAISYYQQSLSMKKTIYGDNTAHPDIAESLNNLGSSLSKLGDHKKAIGYFQQSLSTRKT 1045
Query 64 CYQQQADN--HADIAQNLRTSAAMWEQ 88
Y DN H DIAQ+L + W +
Sbjct 1046 IY---GDNTAHPDIAQSLNNLGSSWSE 1069
>gi|281200969|gb|EFA75183.1| RhoGAP domain-containing protein [Polysphondylium pallidum PN500]
Length=729
Score = 38.9 bits (89), Expect = 0.29, Method: Composition-based stats.
Identities = 23/76 (31%), Positives = 44/76 (58%), Gaps = 3/76 (3%)
Query 18 HHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLE---LRKQCYQQQADNHAD 74
+ QT + L+ ++ A++E DS+G + +ELRD G+++ E L ++ QQQ D A
Sbjct 344 YSHQTYNKLQEEQENYKALQEEKDSIGELLNELRDNGKQMEEKNKLSEEKLQQQRDEVAQ 403
Query 75 IAQNLRTSAAMWEQHE 90
+ + L S + +++E
Sbjct 404 LKKKLEESNSKTKENE 419
>gi|126445495|ref|YP_001062782.1| methyl-accepting chemotaxis protein [Burkholderia pseudomallei
668]
gi|126224986|gb|ABN88491.1| methyl-accepting chemotaxis protein [Burkholderia pseudomallei
668]
Length=938
Score = 38.5 bits (88), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 34/101 (34%), Positives = 47/101 (47%), Gaps = 18/101 (17%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 496 QATAFEQHTAALLRTMSESHSALQATLESRDEQRLATWTDSLGSIAAKL---GTEWAQTS 552
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSLGN 99
Q +Q D A A++L A +EQH A R++G
Sbjct 553 AQAANRQQTICDALAHTARDLSAQATAFEQHTAALLRTIGE 593
Score = 35.8 bits (81), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 33/99 (34%), Positives = 46/99 (47%), Gaps = 18/99 (18%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 576 QATAFEQHTAALLRTIGESHSALQATLESRDEQRLATWADSLGSIAAKL---GTEWEQTS 632
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSL 97
Q +Q D A A++L A +EQH A R++
Sbjct 633 AQAANRQQAICDALAHTARDLSAQATAFEQHTAALLRTM 671
>gi|119855150|ref|YP_935755.1| hypothetical protein Mkms_5765 [Mycobacterium sp. KMS]
gi|119697868|gb|ABL94940.1| hypothetical protein Mkms_5765 [Mycobacterium sp. KMS]
Length=102
Score = 38.5 bits (88), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 23/86 (27%), Positives = 36/86 (42%), Gaps = 0/86 (0%)
Query 1 MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
MTD IH+ P +R A H AD + + I ++++ GPI + + +L+
Sbjct 1 MTDPIHIDPEVMRTVANQHDDVADQIAPAREASTEILAAVNTFGPIMHQFKSAVSDLMVN 60
Query 61 RKQCYQQQADNHADIAQNLRTSAAMW 86
R H A LR AA +
Sbjct 61 RDAALLHHEHTHRSAAIGLRREAANF 86
>gi|260805256|ref|XP_002597503.1| hypothetical protein BRAFLDRAFT_78939 [Branchiostoma floridae]
gi|229282768|gb|EEN53515.1| hypothetical protein BRAFLDRAFT_78939 [Branchiostoma floridae]
Length=1727
Score = 38.1 bits (87), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 23/83 (28%), Positives = 45/83 (55%), Gaps = 10/83 (12%)
Query 13 RQAAAHHQQTADYLRT---VPSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
R+A ++H+Q+ R+ V +SH I +SL+SLG ++ +L D + + L++++
Sbjct 1041 RKAISYHEQSLQMKRSIHGVDTSHSDIAKSLNSLGTVWRDLGDPRKAIGYHEQSLQMKRN 1100
Query 64 CYQQQADNHADIAQNLRTSAAMW 86
Y + + H D+A +L W
Sbjct 1101 VYGED-NEHLDVANSLNNLGVAW 1122
>gi|260787082|ref|XP_002588584.1| hypothetical protein BRAFLDRAFT_107531 [Branchiostoma floridae]
gi|229273749|gb|EEN44595.1| hypothetical protein BRAFLDRAFT_107531 [Branchiostoma floridae]
Length=1473
Score = 37.7 bits (86), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 28/94 (30%), Positives = 51/94 (55%), Gaps = 13/94 (13%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
R+A + +Q R++ ++H I +SL+SLG +S+L D + + L++R+
Sbjct 1276 RKAVRYCEQALGMRRSIYGEGNAHPDIADSLNSLGTAWSDLEDHRKAISYYEQGLQMRRS 1335
Query 64 CYQQQADNHADIAQNLRTSAAMWEQ---HERAAS 94
Y + H DIAQ+++ A W++ H +AAS
Sbjct 1336 IYGEDT-AHPDIAQSIKNLGAAWKKLGDHRKAAS 1368
Score = 34.3 bits (77), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 23/85 (28%), Positives = 45/85 (53%), Gaps = 10/85 (11%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
R+AA++++Q+ + +R + ++H I SLD+LG SE+ D + + LE+++
Sbjct 1364 RKAASYYEQSLEMMRNIHGKNTAHPDIATSLDNLGNASSEIGDHRKAVSYHEQSLEMKRI 1423
Query 64 CYQQQADNHADIAQNLRTSAAMWEQ 88
Y + H DI ++L W +
Sbjct 1424 IYGENT-AHPDIVKSLNNLRVEWSE 1447
>gi|260817000|ref|XP_002603375.1| hypothetical protein BRAFLDRAFT_80368 [Branchiostoma floridae]
gi|229288694|gb|EEN59386.1| hypothetical protein BRAFLDRAFT_80368 [Branchiostoma floridae]
Length=871
Score = 37.7 bits (86), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 26/97 (27%), Positives = 50/97 (52%), Gaps = 13/97 (13%)
Query 10 AHLRQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LEL 60
+ ++A ++H+Q+ +RT+ ++H I SL++LG + L D + + L++
Sbjct 763 GNCKKAVSYHEQSLQMMRTIYGKNTAHPDIASSLNNLGSAWGILGDYKKAVSYYEQSLQI 822
Query 61 RKQCYQQQADNHADIAQNLRTSAAMWE---QHERAAS 94
R+ Y + H DIA +L +W H++AAS
Sbjct 823 RRTIYGENT-AHPDIASSLNILGIVWRLLGDHQKAAS 858
Score = 35.0 bits (79), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 22/83 (27%), Positives = 45/83 (55%), Gaps = 10/83 (12%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
++A ++H+Q+ +RT+ ++H I SL++LG +S+L D +E+ L++++
Sbjct 678 KKAVSYHEQSLQMIRTIYGENTAHPDIASSLNNLGIAWSDLGDHRKEVSYHEQSLQMKRS 737
Query 64 CYQQQADNHADIAQNLRTSAAMW 86
Y + H +IA +L W
Sbjct 738 VYGETT-AHPNIASSLNNLGNAW 759
>gi|183984138|ref|YP_001852429.1| hypothetical protein MMAR_4167 [Mycobacterium marinum M]
gi|183177464|gb|ACC42574.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=103
Score = 37.4 bits (85), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 25/102 (25%), Positives = 48/102 (48%), Gaps = 1/102 (0%)
Query 1 MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
MT+ + VQP L A+HH A + + ES+ + +T ++ E
Sbjct 1 MTENLKVQPELLGVLASHHDNAAASATSGTEVTSGLSESVTVTHGSYCSQFNTTLKMYES 60
Query 61 RKQCYQQQADNHA-DIAQNLRTSAAMWEQHERAASRSLGNII 101
+ +N D+A+NLRT+A ++ + + S++LG++
Sbjct 61 TRSALGSSLNNAGIDLAKNLRTAARVYTEADDTWSKALGSLF 102
>gi|189501781|ref|YP_001957498.1| hypothetical protein Aasi_0340 [Candidatus Amoebophilus asiaticus
5a2]
gi|189497222|gb|ACE05769.1| hypothetical protein Aasi_0340 [Candidatus Amoebophilus asiaticus
5a2]
Length=2413
Score = 37.0 bits (84), Expect = 0.87, Method: Composition-based stats.
Identities = 26/95 (28%), Positives = 49/95 (52%), Gaps = 7/95 (7%)
Query 5 IHVQPAHLRQAAAHHQQTADYLRTV-PSSHDAIRESLDSLGPIFSELRDTGRELLELRKQ 63
++ ++A ++QQ+ LR++ S+H I +SLD+LG ++ L G L R+
Sbjct 1218 VYYNSRKFKEALQYYQQSLLTLRSIYTSNHPDIAKSLDNLGTVYKALDRYGEALEYYRQA 1277
Query 64 CYQQQ---ADNHADIAQNLRTSAAMW---EQHERA 92
+Q A H +IA +L +++ EQ++ A
Sbjct 1278 LTMKQDIYAAKHLEIATSLNNMGSLYKNLEQYQEA 1312
>gi|260830836|ref|XP_002610366.1| hypothetical protein BRAFLDRAFT_72426 [Branchiostoma floridae]
gi|229295731|gb|EEN66376.1| hypothetical protein BRAFLDRAFT_72426 [Branchiostoma floridae]
Length=1340
Score = 36.6 bits (83), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 26/85 (31%), Positives = 42/85 (50%), Gaps = 10/85 (11%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
R+ ++H+Q R V + H I SLD+LG +S+L D + + L++R+
Sbjct 1076 RKGISYHEQALQMNRKVYGQSAVHHQIASSLDNLGGGWSQLGDYSKAVSYHEQALQMRRN 1135
Query 64 CYQQQADNHADIAQNLRTSAAMWEQ 88
Y Q+ H DIA +L W+Q
Sbjct 1136 VY-GQSTAHPDIATSLNNLGTAWQQ 1159
>gi|78065258|ref|YP_368027.1| glutathione S-transferase-like protein [Burkholderia sp. 383]
gi|77966003|gb|ABB07383.1| Glutathione S-transferase-like protein [Burkholderia sp. 383]
Length=204
Score = 36.2 bits (82), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/60 (34%), Positives = 33/60 (55%), Gaps = 2/60 (3%)
Query 35 AIRESLDSLGPIFSELRDTGRELLELRKQCYQQQADNHADIAQNLRTSAAMWEQHERAAS 94
I E +D+L P+ + +GRE LE+R C++ D D A +R M ++ +R+AS
Sbjct 66 VICEYVDTLSPVGKLIPPSGRERLEVR--CWEALGDGVIDAAVAIRIELTMRDEAQRSAS 123
>gi|339327182|ref|YP_004686875.1| glutathione S-transferase [Cupriavidus necator N-1]
gi|338167339|gb|AEI78394.1| glutathione S-transferase [Cupriavidus necator N-1]
Length=204
Score = 36.2 bits (82), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/58 (38%), Positives = 31/58 (54%), Gaps = 2/58 (3%)
Query 35 AIRESLDSLGPIFSELRDTGRELLELRKQCYQQQADNHADIAQNLRTSAAMWEQHERA 92
I E +D+L P+ + GRE LE+R C++ AD D A +R A E HER+
Sbjct 65 VIVEYVDTLTPVGRMIPQGGRERLEVR--CWEALADGLLDAALLVRLEATQREPHERS 120
>gi|260795164|ref|XP_002592576.1| hypothetical protein BRAFLDRAFT_68898 [Branchiostoma floridae]
gi|229277797|gb|EEN48587.1| hypothetical protein BRAFLDRAFT_68898 [Branchiostoma floridae]
Length=1682
Score = 36.2 bits (82), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/98 (30%), Positives = 48/98 (49%), Gaps = 21/98 (21%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGRELLELRKQCYQQQ- 68
++A ++++Q+ +T+ ++H I SLD+LG +SEL D + L CY+Q
Sbjct 873 KKAISYYEQSLAMQKTLYGDNTAHPGIVASLDNLGSSWSELGDQEKAL-----SCYEQSL 927
Query 69 -------ADNHAD--IAQNLRTSAAMWEQ---HERAAS 94
DN AD + Q L A W + H++A S
Sbjct 928 TMQKTIYGDNMADPEVVQTLNNLVAAWSELGYHKKALS 965
>gi|260788618|ref|XP_002589346.1| hypothetical protein BRAFLDRAFT_77799 [Branchiostoma floridae]
gi|229274523|gb|EEN45357.1| hypothetical protein BRAFLDRAFT_77799 [Branchiostoma floridae]
Length=1004
Score = 36.2 bits (82), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/85 (33%), Positives = 42/85 (50%), Gaps = 14/85 (16%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTG--RELLELRKQCYQQ 67
R+AA +H+Q+ + RT+ ++H I SL++LG +ELRD G R + +Q Q
Sbjct 642 RKAACYHEQSLNMKRTIYGEDTAHPGIYTSLNNLG---NELRDLGDYRNAVSYYEQSMQM 698
Query 68 ------QADNHADIAQNLRTSAAMW 86
+ H DIAQ L W
Sbjct 699 MQTIHGKGTAHPDIAQLLSNLGVSW 723
>gi|194290686|ref|YP_002006593.1| glutathione s-transferase enzyme with thioredoxin-like domain
[Cupriavidus taiwanensis LMG 19424]
gi|193224521|emb|CAQ70532.1| putative glutathione S-transferase enzyme with thioredoxin-like
domain [Cupriavidus taiwanensis LMG 19424]
Length=204
Score = 35.8 bits (81), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/58 (38%), Positives = 31/58 (54%), Gaps = 2/58 (3%)
Query 35 AIRESLDSLGPIFSELRDTGRELLELRKQCYQQQADNHADIAQNLRTSAAMWEQHERA 92
I E +D+L P+ + GRE LE+R C++ AD D A +R A E HER+
Sbjct 65 VIVEYVDTLTPVSRLIPQGGRERLEVR--CWEALADGLLDAALLVRLEATQREPHERS 120
>gi|113869079|ref|YP_727568.1| glutathione S-transferase [Ralstonia eutropha H16]
gi|113527855|emb|CAJ94200.1| Glutathione S-transferase [Ralstonia eutropha H16]
Length=204
Score = 35.8 bits (81), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/58 (38%), Positives = 31/58 (54%), Gaps = 2/58 (3%)
Query 35 AIRESLDSLGPIFSELRDTGRELLELRKQCYQQQADNHADIAQNLRTSAAMWEQHERA 92
I E +D+L P+ + GRE LE+R C++ AD D A +R A E HER+
Sbjct 65 VIVEYVDTLTPVSRLIPQGGRERLEVR--CWEALADGLLDAALLVRLEATQREPHERS 120
>gi|226198019|ref|ZP_03793591.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan
9]
gi|225929895|gb|EEH25910.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan
9]
Length=938
Score = 35.8 bits (81), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 33/99 (34%), Positives = 46/99 (47%), Gaps = 18/99 (18%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 496 QATAFEQHTAALLRTMSESHSALQATLESRDEQRLATWTDSLGSIAAKL---GTEWAQTS 552
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSL 97
Q +Q D A A++L A +EQH A R++
Sbjct 553 AQAANRQQAICDALAHTARDLSAQATAFEQHTAALLRTM 591
Score = 34.3 bits (77), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 32/99 (33%), Positives = 46/99 (47%), Gaps = 18/99 (18%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 576 QATAFEQHTAALLRTMSESHSALQATLESRDEQRLATWTDSLGSIAAKL---GTEWAQTS 632
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSL 97
Q +Q D A +++L A +EQH A R++
Sbjct 633 AQAANRQQTICDALAHTSRDLSAQATAFEQHTAALLRAM 671
>gi|186477296|ref|YP_001858766.1| glutathione S-transferase domain-containing protein [Burkholderia
phymatum STM815]
gi|184193755|gb|ACC71720.1| Glutathione S-transferase domain [Burkholderia phymatum STM815]
Length=202
Score = 35.8 bits (81), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 20/60 (34%), Positives = 33/60 (55%), Gaps = 2/60 (3%)
Query 35 AIRESLDSLGPIFSELRDTGRELLELRKQCYQQQADNHADIAQNLRTSAAMWEQHERAAS 94
I E +D+L P+ + +GRE +E+R C++ AD D A +R + E+ R+AS
Sbjct 65 VICEYVDTLSPVCKLVPQSGRERVEVR--CWEALADGIMDAAVLMRLEGVLREEAHRSAS 122
>gi|260816994|ref|XP_002603372.1| hypothetical protein BRAFLDRAFT_80365 [Branchiostoma floridae]
gi|229288691|gb|EEN59383.1| hypothetical protein BRAFLDRAFT_80365 [Branchiostoma floridae]
Length=1185
Score = 35.4 bits (80), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 23/83 (28%), Positives = 43/83 (52%), Gaps = 10/83 (12%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
++A ++H+Q +RT+ ++H I +SL++LG + L D + L L++R+
Sbjct 1036 KKAVSYHEQALQMMRTIYGENTAHPDIAKSLNNLGSAWRNLGDHKKALSYHEQALQMRRT 1095
Query 64 CYQQQADNHADIAQNLRTSAAMW 86
Y + H+DIA +L W
Sbjct 1096 IYGENTA-HSDIASSLNNLGNAW 1117
>gi|296165465|ref|ZP_06848000.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295899180|gb|EFG78651.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=100
Score = 35.0 bits (79), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 20/86 (24%), Positives = 39/86 (46%), Gaps = 0/86 (0%)
Query 1 MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELRDTGRELLEL 60
MT+ I + P L + A +H + + + I ++++LGPI E++ ++L
Sbjct 1 MTEPIRIDPEVLHEVAGNHDEVVNIIEAARERGADIHAAVETLGPIMHEVKSAVGDVLLD 60
Query 61 RKQCYQQQADNHADIAQNLRTSAAMW 86
R + A H LR +A ++
Sbjct 61 RDTALAEHAGRHRFAGDELRRAAHIY 86
>gi|260811630|ref|XP_002600525.1| hypothetical protein BRAFLDRAFT_70102 [Branchiostoma floridae]
gi|229285812|gb|EEN56537.1| hypothetical protein BRAFLDRAFT_70102 [Branchiostoma floridae]
Length=358
Score = 35.0 bits (79), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 25/94 (27%), Positives = 47/94 (50%), Gaps = 13/94 (13%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
++A ++H+Q+ +RT+ ++H I SLD+LG + L D + + L++ +
Sbjct 121 KKAVSYHEQSLQMMRTIYGENTAHSDIASSLDNLGIAWMNLGDHKKAVSYHEQSLQMMRT 180
Query 64 CYQQQADNHADIAQNLRTSAAMWEQ---HERAAS 94
Y + H+DIA +L W H++A S
Sbjct 181 IYGENTA-HSDIASSLDNLGIAWMNLGDHKKAVS 213
>gi|189502336|ref|YP_001958053.1| hypothetical protein Aasi_0966 [Candidatus Amoebophilus asiaticus
5a2]
gi|189497777|gb|ACE06324.1| hypothetical protein Aasi_0966 [Candidatus Amoebophilus asiaticus
5a2]
Length=2145
Score = 35.0 bits (79), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 26/98 (27%), Positives = 51/98 (53%), Gaps = 13/98 (13%)
Query 5 IHVQPAHLRQAAAHHQQTADYLRTVPS-SHDAIRESLDSLGPIFS------ELRDTGREL 57
I+ ++A ++QQ +++ + +H + +S++S+G I+ E +E
Sbjct 1237 IYKTLGQYQKALKYYQQALGMRKSLYTGNHPHVAQSINSVGHIYQILGQYQEALKYYQEA 1296
Query 58 LELRKQCYQQQADNHADIAQNLRTSAAMWE---QHERA 92
LE+RK Y+ NH D+AQ+L + + W+ QH +A
Sbjct 1297 LEMRKALYR---GNHPDVAQSLNSIGSAWKALGQHHKA 1331
>gi|134283723|ref|ZP_01770421.1| conserved hypothetical protein [Burkholderia pseudomallei 305]
gi|134244905|gb|EBA45001.1| conserved hypothetical protein [Burkholderia pseudomallei 305]
Length=938
Score = 35.0 bits (79), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 33/99 (34%), Positives = 45/99 (46%), Gaps = 18/99 (18%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 576 QATAFEQHTAALLRTMSESHSALQATLESRDEQRLATWTDSLGSIAAKL---GTEWAQTS 632
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSL 97
Q Q D A A++L A +EQH A R++
Sbjct 633 AQAADHQQAICDALAHTARDLSAQATAFEQHTAALLRAM 671
Score = 34.7 bits (78), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 32/99 (33%), Positives = 46/99 (47%), Gaps = 18/99 (18%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 496 QATAFEQHTAALLRTMSESHSALQATLESRDEQRLATWTDSLGSIAAKL---GTEWAQTS 552
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSL 97
Q +Q D A +++L A +EQH A R++
Sbjct 553 AQAANRQQAICDALAHTSRDLSAQATAFEQHTAALLRTM 591
>gi|260791148|ref|XP_002590602.1| hypothetical protein BRAFLDRAFT_83757 [Branchiostoma floridae]
gi|229275797|gb|EEN46613.1| hypothetical protein BRAFLDRAFT_83757 [Branchiostoma floridae]
Length=670
Score = 35.0 bits (79), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 26/72 (37%), Positives = 39/72 (55%), Gaps = 11/72 (15%)
Query 32 SHDAIRESLDSLGPIFSELRD------TGRELLELRKQCYQQQADNHADIAQNLRTSAAM 85
+H +I +LDSLG I E D RE LEL++Q Y ++++ A IAQ L +
Sbjct 225 NHPSISGALDSLGAILVEKGDYHESLRCHREALELKRQMYGEKSN--APIAQTLNNLGNV 282
Query 86 WE---QHERAAS 94
++ +HE A S
Sbjct 283 YKSLGEHETALS 294
>gi|260814986|ref|XP_002602194.1| hypothetical protein BRAFLDRAFT_76881 [Branchiostoma floridae]
gi|229287501|gb|EEN58206.1| hypothetical protein BRAFLDRAFT_76881 [Branchiostoma floridae]
Length=1768
Score = 35.0 bits (79), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 28/95 (30%), Positives = 48/95 (51%), Gaps = 15/95 (15%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
R+A +H+Q+ + R++ ++H AI SL++LG +S L D + + L++ +
Sbjct 1327 RKAVNYHEQSLEMTRSIYGKDTAHSAIAASLNNLGTTWSNLGDHRKAISYHEQALDMTRG 1386
Query 64 CYQQQADN-HADIAQNLRTSAAMWEQ---HERAAS 94
Y + DN H DIA +L W H +A S
Sbjct 1387 IYGE--DNAHPDIACSLNNMGNTWGNLGDHRKAVS 1419
>gi|260793721|ref|XP_002591859.1| hypothetical protein BRAFLDRAFT_89372 [Branchiostoma floridae]
gi|229277071|gb|EEN47870.1| hypothetical protein BRAFLDRAFT_89372 [Branchiostoma floridae]
Length=1612
Score = 34.7 bits (78), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 27/94 (29%), Positives = 47/94 (50%), Gaps = 13/94 (13%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
+ A +++++ ++T+ ++H I SL+SLG +SEL D + L L ++K
Sbjct 1242 KTAISYYERALTMMKTIHEENTAHPDIVSSLNSLGSAWSELDDKEKALSYYEQALSMQKS 1301
Query 64 CYQQQADNHADIAQNLRTSAAMWEQ---HERAAS 94
Y HADIA L ++W + E+A S
Sbjct 1302 LYGDTT-AHADIASTLNHLGSVWSELDDQEKAIS 1334
>gi|292899534|ref|YP_003538903.1| CorA-like Mg2+ transporter [Erwinia amylovora ATCC 49946]
gi|291199382|emb|CBJ46499.1| putative CorA-like Mg2+ transporter [Erwinia amylovora ATCC 49946]
gi|312172474|emb|CBX80731.1| putative Zinc transport protein [Erwinia amylovora ATCC BAA-2158]
Length=327
Score = 34.7 bits (78), Expect = 4.5, Method: Composition-based stats.
Identities = 21/47 (45%), Positives = 28/47 (60%), Gaps = 3/47 (6%)
Query 8 QPAHLRQAAAHHQQTADYLRTVPSSHDAIRESL--DSLGPIFSELRD 52
+P L AHHQ +AD+LR P DA+R++L DS+ P S L D
Sbjct 39 RPCWLHLNYAHHQ-SADWLRLTPLLPDAVRDALGGDSMRPRVSRLGD 84
>gi|260801839|ref|XP_002595802.1| hypothetical protein BRAFLDRAFT_96775 [Branchiostoma floridae]
gi|229281051|gb|EEN51814.1| hypothetical protein BRAFLDRAFT_96775 [Branchiostoma floridae]
Length=1412
Score = 34.7 bits (78), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 27/86 (32%), Positives = 44/86 (52%), Gaps = 13/86 (15%)
Query 13 RQAAAHHQQTADYLRTV--PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQC 64
R+A + +Q+ +T+ ++H I +SL ++G SEL D R L L + K
Sbjct 1260 RKAINYFEQSLTMRKTIYGNTAHPDIAQSLLNMGTSLSELGDKKRALRYYEQSLSMSKSF 1319
Query 65 YQQQADN--HADIAQNLRTSAAMWEQ 88
Y DN H DIA++L+T + W +
Sbjct 1320 Y---GDNTAHPDIAKSLQTLGSYWSK 1342
>gi|254183623|ref|ZP_04890215.1| conserved hypothetical protein [Burkholderia pseudomallei 1655]
gi|184214156|gb|EDU11199.1| conserved hypothetical protein [Burkholderia pseudomallei 1655]
Length=938
Score = 34.7 bits (78), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 32/99 (33%), Positives = 46/99 (47%), Gaps = 18/99 (18%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 496 QATAFEQHTAALLRTMSESHSALQATLESRDEQRLATWTDSLGSIAAKL---GTEWAQTS 552
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSL 97
Q +Q D A +++L A +EQH A R++
Sbjct 553 AQAANRQQAICDALAHTSRDLSAQATAFEQHTAALLRTM 591
Score = 34.7 bits (78), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 32/99 (33%), Positives = 46/99 (47%), Gaps = 18/99 (18%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 576 QATAFEQHTAALLRTMSESHSALQATLESRDEQRLATWTDSLGSIAAKL---GTEWAQTS 632
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSL 97
Q +Q D A +++L A +EQH A R++
Sbjct 633 AQAANRQQTICDALAHTSRDLSAQATAFEQHTAALLRTM 671
>gi|301120464|ref|XP_002907959.1| hypothetical protein PITG_01286 [Phytophthora infestans T30-4]
gi|262102990|gb|EEY61042.1| hypothetical protein PITG_01286 [Phytophthora infestans T30-4]
Length=375
Score = 34.7 bits (78), Expect = 5.2, Method: Composition-based stats.
Identities = 16/39 (42%), Positives = 24/39 (62%), Gaps = 0/39 (0%)
Query 13 RQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSELR 51
R AA H+ TA R V H+A+++SL+S G + +LR
Sbjct 6 RHVAAPHRATATASREVSVDHEAMKKSLESFGANYDQLR 44
>gi|53722285|ref|YP_111270.1| hypothetical protein BPSS1263 [Burkholderia pseudomallei K96243]
gi|52212699|emb|CAH38730.1| putative membrane protein [Burkholderia pseudomallei K96243]
Length=938
Score = 34.7 bits (78), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 32/99 (33%), Positives = 46/99 (47%), Gaps = 18/99 (18%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 496 QATAFEQHTAALLRTMSESHSALQATLESRDEQRLATWTDSLGSIAAKL---GTEWAQTS 552
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSL 97
Q +Q D A +++L A +EQH A R++
Sbjct 553 AQAANRQQAICDALAHTSRDLSAQATAFEQHTAALLRTM 591
Score = 34.3 bits (77), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 32/99 (33%), Positives = 46/99 (47%), Gaps = 18/99 (18%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 576 QATAFEQHTAALLRTMSESHSALQATLESRDEQRLATWTDSLGSIAAKL---GTEWAQTS 632
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSL 97
Q +Q D A +++L A +EQH A R++
Sbjct 633 AQAANRQQTICDALAHTSRDLSAQATAFEQHTAALLRAM 671
>gi|126457515|ref|YP_001075735.1| hypothetical protein BURPS1106A_A1701 [Burkholderia pseudomallei
1106a]
gi|242313973|ref|ZP_04812990.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b]
gi|254193564|ref|ZP_04899998.1| conserved hypothetical protein [Burkholderia pseudomallei S13]
gi|126231283|gb|ABN94696.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a]
gi|169650317|gb|EDS83010.1| conserved hypothetical protein [Burkholderia pseudomallei S13]
gi|242137212|gb|EES23615.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b]
Length=938
Score = 34.7 bits (78), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 32/99 (33%), Positives = 46/99 (47%), Gaps = 18/99 (18%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 496 QATAFEQHTAALLRTMSESHSALQATLESRDEQRLATWTDSLGSIAAKL---GTEWAQTS 552
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSL 97
Q +Q D A +++L A +EQH A R++
Sbjct 553 AQAANRQQAICDALAHTSRDLSAQATAFEQHTAALLRTM 591
Score = 34.3 bits (77), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 32/99 (33%), Positives = 46/99 (47%), Gaps = 18/99 (18%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 576 QATAFEQHTAALLRTMSESHSALQATLESRDEQRLATWTDSLGSIAAKL---GTEWAQTS 632
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSL 97
Q +Q D A +++L A +EQH A R++
Sbjct 633 AQAANRQQTICDALAHTSRDLSAQATAFEQHTAALLRAM 671
>gi|254301764|ref|ZP_04969207.1| conserved hypothetical protein [Burkholderia pseudomallei 406e]
gi|157811748|gb|EDO88918.1| conserved hypothetical protein [Burkholderia pseudomallei 406e]
Length=938
Score = 34.7 bits (78), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 32/99 (33%), Positives = 46/99 (47%), Gaps = 18/99 (18%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 496 QATAFEQHTAALLRTMSESHSALQATLESRDEQRLATWTDSLGSIAAKL---GTEWAQTS 552
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSL 97
Q +Q D A +++L A +EQH A R++
Sbjct 553 AQAANRQQAICDALAHTSRDLSAQATAFEQHTAALLRAM 591
>gi|260811622|ref|XP_002600521.1| hypothetical protein BRAFLDRAFT_70106 [Branchiostoma floridae]
gi|229285808|gb|EEN56533.1| hypothetical protein BRAFLDRAFT_70106 [Branchiostoma floridae]
Length=957
Score = 34.3 bits (77), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 23/83 (28%), Positives = 42/83 (51%), Gaps = 10/83 (12%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
++A ++H+Q+ RT+ ++H I SLD+LG + +L D + + L++R+
Sbjct 588 KKAVSYHEQSLQMKRTIYGENTAHPDIAASLDNLGLVCMDLGDYKKAVSYHEQSLQMRRT 647
Query 64 CYQQQADNHADIAQNLRTSAAMW 86
Y + H DIA +L W
Sbjct 648 IYGENT-AHPDIAASLNNLGLAW 669
Score = 33.9 bits (76), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 24/83 (29%), Positives = 41/83 (50%), Gaps = 10/83 (12%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGREL------LELRKQ 63
++A +HH+Q+ RT+ ++H I SLD+LG +L D + + L++R+
Sbjct 676 KKAVSHHEQSLQMERTIYGENTAHPDIAASLDNLGLACMDLGDYKKAVSYHEQSLQMRRT 735
Query 64 CYQQQADNHADIAQNLRTSAAMW 86
Y + H DIA +L W
Sbjct 736 IYGENT-AHPDIAASLNNLGLAW 757
>gi|260816988|ref|XP_002603369.1| hypothetical protein BRAFLDRAFT_80362 [Branchiostoma floridae]
gi|229288688|gb|EEN59380.1| hypothetical protein BRAFLDRAFT_80362 [Branchiostoma floridae]
Length=345
Score = 34.3 bits (77), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 24/85 (29%), Positives = 45/85 (53%), Gaps = 10/85 (11%)
Query 13 RQAAAHHQQTADYLRTV---PSSHDAIRESLDSLGPIFSELRDTGRELLELRKQCYQQQ- 68
++A ++H+Q+ +RT+ ++H I SLD+LG + +L D ++ L +Q Q +
Sbjct 196 KKAVSYHEQSLQMMRTIYGENTAHSDIASSLDNLGIAWMDLGDH-KKALSYHEQSLQMER 254
Query 69 ---ADN--HADIAQNLRTSAAMWEQ 88
+N H+DIA +L W+
Sbjct 255 IIHGENTAHSDIASSLNNLGNAWKN 279
>gi|82190152|sp|O57611.1|K1C18_SCYST RecName: Full=Keratin, type I cytoskeletal 18; AltName: Full=Cytokeratin-18;
Short=CK-18; AltName: Full=Keratin-18; Short=K18
gi|2909612|emb|CAA74980.1| type I keratin K18 [Scyliorhinus stellaris]
Length=415
Score = 34.3 bits (77), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 28/76 (37%), Positives = 38/76 (50%), Gaps = 7/76 (9%)
Query 36 IRESLDSLGPIFSELRDTGRELLELRKQCYQQQADNHADIAQ--NLRTSA----AMWEQH 89
I+E LD+ GP F + + L ELRK+ Y DN I Q N R +A WE
Sbjct 99 IKEHLDARGPSFRDWSIYEKPLNELRKEVYDMTVDNARLILQIDNARLAADDFRVKWES- 157
Query 90 ERAASRSLGNIIDGSR 105
E + +S+ N I+G R
Sbjct 158 ELSIRQSVENDINGLR 173
>gi|167906729|ref|ZP_02493934.1| hypothetical protein BpseN_31155 [Burkholderia pseudomallei NCTC
13177]
Length=858
Score = 34.3 bits (77), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 32/99 (33%), Positives = 46/99 (47%), Gaps = 18/99 (18%)
Query 14 QAAAHHQQTADYLRTVPSSHDAIRESL------------DSLGPIFSELRDTGRELLELR 61
QA A Q TA LRT+ SH A++ +L DSLG I ++L G E +
Sbjct 496 QATAFEQHTAALLRTMSESHSALQATLESRDEQRLATWTDSLGSIAAKL---GTEWEQTS 552
Query 62 KQCYQQQ---ADNHADIAQNLRTSAAMWEQHERAASRSL 97
Q +Q D A A+++ A +EQH A R++
Sbjct 553 AQAANRQQAICDALAHTARDISAQATAFEQHTAALLRTM 591
>gi|292488335|ref|YP_003531217.1| Zinc transport protein [Erwinia amylovora CFBP1430]
gi|291553764|emb|CBA20809.1| putative Zinc transport protein [Erwinia amylovora CFBP1430]
Length=361
Score = 34.3 bits (77), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 21/47 (45%), Positives = 28/47 (60%), Gaps = 3/47 (6%)
Query 8 QPAHLRQAAAHHQQTADYLRTVPSSHDAIRESL--DSLGPIFSELRD 52
+P L AHHQ +AD+LR P DA+R++L DS+ P S L D
Sbjct 73 RPCWLHLNYAHHQ-SADWLRLTPLLPDAVRDALGGDSMRPRVSRLGD 118
Lambda K H
0.317 0.127 0.368
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 131458853568
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40