BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0743c
Length=185
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607883|ref|NP_215257.1| hypothetical protein Rv0743c [Mycob... 376 8e-103
gi|254231060|ref|ZP_04924387.1| hypothetical protein TBCG_00733 ... 346 8e-94
gi|339293768|gb|AEJ45879.1| hypothetical protein CCDC5079_0689 [... 342 2e-92
gi|167966640|ref|ZP_02548917.1| hypothetical protein MtubH3_0062... 330 6e-89
gi|296128653|ref|YP_003635903.1| hypothetical protein Cfla_0794 ... 160 7e-38
gi|119717556|ref|YP_924521.1| hypothetical protein Noca_3332 [No... 130 1e-28
gi|334336642|ref|YP_004541794.1| hypothetical protein Isova_1125... 129 1e-28
gi|269793604|ref|YP_003313059.1| hypothetical protein Sked_02560... 128 4e-28
gi|120404599|ref|YP_954428.1| hypothetical protein Mvan_3631 [My... 122 3e-26
gi|329915752|ref|ZP_08276304.1| converved hypothetical protein [... 108 4e-22
gi|319760079|ref|YP_004124018.1| hypothetical protein Alide_4571... 107 8e-22
gi|340781927|ref|YP_004748534.1| hypothetical protein Atc_1185 [... 103 1e-20
gi|160897761|ref|YP_001563343.1| hypothetical protein Daci_2320 ... 103 1e-20
gi|294340607|emb|CAZ88997.1| conserved hypothetical protein [Thi... 103 2e-20
gi|226349851|ref|YP_002776964.1| hypothetical protein ROP_pROB02... 102 2e-20
gi|116694305|ref|YP_728516.1| hypothetical protein H16_B0351 [Ra... 101 6e-20
gi|254003145|ref|YP_003052611.1| hypothetical protein Msip34_286... 99.8 2e-19
gi|284046272|ref|YP_003396612.1| hypothetical protein Cwoe_4824 ... 98.2 5e-19
gi|330823505|ref|YP_004386808.1| hypothetical protein Alide2_087... 95.9 2e-18
gi|91786580|ref|YP_547532.1| hypothetical protein Bpro_0676 [Pol... 91.3 5e-17
gi|221064688|ref|ZP_03540793.1| conserved hypothetical protein [... 89.7 2e-16
gi|91791124|ref|YP_552074.1| hypothetical protein Bpro_5319 [Pol... 87.0 1e-15
gi|319796018|ref|YP_004157658.1| hypothetical protein Varpa_5391... 85.9 2e-15
gi|121583161|ref|YP_973602.1| hypothetical protein Pnap_4592 [Po... 80.9 8e-14
gi|339327884|ref|YP_004687576.1| hypothetical protein CNE_BB1p01... 80.1 1e-13
gi|255021997|ref|ZP_05294004.1| hypothetical protein ACA_0468 [A... 73.9 1e-11
gi|330819853|ref|YP_004348715.1| hypothetical protein bgla_2g073... 67.8 7e-10
gi|209965881|ref|YP_002298796.1| hypothetical protein RC1_2603 [... 53.9 9e-06
gi|339628980|ref|YP_004720623.1| hypothetical protein TPY_2720 [... 41.2 0.078
gi|188581816|ref|YP_001925261.1| hypothetical protein Mpop_2569 ... 38.9 0.33
gi|269796955|ref|YP_003316410.1| gluconolactonase [Sanguibacter ... 38.9 0.35
gi|327310946|ref|YP_004337843.1| hypothetical protein TUZN_1051 ... 37.4 0.96
gi|260906751|ref|ZP_05915073.1| hypothetical protein BlinB_15582... 37.4 1.1
gi|218529078|ref|YP_002419894.1| hypothetical protein Mchl_1064 ... 35.8 3.3
gi|126728072|ref|ZP_01743888.1| Probable sodium/sulphate symport... 35.4 3.5
gi|332292830|ref|YP_004431439.1| GCN5-related N-acetyltransferas... 35.4 3.8
gi|301622029|ref|XP_002940344.1| PREDICTED: hypothetical protein... 35.4 4.0
gi|94969064|ref|YP_591112.1| hypothetical protein Acid345_2037 [... 35.0 4.4
>gi|15607883|ref|NP_215257.1| hypothetical protein Rv0743c [Mycobacterium tuberculosis H37Rv]
gi|15840154|ref|NP_335191.1| hypothetical protein MT0769 [Mycobacterium tuberculosis CDC1551]
gi|31791929|ref|NP_854422.1| hypothetical protein Mb0764c [Mycobacterium bovis AF2122/97]
73 more sequence titles
Length=185
Score = 376 bits (965), Expect = 8e-103, Method: Compositional matrix adjust.
Identities = 185/185 (100%), Positives = 185/185 (100%), Gaps = 0/185 (0%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD
Sbjct 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
Query 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH 120
KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH
Sbjct 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH 120
Query 121 DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRIAAWLNYYGA 180
DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRIAAWLNYYGA
Sbjct 121 DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRIAAWLNYYGA 180
Query 181 GNHSS 185
GNHSS
Sbjct 181 GNHSS 185
>gi|254231060|ref|ZP_04924387.1| hypothetical protein TBCG_00733 [Mycobacterium tuberculosis C]
gi|124600119|gb|EAY59129.1| hypothetical protein TBCG_00733 [Mycobacterium tuberculosis C]
Length=267
Score = 346 bits (888), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 172/172 (100%), Positives = 172/172 (100%), Gaps = 0/172 (0%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD
Sbjct 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
Query 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH 120
KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH
Sbjct 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH 120
Query 121 DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRIA 172
DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRIA
Sbjct 121 DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRIA 172
>gi|339293768|gb|AEJ45879.1| hypothetical protein CCDC5079_0689 [Mycobacterium tuberculosis
CCDC5079]
gi|339297407|gb|AEJ49517.1| hypothetical protein CCDC5180_0680 [Mycobacterium tuberculosis
CCDC5180]
Length=169
Score = 342 bits (876), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 168/169 (99%), Positives = 169/169 (100%), Gaps = 0/169 (0%)
Query 17 VGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARDKADHVDVAIGEMSDFH 76
+GDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARDKADHVDVAIGEMSDFH
Sbjct 1 MGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARDKADHVDVAIGEMSDFH 60
Query 77 RSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPHDLAVAKLAAGREKDKA 136
RSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPHDLAVAKLAAGREKDKA
Sbjct 61 RSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPHDLAVAKLAAGREKDKA 120
Query 137 FVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRIAAWLNYYGAGNHSS 185
FVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRIAAWLNYYGAGNHSS
Sbjct 121 FVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRIAAWLNYYGAGNHSS 169
>gi|167966640|ref|ZP_02548917.1| hypothetical protein MtubH3_00628 [Mycobacterium tuberculosis
H37Ra]
Length=164
Score = 330 bits (846), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 162/164 (99%), Positives = 163/164 (99%), Gaps = 0/164 (0%)
Query 22 VLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARDKADHVDVAIGEMSDFHRSNGV 81
+LVLGSQSILGSFDENELPPQATASQEADIAFVNDPARDKADHVDVAIGEMSDFHRSNGV
Sbjct 1 MLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARDKADHVDVAIGEMSDFHRSNGV 60
Query 82 YAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPHDLAVAKLAAGREKDKAFVAAL 141
YAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPHDLAVAKLAAGREKDKAFVAAL
Sbjct 61 YAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPHDLAVAKLAAGREKDKAFVAAL 120
Query 142 IRSGLLDVGVIQARVLLLPEETDPRIGQRIAAWLNYYGAGNHSS 185
IRSGLLDVGVIQARVLLLPEETDPRIGQRIAAWLNYYGAG HSS
Sbjct 121 IRSGLLDVGVIQARVLLLPEETDPRIGQRIAAWLNYYGAGKHSS 164
>gi|296128653|ref|YP_003635903.1| hypothetical protein Cfla_0794 [Cellulomonas flavigena DSM 20109]
gi|296020468|gb|ADG73704.1| conserved hypothetical protein [Cellulomonas flavigena DSM 20109]
Length=183
Score = 160 bits (406), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 86/181 (48%), Positives = 117/181 (65%), Gaps = 3/181 (1%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
M R +LAH+LR A + D+L++GSQSILG++DE+ELP +A S EAD+AF+ D A +
Sbjct 1 MKRVELAHILRAASTITSTSDILIVGSQSILGTYDEDELPDEAVGSIEADVAFLGDGAAE 60
Query 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAI-LPNGWRDRLVSWTVESSRPAKPRFLEP 119
KA VD AIGE S FH+ G Y +GV +D + LP GW++R+V+W SS P + LEP
Sbjct 61 KALAVDGAIGEDSGFHQMYGYYGQGVEVDGLVALPEGWQERIVTWQSLSSEPGRALCLEP 120
Query 120 HDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRIAAWLNYYG 179
HDLA++KL A REKD FV ALI + LLD V+ R L E + +R+ +W+
Sbjct 121 HDLAISKLVAHREKDLDFVYALIEARLLDPAVLLER--LKATEVARPLARRVESWVRAMA 178
Query 180 A 180
A
Sbjct 179 A 179
>gi|119717556|ref|YP_924521.1| hypothetical protein Noca_3332 [Nocardioides sp. JS614]
gi|119538217|gb|ABL82834.1| conserved hypothetical protein [Nocardioides sp. JS614]
Length=187
Score = 130 bits (327), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 78/185 (43%), Positives = 113/185 (62%), Gaps = 10/185 (5%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEAD---IAFVNDP 57
M R QL H +R AC ++ +V+V+GSQ+ILG++DE++LP AT S E D IA N
Sbjct 1 MRRDQLEHAIRTACQIIQQPEVIVVGSQAILGTYDESQLPDAATMSIEVDILPIADTNAE 60
Query 58 ARDKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVS-WTVESSRPA-KPR 115
A AD ++ GE+S F +G +GV + TA+LP+GWRDRLV ++ PA +PR
Sbjct 61 AARLADLIESVAGELSPFEELHGFSIDGVDLQTAVLPDGWRDRLVKVQNANTAAPAGEPR 120
Query 116 F----LEPHDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRI 171
F L+ DL VAKL A R+KD+ FVAAL+++ L+D +I R+ +P + + Q +
Sbjct 121 FTGLCLDKEDLCVAKLVAFRDKDRNFVAALLKANLVDADLIAERLSTVPPKHATAVEQGL 180
Query 172 AAWLN 176
WL
Sbjct 181 -TWLT 184
>gi|334336642|ref|YP_004541794.1| hypothetical protein Isova_1125 [Isoptericola variabilis 225]
gi|334107010|gb|AEG43900.1| hypothetical protein Isova_1125 [Isoptericola variabilis 225]
Length=188
Score = 129 bits (325), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 76/178 (43%), Positives = 111/178 (63%), Gaps = 15/178 (8%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
M R QL H +R AC ++ +V+V+GSQ+ILG++DE++LP AT S E DI + P ++
Sbjct 1 MRRDQLEHAIRTACQIIDHTEVIVVGSQAILGTYDESQLPAAATMSVEIDILPIA-PTKE 59
Query 61 K----ADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPA---- 112
+ AD ++ GE+S F +G +GV +DTAILP GWRDRLV V+++ A
Sbjct 60 EVISLADRIEGVAGELSAFEALHGFSIDGVDLDTAILPTGWRDRLVK--VQNANTAAPLG 117
Query 113 KPRF----LEPHDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPR 166
+PRF L+ DL VAKL A REKD+ FVAA++ +GL+D ++ R+ + E R
Sbjct 118 EPRFTGWCLDKEDLCVAKLCAFREKDRNFVAAMLDAGLVDRDLVAVRLQSVSTEYSTR 175
>gi|269793604|ref|YP_003313059.1| hypothetical protein Sked_02560 [Sanguibacter keddieii DSM 10542]
gi|269095789|gb|ACZ20225.1| hypothetical protein Sked_02560 [Sanguibacter keddieii DSM 10542]
Length=187
Score = 128 bits (322), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 81/190 (43%), Positives = 113/190 (60%), Gaps = 14/190 (7%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEAD---IAFVNDP 57
M R QL H +R AC ++G V+V+GSQSILG+FDE LP AT S E D IA ++
Sbjct 1 MRRDQLEHAIRTACQILGHPTVIVVGSQSILGTFDEQRLPAAATMSLEIDILPIATSDEE 60
Query 58 ARDKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPA----K 113
AD ++ GE SDFH +G +GV ++TAILP GWR+RLV+ V++ A
Sbjct 61 TARLADLLEGIAGEWSDFHEMHGFSIDGVDLETAILPAGWRERLVA--VQNLNTAAIGGA 118
Query 114 PRF----LEPHDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPRIGQ 169
P+F L+ DL VAKL A REKD FV AL +G +D+ V+++R+ +P+ + R Q
Sbjct 119 PQFTGLCLDKEDLCVAKLCAYREKDLEFVGALADAGFVDLRVVESRLREVPDASAGRAAQ 178
Query 170 RIAAWLNYYG 179
+ W+ G
Sbjct 179 AL-RWVASRG 187
>gi|120404599|ref|YP_954428.1| hypothetical protein Mvan_3631 [Mycobacterium vanbaalenii PYR-1]
gi|119957417|gb|ABM14422.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=193
Score = 122 bits (306), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 76/186 (41%), Positives = 109/186 (59%), Gaps = 10/186 (5%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
M R QL H +R AC + G +V+++GSQ+ILG++ E+ELP AT S E D+ + D + +
Sbjct 1 MRRDQLEHAIRAACQIAGLTEVIIVGSQAILGTYTEDELPFYATRSAEVDVLPIADGSDE 60
Query 61 ---KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVS-WTVESSRPA-KPR 115
AD ++ GE S F +G +GV + T+ LP GWR RLV ++ P+ +P+
Sbjct 61 IARLADEIEGVAGEFSPFAELHGFNIDGVDLQTSALPEGWRGRLVKVQNPNTAAPSGEPQ 120
Query 116 F----LEPHDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRI 171
F L+ DL VAKL A REKD+ FV ALI + L+D VI R+ +PE P +R
Sbjct 121 FIGWCLDKEDLCVAKLCALREKDQNFVDALITANLVDPRVITTRLTTVPEAHRP-AAERA 179
Query 172 AAWLNY 177
A WL +
Sbjct 180 AHWLAH 185
>gi|329915752|ref|ZP_08276304.1| converved hypothetical protein [Oxalobacteraceae bacterium IMCC9480]
gi|327544849|gb|EGF30226.1| converved hypothetical protein [Oxalobacteraceae bacterium IMCC9480]
Length=181
Score = 108 bits (270), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 66/157 (43%), Positives = 93/157 (60%), Gaps = 6/157 (3%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
M + ++ H+LR A A+VGD + +++GSQS+ G + + L + SQE D+ N D
Sbjct 1 MKKSEVEHVLRAAAAIVGDNEFIIIGSQSLHGKYPD--LADEILKSQEVDLLSKNK--MD 56
Query 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESS-RPAKPRFLEP 119
K D ++V IG S FH + G YA+ V TA LP +R+RLV E S K LEP
Sbjct 57 KTDFLNV-IGMDSTFHETFGYYADPVDAHTATLPKHYRNRLVHLKQEGSGVSVKAYCLEP 115
Query 120 HDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARV 156
HDL VAKLAAGR+KD F+ AL+ L++ I+ R+
Sbjct 116 HDLVVAKLAAGRDKDHVFIRALLARKLINAETIRLRL 152
>gi|319760079|ref|YP_004124018.1| hypothetical protein Alide_4571 [Alicycliphilus denitrificans
BC]
gi|317119685|gb|ADV02173.1| hypothetical protein Alide_4571 [Alicycliphilus denitrificans
BC]
Length=190
Score = 107 bits (267), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 68/172 (40%), Positives = 98/172 (57%), Gaps = 6/172 (3%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
M + L H++R A AV +++V+GSQSILGS D P + S EADI +
Sbjct 1 MHKSDLEHIIRAASAVTNQYEIVVVGSQSILGSVDAP--PMECLVSMEADIFVLGH--EQ 56
Query 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH 120
+D +D +GE S FH + G YA+GV T+ILP+GWR+RLV ++ L+
Sbjct 57 LSDLIDGVLGEGSAFHDTFGYYAQGVDSTTSILPDGWRERLVRLQSPNTDGKVGYCLDAT 116
Query 121 DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRIA 172
DL +AK A REKD+ F AL+ G++D V +RV +P + D + +RIA
Sbjct 117 DLFLAKCVANREKDREFNLALLVHGIVDATVALSRVESMPIDDDAK--KRIA 166
>gi|340781927|ref|YP_004748534.1| hypothetical protein Atc_1185 [Acidithiobacillus caldus SM-1]
gi|340556080|gb|AEK57834.1| conserved hypothetical protein [Acidithiobacillus caldus SM-1]
Length=205
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/169 (37%), Positives = 98/169 (58%), Gaps = 7/169 (4%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDE-NELPPQA-----TASQEADIAFV 54
MT +QL HL+R + A++GD ++L++GSQSIL + PP+A T S EADI +
Sbjct 8 MTEEQLEHLIRSSGAILGDSEILIIGSQSILPWLRKWAGKPPRAWPGVFTLSTEADIIPI 67
Query 55 NDPARDKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKP 114
++ ++ K+D +D +GE S FH + G +A+GV ++TA P GW+ R E ++
Sbjct 68 DNDSK-KSDLIDGVLGEDSYFHATYGYFAQGVSMETARAPEGWQARCYPLKSERTQGVVG 126
Query 115 RFLEPHDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEET 163
+ P DL +AK AGR KD F+ A+I G+++ + V +P T
Sbjct 127 YCMHPADLFIAKTMAGRPKDGPFLDAMIEHGIVEESTVLHLVPKIPNCT 175
>gi|160897761|ref|YP_001563343.1| hypothetical protein Daci_2320 [Delftia acidovorans SPH-1]
gi|160363345|gb|ABX34958.1| conserved hypothetical protein [Delftia acidovorans SPH-1]
Length=178
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/177 (38%), Positives = 99/177 (56%), Gaps = 6/177 (3%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
M R+ L H++R + V + + +++GSQ+ILGS E P S EADI +N P D
Sbjct 1 MNREDLEHIIRASGDVTNEYEFVIVGSQAILGSIPYPE--PVFKMSAEADIYPLNAP--D 56
Query 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH 120
AD +D +IGE S FH SNG YA+GV DTA+L +GW++RL ++ L+
Sbjct 57 LADRIDGSIGEGSRFHESNGYYAQGVGPDTAVLASGWQNRLHRIQNGNTNDRVGYCLDVL 116
Query 121 DLAVAKLAAGREKDKAFVAALIRSG--LLDVGVIQARVLLLPEETDPRIGQRIAAWL 175
DL ++K AGR+KD+ F AL+ G +D + A + L ++ ++ RI W
Sbjct 117 DLFLSKAQAGRDKDRVFCMALMEHGHVQVDAALKLASSMPLGDDGKRQLRARIQRWF 173
>gi|294340607|emb|CAZ88997.1| conserved hypothetical protein [Thiomonas sp. 3As]
Length=215
Score = 103 bits (256), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/162 (41%), Positives = 96/162 (60%), Gaps = 7/162 (4%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATA-SQEADIAFVNDPAR 59
M R+QL H+LR A A+ +V+GSQSI+G + PP S EADI ++ P
Sbjct 1 MNREQLEHILRAASAITKQGRFIVVGSQSIVGVMPD---PPGVLGYSAEADIYPLDAP-- 55
Query 60 DKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEP 119
+ AD +D +IGE S FH + G YA+GV +TA+LPNGW+ RL + + A L+P
Sbjct 56 ELADLIDGSIGEGSPFHETFGYYAQGVGPETAVLPNGWQYRL-NQVRDPITLADGFCLDP 114
Query 120 HDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPE 161
D+AV+KL A REKDK F+ ++ S L+ ++ R +P+
Sbjct 115 TDMAVSKLVAWREKDKEFLGVMLESKLIHHDELERRASQVPQ 156
>gi|226349851|ref|YP_002776964.1| hypothetical protein ROP_pROB02-00200 [Rhodococcus opacus B4]
gi|226245766|dbj|BAH47033.1| hypothetical protein [Rhodococcus opacus B4]
Length=210
Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/156 (41%), Positives = 92/156 (59%), Gaps = 9/156 (5%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADI---AFVNDP 57
M R++L H++ ACA + + V+V GSQSILG++DE ELP AT S+E D+ + ++ P
Sbjct 1 MNREELEHVIEAACANLDEGQVIVFGSQSILGTYDETELPEYATLSREVDVFPRSGIDAP 60
Query 58 AR----DKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVE-SSRPA 112
A +K ++ +GE S FH S GVY EG+H + LV+ V+ S
Sbjct 61 ASPAVVEKILILNGRLGECSPFHESFGVYVEGIHKGCSGTAEAMGQPLVAVEVQDGSEYG 120
Query 113 KPRF-LEPHDLAVAKLAAGREKDKAFVAALIRSGLL 147
+ F L+P DL +K AGREKD+ FVAAL+ G++
Sbjct 121 RAGFCLDPVDLCASKAIAGREKDRVFVAALVEDGIV 156
>gi|116694305|ref|YP_728516.1| hypothetical protein H16_B0351 [Ralstonia eutropha H16]
gi|113528804|emb|CAJ95151.1| converved hypothetical protein [Ralstonia eutropha H16]
Length=188
Score = 101 bits (251), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 70/177 (40%), Positives = 94/177 (54%), Gaps = 8/177 (4%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQA-TASQEADIAFVNDPAR 59
M R+ L H++R A + + + +V+GSQSILG PP T S EADI +N A
Sbjct 1 MKREDLEHIIRAAADITNEYEFVVVGSQSILGPIPN---PPAVFTMSAEADIYPLN--AT 55
Query 60 DKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEP 119
KAD +D AIGE S FH + G YA+GV +TA LP GW +RL + L+
Sbjct 56 HKADAIDAAIGEGSRFHETYGYYAQGVGPETACLPTGWENRLQRIQTVGTNGRVGYCLDL 115
Query 120 HDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLP--EETDPRIGQRIAAW 174
DL +AK AA R+KD+ F ALI+ G + +RV +P R+ RI W
Sbjct 116 VDLFMAKAAADRDKDRVFCMALIQLGYVLPRTAISRVDDMPIDRAAQGRLRARIKRW 172
>gi|254003145|ref|YP_003052611.1| hypothetical protein Msip34_2860 [Methylovorus glucosetrophus
SIP3-4]
gi|253987228|gb|ACT52084.1| hypothetical protein Msip34_2860 [Methylovorus glucosetrophus
SIP3-4]
Length=256
Score = 99.8 bits (247), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/159 (39%), Positives = 88/159 (56%), Gaps = 9/159 (5%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
M R + HLLR A V+ + +V+GSQSILG + + P + S EAD+ N P ++
Sbjct 1 MRRSDIEHLLRAAGDVLNETAFIVVGSQSILGKYPD--APAELLQSAEADLIAKNKPEQN 58
Query 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH 120
H +GE+S FH G +A+ V +TAILP GW RLV+ ++ L+PH
Sbjct 59 ---HKLEVLGELSPFHDMYGYFADPVDRNTAILPKGWEGRLVNLKTPATNGVTGLCLDPH 115
Query 121 DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLL 159
DL V+K+AAGR+KD + +I L V + RVL L
Sbjct 116 DLFVSKMAAGRDKDLIYCRVMIEHNL----VGKERVLAL 150
>gi|284046272|ref|YP_003396612.1| hypothetical protein Cwoe_4824 [Conexibacter woesei DSM 14684]
gi|283950493|gb|ADB53237.1| conserved hypothetical protein [Conexibacter woesei DSM 14684]
Length=180
Score = 98.2 bits (243), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 62/164 (38%), Positives = 91/164 (56%), Gaps = 8/164 (4%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
M R +L H++R A V +++V+GSQ+ILG+ + P SQEAD+ + P +
Sbjct 1 MKRDELEHVIRAAADVASSDEIVVIGSQAILGAIPDA--PATLLWSQEADVYPLRAP--E 56
Query 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTV----ESSRPAKPRF 116
+A +D A+G+ S FH + G YA GV +TAI P GW+ RLV V R
Sbjct 57 RATAIDGALGDGSQFHATFGYYAHGVGPETAIAPAGWQARLVPVRVRRGPRDEREVVGWC 116
Query 117 LEPHDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLP 160
+EPHD+ +AK AAGRE+D F +R ++ +G + R LP
Sbjct 117 MEPHDVVLAKCAAGRERDWEFAREALRHEVVAIGELCRRATELP 160
>gi|330823505|ref|YP_004386808.1| hypothetical protein Alide2_0879 [Alicycliphilus denitrificans
K601]
gi|329308877|gb|AEB83292.1| hypothetical protein Alide2_0879 [Alicycliphilus denitrificans
K601]
Length=181
Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/160 (38%), Positives = 91/160 (57%), Gaps = 4/160 (2%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
MTR++L H++R + + + +++GSQSILG+ E T S EADI + P +
Sbjct 1 MTREELEHIIRASGDITDQYEFVIVGSQSILGAVPRPE--DVFTVSMEADIYPLQAP--E 56
Query 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH 120
AD +D AIGE S FH + G YA+GV +TA LP GW R+ +++ L+
Sbjct 57 LADRIDGAIGEGSQFHETYGYYAQGVGPETACLPAGWMQRVHRIQNRNTQDRIGYCLDVL 116
Query 121 DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLP 160
DL +AK+ A REKD+ F AL++ G +++ A V +P
Sbjct 117 DLFLAKVVAAREKDREFCIALLQYGYVNLEAALALVDNMP 156
>gi|91786580|ref|YP_547532.1| hypothetical protein Bpro_0676 [Polaromonas sp. JS666]
gi|91695805|gb|ABE42634.1| conserved hypothetical protein [Polaromonas sp. JS666]
Length=186
Score = 91.3 bits (225), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 59/160 (37%), Positives = 87/160 (55%), Gaps = 4/160 (2%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
MTR++L H++R A + + +++GSQSILG+ E P T S EADI + P +
Sbjct 1 MTREELEHIIRAAADITDYYEFIIIGSQSILGAVPHPE--PVFTVSMEADIYPKDAP--E 56
Query 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH 120
A+ +D AIGE S F + G YA+GV +TA LP W R ++ L+
Sbjct 57 LAEKIDGAIGEGSHFQDTFGYYAQGVGPETATLPAEWLSRAHKVQNANTNGRIGYCLDLA 116
Query 121 DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLP 160
DL ++K AGREKD+ F AL++ G + + A V +P
Sbjct 117 DLFLSKATAGREKDREFCMALLQYGYVTPAQVLALVSTMP 156
>gi|221064688|ref|ZP_03540793.1| conserved hypothetical protein [Comamonas testosteroni KF-1]
gi|220709711|gb|EED65079.1| conserved hypothetical protein [Comamonas testosteroni KF-1]
Length=177
Score = 89.7 bits (221), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/180 (34%), Positives = 96/180 (54%), Gaps = 13/180 (7%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQA---TASQEADIAFVNDP 57
M R L HL+R + + + D+L++GSQSILG+ +P A S EAD+ P
Sbjct 1 MNRDDLEHLIRVSAEIAQEYDLLIVGSQSILGA-----IPYPAHEFKRSMEADMYPRYAP 55
Query 58 ARDKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFL 117
+KA ++ AIGE S+FH+++G YA+ V T +P GW +RL ++ L
Sbjct 56 --EKATKIEGAIGEASEFHKTHGYYAQEVDPSTFTVPLGWEERLCKIQNANTDSKIGWCL 113
Query 118 EPHDLAVAKLAAGREKDKAFVAALIRSGLL---DVGVIQARVLLLPEETDPRIGQRIAAW 174
DL ++K AAGR+KD+ F A++R + + + ++ + EE R+ +RI W
Sbjct 114 SLVDLFLSKAAAGRDKDREFCQAMLRHRYVIASEALELVPAMMTMDEEERERLAKRIRRW 173
>gi|91791124|ref|YP_552074.1| hypothetical protein Bpro_5319 [Polaromonas sp. JS666]
gi|91701005|gb|ABE47176.1| hypothetical protein Bpro_5319 [Polaromonas sp. JS666]
Length=194
Score = 87.0 bits (214), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/156 (34%), Positives = 85/156 (55%), Gaps = 5/156 (3%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
M L + + A + G D +V+GS SILG E+P T S + D DP R
Sbjct 1 MNLDALFAMFKEARTLSGHTDFVVIGSLSILGLEQSFEIPDSMTMSNDIDCYTQADPGR- 59
Query 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH 120
D VD A+GE S +H+ +G + + V + LP+GWRDRL+ + R FL+P+
Sbjct 60 IFDVVD-ALGENSPYHKKSGFFLDAVSPELPSLPDGWRDRLIKVECDGLRAW---FLDPN 115
Query 121 DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARV 156
D A++K A G +D+ ++ A I +G++ + ++ AR+
Sbjct 116 DAALSKYARGEPRDRRWIQAGILAGVVSMPIVMARI 151
>gi|319796018|ref|YP_004157658.1| hypothetical protein Varpa_5391 [Variovorax paradoxus EPS]
gi|315598481|gb|ADU39547.1| hypothetical protein Varpa_5391 [Variovorax paradoxus EPS]
Length=178
Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/176 (35%), Positives = 91/176 (52%), Gaps = 11/176 (6%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQAT-ASQEADIAFVNDPAR 59
M +L H+LR + A+ + +V+GSQ++L E PP+A S+E D+ P R
Sbjct 1 MNLDELQHVLRASAAISKENSFVVVGSQAVLLLL---EHPPEALLVSREIDLYPALHPER 57
Query 60 DKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEP 119
AD +D AIG S FH + G +A+GV +TA++P W +R + P
Sbjct 58 --ADLIDGAIGMHSSFHETFGYFADGVGPETAVMPADWMNRASLHYIGDITAICPDL--- 112
Query 120 HDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRIAAWL 175
HDL V+K AGREKD FV L++ GL+ + R+ LL P ++A W+
Sbjct 113 HDLVVSKCVAGREKDADFVRELLKHGLVSAETLTERIGLLDAAKYPL--PQLAVWV 166
>gi|121583161|ref|YP_973602.1| hypothetical protein Pnap_4592 [Polaromonas naphthalenivorans
CJ2]
gi|120596423|gb|ABM39860.1| conserved hypothetical protein [Polaromonas naphthalenivorans
CJ2]
Length=192
Score = 80.9 bits (198), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 55/155 (36%), Positives = 79/155 (51%), Gaps = 5/155 (3%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
M L L A A+ G D +V+GS S+LG + ++P T S +AD DP R
Sbjct 1 MNLHALFRLFAEAKALSGHQDYVVIGSLSVLGLEESFDIPETMTMSVDADCYTKADPGR- 59
Query 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH 120
V A+GE S FH +G Y + V LP GW +RL+ E R FLEP
Sbjct 60 -IFDVVKALGENSPFHIEHGFYLDAVSPHLPSLPAGWENRLIKVEREGLRIW---FLEPS 115
Query 121 DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQAR 155
D A++K A G +D+ ++ A I SG++ + V+ +R
Sbjct 116 DAALSKYARGEPRDQRWIRAGILSGVVSIPVVNSR 150
>gi|339327884|ref|YP_004687576.1| hypothetical protein CNE_BB1p01110 [Cupriavidus necator N-1]
gi|338170485|gb|AEI81538.1| hypothetical protein CNE_BB1p01110 [Cupriavidus necator N-1]
Length=146
Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/131 (42%), Positives = 70/131 (54%), Gaps = 4/131 (3%)
Query 46 SQEADIAFVNDPARDKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWT 105
S EADI +N A DKAD +D AIGE S FH + G YA+GV +TA LP GW+ RL
Sbjct 2 SAEADIYPLN--AIDKADAIDAAIGEGSRFHETYGYYAQGVGPETACLPAGWQRRLQRIQ 59
Query 106 VESSRPAKPRFLEPHDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEE--T 163
+ L+ DL +AK A R+KD+ F ALI+ G + +RV +P E
Sbjct 60 TADTNGRVGYCLDVVDLFMAKAVAARDKDRVFCMALIQYGYVSPRAALSRVEDMPIEKAA 119
Query 164 DPRIGQRIAAW 174
R+ RI W
Sbjct 120 QGRLRARIKRW 130
>gi|255021997|ref|ZP_05294004.1| hypothetical protein ACA_0468 [Acidithiobacillus caldus ATCC
51756]
gi|254968565|gb|EET26120.1| hypothetical protein ACA_0468 [Acidithiobacillus caldus ATCC
51756]
Length=151
Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 42/120 (35%), Positives = 67/120 (56%), Gaps = 1/120 (0%)
Query 44 TASQEADIAFVNDPARDKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVS 103
T S EADI +++ ++ K+D +D +GE S FH + G +A+GV ++TA P GW+ R
Sbjct 3 TLSTEADIIPIDNDSK-KSDLIDGVLGEDSYFHATYGYFAQGVSMETARAPEGWQARCYP 61
Query 104 WTVESSRPAKPRFLEPHDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEET 163
E ++ + P DL +AK AGR KD F+ A+I G+++ + V +P T
Sbjct 62 LKSERTQGVVGYCMHPADLFIAKTMAGRPKDGPFLDAMIEHGIVEESTVLHLVPKIPNCT 121
>gi|330819853|ref|YP_004348715.1| hypothetical protein bgla_2g07340 [Burkholderia gladioli BSR3]
gi|327371848|gb|AEA63203.1| hypothetical protein bgla_2g07340 [Burkholderia gladioli BSR3]
Length=259
Score = 67.8 bits (164), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 49/156 (32%), Positives = 73/156 (47%), Gaps = 7/156 (4%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEADIAFVNDPARD 60
M + L +L A V + L+ GS S+LG + ++P + S + D + DP R
Sbjct 1 MHLEHLRRILAEAHKVSHHTEYLIAGSLSVLGV--KADIPDAMSLSIDVDFYPLRDPER- 57
Query 61 KADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPH 120
A + +GE S FH G Y + +H + LP WR R+V K FLE +
Sbjct 58 -AGEIARTLGEGSAFHTQYGYYLDPIHPELPTLPRSWRSRIVEHDFGD---VKAMFLEVN 113
Query 121 DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARV 156
D AV+K G E D ++ A + +LD+ I RV
Sbjct 114 DTAVSKYTRGAENDLRWIEAGYDAKILDLEAIAVRV 149
>gi|209965881|ref|YP_002298796.1| hypothetical protein RC1_2603 [Rhodospirillum centenum SW]
gi|209959347|gb|ACI99983.1| hypothetical protein RC1_2603 [Rhodospirillum centenum SW]
Length=195
Score = 53.9 bits (128), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 51/181 (29%), Positives = 79/181 (44%), Gaps = 8/181 (4%)
Query 2 TRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENEL----PPQATASQEADIAFVNDP 57
TR L +R G V V+GSQS+L + + P+ A ++
Sbjct 8 TRLDLERAVRALAVHFGTDRVFVIGSQSVLLGWPDAPFALRNSPEIDAYPANAGEWLKTS 67
Query 58 ARDKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFL 117
+ ++ ++ GE S FH ++G Y +GV TA L W DR V V+ + R +
Sbjct 68 GIEASEEINALFGEGSQFHIAHGFYIDGVDETTAKLAPDWLDRAVVLDVDRPGGGRVRAI 127
Query 118 EPH--DLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQARVLLLPEETDPRIGQRIAAWL 175
P D+ V+KL EKD+ ++ R LD+ + LLL D I R A+L
Sbjct 128 APSTVDIIVSKLHRLAEKDRTYIRECNRVRPLDIPCTKR--LLLSSGPDSAILARALAFL 185
Query 176 N 176
+
Sbjct 186 D 186
>gi|339628980|ref|YP_004720623.1| hypothetical protein TPY_2720 [Sulfobacillus acidophilus TPY]
gi|339286769|gb|AEJ40880.1| hypothetical protein TPY_2720 [Sulfobacillus acidophilus TPY]
Length=179
Score = 41.2 bits (95), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 38/143 (27%), Positives = 65/143 (46%), Gaps = 17/143 (11%)
Query 26 GSQSILGSFDENEL-PPQATASQ----EADIAFVNDPARDKADHVD-------VAIGEMS 73
G +L +FD++ L PP A A EA +AF + D+ D +D +A+ +
Sbjct 8 GIYQVLKTFDDHPLWPPDAKAVMIVVGEAALAFYHATTIDETDDLDGILWASSIAVEYVL 67
Query 74 DFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPHDLAVAKLAAGREK 133
S G+ H+ LP W +R + W+ + + +L P+D ++KL G +
Sbjct 68 RMAESTGISFRAGHV--MWLPWDWNER-IQWSGWDFQHLQVGWLYPYDWVISKLGRGLDH 124
Query 134 DKAFVAALIRSGLLDVGVIQARV 156
D A + + S LD ++ RV
Sbjct 125 DAAHIMRMAPS--LDPEMMYHRV 145
>gi|188581816|ref|YP_001925261.1| hypothetical protein Mpop_2569 [Methylobacterium populi BJ001]
gi|179345314|gb|ACB80726.1| hypothetical protein Mpop_2569 [Methylobacterium populi BJ001]
Length=181
Score = 38.9 bits (89), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 30/98 (31%), Positives = 44/98 (45%), Gaps = 7/98 (7%)
Query 8 HLLRRACAVVGDVDVLVL---GSQSILGSFDENELPPQATASQEADIAFVNDPARDKADH 64
H R VVG +LV ++ S + + P A A + A +D + ++
Sbjct 23 HFKARTVVVVGSQGILVGWPGAPVTMCMSPEIDAYPANARAWEAAQ----DDDLAEASEE 78
Query 65 VDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDRLV 102
+ V GE S FH ++G Y +GV TA LP W R V
Sbjct 79 ISVIFGEGSHFHTAHGFYIDGVDDRTARLPPSWPSRAV 116
>gi|269796955|ref|YP_003316410.1| gluconolactonase [Sanguibacter keddieii DSM 10542]
gi|269099140|gb|ACZ23576.1| gluconolactonase [Sanguibacter keddieii DSM 10542]
Length=280
Score = 38.9 bits (89), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 19/48 (40%), Positives = 29/48 (61%), Gaps = 3/48 (6%)
Query 46 SQEADIAFVNDPARDKADHVDVAIGEMSD---FHRSNGVYAEGVHIDT 90
S E D+A+ ND A D DV GE++ FH S+G +A+G+ +D+
Sbjct 150 SPEGDLAYYNDTATGTTDVFDVVDGELTGRRVFHSSDGTHADGLTVDS 197
>gi|327310946|ref|YP_004337843.1| hypothetical protein TUZN_1051 [Thermoproteus uzoniensis 768-20]
gi|326947425|gb|AEA12531.1| hypothetical protein TUZN_1051 [Thermoproteus uzoniensis 768-20]
Length=173
Score = 37.4 bits (85), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 31/83 (38%), Positives = 41/83 (50%), Gaps = 6/83 (7%)
Query 77 RSNGVY--AEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPHDLAVAKLAAGREKD 134
R G+Y AEGVH+D P D VS E+ P P DLA+ KLA+G KD
Sbjct 75 RKWGLYVDAEGVHVDINYAPLILDDEFVSRCREAEGLLIP---SPEDLAILKLASGERKD 131
Query 135 KAFVAALIRSGLLDVGVIQARVL 157
+ L+R LD+ ++ R L
Sbjct 132 IDDLKKLLRLP-LDLSYLRRRAL 153
>gi|260906751|ref|ZP_05915073.1| hypothetical protein BlinB_15582 [Brevibacterium linens BL2]
Length=31
Score = 37.4 bits (85), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 16/24 (67%), Positives = 19/24 (80%), Gaps = 0/24 (0%)
Query 1 MTRQQLAHLLRRACAVVGDVDVLV 24
M RQ+LAH+LR AC + GD DVLV
Sbjct 1 MNRQELAHILRAACRITGDQDVLV 24
>gi|218529078|ref|YP_002419894.1| hypothetical protein Mchl_1064 [Methylobacterium chloromethanicum
CM4]
gi|240141915|ref|YP_002966423.1| hypothetical protein MexAM1_META2p0164 [Methylobacterium extorquens
AM1]
gi|218521381|gb|ACK81966.1| conserved hypothetical protein [Methylobacterium chloromethanicum
CM4]
gi|240011857|gb|ACS43082.1| conserved hypothetical protein [Methylobacterium extorquens AM1]
Length=192
Score = 35.8 bits (81), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 22/64 (35%), Positives = 29/64 (46%), Gaps = 3/64 (4%)
Query 94 PNGWRDRLVSWTVESSRPAKPRFLEPHDLAVAKLAAGREKDKAFVAALIRSGLLDVGVIQ 153
P+ W + W+V R + P DL V K+ DK + AL SGLLD I+
Sbjct 98 PDCWDNAWEGWSVGR---IDVRVVSPTDLCVPKVGRWPGNDKEDICALAASGLLDADTIE 154
Query 154 ARVL 157
R L
Sbjct 155 QRCL 158
>gi|126728072|ref|ZP_01743888.1| Probable sodium/sulphate symporter [Sagittula stellata E-37]
gi|126711037|gb|EBA10087.1| Probable sodium/sulphate symporter [Sagittula stellata E-37]
Length=595
Score = 35.4 bits (80), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 25/79 (32%), Positives = 44/79 (56%), Gaps = 14/79 (17%)
Query 13 ACAVVGDVDVLVLGSQSILGSFDENELPPQA---TASQEADIAFVND-PARDKADHVDVA 68
A AVVG + +++LG + LP +A + +++A+ AF+++ R HV A
Sbjct 190 AVAVVGGLTMMILG---------KVLLPDRAQKESGTEDAETAFLSEITVRSAYPHVGTA 240
Query 69 IGEMSDFHRSNGVYAEGVH 87
+G+++DF RS GV G+
Sbjct 241 LGKIADFQRS-GVRVTGIR 258
>gi|332292830|ref|YP_004431439.1| GCN5-related N-acetyltransferase [Krokinobacter diaphorus 4H-3-7-5]
gi|332170916|gb|AEE20171.1| GCN5-related N-acetyltransferase [Krokinobacter sp. 4H-3-7-5]
Length=169
Score = 35.4 bits (80), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 13/45 (29%), Positives = 26/45 (58%), Gaps = 0/45 (0%)
Query 73 SDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFL 117
++F NG+ E ++IDT L G+ RL+ +++ +R K ++
Sbjct 81 TEFQEPNGLEIERIYIDTTYLRKGYGKRLIDFSISKARQLKKNYI 125
>gi|301622029|ref|XP_002940344.1| PREDICTED: hypothetical protein LOC100494670 [Xenopus (Silurana)
tropicalis]
Length=2316
Score = 35.4 bits (80), Expect = 4.0, Method: Composition-based stats.
Identities = 16/48 (34%), Positives = 25/48 (53%), Gaps = 0/48 (0%)
Query 84 EGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPHDLAVAKLAAGR 131
+GV I T LP G + L W +E+ +P P EP D + ++G+
Sbjct 642 KGVWIITVTLPTGTIEYLGKWIIETEKPTLPELYEPGDWIIQLGSSGQ 689
>gi|94969064|ref|YP_591112.1| hypothetical protein Acid345_2037 [Candidatus Koribacter versatilis
Ellin345]
gi|94551114|gb|ABF41038.1| hypothetical protein Acid345_2037 [Candidatus Koribacter versatilis
Ellin345]
Length=198
Score = 35.0 bits (79), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 22/83 (27%), Positives = 39/83 (47%), Gaps = 1/83 (1%)
Query 73 SDFHRSNGVYAEGVHIDTAILPNGWRDRLVSWTVESSRPAKPRFLEPHDLAVAKLAAGRE 132
S HR + +Y + V + A P + DRL + + + +PHDL + K+ E
Sbjct 76 SPLHRKHRIYLQLVTVIEAY-PEEYEDRLSEMFPGALKHLRLLAPDPHDLVLMKVGRNSE 134
Query 133 KDKAFVAALIRSGLLDVGVIQAR 155
+D+ + L R GL+ ++ R
Sbjct 135 RDREGIKFLARKGLITSTELRTR 157
Lambda K H
0.319 0.135 0.397
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 174138591420
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40