BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3180c
Length=144
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610316|ref|NP_217696.1| hypothetical protein Rv3180c [Mycob... 279 1e-73
gi|145223605|ref|YP_001134283.1| hypothetical protein Mflv_3018 ... 261 2e-68
gi|183983163|ref|YP_001851454.1| hypothetical protein MMAR_3166 ... 256 1e-66
gi|118618785|ref|YP_907117.1| hypothetical protein MUL_3491 [Myc... 236 1e-60
gi|289751861|ref|ZP_06511239.1| hypothetical alanine rich protei... 201 2e-50
gi|289763364|ref|ZP_06522742.1| hypothetical alanine rich protei... 184 4e-45
gi|183983173|ref|YP_001851464.1| hypothetical protein MMAR_3183 ... 142 2e-32
gi|334338127|ref|YP_004543279.1| PilT protein domain protein [Is... 93.2 1e-17
gi|269957800|ref|YP_003327589.1| hypothetical protein Xcel_3025 ... 89.4 2e-16
gi|229821890|ref|YP_002883416.1| hypothetical protein Bcav_3412 ... 82.8 2e-14
gi|145223797|ref|YP_001134475.1| PilT domain-containing protein ... 77.0 8e-13
gi|260892184|ref|YP_003238281.1| PilT protein domain protein [Am... 73.6 1e-11
gi|333992046|ref|YP_004524660.1| hypothetical protein JDM601_340... 72.0 3e-11
gi|312198893|ref|YP_004018954.1| PilT protein domain protein [Fr... 66.6 1e-09
gi|94972184|ref|YP_594224.1| hypothetical protein Dgeo_2717 [Dei... 64.3 5e-09
gi|258592459|emb|CBE68768.1| conserved protein of unknown functi... 63.5 1e-08
gi|284992148|ref|YP_003410702.1| PilT domain-containing protein ... 63.5 1e-08
gi|88812276|ref|ZP_01127527.1| hypothetical protein NB231_02713 ... 63.2 1e-08
gi|169830417|ref|YP_001716399.1| hypothetical protein Daud_0206 ... 59.3 2e-07
gi|78224268|ref|YP_386015.1| hypothetical protein Gmet_3076 [Geo... 57.8 5e-07
gi|344198594|ref|YP_004782920.1| hypothetical protein Acife_0369... 55.8 2e-06
gi|301064228|ref|ZP_07204671.1| toxin-antitoxin system, toxin co... 53.5 1e-05
gi|87124594|ref|ZP_01080443.1| hypothetical protein RS9917_13310... 52.4 2e-05
gi|198282236|ref|YP_002218557.1| hypothetical protein Lferr_0088... 52.4 3e-05
gi|83591030|ref|YP_431039.1| hypothetical protein Moth_2207 [Moo... 51.6 4e-05
gi|46255278|ref|YP_006190.1| hypothetical protein TT_P0209 [Ther... 51.2 4e-05
gi|116747855|ref|YP_844542.1| hypothetical protein Sfum_0407 [Sy... 51.2 4e-05
gi|284041682|ref|YP_003392022.1| hypothetical protein Cwoe_0211 ... 49.7 1e-04
gi|258593454|emb|CBE69793.1| conserved protein of unknown functi... 48.9 2e-04
gi|218296688|ref|ZP_03497406.1| conserved hypothetical protein [... 48.5 3e-04
gi|340783792|ref|YP_004750398.1| hypothetical protein Atc_m167 [... 48.1 4e-04
gi|344342234|ref|ZP_08773132.1| PilT protein domain protein [Thi... 46.6 0.001
gi|158521913|ref|YP_001529783.1| hypothetical protein Dole_1902 ... 46.2 0.002
gi|337277681|ref|YP_004617152.1| hypothetical protein Rta_00720 ... 45.4 0.003
gi|297618046|ref|YP_003703205.1| PilT protein domain-containing ... 45.1 0.003
gi|124516076|gb|EAY57585.1| conserved hypothetical protein [Lept... 43.9 0.009
gi|337286921|ref|YP_004626394.1| hypothetical protein Thein_1569... 43.5 0.011
gi|334118678|ref|ZP_08492766.1| hypothetical protein MicvaDRAFT_... 43.1 0.015
gi|75911188|ref|YP_325484.1| hypothetical protein Ava_4992 [Anab... 42.7 0.017
gi|186680922|ref|YP_001864118.1| hypothetical protein Npun_F0394... 42.7 0.018
gi|87303098|ref|ZP_01085896.1| hypothetical protein WH5701_06326... 42.0 0.032
gi|15805687|ref|NP_294383.1| hypothetical protein DR_0660 [Deino... 40.8 0.061
gi|166366200|ref|YP_001658473.1| hypothetical protein MAE_34590 ... 40.8 0.063
gi|336176937|ref|YP_004582312.1| hypothetical protein FsymDg_088... 40.4 0.081
gi|334337553|ref|YP_004542705.1| PilT protein domain protein [Is... 40.4 0.088
gi|94265149|ref|ZP_01288913.1| hypothetical protein MldDRAFT_438... 40.4 0.088
gi|333967724|gb|AEG34488.1| hypothetical protein Ththe16_2108 [T... 40.0 0.10
gi|55978318|ref|YP_145374.1| hypothetical protein TTHB135 [Therm... 40.0 0.11
gi|328953359|ref|YP_004370693.1| hypothetical protein Desac_1665... 40.0 0.11
gi|86741564|ref|YP_481964.1| PilT protein-like protein [Frankia ... 40.0 0.11
>gi|15610316|ref|NP_217696.1| hypothetical protein Rv3180c [Mycobacterium tuberculosis H37Rv]
gi|15842760|ref|NP_337797.1| hypothetical protein MT3271 [Mycobacterium tuberculosis CDC1551]
gi|31794358|ref|NP_856851.1| hypothetical protein Mb3206c [Mycobacterium bovis AF2122/97]
77 more sequence titles
Length=144
Score = 279 bits (713), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 143/144 (99%), Positives = 144/144 (100%), Gaps = 0/144 (0%)
Query 1 VPLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES 60
+PLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES
Sbjct 1 MPLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES 60
Query 61 ELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVA 120
ELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVA
Sbjct 61 ELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVA 120
Query 121 VWDRRLHTGAHAAGCRVAPAQLDP 144
VWDRRLHTGAHAAGCRVAPAQLDP
Sbjct 121 VWDRRLHTGAHAAGCRVAPAQLDP 144
>gi|145223605|ref|YP_001134283.1| hypothetical protein Mflv_3018 [Mycobacterium gilvum PYR-GCK]
gi|145216091|gb|ABP45495.1| conserved hypothetical alanine rich protein [Mycobacterium gilvum
PYR-GCK]
Length=144
Score = 261 bits (668), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 137/144 (96%), Positives = 143/144 (99%), Gaps = 0/144 (0%)
Query 1 VPLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES 60
+PLVYFDASAFVKLLTTETGSS+A+ALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES
Sbjct 1 MPLVYFDASAFVKLLTTETGSSVAAALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES 60
Query 61 ELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVA 120
ELA+AERDWEDFWAATRP+ELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGL+VA
Sbjct 61 ELAEAERDWEDFWAATRPIELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLIVA 120
Query 121 VWDRRLHTGAHAAGCRVAPAQLDP 144
VWDRRLHTGA AAGCRVAPAQLDP
Sbjct 121 VWDRRLHTGARAAGCRVAPAQLDP 144
>gi|183983163|ref|YP_001851454.1| hypothetical protein MMAR_3166 [Mycobacterium marinum M]
gi|183176489|gb|ACC41599.1| conserved hypothetical alanine rich protein [Mycobacterium marinum
M]
Length=144
Score = 256 bits (653), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 135/144 (94%), Positives = 140/144 (98%), Gaps = 0/144 (0%)
Query 1 VPLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES 60
+PLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES
Sbjct 1 MPLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES 60
Query 61 ELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVA 120
ELADAERDWEDFWAATRP+ELTATVEQHAG LARAHALR ADAVHLASALAVG+PGL+VA
Sbjct 61 ELADAERDWEDFWAATRPIELTATVEQHAGRLARAHALREADAVHLASALAVGEPGLIVA 120
Query 121 VWDRRLHTGAHAAGCRVAPAQLDP 144
VWDRRLH GA AAGCR+APAQLDP
Sbjct 121 VWDRRLHAGAQAAGCRLAPAQLDP 144
>gi|118618785|ref|YP_907117.1| hypothetical protein MUL_3491 [Mycobacterium ulcerans Agy99]
gi|118570895|gb|ABL05646.1| conserved hypothetical alanine rich protein [Mycobacterium ulcerans
Agy99]
Length=144
Score = 236 bits (601), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 125/144 (87%), Positives = 131/144 (91%), Gaps = 0/144 (0%)
Query 1 VPLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES 60
+PLVYFDASAFVKLLTTETG SLAS WDGCDAALS+RLAYPEVRAALAAAARNHDLTES
Sbjct 1 MPLVYFDASAFVKLLTTETGGSLASVPWDGCDAALSARLAYPEVRAALAAAARNHDLTES 60
Query 61 ELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVA 120
ELA+AERDWEDFW ATRPVE T TVE HA HLARAHALRGA AVH+ASALAVG PGL++A
Sbjct 61 ELAEAERDWEDFWTATRPVEHTTTVEHHADHLARAHALRGAQAVHMASALAVGAPGLIIA 120
Query 121 VWDRRLHTGAHAAGCRVAPAQLDP 144
WDRRLHTGA AA CRVAPAQLDP
Sbjct 121 AWDRRLHTGAQAARCRVAPAQLDP 144
>gi|289751861|ref|ZP_06511239.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
T92]
gi|289692448|gb|EFD59877.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
T92]
Length=126
Score = 201 bits (512), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 114/126 (91%), Positives = 116/126 (93%), Gaps = 5/126 (3%)
Query 1 VPLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES 60
VPLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES
Sbjct 3 VPLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES 62
Query 61 ELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGD---PGL 117
ELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADA +S +G PGL
Sbjct 63 ELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAG--SSGQRIGSRRTPGL 120
Query 118 VVAVWD 123
VVAVWD
Sbjct 121 VVAVWD 126
>gi|289763364|ref|ZP_06522742.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
GM 1503]
gi|289710870|gb|EFD74886.1| hypothetical alanine rich protein [Mycobacterium tuberculosis
GM 1503]
Length=123
Score = 184 bits (466), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 105/123 (86%), Positives = 109/123 (89%), Gaps = 4/123 (3%)
Query 22 SLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELADAERDWEDFWAATRPVEL 81
S+ +A C AA + P+ AALAAAARNHDLTESELADAERDWEDFWAATRPVEL
Sbjct 5 SMGTAATPHCPAAWPT----PKSAAALAAAARNHDLTESELADAERDWEDFWAATRPVEL 60
Query 82 TATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVAVWDRRLHTGAHAAGCRVAPAQ 141
TATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVAVWDRRLHTGAHAAGCRVAPAQ
Sbjct 61 TATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVAVWDRRLHTGAHAAGCRVAPAQ 120
Query 142 LDP 144
LDP
Sbjct 121 LDP 123
>gi|183983173|ref|YP_001851464.1| hypothetical protein MMAR_3183 [Mycobacterium marinum M]
gi|183176499|gb|ACC41609.1| hypothetical alanine rich protein [Mycobacterium marinum M]
Length=188
Score = 142 bits (357), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/89 (81%), Positives = 75/89 (85%), Gaps = 0/89 (0%)
Query 56 DLTESELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDP 115
DLTESELA+AERDWEDFWAATRPVE T TVE HA HLARAHALRGA AVHLASALAVG P
Sbjct 50 DLTESELAEAERDWEDFWAATRPVEHTTTVEHHADHLARAHALRGAQAVHLASALAVGAP 109
Query 116 GLVVAVWDRRLHTGAHAAGCRVAPAQLDP 144
GL++A WDRRLHTGA A CRVAP P
Sbjct 110 GLIIAAWDRRLHTGAQAPRCRVAPRPTRP 138
>gi|334338127|ref|YP_004543279.1| PilT protein domain protein [Isoptericola variabilis 225]
gi|334108495|gb|AEG45385.1| PilT protein domain protein [Isoptericola variabilis 225]
Length=141
Score = 93.2 bits (230), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 62/139 (45%), Positives = 72/139 (52%), Gaps = 2/139 (1%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESEL 62
LVY DASA VKL E GS LASALW+ D ++SRLA EVRA LAA R + +
Sbjct 3 LVYLDASALVKLCVPEPGSELASALWNRADVVVTSRLADAEVRAVLAAGERAGVIDAATR 62
Query 63 ADAERDWEDFWAATRPVELTATVEQHAGHLARAHA--LRGADAVHLASALAVGDPGLVVA 120
WE W VELTA V A + + LR DAVH+ASAL V P VV
Sbjct 63 ERGLATWERLWPTMHVVELTAEVSARAAEVLATSSVPLRAGDAVHVASALVVAHPDTVVG 122
Query 121 VWDRRLHTGAHAAGCRVAP 139
WD + A + RV P
Sbjct 123 AWDEHVAGAARSRHLRVLP 141
>gi|269957800|ref|YP_003327589.1| hypothetical protein Xcel_3025 [Xylanimonas cellulosilytica DSM
15894]
gi|269306481|gb|ACZ32031.1| hypothetical protein Xcel_3025 [Xylanimonas cellulosilytica DSM
15894]
Length=141
Score = 89.4 bits (220), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/139 (47%), Positives = 79/139 (57%), Gaps = 2/139 (1%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESEL 62
+VYFDASA VKL+ E+GS LASAL++ D A++SR+A EVRAALAA R L +
Sbjct 3 VVYFDASALVKLVVAESGSELASALYNRADVAVTSRIADVEVRAALAAGVRAGLLDAAAH 62
Query 63 ADAERDWEDFWAATRPVELTATVEQHAGHLARAHA--LRGADAVHLASALAVGDPGLVVA 120
A A WE W VE+ V A L A LR DAVH+ASAL V P VVA
Sbjct 63 ATAVTAWERLWPTLAVVEVGDQVSHTAAALLAAGTVPLRADDAVHVASALTVAHPETVVA 122
Query 121 VWDRRLHTGAHAAGCRVAP 139
WD ++ + A A V P
Sbjct 123 AWDDQVASAARAQSLIVLP 141
>gi|229821890|ref|YP_002883416.1| hypothetical protein Bcav_3412 [Beutenbergia cavernae DSM 12333]
gi|229567803|gb|ACQ81654.1| conserved hypothetical protein [Beutenbergia cavernae DSM 12333]
Length=139
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/136 (41%), Positives = 73/136 (54%), Gaps = 0/136 (0%)
Query 4 VYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELA 63
V D SA ++L+ E G L ALW+ D+ ++SRLA E+RA L A R L+E+
Sbjct 4 VVMDTSALLRLVHPEPGHDLVCALWNRADSVVASRLADAELRAVLEAGRRTGHLSEAARD 63
Query 64 DAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVAVWD 123
A W + + R VE+T + AG LA H +R DAV LA AL + V+AVWD
Sbjct 64 AALERWAECRDSVRVVEVTPELADTAGDLALRHGIRAGDAVVLAGALLLAPVDPVLAVWD 123
Query 124 RRLHTGAHAAGCRVAP 139
RL A + G RV P
Sbjct 124 GRLADAARSEGLRVLP 139
>gi|145223797|ref|YP_001134475.1| PilT domain-containing protein [Mycobacterium gilvum PYR-GCK]
gi|145216283|gb|ABP45687.1| PilT protein domain protein [Mycobacterium gilvum PYR-GCK]
Length=235
Score = 77.0 bits (188), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 52/124 (42%), Positives = 62/124 (50%), Gaps = 0/124 (0%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESEL 62
+ Y D SAFV LL E S WD DA +SSRL Y E AALA A R LTE E
Sbjct 89 IGYLDTSAFVPLLIDEPASVACRRFWDDADAIVSSRLLYVETAAALAQAGRIGRLTEGEH 148
Query 63 ADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVAVW 122
A R W+ +E+ + A LA +LRG DAVH ASA + D + A
Sbjct 149 LQARRRLGQMWSEMDVIEVDEQIVTRAADLAHRLSLRGYDAVHAASAEQLDDDDVAAASG 208
Query 123 DRRL 126
D+RL
Sbjct 209 DQRL 212
>gi|260892184|ref|YP_003238281.1| PilT protein domain protein [Ammonifex degensii KC4]
gi|260864325|gb|ACX51431.1| PilT protein domain protein [Ammonifex degensii KC4]
Length=151
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/140 (36%), Positives = 69/140 (50%), Gaps = 3/140 (2%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESEL 62
+ Y D SA VKL E GS + L D +S++AYPE RAALA R+ L E +
Sbjct 2 ICYLDTSALVKLYVREPGSEMVRKLVDEASVVATSKVAYPEARAALARGFRDGLLEEKDY 61
Query 63 ADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDP---GLVV 119
++ W +E++ ++ AG LA H LRG DA+HLA+AL + +V
Sbjct 62 RQVVVALQNDWPRYLVLEVSDSLAWLAGELAEKHRLRGFDAIHLAAALTLKTQVKGRVVA 121
Query 120 AVWDRRLHTGAHAAGCRVAP 139
A +D RL A V P
Sbjct 122 ACFDDRLWEALCAVDLEVVP 141
>gi|333992046|ref|YP_004524660.1| hypothetical protein JDM601_3407 [Mycobacterium sp. JDM601]
gi|333488015|gb|AEF37407.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=142
Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 49/122 (41%), Positives = 60/122 (50%), Gaps = 0/122 (0%)
Query 5 YFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELAD 64
Y +AFV LL E S+ WD D +SSRL Y E AALA A R +T+ +
Sbjct 4 YLGTAAFVPLLIEEQTSAACRRFWDDADVVVSSRLLYVETAAALAQAYRMGRMTQGQHRQ 63
Query 65 AERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVAVWDR 124
+ R ++ W VE V A LA +LRG DAVH ASA + D LV A DR
Sbjct 64 SRRRLDEMWLEIDIVEADDQVINRAADLAYRLSLRGYDAVHCASAAQLADDMLVAASGDR 123
Query 125 RL 126
L
Sbjct 124 GL 125
>gi|312198893|ref|YP_004018954.1| PilT protein domain protein [Frankia sp. EuI1c]
gi|311230229|gb|ADP83084.1| PilT protein domain protein [Frankia sp. EuI1c]
Length=146
Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/132 (41%), Positives = 67/132 (51%), Gaps = 0/132 (0%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESEL 62
+ YFD SAFV LL E GS+ A +WD D +SSRL + E AALA A R L+ S
Sbjct 2 ICYFDTSAFVPLLVDEPGSATAIRIWDAADRVVSSRLLHVEAAAALAQANRLGKLSGSAH 61
Query 63 ADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVAVW 122
A D +A + +T + A LAR ALR DA+H A+A + LVVA
Sbjct 62 QAALLRLNDIYAEFDLLPITDGLVSRAATLARQLALRAFDAMHCAAAQLLASDDLVVASG 121
Query 123 DRRLHTGAHAAG 134
DR+L G
Sbjct 122 DRKLLAACRTLG 133
>gi|94972184|ref|YP_594224.1| hypothetical protein Dgeo_2717 [Deinococcus geothermalis DSM
11300]
gi|94554235|gb|ABF44150.1| PIN domain, predicted nuclease, component of toxin-antitoxin
system [Deinococcus geothermalis DSM 11300]
Length=140
Score = 64.3 bits (155), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 46/113 (41%), Positives = 57/113 (51%), Gaps = 1/113 (0%)
Query 1 VPLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTES 60
+ + Y D+SAF KL E G AL + +AY EVR LA LTE
Sbjct 1 MTVAYLDSSAFAKLYLDEPGREAVEALVGETGRVAACAIAYAEVRGVLARYLHQGRLTEE 60
Query 61 ELADAERDWEDFWAATRPVELTATVEQHAGHLARAHA-LRGADAVHLASALAV 112
E A +E W T V+LT + + AG L RAHA LR DA+HLA+AL V
Sbjct 61 EYEGANEAFEADWGTTNVVDLTPALLRLAGDLLRAHAELRAMDALHLAAALEV 113
>gi|258592459|emb|CBE68768.1| conserved protein of unknown function [NC10 bacterium 'Dutch
sediment']
Length=147
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/136 (34%), Positives = 64/136 (48%), Gaps = 3/136 (2%)
Query 5 YFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELAD 64
Y D SA +K E GS L +L ++++AY EV A L R L++ +
Sbjct 3 YLDTSALIKRFVAEKGSPLVQSLVKREGPIATAKIAYAEVYAGLTRKLREGHLSDVQYGL 62
Query 65 AERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGD---PGLVVAV 121
A R +E W A VEL + A L R H L+G DAVHLASA+++ + + A
Sbjct 63 AYRQFEADWQAYIRVELHDDILFLARDLIRQHPLKGFDAVHLASAISLKNALGEDITFAA 122
Query 122 WDRRLHTGAHAAGCRV 137
D RL A A +
Sbjct 123 ADERLLRAAEAEDLNI 138
>gi|284992148|ref|YP_003410702.1| PilT domain-containing protein [Geodermatophilus obscurus DSM
43160]
gi|284065393|gb|ADB76331.1| PilT domain-containing protein [Geodermatophilus obscurus DSM
43160]
Length=141
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/139 (36%), Positives = 68/139 (49%), Gaps = 0/139 (0%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESEL 62
+ YFD SA V LL E G+ ++ ++ ++ + R+ + E AALA A+R LT
Sbjct 2 IAYFDTSAVVPLLVEEAGTDVSLRVFLQAESVATVRMTFAETSAALARASRLRRLTADAH 61
Query 63 ADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVAVW 122
A E WA +++ + + AG LAR HALRG DAVH A+AL V V
Sbjct 62 DRALAGLESVWAQMDVLDVDDGLVRAAGVLARDHALRGYDAVHCAAALRVTSGTTVALAG 121
Query 123 DRRLHTGAHAAGCRVAPAQ 141
DR L G +V Q
Sbjct 122 DRDLLAAWQREGLQVLDTQ 140
>gi|88812276|ref|ZP_01127527.1| hypothetical protein NB231_02713 [Nitrococcus mobilis Nb-231]
gi|88790527|gb|EAR21643.1| hypothetical protein NB231_02713 [Nitrococcus mobilis Nb-231]
Length=143
Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/135 (35%), Positives = 66/135 (49%), Gaps = 3/135 (2%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESEL 62
+VY D SAF+KL E GS L D A + + Y E+ AA A A R LT++E
Sbjct 2 IVYLDTSAFLKLYLEEEGSKATRQLVDAAVAVCTHVITYAEMCAAFAQAVRMQRLTDAEW 61
Query 63 ADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAV---GDPGLVV 119
+ +E W A + + + + + AG LA LRG D+VHLA+A V +
Sbjct 62 THQKDCFEADWNALQVLFIDEPLVRRAGKLAEGFRLRGFDSVHLAAAERVWRQAPDNFQL 121
Query 120 AVWDRRLHTGAHAAG 134
A +D RL + A G
Sbjct 122 AAFDVRLVSAACTLG 136
>gi|169830417|ref|YP_001716399.1| hypothetical protein Daud_0206 [Candidatus Desulforudis audaxviator
MP104C]
gi|169637261|gb|ACA58767.1| conserved hypothetical protein [Candidatus Desulforudis audaxviator
MP104C]
Length=154
Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/140 (38%), Positives = 68/140 (49%), Gaps = 3/140 (2%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESEL 62
++Y D SA VKL E GS + L +S++AY E RAALA A R+ L +
Sbjct 2 ILYLDTSALVKLYIREEGSEVTQRLLAASSVVATSKVAYAEARAALARAYRDSILDNKKY 61
Query 63 ADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVG---DPGLVV 119
A + D W VE++ + AG LA H+LRG DA+HLAS L V L+
Sbjct 62 TLAVSAFRDDWDRYFAVEVSDMLIGFAGDLAEKHSLRGFDAIHLASILTVKRQVKSPLLA 121
Query 120 AVWDRRLHTGAHAAGCRVAP 139
A WD RL G V P
Sbjct 122 ACWDARLWDAIRTCGIDVIP 141
>gi|78224268|ref|YP_386015.1| hypothetical protein Gmet_3076 [Geobacter metallireducens GS-15]
gi|78195523|gb|ABB33290.1| conserved hypothetical protein [Geobacter metallireducens GS-15]
Length=142
Score = 57.8 bits (138), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 48/143 (34%), Positives = 67/143 (47%), Gaps = 21/143 (14%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESEL 62
++Y D S+ VKL E S + +A + R+AYPE+ L+A R H+ +
Sbjct 2 ILYLDTSSLVKLYVEEVCSDTVRQWVESAEAVATCRVAYPEM---LSALTRRHNRGDLPR 58
Query 63 ADAE-------RDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASA-LAVGD 114
D E +WE F A E AGHL R + LRG DAVHLA+A L D
Sbjct 59 EDCEVVAECFAGEWEHFVALDFD-------EIEAGHLVRKYGLRGFDAVHLAAAKLLSND 111
Query 115 PGLV---VAVWDRRLHTGAHAAG 134
G + + +D +L+ A A G
Sbjct 112 CGAIEVAFSSFDNKLNGAAEAEG 134
>gi|344198594|ref|YP_004782920.1| hypothetical protein Acife_0369 [Acidithiobacillus ferrivorans
SS3]
gi|343774038|gb|AEM46594.1| hypothetical protein Acife_0369 [Acidithiobacillus ferrivorans
SS3]
Length=137
Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/137 (35%), Positives = 65/137 (48%), Gaps = 8/137 (5%)
Query 4 VYFDASAFVKLLTTETGSSLASALWDGCDAALS---SRLAYPEVRAALAAAARNHDLTES 60
VYFD+SAF K E G+ A +W C+ A S +A PE+ +A R LT++
Sbjct 3 VYFDSSAFAKRYIDEVGTD-AVLMW--CERASELALSVIAIPELISAFCRLQRERRLTDA 59
Query 61 ELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVA 120
+ + +R A + T V QHA + H LRG DA+HL +ALA + A
Sbjct 60 QYQEIKRALMSDIADALLCDTTPQVIQHAVNALENHTLRGMDAIHLGAALACTAEVFISA 119
Query 121 VWDRRLHTGAHAAGCRV 137
D R A A G +V
Sbjct 120 --DARQCRAAQAFGLQV 134
>gi|301064228|ref|ZP_07204671.1| toxin-antitoxin system, toxin component, PIN family [delta proteobacterium
NaphS2]
gi|300441673|gb|EFK05995.1| toxin-antitoxin system, toxin component, PIN family [delta proteobacterium
NaphS2]
Length=140
Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/130 (36%), Positives = 62/130 (48%), Gaps = 3/130 (2%)
Query 5 YFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELAD 64
Y D S+ VKL E S +A + +S +AY E RAA A R ++ E
Sbjct 4 YLDTSSLVKLYVEEDRSMEVAAFVKDSEITATSLVAYAEARAAFARRFREGAFSDDEYQR 63
Query 65 AERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDP---GLVVAV 121
+ +ED W + LT AG LA HALRG DA+HLASAL + +V +
Sbjct 64 LKSFFEDDWTRYLVLNLTRECVGQAGELAEKHALRGFDAIHLASALILQSELSSPVVFSC 123
Query 122 WDRRLHTGAH 131
+D RL T +
Sbjct 124 FDDRLLTASR 133
>gi|87124594|ref|ZP_01080443.1| hypothetical protein RS9917_13310 [Synechococcus sp. RS9917]
gi|86168166|gb|EAQ69424.1| hypothetical protein RS9917_13310 [Synechococcus sp. RS9917]
Length=141
Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/140 (30%), Positives = 63/140 (45%), Gaps = 3/140 (2%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESEL 62
+++ D SA +KL E S + R+ + E AALA R ++ L
Sbjct 2 ILFCDTSALLKLFIDEQDSESMIKARSASEGIAVCRITWAESMAALAQRTRCKGANQAGL 61
Query 63 ADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPG---LVV 119
A A +E W ++T ++ + AG + A ALRG D+V LA+A + + L
Sbjct 62 AQARSMFEQAWPGFAIADVTQSLVEKAGVFSEAFALRGYDSVQLAAAHQLHEQFALPLTF 121
Query 120 AVWDRRLHTGAHAAGCRVAP 139
A +DRRL+ A V P
Sbjct 122 ACFDRRLNQAAKLLKLVVLP 141
>gi|198282236|ref|YP_002218557.1| hypothetical protein Lferr_0088 [Acidithiobacillus ferrooxidans
ATCC 53993]
gi|218666107|ref|YP_002424598.1| hypothetical protein AFE_0086 [Acidithiobacillus ferrooxidans
ATCC 23270]
gi|198246757|gb|ACH82350.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans
ATCC 53993]
gi|218518320|gb|ACK78906.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans
ATCC 23270]
gi|339834710|gb|EGQ62453.1| hypothetical protein GGI1_13017 [Acidithiobacillus sp. GGI-221]
Length=137
Score = 52.4 bits (124), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/135 (36%), Positives = 65/135 (49%), Gaps = 4/135 (2%)
Query 4 VYFDASAFVKLLTTETGSSLASALWDGCDAALS-SRLAYPEVRAALAAAARNHDLTESEL 62
VYFD+SAF K ETG++ A W G + L+ S +A PE+ +A R LT+++
Sbjct 3 VYFDSSAFAKRYIDETGTADVLA-WCGRASELALSVIAVPELISAFRRLQREGRLTDAQY 61
Query 63 ADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVAVW 122
+R A + T V QHA H LRG DA+HL +ALA + A
Sbjct 62 QIIKRALMLDIADALICDTTPQVIQHAVKALENHTLRGMDAIHLGAALACTAEVFISA-- 119
Query 123 DRRLHTGAHAAGCRV 137
D R A A G +V
Sbjct 120 DARQCRAAEAFGLQV 134
>gi|83591030|ref|YP_431039.1| hypothetical protein Moth_2207 [Moorella thermoacetica ATCC 39073]
gi|83573944|gb|ABC20496.1| hypothetical protein Moth_2207 [Moorella thermoacetica ATCC 39073]
Length=147
Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/141 (33%), Positives = 64/141 (46%), Gaps = 3/141 (2%)
Query 5 YFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELAD 64
Y D SA VKL E G+ L + ++AY E RAALA R L ++
Sbjct 4 YLDTSALVKLYIYEEGTPEVKELAANSLIVATCKIAYAEARAALARGHRERALDDAVYTQ 63
Query 65 AERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDP---GLVVAV 121
A +D W +E++ + AG LA H LRG DAVHLA+ L + + VA
Sbjct 64 AVTALKDDWRNYFAIEVSDALIDKAGELAERHQLRGFDAVHLAAVLMLKQQVKDNITVAC 123
Query 122 WDRRLHTGAHAAGCRVAPAQL 142
WD++ A + P +L
Sbjct 124 WDKKFWQALKANNFTLLPEEL 144
>gi|46255278|ref|YP_006190.1| hypothetical protein TT_P0209 [Thermus thermophilus HB27]
gi|46198127|gb|AAS82537.1| hypothetical protein TT_P0209 [Thermus thermophilus HB27]
Length=156
Score = 51.2 bits (121), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 47/145 (33%), Positives = 64/145 (45%), Gaps = 9/145 (6%)
Query 4 VYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELA 63
++ + S +K+L E S LA + A +S A PE L A R+ LT
Sbjct 3 LFLETSGLLKVLLREDLSDLAREAFSQAQAHAASAFALPEAVGVLHAMQRDGRLTRPLYR 62
Query 64 DAERDWEDFWAATRPVELTATVEQH--AGHLARAHALRGADAVHLASALAVGD--PGLVV 119
A R+ + W LT T+E H A L H L+GADAVHL AL + P +
Sbjct 63 KALRELYELWEYLE--VLTPTLEGHMRAARLCERHPLKGADAVHLEGALFLKGLYPDTAL 120
Query 120 AVWDRRLHTGAHAAG---CRVAPAQ 141
+DR L+ A G RV P +
Sbjct 121 LTFDRTLYRAAKKEGLTVVRVPPLE 145
>gi|116747855|ref|YP_844542.1| hypothetical protein Sfum_0407 [Syntrophobacter fumaroxidans
MPOB]
gi|116696919|gb|ABK16107.1| hypothetical protein Sfum_0407 [Syntrophobacter fumaroxidans
MPOB]
Length=165
Score = 51.2 bits (121), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 45/148 (31%), Positives = 60/148 (41%), Gaps = 24/148 (16%)
Query 4 VYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARN--------- 54
V+FD SAFVK E G+ + D D + + PE+ + L R
Sbjct 31 VFFDTSAFVKRYVEEPGTEKVLEICDKADQLVLCVICLPEMISTLNRLVREGRLQNDEYR 90
Query 55 --HDLTESELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAV 112
DL E+ DAE + LT V ++ LR DA+HL AL V
Sbjct 91 KLRDLVLEEIEDAEICF-----------LTPEVVAQTIKCLESNVLRAMDALHLGCALVV 139
Query 113 GDPGLVVAVWDRRLHTGAHAAGCRVAPA 140
+P L V+ DRR A AG +V A
Sbjct 140 -EPDLFVS-SDRRQLEAARRAGLKVMEA 165
>gi|284041682|ref|YP_003392022.1| hypothetical protein Cwoe_0211 [Conexibacter woesei DSM 14684]
gi|283945903|gb|ADB48647.1| conserved hypothetical protein [Conexibacter woesei DSM 14684]
Length=141
Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/137 (36%), Positives = 58/137 (43%), Gaps = 1/137 (0%)
Query 4 VYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELA 63
+Y D SA VKLL TE GS A +SS L Y E+ +ALA LT
Sbjct 3 LYLDTSALVKLLVTEPGSEAVRAEAAAATELVSSHLTYVEIHSALARMHAGGRLTRRVHR 62
Query 64 DAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLV-VAVW 122
+ W V A V A LA H LRG DA+ LASA+ + G A W
Sbjct 63 RQLDAFVRMWEDVVVVPADAPVIDRAAALAERHVLRGFDALQLASAVELLGAGPARFASW 122
Query 123 DRRLHTGAHAAGCRVAP 139
D RL+ A + P
Sbjct 123 DERLNAAAARERLELIP 139
>gi|258593454|emb|CBE69793.1| conserved protein of unknown function [NC10 bacterium 'Dutch
sediment']
Length=145
Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/143 (35%), Positives = 69/143 (49%), Gaps = 3/143 (2%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESEL 62
+ Y DASA VK E S+ L D ++ ++ EV AA+A A R LT E
Sbjct 2 IAYLDASALVKRYVAEADSAEVGELIDQAAVVGTAIISRAEVAAAMAKAVRMALLTREEG 61
Query 63 ADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGD---PGLVV 119
A + + W + +++T + A LA ++LRG DA HLASAL D + V
Sbjct 62 VSALQFFSGEWESLIRLQMTEVLVSRAASLAWDYSLRGYDATHLASALFWRDMLGESVTV 121
Query 120 AVWDRRLHTGAHAAGCRVAPAQL 142
A +DR+L A A G V P L
Sbjct 122 ATYDRQLWNAAQATGLTVWPRSL 144
>gi|218296688|ref|ZP_03497406.1| conserved hypothetical protein [Thermus aquaticus Y51MC23]
gi|218243001|gb|EED09534.1| conserved hypothetical protein [Thermus aquaticus Y51MC23]
Length=150
Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/146 (32%), Positives = 67/146 (46%), Gaps = 7/146 (4%)
Query 4 VYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELA 63
+ + S +KLL E S LA + +A +S A PE L A R+ L+
Sbjct 3 LLLETSGLLKLLLREEFSPLAQEAFSRAEALAASAFALPEAVGVLHAMRRDGRLSRIGYR 62
Query 64 DAERDWEDFWAATRPVELTATVEQH--AGHLARAHALRGADAVHLASALAVGD--PGLVV 119
A ++ W LT T+E H A L H L+GADA+HL +AL + + +V+
Sbjct 63 KALKELYGLWDYLE--VLTPTLEGHVRAAQLCEKHPLKGADALHLVAALYLREFRQEVVL 120
Query 120 AVWDRRLHTGAHAAGCRVAPAQ-LDP 144
+DR L+ A G V P L+P
Sbjct 121 LTFDRTLYHAAKREGLAVVPVPALEP 146
>gi|340783792|ref|YP_004750398.1| hypothetical protein Atc_m167 [Acidithiobacillus caldus SM-1]
gi|340557945|gb|AEK59698.1| conserved hypothetical protein [Acidithiobacillus caldus SM-1]
Length=137
Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/108 (34%), Positives = 52/108 (49%), Gaps = 0/108 (0%)
Query 4 VYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELA 63
VYFD+SAF K ETG++ A + S +A PE+ +A R LT+++
Sbjct 3 VYFDSSAFAKRYIDETGTADVLAWCERASELALSVIAVPELISAFCRLQREGRLTDAQYQ 62
Query 64 DAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALA 111
+R A + T V QHA + LRG DA+HL +A+A
Sbjct 63 IIKRALMLDIADALICDTTPQVIQHAVKALENYTLRGMDAIHLGAAIA 110
>gi|344342234|ref|ZP_08773132.1| PilT protein domain protein [Thiocapsa marina 5811]
gi|343797870|gb|EGV15846.1| PilT protein domain protein [Thiocapsa marina 5811]
Length=137
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/138 (33%), Positives = 62/138 (45%), Gaps = 10/138 (7%)
Query 4 VYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELA 63
V+FD+SAFVK E G+ A D S +A PE+ +A R + E+
Sbjct 3 VFFDSSAFVKRYVRENGTEAVLAWCDRAGEIGLSGIALPEIVSAFCRLRREGKIDETRYR 62
Query 64 DAER----DWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVV 119
+ D ED AA +LT V + + LRG DA+H+ SALA+ +
Sbjct 63 QLKSLLLTDIED--AAI--CDLTPEVLAQSIVCLENNLLRGMDAIHIGSALALKADIFIT 118
Query 120 AVWDRRLHTGAHAAGCRV 137
A D+R A AG RV
Sbjct 119 A--DQRQGDAASRAGLRV 134
>gi|158521913|ref|YP_001529783.1| hypothetical protein Dole_1902 [Desulfococcus oleovorans Hxd3]
gi|158510739|gb|ABW67706.1| conserved hypothetical protein [Desulfococcus oleovorans Hxd3]
Length=137
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/135 (28%), Positives = 57/135 (43%), Gaps = 2/135 (1%)
Query 5 YFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELAD 64
+FD+SAF K E GS L + S + PE+ +AL R L+ +
Sbjct 4 FFDSSAFAKRYIEEKGSQLVDDICYKATEICLSVICVPEIISALNRRLREKCLSHQDYIT 63
Query 65 AERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVAVWDR 124
++ + LT V + + L + LR DA+H+A ALA V + D
Sbjct 64 IKQHLSGDVRDAVIINLTPEVIRMSTELLESSPLRAMDAIHVACALAWKAELFVSS--DN 121
Query 125 RLHTGAHAAGCRVAP 139
R + A AG ++ P
Sbjct 122 RQLSAAKKAGLKIKP 136
>gi|337277681|ref|YP_004617152.1| hypothetical protein Rta_00720 [Ramlibacter tataouinensis TTB310]
gi|334728757|gb|AEG91133.1| Hypothetical protein Rta_00720 [Ramlibacter tataouinensis TTB310]
Length=139
Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/137 (31%), Positives = 62/137 (46%), Gaps = 10/137 (7%)
Query 4 VYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARN----HDLTE 59
V FD SA +K E G +L +++ E+ +AL R+ +L
Sbjct 3 VLFDTSALLKRYLPEPGREALLSLMGQARPVVAAPNCKVELYSALNRVRRDTGASDELYR 62
Query 60 SELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVV 119
A+ ER++ DF V +T +E+ A A LR DA+H+ +ALA G V
Sbjct 63 QTCAEVERNFGDFNV----VPMTGVLERAAIRALEAAPLRAGDALHVGAALAAGVDLFVT 118
Query 120 AVWDRRLHTGAHAAGCR 136
A DRR + GA A G +
Sbjct 119 A--DRRQYQGALATGLK 133
>gi|297618046|ref|YP_003703205.1| PilT protein domain-containing protein [Syntrophothermus lipocalidus
DSM 12680]
gi|297145883|gb|ADI02640.1| PilT protein domain protein [Syntrophothermus lipocalidus DSM
12680]
Length=146
Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/140 (31%), Positives = 63/140 (45%), Gaps = 3/140 (2%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESEL 62
++Y D SA VKL E GS D +S++AY E RAA A A R L E
Sbjct 2 ILYLDTSALVKLYVREAGSETVRTFVDSASLVATSKVAYAEARAAFARAFREGVLGEEGY 61
Query 63 ADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASAL---AVGDPGLVV 119
+ W + ++ ++ AG LA + LRG D++HLASA+ + +
Sbjct 62 LQVVASLQSDWPRYLTLAVSDSLVWLAGELAERYRLRGFDSIHLASAMTLKGMAKSPVRA 121
Query 120 AVWDRRLHTGAHAAGCRVAP 139
+D RL ++G V P
Sbjct 122 VCFDARLWDAFLSSGFEVVP 141
>gi|124516076|gb|EAY57585.1| conserved hypothetical protein [Leptospirillum rubarum]
gi|206603762|gb|EDZ40242.1| Conserved protein of unknown function [Leptospirillum sp. Group
II '5-way CG']
Length=156
Score = 43.9 bits (102), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 35/138 (26%), Positives = 58/138 (43%), Gaps = 4/138 (2%)
Query 4 VYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELA 63
+Y D+SA VK E + D +S +AY EV +A + R+ ++ ++
Sbjct 5 LYLDSSAIVKFYVHEPHYQNVRQWAESADILATSEIAYTEVVSAFSQKVRSEEIGWDYVS 64
Query 64 DAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALA----VGDPGLVV 119
W V +E L+ LR D+VHL +A+A + + ++
Sbjct 65 QVLPKLNQLWFGRLAVVRIDPLEAAQYVLSPTFPLRAMDSVHLHAAIAFRSQMPEYSVMF 124
Query 120 AVWDRRLHTGAHAAGCRV 137
+DRRL A+ G RV
Sbjct 125 CSFDRRLIQAANTYGFRV 142
>gi|337286921|ref|YP_004626394.1| hypothetical protein Thein_1569 [Thermodesulfatator indicus DSM
15286]
gi|335359749|gb|AEH45430.1| hypothetical protein Thein_1569 [Thermodesulfatator indicus DSM
15286]
Length=163
Score = 43.5 bits (101), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 29/103 (29%), Positives = 52/103 (51%), Gaps = 2/103 (1%)
Query 6 FDASAFVKLLTTETGS-SLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELAD 64
F VKL E S ++ A+++ + + +AY EV+AA A R ++E L +
Sbjct 32 FRYFCLVKLYVKEEHSLTVEQAVFEA-EIVATHLIAYVEVQAAFARLFREGVISEEILEN 90
Query 65 AERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLA 107
+ D++ W + L ++ + A A+A AL+ D++HLA
Sbjct 91 IQEDFKKDWPHYMKIGLNQSLLERASDFAKAFALKAYDSIHLA 133
>gi|334118678|ref|ZP_08492766.1| hypothetical protein MicvaDRAFT_5260 [Microcoleus vaginatus FGP-2]
gi|333458908|gb|EGK87523.1| hypothetical protein MicvaDRAFT_5260 [Microcoleus vaginatus FGP-2]
Length=156
Score = 43.1 bits (100), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/111 (34%), Positives = 56/111 (51%), Gaps = 7/111 (6%)
Query 5 YFDASAFVKLLTTETGSSLASALWD---GCDAALSSRLAYPEVRAALAAAARNHDLTESE 61
+ D SA VK E GS ++ D D A+S ++ + EV +A A R+ L+ +E
Sbjct 6 FLDTSALVKRYVPEIGSEWILSITDPARDNDLAIS-QITWVEVHSAFARRLRDRSLS-AE 63
Query 62 LAD--AERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASAL 110
D ++ EDF R +++ T+ + A L H LR D+V LASAL
Sbjct 64 RFDLIGQKVREDFENEYRVIDVDQTLIETATALVMQHPLRAYDSVQLASAL 114
>gi|75911188|ref|YP_325484.1| hypothetical protein Ava_4992 [Anabaena variabilis ATCC 29413]
gi|75704913|gb|ABA24589.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length=154
Score = 42.7 bits (99), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 40/124 (33%), Positives = 59/124 (48%), Gaps = 16/124 (12%)
Query 4 VYF-DASAFVKLLTTETGSSLASALWDGCDAALSSRLAYP-----EVRAALAAAARNHDL 57
+YF D+SA VK E GSS L++ ALS+ + E+ AA+ +R +
Sbjct 3 IYFIDSSALVKRYVNEIGSSWVLGLFE---PALSNEVFIAAITGVEIVAAVTRRSRGGSI 59
Query 58 TESELADAE----RDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVG 113
+ DA+ + +D + VE+T V A LA + LRG DA LA+ LAV
Sbjct 60 S---FVDAKLVCNQFRKDLQTEYQVVEITENVIISAMSLAETYGLRGYDATQLATGLAVN 116
Query 114 DPGL 117
G+
Sbjct 117 ALGI 120
>gi|186680922|ref|YP_001864118.1| hypothetical protein Npun_F0394 [Nostoc punctiforme PCC 73102]
gi|186463374|gb|ACC79175.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length=154
Score = 42.7 bits (99), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 37/145 (26%), Positives = 63/145 (44%), Gaps = 11/145 (7%)
Query 4 VYF-DASAFVKLLTTETGSSLASALWDGC--DAALSSRLAYPEVRAALAAAARNHDLTES 60
+YF D+SA VK +ETGS+ L+D + + + E+ AA+ +R ++ +
Sbjct 3 IYFIDSSALVKRYISETGSAWVLELFDPTLNNEVFIAAITSVEIIAAITRRSRGGSISIT 62
Query 61 ELADAERDWE-DFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAV------- 112
+ ++ D + VE+T V L+ + LRG DA+ LA AV
Sbjct 63 DATITRNQFKRDLQKDYQIVEITENVINSGIVLSETYGLRGYDAIQLAVGRAVNSICIAN 122
Query 113 GDPGLVVAVWDRRLHTGAHAAGCRV 137
G P + D L+ + G +
Sbjct 123 GLPSITFVSADNELNAAVGSEGLMI 147
>gi|87303098|ref|ZP_01085896.1| hypothetical protein WH5701_06326 [Synechococcus sp. WH 5701]
gi|87282265|gb|EAQ74225.1| hypothetical protein WH5701_06326 [Synechococcus sp. WH 5701]
Length=140
Score = 42.0 bits (97), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 47/144 (33%), Positives = 68/144 (48%), Gaps = 12/144 (8%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGC-DAALSSRLAYPEVRAALAAAARNHDLTESE 61
++Y D S V LLTTE S A ++ C D +SS E +AL R H L++
Sbjct 1 MIYLDTSVVVALLTTEERSPQALNWFEQCRDTLISSDWLITETHSALGIKQRRHGLSQDA 60
Query 62 LADAERDWEDFW---AATRPVELTATVEQHAGHLAR--AHALRGADAVHLASALAVGDPG 116
+ A +E A RP++ + + A L + A LR +DA+HLA AL
Sbjct 61 RSAATVQFERLLQGGAELRPLDRSRF--RQAAELLQDPALDLRASDALHLAVALHSRCSQ 118
Query 117 LVVAVWDRRLHTGAHAAGCRVAPA 140
L A +D R+ A A G ++PA
Sbjct 119 L--ASFDGRMQQAATALG--LSPA 138
>gi|15805687|ref|NP_294383.1| hypothetical protein DR_0660 [Deinococcus radiodurans R1]
gi|6458363|gb|AAF10239.1|AE001923_6 conserved hypothetical protein [Deinococcus radiodurans R1]
Length=172
Score = 40.8 bits (94), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 47/147 (32%), Positives = 63/147 (43%), Gaps = 18/147 (12%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAA-------AARNH 55
L+Y D SA +++ T E + + Y E AALA + R H
Sbjct 34 LLYLDTSALIRIYTQEPDYQHVIQEKQQSSGVICHEITYVEALAALAGRRARRLLSVRQH 93
Query 56 DLTESELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDP 115
L + + DW F R V + + Q A LA+AH LR DAVHLA+A AV
Sbjct 94 QLAVTAFQN---DWPTF----RHVSIDQQLLQDAAALAQAHTLRAYDAVHLAAAQAVSPL 146
Query 116 GLVVAVWDRRLHTGAHAAGCRVAPAQL 142
GL +D L T A +V P Q+
Sbjct 147 GLQFMTFDTHLRTVAE----QVLPGQV 169
>gi|166366200|ref|YP_001658473.1| hypothetical protein MAE_34590 [Microcystis aeruginosa NIES-843]
gi|166088573|dbj|BAG03281.1| hypothetical protein MAE_34590 [Microcystis aeruginosa NIES-843]
Length=155
Score = 40.8 bits (94), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 39/145 (27%), Positives = 66/145 (46%), Gaps = 16/145 (11%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRL-----AYPEVRAALAAAARNHDL 57
L + D+SA VK +ETGS+ L+ AAL++ + A E+ AA+ +R +
Sbjct 3 LYFLDSSALVKRYISETGSAWVLGLFA---AALNNEIFIAAIAKVEIVAAITRRSRTGSI 59
Query 58 TESE-LADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAV---- 112
+ ++ A + +D + +E+T ++ LA + LRG DA+ LA AV
Sbjct 60 SVTDATAIVHQLRKDSLKDYQVIEITESIINSGMVLAETYGLRGYDAIQLAVGCAVNTLC 119
Query 113 ---GDPGLVVAVWDRRLHTGAHAAG 134
G P + D L+ + G
Sbjct 120 LASGLPSITFVSADNELNVAVISEG 144
>gi|336176937|ref|YP_004582312.1| hypothetical protein FsymDg_0881 [Frankia symbiont of Datisca
glomerata]
gi|334857917|gb|AEH08391.1| hypothetical protein FsymDg_0881 [Frankia symbiont of Datisca
glomerata]
Length=132
Score = 40.4 bits (93), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 43/138 (32%), Positives = 62/138 (45%), Gaps = 26/138 (18%)
Query 3 LVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALA----------AAA 52
++Y D+ A VKL+ TE +S A LS+ YP+V + LA A A
Sbjct 1 MIYLDSCALVKLVVTEAETS-------ALRAFLSAHAGYPQVTSLLARTEVVRAVRRATA 53
Query 53 RNHDLTESELADAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAV 112
+ DL ++ + R D RP+ AG +A +R DA+HLA+A +
Sbjct 54 DDADLYKAAVTMLNR--LDHIILDRPIL------DDAGAVADP-LVRTLDAIHLAAARRL 104
Query 113 GDPGLVVAVWDRRLHTGA 130
GD +DRRL T A
Sbjct 105 GDSLTAFVTYDRRLATAA 122
>gi|334337553|ref|YP_004542705.1| PilT protein domain protein [Isoptericola variabilis 225]
gi|334107921|gb|AEG44811.1| PilT protein domain protein [Isoptericola variabilis 225]
Length=149
Score = 40.4 bits (93), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 49/134 (37%), Positives = 61/134 (46%), Gaps = 12/134 (8%)
Query 5 YFDASAFVKLLTTETGSSLASALWDGCDAAL-SSRLAYPEVRAALAAAARNHDLTESELA 63
Y D SA VKL+ E S+ D D L SS LA E+ A+ AA H +T +
Sbjct 3 YLDTSALVKLVVREPESAALKKWVDANDDRLVSSDLARTELLRAVRRAAPEHAVTARAVL 62
Query 64 DAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVAVWD 123
DA D A T TA E A HL +R DA+HLA+AL +GD + +D
Sbjct 63 DAV----DLLALT-----TADFEA-AAHLD-PDIVRSLDALHLATALRLGDELESMVTYD 111
Query 124 RRLHTGAHAAGCRV 137
RL A G V
Sbjct 112 VRLADAARYHGVAV 125
>gi|94265149|ref|ZP_01288913.1| hypothetical protein MldDRAFT_4382 [delta proteobacterium MLMS-1]
gi|93454388|gb|EAT04689.1| hypothetical protein MldDRAFT_4382 [delta proteobacterium MLMS-1]
Length=161
Score = 40.4 bits (93), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 38/132 (29%), Positives = 58/132 (44%), Gaps = 10/132 (7%)
Query 5 YFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELAD 64
+ D SA VK+ E G+ L+ G + S LA E+++A+ R +L E+ L
Sbjct 20 FLDTSALVKIYHREVGTDFCLNLYTGQAHLIISELARVELQSAVFRRYREKELNETALKA 79
Query 65 AERDWE-DFWAATRPVELTATVEQHAGHLARAHA----LRGADAVHLASALAV---GDPG 116
+E D + + ++V A L HA LR D++ LA+ L G G
Sbjct 80 VLEKFESDCEERYEVLHIASSVYDEACKLLSRHAEMYGLRTLDSLQLATFLNYCEKGQDG 139
Query 117 LVVAVWDRRLHT 128
V A DR+ T
Sbjct 140 FVCA--DRKFTT 149
>gi|333967724|gb|AEG34488.1| hypothetical protein Ththe16_2108 [Thermus thermophilus SG0.5JP17-16]
Length=158
Score = 40.0 bits (92), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 30/73 (42%), Positives = 35/73 (48%), Gaps = 0/73 (0%)
Query 36 SSRLAYPEVRAALAAAARNHDLTESELADAERDWEDFWAATRPVELTATVEQHAGHLARA 95
+S LAY E A A R +T A + W A V L+ V Q AG LA
Sbjct 47 ASHLAYVETLATFHALRRGRAITSRRQASLSAAFRADWPAFLRVPLSPPVVQLAGALAEE 106
Query 96 HALRGADAVHLAS 108
H LRGADA+ LAS
Sbjct 107 HPLRGADALQLAS 119
>gi|55978318|ref|YP_145374.1| hypothetical protein TTHB135 [Thermus thermophilus HB8]
gi|55773491|dbj|BAD71931.1| conserved hypothetical protein [Thermus thermophilus HB8]
Length=148
Score = 40.0 bits (92), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 44/126 (35%), Positives = 53/126 (43%), Gaps = 5/126 (3%)
Query 3 LVYFDASAFVKLL-TTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESE 61
L Y D SA VK E G+ L+ A L+S LA E +A R LT E
Sbjct 3 LPYLDTSALVKRYDPEEPGAEEVRTLFTEVRAVLTSSLAVVEAVSAFRIKERQGVLTPEE 62
Query 62 LADAERDWEDFWAAT-RPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVA 120
+ A E A R V V + A L H LR DA+HLA+AL V V
Sbjct 63 VRLAVEALEAHAALQYRLVPPKPPVLREAKRLLLRHKLRAYDALHLATALVVAR---VAG 119
Query 121 VWDRRL 126
V R+L
Sbjct 120 VEPRKL 125
>gi|328953359|ref|YP_004370693.1| hypothetical protein Desac_1665 [Desulfobacca acetoxidans DSM
11109]
gi|328453683|gb|AEB09512.1| hypothetical protein Desac_1665 [Desulfobacca acetoxidans DSM
11109]
Length=148
Score = 40.0 bits (92), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 37/115 (33%), Positives = 53/115 (47%), Gaps = 13/115 (11%)
Query 5 YFDASAFVKLLTTETGSSLASALWDGCDAALS-SRLAYPEVRAALAAAAR----NHDLTE 59
+FD SA VKL E G+ L D + S L++ E+ +ALA R + D+ E
Sbjct 6 FFDTSALVKLYHQEKGTEALEHLISAADTCIVISDLSFIEITSALATKVRMGLIDRDVFE 65
Query 60 SELADAERDWEDFWAATRPVELTATVEQHAGHLAR----AHALRGADAVHLASAL 110
+ L RD +A +E+ V+ A L + LR DA+ LASAL
Sbjct 66 AVLDCFIRD----FAGYEIIEVDHAVKMQAADLLKTIVVTRRLRTLDALQLASAL 116
>gi|86741564|ref|YP_481964.1| PilT protein-like protein [Frankia sp. CcI3]
gi|86568426|gb|ABD12235.1| PilT protein-like [Frankia sp. CcI3]
Length=129
Score = 40.0 bits (92), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 46/135 (35%), Positives = 59/135 (44%), Gaps = 12/135 (8%)
Query 5 YFDASAFVKLLTTETGS-SLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELA 63
Y D+SA +KL E G+ +L S L A+SS L EV AL R D
Sbjct 3 YLDSSALMKLTHPERGTRALRSWLAVRPGVAVSSALVMLEVTRAL----RRSD------P 52
Query 64 DAERDWEDFWAATRPVELTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVAVWD 123
A D + V + + A L LR DA+HLA+AL + P LV +D
Sbjct 53 AALPRIPDVLSRITLVPIDQPIMVSAAALTDP-LLRSLDALHLATALRLDAPSLVFVSYD 111
Query 124 RRLHTGAHAAGCRVA 138
+RL T A G VA
Sbjct 112 KRLSTAAAQEGLTVA 126
Lambda K H
0.319 0.129 0.395
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128702269584
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40