BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1670
Length=115
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608808|ref|NP_216186.1| hypothetical protein Rv1670 [Mycoba... 235 1e-60
gi|15841125|ref|NP_336162.1| hypothetical protein MT1708 [Mycoba... 231 2e-59
gi|308380144|ref|ZP_07488847.2| hypothetical protein TMKG_02176 ... 227 5e-58
gi|183980887|ref|YP_001849178.1| hypothetical protein MMAR_0866 ... 174 5e-42
gi|118616426|ref|YP_904758.1| hypothetical protein MUL_0619 [Myc... 168 3e-40
gi|240171059|ref|ZP_04749718.1| hypothetical protein MkanA1_1723... 159 1e-37
gi|240171268|ref|ZP_04749927.1| hypothetical protein MkanA1_1828... 158 3e-37
gi|331695133|ref|YP_004331372.1| hypothetical protein Psed_1275 ... 137 8e-31
gi|163848744|ref|YP_001636788.1| hypothetical protein Caur_3205 ... 122 2e-26
gi|298250385|ref|ZP_06974189.1| protein of unknown function DUF4... 119 1e-25
gi|256394250|ref|YP_003115814.1| hypothetical protein Caci_5114 ... 118 3e-25
gi|148656685|ref|YP_001276890.1| hypothetical protein RoseRS_256... 117 6e-25
gi|156741834|ref|YP_001431963.1| hypothetical protein Rcas_1854 ... 117 7e-25
gi|311894179|dbj|BAJ26587.1| hypothetical protein KSE_07470 [Kit... 116 1e-24
gi|225874959|ref|YP_002756418.1| hypothetical protein ACP_3423 [... 115 2e-24
gi|299132724|ref|ZP_07025919.1| protein of unknown function DUF4... 115 2e-24
gi|340616876|ref|YP_004735329.1| hypothetical protein zobellia_8... 114 6e-24
gi|219847448|ref|YP_002461881.1| hypothetical protein Cagg_0506 ... 114 7e-24
gi|284035248|ref|YP_003385178.1| hypothetical protein Slin_0315 ... 112 1e-23
gi|269794972|ref|YP_003314427.1| hypothetical protein Sked_16580... 112 1e-23
gi|322436361|ref|YP_004218573.1| hypothetical protein AciX9_2769... 112 1e-23
gi|118576088|ref|YP_875831.1| hypothetical protein CENSYa_0895 [... 112 2e-23
gi|325108351|ref|YP_004269419.1| hypothetical protein Plabr_1787... 112 2e-23
gi|16330323|ref|NP_441051.1| hypothetical protein ssl3291 [Synec... 112 2e-23
gi|82697914|gb|ABB88993.1| uncharacterized protein [uncultured c... 112 3e-23
gi|331695129|ref|YP_004331368.1| hypothetical protein Psed_1271 ... 112 3e-23
gi|161528290|ref|YP_001582116.1| hypothetical protein Nmar_0782 ... 111 3e-23
gi|295695889|ref|YP_003589127.1| hypothetical protein Btus_1256 ... 111 4e-23
gi|186685694|ref|YP_001868890.1| hypothetical protein Npun_R5645... 111 4e-23
gi|220909350|ref|YP_002484661.1| hypothetical protein Cyan7425_3... 111 4e-23
gi|75908874|ref|YP_323170.1| hypothetical protein Ava_2662 [Anab... 111 4e-23
gi|218438387|ref|YP_002376716.1| hypothetical protein PCC7424_14... 110 6e-23
gi|320102914|ref|YP_004178505.1| hypothetical protein Isop_1371 ... 110 7e-23
gi|119715879|ref|YP_922844.1| hypothetical protein Noca_1644 [No... 110 8e-23
gi|327405812|ref|YP_004346650.1| hypothetical protein Fluta_3846... 110 8e-23
gi|282897139|ref|ZP_06305141.1| Protein of unknown function DUF4... 109 1e-22
gi|17227556|ref|NP_484104.1| hypothetical protein asl0060 [Nosto... 109 1e-22
gi|325964761|ref|YP_004242667.1| hypothetical protein Asphe3_343... 109 2e-22
gi|119716305|ref|YP_923270.1| hypothetical protein Noca_2074 [No... 109 2e-22
gi|54023893|ref|YP_118135.1| hypothetical protein nfa19250 [Noca... 108 3e-22
gi|334109370|ref|ZP_08483650.1| protein of unknown function DUF4... 108 4e-22
gi|217979304|ref|YP_002363451.1| hypothetical protein Msil_3182 ... 108 4e-22
gi|119509540|ref|ZP_01628687.1| hypothetical protein N9414_08240... 107 4e-22
gi|329765038|ref|ZP_08256623.1| hypothetical protein Nlim_0380 [... 107 4e-22
gi|340344765|ref|ZP_08667897.1| DUF427 domain containing protein... 107 5e-22
gi|86607571|ref|YP_476333.1| hypothetical protein CYB_0069 [Syne... 107 6e-22
gi|91774189|ref|YP_566881.1| hypothetical protein Mbur_2270 [Met... 107 6e-22
gi|299138374|ref|ZP_07031553.1| protein of unknown function DUF4... 107 7e-22
gi|298489802|ref|YP_003719979.1| hypothetical protein Aazo_0252 ... 107 7e-22
gi|302879141|ref|YP_003847705.1| hypothetical protein Galf_1935 ... 107 8e-22
>gi|15608808|ref|NP_216186.1| hypothetical protein Rv1670 [Mycobacterium tuberculosis H37Rv]
gi|31792856|ref|NP_855349.1| hypothetical protein Mb1697 [Mycobacterium bovis AF2122/97]
gi|121637577|ref|YP_977800.1| hypothetical protein BCG_1708 [Mycobacterium bovis BCG str. Pasteur
1173P2]
73 more sequence titles
Length=115
Score = 235 bits (600), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 115/115 (100%), Positives = 115/115 (100%), Gaps = 0/115 (0%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG
Sbjct 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGESESRHGLARRVVAWLGK 115
PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGESESRHGLARRVVAWLGK
Sbjct 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGESESRHGLARRVVAWLGK 115
>gi|15841125|ref|NP_336162.1| hypothetical protein MT1708 [Mycobacterium tuberculosis CDC1551]
gi|254231864|ref|ZP_04925191.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|13881343|gb|AAK45976.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
gi|124600923|gb|EAY59933.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|323719759|gb|EGB28873.1| hypothetical protein TMMG_00919 [Mycobacterium tuberculosis CDC1551A]
Length=115
Score = 231 bits (590), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 114/115 (99%), Positives = 114/115 (99%), Gaps = 0/115 (0%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG
Sbjct 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGESESRHGLARRVVAWLGK 115
PYGPVNPDAAWYYRRPSPLARRIKNHVAF HGVTVEGESESRHGLARRVVAWLGK
Sbjct 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFRHGVTVEGESESRHGLARRVVAWLGK 115
>gi|308380144|ref|ZP_07488847.2| hypothetical protein TMKG_02176 [Mycobacterium tuberculosis SUMu011]
gi|308362426|gb|EFP51277.1| hypothetical protein TMKG_02176 [Mycobacterium tuberculosis SUMu011]
Length=111
Score = 227 bits (578), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 110/111 (99%), Positives = 111/111 (100%), Gaps = 0/111 (0%)
Query 5 VWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGPYGP 64
+WNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGPYGP
Sbjct 1 MWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGPYGP 60
Query 65 VNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGESESRHGLARRVVAWLGK 115
VNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGESESRHGLARRVVAWLGK
Sbjct 61 VNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGESESRHGLARRVVAWLGK 111
>gi|183980887|ref|YP_001849178.1| hypothetical protein MMAR_0866 [Mycobacterium marinum M]
gi|183174213|gb|ACC39323.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=115
Score = 174 bits (440), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 88/115 (77%), Positives = 94/115 (82%), Gaps = 4/115 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
MIRAVWNGTVLAE PRT RVEGNHYFPPES+HRE+L ES TTS+CPWKG+AHYY VVVD
Sbjct 1 MIRAVWNGTVLAETPRTARVEGNHYFPPESVHREYLRESSTTSLCPWKGVAHYYTVVVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGESESR-HGLARRVVAWLG 114
G NPDAAW+Y RPSPLARRIKNHVAFW+GV VEGE E GL RV WLG
Sbjct 60 --GRTNPDAAWFYPRPSPLARRIKNHVAFWNGVDVEGEPEGEATGLVSRVAGWLG 112
>gi|118616426|ref|YP_904758.1| hypothetical protein MUL_0619 [Mycobacterium ulcerans Agy99]
gi|118568536|gb|ABL03287.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=115
Score = 168 bits (425), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 86/115 (75%), Positives = 92/115 (80%), Gaps = 4/115 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
MIRAVWNGTVLAE PRT RVEGNHYFPPES+HR +L ES TTS+CPWKG+AHYY VVVD
Sbjct 1 MIRAVWNGTVLAETPRTARVEGNHYFPPESVHRVYLRESTTTSLCPWKGVAHYYTVVVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGESESR-HGLARRVVAWLG 114
G NPDAAW+Y RPSPLA RIKNHVAFW+GV VEGE E GL RV WLG
Sbjct 60 --GRTNPDAAWFYPRPSPLACRIKNHVAFWNGVDVEGEPEGEATGLVSRVAGWLG 112
>gi|240171059|ref|ZP_04749718.1| hypothetical protein MkanA1_17239 [Mycobacterium kansasii ATCC
12478]
Length=219
Score = 159 bits (403), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 77/97 (80%), Positives = 84/97 (87%), Gaps = 3/97 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
MIRAVWNG VLAEAP+T+R+EGN YFPPESL REHL++S T SICPWKGLAHYY V V+
Sbjct 1 MIRAVWNGAVLAEAPQTIRLEGNDYFPPESLRREHLVDSRTKSICPWKGLAHYYTVSVN- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEG 97
G VNPDAAWYY RPSPLARRIKNHVAFW+GV VEG
Sbjct 60 --GDVNPDAAWYYPRPSPLARRIKNHVAFWNGVRVEG 94
>gi|240171268|ref|ZP_04749927.1| hypothetical protein MkanA1_18286 [Mycobacterium kansasii ATCC
12478]
Length=97
Score = 158 bits (399), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 76/98 (78%), Positives = 83/98 (85%), Gaps = 3/98 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
MIRAVWNGTVLAE P+TVRVEGNHYFPPES+HRE+LI+S TTSICPWKG+A YY VVVD
Sbjct 1 MIRAVWNGTVLAEVPQTVRVEGNHYFPPESVHREYLIDSATTSICPWKGVASYYTVVVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGE 98
G NPDAAWYY PSP A IKNHVAFW GVT+EG+
Sbjct 60 --GQTNPDAAWYYPDPSPAASDIKNHVAFWRGVTIEGQ 95
>gi|331695133|ref|YP_004331372.1| hypothetical protein Psed_1275 [Pseudonocardia dioxanivorans
CB1190]
gi|326949822|gb|AEA23519.1| protein of unknown function DUF427 [Pseudonocardia dioxanivorans
CB1190]
Length=115
Score = 137 bits (344), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 68/102 (67%), Positives = 77/102 (76%), Gaps = 3/102 (2%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M+RAVWNG VLAEAPRTVRVEGNHYFPP+S+H E+ ES T ++CPWKGLA YYN+ D
Sbjct 1 MLRAVWNGIVLAEAPRTVRVEGNHYFPPDSVHTEYFAESRTKTLCPWKGLASYYNLTSD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGESESR 102
D AWYY PSPLARRIKNHVAF+ V +EGE E R
Sbjct 60 --RGSELDVAWYYPHPSPLARRIKNHVAFYPAVRIEGEPEPR 99
>gi|163848744|ref|YP_001636788.1| hypothetical protein Caur_3205 [Chloroflexus aurantiacus J-10-fl]
gi|222526690|ref|YP_002571161.1| hypothetical protein Chy400_3460 [Chloroflexus sp. Y-400-fl]
gi|163670033|gb|ABY36399.1| protein of unknown function DUF427 [Chloroflexus aurantiacus
J-10-fl]
gi|222450569|gb|ACM54835.1| protein of unknown function DUF427 [Chloroflexus sp. Y-400-fl]
Length=94
Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 59/96 (62%), Positives = 73/96 (77%), Gaps = 3/96 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M RAVWNGTV+AE+P T+ VEGNHYFPPES+ RE+L +S T ++CPWKG A YY+VVV+
Sbjct 1 MPRAVWNGTVIAESPTTIVVEGNHYFPPESVRREYLRDSQTHTVCPWKGTASYYDVVVN- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G VN DAAWYY P AR+I ++AFW GV +E
Sbjct 60 --GQVNRDAAWYYPTPKEAARQIAGYIAFWRGVRIE 93
>gi|298250385|ref|ZP_06974189.1| protein of unknown function DUF427 [Ktedonobacter racemifer DSM
44963]
gi|297548389|gb|EFH82256.1| protein of unknown function DUF427 [Ktedonobacter racemifer DSM
44963]
Length=95
Score = 119 bits (299), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 56/98 (58%), Positives = 70/98 (72%), Gaps = 3/98 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M++A WNG +LAE+ + V VEGN YFP +++ RE+L ES T + CPWKG A YY+V VD
Sbjct 1 MVKATWNGVMLAESDKCVVVEGNQYFPLDAVKREYLKESDTHTTCPWKGDASYYSVEVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEGE 98
G N DAAWYY P A++IK+HVAFW GVTVE E
Sbjct 60 --GKTNKDAAWYYPTPKDAAKQIKDHVAFWRGVTVEVE 95
>gi|256394250|ref|YP_003115814.1| hypothetical protein Caci_5114 [Catenulispora acidiphila DSM
44928]
gi|256360476|gb|ACU73973.1| protein of unknown function DUF427 [Catenulispora acidiphila
DSM 44928]
Length=98
Score = 118 bits (295), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 58/95 (62%), Positives = 66/95 (70%), Gaps = 3/95 (3%)
Query 2 IRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGP 61
+RAVWNG VLAE+ TV VEGNHYFP ESL+ E S T S+CPWKGLA YY V VD
Sbjct 3 MRAVWNGQVLAESDDTVVVEGNHYFPAESLNGEFFSASETHSVCPWKGLASYYTVTVD-- 60
Query 62 YGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G N DAAWYY +P A +++HVAFW GV VE
Sbjct 61 -GEANSDAAWYYPQPKAEAEHVRDHVAFWRGVKVE 94
>gi|148656685|ref|YP_001276890.1| hypothetical protein RoseRS_2564 [Roseiflexus sp. RS-1]
gi|148568795|gb|ABQ90940.1| protein of unknown function DUF427 [Roseiflexus sp. RS-1]
Length=94
Score = 117 bits (293), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 56/97 (58%), Positives = 70/97 (73%), Gaps = 3/97 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M RA+WNG V+AE+ T+ VEGNHYFPPES+ +E+L ES T ++C WKG+A YY+VVV+
Sbjct 1 MARAIWNGVVIAESDATIVVEGNHYFPPESVKQEYLRESATHTVCSWKGVASYYDVVVN- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEG 97
VN DAAWYY P ARRI ++AFW GV VE
Sbjct 60 --DAVNRDAAWYYPEPKEGARRIAGYIAFWRGVKVEA 94
>gi|156741834|ref|YP_001431963.1| hypothetical protein Rcas_1854 [Roseiflexus castenholzii DSM
13941]
gi|156233162|gb|ABU57945.1| protein of unknown function DUF427 [Roseiflexus castenholzii
DSM 13941]
Length=94
Score = 117 bits (292), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 56/97 (58%), Positives = 69/97 (72%), Gaps = 3/97 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M RAVWNG V+AE+ T+ VEGNHYFPPES+ E+L ES T ++C WKG+A YY++VV+
Sbjct 1 MARAVWNGVVIAESDATIVVEGNHYFPPESVKHEYLRESATHTVCSWKGVASYYDIVVN- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEG 97
VN DAAWYY P ARRI ++AFW GV VE
Sbjct 60 --DAVNRDAAWYYPEPKEAARRIAGYIAFWRGVKVEA 94
>gi|311894179|dbj|BAJ26587.1| hypothetical protein KSE_07470 [Kitasatospora setae KM-6054]
Length=96
Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 56/95 (59%), Positives = 67/95 (71%), Gaps = 3/95 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M+RAVWNG VLAE+ TV VEGNHYFPPE+L+RE + S TT++C WKG AHY+ V V
Sbjct 1 MVRAVWNGVVLAESADTVVVEGNHYFPPEALNREFFVPSGTTTVCSWKGTAHYWTVAVGD 60
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
NPDAAWYY P P A +++ VAFW GV V
Sbjct 61 AE---NPDAAWYYPEPKPEAEHVRDRVAFWRGVDV 92
>gi|225874959|ref|YP_002756418.1| hypothetical protein ACP_3423 [Acidobacterium capsulatum ATCC
51196]
gi|225791963|gb|ACO32053.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
51196]
Length=94
Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 54/96 (57%), Positives = 67/96 (70%), Gaps = 3/96 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M +AVWNG V+AE+ + VEGN YFP +++RE+L S TTS CPWKG A YY +++D
Sbjct 1 MAKAVWNGKVIAESEKFETVEGNLYFPESTVNREYLRPSSTTSTCPWKGQARYYTILID- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G NPDAAWYY P P AR IK H+AFW GV +E
Sbjct 60 --GQENPDAAWYYPDPKPAARNIKGHIAFWRGVEIE 93
>gi|299132724|ref|ZP_07025919.1| protein of unknown function DUF427 [Afipia sp. 1NLS2]
gi|298592861|gb|EFI53061.1| protein of unknown function DUF427 [Afipia sp. 1NLS2]
Length=95
Score = 115 bits (288), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 57/96 (60%), Positives = 66/96 (69%), Gaps = 3/96 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
MI+A+WNG V+AE+ RT VEGNHYFP ESL E S T ++CPWKG+A YY + VD
Sbjct 1 MIKAIWNGRVIAESDRTEIVEGNHYFPAESLRAEFFRPSETHTVCPWKGVASYYTLEVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G NPDAAWYY P P AR+I VAFW GV VE
Sbjct 60 --GKSNPDAAWYYPDPKPAARKIAGRVAFWKGVGVE 93
>gi|340616876|ref|YP_004735329.1| hypothetical protein zobellia_873 [Zobellia galactanivorans]
gi|339731673|emb|CAZ94938.1| Conserved hypothetical protein [Zobellia galactanivorans]
Length=93
Score = 114 bits (284), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 54/95 (57%), Positives = 67/95 (71%), Gaps = 3/95 (3%)
Query 2 IRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGP 61
++A+WNGTV+AE+ T+ VE NHYFP ES+ +E+ S T + CPWKG AHYY + VD
Sbjct 1 MKAIWNGTVIAESDDTLVVENNHYFPAESIKKEYFKASNTHTTCPWKGQAHYYTLSVD-- 58
Query 62 YGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G NPDAAW+Y S LA+ IK VAFW GVTVE
Sbjct 59 -GKENPDAAWFYPEVSELAKGIKGRVAFWRGVTVE 92
>gi|219847448|ref|YP_002461881.1| hypothetical protein Cagg_0506 [Chloroflexus aggregans DSM 9485]
gi|219541707|gb|ACL23445.1| protein of unknown function DUF427 [Chloroflexus aggregans DSM
9485]
Length=94
Score = 114 bits (284), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 53/97 (55%), Positives = 69/97 (72%), Gaps = 3/97 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M RA+WNG ++AE+ T+ VEGNHYFPP+S+ RE+L ES T ++C WKG A YY+VVV+
Sbjct 1 MPRAIWNGAIIAESDATIVVEGNHYFPPDSVRREYLRESQTHTVCSWKGTASYYDVVVN- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEG 97
G VN D AWYY P A++I ++AFW GV VE
Sbjct 60 --GEVNRDTAWYYPAPKDAAKQIAGYIAFWRGVKVEA 94
>gi|284035248|ref|YP_003385178.1| hypothetical protein Slin_0315 [Spirosoma linguale DSM 74]
gi|283814541|gb|ADB36379.1| protein of unknown function DUF427 [Spirosoma linguale DSM 74]
Length=93
Score = 112 bits (281), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 50/94 (54%), Positives = 65/94 (70%), Gaps = 3/94 (3%)
Query 2 IRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGP 61
++A+WNG +AE+ TV VE NHYFP +S+ + L++S T + CPWKGLA YY + VD
Sbjct 1 MKAIWNGETIAESDDTVIVENNHYFPKDSVKAKFLVDSETHTTCPWKGLASYYTLTVD-- 58
Query 62 YGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
G NPDAAWYY P P A +I+N VAFW GV +
Sbjct 59 -GKTNPDAAWYYPEPKPAASQIQNRVAFWKGVQI 91
>gi|269794972|ref|YP_003314427.1| hypothetical protein Sked_16580 [Sanguibacter keddieii DSM 10542]
gi|269097157|gb|ACZ21593.1| uncharacterized conserved protein [Sanguibacter keddieii DSM
10542]
Length=95
Score = 112 bits (281), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 53/95 (56%), Positives = 64/95 (68%), Gaps = 3/95 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M RA+WNGTV+AE+ T VEGN YFPP+S+ R+ + TT++CPWKG A YY V VDG
Sbjct 1 MYRALWNGTVIAESDATQVVEGNQYFPPDSIVRDLYTSTSTTTVCPWKGTASYYTVTVDG 60
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
N DAAW Y P P A I++HVAFW GV V
Sbjct 61 QQ---NTDAAWTYADPKPDAEHIRDHVAFWKGVEV 92
>gi|322436361|ref|YP_004218573.1| hypothetical protein AciX9_2769 [Acidobacterium sp. MP5ACTX9]
gi|321164088|gb|ADW69793.1| protein of unknown function DUF427 [Acidobacterium sp. MP5ACTX9]
Length=94
Score = 112 bits (281), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 56/95 (59%), Positives = 66/95 (70%), Gaps = 3/95 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M +A+WNG VLAE+ T VEGN YFP ES+ RE S TTS CP+KG A YY+++VD
Sbjct 1 MAKAMWNGQVLAESETTQLVEGNVYFPDESVKREFFRPSSTTSSCPFKGQARYYSLIVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
G NPDAAWYY P P AR IK+H+AFW GV V
Sbjct 60 --GQENPDAAWYYPDPKPAARAIKHHIAFWRGVEV 92
>gi|118576088|ref|YP_875831.1| hypothetical protein CENSYa_0895 [Cenarchaeum symbiosum A]
gi|118194609|gb|ABK77527.1| uncharacterized protein conserved in bacteria [Cenarchaeum symbiosum
A]
Length=106
Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 51/96 (54%), Positives = 66/96 (69%), Gaps = 3/96 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
+++AVWNG VLAE+ TV +EGNHYFP +S+ +E L+ES TS C WKG A YY+V +
Sbjct 13 IMKAVWNGVVLAESDDTVVIEGNHYFPADSVKKEFLVESGFTSFCGWKGTASYYSVTAN- 71
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G N D AWYY P+ A +I+ HVAFW GV +E
Sbjct 72 --GKTNSDCAWYYPEPNDAAAKIRGHVAFWKGVEME 105
>gi|325108351|ref|YP_004269419.1| hypothetical protein Plabr_1787 [Planctomyces brasiliensis DSM
5305]
gi|324968619|gb|ADY59397.1| protein of unknown function DUF427 [Planctomyces brasiliensis
DSM 5305]
Length=94
Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/97 (56%), Positives = 65/97 (68%), Gaps = 3/97 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M +A WNG +LAE+ +T VEGNHYFP +SL+RE+ +S S CPWKG A YYNVVV+
Sbjct 1 MAKATWNGAILAESEQTEVVEGNHYFPVDSLNREYFSDSDHHSSCPWKGTASYYNVVVE- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEG 97
G N +AAWYY P A IK +VAFW GV VE
Sbjct 60 --GETNQNAAWYYPEPKSAAENIKGYVAFWKGVQVEA 94
>gi|16330323|ref|NP_441051.1| hypothetical protein ssl3291 [Synechocystis sp. PCC 6803]
gi|1652812|dbj|BAA17731.1| ssl3291 [Synechocystis sp. PCC 6803]
gi|339273416|dbj|BAK49903.1| hypothetical protein SYNGTS_1155 [Synechocystis sp. PCC 6803]
Length=94
Score = 112 bits (279), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 53/97 (55%), Positives = 66/97 (69%), Gaps = 3/97 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M +A+WNG VLAE+ + VEGN+YFPP+S+H ++ S T ++C WKG A YY VVVD
Sbjct 1 MAKAIWNGVVLAESDQCEMVEGNYYFPPDSIHPQYFQPSNTHTVCGWKGTASYYTVVVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEG 97
G NPDAAWYY P P A IK ++AFW GV VE
Sbjct 60 --GQENPDAAWYYPDPKPRAENIKGYIAFWRGVKVEA 94
>gi|82697914|gb|ABB88993.1| uncharacterized protein [uncultured crenarchaeote]
Length=92
Score = 112 bits (279), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 49/95 (52%), Positives = 66/95 (70%), Gaps = 3/95 (3%)
Query 2 IRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGP 61
++AVWN TV+AE+ T+ +EGNHYFPPES+ +E ES T + C WKG A YY++ +D
Sbjct 1 MKAVWNNTVIAESDSTIVIEGNHYFPPESIKKEFFDESSTNTRCGWKGTASYYSIKID-- 58
Query 62 YGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G +N D AWYY PS A +IK ++AFW GV V+
Sbjct 59 -GELNKDCAWYYPEPSEAATKIKGYIAFWKGVEVK 92
>gi|331695129|ref|YP_004331368.1| hypothetical protein Psed_1271 [Pseudonocardia dioxanivorans
CB1190]
gi|326949818|gb|AEA23515.1| protein of unknown function DUF427 [Pseudonocardia dioxanivorans
CB1190]
Length=97
Score = 112 bits (279), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 54/95 (57%), Positives = 64/95 (68%), Gaps = 3/95 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
+ +AVW TVLAE+ V VEGN YFP ES+ +EH S TT++CPWKG A Y V VD
Sbjct 2 IYQAVWQDTVLAESDDIVEVEGNAYFPTESVRQEHFRASDTTTVCPWKGTARYLTVTVD- 60
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
G NPDAAW Y +P P A +I+N VAFWHGV V
Sbjct 61 --GQSNPDAAWTYPQPYPAAEKIRNRVAFWHGVQV 93
>gi|161528290|ref|YP_001582116.1| hypothetical protein Nmar_0782 [Nitrosopumilus maritimus SCM1]
gi|160339591|gb|ABX12678.1| protein of unknown function DUF427 [Nitrosopumilus maritimus
SCM1]
Length=92
Score = 111 bits (278), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 48/95 (51%), Positives = 68/95 (72%), Gaps = 3/95 (3%)
Query 2 IRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGP 61
++A+WN TV+AE+ TV VEGNHYFP +S+ +E+ ++ TS C WKG+A+YY+V V+
Sbjct 1 MQAIWNDTVIAESDDTVVVEGNHYFPIDSIKKEYFEKTDMTSFCGWKGMANYYSVTVN-- 58
Query 62 YGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G N D AWYY P+ A +IK+ VAFW+GV V+
Sbjct 59 -GKTNKDCAWYYAEPNDAAMKIKDMVAFWNGVEVK 92
>gi|295695889|ref|YP_003589127.1| hypothetical protein Btus_1256 [Bacillus tusciae DSM 2912]
gi|295411491|gb|ADG05983.1| protein of unknown function DUF427 [Bacillus tusciae DSM 2912]
Length=93
Score = 111 bits (278), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 53/95 (56%), Positives = 68/95 (72%), Gaps = 3/95 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M +AVW G VLAE+ +TV V+G YFPPE++ E+LI S TTS+C WKG+A YY+V VD
Sbjct 1 MPKAVWKGRVLAESDQTVEVDGYTYFPPEAVRTEYLIPSETTSVCQWKGVASYYHVKVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
+NPDAAW YR P P A RI+N +AFW G+ +
Sbjct 60 --SDINPDAAWSYRSPKPEAARIQNWIAFWRGIEI 92
>gi|186685694|ref|YP_001868890.1| hypothetical protein Npun_R5645 [Nostoc punctiforme PCC 73102]
gi|186468146|gb|ACC83947.1| protein of unknown function DUF427 [Nostoc punctiforme PCC 73102]
Length=94
Score = 111 bits (277), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 50/97 (52%), Positives = 68/97 (71%), Gaps = 3/97 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M +A+WNG VLAE+ RTV VE NHYFP +++++E+ +S T + CPWKG+A YY++ VD
Sbjct 1 MPKAIWNGAVLAESDRTVVVENNHYFPADTINKEYFTDSNTHTTCPWKGVASYYSIEVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEG 97
G +N DAAWYY A+ I+ +VAFW GV VE
Sbjct 60 --GQINKDAAWYYPSTKEKAKNIEGYVAFWKGVKVEA 94
>gi|220909350|ref|YP_002484661.1| hypothetical protein Cyan7425_3986 [Cyanothece sp. PCC 7425]
gi|219865961|gb|ACL46300.1| protein of unknown function DUF427 [Cyanothece sp. PCC 7425]
Length=94
Score = 111 bits (277), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 51/96 (54%), Positives = 68/96 (71%), Gaps = 3/96 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M RA+WNG VLA++ R VEGN YFPP+S+++++ S T + CPWKG+A YY++ VD
Sbjct 1 MPRAIWNGAVLAQSDRCEVVEGNQYFPPDSINQDYFRPSDTHTTCPWKGVASYYSIEVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G VN DAAWYY P A++IK ++AFW GV VE
Sbjct 60 --GQVNKDAAWYYPEPKDAAKQIKGYIAFWKGVKVE 93
>gi|75908874|ref|YP_323170.1| hypothetical protein Ava_2662 [Anabaena variabilis ATCC 29413]
gi|75702599|gb|ABA22275.1| Protein of unknown function DUF427 [Anabaena variabilis ATCC
29413]
Length=95
Score = 111 bits (277), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 52/96 (55%), Positives = 67/96 (70%), Gaps = 3/96 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M RA+WNG VLAE+ TV VEGNHYFP +++++E+ S T + CPWKG+A YY++ VD
Sbjct 1 MPRAIWNGAVLAESDNTVVVEGNHYFPADAINKEYFQASETHTTCPWKGVASYYSIEVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G VN DAAWYY A+ I+ +VAFW GV VE
Sbjct 60 --GQVNKDAAWYYPSTKEKAKNIEGYVAFWKGVKVE 93
>gi|218438387|ref|YP_002376716.1| hypothetical protein PCC7424_1404 [Cyanothece sp. PCC 7424]
gi|218171115|gb|ACK69848.1| protein of unknown function DUF427 [Cyanothece sp. PCC 7424]
Length=94
Score = 110 bits (275), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 51/97 (53%), Positives = 66/97 (69%), Gaps = 3/97 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M +A+WNG +LA++ VEGN+YFPP+S+++E+ ES T + CPWKG A YY + VD
Sbjct 1 MPQAIWNGAILAQSDNCEVVEGNYYFPPDSINKEYFKESSTHTSCPWKGQASYYTIEVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEG 97
G N DAAWYY P A IKN+VAFW GVTV+
Sbjct 60 --GQQNKDAAWYYPNPKEKANNIKNYVAFWRGVTVKS 94
>gi|320102914|ref|YP_004178505.1| hypothetical protein Isop_1371 [Isosphaera pallida ATCC 43644]
gi|319750196|gb|ADV61956.1| protein of unknown function DUF427 [Isosphaera pallida ATCC 43644]
Length=93
Score = 110 bits (275), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 52/96 (55%), Positives = 64/96 (67%), Gaps = 3/96 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M +AVWNG V+A++ TV VEGNHYFP +S+ RE +S TTS+C WKG A YY+V V+
Sbjct 1 MPQAVWNGEVIAQSDDTVVVEGNHYFPADSIRREFFSDSATTSVCGWKGTARYYSVTVN- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G NPD AW Y P P A +IK +AFW GV V
Sbjct 60 --GQTNPDCAWSYPDPKPAASQIKGRIAFWKGVKVS 93
>gi|119715879|ref|YP_922844.1| hypothetical protein Noca_1644 [Nocardioides sp. JS614]
gi|119536540|gb|ABL81157.1| protein of unknown function DUF427 [Nocardioides sp. JS614]
Length=106
Score = 110 bits (275), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 56/94 (60%), Positives = 69/94 (74%), Gaps = 3/94 (3%)
Query 2 IRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGP 61
++AV++G V+AE+ T+ VEGNHYFP ESL E L + T +ICPWKGLA YY+V VD
Sbjct 3 MQAVFHGVVIAESDDTIVVEGNHYFPEESLSHEFLTPTGTHTICPWKGLASYYDVGVD-- 60
Query 62 YGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
G + +AAWYY RPSPLARR+K VAFW GV V
Sbjct 61 -GVTSANAAWYYPRPSPLARRVKGRVAFWRGVEV 93
>gi|327405812|ref|YP_004346650.1| hypothetical protein Fluta_3846 [Fluviicola taffensis DSM 16823]
gi|327321320|gb|AEA45812.1| protein of unknown function DUF427 [Fluviicola taffensis DSM
16823]
Length=95
Score = 110 bits (274), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 49/95 (52%), Positives = 66/95 (70%), Gaps = 3/95 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M RA+W T++AE+ +TV +EGNHYFP ES+ +E+ ESPT + CPWKG+A YY + V+
Sbjct 1 MKRAIWKNTIIAESDKTVIIEGNHYFPVESIKKEYFKESPTHTTCPWKGIASYYTIEVNK 60
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
N DAAW+Y + S LA+ I+ VAFW GV V
Sbjct 61 DQ---NKDAAWFYPKTSELAKSIEGKVAFWKGVQV 92
>gi|282897139|ref|ZP_06305141.1| Protein of unknown function DUF427 [Raphidiopsis brookii D9]
gi|281197791|gb|EFA72685.1| Protein of unknown function DUF427 [Raphidiopsis brookii D9]
Length=93
Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 51/96 (54%), Positives = 68/96 (71%), Gaps = 3/96 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M +A+WNG VLAE+ + VEGN+YFPP++++R++ ES T + CPWKG+A YYN+ VD
Sbjct 1 MPKALWNGAVLAESDKGQVVEGNYYFPPDAINRQYFQESNTHTTCPWKGVASYYNIQVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G VN DAAWYY A+ I+ +VAFW GV VE
Sbjct 60 --GQVNKDAAWYYPATKDKAKHIEGYVAFWRGVKVE 93
>gi|17227556|ref|NP_484104.1| hypothetical protein asl0060 [Nostoc sp. PCC 7120]
gi|17135038|dbj|BAB77584.1| asl0060 [Nostoc sp. PCC 7120]
Length=95
Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 50/96 (53%), Positives = 67/96 (70%), Gaps = 3/96 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M +A+WNG VLAE+ TV VEGNHYFP +++++++ S T + CPWKG+A YY++ VD
Sbjct 1 MPKAIWNGAVLAESDNTVVVEGNHYFPADAINKQYFQASETHTTCPWKGVASYYSIEVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G VN DAAWYY A+ I+ +VAFW GV VE
Sbjct 60 --GQVNKDAAWYYPSAKEKAKNIEGYVAFWKGVKVE 93
>gi|325964761|ref|YP_004242667.1| hypothetical protein Asphe3_34300 [Arthrobacter phenanthrenivorans
Sphe3]
gi|323470848|gb|ADX74533.1| uncharacterized conserved protein [Arthrobacter phenanthrenivorans
Sphe3]
Length=94
Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 50/96 (53%), Positives = 70/96 (73%), Gaps = 3/96 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M+RA+WNG V+AE+ TV VEGNHYFP ++++ +L +S S+CPWKG A Y+ + VD
Sbjct 1 MVRAIWNGKVIAESTDTVVVEGNHYFPRDAVNAGYLRDSDMYSVCPWKGRASYHTLEVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G +N DAAW+Y P P AR+I++ +AFW GVT+E
Sbjct 60 --GKLNVDAAWFYPEPKPRARQIEDRIAFWRGVTIE 93
>gi|119716305|ref|YP_923270.1| hypothetical protein Noca_2074 [Nocardioides sp. JS614]
gi|119536966|gb|ABL81583.1| protein of unknown function DUF427 [Nocardioides sp. JS614]
Length=102
Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 54/94 (58%), Positives = 66/94 (71%), Gaps = 3/94 (3%)
Query 2 IRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGP 61
++AV+NG V+AE+ TV +EGNHYFP ES+ E ++ S T S+CPWKG A YY+VVVD
Sbjct 3 MQAVFNGAVIAESESTVVLEGNHYFPEESVRHEFIVPSDTHSLCPWKGRASYYSVVVD-- 60
Query 62 YGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
G V PD+AWYY PSP A I VAFW GV V
Sbjct 61 -GAVAPDSAWYYPSPSPAAEEIGGRVAFWQGVRV 93
>gi|54023893|ref|YP_118135.1| hypothetical protein nfa19250 [Nocardia farcinica IFM 10152]
gi|54015401|dbj|BAD56771.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=96
Score = 108 bits (270), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 54/95 (57%), Positives = 63/95 (67%), Gaps = 3/95 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
+RA WNGTVLAE+ TV VEGNHYFP ++ E+ S T ++CPWKG A YY V VDG
Sbjct 2 TVRAEWNGTVLAESDDTVVVEGNHYFPVSAIRTEYFAPSDTHTVCPWKGTASYYTVRVDG 61
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
NPDAAWYY P A +IK+ VAFW GV V
Sbjct 62 SE---NPDAAWYYPAPKDAAAQIKDRVAFWRGVDV 93
>gi|334109370|ref|ZP_08483650.1| protein of unknown function DUF427 [Methylomicrobium album BG8]
gi|333600806|gb|EGL05622.1| protein of unknown function DUF427 [Methylomicrobium album BG8]
Length=93
Score = 108 bits (269), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 48/94 (52%), Positives = 65/94 (70%), Gaps = 3/94 (3%)
Query 2 IRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGP 61
++A+WN V+A++ ++VEGN YFPP S+H E L ES T ++CPWKG A Y ++VV+
Sbjct 1 MKALWNNAVIADSDEVIKVEGNCYFPPASVHTEFLRESATHTVCPWKGQASYCDIVVN-- 58
Query 62 YGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
G NPDAAWYY P A IKN++AFW GV +
Sbjct 59 -GATNPDAAWYYPEPKEKASSIKNYIAFWKGVEI 91
>gi|217979304|ref|YP_002363451.1| hypothetical protein Msil_3182 [Methylocella silvestris BL2]
gi|217504680|gb|ACK52089.1| protein of unknown function DUF427 [Methylocella silvestris BL2]
Length=93
Score = 108 bits (269), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 50/95 (53%), Positives = 63/95 (67%), Gaps = 3/95 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M +A+WNG +A+ TV VEGNHYFP E + + L S TT++CPWKG+AHYY +V G
Sbjct 1 MPQAIWNGVAIAKTDDTVIVEGNHYFPKEDVPAQFLKPSATTTMCPWKGVAHYYTIVAGG 60
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
NPDAAWYY P P A IK+ +AFW GV +
Sbjct 61 AE---NPDAAWYYPDPKPAAAEIKDRIAFWKGVKI 92
>gi|119509540|ref|ZP_01628687.1| hypothetical protein N9414_08240 [Nodularia spumigena CCY9414]
gi|119465729|gb|EAW46619.1| hypothetical protein N9414_08240 [Nodularia spumigena CCY9414]
Length=94
Score = 107 bits (268), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 49/97 (51%), Positives = 64/97 (66%), Gaps = 3/97 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M +A+WNGTVLAE+ T+ VE NHYFP +++ +E+ ES T + CPWKG+A YY + VD
Sbjct 1 MPKAIWNGTVLAESDNTIVVENNHYFPADAIKKEYFQESNTHTTCPWKGVASYYTIEVD- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVEG 97
G N DAAWYY A+ + +VAFW GV VE
Sbjct 60 --GQTNKDAAWYYPSTKERAKNFEGYVAFWKGVKVEA 94
>gi|329765038|ref|ZP_08256623.1| hypothetical protein Nlim_0380 [Candidatus Nitrosoarchaeum limnia
SFB1]
gi|329138474|gb|EGG42725.1| hypothetical protein Nlim_0380 [Candidatus Nitrosoarchaeum limnia
SFB1]
Length=92
Score = 107 bits (268), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 44/95 (47%), Positives = 68/95 (72%), Gaps = 3/95 (3%)
Query 2 IRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGP 61
++A+WN T++AE+ TV VEGN+YFP +S+ +E+ ++ T+ C WKG+A+YY+V ++
Sbjct 1 MKAIWNDTIIAESDDTVVVEGNYYFPFDSIKQEYFKKTDLTTFCGWKGMANYYSVTIN-- 58
Query 62 YGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G N D AWYY P+ A++IK VAFW+GV V+
Sbjct 59 -GKTNKDCAWYYAEPNDAAKKIKGRVAFWNGVLVK 92
>gi|340344765|ref|ZP_08667897.1| DUF427 domain containing protein [Nitrosopumilus sp. MY1]
gi|339519906|gb|EGP93629.1| DUF427 domain containing protein [Nitrosopumilus sp. MY1]
Length=92
Score = 107 bits (268), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 45/95 (48%), Positives = 66/95 (70%), Gaps = 3/95 (3%)
Query 2 IRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGP 61
++A+WN V+AE+ TV VEGNHYFP +S+ E+ ++ T+ C WKG+A+YY+V ++
Sbjct 1 MKAIWNNVVIAESDDTVVVEGNHYFPIDSIKSEYFKKTDLTTFCGWKGMANYYSVTIN-- 58
Query 62 YGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G N D AWYY P+ A++IK VAFW+GV V+
Sbjct 59 -GKTNKDCAWYYAEPNDAAKKIKGMVAFWNGVDVK 92
>gi|86607571|ref|YP_476333.1| hypothetical protein CYB_0069 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86556113|gb|ABD01070.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length=94
Score = 107 bits (267), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 55/96 (58%), Positives = 63/96 (66%), Gaps = 3/96 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M RA+WNG V+AE R VEGN YFPPESL RE+ S T + CPWKG A YY +VV+
Sbjct 1 MPRAIWNGVVVAETERYETVEGNIYFPPESLRREYFQPSSTHTTCPWKGEASYYTLVVN- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G N DAAWYY P P A I+ +VAFW GV VE
Sbjct 60 --GQENKDAAWYYPDPKPAAANIRGYVAFWKGVKVE 93
>gi|91774189|ref|YP_566881.1| hypothetical protein Mbur_2270 [Methanococcoides burtonii DSM
6242]
gi|91713204|gb|ABE53131.1| Protein of unknown function with DUF427 [Methanococcoides burtonii
DSM 6242]
Length=94
Score = 107 bits (267), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 52/95 (55%), Positives = 63/95 (67%), Gaps = 3/95 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M A WNG VLAE+ +EGN YFPPES+++E+L ES T S CPWKGLAHY+++VV+
Sbjct 1 MAIAKWNGVVLAESESVKEIEGNLYFPPESVNKEYLRESDTRSTCPWKGLAHYFDIVVN- 59
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
G N DAAW Y P A+ I H AFW GV V
Sbjct 60 --GEDNNDAAWSYPEPKEAAKEITGHFAFWKGVEV 92
>gi|299138374|ref|ZP_07031553.1| protein of unknown function DUF427 [Acidobacterium sp. MP5ACTX8]
gi|298599620|gb|EFI55779.1| protein of unknown function DUF427 [Acidobacterium sp. MP5ACTX8]
Length=95
Score = 107 bits (266), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 51/95 (54%), Positives = 65/95 (69%), Gaps = 3/95 (3%)
Query 1 MIRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDG 60
M +A+WNG LAE+ + VEG+ YFP E+++R L S T+S CPWKG A YY V+VDG
Sbjct 1 MAKAIWNGQTLAESEKIETVEGSIYFPEETVNRAFLKPSSTSSNCPWKGRARYYTVLVDG 60
Query 61 PYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTV 95
NPDAAWYY P P A+ +K+HVAFW GV +
Sbjct 61 QE---NPDAAWYYPDPKPAAKTVKHHVAFWRGVEI 92
>gi|298489802|ref|YP_003719979.1| hypothetical protein Aazo_0252 ['Nostoc azollae' 0708]
gi|298231720|gb|ADI62856.1| protein of unknown function DUF427 ['Nostoc azollae' 0708]
Length=93
Score = 107 bits (266), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 46/95 (49%), Positives = 66/95 (70%), Gaps = 3/95 (3%)
Query 2 IRAVWNGTVLAEAPRTVRVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVVDGP 61
++A WNG +LAE+ +TV VEGNHYFP +++++++ +S T + C WKG+A YYN+ VD
Sbjct 1 MKATWNGAILAESNKTVVVEGNHYFPADTINKQYFRDSDTHTTCFWKGVASYYNIEVD-- 58
Query 62 YGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
G VN DAAWYY A+ I+ ++ FW GV VE
Sbjct 59 -GQVNKDAAWYYPSTKEKAKNIEGYIGFWKGVKVE 92
>gi|302879141|ref|YP_003847705.1| hypothetical protein Galf_1935 [Gallionella capsiferriformans
ES-2]
gi|302581930|gb|ADL55941.1| protein of unknown function DUF427 [Gallionella capsiferriformans
ES-2]
Length=96
Score = 107 bits (266), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 56/98 (58%), Positives = 66/98 (68%), Gaps = 5/98 (5%)
Query 1 MIRAVWNGTVLAEAPRTV--RVEGNHYFPPESLHREHLIESPTTSICPWKGLAHYYNVVV 58
M +A+WN V+AEAP + RVEGN YFP ++ RE L S T + C WKG A YYNVVV
Sbjct 1 MPKAIWNNQVIAEAPSSDIHRVEGNIYFPLSAVKREFLQASETHTHCVWKGEASYYNVVV 60
Query 59 DGPYGPVNPDAAWYYRRPSPLARRIKNHVAFWHGVTVE 96
DG N DAAWYY P+ +A RIKNHVAFW G+ VE
Sbjct 61 DGK---ANKDAAWYYPEPTAMAARIKNHVAFWRGIKVE 95
Lambda K H
0.320 0.137 0.464
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 131043835296
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40