BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2331
Length=128
Score E
Sequences producing significant alignments: (Bits) Value
gi|15841834|ref|NP_336871.1| hypothetical protein MT2393 [Mycoba... 259 1e-67
gi|15609468|ref|NP_216847.1| hypothetical protein Rv2331 [Mycoba... 256 7e-67
gi|340627341|ref|YP_004745793.1| hypothetical protein MCAN_23621... 249 9e-65
gi|183983619|ref|YP_001851910.1| formate dehydrogenase H FdhF [M... 60.8 6e-08
gi|118617136|ref|YP_905468.1| formate dehydrogenase H FdhF [Myco... 60.8 7e-08
gi|240170977|ref|ZP_04749636.1| formate dehydrogenase H FdhF [My... 58.9 2e-07
gi|254819360|ref|ZP_05224361.1| FdhF [Mycobacterium intracellula... 54.3 5e-06
gi|342857452|ref|ZP_08714108.1| FdhF [Mycobacterium colombiense ... 53.1 1e-05
gi|254774796|ref|ZP_05216312.1| FdhF [Mycobacterium avium subsp.... 53.1 1e-05
gi|336461719|gb|EGO40581.1| sulfite reductase, alpha subunit (fl... 53.1 1e-05
gi|118465789|ref|YP_881291.1| FdhF protein [Mycobacterium avium ... 53.1 1e-05
gi|41408202|ref|NP_961038.1| FdhF [Mycobacterium avium subsp. pa... 53.1 1e-05
gi|296166333|ref|ZP_06848770.1| sulfite reductase [NADPH] hemopr... 51.6 3e-05
gi|229489436|ref|ZP_04383299.1| molybdopterin oxidoreductase Fe4... 51.2 4e-05
gi|226305479|ref|YP_002765439.1| nitrate/sulfite reductase [Rhod... 50.8 7e-05
gi|182440026|ref|YP_001827745.1| putative assimilatory nitrate r... 50.8 7e-05
gi|326780694|ref|ZP_08239959.1| Nitrate reductase., NADPH--hemop... 50.4 8e-05
gi|182678935|ref|YP_001833081.1| molybdopterin oxidoreductase [B... 50.1 1e-04
gi|239986218|ref|ZP_04706882.1| putative nitrate/sulfite reducta... 50.1 1e-04
gi|226362872|ref|YP_002780652.1| nitrate/sulfite reductase [Rhod... 48.9 2e-04
gi|302543846|ref|ZP_07296188.1| putative assimilatory nitrate re... 48.5 3e-04
gi|107026217|ref|YP_623728.1| molybdopterin oxidoreductase [Burk... 48.5 3e-04
gi|254248308|ref|ZP_04941628.1| Anaerobic dehydrogenases, typica... 48.5 4e-04
gi|333920712|ref|YP_004494293.1| putative nitrate/sulfite reduct... 48.1 4e-04
gi|78062196|ref|YP_372104.1| sulfite reductase (NADPH) alpha sub... 47.8 5e-04
gi|111020629|ref|YP_703601.1| bifunctional reductase [Rhodococcu... 47.8 6e-04
gi|134293723|ref|YP_001117459.1| sulfite reductase (NADPH) alpha... 47.4 6e-04
gi|91774673|ref|YP_544429.1| sulfite reductase (NADPH) alpha sub... 47.4 7e-04
gi|170738150|ref|YP_001779410.1| molybdopterin oxidoreductase [B... 47.4 7e-04
gi|340778147|ref|ZP_08698090.1| molybdopterin oxidoreductase [Ac... 47.4 7e-04
gi|167584203|ref|ZP_02376591.1| molybdopterin oxidoreductase [Bu... 47.4 8e-04
gi|187923974|ref|YP_001895616.1| molybdopterin oxidoreductase [B... 47.0 8e-04
gi|254254127|ref|ZP_04947444.1| Anaerobic dehydrogenase [Burkhol... 46.6 0.001
gi|111224696|ref|YP_715490.1| nitrate reductase [Frankia alni AC... 46.6 0.001
gi|311899408|dbj|BAJ31816.1| putative nitrate/sulfite reductase ... 46.2 0.001
gi|115358684|ref|YP_775822.1| molybdopterin oxidoreductase [Burk... 46.2 0.002
gi|339486279|ref|YP_004700807.1| molybdopterin oxidoreductase [P... 45.8 0.002
gi|172063419|ref|YP_001811070.1| molybdopterin oxidoreductase [B... 45.8 0.002
gi|167572965|ref|ZP_02365839.1| nitrate reductase [Burkholderia ... 45.8 0.002
gi|307729730|ref|YP_003906954.1| molybdopterin oxidoreductase [B... 45.8 0.002
gi|169629524|ref|YP_001703173.1| oxidoreductase [Mycobacterium a... 45.8 0.002
gi|343925197|ref|ZP_08764724.1| putative nitrate/sulfite reducta... 45.8 0.002
gi|302554769|ref|ZP_07307111.1| formate dehydrogenase, alpha sub... 45.8 0.002
gi|296166335|ref|ZP_06848771.1| bifunctional reductase [Mycobact... 45.8 0.002
gi|167615918|ref|ZP_02384553.1| nitrate reductase [Burkholderia ... 45.4 0.002
gi|167565877|ref|ZP_02358793.1| nitrate reductase, putative [Bur... 45.4 0.002
gi|83716994|ref|YP_439369.1| nitrate reductase [Burkholderia tha... 45.4 0.002
gi|296162575|ref|ZP_06845364.1| molybdopterin oxidoreductase [Bu... 45.4 0.002
gi|29828872|ref|NP_823506.1| assimilatory nitrate reductase larg... 45.4 0.003
gi|330993629|ref|ZP_08317563.1| Nitrate reductase [Gluconacetoba... 45.4 0.003
>gi|15841834|ref|NP_336871.1| hypothetical protein MT2393 [Mycobacterium tuberculosis CDC1551]
gi|13882098|gb|AAK46685.1| hypothetical protein MT2393 [Mycobacterium tuberculosis CDC1551]
Length=195
Score = 259 bits (662), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 128/128 (100%), Positives = 128/128 (100%), Gaps = 0/128 (0%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIGSRPCSLIGQPNTMGDEMGYLGPGL 60
VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIGSRPCSLIGQPNTMGDEMGYLGPGL
Sbjct 68 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIGSRPCSLIGQPNTMGDEMGYLGPGL 127
Query 61 AGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT 120
AGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT
Sbjct 128 AGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT 187
Query 121 GYNFRGVP 128
GYNFRGVP
Sbjct 188 GYNFRGVP 195
>gi|15609468|ref|NP_216847.1| hypothetical protein Rv2331 [Mycobacterium tuberculosis H37Rv]
gi|31793514|ref|NP_856007.1| hypothetical protein Mb2358 [Mycobacterium bovis AF2122/97]
gi|121638217|ref|YP_978441.1| hypothetical protein BCG_2352 [Mycobacterium bovis BCG str. Pasteur
1173P2]
78 more sequence titles
Length=128
Score = 256 bits (654), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 127/128 (99%), Positives = 128/128 (100%), Gaps = 0/128 (0%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIGSRPCSLIGQPNTMGDEMGYLGPGL 60
+PPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIGSRPCSLIGQPNTMGDEMGYLGPGL
Sbjct 1 MPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIGSRPCSLIGQPNTMGDEMGYLGPGL 60
Query 61 AGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT 120
AGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT
Sbjct 61 AGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT 120
Query 121 GYNFRGVP 128
GYNFRGVP
Sbjct 121 GYNFRGVP 128
>gi|340627341|ref|YP_004745793.1| hypothetical protein MCAN_23621 [Mycobacterium canettii CIPT
140010059]
gi|340005531|emb|CCC44693.1| hypothetical protein MCAN_23621 [Mycobacterium canettii CIPT
140010059]
Length=128
Score = 249 bits (636), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 125/128 (98%), Positives = 126/128 (99%), Gaps = 0/128 (0%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIGSRPCSLIGQPNTMGDEMGYLGPGL 60
+PPVFLPQIGRLTP AVGEAIGIAADDIPMAARWIGSRP SLIGQPNTMGDEMGYLGPGL
Sbjct 1 MPPVFLPQIGRLTPAAVGEAIGIAADDIPMAARWIGSRPGSLIGQPNTMGDEMGYLGPGL 60
Query 61 AGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT 120
AGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT
Sbjct 61 AGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT 120
Query 121 GYNFRGVP 128
GYNFRGVP
Sbjct 121 GYNFRGVP 128
>gi|183983619|ref|YP_001851910.1| formate dehydrogenase H FdhF [Mycobacterium marinum M]
gi|183176945|gb|ACC42055.1| formate dehydrogenase H FdhF [Mycobacterium marinum M]
Length=1402
Score = 60.8 bits (146), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 32/52 (62%), Positives = 41/52 (79%), Gaps = 2/52 (3%)
Query 79 SRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT--GYNFRGVP 128
++ PV ASVDE +SVLKP++PRL+SIS K PGEV+LTV+ YNF+GVP
Sbjct 1169 AQSPVRASVDEWMSVLKPIQPRLYSISSSPKEHPGEVHLTVSPVRYNFQGVP 1220
Score = 55.1 bits (131), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/107 (40%), Positives = 45/107 (43%), Gaps = 44/107 (41%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG------------------------ 36
V P FL Q TPD V E GI ++DI AARWIG
Sbjct 260 VMPSFLEQ---FTPDRVSEITGIPSEDIRTAARWIGTAANWMSCWTMGLNQSTHGTWNTN 316
Query 37 ----------------SRPCSLIGQPNTMG-DEMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 317 ALCNLHLATGAICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 363
>gi|118617136|ref|YP_905468.1| formate dehydrogenase H FdhF [Mycobacterium ulcerans Agy99]
gi|118569246|gb|ABL03997.1| formate dehydrogenase H FdhF [Mycobacterium ulcerans Agy99]
Length=1402
Score = 60.8 bits (146), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 32/52 (62%), Positives = 41/52 (79%), Gaps = 2/52 (3%)
Query 79 SRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT--GYNFRGVP 128
++ PV ASVDE +SVLKP++PRL+SIS K PGEV+LTV+ YNF+GVP
Sbjct 1169 AQSPVRASVDEWMSVLKPIQPRLYSISSSPKEHPGEVHLTVSPVRYNFQGVP 1220
Score = 55.1 bits (131), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/107 (40%), Positives = 45/107 (43%), Gaps = 44/107 (41%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG------------------------ 36
V P FL Q TPD V E GI ++DI AARWIG
Sbjct 260 VMPSFLEQ---FTPDRVSEITGIPSEDIRTAARWIGTAANWMSCWTMGLNQSTHGTWNTN 316
Query 37 ----------------SRPCSLIGQPNTMG-DEMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 317 ALCNLHLATGAICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 363
>gi|240170977|ref|ZP_04749636.1| formate dehydrogenase H FdhF [Mycobacterium kansasii ATCC 12478]
Length=1400
Score = 58.9 bits (141), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/52 (58%), Positives = 40/52 (77%), Gaps = 2/52 (3%)
Query 79 SRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT--GYNFRGVP 128
++ PV A+ DE L+VLKP++PRL+SIS K PGEV+LTV+ YNF+GVP
Sbjct 1167 AQAPVTATADEWLAVLKPLQPRLYSISSSPKQHPGEVHLTVSPVRYNFQGVP 1218
Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/107 (38%), Positives = 43/107 (41%), Gaps = 44/107 (41%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIGSR---------------------- 38
V P FL Q TP+ V E GI +DI AARWIG
Sbjct 265 VMPSFLEQ---FTPEVVSEITGIPVEDIRTAARWIGEAANWMSCWTMGLNQSTHGTWNTN 321
Query 39 ------------------PCSLIGQPNTMG-DEMGYLGPGLAGQRCV 66
P SL GQPN MG EMGY+GPGL GQR V
Sbjct 322 ALCNLHLATGAICKPGNGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 368
>gi|254819360|ref|ZP_05224361.1| FdhF [Mycobacterium intracellulare ATCC 13950]
Length=395
Score = 54.3 bits (129), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 31/52 (60%), Positives = 39/52 (75%), Gaps = 2/52 (3%)
Query 79 SRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT--GYNFRGVP 128
++LPVIAS E L VLK ++PRL+SIS K PGEV+LTV+ YNF+GVP
Sbjct 162 AQLPVIASAHEWLRVLKRLQPRLYSISSSPKECPGEVHLTVSPVRYNFQGVP 213
>gi|342857452|ref|ZP_08714108.1| FdhF [Mycobacterium colombiense CECT 3035]
gi|342134785|gb|EGT87951.1| FdhF [Mycobacterium colombiense CECT 3035]
Length=1394
Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/52 (58%), Positives = 38/52 (74%), Gaps = 2/52 (3%)
Query 79 SRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT--GYNFRGVP 128
S+LPV AS E L VLK ++PRL+SIS K PGEV+LTV+ +NF+GVP
Sbjct 1161 SKLPVSASAHEWLRVLKRLQPRLYSISSSPKACPGEVHLTVSPVRFNFQGVP 1212
Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/32 (72%), Positives = 24/32 (75%), Gaps = 1/32 (3%)
Query 36 GSRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
GS P SL GQPN MG EMGY+GPGL GQR V
Sbjct 312 GSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 343
>gi|254774796|ref|ZP_05216312.1| FdhF [Mycobacterium avium subsp. avium ATCC 25291]
Length=1411
Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/52 (58%), Positives = 38/52 (74%), Gaps = 2/52 (3%)
Query 79 SRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT--GYNFRGVP 128
++LPV AS E L VLK ++PRL+SIS K PGEV+LTV+ YNF+GVP
Sbjct 1178 AQLPVSASAHEWLRVLKRLQPRLYSISSSPKACPGEVHLTVSPVRYNFQGVP 1229
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/107 (37%), Positives = 42/107 (40%), Gaps = 44/107 (41%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG------------------------ 36
V P FL Q TPD V E GI DI AA+ +G
Sbjct 259 VMPSFLEQ---YTPDRVSEITGIPEQDIRAAAQLVGEADNFMSCWTMGLNQSTHGTWNTN 315
Query 37 ----------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 316 AICNLHLATGAICKPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 362
>gi|336461719|gb|EGO40581.1| sulfite reductase, alpha subunit (flavoprotein) [Mycobacterium
avium subsp. paratuberculosis S397]
Length=1416
Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/52 (58%), Positives = 38/52 (74%), Gaps = 2/52 (3%)
Query 79 SRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT--GYNFRGVP 128
++LPV AS E L VLK ++PRL+SIS K PGEV+LTV+ YNF+GVP
Sbjct 1183 AQLPVSASAHEWLRVLKRLQPRLYSISSSPKACPGEVHLTVSPVRYNFQGVP 1234
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/107 (38%), Positives = 42/107 (40%), Gaps = 44/107 (41%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG------------------------ 36
V P FL Q TPD V E GI DI AA+ IG
Sbjct 264 VMPSFLEQ---YTPDRVSEITGIPEQDIRAAAQLIGEADNFMSCWTMGLNQSTHGTWNTN 320
Query 37 ----------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 321 AICNLHLATGAICKPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 367
>gi|118465789|ref|YP_881291.1| FdhF protein [Mycobacterium avium 104]
gi|118167076|gb|ABK67973.1| FdhF protein [Mycobacterium avium 104]
Length=1405
Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/52 (58%), Positives = 38/52 (74%), Gaps = 2/52 (3%)
Query 79 SRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT--GYNFRGVP 128
++LPV AS E L VLK ++PRL+SIS K PGEV+LTV+ YNF+GVP
Sbjct 1172 AQLPVSASAHEWLRVLKRLQPRLYSISSSPKACPGEVHLTVSPVRYNFQGVP 1223
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/107 (38%), Positives = 42/107 (40%), Gaps = 44/107 (41%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG------------------------ 36
V P FL Q TPD V E GI DI AA+ IG
Sbjct 253 VMPSFLEQ---YTPDRVSEITGIPEQDIRAAAQLIGEADNFMSCWTMGLNQSTHGTWNTN 309
Query 37 ----------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 310 AICNLHLATGAICKPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 356
>gi|41408202|ref|NP_961038.1| FdhF [Mycobacterium avium subsp. paratuberculosis K-10]
gi|41396557|gb|AAS04421.1| FdhF [Mycobacterium avium subsp. paratuberculosis K-10]
Length=1411
Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/52 (58%), Positives = 38/52 (74%), Gaps = 2/52 (3%)
Query 79 SRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT--GYNFRGVP 128
++LPV AS E L VLK ++PRL+SIS K PGEV+LTV+ YNF+GVP
Sbjct 1178 AQLPVSASAHEWLRVLKRLQPRLYSISSSPKACPGEVHLTVSPVRYNFQGVP 1229
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/107 (38%), Positives = 42/107 (40%), Gaps = 44/107 (41%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG------------------------ 36
V P FL Q TPD V E GI DI AA+ IG
Sbjct 259 VMPSFLEQ---YTPDRVSEITGIPEQDIRAAAQLIGEADNFMSCWTMGLNQSTHGTWNTN 315
Query 37 ----------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 316 AICNLHLATGAICKPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 362
>gi|296166333|ref|ZP_06848770.1| sulfite reductase [NADPH] hemoprotein alpha subunit [Mycobacterium
parascrofulaceum ATCC BAA-614]
gi|295898342|gb|EFG77911.1| sulfite reductase [NADPH] hemoprotein alpha subunit [Mycobacterium
parascrofulaceum ATCC BAA-614]
Length=371
Score = 51.6 bits (122), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/52 (56%), Positives = 37/52 (72%), Gaps = 2/52 (3%)
Query 79 SRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT--GYNFRGVP 128
++ PV AS E L VLK ++PRL+SIS K PGEV+LTV+ YNF+GVP
Sbjct 138 AQHPVSASAHEWLRVLKRLQPRLYSISSSPKENPGEVHLTVSPVRYNFKGVP 189
>gi|229489436|ref|ZP_04383299.1| molybdopterin oxidoreductase Fe4S4 domain protein [Rhodococcus
erythropolis SK121]
gi|229323533|gb|EEN89291.1| molybdopterin oxidoreductase Fe4S4 domain protein [Rhodococcus
erythropolis SK121]
Length=1394
Score = 51.2 bits (121), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/52 (56%), Positives = 36/52 (70%), Gaps = 2/52 (3%)
Query 79 SRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVTG--YNFRGVP 128
++ PV AS DE LS+LKP++PRL+SIS K P EV LTV+ YN GVP
Sbjct 1161 AQSPVRASTDEWLSLLKPIQPRLYSISSSPKENPREVQLTVSAVRYNVAGVP 1212
Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/107 (37%), Positives = 43/107 (41%), Gaps = 44/107 (41%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG------------------------ 36
V P FL + TP+ V E GI DI AA+WIG
Sbjct 262 VMPSFLAE---YTPEKVSEITGIPEADIRTAAQWIGEAANWMSCWTMGLNQSTHGTWNTN 318
Query 37 ----------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 319 AICNLHLATGAICKPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRAV 365
>gi|226305479|ref|YP_002765439.1| nitrate/sulfite reductase [Rhodococcus erythropolis PR4]
gi|226184596|dbj|BAH32700.1| putative nitrate/sulfite reductase [Rhodococcus erythropolis
PR4]
Length=1398
Score = 50.8 bits (120), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/52 (56%), Positives = 36/52 (70%), Gaps = 2/52 (3%)
Query 79 SRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVTG--YNFRGVP 128
++ PV AS DE LS+LKP++PRL+SIS K P EV LTV+ YN GVP
Sbjct 1165 AQSPVRASTDEWLSLLKPLQPRLYSISSSPKENPREVQLTVSAVRYNVAGVP 1216
Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/107 (37%), Positives = 43/107 (41%), Gaps = 44/107 (41%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG------------------------ 36
V P FL + TP+ V E GI DI AA+WIG
Sbjct 266 VMPSFLAE---YTPEKVSEITGIPEADIRTAAQWIGEAANWMSCWTMGLNQSTHGTWNTN 322
Query 37 ----------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 323 AICNLHLATGAICKPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRAV 369
>gi|182440026|ref|YP_001827745.1| putative assimilatory nitrate reductase large subunit [Streptomyces
griseus subsp. griseus NBRC 13350]
gi|178468542|dbj|BAG23062.1| putative assimilatory nitrate reductase large subunit [Streptomyces
griseus subsp. griseus NBRC 13350]
Length=1419
Score = 50.8 bits (120), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/94 (41%), Positives = 38/94 (41%), Gaps = 41/94 (43%)
Query 14 PDAVGEAIGIAADDIPMAARWIG------------------------------------- 36
P AV E GI DDI AARWIG
Sbjct 277 PAAVAEITGIPEDDIRQAARWIGEAGEWMSCWTMGLNQSTHGTWNTNAIVNLHLATGAIC 336
Query 37 ---SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGYLGPGL GQR V
Sbjct 337 RPGSGPFSLTGQPNAMGGREMGYLGPGLPGQRSV 370
>gi|326780694|ref|ZP_08239959.1| Nitrate reductase., NADPH--hemoprotein reductase [Streptomyces
cf. griseus XylebKG-1]
gi|326661027|gb|EGE45873.1| Nitrate reductase., NADPH--hemoprotein reductase [Streptomyces
griseus XylebKG-1]
Length=1419
Score = 50.4 bits (119), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 38/94 (41%), Positives = 38/94 (41%), Gaps = 41/94 (43%)
Query 14 PDAVGEAIGIAADDIPMAARWIG------------------------------------- 36
P AV E GI DDI AARWIG
Sbjct 277 PAAVAEITGIPEDDIRQAARWIGEAGEWMSCWTMGLNQSTHGTWNTNAIVNLHLATGAIC 336
Query 37 ---SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGYLGPGL GQR V
Sbjct 337 RPGSGPFSLTGQPNAMGGREMGYLGPGLPGQRSV 370
>gi|182678935|ref|YP_001833081.1| molybdopterin oxidoreductase [Beijerinckia indica subsp. indica
ATCC 9039]
gi|182634818|gb|ACB95592.1| molybdopterin oxidoreductase [Beijerinckia indica subsp. indica
ATCC 9039]
Length=1313
Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/113 (36%), Positives = 46/113 (41%), Gaps = 45/113 (39%)
Query 9 IGRLTPDAVGEAIGIAADDIPMAARWI--------------------------------- 35
+ TP+ V E G+A DDI AARWI
Sbjct 237 LAAYTPERVVELTGLARDDIETAARWIAEAQAWMSLWTMGLNQSVMGTWHTNALCNLHLA 296
Query 36 -------GSRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV----DRLVMGASRS 76
G+ P SL GQPN MG EMGY+GPGL GQR V DR+ A S
Sbjct 297 TGAICRPGAGPFSLTGQPNAMGGREMGYMGPGLPGQRVVTVAEDRVFAEAQWS 349
>gi|239986218|ref|ZP_04706882.1| putative nitrate/sulfite reductase [Streptomyces roseosporus
NRRL 11379]
gi|291443163|ref|ZP_06582553.1| assimilatory nitrate reductase large subunit [Streptomyces roseosporus
NRRL 15998]
gi|291346110|gb|EFE73014.1| assimilatory nitrate reductase large subunit [Streptomyces roseosporus
NRRL 15998]
Length=1395
Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/94 (40%), Positives = 38/94 (41%), Gaps = 41/94 (43%)
Query 14 PDAVGEAIGIAADDIPMAARWIG------------------------------------- 36
P AV E GI DDI AARWIG
Sbjct 277 PAAVAEITGIPEDDIRQAARWIGEAGEWMSCWTMGLNQSTHGTWNTNALINLHLATGAIC 336
Query 37 ---SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 337 RPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 370
>gi|226362872|ref|YP_002780652.1| nitrate/sulfite reductase [Rhodococcus opacus B4]
gi|226241359|dbj|BAH51707.1| putative nitrate/sulfite reductase [Rhodococcus opacus B4]
Length=1383
Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/107 (36%), Positives = 42/107 (40%), Gaps = 44/107 (41%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIGSR---------------------- 38
V P FL + TP+ V E GI DI AA+WIG
Sbjct 268 VMPEFLEE---YTPEKVAEITGIPESDIRQAAQWIGEAENWMSCWTMGLNQSTHGTWNTN 324
Query 39 ------------------PCSLIGQPNTMG-DEMGYLGPGLAGQRCV 66
P SL GQPN MG EMGY+GPGL GQR V
Sbjct 325 AICNLHLATGAICRPGNGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 371
Score = 40.8 bits (94), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 25/55 (46%), Positives = 32/55 (59%), Gaps = 2/55 (3%)
Query 74 SRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVTG--YNFRG 126
S ++ V A +E L VLK ++PRL+SIS K PGEV LTV+ YN G
Sbjct 1145 SMDVLAQYEVDADAEEWLGVLKRLQPRLYSISSSPKVDPGEVQLTVSAVRYNHEG 1199
>gi|302543846|ref|ZP_07296188.1| putative assimilatory nitrate reductase/sulfite reductase [Streptomyces
hygroscopicus ATCC 53653]
gi|302461464|gb|EFL24557.1| putative assimilatory nitrate reductase/sulfite reductase [Streptomyces
himastatinicus ATCC 53653]
Length=1383
Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/105 (38%), Positives = 41/105 (40%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------- 36
P FL P+ V E GI DDI AARWIG
Sbjct 254 PAFLDD---YPPERVAEITGIPEDDIREAARWIGESGEWMSCWTMGLNQSTHGTWNTNAL 310
Query 37 --------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 311 VNLHLATGAICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 355
>gi|107026217|ref|YP_623728.1| molybdopterin oxidoreductase [Burkholderia cenocepacia AU 1054]
gi|116692598|ref|YP_838131.1| molybdopterin oxidoreductase [Burkholderia cenocepacia HI2424]
gi|105895591|gb|ABF78755.1| sulfite reductase (NADPH) alpha subunit [Burkholderia cenocepacia
AU 1054]
gi|116650598|gb|ABK11238.1| sulfite reductase (NADPH) alpha subunit [Burkholderia cenocepacia
HI2424]
Length=1395
Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/105 (37%), Positives = 42/105 (40%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------- 36
P FL TP+ V E G+A DI AA+WIG
Sbjct 254 PAFLAD---YTPERVAEITGLAEADIRTAAQWIGDAQEWMSCWTMGLNQSTHGVWNTNAI 310
Query 37 --------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 311 CNLHLATGAICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 355
Score = 35.4 bits (80), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 27/86 (32%), Positives = 40/86 (47%), Gaps = 14/86 (16%)
Query 41 SLIGQPNTMGDEMGYLGPGLAGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPR 100
SL+G+ D G L L GQ+ D L PV S E + +LK ++PR
Sbjct 1138 SLLGE-----DRKGDLKQWLWGQQLADVL---------HEYPVELSGAELVGMLKRMQPR 1183
Query 101 LHSISFIFKGRPGEVYLTVTGYNFRG 126
L+SI+ GE++LTV+ +
Sbjct 1184 LYSIASSPSAHEGEIHLTVSAVRYHN 1209
>gi|254248308|ref|ZP_04941628.1| Anaerobic dehydrogenases, typically selenocysteine-containing
[Burkholderia cenocepacia PC184]
gi|124874809|gb|EAY64799.1| Anaerobic dehydrogenases, typically selenocysteine-containing
[Burkholderia cenocepacia PC184]
Length=792
Score = 48.5 bits (114), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/105 (37%), Positives = 42/105 (40%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------- 36
P FL TP+ V E G+A DI AA+WIG
Sbjct 254 PAFLAD---YTPERVAEITGLAEADIRTAAQWIGDAQEWMSCWTMGLNQSTHGVWNTNAI 310
Query 37 --------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 311 CNLHLATGKICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 355
>gi|333920712|ref|YP_004494293.1| putative nitrate/sulfite reductase [Amycolicicoccus subflavus
DQS3-9A1]
gi|333482933|gb|AEF41493.1| Putative nitrate/sulfite reductase [Amycolicicoccus subflavus
DQS3-9A1]
Length=1391
Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/105 (37%), Positives = 40/105 (39%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------- 36
P FL TPD V E G+ DI AA WIG
Sbjct 273 PEFLAD---YTPDTVSEITGLDEADIRTAATWIGEANNWMTLWTMGLNQSTHGTWNTNAI 329
Query 37 --------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 330 CNLHLATGAICKPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRAV 374
Score = 39.3 bits (90), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 26/56 (47%), Positives = 32/56 (58%), Gaps = 1/56 (1%)
Query 66 VDRLVMGASRSTCSR-LPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT 120
+DR + G R PV A V E L VLK ++PRL+SIS K P EV LTV+
Sbjct 1143 LDRFLWGKQAVDVIREFPVSADVIEWLDVLKRLQPRLYSISSSPKVSPDEVQLTVS 1198
>gi|78062196|ref|YP_372104.1| sulfite reductase (NADPH) alpha subunit [Burkholderia sp. 383]
gi|77970081|gb|ABB11460.1| sulfite reductase (NADPH) alpha subunit [Burkholderia sp. 383]
Length=1395
Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/105 (36%), Positives = 42/105 (40%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------- 36
P FL TP+ V E G+A D+ AA+WIG
Sbjct 254 PAFLAD---YTPERVAEITGLAEADLRTAAQWIGDAQEWMSCWTMGLNQSTHGVWNTNAI 310
Query 37 --------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 311 CNLHLATGRICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 355
Score = 36.2 bits (82), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 26/79 (33%), Positives = 38/79 (49%), Gaps = 10/79 (12%)
Query 49 MGDE-MGYLGPGLAGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFI 107
+GDE G L L GQ+ D L PV S E + +LK ++PRL+SI+
Sbjct 1140 LGDERKGDLKQWLWGQQLADVL---------HEFPVELSGTELVGMLKRMQPRLYSIASS 1190
Query 108 FKGRPGEVYLTVTGYNFRG 126
GE++LTV+ +
Sbjct 1191 PSAHQGEIHLTVSAVRYHN 1209
>gi|111020629|ref|YP_703601.1| bifunctional reductase [Rhodococcus jostii RHA1]
gi|110820159|gb|ABG95443.1| possible bifunctional reductase [Rhodococcus jostii RHA1]
Length=1378
Score = 47.8 bits (112), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/107 (36%), Positives = 43/107 (41%), Gaps = 44/107 (41%)
Query 1 VPPVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG------------------------ 36
V P FL + TP+ V E GI +I AA+WIG
Sbjct 263 VMPEFLEE---YTPEKVAEITGIPESEIRRAAQWIGEAENWMSCWTMGLNQSTHGTWNTN 319
Query 37 ----------------SRPCSLIGQPNTMG-DEMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 320 AICNLHLATGAICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 366
Score = 43.5 bits (101), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 26/46 (57%), Positives = 30/46 (66%), Gaps = 2/46 (4%)
Query 83 VIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVTG--YNFRG 126
V A DE LSVLK ++PRL+SIS K PGEV LTV+ YN G
Sbjct 1149 VDADADEWLSVLKRLQPRLYSISSSPKVDPGEVQLTVSAVRYNHEG 1194
>gi|134293723|ref|YP_001117459.1| sulfite reductase (NADPH) alpha subunit [Burkholderia vietnamiensis
G4]
gi|134136880|gb|ABO57994.1| sulfite reductase (NADPH) alpha subunit [Burkholderia vietnamiensis
G4]
Length=1402
Score = 47.4 bits (111), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/105 (36%), Positives = 42/105 (40%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------- 36
P FL TP+ V E G+A DD+ AA+WIG
Sbjct 254 PAFLAD---YTPERVAEITGLAPDDLRTAAQWIGDAPEWTSCWTMGLNQSTHGVWNTNAI 310
Query 37 --------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+G GL GQR V
Sbjct 311 CNLHLATGTICRPGSGPFSLTGQPNAMGGREMGYMGAGLPGQRSV 355
>gi|91774673|ref|YP_544429.1| sulfite reductase (NADPH) alpha subunit [Methylobacillus flagellatus
KT]
gi|91708660|gb|ABE48588.1| sulfite reductase (NADPH) alpha subunit [Methylobacillus flagellatus
KT]
Length=1383
Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 38/105 (37%), Positives = 41/105 (40%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------- 36
P FL +P+AV E GI DI AA WIG
Sbjct 253 PEFLED---YSPEAVSEITGIPEADIRQAAEWIGKASNWMSCWTMGLNQSTHGTWHTNAI 309
Query 37 --------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 310 CNLHLATGAICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 354
Score = 37.0 bits (84), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 26/69 (38%), Positives = 36/69 (53%), Gaps = 9/69 (13%)
Query 52 EMGYLGPGLAGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKGR 111
+ G L P L G++ VD L P+ V E L VLK ++PRL+SIS K
Sbjct 1136 KQGELKPWLWGKQPVDLL---------HAYPIKVEVSEWLGVLKRLQPRLYSISSSPKTV 1186
Query 112 PGEVYLTVT 120
+V+LTV+
Sbjct 1187 TDQVHLTVS 1195
>gi|170738150|ref|YP_001779410.1| molybdopterin oxidoreductase [Burkholderia cenocepacia MC0-3]
gi|169820338|gb|ACA94920.1| molybdopterin oxidoreductase [Burkholderia cenocepacia MC0-3]
Length=1395
Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 37/105 (36%), Positives = 42/105 (40%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------- 36
P FL TP+ V + G+A DI AA+WIG
Sbjct 254 PAFLAD---YTPERVADITGLAEADIRTAAQWIGDAQEWMSCWTMGLNQSTHGVWNTNAI 310
Query 37 --------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 311 CNLHLATGKICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 355
Score = 35.4 bits (80), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 27/86 (32%), Positives = 40/86 (47%), Gaps = 14/86 (16%)
Query 41 SLIGQPNTMGDEMGYLGPGLAGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPR 100
SL+G+ D G L L GQ+ D L PV S E + +LK ++PR
Sbjct 1138 SLLGE-----DRKGDLKQWLWGQQLADVL---------HEYPVELSGAELVGMLKRMQPR 1183
Query 101 LHSISFIFKGRPGEVYLTVTGYNFRG 126
L+SI+ GE++LTV+ +
Sbjct 1184 LYSIASSPSAHEGEIHLTVSAVRYHN 1209
>gi|340778147|ref|ZP_08698090.1| molybdopterin oxidoreductase [Acetobacter aceti NBRC 14818]
Length=1317
Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/92 (37%), Positives = 38/92 (42%), Gaps = 41/92 (44%)
Query 14 PDAVGEAIGIAADDIPMAARWI-------------------------------------- 35
P+ V E G+ A+DI AARWI
Sbjct 262 PETVAEITGLPAEDIVTAARWIAEAGEWMSLWTMGLNQSITGTWNTNAVCNLHLATGTIC 321
Query 36 --GSRPCSLIGQPNTMGD-EMGYLGPGLAGQR 64
GS P SL GQPN MG EMGY+GPGL GQR
Sbjct 322 RPGSGPFSLTGQPNAMGGREMGYMGPGLPGQR 353
>gi|167584203|ref|ZP_02376591.1| molybdopterin oxidoreductase [Burkholderia ubonensis Bu]
Length=792
Score = 47.4 bits (111), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/105 (36%), Positives = 42/105 (40%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------- 36
P FL TP+ V G+A DI +AA+WIG
Sbjct 254 PAFLAD---YTPEKVAAITGLAEADIRLAAQWIGDAPEWMSCWTMGLNQSTHGVWNTNAI 310
Query 37 --------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 311 CNLHLATGKLCRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 355
>gi|187923974|ref|YP_001895616.1| molybdopterin oxidoreductase [Burkholderia phytofirmans PsJN]
gi|187715168|gb|ACD16392.1| molybdopterin oxidoreductase [Burkholderia phytofirmans PsJN]
Length=1403
Score = 47.0 bits (110), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/105 (37%), Positives = 43/105 (41%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAAR-------WI-------------------- 35
P FL TP+ V + G+A +DI AAR WI
Sbjct 254 PAFLAD---YTPENVAQITGLAVEDIRQAARMIGTAKEWISCWTMGLNQSTHGTWHTNAI 310
Query 36 -------------GSRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
GS P SL GQPN MG EMGY+GPGL GQR V
Sbjct 311 CNLHLATGKICRRGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 355
>gi|254254127|ref|ZP_04947444.1| Anaerobic dehydrogenase [Burkholderia dolosa AUO158]
gi|124898772|gb|EAY70615.1| Anaerobic dehydrogenase [Burkholderia dolosa AUO158]
Length=820
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/105 (37%), Positives = 41/105 (40%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------- 36
P FL TP V E G+A DI AA+WIG
Sbjct 254 PAFLAD---YTPARVAEITGLAEADIRTAAQWIGDAPEWTSCWTMGLNQSTHGVWNTNAL 310
Query 37 --------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 311 CNLHLATGKICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 355
>gi|111224696|ref|YP_715490.1| nitrate reductase [Frankia alni ACN14a]
gi|111152228|emb|CAJ63957.2| Nitrate reductase (partial match) [Frankia alni ACN14a]
Length=1448
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/105 (37%), Positives = 41/105 (40%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIGSR------------------------ 38
P FL PD V E GI A DI AA+WIG+
Sbjct 235 PAFLAD---YPPDRVAEITGIPAADIRRAAQWIGAAKAWMSLWTMGLNQSTHGTWNTNAL 291
Query 39 ----------------PCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
P SL GQPN MG EMGY+GPGL GQR V
Sbjct 292 CNLHLATGAICRPGAGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 336
Score = 37.0 bits (84), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/48 (44%), Positives = 28/48 (59%), Gaps = 0/48 (0%)
Query 79 SRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVTGYNFRG 126
+ LPV A+ E VL ++PRL+SIS P EV+LTV+ F G
Sbjct 1208 AELPVRATAAEWAGVLSRLQPRLYSISSSPLAHPDEVHLTVSVVRFDG 1255
>gi|311899408|dbj|BAJ31816.1| putative nitrate/sulfite reductase [Kitasatospora setae KM-6054]
Length=1353
Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/47 (58%), Positives = 29/47 (62%), Gaps = 4/47 (8%)
Query 36 GSRPCSLIGQPNTMGD-EMGYLGPGLAGQRCVDRLVMGASRSTCSRL 81
GS P SL GQPN MG EMGY+GPGL GQR V + A R RL
Sbjct 339 GSGPFSLTGQPNAMGGREMGYMGPGLPGQRTV---LSAADREFTERL 382
>gi|115358684|ref|YP_775822.1| molybdopterin oxidoreductase [Burkholderia ambifaria AMMD]
gi|115283972|gb|ABI89488.1| sulfite reductase (NADPH) alpha subunit [Burkholderia ambifaria
AMMD]
Length=1397
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/105 (36%), Positives = 41/105 (40%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------- 36
P FL TP+ V E G+A DI AA+WIG
Sbjct 254 PAFLAD---YTPERVAEITGLAEADIRTAAQWIGDAQEWMSCWTMGLNQSTHGVWNTNAI 310
Query 37 --------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+G GL GQR V
Sbjct 311 CNLHLATGTICRPGSGPFSLTGQPNAMGGREMGYMGAGLPGQRSV 355
Score = 36.6 bits (83), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/76 (32%), Positives = 36/76 (48%), Gaps = 9/76 (11%)
Query 51 DEMGYLGPGLAGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKG 110
D L L GQ+ D L PV S E +++LK ++PRL+SI+
Sbjct 1145 DRKADLKQWLWGQQLADVL---------HEFPVELSGAELVAMLKRLQPRLYSIASSPSA 1195
Query 111 RPGEVYLTVTGYNFRG 126
PGE++LTV+ +
Sbjct 1196 HPGEIHLTVSAVRYHN 1211
>gi|339486279|ref|YP_004700807.1| molybdopterin oxidoreductase [Pseudomonas putida S16]
gi|338837122|gb|AEJ11927.1| molybdopterin oxidoreductase [Pseudomonas putida S16]
Length=1341
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/103 (36%), Positives = 39/103 (38%), Gaps = 44/103 (42%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIGSR------------------------ 38
P FL TPD V G+A DI AA WIGS
Sbjct 254 PAFLDDY---TPDHVAAITGLAEADIIKAAEWIGSARNWMSCWTMGLNQSIHGTWHTNAI 310
Query 39 ----------------PCSLIGQPNTMGD-EMGYLGPGLAGQR 64
P SL GQPN MG EMGY+GPGL GQR
Sbjct 311 CNLHLATGAICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQR 353
>gi|172063419|ref|YP_001811070.1| molybdopterin oxidoreductase [Burkholderia ambifaria MC40-6]
gi|171995936|gb|ACB66854.1| molybdopterin oxidoreductase [Burkholderia ambifaria MC40-6]
Length=1397
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/105 (36%), Positives = 41/105 (40%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------- 36
P FL TP+ V E G+A DI AA+WIG
Sbjct 254 PAFLAD---YTPERVAEITGLAEADIRTAAQWIGDAQEWMSCWTMGLNQSTHGVWNTNAI 310
Query 37 --------------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+G GL GQR V
Sbjct 311 CNLHLATGTICRPGSGPFSLTGQPNAMGGREMGYMGAGLPGQRSV 355
Score = 37.0 bits (84), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/76 (32%), Positives = 36/76 (48%), Gaps = 9/76 (11%)
Query 51 DEMGYLGPGLAGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKG 110
D L L GQ+ D L PV S E +++LK ++PRL+SI+
Sbjct 1145 DRKADLKQWLWGQQLADVL---------HEFPVELSGAELVAMLKRLQPRLYSIASSPSA 1195
Query 111 RPGEVYLTVTGYNFRG 126
PGE++LTV+ +
Sbjct 1196 HPGEIHLTVSAVRYHN 1211
>gi|167572965|ref|ZP_02365839.1| nitrate reductase [Burkholderia oklahomensis C6786]
Length=1412
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/93 (37%), Positives = 38/93 (41%), Gaps = 41/93 (44%)
Query 13 TPDAVGEAIGIAADDIPMAARWIG------------------------------------ 36
TP+ V GIA DI +AA+WIG
Sbjct 261 TPEKVAAITGIAEADIRLAAQWIGDAREWTSCWTMGLNQSTHGTWNTNAICNLHLATGRI 320
Query 37 ----SRPCSLIGQPNTMGD-EMGYLGPGLAGQR 64
S P SL GQPN MG EMGY+GPGL GQR
Sbjct 321 CRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQR 353
Score = 35.8 bits (81), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 36/126 (29%), Positives = 56/126 (45%), Gaps = 12/126 (9%)
Query 3 PVFLPQIGRL-TPDAVGEAIGIAADDIPMAARWIGSRPCSLIGQPNTMGDE-MGYLGPGL 60
PV + +G L DA+ + + I P A +I SR + +GD+ L L
Sbjct 1111 PVTVAGVGELRVADALAKHLDITRPH-PDALAFIASRSRAGGALKRLLGDDRKADLKNWL 1169
Query 61 AGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVT 120
GQ+ D L PV S E + +LK ++PRL+SI+ GE++LTV+
Sbjct 1170 WGQQLADVL---------HEFPVELSAAELVGMLKRLQPRLYSIASSPNAHRGEIHLTVS 1220
Query 121 GYNFRG 126
+
Sbjct 1221 AVRYNN 1226
>gi|307729730|ref|YP_003906954.1| molybdopterin oxidoreductase [Burkholderia sp. CCGE1003]
gi|307584265|gb|ADN57663.1| molybdopterin oxidoreductase [Burkholderia sp. CCGE1003]
Length=1416
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/105 (36%), Positives = 41/105 (40%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIP-------MAARWI-------------------- 35
P FL TP+ V G+A DDI MA W+
Sbjct 254 PAFLAD---YTPEKVAHITGLAVDDIRRAAQMIGMAHEWMSCWTMGLNQSTHGTWHTNAI 310
Query 36 -------------GSRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
GS P SL GQPN MG EMGY+GPGL GQR V
Sbjct 311 CNLHLATGKICRRGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 355
Score = 38.9 bits (89), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 37/125 (30%), Positives = 55/125 (44%), Gaps = 11/125 (8%)
Query 3 PVFLPQIGRLT-PDAVGEAIGIAADDIPMAARWIGSRPCSLIGQPNTMGDEMGYLGPGLA 61
PV + G + DA+G+ IA + P A +I SR + + + L L
Sbjct 1116 PVHVAGAGEMRLADALGKHYEIARPN-PDALAFIASRSSNGALRELLAPERKADLKQWLW 1174
Query 62 GQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVTG 121
GQ+ D L PV + E +LK ++PRL+SI+ K PGEV+LTV
Sbjct 1175 GQQLADVL---------HEFPVKLTAAELTGMLKRLQPRLYSIASSPKAHPGEVHLTVAA 1225
Query 122 YNFRG 126
+
Sbjct 1226 VRYNN 1230
>gi|169629524|ref|YP_001703173.1| oxidoreductase [Mycobacterium abscessus ATCC 19977]
gi|169241491|emb|CAM62519.1| Probable oxidoreductase [Mycobacterium abscessus]
Length=1257
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/94 (37%), Positives = 37/94 (40%), Gaps = 41/94 (43%)
Query 14 PDAVGEAIGIAADDIPMAARWIG------------------------------------- 36
P V G+A +DI AARWIG
Sbjct 247 PAVVAAITGLAEEDIRTAARWIGEAREWMTLWTMGLNQSTHGTWNTNAICNLHLATGAIC 306
Query 37 ---SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR V
Sbjct 307 RPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 340
>gi|343925197|ref|ZP_08764724.1| putative nitrate/sulfite reductase [Gordonia alkanivorans NBRC
16433]
gi|343764794|dbj|GAA11650.1| putative nitrate/sulfite reductase [Gordonia alkanivorans NBRC
16433]
Length=1362
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/33 (70%), Positives = 25/33 (76%), Gaps = 1/33 (3%)
Query 35 IGSRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
+GS P SL GQPN MG EMGY+GPGL GQR V
Sbjct 318 LGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 350
Score = 38.9 bits (89), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 24/48 (50%), Positives = 29/48 (61%), Gaps = 2/48 (4%)
Query 82 PVIASVDERLSVLKPVRPRLHSISFIFKGRPGEVYLTVTG--YNFRGV 127
P+ A VDE L+VLKP+ PR +SIS P EV LTV YN G+
Sbjct 1132 PIHADVDEWLAVLKPLAPRSYSISSSPLESPDEVQLTVNAVRYNRCGI 1179
>gi|302554769|ref|ZP_07307111.1| formate dehydrogenase, alpha subunit [Streptomyces viridochromogenes
DSM 40736]
gi|302472387|gb|EFL35480.1| formate dehydrogenase, alpha subunit [Streptomyces viridochromogenes
DSM 40736]
Length=899
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/119 (34%), Positives = 48/119 (41%), Gaps = 46/119 (38%)
Query 6 LPQ-IGRLTPDAVGEAIGIAADDIPMAARWIGSR-------------------------- 38
LP+ + TP AV E G+A DD+ AAR IG+
Sbjct 266 LPEFLADYTPAAVAEITGLAEDDLREAARLIGAAGRRWMSCWTMGLNQSTHGTWNTNALV 325
Query 39 ---------------PCSLIGQPNTMGD-EMGYLGPGLAGQRCVDRLVMGASRSTCSRL 81
P SL GQPN MG EMGY+GPGL GQR L++ RS L
Sbjct 326 NLHLATGAICRPGAGPFSLTGQPNAMGGREMGYMGPGLPGQRS---LLVAEERSFTEEL 381
>gi|296166335|ref|ZP_06848771.1| bifunctional reductase [Mycobacterium parascrofulaceum ATCC BAA-614]
gi|295898298|gb|EFG77868.1| bifunctional reductase [Mycobacterium parascrofulaceum ATCC BAA-614]
Length=990
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/32 (75%), Positives = 24/32 (75%), Gaps = 1/32 (3%)
Query 36 GSRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
GS P SL GQPN MG EMGYLGPGL GQR V
Sbjct 312 GSGPFSLTGQPNAMGGREMGYLGPGLPGQRSV 343
>gi|167615918|ref|ZP_02384553.1| nitrate reductase [Burkholderia thailandensis Bt4]
Length=1422
Score = 45.4 bits (106), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/99 (34%), Positives = 39/99 (40%), Gaps = 41/99 (41%)
Query 9 IGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------------- 36
+G TP+ V G+ DI +AA+WIG
Sbjct 257 LGDYTPEKVAAITGLTEADIRLAAQWIGDAREWTSCWTMGLNQSTHGTWNTNAICNLHLA 316
Query 37 --------SRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
S P SL GQPN MG EMGY+GPGL GQR
Sbjct 317 TGKICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSA 355
>gi|167565877|ref|ZP_02358793.1| nitrate reductase, putative [Burkholderia oklahomensis EO147]
Length=801
Score = 45.4 bits (106), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/93 (37%), Positives = 38/93 (41%), Gaps = 41/93 (44%)
Query 13 TPDAVGEAIGIAADDIPMAARWIG------------------------------------ 36
TP+ V GIA DI +AA+WIG
Sbjct 261 TPEKVAAITGIAEADIRLAAQWIGDAREWTSCWTMGLNQSTHGTWNTNAICNLHLATGRI 320
Query 37 ----SRPCSLIGQPNTMGD-EMGYLGPGLAGQR 64
S P SL GQPN MG EMGY+GPGL GQR
Sbjct 321 CRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQR 353
>gi|83716994|ref|YP_439369.1| nitrate reductase [Burkholderia thailandensis E264]
gi|257142490|ref|ZP_05590752.1| nitrate reductase [Burkholderia thailandensis E264]
gi|83650819|gb|ABC34883.1| nitrate reductase [Burkholderia thailandensis E264]
Length=1427
Score = 45.4 bits (106), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/97 (35%), Positives = 39/97 (41%), Gaps = 41/97 (42%)
Query 9 IGRLTPDAVGEAIGIAADDIPMAARWIG-------------------------------- 36
+G TP+ V G+ DI +AA+WIG
Sbjct 257 LGDYTPEKVAAITGLTEADIRLAAQWIGDAREWTSCWTMGLNQSTHGTWNTNAICNLHLA 316
Query 37 --------SRPCSLIGQPNTMGD-EMGYLGPGLAGQR 64
S P SL GQPN MG EMGY+GPGL GQR
Sbjct 317 TGKICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQR 353
>gi|296162575|ref|ZP_06845364.1| molybdopterin oxidoreductase [Burkholderia sp. Ch1-1]
gi|295887199|gb|EFG67028.1| molybdopterin oxidoreductase [Burkholderia sp. Ch1-1]
Length=1401
Score = 45.4 bits (106), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/105 (36%), Positives = 43/105 (41%), Gaps = 44/105 (41%)
Query 3 PVFLPQIGRLTPDAVGEAIGIAADDIPMAAR-------WI-------------------- 35
P FL TP+ V + G+A +DI AA+ WI
Sbjct 254 PAFLAD---YTPEKVAQITGLAVEDIRAAAQMIGCANEWISCWTMGLNQSTHGTWHTNAI 310
Query 36 -------------GSRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
GS P SL GQPN MG EMGY+GPGL GQR V
Sbjct 311 CNLHLATGKICRRGSGPFSLTGQPNAMGGREMGYMGPGLPGQRSV 355
Score = 36.2 bits (82), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/74 (33%), Positives = 34/74 (46%), Gaps = 9/74 (12%)
Query 51 DEMGYLGPGLAGQRCVDRLVMGASRSTCSRLPVIASVDERLSVLKPVRPRLHSISFIFKG 110
D L L GQ+ D L PV + E +LK ++PRL+SI+ K
Sbjct 1149 DRKADLKQWLWGQQIADVL---------HEFPVSLTASELTGMLKRLQPRLYSIASSPKA 1199
Query 111 RPGEVYLTVTGYNF 124
GEV+LTV+ +
Sbjct 1200 HAGEVHLTVSAVRY 1213
>gi|29828872|ref|NP_823506.1| assimilatory nitrate reductase large subunit [Streptomyces avermitilis
MA-4680]
gi|29605977|dbj|BAC70041.1| putative assimilatory nitrate reductase large subunit [Streptomyces
avermitilis MA-4680]
Length=1352
Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/32 (72%), Positives = 24/32 (75%), Gaps = 1/32 (3%)
Query 36 GSRPCSLIGQPNTMGD-EMGYLGPGLAGQRCV 66
GS P SL GQPN MG EMGY+GPGL GQR V
Sbjct 339 GSGPLSLTGQPNAMGGREMGYMGPGLPGQRSV 370
>gi|330993629|ref|ZP_08317563.1| Nitrate reductase [Gluconacetobacter sp. SXCC-1]
gi|329759203|gb|EGG75713.1| Nitrate reductase [Gluconacetobacter sp. SXCC-1]
Length=874
Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/99 (35%), Positives = 38/99 (39%), Gaps = 41/99 (41%)
Query 7 PQIGRLTPDAVGEAIGIAADDIPMAARWI------------------------------- 35
P + P V E G+ +DI AARWI
Sbjct 255 PFLDEYAPQTVAEITGLPVEDILTAARWIAEAGEWMSLWTMGLNQSIAGTWNTNALCNLH 314
Query 36 ---------GSRPCSLIGQPNTMGD-EMGYLGPGLAGQR 64
GS P SL GQPN MG EMGY+GPGL GQR
Sbjct 315 LATGAICRPGSGPFSLTGQPNAMGGREMGYMGPGLPGQR 353
Lambda K H
0.323 0.143 0.444
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128801298864
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40