BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3589
Length=304
Score E
Sequences producing significant alignments: (Bits) Value
gi|289755717|ref|ZP_06515095.1| adenine glycosylase MutY [Mycoba... 607 9e-172
gi|15610725|ref|NP_218106.1| adenine glycosylase MutY [Mycobacte... 607 1e-171
gi|298527068|ref|ZP_07014477.1| adenine glycosylase mutY [Mycoba... 604 6e-171
gi|289572241|ref|ZP_06452468.1| adenine glycosylase mutY [Mycoba... 604 6e-171
gi|289759747|ref|ZP_06519125.1| adenine glycosylase MutY [Mycoba... 596 1e-168
gi|340628555|ref|YP_004747007.1| putative adenine glycosylase MU... 590 7e-167
gi|167967107|ref|ZP_02549384.1| adenine glycosylase mutY [Mycoba... 590 9e-167
gi|308369168|ref|ZP_07416810.2| adenine glycosylase mutY [Mycoba... 586 1e-165
gi|339296408|gb|AEJ48519.1| adenine glycosylase MutY [Mycobacter... 581 4e-164
gi|240172175|ref|ZP_04750834.1| MutY [Mycobacterium kansasii ATC... 462 2e-128
gi|339300009|gb|AEJ52119.1| adenine glycosylase MutY [Mycobacter... 461 4e-128
gi|296166645|ref|ZP_06849071.1| probable A/G-specific adenine gl... 455 4e-126
gi|41406567|ref|NP_959403.1| MutY [Mycobacterium avium subsp. pa... 453 2e-125
gi|118466018|ref|YP_879844.1| A/G-specific adenine glycosylase [... 449 3e-124
gi|336458463|gb|EGO37436.1| A/G-specific DNA glycosylase [Mycoba... 448 4e-124
gi|342862427|ref|ZP_08719067.1| adenine glycosylase MutY [Mycoba... 448 6e-124
gi|118619343|ref|YP_907675.1| adenine glycosylase MutY [Mycobact... 444 8e-123
gi|120406294|ref|YP_956123.1| HhH-GPD family protein [Mycobacter... 429 3e-118
gi|183985057|ref|YP_001853348.1| adenine glycosylase MutY [Mycob... 428 5e-118
gi|15828031|ref|NP_302294.1| DNA glycosylase [Mycobacterium lepr... 423 2e-116
gi|108801709|ref|YP_641906.1| HhH-GPD domain-containing protein ... 417 1e-114
gi|254823229|ref|ZP_05228230.1| MutY [Mycobacterium intracellula... 407 1e-111
gi|145222030|ref|YP_001132708.1| HhH-GPD family protein [Mycobac... 406 2e-111
gi|315446228|ref|YP_004079107.1| A/G-specific DNA glycosylase [M... 405 5e-111
gi|333992521|ref|YP_004525135.1| adenine glycosylase MutY [Mycob... 382 3e-104
gi|229494157|ref|ZP_04387920.1| base excision DNA repair protein... 377 1e-102
gi|169627661|ref|YP_001701310.1| adenine glycosylase MutY [Mycob... 377 1e-102
gi|118470417|ref|YP_890305.1| base excision DNA repair protein, ... 375 3e-102
gi|226304159|ref|YP_002764117.1| adenine glycosylase [Rhodococcu... 375 4e-102
gi|333918024|ref|YP_004491605.1| putative adenine glycosylase [A... 371 6e-101
gi|325675006|ref|ZP_08154693.1| A/G-specific adenine DNA glycosy... 370 1e-100
gi|312138090|ref|YP_004005426.1| hhh-gpd family DNA repair prote... 369 4e-100
gi|54022393|ref|YP_116635.1| putative A/G-specific adenine glyco... 368 5e-100
gi|326383086|ref|ZP_08204775.1| HhH-GPD family protein [Gordonia... 366 3e-99
gi|226363775|ref|YP_002781557.1| adenine glycosylase [Rhodococcu... 361 1e-97
gi|111021420|ref|YP_704392.1| A/G-specific adenine DNA glycosyla... 359 3e-97
gi|343925913|ref|ZP_08765428.1| adenine glycosylase [Gordonia al... 358 5e-97
gi|262200959|ref|YP_003272167.1| HhH-GPD family protein [Gordoni... 354 8e-96
gi|296138329|ref|YP_003645572.1| HhH-GPD family protein [Tsukamu... 347 1e-93
gi|300790508|ref|YP_003770799.1| A/G-specific adenine glycosylas... 346 3e-93
gi|340532193|gb|AEK47398.1| A/G-specific adenine glycosylase [Am... 345 4e-93
gi|134097045|ref|YP_001102706.1| A/G-specific adenine glycosylas... 340 1e-91
gi|331699823|ref|YP_004336062.1| HhH-GPD family protein [Pseudon... 340 2e-91
gi|302530747|ref|ZP_07283089.1| A/G-specific adenine glycosylase... 335 4e-90
gi|256374498|ref|YP_003098158.1| HhH-GPD family protein [Actinos... 329 4e-88
gi|324997777|ref|ZP_08118889.1| A/G-specific adenine glycosylase... 324 8e-87
gi|257057486|ref|YP_003135318.1| A/G-specific DNA glycosylase [S... 324 9e-87
gi|269128666|ref|YP_003302036.1| HhH-GPD family protein [Thermom... 304 1e-80
gi|72163274|ref|YP_290931.1| HhH-GPD:Iron-sulfur cluster loop [T... 300 2e-79
gi|284989237|ref|YP_003407791.1| HhH-GPD family protein [Geoderm... 297 2e-78
>gi|289755717|ref|ZP_06515095.1| adenine glycosylase MutY [Mycobacterium tuberculosis EAS054]
gi|289696304|gb|EFD63733.1| adenine glycosylase MutY [Mycobacterium tuberculosis EAS054]
gi|323717783|gb|EGB26981.1| adenine glycosylase mutY [Mycobacterium tuberculosis CDC1551A]
Length=307
Score = 607 bits (1564), Expect = 9e-172, Method: Compositional matrix adjust.
Identities = 304/304 (100%), Positives = 304/304 (100%), Gaps = 0/304 (0%)
Query 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL
Sbjct 4 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 63
Query 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE
Sbjct 64 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 123
Query 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA
Sbjct 124 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 183
Query 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG
Sbjct 184 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 243
Query 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP
Sbjct 244 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 303
Query 301 GEGF 304
GEGF
Sbjct 304 GEGF 307
>gi|15610725|ref|NP_218106.1| adenine glycosylase MutY [Mycobacterium tuberculosis H37Rv]
gi|15843202|ref|NP_338239.1| A/G-specific adenine glycosylase, putative [Mycobacterium tuberculosis
CDC1551]
gi|31794766|ref|NP_857259.1| putative adenine glycosylase [Mycobacterium bovis AF2122/97]
48 more sequence titles
Length=304
Score = 607 bits (1564), Expect = 1e-171, Method: Compositional matrix adjust.
Identities = 304/304 (100%), Positives = 304/304 (100%), Gaps = 0/304 (0%)
Query 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL
Sbjct 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
Query 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE
Sbjct 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
Query 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA
Sbjct 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
Query 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG
Sbjct 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
Query 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP
Sbjct 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
Query 301 GEGF 304
GEGF
Sbjct 301 GEGF 304
>gi|298527068|ref|ZP_07014477.1| adenine glycosylase mutY [Mycobacterium tuberculosis 94_M4241A]
gi|298496862|gb|EFI32156.1| adenine glycosylase mutY [Mycobacterium tuberculosis 94_M4241A]
Length=304
Score = 604 bits (1557), Expect = 6e-171, Method: Compositional matrix adjust.
Identities = 303/304 (99%), Positives = 303/304 (99%), Gaps = 0/304 (0%)
Query 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL
Sbjct 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
Query 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE
Sbjct 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
Query 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA
Sbjct 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
Query 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
LLPHR TAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG
Sbjct 181 LLPHRATAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
Query 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP
Sbjct 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
Query 301 GEGF 304
GEGF
Sbjct 301 GEGF 304
>gi|289572241|ref|ZP_06452468.1| adenine glycosylase mutY [Mycobacterium tuberculosis K85]
gi|339633588|ref|YP_004725230.1| adenine glycosylase MUTY [Mycobacterium africanum GM041182]
gi|289536672|gb|EFD41250.1| adenine glycosylase mutY [Mycobacterium tuberculosis K85]
gi|339332944|emb|CCC28671.1| putative adenine glycosylase MUTY [Mycobacterium africanum GM041182]
Length=304
Score = 604 bits (1557), Expect = 6e-171, Method: Compositional matrix adjust.
Identities = 303/304 (99%), Positives = 303/304 (99%), Gaps = 0/304 (0%)
Query 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL
Sbjct 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
Query 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE
Sbjct 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
Query 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA
Sbjct 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
Query 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQ YTG
Sbjct 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQDYTG 240
Query 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP
Sbjct 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
Query 301 GEGF 304
GEGF
Sbjct 301 GEGF 304
>gi|289759747|ref|ZP_06519125.1| adenine glycosylase MutY [Mycobacterium tuberculosis T85]
gi|289715311|gb|EFD79323.1| adenine glycosylase MutY [Mycobacterium tuberculosis T85]
Length=306
Score = 596 bits (1536), Expect = 1e-168, Method: Compositional matrix adjust.
Identities = 299/300 (99%), Positives = 300/300 (100%), Gaps = 0/300 (0%)
Query 5 LPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWP 64
+PEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWP
Sbjct 1 MPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWP 60
Query 65 DWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVT 124
DWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVT
Sbjct 61 DWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVT 120
Query 125 LPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPH 184
LPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPH
Sbjct 121 LPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPH 180
Query 185 RETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQ 244
RETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQ
Sbjct 181 RETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQ 240
Query 245 VRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEGF 304
VRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEGF
Sbjct 241 VRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEGF 300
>gi|340628555|ref|YP_004747007.1| putative adenine glycosylase MUTY [Mycobacterium canettii CIPT
140010059]
gi|340006745|emb|CCC45933.1| putative adenine glycosylase MUTY [Mycobacterium canettii CIPT
140010059]
Length=304
Score = 590 bits (1522), Expect = 7e-167, Method: Compositional matrix adjust.
Identities = 294/304 (97%), Positives = 296/304 (98%), Gaps = 0/304 (0%)
Query 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTP ARVL
Sbjct 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPVARVL 60
Query 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE
Sbjct 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
Query 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA
Sbjct 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
Query 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG
Sbjct 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
Query 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
TDRQVRGRLLDVLR +FP T+AELDVAWLTDT QRDRALESLLAD LVTRT DGRFAL
Sbjct 241 TDRQVRGRLLDVLRTKDFPATQAELDVAWLTDTQQRDRALESLLADGLVTRTADGRFALA 300
Query 301 GEGF 304
GEGF
Sbjct 301 GEGF 304
>gi|167967107|ref|ZP_02549384.1| adenine glycosylase mutY [Mycobacterium tuberculosis H37Ra]
Length=467
Score = 590 bits (1521), Expect = 9e-167, Method: Compositional matrix adjust.
Identities = 294/304 (97%), Positives = 295/304 (98%), Gaps = 0/304 (0%)
Query 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL
Sbjct 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
Query 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE
Sbjct 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
Query 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA
Sbjct 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
Query 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG
Sbjct 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
Query 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTR P
Sbjct 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRRSMAGSRCP 300
Query 301 GEGF 304
+GF
Sbjct 301 AKGF 304
>gi|308369168|ref|ZP_07416810.2| adenine glycosylase mutY [Mycobacterium tuberculosis SUMu002]
gi|308371393|ref|ZP_07424818.2| adenine glycosylase mutY [Mycobacterium tuberculosis SUMu003]
gi|308372589|ref|ZP_07429183.2| adenine glycosylase mutY [Mycobacterium tuberculosis SUMu004]
9 more sequence titles
Length=295
Score = 586 bits (1511), Expect = 1e-165, Method: Compositional matrix adjust.
Identities = 294/295 (99%), Positives = 295/295 (100%), Gaps = 0/295 (0%)
Query 10 VTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRR 69
+TGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRR
Sbjct 1 MTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRR 60
Query 70 WPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVG 129
WPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVG
Sbjct 61 WPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVG 120
Query 130 SYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAP 189
SYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAP
Sbjct 121 SYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAP 180
Query 190 EFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRL 249
EFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRL
Sbjct 181 EFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRL 240
Query 250 LDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEGF 304
LDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEGF
Sbjct 241 LDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEGF 295
>gi|339296408|gb|AEJ48519.1| adenine glycosylase MutY [Mycobacterium tuberculosis CCDC5079]
Length=295
Score = 581 bits (1498), Expect = 4e-164, Method: Compositional matrix adjust.
Identities = 292/295 (99%), Positives = 293/295 (99%), Gaps = 0/295 (0%)
Query 10 VTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRR 69
+TGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRR
Sbjct 1 MTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRR 60
Query 70 WPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVG 129
WPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVG
Sbjct 61 WPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVG 120
Query 130 SYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAP 189
SY ARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAP
Sbjct 121 SYPARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAP 180
Query 190 EFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRL 249
EFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRL
Sbjct 181 EFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRL 240
Query 250 LDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEGF 304
LDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALV RTVDGRFALPGEGF
Sbjct 241 LDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVIRTVDGRFALPGEGF 295
>gi|240172175|ref|ZP_04750834.1| MutY [Mycobacterium kansasii ATCC 12478]
Length=303
Score = 462 bits (1190), Expect = 2e-128, Method: Compositional matrix adjust.
Identities = 241/302 (80%), Positives = 253/302 (84%), Gaps = 0/302 (0%)
Query 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
M ILP VT P+HI NLLAWY+RS RDLPWR P V WQILVSEFMLQQTP +RVL
Sbjct 1 MSRILPYQPVTAPQHIPSANLLAWYERSRRDLPWRRPDVGAWQILVSEFMLQQTPVSRVL 60
Query 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
+IWPDWVRRWPT SATA A+ ADVLRAWGKLGYPRRAKRLHECAT+IARDH DVVPDD+E
Sbjct 61 SIWPDWVRRWPTASATAAATAADVLRAWGKLGYPRRAKRLHECATIIARDHGDVVPDDVE 120
Query 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
LV LPGVGSYTARAVACFAY QRVPVVDTNVRRVVARAVHGRADAGA S RDHADVLA
Sbjct 121 TLVKLPGVGSYTARAVACFAYGQRVPVVDTNVRRVVARAVHGRADAGASSAARDHADVLA 180
Query 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
LLP+ A FSVALMELGATVCTAR PRCG CPL CAWR AGYP + GP RR Q Y G
Sbjct 181 LLPNSAVAKHFSVALMELGATVCTARAPRCGSCPLSDCAWRTAGYPAATGPARRVQTYAG 240
Query 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
TDRQVRGRLLDVLR + PV+RAELDVAWLTD AQRDRAL+SLL D LVTRT DGRFAL
Sbjct 241 TDRQVRGRLLDVLRGNDSPVSRAELDVAWLTDVAQRDRALDSLLVDGLVTRTPDGRFALA 300
Query 301 GE 302
GE
Sbjct 301 GE 302
>gi|339300009|gb|AEJ52119.1| adenine glycosylase MutY [Mycobacterium tuberculosis CCDC5180]
Length=263
Score = 461 bits (1187), Expect = 4e-128, Method: Compositional matrix adjust.
Identities = 233/233 (100%), Positives = 233/233 (100%), Gaps = 0/233 (0%)
Query 72 TPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSY 131
TPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSY
Sbjct 31 TPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSY 90
Query 132 TARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEF 191
TARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEF
Sbjct 91 TARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEF 150
Query 192 SVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLD 251
SVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLD
Sbjct 151 SVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLD 210
Query 252 VLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEGF 304
VLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEGF
Sbjct 211 VLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEGF 263
>gi|296166645|ref|ZP_06849071.1| probable A/G-specific adenine glycosylase [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897980|gb|EFG77560.1| probable A/G-specific adenine glycosylase [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=306
Score = 455 bits (1170), Expect = 4e-126, Method: Compositional matrix adjust.
Identities = 244/303 (81%), Positives = 259/303 (86%), Gaps = 0/303 (0%)
Query 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
M I+P V GP IS TNLL WY S RDLPWREPGVS WQILVSEFMLQQTP +RVL
Sbjct 4 MADIMPHSPVNGPERISVTNLLDWYDGSRRDLPWREPGVSAWQILVSEFMLQQTPVSRVL 63
Query 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
IW +WVRRWPTPSATA A+ ADVLRAWGKLGYPRRAKRLHECATVIARDH+DVVPDD++
Sbjct 64 PIWSEWVRRWPTPSATAAATAADVLRAWGKLGYPRRAKRLHECATVIARDHHDVVPDDVD 123
Query 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
L+TLPG+G YTARA+ACFAYRQ VPVVDTNVRRVVARAV G AD G S RDHADV A
Sbjct 124 TLLTLPGIGGYTARAIACFAYRQPVPVVDTNVRRVVARAVRGLADGGPASAARDHADVAA 183
Query 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
LLP TAP+FSVALMELGATVCTAR PRCGLCPL CAWR AGYPP+ GPPRR QAY G
Sbjct 184 LLPSNGTAPKFSVALMELGATVCTARAPRCGLCPLPACAWRDAGYPPAQGPPRRAQAYAG 243
Query 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
TDRQVRGRLLDVLRA + PV+RAELDVAWLTDTAQRDRAL+SLLAD LVTRTVDGRFAL
Sbjct 244 TDRQVRGRLLDVLRANDAPVSRAELDVAWLTDTAQRDRALDSLLADGLVTRTVDGRFALL 303
Query 301 GEG 303
GEG
Sbjct 304 GEG 306
>gi|41406567|ref|NP_959403.1| MutY [Mycobacterium avium subsp. paratuberculosis K-10]
gi|41394916|gb|AAS02786.1| MutY [Mycobacterium avium subsp. paratuberculosis K-10]
Length=303
Score = 453 bits (1165), Expect = 2e-125, Method: Compositional matrix adjust.
Identities = 242/302 (81%), Positives = 258/302 (86%), Gaps = 0/302 (0%)
Query 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
M I+P+P G IS T+LL WY+ + RDLPWR PGVS WQILVSEFMLQQTP +RVL
Sbjct 1 MADIMPQPPADGSPLISVTDLLEWYRVARRDLPWRAPGVSAWQILVSEFMLQQTPVSRVL 60
Query 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
IWPDWVRRWPTPSATA AS ADVLRAWGKLGYPRRAKRLHECATVIARDH DVVPDD++
Sbjct 61 PIWPDWVRRWPTPSATAAASAADVLRAWGKLGYPRRAKRLHECATVIARDHGDVVPDDVD 120
Query 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
L+TLPGVG YTARAVACFAYR+ VPVVDTNVRRVVARAVHG+ADAGAPS RDHADV A
Sbjct 121 TLLTLPGVGGYTARAVACFAYRRPVPVVDTNVRRVVARAVHGQADAGAPSAGRDHADVAA 180
Query 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
LLP +APEFSVALMELGATVCTAR PRCGLCPL CAWR AG+PP+ GP RR Q Y G
Sbjct 181 LLPGDGSAPEFSVALMELGATVCTARAPRCGLCPLRRCAWREAGHPPATGPARRVQTYAG 240
Query 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
TDRQVRGRLLDVLR + PVTRAELDVAWLTDTAQRDRAL SLLAD LVT+T DGRFAL
Sbjct 241 TDRQVRGRLLDVLRGNDSPVTRAELDVAWLTDTAQRDRALYSLLADGLVTQTTDGRFALA 300
Query 301 GE 302
GE
Sbjct 301 GE 302
>gi|118466018|ref|YP_879844.1| A/G-specific adenine glycosylase [Mycobacterium avium 104]
gi|118167305|gb|ABK68202.1| putative A/G-specific adenine glycosylase [Mycobacterium avium
104]
Length=303
Score = 449 bits (1154), Expect = 3e-124, Method: Compositional matrix adjust.
Identities = 241/302 (80%), Positives = 256/302 (85%), Gaps = 0/302 (0%)
Query 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
M I+P+P G IS T+LL WY+ + RDLPWR PGVS WQILVSEFMLQQTP +RVL
Sbjct 1 MADIMPQPPAVGSPLISVTDLLEWYRVARRDLPWRAPGVSAWQILVSEFMLQQTPVSRVL 60
Query 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
IWPDWVRRWPTPSATA AS ADVLRAWGKLGYPRRAKRLHECATVIARDH DVVP DI+
Sbjct 61 PIWPDWVRRWPTPSATAAASAADVLRAWGKLGYPRRAKRLHECATVIARDHGDVVPRDID 120
Query 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
L+TLPGVG YTARAVACFAYR+ VPVVDTNVRRVVARAVHG+ADAGAPS RDHADV A
Sbjct 121 TLLTLPGVGGYTARAVACFAYRRPVPVVDTNVRRVVARAVHGQADAGAPSAGRDHADVAA 180
Query 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
LLP +APEFSVALMELGATVCTAR PRCGLCPL CAWR AG+PP GP RR Q Y G
Sbjct 181 LLPGDGSAPEFSVALMELGATVCTARAPRCGLCPLRRCAWREAGHPPPTGPARRVQTYAG 240
Query 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
TDRQVRGRL+DVLR + PVTRAELDVAWLTDTAQRDRAL SLLAD LVT+T DGRFAL
Sbjct 241 TDRQVRGRLMDVLRGNDSPVTRAELDVAWLTDTAQRDRALYSLLADGLVTQTADGRFALA 300
Query 301 GE 302
GE
Sbjct 301 GE 302
>gi|336458463|gb|EGO37436.1| A/G-specific DNA glycosylase [Mycobacterium avium subsp. paratuberculosis
S397]
Length=299
Score = 448 bits (1153), Expect = 4e-124, Method: Compositional matrix adjust.
Identities = 240/298 (81%), Positives = 256/298 (86%), Gaps = 0/298 (0%)
Query 5 LPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWP 64
+P+P G IS T+LL WY+ + RDLPWR PGVS WQILVSEFMLQQTP +RVL IWP
Sbjct 1 MPQPPADGSPLISVTDLLEWYRVARRDLPWRAPGVSAWQILVSEFMLQQTPVSRVLPIWP 60
Query 65 DWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVT 124
DWVRRWPTPSATA AS ADVLRAWGKLGYPRRAKRLHECATVIARDH DVVPDD++ L+T
Sbjct 61 DWVRRWPTPSATAAASAADVLRAWGKLGYPRRAKRLHECATVIARDHGDVVPDDVDTLLT 120
Query 125 LPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPH 184
LPGVG YTARAVACFAYR+ VPVVDTNVRRVVARAVHG+ADAGAPS RDHADV ALLP
Sbjct 121 LPGVGGYTARAVACFAYRRPVPVVDTNVRRVVARAVHGQADAGAPSAGRDHADVAALLPG 180
Query 185 RETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQ 244
+APEFSVALMELGATVCTAR PRCGLCPL CAWR AG+PP+ GP RR Q Y GTDRQ
Sbjct 181 DGSAPEFSVALMELGATVCTARAPRCGLCPLRRCAWREAGHPPATGPARRVQTYAGTDRQ 240
Query 245 VRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGE 302
VRGRLLDVLR + PVTRAELDVAWLTDTAQRDRAL SLLAD LVT+T DGRFAL GE
Sbjct 241 VRGRLLDVLRGNDSPVTRAELDVAWLTDTAQRDRALYSLLADGLVTQTTDGRFALAGE 298
>gi|342862427|ref|ZP_08719067.1| adenine glycosylase MutY [Mycobacterium colombiense CECT 3035]
gi|342130098|gb|EGT83432.1| adenine glycosylase MutY [Mycobacterium colombiense CECT 3035]
Length=303
Score = 448 bits (1152), Expect = 6e-124, Method: Compositional matrix adjust.
Identities = 244/302 (81%), Positives = 259/302 (86%), Gaps = 0/302 (0%)
Query 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
M I+P P V GP IS TNL+ WY+ S RDLPWREPGVS WQILVSEFMLQQTP +RVL
Sbjct 1 MADIMPHPPVAGPEFISVTNLVHWYESSRRDLPWREPGVSAWQILVSEFMLQQTPVSRVL 60
Query 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
IW DWVRRWPTPSATA AS ADVLRAWGKLGYPRRAKRLHECATVIARDH DVVPDD++
Sbjct 61 PIWTDWVRRWPTPSATAAASAADVLRAWGKLGYPRRAKRLHECATVIARDHGDVVPDDVD 120
Query 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
L+TLPGVGSYTARA+ACFAY Q VPVVDTNVRRVVAR VHG ADAGAPS RDHADV A
Sbjct 121 TLLTLPGVGSYTARAIACFAYGQSVPVVDTNVRRVVARVVHGLADAGAPSATRDHADVSA 180
Query 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
LLP ETAP+FS+ALMELGATVCTAR PRCGLCPLD CAWR AG+PP+ GP RR Q Y G
Sbjct 181 LLPADETAPKFSIALMELGATVCTARAPRCGLCPLDRCAWRDAGHPPAQGPARRVQTYAG 240
Query 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
TDRQVRGRL+DVLRA + PVTRAELD AWLTDTAQRDRAL SLLAD LVT+T DGRFAL
Sbjct 241 TDRQVRGRLMDVLRANDSPVTRAELDAAWLTDTAQRDRALYSLLADGLVTQTRDGRFALA 300
Query 301 GE 302
GE
Sbjct 301 GE 302
>gi|118619343|ref|YP_907675.1| adenine glycosylase MutY [Mycobacterium ulcerans Agy99]
gi|118571453|gb|ABL06204.1| adenine glycosylase MutY [Mycobacterium ulcerans Agy99]
Length=303
Score = 444 bits (1142), Expect = 8e-123, Method: Compositional matrix adjust.
Identities = 243/302 (81%), Positives = 262/302 (87%), Gaps = 0/302 (0%)
Query 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
MP I+P+PS+T P HI TNL+AWY+R +RDLPWR P VSPWQILVSEFMLQQTP +RVL
Sbjct 1 MPAIVPQPSMTCPEHIPATNLVAWYERCYRDLPWRAPDVSPWQILVSEFMLQQTPVSRVL 60
Query 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
+IWPDWVRRWPT SATATA+ ADVLRAWGKLGYPRRAKRLHECA VIA DH+DVVPDD++
Sbjct 61 SIWPDWVRRWPTASATATATAADVLRAWGKLGYPRRAKRLHECAMVIAGDHDDVVPDDVD 120
Query 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
L+TLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRA+AG+PS RDHA+V A
Sbjct 121 TLLTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRAEAGSPSATRDHAEVSA 180
Query 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
LLP E AP FSVALMELGATVCTAR PRCG CPL CAWR AGYPP+ GP RR Q Y G
Sbjct 181 LLPDDELAPRFSVALMELGATVCTARAPRCGQCPLAECAWRRAGYPPATGPARRVQTYAG 240
Query 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
TDRQVRGRLLDVLR PVTRA+LDVAWLTDTAQRDRAL SLLAD LVT+T DG FAL
Sbjct 241 TDRQVRGRLLDVLRDNNGPVTRAQLDVAWLTDTAQRDRALYSLLADGLVTQTADGLFALA 300
Query 301 GE 302
GE
Sbjct 301 GE 302
>gi|120406294|ref|YP_956123.1| HhH-GPD family protein [Mycobacterium vanbaalenii PYR-1]
gi|119959112|gb|ABM16117.1| HhH-GPD family protein [Mycobacterium vanbaalenii PYR-1]
Length=290
Score = 429 bits (1103), Expect = 3e-118, Method: Compositional matrix adjust.
Identities = 216/282 (77%), Positives = 233/282 (83%), Gaps = 1/282 (0%)
Query 21 LLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATAS 80
LL WY+ + RDLPWR PGVS WQILVSEFMLQQTP ARV IW DW+ RWPTPSATA A
Sbjct 7 LLDWYEHAQRDLPWRRPGVSAWQILVSEFMLQQTPVARVEPIWLDWIARWPTPSATAAAG 66
Query 81 TADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFA 140
AD+LRAWGKLGYPRRAKRLHECATVIA DH+D VPDD+E L+TLPG+G+YTARAVACFA
Sbjct 67 PADILRAWGKLGYPRRAKRLHECATVIATDHDDTVPDDVETLLTLPGIGTYTARAVACFA 126
Query 141 YRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVALMELGA 200
YRQRVPVVDTNVRRVVARAVHG ADAG+PS RD DV LLP +TAP FSVALME GA
Sbjct 127 YRQRVPVVDTNVRRVVARAVHGLADAGSPS-SRDLGDVETLLPADDTAPRFSVALMEFGA 185
Query 201 TVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEFPV 260
TVCTAR PRCG+CPL C WR G+P GP RR Q Y GTDRQ RGRLLDVLR ++ PV
Sbjct 186 TVCTARAPRCGVCPLQRCGWRARGFPAGTGPARRAQKYAGTDRQARGRLLDVLRGSDSPV 245
Query 261 TRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGE 302
TRAELDVAW TDTAQRDRAL+SLL D LV +T DGRFAL GE
Sbjct 246 TRAELDVAWTTDTAQRDRALDSLLVDGLVEQTADGRFALCGE 287
>gi|183985057|ref|YP_001853348.1| adenine glycosylase MutY [Mycobacterium marinum M]
gi|183178383|gb|ACC43493.1| adenine glycosylase MutY [Mycobacterium marinum M]
Length=294
Score = 428 bits (1101), Expect = 5e-118, Method: Compositional matrix adjust.
Identities = 236/294 (81%), Positives = 253/294 (87%), Gaps = 0/294 (0%)
Query 10 VTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRR 69
+T P HI TNL+AWY+R RDLPWR P VSPWQILVSEFMLQQTP +RVL+IWPDWVRR
Sbjct 1 MTCPEHIPATNLVAWYERCCRDLPWRAPDVSPWQILVSEFMLQQTPVSRVLSIWPDWVRR 60
Query 70 WPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVG 129
WPT SATATA+ ADVLRAWGKLGYPRRAKRLHECA V+A DH+DVVPDD++ L+TLPGVG
Sbjct 61 WPTASATATATAADVLRAWGKLGYPRRAKRLHECAMVVAGDHDDVVPDDVDTLLTLPGVG 120
Query 130 SYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAP 189
SYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRA+AG+PS RDHA+V ALLP E AP
Sbjct 121 SYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRAEAGSPSATRDHAEVSALLPDDELAP 180
Query 190 EFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRL 249
FSVALMELGATVCTAR PRCG CPL CAWR AGYPP+ GP RR Q Y GTDRQVRGRL
Sbjct 181 RFSVALMELGATVCTARAPRCGQCPLAECAWRRAGYPPATGPARRVQTYAGTDRQVRGRL 240
Query 250 LDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
LDVLR PVTRA+LDVAWLTDTAQRDRAL SLLAD LVT+T DG FAL GE
Sbjct 241 LDVLRDNNGPVTRAQLDVAWLTDTAQRDRALYSLLADGLVTQTADGLFALAGEA 294
>gi|15828031|ref|NP_302294.1| DNA glycosylase [Mycobacterium leprae TN]
gi|221230508|ref|YP_002503924.1| putative DNA glycosylase [Mycobacterium leprae Br4923]
gi|13093584|emb|CAC30875.1| probable DNA glycosylase [Mycobacterium leprae]
gi|219933615|emb|CAR72017.1| probable DNA glycosylase [Mycobacterium leprae Br4923]
Length=297
Score = 423 bits (1087), Expect = 2e-116, Method: Compositional matrix adjust.
Identities = 227/303 (75%), Positives = 257/303 (85%), Gaps = 6/303 (1%)
Query 1 MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVL 60
M HI+P+ IS LL WY++S RDLPWR+PGV+ WQILVSEFMLQQTP ARVL
Sbjct 1 MAHIVPQIK------ISTGGLLDWYRQSCRDLPWRQPGVTAWQILVSEFMLQQTPVARVL 54
Query 61 AIWPDWVRRWPTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIE 120
+IWPDWV+RWPTPS TA AS ADVLRAWGKLGYPRRAKRLHE AT+IAR+H+DVVPDD++
Sbjct 55 SIWPDWVQRWPTPSTTAAASAADVLRAWGKLGYPRRAKRLHESATIIAREHDDVVPDDVD 114
Query 121 ILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLA 180
+L+T+PG+GSYTARAVACFAY Q VPVVDTNVRRVVARAVHGRADAG PS RDHADV +
Sbjct 115 VLLTMPGIGSYTARAVACFAYHQPVPVVDTNVRRVVARAVHGRADAGPPSATRDHADVSS 174
Query 181 LLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG 240
LLP ++ AP+FS+ALMELGA VCTAR+PRCGLCPL+ CAWRHAGYPP+ GP RR Q Y G
Sbjct 175 LLPGKKAAPQFSMALMELGAIVCTARSPRCGLCPLNECAWRHAGYPPAQGPARRAQQYAG 234
Query 241 TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
TDRQVRG LLDVLRA + PVT EL+V WLT++ Q DRAL+SLLAD LVTRT DGRFAL
Sbjct 235 TDRQVRGLLLDVLRAKDSPVTGTELNVTWLTNSIQLDRALDSLLADGLVTRTTDGRFALT 294
Query 301 GEG 303
GEG
Sbjct 295 GEG 297
>gi|108801709|ref|YP_641906.1| HhH-GPD domain-containing protein [Mycobacterium sp. MCS]
gi|119870860|ref|YP_940812.1| HhH-GPD family protein [Mycobacterium sp. KMS]
gi|126437696|ref|YP_001073387.1| HhH-GPD family protein [Mycobacterium sp. JLS]
gi|108772128|gb|ABG10850.1| HhH-GPD [Mycobacterium sp. MCS]
gi|119696949|gb|ABL94022.1| HhH-GPD family protein [Mycobacterium sp. KMS]
gi|126237496|gb|ABO00897.1| HhH-GPD family protein [Mycobacterium sp. JLS]
Length=288
Score = 417 bits (1071), Expect = 1e-114, Method: Compositional matrix adjust.
Identities = 210/288 (73%), Positives = 230/288 (80%), Gaps = 2/288 (0%)
Query 16 ISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSA 75
I ++LAWY R RDLPWR PGV+PWQILVSEFMLQQTP ARV IW W+ RWPTPSA
Sbjct 2 IDPADVLAWYDREQRDLPWRRPGVTPWQILVSEFMLQQTPVARVEPIWLSWIERWPTPSA 61
Query 76 TATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARA 135
TA A ADVLRAWGKLGYPRRAKRLHECATVIA +H D VP D+E+L+TLPG+G+YTARA
Sbjct 62 TAAAGVADVLRAWGKLGYPRRAKRLHECATVIAIEHGDEVPSDVEVLLTLPGIGAYTARA 121
Query 136 VACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVAL 195
VACFAY QRVPVVDTNVRRV+ARAVHGRAD +P RD DV LLP AP FSVA+
Sbjct 122 VACFAYGQRVPVVDTNVRRVIARAVHGRAD--SPPSSRDLDDVATLLPEGPEAPRFSVAV 179
Query 196 MELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRA 255
MELGATVCTARTPRCGLCPL C WR GYP + P R+ Q Y GTDRQVRGRLLDVLR
Sbjct 180 MELGATVCTARTPRCGLCPLGTCTWRSLGYPAASAPARKVQRYAGTDRQVRGRLLDVLRG 239
Query 256 AEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
+ PV+R +LDV WLTD AQRDRAL+SLL D LV +T DGRFAL GEG
Sbjct 240 SSSPVSRDQLDVVWLTDPAQRDRALDSLLIDGLVEQTADGRFALAGEG 287
>gi|254823229|ref|ZP_05228230.1| MutY [Mycobacterium intracellulare ATCC 13950]
Length=282
Score = 407 bits (1045), Expect = 1e-111, Method: Compositional matrix adjust.
Identities = 236/282 (84%), Positives = 248/282 (88%), Gaps = 0/282 (0%)
Query 22 LAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATAST 81
+AWY+ S RDLPWREPGVS WQILVSEFMLQQTP ARVL IW DWVRRWPTPSATA AS
Sbjct 1 MAWYETSRRDLPWREPGVSAWQILVSEFMLQQTPVARVLPIWSDWVRRWPTPSATAAASA 60
Query 82 ADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFAY 141
ADVLRAWGKLGYPRRAKRLHECATVIARDH+DVVPDD++ L+TLPGVGSYTARAVACFAY
Sbjct 61 ADVLRAWGKLGYPRRAKRLHECATVIARDHDDVVPDDVDTLLTLPGVGSYTARAVACFAY 120
Query 142 RQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVALMELGAT 201
R+RVPVVDTNVRRVVAR VHG ADAGAPS RDHADV ALLP TAP+FSVALMELGAT
Sbjct 121 RRRVPVVDTNVRRVVARVVHGLADAGAPSATRDHADVSALLPDDATAPQFSVALMELGAT 180
Query 202 VCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEFPVT 261
VCTAR PRCGLCPL CAWR AGYP + GPPRR Q Y GTDRQVRGRLLDVLR + P +
Sbjct 181 VCTARAPRCGLCPLGQCAWRRAGYPAAQGPPRRVQTYAGTDRQVRGRLLDVLRDKDSPAS 240
Query 262 RAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
RAELDVAWLTDTAQRDRAL SLLAD LVT+T DGRFAL GEG
Sbjct 241 RAELDVAWLTDTAQRDRALYSLLADGLVTQTRDGRFALAGEG 282
>gi|145222030|ref|YP_001132708.1| HhH-GPD family protein [Mycobacterium gilvum PYR-GCK]
gi|145214516|gb|ABP43920.1| HhH-GPD family protein [Mycobacterium gilvum PYR-GCK]
Length=291
Score = 406 bits (1044), Expect = 2e-111, Method: Compositional matrix adjust.
Identities = 220/288 (77%), Positives = 234/288 (82%), Gaps = 1/288 (0%)
Query 16 ISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSA 75
I L+ WY + RDL WR PGVS WQILVSEFMLQQTP ARV IW DWV RWPTPSA
Sbjct 2 IDPNELIRWYGTAQRDLAWRRPGVSAWQILVSEFMLQQTPVARVEPIWLDWVSRWPTPSA 61
Query 76 TATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARA 135
TA AS AD+LRAWGKLGYPRRAKRLHECATVIA +H DVVPDD++ LVTLPGVG+YTARA
Sbjct 62 TAAASAADILRAWGKLGYPRRAKRLHECATVIATEHGDVVPDDVDTLVTLPGVGTYTARA 121
Query 136 VACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVAL 195
VACFAYRQRVPVVDTNVRRVVARAVHG DAG PS RD ADV ALLP +TAP FS+A+
Sbjct 122 VACFAYRQRVPVVDTNVRRVVARAVHGLHDAGPPST-RDLADVAALLPDDDTAPHFSIAV 180
Query 196 MELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRA 255
MELGATVCTAR PRCG+CPL CAWR G+P GP RR Q Y GTDRQVRGRLLDVLR
Sbjct 181 MELGATVCTARAPRCGVCPLTHCAWRSRGFPAGTGPARRVQRYAGTDRQVRGRLLDVLRD 240
Query 256 AEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
PVTRAELDVAW TD AQRDRAL+SLL D LV +T DGRFAL GEG
Sbjct 241 NASPVTRAELDVAWTTDPAQRDRALDSLLVDGLVEQTADGRFALAGEG 288
>gi|315446228|ref|YP_004079107.1| A/G-specific DNA glycosylase [Mycobacterium sp. Spyr1]
gi|315264531|gb|ADU01273.1| A/G-specific DNA glycosylase [Mycobacterium sp. Spyr1]
Length=291
Score = 405 bits (1040), Expect = 5e-111, Method: Compositional matrix adjust.
Identities = 219/288 (77%), Positives = 234/288 (82%), Gaps = 1/288 (0%)
Query 16 ISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSA 75
I L+ WY + RDL WR PGVS WQILVSEFMLQQTP ARV IW DWV RWPTPSA
Sbjct 2 IDPNELIRWYATAQRDLAWRRPGVSAWQILVSEFMLQQTPVARVEPIWLDWVARWPTPSA 61
Query 76 TATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARA 135
TA AS AD+LRAWGKLGYPRRAKRLHECATVIA +H DVVPDD++ L+TLPGVG+YTARA
Sbjct 62 TAAASAADILRAWGKLGYPRRAKRLHECATVIATEHGDVVPDDVDTLLTLPGVGTYTARA 121
Query 136 VACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVAL 195
VACFAYRQRVPVVDTNVRRVVARAVHG DAG PS RD ADV ALLP +TAP FS+A+
Sbjct 122 VACFAYRQRVPVVDTNVRRVVARAVHGLHDAGPPST-RDLADVAALLPDDDTAPHFSIAV 180
Query 196 MELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRA 255
MELGATVCTAR PRCG+CPL CAWR G+P GP RR Q Y GTDRQVRGRLLDVLR
Sbjct 181 MELGATVCTARAPRCGVCPLTHCAWRSRGFPAGTGPARRVQRYAGTDRQVRGRLLDVLRD 240
Query 256 AEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
PVTRAELDVAW TD AQRDRAL+SLL D LV +T DGRFAL GEG
Sbjct 241 NASPVTRAELDVAWTTDPAQRDRALDSLLVDGLVEQTADGRFALAGEG 288
>gi|333992521|ref|YP_004525135.1| adenine glycosylase MutY [Mycobacterium sp. JDM601]
gi|333488490|gb|AEF37882.1| adenine glycosylase MutY [Mycobacterium sp. JDM601]
Length=299
Score = 382 bits (982), Expect = 3e-104, Method: Compositional matrix adjust.
Identities = 212/284 (75%), Positives = 225/284 (80%), Gaps = 1/284 (0%)
Query 20 NLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATA 79
+LL W+ + RDLPWR GVSPWQILVSEFMLQQTP ARVL IW DWV RWPTPSATA A
Sbjct 7 DLLGWFDSAERDLPWRAVGVSPWQILVSEFMLQQTPVARVLPIWTDWVARWPTPSATAAA 66
Query 80 STADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACF 139
+ ADVLRAWGKLGYPRRAKRLHECA IARD DVVPDD+E L LPG+GSYTARAVACF
Sbjct 67 TPADVLRAWGKLGYPRRAKRLHECAVTIARDFADVVPDDVETLEQLPGIGSYTARAVACF 126
Query 140 AYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVALMELG 199
AYR+ VPVVDTNVRRVVARAVHGRADA PS RDHADV ALLP A FS ALMELG
Sbjct 127 AYRKPVPVVDTNVRRVVARAVHGRADAAGPSAKRDHADVAALLPDAARAARFSAALMELG 186
Query 200 ATVCTARTPRCGLCPL-DWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEF 258
A VCTARTPRC CPL + C WR AG P +DGP R Q Y GTDRQVRGRLLDVLR EF
Sbjct 187 ALVCTARTPRCDSCPLREHCRWRLAGSPSADGPRRPAQRYAGTDRQVRGRLLDVLRDNEF 246
Query 259 PVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGE 302
VTRA+LDVAW D QR+RAL SLL D LV + DGRFALPG+
Sbjct 247 SVTRADLDVAWPVDEVQRERALASLLDDGLVEQLADGRFALPGQ 290
>gi|229494157|ref|ZP_04387920.1| base excision DNA repair protein, HhH-GPD family [Rhodococcus
erythropolis SK121]
gi|229318519|gb|EEN84377.1| base excision DNA repair protein, HhH-GPD family [Rhodococcus
erythropolis SK121]
Length=293
Score = 377 bits (969), Expect = 1e-102, Method: Compositional matrix adjust.
Identities = 189/289 (66%), Positives = 217/289 (76%), Gaps = 1/289 (0%)
Query 16 ISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSA 75
+ + LL WY + RDLPWR GVS WQIL+SE MLQQTP RV IW +WV RWP PSA
Sbjct 3 VESSVLLKWYSETARDLPWRREGVSAWQILMSEIMLQQTPVVRVAPIWEEWVERWPVPSA 62
Query 76 TATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARA 135
A +S A+VLRAWGKLGYPRRA RLHECA V+A +H DVVP+D++ L++LPG+GSYTARA
Sbjct 63 MAASSQAEVLRAWGKLGYPRRALRLHECAGVLAAEHGDVVPEDVDTLLSLPGIGSYTARA 122
Query 136 VACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPH-RETAPEFSVA 194
VACFAY QRVPVVDTNVRRVVARAVHG A+ G PS RD ADV ALLP R A +S A
Sbjct 123 VACFAYGQRVPVVDTNVRRVVARAVHGNAEPGNPSNTRDLADVAALLPRTRARAATYSAA 182
Query 195 LMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLR 254
LMELGA +CTARTP C CPL CAW +G P GPP++ Q + GTDRQVRG+L+ VLR
Sbjct 183 LMELGALICTARTPNCENCPLPNCAWVASGKPAHTGPPKKVQKFAGTDRQVRGKLMAVLR 242
Query 255 AAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
+ PV RA+LD+ W TD QRDRAL SLL D LV +T DG FAL GEG
Sbjct 243 ESHRPVERAQLDIVWPTDPGQRDRALHSLLVDGLVEQTADGLFALAGEG 291
>gi|169627661|ref|YP_001701310.1| adenine glycosylase MutY [Mycobacterium abscessus ATCC 19977]
gi|169239628|emb|CAM60656.1| Probable adenine glycosylase (MutY) [Mycobacterium abscessus]
Length=280
Score = 377 bits (969), Expect = 1e-102, Method: Compositional matrix adjust.
Identities = 192/282 (69%), Positives = 217/282 (77%), Gaps = 3/282 (1%)
Query 22 LAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATAST 81
+ W+ + RDLPWR P +PW IL+SE MLQQTP +RV +W +WV RWP PSA A S
Sbjct 1 MGWFDVAERDLPWRRPEATPWHILISEVMLQQTPVSRVEPVWREWVARWPVPSAMAKTSV 60
Query 82 ADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFAY 141
A+VLRAWGKLGYPRRA RLHECATV+ARD++D VP D+E L+TLPGVG+YTARA+ACF Y
Sbjct 61 AEVLRAWGKLGYPRRAMRLHECATVLARDYDDQVPGDVETLLTLPGVGAYTARAIACFGY 120
Query 142 RQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVALMELGAT 201
QRVPVVDTNVRRV+AR VHG AD+ APS RD D ALLP E FS ALMELGA
Sbjct 121 GQRVPVVDTNVRRVIARVVHGVADS-APSA-RDLRDAEALLPT-ENGARFSAALMELGAL 177
Query 202 VCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEFPVT 261
VCTARTP+C +CPL CAW AG PP D P R Q Y GTDRQVRGRLLDVLR + PVT
Sbjct 178 VCTARTPQCPMCPLSSCAWHEAGQPPLDAPARPVQKYAGTDRQVRGRLLDVLRGSSIPVT 237
Query 262 RAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
RA+LDVAWL+D QRDRAL+SLL D LV +T DGRFAL GEG
Sbjct 238 RAQLDVAWLSDITQRDRALDSLLVDGLVEQTADGRFALAGEG 279
>gi|118470417|ref|YP_890305.1| base excision DNA repair protein, HhH-GPD family protein [Mycobacterium
smegmatis str. MC2 155]
gi|118171704|gb|ABK72600.1| base excision DNA repair protein, HhH-GPD family protein [Mycobacterium
smegmatis str. MC2 155]
Length=293
Score = 375 bits (964), Expect = 3e-102, Method: Compositional matrix adjust.
Identities = 202/288 (71%), Positives = 227/288 (79%), Gaps = 3/288 (1%)
Query 16 ISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSA 75
IS LL+WY + RDLPWR PGVS WQILVSEFMLQQTP +RV IW W+ RWPT SA
Sbjct 3 ISPVELLSWYDHARRDLPWRRPGVSAWQILVSEFMLQQTPVSRVEPIWSAWIERWPTASA 62
Query 76 TATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARA 135
TA A A+VLRAWGKLGYPRRAKRLHECA VIA +++DVVP D++ L+TLPG+G+YTARA
Sbjct 63 TAAAGPAEVLRAWGKLGYPRRAKRLHECAVVIASEYDDVVPRDVDTLLTLPGIGAYTARA 122
Query 136 VACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVAL 195
VACFAY+ VPVVDTNVRRVV RAVHG AD AP+ RD V ALLP TAP FS AL
Sbjct 123 VACFAYQASVPVVDTNVRRVVTRAVHGAAD--APASTRDLDMVAALLPPDTTAPTFSAAL 180
Query 196 MELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRA 255
MELGATVCTAR+PRCG+CPL C WR AG+ P+ RR Q Y GTDRQVRG+LLDVLR
Sbjct 181 MELGATVCTARSPRCGICPLSHCRWRSAGF-PAGTVARRVQRYAGTDRQVRGKLLDVLRD 239
Query 256 AEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
+ PVTRA+LDV WL+D AQRDRAL+SLL D LV +T DGRFAL GEG
Sbjct 240 STTPVTRAQLDVVWLSDPAQRDRALDSLLVDGLVEQTADGRFALAGEG 287
>gi|226304159|ref|YP_002764117.1| adenine glycosylase [Rhodococcus erythropolis PR4]
gi|226183274|dbj|BAH31378.1| putative adenine glycosylase [Rhodococcus erythropolis PR4]
Length=297
Score = 375 bits (963), Expect = 4e-102, Method: Compositional matrix adjust.
Identities = 189/289 (66%), Positives = 217/289 (76%), Gaps = 1/289 (0%)
Query 16 ISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSA 75
+ + LL WY + RDLPWR GVS WQIL+SE MLQQTP RV IW +WV RWP PSA
Sbjct 7 VESSVLLKWYSETARDLPWRREGVSAWQILMSEIMLQQTPVVRVAPIWEEWVERWPVPSA 66
Query 76 TATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARA 135
A +S A+VLRAWGKLGYPRRA RLHECA V+A +H DVVP+D++ L++LPG+GSYTARA
Sbjct 67 MAASSQAEVLRAWGKLGYPRRALRLHECAGVLAAEHGDVVPEDVDTLLSLPGIGSYTARA 126
Query 136 VACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPH-RETAPEFSVA 194
VACFAY QRVPVVDTNVRRVVARAVHG A+ G PS RD ADV ALLP R A +S A
Sbjct 127 VACFAYGQRVPVVDTNVRRVVARAVHGNAEPGNPSNTRDLADVAALLPRTRARAATYSAA 186
Query 195 LMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLR 254
LMELGA +CTARTP C CPL CAW +G P GPP++ Q + GTDRQVRG+L+ VLR
Sbjct 187 LMELGALICTARTPNCENCPLPNCAWVASGKPAHTGPPKKVQKFAGTDRQVRGKLMAVLR 246
Query 255 AAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
+ PV RA+LD+ W TD QRDRAL SLL D LV +T DG FAL GEG
Sbjct 247 DSHRPVERAQLDIVWPTDPGQRDRALHSLLVDGLVEQTDDGLFALAGEG 295
>gi|333918024|ref|YP_004491605.1| putative adenine glycosylase [Amycolicicoccus subflavus DQS3-9A1]
gi|333480245|gb|AEF38805.1| Putative adenine glycosylase [Amycolicicoccus subflavus DQS3-9A1]
Length=297
Score = 371 bits (953), Expect = 6e-101, Method: Compositional matrix adjust.
Identities = 187/288 (65%), Positives = 217/288 (76%), Gaps = 1/288 (0%)
Query 15 HISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPS 74
+I L+ W++ S RDLPWR PGV+ WQIL+SE MLQQTP ARV +W +WV+RWP PS
Sbjct 9 NIDAEVLVTWFRESARDLPWRRPGVTGWQILMSEIMLQQTPVARVAPVWEEWVKRWPRPS 68
Query 75 ATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTAR 134
+ A S A+V+RAWGKLGYPRRA RLHECA+ +A+ ++D VP D+EIL+TLPGVG YTAR
Sbjct 69 SLAAVSPAEVVRAWGKLGYPRRALRLHECASHLAQQYDDEVPADVEILLTLPGVGDYTAR 128
Query 135 AVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVA 194
AVACFAY Q VPVVDTNVRRV+ARAVHGRA+ G+P+ RD D ALLP R A FS A
Sbjct 129 AVACFAYGQSVPVVDTNVRRVIARAVHGRAEPGSPAAKRDLNDASALLPVRNGA-AFSAA 187
Query 195 LMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLR 254
LMELGATVCTAR+P+C +CP+ CAW AG P DGP RR Q Y GTDRQVRGR+LDVLR
Sbjct 188 LMELGATVCTARSPQCAVCPVTACAWLQAGRPAHDGPRRRTQRYAGTDRQVRGRILDVLR 247
Query 255 AAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGE 302
A PV R +D WL D QRDRAL SLL D LV +T G FAL GE
Sbjct 248 GANGPVDRGRIDAVWLNDPTQRDRALGSLLEDGLVEQTAQGLFALAGE 295
>gi|325675006|ref|ZP_08154693.1| A/G-specific adenine DNA glycosylase [Rhodococcus equi ATCC 33707]
gi|325554592|gb|EGD24267.1| A/G-specific adenine DNA glycosylase [Rhodococcus equi ATCC 33707]
Length=306
Score = 370 bits (950), Expect = 1e-100, Method: Compositional matrix adjust.
Identities = 194/284 (69%), Positives = 219/284 (78%), Gaps = 1/284 (0%)
Query 21 LLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATAS 80
LL WY RDLPWR GV+ W IL+SE MLQQTP RV IW +WVRRWP PS A +S
Sbjct 21 LLRWYDEQARDLPWRRDGVTAWHILMSEIMLQQTPVVRVAPIWEEWVRRWPVPSLMAASS 80
Query 81 TADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFA 140
ADVLRAWGKLGYPRRA RLHECA V+AR+H+DVVP D+++L++LPG+G YTARAVACFA
Sbjct 81 QADVLRAWGKLGYPRRALRLHECAGVLAREHDDVVPADVDVLLSLPGIGDYTARAVACFA 140
Query 141 YRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPH-RETAPEFSVALMELG 199
Y QRVPVVDTNVRRVVARAVHGRAD G PS RD ADV ALLP RE A FS ALMELG
Sbjct 141 YGQRVPVVDTNVRRVVARAVHGRADQGNPSAKRDMADVDALLPRTRERAARFSAALMELG 200
Query 200 ATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEFP 259
AT+CTARTP C CPL CAW AG P G PR+ Q + GTDRQVRG+L+DVLR + P
Sbjct 201 ATICTARTPDCANCPLPRCAWVEAGRPAYTGEPRKVQKFAGTDRQVRGKLMDVLRGSAHP 260
Query 260 VTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
V RA+LD+ WL+D QRDRAL+SLL D L+ +T DG FAL GEG
Sbjct 261 VERAQLDLVWLSDPGQRDRALDSLLVDGLIEQTEDGMFALAGEG 304
>gi|312138090|ref|YP_004005426.1| hhh-gpd family DNA repair protein [Rhodococcus equi 103S]
gi|311887429|emb|CBH46741.1| HhH-GPD family DNA repair protein [Rhodococcus equi 103S]
Length=296
Score = 369 bits (947), Expect = 4e-100, Method: Compositional matrix adjust.
Identities = 193/289 (67%), Positives = 219/289 (76%), Gaps = 1/289 (0%)
Query 16 ISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSA 75
+ LL WY RDLPWR GV+ W IL+SE MLQQTP RV IW +WVRRWP PS
Sbjct 6 VDGAALLRWYDEQARDLPWRRDGVTAWHILMSEIMLQQTPVVRVAPIWEEWVRRWPVPSL 65
Query 76 TATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARA 135
A +S ADVLRAWGKLGYPRRA RLHECA V+A +H+DVVP D+++L++LPG+G YTARA
Sbjct 66 MAASSQADVLRAWGKLGYPRRALRLHECAGVLASEHDDVVPADVDVLLSLPGIGDYTARA 125
Query 136 VACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPH-RETAPEFSVA 194
VACFAY QRVPVVDTNVRRVVARAVHGRAD G PS RD ADV ALLP RE A FS A
Sbjct 126 VACFAYGQRVPVVDTNVRRVVARAVHGRADQGNPSAKRDMADVDALLPRTRERAARFSAA 185
Query 195 LMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLR 254
LMELGAT+CTARTP C CPL CAW AG P G PR+ Q + GTDRQVRG+L+DVLR
Sbjct 186 LMELGATICTARTPDCANCPLPRCAWVEAGRPAYTGEPRKVQKFAGTDRQVRGKLMDVLR 245
Query 255 AAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
+ PV RA+LD+ WL+D QRDRAL+SLL D L+ +T DG FAL GEG
Sbjct 246 GSAHPVERAQLDLVWLSDPGQRDRALDSLLVDGLIEQTEDGMFALAGEG 294
>gi|54022393|ref|YP_116635.1| putative A/G-specific adenine glycosylase [Nocardia farcinica
IFM 10152]
gi|54013901|dbj|BAD55271.1| putative A/G-specific adenine glycosylase [Nocardia farcinica
IFM 10152]
Length=295
Score = 368 bits (945), Expect = 5e-100, Method: Compositional matrix adjust.
Identities = 196/289 (68%), Positives = 218/289 (76%), Gaps = 3/289 (1%)
Query 16 ISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSA 75
I LL WY + RDLPWR PGV+ WQIL+SE MLQQTP ARV IW +WVRRWP PSA
Sbjct 7 IDHDALLDWYAAAARDLPWRRPGVTAWQILMSEIMLQQTPVARVQPIWLEWVRRWPVPSA 66
Query 76 TATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARA 135
A +S A+VLRAWGKLGYPRRA RLHECA V+A +H D VP D+++L+ LPG+G+YTARA
Sbjct 67 MAASSQAEVLRAWGKLGYPRRALRLHECAGVLAAEHGDEVPADVDVLLGLPGIGAYTARA 126
Query 136 VACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHR-ETAPEFSVA 194
VACFAY RVPVVDTNVRRVVARAVHGRA+AG PS RD A+ ALLP + E A FS A
Sbjct 127 VACFAYGIRVPVVDTNVRRVVARAVHGRAEAGNPSA-RDLAETEALLPAQVEPAARFSAA 185
Query 195 LMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLR 254
LMELGATVCTAR P CG CPL CAW AG P S+ RR Q Y GTDRQ RGRLLDVLR
Sbjct 186 LMELGATVCTARNPDCGRCPLPRCAWVSAGRPASE-VVRRTQKYEGTDRQARGRLLDVLR 244
Query 255 AAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
AA PV R LD+AW D QRDRAL+SLL D L+ +T DG FAL GEG
Sbjct 245 AASGPVERVRLDLAWTRDPGQRDRALDSLLVDGLIEQTADGLFALAGEG 293
>gi|326383086|ref|ZP_08204775.1| HhH-GPD family protein [Gordonia neofelifaecis NRRL B-59395]
gi|326198222|gb|EGD55407.1| HhH-GPD family protein [Gordonia neofelifaecis NRRL B-59395]
Length=288
Score = 366 bits (939), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 189/283 (67%), Positives = 209/283 (74%), Gaps = 2/283 (0%)
Query 21 LLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATAS 80
LL W+ R+ R LPWREPG+S WQIL+SE MLQQTP ARV+ W WV RWP PSA A +
Sbjct 6 LLDWFARARRPLPWREPGISGWQILISEIMLQQTPVARVVGPWQTWVERWPVPSAMAAET 65
Query 81 TADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFA 140
T +V+RAWGKLGYPRRA RLHECA V+A +H+D VPDD++ L+ LPG+G YTARAVACFA
Sbjct 66 TGEVVRAWGKLGYPRRAMRLHECARVLAAEHDDAVPDDVDTLLGLPGIGDYTARAVACFA 125
Query 141 YRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVALMELGA 200
Y Q VPVVDTNVRRV+ARAVHG G PS RD D LLP ETAPEFS ALMELGA
Sbjct 126 YGQSVPVVDTNVRRVIARAVHGTQQPGNPS-KRDLVDARQLLPDDETAPEFSAALMELGA 184
Query 201 TVCTARTPRCGLCPL-DWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEFP 259
VCTAR+P C CPL D C W G P DGPPRR Q Y GTDRQVRG LLDVLR P
Sbjct 185 LVCTARSPLCDACPLVDTCRWVALGKPAHDGPPRRTQKYDGTDRQVRGLLLDVLRGRTGP 244
Query 260 VTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGE 302
V RA LD+AW DTAQRDRAL+SLL D L+ T DG F L GE
Sbjct 245 VERARLDLAWTRDTAQRDRALDSLLVDGLIEITSDGLFCLAGE 287
>gi|226363775|ref|YP_002781557.1| adenine glycosylase [Rhodococcus opacus B4]
gi|226242264|dbj|BAH52612.1| putative adenine glycosylase [Rhodococcus opacus B4]
Length=301
Score = 361 bits (926), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 185/289 (65%), Positives = 209/289 (73%), Gaps = 1/289 (0%)
Query 16 ISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSA 75
+ + L+ WY+ RDLPWR GV+ W IL+SE MLQQTP RV IW +WV+RWP PS
Sbjct 12 VDSSALINWYEAQARDLPWRRDGVTAWHILMSEIMLQQTPVVRVAPIWEEWVQRWPVPSR 71
Query 76 TATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARA 135
A +S ADVLRAWGKLGYPRRA RLHECA V+A +H DVVP D++ L+ LPG+G+YTARA
Sbjct 72 MAASSQADVLRAWGKLGYPRRALRLHECAGVLAAEHGDVVPSDVDTLLGLPGIGAYTARA 131
Query 136 VACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPH-RETAPEFSVA 194
VACFAY QRVPVVDTNVRRVVARAVHG A+ G PS RD ADV LLP R A FS A
Sbjct 132 VACFAYGQRVPVVDTNVRRVVARAVHGSAEPGNPSTTRDLADVSTLLPRTRARAATFSAA 191
Query 195 LMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLR 254
LMELGATVCTAR+P C CPL CAW AG P G R+ Q + GTDRQVRG+L+ VLR
Sbjct 192 LMELGATVCTARSPECTRCPLPNCAWVDAGRPAHTGARRKVQKFAGTDRQVRGKLMAVLR 251
Query 255 AAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
+ PV R LD WL D QRDRAL SLL D LV +T DG FAL GEG
Sbjct 252 ESTAPVERVRLDQVWLDDPGQRDRALHSLLVDGLVEQTDDGLFALAGEG 300
>gi|111021420|ref|YP_704392.1| A/G-specific adenine DNA glycosylase [Rhodococcus jostii RHA1]
gi|110820950|gb|ABG96234.1| probable A/G-specific adenine DNA glycosylase [Rhodococcus jostii
RHA1]
Length=326
Score = 359 bits (922), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 186/289 (65%), Positives = 207/289 (72%), Gaps = 1/289 (0%)
Query 16 ISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSA 75
+ L+ WY RDLPWR GV+ W IL+SE MLQQTP RV IW +WVRRWP PS
Sbjct 37 VDSAALINWYDVQARDLPWRRDGVTAWHILMSEIMLQQTPVVRVAPIWEEWVRRWPVPSR 96
Query 76 TATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARA 135
A +S ADVLRAWGKLGYPRRA RLHECA V+A +H DVVP D++ L+ LPG+G+YTARA
Sbjct 97 MAASSQADVLRAWGKLGYPRRALRLHECAGVLAAEHGDVVPSDVDTLLGLPGIGAYTARA 156
Query 136 VACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPH-RETAPEFSVA 194
VACFAY QRVPVVDTNVRRVVARAVHG A+ G PS RD ADV ALLP R A FS A
Sbjct 157 VACFAYGQRVPVVDTNVRRVVARAVHGSAEPGNPSTTRDLADVSALLPRTRARAATFSAA 216
Query 195 LMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLR 254
LMELGATVCTAR+P C CPL CAW AG P G R+ Q + GTDRQVRG+L+ VLR
Sbjct 217 LMELGATVCTARSPECARCPLPSCAWVDAGRPAHTGERRKVQKFAGTDRQVRGKLMAVLR 276
Query 255 AAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
+ PV R LD W D QRDRAL SLL D LV +T DG FAL GEG
Sbjct 277 ESSSPVERVRLDQVWPGDPGQRDRALHSLLIDGLVEQTDDGLFALAGEG 325
>gi|343925913|ref|ZP_08765428.1| adenine glycosylase [Gordonia alkanivorans NBRC 16433]
gi|343764264|dbj|GAA12354.1| adenine glycosylase [Gordonia alkanivorans NBRC 16433]
Length=290
Score = 358 bits (919), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 185/283 (66%), Positives = 205/283 (73%), Gaps = 2/283 (0%)
Query 21 LLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATAS 80
L+ W+ RDLPWR+P + WQILVSE MLQQTP +RVL W WV RWP PSA A
Sbjct 10 LIGWFGTHERDLPWRDPACTGWQILVSEIMLQQTPVSRVLEPWRMWVERWPVPSAMAVEP 69
Query 81 TADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFA 140
+VLRAWGKLGYPRRA RLHEC+ V+ARDH D VPDD+E ++TLPG+G YTARAVACFA
Sbjct 70 AGEVLRAWGKLGYPRRALRLHECSKVLARDHGDRVPDDVETMLTLPGIGDYTARAVACFA 129
Query 141 YRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRETAPEFSVALMELGA 200
Y Q VPVVDTNVRRV+ARAVHGR G PS RD D ALLP R AP FS ALMELGA
Sbjct 130 YGQAVPVVDTNVRRVIARAVHGREQPGNPS-RRDLDDAEALLP-RSGAPRFSAALMELGA 187
Query 201 TVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEFPV 260
VCTAR P+C CPL C W G P GPPR+ Q + GTDRQVRG LLDVLR V
Sbjct 188 LVCTARNPKCDNCPLVDCTWVRRGRPAHTGPPRKAQKFAGTDRQVRGLLLDVLRGTTGSV 247
Query 261 TRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEG 303
RA+LDV W +DTAQRDRAL+SLL D LV T DGR+ L GEG
Sbjct 248 DRAKLDVVWTSDTAQRDRALDSLLVDGLVEITTDGRYCLAGEG 290
>gi|262200959|ref|YP_003272167.1| HhH-GPD family protein [Gordonia bronchialis DSM 43247]
gi|262084306|gb|ACY20274.1| HhH-GPD family protein [Gordonia bronchialis DSM 43247]
Length=300
Score = 354 bits (909), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 182/288 (64%), Positives = 208/288 (73%), Gaps = 7/288 (2%)
Query 21 LLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATAS 80
+L W+ RDLPWR+P S W IL+SE MLQQTP +RV+ W +WVRRWP PSA A
Sbjct 10 ILGWFDVHERDLPWRDPACSAWHILISEIMLQQTPVSRVIDPWREWVRRWPVPSAMADTP 69
Query 81 TADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFA 140
+VLRAWGKLGYPRRA RLHECA V+AR+H+D VP D+E +++LPG+G YTARAVACFA
Sbjct 70 AGEVLRAWGKLGYPRRALRLHECAKVLAREHDDRVPADVETMLSLPGIGDYTARAVACFA 129
Query 141 YRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLP------HRETAPEFSVA 194
Y Q VPVVDTNVRRV+ARAVHGR G P D AD ALLP + TAP +S A
Sbjct 130 YGQAVPVVDTNVRRVIARAVHGREQPGNPGRA-DLADAQALLPRASSGGYAPTAPRYSAA 188
Query 195 LMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLR 254
LMELGA VCTAR PRCG CP+ C W G PP GPPR+ Q + GTDRQVRG LLDVLR
Sbjct 189 LMELGALVCTARNPRCGDCPVPDCTWVRLGRPPHSGPPRKVQKFAGTDRQVRGLLLDVLR 248
Query 255 AAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGE 302
+ PV R LD+AW TDTAQRDRAL+SLL D LV T DGR+ L GE
Sbjct 249 GTDAPVNRGRLDLAWTTDTAQRDRALDSLLVDGLVEITDDGRYCLAGE 296
>gi|296138329|ref|YP_003645572.1| HhH-GPD family protein [Tsukamurella paurometabola DSM 20162]
gi|296026463|gb|ADG77233.1| HhH-GPD family protein [Tsukamurella paurometabola DSM 20162]
Length=303
Score = 347 bits (890), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 186/296 (63%), Positives = 217/296 (74%), Gaps = 6/296 (2%)
Query 12 GPRHISD-TNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRW 70
GP+ SD T++L W+ R+ RDLPWR GV+PWQIL+SE MLQQTP RV A+W +WVRRW
Sbjct 8 GPKIDSDFTSVLHWFDRTERDLPWRADGVTPWQILMSEVMLQQTPVVRVEAMWREWVRRW 67
Query 71 PTPSATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGS 130
PTP+ A A+ AD +RAWGKLGYPRRA RLH CA I +H VVP+ +E L+ LPG+G
Sbjct 68 PTPADLAAATGADAVRAWGKLGYPRRAMRLHACAQAIVAEHAGVVPETVEELLALPGIGD 127
Query 131 YTARAVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHA---DVLALLPHRET 187
YTARAVACFA+ Q VPVVD NVRRV+ARAV G +DA APS RD A +VL +P +
Sbjct 128 YTARAVACFAFGQDVPVVDINVRRVLARAVAGDSDAAAPSARRDLAAATEVLETVPAQRR 187
Query 188 APEFSVALMELGATVCTARTPRCGLCPLD-WCAWRHAGYPPSDGPPRRGQAYTGTDRQVR 246
P S ALMELGA VCTAR+PRCG CP+ CAW AG P PR+ Q Y GTDRQ R
Sbjct 188 -PRLSAALMELGALVCTARSPRCGECPVHAGCAWLAAGAPEGTVAPRKVQKYEGTDRQAR 246
Query 247 GRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGE 302
GRLLDVLRAA+ V RA LD+AW DTAQRDRAL+SLL D L+ +T DGRFAL GE
Sbjct 247 GRLLDVLRAADGAVPRAALDLAWERDTAQRDRALDSLLLDGLIEQTDDGRFALSGE 302
>gi|300790508|ref|YP_003770799.1| A/G-specific adenine glycosylase [Amycolatopsis mediterranei
U32]
gi|299800022|gb|ADJ50397.1| A/G-specific adenine glycosylase [Amycolatopsis mediterranei
U32]
Length=291
Score = 346 bits (887), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 180/284 (64%), Positives = 208/284 (74%), Gaps = 3/284 (1%)
Query 21 LLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATAS 80
LL W+ RDLPWREP S W +LVSE MLQQTP RV IW +W+ RWP PSA A +S
Sbjct 8 LLDWFSAHGRDLPWREPTCSAWGVLVSEIMLQQTPVVRVQPIWLEWMARWPVPSALAASS 67
Query 81 TADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFA 140
+V+RAWGKLGYPRRA RLHE A VIA++H DVVP D++ L+ LPG+G+YTARAVA FA
Sbjct 68 QGEVVRAWGKLGYPRRALRLHEAAGVIAKEHGDVVPSDVDTLLALPGIGAYTARAVAAFA 127
Query 141 YRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRET-APEFSVALMELG 199
Y +R PVVDTNVRRVVARAVHG DAG S RD ADV ALLP + A FS A+MELG
Sbjct 128 YGRRAPVVDTNVRRVVARAVHGAGDAGPASNTRDMADVEALLPAEDAPAARFSAAIMELG 187
Query 200 ATVCTARTPRCGLCPL-DWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEF 258
A +CTAR P+C CP+ D CAW+HAG P DGP + Q Y GTDR VRGRLLDVLR +E
Sbjct 188 ALICTARAPKCADCPIYDECAWQHAGRPEYDGPAKPVQRYAGTDRYVRGRLLDVLRGSEG 247
Query 259 PVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGE 302
PV +A LD+ W D QRDR L+SLL D L+ +T DG FALPGE
Sbjct 248 PVDKARLDLVW-HDGGQRDRCLDSLLVDGLLEQTRDGLFALPGE 290
>gi|340532193|gb|AEK47398.1| A/G-specific adenine glycosylase [Amycolatopsis mediterranei
S699]
Length=289
Score = 345 bits (886), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 180/284 (64%), Positives = 208/284 (74%), Gaps = 3/284 (1%)
Query 21 LLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATAS 80
LL W+ RDLPWREP S W +LVSE MLQQTP RV IW +W+ RWP PSA A +S
Sbjct 6 LLDWFSAHGRDLPWREPTCSAWGVLVSEIMLQQTPVVRVQPIWLEWMARWPVPSALAASS 65
Query 81 TADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFA 140
+V+RAWGKLGYPRRA RLHE A VIA++H DVVP D++ L+ LPG+G+YTARAVA FA
Sbjct 66 QGEVVRAWGKLGYPRRALRLHEAAGVIAKEHGDVVPSDVDTLLALPGIGAYTARAVAAFA 125
Query 141 YRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRET-APEFSVALMELG 199
Y +R PVVDTNVRRVVARAVHG DAG S RD ADV ALLP + A FS A+MELG
Sbjct 126 YGRRAPVVDTNVRRVVARAVHGAGDAGPASNTRDMADVEALLPAEDAPAARFSAAIMELG 185
Query 200 ATVCTARTPRCGLCPL-DWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEF 258
A +CTAR P+C CP+ D CAW+HAG P DGP + Q Y GTDR VRGRLLDVLR +E
Sbjct 186 ALICTARAPKCADCPIYDECAWQHAGRPEYDGPAKPVQRYAGTDRYVRGRLLDVLRGSEG 245
Query 259 PVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGE 302
PV +A LD+ W D QRDR L+SLL D L+ +T DG FALPGE
Sbjct 246 PVDKARLDLVW-HDGGQRDRCLDSLLVDGLLEQTRDGLFALPGE 288
>gi|134097045|ref|YP_001102706.1| A/G-specific adenine glycosylase [Saccharopolyspora erythraea
NRRL 2338]
gi|291007014|ref|ZP_06564987.1| A/G-specific adenine glycosylase [Saccharopolyspora erythraea
NRRL 2338]
gi|133909668|emb|CAL99780.1| A/G-specific adenine glycosylase [Saccharopolyspora erythraea
NRRL 2338]
Length=302
Score = 340 bits (873), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 177/289 (62%), Positives = 207/289 (72%), Gaps = 3/289 (1%)
Query 16 ISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSA 75
++ L+ W+ + R LPWR PG + W +LVSE MLQQTP ARV IW +W+ RWP PS
Sbjct 14 LNPVELIDWFAATARPLPWRAPGTTGWGVLVSETMLQQTPVARVQPIWEEWMARWPRPSD 73
Query 76 TATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARA 135
A A A+VLRAWGKLGYPRRA RLHE A IA +H DVVP D++ L+ LPG+G+YTARA
Sbjct 74 LAAAGQAEVLRAWGKLGYPRRALRLHEAAGTIAAEHGDVVPSDVDTLLALPGIGAYTARA 133
Query 136 VACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRET-APEFSVA 194
VA FAY +R PVVDTNVRRVVARAVHG DAG PS RD ADV ALLP + A S A
Sbjct 134 VAAFAYGRRAPVVDTNVRRVVARAVHGAGDAGPPSTRRDLADVEALLPDTDAEAARLSAA 193
Query 195 LMELGATVCTARTPRCGLCPLDW-CAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVL 253
LMELG VCT R+P C CP+ CAW+HAG P GPP++ Q + GTDRQVRG+LLDVL
Sbjct 194 LMELGQVVCTVRSPACETCPIAHDCAWQHAGRPAYAGPPKKVQKFAGTDRQVRGKLLDVL 253
Query 254 RAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGE 302
R + PVTR +LD W D QRDR L+SLL D L+ +T DGRFALPGE
Sbjct 254 RGTDGPVTRDQLDAVW-ADAGQRDRCLDSLLVDGLLEQTDDGRFALPGE 301
>gi|331699823|ref|YP_004336062.1| HhH-GPD family protein [Pseudonocardia dioxanivorans CB1190]
gi|326954512|gb|AEA28209.1| HhH-GPD family protein [Pseudonocardia dioxanivorans CB1190]
Length=294
Score = 340 bits (871), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 186/283 (66%), Positives = 205/283 (73%), Gaps = 3/283 (1%)
Query 20 NLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATA 79
L+ WY + RDLPWR PG +PW ++VSEFMLQQTP ARV +W W+ WPTPSA A A
Sbjct 12 TLVPWYAVAARDLPWRRPGTTPWGVMVSEFMLQQTPVARVEPVWLRWMADWPTPSALAAA 71
Query 80 STADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACF 139
ADVLRAWGKLGYPRRA RLHE A IA H+D+VP D++ L LPGVGSYTARAVA F
Sbjct 72 PRADVLRAWGKLGYPRRALRLHEAAATIATVHDDIVPADVDALEALPGVGSYTARAVAAF 131
Query 140 AYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRET-APEFSVALMEL 198
Y +R VVDTNVRRVVARAVHG DAG V D ADV ALLP E A FS+A+MEL
Sbjct 132 GYGRRCAVVDTNVRRVVARAVHGAGDAGPARVRADLADVEALLPADEAEAATFSIAMMEL 191
Query 199 GATVCTARTPRCGLCPL-DWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAE 257
GATVCTARTPRC CP+ + CAW AG P DGP R QA+ GTDRQVRGRLLDVLR A
Sbjct 192 GATVCTARTPRCAACPVRERCAWLAAGAPTYDGPRRAPQAFAGTDRQVRGRLLDVLRGAT 251
Query 258 FPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
PV RA LD AW D AQRDR L+SLL D L + DGRFALP
Sbjct 252 GPVDRAALDRAW-ADAAQRDRCLDSLLVDGLAEQLDDGRFALP 293
>gi|302530747|ref|ZP_07283089.1| A/G-specific adenine glycosylase [Streptomyces sp. AA4]
gi|302439642|gb|EFL11458.1| A/G-specific adenine glycosylase [Streptomyces sp. AA4]
Length=291
Score = 335 bits (860), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 177/288 (62%), Positives = 209/288 (73%), Gaps = 4/288 (1%)
Query 18 DTN-LLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSAT 76
DT+ L+ W+ RDLPWREP S W +LVSE MLQQTP +RV IW +W+ RWP PSA
Sbjct 4 DTDVLIEWFADVGRDLPWREPDCSAWGVLVSEIMLQQTPVSRVQPIWLEWMARWPVPSAL 63
Query 77 ATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAV 136
A +T +V+RAWGKLGYPRRA RLH ATVIA++H DVVP D++ L+ LPG+G+YTARAV
Sbjct 64 AAETTGEVVRAWGKLGYPRRALRLHAAATVIAQEHGDVVPSDVDTLLALPGIGAYTARAV 123
Query 137 ACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRET-APEFSVAL 195
A FAY +R PVVDTNVRRVVARAVHG DAG S RD ADV ALLP + A + S AL
Sbjct 124 AAFAYGKRAPVVDTNVRRVVARAVHGAGDAGPASNTRDMADVEALLPAEDAPAAKLSAAL 183
Query 196 MELGATVCTARTPRCGLCPL-DWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLR 254
MELGA +CTAR+P+C CP+ CAW+ G P GP + Q + GTDRQVRG LLDVLR
Sbjct 184 MELGALICTARSPKCADCPIYAECAWQLNGRPEYAGPAKPVQKFAGTDRQVRGLLLDVLR 243
Query 255 AAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGE 302
AE PV +A LDV W + QRDR L+SLL D L+ +T DG FALPGE
Sbjct 244 GAEGPVEKARLDVVW-HEAGQRDRCLDSLLVDGLLEQTRDGLFALPGE 290
>gi|256374498|ref|YP_003098158.1| HhH-GPD family protein [Actinosynnema mirum DSM 43827]
gi|255918801|gb|ACU34312.1| HhH-GPD family protein [Actinosynnema mirum DSM 43827]
Length=329
Score = 329 bits (843), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 177/290 (62%), Positives = 203/290 (70%), Gaps = 3/290 (1%)
Query 15 HISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPS 74
H+ + L W+ + RDLPWR+P + W +LVSE MLQQTP ARV IW W+ +WP PS
Sbjct 39 HLPPSVLNTWFAATARDLPWRDPECTAWGVLVSEIMLQQTPVARVEPIWRVWLDKWPRPS 98
Query 75 ATATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTAR 134
A AS +VLR WGKLGYPRRA RLH A +A +H+DVVPDD+E L+ LPG+G+YTAR
Sbjct 99 DMAAASQGEVLRMWGKLGYPRRALRLHAAAQAVAAEHDDVVPDDVETLLALPGIGAYTAR 158
Query 135 AVACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRE-TAPEFSV 193
AVA FAY +R PVVDTNVRRVVARAVHG DAG PS RD DV ALLP E +A +S
Sbjct 159 AVAAFAYGRRCPVVDTNVRRVVARAVHGAGDAGPPSTTRDLRDVEALLPEDEASAATYSA 218
Query 194 ALMELGATVCTARTPRCGLCP-LDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDV 252
ALMELGA VCTARTPRC CP L C W+ G P DGP + Q + GTDRQVRGRLLDV
Sbjct 219 ALMELGALVCTARTPRCSACPVLGSCQWQRNGRPAYDGPAKAVQKFAGTDRQVRGRLLDV 278
Query 253 LRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGE 302
LR PV + LD AW +D QRDR L SLL D LV +T G FALPGE
Sbjct 279 LRGTSEPVAKEVLDRAW-SDAGQRDRCLHSLLVDGLVEQTAAGLFALPGE 327
>gi|324997777|ref|ZP_08118889.1| A/G-specific adenine glycosylase [Pseudonocardia sp. P1]
Length=287
Score = 324 bits (831), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 176/280 (63%), Positives = 195/280 (70%), Gaps = 5/280 (1%)
Query 24 WYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATASTAD 83
W++ RDLPWR P +PW +LVSEFMLQQTP ARV IW DW+ RWP PS+ A A ++
Sbjct 9 WFRAHARDLPWRRPETTPWGVLVSEFMLQQTPVARVEPIWLDWMERWPLPSSLAAAPRSE 68
Query 84 VLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFAYRQ 143
VLRAWGKLGYPRRA RLH+ A IA H D VPDD+E L LPG+GSYTARAV F Y
Sbjct 69 VLRAWGKLGYPRRAIRLHDAAAAIAGQHGDAVPDDVEALEALPGIGSYTARAVVAFGYGL 128
Query 144 RVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLP--HRETAPEFSVALMELGAT 201
R PVVDTNVRRVVARAVHG+ DAG D ADV ALLP H E A S LMELGA
Sbjct 129 RAPVVDTNVRRVVARAVHGQGDAGPARTRADLADVDALLPGGHAEAA-VVSAGLMELGAV 187
Query 202 VCTARTPRCGLCPL-DWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEFPV 260
VCTAR+PRC CP+ CAW AG P DGP + Q + GTDRQVRG+L+DVLR A PV
Sbjct 188 VCTARSPRCADCPVAHTCAWVAAGKPAYDGPRKPVQGFAGTDRQVRGKLMDVLRTAPGPV 247
Query 261 TRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
RA LD W TD QRDR L SLL D L +T DGRFALP
Sbjct 248 ERAALDAVW-TDAGQRDRCLHSLLVDGLAEQTDDGRFALP 286
>gi|257057486|ref|YP_003135318.1| A/G-specific DNA glycosylase [Saccharomonospora viridis DSM 43017]
gi|256587358|gb|ACU98491.1| A/G-specific DNA glycosylase [Saccharomonospora viridis DSM 43017]
Length=293
Score = 324 bits (831), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 175/291 (61%), Positives = 199/291 (69%), Gaps = 5/291 (1%)
Query 16 ISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSA 75
I LL W+ + RDLPWR +PW +LVSE MLQQTP ARVL +W W+ RWP P+
Sbjct 3 IDPQTLLDWFDANARDLPWRHADCTPWGVLVSEIMLQQTPVARVLPVWRQWMERWPKPAD 62
Query 76 TATASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARA 135
A AS +VLRAWGKLGYPRRA RLH A I +H VP D++ L++LPG+G+YTARA
Sbjct 63 LAAASQGEVLRAWGKLGYPRRALRLHTAANTIVAEHGGEVPADVDTLLSLPGIGAYTARA 122
Query 136 VACFAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALL---PHRETAPEFS 192
VA FAY +R PVVDTNVRRVVARAVHG A+AG PS RD DV ALL P A FS
Sbjct 123 VAAFAYGRRAPVVDTNVRRVVARAVHGAAEAGPPSTKRDLDDVEALLPDGPDEARAARFS 182
Query 193 VALMELGATVCTARTPRCGLCPLDW-CAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLD 251
ALMELGA VC AR PRC CPL CAW+ AG P GP R Q Y+GTDR VRG LLD
Sbjct 183 AALMELGALVCIARKPRCDDCPLFADCAWQKAGRPAYTGPTRPTQRYSGTDRHVRGLLLD 242
Query 252 VLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGE 302
VLR PV +A+LD+AW QRD L SLL+D LV +T DGRFALPGE
Sbjct 243 VLRGTTGPVEKAQLDLAW-PKAGQRDACLASLLSDGLVEQTADGRFALPGE 292
>gi|269128666|ref|YP_003302036.1| HhH-GPD family protein [Thermomonospora curvata DSM 43183]
gi|268313624|gb|ACY99998.1| HhH-GPD family protein [Thermomonospora curvata DSM 43183]
Length=296
Score = 304 bits (778), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 166/282 (59%), Positives = 191/282 (68%), Gaps = 3/282 (1%)
Query 21 LLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATAS 80
+L WY RDLPWR P +PW ILVSE MLQQTP ARVL IW W+ RWPTP+A A
Sbjct 15 ILGWYAEHARDLPWRAPDATPWGILVSEVMLQQTPVARVLPIWRRWMERWPTPAALAAEP 74
Query 81 TADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFA 140
+ + +RAWG+LGYPRRA RLH CA I H VP E L LPG+G+YTA AVA FA
Sbjct 75 SGEAVRAWGRLGYPRRALRLHACAVAITERHGGQVPSSYEALRELPGIGAYTAAAVASFA 134
Query 141 YRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRE-TAPEFSVALMELG 199
YRQR V+DTNVRRV+AR + G + A +LLPH TA +SVA+MELG
Sbjct 135 YRQRHAVLDTNVRRVLARLIGGVEYPPRSQTAAEVALAESLLPHDAPTAARWSVAIMELG 194
Query 200 ATVCTARTPRCGLCP-LDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEF 258
A VCTAR PRC CP L CAW+ AG P DGPPRRGQAY GTDRQ RGRLL VLR A+
Sbjct 195 ALVCTARNPRCVDCPVLAECAWQRAGRPAYDGPPRRGQAYAGTDRQCRGRLLAVLREADG 254
Query 259 PVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
PV +A LDV W +D QR+RAL+ L+AD LV DGR+ALP
Sbjct 255 PVEKAALDVVW-SDPIQRERALDGLIADGLVDPLEDGRYALP 295
>gi|72163274|ref|YP_290931.1| HhH-GPD:Iron-sulfur cluster loop [Thermobifida fusca YX]
gi|71917006|gb|AAZ56908.1| HhH-GPD:Iron-sulfur cluster loop [Thermobifida fusca YX]
Length=291
Score = 300 bits (769), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 162/285 (57%), Positives = 192/285 (68%), Gaps = 3/285 (1%)
Query 19 TNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATAT 78
T +LAWY + RDLPWR V+PW +LVSE MLQQTP ARVL W W+ RWPTP A
Sbjct 8 TAVLAWYDANARDLPWRSAEVTPWGVLVSEVMLQQTPVARVLPAWQAWMERWPTPKDLAA 67
Query 79 ASTADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVAC 138
S + +R WG+LGYPRRA RLH CATVI H VPD + L+ LPGVG+YTA AVA
Sbjct 68 DSAGEAVRMWGRLGYPRRALRLHACATVIVERHGGTVPDSYDELLALPGVGAYTAAAVAS 127
Query 139 FAYRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHR-ETAPEFSVALME 197
FA++QR V+DTNVRRV+ R V+GR ++ +LLP A + VA+ME
Sbjct 128 FAFQQRHAVLDTNVRRVLERLVNGRQYPPRTPTKAEYRLAESLLPEEPAVAARWGVAVME 187
Query 198 LGATVCTARTPRCGLCP-LDWCAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAA 256
LGA VCTAR+PRCG+CP +D CAW AG PP DGPPRRGQ Y GTDRQVRGRLL VLR A
Sbjct 188 LGALVCTARSPRCGVCPVVDQCAWVLAGCPPHDGPPRRGQRYAGTDRQVRGRLLAVLREA 247
Query 257 EFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPG 301
PV ++ LD W D QR+RAL+SL+AD LV G +ALP
Sbjct 248 SGPVPKSMLDAVW-DDAVQRERALDSLVADGLVDPLETGEYALPN 291
>gi|284989237|ref|YP_003407791.1| HhH-GPD family protein [Geodermatophilus obscurus DSM 43160]
gi|284062482|gb|ADB73420.1| HhH-GPD family protein [Geodermatophilus obscurus DSM 43160]
Length=300
Score = 297 bits (760), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 175/282 (63%), Positives = 198/282 (71%), Gaps = 4/282 (1%)
Query 21 LLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATAS 80
L+ W+ + RDLPWR PG PW +LVSE MLQQTP ARV +W W+ RWPTP+A A AS
Sbjct 19 LVDWFADAARDLPWRRPGTDPWAVLVSEVMLQQTPVARVEPVWRQWMGRWPTPAALAAAS 78
Query 81 TADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFA 140
A+V+RAWGKLGYPRRA RL E A + H VVP D+ L LPG+G+YTARAVACF
Sbjct 79 PAEVIRAWGKLGYPRRALRLRETAVALTERHGGVVPADVAALEALPGIGTYTARAVACFG 138
Query 141 YRQRVPVVDTNVRRVVARAVHGRADAGAPSVPRDHADVLALLPHRET-APEFSVALMELG 199
+ Q PVVDTNVRRVVAR VHGRA+A + D DV AL P A FSVA+MELG
Sbjct 139 HGQPQPVVDTNVRRVVARLVHGRAEAAP-ARAADLTDVAALAPADPGRAARFSVAVMELG 197
Query 200 ATVCTARTPRCGLCPLDW-CAWRHAGYPPSDGPPRRGQAYTGTDRQVRGRLLDVLRAAEF 258
A VC ARTPRC CP+ CAWR AG P DGPPRR Q + GTDRQVRGRLLDVLR+AE
Sbjct 198 ALVCVARTPRCAACPVRTDCAWRLAGSPAPDGPPRRVQKFAGTDRQVRGRLLDVLRSAEG 257
Query 259 PVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALP 300
PV +LD AW D QR R L+SLLAD LV +T DGRF LP
Sbjct 258 PVAAPQLDEAW-ADAVQRSRCLDSLLADGLVEQTADGRFTLP 298
Lambda K H
0.322 0.136 0.445
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 511987440690
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40