BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3074
Length=424
Score E
Sequences producing significant alignments: (Bits) Value
gi|339295909|gb|AEJ48020.1| hypothetical protein CCDC5079_2830 [... 840 0.0
gi|15610211|ref|NP_217590.1| hypothetical protein Rv3074 [Mycoba... 839 0.0
gi|254365702|ref|ZP_04981747.1| conserved hypothetical protein [... 838 0.0
gi|167969679|ref|ZP_02551956.1| hypothetical protein MtubH3_1730... 838 0.0
gi|340628065|ref|YP_004746517.1| hypothetical protein MCAN_31001... 836 0.0
gi|289755191|ref|ZP_06514569.1| LOW QUALITY PROTEIN: conserved h... 694 0.0
gi|183981600|ref|YP_001849891.1| hypothetical protein MMAR_1584 ... 665 0.0
gi|342860971|ref|ZP_08717620.1| hypothetical protein MCOL_18907 ... 615 5e-174
gi|41409249|ref|NP_962085.1| hypothetical protein MAP3151 [Mycob... 606 2e-171
gi|118464996|ref|YP_883136.1| hypothetical protein MAV_3980 [Myc... 595 4e-168
gi|296169248|ref|ZP_06850901.1| HNH nuclease [Mycobacterium para... 568 8e-160
gi|254776396|ref|ZP_05217912.1| hypothetical protein MaviaA2_172... 566 3e-159
gi|298526546|ref|ZP_07013955.1| conserved hypothetical protein [... 559 4e-157
gi|254550391|ref|ZP_05140838.1| hypothetical protein Mtube_08010... 556 2e-156
gi|323720131|gb|EGB29235.1| hypothetical protein TMMG_02079 [Myc... 555 6e-156
gi|308375447|ref|ZP_07443945.2| hypothetical protein TMGG_01948 ... 554 1e-155
gi|15608518|ref|NP_215894.1| hypothetical protein Rv1378c [Mycob... 554 1e-155
gi|31792572|ref|NP_855065.1| hypothetical protein Mb1413c [Mycob... 553 1e-155
gi|289554832|ref|ZP_06444042.1| conserved hypothetical protein [... 553 1e-155
gi|121637308|ref|YP_977531.1| hypothetical protein BCG_1439c [My... 553 2e-155
gi|308371953|ref|ZP_07667276.1| hypothetical protein TMDG_03491 ... 552 3e-155
gi|289574045|ref|ZP_06454272.1| conserved hypothetical protein [... 552 3e-155
gi|340626392|ref|YP_004744844.1| hypothetical protein MCAN_13941... 552 4e-155
gi|339631445|ref|YP_004723087.1| hypothetical protein MAF_14000 ... 551 6e-155
gi|15840836|ref|NP_335873.1| hypothetical protein MT1422 [Mycoba... 551 6e-155
gi|108798576|ref|YP_638773.1| hypothetical protein Mmcs_1606 [My... 543 2e-152
gi|289571279|ref|ZP_06451506.1| LOW QUALITY PROTEIN: conserved h... 534 1e-149
gi|126432903|ref|YP_001068594.1| hypothetical protein Mjls_0290 ... 523 3e-146
gi|254821166|ref|ZP_05226167.1| hypothetical protein MintA_14612... 521 8e-146
gi|289553135|ref|ZP_06442345.1| conserved hypothetical protein [... 515 4e-144
gi|240168682|ref|ZP_04747341.1| hypothetical protein MkanA1_0517... 509 2e-142
gi|108797279|ref|YP_637476.1| hypothetical protein Mmcs_0299 [My... 502 4e-140
gi|108801662|ref|YP_641859.1| hypothetical protein Mmcs_4699 [My... 500 2e-139
gi|126437648|ref|YP_001073339.1| hypothetical protein Mjls_5084 ... 499 3e-139
gi|118469447|ref|YP_890248.1| hypothetical protein MSMEG_6025 [M... 496 2e-138
gi|307083945|ref|ZP_07493058.1| hypothetical protein TMLG_03064 ... 490 1e-136
gi|120405676|ref|YP_955505.1| hypothetical protein Mvan_4725 [My... 477 2e-132
gi|120406245|ref|YP_956074.1| hypothetical protein Mvan_5297 [My... 462 6e-128
gi|315446192|ref|YP_004079071.1| hypothetical protein Mspyr1_469... 457 2e-126
gi|315444664|ref|YP_004077543.1| hypothetical protein Mspyr1_309... 454 1e-125
gi|145224332|ref|YP_001135010.1| HNH nuclease [Mycobacterium gil... 454 2e-125
gi|145223173|ref|YP_001133851.1| hypothetical protein Mflv_2586 ... 453 3e-125
gi|120404582|ref|YP_954411.1| hypothetical protein Mvan_3614 [My... 453 3e-125
gi|169628995|ref|YP_001702644.1| hypothetical protein MAB_1907 [... 440 2e-121
gi|145223484|ref|YP_001134162.1| HNH nuclease [Mycobacterium gil... 440 2e-121
gi|119963190|ref|YP_946206.1| hypothetical protein AAur_0393 [Ar... 399 4e-109
gi|226361477|ref|YP_002779255.1| hypothetical protein ROP_20630 ... 390 2e-106
gi|343924565|ref|ZP_08764113.1| hypothetical protein GOALK_017_0... 384 1e-104
gi|111019339|ref|YP_702311.1| hypothetical protein RHA1_ro02347 ... 384 2e-104
gi|119962228|ref|YP_946315.1| HNH endonuclease domain-containing... 380 3e-103
>gi|339295909|gb|AEJ48020.1| hypothetical protein CCDC5079_2830 [Mycobacterium tuberculosis
CCDC5079]
Length=432
Score = 840 bits (2169), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/424 (100%), Positives = 424/424 (100%), Gaps = 0/424 (0%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR
Sbjct 9 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 68
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL
Sbjct 69 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 128
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP
Sbjct 129 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 188
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI
Sbjct 189 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 248
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA
Sbjct 249 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 308
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE
Sbjct 309 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 368
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA 420
RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA
Sbjct 369 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA 428
Query 421 RYAA 424
RYAA
Sbjct 429 RYAA 432
>gi|15610211|ref|NP_217590.1| hypothetical protein Rv3074 [Mycobacterium tuberculosis H37Rv]
gi|15842644|ref|NP_337681.1| hypothetical protein MT3159 [Mycobacterium tuberculosis CDC1551]
gi|31794253|ref|NP_856746.1| hypothetical protein Mb3101 [Mycobacterium bovis AF2122/97]
62 more sequence titles
Length=424
Score = 839 bits (2167), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/424 (100%), Positives = 424/424 (100%), Gaps = 0/424 (0%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR
Sbjct 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL
Sbjct 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP
Sbjct 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI
Sbjct 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA
Sbjct 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE
Sbjct 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA 420
RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA
Sbjct 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA 420
Query 421 RYAA 424
RYAA
Sbjct 421 RYAA 424
>gi|254365702|ref|ZP_04981747.1| conserved hypothetical protein [Mycobacterium tuberculosis str.
Haarlem]
gi|134151215|gb|EBA43260.1| conserved hypothetical protein [Mycobacterium tuberculosis str.
Haarlem]
Length=432
Score = 838 bits (2165), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/424 (99%), Positives = 424/424 (100%), Gaps = 0/424 (0%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
+FETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR
Sbjct 9 IFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 68
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL
Sbjct 69 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 128
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP
Sbjct 129 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 188
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI
Sbjct 189 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 248
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA
Sbjct 249 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 308
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE
Sbjct 309 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 368
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA 420
RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA
Sbjct 369 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA 428
Query 421 RYAA 424
RYAA
Sbjct 429 RYAA 432
>gi|167969679|ref|ZP_02551956.1| hypothetical protein MtubH3_17308 [Mycobacterium tuberculosis
H37Ra]
Length=424
Score = 838 bits (2164), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/424 (99%), Positives = 424/424 (100%), Gaps = 0/424 (0%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR
Sbjct 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL
Sbjct 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP
Sbjct 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI
Sbjct 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
AVNLVMSD+TLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA
Sbjct 241 AVNLVMSDDTLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE
Sbjct 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA 420
RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA
Sbjct 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA 420
Query 421 RYAA 424
RYAA
Sbjct 421 RYAA 424
>gi|340628065|ref|YP_004746517.1| hypothetical protein MCAN_31001 [Mycobacterium canettii CIPT
140010059]
gi|340006255|emb|CCC45431.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=424
Score = 836 bits (2159), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/424 (99%), Positives = 423/424 (99%), Gaps = 0/424 (0%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MF+TLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR
Sbjct 1 MFDTLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL
Sbjct 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP
Sbjct 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRD AVPTPI
Sbjct 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDTAVPTPI 240
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA
Sbjct 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE
Sbjct 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA 420
RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA
Sbjct 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA 420
Query 421 RYAA 424
RYAA
Sbjct 421 RYAA 424
>gi|289755191|ref|ZP_06514569.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis EAS054]
gi|289695778|gb|EFD63207.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis EAS054]
Length=407
Score = 694 bits (1790), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/345 (99%), Positives = 345/345 (100%), Gaps = 0/345 (0%)
Query 80 SRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVADRRALDAELCGDPGDL 139
+RHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVADRRALDAELCGDPGDL
Sbjct 63 ARHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVADRRALDAELCGDPGDL 122
Query 140 EGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTYLTALLPVAQGVS 199
EGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTYLTALLPVAQGVS
Sbjct 123 EGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTYLTALLPVAQGVS 182
Query 200 VYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAANTPA 259
VYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAANTPA
Sbjct 183 VYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAANTPA 242
Query 260 QLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMESRARLFPRGLAAFI 319
QLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMESRARLFPRGLAAFI
Sbjct 243 QLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMESRARLFPRGLAAFI 302
Query 320 ELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAPGWRVSTSVDE 379
ELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAPGWRVSTSVDE
Sbjct 303 ELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAPGWRVSTSVDE 362
Query 380 NHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALARYAA 424
NHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALARYAA
Sbjct 363 NHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALARYAA 407
>gi|183981600|ref|YP_001849891.1| hypothetical protein MMAR_1584 [Mycobacterium marinum M]
gi|183174926|gb|ACC40036.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=424
Score = 665 bits (1715), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/424 (85%), Positives = 389/424 (92%), Gaps = 0/424 (0%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFE+L A+DP A++AAL+ERIAELE +KSAAAAGQARAAAA+D ARRAAE AAGVPAARR
Sbjct 1 MFESLAAVDPAADQAALVERIAELETVKSAAAAGQARAAAALDTARRAAESAAGVPAARR 60
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
GRGLA+EIALARRDSPARGSRHLGFAKALV+EMPHTLAAL+CG LSEWRATLIVRESACL
Sbjct 61 GRGLANEIALARRDSPARGSRHLGFAKALVHEMPHTLAALECGLLSEWRATLIVRESACL 120
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
D+ RR LDAELCGDP LEGMGDARV AAA+AIAYRLDP AVVDRAA AE +RTVTIRP
Sbjct 121 DIEHRRELDAELCGDPSGLEGMGDARVAAAAKAIAYRLDPHAVVDRAAKAERERTVTIRP 180
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
APDTMTYLTALLPVAQG+SVYAAL RAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI
Sbjct 181 APDTMTYLTALLPVAQGISVYAALRRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
AVNLV++DE+LLGA + PA +CGYGPIPAAVARTMVA AV D RSRATLRRLYAHP+AGA
Sbjct 241 AVNLVLTDESLLGADSAPADVCGYGPIPAAVARTMVADAVADGRSRATLRRLYAHPRAGA 300
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LV+MESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHA PWA GGPT+A NGLG+CE
Sbjct 301 LVAMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHARPWAKGGPTTADNGLGSCE 360
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA 420
RCNYAK+A GWRV TS+ E HTHTAEF TPTG+ +RSGAPP + VT SELEVR+GI+LA
Sbjct 361 RCNYAKEAYGWRVETSMHETHTHTAEFTTPTGTSYRSGAPPRVQTVTASELEVRVGISLA 420
Query 421 RYAA 424
R+AA
Sbjct 421 RHAA 424
>gi|342860971|ref|ZP_08717620.1| hypothetical protein MCOL_18907 [Mycobacterium colombiense CECT
3035]
gi|342131415|gb|EGT84685.1| hypothetical protein MCOL_18907 [Mycobacterium colombiense CECT
3035]
Length=424
Score = 615 bits (1586), Expect = 5e-174, Method: Compositional matrix adjust.
Identities = 330/424 (78%), Positives = 366/424 (87%), Gaps = 0/424 (0%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFE L +DP A+E+ALIERIAELE KSAAAAGQARAAA +DA RRA E AAGVPA RR
Sbjct 1 MFEELATVDPAADESALIERIAELETAKSAAAAGQARAAAELDALRRATEAAAGVPATRR 60
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
GRG+A E+ALARRD+P+RG RHLGFA ALV+EMPHTLAAL+CGALSEWRATLIVRESACL
Sbjct 61 GRGVAGEVALARRDAPSRGGRHLGFATALVHEMPHTLAALECGALSEWRATLIVRESACL 120
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
DV DRRALDAELC D L G+GDARV AAA+AIAYRLDP AVV+RAA AEN+RTVTIRP
Sbjct 121 DVEDRRALDAELCADLASLSGLGDARVAAAAKAIAYRLDPHAVVERAAKAENERTVTIRP 180
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
APDTMTYLTALLPVAQGVSVYAAL R ADTR DGR RGQVMADTLVER+TGR A+VP P
Sbjct 181 APDTMTYLTALLPVAQGVSVYAALRREADTRGDGRPRGQVMADTLVERITGRSASVPVPT 240
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
AVNLV+SDETLLG + PA + GYGP+PA VAR MVA AVTD+RSRATLRRLYAHP+AGA
Sbjct 241 AVNLVLSDETLLGGDDAPATISGYGPVPAPVARAMVAGAVTDRRSRATLRRLYAHPRAGA 300
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LV+MESRAR+FPRGLA FI LRDQ CRTPYCDAP+RHRDHA PWA GG T+A NGLG CE
Sbjct 301 LVAMESRARIFPRGLAEFIGLRDQSCRTPYCDAPVRHRDHAKPWARGGRTTADNGLGLCE 360
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA 420
CNY K+ GWRVST+VDENHTHTA F TPTG +RS APP + +TVSE+E+RIG++LA
Sbjct 361 ACNYVKENAGWRVSTAVDENHTHTALFTTPTGKSYRSTAPPPVLRITVSEVELRIGVSLA 420
Query 421 RYAA 424
R+AA
Sbjct 421 RHAA 424
>gi|41409249|ref|NP_962085.1| hypothetical protein MAP3151 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41398069|gb|AAS05699.1| hypothetical protein MAP_3151 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336459196|gb|EGO38141.1| hypothetical protein MAPs_04970 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=428
Score = 606 bits (1563), Expect = 2e-171, Method: Compositional matrix adjust.
Identities = 341/424 (81%), Positives = 370/424 (88%), Gaps = 0/424 (0%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
+FE+L A+DP A E+ALIERI+ELE LK AAAA QARAAAA+DAARRAAE AAGVPAARR
Sbjct 5 VFESLMAVDPAAGESALIERISELEMLKCAAAAAQARAAAALDAARRAAEAAAGVPAARR 64
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
GRG+ASE+ALARRDSPARG RHLGFAKALV+EMPHTLAAL+CGALSEWRATLIVRESACL
Sbjct 65 GRGVASEVALARRDSPARGGRHLGFAKALVHEMPHTLAALECGALSEWRATLIVRESACL 124
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
D DRRALDAELC DP L GMGDARV AAA+AIAYRLDP AVV+RAA AENDRTVTIRP
Sbjct 125 DAEDRRALDAELCADPAGLSGMGDARVAAAAKAIAYRLDPHAVVERAAKAENDRTVTIRP 184
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
APDTMTYLTALLPVAQGVSVYAAL R ADTR DGR RGQVMADTLVERVTGR A VPTP+
Sbjct 185 APDTMTYLTALLPVAQGVSVYAALRREADTRGDGRPRGQVMADTLVERVTGRRATVPTPV 244
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
AVNLV+SDETLLG A+ P ++ GYGPIPAAVAR MVA+A D RSRATLRRLYAHP+AGA
Sbjct 245 AVNLVLSDETLLGGADAPGEISGYGPIPAAVARRMVANAAADPRSRATLRRLYAHPRAGA 304
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LV+MESRARLFP+GLA FI LRDQRCRTPYCDAPIRHRDHA PWADGG TSA NGLG CE
Sbjct 305 LVAMESRARLFPQGLARFIGLRDQRCRTPYCDAPIRHRDHAQPWADGGATSAGNGLGLCE 364
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALA 420
CNY K+ PGW VS VDE HTHTA F TPTG +RS APP PA+T+S +EVR+ +A A
Sbjct 365 HCNYVKETPGWTVSAGVDETHTHTALFTTPTGQTYRSTAPPRAPAITMSTVEVRVTVAFA 424
Query 421 RYAA 424
R+AA
Sbjct 425 RHAA 428
>gi|118464996|ref|YP_883136.1| hypothetical protein MAV_3980 [Mycobacterium avium 104]
gi|118166283|gb|ABK67180.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=419
Score = 595 bits (1535), Expect = 4e-168, Method: Compositional matrix adjust.
Identities = 336/418 (81%), Positives = 364/418 (88%), Gaps = 0/418 (0%)
Query 7 AIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARRGRGLAS 66
A+DP A E+ALIERI+ELE LK AAAA QARAAAA+DAARRAAE AAGVPAARRGRG+AS
Sbjct 2 AVDPAAGESALIERISELETLKCAAAAAQARAAAALDAARRAAEAAAGVPAARRGRGVAS 61
Query 67 EIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVADRR 126
E+ALARRDSPARG RHLGFAKALV+EMPHTLAAL+CGALSEWRATLIVRESACLD DRR
Sbjct 62 EVALARRDSPARGGRHLGFAKALVHEMPHTLAALECGALSEWRATLIVRESACLDAEDRR 121
Query 127 ALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMT 186
ALDAELC DP L GMGDARV AAA+AIAYRLDP AVV+RAA AENDRTVTIRPAPDTMT
Sbjct 122 ALDAELCADPAGLSGMGDARVAAAAKAIAYRLDPHAVVERAAKAENDRTVTIRPAPDTMT 181
Query 187 YLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVM 246
YLTALLPVAQGVSVYAAL R ADTR DGR RGQVMADTLVERVTGR + VPTP+AVNLV+
Sbjct 182 YLTALLPVAQGVSVYAALRREADTRGDGRPRGQVMADTLVERVTGRRSTVPTPVAVNLVL 241
Query 247 SDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMES 306
SDETLLG A+ P ++ GYGPIPAAVAR MVA+A D RSRATLRRLYAHP+AGALV+MES
Sbjct 242 SDETLLGGADAPGEISGYGPIPAAVARRMVANAAADPRSRATLRRLYAHPRAGALVAMES 301
Query 307 RARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAK 366
RARLFP+GLA FI LRDQRCRTPYCDAPIRHRDHA PWADGG TSA NGLG CE CNY K
Sbjct 302 RARLFPQGLARFIGLRDQRCRTPYCDAPIRHRDHAQPWADGGATSAGNGLGLCEHCNYVK 361
Query 367 QAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALARYAA 424
+ GW VS VDE HTHTA F TPTG +RS APP PA+T+S +EVR+ +A AR+AA
Sbjct 362 ETSGWTVSAGVDETHTHTALFTTPTGQTYRSTAPPRAPAITMSTVEVRVAVAFARHAA 419
>gi|296169248|ref|ZP_06850901.1| HNH nuclease [Mycobacterium parascrofulaceum ATCC BAA-614]
gi|295896146|gb|EFG75813.1| HNH nuclease [Mycobacterium parascrofulaceum ATCC BAA-614]
Length=358
Score = 568 bits (1463), Expect = 8e-160, Method: Compositional matrix adjust.
Identities = 288/358 (81%), Positives = 316/358 (89%), Gaps = 0/358 (0%)
Query 67 EIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVADRR 126
E+ALARRDSPARG RHLGFAKALV EMPHTLAAL+ GALSEWRATLIVRESACLDV DRR
Sbjct 1 EVALARRDSPARGGRHLGFAKALVCEMPHTLAALERGALSEWRATLIVRESACLDVEDRR 60
Query 127 ALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMT 186
ALDAELC DP L GMGDARV AAA+AIAYRLDP AVV+RAA AE +RTVTIRPAPDTMT
Sbjct 61 ALDAELCADPASLSGMGDARVAAAAKAIAYRLDPHAVVERAARAEQERTVTIRPAPDTMT 120
Query 187 YLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVM 246
YLTALLPVAQGVSVYAAL R ADTR DGRSRGQVMADTLVERVTGR AAVP+PIAVNLV+
Sbjct 121 YLTALLPVAQGVSVYAALRREADTRGDGRSRGQVMADTLVERVTGRSAAVPSPIAVNLVL 180
Query 247 SDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMES 306
SD+TLLG + PA + GYGPIPAAVAR MV SAV D+RSRATLRRLYAHP+ GALV+MES
Sbjct 181 SDQTLLGGDHAPADIAGYGPIPAAVARAMVGSAVADRRSRATLRRLYAHPRTGALVAMES 240
Query 307 RARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAK 366
R+R+FPRGLAAFI LRDQRCRTPYCDAP+RHRDHA PWADGG T+A NGLG CE CNY K
Sbjct 241 RSRIFPRGLAAFIGLRDQRCRTPYCDAPVRHRDHAQPWADGGATTADNGLGLCENCNYVK 300
Query 367 QAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALARYAA 424
++ GWRV+T+VDEN THTA F TPTG +RS APP P +TVSELE+R+G++LAR+AA
Sbjct 301 ESAGWRVTTTVDENRTHTALFTTPTGKSYRSAAPPRAPTITVSELEIRVGVSLARHAA 358
>gi|254776396|ref|ZP_05217912.1| hypothetical protein MaviaA2_17254 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=361
Score = 566 bits (1458), Expect = 3e-159, Method: Compositional matrix adjust.
Identities = 289/361 (81%), Positives = 313/361 (87%), Gaps = 0/361 (0%)
Query 64 LASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVA 123
+ASE+ALARRDSPARG RHLGFAKALV+EMPHTLAAL+CGALSEWRATLIVRESACLD
Sbjct 1 MASEVALARRDSPARGGRHLGFAKALVHEMPHTLAALECGALSEWRATLIVRESACLDAE 60
Query 124 DRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPD 183
DRRALDAELC DP L GMGDARV AAA+AIAYRLDP AVV+RAA AENDRTVTIRPAPD
Sbjct 61 DRRALDAELCADPAGLSGMGDARVAAAAKAIAYRLDPHAVVERAAKAENDRTVTIRPAPD 120
Query 184 TMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVN 243
TMTYLTALLPVAQGVSVYAAL R ADTR DGR RGQVMADTLVERVTGR A VPTP+AVN
Sbjct 121 TMTYLTALLPVAQGVSVYAALRREADTRGDGRPRGQVMADTLVERVTGRRATVPTPVAVN 180
Query 244 LVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVS 303
LV+SDETLLG A+ P ++ GYGPIPAAVAR MVA+A D RSRATLRRLYAHP+AGALV+
Sbjct 181 LVLSDETLLGGADAPGEISGYGPIPAAVARRMVANAAADPRSRATLRRLYAHPRAGALVA 240
Query 304 MESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCN 363
MESRARLFP+GLA FI LRDQRCRTPYCDAPIRHRDHA PWADGG TSA NGLG CE CN
Sbjct 241 MESRARLFPQGLARFIGLRDQRCRTPYCDAPIRHRDHAQPWADGGATSAGNGLGLCEHCN 300
Query 364 YAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALARYA 423
Y K+ GW VS VDE H HTA F TPTG +RS APP PA+T+S++EVR+ +A AR+A
Sbjct 301 YVKETAGWTVSAGVDETHIHTALFTTPTGQTYRSTAPPRAPAITMSKVEVRVAVAFARHA 360
Query 424 A 424
A
Sbjct 361 A 361
>gi|298526546|ref|ZP_07013955.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|298496340|gb|EFI31634.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
Length=296
Score = 559 bits (1440), Expect = 4e-157, Method: Compositional matrix adjust.
Identities = 279/289 (97%), Positives = 280/289 (97%), Gaps = 0/289 (0%)
Query 136 PGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTYLTALLPVA 195
P G VVAAARAIA+RLDPQAVVDRAANAENDRTVTIRPAPDTMTYLTALLPVA
Sbjct 8 PATWRGWAMRGVVAAARAIAHRLDPQAVVDRAANAENDRTVTIRPAPDTMTYLTALLPVA 67
Query 196 QGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAA 255
QGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAA
Sbjct 68 QGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAA 127
Query 256 NTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMESRARLFPRGL 315
NTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMESRARLFPRGL
Sbjct 128 NTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMESRARLFPRGL 187
Query 316 AAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAPGWRVST 375
AAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAPGWRVST
Sbjct 188 AAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAPGWRVST 247
Query 376 SVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALARYAA 424
SVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALARYAA
Sbjct 248 SVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSELEVRIGIALARYAA 296
>gi|254550391|ref|ZP_05140838.1| hypothetical protein Mtube_08010 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
gi|308231826|ref|ZP_07663944.1| hypothetical protein TMAG_03382 [Mycobacterium tuberculosis SUMu001]
gi|308369785|ref|ZP_07666807.1| hypothetical protein TMBG_01206 [Mycobacterium tuberculosis SUMu002]
11 more sequence titles
Length=429
Score = 556 bits (1434), Expect = 2e-156, Method: Compositional matrix adjust.
Identities = 312/424 (74%), Positives = 347/424 (82%), Gaps = 6/424 (1%)
Query 1 MFE--TLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAA 58
MF+ +L +D +EA+L RIAELER+KSAAAAGQARAAAA+D RR E AGVPA
Sbjct 1 MFDLVSLAGVDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAGVPAR 60
Query 59 RRGRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESA 118
RRGRG+ASE+ALARRDSPARG RHLGFAKALVYEMPHTLAAL+ G LSEWRATLIVRESA
Sbjct 61 RRGRGVASEVALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATLIVRESA 120
Query 119 CLDVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTI 178
CLDV DRRALDAELC D L+GMGDAR+ AAARAIAYRLD QAVV+RAA AE +RTVTI
Sbjct 121 CLDVEDRRALDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARAETERTVTI 180
Query 179 RPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPT 238
RPAPDTMT++TALLPVA+GVSVYAAL RAADT D R+RGQVMADTLVERVTG+ A
Sbjct 181 RPAPDTMTWVTALLPVARGVSVYAALKRAADTTFDDRTRGQVMADTLVERVTGQPAEAAQ 240
Query 239 PIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQA 298
P+AVNLV+SDETLL PA + GYGPIPAAVAR +V AV D RSRATLRRLY HP++
Sbjct 241 PVAVNLVLSDETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSRATLRRLYRHPRS 300
Query 299 GALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGT 358
GALV+MESRAR FP+GLAAFI LRDQRCR PYCDAPIRHRDHA P GGPT+A NGLG+
Sbjct 301 GALVAMESRARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPHHRGGPTTATNGLGS 360
Query 359 CERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPA---VTVSELEVRI 415
CERCNY K+APGWRVST DE HTAEF TPTG + APP LP + VS++E RI
Sbjct 361 CERCNYVKEAPGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPP-LPGPLEIDVSQVEARI 419
Query 416 GIAL 419
G+AL
Sbjct 420 GVAL 423
>gi|323720131|gb|EGB29235.1| hypothetical protein TMMG_02079 [Mycobacterium tuberculosis CDC1551A]
Length=429
Score = 555 bits (1430), Expect = 6e-156, Method: Compositional matrix adjust.
Identities = 311/424 (74%), Positives = 347/424 (82%), Gaps = 6/424 (1%)
Query 1 MFE--TLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAA 58
MF+ +L +D +EA+L RIAELER+KSAAAAGQARAAAA+D RR E AGVPA
Sbjct 1 MFDLVSLAGVDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAGVPAR 60
Query 59 RRGRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESA 118
RRGRG+ASE+ALARRDSPARG RHLGFAKALVYEMPHTLAAL+ G LSEWRATLIVRESA
Sbjct 61 RRGRGVASEVALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATLIVRESA 120
Query 119 CLDVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTI 178
CLDV DRRALDAELC D L+GMGDAR+ AAARAIAYRLD QAVV+RAA AE +RTVTI
Sbjct 121 CLDVEDRRALDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARAETERTVTI 180
Query 179 RPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPT 238
RPAPDTMT++TALLPVA+GVSVYAAL RAA+T D R+RGQVMADTLVERVTG+ A
Sbjct 181 RPAPDTMTWVTALLPVARGVSVYAALKRAAETTFDDRTRGQVMADTLVERVTGQPAEAAQ 240
Query 239 PIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQA 298
P+AVNLV+SDETLL PA + GYGPIPAAVAR +V AV D RSRATLRRLY HP++
Sbjct 241 PVAVNLVLSDETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSRATLRRLYRHPRS 300
Query 299 GALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGT 358
GALV+MESRAR FP+GLAAFI LRDQRCR PYCDAPIRHRDHA P GGPT+A NGLG+
Sbjct 301 GALVAMESRARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPHHRGGPTTATNGLGS 360
Query 359 CERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPA---VTVSELEVRI 415
CERCNY K+APGWRVST DE HTAEF TPTG + APP LP + VS++E RI
Sbjct 361 CERCNYVKEAPGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPP-LPGPLEIDVSQVEARI 419
Query 416 GIAL 419
G+AL
Sbjct 420 GVAL 423
>gi|308375447|ref|ZP_07443945.2| hypothetical protein TMGG_01948 [Mycobacterium tuberculosis SUMu007]
gi|308346301|gb|EFP35152.1| hypothetical protein TMGG_01948 [Mycobacterium tuberculosis SUMu007]
Length=503
Score = 554 bits (1428), Expect = 1e-155, Method: Compositional matrix adjust.
Identities = 312/424 (74%), Positives = 347/424 (82%), Gaps = 6/424 (1%)
Query 1 MFE--TLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAA 58
MF+ +L +D +EA+L RIAELER+KSAAAAGQARAAAA+D RR E AGVPA
Sbjct 75 MFDLVSLAGVDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAGVPAR 134
Query 59 RRGRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESA 118
RRGRG+ASE+ALARRDSPARG RHLGFAKALVYEMPHTLAAL+ G LSEWRATLIVRESA
Sbjct 135 RRGRGVASEVALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATLIVRESA 194
Query 119 CLDVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTI 178
CLDV DRRALDAELC D L+GMGDAR+ AAARAIAYRLD QAVV+RAA AE +RTVTI
Sbjct 195 CLDVEDRRALDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARAETERTVTI 254
Query 179 RPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPT 238
RPAPDTMT++TALLPVA+GVSVYAAL RAADT D R+RGQVMADTLVERVTG+ A
Sbjct 255 RPAPDTMTWVTALLPVARGVSVYAALKRAADTTFDDRTRGQVMADTLVERVTGQPAEAAQ 314
Query 239 PIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQA 298
P+AVNLV+SDETLL PA + GYGPIPAAVAR +V AV D RSRATLRRLY HP++
Sbjct 315 PVAVNLVLSDETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSRATLRRLYRHPRS 374
Query 299 GALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGT 358
GALV+MESRAR FP+GLAAFI LRDQRCR PYCDAPIRHRDHA P GGPT+A NGLG+
Sbjct 375 GALVAMESRARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPHHRGGPTTATNGLGS 434
Query 359 CERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPA---VTVSELEVRI 415
CERCNY K+APGWRVST DE HTAEF TPTG + APP LP + VS++E RI
Sbjct 435 CERCNYVKEAPGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPP-LPGPLEIDVSQVEARI 493
Query 416 GIAL 419
G+AL
Sbjct 494 GVAL 497
>gi|15608518|ref|NP_215894.1| hypothetical protein Rv1378c [Mycobacterium tuberculosis H37Rv]
gi|148661169|ref|YP_001282692.1| hypothetical protein MRA_1387 [Mycobacterium tuberculosis H37Ra]
gi|167968430|ref|ZP_02550707.1| hypothetical protein MtubH3_10496 [Mycobacterium tuberculosis
H37Ra]
gi|1621260|emb|CAB02639.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium tuberculosis H37Rv]
gi|148505321|gb|ABQ73130.1| hypothetical protein MRA_1387 [Mycobacterium tuberculosis H37Ra]
Length=475
Score = 554 bits (1427), Expect = 1e-155, Method: Compositional matrix adjust.
Identities = 312/424 (74%), Positives = 347/424 (82%), Gaps = 6/424 (1%)
Query 1 MFE--TLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAA 58
MF+ +L +D +EA+L RIAELER+KSAAAAGQARAAAA+D RR E AGVPA
Sbjct 47 MFDLVSLAGVDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAGVPAR 106
Query 59 RRGRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESA 118
RRGRG+ASE+ALARRDSPARG RHLGFAKALVYEMPHTLAAL+ G LSEWRATLIVRESA
Sbjct 107 RRGRGVASEVALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATLIVRESA 166
Query 119 CLDVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTI 178
CLDV DRRALDAELC D L+GMGDAR+ AAARAIAYRLD QAVV+RAA AE +RTVTI
Sbjct 167 CLDVEDRRALDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARAETERTVTI 226
Query 179 RPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPT 238
RPAPDTMT++TALLPVA+GVSVYAAL RAADT D R+RGQVMADTLVERVTG+ A
Sbjct 227 RPAPDTMTWVTALLPVARGVSVYAALKRAADTTFDDRTRGQVMADTLVERVTGQPAEAAQ 286
Query 239 PIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQA 298
P+AVNLV+SDETLL PA + GYGPIPAAVAR +V AV D RSRATLRRLY HP++
Sbjct 287 PVAVNLVLSDETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSRATLRRLYRHPRS 346
Query 299 GALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGT 358
GALV+MESRAR FP+GLAAFI LRDQRCR PYCDAPIRHRDHA P GGPT+A NGLG+
Sbjct 347 GALVAMESRARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPHHRGGPTTATNGLGS 406
Query 359 CERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPA---VTVSELEVRI 415
CERCNY K+APGWRVST DE HTAEF TPTG + APP LP + VS++E RI
Sbjct 407 CERCNYVKEAPGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPP-LPGPLEIDVSQVEARI 465
Query 416 GIAL 419
G+AL
Sbjct 466 GVAL 469
>gi|31792572|ref|NP_855065.1| hypothetical protein Mb1413c [Mycobacterium bovis AF2122/97]
gi|31618161|emb|CAD94274.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
Length=475
Score = 553 bits (1426), Expect = 1e-155, Method: Compositional matrix adjust.
Identities = 312/424 (74%), Positives = 347/424 (82%), Gaps = 6/424 (1%)
Query 1 MFE--TLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAA 58
MF+ +L +D +EA+L RIAELER+KSAAAAGQARAAAA+D RR E AGVPA
Sbjct 47 MFDLVSLAGVDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAGVPAR 106
Query 59 RRGRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESA 118
RRGRG+ASE+ALARRDSPARG RHLGFAKALVYEMPHTLAAL+ G LSEWRATLIVRESA
Sbjct 107 RRGRGVASEVALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATLIVRESA 166
Query 119 CLDVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTI 178
CLDV DRRALDAELC D L+GMGDAR+ AAARAIAYRLD QAVV+RAA AE +RTVTI
Sbjct 167 CLDVEDRRALDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARAETERTVTI 226
Query 179 RPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPT 238
RPAPDTMT++TALLPVA+GVSVYAAL RAADT D R+RGQVMADTLVERVTG+ A
Sbjct 227 RPAPDTMTWVTALLPVARGVSVYAALKRAADTTFDDRTRGQVMADTLVERVTGQPAEAAQ 286
Query 239 PIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQA 298
P+AVNLV+SDETLL PA + GYGPIPAAVAR +V AV D RSRATLRRLY HP++
Sbjct 287 PVAVNLVLSDETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSRATLRRLYRHPRS 346
Query 299 GALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGT 358
GALV+MESRAR FP+GLAAFI LRDQRCR PYCDAPIRHRDHA P GGPT+A NGLG+
Sbjct 347 GALVAMESRARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPHHRGGPTTATNGLGS 406
Query 359 CERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPA---VTVSELEVRI 415
CERCNY K+APGWRVST DE HTAEF TPTG + APP LP + VS++E RI
Sbjct 407 CERCNYVKEAPGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPP-LPGPLEIDVSQVEARI 465
Query 416 GIAL 419
G+AL
Sbjct 466 GVAL 469
>gi|289554832|ref|ZP_06444042.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
gi|289439464|gb|EFD21957.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
Length=443
Score = 553 bits (1426), Expect = 1e-155, Method: Compositional matrix adjust.
Identities = 312/424 (74%), Positives = 347/424 (82%), Gaps = 6/424 (1%)
Query 1 MFE--TLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAA 58
MF+ +L +D +EA+L RIAELER+KSAAAAGQARAAAA+D RR E AGVPA
Sbjct 15 MFDLVSLAGVDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAGVPAR 74
Query 59 RRGRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESA 118
RRGRG+ASE+ALARRDSPARG RHLGFAKALVYEMPHTLAAL+ G LSEWRATLIVRESA
Sbjct 75 RRGRGVASEVALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATLIVRESA 134
Query 119 CLDVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTI 178
CLDV DRRALDAELC D L+GMGDAR+ AAARAIAYRLD QAVV+RAA AE +RTVTI
Sbjct 135 CLDVEDRRALDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARAETERTVTI 194
Query 179 RPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPT 238
RPAPDTMT++TALLPVA+GVSVYAAL RAADT D R+RGQVMADTLVERVTG+ A
Sbjct 195 RPAPDTMTWVTALLPVARGVSVYAALKRAADTTFDDRTRGQVMADTLVERVTGQPAEAAQ 254
Query 239 PIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQA 298
P+AVNLV+SDETLL PA + GYGPIPAAVAR +V AV D RSRATLRRLY HP++
Sbjct 255 PVAVNLVLSDETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSRATLRRLYRHPRS 314
Query 299 GALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGT 358
GALV+MESRAR FP+GLAAFI LRDQRCR PYCDAPIRHRDHA P GGPT+A NGLG+
Sbjct 315 GALVAMESRARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPHHRGGPTTATNGLGS 374
Query 359 CERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPA---VTVSELEVRI 415
CERCNY K+APGWRVST DE HTAEF TPTG + APP LP + VS++E RI
Sbjct 375 CERCNYVKEAPGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPP-LPGPLEIDVSQVEARI 433
Query 416 GIAL 419
G+AL
Sbjct 434 GVAL 437
>gi|121637308|ref|YP_977531.1| hypothetical protein BCG_1439c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|148822599|ref|YP_001287352.1| hypothetical protein TBFG_11407 [Mycobacterium tuberculosis F11]
gi|224989783|ref|YP_002644470.1| hypothetical protein JTY_1414 [Mycobacterium bovis BCG str. Tokyo
172]
29 more sequence titles
Length=475
Score = 553 bits (1426), Expect = 2e-155, Method: Compositional matrix adjust.
Identities = 312/424 (74%), Positives = 347/424 (82%), Gaps = 6/424 (1%)
Query 1 MFE--TLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAA 58
MF+ +L +D +EA+L RIAELER+KSAAAAGQARAAAA+D RR E AGVPA
Sbjct 47 MFDLVSLAGVDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAGVPAR 106
Query 59 RRGRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESA 118
RRGRG+ASE+ALARRDSPARG RHLGFAKALVYEMPHTLAAL+ G LSEWRATLIVRESA
Sbjct 107 RRGRGVASEVALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATLIVRESA 166
Query 119 CLDVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTI 178
CLDV DRRALDAELC D L+GMGDAR+ AAARAIAYRLD QAVV+RAA AE +RTVTI
Sbjct 167 CLDVEDRRALDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARAETERTVTI 226
Query 179 RPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPT 238
RPAPDTMT++TALLPVA+GVSVYAAL RAADT D R+RGQVMADTLVERVTG+ A
Sbjct 227 RPAPDTMTWVTALLPVARGVSVYAALKRAADTTFDDRTRGQVMADTLVERVTGQPAEAAQ 286
Query 239 PIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQA 298
P+AVNLV+SDETLL PA + GYGPIPAAVAR +V AV D RSRATLRRLY HP++
Sbjct 287 PVAVNLVLSDETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSRATLRRLYRHPRS 346
Query 299 GALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGT 358
GALV+MESRAR FP+GLAAFI LRDQRCR PYCDAPIRHRDHA P GGPT+A NGLG+
Sbjct 347 GALVAMESRARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPHHRGGPTTATNGLGS 406
Query 359 CERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPA---VTVSELEVRI 415
CERCNY K+APGWRVST DE HTAEF TPTG + APP LP + VS++E RI
Sbjct 407 CERCNYVKEAPGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPP-LPGPLEIDVSQVEARI 465
Query 416 GIAL 419
G+AL
Sbjct 466 GVAL 469
>gi|308371953|ref|ZP_07667276.1| hypothetical protein TMDG_03491 [Mycobacterium tuberculosis SUMu004]
gi|308377714|ref|ZP_07668578.1| hypothetical protein TMIG_03060 [Mycobacterium tuberculosis SUMu009]
gi|308380062|ref|ZP_07669111.1| hypothetical protein TMKG_01882 [Mycobacterium tuberculosis SUMu011]
gi|308334956|gb|EFP23807.1| hypothetical protein TMDG_03491 [Mycobacterium tuberculosis SUMu004]
gi|308354872|gb|EFP43723.1| hypothetical protein TMIG_03060 [Mycobacterium tuberculosis SUMu009]
gi|308362760|gb|EFP51611.1| hypothetical protein TMKG_01882 [Mycobacterium tuberculosis SUMu011]
Length=420
Score = 552 bits (1423), Expect = 3e-155, Method: Compositional matrix adjust.
Identities = 309/415 (75%), Positives = 342/415 (83%), Gaps = 4/415 (0%)
Query 8 IDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARRGRGLASE 67
+D +EA+L RIAELER+KSAAAAGQARAAAA+D RR E AGVPA RRGRG+ASE
Sbjct 1 MDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAGVPARRRGRGVASE 60
Query 68 IALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVADRRA 127
+ALARRDSPARG RHLGFAKALVYEMPHTLAAL+ G LSEWRATLIVRESACLDV DRRA
Sbjct 61 VALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATLIVRESACLDVEDRRA 120
Query 128 LDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTY 187
LDAELC D L+GMGDAR+ AAARAIAYRLD QAVV+RAA AE +RTVTIRPAPDTMT+
Sbjct 121 LDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARAETERTVTIRPAPDTMTW 180
Query 188 LTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMS 247
+TALLPVA+GVSVYAAL RAADT D R+RGQVMADTLVERVTG+ A P+AVNLV+S
Sbjct 181 VTALLPVARGVSVYAALKRAADTTFDDRTRGQVMADTLVERVTGQPAEAAQPVAVNLVLS 240
Query 248 DETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMESR 307
DETLL PA + GYGPIPAAVAR +V AV D RSRATLRRLY HP++GALV+MESR
Sbjct 241 DETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSRATLRRLYRHPRSGALVAMESR 300
Query 308 ARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQ 367
AR FP+GLAAFI LRDQRCR PYCDAPIRHRDHA P GGPT+A NGLG+CERCNY K+
Sbjct 301 ARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPHHRGGPTTATNGLGSCERCNYVKE 360
Query 368 APGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPA---VTVSELEVRIGIAL 419
APGWRVST DE HTAEF TPTG + APP LP + VS++E RIG+AL
Sbjct 361 APGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPP-LPGPLEIDVSQVEARIGVAL 414
>gi|289574045|ref|ZP_06454272.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|289538476|gb|EFD43054.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
Length=475
Score = 552 bits (1423), Expect = 3e-155, Method: Compositional matrix adjust.
Identities = 311/424 (74%), Positives = 347/424 (82%), Gaps = 6/424 (1%)
Query 1 MFE--TLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAA 58
MF+ +L +D +EA+L RIAELER+KSAAAAGQARAAAA+D RR E AG+PA
Sbjct 47 MFDLVSLAGVDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAGLPAR 106
Query 59 RRGRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESA 118
RRGRG+ASE+ALARRDSPARG RHLGFAKALVYEMPHTLAAL+ G LSEWRATLIVRESA
Sbjct 107 RRGRGVASEVALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATLIVRESA 166
Query 119 CLDVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTI 178
CLDV DRRALDAELC D L+GMGDAR+ AAARAIAYRLD QAVV+RAA AE +RTVTI
Sbjct 167 CLDVEDRRALDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARAETERTVTI 226
Query 179 RPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPT 238
RPAPDTMT++TALLPVA+GVSVYAAL RAADT D R+RGQVMADTLVERVTG+ A
Sbjct 227 RPAPDTMTWVTALLPVARGVSVYAALKRAADTTFDDRTRGQVMADTLVERVTGQPAEAAQ 286
Query 239 PIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQA 298
P+AVNLV+SDETLL PA + GYGPIPAAVAR +V AV D RSRATLRRLY HP++
Sbjct 287 PVAVNLVLSDETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSRATLRRLYRHPRS 346
Query 299 GALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGT 358
GALV+MESRAR FP+GLAAFI LRDQRCR PYCDAPIRHRDHA P GGPT+A NGLG+
Sbjct 347 GALVAMESRARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPHHRGGPTTATNGLGS 406
Query 359 CERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPA---VTVSELEVRI 415
CERCNY K+APGWRVST DE HTAEF TPTG + APP LP + VS++E RI
Sbjct 407 CERCNYVKEAPGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPP-LPGPLEIDVSQVEARI 465
Query 416 GIAL 419
G+AL
Sbjct 466 GVAL 469
>gi|340626392|ref|YP_004744844.1| hypothetical protein MCAN_13941 [Mycobacterium canettii CIPT
140010059]
gi|340004582|emb|CCC43726.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=475
Score = 552 bits (1422), Expect = 4e-155, Method: Compositional matrix adjust.
Identities = 310/419 (74%), Positives = 344/419 (83%), Gaps = 4/419 (0%)
Query 4 TLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARRGRG 63
+L +D +EA+L RIAELER+KSAAAAGQARAAAA+D RR E AGVPA RRGRG
Sbjct 52 SLAGVDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAGVPARRRGRG 111
Query 64 LASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVA 123
+ASE+ALARRDSPARG RHLGFAKALVYEMPHTLAAL+ G LSEWRATLIVRESACLDV
Sbjct 112 VASEVALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATLIVRESACLDVE 171
Query 124 DRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPD 183
DRRALDAELC D L+GMGDAR+ AAARAIAYRLD QAVV+RAA AE +RTVTIRPAPD
Sbjct 172 DRRALDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARAETERTVTIRPAPD 231
Query 184 TMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVN 243
TMT++TALLPVA+GVSVYAAL RAADT D R+RGQVMADTLVERVTG+ A P+AVN
Sbjct 232 TMTWVTALLPVARGVSVYAALKRAADTTFDDRTRGQVMADTLVERVTGQPAEAAQPVAVN 291
Query 244 LVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVS 303
LV+SDETLL PA + GYGPIPAAVAR +V AV D RSRATLRRLY HP++GALV+
Sbjct 292 LVLSDETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSRATLRRLYRHPRSGALVA 351
Query 304 MESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCN 363
MESRAR FP+GLAAFI LRDQRCR PYCDAPIRHRDHA P GGPT+A NGLG+CERCN
Sbjct 352 MESRARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPHHRGGPTTATNGLGSCERCN 411
Query 364 YAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPA---VTVSELEVRIGIAL 419
Y K+APGWRVST DE HTAEF TPTG + APP LP + VS++E RIG+AL
Sbjct 412 YVKEAPGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPP-LPGPLEIDVSQVEARIGVAL 469
>gi|339631445|ref|YP_004723087.1| hypothetical protein MAF_14000 [Mycobacterium africanum GM041182]
gi|339330801|emb|CCC26472.1| conserved hypothetical protein [Mycobacterium africanum GM041182]
Length=475
Score = 551 bits (1421), Expect = 6e-155, Method: Compositional matrix adjust.
Identities = 311/424 (74%), Positives = 347/424 (82%), Gaps = 6/424 (1%)
Query 1 MFE--TLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAA 58
MF+ +L +D +EA+L RIAELER+KSAAAAGQARAAAA+D RR E AGVPA
Sbjct 47 MFDLVSLAGVDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAGVPAR 106
Query 59 RRGRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESA 118
RRGRG+ASE+ALARRDSPARG RHLGFAKALVYEMPHTLAAL+ G LSEWRATLIVRESA
Sbjct 107 RRGRGVASEVALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATLIVRESA 166
Query 119 CLDVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTI 178
CL+V DRRALDAELC D L+GMGDAR+ AAARAIAYRLD QAVV+RAA AE +RTVTI
Sbjct 167 CLEVEDRRALDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARAETERTVTI 226
Query 179 RPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPT 238
RPAPDTMT++TALLPVA+GVSVYAAL RAADT D R+RGQVMADTLVERVTG+ A
Sbjct 227 RPAPDTMTWVTALLPVARGVSVYAALKRAADTTFDDRTRGQVMADTLVERVTGQPAEAAQ 286
Query 239 PIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQA 298
P+AVNLV+SDETLL PA + GYGPIPAAVAR +V AV D RSRATLRRLY HP++
Sbjct 287 PVAVNLVLSDETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSRATLRRLYRHPRS 346
Query 299 GALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGT 358
GALV+MESRAR FP+GLAAFI LRDQRCR PYCDAPIRHRDHA P GGPT+A NGLG+
Sbjct 347 GALVAMESRARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPHHRGGPTTATNGLGS 406
Query 359 CERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPA---VTVSELEVRI 415
CERCNY K+APGWRVST DE HTAEF TPTG + APP LP + VS++E RI
Sbjct 407 CERCNYVKEAPGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPP-LPGPLEIDVSQVEARI 465
Query 416 GIAL 419
G+AL
Sbjct 466 GVAL 469
>gi|15840836|ref|NP_335873.1| hypothetical protein MT1422 [Mycobacterium tuberculosis CDC1551]
gi|13881034|gb|AAK45687.1| hypothetical protein MT1422 [Mycobacterium tuberculosis CDC1551]
Length=475
Score = 551 bits (1421), Expect = 6e-155, Method: Compositional matrix adjust.
Identities = 311/424 (74%), Positives = 347/424 (82%), Gaps = 6/424 (1%)
Query 1 MFE--TLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAA 58
MF+ +L +D +EA+L RIAELER+KSAAAAGQARAAAA+D RR E AGVPA
Sbjct 47 MFDLVSLAGVDSRDDEASLTARIAELERVKSAAAAGQARAAAALDKLRRCNEADAGVPAR 106
Query 59 RRGRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESA 118
RRGRG+ASE+ALARRDSPARG RHLGFAKALVYEMPHTLAAL+ G LSEWRATLIVRESA
Sbjct 107 RRGRGVASEVALARRDSPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATLIVRESA 166
Query 119 CLDVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTI 178
CLDV DRRALDAELC D L+GMGDAR+ AAARAIAYRLD QAVV+RAA AE +RTVTI
Sbjct 167 CLDVEDRRALDAELCADMSALDGMGDARIAAAARAIAYRLDAQAVVERAARAETERTVTI 226
Query 179 RPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPT 238
RPAPDTMT++TALLPVA+GVSVYAAL RAA+T D R+RGQVMADTLVERVTG+ A
Sbjct 227 RPAPDTMTWVTALLPVARGVSVYAALKRAAETTFDDRTRGQVMADTLVERVTGQPAEAAQ 286
Query 239 PIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQA 298
P+AVNLV+SDETLL PA + GYGPIPAAVAR +V AV D RSRATLRRLY HP++
Sbjct 287 PVAVNLVLSDETLLAGDRAPAVVDGYGPIPAAVARNLVRDAVADTRSRATLRRLYRHPRS 346
Query 299 GALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGT 358
GALV+MESRAR FP+GLAAFI LRDQRCR PYCDAPIRHRDHA P GGPT+A NGLG+
Sbjct 347 GALVAMESRARRFPKGLAAFIGLRDQRCRMPYCDAPIRHRDHAQPHHRGGPTTATNGLGS 406
Query 359 CERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPA---VTVSELEVRI 415
CERCNY K+APGWRVST DE HTAEF TPTG + APP LP + VS++E RI
Sbjct 407 CERCNYVKEAPGWRVSTDTDETGRHTAEFTTPTGMYYHCTAPP-LPGPLEIDVSQVEARI 465
Query 416 GIAL 419
G+AL
Sbjct 466 GVAL 469
>gi|108798576|ref|YP_638773.1| hypothetical protein Mmcs_1606 [Mycobacterium sp. MCS]
gi|119867676|ref|YP_937628.1| hypothetical protein Mkms_1631 [Mycobacterium sp. KMS]
gi|108768995|gb|ABG07717.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119693765|gb|ABL90838.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=430
Score = 543 bits (1399), Expect = 2e-152, Method: Compositional matrix adjust.
Identities = 295/414 (72%), Positives = 335/414 (81%), Gaps = 2/414 (0%)
Query 11 DAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARRGRGLASEIAL 70
+A E ALIERIA LER KSAAAA QARA A +D RRAAE AAGVPA +RGRGLASE+AL
Sbjct 13 EASETALIERIAALERAKSAAAAAQARATALLDEKRRAAEAAAGVPANKRGRGLASEVAL 72
Query 71 ARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVADRRALDA 130
AR D P +G RHLGFA+ALV+EMPHTLAAL+CGALSEWRATLIVRESACL V DRR LDA
Sbjct 73 ARHDCPNKGGRHLGFARALVHEMPHTLAALECGALSEWRATLIVRESACLSVEDRRMLDA 132
Query 131 ELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTYLTA 190
ELC D LEG+GD R+ A A+ IAYRLDPQAVVDRAA A ++RTVT RPAPDTMTY+TA
Sbjct 133 ELCRDVSRLEGLGDKRIEAEAKRIAYRLDPQAVVDRAAKAASERTVTCRPAPDTMTYVTA 192
Query 191 LLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMSDET 250
LLPVAQGV+VYAAL R+ADT D RSRGQVMADTLVERVTG A V P+AVNLV++DE
Sbjct 193 LLPVAQGVAVYAALKRSADTTFDDRSRGQVMADTLVERVTGCPAEVAVPVAVNLVITDEA 252
Query 251 LLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMESRARL 310
LLG PA + GYGP+PAAV R +V +AVTD+RS+ATLRRLY P++GALV+MESR+R
Sbjct 253 LLGGDPEPAVISGYGPVPAAVGRRLVDAAVTDKRSKATLRRLYRRPRSGALVAMESRSRC 312
Query 311 FPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAPG 370
FP+GLAAFI+LRDQ CRTPYCDAPIRH DHA P GGPTSA NGLG C CNYAK+APG
Sbjct 313 FPKGLAAFIDLRDQTCRTPYCDAPIRHHDHARPHRAGGPTSAANGLGECAACNYAKEAPG 372
Query 371 WRVSTSVDENHTHTAEFITPTGSRHRSGAPPH--LPAVTVSELEVRIGIALARY 422
WRV+TS D H A F TPTG+++ S APP + V +S+LEVRIGI LA +
Sbjct 373 WRVATSCDAEGRHRATFTTPTGTQYHSTAPPSPGMTVVNLSDLEVRIGIELAAF 426
>gi|289571279|ref|ZP_06451506.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T17]
gi|289545033|gb|EFD48681.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T17]
Length=263
Score = 534 bits (1376), Expect = 1e-149, Method: Compositional matrix adjust.
Identities = 263/263 (100%), Positives = 263/263 (100%), Gaps = 0/263 (0%)
Query 162 AVVDRAANAENDRTVTIRPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVM 221
AVVDRAANAENDRTVTIRPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVM
Sbjct 1 AVVDRAANAENDRTVTIRPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVM 60
Query 222 ADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVT 281
ADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVT
Sbjct 61 ADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVT 120
Query 282 DQRSRATLRRLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHA 341
DQRSRATLRRLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHA
Sbjct 121 DQRSRATLRRLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHA 180
Query 342 HPWADGGPTSAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPP 401
HPWADGGPTSAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPP
Sbjct 181 HPWADGGPTSAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPP 240
Query 402 HLPAVTVSELEVRIGIALARYAA 424
HLPAVTVSELEVRIGIALARYAA
Sbjct 241 HLPAVTVSELEVRIGIALARYAA 263
>gi|126432903|ref|YP_001068594.1| hypothetical protein Mjls_0290 [Mycobacterium sp. JLS]
gi|126232703|gb|ABN96103.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=411
Score = 523 bits (1346), Expect = 3e-146, Method: Compositional matrix adjust.
Identities = 285/393 (73%), Positives = 327/393 (84%), Gaps = 0/393 (0%)
Query 8 IDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARRGRGLASE 67
+D + +EAALIERIA LER KSAAAA QARA A +D RRAAE AAGVPA +RGRGLASE
Sbjct 2 VDTEIDEAALIERIAALERAKSAAAAAQARATALLDEKRRAAEAAAGVPANKRGRGLASE 61
Query 68 IALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVADRRA 127
+ALAR D P +G RHLGFA+ LV+EMPHTLAAL+CGALSEWRATLIVRESACL V DRR+
Sbjct 62 VALARHDCPNKGGRHLGFARVLVHEMPHTLAALECGALSEWRATLIVRESACLSVEDRRS 121
Query 128 LDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTY 187
LD ELC D LEG+GD R+ A A+ IAYRLDPQAVVDRAA A ++RTVT RPAPDTMTY
Sbjct 122 LDTELCRDVSSLEGLGDKRIEAEAKKIAYRLDPQAVVDRAARAASERTVTCRPAPDTMTY 181
Query 188 LTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMS 247
+TALLPVAQGV+VYAAL R+ADT D RSRGQVMADTLVERVTG A V P+AVNLV++
Sbjct 182 VTALLPVAQGVAVYAALKRSADTTFDDRSRGQVMADTLVERVTGCPAEVAVPVAVNLVIT 241
Query 248 DETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMESR 307
DE LLG PA + GYGP+PAAV R +V +AVTD+RS+ATLRRLY P++GALV+MESR
Sbjct 242 DEALLGGDPEPAVISGYGPVPAAVGRRLVDAAVTDKRSKATLRRLYRRPRSGALVAMESR 301
Query 308 ARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQ 367
AR FP+GLAAFI+LRDQ CRTPYC+APIRH DHA P GGPTSA NGLG CE+CNYAK+
Sbjct 302 ARCFPKGLAAFIDLRDQTCRTPYCNAPIRHHDHARPHRAGGPTSAANGLGECEQCNYAKE 361
Query 368 APGWRVSTSVDENHTHTAEFITPTGSRHRSGAP 400
APGW+V+ ++DE THTAEF TPTG+ +RS AP
Sbjct 362 APGWQVTAAIDETGTHTAEFTTPTGAVYRSTAP 394
>gi|254821166|ref|ZP_05226167.1| hypothetical protein MintA_14612 [Mycobacterium intracellulare
ATCC 13950]
Length=335
Score = 521 bits (1342), Expect = 8e-146, Method: Compositional matrix adjust.
Identities = 261/335 (78%), Positives = 293/335 (88%), Gaps = 0/335 (0%)
Query 90 VYEMPHTLAALDCGALSEWRATLIVRESACLDVADRRALDAELCGDPGDLEGMGDARVVA 149
++EMPHTLAAL+CGALSEWRATLIVRESACLDV DRRALDAE+C DP L GMGDARV A
Sbjct 1 MHEMPHTLAALECGALSEWRATLIVRESACLDVEDRRALDAEMCADPSSLSGMGDARVAA 60
Query 150 AARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTYLTALLPVAQGVSVYAALTRAAD 209
AA+AIAYRLDP A+V+RAA AE RTVTIRPAPDTM+Y+TALLPVAQGVSVYA L R AD
Sbjct 61 AAKAIAYRLDPHAIVERAAKAEEGRTVTIRPAPDTMSYVTALLPVAQGVSVYATLRREAD 120
Query 210 TRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAANTPAQLCGYGPIPA 269
T DGR+RGQVMADTLVERVTGR A VPTP+AVNL ++DETLLG + PA + GYGPIPA
Sbjct 121 TCGDGRTRGQVMADTLVERVTGRSATVPTPVAVNLALTDETLLGGDDAPADVAGYGPIPA 180
Query 270 AVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTP 329
+VAR+MVA A D+RSRATLRRLY HPQ+GALV+MESRARLFP+GLAAFI LRDQ CRTP
Sbjct 181 SVARSMVAEAAADRRSRATLRRLYTHPQSGALVAMESRARLFPQGLAAFIGLRDQHCRTP 240
Query 330 YCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFIT 389
YCDAPIRHRDHA PWADGGPT+A NGLG CE+CNY K+ GWRVSTSVDENHTHTA F T
Sbjct 241 YCDAPIRHRDHAQPWADGGPTTAGNGLGLCEQCNYVKENAGWRVSTSVDENHTHTALFTT 300
Query 390 PTGSRHRSGAPPHLPAVTVSELEVRIGIALARYAA 424
PTG+ +RS APP P +T+S+LEVR+G+ALAR+AA
Sbjct 301 PTGTTYRSTAPPRGPTITMSKLEVRVGVALARHAA 335
>gi|289553135|ref|ZP_06442345.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
gi|289437767|gb|EFD20260.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
605]
Length=296
Score = 515 bits (1327), Expect = 4e-144, Method: Compositional matrix adjust.
Identities = 253/254 (99%), Positives = 254/254 (100%), Gaps = 0/254 (0%)
Query 171 ENDRTVTIRPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVT 230
+NDRTVTIRPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVT
Sbjct 43 KNDRTVTIRPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVT 102
Query 231 GRDAAVPTPIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLR 290
GRDAAVPTPIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLR
Sbjct 103 GRDAAVPTPIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLR 162
Query 291 RLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPT 350
RLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPT
Sbjct 163 RLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPT 222
Query 351 SAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSE 410
SAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSE
Sbjct 223 SAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSE 282
Query 411 LEVRIGIALARYAA 424
LEVRIGIALARYAA
Sbjct 283 LEVRIGIALARYAA 296
>gi|240168682|ref|ZP_04747341.1| hypothetical protein MkanA1_05175 [Mycobacterium kansasii ATCC
12478]
Length=333
Score = 509 bits (1312), Expect = 2e-142, Method: Compositional matrix adjust.
Identities = 270/333 (82%), Positives = 298/333 (90%), Gaps = 1/333 (0%)
Query 92 EMPHTLAALDCGALSEWRATLIVRESACLDVADRRALDAELCGDPGDLEGMGDARVVAAA 151
EMPHTLAAL+ GALSEWRATLIVRESACLDVADRR LDAELCGDP +L+G+GDARV AAA
Sbjct 2 EMPHTLAALERGALSEWRATLIVRESACLDVADRRTLDAELCGDPANLDGLGDARVAAAA 61
Query 152 RAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTYLTALLPVAQGVSVYAALTRAADTR 211
+AIA RLDP AV DRAA A +R VTIRPAPDTM+Y+TALLPVAQGVSVYAAL R AD
Sbjct 62 KAIACRLDPHAVADRAATAAEERRVTIRPAPDTMSYVTALLPVAQGVSVYAALCREADAC 121
Query 212 CDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAANTPAQLCGYGPIPAAV 271
DGR RGQVMADTLVERVTGR A VP PIAVNLV+SDETLLGA + PA +CGYGPIPAAV
Sbjct 122 RDGRPRGQVMADTLVERVTGRAATVPAPIAVNLVLSDETLLGADSAPADVCGYGPIPAAV 181
Query 272 ARTMVASAVTDQRSRATLRRLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYC 331
AR MVA V D RSRATLRRLYAHP++GALV+MESR+RLFPRGLAAFIELRDQRCRTPYC
Sbjct 182 ARAMVADTVADPRSRATLRRLYAHPRSGALVAMESRSRLFPRGLAAFIELRDQRCRTPYC 241
Query 332 DAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPT 391
DAPIRHRDHA PWA+GG T+A+NGLG+CERCNYAKQAPGW+V+T+ DENHTHTAEF TPT
Sbjct 242 DAPIRHRDHARPWAEGGATTANNGLGSCERCNYAKQAPGWQVTTN-DENHTHTAEFTTPT 300
Query 392 GSRHRSGAPPHLPAVTVSELEVRIGIALARYAA 424
G R+RSGAPP +P +TVS++EVRIGIALAR+AA
Sbjct 301 GKRYRSGAPPRIPPITVSDVEVRIGIALARHAA 333
>gi|108797279|ref|YP_637476.1| hypothetical protein Mmcs_0299 [Mycobacterium sp. MCS]
gi|119866364|ref|YP_936316.1| hypothetical protein Mkms_0309 [Mycobacterium sp. KMS]
gi|108767698|gb|ABG06420.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119692453|gb|ABL89526.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=377
Score = 502 bits (1293), Expect = 4e-140, Method: Compositional matrix adjust.
Identities = 261/359 (73%), Positives = 300/359 (84%), Gaps = 0/359 (0%)
Query 42 VDAARRAAEGAAGVPAARRGRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALD 101
+D RRAAE AAGVPA +RGRGLASE+ALAR D P +G RHLGFA+ LV+EMPHTLAAL+
Sbjct 2 LDEKRRAAEAAAGVPANKRGRGLASEVALARHDCPNKGGRHLGFARVLVHEMPHTLAALE 61
Query 102 CGALSEWRATLIVRESACLDVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQ 161
CGALSEWRATLIVRESACL V DRR+LD ELC D LEG+GD R+ A A+ IAYRLDPQ
Sbjct 62 CGALSEWRATLIVRESACLSVEDRRSLDTELCRDVSSLEGLGDKRIEAEAKKIAYRLDPQ 121
Query 162 AVVDRAANAENDRTVTIRPAPDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVM 221
AVVDRAA A ++RTVT RPAPDTMTY+TALLPVAQGV+VYAAL R+ADT D RSRGQVM
Sbjct 122 AVVDRAARAASERTVTCRPAPDTMTYVTALLPVAQGVAVYAALKRSADTTFDDRSRGQVM 181
Query 222 ADTLVERVTGRDAAVPTPIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVT 281
ADTLVERVTG A V P+AVNLV++DE LLG PA + GYGP+PAAV R +V +AVT
Sbjct 182 ADTLVERVTGCPAEVAVPVAVNLVITDEALLGGDPEPAVISGYGPVPAAVGRRLVDAAVT 241
Query 282 DQRSRATLRRLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHA 341
D+RS+ATLRRLY P++GALV+MESRAR FP+GLAAFI+LRDQ CRTPYC+APIRH DHA
Sbjct 242 DKRSKATLRRLYRRPRSGALVAMESRARCFPKGLAAFIDLRDQTCRTPYCNAPIRHHDHA 301
Query 342 HPWADGGPTSAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAP 400
P GGPTSA NGLG CE+CNYAK+APGW+V+ ++DE THTAEF TPTG+ +RS AP
Sbjct 302 RPHRAGGPTSAANGLGECEQCNYAKEAPGWQVTAAIDETGTHTAEFTTPTGAVYRSTAP 360
>gi|108801662|ref|YP_641859.1| hypothetical protein Mmcs_4699 [Mycobacterium sp. MCS]
gi|119870813|ref|YP_940765.1| hypothetical protein Mkms_4785 [Mycobacterium sp. KMS]
gi|108772081|gb|ABG10803.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119696902|gb|ABL93975.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=434
Score = 500 bits (1287), Expect = 2e-139, Method: Compositional matrix adjust.
Identities = 267/427 (63%), Positives = 323/427 (76%), Gaps = 5/427 (1%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFE++ +DPDA EA L ++ +LERLKS+AAA QARA A A RRA E A+G+P +R
Sbjct 9 MFESVFDVDPDAGEAELRAQVEQLERLKSSAAAAQARATALWAAKRRATEAASGMPKRKR 68
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
GRGLA+E+ALAR+D+P G+ HLG A+ALV+EMPHTLAAL+CGALSEWRATLIV++SACL
Sbjct 69 GRGLATEVALARKDAPVCGNTHLGMARALVHEMPHTLAALECGALSEWRATLIVKQSACL 128
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
V RR LDAELC D L+G G+ R+ A A+ I RLD AVV R+A A DR VTIRP
Sbjct 129 SVEHRRHLDAELCADVSKLDGWGNRRIEAEAKKITTRLDAAAVVARSAKAAGDRCVTIRP 188
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
AP TM Y+TALLPVAQGV+VYAAL R ADT D RSRGQVMADTLVERVTGR A P P+
Sbjct 189 APGTMAYVTALLPVAQGVAVYAALKREADTTFDDRSRGQVMADTLVERVTGRPAEKPVPV 248
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
++NL ++D TL+G + P GYGP+PA V R +V+ AV D+ ++ATLRRLY HP +G
Sbjct 249 SLNLALADTTLVGDDDEPGWCEGYGPVPAGVVRALVSDAVADEAAKATLRRLYRHPASGQ 308
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LV+MES+AR FP+GLAAFI++RDQ CRTPYC+APIRH DHA P GGPTSA NGLG CE
Sbjct 309 LVAMESKARTFPKGLAAFIDIRDQTCRTPYCNAPIRHHDHAEPHRQGGPTSARNGLGECE 368
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTV---SELEVRIGI 417
CNYAK+APGW V+TS D+ H AEF TPTG+ +RS APP LP V S +E + I
Sbjct 369 GCNYAKEAPGWTVTTS-DDGGEHRAEFRTPTGATYRSTAPP-LPGPPVYARSIIEGGLSI 426
Query 418 ALARYAA 424
+ R+AA
Sbjct 427 DIVRFAA 433
>gi|126437648|ref|YP_001073339.1| hypothetical protein Mjls_5084 [Mycobacterium sp. JLS]
gi|126237448|gb|ABO00849.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=434
Score = 499 bits (1286), Expect = 3e-139, Method: Compositional matrix adjust.
Identities = 267/427 (63%), Positives = 321/427 (76%), Gaps = 5/427 (1%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFE+ +DPDA EA L ++ +LERLKS+AAA QARA A A RRA E A+G+P +R
Sbjct 9 MFESAFDVDPDAGEAELRAQVEQLERLKSSAAAAQARATALWAAKRRATEAASGMPKRKR 68
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
GRGLA+E+ALAR+D+P G+ HLG A+ALV+EMPHTLAAL+CGALSEWRATLIV++SACL
Sbjct 69 GRGLATEVALARKDAPVCGNTHLGMARALVHEMPHTLAALECGALSEWRATLIVKQSACL 128
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
V RR LDAELC D L+G G+ R+ A A+ I RLD AVV R+A A DR VTIRP
Sbjct 129 SVEHRRHLDAELCADVSKLDGWGNRRIEAEAKKITTRLDAAAVVARSAKAAGDRCVTIRP 188
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
APDTM Y+TALLPVAQGV+VYAAL R ADT D RSRGQVMADTLVERVTGR A P P+
Sbjct 189 APDTMAYVTALLPVAQGVAVYAALKREADTTFDDRSRGQVMADTLVERVTGRPAEKPVPV 248
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
++NL ++D TL+G + P GYGP+PA V R +V AV D+ ++ATLRRLY HP +G
Sbjct 249 SLNLALADTTLVGDDDEPGWCEGYGPVPAGVVRALVGDAVADEAAKATLRRLYRHPASGQ 308
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LV+MES+AR FP+GLAAFI++RDQ CRTPYC+APIRH DHA P GGPTSA NGLG CE
Sbjct 309 LVAMESKARTFPKGLAAFIDIRDQTCRTPYCNAPIRHHDHAEPHRQGGPTSARNGLGECE 368
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTV---SELEVRIGI 417
CNYAK+ PGW V+TS D+ H AEF TPTG+ +RS APP LP V S +E + I
Sbjct 369 GCNYAKETPGWTVTTS-DDGGEHRAEFRTPTGATYRSTAPP-LPGPPVYARSIIEGGLSI 426
Query 418 ALARYAA 424
+ R+AA
Sbjct 427 DIVRFAA 433
>gi|118469447|ref|YP_890248.1| hypothetical protein MSMEG_6025 [Mycobacterium smegmatis str.
MC2 155]
gi|118170734|gb|ABK71630.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=438
Score = 496 bits (1278), Expect = 2e-138, Method: Compositional matrix adjust.
Identities = 268/428 (63%), Positives = 318/428 (75%), Gaps = 5/428 (1%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFE+L IDP A EA L +RI + ER+KSAAAA QA A+ + RRAAE AAG+PA+RR
Sbjct 11 MFESLFDIDPQASEAELRDRIEQFERMKSAAAAAQATASVLWEQKRRAAEAAAGIPASRR 70
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
GRGLA+E+ALARR+SP G RHLG A ALV E+PHTLAAL CGALSEWRATLI RESACL
Sbjct 71 GRGLATEVALARRESPNAGGRHLGLAHALVDELPHTLAALRCGALSEWRATLIARESACL 130
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
RR LDAEL DP +G GDARV A A+ IA RL+ AV+DR+A AE DR VTIRP
Sbjct 131 SPELRRELDAELSADPSRFDGWGDARVAAEAKKIACRLNIDAVLDRSAKAEKDRRVTIRP 190
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
APD MT LT LLP+ QGV+VYAAL +AA D R+RGQVMADTLVERVTGR A P P+
Sbjct 191 APDAMTKLTVLLPLKQGVAVYAALHQAAMVNADDRNRGQVMADTLVERVTGRPAHAPVPV 250
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
+NLVM+D TL G + P + GYGP+PA VAR MVA A D+ ++A +RRL+ HP++G
Sbjct 251 NLNLVMADTTLFGEDDQPGWVQGYGPVPAEVARRMVADATLDENTKAAVRRLFRHPKSGQ 310
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LV+MESR+R+FP+GLA FI LRDQ CRTPYCDA IRHRDHA P DGGPT+AHNGLGTCE
Sbjct 311 LVAMESRSRIFPKGLATFIGLRDQTCRTPYCDALIRHRDHAVPHHDGGPTTAHNGLGTCE 370
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPA----VTVSELEVRIG 416
CNYAK+APGW V + + H AE++TPTG+ +RS APP LP T+S +E +
Sbjct 371 ACNYAKEAPGWSVIITETSDGEHIAEYVTPTGAVYRSTAPP-LPGRPVRHTLSLVETGLT 429
Query 417 IALARYAA 424
I L + A
Sbjct 430 IDLVTFDA 437
>gi|307083945|ref|ZP_07493058.1| hypothetical protein TMLG_03064 [Mycobacterium tuberculosis SUMu012]
gi|308366414|gb|EFP55265.1| hypothetical protein TMLG_03064 [Mycobacterium tuberculosis SUMu012]
Length=353
Score = 490 bits (1262), Expect = 1e-136, Method: Compositional matrix adjust.
Identities = 261/348 (75%), Positives = 287/348 (83%), Gaps = 4/348 (1%)
Query 75 SPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVADRRALDAELCG 134
SPARG RHLGFAKALVYEMPHTLAAL+ G LSEWRATLIVRESACLDV DRRALDAELC
Sbjct 1 SPARGGRHLGFAKALVYEMPHTLAALEVGRLSEWRATLIVRESACLDVEDRRALDAELCA 60
Query 135 DPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTYLTALLPV 194
D L+GMGDAR+ AAARAIAYRLD QAVV+RAA AE +RTVTIRPAPDTMT++TALLPV
Sbjct 61 DMSALDGMGDARIAAAARAIAYRLDAQAVVERAARAETERTVTIRPAPDTMTWVTALLPV 120
Query 195 AQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMSDETLLGA 254
A+GVSVYAAL RAADT D R+RGQVMADTLVERVTG+ A P+AVNLV+SDETLL
Sbjct 121 ARGVSVYAALKRAADTTFDDRTRGQVMADTLVERVTGQPAEAAQPVAVNLVLSDETLLAG 180
Query 255 ANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMESRARLFPRG 314
PA + GYGPIPAAVAR +V AV D RSRATLRRLY HP++GALV+MESRAR FP+G
Sbjct 181 DRAPAVVDGYGPIPAAVARNLVRDAVADTRSRATLRRLYRHPRSGALVAMESRARRFPKG 240
Query 315 LAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAPGWRVS 374
LAAFI LRDQRCR PYCDAPIRHRDHA P GGPT+A NGLG+CERCNY K+APGWRVS
Sbjct 241 LAAFIGLRDQRCRMPYCDAPIRHRDHAQPHHRGGPTTATNGLGSCERCNYVKEAPGWRVS 300
Query 375 TSVDENHTHTAEFITPTGSRHRSGAPPHLPA---VTVSELEVRIGIAL 419
T DE HTAEF TPTG + APP LP + VS++E RIG+AL
Sbjct 301 TDTDETGRHTAEFTTPTGMYYHCTAPP-LPGPLEIDVSQVEARIGVAL 347
>gi|120405676|ref|YP_955505.1| hypothetical protein Mvan_4725 [Mycobacterium vanbaalenii PYR-1]
gi|119958494|gb|ABM15499.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=426
Score = 477 bits (1227), Expect = 2e-132, Method: Compositional matrix adjust.
Identities = 251/401 (63%), Positives = 299/401 (75%), Gaps = 1/401 (0%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFE++ +D A +A L + LERLKS AAA QARA A A R+ AE +AGV A R
Sbjct 1 MFESMFDVDETASQAELRAAVERLERLKSQAAAAQARATALWAAKRQLAEESAGVRAHHR 60
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
G+GLASE+ALARRD+P +G++HLGFAKALV EMPHTLAAL+ G LSEWRA LIVRESACL
Sbjct 61 GKGLASEVALARRDAPVKGNQHLGFAKALVNEMPHTLAALESGVLSEWRANLIVRESACL 120
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
V RR LDAELC D LEG G+ R+ A A+ IA RLD A+V+RA A R V+ RP
Sbjct 121 SVEHRRQLDAELCADASGLEGWGNRRIEAEAKKIAARLDAAALVERARKAPEQRCVSCRP 180
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
APD M Y+T LLP+ GV+VYAA RAADT DGR+R QVMADT+ ERVTGR AA P +
Sbjct 181 APDNMLYVTVLLPMTHGVAVYAACNRAADTTFDGRTRDQVMADTIYERVTGRSAAEPVSV 240
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
A+NLVM+D TL G P GYGP+PAAV R +V+ AVTD +++ATLRRLY HP++G
Sbjct 241 ALNLVMADTTLFGDDEAPGWAQGYGPVPAAVVRDLVSDAVTDAKAKATLRRLYRHPKSGQ 300
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LV+MESR+ LFP+GLA FI LRDQ CRTPYCDAPIRH DHA P +GGPTSA NGLG C+
Sbjct 301 LVAMESRSWLFPKGLATFIGLRDQSCRTPYCDAPIRHHDHAVPDREGGPTSALNGLGECQ 360
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPP 401
CNYAK+APGW V+T+ D + HTA F TPTG+++ S APP
Sbjct 361 ACNYAKEAPGWHVTTT-DTDGEHTATFCTPTGAKYFSIAPP 400
>gi|120406245|ref|YP_956074.1| hypothetical protein Mvan_5297 [Mycobacterium vanbaalenii PYR-1]
gi|119959063|gb|ABM16068.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=430
Score = 462 bits (1188), Expect = 6e-128, Method: Compositional matrix adjust.
Identities = 260/431 (61%), Positives = 304/431 (71%), Gaps = 9/431 (2%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFE+L +D A EA L + ERLKS AAA QARA A A RRAAE A GVPAARR
Sbjct 1 MFESLFDVDVGASEAELRAAVERCERLKSQAAAAQARATALWAAKRRAAEQARGVPAARR 60
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
G+GLASE+ALAR D+P G+RHLGFA ALV EMPHTLAAL+CGALSEWRATLIVRESACL
Sbjct 61 GKGLASEVALARHDAPVLGNRHLGFATALVEEMPHTLAALECGALSEWRATLIVRESACL 120
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
V RRALD ELCGD L G G+ RV A A+ IA RLD A+V+R+ A D VT RP
Sbjct 121 SVEHRRALDEELCGDVSRLAGWGNKRVEAEAKTIAARLDAAAIVERSEKAAADCAVTCRP 180
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
AP+ M Y+T LP+ +GV +YAA RAADT DGR RGQVM DT+ ERVTGR A P P+
Sbjct 181 APNNMVYVTLRLPLTRGVGIYAACKRAADTTFDGRPRGQVMTDTVYERVTGRPADTPVPV 240
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVAS---AVTDQRSRATLRRLYAHPQ 297
A+NLVM+D TL G + L GYGP+PA RT++ + A + +ATLRRLY HP
Sbjct 241 ALNLVMADTTLAGDDDELGWLDGYGPVPAGFCRTLIGNADDAEAEAEVKATLRRLYRHPT 300
Query 298 AGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLG 357
+G LV+MESRAR+FP+GL ++ RD+ CRTPYCDAPIRH DHA P GGPTSA NGLG
Sbjct 301 SGQLVAMESRARIFPKGLGMLLQRRDRTCRTPYCDAPIRHHDHATPDRQGGPTSALNGLG 360
Query 358 TCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTV----SELEV 413
CE CNYAK+APGW V+T D + HTA+F TPTG+ +RS APP LP V S LE
Sbjct 361 ECEACNYAKEAPGWTVTTG-DVDGAHTADFETPTGAVYRSTAPP-LPGPPVRRRLSLLEG 418
Query 414 RIGIALARYAA 424
R+ I L + A
Sbjct 419 RLSIDLVTFDA 429
>gi|315446192|ref|YP_004079071.1| hypothetical protein Mspyr1_46930 [Mycobacterium sp. Spyr1]
gi|315264495|gb|ADU01237.1| hypothetical protein Mspyr1_46930 [Mycobacterium sp. Spyr1]
Length=430
Score = 457 bits (1176), Expect = 2e-126, Method: Compositional matrix adjust.
Identities = 258/426 (61%), Positives = 302/426 (71%), Gaps = 6/426 (1%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFE+L ID A +A L + ERLKSAAAA QARA A A R AAE AAGVP RR
Sbjct 1 MFESLFDIDEGASQAELRAVVERCERLKSAAAAAQARATALWAAKRAAAEEAAGVPVRRR 60
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
G+GLASE+ALARRD+P +G +HLGFAKALV+EMPHT AAL+CGALSEWRATLIVRESACL
Sbjct 61 GKGLASEVALARRDAPVKGGQHLGFAKALVHEMPHTWAALECGALSEWRATLIVRESACL 120
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
V DRR LD ELC D LEG G+ RV A A+ IA RLD AVV+RA A V+ RP
Sbjct 121 SVEDRRRLDEELCADVSTLEGWGNKRVEAEAKKIAARLDVAAVVERADKAAAQARVSCRP 180
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
AP+ M Y T LLP+AQG+ +YAAL ADT DGRSRGQVM DT ER+TGR A P+
Sbjct 181 APNGMVYFTVLLPLAQGIGMYAALKHHADTTFDGRSRGQVMTDTAFERITGRSAGTAVPV 240
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
A+NLVM+D TL G + P L GYGP+PA R + AV D+ +RATLRRLY HP++G
Sbjct 241 ALNLVMADTTLAGDDDCPGWLDGYGPVPAGFCRALTGDAVADKGARATLRRLYRHPRSGQ 300
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LV+MESRAR+FP+GLA I+ RDQ CRTPYCDAPIRH DHA P GG TSA NGLG C
Sbjct 301 LVAMESRARIFPKGLATLIDRRDQTCRTPYCDAPIRHHDHATPDRAGGKTSAENGLGECA 360
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTV----SELEVRIG 416
CNYAK+APGW+VS ++ +HTA ++TPTG+ H S AP LP V S+ E R+
Sbjct 361 ACNYAKEAPGWQVSAGT-QDGSHTARWVTPTGAVHYSIAPT-LPGPPVRRRISDTEGRLS 418
Query 417 IALARY 422
I L +
Sbjct 419 IDLITF 424
>gi|315444664|ref|YP_004077543.1| hypothetical protein Mspyr1_30910 [Mycobacterium sp. Spyr1]
gi|315262967|gb|ADT99708.1| hypothetical protein Mspyr1_30910 [Mycobacterium sp. Spyr1]
Length=426
Score = 454 bits (1169), Expect = 1e-125, Method: Compositional matrix adjust.
Identities = 254/427 (60%), Positives = 300/427 (71%), Gaps = 5/427 (1%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFE+L D +A E L + + E LKS AAA QAR A A R AAE AAG+ +R
Sbjct 1 MFESLFDYDTEASEKELRVLVEQYEALKSRAAAAQARVTALWAAKRAAAERAAGIGTRKR 60
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
G+GLASEIALAR D+P +G++HLGFA ALV+EMPHTLAAL+CG LSE+RATLIVRESACL
Sbjct 61 GKGLASEIALARHDAPVKGNQHLGFANALVHEMPHTLAALECGVLSEYRATLIVRESACL 120
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
RR LD ELC DP L G G+ RV A A+ I RLD AVV+R+A AE DR VT RP
Sbjct 121 STEHRRRLDEELCSDPSKLAGWGNNRVEAEAKRITARLDAAAVVERSAKAEQDRCVTTRP 180
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
AP+ M Y+T LLPVAQGV +YAAL RAADT D RSRGQVMADT R+TG+ A P +
Sbjct 181 APNCMVYVTVLLPVAQGVGMYAALKRAADTTFDERSRGQVMADTAYARITGKVATQPVSV 240
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
++NLVM+D TL G PA L GYGP+PA A + AV D+ +ATLRRLY HP +G
Sbjct 241 SLNLVMADTTLAGDDTEPAWLDGYGPVPAGFACKLTGDAVADEDVKATLRRLYRHPGSGQ 300
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LV+MESR+R FP+GLAAFI +RD+ CRTPYC+APIRH DHA P DGG TSA NGLG CE
Sbjct 301 LVAMESRSRAFPKGLAAFIGIRDRTCRTPYCNAPIRHHDHATPDRDGGRTSAVNGLGLCE 360
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSE---LEVRIGI 417
CNYAK+APGW V+TSV H AEF+TPT + + S APP LP +S +E R I
Sbjct 361 ACNYAKEAPGWTVTTSVRAGE-HRAEFVTPTHATYYSIAPP-LPGTRISRRSIVEDRFSI 418
Query 418 ALARYAA 424
L + A
Sbjct 419 DLVTFEA 425
>gi|145224332|ref|YP_001135010.1| HNH nuclease [Mycobacterium gilvum PYR-GCK]
gi|145216818|gb|ABP46222.1| HNH nuclease [Mycobacterium gilvum PYR-GCK]
Length=436
Score = 454 bits (1167), Expect = 2e-125, Method: Compositional matrix adjust.
Identities = 254/427 (60%), Positives = 300/427 (71%), Gaps = 5/427 (1%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFE+L D +A E L + + E LKS AAA QAR A A R AAE AAG+ +R
Sbjct 11 MFESLFDYDTEASEKELRVLVEQYEALKSRAAAAQARVTALWAAKRAAAERAAGIGTRKR 70
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
G+GLASEIALAR D+P +G++HLGFA ALV+EMPHTLAAL+CG LSE+RATLIVRESACL
Sbjct 71 GKGLASEIALARHDAPVKGNQHLGFANALVHEMPHTLAALECGVLSEYRATLIVRESACL 130
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
RR LD ELC DP L G G+ RV A A+ I RLD AVV+R+A AE DR VT RP
Sbjct 131 SAEHRRRLDEELCSDPSKLAGWGNNRVEAEAKRITARLDAAAVVERSAKAEQDRCVTTRP 190
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
AP+ M Y+T LLPVAQGV +YAAL RAADT D RSRGQVMADT R+TG+ A P +
Sbjct 191 APNCMVYVTVLLPVAQGVGMYAALKRAADTTFDERSRGQVMADTAYARITGKVATQPVSV 250
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
++NLVM+D TL G PA L GYGP+PA A + AV D+ +ATLRRLY HP +G
Sbjct 251 SLNLVMADTTLAGDDTEPAWLDGYGPVPAGFACKLTGDAVADEDVKATLRRLYRHPGSGQ 310
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LV+MESR+R FP+GLAAFI +RD+ CRTPYC+APIRH DHA P DGG TSA NGLG CE
Sbjct 311 LVAMESRSRAFPKGLAAFIGIRDRTCRTPYCNAPIRHHDHATPDRDGGRTSAVNGLGLCE 370
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTVSE---LEVRIGI 417
CNYAK+APGW V+TSV H AEF+TPT + + S APP LP +S +E R I
Sbjct 371 ACNYAKEAPGWTVTTSVRAGE-HRAEFVTPTHATYYSIAPP-LPGTRISRRSIVEDRFSI 428
Query 418 ALARYAA 424
L + A
Sbjct 429 DLVTFEA 435
>gi|145223173|ref|YP_001133851.1| hypothetical protein Mflv_2586 [Mycobacterium gilvum PYR-GCK]
gi|145215659|gb|ABP45063.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=430
Score = 453 bits (1165), Expect = 3e-125, Method: Compositional matrix adjust.
Identities = 257/426 (61%), Positives = 300/426 (71%), Gaps = 6/426 (1%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFE+L ID A +A L + ERLKSAAAA QARA A A R AAE AAGVP RR
Sbjct 1 MFESLFDIDEGASQAELRAVVERCERLKSAAAAAQARATALWAAKRAAAEEAAGVPVRRR 60
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
G+GLASE+ALARRD+P +G +HLGFAKAL+ EMPHT AAL+CGALSEWRATLIVRESACL
Sbjct 61 GKGLASEVALARRDAPVKGGQHLGFAKALMGEMPHTWAALECGALSEWRATLIVRESACL 120
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
V DRR LD ELC D EG G+ RV A A+ IA RLD AVV+R A V+ RP
Sbjct 121 SVEDRRRLDEELCADVSRFEGWGNKRVEAEAKKIAARLDVAAVVERVDRAAAQARVSCRP 180
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
AP+ M Y T LLP+AQG+ +YAAL ADT DGRSRGQVMADT ER+TGR A P+
Sbjct 181 APNGMVYFTVLLPLAQGIGMYAALKHHADTTFDGRSRGQVMADTAFERITGRSAGTAVPV 240
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
A+NLVM+D TL G + P L GYGP+PA R + AV D+ +RATLRRLY HP++G
Sbjct 241 ALNLVMADTTLAGDDDCPGWLDGYGPVPAGFCRALTGDAVADKGARATLRRLYRHPRSGQ 300
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LV+MESRARLFP+GLA I+ RDQ CRTPYCDAPIRH DHA P GG TSA NGLG C
Sbjct 301 LVAMESRARLFPKGLATLIDRRDQTCRTPYCDAPIRHHDHATPDRAGGKTSAENGLGECA 360
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTV----SELEVRIG 416
CNYAK+APGW+VS ++ +HTA ++TPTG+ H S AP LP V S+ E R+
Sbjct 361 ACNYAKEAPGWQVSAGT-QDGSHTARWVTPTGAVHYSIAPA-LPGPPVRRRISDTEGRLS 418
Query 417 IALARY 422
I L +
Sbjct 419 IDLITF 424
>gi|120404582|ref|YP_954411.1| hypothetical protein Mvan_3614 [Mycobacterium vanbaalenii PYR-1]
gi|119957400|gb|ABM14405.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=466
Score = 453 bits (1165), Expect = 3e-125, Method: Compositional matrix adjust.
Identities = 259/440 (59%), Positives = 302/440 (69%), Gaps = 18/440 (4%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFE+L + A EA L + ERLKS AAA QARA A A RRAAE A GVPAA+R
Sbjct 28 MFESLFDVGVGASEAELRAAVERCERLKSQAAAAQARATALWAAKRRAAEQAGGVPAAKR 87
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
G+GLASE+ALAR D+P G+RHLGFA+ALV EMPHTLAAL+CGALSEWRATLIVRESACL
Sbjct 88 GKGLASEVALARHDAPVLGNRHLGFAQALVEEMPHTLAALECGALSEWRATLIVRESACL 147
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
V RRALD ELCGD L G G+ R+ A A+ IA RLD AVVD A D VT RP
Sbjct 148 SVEHRRALDEELCGDVSRLAGWGNKRIEAEAKTIAARLDAAAVVDHTEKAAADCAVTCRP 207
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
AP+ M Y+T LP+ +GV +YAA RAADT DGR RGQVM DT+ ERVTGR A P P+
Sbjct 208 APNNMVYVTLRLPLTRGVGIYAACKRAADTTFDGRPRGQVMTDTVYERVTGRPADTPVPV 267
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAV------------TDQRSRAT 288
A+NLVM+D TL G + L GYGP+PA R ++ +AV D +AT
Sbjct 268 ALNLVMADTTLAGDDDELGWLDGYGPVPAGFCRALIGNAVADAHAHADADAEVDAEVKAT 327
Query 289 LRRLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGG 348
LRRLY HP +G LV+MESRAR+FP+GL ++ RD+ CRTPYCDAPIRH DHA P GG
Sbjct 328 LRRLYRHPTSGQLVAMESRARIFPKGLGMLLQRRDRTCRTPYCDAPIRHHDHATPDRAGG 387
Query 349 PTSAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPPHLPAVTV 408
PTSA NGLG CE CNYAK+APGW V+T D + HTA+F TPTG+ +RS APP LP V
Sbjct 388 PTSALNGLGECEACNYAKEAPGWTVTTG-DVDGAHTADFETPTGAVYRSTAPP-LPGPPV 445
Query 409 ----SELEVRIGIALARYAA 424
S LE R+ I L + A
Sbjct 446 RRRLSLLEGRLSIDLVTFDA 465
>gi|169628995|ref|YP_001702644.1| hypothetical protein MAB_1907 [Mycobacterium abscessus ATCC 19977]
gi|169240962|emb|CAM61990.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=398
Score = 440 bits (1132), Expect = 2e-121, Method: Compositional matrix adjust.
Identities = 242/395 (62%), Positives = 293/395 (75%), Gaps = 3/395 (0%)
Query 8 IDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARRGRGLASE 67
+DP A L++RIAELER+K+AAAA QA AA +D ARR E A GVP R+G G+A+E
Sbjct 4 LDPATRAAQLVDRIAELERVKAAAAAEQAHAAVLLDRARREEEAANGVPRRRQGAGVATE 63
Query 68 IALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVADRRA 127
IALAR+DSPARGSRHLGFAKALV EMPHTLAAL+CGALSEWRAT++VRE+A L V DR+
Sbjct 64 IALARQDSPARGSRHLGFAKALVNEMPHTLAALECGALSEWRATILVRETAYLAVEDRQK 123
Query 128 LDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTY 187
+D E+C + L G+GD R+ A A+ +AYRLD QAVV RA AE++R V++RPAPDTMTY
Sbjct 124 IDVEMCAETSRLRGLGDTRLAAEAKRLAYRLDAQAVVRRARRAESERRVSLRPAPDTMTY 183
Query 188 LTALLPVAQGVSVYAALTRAADTRCDG-RSRGQVMADTLVERVTGRDAAVPTPIAVNLVM 246
LTALLPV QGV+VYAAL R AD D R +GQ+MADTLVERVTG AA P+ VN+ +
Sbjct 184 LTALLPVKQGVAVYAALKRTADASMDPVRGQGQIMADTLVERVTGVSAAAAVPVGVNITV 243
Query 247 SDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGALVSMES 306
SDE LLG + PA + G+G +PAAVAR +++ AV+ + ++ +RRLY GALV ES
Sbjct 244 SDEALLGGGDEPATITGHGLVPAAVARRLISEAVSAE-AKVLVRRLYRRCTTGALVKAES 302
Query 307 RARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAK 366
R+RLFPRGLA I+LRDQ CRTPYCDAPIRH DH GGPT+ NG G CERCNY K
Sbjct 303 RSRLFPRGLAELIDLRDQTCRTPYCDAPIRHHDHVVGSMRGGPTALDNGQGLCERCNYVK 362
Query 367 QAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPP 401
+ GW V+ V HT EF TPTG+ +RS APP
Sbjct 363 ETAGWNVA-PVPGADRHTVEFTTPTGTAYRSTAPP 396
>gi|145223484|ref|YP_001134162.1| HNH nuclease [Mycobacterium gilvum PYR-GCK]
gi|145215970|gb|ABP45374.1| HNH nuclease [Mycobacterium gilvum PYR-GCK]
Length=426
Score = 440 bits (1131), Expect = 2e-121, Method: Compositional matrix adjust.
Identities = 249/425 (59%), Positives = 297/425 (70%), Gaps = 8/425 (1%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
MFE +D A +A L + E ERLKS AAA QARA A R AAE AAGVPA+RR
Sbjct 1 MFE----VDEGASQAELRAVVEECERLKSRAAAAQARATVLWAAQRAAAERAAGVPASRR 56
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
G+GLASE+ALARR++P +G++HLGFAKAL+ EMPHTLAAL+ G LSEWRATLIVRESACL
Sbjct 57 GKGLASEVALARREAPVKGNQHLGFAKALMQEMPHTLAALEGGVLSEWRATLIVRESACL 116
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
V RR LDAELCGDP + G G+ RV A A+ IA RLD AVV+R A DR V+ RP
Sbjct 117 SVEHRRQLDAELCGDPSRVAGWGNKRVEAEAKRIAARLDVAAVVERNDRAVKDRCVSTRP 176
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRCDGRSRGQVMADTLVERVTGRDAAVPTPI 240
AP M Y+T L+P+AQG+ +YAAL R + DGRS GQVM DT ER+TGR AA P P+
Sbjct 177 APHNMVYVTLLMPLAQGIGMYAALRRHGEMTVDGRSLGQVMTDTAFERITGRSAAAPVPV 236
Query 241 AVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTDQRSRATLRRLYAHPQAGA 300
+NLVM+D TL G + P L GYGP+PA R +V AV D +A LRRLY HP++G
Sbjct 237 ELNLVMADTTLAGDDDCPGWLDGYGPVPAGFCRGLVGDAVADAEPKAALRRLYRHPRSGQ 296
Query 301 LVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCE 360
LV+MES AR+FP+GLA I+ RDQ CRTPYCDAPIRH DHA P GG TSA NGLG C
Sbjct 297 LVAMESSARVFPKGLATLIDRRDQTCRTPYCDAPIRHHDHAVPDRAGGQTSADNGLGACA 356
Query 361 RCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAP--PHLPAV-TVSELEVRIGI 417
CNYAK+APGW+V T+ E+ HTA ++TPTG+ H S AP P P +S E R+ I
Sbjct 357 GCNYAKEAPGWKV-TAGGEDGVHTARWVTPTGAVHYSIAPTLPGPPVYRRISSTESRLSI 415
Query 418 ALARY 422
L +
Sbjct 416 DLITF 420
>gi|119963190|ref|YP_946206.1| hypothetical protein AAur_0393 [Arthrobacter aurescens TC1]
gi|119950049|gb|ABM08960.1| conserved hypothetical protein [Arthrobacter aurescens TC1]
Length=481
Score = 399 bits (1026), Expect = 4e-109, Method: Compositional matrix adjust.
Identities = 222/407 (55%), Positives = 278/407 (69%), Gaps = 20/407 (4%)
Query 12 AEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARRGRGLASEIALA 71
A+ +ALI+ + +LE LKSA +A QAR A A D A+RA + AGVPAA RG G+A+++ALA
Sbjct 67 ADSSALIDELRDLEDLKSAISARQARVAVAFDLAQRAEQAQAGVPAAERGMGVAAQVALA 126
Query 72 RRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVADRRALDAE 131
RR+SP +GSR GFAKALV EMP T+AAL+ G L+EWRATL+V+E+ACL V DR A+D E
Sbjct 127 RRESPNKGSRLFGFAKALVTEMPRTMAALESGQLNEWRATLLVKETACLSVEDRAAVDEE 186
Query 132 LCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTYLTAL 191
L D G +G GD ++AAA+A AYR DP++V RA+ A +RTV++RPAPDTMTYLTAL
Sbjct 187 LAPDAGTFDGTGDKAIIAAAKAAAYRRDPRSVAQRASRAATERTVSLRPAPDTMTYLTAL 246
Query 192 LPVAQGVSVYAALTRAAD---TRCDGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMSD 248
LPVAQGV+VYAALTR AD + D R+RGQVMADTL ERVTG + V I +NLVM+D
Sbjct 247 LPVAQGVAVYAALTRTADSVRSSGDARTRGQVMADTLTERVTGTSSGV-AGINLNLVMTD 305
Query 249 ETLLGAANTPAQLCGYGPIPAAVARTMVA---SAVTDQ---------RS--RATLRRLYA 294
TL PA+L GYG +PA AR ++ S DQ RS R LRRLY
Sbjct 306 RTLFQGDPEPARLEGYGIVPAEWARALLVEEQSGYEDQLHWNLESADRSELRVLLRRLYT 365
Query 295 HPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHN 354
P++G L++M+S+AR FP+ L FI +RD CRTPYCDAPIRH DH PW G T +N
Sbjct 366 APRSGELLTMDSKARFFPQKLRRFIHIRDNTCRTPYCDAPIRHIDHVIPWHSEGSTHLNN 425
Query 355 GLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPP 401
G G C CN+ K+ PGW + + HT TPTG ++S APP
Sbjct 426 GAGLCGACNHTKENPGWTAKSMPGD--VHTIRVSTPTGHSYKSKAPP 470
>gi|226361477|ref|YP_002779255.1| hypothetical protein ROP_20630 [Rhodococcus opacus B4]
gi|226239962|dbj|BAH50310.1| hypothetical protein [Rhodococcus opacus B4]
Length=447
Score = 390 bits (1002), Expect = 2e-106, Method: Compositional matrix adjust.
Identities = 221/412 (54%), Positives = 272/412 (67%), Gaps = 11/412 (2%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
L +D D ++A I+ + LE +K+A A QAR A +DA+ RA+ VP ++R
Sbjct 9 FLAELPKLDRDIDDATRIDVLRALEEVKAACAGVQARVTADLDASIRASRADRDVPVSQR 68
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
GRG+A+++ALARRDSP RG RHLG A AL +EMPHTLA L+ G LSEWRAT++VRE+ACL
Sbjct 69 GRGIANQVALARRDSPFRGGRHLGMATALAHEMPHTLALLERGLLSEWRATILVRETACL 128
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
DR ALD LCGDP L+G+GD V A RA A +D +AVV RA A +DR VT RP
Sbjct 129 TREDRTALDYLLCGDPATLDGLGDQAVCAKVRAAAAEVDAEAVVRRARKAVSDRRVTSRP 188
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRC---DGRSRGQVMADTLVERVTG-RDAAV 236
APDTM Y++ALLPVA+GV+V+A L R AD+ D R+R Q+MAD LV RVTG A
Sbjct 189 APDTMAYVSALLPVAEGVAVHATLARDADSILAAGDDRTRSQIMADLLVSRVTGTHHTAT 248
Query 237 PTPIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAVTD-------QRSRATL 289
PI VNLV+SD LL + PA + GYGP+PA +AR +A AV D +R L
Sbjct 249 TPPITVNLVISDRALLDGGSEPAHVQGYGPVPADLARHWIAEAVQDAIDPNTGNATRVNL 308
Query 290 RRLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGP 349
RRLYA+P +GAL + ES+AR FP GLA I+LRD+ CRTP+CDAPIRH DH GGP
Sbjct 309 RRLYANPASGALTATESQARCFPAGLARLIDLRDRTCRTPWCDAPIRHHDHIQSREFGGP 368
Query 350 TSAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRSGAPP 401
T+AHNG G C CNYAKQ GW + H E TPTG R+RS APP
Sbjct 369 TTAHNGAGLCAACNYAKQGAGWNATPRQRSGGLHHIEIYTPTGHRYRSTAPP 420
>gi|343924565|ref|ZP_08764113.1| hypothetical protein GOALK_017_00230 [Gordonia alkanivorans NBRC
16433]
gi|343765500|dbj|GAA11039.1| hypothetical protein GOALK_017_00230 [Gordonia alkanivorans NBRC
16433]
Length=419
Score = 384 bits (987), Expect = 1e-104, Method: Compositional matrix adjust.
Identities = 208/392 (54%), Positives = 268/392 (69%), Gaps = 4/392 (1%)
Query 13 EEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARRGRGLASEIALAR 72
+EA + R+ LE +KSA A Q R A +D R E A VP RRGRGLA+E+ LAR
Sbjct 6 DEAESVRRLTLLEEIKSACTAVQVRETANLDRLRADDEAARNVPQKRRGRGLAAEVGLAR 65
Query 73 RDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACLDVADRRALDAEL 132
+ SP +GS++LGFA+AL +EMPHT+AAL G L+EWRAT++VRE+A L R +D +
Sbjct 66 KVSPKKGSQYLGFARALAHEMPHTMAALTDGVLTEWRATILVRETAFLTRESREEIDRRV 125
Query 133 CGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTYLTALL 192
CGD +L G+ D + + A+ AY L+P AVV R A AE DR V++RPAPD MT L+ALL
Sbjct 126 CGDRAELVGVSDREIESTAKRHAYELEPAAVVARKAKAEKDRRVSVRPAPDLMTQLSALL 185
Query 193 PVAQGVSVYAALTRAADTRC--DGRSRGQVMADTLVERVTGRDAAVPTPIAVNLVMSDET 250
P+ QG+SVYA+L + AD+ D R+ Q+MADTLVERVTGR AA P P++V+++MSDE
Sbjct 186 PMTQGISVYASLKKHADSILGADDRTHAQIMADTLVERVTGRSAAEPVPVSVDMIMSDEC 245
Query 251 LLGAANTPAQLCGYGPIPAAVARTMVASAVT-DQRSRATLRRLYAHPQAGALVSMESRAR 309
L G ++ A++ GYGPIPAAVAR +VA++V+ D + +T+RR+YA P GALV+MES++R
Sbjct 246 LFGLNDSAAEVKGYGPIPAAVARELVAASVSPDGETASTMRRIYARPSDGALVAMESKSR 305
Query 310 LFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAP 369
FP LA F+ LRDQRCRTPYC API DHA P GPTS N G C N AK+A
Sbjct 306 TFPAALAHFVRLRDQRCRTPYCGAPIAEIDHAKPHRHDGPTSEANADGICVTHNRAKEAD 365
Query 370 GWRVSTSVDENHTHTAEFITPTGSRHRSGAPP 401
GW S + + ITPTG RHRS APP
Sbjct 366 GWGYSVRM-AGDIRVIDVITPTGGRHRSTAPP 396
>gi|111019339|ref|YP_702311.1| hypothetical protein RHA1_ro02347 [Rhodococcus jostii RHA1]
gi|110818869|gb|ABG94153.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=573
Score = 384 bits (986), Expect = 2e-104, Method: Compositional matrix adjust.
Identities = 219/408 (54%), Positives = 266/408 (66%), Gaps = 11/408 (2%)
Query 1 MFETLTAIDPDAEEAALIERIAELERLKSAAAAGQARAAAAVDAARRAAEGAAGVPAARR 60
L +D D ++A I+ + LE +K+A A QAR A +DA+ R A VP A R
Sbjct 135 FLAELPKLDLDIDDATRIDVLRTLEEVKAACAGVQARVTADLDASIRTARADRQVPVAHR 194
Query 61 GRGLASEIALARRDSPARGSRHLGFAKALVYEMPHTLAALDCGALSEWRATLIVRESACL 120
GRG+A+++ALARRDSP RG RHLG A ALV+EMP TLA L+ G LSEWRAT++VRE+ACL
Sbjct 195 GRGIANQVALARRDSPFRGGRHLGMATALVHEMPRTLALLERGVLSEWRATILVRETACL 254
Query 121 DVADRRALDAELCGDPGDLEGMGDARVVAAARAIAYRLDPQAVVDRAANAENDRTVTIRP 180
DR ALD LC DP L+G+GD V A RA A +D +A+V RA A +DR VT RP
Sbjct 255 TREDRTALDYLLCADPATLDGLGDQAVCAKVRAAAAEVDAEAMVRRARKAVSDRRVTSRP 314
Query 181 APDTMTYLTALLPVAQGVSVYAALTRAADTRC---DGRSRGQVMADTLVERVTGR-DAAV 236
APDTM Y++ALLPVAQGV+V+A LTR AD+ D R+R Q+MAD LV RVTG A
Sbjct 315 APDTMAYVSALLPVAQGVAVHATLTRDADSILAAGDERTRSQIMADLLVSRVTGAPHTAT 374
Query 237 PTPIAVNLVMSDETLLGAANTPAQLCGYGPIPAAVARTMVASAV-------TDQRSRATL 289
PI VNLV+SD LL + PA + GYGP+PAA+A + AV T +R TL
Sbjct 375 APPITVNLVISDRALLDRGSEPAYVQGYGPVPAALAGHWIHEAVQATIDPETGNEARVTL 434
Query 290 RRLYAHPQAGALVSMESRARLFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGP 349
RRLYA+P +GAL + ES+AR FP GLA I+LRD+ CRTP+CDAPIRH DH P GP
Sbjct 435 RRLYANPHSGALTATESQARRFPAGLARMIDLRDRTCRTPWCDAPIRHHDHIQPREYEGP 494
Query 350 TSAHNGLGTCERCNYAKQAPGWRVSTSVDENHTHTAEFITPTGSRHRS 397
T+AHNG G C CNYAKQ GW H E TPTG R+RS
Sbjct 495 TTAHNGAGLCAACNYAKQGAGWNARPHQLPGGLHKIEICTPTGHRYRS 542
>gi|119962228|ref|YP_946315.1| HNH endonuclease domain-containing protein [Arthrobacter aurescens
TC1]
gi|119949087|gb|ABM07998.1| putative HNH endonuclease domain protein [Arthrobacter aurescens
TC1]
Length=398
Score = 380 bits (975), Expect = 3e-103, Method: Compositional matrix adjust.
Identities = 216/401 (54%), Positives = 268/401 (67%), Gaps = 23/401 (5%)
Query 27 LKSAAAAGQARAAAAVDAARRAAEGAAGVPAARRGRGLASEIALARRDSPARGSRHLGFA 86
+KSA A QAR A A D A+RA + AGVPA+ RGRG+ +++ALARR+SP RG R LG A
Sbjct 1 MKSAITALQARVAVAFDLAQRAEQAEAGVPASERGRGVGAQVALARRESPNRGGRLLGLA 60
Query 87 KALVYEMPHTLAALDCGALSEWRATLIVRESACLDVADRRALDAELCGDPGDLEGMGDAR 146
KALV EMP TLAAL G L+EWRATL+V+E+ACL DR A+D EL D G +G GD
Sbjct 61 KALVTEMPRTLAALQSGYLNEWRATLLVKETACLSAEDRCAVDEELAPDAGTFDGKGDKA 120
Query 147 VVAAARAIAYRLDPQAVVDRAANAENDRTVTIRPAPDTMTYLTALLPVAQGVSVYAALTR 206
++AAA+A AYR DP++VV RA+ A +RTV++RPAPDTM+YLTALLPVAQGV+VY ALT+
Sbjct 121 IIAAAKAAAYRRDPRSVVGRASRAAAERTVSLRPAPDTMSYLTALLPVAQGVAVYKALTQ 180
Query 207 AADT-RCDG--------------RSRGQVMADTLVERVTGRDAAVPTPIAVNLVMSDETL 251
AAD+ R G R RGQ+MADTLVER+TG + + I ++LVM+D TL
Sbjct 181 AADSARSSGKAKSFEDVGSGRVVRPRGQIMADTLVERITGTPGGI-SGITIDLVMTDRTL 239
Query 252 LGAANTPAQLCGYGPIPAAVARTMVA--SAVTDQRSRATLRRLYAHPQAGALVSMESRAR 309
+ PA+L GYG +PA ART+V + D+ LRRLY P G L++ +S+AR
Sbjct 240 FQGDSEPARLQGYGVVPAEWARTVVGEEQSARDREFSVWLRRLYTAPATGDLLATDSKAR 299
Query 310 LFPRGLAAFIELRDQRCRTPYCDAPIRHRDHAHPWADGGPTSAHNGLGTCERCNYAKQAP 369
LF L FIE RD CRTPYCDAPIRH DH PW GG T+ NG G CE CN+ K+ P
Sbjct 300 LFSGRLRRFIETRDDSCRTPYCDAPIRHIDHVIPWHSGGQTNLGNGAGLCEACNHTKENP 359
Query 370 GWRVSTSVDENHTHTAEFITPTGSRHRSGAPP---HLPAVT 407
GW ST + HT E TPTG ++S APP H P+ T
Sbjct 360 GW--STRAVDADVHTLEISTPTGHTYQSKAPPLPGHRPSRT 398
Lambda K H
0.319 0.131 0.386
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 862149077424
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40