BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1576c
Length=473
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608714|ref|NP_216092.1| phiRV1 phage protein [Mycobacterium... 955 0.0
gi|289443029|ref|ZP_06432773.1| phi phage protein [Mycobacterium... 952 0.0
gi|289447185|ref|ZP_06436929.1| phiRv1 phage protein [Mycobacter... 949 0.0
gi|31792762|ref|NP_855255.1| phiRV1 phage protein [Mycobacterium... 947 0.0
gi|289753663|ref|ZP_06513041.1| phiRV1 phage protein [Mycobacter... 944 0.0
gi|289444191|ref|ZP_06433935.1| phi phage protein [Mycobacterium... 814 0.0
gi|15842190|ref|NP_337227.1| hypothetical protein MT2727 [Mycoba... 813 0.0
gi|15609787|ref|NP_217166.1| phiRv2 prophage protein [Mycobacter... 811 0.0
gi|306805298|ref|ZP_07441966.1| phage capsid family protein [Myc... 755 0.0
gi|308372556|ref|ZP_07429056.2| phage capsid family protein [Myc... 746 0.0
gi|167966951|ref|ZP_02549228.1| putative phiRv1 phage protein [M... 716 0.0
gi|307084155|ref|ZP_07493268.1| phage capsid family protein [Myc... 714 0.0
gi|289758778|ref|ZP_06518156.1| phiRv2 prophage protein [Mycobac... 660 0.0
gi|308405926|ref|ZP_07494458.2| phage capsid family protein [Myc... 609 3e-172
gi|289448305|ref|ZP_06438049.1| LOW QUALITY PROTEIN: phiRv2 phag... 547 2e-153
gi|289569612|ref|ZP_06449839.1| hypothetical protein TBJG_04301 ... 490 3e-136
gi|240172573|ref|ZP_04751232.1| phiRv2 prophage protein [Mycobac... 469 6e-130
gi|289570824|ref|ZP_06451051.1| conserved hypothetical protein [... 380 4e-103
gi|289751273|ref|ZP_06510651.1| phiRv2 phage protein [Mycobacter... 344 2e-92
gi|226307463|ref|YP_002767423.1| hypothetical protein RER_39760 ... 341 1e-91
gi|307084156|ref|ZP_07493269.1| hypothetical protein TMLG_00562 ... 320 3e-85
gi|289751274|ref|ZP_06510652.1| phiRv1 phage protein [Mycobacter... 301 1e-79
gi|15843075|ref|NP_338112.1| hypothetical protein MT3573.12 [Myc... 289 6e-76
gi|15843074|ref|NP_338111.1| hypothetical protein MT3573.11 [Myc... 287 2e-75
gi|307085346|ref|ZP_07494459.1| hypothetical protein TMLG_04087 ... 266 4e-69
gi|290959236|ref|YP_003490418.1| phage capsid protein [Streptomy... 256 7e-66
gi|120405315|ref|YP_955144.1| phage major capsid protein, HK97 [... 241 3e-61
gi|206599551|ref|YP_002241990.1| gp7 [Mycobacterium phage Brujit... 238 1e-60
gi|29566114|ref|NP_817683.1| gp6 [Mycobacterium phage Che9c] >gi... 233 5e-59
gi|306804415|ref|ZP_07441083.1| hypothetical protein TMHG_01848 ... 217 5e-54
gi|317125799|ref|YP_004099911.1| hypothetical protein Intca_2682... 172 1e-40
gi|306805297|ref|ZP_07441965.1| hypothetical protein TMHG_04002 ... 129 8e-28
gi|15843072|ref|NP_338109.1| hypothetical protein MT3573.9 [Myco... 123 6e-26
gi|304390287|ref|ZP_07372240.1| HK97 family phage major capsid p... 99.4 1e-18
gi|306817330|ref|ZP_07451075.1| HK97 family phage major capsid p... 96.7 8e-18
gi|227875043|ref|ZP_03993188.1| HK97 family phage major capsid p... 96.7 8e-18
gi|298346363|ref|YP_003719050.1| HK97 family phage major capsid ... 95.1 2e-17
gi|146277402|ref|YP_001167561.1| HK97 family phage major capsid ... 94.0 5e-17
gi|281418278|ref|ZP_06249298.1| phage major capsid protein, HK97... 93.6 6e-17
gi|125974135|ref|YP_001038045.1| HK97 family phage major capsid ... 93.6 7e-17
gi|315654943|ref|ZP_07907848.1| HK97 family major capsid protein... 92.8 1e-16
gi|315656914|ref|ZP_07909801.1| HK97 family prophage LambdaSa04 ... 92.8 1e-16
gi|110634245|ref|YP_674453.1| HK97 family phage major capsid pro... 92.4 1e-16
gi|256003557|ref|ZP_05428547.1| phage major capsid protein, HK97... 92.0 2e-16
gi|304316282|ref|YP_003851427.1| phage major capsid protein, HK9... 92.0 2e-16
gi|331085733|ref|ZP_08334816.1| HK97 family phage major capsid p... 91.3 3e-16
gi|167039899|ref|YP_001662884.1| HK97 family phage major capsid ... 91.3 3e-16
gi|336435240|ref|ZP_08614957.1| HK97 family phage major capsid p... 91.3 3e-16
gi|153955258|ref|YP_001396023.1| Phage major capsid protein [Clo... 90.9 4e-16
gi|302386148|ref|YP_003821970.1| phage major capsid protein, HK9... 90.9 4e-16
>gi|15608714|ref|NP_216092.1| phiRV1 phage protein [Mycobacterium tuberculosis H37Rv]
gi|148661371|ref|YP_001282894.1| putative phiRv1 phage protein [Mycobacterium tuberculosis H37Ra]
gi|254366078|ref|ZP_04982123.1| possible phiRV1 phage protein [Mycobacterium tuberculosis str.
Haarlem]
22 more sequence titles
Length=473
Score = 955 bits (2469), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 473/473 (100%), Positives = 473/473 (100%), Gaps = 0/473 (0%)
Query 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE
Sbjct 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
Query 61 EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL 120
EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL
Sbjct 61 EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL 120
Query 121 CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQ 180
CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQ
Sbjct 121 CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQ 180
Query 181 RAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWY 240
RAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWY
Sbjct 181 RAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWY 240
Query 241 SEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAF 300
SEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAF
Sbjct 241 SEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAF 300
Query 301 VNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLST 360
VNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLST
Sbjct 301 VNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLST 360
Query 361 INTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFL 420
INTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFL
Sbjct 361 INTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFL 420
Query 421 IGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
IGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA
Sbjct 421 IGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
>gi|289443029|ref|ZP_06432773.1| phi phage protein [Mycobacterium tuberculosis T46]
gi|289415948|gb|EFD13188.1| phi phage protein [Mycobacterium tuberculosis T46]
Length=473
Score = 952 bits (2461), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 472/473 (99%), Positives = 472/473 (99%), Gaps = 0/473 (0%)
Query 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE
Sbjct 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
Query 61 EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL 120
EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL
Sbjct 61 EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL 120
Query 121 CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQ 180
CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQ
Sbjct 121 CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQ 180
Query 181 RAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWY 240
RAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWY
Sbjct 181 RAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWY 240
Query 241 SEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAF 300
SEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAF
Sbjct 241 SEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAF 300
Query 301 VNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLST 360
VNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLST
Sbjct 301 VNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLST 360
Query 361 INTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFL 420
INTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFL
Sbjct 361 INTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFL 420
Query 421 IGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
I DRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA
Sbjct 421 IVDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
>gi|289447185|ref|ZP_06436929.1| phiRv1 phage protein [Mycobacterium tuberculosis CPHL_A]
gi|289420143|gb|EFD17344.1| phiRv1 phage protein [Mycobacterium tuberculosis CPHL_A]
Length=473
Score = 949 bits (2454), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/473 (99%), Positives = 471/473 (99%), Gaps = 0/473 (0%)
Query 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE
Sbjct 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
Query 61 EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL 120
EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL
Sbjct 61 EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL 120
Query 121 CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQ 180
CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQ
Sbjct 121 CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQ 180
Query 181 RAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWY 240
RAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWY
Sbjct 181 RAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWY 240
Query 241 SEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAF 300
SEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAA F
Sbjct 241 SEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAVF 300
Query 301 VNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLST 360
VNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLST
Sbjct 301 VNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLST 360
Query 361 INTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFL 420
INTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFL
Sbjct 361 INTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFL 420
Query 421 IGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
I DRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA
Sbjct 421 IVDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
>gi|31792762|ref|NP_855255.1| phiRV1 phage protein [Mycobacterium bovis AF2122/97]
gi|31618352|emb|CAD96270.1| Probable phiRV1 phage protein [Mycobacterium bovis AF2122/97]
Length=473
Score = 947 bits (2449), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 470/473 (99%), Positives = 470/473 (99%), Gaps = 0/473 (0%)
Query 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE
Sbjct 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
Query 61 EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL 120
E LRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL
Sbjct 61 EELRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL 120
Query 121 CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQ 180
CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQ
Sbjct 121 CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQ 180
Query 181 RAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWY 240
RAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWY
Sbjct 181 RAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWY 240
Query 241 SEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAF 300
SEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQ AAF
Sbjct 241 SEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQTAAF 300
Query 301 VNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLST 360
VNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLST
Sbjct 301 VNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLST 360
Query 361 INTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFL 420
INTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFL
Sbjct 361 INTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFL 420
Query 421 IGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
I DRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA
Sbjct 421 IVDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
>gi|289753663|ref|ZP_06513041.1| phiRV1 phage protein [Mycobacterium tuberculosis EAS054]
gi|289694250|gb|EFD61679.1| phiRV1 phage protein [Mycobacterium tuberculosis EAS054]
Length=479
Score = 944 bits (2441), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 469/470 (99%), Positives = 469/470 (99%), Gaps = 0/470 (0%)
Query 4 FDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAEEAL 63
FDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAEEAL
Sbjct 10 FDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAEEAL 69
Query 64 RRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETLCRT 123
RRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETLCRT
Sbjct 70 RRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETLCRT 129
Query 124 GPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQRAM 183
GPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQRAM
Sbjct 130 GPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQRAM 189
Query 184 GLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEA 243
GLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEA
Sbjct 190 GLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEA 249
Query 244 QEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAFVNG 303
QEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAFVNG
Sbjct 250 QEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAFVNG 309
Query 304 SGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINT 363
SGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINT
Sbjct 310 SGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINT 369
Query 364 LRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIGD 423
LRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLI D
Sbjct 370 LRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIVD 429
Query 424 RVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
RVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA
Sbjct 430 RVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 479
>gi|289444191|ref|ZP_06433935.1| phi phage protein [Mycobacterium tuberculosis T46]
gi|289746451|ref|ZP_06505829.1| phiRv2 prophage protein [Mycobacterium tuberculosis 02_1987]
gi|294994255|ref|ZP_06799946.1| phiRv2 phage protein [Mycobacterium tuberculosis 210]
7 more sequence titles
Length=479
Score = 814 bits (2102), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/468 (90%), Positives = 442/468 (95%), Gaps = 0/468 (0%)
Query 6 DIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAEEALRR 65
DIK LSL ETR AAKQLLDSV GDLTG+ AQRFQALTRHAEELRAEQRRRGREAEEALRR
Sbjct 12 DIKQLSLDETRSAAKQLLDSVEGDLTGDVAQRFQALTRHAEELRAEQRRRGREAEEALRR 71
Query 66 YRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETLCRTGP 125
RAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLD CVRDGLMSSRAAE AETLCRTGP
Sbjct 72 CRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDVCVRDGLMSSRAAEAAETLCRTGP 131
Query 126 PQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGL 185
PQSTSWAQRWLAATG+RDYLGAFVKRVSNPVAGHT WTDREAAAWREAAAVAAEQRAMGL
Sbjct 132 PQSTSWAQRWLAATGNRDYLGAFVKRVSNPVAGHTTWTDREAAAWREAAAVAAEQRAMGL 191
Query 186 VDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQE 245
VDT GGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSE+WRGVTSEGAEA WYSEAQE
Sbjct 192 VDTAGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEVWRGVTSEGAEAHWYSEAQE 251
Query 246 VSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAFVNGSG 305
VSDDSP LAQPAVP+YRGSCWIPFS+E+EGDAA FV E+G++LADSVEQLQAAAFV+GSG
Sbjct 252 VSDDSPTLAQPAVPSYRGSCWIPFSLEIEGDAAGFVAEVGRVLADSVEQLQAAAFVSGSG 311
Query 306 NGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTLR 365
NGEPTGFVSALTGT+D V GAG+EA+VAADVYALQSALPPRFQ+++AFAANLSTIN LR
Sbjct 312 NGEPTGFVSALTGTADYTVTGAGTEAVVAADVYALQSALPPRFQSNSAFAANLSTINVLR 371
Query 366 QAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRV 425
QAET+NGALKFPSLH SPPMLAGK + EVS+MDTVD+AVTATN+PLVLGDWKQF+I DRV
Sbjct 372 QAETANGALKFPSLHASPPMLAGKHIWEVSNMDTVDAAVTATNYPLVLGDWKQFIITDRV 431
Query 426 GSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
GS VELVPH+FG NRRPTGQRGFF WFRVGSDVLV NAFRVLKV+TTA
Sbjct 432 GSTVELVPHVFGGNRRPTGQRGFFCWFRVGSDVLVDNAFRVLKVQTTA 479
>gi|15842190|ref|NP_337227.1| hypothetical protein MT2727 [Mycobacterium tuberculosis CDC1551]
gi|148823841|ref|YP_001288595.1| phiRv2 prophage protein [Mycobacterium tuberculosis F11]
gi|253798268|ref|YP_003031269.1| phiRv2 phage protein [Mycobacterium tuberculosis KZN 1435]
36 more sequence titles
Length=479
Score = 813 bits (2100), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/468 (90%), Positives = 442/468 (95%), Gaps = 0/468 (0%)
Query 6 DIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAEEALRR 65
DIK LSL ETR AAKQLLDSV GDLTG+ AQRFQALTRHAEELRAEQRRRGREAEEALRR
Sbjct 12 DIKQLSLDETRSAAKQLLDSVEGDLTGDVAQRFQALTRHAEELRAEQRRRGREAEEALRR 71
Query 66 YRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETLCRTGP 125
RAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLD CVRDGLMSSRAAE AETLCRTGP
Sbjct 72 CRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDVCVRDGLMSSRAAEAAETLCRTGP 131
Query 126 PQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGL 185
PQSTSWAQRWLAATG+RDYLGAFVKRVSNPVAGHT WTDREAAAWREAAAVAAEQRAMGL
Sbjct 132 PQSTSWAQRWLAATGNRDYLGAFVKRVSNPVAGHTTWTDREAAAWREAAAVAAEQRAMGL 191
Query 186 VDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQE 245
VDT GGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSE+WRGVTSEGAEA WYSEAQE
Sbjct 192 VDTAGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEVWRGVTSEGAEAHWYSEAQE 251
Query 246 VSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAFVNGSG 305
VSDDSP LAQPAVP+YRGSCWIPFS+E+EGDAA FV E+G++LADSVEQLQAAAFV+GSG
Sbjct 252 VSDDSPTLAQPAVPSYRGSCWIPFSLEIEGDAAGFVAEVGRVLADSVEQLQAAAFVSGSG 311
Query 306 NGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTLR 365
NGEPTGFVSALTGT+D V GAG+EA+VAADVYALQSALPPRFQ+++AFAANLSTIN LR
Sbjct 312 NGEPTGFVSALTGTADYTVTGAGTEAVVAADVYALQSALPPRFQSNSAFAANLSTINVLR 371
Query 366 QAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRV 425
QAET+NGALKFPSLH SPPMLAGK + EVS+MDTVD+AVTATN+PLVLGDWKQF+I DRV
Sbjct 372 QAETANGALKFPSLHASPPMLAGKHIWEVSNMDTVDAAVTATNYPLVLGDWKQFIITDRV 431
Query 426 GSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
GS VELVPH+FG NRRPTGQRGFF WFRVGSDVLV NAFRVLKV+TTA
Sbjct 432 GSTVELVPHVFGGNRRPTGQRGFFCWFRVGSDVLVDNAFRVLKVQTTA 479
>gi|15609787|ref|NP_217166.1| phiRv2 prophage protein [Mycobacterium tuberculosis H37Rv]
gi|148662492|ref|YP_001284015.1| putative phiRv2 prophage protein [Mycobacterium tuberculosis
H37Ra]
gi|167967166|ref|ZP_02549443.1| putative phiRv2 prophage protein [Mycobacterium tuberculosis
H37Ra]
gi|1550691|emb|CAB02329.1| POSSIBLE phiRv2 PROPHAGE PROTEIN [Mycobacterium tuberculosis
H37Rv]
gi|148506644|gb|ABQ74453.1| putative phiRv2 prophage protein [Mycobacterium tuberculosis
H37Ra]
Length=479
Score = 811 bits (2096), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/468 (90%), Positives = 441/468 (95%), Gaps = 0/468 (0%)
Query 6 DIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAEEALRR 65
DIK LSL ETR AAKQLLDSV GDLTG+ AQRFQALTRHAEELRAEQRRRGREAEEALRR
Sbjct 12 DIKQLSLDETRSAAKQLLDSVEGDLTGDVAQRFQALTRHAEELRAEQRRRGREAEEALRR 71
Query 66 YRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETLCRTGP 125
RAGELRVVPGAPTGGDDGDAPPGNSLRD AFRTLD CVRDGLMSSRAAE AETLCRTGP
Sbjct 72 CRAGELRVVPGAPTGGDDGDAPPGNSLRDIAFRTLDVCVRDGLMSSRAAEAAETLCRTGP 131
Query 126 PQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGL 185
PQSTSWAQRWLAATG+RDYLGAFVKRVSNPVAGHT WTDREAAAWREAAAVAAEQRAMGL
Sbjct 132 PQSTSWAQRWLAATGNRDYLGAFVKRVSNPVAGHTTWTDREAAAWREAAAVAAEQRAMGL 191
Query 186 VDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQE 245
VDT GGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSE+WRGVTSEGAEA WYSEAQE
Sbjct 192 VDTAGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEVWRGVTSEGAEAHWYSEAQE 251
Query 246 VSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAFVNGSG 305
VSDDSP LAQPAVP+YRGSCWIPFS+E+EGDAA FV E+G++LADSVEQLQAAAFV+GSG
Sbjct 252 VSDDSPTLAQPAVPSYRGSCWIPFSLEIEGDAAGFVAEVGRVLADSVEQLQAAAFVSGSG 311
Query 306 NGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTLR 365
NGEPTGFVSALTGT+D V GAG+EA+VAADVYALQSALPPRFQ+++AFAANLSTIN LR
Sbjct 312 NGEPTGFVSALTGTADYTVTGAGTEAVVAADVYALQSALPPRFQSNSAFAANLSTINVLR 371
Query 366 QAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRV 425
QAET+NGALKFPSLH SPPMLAGK + EVS+MDTVD+AVTATN+PLVLGDWKQF+I DRV
Sbjct 372 QAETANGALKFPSLHASPPMLAGKHIWEVSNMDTVDAAVTATNYPLVLGDWKQFIITDRV 431
Query 426 GSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
GS VELVPH+FG NRRPTGQRGFF WFRVGSDVLV NAFRVLKV+TTA
Sbjct 432 GSTVELVPHVFGGNRRPTGQRGFFCWFRVGSDVLVDNAFRVLKVQTTA 479
>gi|306805298|ref|ZP_07441966.1| phage capsid family protein [Mycobacterium tuberculosis SUMu008]
gi|308348142|gb|EFP36993.1| phage capsid family protein [Mycobacterium tuberculosis SUMu008]
Length=373
Score = 755 bits (1949), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/373 (100%), Positives = 373/373 (100%), Gaps = 0/373 (0%)
Query 101 DSCVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHT 160
DSCVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHT
Sbjct 1 DSCVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHT 60
Query 161 VWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVV 220
VWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVV
Sbjct 61 VWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVV 120
Query 221 QTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASF 280
QTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASF
Sbjct 121 QTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASF 180
Query 281 VGEIGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYAL 340
VGEIGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYAL
Sbjct 181 VGEIGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYAL 240
Query 341 QSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTV 400
QSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTV
Sbjct 241 QSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTV 300
Query 401 DSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLV 460
DSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLV
Sbjct 301 DSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLV 360
Query 461 RNAFRVLKVETTA 473
RNAFRVLKVETTA
Sbjct 361 RNAFRVLKVETTA 373
>gi|308372556|ref|ZP_07429056.2| phage capsid family protein [Mycobacterium tuberculosis SUMu004]
gi|308332841|gb|EFP21692.1| phage capsid family protein [Mycobacterium tuberculosis SUMu004]
Length=370
Score = 746 bits (1926), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/370 (99%), Positives = 370/370 (100%), Gaps = 0/370 (0%)
Query 104 VRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWT 163
+RDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWT
Sbjct 1 MRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWT 60
Query 164 DREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTT 223
DREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTT
Sbjct 61 DREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTT 120
Query 224 SEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGE 283
SEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGE
Sbjct 121 SEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGE 180
Query 284 IGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSA 343
IGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSA
Sbjct 181 IGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSA 240
Query 344 LPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSA 403
LPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSA
Sbjct 241 LPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSA 300
Query 404 VTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNA 463
VTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNA
Sbjct 301 VTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNA 360
Query 464 FRVLKVETTA 473
FRVLKVETTA
Sbjct 361 FRVLKVETTA 370
>gi|167966951|ref|ZP_02549228.1| putative phiRv1 phage protein [Mycobacterium tuberculosis H37Ra]
Length=354
Score = 716 bits (1849), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/354 (99%), Positives = 354/354 (100%), Gaps = 0/354 (0%)
Query 120 LCRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAE 179
+CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAE
Sbjct 1 MCRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAE 60
Query 180 QRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARW 239
QRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARW
Sbjct 61 QRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARW 120
Query 240 YSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAA 299
YSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAA
Sbjct 121 YSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAA 180
Query 300 FVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLS 359
FVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLS
Sbjct 181 FVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLS 240
Query 360 TINTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQF 419
TINTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQF
Sbjct 241 TINTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQF 300
Query 420 LIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
LIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA
Sbjct 301 LIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 354
>gi|307084155|ref|ZP_07493268.1| phage capsid family protein [Mycobacterium tuberculosis SUMu012]
gi|308366219|gb|EFP55070.1| phage capsid family protein [Mycobacterium tuberculosis SUMu012]
Length=392
Score = 714 bits (1842), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/385 (98%), Positives = 376/385 (98%), Gaps = 5/385 (1%)
Query 89 GNSLRDTAFRTLDSCVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAF 148
G+ + T F CVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAF
Sbjct 13 GHRVSHTGF-----CVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAF 67
Query 149 VKRVSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDG 208
VKRVSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDG
Sbjct 68 VKRVSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDG 127
Query 209 STNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIP 268
STNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIP
Sbjct 128 STNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIP 187
Query 269 FSIELEGDAASFVGEIGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAG 328
FSIELEGDAASFVGEIGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAG
Sbjct 188 FSIELEGDAASFVGEIGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAG 247
Query 329 SEAIVAADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAG 388
SEAIVAADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAG
Sbjct 248 SEAIVAADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAG 307
Query 389 KSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGF 448
KSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGF
Sbjct 308 KSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGF 367
Query 449 FAWFRVGSDVLVRNAFRVLKVETTA 473
FAWFRVGSDVLVRNAFRVLKVETTA
Sbjct 368 FAWFRVGSDVLVRNAFRVLKVETTA 392
>gi|289758778|ref|ZP_06518156.1| phiRv2 prophage protein [Mycobacterium tuberculosis T85]
gi|289714342|gb|EFD78354.1| phiRv2 prophage protein [Mycobacterium tuberculosis T85]
Length=382
Score = 660 bits (1704), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/382 (89%), Positives = 362/382 (95%), Gaps = 0/382 (0%)
Query 92 LRDTAFRTLDSCVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKR 151
+RDTAFRTLD CVRDGLMSSRAAE AETLCRTGPPQSTSWAQRWLAATG+RDYLGAFVKR
Sbjct 1 MRDTAFRTLDVCVRDGLMSSRAAEAAETLCRTGPPQSTSWAQRWLAATGNRDYLGAFVKR 60
Query 152 VSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTN 211
VSNPVAGHT WTDREAAAWREAAAVAAEQRAMGLVDT GGFLIPAALDPAILLSGDGSTN
Sbjct 61 VSNPVAGHTTWTDREAAAWREAAAVAAEQRAMGLVDTAGGFLIPAALDPAILLSGDGSTN 120
Query 212 PIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSI 271
PIRQVARVVQTTSE+WRGVTSEGAEA WYSEAQEVSDDSP LAQPAVP+YRGSCWIPFS+
Sbjct 121 PIRQVARVVQTTSEVWRGVTSEGAEAHWYSEAQEVSDDSPTLAQPAVPSYRGSCWIPFSL 180
Query 272 ELEGDAASFVGEIGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEA 331
E+EGDAA FV E+G++LADSVEQLQAAAFV+GSGNGEPTGFVSALTGT+D V GAG+EA
Sbjct 181 EIEGDAAGFVAEVGRVLADSVEQLQAAAFVSGSGNGEPTGFVSALTGTADYTVTGAGTEA 240
Query 332 IVAADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSV 391
+VAADVYALQSALPPRFQ+++AFAANLSTIN LRQAET+NGALKFPSLH SPPMLAGK +
Sbjct 241 VVAADVYALQSALPPRFQSNSAFAANLSTINVLRQAETANGALKFPSLHASPPMLAGKHI 300
Query 392 LEVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAW 451
EVS+MDTVD+AVTATN+PLVLGDWKQF+I DRVGS VELVPH+FG NRRPTGQRGFF W
Sbjct 301 WEVSNMDTVDAAVTATNYPLVLGDWKQFIITDRVGSTVELVPHVFGGNRRPTGQRGFFCW 360
Query 452 FRVGSDVLVRNAFRVLKVETTA 473
FRVGSDVLV NAFRVLKV+TTA
Sbjct 361 FRVGSDVLVDNAFRVLKVQTTA 382
>gi|308405926|ref|ZP_07494458.2| phage capsid family protein [Mycobacterium tuberculosis SUMu012]
gi|308365115|gb|EFP53966.1| phage capsid family protein [Mycobacterium tuberculosis SUMu012]
Length=354
Score = 609 bits (1571), Expect = 3e-172, Method: Compositional matrix adjust.
Identities = 312/354 (89%), Positives = 336/354 (95%), Gaps = 0/354 (0%)
Query 120 LCRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAE 179
+CRTGPPQSTSWAQRWLAATG+RDYLGAFVKRVSNPVAGHT WTDREAAAWREAAAVAAE
Sbjct 1 MCRTGPPQSTSWAQRWLAATGNRDYLGAFVKRVSNPVAGHTTWTDREAAAWREAAAVAAE 60
Query 180 QRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARW 239
QRAMGLVDT GGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSE+WRGVTSEGAEA W
Sbjct 61 QRAMGLVDTAGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEVWRGVTSEGAEAHW 120
Query 240 YSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAA 299
YSEAQEVSDDSP LAQPAVP+YRGSCWIPFS+E+EGDAA FV E+G++LADSVEQLQAAA
Sbjct 121 YSEAQEVSDDSPTLAQPAVPSYRGSCWIPFSLEIEGDAAGFVAEVGRVLADSVEQLQAAA 180
Query 300 FVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLS 359
FV+GSGNGEPTGFVSALTGT+D V GAG+EA+VAADVYALQSALPPRFQ+++AFAANLS
Sbjct 181 FVSGSGNGEPTGFVSALTGTADYTVTGAGTEAVVAADVYALQSALPPRFQSNSAFAANLS 240
Query 360 TINTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQF 419
TIN LRQAET+NGALKFPSLH SPPMLAGK + EVS+MDTVD+AVTATN+PLVLGDWKQF
Sbjct 241 TINVLRQAETANGALKFPSLHASPPMLAGKHIWEVSNMDTVDAAVTATNYPLVLGDWKQF 300
Query 420 LIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
+I DRVGS VELVPH+FG NRRPTGQRGFF WFRVGSDVLV NAFRVLKV+TTA
Sbjct 301 IITDRVGSTVELVPHVFGGNRRPTGQRGFFCWFRVGSDVLVDNAFRVLKVQTTA 354
>gi|289448305|ref|ZP_06438049.1| LOW QUALITY PROTEIN: phiRv2 phage protein [Mycobacterium tuberculosis
CPHL_A]
gi|289421263|gb|EFD18464.1| LOW QUALITY PROTEIN: phiRv2 phage protein [Mycobacterium tuberculosis
CPHL_A]
Length=366
Score = 547 bits (1409), Expect = 2e-153, Method: Compositional matrix adjust.
Identities = 293/356 (83%), Positives = 319/356 (90%), Gaps = 3/356 (0%)
Query 120 LCRTGPPQSTSWAQRWLAATGSRDYLGAFVKR--VSNPVAGHTVWTDREAAAWREAAAVA 177
LCRTGPPQS + LA + L V++ NPVAGHT WTDREAAAWREAAAVA
Sbjct 12 LCRTGPPQS-NLVGAALAGGHRQPRLPGGVRQEGFRNPVAGHTTWTDREAAAWREAAAVA 70
Query 178 AEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEA 237
AEQRAMGLVDT GGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSE+WRGVTSEGAEA
Sbjct 71 AEQRAMGLVDTAGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEVWRGVTSEGAEA 130
Query 238 RWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQA 297
WYSEAQEVSDDSP LAQPAVP+YRGSCWIPFS+E+EGDAA FV E+G++LADSVEQLQA
Sbjct 131 HWYSEAQEVSDDSPTLAQPAVPSYRGSCWIPFSLEIEGDAAGFVAEVGRVLADSVEQLQA 190
Query 298 AAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAAN 357
AAFV+GSGNGEPTGFVSALTGT+D V GAG+EA+VAADVYALQSALPPRFQ+++AFAAN
Sbjct 191 AAFVSGSGNGEPTGFVSALTGTADYTVTGAGTEAVVAADVYALQSALPPRFQSNSAFAAN 250
Query 358 LSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWK 417
LSTIN LRQAET+NGALKFPSLH SPPMLAGK + EVS+MDTVD+AVTATN+PLVLGDWK
Sbjct 251 LSTINVLRQAETANGALKFPSLHASPPMLAGKHIWEVSNMDTVDAAVTATNYPLVLGDWK 310
Query 418 QFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
QF+I DRVGS VELVPH+FG NRRPTGQRGFF WFRVGSDVLV NAFRVLKV+TTA
Sbjct 311 QFIITDRVGSTVELVPHVFGGNRRPTGQRGFFCWFRVGSDVLVDNAFRVLKVQTTA 366
>gi|289569612|ref|ZP_06449839.1| hypothetical protein TBJG_04301 [Mycobacterium tuberculosis T17]
gi|289543366|gb|EFD47014.1| hypothetical protein TBJG_04301 [Mycobacterium tuberculosis T17]
Length=266
Score = 490 bits (1261), Expect = 3e-136, Method: Compositional matrix adjust.
Identities = 244/246 (99%), Positives = 246/246 (100%), Gaps = 0/246 (0%)
Query 92 LRDTAFRTLDSCVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKR 151
+RDTAFRTLDSCVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKR
Sbjct 1 MRDTAFRTLDSCVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKR 60
Query 152 VSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTN 211
VSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTN
Sbjct 61 VSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTN 120
Query 212 PIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSI 271
PIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSI
Sbjct 121 PIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSI 180
Query 272 ELEGDAASFVGEIGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEA 331
ELEGDAASFVGEIGKILADSVEQLQAAAFV+GSGNGEPTGFVSALTGTSDQVVVGAGSEA
Sbjct 181 ELEGDAASFVGEIGKILADSVEQLQAAAFVSGSGNGEPTGFVSALTGTSDQVVVGAGSEA 240
Query 332 IVAADV 337
IVAADV
Sbjct 241 IVAADV 246
>gi|240172573|ref|ZP_04751232.1| phiRv2 prophage protein [Mycobacterium kansasii ATCC 12478]
Length=486
Score = 469 bits (1206), Expect = 6e-130, Method: Compositional matrix adjust.
Identities = 245/487 (51%), Positives = 325/487 (67%), Gaps = 15/487 (3%)
Query 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQ----RRRG 56
M + +I ++ + R AA+QLLDS GDLTG AA+RFQALT HAE+LR Q RR
Sbjct 1 MKKMTEIDFTTVEQCRAAAQQLLDSTDGDLTGPAAERFQALTLHAEQLRERQAQRDRRHA 60
Query 57 REAEEALRRYRAGELRVVPGA----PTGGD-----DGDAPPGNSLRDTAFRTLDSCVRDG 107
+ +R ++GELR GA G+ D D P + RD+A RT++ + G
Sbjct 61 TDLAAMVRGLQSGELRTEGGANGMHTLNGEQRSQYDEDRPAPDRQRDSAMRTIERSHKAG 120
Query 108 LMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREA 167
L+++ AE AE L +GP + SWA RW+A TG Y AF K V +P GH +T E
Sbjct 121 LLAAGGAEVAERLVGSGPAPARSWAARWIAETGCEKYREAFSKLVLDPQRGHLQFTPAEG 180
Query 168 AAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIW 227
A+R A+ AEQRAM L D GGFL+P LDP +LLS DGS NP+ +++RV+QT S++W
Sbjct 181 EAFRRVTALQAEQRAMSLTDAAGGFLVPFELDPTVLLSSDGSNNPLMKISRVIQTVSDVW 240
Query 228 RGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKI 287
GVTSEG A W E+ E +D SP L QPA+P+ + S ++PFS+EL+GDA + + E+G++
Sbjct 241 HGVTSEGVVAEWLPESSEAADASPTLTQPAIPSCKASVFVPFSVELQGDATTLMQELGRL 300
Query 288 LADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPR 347
L D +QL A AF GSG G+PTG +SAL G S VV G GSEA+ A+D+Y +QS LPPR
Sbjct 301 LQDGADQLLATAFTTGSGTGQPTGIISALAGGSS-VVTGDGSEALAASDIYKVQSMLPPR 359
Query 348 FQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVLEVSHMD-TVDSAVTA 406
FQ A++ ANLS +NT+RQ ET+NGAL+FP L SPP L G+++ E S+MD ++++A T
Sbjct 360 FQPRASWNANLSILNTIRQFETTNGALRFPELSTSPPKLLGRNIYENSNMDGSLNTAATE 419
Query 407 TNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRV 466
TNH L+ GD+ QF I R GS +EL+PHL G NRRPTG+RG + W RVGSDVLV NAFR+
Sbjct 420 TNHVLLYGDFSQFAITMRTGSSLELIPHLVGANRRPTGERGAWLWMRVGSDVLVDNAFRL 479
Query 467 LKVETTA 473
L V T+A
Sbjct 480 LNVPTSA 486
>gi|289570824|ref|ZP_06451051.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289544578|gb|EFD48226.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
Length=216
Score = 380 bits (975), Expect = 4e-103, Method: Compositional matrix adjust.
Identities = 180/216 (84%), Positives = 202/216 (94%), Gaps = 0/216 (0%)
Query 258 VPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAFVNGSGNGEPTGFVSALT 317
+P+YRGSCWIPFS+E+EGDAA FV E+G++LADSVEQLQAAAFV+GSGNGEPTGFVSALT
Sbjct 1 MPSYRGSCWIPFSLEIEGDAAGFVAEVGRVLADSVEQLQAAAFVSGSGNGEPTGFVSALT 60
Query 318 GTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGALKFP 377
GT+D V GAG+EA+VAADVYALQSALPPRFQ+++AFAANLSTIN LRQAET+NGALKFP
Sbjct 61 GTADYTVTGAGTEAVVAADVYALQSALPPRFQSNSAFAANLSTINVLRQAETANGALKFP 120
Query 378 SLHDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFG 437
SLH SPPMLAGK + EVS+MDTVD+AVTATN+PLVLGDWKQF+I DRVGS VELVPH+FG
Sbjct 121 SLHASPPMLAGKHIWEVSNMDTVDAAVTATNYPLVLGDWKQFIITDRVGSTVELVPHVFG 180
Query 438 PNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
NRRPTGQRGFF WFRVGSDVLV NAFRVLKV+TTA
Sbjct 181 GNRRPTGQRGFFCWFRVGSDVLVDNAFRVLKVQTTA 216
>gi|289751273|ref|ZP_06510651.1| phiRv2 phage protein [Mycobacterium tuberculosis T92]
gi|289691860|gb|EFD59289.1| phiRv2 phage protein [Mycobacterium tuberculosis T92]
Length=202
Score = 344 bits (882), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 166/201 (83%), Positives = 185/201 (93%), Gaps = 0/201 (0%)
Query 273 LEGDAASFVGEIGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAI 332
+EG A FV E+G++LADSVEQLQAAAFV+GSGNGEPTGFVSALTGT+D V GAG+EA+
Sbjct 2 IEGATAGFVAEVGRVLADSVEQLQAAAFVSGSGNGEPTGFVSALTGTADYTVTGAGTEAV 61
Query 333 VAADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVL 392
VAADVYALQSALPPRFQ+++AFAANLSTIN LRQAET+NGALKFPSLH SPPMLAGK +
Sbjct 62 VAADVYALQSALPPRFQSNSAFAANLSTINVLRQAETANGALKFPSLHASPPMLAGKHIW 121
Query 393 EVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWF 452
EVS+MDTVD+AVTATN+PLVLGDWKQF+I DRVGS VELVPH+FG NRRPTGQRGFF WF
Sbjct 122 EVSNMDTVDAAVTATNYPLVLGDWKQFIITDRVGSTVELVPHVFGGNRRPTGQRGFFCWF 181
Query 453 RVGSDVLVRNAFRVLKVETTA 473
RVGSDVLV NAFRVLKV+TTA
Sbjct 182 RVGSDVLVDNAFRVLKVQTTA 202
>gi|226307463|ref|YP_002767423.1| hypothetical protein RER_39760 [Rhodococcus erythropolis PR4]
gi|226186580|dbj|BAH34684.1| hypothetical protein RER_39760 [Rhodococcus erythropolis PR4]
Length=473
Score = 341 bits (875), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 196/465 (43%), Positives = 283/465 (61%), Gaps = 27/465 (5%)
Query 16 RDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELR--AEQRRRGREAEEALRRYRAGELRV 73
R A +L + + +LT + ++RF +L E ++ EQ R RE EA AG
Sbjct 29 RTEATELTERI--ELTADDSERFDSLADDLEYIKRALEQHSRLRELVEA-GSIEAGASFG 85
Query 74 VPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETLCRTGPPQSTSWAQ 133
V GA T D + +RD A R ++ + G ++ AA +E L T S A
Sbjct 86 VGGASTHKDS------DPVRDQALRNIERAHKAGRLTESAATLSEHLVGT-----DSVAA 134
Query 134 RWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGG-- 191
R A TGS Y AF K V++P GH +WT E A+R+A + GL++ GG
Sbjct 135 RLAATTGSDAYRSAFAKLVTDPQRGHMLWTPDEGQAYRDA------DKVRGLIEGSGGTG 188
Query 192 -FLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQEVSDDS 250
L+P LDP+++L+ GS +P+R+++RVVQT S W GV+S G + W +E + D +
Sbjct 189 KHLVPWDLDPSVILTNAGSVSPLREISRVVQTNSNAWNGVSSAGVTSDWTAETAQAPDGT 248
Query 251 PALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAFVNGSGNGEPT 310
P L +P ++ + W+PFSIELE D + E+ K+L DS QL+ AF GSG+G+PT
Sbjct 249 PTLVPEPIPVHKAASWVPFSIELEQDGLHLLAELQKLLVDSAVQLENTAFATGSGSGQPT 308
Query 311 GFVSALTGTSDQVVV-GAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTLRQAET 369
G ++AL V+V G G+EA+V+ADVYALQ+AL R+QA+A+FA NL+ +NT+RQ ET
Sbjct 309 GLITALVAAGGSVIVPGTGTEALVSADVYALQNALGSRWQANASFAGNLAVLNTIRQFET 368
Query 370 SNGALKFPSLHDSPPMLAGKSVLEVSHMD-TVDSAVTATNHPLVLGDWKQFLIGDRVGSM 428
+NGALKFPS + P L + + E+S MD +++A T +N+ LV GD++ F+I DRVG+
Sbjct 369 TNGALKFPSAQNVPASLLSRPLHEISGMDGVINAAATESNYSLVYGDFQNFVIVDRVGTT 428
Query 429 VELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
VELVPHL G N RPTG+RG + + RVGSDV+ AF++L++ TTA
Sbjct 429 VELVPHLMGANGRPTGERGLYMFRRVGSDVVNPAAFKLLRINTTA 473
>gi|307084156|ref|ZP_07493269.1| hypothetical protein TMLG_00562 [Mycobacterium tuberculosis SUMu012]
gi|308366211|gb|EFP55062.1| hypothetical protein TMLG_00562 [Mycobacterium tuberculosis SUMu012]
Length=159
Score = 320 bits (820), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 158/158 (100%), Positives = 158/158 (100%), Gaps = 0/158 (0%)
Query 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE
Sbjct 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
Query 61 EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL 120
EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL
Sbjct 61 EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL 120
Query 121 CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAG 158
CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAG
Sbjct 121 CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAG 158
>gi|289751274|ref|ZP_06510652.1| phiRv1 phage protein [Mycobacterium tuberculosis T92]
gi|289691861|gb|EFD59290.1| phiRv1 phage protein [Mycobacterium tuberculosis T92]
Length=175
Score = 301 bits (772), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 149/157 (95%), Positives = 151/157 (97%), Gaps = 0/157 (0%)
Query 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE
Sbjct 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
Query 61 EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL 120
EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL
Sbjct 61 EALRRYRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETL 120
Query 121 CRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVA 157
CRTGPPQSTSWAQRWLA TGSRDY+ FV R+S P A
Sbjct 121 CRTGPPQSTSWAQRWLAGTGSRDYMDPFVTRISGPAA 157
>gi|15843075|ref|NP_338112.1| hypothetical protein MT3573.12 [Mycobacterium tuberculosis CDC1551]
gi|13883420|gb|AAK47926.1| hypothetical protein MT3573.12 [Mycobacterium tuberculosis CDC1551]
Length=141
Score = 289 bits (740), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 140/141 (99%), Positives = 141/141 (100%), Gaps = 0/141 (0%)
Query 333 VAADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVL 392
+AADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVL
Sbjct 1 MAADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDSPPMLAGKSVL 60
Query 393 EVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWF 452
EVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWF
Sbjct 61 EVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWF 120
Query 453 RVGSDVLVRNAFRVLKVETTA 473
RVGSDVLVRNAFRVLKVETTA
Sbjct 121 RVGSDVLVRNAFRVLKVETTA 141
>gi|15843074|ref|NP_338111.1| hypothetical protein MT3573.11 [Mycobacterium tuberculosis CDC1551]
gi|13883419|gb|AAK47925.1| hypothetical protein MT3573.11 [Mycobacterium tuberculosis CDC1551]
Length=224
Score = 287 bits (735), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 141/145 (98%), Positives = 143/145 (99%), Gaps = 0/145 (0%)
Query 92 LRDTAFRTLDSCVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKR 151
+RDTAFRTLDSCVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKR
Sbjct 1 MRDTAFRTLDSCVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKR 60
Query 152 VSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTN 211
VSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTN
Sbjct 61 VSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTN 120
Query 212 PIRQVARVVQTTSEIWRGVTSEGAE 236
PIRQVARVVQTTSEIWRGVTSE +
Sbjct 121 PIRQVARVVQTTSEIWRGVTSEAPK 145
>gi|307085346|ref|ZP_07494459.1| hypothetical protein TMLG_04087 [Mycobacterium tuberculosis SUMu012]
gi|308365111|gb|EFP53962.1| hypothetical protein TMLG_04087 [Mycobacterium tuberculosis SUMu012]
Length=177
Score = 266 bits (681), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 134/153 (88%), Positives = 137/153 (90%), Gaps = 0/153 (0%)
Query 6 DIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAEEALRR 65
DIK LSL ETR AAKQLLDSV GDLTG+ AQRFQALTRHAEELRAEQRRRGREAEEALRR
Sbjct 12 DIKQLSLDETRSAAKQLLDSVEGDLTGDVAQRFQALTRHAEELRAEQRRRGREAEEALRR 71
Query 66 YRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETLCRTGP 125
RAGELRVVPGAPTGGDDGDAPPGNSLRD AFRTLD CVRDGLMSSRAAE AETLCRTGP
Sbjct 72 CRAGELRVVPGAPTGGDDGDAPPGNSLRDIAFRTLDVCVRDGLMSSRAAEAAETLCRTGP 131
Query 126 PQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAG 158
PQSTSWAQRWLAATG+RDYLGAF + P G
Sbjct 132 PQSTSWAQRWLAATGNRDYLGAFGQEGFEPCCG 164
>gi|290959236|ref|YP_003490418.1| phage capsid protein [Streptomyces scabiei 87.22]
gi|260648762|emb|CBG71875.1| putative phage capsid protein [Streptomyces scabiei 87.22]
Length=493
Score = 256 bits (654), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 146/337 (44%), Positives = 207/337 (62%), Gaps = 15/337 (4%)
Query 134 RWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGGFL 193
R AT S +Y+ A+ K GH V + + A +RAM L D+ GG+L
Sbjct 170 RMCLATSSPEYMRAWSKLARG--KGHMVTPEEQQAL----------ERAMSLTDSAGGYL 217
Query 194 IPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQEVSDDSPAL 253
+P LDP I+++ +GS N IRQVAR V T +IW GV+S RW +EA E SD++P L
Sbjct 218 VPFQLDPTIIITANGSINQIRQVARQVVATGDIWNGVSSGSVSWRWAAEASEASDNAPTL 277
Query 254 AQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAFVNGSGNGEPTGFV 313
AQP VP Y+ ++P SIE DA + E+G++LA + L+AAA GSG+G+PTG V
Sbjct 278 AQPTVPVYKADGFVPISIEAMDDAENVTTEVGRLLAFGKDTLEAAALATGSGSGQPTGIV 337
Query 314 SALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGA 373
+ALTGTS +V ++ + DVY + +ALP R++ +AA+ AN N +RQ ++S G
Sbjct 338 TALTGTS-SIVTSTTTDTFASGDVYKVDTALPGRYRPNAAWLANRGIYNAVRQFDSSGGT 396
Query 374 LKFPSL-HDSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRVGSMVELV 432
+ + D PPML G+ LE MD V +A A N+ +V GD+ ++I DR+G +E +
Sbjct 397 NLWERIGADVPPMLLGRKALESEDMDGVVTAA-AENYVMVYGDFDNYVIADRIGMSIEFL 455
Query 433 PHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKV 469
PHL G NRRPTGQRG++AW+RVG+D + AFR+L V
Sbjct 456 PHLVGANRRPTGQRGWYAWYRVGADSVNDGAFRMLNV 492
>gi|120405315|ref|YP_955144.1| phage major capsid protein, HK97 [Mycobacterium vanbaalenii PYR-1]
gi|119958133|gb|ABM15138.1| phage major capsid protein, HK97 [Mycobacterium vanbaalenii PYR-1]
Length=389
Score = 241 bits (614), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 143/338 (43%), Positives = 196/338 (58%), Gaps = 23/338 (6%)
Query 138 ATGSRDYLGAFVKRV---SNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGGFLI 194
AT S DY AF K + NP TV + RE V A QRAM L D QGGFL+
Sbjct 68 ATTSPDYSRAFTKMIRSRGNP----TVLSGRE---------VQAYQRAMSLTDNQGGFLV 114
Query 195 PAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALA 254
P LDP I+L+ +GS N +RQ++RVVQ T + W GVTS G W EA EVSDDSP L
Sbjct 115 PMQLDPTIILTANGSFNQVRQISRVVQATGKSWTGVTSAGVSGSWDGEAVEVSDDSPELQ 174
Query 255 QPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAFVNGSGNGEPTGFVS 314
QP +P ++ W+ FS EL+ DAA +I K++A + ++ AF GSG G+P G ++
Sbjct 175 QPEIPVHKLQIWVEFSHELQHDAAGLADDIAKMIAFEKDVKESIAFATGSGVGQPRGVIT 234
Query 315 ALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGAL 374
AL G SD VV A ++ A DV+ L LP R+ +A++ A+ + +RQ +T+ GA
Sbjct 235 ALMG-SDSVVNSAVTDTFAAGDVHNLDGDLPQRYAFNASWLAHRKIYSKIRQFDTNGGAS 293
Query 375 KFPSLHDS-PPMLAGKSVLEVSHMDTVDSAVT--ATNHPLVLGDWKQFLIGDRVGSMVEL 431
+ L + L G+ M DS++T NH L GD++ F+I DR+G+ +
Sbjct 294 LWGQLAEGRKSELLGRPDYVAEAM---DSSITNGQDNHVLAFGDFQNFVIADRLGTTLSY 350
Query 432 VPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKV 469
+P+L GPN RP G+ G+ AW RVGSDV+ AFR+L V
Sbjct 351 IPNLMGPNGRPVGKAGWHAWIRVGSDVVNPGAFRLLNV 388
>gi|206599551|ref|YP_002241990.1| gp7 [Mycobacterium phage Brujita]
gi|206282700|gb|ACI06221.1| gp7 [Mycobacterium phage Brujita]
gi|302858444|gb|ADL71191.1| gp7 [Mycobacterium phage island3]
Length=515
Score = 238 bits (608), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 139/367 (38%), Positives = 210/367 (58%), Gaps = 14/367 (3%)
Query 110 SSRAAETAETLCRTGPPQSTSWAQRWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAA 169
S + E A + + ++ A++ L T S Y+ A+ K NP + ++ E A
Sbjct 160 SDKVREAATKIIERFDDKHSTLARQCLL-TSSPAYMRAWSKMARNPHGA--ILSEDEKRA 216
Query 170 WREAAAVAAEQRAMGLVDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRG 229
E RAMGL D+ GG+L+P LDPA++++ +GS N IR AR V T + W G
Sbjct 217 LNEV-------RAMGLTDSDGGYLVPFQLDPAVIVTSNGSLNDIRMFARQVVATGDKWNG 269
Query 230 VTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILA 289
VTS + W +E +EVSDD+P QP +P + ++P SIE D A+ + +LA
Sbjct 270 VTSAAVQWSWDAEFEEVSDDAPTFGQPDIPIKKAQGFVPISIEALADEANVTQTVATLLA 329
Query 290 DSVEQLQAAAFVNGSGNG-EPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRF 348
+ ++L+A + GSG G EPTG V+AL GT+ ++ A +E ADVY + L R
Sbjct 330 EGKDELEAVTLITGSGQGNEPTGIVTALAGTAAEIAP-ATAETFAIADVYGVYEQLAARH 388
Query 349 QASAAFAANLSTINTLRQAETSNGALKFPSL-HDSPPMLAGKSVLEVSHMD-TVDSAVTA 406
+ A+ AN N +RQ +T GA + ++ + P L G+ V E MD T D TA
Sbjct 389 RKRGAWLANNLIYNKIRQFDTQGGAGLWETIGNGEPSQLLGRPVGEAEAMDATWDGTATA 448
Query 407 TNHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRV 466
N+ L+ G+++ ++I DR+G VE +PHLFG ++RPTGQRG++A+ R+G+DV+ NAFR+
Sbjct 449 DNYVLLYGNFQNYVIADRIGMTVEFIPHLFGSSQRPTGQRGWYAYCRMGADVVNPNAFRL 508
Query 467 LKVETTA 473
L VET +
Sbjct 509 LNVETAS 515
>gi|29566114|ref|NP_817683.1| gp6 [Mycobacterium phage Che9c]
gi|29424839|gb|AAN12566.1| gp6 [Mycobacterium phage Che9c]
Length=543
Score = 233 bits (594), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 132/343 (39%), Positives = 197/343 (58%), Gaps = 13/343 (3%)
Query 134 RWLAATGSRDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGGFL 193
R AT S YL A+ K NP A + T+ E A E RAMGL GG+L
Sbjct 211 RQCLATSSPAYLRAWSKMARNPHAA--ILTEEEKRAINEV-------RAMGLTKADGGYL 261
Query 194 IPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQEVSDDSPAL 253
+P LDP ++++ +GS N IR+ AR V T ++W GV+S + W +E +EVSDDSP
Sbjct 262 VPFQLDPTVIITSNGSLNDIRRFARQVVATGDVWHGVSSAAVQWSWDAEFEEVSDDSPEF 321
Query 254 AQPAVPNYRGSCWIPFSIELEGDAASFVGEIGKILADSVEQLQAAAFVNGSGNG-EPTGF 312
QP +P + ++P SIE D A+ + + A+ ++L+A G+G G +PTG
Sbjct 322 GQPEIPVKKAQGFVPISIEALQDEANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGI 381
Query 313 VSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNG 372
V+AL GT+ ++ +E ADVYA+ L R + A+ AN N +RQ +T G
Sbjct 382 VTALAGTAAEIAP-VTAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGG 440
Query 373 ALKFPSL-HDSPPMLAGKSVLEVSHMD-TVDSAVTATNHPLVLGDWKQFLIGDRVGSMVE 430
A + ++ + P L G+ V E MD +++ +A N L+ G+++ ++I DR+G VE
Sbjct 441 AGLWTTIGNGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTVE 500
Query 431 LVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
+PHLFG NRRP G RG+FA++R+G+DV+ NAFR+L VET +
Sbjct 501 FIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETAS 543
>gi|306804415|ref|ZP_07441083.1| hypothetical protein TMHG_01848 [Mycobacterium tuberculosis SUMu008]
gi|308348977|gb|EFP37828.1| hypothetical protein TMHG_01848 [Mycobacterium tuberculosis SUMu008]
Length=129
Score = 217 bits (552), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 109/118 (93%), Positives = 110/118 (94%), Gaps = 0/118 (0%)
Query 6 DIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAEEALRR 65
DIK LSL ETR AAKQLLDSV GDLTG+ AQRFQALTRHAEELRAEQRRRGREAEEALRR
Sbjct 12 DIKQLSLDETRSAAKQLLDSVEGDLTGDVAQRFQALTRHAEELRAEQRRRGREAEEALRR 71
Query 66 YRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDSCVRDGLMSSRAAETAETLCRT 123
RAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLD CVRDGLMSSRAAE AETLCRT
Sbjct 72 CRAGELRVVPGAPTGGDDGDAPPGNSLRDTAFRTLDVCVRDGLMSSRAAEAAETLCRT 129
>gi|317125799|ref|YP_004099911.1| hypothetical protein Intca_2682 [Intrasporangium calvum DSM 43043]
gi|315589887|gb|ADU49184.1| hypothetical protein Intca_2682 [Intrasporangium calvum DSM 43043]
Length=528
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 138/445 (32%), Positives = 209/445 (47%), Gaps = 35/445 (7%)
Query 47 ELRAEQRRRGREAEEALRRYRAGELRVVPGAPTGGDDGDAPPG-------------NSLR 93
ELR R R A+ + R R G LR A G D DA G +R
Sbjct 94 ELRDRVTRHQRLAK--VLRDRPGTLR---AAYHGLADDDASGGTFDAWTDVARMSDQQVR 148
Query 94 DTAFRTLDSCVRDGLMSSRAAETAETLCRT-----GPPQSTSWAQRWLAATGSRDYLGAF 148
D A R L++ RD +S+ A + L RT P + R + T + Y AF
Sbjct 149 DVALRGLEARERD--LSADQAARVDRLVRTVRTEENPNYDGAALARRIILTENEHYRSAF 206
Query 149 VKRVSNPVAGHTVWTDREAAAWREAAAV-AAEQRAMGL-VDTQGGFLIPAALDPAILLSG 206
+ +S P H + ++ E A R +E RAMG GG+ +P +DP+++++
Sbjct 207 RRVMSTP---HPLLSEPEIQALRAFQDFEKSELRAMGEGTGAAGGYGVPVFIDPSVIMTA 263
Query 207 DGSTNPIRQVARVVQTTSEIWRGVTSEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCW 266
GS N + +VV+ + +W+GV+S G + +E VSDDSP L QP V + +
Sbjct 264 QGSGNVFLDLCKVVEVNTNVWKGVSSAGVSWSFDAEGATVSDDSPTLDQPVVNVFTARGF 323
Query 267 IPFSIELEGDAASFVGEIGKILADSVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVG 326
+PFSIE+ D F E+ ++LA ++L F GSG GEP G V+AL V+
Sbjct 324 VPFSIEVGQDYPGFASEMAELLASGYDELLVDKFTRGSGTGEPQGIVTALDADPTAEVLL 383
Query 327 AGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNG--ALKFPSLHDSPP 384
+ + ADVY + + LP RF+ +++ + N +RQ T+ +
Sbjct 384 GTAGTLALADVYNVWAKLPQRFRRRSSWMGAVEINNKIRQLGTAANFHGTTVDLTAGAAD 443
Query 385 MLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRVGSMVELVPHLFG-PNRRPT 443
+L + E +M T + ++GD+ ++I R G VELVP LF N RPT
Sbjct 444 VLMNRQWYETPYMTD--LTTTTHTNVAIVGDFSNYVIARRSGLNVELVPTLFDVTNNRPT 501
Query 444 GQRGFFAWFRVGSDVLVRNAFRVLK 468
GQRG+FA+ R+G + FR+L
Sbjct 502 GQRGWFAYARIGGGSANNSGFRLLN 526
>gi|306805297|ref|ZP_07441965.1| hypothetical protein TMHG_04002 [Mycobacterium tuberculosis SUMu008]
gi|308348167|gb|EFP37018.1| hypothetical protein TMHG_04002 [Mycobacterium tuberculosis SUMu008]
Length=65
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 65/65 (100%), Positives = 65/65 (100%), Gaps = 0/65 (0%)
Query 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE
Sbjct 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
Query 61 EALRR 65
EALRR
Sbjct 61 EALRR 65
>gi|15843072|ref|NP_338109.1| hypothetical protein MT3573.9 [Mycobacterium tuberculosis CDC1551]
gi|13883417|gb|AAK47923.1| hypothetical protein MT3573.9 [Mycobacterium tuberculosis CDC1551]
Length=68
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 62/62 (100%), Positives = 62/62 (100%), Gaps = 0/62 (0%)
Query 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE
Sbjct 1 MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRAEQRRRGREAE 60
Query 61 EA 62
EA
Sbjct 61 EA 62
>gi|304390287|ref|ZP_07372240.1| HK97 family phage major capsid protein [Mobiluncus curtisii subsp.
curtisii ATCC 35241]
gi|304326043|gb|EFL93288.1| HK97 family phage major capsid protein [Mobiluncus curtisii subsp.
curtisii ATCC 35241]
Length=404
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 82/286 (29%), Positives = 135/286 (48%), Gaps = 14/286 (4%)
Query 186 VDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQ 244
VDT+GG+L+P + L+S N +R +A+V+QTTS + V S A W E +
Sbjct 123 VDTEGGYLVPDEFE-RTLISSLEDQNIMRSLAKVIQTTSGDRKIPVVSTHGTAGWLDEGK 181
Query 245 EVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGE-IGKILADSVEQLQAAAFVNG 303
++ A Q + ++ ++ S EL DAA V + + A + + AF+ G
Sbjct 182 PYTESDEAFTQVTLSAFKLGTFLKISEELLNDAAFNVEQYLAAEFARRIGAAEEEAFLTG 241
Query 304 SGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINT 363
G G+PTG +A G V G S+ I A ++ L L ++ +A + N ST+ T
Sbjct 242 DGKGKPTGIFAATGGGEKAVTTGKASD-ITADELIDLHYGLRAPYRKNAVWLMNDSTVKT 300
Query 364 LRQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLI 421
+R+ + NG + P+L +P ++ G+ V H T + A + GD + I
Sbjct 301 IRKLKDGNGQYLWQPALTAGTPDLVLGRPV----HTSTFVPEIKAGASTVAFGDLSYYWI 356
Query 422 GDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVL 467
DR G + + LF TGQ GF A R+ +++ A ++L
Sbjct 357 ADRQGRSFKRLNELF----VTTGQVGFLASQRLDGKLVLPEAVKLL 398
>gi|306817330|ref|ZP_07451075.1| HK97 family phage major capsid protein [Mobiluncus mulieris ATCC
35239]
gi|304649771|gb|EFM47051.1| HK97 family phage major capsid protein [Mobiluncus mulieris ATCC
35239]
Length=405
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 81/292 (28%), Positives = 138/292 (48%), Gaps = 14/292 (4%)
Query 186 VDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQ 244
VDT+GG+L+P + L+S N +R +A+V+QTTS + V S A W E +
Sbjct 124 VDTEGGYLVPDEFE-RTLISSLEDQNIMRGLAKVIQTTSGDRKIPVVSTHGTAGWLDEGK 182
Query 245 EVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGE-IGKILADSVEQLQAAAFVNG 303
++ Q + ++ ++ S EL D+A V + + A + + AF+ G
Sbjct 183 PYTESDETFTQVTLSAFKLGTFLKISEELLNDSAFNVEQYLAAEFARRIGAAEEEAFLTG 242
Query 304 SGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINT 363
G G+PTG +A G V G ++ I A ++ L AL ++ +A + N ST+ T
Sbjct 243 DGKGKPTGIFTASGGGEKAVTTGKATD-ITADELIDLHYALRGPYRKNAVWLMNDSTVKT 301
Query 364 LRQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLI 421
+R+ + NG + P+L +P ++ G+ V H T + A + GD + I
Sbjct 302 IRKLKDGNGQYLWQPALTAGTPDLVLGRPV----HTSTFVPEIKAGASTVAFGDLSYYWI 357
Query 422 GDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
DR G + + LF TGQ GF A R+ +++ A ++L + +A
Sbjct 358 ADRQGRSFKRLNELFA----TTGQVGFLASQRLDGKLVLPEAVKLLTQKASA 405
>gi|227875043|ref|ZP_03993188.1| HK97 family phage major capsid protein [Mobiluncus mulieris ATCC
35243]
gi|227844321|gb|EEJ54485.1| HK97 family phage major capsid protein [Mobiluncus mulieris ATCC
35243]
Length=409
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 81/292 (28%), Positives = 138/292 (48%), Gaps = 14/292 (4%)
Query 186 VDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQ 244
VDT+GG+L+P + L+S N +R +A+V+QTTS + V S A W E +
Sbjct 128 VDTEGGYLVPDEFE-RTLISSLEDQNIMRGLAKVIQTTSGDRKIPVVSTHGTAGWLDEGK 186
Query 245 EVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGE-IGKILADSVEQLQAAAFVNG 303
++ Q + ++ ++ S EL D+A V + + A + + AF+ G
Sbjct 187 PYTESDETFTQVTLSAFKLGTFLKISEELLNDSAFNVEQYLAAEFARRIGAAEEEAFLTG 246
Query 304 SGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINT 363
G G+PTG +A G V G ++ I A ++ L AL ++ +A + N ST+ T
Sbjct 247 DGKGKPTGIFTASGGGEKAVTTGKATD-ITADELIDLHYALRGPYRKNAVWLMNDSTVKT 305
Query 364 LRQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLI 421
+R+ + NG + P+L +P ++ G+ V H T + A + GD + I
Sbjct 306 IRKLKDGNGQYLWQPALTAGTPDLVLGRPV----HTSTFVPEIKAGASTVAFGDLSYYWI 361
Query 422 GDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA 473
DR G + + LF TGQ GF A R+ +++ A ++L + +A
Sbjct 362 ADRQGRSFKRLNELFA----TTGQVGFLASQRLDGKLVLPEAVKLLTQKASA 409
>gi|298346363|ref|YP_003719050.1| HK97 family phage major capsid protein [Mobiluncus curtisii ATCC
43063]
gi|298236424|gb|ADI67556.1| HK97 family phage major capsid protein [Mobiluncus curtisii ATCC
43063]
Length=406
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/290 (29%), Positives = 138/290 (48%), Gaps = 14/290 (4%)
Query 186 VDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQ 244
VD++GG+L+P + ++ S N +R +A+V+QTTS + V S A W E +
Sbjct 125 VDSEGGYLVPDEFERTLVQSL-ADQNIMRSLAKVIQTTSGDRKIPVVSTHGTATWLDEGK 183
Query 245 EVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGE-IGKILADSVEQLQAAAFVNG 303
S+ A Q ++ Y+ ++ S EL DAA V + + A + + AF+ G
Sbjct 184 PYSESDEAFTQISLSAYKLGTFLKISEELLNDAAFNVEQYLASEFARRIGAAEEEAFLIG 243
Query 304 SGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINT 363
G G+PTG + TG +D V + I A ++ L +L ++A A + N +T+ T
Sbjct 244 DGKGKPTGIFNP-TGGADLGVTTSKPTDINADELIDLHYSLRAPYRARAVWMMNDATVKT 302
Query 364 LRQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLI 421
+R+ + NG + P+L +P M+ G+ V H + A + GD + I
Sbjct 303 VRKLKDGNGQYLWQPALTAGTPDMILGRPV----HTSVFVPELKAGARTVAFGDLGFYWI 358
Query 422 GDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVET 471
DR G + + LF TGQ GF A R+ +++ A +VL +T
Sbjct 359 ADRQGRSFKRLNELFA----TTGQIGFLASQRLDGKLVLPEAIKVLTQKT 404
>gi|146277402|ref|YP_001167561.1| HK97 family phage major capsid protein [Rhodobacter sphaeroides
ATCC 17025]
gi|145555643|gb|ABP70256.1| phage major capsid protein, HK97 family [Rhodobacter sphaeroides
ATCC 17025]
Length=385
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 95/302 (32%), Positives = 141/302 (47%), Gaps = 19/302 (6%)
Query 174 AAVAAEQRAMGLV-DTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTS-EIWRGVT 231
AA A E +A+ + D QGG+L PA + + +P+R VA V QT S I
Sbjct 94 AAPADELKALNVSSDPQGGYLAPAEMSTE-FIRDLVEFSPVRAVASVRQTGSPSIIYPAR 152
Query 232 SEGAEARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFV-GEIGKILAD 290
+ ARW EAQ P Q V + ++ S +L D+A E+ LA+
Sbjct 153 TGITNARWKGEAQAQEGSEPGFGQAEVVVKEVNTFVDISNQLLADSAGQAEAEVRMALAE 212
Query 291 SVEQLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAAD-VYALQSALPPRFQ 349
Q + AAFV+G G EP GF++ G + V +G+ A + AD + L ALP ++
Sbjct 213 DFGQKEGAAFVSGDGILEPAGFMTH-AGIAHTV---SGAAAGITADALVKLLYALPATYR 268
Query 350 ASAAFAANLSTINTLRQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTAT 407
A+A N +T+ +R + +G + PS P L G+ V+E+ M V+ A
Sbjct 269 GRGAWAMNGTTLGAVRLLKDGDGRFLWQPSYQAGQPETLLGRPVVEMVDMPDVE----AG 324
Query 408 NHPLVLGDWKQFLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVL 467
P++ GDW + I DR+ V + P++ R G A RVG VL FR L
Sbjct 325 AFPIIYGDWSGYRIVDRIALSVLVNPYI----RATEGITRIHATRRVGGRVLQAAKFRKL 380
Query 468 KV 469
K+
Sbjct 381 KI 382
>gi|281418278|ref|ZP_06249298.1| phage major capsid protein, HK97 family [Clostridium thermocellum
JW20]
gi|281409680|gb|EFB39938.1| phage major capsid protein, HK97 family [Clostridium thermocellum
JW20]
Length=400
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 79/290 (28%), Positives = 140/290 (49%), Gaps = 14/290 (4%)
Query 187 DTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQE 245
DT+GG+L+P + L+ N RQ+A V+ T+S + V + A W E +
Sbjct 121 DTEGGYLVPDDFERT-LVEALEEENIFRQIANVITTSSGDKKIPVVASKGTASWVDEEGQ 179
Query 246 VSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGE-IGKILADSVEQLQAAAFVNGS 304
+ + + AQ ++ Y+ + I S EL D+ + + I K A + + AF G
Sbjct 180 IPESDDSFAQVSIGAYKLATMIKVSEELLNDSVFNLEQYIAKEFARRIGAKEEEAFFIGD 239
Query 305 GNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTL 364
G+G+PTG + A G + V A + AI ++ L +L ++ +A F N STI +
Sbjct 240 GSGKPTGIL-ADNGGGEIGVTAASATAITLDEIMDLFYSLKSPYRRNAVFIMNDSTIKAI 298
Query 365 RQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIG 422
R+ + +NG + PS+ +P + + V + M A+ A +V GD+ + +
Sbjct 299 RKLKDNNGQYLWQPSVTAGTPDTILNRPVKTSAFM----PAIAAGAKTIVFGDFSYYWVA 354
Query 423 DRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETT 472
DR G + + + L+ TGQ GF A RV +++ A ++L+ ++T
Sbjct 355 DRQGRIFKRLNELYA----ATGQVGFMATQRVDGKLVLAEAVKILQQKST 400
>gi|125974135|ref|YP_001038045.1| HK97 family phage major capsid protein [Clostridium thermocellum
ATCC 27405]
gi|125714360|gb|ABN52852.1| phage major capsid protein, HK97 family [Clostridium thermocellum
ATCC 27405]
Length=400
Score = 93.6 bits (231), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 79/290 (28%), Positives = 140/290 (49%), Gaps = 14/290 (4%)
Query 187 DTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQE 245
DT+GG+L+P + L+ N RQ+A V+ T+S + V + A W E +
Sbjct 121 DTEGGYLVPDDFERT-LVEALEEENIFRQIANVITTSSGDKKIPVVASKGTASWVDEEGQ 179
Query 246 VSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGE-IGKILADSVEQLQAAAFVNGS 304
+ + + AQ ++ Y+ + I S EL D+ + + I K A + + AF G
Sbjct 180 IPESDDSFAQVSIGAYKLATMIKVSEELLNDSVFNLEQYIAKEFARRIGAKEEEAFFIGD 239
Query 305 GNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTL 364
G+G+PTG + A G + V A + AI ++ L +L ++ +A F N STI +
Sbjct 240 GSGKPTGIL-ADNGGGEIGVTAASATAITLDEIMDLFYSLKSPYRRNAVFIMNDSTIKAI 298
Query 365 RQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIG 422
R+ + +NG + PS+ +P + + V + M A+ A +V GD+ + +
Sbjct 299 RKLKDNNGQYLWQPSVTAGTPDTILNRPVKTSAFM----PAIAAGAKTIVFGDFSYYWVA 354
Query 423 DRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETT 472
DR G + + + L+ TGQ GF A RV +++ A ++L+ ++T
Sbjct 355 DRQGRVFKRLNELYA----ATGQVGFMATQRVDGKLVLSEAVKILQQKST 400
>gi|315654943|ref|ZP_07907848.1| HK97 family major capsid protein [Mobiluncus curtisii ATCC 51333]
gi|315490904|gb|EFU80524.1| HK97 family major capsid protein [Mobiluncus curtisii ATCC 51333]
Length=405
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 82/290 (29%), Positives = 138/290 (48%), Gaps = 14/290 (4%)
Query 186 VDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQ 244
VD++GG+L+P + ++ S N +R +A+V+QTTS + V S A W E +
Sbjct 124 VDSEGGYLVPDEFERTLVQSL-ADQNIMRTLAKVIQTTSGDRKIPVVSTHGTATWLDEGK 182
Query 245 EVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGE-IGKILADSVEQLQAAAFVNG 303
S+ A Q ++ ++ ++ S EL DAA V + + A + + AF+ G
Sbjct 183 PYSESDEAFTQISLSAFKLGTFLKISEELLNDAAFNVEQYLASEFARRIGAAEEEAFLVG 242
Query 304 SGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINT 363
G G+PTG + TG +D V A I A ++ L +L ++A A + N +T+ T
Sbjct 243 DGKGKPTGIFNP-TGGADLGVTSAKPTDISADELIDLHYSLRSPYRARAVWLMNDATVKT 301
Query 364 LRQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLI 421
+R+ + NG + P+L +P M+ G+ V + + A + GD + I
Sbjct 302 VRKLKDGNGQYLWQPALTAGTPDMILGRPV----YTSVFAPELKAGARTVAFGDLGFYWI 357
Query 422 GDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVET 471
DR G + + LF TGQ GF A R+ +++ A +VL +T
Sbjct 358 ADRQGRSFKRLNELFA----TTGQIGFLASQRLDGKLVLPEAIKVLTQKT 403
>gi|315656914|ref|ZP_07909801.1| HK97 family prophage LambdaSa04 [Mobiluncus curtisii subsp. holmesii
ATCC 35242]
gi|315492869|gb|EFU82473.1| HK97 family prophage LambdaSa04 [Mobiluncus curtisii subsp. holmesii
ATCC 35242]
Length=405
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 78/286 (28%), Positives = 134/286 (47%), Gaps = 14/286 (4%)
Query 186 VDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQ 244
VDT+GG+L+P + L+S N +R +A+V+QTTS + V S A W E +
Sbjct 124 VDTEGGYLVPDEFE-RTLISSLEDQNIMRSLAKVIQTTSGDRKIPVVSTHGTAGWLDEGK 182
Query 245 EVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGE-IGKILADSVEQLQAAAFVNG 303
++ + Q + ++ ++ S EL D+A V + + A + + AF+ G
Sbjct 183 PYTESDESFTQVTLSAFKLGTFLKISEELLNDSAFNVEQYLAAEFARRIGAAEEEAFLTG 242
Query 304 SGNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINT 363
G +PTG +A G V G ++ I A ++ L AL ++ +A + N ST+ T
Sbjct 243 DGKNKPTGIFAATGGGEKAVTTGKATD-ITADELIDLHYALRAPYRKNAVWLMNDSTVKT 301
Query 364 LRQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLI 421
+R+ + N + P+L +P ++ G+ V H T + A + GD + I
Sbjct 302 VRKLKDGNDQYLWQPALTAGTPDLVLGRPV----HTSTFVPEIKAGASTVAFGDLSYYWI 357
Query 422 GDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVL 467
DR G + + LF TGQ GF A R+ +++ A ++L
Sbjct 358 ADRQGRSFKRLNELFA----TTGQVGFLASQRLDGKLVLPEAVKLL 399
>gi|110634245|ref|YP_674453.1| HK97 family phage major capsid protein [Mesorhizobium sp. BNC1]
gi|110285229|gb|ABG63288.1| phage major capsid protein, HK97 family [Chelativorans sp. BNC1]
Length=389
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/300 (30%), Positives = 143/300 (48%), Gaps = 17/300 (5%)
Query 177 AAEQRAMGL-VDTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEGA 235
A EQRA+ + D GGFL+P A +L +P+RQ ARV+ R G
Sbjct 97 ADEQRALTVSTDAAGGFLVPDNF-VAEMLRNVVQFSPVRQYARVMNVAGANVRMPKRTGT 155
Query 236 -EARWYSEAQEVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAA-SFVGEIGKILADSVE 293
A W +E + + PA + + + +C++ S +L D+A + E+ A+
Sbjct 156 MTAAWVAETGDRASTQPAYGEVELTPFEAACYVDISNQLLEDSAFNLESELAFDAAEEFG 215
Query 294 QLQAAAFVNGSGNGEPTGFVSALTGTSDQVVVGAGSEAIVAAD-VYALQSALPPRFQASA 352
+L++ AFV G G G+P G + A TG + V A + AD + L L P ++ +A
Sbjct 216 RLESVAFVAGDGTGKPKGIL-ADTGIATVVSGNASTLGTAPADKLIDLLYKLAPAYRRNA 274
Query 353 AFAANLSTINTLRQAETSNGALKF-PSLHD-SPPMLAGKSVLEVSHMDTVDSAVTATNHP 410
+A N +T+ +R+ + S G + P + + P + G+ V E+ M VTA P
Sbjct 275 TWALNSTTLALVRKLKDSQGNFLWQPGIANGQPETILGRPVAEMPDMPD----VTADALP 330
Query 411 LVLGDWKQ-FLIGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKV 469
+++GD++Q + I DRV V P+ GQ F RVG V+ AF+ LK+
Sbjct 331 ILIGDFQQGYRIVDRVSLAVLRDPYTMASK----GQTRFHMRRRVGGGVVKAEAFKALKI 386
>gi|256003557|ref|ZP_05428547.1| phage major capsid protein, HK97 family [Clostridium thermocellum
DSM 2360]
gi|255992581|gb|EEU02673.1| phage major capsid protein, HK97 family [Clostridium thermocellum
DSM 2360]
gi|316941378|gb|ADU75412.1| phage major capsid protein, HK97 family [Clostridium thermocellum
DSM 1313]
Length=400
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/286 (28%), Positives = 137/286 (48%), Gaps = 14/286 (4%)
Query 187 DTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQE 245
DT+GG+L+P + L+ N RQ+A V+ T+S + V + A W E +
Sbjct 121 DTEGGYLVPDDFERT-LVEALEEENIFRQIANVISTSSGDKKIPVVASKGTASWVDEEGQ 179
Query 246 VSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGE-IGKILADSVEQLQAAAFVNGS 304
+ + + AQ ++ Y+ + I S EL D+ + + I K A + + AF G
Sbjct 180 IPESDDSFAQVSIGAYKLATMIKVSEELLNDSVFNLEQYIAKEFARRIGAKEEEAFFIGD 239
Query 305 GNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTL 364
G+G+PTG + A G + V A + AI ++ L +L ++ +A F N STI +
Sbjct 240 GSGKPTGIL-ADNGGGEIGVTAASATAITLDEIMDLFYSLKSPYRRNAVFIMNDSTIKAI 298
Query 365 RQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIG 422
R+ + +NG + PS+ +P + + V + M A+ A +V GD+ + +
Sbjct 299 RKLKDNNGQYLWQPSVTAGTPDTILNRPVKTSAFM----PAIAAGAKTIVFGDFSYYWVA 354
Query 423 DRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLK 468
DR G + + + L+ TGQ GF A RV +++ A ++L+
Sbjct 355 DRQGRVFKRLNELYA----ATGQVGFMATQRVDGKLVLSEAVKILQ 396
>gi|304316282|ref|YP_003851427.1| phage major capsid protein, HK97 family [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
gi|332983325|ref|YP_004464766.1| phage major capsid protein, HK97 family [Mahella australiensis
50-1 BON]
gi|302777784|gb|ADL68343.1| phage major capsid protein, HK97 family [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
gi|332701003|gb|AEE97944.1| phage major capsid protein, HK97 family [Mahella australiensis
50-1 BON]
Length=400
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/286 (28%), Positives = 137/286 (48%), Gaps = 14/286 (4%)
Query 187 DTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQE 245
DT+GG+L+P + L+ N RQ+A V+ T+S + V + A W E +
Sbjct 121 DTEGGYLVPDDFERT-LVEALEEENIFRQIANVITTSSGDKKIPVVASKGTASWVDEEGQ 179
Query 246 VSDDSPALAQPAVPNYRGSCWIPFSIELEGDAASFVGE-IGKILADSVEQLQAAAFVNGS 304
+ + + AQ ++ Y+ + I S EL D+ + + I K A + + AF G
Sbjct 180 IPESDDSFAQVSIGAYKLATMIKVSEELLNDSVFNLEQYIAKEFARRIGAKEEEAFFIGD 239
Query 305 GNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTL 364
G+G+PTG + A G + V A + AI ++ L +L ++ +A F N STI +
Sbjct 240 GSGKPTGIL-ADNGGGEIGVTAASATAITLDEIMDLFYSLKSPYRRNAVFIMNDSTIKAI 298
Query 365 RQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIG 422
R+ + +NG + PS+ +P + + V + M A+ A +V GD+ + +
Sbjct 299 RKLKDNNGQYLWQPSVTAGTPDTILNRPVKTSAFM----PAIAAGAKTIVFGDFSYYWVA 354
Query 423 DRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLK 468
DR G + + + L+ TGQ GF A RV +++ A ++L+
Sbjct 355 DRQGRVFKRLNELYA----ATGQVGFMATQRVDGKLVLSEAVKILQ 396
>gi|331085733|ref|ZP_08334816.1| HK97 family phage major capsid protein [Lachnospiraceae bacterium
9_1_43BFAA]
gi|330406656|gb|EGG86161.1| HK97 family phage major capsid protein [Lachnospiraceae bacterium
9_1_43BFAA]
Length=400
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 78/290 (27%), Positives = 127/290 (44%), Gaps = 15/290 (5%)
Query 187 DTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQE 245
D++GG+L+P + L+ N R +A V+QT+S + + + EA+W E
Sbjct 121 DSEGGYLVPDEYERT-LVEALEEENFFRSLATVIQTSSGDRKIPIVASKGEAKWIDEEAA 179
Query 246 VSDDSPALAQPAVPNYRGSCWIPFSIELEGDAA-SFVGEIGKILADSVEQLQAAAFVNGS 304
+ + Q ++ Y+ + I S EL D + I K + + AF G
Sbjct 180 YPESDDSFGQISISAYKVATMIKVSDELLNDNVFNLEAYISKEFGRRIGTKEEEAFFTGD 239
Query 305 GNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTL 364
G G+PTG +A G SD V S I DV L +L ++ A + N ST+ L
Sbjct 240 GKGKPTGIFNATGGASDGVTTAGAS--ITFDDVMDLFYSLRSPYRKKAVWMLNDSTVKAL 297
Query 365 RQAETSNGALKF-PSLHDS-PPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIG 422
R+ + NG + PS+ P M+ + S + + A + GD+ + I
Sbjct 298 RKLKDGNGNYIWQPSVQAGVPDMILNRPYFTSSFV----PEIAAGQKIMAFGDFSYYWIA 353
Query 423 DRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETT 472
DR G + + LF TGQ GF A RV +++ A + +K++ T
Sbjct 354 DRQGRSFKRLNELFA----ATGQVGFLASQRVDGKLILPEAVKTMKLKET 399
>gi|167039899|ref|YP_001662884.1| HK97 family phage major capsid protein [Thermoanaerobacter sp.
X514]
gi|300915364|ref|ZP_07132678.1| phage major capsid protein, HK97 family [Thermoanaerobacter sp.
X561]
gi|307724777|ref|YP_003904528.1| phage major capsid protein, HK97 family [Thermoanaerobacter sp.
X513]
gi|166854139|gb|ABY92548.1| phage major capsid protein, HK97 family [Thermoanaerobacter sp.
X514]
gi|300888640|gb|EFK83788.1| phage major capsid protein, HK97 family [Thermoanaerobacter sp.
X561]
gi|307581838|gb|ADN55237.1| phage major capsid protein, HK97 family [Thermoanaerobacter sp.
X513]
Length=399
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 78/288 (28%), Positives = 138/288 (48%), Gaps = 18/288 (6%)
Query 187 DTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR--GVTSEGAEARWYSEAQ 244
D++GG+L+P + ++ + + N R++A+++QT+S + V ++G A W E +
Sbjct 121 DSEGGYLVPDEFERTLVQTLE-EENVFRKLAKIIQTSSGDRKIPVVVTKGTAA-WLDEGE 178
Query 245 EVSDDSPALAQPAVPNYRGSCWIPFSIELEGDAA-SFVGEIGKILADSVEQLQAAAFVNG 303
E + Q ++ Y+ I S EL D+ I A + + AF+ G
Sbjct 179 EFDESDSVFGQTSIGAYKLGTMIKVSDELLNDSVFDLENYISTEFARRIGAKEEEAFLVG 238
Query 304 SGNGEPTGFVSALTGTSDQVVVGAGS-EAIVAADVYALQSALPPRFQASAAFAANLSTIN 362
G+G+PTG +A G Q+ V AGS AI A ++ L +L ++ +A F N +T+
Sbjct 239 DGDGKPTGIFNATGGA--QLGVTAGSATAITADEIIDLVYSLKAPYRKNAVFLMNDATVK 296
Query 363 TLRQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFL 420
+R+ + G + PSL +P L + V ++ T+++ + GD+ +
Sbjct 297 AIRKLKDGQGQYLWQPSLTAGTPDTLLNRPVYTSAYAPTIEAGA----KTIAFGDFGYYW 352
Query 421 IGDRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLK 468
I DR G + + LF TGQ GF A RV +++ A +VL+
Sbjct 353 IADRQGRSFKRLNELFA----TTGQVGFLASQRVDGKLILPEAIKVLQ 396
>gi|336435240|ref|ZP_08614957.1| HK97 family phage major capsid protein [Lachnospiraceae bacterium
1_4_56FAA]
gi|336001631|gb|EGN31767.1| HK97 family phage major capsid protein [Lachnospiraceae bacterium
1_4_56FAA]
Length=400
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 78/290 (27%), Positives = 127/290 (44%), Gaps = 15/290 (5%)
Query 187 DTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQE 245
D++GG+L+P + L+ N R +A V+QT+S + + + EA+W E
Sbjct 121 DSEGGYLVPDEYERT-LVEALEEENFFRSLATVIQTSSGDRKIPIVASKGEAKWIDEEAA 179
Query 246 VSDDSPALAQPAVPNYRGSCWIPFSIELEGDAA-SFVGEIGKILADSVEQLQAAAFVNGS 304
+ + Q ++ Y+ + I S EL D + I K + + AF G
Sbjct 180 YPESDDSFGQISISAYKVATMIKVSDELLNDNVFNLEAYISKEFGRRIGTKEEEAFFTGD 239
Query 305 GNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTL 364
G G+PTG +A G SD V S I DV L +L ++ A + N ST+ L
Sbjct 240 GKGKPTGIFNATGGASDGVTTAGAS--ITFDDVMDLFYSLRSPYRKKAVWMLNDSTVKAL 297
Query 365 RQAETSNGALKF-PSLHDS-PPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIG 422
R+ + NG + PS+ P M+ + S + + A + GD+ + I
Sbjct 298 RKLKDGNGNYIWQPSVQAGVPDMILNRPYFTSSFV----PEIAAGQKIMAFGDFSYYWIA 353
Query 423 DRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETT 472
DR G + + LF TGQ GF A RV +++ A + +K++ T
Sbjct 354 DRQGRSFKRLNELFA----ATGQVGFLASQRVDGKLILPEAVKTMKLKET 399
>gi|153955258|ref|YP_001396023.1| Phage major capsid protein [Clostridium kluyveri DSM 555]
gi|219855683|ref|YP_002472805.1| hypothetical protein CKR_2340 [Clostridium kluyveri NBRC 12016]
gi|146348116|gb|EDK34652.1| Phage major capsid protein [Clostridium kluyveri DSM 555]
gi|219569407|dbj|BAH07391.1| hypothetical protein [Clostridium kluyveri NBRC 12016]
Length=401
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 75/286 (27%), Positives = 135/286 (48%), Gaps = 14/286 (4%)
Query 187 DTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQE 245
D++GG+L+P + L+ N R +A V+ T+S + V + A W E
Sbjct 123 DSEGGYLVPDEFERT-LVEALEEENIFRSLANVINTSSGDRKIPVVATKGTASWVDEEGT 181
Query 246 VSDDSPALAQPAVPNYRGSCWIPFSIELEGDAA-SFVGEIGKILADSVEQLQAAAFVNGS 304
+ D + Q ++ Y+ + I S EL D+ + I K A + + AF G
Sbjct 182 IPDSDDSFGQVSIGAYKLATMIKVSEELLNDSVFNLEAYISKEFARRIGNKEEEAFFTGD 241
Query 305 GNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTL 364
G+G+PTG +++ TG + V AG+ AI +V L +L ++ A F N +T+ +
Sbjct 242 GSGKPTGILAS-TGGAQIGVTTAGATAITMDEVLDLFYSLKAPYRNKAVFVMNDATVKAI 300
Query 365 RQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIG 422
R+ + G + PSL +P + + + ++M T+ +A + + GD+ + +
Sbjct 301 RKLKDGQGQYLWQPSLQAGTPDTILNRPLYTSAYMPTIAAAAKS----IAFGDFSYYWVA 356
Query 423 DRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLK 468
DR G + + + L+ TGQ GF A RV +++ A +VL+
Sbjct 357 DRQGRVFKRLNELYA----VTGQVGFVATQRVDGKLILPEAIKVLQ 398
>gi|302386148|ref|YP_003821970.1| phage major capsid protein, HK97 family [Clostridium saccharolyticum
WM1]
gi|302196776|gb|ADL04347.1| phage major capsid protein, HK97 family [Clostridium saccharolyticum
WM1]
Length=402
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 74/285 (26%), Positives = 132/285 (47%), Gaps = 15/285 (5%)
Query 187 DTQGGFLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWR-GVTSEGAEARWYSEAQE 245
D++GG+L+P + L+ G N R +A ++QT+S + V + EA W E +
Sbjct 121 DSEGGYLVPDEFEQT-LVQGLEEENVFRTLATIIQTSSGDRKIPVVATKGEASWVDEEGQ 179
Query 246 VSDDSPALAQPAVPNYRGSCWIPFSIELEGDAA-SFVGEIGKILADSVEQLQAAAFVNGS 304
+ + + Q ++ Y+ + I S EL D+ + I + + + AF+ G
Sbjct 180 IPESDDSFGQVSIAAYKVATMIKVSDELLNDSVFNMEAYISNEFSRRIGAKEEEAFLVGD 239
Query 305 GNGEPTGFVSALTGTSDQVVVGAGSEAIVAADVYALQSALPPRFQASAAFAANLSTINTL 364
G G+PTG +++ G S+ V S I DV L ++ ++ + F N ST+ L
Sbjct 240 GKGKPTGIFNSVGGASEGVTTATVS--ITFDDVMDLFYSVKSPYRKKSTFVMNDSTVKAL 297
Query 365 RQAETSNGALKF-PSLH-DSPPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIG 422
R+ + +NG + PS+ P + + V+ ++ A+T + GD+K + I
Sbjct 298 RKLKDNNGTYIWQPSVQAGQPDTVLNRPVVTSAYA----PAITTGGKVIAFGDFKYYWIA 353
Query 423 DRVGSMVELVPHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVL 467
DR G + + LF TGQ GF +V +++ A +VL
Sbjct 354 DRQGRSFKRLNELFA----ATGQVGFLGSQKVDGKLILPEAVKVL 394
Lambda K H
0.316 0.130 0.383
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 1003872181620
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40