BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3361c
Length=183
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610497|ref|NP_217878.1| hypothetical protein Rv3361c [Mycob... 362 1e-98
gi|71042648|pdb|2BM4|A Chain A, The Structure Of Mfpa (Rv3361c, ... 362 1e-98
gi|118618271|ref|YP_906603.1| hypothetical protein MUL_2846 [Myc... 299 1e-79
gi|183981191|ref|YP_001849482.1| hypothetical protein MMAR_1169 ... 297 5e-79
gi|342861574|ref|ZP_08718221.1| hypothetical protein MCOL_21916 ... 293 1e-77
gi|240169067|ref|ZP_04747726.1| hypothetical protein MkanA1_0712... 289 2e-76
gi|254822557|ref|ZP_05227558.1| hypothetical protein MintA_21689... 282 2e-74
gi|296168828|ref|ZP_06850504.1| pentapeptide repeat family prote... 280 9e-74
gi|254776755|ref|ZP_05218271.1| hypothetical protein MaviaA2_191... 273 7e-72
gi|118463434|ref|YP_883463.1| hypothetical protein MAV_4327 [Myc... 269 1e-70
gi|118470978|ref|YP_886018.1| hypothetical protein MSMEG_1641 [M... 251 3e-65
gi|120402548|ref|YP_952377.1| pentapeptide repeat-containing pro... 239 1e-61
gi|108798164|ref|YP_638361.1| pentapeptide repeat-containing pro... 239 2e-61
gi|169630778|ref|YP_001704427.1| hypothetical protein MAB_3699c ... 228 4e-58
gi|145225461|ref|YP_001136139.1| pentapeptide repeat-containing ... 224 4e-57
gi|312140786|ref|YP_004008122.1| hypothetical protein REQ_34480 ... 216 9e-55
gi|312140974|ref|YP_004008310.1| hypothetical protein REQ_36430 ... 214 5e-54
gi|325675389|ref|ZP_08155073.1| pentapeptide repeat family prote... 214 5e-54
gi|226365701|ref|YP_002783484.1| hypothetical protein ROP_62920 ... 212 2e-53
gi|229489413|ref|ZP_04383276.1| pentapeptide repeat protein [Rho... 211 4e-53
gi|226305482|ref|YP_002765442.1| hypothetical protein RER_19950 ... 211 4e-53
gi|111023196|ref|YP_706168.1| hypothetical protein RHA1_ro06233 ... 210 6e-53
gi|41409566|ref|NP_962402.1| hypothetical protein MAP3468c [Myco... 181 5e-44
gi|134103085|ref|YP_001108746.1| pentapeptide repeat-containing ... 179 1e-43
gi|291003972|ref|ZP_06561945.1| pentapeptide repeat-containing p... 178 2e-43
gi|258651510|ref|YP_003200666.1| pentapeptide repeat-containing ... 172 3e-41
gi|331699063|ref|YP_004335302.1| pentapeptide repeat-containing ... 166 1e-39
gi|336459778|gb|EGO38693.1| hypothetical protein MAPs_00210 [Myc... 156 2e-36
gi|256380480|ref|YP_003104140.1| pentapeptide repeat-containing ... 153 1e-35
gi|302524130|ref|ZP_07276472.1| pentapeptide repeat-containing p... 149 2e-34
gi|300782748|ref|YP_003763039.1| pentapeptide repeat-containing ... 147 6e-34
gi|284992789|ref|YP_003411343.1| pentapeptide repeat-containing ... 132 1e-29
gi|229818699|ref|YP_002880225.1| pentapeptide repeat-containing ... 122 2e-26
gi|333371278|ref|ZP_08463236.1| pentapeptide repeat domain prote... 122 3e-26
gi|302865634|ref|YP_003834271.1| pentapeptide repeat-containing ... 115 3e-24
gi|229084855|ref|ZP_04217111.1| Pentapeptide repeat protein [Bac... 115 3e-24
gi|330466022|ref|YP_004403765.1| pentapeptide repeat-containing ... 112 2e-23
gi|228990899|ref|ZP_04150863.1| Pentapeptide repeat protein [Bac... 112 2e-23
gi|145593641|ref|YP_001157938.1| pentapeptide repeat-containing ... 112 2e-23
gi|308071304|ref|YP_003872909.1| hypothetical protein PPE_04612 ... 112 3e-23
gi|42781015|ref|NP_978262.1| hypothetical protein BCE_1946 [Baci... 110 1e-22
gi|229166768|ref|ZP_04294518.1| Pentapeptide repeat protein [Bac... 110 1e-22
gi|229029598|ref|ZP_04185677.1| Pentapeptide repeat protein [Bac... 109 2e-22
gi|118477320|ref|YP_894471.1| hypothetical protein BALH_1637 [Ba... 108 2e-22
gi|229160860|ref|ZP_04288850.1| Pentapeptide repeat protein [Bac... 108 3e-22
gi|238063106|ref|ZP_04607815.1| pentapeptide repeat-containing p... 108 3e-22
gi|163939702|ref|YP_001644586.1| pentapeptide repeat-containing ... 108 4e-22
gi|301053431|ref|YP_003791642.1| hypothetical protein BACI_c1847... 108 4e-22
gi|300783407|ref|YP_003763698.1| pentapeptide repeat-containing ... 108 5e-22
gi|310644538|ref|YP_003949297.1| pentapeptide repeat protein [Pa... 108 5e-22
>gi|15610497|ref|NP_217878.1| hypothetical protein Rv3361c [Mycobacterium tuberculosis H37Rv]
gi|15842957|ref|NP_337994.1| pentapeptide repeat-containing protein [Mycobacterium tuberculosis
CDC1551]
gi|31794544|ref|NP_857037.1| hypothetical protein Mb3396c [Mycobacterium bovis AF2122/97]
82 more sequence titles
Length=183
Score = 362 bits (930), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 182/183 (99%), Positives = 183/183 (100%), Gaps = 0/183 (0%)
Query 1 LQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW 60
+QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW
Sbjct 1 MQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW 60
Query 61 HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLR 120
HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLR
Sbjct 61 HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLR 120
Query 121 KCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
KCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL
Sbjct 121 KCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
Query 181 AGG 183
AGG
Sbjct 181 AGG 183
>gi|71042648|pdb|2BM4|A Chain A, The Structure Of Mfpa (Rv3361c, C2 Crystal Form). The
Pentapeptide Repeat Protein From Mycobacterium Tuberculosis
Folds As A Right-Handed Quadrilateral Beta- Helix.
gi|71042649|pdb|2BM4|B Chain B, The Structure Of Mfpa (Rv3361c, C2 Crystal Form). The
Pentapeptide Repeat Protein From Mycobacterium Tuberculosis
Folds As A Right-Handed Quadrilateral Beta- Helix.
gi|71042650|pdb|2BM5|A Chain A, The Structure Of Mfpa (Rv3361c, P21 Crystal Form). The
Pentapeptide Repeat Protein From Mycobacterium Tuberculosis
Folds As A Right-Handed Quadrilateral Beta- Helix.
gi|71042651|pdb|2BM5|B Chain B, The Structure Of Mfpa (Rv3361c, P21 Crystal Form). The
Pentapeptide Repeat Protein From Mycobacterium Tuberculosis
Folds As A Right-Handed Quadrilateral Beta- Helix.
gi|71042652|pdb|2BM6|A Chain A, The Structure Of Mfpa (Rv3361c, C2221 Crystal Form).
The Pentapeptide Repeat Protein From Mycobacterium Tuberculosis
Folds As A Right-Handed Quadrilateral Beta- Helix.
gi|71042653|pdb|2BM7|A Chain A, The Structure Of Mfpa (Rv3361c, P3221 Crystal Form).
The Pentapeptide Repeat Protein From Mycobacterium Tuberculosis
Folds As A Right-Handed Quadrilateral Beta- Helix.
gi|71042654|pdb|2BM7|B Chain B, The Structure Of Mfpa (Rv3361c, P3221 Crystal Form).
The Pentapeptide Repeat Protein From Mycobacterium Tuberculosis
Folds As A Right-Handed Quadrilateral Beta- Helix.
gi|71042655|pdb|2BM7|C Chain C, The Structure Of Mfpa (Rv3361c, P3221 Crystal Form).
The Pentapeptide Repeat Protein From Mycobacterium Tuberculosis
Folds As A Right-Handed Quadrilateral Beta- Helix.
Length=186
Score = 362 bits (929), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 182/183 (99%), Positives = 183/183 (100%), Gaps = 0/183 (0%)
Query 1 LQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW 60
+QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW
Sbjct 4 MQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW 63
Query 61 HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLR 120
HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLR
Sbjct 64 HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLR 123
Query 121 KCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
KCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL
Sbjct 124 KCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 183
Query 181 AGG 183
AGG
Sbjct 184 AGG 186
>gi|118618271|ref|YP_906603.1| hypothetical protein MUL_2846 [Mycobacterium ulcerans Agy99]
gi|118570381|gb|ABL05132.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=182
Score = 299 bits (766), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 144/180 (80%), Positives = 162/180 (90%), Gaps = 0/180 (0%)
Query 1 LQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW 60
++ WVDCEFTGRDFRDEDLSRL TER +FSEC+F GVNL ES+HRGSAFRNC+FERTTLW
Sbjct 1 MEHWVDCEFTGRDFRDEDLSRLRTERVVFSECNFGGVNLTESEHRGSAFRNCSFERTTLW 60
Query 61 HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLR 120
HSTFAQCSMLGSVFV+CR+RPL LD+VDFTLAVLGGNDLRG++L+GCRLRE SLV+TDLR
Sbjct 61 HSTFAQCSMLGSVFVSCRMRPLVLDEVDFTLAVLGGNDLRGVDLSGCRLREASLVETDLR 120
Query 121 KCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
K VLRGADL GART G +LDDADLRGA DP LWR+ASL GAR+DV QA++FA AHGL L
Sbjct 121 KSVLRGADLRGARTNGTKLDDADLRGANPDPSLWRSASLAGARIDVPQALSFALAHGLRL 180
>gi|183981191|ref|YP_001849482.1| hypothetical protein MMAR_1169 [Mycobacterium marinum M]
gi|183174517|gb|ACC39627.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=182
Score = 297 bits (760), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 143/180 (80%), Positives = 162/180 (90%), Gaps = 0/180 (0%)
Query 1 LQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW 60
++ WVDCEFT RDFRDEDLSRL TER +FSEC+F GVNL ES+HRGSAFRNC+FERTTLW
Sbjct 1 MEHWVDCEFTDRDFRDEDLSRLRTERVVFSECNFGGVNLTESEHRGSAFRNCSFERTTLW 60
Query 61 HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLR 120
HSTFAQCSMLGSVFV+CR+RPL LD+VDFTLAVLGGNDLRG++L+GCRLRE SLV+TDLR
Sbjct 61 HSTFAQCSMLGSVFVSCRMRPLVLDEVDFTLAVLGGNDLRGVDLSGCRLREASLVETDLR 120
Query 121 KCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
K VLRGADL GART G +LDDADLRGA +DP LWR+ASL GAR+DV QA++FA AHGL L
Sbjct 121 KSVLRGADLRGARTNGTKLDDADLRGANLDPSLWRSASLAGARIDVPQALSFALAHGLRL 180
>gi|342861574|ref|ZP_08718221.1| hypothetical protein MCOL_21916 [Mycobacterium colombiense CECT
3035]
gi|342131063|gb|EGT84352.1| hypothetical protein MCOL_21916 [Mycobacterium colombiense CECT
3035]
Length=187
Score = 293 bits (749), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 146/181 (81%), Positives = 159/181 (88%), Gaps = 0/181 (0%)
Query 3 QWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWHS 62
+WVD EF G DF D+DLSRL TER +F+EC+F G NLAESQHRGSAFRNCTF RT+LWHS
Sbjct 4 RWVDQEFEGHDFTDDDLSRLQTERVVFTECNFGGANLAESQHRGSAFRNCTFRRTSLWHS 63
Query 63 TFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRKC 122
TFAQCSMLGSVFV CRLRP+T D+VDFTLAVLGG DLRG++L+GCRLRETSLV+ DLRK
Sbjct 64 TFAQCSMLGSVFVQCRLRPITFDEVDFTLAVLGGIDLRGVDLSGCRLRETSLVEADLRKA 123
Query 123 VLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCLAG 182
VLRGADL GARTTG RLDDADLRG+TVDP LW TASL GARVDVDQAVAFA AHGL L G
Sbjct 124 VLRGADLRGARTTGTRLDDADLRGSTVDPTLWTTASLAGARVDVDQAVAFALAHGLRLDG 183
Query 183 G 183
G
Sbjct 184 G 184
>gi|240169067|ref|ZP_04747726.1| hypothetical protein MkanA1_07124 [Mycobacterium kansasii ATCC
12478]
Length=178
Score = 289 bits (739), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 147/174 (85%), Positives = 155/174 (90%), Gaps = 0/174 (0%)
Query 5 VDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWHSTF 64
+DCEF RDFRDEDLSRL TER +FS CDFSGVNLAESQHRGSAFRNCTFERT LWHSTF
Sbjct 1 MDCEFDCRDFRDEDLSRLCTERVVFSGCDFSGVNLAESQHRGSAFRNCTFERTALWHSTF 60
Query 65 AQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRKCVL 124
QCS+LGSVFV CRLRPLT DDVDFTLAVL GNDLRG +L+GCRLRETSLV+ DLRK VL
Sbjct 61 QQCSLLGSVFVGCRLRPLTFDDVDFTLAVLAGNDLRGADLSGCRLRETSLVEADLRKAVL 120
Query 125 RGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGL 178
RGADLSGARTTGARLD ADLRGATVDP LW TASL GAR+DV QA+AFA AHGL
Sbjct 121 RGADLSGARTTGARLDGADLRGATVDPSLWTTASLTGARIDVPQALAFALAHGL 174
>gi|254822557|ref|ZP_05227558.1| hypothetical protein MintA_21689 [Mycobacterium intracellulare
ATCC 13950]
Length=189
Score = 282 bits (721), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 140/182 (77%), Positives = 155/182 (86%), Gaps = 0/182 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++WVD EF G DF DEDLSRL TER +F+EC+FSG NLAESQHRGSAFRNC+F+RT+LWH
Sbjct 3 EEWVDREFDGHDFTDEDLSRLRTERTVFTECNFSGANLAESQHRGSAFRNCSFQRTSLWH 62
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S+FAQCSMLGSVFV CRLRP+T D+VDFTLAVL G DLRG++ +GCRLRE SLV+ DLRK
Sbjct 63 SSFAQCSMLGSVFVQCRLRPITFDEVDFTLAVLAGIDLRGVDFSGCRLREASLVEADLRK 122
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCLA 181
VLRGADL GART GARLD ADLRG T DP LW TASL GARVDVDQ VAFA AHGL L
Sbjct 123 AVLRGADLRGARTAGARLDGADLRGTTADPGLWTTASLAGARVDVDQVVAFALAHGLRLD 182
Query 182 GG 183
GG
Sbjct 183 GG 184
>gi|296168828|ref|ZP_06850504.1| pentapeptide repeat family protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295896503|gb|EFG76151.1| pentapeptide repeat family protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=183
Score = 280 bits (715), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 139/181 (77%), Positives = 153/181 (85%), Gaps = 0/181 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++W D F G DF DEDL+RL TERA+F CDF G NLAES+HRGSAFRNCTF RT+LWH
Sbjct 3 ERWADRHFEGHDFTDEDLTRLDTERAVFDGCDFGGANLAESRHRGSAFRNCTFRRTSLWH 62
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S F QCSMLGSVFV CR+RP++ D+VDFTLAVLGGNDLRG++L+GCRLRE SLV+ DLRK
Sbjct 63 SAFEQCSMLGSVFVQCRMRPISFDEVDFTLAVLGGNDLRGVDLSGCRLREASLVEADLRK 122
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCLA 181
VLRGADLSGAR TGARLDDADLRGA VDP LWRTASL GARVDV QAVAFA A GL L
Sbjct 123 AVLRGADLSGARATGARLDDADLRGAVVDPSLWRTASLAGARVDVGQAVAFAVAQGLRLD 182
Query 182 G 182
G
Sbjct 183 G 183
>gi|254776755|ref|ZP_05218271.1| hypothetical protein MaviaA2_19111 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=182
Score = 273 bits (698), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 137/182 (76%), Positives = 150/182 (83%), Gaps = 0/182 (0%)
Query 1 LQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW 60
+ WVD EF DF DEDL L TER +F+EC+FSG NLAES+HR SAFRNCTF RT+LW
Sbjct 1 MTAWVDREFERHDFTDEDLVGLSTERVVFTECNFSGANLAESRHRASAFRNCTFRRTSLW 60
Query 61 HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLR 120
HSTF QC+MLGSVF CRLRP+T D+VDFTLAVLGGNDLRG++L+GCRLRETSLV+ DLR
Sbjct 61 HSTFEQCTMLGSVFEQCRLRPVTFDEVDFTLAVLGGNDLRGVDLSGCRLRETSLVEADLR 120
Query 121 KCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
K VLRGADL GART G RLDDADLRG DP LW TASL GARVDVDQAVAFA AHGL L
Sbjct 121 KAVLRGADLRGARTAGTRLDDADLRGGAADPALWTTASLAGARVDVDQAVAFALAHGLRL 180
Query 181 AG 182
G
Sbjct 181 DG 182
>gi|118463434|ref|YP_883463.1| hypothetical protein MAV_4327 [Mycobacterium avium 104]
gi|118164721|gb|ABK65618.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=185
Score = 269 bits (687), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 137/183 (75%), Positives = 150/183 (82%), Gaps = 0/183 (0%)
Query 1 LQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW 60
+ WVD EF DF DEDL L TER +F+EC+FSG NLAES+HR SAFRNCTF RT+LW
Sbjct 1 MTAWVDREFERHDFTDEDLVGLSTERVVFTECNFSGANLAESRHRASAFRNCTFRRTSLW 60
Query 61 HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLR 120
HSTF QC+MLGSVF CRLRP+T D+VDFTLAVLGGNDLRG++L+GCRLR TSLV+ DLR
Sbjct 61 HSTFEQCTMLGSVFEQCRLRPVTFDEVDFTLAVLGGNDLRGVDLSGCRLRVTSLVEADLR 120
Query 121 KCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
K VLRGADL GARTTG RLDDADL G DP LW TASL GARVDVDQAVAFA AHGL L
Sbjct 121 KAVLRGADLRGARTTGTRLDDADLCGGAADPALWTTASLAGARVDVDQAVAFALAHGLRL 180
Query 181 AGG 183
GG
Sbjct 181 DGG 183
>gi|118470978|ref|YP_886018.1| hypothetical protein MSMEG_1641 [Mycobacterium smegmatis str.
MC2 155]
gi|118172265|gb|ABK73161.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=191
Score = 251 bits (641), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 119/180 (67%), Positives = 146/180 (82%), Gaps = 0/180 (0%)
Query 4 WVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWHST 63
W D EF GRDFRDEDLSR+ TER +F+ECDFSGV+L+ES+H GSAFRNCTF R+T+WHST
Sbjct 12 WADEEFAGRDFRDEDLSRIRTERVVFTECDFSGVDLSESEHHGSAFRNCTFRRSTIWHST 71
Query 64 FAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRKCV 123
F CS+LGSVF CR+RP+T + DFTLAVLGG DLR ++L+ CRLRE SLV DLRK V
Sbjct 72 FTNCSLLGSVFTECRIRPVTFVECDFTLAVLGGCDLRAVDLSDCRLREVSLVGADLRKAV 131
Query 124 LRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCLAGG 183
LR ADL+G+R ARL++ADLRG VDP W TA + GA++D++QA+A+AAAHGL + GG
Sbjct 132 LRRADLTGSRVQDARLEEADLRGTRVDPTFWTTAKVRGAKIDIEQALAYAAAHGLAVHGG 191
>gi|120402548|ref|YP_952377.1| pentapeptide repeat-containing protein [Mycobacterium vanbaalenii
PYR-1]
gi|119955366|gb|ABM12371.1| pentapeptide repeat protein [Mycobacterium vanbaalenii PYR-1]
Length=183
Score = 239 bits (611), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 119/177 (68%), Positives = 140/177 (80%), Gaps = 0/177 (0%)
Query 6 DCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWHSTFA 65
D E+ G DFRD+DLSR+ TER +F+ECDF+G + ++S+H GSAFRNC F RT+LWHSTF
Sbjct 7 DREYGGHDFRDQDLSRVRTERVVFTECDFTGADFSDSEHTGSAFRNCIFRRTSLWHSTFR 66
Query 66 QCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRKCVLR 125
CS LGS F CRLRPLTL +VDFTLAVL G DLR +L+ CRLRETSLV TDLR+ VL
Sbjct 67 HCSFLGSTFTECRLRPLTLVEVDFTLAVLAGVDLRKTDLSDCRLRETSLVGTDLREAVLT 126
Query 126 GADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCLAG 182
ADLSGAR ARL++ADLRGA VDP W TA L GA+VD+DQA+AFAAAHGL + G
Sbjct 127 RADLSGARVQDARLENADLRGARVDPTFWTTAKLRGAKVDIDQAIAFAAAHGLDIGG 183
>gi|108798164|ref|YP_638361.1| pentapeptide repeat-containing protein [Mycobacterium sp. MCS]
gi|119867260|ref|YP_937212.1| pentapeptide repeat-containing protein [Mycobacterium sp. KMS]
gi|126433823|ref|YP_001069514.1| pentapeptide repeat-containing protein [Mycobacterium sp. JLS]
gi|108768583|gb|ABG07305.1| pentapeptide repeat [Mycobacterium sp. MCS]
gi|119693349|gb|ABL90422.1| pentapeptide repeat protein [Mycobacterium sp. KMS]
gi|126233623|gb|ABN97023.1| pentapeptide repeat protein [Mycobacterium sp. JLS]
Length=183
Score = 239 bits (609), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 118/183 (65%), Positives = 142/183 (78%), Gaps = 0/183 (0%)
Query 1 LQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW 60
+ +W D EF GRDFRDEDLSRL TER +F CDFSGV+++ES+H GSAFRNC F R +LW
Sbjct 1 MAEWNDEEFIGRDFRDEDLSRLRTERVVFDGCDFSGVDMSESEHVGSAFRNCVFRRASLW 60
Query 61 HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLR 120
HSTF CSMLGSVF CRLRPLTL +VD TLAVLGG DLR ++L+ CRLRE LV DLR
Sbjct 61 HSTFRNCSMLGSVFTECRLRPLTLVEVDLTLAVLGGCDLRKVDLSDCRLREAGLVGADLR 120
Query 121 KCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
+ VL+ ADL GAR R + ADLRGA +D LW TA++ GAR+D++QA+A+AAAHGL +
Sbjct 121 EAVLQRADLRGARVQNTRFEGADLRGARIDATLWTTAAVRGARIDIEQALAYAAAHGLDV 180
Query 181 AGG 183
GG
Sbjct 181 HGG 183
>gi|169630778|ref|YP_001704427.1| hypothetical protein MAB_3699c [Mycobacterium abscessus ATCC
19977]
gi|169242745|emb|CAM63773.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=184
Score = 228 bits (580), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 109/182 (60%), Positives = 131/182 (72%), Gaps = 0/182 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
+ W D E T F DED LHTER +F+ECDFSG NL ES H GSAFRNCTF RT+LWH
Sbjct 3 EHWTDREITAETFYDEDFRELHTERVVFTECDFSGANLTESLHVGSAFRNCTFRRTSLWH 62
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S F QCS+LGS F CR+RP + DFTL+ L G DLR ++L+ CR RE +LV D+RK
Sbjct 63 SEFRQCSLLGSTFTDCRVRPSKFTETDFTLSSLAGLDLREMDLSDCRFREANLVGADMRK 122
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCLA 181
L GAD +GART +LD ADLRGA +DP LW TA+L+ A+VD+ QA+AFAAAHGL +
Sbjct 123 ANLHGADFTGARTQNLKLDGADLRGARIDPTLWTTAALITAKVDLPQAIAFAAAHGLDVH 182
Query 182 GG 183
GG
Sbjct 183 GG 184
>gi|145225461|ref|YP_001136139.1| pentapeptide repeat-containing protein [Mycobacterium gilvum
PYR-GCK]
gi|315445814|ref|YP_004078693.1| low-complexity protein [Mycobacterium sp. Spyr1]
gi|145217947|gb|ABP47351.1| pentapeptide repeat protein [Mycobacterium gilvum PYR-GCK]
gi|315264117|gb|ADU00859.1| uncharacterized low-complexity protein [Mycobacterium sp. Spyr1]
Length=186
Score = 224 bits (571), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 116/178 (66%), Positives = 134/178 (76%), Gaps = 0/178 (0%)
Query 5 VDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWHSTF 64
D EF G DF D DLS L TER +++ECDF+G +L++S H GSAFRNCTF RT+LWHSTF
Sbjct 9 ADREFDGHDFCDADLSGLRTERVVYTECDFTGADLSDSDHTGSAFRNCTFRRTSLWHSTF 68
Query 65 AQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRKCVL 124
CS LGS F CRLRPLTL +VDFTLAVL G DLR +L+ CRLRETSLV TDLR+ VL
Sbjct 69 RHCSFLGSTFTECRLRPLTLVEVDFTLAVLAGVDLRKTDLSDCRLRETSLVGTDLREAVL 128
Query 125 RGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCLAG 182
ADL+GAR A+ D ADLRGA VDP W TA L A+VD+ QA+AFAAAHGL + G
Sbjct 129 AHADLTGARVQDAKFDGADLRGARVDPTFWTTARLRAAKVDLPQALAFAAAHGLDIGG 186
>gi|312140786|ref|YP_004008122.1| hypothetical protein REQ_34480 [Rhodococcus equi 103S]
gi|325675574|ref|ZP_08155258.1| pentapeptide repeat family protein [Rhodococcus equi ATCC 33707]
gi|311890125|emb|CBH49443.1| conserved hypothetical protein [Rhodococcus equi 103S]
gi|325553545|gb|EGD23223.1| pentapeptide repeat family protein [Rhodococcus equi ATCC 33707]
Length=204
Score = 216 bits (551), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 104/179 (59%), Positives = 130/179 (73%), Gaps = 0/179 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++WV +T FRD DLS L TE +F+ECDF+G +L +S H GSAFR+C F RTT+WH
Sbjct 24 EKWVRQHYTACSFRDADLSELDTEFVVFTECDFTGADLTDSHHHGSAFRSCMFARTTMWH 83
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S+F CSMLGS+FV C++RPL +++VDFTLA +GG DLRGL+ T CR RE +LV DLR
Sbjct 84 SSFRSCSMLGSIFVECQMRPLVVEEVDFTLASMGGADLRGLDFTDCRFREANLVQADLRG 143
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
VLR +LSGAR RL+ ADLRGA VDP LW A L RV++ QAVA+A AHGL +
Sbjct 144 AVLRSVNLSGARVEAIRLEGADLRGAHVDPGLWTAAKLDKTRVELTQAVAYAVAHGLTV 202
>gi|312140974|ref|YP_004008310.1| hypothetical protein REQ_36430 [Rhodococcus equi 103S]
gi|311890313|emb|CBH49631.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length=209
Score = 214 bits (545), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 109/181 (61%), Positives = 135/181 (75%), Gaps = 0/181 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++W + G FRD DLS L TE F++CDF+G +L SQH GSAFRNC FE T +W
Sbjct 28 ERWQHRSYVGCLFRDTDLSGLVTESVTFTDCDFTGADLTGSQHSGSAFRNCHFEYTRMWD 87
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S+F S LGSV R+RPLTL++VDFTLA LG DLRG++L+GCRLRE +LV DLRK
Sbjct 88 SSFRHSSFLGSVIRDSRIRPLTLEEVDFTLASLGEIDLRGVDLSGCRLREANLVKADLRK 147
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCLA 181
+LRGADL+GART RLDDADL GA VDP LW +A L GAR++++QAV++AAAHGL +A
Sbjct 148 AILRGADLTGARTGDLRLDDADLEGARVDPSLWTSAKLGGARIEMNQAVSYAAAHGLRVA 207
Query 182 G 182
G
Sbjct 208 G 208
>gi|325675389|ref|ZP_08155073.1| pentapeptide repeat family protein [Rhodococcus equi ATCC 33707]
gi|325553360|gb|EGD23038.1| pentapeptide repeat family protein [Rhodococcus equi ATCC 33707]
Length=215
Score = 214 bits (545), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 109/181 (61%), Positives = 133/181 (74%), Gaps = 0/181 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
+ W + G FRD DLS L TE F++CDF+G +L SQH GSAFRNC FE T +W
Sbjct 34 ESWQHRSYVGCLFRDTDLSGLVTESVTFTDCDFTGADLTGSQHSGSAFRNCYFEYTRMWD 93
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S+F S LGSV R+RPLTL++VDFTLA LG DLRG++L+GCRLRE +LV DLRK
Sbjct 94 SSFRHSSFLGSVIRDSRIRPLTLEEVDFTLASLGEIDLRGVDLSGCRLREANLVKADLRK 153
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCLA 181
+LRGADL+GART RLDDADL GA VDP LW +A L GAR++++QAV +AAAHGL +A
Sbjct 154 AILRGADLTGARTGDLRLDDADLEGARVDPSLWTSAKLGGARIEMNQAVTYAAAHGLRVA 213
Query 182 G 182
G
Sbjct 214 G 214
>gi|226365701|ref|YP_002783484.1| hypothetical protein ROP_62920 [Rhodococcus opacus B4]
gi|226244191|dbj|BAH54539.1| hypothetical protein [Rhodococcus opacus B4]
Length=201
Score = 212 bits (539), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 107/179 (60%), Positives = 130/179 (73%), Gaps = 0/179 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
Q W +T +FRD DLS L TE +F+ECDF+G +LAES H G+AFR+C+F RTTLWH
Sbjct 20 QNWQRRHYTKCNFRDADLSELRTESVIFTECDFTGADLAESHHVGTAFRSCSFTRTTLWH 79
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S F CS LGS F CRLRP+ D+ DFTL LGG DLRGL+ T CR RE +LV TDLR+
Sbjct 80 SEFRNCSFLGSEFDNCRLRPMVFDECDFTLVSLGGADLRGLDFTDCRFREANLVRTDLRR 139
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
VLR ADL GART GA+LD ADLRGA VD LW ASL A++++ QA+A+A A+GL +
Sbjct 140 AVLRSADLFGARTGGAKLDGADLRGAHVDANLWTAASLDKAQIELTQAIAYATANGLVV 198
>gi|229489413|ref|ZP_04383276.1| pentapeptide repeat protein [Rhodococcus erythropolis SK121]
gi|229323510|gb|EEN89268.1| pentapeptide repeat protein [Rhodococcus erythropolis SK121]
Length=206
Score = 211 bits (537), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 102/179 (57%), Positives = 133/179 (75%), Gaps = 0/179 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
Q+W FT +FRD DL+ L TE +F++CDF+G +L ES H G+AFR+C F RT+LWH
Sbjct 25 QKWRQRTFTNCNFRDADLTGLTTESVVFTDCDFTGTDLGESVHTGTAFRSCNFARTSLWH 84
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
STF CS+LGS F CR+RPLTLD+VDF+L LGG DLR ++ T CR RE +LV D+R
Sbjct 85 STFRNCSLLGSTFDGCRIRPLTLDEVDFSLTSLGGADLRKIDFTSCRFREANLVRADMRG 144
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
VL ADLSGART G +L+ ADLRGA +DP LW +AS+ A++++ QAVA+A+AHGL +
Sbjct 145 AVLASADLSGARTGGLKLEGADLRGARIDPSLWTSASVGNAQIELMQAVAYASAHGLVV 203
>gi|226305482|ref|YP_002765442.1| hypothetical protein RER_19950 [Rhodococcus erythropolis PR4]
gi|226184599|dbj|BAH32703.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length=203
Score = 211 bits (537), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 102/179 (57%), Positives = 133/179 (75%), Gaps = 0/179 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
Q+W FT +FRD DL+ L TE +F++CDF+G +L ES H G+AFR+C F RT+LWH
Sbjct 22 QKWRQRTFTNCNFRDADLTGLTTESVVFTDCDFTGTDLGESVHTGTAFRSCNFARTSLWH 81
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
STF CS+LGS F CR+RPLTLD+VDF+L LGG DLR ++ T CR RE +LV D+R
Sbjct 82 STFRNCSLLGSTFDGCRIRPLTLDEVDFSLTSLGGADLRKIDFTSCRFREANLVRADMRG 141
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
VL ADLSGART G +L+ ADLRGA +DP LW +AS+ A++++ QAVA+A+AHGL +
Sbjct 142 AVLASADLSGARTGGLKLEGADLRGARIDPSLWTSASVGNAQIELMQAVAYASAHGLVV 200
>gi|111023196|ref|YP_706168.1| hypothetical protein RHA1_ro06233 [Rhodococcus jostii RHA1]
gi|110822726|gb|ABG98010.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=201
Score = 210 bits (535), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 106/179 (60%), Positives = 129/179 (73%), Gaps = 0/179 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
Q W +T +FRD DLS L TE +F+ECDF+G +LAES H G+AFR+C+F RTTLWH
Sbjct 20 QNWRRRHYTKCNFRDADLSELRTESVIFTECDFTGADLAESHHVGTAFRSCSFTRTTLWH 79
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S F CS LGS F CRLRP+ D+ DFTLA LGG DLRGL+ T CR RE +LV TDLR+
Sbjct 80 SEFRNCSFLGSEFDNCRLRPMVFDECDFTLASLGGADLRGLDFTDCRFREANLVRTDLRR 139
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
VLR ADL GART GA+ D ADLRGA VD LW SL A++++ QA+A+A A+GL +
Sbjct 140 AVLRSADLFGARTGGAKFDGADLRGAHVDANLWTAVSLDKAQIELTQAIAYATANGLVV 198
>gi|41409566|ref|NP_962402.1| hypothetical protein MAP3468c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41398397|gb|AAS06018.1| hypothetical protein MAP_3468c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=137
Score = 181 bits (458), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 100/182 (55%), Positives = 109/182 (60%), Gaps = 45/182 (24%)
Query 1 LQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW 60
+ WVD EF DF DEDL L TER +F+EC+FSG NLAES+HR SAFRNCTF RT+LW
Sbjct 1 MTAWVDREFERHDFTDEDLVGLSTERVVFTECNFSGANLAESRHRASAFRNCTFRRTSLW 60
Query 61 HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLR 120
HSTF QC+MLGSVF CRLRP+T D+VDFTLAVLGGNDLR
Sbjct 61 HSTFEQCTMLGSVFEQCRLRPVTFDEVDFTLAVLGGNDLR-------------------- 100
Query 121 KCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
G DP LW TASL GARVDVDQAVAFA A GL L
Sbjct 101 -------------------------GGAADPALWTTASLAGARVDVDQAVAFALARGLRL 135
Query 181 AG 182
G
Sbjct 136 DG 137
>gi|134103085|ref|YP_001108746.1| pentapeptide repeat-containing protein [Saccharopolyspora erythraea
NRRL 2338]
gi|133915708|emb|CAM05821.1| pentapeptide repeat family protein [Saccharopolyspora erythraea
NRRL 2338]
Length=196
Score = 179 bits (455), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 99/177 (56%), Positives = 119/177 (68%), Gaps = 0/177 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
Q+W F DF D DL L T +F+EC F+G +L ES HR +AFR+C FERT L H
Sbjct 17 QEWDGRSFERCDFTDADLRGLRTTSCVFTECTFTGTDLGESVHRATAFRSCRFERTVLQH 76
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
ST C+MLGS FV CR RPLT+ + D TL +G +DLRG NL+G R RE +L + DLR+
Sbjct 77 STVEGCTMLGSGFVDCRFRPLTVRETDMTLVGMGRSDLRGTNLSGIRFREANLGECDLRE 136
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGL 178
C LR ADLSGAR G RL++ADLRGA +D A L GARVD A+AFAAAHGL
Sbjct 137 CDLREADLSGARLLGTRLEEADLRGARIDADGLVQAVLRGARVDSMTALAFAAAHGL 193
>gi|291003972|ref|ZP_06561945.1| pentapeptide repeat-containing protein [Saccharopolyspora erythraea
NRRL 2338]
Length=229
Score = 178 bits (452), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 99/177 (56%), Positives = 119/177 (68%), Gaps = 0/177 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
Q+W F DF D DL L T +F+EC F+G +L ES HR +AFR+C FERT L H
Sbjct 50 QEWDGRSFERCDFTDADLRGLRTTSCVFTECTFTGTDLGESVHRATAFRSCRFERTVLQH 109
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
ST C+MLGS FV CR RPLT+ + D TL +G +DLRG NL+G R RE +L + DLR+
Sbjct 110 STVEGCTMLGSGFVDCRFRPLTVRETDMTLVGMGRSDLRGTNLSGIRFREANLGECDLRE 169
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGL 178
C LR ADLSGAR G RL++ADLRGA +D A L GARVD A+AFAAAHGL
Sbjct 170 CDLREADLSGARLLGTRLEEADLRGARIDADGLVQAVLRGARVDSMTALAFAAAHGL 226
>gi|258651510|ref|YP_003200666.1| pentapeptide repeat-containing protein [Nakamurella multipartita
DSM 44233]
gi|258554735|gb|ACV77677.1| pentapeptide repeat protein [Nakamurella multipartita DSM 44233]
Length=206
Score = 172 bits (435), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 93/179 (52%), Positives = 112/179 (63%), Gaps = 0/179 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
Q+ D FTG DF + L +HT R F+ C F G L +S HR ++F +CTFER L
Sbjct 23 QELADITFTGCDFTEAALVGVHTRRVTFTNCRFRGTELYDSTHRSTSFASCTFERAALHG 82
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
+ C + GS F CR RPLT+ D D TL L G +L G++L+G RLRE +LV DL
Sbjct 83 MSLHGCRLTGSSFTDCRTRPLTIRDCDLTLVSLAGANLAGVDLSGLRLREANLVRADLSG 142
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
C LRGADL+GAR L ADLRGA VD LW A L GARVD+DQAV FAAAHGL +
Sbjct 143 CDLRGADLTGARAERLNLTGADLRGARVDAGLWVAAVLTGARVDIDQAVLFAAAHGLAI 201
>gi|331699063|ref|YP_004335302.1| pentapeptide repeat-containing protein [Pseudonocardia dioxanivorans
CB1190]
gi|326953752|gb|AEA27449.1| pentapeptide repeat protein [Pseudonocardia dioxanivorans CB1190]
Length=205
Score = 166 bits (421), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 93/180 (52%), Positives = 117/180 (65%), Gaps = 5/180 (2%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
+++V C F+G D R L T+ F ECDF G ++ +S+HRGSA R C+F+ TL
Sbjct 31 RRFVRCSFSGSDLRG-----LRTDVCTFDECDFRGADMGDSEHRGSALRTCSFQGVTLLG 85
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
STF CS+LGS + R+RP++ D D TL LG DLRG +L+G RLRE +LV+TDLRK
Sbjct 86 STFRGCSLLGSTLLDARMRPISFVDCDLTLVSLGRADLRGTDLSGLRLREANLVETDLRK 145
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCLA 181
L GADLSGAR GARL+ ADLRGA +D A L GA VD AV FA AHGL ++
Sbjct 146 ADLHGADLSGARLRGARLEGADLRGARIDADGLVQARLEGATVDAATAVRFAVAHGLIIS 205
>gi|336459778|gb|EGO38693.1| hypothetical protein MAPs_00210 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=114
Score = 156 bits (394), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 74/101 (74%), Positives = 83/101 (83%), Gaps = 0/101 (0%)
Query 1 LQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLW 60
+ WVD EF DF DEDL L TER +F+EC+FSG NLAES+HR SAFRNCTF RT+LW
Sbjct 1 MTAWVDREFERHDFTDEDLVGLSTERVVFTECNFSGANLAESRHRASAFRNCTFRRTSLW 60
Query 61 HSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRG 101
HSTF QC+MLGSVF CRLRP+T D+VDFTLAVLGGNDLRG
Sbjct 61 HSTFEQCTMLGSVFEQCRLRPVTFDEVDFTLAVLGGNDLRG 101
>gi|256380480|ref|YP_003104140.1| pentapeptide repeat-containing protein [Actinosynnema mirum DSM
43827]
gi|255924783|gb|ACU40294.1| pentapeptide repeat protein [Actinosynnema mirum DSM 43827]
Length=204
Score = 153 bits (386), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/177 (49%), Positives = 110/177 (63%), Gaps = 0/177 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
Q W F G DF + D+ L T F++C F+ +L +S+H SAFR+C F+R L
Sbjct 24 QVWERKSFVGCDFSEADMRGLVTRGCTFTDCKFTRTDLGDSRHTTSAFRSCRFDRAVLGG 83
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
+ F CS+LGS+FV C +RP+T+ + D TL L G L +L G R+RE +L +L
Sbjct 84 AEFTSCSLLGSMFVDCSMRPITITETDLTLVSLSGAVLPKASLAGLRMREANLESANLTG 143
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGL 178
LRGADLSGAR TG +L DADLRGA +D L GA+VD+D AVAFAAAHGL
Sbjct 144 ADLRGADLSGARLTGTKLVDADLRGARLDANGLVQGVLRGAKVDLDTAVAFAAAHGL 200
>gi|302524130|ref|ZP_07276472.1| pentapeptide repeat-containing protein [Streptomyces sp. AA4]
gi|302433025|gb|EFL04841.1| pentapeptide repeat-containing protein [Streptomyces sp. AA4]
Length=195
Score = 149 bits (376), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 86/179 (49%), Positives = 110/179 (62%), Gaps = 0/179 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
Q W +F G DF + DL L T F CDF+ V+LA S+H SAFR+CTF+R+ L
Sbjct 16 QWWEKRQFAGCDFTEADLRDLRTRGCTFDNCDFTKVDLAGSRHDASAFRSCTFDRSVLAD 75
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S ++ CS+LGS FV CR + L + D +LA LR L+L+G R+RE +L + DL
Sbjct 76 SRWSSCSLLGSSFVDCRYTGIALTECDLSLASFARGRLRKLDLSGLRMREVNLNEADLTD 135
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
LRG DL+GAR G + ADLRGA VD A L GA VDV+ AVAFAAA+GL +
Sbjct 136 SDLRGTDLAGARMIGTKFPGADLRGAVVDANGLVQADLRGAYVDVELAVAFAAANGLVV 194
>gi|300782748|ref|YP_003763039.1| pentapeptide repeat-containing protein [Amycolatopsis mediterranei
U32]
gi|299792262|gb|ADJ42637.1| pentapeptide repeat-containing protein [Amycolatopsis mediterranei
U32]
gi|340524122|gb|AEK39327.1| pentapeptide repeat-containing protein [Amycolatopsis mediterranei
S699]
Length=195
Score = 147 bits (371), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 86/179 (49%), Positives = 110/179 (62%), Gaps = 0/179 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
Q W FTG DF DLS L T F +C FS +L+ S+H SAFR+CTF+RT L
Sbjct 16 QWWEKRRFTGCDFTGADLSGLRTRGCTFDDCVFSRADLSRSRHDASAFRSCTFDRTVLAE 75
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S + CS+LG+ F + L + D +LA L LR L L+G RLRE +L++ DL
Sbjct 76 SRWTACSLLGTSFTDSGFGGIALTECDLSLASLAKARLRKLGLSGLRLREANLMEADLTG 135
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
LRG+DL+GAR GA+L+ ADLRGA +D A L GA VDV+ A+AFAAAHGL +
Sbjct 136 ADLRGSDLTGARLQGAKLEGADLRGARLDANALVQADLRGAEVDVETAIAFAAAHGLVI 194
>gi|284992789|ref|YP_003411343.1| pentapeptide repeat-containing protein [Geodermatophilus obscurus
DSM 43160]
gi|284066034|gb|ADB76972.1| pentapeptide repeat protein [Geodermatophilus obscurus DSM 43160]
Length=218
Score = 132 bits (333), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 77/169 (46%), Positives = 97/169 (58%), Gaps = 0/169 (0%)
Query 9 FTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWHSTFAQCS 68
FT F D L L T R +F C +GV + ++H G+AF +C F+R L+ T+ C
Sbjct 45 FTRCRFEDAGLEELVTRRCVFDSCVLTGVRMGGARHLGTAFLSCRFDRARLFDVTWDGCK 104
Query 69 MLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRKCVLRGAD 128
+ GS F +LRP+T D D++ L G DL GL L G R RE L DLR+C L GAD
Sbjct 105 LTGSQFPGAQLRPMTATDTDWSWTSLRGTDLSGLVLAGQRFREADLTGADLRECDLTGAD 164
Query 129 LSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHG 177
L AR GA+L ADLRGA+ D V WR L G R+D+ QAV A A G
Sbjct 165 LDRARLQGAQLRGADLRGASTDAVDWRAFELTGVRLDLVQAVQVARAQG 213
>gi|229818699|ref|YP_002880225.1| pentapeptide repeat-containing protein [Beutenbergia cavernae
DSM 12333]
gi|229564612|gb|ACQ78463.1| pentapeptide repeat-containing protein [Beutenbergia cavernae
DSM 12333]
Length=205
Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/173 (40%), Positives = 94/173 (55%), Gaps = 0/173 (0%)
Query 8 EFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWHSTFAQC 67
++ G F D DL+ T +FSEC FS V S H +AF NCTF R + +TF C
Sbjct 31 QYAGVRFVDVDLTEASTRGTVFSECVFSNVAFNVSHHASTAFVNCTFRRCNFFDATFTGC 90
Query 68 SMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRKCVLRGA 127
++G++F C + +D D++ A G DL G+ TG RLRE+ L + V G
Sbjct 91 KLVGAMFDGCSFGIMKVDRGDWSFAGFPGADLEGVEFTGVRLRESDLTHARCARSVFAGC 150
Query 128 DLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCL 180
DLSG+ GA DADLRG+ + + R +L GA + DQA+A AA GL +
Sbjct 151 DLSGSWLHGADFTDADLRGSALGEIDPRVVTLRGATITADQAIAIAAGLGLVV 203
>gi|333371278|ref|ZP_08463236.1| pentapeptide repeat domain protein [Desmospora sp. 8437]
gi|332976397|gb|EGK13247.1| pentapeptide repeat domain protein [Desmospora sp. 8437]
Length=199
Score = 122 bits (305), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 71/175 (41%), Positives = 92/175 (53%), Gaps = 0/175 (0%)
Query 3 QWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWHS 62
+W C F +FR+ L T ++F CDF+G L S H G+AF NC F T L+ S
Sbjct 21 EWEGCRFVRCNFREAVLKEWVTRSSVFEACDFTGAKLNASHHEGTAFLNCRFRGTDLYVS 80
Query 63 TFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRKC 122
FA C M GS F R+ +T+ D++L L DL G L G RE L D
Sbjct 81 RFASCKMTGSGFEEARMEGMTISGGDWSLTRLVYQDLSGFQLGGIHFREADLYRCDFTGA 140
Query 123 VLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHG 177
LR ADLS A GA L ADLR A ++ + W+ +L G R+D+ QAVA A + G
Sbjct 141 DLRRADLSYATLDGACLKGADLREAIMEGIPWKELNLEGTRIDMAQAVALAQSLG 195
>gi|302865634|ref|YP_003834271.1| pentapeptide repeat-containing protein [Micromonospora aurantiaca
ATCC 27029]
gi|315502181|ref|YP_004081068.1| pentapeptide repeat protein [Micromonospora sp. L5]
gi|302568493|gb|ADL44695.1| pentapeptide repeat protein [Micromonospora aurantiaca ATCC 27029]
gi|315408800|gb|ADU06917.1| pentapeptide repeat protein [Micromonospora sp. L5]
Length=197
Score = 115 bits (288), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/177 (38%), Positives = 94/177 (54%), Gaps = 0/177 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++ D F +F DL+ + A+F++C F V S+H SAF C+F + L+
Sbjct 18 EELADRHFVRCEFFHVDLTEAVSRGAVFTDCVFGNVAFNASRHTDSAFTRCSFSKCNLFE 77
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
+ F C ++GS F C LRPL +D D++ L G DLRG +T R+RE L DL
Sbjct 78 AEFTGCKLVGSTFDRCDLRPLRVDRGDWSFVTLAGADLRGARITDVRMREVDLTGADLTG 137
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGL 178
L G DLSGA+ A+L ADLRG+ + + GAR+D +QAV A A G
Sbjct 138 ATLTGVDLSGAQLHAAKLIRADLRGSDLSALDPTAVQRSGARIDAEQAVMLAQALGF 194
>gi|229084855|ref|ZP_04217111.1| Pentapeptide repeat protein [Bacillus cereus Rock3-44]
gi|228698470|gb|EEL51199.1| Pentapeptide repeat protein [Bacillus cereus Rock3-44]
Length=201
Score = 115 bits (288), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 69/176 (40%), Positives = 92/176 (53%), Gaps = 0/176 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++ +C F FR D S + E F ECDF+G S H+GS F NC F L+
Sbjct 22 EELKNCTFIKCRFRGVDASEVVAENCNFIECDFTGALFNASIHQGSTFANCKFSGANLFV 81
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S F +C M GS F L +T+ D++ L +L L G RL E L + +L K
Sbjct 82 SKFEECKMTGSDFEEANLDGITIVSGDWSYTNLRFANLSKQKLKGIRLVEADLYECNLEK 141
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHG 177
LR ADL+G + A+L ADLRGA VD + +RT L R+D+ QAVA A +G
Sbjct 142 ADLRNADLTGVQLGKAKLTGADLRGAVVDRIDFRTFDLKNVRLDITQAVAVARCYG 197
>gi|330466022|ref|YP_004403765.1| pentapeptide repeat-containing protein [Verrucosispora maris
AB-18-032]
gi|328808993|gb|AEB43165.1| pentapeptide repeat protein [Verrucosispora maris AB-18-032]
Length=198
Score = 112 bits (281), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/177 (41%), Positives = 90/177 (51%), Gaps = 5/177 (2%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
+ +V C DF DL+ + A F C F V S+H SAF CTF R L+
Sbjct 23 RHYVHC-----DFHRVDLTEATSRGASFVGCTFGDVRFNVSRHTDSAFTRCTFIRCNLFE 77
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
+ F C +GS F C LRPLT+ D++ L G DLRG TG R+RET L DL
Sbjct 78 AEFTGCKFIGSTFDRCDLRPLTVTGGDWSFVTLAGADLRGARFTGVRMRETDLAGADLTG 137
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGL 178
+ G+DLS A+ G RL ADLRG+ + + G VDV QA+A A A G
Sbjct 138 ATVSGSDLSDAQWRGTRLTRADLRGSDLTGLDPTAVQRTGTIVDVGQAMALAQALGF 194
>gi|228990899|ref|ZP_04150863.1| Pentapeptide repeat protein [Bacillus pseudomycoides DSM 12442]
gi|228996973|ref|ZP_04156606.1| Pentapeptide repeat protein [Bacillus mycoides Rock3-17]
gi|229007881|ref|ZP_04165452.1| Pentapeptide repeat protein [Bacillus mycoides Rock1-4]
gi|228753386|gb|EEM02853.1| Pentapeptide repeat protein [Bacillus mycoides Rock1-4]
gi|228762852|gb|EEM11766.1| Pentapeptide repeat protein [Bacillus mycoides Rock3-17]
gi|228768836|gb|EEM17435.1| Pentapeptide repeat protein [Bacillus pseudomycoides DSM 12442]
Length=208
Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 69/176 (40%), Positives = 92/176 (53%), Gaps = 0/176 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++ +C F FR D S + TE F ECDF+G S H+GS F NC F L+
Sbjct 29 EELKNCTFIKCRFRGVDASEIVTENCNFIECDFTGALFNASIHQGSTFANCKFSGANLFV 88
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S F +C M GS F L +T+ D++ L +L L G RL E L + +L K
Sbjct 89 SKFEECKMTGSDFEEANLDGITIVFGDWSYTNLRFANLSKQKLKGIRLVEADLCECNLEK 148
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHG 177
LR DL+GA+ A+L ADLRGA VD + +R L R+D+ QAVA A +G
Sbjct 149 ADLRDVDLTGAQLGKAKLIGADLRGAVVDRIDFRAFDLQNVRLDITQAVAVARCYG 204
>gi|145593641|ref|YP_001157938.1| pentapeptide repeat-containing protein [Salinispora tropica CNB-440]
gi|145302978|gb|ABP53560.1| pentapeptide repeat protein [Salinispora tropica CNB-440]
Length=197
Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/180 (38%), Positives = 93/180 (52%), Gaps = 6/180 (3%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++ D + G F DL+ T +F+EC F V+ S+H +AF C F R +
Sbjct 18 EEIADRHYVGCHFERADLTEATTRGVLFTECTFGNVSFNASRHVNTAFTRCVFRRCNFFA 77
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
+ F C ++GS F C LRPLT+ D++ L G DLRG + R+RE L+ DL
Sbjct 78 AEFTGCKLVGSTFDQCDLRPLTIVGGDWSFVALPGADLRGARVVDVRMREADLIRADLSG 137
Query 122 CVLRGADLSGARTTGARLDDADLRGA---TVDPVLWRTASLVGARVDVDQAVAFAAAHGL 178
+ G DLSGA+ A+L ADLRG+ +DP+ A GA V +QAV A A G
Sbjct 138 ATVTGVDLSGAQLQHAKLSRADLRGSDLTDLDPIEVERA---GAIVSAEQAVVLAQALGF 194
>gi|308071304|ref|YP_003872909.1| hypothetical protein PPE_04612 [Paenibacillus polymyxa E681]
gi|305860583|gb|ADM72371.1| Uncharacterized low-complexity protein [Paenibacillus polymyxa
E681]
Length=202
Score = 112 bits (280), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 64/180 (36%), Positives = 97/180 (54%), Gaps = 0/180 (0%)
Query 3 QWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWHS 62
+W +CEFT FR +++ E F +CDF+G L S ++ SAF NC F L+
Sbjct 23 EWKNCEFTRCRFRGVEMNESLVEGCTFVDCDFTGAILNASHYKESAFTNCLFTSANLFVV 82
Query 63 TFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRKC 122
F C ++GS F + +T+ D++ L DLR +L R +E L + +L K
Sbjct 83 RFDNCKLVGSDFAGANMDGITITGGDWSYTNLRHADLRKQDLRKVRFKEADLSECNLEKA 142
Query 123 VLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCLAG 182
LR ADL+ R A L DLRGA +D V ++ + GA++D++QAVA A ++G + G
Sbjct 143 DLREADLTRIRLHKAHLQGTDLRGAKMDGVNFKDLDITGAKLDIEQAVAVARSYGAKVEG 202
>gi|42781015|ref|NP_978262.1| hypothetical protein BCE_1946 [Bacillus cereus ATCC 10987]
gi|42736936|gb|AAS40870.1| conserved hypothetical protein [Bacillus cereus ATCC 10987]
Length=201
Score = 110 bits (275), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/176 (37%), Positives = 93/176 (53%), Gaps = 0/176 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++ +C F FR D S + TE F ECDF+G S H+G+ F NC F L+
Sbjct 22 EELKNCTFIKCRFRGIDASEISTENCNFIECDFTGALFNASIHQGTTFANCKFVGANLFV 81
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S F +C M GS F L +T+ D++ L +L L G RL E L + +L K
Sbjct 82 SKFEECKMTGSDFEEANLDGITIISGDWSYTNLRFANLSKQMLKGIRLMEADLYECNLEK 141
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHG 177
LR ADL+GA+ A+L A+L+GA VD + + + L ++D+ QAVA A +G
Sbjct 142 ADLREADLTGAQLGKAKLSSANLKGAIVDRIDFTSFDLKNVKLDIAQAVAVARCYG 197
>gi|229166768|ref|ZP_04294518.1| Pentapeptide repeat protein [Bacillus cereus AH621]
gi|228616765|gb|EEK73840.1| Pentapeptide repeat protein [Bacillus cereus AH621]
Length=201
Score = 110 bits (274), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 67/176 (39%), Positives = 92/176 (53%), Gaps = 0/176 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++ +C F FR D S + TE F ECDF+G S H+G+ F NC F L+
Sbjct 22 EELKNCTFIKCRFRGIDASEVFTENCNFIECDFTGTLFNASIHQGTTFANCRFFGANLFV 81
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S F +C M GS F L +T+ D++ L +L L G RL E L + +L K
Sbjct 82 SKFEECKMTGSDFEEANLDGITIISGDWSYTNLRFANLSKQMLKGIRLIEADLYECNLEK 141
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHG 177
LR ADL+GA+ +L ADLRGA VD V ++ L ++D+ QAVA A +G
Sbjct 142 ADLREADLTGAQLGKVKLSGADLRGAVVDRVDFKAFDLKNVKLDIAQAVAVARCYG 197
>gi|229029598|ref|ZP_04185677.1| Pentapeptide repeat protein [Bacillus cereus AH1271]
gi|228731720|gb|EEL82623.1| Pentapeptide repeat protein [Bacillus cereus AH1271]
Length=201
Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/176 (37%), Positives = 94/176 (54%), Gaps = 0/176 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++ +C F FR D S + TE F ECDF+G S H+G+ F NC F L+
Sbjct 22 EELKNCTFIKCRFRGIDASEVSTENCNFIECDFTGALFNASIHQGTTFANCKFVGANLFV 81
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S F +C M GS F L +T+ D++ L +L L G RL E L + +L K
Sbjct 82 SKFEECKMTGSDFEEANLDGITIISGDWSYTNLRFANLSKQMLKGIRLIEADLYECNLEK 141
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHG 177
LR ADL+GA+ A+L A+L+GA VD + + + +L ++D+ QAVA A +G
Sbjct 142 ADLREADLTGAQLGKAKLSGANLKGAIVDRIDFTSFNLKNVKLDIAQAVAVARCYG 197
>gi|118477320|ref|YP_894471.1| hypothetical protein BALH_1637 [Bacillus thuringiensis str. Al
Hakam]
gi|196045161|ref|ZP_03112394.1| conserved hypothetical protein [Bacillus cereus 03BB108]
gi|225863764|ref|YP_002749142.1| pentapeptide repeat protein [Bacillus cereus 03BB102]
gi|229184092|ref|ZP_04311303.1| Pentapeptide repeat protein [Bacillus cereus BGSC 6E1]
gi|118416545|gb|ABK84964.1| conserved hypothetical protein [Bacillus thuringiensis str. Al
Hakam]
gi|196024163|gb|EDX62837.1| conserved hypothetical protein [Bacillus cereus 03BB108]
gi|225789989|gb|ACO30206.1| pentapeptide repeat protein [Bacillus cereus 03BB102]
gi|228599381|gb|EEK56990.1| Pentapeptide repeat protein [Bacillus cereus BGSC 6E1]
Length=201
Score = 108 bits (271), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/176 (38%), Positives = 93/176 (53%), Gaps = 0/176 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++ +C F FR D S + TE F ECDF+G S H+G+ F NC F L+
Sbjct 22 EELKNCTFIKCCFRGIDASEVSTENCNFIECDFTGALFNASIHQGTTFANCKFVGANLFV 81
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S F +C M GS F L +T+ D++ L +L L G RL E L + +L K
Sbjct 82 SKFEECKMTGSDFEEANLDGITIISGDWSYTNLRFANLSKQMLKGIRLIEADLYECNLEK 141
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHG 177
LR ADL+GA+ A+L A+LRGA VD + + + L ++D+ QAVA A +G
Sbjct 142 ADLREADLTGAQLGKAKLSGANLRGAIVDRIDFTSFDLKNVKLDIAQAVAVARCYG 197
>gi|229160860|ref|ZP_04288850.1| Pentapeptide repeat protein [Bacillus cereus R309803]
gi|228622597|gb|EEK79433.1| Pentapeptide repeat protein [Bacillus cereus R309803]
Length=201
Score = 108 bits (271), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/176 (37%), Positives = 93/176 (53%), Gaps = 0/176 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++ +C F FR D S + TE F ECDF+G S H+G+ F NC F L+
Sbjct 22 EELKNCTFIKCRFRGIDASEVSTENCNFIECDFTGALFNASIHKGTTFANCKFVGANLFV 81
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S F +C M GS F L +T+ D++ L +L L G RL E L + +L+K
Sbjct 82 SKFEECKMTGSDFEEANLDGITIISGDWSYTNLRFANLSKQMLKGIRLIEADLYECNLKK 141
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHG 177
LR ADL+GA+ A+L A L+GA VD + + + L ++D+ QAVA A +G
Sbjct 142 ADLREADLTGAQLGKAKLSGAHLKGAIVDRIDFTSFDLKNVKLDIAQAVAVARCYG 197
>gi|238063106|ref|ZP_04607815.1| pentapeptide repeat-containing protein [Micromonospora sp. ATCC
39149]
gi|237884917|gb|EEP73745.1| pentapeptide repeat-containing protein [Micromonospora sp. ATCC
39149]
Length=197
Score = 108 bits (271), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 70/180 (39%), Positives = 96/180 (54%), Gaps = 11/180 (6%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
+ +V CEF DL+ + A+F+EC F V S+H SAF C F+R L+
Sbjct 23 RHFVRCEFF-----HVDLTEAVSRGAVFTECVFGNVAFNASRHADSAFTRCVFKRCNLFE 77
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
+ F C ++GS F C LRPL + D++ L G DLRG+ LT R+RE L DL
Sbjct 78 AEFTGCKLVGSSFEQCGLRPLRVVGGDWSFVALPGADLRGVCLTDVRMREVDLTGADLTD 137
Query 122 CVLRGADLSGARTTGARLDDADLRG---ATVDPVLWRTASLVGARVDVDQAVAFAAAHGL 178
+ G DLSGA G+ L ADLRG + +DP + + A GA +D +QA+ A A G
Sbjct 138 ATVTGVDLSGALLHGSLLSRADLRGSDLSALDPTVVQRA---GALIDTEQAMQLARALGF 194
>gi|163939702|ref|YP_001644586.1| pentapeptide repeat-containing protein [Bacillus weihenstephanensis
KBAB4]
gi|229059553|ref|ZP_04196934.1| Pentapeptide repeat protein [Bacillus cereus AH603]
gi|229132730|ref|ZP_04261576.1| Pentapeptide repeat protein [Bacillus cereus BDRD-ST196]
gi|163861899|gb|ABY42958.1| pentapeptide repeat protein [Bacillus weihenstephanensis KBAB4]
gi|228650740|gb|EEL06729.1| Pentapeptide repeat protein [Bacillus cereus BDRD-ST196]
gi|228719757|gb|EEL71352.1| Pentapeptide repeat protein [Bacillus cereus AH603]
Length=201
Score = 108 bits (269), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 66/176 (38%), Positives = 92/176 (53%), Gaps = 0/176 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++ +C F FR D S + T+ F ECDF+G S H+G+ F NC F L+
Sbjct 22 EELKNCTFIKCRFRGIDASEVFTKNCNFIECDFTGTLFNASIHQGTTFANCRFFGANLFV 81
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S F +C M GS F L +T+ D++ L +L L G RL E L + +L K
Sbjct 82 SKFEECKMTGSDFEEANLDGITIISGDWSYTNLRFANLSKQMLKGIRLIEADLYECNLEK 141
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHG 177
LR ADL+GA+ +L ADLRGA VD V ++ L ++D+ QAVA A +G
Sbjct 142 ADLREADLTGAQLGKVKLSGADLRGAVVDRVDFKAFDLKNVKLDIAQAVAVARCYG 197
>gi|301053431|ref|YP_003791642.1| hypothetical protein BACI_c18470 [Bacillus cereus biovar anthracis
str. CI]
gi|300375600|gb|ADK04504.1| conserved hypothetical protein [Bacillus cereus biovar anthracis
str. CI]
Length=201
Score = 108 bits (269), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/176 (37%), Positives = 93/176 (53%), Gaps = 0/176 (0%)
Query 2 QQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWH 61
++ +C F FR D S + TE F ECDF+G S H+G+ F NC F T L+
Sbjct 22 EELKNCTFIKCCFRGIDASEISTENCNFIECDFTGALFNASIHQGTTFANCKFVGTNLFV 81
Query 62 STFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRK 121
S F +C M GS F L +T+ D++ L +L L G RL E L + +L K
Sbjct 82 SKFEECKMTGSDFEEANLDGITIISGDWSYTNLRFANLSKQMLKGIRLVEADLYECNLEK 141
Query 122 CVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHG 177
LR ADL+ A+ A+L A+L+GA VD + + + L ++D+ QAVA A +G
Sbjct 142 ADLREADLTSAQLGKAKLSGANLKGAIVDRIDFTSFDLKNVKLDIAQAVAVARCYG 197
>gi|300783407|ref|YP_003763698.1| pentapeptide repeat-containing protein [Amycolatopsis mediterranei
U32]
gi|299792921|gb|ADJ43296.1| pentapeptide repeat-containing protein [Amycolatopsis mediterranei
U32]
gi|340524793|gb|AEK39998.1| pentapeptide repeat-containing protein [Amycolatopsis mediterranei
S699]
Length=202
Score = 108 bits (269), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 88/180 (49%), Gaps = 5/180 (2%)
Query 4 WVDCEFTGR-----DFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTT 58
W E TGR +F + DLS T ++F+ C F V S+H SAF C F+R
Sbjct 19 WYGEEITGRHYVRCEFHEADLSEAVTRNSVFTGCVFGNVRFNASRHTDSAFTGCAFKRCN 78
Query 59 LWHSTFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTD 118
+ + F C ++G+ F C LRPL + D++ A L G DLR + G R+RE L +
Sbjct 79 FFDAEFTGCKLVGATFTECELRPLRVTGGDWSFAGLAGADLRAVTFQGVRMREADLTGAN 138
Query 119 LRKCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGL 178
V DLSGA +L A+LRG+ + + A L GA V +QA + GL
Sbjct 139 CAGAVFADVDLSGAMLHAVKLPKAELRGSDLSALDPLNAELAGAIVSPEQAAVLVTSLGL 198
>gi|310644538|ref|YP_003949297.1| pentapeptide repeat protein [Paenibacillus polymyxa SC2]
gi|309249489|gb|ADO59056.1| Pentapeptide repeat protein [Paenibacillus polymyxa SC2]
Length=202
Score = 108 bits (269), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 95/180 (53%), Gaps = 0/180 (0%)
Query 3 QWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWHS 62
+W +CEFT FR +++ E F +CDF+G L S ++ SAF NC F L+
Sbjct 23 EWRNCEFTRCRFRGVEMNESLVEDCTFVDCDFTGAILNASHYKESAFTNCLFTSANLFVV 82
Query 63 TFAQCSMLGSVFVACRLRPLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRKC 122
F C ++GS F + +T+ D++ L DLR +L R +E L + L K
Sbjct 83 RFDNCKLVGSDFAGANMDGITITGGDWSYTNLRHADLRKQDLRKIRFKEADLSECHLEKA 142
Query 123 VLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLVGARVDVDQAVAFAAAHGLCLAG 182
LR ADL+ R A L DLRGA +D V ++ + GA++D++ AVA A ++G + G
Sbjct 143 DLREADLTRVRLHKAHLQGTDLRGAKMDGVNFKDLDITGAKLDIELAVAVARSYGAKIDG 202
Lambda K H
0.325 0.137 0.430
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 167689013960
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40