BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1222
Length=154
Score E
Sequences producing significant alignments: (Bits) Value
gi|148661009|ref|YP_001282532.1| hypothetical protein MRA_1231 [... 309 7e-83
gi|15840666|ref|NP_335703.1| hypothetical protein MT1260 [Mycoba... 308 2e-82
gi|148822437|ref|YP_001287191.1| hypothetical protein TBFG_11246... 307 3e-82
gi|339297849|gb|AEJ49959.1| hypothetical protein CCDC5180_1122 [... 306 6e-82
gi|15608362|ref|NP_215738.1| hypothetical protein Rv1222 [Mycoba... 306 9e-82
gi|323720280|gb|EGB29378.1| hypothetical protein TMMG_01916 [Myc... 305 2e-81
gi|340626235|ref|YP_004744687.1| hypothetical protein MCAN_12361... 295 2e-78
gi|308231781|ref|ZP_07663929.1| hypothetical protein TMAG_01853 ... 287 4e-76
gi|15827527|ref|NP_301790.1| hypothetical protein ML1077 [Mycoba... 197 4e-49
gi|296170126|ref|ZP_06851725.1| conserved hypothetical protein [... 196 8e-49
gi|118619620|ref|YP_907952.1| hypothetical protein MUL_4518 [Myc... 196 9e-49
gi|118463820|ref|YP_880609.1| RNA polymerase sigma-70 factor [My... 192 2e-47
gi|2062635|gb|AAC45220.1| unknown [Mycobacterium avium] 190 6e-47
gi|41408654|ref|NP_961490.1| hypothetical protein MAP2556c [Myco... 190 7e-47
gi|342862105|ref|ZP_08718748.1| RNA polymerase sigma-70 factor [... 189 8e-47
gi|240168998|ref|ZP_04747657.1| hypothetical protein MkanA1_0677... 189 1e-46
gi|254822083|ref|ZP_05227084.1| RNA polymerase sigma-70 factor [... 186 1e-45
gi|254774243|ref|ZP_05215759.1| hypothetical protein MaviaA2_061... 181 3e-44
gi|118470197|ref|YP_889321.1| hypothetical protein MSMEG_5071 [M... 177 4e-43
gi|2062638|gb|AAC45222.1| unknown [Mycobacterium smegmatis] 177 4e-43
gi|108800957|ref|YP_641154.1| hypothetical protein Mmcs_3993 [My... 177 5e-43
gi|126436582|ref|YP_001072273.1| hypothetical protein Mjls_4007 ... 175 2e-42
gi|333989836|ref|YP_004522450.1| hypothetical protein JDM601_119... 171 4e-41
gi|120405447|ref|YP_955276.1| hypothetical protein Mvan_4495 [My... 158 2e-37
gi|315443257|ref|YP_004076136.1| hypothetical protein Mspyr1_163... 153 1e-35
gi|145222789|ref|YP_001133467.1| hypothetical protein Mflv_2201 ... 150 7e-35
gi|169628454|ref|YP_001702103.1| hypothetical protein MAB_1363 [... 141 3e-32
gi|226365447|ref|YP_002783230.1| hypothetical protein ROP_60380 ... 113 1e-23
gi|111022941|ref|YP_705913.1| hypothetical protein RHA1_ro05978 ... 111 5e-23
gi|312138876|ref|YP_004006212.1| hypothetical protein REQ_14470 ... 107 7e-22
gi|226307644|ref|YP_002767604.1| hypothetical protein RER_41570 ... 100 9e-20
gi|54026704|ref|YP_120946.1| hypothetical protein nfa47300 [Noca... 99.0 2e-19
gi|229493853|ref|ZP_04387626.1| RNA polymerase sigma-70 factor [... 97.8 4e-19
gi|296138796|ref|YP_003646039.1| hypothetical protein Tpau_1068 ... 97.1 9e-19
gi|343928011|ref|ZP_08767476.1| hypothetical protein GOALK_100_0... 95.1 3e-18
gi|333921003|ref|YP_004494584.1| hypothetical protein AS9A_3343 ... 89.0 2e-16
gi|296394493|ref|YP_003659377.1| hypothetical protein Srot_2091 ... 73.6 9e-12
gi|317508222|ref|ZP_07965902.1| agrin [Segniliparus rugosus ATCC... 70.1 9e-11
gi|237785245|ref|YP_002905950.1| anti-sigma factor [Corynebacter... 68.9 2e-10
gi|38233590|ref|NP_939357.1| hypothetical protein DIP0995 [Coryn... 65.5 3e-09
gi|227487902|ref|ZP_03918218.1| conserved hypothetical protein [... 61.6 4e-08
gi|325002582|ref|ZP_08123694.1| hypothetical protein PseP1_27637... 61.6 4e-08
gi|262201799|ref|YP_003273007.1| hypothetical protein Gbro_1858 ... 61.6 4e-08
gi|300858210|ref|YP_003783193.1| anti-sigma factor [Corynebacter... 60.1 1e-07
gi|291453975|ref|ZP_06593365.1| conserved hypothetical protein [... 59.3 2e-07
gi|326382037|ref|ZP_08203730.1| hypothetical protein SCNU_03797 ... 58.9 3e-07
gi|337290464|ref|YP_004629485.1| anti-sigma factor [Corynebacter... 58.9 3e-07
gi|258652368|ref|YP_003201524.1| hypothetical protein Namu_2157 ... 58.5 3e-07
gi|227548419|ref|ZP_03978468.1| conserved hypothetical protein [... 57.8 6e-07
gi|331694894|ref|YP_004331133.1| hypothetical protein Psed_1029 ... 57.4 8e-07
>gi|148661009|ref|YP_001282532.1| hypothetical protein MRA_1231 [Mycobacterium tuberculosis H37Ra]
gi|167967896|ref|ZP_02550173.1| hypothetical protein MtubH3_07623 [Mycobacterium tuberculosis
H37Ra]
gi|253799734|ref|YP_003032735.1| RNA polymerase sigma-70 factor [Mycobacterium tuberculosis KZN
1435]
18 more sequence titles
Length=219
Score = 309 bits (792), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 154/154 (100%), Positives = 154/154 (100%), Gaps = 0/154 (0%)
Query 1 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH
Sbjct 66 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 125
Query 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 120
LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG
Sbjct 126 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 185
Query 121 GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR 154
GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR
Sbjct 186 GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR 219
>gi|15840666|ref|NP_335703.1| hypothetical protein MT1260 [Mycobacterium tuberculosis CDC1551]
gi|13880852|gb|AAK45517.1| hypothetical protein MT1260 [Mycobacterium tuberculosis CDC1551]
Length=219
Score = 308 bits (789), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 153/154 (99%), Positives = 153/154 (99%), Gaps = 0/154 (0%)
Query 1 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
MADPGSVGHVFRRAFSWLPAQF SQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH
Sbjct 66 MADPGSVGHVFRRAFSWLPAQFTSQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 125
Query 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 120
LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG
Sbjct 126 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 185
Query 121 GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR 154
GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR
Sbjct 186 GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR 219
>gi|148822437|ref|YP_001287191.1| hypothetical protein TBFG_11246 [Mycobacterium tuberculosis F11]
gi|254231484|ref|ZP_04924811.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|2062625|gb|AAC45269.1| unknown [Mycobacterium tuberculosis H37Rv]
gi|124600543|gb|EAY59553.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|148720964|gb|ABR05589.1| conserved hypothetical protein [Mycobacterium tuberculosis F11]
Length=193
Score = 307 bits (787), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 154/154 (100%), Positives = 154/154 (100%), Gaps = 0/154 (0%)
Query 1 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH
Sbjct 40 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 99
Query 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 120
LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG
Sbjct 100 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 159
Query 121 GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR 154
GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR
Sbjct 160 GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR 193
>gi|339297849|gb|AEJ49959.1| hypothetical protein CCDC5180_1122 [Mycobacterium tuberculosis
CCDC5180]
Length=156
Score = 306 bits (784), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 154/154 (100%), Positives = 154/154 (100%), Gaps = 0/154 (0%)
Query 1 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH
Sbjct 3 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 62
Query 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 120
LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG
Sbjct 63 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 122
Query 121 GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR 154
GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR
Sbjct 123 GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR 156
>gi|15608362|ref|NP_215738.1| hypothetical protein Rv1222 [Mycobacterium tuberculosis H37Rv]
gi|31792415|ref|NP_854908.1| hypothetical protein Mb1254 [Mycobacterium bovis AF2122/97]
gi|121637151|ref|YP_977374.1| hypothetical protein BCG_1282 [Mycobacterium bovis BCG str. Pasteur
1173P2]
46 more sequence titles
Length=154
Score = 306 bits (783), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 154/154 (100%), Positives = 154/154 (100%), Gaps = 0/154 (0%)
Query 1 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH
Sbjct 1 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
Query 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 120
LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG
Sbjct 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 120
Query 121 GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR 154
GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR
Sbjct 121 GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR 154
>gi|323720280|gb|EGB29378.1| hypothetical protein TMMG_01916 [Mycobacterium tuberculosis CDC1551A]
Length=154
Score = 305 bits (781), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 153/154 (99%), Positives = 153/154 (99%), Gaps = 0/154 (0%)
Query 1 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
MADPGSVGHVFRRAFSWLPAQF SQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH
Sbjct 1 MADPGSVGHVFRRAFSWLPAQFTSQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
Query 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 120
LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG
Sbjct 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 120
Query 121 GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR 154
GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR
Sbjct 121 GSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR 154
>gi|340626235|ref|YP_004744687.1| hypothetical protein MCAN_12361 [Mycobacterium canettii CIPT
140010059]
gi|340004425|emb|CCC43568.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059]
Length=148
Score = 295 bits (755), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 148/148 (100%), Positives = 148/148 (100%), Gaps = 0/148 (0%)
Query 1 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH
Sbjct 1 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
Query 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 120
LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG
Sbjct 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 120
Query 121 GSSQGPPDGAAAGFGDRFADGDGGNRGR 148
GSSQGPPDGAAAGFGDRFADGDGGNRGR
Sbjct 121 GSSQGPPDGAAAGFGDRFADGDGGNRGR 148
>gi|308231781|ref|ZP_07663929.1| hypothetical protein TMAG_01853 [Mycobacterium tuberculosis SUMu001]
gi|308216094|gb|EFO75493.1| hypothetical protein TMAG_01853 [Mycobacterium tuberculosis SUMu001]
gi|339294209|gb|AEJ46320.1| hypothetical protein CCDC5079_1130 [Mycobacterium tuberculosis
CCDC5079]
Length=145
Score = 287 bits (734), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 144/145 (99%), Positives = 145/145 (100%), Gaps = 0/145 (0%)
Query 10 VFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSL 69
+FRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSL
Sbjct 1 MFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSL 60
Query 70 CAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSGGSSQGPPDG 129
CAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSGGSSQGPPDG
Sbjct 61 CAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSGGSSQGPPDG 120
Query 130 AAAGFGDRFADGDGGNRGRQSRVRR 154
AAAGFGDRFADGDGGNRGRQSRVRR
Sbjct 121 AAAGFGDRFADGDGGNRGRQSRVRR 145
>gi|15827527|ref|NP_301790.1| hypothetical protein ML1077 [Mycobacterium leprae TN]
gi|221230004|ref|YP_002503420.1| hypothetical protein MLBr_01077 [Mycobacterium leprae Br4923]
gi|13093077|emb|CAC31458.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219933111|emb|CAR71172.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=139
Score = 197 bits (502), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 101/142 (72%), Positives = 110/142 (78%), Gaps = 8/142 (5%)
Query 8 GHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL 67
G VFRR FSWLPAQFASQ+DAPVGAPR+F STEHLS+EAIAAFVDGELRMNAHLRAAHH+
Sbjct 6 GQVFRRVFSWLPAQFASQNDAPVGAPRRFGSTEHLSVEAIAAFVDGELRMNAHLRAAHHI 65
Query 68 SLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSGGSSQGPP 127
SLCAQCAAEVDDQSR RAALRDSHPIRIPSTL GLL+ IPRC P+ S S S
Sbjct 66 SLCAQCAAEVDDQSRTRAALRDSHPIRIPSTLFGLLTAIPRCSPDYTSPVSEPFSE---- 121
Query 128 DGAAAGFGDRFADGDGGNRGRQ 149
DRF DG +G++
Sbjct 122 ----GSVSDRFVDGVAREQGKR 139
>gi|296170126|ref|ZP_06851725.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295895228|gb|EFG74941.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=132
Score = 196 bits (499), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 106/147 (73%), Positives = 112/147 (77%), Gaps = 20/147 (13%)
Query 8 GHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL 67
GHVFRRAFSWLPAQFASQSDAPVGAPRQF STEHLS+EAIAAFVDGELRMNAHLRAAHHL
Sbjct 6 GHVFRRAFSWLPAQFASQSDAPVGAPRQFGSTEHLSVEAIAAFVDGELRMNAHLRAAHHL 65
Query 68 SLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSGGSSQGPP 127
SLC QCAAEVDDQSRARAALRDSHPIRIPSTLLG+L+EIP P
Sbjct 66 SLCPQCAAEVDDQSRARAALRDSHPIRIPSTLLGMLAEIP----------------YESP 109
Query 128 DGAAAGFGDRFADGDGGNRGRQSRVRR 154
D + +RFAD D R+ R RR
Sbjct 110 DDSTPRASERFADPD----AREQRKRR 132
>gi|118619620|ref|YP_907952.1| hypothetical protein MUL_4518 [Mycobacterium ulcerans Agy99]
gi|183984187|ref|YP_001852478.1| hypothetical protein MMAR_4215 [Mycobacterium marinum M]
gi|118571730|gb|ABL06481.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
gi|183177513|gb|ACC42623.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=134
Score = 196 bits (498), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 95/113 (85%), Positives = 104/113 (93%), Gaps = 0/113 (0%)
Query 8 GHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL 67
GHVFRRAFSWLPAQFASQSDAPVGAPR+FRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL
Sbjct 5 GHVFRRAFSWLPAQFASQSDAPVGAPRRFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL 64
Query 68 SLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSG 120
S+C QCAAEV+D +RARAALRDSHPIRIP+ LLG+LSEIP P EG + ++G
Sbjct 65 SMCPQCAAEVEDHTRARAALRDSHPIRIPTALLGMLSEIPHHPSEGAPEDTTG 117
>gi|118463820|ref|YP_880609.1| RNA polymerase sigma-70 factor [Mycobacterium avium 104]
gi|118165107|gb|ABK66004.1| RNA polymerase sigma-70 factor [Mycobacterium avium 104]
Length=424
Score = 192 bits (487), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 94/107 (88%), Positives = 99/107 (93%), Gaps = 0/107 (0%)
Query 8 GHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL 67
GHVFRRAFSWLPAQFASQSDAPVGAPRQF STEHLS+EAIAAFVDGELRMNAHLRAAHHL
Sbjct 303 GHVFRRAFSWLPAQFASQSDAPVGAPRQFGSTEHLSVEAIAAFVDGELRMNAHLRAAHHL 362
Query 68 SLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGP 114
S CAQCAAEVDDQSRARAALRDS PIRIP+ LLG+L+EIP P+ P
Sbjct 363 SQCAQCAAEVDDQSRARAALRDSRPIRIPANLLGMLAEIPYESPDAP 409
>gi|2062635|gb|AAC45220.1| unknown [Mycobacterium avium]
Length=133
Score = 190 bits (483), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 94/107 (88%), Positives = 99/107 (93%), Gaps = 0/107 (0%)
Query 8 GHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL 67
GHVFRRAFSWLPAQFASQSDAPVGAPRQF STEHLS+EAIAAFVDGELRMNAHLRAAHHL
Sbjct 12 GHVFRRAFSWLPAQFASQSDAPVGAPRQFGSTEHLSVEAIAAFVDGELRMNAHLRAAHHL 71
Query 68 SLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGP 114
S CAQCAAEVDDQSRARAALRDS PIRIP+ LLG+L+EIP P+ P
Sbjct 72 SQCAQCAAEVDDQSRARAALRDSRPIRIPANLLGMLAEIPYESPDAP 118
>gi|41408654|ref|NP_961490.1| hypothetical protein MAP2556c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41397012|gb|AAS04873.1| hypothetical protein MAP_2556c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336458557|gb|EGO37524.1| hypothetical protein MAPs_11470 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=133
Score = 190 bits (482), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 94/107 (88%), Positives = 99/107 (93%), Gaps = 0/107 (0%)
Query 8 GHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL 67
GHVFRRAFSWLPAQFASQSDAPVGAPRQF STEHLS+EAIAAFVDGELRMNAHLRAAHHL
Sbjct 12 GHVFRRAFSWLPAQFASQSDAPVGAPRQFGSTEHLSVEAIAAFVDGELRMNAHLRAAHHL 71
Query 68 SLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGP 114
S CAQCAAEVDDQSRARAALRDS PIRIP+ LLG+L+EIP P+ P
Sbjct 72 SQCAQCAAEVDDQSRARAALRDSRPIRIPANLLGMLAEIPYESPDTP 118
>gi|342862105|ref|ZP_08718748.1| RNA polymerase sigma-70 factor [Mycobacterium colombiense CECT
3035]
gi|342130409|gb|EGT83724.1| RNA polymerase sigma-70 factor [Mycobacterium colombiense CECT
3035]
Length=138
Score = 189 bits (481), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 95/112 (85%), Positives = 102/112 (92%), Gaps = 1/112 (0%)
Query 8 GHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL 67
GHVFRRAFSWLPAQFASQSDAPVGAPRQF STEHLS+EAIAAFVDGELRMNAHLRAAHHL
Sbjct 12 GHVFRRAFSWLPAQFASQSDAPVGAPRQFGSTEHLSVEAIAAFVDGELRMNAHLRAAHHL 71
Query 68 SLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIP-RCPPEGPSKGS 118
SLCAQCA EV+DQSRARAALRDS PIRIPSTLLG+L++IP P + P+ S
Sbjct 72 SLCAQCAGEVEDQSRARAALRDSRPIRIPSTLLGMLADIPYESPDDSPTHAS 123
>gi|240168998|ref|ZP_04747657.1| hypothetical protein MkanA1_06779 [Mycobacterium kansasii ATCC
12478]
Length=130
Score = 189 bits (480), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 95/103 (93%), Positives = 96/103 (94%), Gaps = 0/103 (0%)
Query 8 GHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL 67
GHVFRRAFSWLP QFASQSDAPVGAPR+FRSTEHLSIEAIAAFVDGEL MNAHLRAAHHL
Sbjct 5 GHVFRRAFSWLPTQFASQSDAPVGAPRRFRSTEHLSIEAIAAFVDGELTMNAHLRAAHHL 64
Query 68 SLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCP 110
SLC QCAAEVDD SRARAALRDS PIRIPSTLLGLLSEIP P
Sbjct 65 SLCPQCAAEVDDHSRARAALRDSRPIRIPSTLLGLLSEIPHQP 107
>gi|254822083|ref|ZP_05227084.1| RNA polymerase sigma-70 factor [Mycobacterium intracellulare
ATCC 13950]
Length=125
Score = 186 bits (472), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 90/98 (92%), Positives = 95/98 (97%), Gaps = 0/98 (0%)
Query 10 VFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSL 69
+FRRAFSWLPAQFASQSDAPVGAPRQF STEHLS+EAIAAFVDGELRMNAHLRAAHHLS+
Sbjct 1 MFRRAFSWLPAQFASQSDAPVGAPRQFGSTEHLSVEAIAAFVDGELRMNAHLRAAHHLSM 60
Query 70 CAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIP 107
C QCAAEVDDQSR RAALRDSHPIRIPSTLLG+L+EIP
Sbjct 61 CPQCAAEVDDQSRTRAALRDSHPIRIPSTLLGMLAEIP 98
>gi|254774243|ref|ZP_05215759.1| hypothetical protein MaviaA2_06180 [Mycobacterium avium subsp.
avium ATCC 25291]
Length=120
Score = 181 bits (459), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 90/105 (86%), Positives = 96/105 (92%), Gaps = 0/105 (0%)
Query 10 VFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSL 69
+FRRAFSWLPAQFASQSDAPVGAPRQF STEHLS+EAIA FVDGELRMNAHLRAAHHLS
Sbjct 1 MFRRAFSWLPAQFASQSDAPVGAPRQFGSTEHLSVEAIAPFVDGELRMNAHLRAAHHLSQ 60
Query 70 CAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGP 114
CAQCAAEVDDQSRARAALRDS PIRIP+ LLG+L+EIP P+ P
Sbjct 61 CAQCAAEVDDQSRARAALRDSRPIRIPANLLGMLAEIPYESPDTP 105
>gi|118470197|ref|YP_889321.1| hypothetical protein MSMEG_5071 [Mycobacterium smegmatis str.
MC2 155]
gi|118171484|gb|ABK72380.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=132
Score = 177 bits (450), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 90/112 (81%), Positives = 100/112 (90%), Gaps = 3/112 (2%)
Query 1 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
MADPG HVFRRAFSWLP+QFASQSDAPVGAPRQF STEHLS+EAIAAFVDGELRM+AH
Sbjct 1 MADPG---HVFRRAFSWLPSQFASQSDAPVGAPRQFGSTEHLSVEAIAAFVDGELRMSAH 57
Query 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPE 112
LRAAHHLSLC +CAAEVD QS+AR ALR+S PI IP++LLG+LS+IP PE
Sbjct 58 LRAAHHLSLCPECAAEVDAQSQARTALRESCPIAIPNSLLGMLSQIPHRTPE 109
>gi|2062638|gb|AAC45222.1| unknown [Mycobacterium smegmatis]
Length=145
Score = 177 bits (449), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 90/112 (81%), Positives = 100/112 (90%), Gaps = 3/112 (2%)
Query 1 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
MADPG HVFRRAFSWLP+QFASQSDAPVGAPRQF STEHLS+EAIAAFVDGELRM+AH
Sbjct 14 MADPG---HVFRRAFSWLPSQFASQSDAPVGAPRQFGSTEHLSVEAIAAFVDGELRMSAH 70
Query 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPE 112
LRAAHHLSLC +CAAEVD QS+AR ALR+S PI IP++LLG+LS+IP PE
Sbjct 71 LRAAHHLSLCPECAAEVDAQSQARTALRESCPIAIPNSLLGMLSQIPHRTPE 122
>gi|108800957|ref|YP_641154.1| hypothetical protein Mmcs_3993 [Mycobacterium sp. MCS]
gi|119870097|ref|YP_940049.1| hypothetical protein Mkms_4067 [Mycobacterium sp. KMS]
gi|108771376|gb|ABG10098.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119696186|gb|ABL93259.1| conserved hypothetical protein [Mycobacterium sp. KMS]
Length=132
Score = 177 bits (449), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 89/116 (77%), Positives = 99/116 (86%), Gaps = 3/116 (2%)
Query 1 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
M DPG HVFRRAFSWLPAQFASQS+APVGAPRQF STEHLSIEA+AAFVDGEL M AH
Sbjct 1 MVDPG---HVFRRAFSWLPAQFASQSNAPVGAPRQFGSTEHLSIEAVAAFVDGELSMTAH 57
Query 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSK 116
+RAA HLSLC QCAAEVD QS+AR+ALRDS PI IP+TLLG+LS+IP+ P P +
Sbjct 58 MRAASHLSLCPQCAAEVDAQSQARSALRDSQPIAIPNTLLGMLSQIPQHMPHSPVE 113
>gi|126436582|ref|YP_001072273.1| hypothetical protein Mjls_4007 [Mycobacterium sp. JLS]
gi|126236382|gb|ABN99782.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=132
Score = 175 bits (444), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 87/111 (79%), Positives = 97/111 (88%), Gaps = 3/111 (2%)
Query 1 MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAH 60
M DPG HVFRRAFSWLPAQFASQS+APVGAPRQF STEHLSIEA+AAFVDGEL M AH
Sbjct 1 MVDPG---HVFRRAFSWLPAQFASQSNAPVGAPRQFGSTEHLSIEAVAAFVDGELSMTAH 57
Query 61 LRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPRCPP 111
+RAA+H+SLC QCAAEVD QS+AR ALRDS PI IP+TLLG+LS+IP+ P
Sbjct 58 MRAANHMSLCLQCAAEVDAQSQARTALRDSQPIAIPNTLLGMLSQIPQHMP 108
>gi|333989836|ref|YP_004522450.1| hypothetical protein JDM601_1196 [Mycobacterium sp. JDM601]
gi|333485804|gb|AEF35196.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=135
Score = 171 bits (432), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 84/99 (85%), Positives = 91/99 (92%), Gaps = 0/99 (0%)
Query 8 GHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL 67
G VFRRAFSWLP+QFASQSDAPVGAPRQF STEHL EA+AA+VDGELRMNAHLRAAHHL
Sbjct 5 GGVFRRAFSWLPSQFASQSDAPVGAPRQFGSTEHLCSEAVAAYVDGELRMNAHLRAAHHL 64
Query 68 SLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEI 106
SLC+ CAAEV+ Q RAR+ALRDSHPIRIPS LLGLLS+I
Sbjct 65 SLCSDCAAEVEYQGRARSALRDSHPIRIPSALLGLLSQI 103
>gi|120405447|ref|YP_955276.1| hypothetical protein Mvan_4495 [Mycobacterium vanbaalenii PYR-1]
gi|119958265|gb|ABM15270.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=132
Score = 158 bits (400), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 83/119 (70%), Positives = 94/119 (79%), Gaps = 2/119 (1%)
Query 8 GHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL 67
GH FRRAFSWLP+Q ASQSD PVG PRQF STEHLSIEA+AAFVDGELRM+AHLRAAHHL
Sbjct 6 GHAFRRAFSWLPSQLASQSDDPVG-PRQFGSTEHLSIEAVAAFVDGELRMSAHLRAAHHL 64
Query 68 SLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPR-CPPEGPSKGSSGGSSQG 125
SLC +CA EVD Q +AR ALR+S P+ +PS+LLGLLS+IP P E P S + G
Sbjct 65 SLCPECALEVDAQRQAREALRESRPVAMPSSLLGLLSQIPNHTPVEAPEPADSPQLADG 123
>gi|315443257|ref|YP_004076136.1| hypothetical protein Mspyr1_16340 [Mycobacterium sp. Spyr1]
gi|315261560|gb|ADT98301.1| hypothetical protein Mspyr1_16340 [Mycobacterium sp. Spyr1]
Length=128
Score = 153 bits (386), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/121 (71%), Positives = 92/121 (77%), Gaps = 6/121 (4%)
Query 8 GHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL 67
GH FRRAFSWLP SQSDAPVG PRQF STEHLS EAIAAF DGELRM AHLRAAHHL
Sbjct 6 GHAFRRAFSWLP----SQSDAPVG-PRQFGSTEHLSTEAIAAFADGELRMTAHLRAAHHL 60
Query 68 SLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIP-RCPPEGPSKGSSGGSSQGP 126
SLCA+CA EVD Q +AR ALRDS P+ IPS+LLGLLS+IP R P + S + GP
Sbjct 61 SLCAECAQEVDAQRQAREALRDSCPVDIPSSLLGLLSQIPNRTPLDAQEPSESPQLADGP 120
Query 127 P 127
P
Sbjct 121 P 121
>gi|145222789|ref|YP_001133467.1| hypothetical protein Mflv_2201 [Mycobacterium gilvum PYR-GCK]
gi|145215275|gb|ABP44679.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
Length=133
Score = 150 bits (379), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 83/121 (69%), Positives = 92/121 (77%), Gaps = 6/121 (4%)
Query 8 GHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHL 67
GH FRRAFSWLP SQSD+PVG PRQF STEHLS EAIAAF DGELRM AHLRAAHHL
Sbjct 11 GHAFRRAFSWLP----SQSDSPVG-PRQFGSTEHLSTEAIAAFADGELRMTAHLRAAHHL 65
Query 68 SLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIP-RCPPEGPSKGSSGGSSQGP 126
SLC++CA EVD Q +AR ALRDS P+ IPS+LLGLLS+IP R P + S + GP
Sbjct 66 SLCSECAQEVDAQRQAREALRDSCPVDIPSSLLGLLSQIPNRTPVDAQEPPESPQLADGP 125
Query 127 P 127
P
Sbjct 126 P 126
>gi|169628454|ref|YP_001702103.1| hypothetical protein MAB_1363 [Mycobacterium abscessus ATCC 19977]
gi|169240421|emb|CAM61449.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=147
Score = 141 bits (355), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 71/93 (77%), Positives = 80/93 (87%), Gaps = 0/93 (0%)
Query 18 LPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEV 77
LP+ FASQS APVGAPRQF STEHLS EAIAAFVDGELRM+AHLRAAHHLS+CA+CA E+
Sbjct 20 LPSPFASQSGAPVGAPRQFGSTEHLSTEAIAAFVDGELRMSAHLRAAHHLSMCAECALEI 79
Query 78 DDQSRARAALRDSHPIRIPSTLLGLLSEIPRCP 110
D Q +AR ALRDS IR+P +LLGLLS+IP P
Sbjct 80 DAQRQARTALRDSGAIRVPGSLLGLLSQIPHIP 112
>gi|226365447|ref|YP_002783230.1| hypothetical protein ROP_60380 [Rhodococcus opacus B4]
gi|226243937|dbj|BAH54285.1| hypothetical protein [Rhodococcus opacus B4]
Length=120
Score = 113 bits (282), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 58/87 (67%), Positives = 68/87 (79%), Gaps = 0/87 (0%)
Query 32 APRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSH 91
PRQF STEHL+ EAIAAFVDGELRM+A+LRAAHHLS+CA+CA EVD Q +AR ALR S
Sbjct 6 VPRQFGSTEHLASEAIAAFVDGELRMSAYLRAAHHLSICAECAFEVDSQQQARRALRRSG 65
Query 92 PIRIPSTLLGLLSEIPRCPPEGPSKGS 118
+ +PS LLGLLS+IP C P+ S
Sbjct 66 DVAMPSGLLGLLSQIPSCNQGDPADKS 92
>gi|111022941|ref|YP_705913.1| hypothetical protein RHA1_ro05978 [Rhodococcus jostii RHA1]
gi|110822471|gb|ABG97755.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=120
Score = 111 bits (277), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 55/78 (71%), Positives = 65/78 (84%), Gaps = 0/78 (0%)
Query 32 APRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSH 91
PRQF STEHL+ EAIAA+VDGELRM+A+LRAAHHLS+CA+CA EVD Q +AR ALR S
Sbjct 6 VPRQFGSTEHLASEAIAAYVDGELRMSAYLRAAHHLSICAECAFEVDSQQQARRALRRSG 65
Query 92 PIRIPSTLLGLLSEIPRC 109
+ +PS LLGLLS+IP C
Sbjct 66 DVAMPSGLLGLLSQIPSC 83
>gi|312138876|ref|YP_004006212.1| hypothetical protein REQ_14470 [Rhodococcus equi 103S]
gi|325676419|ref|ZP_08156097.1| hypothetical protein HMPREF0724_13880 [Rhodococcus equi ATCC
33707]
gi|311888215|emb|CBH47527.1| conserved hypothetical protein [Rhodococcus equi 103S]
gi|325552597|gb|EGD22281.1| hypothetical protein HMPREF0724_13880 [Rhodococcus equi ATCC
33707]
Length=125
Score = 107 bits (267), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 54/83 (66%), Positives = 67/83 (81%), Gaps = 1/83 (1%)
Query 33 PRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRD-SH 91
PRQF STEHL+ EAIA++VDGELRMNA+LRA+ HL+LC CAAEV+ Q +AR ALR +
Sbjct 12 PRQFGSTEHLASEAIASYVDGELRMNAYLRASQHLALCPDCAAEVEAQQQARIALRRAAS 71
Query 92 PIRIPSTLLGLLSEIPRCPPEGP 114
+ +PS+LLGLLS+IPRC P P
Sbjct 72 EVSMPSSLLGLLSQIPRCHPAEP 94
>gi|226307644|ref|YP_002767604.1| hypothetical protein RER_41570 [Rhodococcus erythropolis PR4]
gi|226186761|dbj|BAH34865.1| hypothetical protein RER_41570 [Rhodococcus erythropolis PR4]
Length=127
Score = 100 bits (248), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 58/100 (58%), Positives = 74/100 (74%), Gaps = 2/100 (2%)
Query 32 APRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSH 91
A RQF STEHL+ EAIAA+VDGELRM A+LRA+HH+S+CA+CAA VD Q +AR ALR S
Sbjct 14 AHRQFGSTEHLASEAIAAYVDGELRMQAYLRASHHISICAECAAAVDAQQQARGALRRSG 73
Query 92 PIRIPSTLLGLLSEIPRC--PPEGPSKGSSGGSSQGPPDG 129
+ +P +L+GLLS+IP C P GP+ ++ GS P G
Sbjct 74 EMTMPLSLVGLLSQIPSCNSPTTGPNSENADGSVGNQPAG 113
>gi|54026704|ref|YP_120946.1| hypothetical protein nfa47300 [Nocardia farcinica IFM 10152]
gi|54018212|dbj|BAD59582.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=159
Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/88 (57%), Positives = 63/88 (72%), Gaps = 0/88 (0%)
Query 31 GAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDS 90
G P +F TEHL+ EA+ A+VDGELRMNA+LRAAHH+S+C +CAAEV+ Q +AR ALR S
Sbjct 40 GRPPRFAPTEHLASEAVVAYVDGELRMNAYLRAAHHISVCPECAAEVEAQQQARIALRQS 99
Query 91 HPIRIPSTLLGLLSEIPRCPPEGPSKGS 118
PI +P +L LS IP GP + S
Sbjct 100 GPIAVPRSLHDSLSRIPLAELPGPVENS 127
>gi|229493853|ref|ZP_04387626.1| RNA polymerase sigma-70 factor [Rhodococcus erythropolis SK121]
gi|229319240|gb|EEN85088.1| RNA polymerase sigma-70 factor [Rhodococcus erythropolis SK121]
Length=119
Score = 97.8 bits (242), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 57/100 (57%), Positives = 73/100 (73%), Gaps = 2/100 (2%)
Query 32 APRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSH 91
A RQF STEHL+ EAIAA+VDGELRM A+LRA+HH+S+CA+CAA VD Q +AR ALR S
Sbjct 6 AHRQFGSTEHLASEAIAAYVDGELRMQAYLRASHHISICAECAAAVDAQQQARGALRRSG 65
Query 92 PIRIPSTLLGLLSEIPRC--PPEGPSKGSSGGSSQGPPDG 129
+ +P +L+GLLS+IP C P GP+ ++ S P G
Sbjct 66 EMTMPLSLVGLLSQIPSCNSPTTGPNSENADSSVGNQPAG 105
>gi|296138796|ref|YP_003646039.1| hypothetical protein Tpau_1068 [Tsukamurella paurometabola DSM
20162]
gi|296026930|gb|ADG77700.1| conserved hypothetical protein [Tsukamurella paurometabola DSM
20162]
Length=129
Score = 97.1 bits (240), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 52/105 (50%), Positives = 66/105 (63%), Gaps = 16/105 (15%)
Query 11 FRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLC 70
F+RAF+ P +F+S TEHL+ EA+ AFVDGELRMNAHLRA H++ C
Sbjct 8 FKRAFARRPGEFSS--------------TEHLAFEAVVAFVDGELRMNAHLRAGTHIAQC 53
Query 71 AQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIPR--CPPEG 113
CAAEVD Q + R LR+S I +P+ LLG L++IP C P G
Sbjct 54 PMCAAEVDAQRQVRNTLRESGEISVPNRLLGQLAQIPTECCKPGG 98
>gi|343928011|ref|ZP_08767476.1| hypothetical protein GOALK_100_00150 [Gordonia alkanivorans NBRC
16433]
gi|343762019|dbj|GAA14402.1| hypothetical protein GOALK_100_00150 [Gordonia alkanivorans NBRC
16433]
Length=173
Score = 95.1 bits (235), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 52/99 (53%), Positives = 67/99 (68%), Gaps = 5/99 (5%)
Query 14 AFSWLPAQFASQSDAPVGAP-----RQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLS 68
A SW PA + + AP R+F TEHL+ EA+AAFVDGEL M+AH RA+HHL+
Sbjct 31 ADSWTPAPSSFLPNRGYRAPGTAGGRRFAPTEHLAPEAVAAFVDGELGMSAHARASHHLA 90
Query 69 LCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIP 107
LC +C A VD QS AR LR+S + +P++LLG LS+IP
Sbjct 91 LCPECVAAVDAQSLARTRLRESGQVSVPASLLGALSQIP 129
>gi|333921003|ref|YP_004494584.1| hypothetical protein AS9A_3343 [Amycolicicoccus subflavus DQS3-9A1]
gi|333483224|gb|AEF41784.1| hypothetical protein AS9A_3343 [Amycolicicoccus subflavus DQS3-9A1]
Length=116
Score = 89.0 bits (219), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/78 (59%), Positives = 60/78 (77%), Gaps = 2/78 (2%)
Query 32 APRQ--FRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRD 89
APRQ FR+TEHL+ EAIAA+VDGEL M+A+LRA HLS+C +C +V Q +AR+ALR
Sbjct 14 APRQRAFRATEHLAHEAIAAYVDGELPMSAYLRAGAHLSMCDECRDQVSAQIQARSALRQ 73
Query 90 SHPIRIPSTLLGLLSEIP 107
S P+ +P +LL LS+IP
Sbjct 74 SGPVGVPESLLSALSQIP 91
>gi|296394493|ref|YP_003659377.1| hypothetical protein Srot_2091 [Segniliparus rotundus DSM 44985]
gi|296181640|gb|ADG98546.1| hypothetical protein Srot_2091 [Segniliparus rotundus DSM 44985]
Length=125
Score = 73.6 bits (179), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 35/76 (47%), Positives = 50/76 (66%), Gaps = 0/76 (0%)
Query 32 APRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSH 91
AP+ F S +H+S EA+AA+ DG+L A RA H +C +C+ ++ Q +ARAALR S
Sbjct 27 APKNFWSVDHISFEAVAAYADGKLGEKASARAREHFQMCPECSEQLQAQMQARAALRHSP 86
Query 92 PIRIPSTLLGLLSEIP 107
+++PS LLG L IP
Sbjct 87 RVQVPSELLGTLCAIP 102
>gi|317508222|ref|ZP_07965902.1| agrin [Segniliparus rugosus ATCC BAA-974]
gi|316253397|gb|EFV12787.1| agrin [Segniliparus rugosus ATCC BAA-974]
Length=106
Score = 70.1 bits (170), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 34/76 (45%), Positives = 50/76 (66%), Gaps = 0/76 (0%)
Query 32 APRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSH 91
AP+ F S +H+S EA+AA+ DG+L A +RA H C +C+ E+ Q +ARAALR +
Sbjct 8 APKAFWSVDHVSFEAVAAYADGKLGEKASVRAREHFQACPECSDELQAQLQARAALRQAG 67
Query 92 PIRIPSTLLGLLSEIP 107
+++P+ LLG L IP
Sbjct 68 CVQVPADLLGALCAIP 83
>gi|237785245|ref|YP_002905950.1| anti-sigma factor [Corynebacterium kroppenstedtii DSM 44385]
gi|237758157|gb|ACR17407.1| anti-sigma factor [Corynebacterium kroppenstedtii DSM 44385]
Length=184
Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/113 (39%), Positives = 57/113 (51%), Gaps = 3/113 (2%)
Query 33 PRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRD--S 90
PRQF S EHLS EA+AAFVDGE+ A R HL C +C +V Q A +R+ +
Sbjct 7 PRQFSSIEHLSEEAVAAFVDGEMPPRAQRRVLRHLVHCEECRRDVKAQRDAAQRMREAAN 66
Query 91 HPIRIPSTLLGLLSEIP-RCPPEGPSKGSSGGSSQGPPDGAAAGFGDRFADGD 142
P+ + + LL L+ IP C P+ PS + G A A G D D
Sbjct 67 EPVHMSTELLHKLAAIPTNCDPQNPSGSPAKTEHHGQGKKAQATGGGHDTDSD 119
>gi|38233590|ref|NP_939357.1| hypothetical protein DIP0995 [Corynebacterium diphtheriae NCTC
13129]
gi|38199850|emb|CAE49513.1| Conserved hypothetical protein [Corynebacterium diphtheriae]
Length=137
Score = 65.5 bits (158), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/103 (42%), Positives = 53/103 (52%), Gaps = 2/103 (1%)
Query 28 APVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAAL 87
+P R F S EHL+ EA+AAFVD EL A RA HL CA+C E+ Q RA L
Sbjct 6 SPKNKVRHFASVEHLNPEAVAAFVDNELSPAAAHRAKIHLVHCAECREEIHRQRRAADRL 65
Query 88 RD--SHPIRIPSTLLGLLSEIPRCPPEGPSKGSSGGSSQGPPD 128
RD + +R S L+ L I C P+GP+ S Q D
Sbjct 66 RDGNNSDMRPSSDLIAKLQSIAACCPDGPTAEEVPSSPQSLLD 108
>gi|227487902|ref|ZP_03918218.1| conserved hypothetical protein [Corynebacterium glucuronolyticum
ATCC 51867]
gi|227542541|ref|ZP_03972590.1| conserved hypothetical protein [Corynebacterium glucuronolyticum
ATCC 51866]
gi|227092108|gb|EEI27420.1| conserved hypothetical protein [Corynebacterium glucuronolyticum
ATCC 51867]
gi|227181739|gb|EEI62711.1| conserved hypothetical protein [Corynebacterium glucuronolyticum
ATCC 51866]
Length=125
Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 36/85 (43%), Positives = 49/85 (58%), Gaps = 3/85 (3%)
Query 36 FRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDS--HPI 93
F S EHLS EA+A +VDGEL + A RA HL C+ C EV +Q A L+ + I
Sbjct 13 FSSVEHLSAEAVAGYVDGELTLKAQKRARAHLLHCSICRKEVREQREASLTLKQETRNDI 72
Query 94 RIPSTLLGLLSEI-PRCPPEGPSKG 117
+PS+L+ L+ + P EGP+ G
Sbjct 73 HVPSSLVAKLASMNPDTCEEGPAAG 97
>gi|325002582|ref|ZP_08123694.1| hypothetical protein PseP1_27637 [Pseudonocardia sp. P1]
Length=84
Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/67 (48%), Positives = 42/67 (63%), Gaps = 0/67 (0%)
Query 41 HLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLL 100
HL++EA+ A+VD EL H RA HL C CAAEV +Q RAR+ALR + +P +L+
Sbjct 18 HLTLEAVVAYVDDELARGPHDRATRHLGHCPDCAAEVAEQRRARSALRGADAPTLPPSLM 77
Query 101 GLLSEIP 107
L IP
Sbjct 78 SALRSIP 84
>gi|262201799|ref|YP_003273007.1| hypothetical protein Gbro_1858 [Gordonia bronchialis DSM 43247]
gi|262085146|gb|ACY21114.1| hypothetical protein Gbro_1858 [Gordonia bronchialis DSM 43247]
Length=91
Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 28/51 (55%), Positives = 37/51 (73%), Gaps = 0/51 (0%)
Query 57 MNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTLLGLLSEIP 107
M AH+RA HHL+LC +C A VD Q+ ARA LR+S + IP +LL L++IP
Sbjct 1 MTAHMRATHHLALCPECVAAVDAQTSARARLRESGRVSIPDSLLSQLTQIP 51
>gi|300858210|ref|YP_003783193.1| anti-sigma factor [Corynebacterium pseudotuberculosis FRC41]
gi|300685664|gb|ADK28586.1| anti-sigma factor [Corynebacterium pseudotuberculosis FRC41]
gi|302205932|gb|ADL10274.1| Anti-sigma factor [Corynebacterium pseudotuberculosis C231]
gi|302330488|gb|ADL20682.1| Anti-sigma factor [Corynebacterium pseudotuberculosis 1002]
gi|308276167|gb|ADO26066.1| anti-sigma factor [Corynebacterium pseudotuberculosis I19]
gi|341824602|gb|AEK92123.1| Anti-sigma factor [Corynebacterium pseudotuberculosis PAT10]
Length=143
Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/91 (46%), Positives = 49/91 (54%), Gaps = 2/91 (2%)
Query 27 DAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAA 86
++ V R F S EHL+ EA+AA VD EL A RA HL C +C EVD Q RA
Sbjct 7 NSKVEKNRHFASVEHLNPEAVAALVDDELSSVAAHRAKIHLVHCKECRDEVDRQRRAADR 66
Query 87 LRDS--HPIRIPSTLLGLLSEIPRCPPEGPS 115
LR S +R S LL L+ I PEGP+
Sbjct 67 LRGSSCSEMRASSDLLDKLNGIAHSCPEGPN 97
>gi|291453975|ref|ZP_06593365.1| conserved hypothetical protein [Streptomyces albus J1074]
gi|291356924|gb|EFE83826.1| conserved hypothetical protein [Streptomyces albus J1074]
Length=299
Score = 59.3 bits (142), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/96 (40%), Positives = 51/96 (54%), Gaps = 2/96 (2%)
Query 40 EHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPSTL 99
+HL + +AA VDGEL A R HL+ C +C AE D Q R ++ + P +
Sbjct 11 QHLG-DRLAALVDGELGHEARERVLAHLATCCKCKAEADAQRRLKSVFATAAPPPPSESF 69
Query 100 LGLLSEIPRCPPEGPSKGSSGGSSQGPPDGAAAGFG 135
L L +P PEGP+ GS G GPP G++A FG
Sbjct 70 LARLQGLPAAGPEGPT-GSGSGFGAGPPFGSSADFG 104
>gi|326382037|ref|ZP_08203730.1| hypothetical protein SCNU_03797 [Gordonia neofelifaecis NRRL
B-59395]
gi|326199463|gb|EGD56644.1| hypothetical protein SCNU_03797 [Gordonia neofelifaecis NRRL
B-59395]
Length=122
Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/96 (41%), Positives = 55/96 (58%), Gaps = 1/96 (1%)
Query 13 RAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQ 72
R WLPA + + P F ST+HL+ EA+ A+VD EL A RA HL++C
Sbjct 10 RGSRWLPAASSITPNPGYRRPTGFASTQHLNPEAVVAYVDNELTAQAAARADAHLAMCPD 69
Query 73 CAAEVDDQSRARAALRD-SHPIRIPSTLLGLLSEIP 107
CA EV Q+RAR+ L+ + + +P +L LS+IP
Sbjct 70 CAREVTAQARARSMLQTCQNDLSVPDSLRAQLSQIP 105
>gi|337290464|ref|YP_004629485.1| anti-sigma factor [Corynebacterium ulcerans BR-AD22]
gi|334696577|gb|AEG81374.1| anti-sigma factor [Corynebacterium ulcerans 809]
gi|334698770|gb|AEG83566.1| anti-sigma factor [Corynebacterium ulcerans BR-AD22]
Length=143
Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 41/88 (47%), Positives = 46/88 (53%), Gaps = 2/88 (2%)
Query 30 VGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRD 89
V R F S EHL+ EA+AA VD EL A RA HL C +C EVD Q RA LR
Sbjct 10 VEKIRHFASVEHLNPEAVAALVDDELSSVAAHRAKIHLVHCKECRDEVDRQRRAADRLRG 69
Query 90 S--HPIRIPSTLLGLLSEIPRCPPEGPS 115
S +R S LL L I PEGP+
Sbjct 70 STHSEMRASSDLLAKLQGIAHSCPEGPN 97
>gi|258652368|ref|YP_003201524.1| hypothetical protein Namu_2157 [Nakamurella multipartita DSM
44233]
gi|258555593|gb|ACV78535.1| hypothetical protein Namu_2157 [Nakamurella multipartita DSM
44233]
Length=176
Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/69 (43%), Positives = 45/69 (66%), Gaps = 0/69 (0%)
Query 38 STEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIPS 97
S +HL+++A+ A+ DGE+ + A+ RAA H++ C QC AEV Q AR+ LR + +P+
Sbjct 5 SVDHLTLDAVVAYADGEMPLVAYQRAAAHVARCPQCDAEVRAQLVARSWLRSAETPAMPT 64
Query 98 TLLGLLSEI 106
+LL L I
Sbjct 65 SLLDTLRSI 73
>gi|227548419|ref|ZP_03978468.1| conserved hypothetical protein [Corynebacterium lipophiloflavum
DSM 44291]
gi|227079463|gb|EEI17426.1| conserved hypothetical protein [Corynebacterium lipophiloflavum
DSM 44291]
Length=124
Score = 57.8 bits (138), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 38/82 (47%), Positives = 50/82 (61%), Gaps = 6/82 (7%)
Query 32 APRQ---FRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALR 88
APR+ F STEHLS EA+AAF D EL +A RA H+ LC +C AEV+ Q A LR
Sbjct 9 APRKKKGFFSTEHLSPEAVAAFADQELSESALHRARVHVVLCEECRAEVNHQRAAAEHLR 68
Query 89 DSH---PIRIPSTLLGLLSEIP 107
+ +R P +L+ L+E+P
Sbjct 69 CCNADDSVRAPRSLVQKLAEMP 90
>gi|331694894|ref|YP_004331133.1| hypothetical protein Psed_1029 [Pseudonocardia dioxanivorans
CB1190]
gi|326949583|gb|AEA23280.1| hypothetical protein Psed_1029 [Pseudonocardia dioxanivorans
CB1190]
Length=236
Score = 57.4 bits (137), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 31/79 (40%), Positives = 45/79 (57%), Gaps = 6/79 (7%)
Query 41 HLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARAALRDSHPIRIP---- 96
HL+++AI AF DGEL AH RA HL+ C +CA EV +Q +AR LR + +P
Sbjct 19 HLTLDAIVAFTDGELSAGAHARATAHLAHCPECAEEVVEQDQARLLLRSASAPAMPSSLL 78
Query 97 --STLLGLLSEIPRCPPEG 113
+ + +++P PP G
Sbjct 79 SSLRSIPMDADLPDEPPAG 97
Lambda K H
0.318 0.134 0.408
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 131222683000
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40