BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0190
Length=96
Score E
Sequences producing significant alignments: (Bits) Value
gi|15607331|ref|NP_214704.1| hypothetical protein Rv0190 [Mycoba... 194 3e-48
gi|118616813|ref|YP_905145.1| hypothetical protein MUL_1083 [Myc... 179 2e-43
gi|296167409|ref|ZP_06849811.1| protein of hypothetical function... 179 2e-43
gi|254821526|ref|ZP_05226527.1| hypothetical protein MintA_16452... 177 7e-43
gi|342859351|ref|ZP_08716005.1| hypothetical protein MCOL_10743 ... 176 7e-43
gi|240172070|ref|ZP_04750729.1| hypothetical protein MkanA1_2233... 174 3e-42
gi|41409730|ref|NP_962566.1| hypothetical protein MAP3632 [Mycob... 172 1e-41
gi|108797128|ref|YP_637325.1| hypothetical protein Mmcs_0148 [My... 161 3e-38
gi|126432751|ref|YP_001068442.1| hypothetical protein Mjls_0138 ... 159 9e-38
gi|15828408|ref|NP_302671.1| hypothetical protein ML2609 [Mycoba... 158 2e-37
gi|296165802|ref|ZP_06848307.1| protein of hypothetical function... 157 6e-37
gi|118467640|ref|YP_884645.1| hypothetical protein MSMEG_0230 [M... 157 6e-37
gi|120401203|ref|YP_951032.1| hypothetical protein Mvan_0175 [My... 155 1e-36
gi|333988798|ref|YP_004521412.1| hypothetical protein JDM601_015... 155 2e-36
gi|169631616|ref|YP_001705265.1| hypothetical protein MAB_4542c ... 152 2e-35
gi|183981464|ref|YP_001849755.1| hypothetical protein MMAR_1447 ... 150 7e-35
gi|41409844|ref|NP_962680.1| hypothetical protein MAP3746 [Mycob... 149 1e-34
gi|240169432|ref|ZP_04748091.1| hypothetical protein MkanA1_0897... 147 5e-34
gi|86742724|ref|YP_483124.1| hypothetical protein Francci3_4045 ... 143 1e-32
gi|256394327|ref|YP_003115891.1| hypothetical protein Caci_5191 ... 142 2e-32
gi|288923949|ref|ZP_06418024.1| protein of unknown function DUF1... 140 5e-32
gi|158312530|ref|YP_001505038.1| hypothetical protein Franean1_0... 140 6e-32
gi|111225747|ref|YP_716541.1| hypothetical protein FRAAL6408 [Fr... 140 8e-32
gi|327309913|ref|YP_004336811.1| hypothetical protein Psed_6873 ... 139 1e-31
gi|145221087|ref|YP_001131765.1| hypothetical protein Mflv_0484 ... 139 2e-31
gi|256379649|ref|YP_003103309.1| hypothetical protein Amir_5648 ... 138 2e-31
gi|324999869|ref|ZP_08120981.1| hypothetical protein PseP1_13921... 137 6e-31
gi|317123378|ref|YP_004097490.1| hypothetical protein Intca_0205... 137 6e-31
gi|325003143|ref|ZP_08124255.1| hypothetical protein PseP1_30450... 136 1e-30
gi|258651962|ref|YP_003201118.1| hypothetical protein Namu_1739 ... 136 1e-30
gi|119855094|ref|YP_935699.1| hypothetical protein Mkms_5706 [My... 136 1e-30
gi|323358964|ref|YP_004225360.1| hypothetical protein MTES_2516 ... 135 2e-30
gi|312194420|ref|YP_004014481.1| hypothetical protein FraEuI1c_0... 135 3e-30
gi|337766578|emb|CCB75289.1| conserved protein of unknown functi... 134 4e-30
gi|126437036|ref|YP_001072727.1| hypothetical protein Mjls_4465 ... 133 7e-30
gi|333919705|ref|YP_004493286.1| hypothetical protein AS9A_2037 ... 133 7e-30
gi|284029033|ref|YP_003378964.1| hypothetical protein Kfla_1061 ... 133 9e-30
gi|226349474|ref|YP_002776588.1| hypothetical protein ROP_pROB01... 133 9e-30
gi|54022809|ref|YP_117051.1| hypothetical protein nfa8420 [Nocar... 132 1e-29
gi|297562195|ref|YP_003681169.1| hypothetical protein Ndas_3257 ... 132 1e-29
gi|120404632|ref|YP_954461.1| hypothetical protein Mvan_3668 [My... 132 2e-29
gi|257057621|ref|YP_003135453.1| hypothetical protein Svir_36760... 130 6e-29
gi|284992005|ref|YP_003410559.1| hypothetical protein Gobs_3602 ... 130 6e-29
gi|108798425|ref|YP_638622.1| hypothetical protein Mmcs_1454 [My... 130 7e-29
gi|297158509|gb|ADI08221.1| hypothetical protein SBI_05101 [Stre... 130 8e-29
gi|84495528|ref|ZP_00994647.1| hypothetical protein JNB_12019 [J... 130 9e-29
gi|170781089|ref|YP_001709421.1| hypothetical protein CMS_0652 [... 129 2e-28
gi|284029493|ref|YP_003379424.1| hypothetical protein Kfla_1527 ... 129 2e-28
gi|111020527|ref|YP_703499.1| hypothetical protein RHA1_ro03538 ... 129 2e-28
gi|296271355|ref|YP_003653987.1| hypothetical protein Tbis_3404 ... 128 2e-28
>gi|15607331|ref|NP_214704.1| hypothetical protein Rv0190 [Mycobacterium tuberculosis H37Rv]
gi|15839569|ref|NP_334606.1| hypothetical protein MT0200 [Mycobacterium tuberculosis CDC1551]
gi|31791368|ref|NP_853861.1| hypothetical protein Mb0196 [Mycobacterium bovis AF2122/97]
79 more sequence titles
Length=96
Score = 194 bits (494), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 96/96 (100%), Positives = 96/96 (100%), Gaps = 0/96 (0%)
Query 1 MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL 60
MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL
Sbjct 1 MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL 60
Query 61 DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS
Sbjct 61 DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
>gi|118616813|ref|YP_905145.1| hypothetical protein MUL_1083 [Mycobacterium ulcerans Agy99]
gi|183980464|ref|YP_001848755.1| hypothetical protein MMAR_0433 [Mycobacterium marinum M]
gi|118568923|gb|ABL03674.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
gi|183173790|gb|ACC38900.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=96
Score = 179 bits (453), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 87/96 (91%), Positives = 92/96 (96%), Gaps = 0/96 (0%)
Query 1 MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL 60
MT AHGY+QQK+NYAKRLRR+EGQVRGIARMI+EDKYCIDVLTQISAV SALRSVALNLL
Sbjct 1 MTTAHGYSQQKENYAKRLRRIEGQVRGIARMIDEDKYCIDVLTQISAVNSALRSVALNLL 60
Query 61 DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
DEHL+HCVTRAVAEGG ADGKLAEASAAIARLVRS
Sbjct 61 DEHLNHCVTRAVAEGGTDADGKLAEASAAIARLVRS 96
>gi|296167409|ref|ZP_06849811.1| protein of hypothetical function DUF156 [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295897353|gb|EFG76957.1| protein of hypothetical function DUF156 [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=96
Score = 179 bits (453), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 88/96 (92%), Positives = 91/96 (95%), Gaps = 0/96 (0%)
Query 1 MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL 60
MT AHGY+QQKDNYAKRLRR+EGQVRGIARMIEEDKYCIDVLTQISAV SALRSVALNLL
Sbjct 1 MTNAHGYSQQKDNYAKRLRRIEGQVRGIARMIEEDKYCIDVLTQISAVNSALRSVALNLL 60
Query 61 DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
DEHL+HCVTRAVAEGG AD KLAEASAAIARLVRS
Sbjct 61 DEHLNHCVTRAVAEGGDDADQKLAEASAAIARLVRS 96
>gi|254821526|ref|ZP_05226527.1| hypothetical protein MintA_16452 [Mycobacterium intracellulare
ATCC 13950]
Length=96
Score = 177 bits (448), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 87/96 (91%), Positives = 90/96 (94%), Gaps = 0/96 (0%)
Query 1 MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL 60
MT HGY+QQKDNYAKRLRR+EGQVRGIARMIEEDKYCIDVLTQISAV SALRSVALNLL
Sbjct 1 MTDEHGYSQQKDNYAKRLRRIEGQVRGIARMIEEDKYCIDVLTQISAVNSALRSVALNLL 60
Query 61 DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
DEHLSHCVTRAVAEGG AD KL+EASAAIARLVRS
Sbjct 61 DEHLSHCVTRAVAEGGDEADTKLSEASAAIARLVRS 96
>gi|342859351|ref|ZP_08716005.1| hypothetical protein MCOL_10743 [Mycobacterium colombiense CECT
3035]
gi|342133592|gb|EGT86795.1| hypothetical protein MCOL_10743 [Mycobacterium colombiense CECT
3035]
Length=96
Score = 176 bits (447), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 86/96 (90%), Positives = 90/96 (94%), Gaps = 0/96 (0%)
Query 1 MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL 60
MT HGY+QQKDNYAKRLRR+EGQVRGIARMIEEDKYCIDVLTQ+SAV SALRSVALNLL
Sbjct 1 MTNDHGYSQQKDNYAKRLRRIEGQVRGIARMIEEDKYCIDVLTQVSAVNSALRSVALNLL 60
Query 61 DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
DEHL+HCVTRAVAEGG AD KLAEASAAIARLVRS
Sbjct 61 DEHLNHCVTRAVAEGGDEADNKLAEASAAIARLVRS 96
>gi|240172070|ref|ZP_04750729.1| hypothetical protein MkanA1_22330 [Mycobacterium kansasii ATCC
12478]
Length=96
Score = 174 bits (442), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 84/96 (88%), Positives = 90/96 (94%), Gaps = 0/96 (0%)
Query 1 MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL 60
MT +GY+QQKDNYAKRLRR+EGQVRGIARMIEEDKYCID+LTQISA+ SALRSVALNLL
Sbjct 1 MTTDYGYSQQKDNYAKRLRRIEGQVRGIARMIEEDKYCIDILTQISAINSALRSVALNLL 60
Query 61 DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
DEHL+HCVTRAVAEGG AD KLAEASAAIARLVRS
Sbjct 61 DEHLNHCVTRAVAEGGTEADSKLAEASAAIARLVRS 96
>gi|41409730|ref|NP_962566.1| hypothetical protein MAP3632 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|118466419|ref|YP_884109.1| hypothetical protein MAV_4988 [Mycobacterium avium 104]
gi|254777428|ref|ZP_05218944.1| hypothetical protein MaviaA2_22556 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|41398562|gb|AAS06182.1| hypothetical protein MAP_3632 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|118167706|gb|ABK68603.1| conserved hypothetical protein [Mycobacterium avium 104]
gi|336460062|gb|EGO38971.1| hypothetical protein MAPs_44440 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=96
Score = 172 bits (436), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 86/96 (90%), Positives = 88/96 (92%), Gaps = 0/96 (0%)
Query 1 MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL 60
MT HGY+QQKDNYAKRLRR+EGQVRGIARMIEEDKYCIDVLTQISAV SALRSVALNLL
Sbjct 1 MTNEHGYSQQKDNYAKRLRRIEGQVRGIARMIEEDKYCIDVLTQISAVNSALRSVALNLL 60
Query 61 DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
DEHL HCVTRAVA GG AD KLAEASAAIARLVRS
Sbjct 61 DEHLGHCVTRAVAGGGDDADEKLAEASAAIARLVRS 96
>gi|108797128|ref|YP_637325.1| hypothetical protein Mmcs_0148 [Mycobacterium sp. MCS]
gi|119866213|ref|YP_936165.1| hypothetical protein Mkms_0157 [Mycobacterium sp. KMS]
gi|108767547|gb|ABG06269.1| protein of unknown function DUF156 [Mycobacterium sp. MCS]
gi|119692302|gb|ABL89375.1| protein of unknown function DUF156 [Mycobacterium sp. KMS]
Length=103
Score = 161 bits (407), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 79/95 (84%), Positives = 86/95 (91%), Gaps = 0/95 (0%)
Query 2 TAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLD 61
A HGY+ QK+NYAKRLRR+EGQVRGIA+MIE+DKYCIDVLTQISAV SAL+SVAL LLD
Sbjct 9 VAVHGYSAQKENYAKRLRRIEGQVRGIAKMIEDDKYCIDVLTQISAVNSALQSVALGLLD 68
Query 62 EHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
EHL HCVT+AVAEGG AD KLAEASAAIARLVRS
Sbjct 69 EHLGHCVTQAVAEGGEQADAKLAEASAAIARLVRS 103
>gi|126432751|ref|YP_001068442.1| hypothetical protein Mjls_0138 [Mycobacterium sp. JLS]
gi|126232551|gb|ABN95951.1| protein of unknown function DUF156 [Mycobacterium sp. JLS]
Length=103
Score = 159 bits (403), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 79/95 (84%), Positives = 86/95 (91%), Gaps = 0/95 (0%)
Query 2 TAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLD 61
A HGY+ QK+NYAKRLRR+EGQVRGIA+MIE+DKYCIDVLTQISAV SAL+SVAL LLD
Sbjct 9 VAVHGYSAQKENYAKRLRRIEGQVRGIAKMIEDDKYCIDVLTQISAVNSALQSVALGLLD 68
Query 62 EHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
EHL HCVT+AVAEGG AD KLAEASAAIARLVRS
Sbjct 69 EHLGHCVTQAVAEGGEQADVKLAEASAAIARLVRS 103
>gi|15828408|ref|NP_302671.1| hypothetical protein ML2609 [Mycobacterium leprae TN]
gi|221230885|ref|YP_002504301.1| hypothetical protein MLBr_02609 [Mycobacterium leprae Br4923]
gi|2104595|emb|CAB08799.1| unknown [Mycobacterium leprae]
gi|13093838|emb|CAC32141.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219933992|emb|CAR72709.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=135
Score = 158 bits (400), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 76/92 (83%), Positives = 86/92 (94%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
+GY+QQK NYAKRLRR+EGQVRGIARMI+EDKYCID+LTQISAV++ALRSVALNLLDEHL
Sbjct 44 YGYSQQKGNYAKRLRRIEGQVRGIARMIDEDKYCIDILTQISAVSNALRSVALNLLDEHL 103
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
+CV+RAVAEGG A+ K AEASAAIARLVRS
Sbjct 104 EYCVSRAVAEGGSEAEDKFAEASAAIARLVRS 135
>gi|296165802|ref|ZP_06848307.1| protein of hypothetical function DUF156 [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295898845|gb|EFG78346.1| protein of hypothetical function DUF156 [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=96
Score = 157 bits (397), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 77/96 (81%), Positives = 85/96 (89%), Gaps = 0/96 (0%)
Query 1 MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL 60
MT GY+ QKDNYAKRLRR+EGQVRGIA+MI++DKYCIDVLTQISA +SALRSVALNLL
Sbjct 1 MTTPLGYSAQKDNYAKRLRRIEGQVRGIAKMIDDDKYCIDVLTQISAASSALRSVALNLL 60
Query 61 DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
+EHL HC+TRA AEGG A KLAEASAAIARLVRS
Sbjct 61 NEHLDHCLTRAAAEGGDQAHAKLAEASAAIARLVRS 96
>gi|118467640|ref|YP_884645.1| hypothetical protein MSMEG_0230 [Mycobacterium smegmatis str.
MC2 155]
gi|118168927|gb|ABK69823.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=101
Score = 157 bits (396), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 76/95 (80%), Positives = 86/95 (91%), Gaps = 0/95 (0%)
Query 2 TAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLD 61
+ HGY+ QK+NYAKRLRR+EGQVRGIA+MI+EDKYCIDVLTQISAV SAL+SVAL LLD
Sbjct 7 SGVHGYSAQKENYAKRLRRIEGQVRGIAKMIDEDKYCIDVLTQISAVNSALQSVALGLLD 66
Query 62 EHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
EHL HCV++AVAEGG A+ KLAEASAAIARLVRS
Sbjct 67 EHLGHCVSQAVAEGGQEAEAKLAEASAAIARLVRS 101
>gi|120401203|ref|YP_951032.1| hypothetical protein Mvan_0175 [Mycobacterium vanbaalenii PYR-1]
gi|119954021|gb|ABM11026.1| protein of unknown function DUF156 [Mycobacterium vanbaalenii
PYR-1]
Length=145
Score = 155 bits (393), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 76/93 (82%), Positives = 83/93 (90%), Gaps = 0/93 (0%)
Query 4 AHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEH 63
AHGY+ QK NYAKRLRR+EGQVRGIA+MI+EDKYCID+LTQISAV SAL+SVAL LLDEH
Sbjct 53 AHGYSGQKANYAKRLRRIEGQVRGIAKMIDEDKYCIDILTQISAVNSALQSVALGLLDEH 112
Query 64 LSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
LSHCV+ AV GG AD KLAEASAAIARLVRS
Sbjct 113 LSHCVSHAVTVGGSEADAKLAEASAAIARLVRS 145
>gi|333988798|ref|YP_004521412.1| hypothetical protein JDM601_0158 [Mycobacterium sp. JDM601]
gi|333484766|gb|AEF34158.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=97
Score = 155 bits (391), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 77/95 (82%), Positives = 84/95 (89%), Gaps = 0/95 (0%)
Query 2 TAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLD 61
T A GY+ +KDNYAKRLRR+EGQVRGIA+MI+EDKYCIDVLTQISAV SALRSVALNLLD
Sbjct 3 TDAPGYSPRKDNYAKRLRRIEGQVRGIAKMIDEDKYCIDVLTQISAVNSALRSVALNLLD 62
Query 62 EHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
EHL+HCV+ AV GG AD KL EASAAIARLVRS
Sbjct 63 EHLAHCVSGAVTAGGDEADAKLTEASAAIARLVRS 97
>gi|169631616|ref|YP_001705265.1| hypothetical protein MAB_4542c [Mycobacterium abscessus ATCC
19977]
gi|169243583|emb|CAM64611.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=100
Score = 152 bits (383), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 75/95 (79%), Positives = 82/95 (87%), Gaps = 0/95 (0%)
Query 2 TAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLD 61
+ HGY+ +KD+YAKRL R+EGQVRGIA+MI+EDKYCIDVLTQISA SALRSVAL LLD
Sbjct 6 STGHGYSLKKDDYAKRLLRIEGQVRGIAKMIDEDKYCIDVLTQISAAQSALRSVALGLLD 65
Query 62 EHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
EHL HCVT AVA GG AD KLAEASAAIARLVRS
Sbjct 66 EHLGHCVTNAVASGGADADEKLAEASAAIARLVRS 100
>gi|183981464|ref|YP_001849755.1| hypothetical protein MMAR_1447 [Mycobacterium marinum M]
gi|183174790|gb|ACC39900.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=96
Score = 150 bits (379), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 72/96 (75%), Positives = 83/96 (87%), Gaps = 0/96 (0%)
Query 1 MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL 60
M A +GY KDNYAKRLRR+EGQVRGIA+MI++DKYCIDVLTQISA +S+LR+VALNLL
Sbjct 1 MEAPYGYVAHKDNYAKRLRRIEGQVRGIAKMIDDDKYCIDVLTQISAASSSLRAVALNLL 60
Query 61 DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
DEHL HCV++A+ EGG A KL EASAAIARLVRS
Sbjct 61 DEHLDHCVSQAITEGGDEAQVKLTEASAAIARLVRS 96
>gi|41409844|ref|NP_962680.1| hypothetical protein MAP3746 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41398676|gb|AAS06296.1| hypothetical protein MAP_3746 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|336460147|gb|EGO39052.1| hypothetical protein MAPs_43270 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=104
Score = 149 bits (377), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/92 (80%), Positives = 80/92 (87%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
+GY KD YAKRLRRVEGQ+RGIA+MIEEDKYCIDVLTQISA +ALRSVALNLLDEHL
Sbjct 13 YGYVAHKDGYAKRLRRVEGQIRGIAKMIEEDKYCIDVLTQISAANNALRSVALNLLDEHL 72
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
HCV+ A+AEGG A KL+EASAAIARLVRS
Sbjct 73 DHCVSSALAEGGDEAQAKLSEASAAIARLVRS 104
>gi|240169432|ref|ZP_04748091.1| hypothetical protein MkanA1_08974 [Mycobacterium kansasii ATCC
12478]
Length=101
Score = 147 bits (371), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 69/96 (72%), Positives = 84/96 (88%), Gaps = 0/96 (0%)
Query 1 MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL 60
M + +GY KD+YAKRLRR+EGQ+RGIA+MI++DKYCID+LTQISA +SALRSVALNLL
Sbjct 6 MESTYGYVAHKDSYAKRLRRIEGQIRGIAKMIDDDKYCIDILTQISAASSALRSVALNLL 65
Query 61 DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
D+HL CV++A+ +GG AD KLAEASAAIARLVRS
Sbjct 66 DDHLDSCVSQAITQGGDEADVKLAEASAAIARLVRS 101
>gi|86742724|ref|YP_483124.1| hypothetical protein Francci3_4045 [Frankia sp. CcI3]
gi|86569586|gb|ABD13395.1| protein of unknown function DUF156 [Frankia sp. CcI3]
Length=93
Score = 143 bits (360), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 68/92 (74%), Positives = 78/92 (85%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGYT KD+Y RLRR+EGQVRG+ RMI EDKYCID+LTQ+SA T AL+SVAL LL++HL
Sbjct 2 HGYTNTKDDYTARLRRIEGQVRGLQRMIAEDKYCIDILTQVSATTRALQSVALGLLEDHL 61
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
+HCVT A AEGGP AD K+ EASAAIARLVRS
Sbjct 62 THCVTHAAAEGGPVADEKIREASAAIARLVRS 93
>gi|256394327|ref|YP_003115891.1| hypothetical protein Caci_5191 [Catenulispora acidiphila DSM
44928]
gi|256360553|gb|ACU74050.1| protein of unknown function DUF156 [Catenulispora acidiphila
DSM 44928]
Length=93
Score = 142 bits (357), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 66/92 (72%), Positives = 80/92 (87%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGYT +KD+Y KRLRR+EGQVRG+ RM++EDKYCID+LTQISAVT AL+SVAL LL+EH+
Sbjct 2 HGYTDRKDDYTKRLRRIEGQVRGLQRMVDEDKYCIDILTQISAVTKALQSVALGLLEEHV 61
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
+HCV A+A GGP AD K+ EA+ AIARLVRS
Sbjct 62 AHCVADALAVGGPEADAKVKEANEAIARLVRS 93
>gi|288923949|ref|ZP_06418024.1| protein of unknown function DUF156 [Frankia sp. EUN1f]
gi|288344710|gb|EFC79164.1| protein of unknown function DUF156 [Frankia sp. EUN1f]
Length=118
Score = 140 bits (354), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 67/92 (73%), Positives = 77/92 (84%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGYT KD Y RLRR+EGQVRG+ RMI EDKYCID+LTQ+SA T AL+SVAL LL +HL
Sbjct 27 HGYTSGKDEYTARLRRIEGQVRGLQRMIAEDKYCIDILTQVSAATRALQSVALGLLSDHL 86
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
+HCV +AVA+GGP AD KL EA+AAIARLVRS
Sbjct 87 AHCVAQAVADGGPAADEKLREATAAIARLVRS 118
>gi|158312530|ref|YP_001505038.1| hypothetical protein Franean1_0673 [Frankia sp. EAN1pec]
gi|158107935|gb|ABW10132.1| protein of unknown function DUF156 [Frankia sp. EAN1pec]
Length=152
Score = 140 bits (353), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 65/92 (71%), Positives = 76/92 (83%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGY KD Y RLRR+EGQVRG+ RM+ EDKYCID+LTQ+SA T AL+SVAL LL +HL
Sbjct 61 HGYANSKDEYTARLRRIEGQVRGLQRMVAEDKYCIDILTQVSAATRALQSVALGLLSDHL 120
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
+HCV +AVAEGGP AD K+ EA+AAIARLVRS
Sbjct 121 AHCVAQAVAEGGPAADEKVREATAAIARLVRS 152
>gi|111225747|ref|YP_716541.1| hypothetical protein FRAAL6408 [Frankia alni ACN14a]
gi|111153279|emb|CAJ65031.1| conserved hypothetical protein [Frankia alni ACN14a]
Length=95
Score = 140 bits (352), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 67/92 (73%), Positives = 76/92 (83%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGYT KD Y RLRR+EGQVRG+ RMI EDKYCID+LTQ+SA T AL+SVAL LL++HL
Sbjct 4 HGYTNTKDEYTARLRRIEGQVRGLQRMIAEDKYCIDILTQVSAATRALQSVALGLLEDHL 63
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
+HCV A AEGGP AD KL EASAAIARL+RS
Sbjct 64 AHCVAHAAAEGGPVADEKLREASAAIARLIRS 95
>gi|327309913|ref|YP_004336811.1| hypothetical protein Psed_6873 [Pseudonocardia dioxanivorans
CB1190]
gi|326955248|gb|AEA28944.1| protein of unknown function DUF156 [Pseudonocardia dioxanivorans
CB1190]
Length=93
Score = 139 bits (350), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 65/92 (71%), Positives = 78/92 (85%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGY KDN+ KR+RR+EGQVRGI++MIE DKYCIDVLTQ++AV AL +VAL LLD+HL
Sbjct 2 HGYADNKDNHLKRMRRIEGQVRGISKMIESDKYCIDVLTQVAAVNKALEAVALGLLDDHL 61
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
HCV+ AVAEGG AD K++EASAAIARLV+S
Sbjct 62 KHCVSDAVAEGGQVADQKISEASAAIARLVKS 93
>gi|145221087|ref|YP_001131765.1| hypothetical protein Mflv_0484 [Mycobacterium gilvum PYR-GCK]
gi|315441949|ref|YP_004074828.1| hypothetical protein Mspyr1_02770 [Mycobacterium sp. Spyr1]
gi|145213573|gb|ABP42977.1| protein of unknown function DUF156 [Mycobacterium gilvum PYR-GCK]
gi|315260252|gb|ADT96993.1| uncharacterized conserved protein [Mycobacterium sp. Spyr1]
Length=104
Score = 139 bits (349), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/92 (83%), Positives = 83/92 (91%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGY+ QK+NYAKRLRR+EGQVRGIA+MI+EDKYCIDVLTQISAV SAL+SVAL LLDEHL
Sbjct 13 HGYSAQKENYAKRLRRIEGQVRGIAKMIDEDKYCIDVLTQISAVNSALQSVALGLLDEHL 72
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
HCV+ AVA GG AD KLAEASAAIARLVRS
Sbjct 73 GHCVSHAVAAGGADADAKLAEASAAIARLVRS 104
>gi|256379649|ref|YP_003103309.1| hypothetical protein Amir_5648 [Actinosynnema mirum DSM 43827]
gi|255923952|gb|ACU39463.1| protein of unknown function DUF156 [Actinosynnema mirum DSM 43827]
Length=93
Score = 138 bits (348), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 66/92 (72%), Positives = 78/92 (85%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGYT+ KD Y KRLRR+EGQVRG+ RM+E D+YCIDVLTQISA T AL++V+L LLDEHL
Sbjct 2 HGYTEDKDAYLKRLRRIEGQVRGLQRMVENDEYCIDVLTQISAATKALQAVSLGLLDEHL 61
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
HCV +A A+GG A+ K+AEASAAIARLVRS
Sbjct 62 KHCVAQAAAQGGEVAEEKIAEASAAIARLVRS 93
>gi|324999869|ref|ZP_08120981.1| hypothetical protein PseP1_13921 [Pseudonocardia sp. P1]
Length=93
Score = 137 bits (345), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 66/92 (72%), Positives = 74/92 (81%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGY KD + R+RR+EGQVRGI+RMIE DKYCIDVLTQ++AV AL +VAL LLDEHL
Sbjct 2 HGYADNKDAHVNRMRRIEGQVRGISRMIESDKYCIDVLTQVAAVNKALEAVALGLLDEHL 61
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
HCV A AEGGP A+ KL EASAAIARLVRS
Sbjct 62 KHCVADAAAEGGPVAEQKLEEASAAIARLVRS 93
>gi|317123378|ref|YP_004097490.1| hypothetical protein Intca_0205 [Intrasporangium calvum DSM 43043]
gi|315587466|gb|ADU46763.1| protein of unknown function DUF156 [Intrasporangium calvum DSM
43043]
Length=103
Score = 137 bits (345), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 64/92 (70%), Positives = 78/92 (85%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGY QKD+Y +RLRR+EGQ RG+ RM++E++YCID+LTQISA+T AL+SVAL LLDEH+
Sbjct 12 HGYIHQKDDYLRRLRRIEGQSRGLQRMVDEEQYCIDILTQISAMTKALQSVALGLLDEHI 71
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
+HCV AV EGGP A+ KL EAS AIARLVRS
Sbjct 72 AHCVVDAVREGGPDAELKLKEASDAIARLVRS 103
>gi|325003143|ref|ZP_08124255.1| hypothetical protein PseP1_30450 [Pseudonocardia sp. P1]
Length=94
Score = 136 bits (343), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 64/92 (70%), Positives = 74/92 (81%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGY KD++ KR+RR+EGQVRGI +MIE DKYCID+LTQ+SAV AL +VAL LLDEHL
Sbjct 3 HGYADSKDSHIKRMRRIEGQVRGITKMIESDKYCIDILTQVSAVNKALEAVALGLLDEHL 62
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
HCV A EGGP A+ K+ EASAAIARLVRS
Sbjct 63 KHCVADAATEGGPVAEQKIQEASAAIARLVRS 94
>gi|258651962|ref|YP_003201118.1| hypothetical protein Namu_1739 [Nakamurella multipartita DSM
44233]
gi|258555187|gb|ACV78129.1| protein of unknown function DUF156 [Nakamurella multipartita
DSM 44233]
Length=103
Score = 136 bits (342), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 67/103 (66%), Positives = 80/103 (78%), Gaps = 7/103 (6%)
Query 1 MTAA-------HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALR 53
MTAA HGY +KD+Y KRLRR+EGQ RG+ +M+E++KYCID+LTQ+SA+T AL+
Sbjct 1 MTAASSEHAGPHGYISRKDDYLKRLRRIEGQARGLQKMVEDEKYCIDILTQVSAMTKALQ 60
Query 54 SVALNLLDEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
SVAL LLDEHLSHCV A GGP AD K+ EAS AIARLVRS
Sbjct 61 SVALGLLDEHLSHCVVHAAQAGGPEADEKIREASEAIARLVRS 103
>gi|119855094|ref|YP_935699.1| hypothetical protein Mkms_5706 [Mycobacterium sp. KMS]
gi|145225900|ref|YP_001136554.1| hypothetical protein Mflv_5303 [Mycobacterium gilvum PYR-GCK]
gi|119697812|gb|ABL94884.1| protein of unknown function DUF156 [Mycobacterium sp. KMS]
gi|145218363|gb|ABP47766.1| protein of unknown function DUF156 [Mycobacterium gilvum PYR-GCK]
Length=97
Score = 136 bits (342), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 68/95 (72%), Positives = 78/95 (83%), Gaps = 0/95 (0%)
Query 2 TAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLD 61
TA GYT KD + KRLRR+EGQVRG+ARM++ED YCIDVLTQISA T AL SVAL+LLD
Sbjct 3 TATPGYTASKDAHLKRLRRIEGQVRGLARMVDEDAYCIDVLTQISAATKALESVALSLLD 62
Query 62 EHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
EHLSHCV A+ +GG A+ K+ EASAAIARLVRS
Sbjct 63 EHLSHCVRDAIHQGGDMAEQKINEASAAIARLVRS 97
>gi|323358964|ref|YP_004225360.1| hypothetical protein MTES_2516 [Microbacterium testaceum StLB037]
gi|323275335|dbj|BAJ75480.1| uncharacterized protein conserved in bacteria [Microbacterium
testaceum StLB037]
Length=93
Score = 135 bits (340), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 64/92 (70%), Positives = 77/92 (84%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGY KD+ KRLRR EGQVRGIARM+++D+YCID+LTQ+SAVT AL +VAL+LLD+HL
Sbjct 2 HGYVGHKDDLLKRLRRAEGQVRGIARMVDDDQYCIDILTQVSAVTKALENVALSLLDDHL 61
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
SHCV A A+GGP A+ KL EA+ AIARLVRS
Sbjct 62 SHCVAEAAAQGGPVAEEKLKEANQAIARLVRS 93
>gi|312194420|ref|YP_004014481.1| hypothetical protein FraEuI1c_0529 [Frankia sp. EuI1c]
gi|311225756|gb|ADP78611.1| protein of unknown function DUF156 [Frankia sp. EuI1c]
Length=143
Score = 135 bits (339), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 63/92 (69%), Positives = 77/92 (84%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGY+ +++Y RLRRVEGQVRG+ RM+ +DKYCID+LTQ++AVT AL+SVAL LL++HL
Sbjct 52 HGYSHHREDYLARLRRVEGQVRGLQRMVSDDKYCIDILTQVAAVTRALQSVALGLLEDHL 111
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
HCV A AEGGP A+ KL EASAAIARLVRS
Sbjct 112 GHCVADAAAEGGPVAEAKLREASAAIARLVRS 143
>gi|337766578|emb|CCB75289.1| conserved protein of unknown function [Streptomyces cattleya
NRRL 8057]
Length=121
Score = 134 bits (337), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 61/92 (67%), Positives = 77/92 (84%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGY++QKD + KRLRR+EGQ+RG+ RM++ED YCID+LTQ+SA T AL+S AL LL+EHL
Sbjct 30 HGYSKQKDQHLKRLRRIEGQIRGLQRMVDEDVYCIDILTQVSAGTKALQSFALQLLEEHL 89
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
HCVT A GGP AD K+AEA+AAIARL+R+
Sbjct 90 RHCVTAAAVHGGPEADAKVAEATAAIARLLRT 121
>gi|126437036|ref|YP_001072727.1| hypothetical protein Mjls_4465 [Mycobacterium sp. JLS]
gi|126437115|ref|YP_001072806.1| hypothetical protein Mjls_4548 [Mycobacterium sp. JLS]
gi|126236836|gb|ABO00237.1| protein of unknown function DUF156 [Mycobacterium sp. JLS]
gi|126236915|gb|ABO00316.1| protein of unknown function DUF156 [Mycobacterium sp. JLS]
Length=109
Score = 133 bits (335), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 64/95 (68%), Positives = 73/95 (77%), Gaps = 0/95 (0%)
Query 2 TAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLD 61
T HGY KD Y KRL+R+EGQ RGI+RMIEE++YCID+LTQ A+T AL+ VAL LLD
Sbjct 15 TTHHGYITDKDKYLKRLKRIEGQARGISRMIEEERYCIDILTQTDALTKALQGVALALLD 74
Query 62 EHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
+HL HCV A A GGP AD KL EAS AIARLVRS
Sbjct 75 DHLRHCVRDAAATGGPAADAKLTEASEAIARLVRS 109
>gi|333919705|ref|YP_004493286.1| hypothetical protein AS9A_2037 [Amycolicicoccus subflavus DQS3-9A1]
gi|333481926|gb|AEF40486.1| hypothetical protein AS9A_2037 [Amycolicicoccus subflavus DQS3-9A1]
Length=105
Score = 133 bits (335), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 61/92 (67%), Positives = 75/92 (82%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGY KD Y KRLRR+EGQ RG+ RM+E++KYCID+LTQ+SA+ SAL++VAL LLD+H+
Sbjct 14 HGYITSKDAYLKRLRRIEGQARGLQRMVEDEKYCIDILTQVSAMKSALQAVALGLLDDHM 73
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
+HCV AVAEGGP A+ KL E S AI RLVRS
Sbjct 74 AHCVAEAVAEGGPEAEAKLKEVSDAITRLVRS 105
>gi|284029033|ref|YP_003378964.1| hypothetical protein Kfla_1061 [Kribbella flavida DSM 17836]
gi|283808326|gb|ADB30165.1| protein of unknown function DUF156 [Kribbella flavida DSM 17836]
Length=102
Score = 133 bits (334), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 61/92 (67%), Positives = 75/92 (82%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGYT +K ++ KRLRR+EGQ+RG+ RM+EEDKYCID+LTQ+SA T AL+S +L LL+EHL
Sbjct 11 HGYTAEKSSHLKRLRRIEGQIRGLQRMVEEDKYCIDILTQVSAATKALQSFSLELLEEHL 70
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
SHCV A GGP A+ K+ EAS AIARLVRS
Sbjct 71 SHCVVEAAQRGGPEAEEKVREASDAIARLVRS 102
>gi|226349474|ref|YP_002776588.1| hypothetical protein ROP_pROB01-02370 [Rhodococcus opacus B4]
gi|226245389|dbj|BAH55736.1| hypothetical protein [Rhodococcus opacus B4]
Length=114
Score = 133 bits (334), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 60/94 (64%), Positives = 77/94 (82%), Gaps = 0/94 (0%)
Query 3 AAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDE 62
AAHGY +KD Y KRL+R+EGQ RG+ RM+EEDKYCID+LTQ+SA+T AL++VA+ LL++
Sbjct 21 AAHGYITEKDAYLKRLKRIEGQARGLQRMVEEDKYCIDILTQVSAMTKALQAVAMGLLED 80
Query 63 HLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
H+SHCV A GGP A+ K+ EA+ AIARLVRS
Sbjct 81 HISHCVVDAAVAGGPEAEAKIKEATDAIARLVRS 114
>gi|54022809|ref|YP_117051.1| hypothetical protein nfa8420 [Nocardia farcinica IFM 10152]
gi|54014317|dbj|BAD55687.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=106
Score = 132 bits (333), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 60/94 (64%), Positives = 77/94 (82%), Gaps = 0/94 (0%)
Query 3 AAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDE 62
AAHGY KD+Y KRLRR+EGQ RG+ RM+EE+KYCID+LTQ+SA+T AL++VA+ LL++
Sbjct 13 AAHGYITSKDDYLKRLRRIEGQARGLQRMVEEEKYCIDILTQVSAMTKALQAVAMGLLED 72
Query 63 HLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
H+SHCV A GGP A+ K+ EA+ AIARLVRS
Sbjct 73 HISHCVVDAAVAGGPEAEAKIKEATDAIARLVRS 106
>gi|297562195|ref|YP_003681169.1| hypothetical protein Ndas_3257 [Nocardiopsis dassonvillei subsp.
dassonvillei DSM 43111]
gi|296846643|gb|ADH68663.1| protein of unknown function DUF156 [Nocardiopsis dassonvillei
subsp. dassonvillei DSM 43111]
Length=96
Score = 132 bits (333), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 61/96 (64%), Positives = 75/96 (79%), Gaps = 0/96 (0%)
Query 1 MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL 60
M A HGY+ K ++ RLRR+EGQ+RG+ RM+E+D+YCID+LTQ SA ALRS AL++L
Sbjct 1 MAATHGYSNDKQSHINRLRRIEGQIRGLQRMVEDDQYCIDILTQTSAANKALRSFALSML 60
Query 61 DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
DEHL HCV+ AVA GG AD K+ EAS AIARLVRS
Sbjct 61 DEHLKHCVSNAVANGGEEADEKVREASEAIARLVRS 96
>gi|120404632|ref|YP_954461.1| hypothetical protein Mvan_3668 [Mycobacterium vanbaalenii PYR-1]
gi|119957450|gb|ABM14455.1| protein of unknown function DUF156 [Mycobacterium vanbaalenii
PYR-1]
Length=97
Score = 132 bits (331), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 67/95 (71%), Positives = 76/95 (80%), Gaps = 0/95 (0%)
Query 2 TAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLD 61
T GY KD + KRLRR+EGQVRG+ARM++ED YCIDVLTQISA T AL SVAL+LLD
Sbjct 3 TKTPGYAASKDAHLKRLRRIEGQVRGLARMVDEDAYCIDVLTQISAATKALESVALSLLD 62
Query 62 EHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
EHLSHCV AV +GG A+ K+ EASAAIARLVRS
Sbjct 63 EHLSHCVRDAVHQGGDIAEQKIDEASAAIARLVRS 97
>gi|257057621|ref|YP_003135453.1| hypothetical protein Svir_36760 [Saccharomonospora viridis DSM
43017]
gi|256587493|gb|ACU98626.1| uncharacterized conserved protein [Saccharomonospora viridis
DSM 43017]
Length=93
Score = 130 bits (327), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 62/91 (69%), Positives = 75/91 (83%), Gaps = 0/91 (0%)
Query 6 GYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHLS 65
GYT KD Y KRLRR+EGQ+RG+ RM+E D+YCIDVLTQI+A T AL++V+L LL+EHL
Sbjct 3 GYTSDKDAYLKRLRRIEGQIRGLQRMVENDEYCIDVLTQIAAATKALQAVSLGLLEEHLR 62
Query 66 HCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
HCV A++EGG AD K+ EASAAIARLVRS
Sbjct 63 HCVAEAISEGGERADEKVREASAAIARLVRS 93
>gi|284992005|ref|YP_003410559.1| hypothetical protein Gobs_3602 [Geodermatophilus obscurus DSM
43160]
gi|284065250|gb|ADB76188.1| protein of unknown function DUF156 [Geodermatophilus obscurus
DSM 43160]
Length=100
Score = 130 bits (327), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 61/92 (67%), Positives = 76/92 (83%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGY +KD+Y KRLRR+EGQ RG+ RM+E++KYCID+LTQ+SA+T AL+SVAL L++EHL
Sbjct 9 HGYIHRKDDYLKRLRRIEGQARGLQRMVEDEKYCIDILTQVSAMTKALQSVALGLVEEHL 68
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
SHCV +A GG AD K+ EAS AIARLVRS
Sbjct 69 SHCVVQAAQAGGREADEKVREASEAIARLVRS 100
>gi|108798425|ref|YP_638622.1| hypothetical protein Mmcs_1454 [Mycobacterium sp. MCS]
gi|119867522|ref|YP_937474.1| hypothetical protein Mkms_1472 [Mycobacterium sp. KMS]
gi|108768844|gb|ABG07566.1| protein of unknown function DUF156 [Mycobacterium sp. MCS]
gi|119693611|gb|ABL90684.1| protein of unknown function DUF156 [Mycobacterium sp. KMS]
Length=109
Score = 130 bits (327), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 62/95 (66%), Positives = 73/95 (77%), Gaps = 0/95 (0%)
Query 2 TAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLD 61
+ HGY K+ Y KRL+R+EGQ RGI+RMIEE++YCID+LTQ A+T AL+ VAL LLD
Sbjct 15 STPHGYITDKEKYLKRLKRIEGQARGISRMIEEERYCIDILTQADALTKALQGVALALLD 74
Query 62 EHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
+HL HCV A A GGP AD KL EAS AIARLVRS
Sbjct 75 DHLRHCVLDAAAVGGPAADNKLTEASEAIARLVRS 109
>gi|297158509|gb|ADI08221.1| hypothetical protein SBI_05101 [Streptomyces bingchenggensis
BCW-1]
Length=128
Score = 130 bits (326), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 59/92 (65%), Positives = 74/92 (81%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGY +QKD + KRLRR+EGQ+RG+ RM+EED YCID+LTQ+SA T AL+S AL LL+EHL
Sbjct 37 HGYAKQKDQHLKRLRRIEGQIRGLQRMVEEDVYCIDILTQVSASTKALQSFALQLLEEHL 96
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
HCV A +GG D K+AEA+AAIARL+R+
Sbjct 97 RHCVAAAAVKGGEEVDAKVAEATAAIARLLRT 128
>gi|84495528|ref|ZP_00994647.1| hypothetical protein JNB_12019 [Janibacter sp. HTCC2649]
gi|84385021|gb|EAQ00901.1| hypothetical protein JNB_12019 [Janibacter sp. HTCC2649]
Length=102
Score = 130 bits (326), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 62/94 (66%), Positives = 76/94 (81%), Gaps = 2/94 (2%)
Query 5 HGY--TQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDE 62
H Y + KD+Y KRLRR+EGQ RGI +M++E+KYCID+LTQISA+T AL+SVAL LLDE
Sbjct 9 HSYIASGHKDDYLKRLRRIEGQARGIQKMVDEEKYCIDILTQISAMTKALQSVALGLLDE 68
Query 63 HLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
H++HCV A +GGP AD KL EAS AIARLV+S
Sbjct 69 HMNHCVADAARKGGPEADAKLKEASEAIARLVKS 102
>gi|170781089|ref|YP_001709421.1| hypothetical protein CMS_0652 [Clavibacter michiganensis subsp.
sepedonicus]
gi|169155657|emb|CAQ00775.1| conserved hypothetical protein [Clavibacter michiganensis subsp.
sepedonicus]
Length=96
Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 64/96 (67%), Positives = 75/96 (79%), Gaps = 0/96 (0%)
Query 1 MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLL 60
M A GY++ KD+ KRLRR EGQVRGI RM+E D YCIDVLTQ+SAVT A+ +VAL LL
Sbjct 1 MAAMVGYSEGKDDILKRLRRAEGQVRGIERMVESDTYCIDVLTQVSAVTRAMETVALKLL 60
Query 61 DEHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
D+HL+HC+ A EGG AD K+ EASAAIARLVRS
Sbjct 61 DDHLAHCLAEAAREGGQVADDKVREASAAIARLVRS 96
>gi|284029493|ref|YP_003379424.1| hypothetical protein Kfla_1527 [Kribbella flavida DSM 17836]
gi|283808786|gb|ADB30625.1| protein of unknown function DUF156 [Kribbella flavida DSM 17836]
Length=100
Score = 129 bits (323), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 61/94 (65%), Positives = 75/94 (80%), Gaps = 0/94 (0%)
Query 3 AAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDE 62
A HGY K+ Y +RL+R+EGQ RG+ RM+++++YCID+LTQISA+T AL SVAL LLD+
Sbjct 7 APHGYIGDKEAYLRRLKRIEGQARGLQRMVDDEQYCIDILTQISAMTKALESVALGLLDD 66
Query 63 HLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
HL+HCV A A GGP AD KL EAS AIARLVRS
Sbjct 67 HLAHCVVDAAAAGGPEADRKLKEASEAIARLVRS 100
>gi|111020527|ref|YP_703499.1| hypothetical protein RHA1_ro03538 [Rhodococcus jostii RHA1]
gi|110820057|gb|ABG95341.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=109
Score = 129 bits (323), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 59/95 (63%), Positives = 75/95 (79%), Gaps = 0/95 (0%)
Query 2 TAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLD 61
T HGY KD+Y KRLRR+EGQ RG+ RM+E++KYCID+LTQ+SA+T AL++VAL LL+
Sbjct 15 TTGHGYISAKDDYLKRLRRIEGQARGLQRMVEDEKYCIDILTQVSAMTKALQAVALGLLE 74
Query 62 EHLSHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
+H+SHCV A GG AD K+ EA+ AIARLVRS
Sbjct 75 DHISHCVVDAAVAGGTEADAKIKEATDAIARLVRS 109
>gi|296271355|ref|YP_003653987.1| hypothetical protein Tbis_3404 [Thermobispora bispora DSM 43833]
gi|296094142|gb|ADG90094.1| protein of unknown function DUF156 [Thermobispora bispora DSM
43833]
Length=93
Score = 128 bits (322), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 59/92 (65%), Positives = 76/92 (83%), Gaps = 0/92 (0%)
Query 5 HGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHL 64
HGY+ K Y RLRR+EGQVRG+ RM+E+D YCID+LTQ+SAVT AL+SVAL LL++H+
Sbjct 2 HGYSADKQAYLVRLRRIEGQVRGLQRMVEQDAYCIDILTQVSAVTRALQSVALGLLEDHI 61
Query 65 SHCVTRAVAEGGPGADGKLAEASAAIARLVRS 96
HCV A+++GGP A+ K+ EA+AAIARLVRS
Sbjct 62 GHCVAEAISKGGPQAEEKVKEAAAAIARLVRS 93
Lambda K H
0.318 0.130 0.357
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 131127635258
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40