BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2091c
Length=244
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609228|ref|NP_216607.1| hypothetical protein Rv2091c [Mycob... 475 3e-132
gi|340627104|ref|YP_004745556.1| hypothetical protein MCAN_21181... 472 2e-131
gi|240172772|ref|ZP_04751431.1| hypothetical protein MkanA1_2589... 323 1e-86
gi|183983076|ref|YP_001851367.1| hypothetical protein MMAR_3077 ... 277 1e-72
gi|118617833|ref|YP_906165.1| hypothetical protein MUL_2323 [Myc... 276 2e-72
gi|342859795|ref|ZP_08716448.1| hypothetical protein MCOL_12988 ... 268 4e-70
gi|296165172|ref|ZP_06847720.1| conserved hypothetical protein [... 268 5e-70
gi|221230172|ref|YP_002503588.1| hypothetical protein MLBr_01334... 249 2e-64
gi|15827695|ref|NP_301958.1| hypothetical protein ML1334 [Mycoba... 249 3e-64
gi|333990753|ref|YP_004523367.1| hypothetical protein JDM601_211... 233 1e-59
gi|41407924|ref|NP_960760.1| hypothetical protein MAP1826c [Myco... 228 6e-58
gi|336462031|gb|EGO40879.1| hypothetical protein MAPs_24760 [Myc... 227 1e-57
gi|118462890|ref|YP_881610.1| hypothetical protein MAV_2412 [Myc... 208 6e-52
gi|254819634|ref|ZP_05224635.1| hypothetical protein MintA_06914... 189 2e-46
gi|169629276|ref|YP_001702925.1| hypothetical protein MAB_2190 [... 178 5e-43
gi|108799444|ref|YP_639641.1| hypothetical protein Mmcs_2477 [My... 166 4e-39
gi|145223683|ref|YP_001134361.1| hypothetical protein Mflv_3096 ... 160 1e-37
gi|118473777|ref|YP_888175.1| hypothetical protein MSMEG_3884 [M... 157 2e-36
gi|120404411|ref|YP_954240.1| hypothetical protein Mvan_3439 [My... 154 1e-35
gi|296164334|ref|ZP_06846918.1| conserved hypothetical protein [... 124 9e-27
gi|296164330|ref|ZP_06846914.1| conserved hypothetical protein [... 109 3e-22
gi|343927929|ref|ZP_08767395.1| hypothetical protein GOALK_099_0... 92.4 5e-17
gi|226359986|ref|YP_002777764.1| hypothetical protein ROP_05720 ... 92.0 6e-17
gi|111017853|ref|YP_700825.1| hypothetical protein RHA1_ro00834 ... 91.7 1e-16
gi|333992672|ref|YP_004525286.1| hypothetical protein JDM601_403... 89.4 4e-16
gi|118469274|ref|YP_889150.1| hypothetical protein MSMEG_4894 [M... 87.4 1e-15
gi|326382290|ref|ZP_08203982.1| hypothetical protein SCNU_05091 ... 84.3 2e-14
gi|296394175|ref|YP_003659059.1| hypothetical protein Srot_1768 ... 79.3 4e-13
gi|325676817|ref|ZP_08156490.1| hypothetical protein HMPREF0724_... 79.0 6e-13
gi|257056195|ref|YP_003134027.1| hypothetical protein Svir_21920... 78.6 8e-13
gi|312139732|ref|YP_004007068.1| membrane protein [Rhodococcus e... 78.6 8e-13
gi|226306652|ref|YP_002766612.1| hypothetical protein RER_31650 ... 77.0 2e-12
gi|229493548|ref|ZP_04387333.1| conserved hypothetical protein [... 77.0 2e-12
gi|262202397|ref|YP_003273605.1| hypothetical protein Gbro_2471 ... 75.5 6e-12
gi|54025130|ref|YP_119372.1| hypothetical protein nfa31610 [Noca... 73.9 2e-11
gi|333919687|ref|YP_004493268.1| hypothetical protein AS9A_2019 ... 71.6 1e-10
gi|317507054|ref|ZP_07964818.1| hypothetical protein HMPREF9336_... 69.3 4e-10
gi|258653123|ref|YP_003202279.1| hypothetical protein Namu_2952 ... 65.9 6e-09
gi|331697088|ref|YP_004333327.1| hypothetical protein Psed_3284 ... 64.3 2e-08
gi|302524039|ref|ZP_07276381.1| predicted protein [Streptomyces ... 63.2 3e-08
gi|134098835|ref|YP_001104496.1| hypothetical protein SACE_2266 ... 60.8 2e-07
gi|291005807|ref|ZP_06563780.1| hypothetical protein SeryN2_1490... 60.5 2e-07
gi|256376381|ref|YP_003100041.1| hypothetical protein Amir_2255 ... 60.1 3e-07
gi|300782655|ref|YP_003762946.1| hypothetical protein AMED_0724 ... 55.5 7e-06
gi|302527221|ref|ZP_07279563.1| predicted protein [Streptomyces ... 54.3 2e-05
gi|330806033|ref|XP_003290979.1| hypothetical protein DICPUDRAFT... 49.3 5e-04
gi|281208312|gb|EFA82490.1| hypothetical protein PPL_04915 [Poly... 47.0 0.002
gi|227874619|ref|ZP_03992782.1| conserved hypothetical protein [... 47.0 0.002
gi|72016099|ref|XP_786229.1| PREDICTED: similar to RNA-binding p... 41.6 0.12
gi|256376356|ref|YP_003100016.1| hypothetical protein Amir_2230 ... 40.8 0.17
>gi|15609228|ref|NP_216607.1| hypothetical protein Rv2091c [Mycobacterium tuberculosis H37Rv]
gi|15841582|ref|NP_336619.1| hypothetical protein MT2152 [Mycobacterium tuberculosis CDC1551]
gi|31793274|ref|NP_855767.1| hypothetical protein Mb2118c [Mycobacterium bovis AF2122/97]
80 more sequence titles
Length=244
Score = 475 bits (1222), Expect = 3e-132, Method: Compositional matrix adjust.
Identities = 244/244 (100%), Positives = 244/244 (100%), Gaps = 0/244 (0%)
Query 1 MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTPQYQQPADPA 60
MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTPQYQQPADPA
Sbjct 1 MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTPQYQQPADPA 60
Query 61 YPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPPGQYPG 120
YPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPPGQYPG
Sbjct 61 YPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPPGQYPG 120
Query 121 QYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQ 180
QYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQ
Sbjct 121 QYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQ 180
Query 181 QVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEV 240
QVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEV
Sbjct 181 QVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEV 240
Query 241 GRPQ 244
GRPQ
Sbjct 241 GRPQ 244
>gi|340627104|ref|YP_004745556.1| hypothetical protein MCAN_21181 [Mycobacterium canettii CIPT
140010059]
gi|340005294|emb|CCC44450.1| putative membrane protein [Mycobacterium canettii CIPT 140010059]
Length=244
Score = 472 bits (1215), Expect = 2e-131, Method: Compositional matrix adjust.
Identities = 243/244 (99%), Positives = 243/244 (99%), Gaps = 0/244 (0%)
Query 1 MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTPQYQQPADPA 60
MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTPQYQQPADPA
Sbjct 1 MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTPQYQQPADPA 60
Query 61 YPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPPGQYPG 120
YPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQ GQYAPPGQYPG
Sbjct 61 YPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQTGQYAPPGQYPG 120
Query 121 QYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQ 180
QYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQ
Sbjct 121 QYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQ 180
Query 181 QVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEV 240
QVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEV
Sbjct 181 QVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEV 240
Query 241 GRPQ 244
GRPQ
Sbjct 241 GRPQ 244
>gi|240172772|ref|ZP_04751431.1| hypothetical protein MkanA1_25892 [Mycobacterium kansasii ATCC
12478]
Length=260
Score = 323 bits (829), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 198/263 (76%), Positives = 212/263 (81%), Gaps = 22/263 (8%)
Query 1 MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTP----QYQQP 56
MSGPQGSDPRQ WQPPGQG DHSSDPT+A G PWQQQPTQ+ATWQ PAYTP QYQQP
Sbjct 1 MSGPQGSDPRQSWQPPGQG-DHSSDPTMA-GSPWQQQPTQDATWQTPAYTPAEYPQYQQP 58
Query 57 ADPAYPQQYPQPTPGYAQPE------QFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPG 110
DPAYPQQYPQ P Y QP+ QFGA P Q G PGQYGQ QPGQYGQP QYGQPG
Sbjct 59 VDPAYPQQYPQSAPAYGQPDFGSQATQFGA-PPQFGQPGQYGQAAQPGQYGQPAQYGQPG 117
Query 111 QY-AP--------PGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWA 161
QY AP PGQYPGQY PY Q+ + SKRS+A+IGGV+ V+ VL IG VL++GFW
Sbjct 118 QYGAPGQPGQYGQPGQYPGQYAPYDQAAKSSKRSMAIIGGVVGVIVVLVIGIVLVVGFWQ 177
Query 162 PGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSID 221
PGFFVTTKLDV KA +GVQQ+LTDET GYGAKNVKDVKCNNG +PTVKKGATFECTVSID
Sbjct 178 PGFFVTTKLDVNKANSGVQQILTDETNGYGAKNVKDVKCNNGVNPTVKKGATFECTVSID 237
Query 222 GTSKRVTVTFQDNKGTYEVGRPQ 244
G SK VTVTFQDNKGTYEVGRPQ
Sbjct 238 GASKHVTVTFQDNKGTYEVGRPQ 260
>gi|183983076|ref|YP_001851367.1| hypothetical protein MMAR_3077 [Mycobacterium marinum M]
gi|183176402|gb|ACC41512.1| conserved membrane protein [Mycobacterium marinum M]
Length=254
Score = 277 bits (709), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 179/257 (70%), Positives = 198/257 (78%), Gaps = 16/257 (6%)
Query 1 MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTP----QYQQP 56
MSGPQG DPRQ WQPPGQG DHSSDPT+AAG PWQQ P QE +WQ P+Y QYQQP
Sbjct 1 MSGPQGPDPRQSWQPPGQGGDHSSDPTMAAGSPWQQ-PGQEGSWQPPSYPAADYQQYQQP 59
Query 57 ADPAYPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPPG 116
+PAYPQQYP PGY Q + FG+QPTQ G P Q+GQ Q GQ GQ GQ GQ GQ PG
Sbjct 60 VEPAYPQQYPA-APGYGQSD-FGSQPTQFGAPPQFGQPGQYGQPGQYGQPGQYGQPGQPG 117
Query 117 QYPGQ---------YGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVT 167
Q + PY QSGQGSKRS+A+IGGV+ +A+LF+ +L+LGFW PGFFVT
Sbjct 118 QPGQYGQPGQYPGQFSPYDQSGQGSKRSMALIGGVVGGIALLFVVVILVLGFWVPGFFVT 177
Query 168 TKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRV 227
TKLDV KA AGVQQVLTDET GYGAKNVKDVKCN+G+DPTVKKGATF+CTVSIDG SK V
Sbjct 178 TKLDVSKANAGVQQVLTDETNGYGAKNVKDVKCNDGTDPTVKKGATFDCTVSIDGASKHV 237
Query 228 TVTFQDNKGTYEVGRPQ 244
TVTFQDNKGTYEVGRPQ
Sbjct 238 TVTFQDNKGTYEVGRPQ 254
>gi|118617833|ref|YP_906165.1| hypothetical protein MUL_2323 [Mycobacterium ulcerans Agy99]
gi|118569943|gb|ABL04694.1| conserved membrane protein [Mycobacterium ulcerans Agy99]
Length=254
Score = 276 bits (705), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 178/257 (70%), Positives = 198/257 (78%), Gaps = 16/257 (6%)
Query 1 MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTP----QYQQP 56
MSGPQG DPRQ WQPPGQG DHSSDPT+AAG PWQQ P QE +WQ P+Y QYQQP
Sbjct 1 MSGPQGPDPRQSWQPPGQGGDHSSDPTMAAGSPWQQ-PGQEGSWQPPSYPAADYQQYQQP 59
Query 57 ADPAYPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPPG 116
+PAYPQQYP PGY Q + FG+QPT+ G P Q+GQ Q GQ GQ GQ GQ GQ PG
Sbjct 60 VEPAYPQQYPA-APGYGQSD-FGSQPTRFGAPPQFGQPGQYGQPGQYGQPGQYGQPGQPG 117
Query 117 QYPGQ---------YGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVT 167
Q + PY QSGQGSKRS+A+IGGV+ +A+LF+ +L+LGFW PGFFVT
Sbjct 118 QPGQYGQPGQYPGQFSPYDQSGQGSKRSMALIGGVVGGIALLFVVVILVLGFWVPGFFVT 177
Query 168 TKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRV 227
TKLDV KA AGVQQVLTDET GYGAKNVKDVKCN+G+DPTVKKGATF+CTVSIDG SK V
Sbjct 178 TKLDVSKANAGVQQVLTDETNGYGAKNVKDVKCNDGTDPTVKKGATFDCTVSIDGASKHV 237
Query 228 TVTFQDNKGTYEVGRPQ 244
TVTFQDNKGTYEVGRPQ
Sbjct 238 TVTFQDNKGTYEVGRPQ 254
>gi|342859795|ref|ZP_08716448.1| hypothetical protein MCOL_12988 [Mycobacterium colombiense CECT
3035]
gi|342132927|gb|EGT86147.1| hypothetical protein MCOL_12988 [Mycobacterium colombiense CECT
3035]
Length=254
Score = 268 bits (686), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 174/257 (68%), Positives = 191/257 (75%), Gaps = 16/257 (6%)
Query 1 MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTP-----QYQQ 55
MSGPQGSD + GQG+DHSS+PT A W +Q QE +WQAPAYTP QYQQ
Sbjct 1 MSGPQGSDQQWQPPGQGQGSDHSSEPTQVAPT-WNEQSNQETSWQAPAYTPPADYPQYQQ 59
Query 56 PADPAYPQQYPQPTPGYAQPEQFG-----AQPTQLGVPGQYGQ---YQQPGQYGQPGQYG 107
PA+ AYPQQYPQ PGYAQP Q+G AQP Q PGQYGQ Y QPGQYGQPGQY
Sbjct 60 PAEQAYPQQYPQQQPGYAQPGQYGQPGQYAQPGQYAPPGQYGQPGQYPQPGQYGQPGQYP 119
Query 108 QPGQYAPPGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVT 167
QPGQY PGQYP QY Y Q G+ KR V +I GV +A L I VL+LGFW PGFFVT
Sbjct 120 QPGQYGQPGQYPQQYPGYEQPGK--KRPVGLIAGVGGAIAFLLIAVVLVLGFWQPGFFVT 177
Query 168 TKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRV 227
TKLDV KAQAGVQQVL+DET GYGAKNV++VKCN+G +PTVKKGATF+C VSIDG K V
Sbjct 178 TKLDVNKAQAGVQQVLSDETNGYGAKNVQNVKCNHGQNPTVKKGATFDCDVSIDGAPKHV 237
Query 228 TVTFQDNKGTYEVGRPQ 244
TVTFQDNKGTYEVGRPQ
Sbjct 238 TVTFQDNKGTYEVGRPQ 254
>gi|296165172|ref|ZP_06847720.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295899480|gb|EFG78938.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=258
Score = 268 bits (685), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 181/263 (69%), Positives = 196/263 (75%), Gaps = 24/263 (9%)
Query 1 MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTP-----QYQQ 55
MSGPQG P Q WQPPGQG+D S++PT G PWQQQP QEATWQAPAYTP QYQQ
Sbjct 1 MSGPQG--PEQQWQPPGQGSDQSAEPT-QVGSPWQQQPGQEATWQAPAYTPPAEYPQYQQ 57
Query 56 PADPAYPQQYPQPT----------PG-YAQPEQFGAQPTQLGVPGQY---GQYQQPGQYG 101
PA+ AYPQQYPQ PG YAQP Q+G QP Q PGQY GQY QPGQY
Sbjct 58 PAEQAYPQQYPQQPGYGQPGQYAQPGQYAQPGQYG-QPGQYAQPGQYAQPGQYGQPGQYA 116
Query 102 QPGQYGQPGQYAPPGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWA 161
QPGQYGQPGQY PGQYP QY PY Q KR V +I GV +A+L I VL+LGFW
Sbjct 117 QPGQYGQPGQYGQPGQYPQQYPPYDQQ-PAKKRPVGLIAGVGGAIALLLIAVVLVLGFWQ 175
Query 162 PGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSID 221
PGFFVTTKLDV KAQAGVQQVL+DET GYGAKNV++VKCN+G +PTVKKGATF+C VSID
Sbjct 176 PGFFVTTKLDVNKAQAGVQQVLSDETNGYGAKNVQNVKCNHGQNPTVKKGATFDCDVSID 235
Query 222 GTSKRVTVTFQDNKGTYEVGRPQ 244
G K VTVTFQDNKGTYEVGRPQ
Sbjct 236 GAPKHVTVTFQDNKGTYEVGRPQ 258
>gi|221230172|ref|YP_002503588.1| hypothetical protein MLBr_01334 [Mycobacterium leprae Br4923]
gi|219933279|emb|CAR71429.1| possible conserved membrane protein [Mycobacterium leprae Br4923]
Length=242
Score = 249 bits (636), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 158/250 (64%), Positives = 178/250 (72%), Gaps = 14/250 (5%)
Query 1 MSGPQGSDPRQPWQPPGQGA-DHSSDPTVAAGYPWQQQP-TQEATWQAPAYTP----QYQ 54
MS PQGSDP + WQ PG+G +HSSD A PWQQQP TQ++TW PAY Y
Sbjct 1 MSEPQGSDPGKQWQSPGEGVENHSSDQPTQAASPWQQQPSTQDSTWHPPAYASPECYNYP 60
Query 55 QPADPAYPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAP 114
Q +P YP QYP TPGY QP FGAQ +Q G+PGQY Q G PGQYG PGQY P
Sbjct 61 QLTEPVYPHQYPSATPGYGQPGYFGAQFSQCGIPGQYPQS------GSPGQYGPPGQYGP 114
Query 115 PGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIK 174
PGQY Q+ PY Q G +K VA+IG + V+ VL A+L+ GF P + VTTKLDV K
Sbjct 115 PGQYSQQFQPYEQPG--TKGFVALIGSIAGVIGVLIFAAILVTGFLWPAWLVTTKLDVNK 172
Query 175 AQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDN 234
AQA VQQVLTDET GYGAKNVKDVKCNNG+DPTVKKG TF+C+VSIDG KRVTVTFQD+
Sbjct 173 AQASVQQVLTDETNGYGAKNVKDVKCNNGADPTVKKGDTFDCSVSIDGMQKRVTVTFQDD 232
Query 235 KGTYEVGRPQ 244
KGTYEVGRPQ
Sbjct 233 KGTYEVGRPQ 242
>gi|15827695|ref|NP_301958.1| hypothetical protein ML1334 [Mycobacterium leprae TN]
gi|4200283|emb|CAA22944.1| hypothetical protein MLCB2533.30 [Mycobacterium leprae]
gi|13093246|emb|CAC31715.1| possible conserved membrane protein [Mycobacterium leprae]
Length=272
Score = 249 bits (635), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 162/274 (60%), Positives = 182/274 (67%), Gaps = 32/274 (11%)
Query 1 MSGPQGSDPRQPWQPPGQGA-DHSSDPTVAAGYPWQQQP-TQEATWQAPAYTP----QYQ 54
MS PQGSDP + WQ PG+G +HSSD A PWQQQP TQ++TW PAY Y
Sbjct 1 MSEPQGSDPGKQWQSPGEGVENHSSDQPTQAASPWQQQPSTQDSTWHPPAYASPECYNYP 60
Query 55 QPADPAYPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAP 114
Q +P YP QYP TPGY QP FGAQ +Q G+PGQY Q PGQYG PGQYG PGQY P
Sbjct 61 QLTEPVYPHQYPSATPGYGQPGYFGAQFSQCGIPGQYPQSGSPGQYGSPGQYGPPGQYGP 120
Query 115 PGQYPG------------------------QYGPYGQSGQGSKRSVAVIGGVIAVMAVLF 150
PGQY Q+ PY Q G +K VA+IG + V+ VL
Sbjct 121 PGQYGPPGQYGPPGQYGPPGQYGPPGQYSQQFQPYEQPG--TKGFVALIGSIAGVIGVLI 178
Query 151 IGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKK 210
A+L+ GF P + VTTKLDV KAQA VQQVLTDET GYGAKNVKDVKCNNG+DPTVKK
Sbjct 179 FAAILVTGFLWPAWLVTTKLDVNKAQASVQQVLTDETNGYGAKNVKDVKCNNGADPTVKK 238
Query 211 GATFECTVSIDGTSKRVTVTFQDNKGTYEVGRPQ 244
G TF+C+VSIDG KRVTVTFQD+KGTYEVGRPQ
Sbjct 239 GDTFDCSVSIDGMQKRVTVTFQDDKGTYEVGRPQ 272
>gi|333990753|ref|YP_004523367.1| hypothetical protein JDM601_2113 [Mycobacterium sp. JDM601]
gi|333486721|gb|AEF36113.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=239
Score = 233 bits (595), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 134/214 (63%), Positives = 149/214 (70%), Gaps = 20/214 (9%)
Query 41 EATWQAPAYTPQYQQPADPAYPQQYPQP--TPGYAQPEQFG-----AQPTQLGVPGQYGQ 93
E WQ PAYTP YPQ YP PGY QPEQ+G QPTQ G PG YGQ
Sbjct 36 EQQWQPPAYTPS-------QYPQSYPSAPQYPGYPQPEQYGQPGGYGQPTQYGQPGGYGQ 88
Query 94 ---YQQPGQYGQPGQYGQPGQYAPPGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLF 150
Y QPGQYGQPGQYGQPGQY GQYP Q G +GQ G G KR A++G V+A +
Sbjct 89 PTQYGQPGQYGQPGQYGQPGQY---GQYPQQPGQFGQPGSGGKRPTALVGTVLAACLGIA 145
Query 151 IGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKK 210
VL+LGFW PGFFVTTKLD+ KAQ GVQQ+L D+T GYGAKNVKDV+CNNG P VK+
Sbjct 146 ALVVLVLGFWKPGFFVTTKLDIDKAQQGVQQILADQTNGYGAKNVKDVRCNNGQSPVVKQ 205
Query 211 GATFECTVSIDGTSKRVTVTFQDNKGTYEVGRPQ 244
G TF C VSIDGT ++VTVTFQDN GTYEVGRP+
Sbjct 206 GDTFNCEVSIDGTKRQVTVTFQDNSGTYEVGRPK 239
>gi|41407924|ref|NP_960760.1| hypothetical protein MAP1826c [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|41396278|gb|AAS04143.1| hypothetical protein MAP_1826c [Mycobacterium avium subsp. paratuberculosis
K-10]
Length=263
Score = 228 bits (581), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 165/268 (62%), Positives = 177/268 (67%), Gaps = 39/268 (14%)
Query 1 MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTP-----QYQQ 55
MSGPQGSD + GQG+DHSSDPT A WQ TQEA+WQAPAYTP QYQQ
Sbjct 11 MSGPQGSDQQWQPPGQGQGSDHSSDPTQVAP-TWQDHATQEASWQAPAYTPPAEYPQYQQ 69
Query 56 PADPAYPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPP 115
P + AYPQ YPQ GYAQP GQYGQY QPGQYGQPGQY QPGQY P
Sbjct 70 PGEQAYPQPYPQQQ-GYAQP-------------GQYGQYGQPGQYGQPGQYAQPGQYGQP 115
Query 116 GQYPGQYGPYGQSGQ-------------------GSKRSVAVIGGVIAVMAVLFIGAVLI 156
GQYP G KR V +I GV +A+L I VL+
Sbjct 116 GQYPQPGQYPQPGQYGQYGQPGQYPQQYQPYEQPGKKRPVGLIAGVAGGIALLLIAVVLV 175
Query 157 LGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFEC 216
LGFW PGFFVTTKLDV KAQAGVQQVL+DET GYGAKNVKDVKCN+G +PTVKKGATF+C
Sbjct 176 LGFWQPGFFVTTKLDVNKAQAGVQQVLSDETNGYGAKNVKDVKCNHGQNPTVKKGATFDC 235
Query 217 TVSIDGTSKRVTVTFQDNKGTYEVGRPQ 244
VSIDG K VTVTFQDNKGTYEVGRPQ
Sbjct 236 DVSIDGAPKHVTVTFQDNKGTYEVGRPQ 263
>gi|336462031|gb|EGO40879.1| hypothetical protein MAPs_24760 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=253
Score = 227 bits (579), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 165/268 (62%), Positives = 177/268 (67%), Gaps = 39/268 (14%)
Query 1 MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTP-----QYQQ 55
MSGPQGSD + GQG+DHSSDPT A WQ TQEA+WQAPAYTP QYQQ
Sbjct 1 MSGPQGSDQQWQPPGQGQGSDHSSDPTQVAP-TWQDHATQEASWQAPAYTPPAEYPQYQQ 59
Query 56 PADPAYPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPP 115
P + AYPQ YPQ GYAQP GQYGQY QPGQYGQPGQY QPGQY P
Sbjct 60 PGEQAYPQPYPQQQ-GYAQP-------------GQYGQYGQPGQYGQPGQYAQPGQYGQP 105
Query 116 GQYPGQYGPYGQSGQ-------------------GSKRSVAVIGGVIAVMAVLFIGAVLI 156
GQYP G KR V +I GV +A+L I VL+
Sbjct 106 GQYPQPGQYPQPGQYGQYGQPGQYPQQYQPYEQPGKKRPVGLIAGVAGGIALLLIAVVLV 165
Query 157 LGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFEC 216
LGFW PGFFVTTKLDV KAQAGVQQVL+DET GYGAKNVKDVKCN+G +PTVKKGATF+C
Sbjct 166 LGFWQPGFFVTTKLDVNKAQAGVQQVLSDETNGYGAKNVKDVKCNHGQNPTVKKGATFDC 225
Query 217 TVSIDGTSKRVTVTFQDNKGTYEVGRPQ 244
VSIDG K VTVTFQDNKGTYEVGRPQ
Sbjct 226 DVSIDGAPKHVTVTFQDNKGTYEVGRPQ 253
>gi|118462890|ref|YP_881610.1| hypothetical protein MAV_2412 [Mycobacterium avium 104]
gi|254775081|ref|ZP_05216597.1| hypothetical protein MaviaA2_10466 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|118164177|gb|ABK65074.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=225
Score = 208 bits (529), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 146/235 (63%), Positives = 156/235 (67%), Gaps = 38/235 (16%)
Query 34 WQQQPTQEATWQAPAYTP-----QYQQPADPAYPQQYPQPTPGYAQPEQFGAQPTQLGVP 88
WQ TQEA+WQAPAYTP QYQQP + AYPQ YPQ GYAQP
Sbjct 5 WQDHATQEASWQAPAYTPPAEYPQYQQPGEQAYPQPYPQQQ-GYAQP------------- 50
Query 89 GQYGQYQQPGQYGQPGQYGQPGQYAPPGQYPGQYGPYGQSGQ------------------ 130
GQYGQY QPGQYGQPGQY QPGQY PGQYP
Sbjct 51 GQYGQYGQPGQYGQPGQYAQPGQYPQPGQYPQPGQYPQPGQYGQYGQPGQYPQQYQPYEQ 110
Query 131 -GSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTG 189
G KR V +I GV +A+L I VL+LGFW PGFFVTTKLDV KAQAGVQQVL+DET G
Sbjct 111 PGKKRPVGLIAGVAGGIALLLIAVVLVLGFWQPGFFVTTKLDVNKAQAGVQQVLSDETNG 170
Query 190 YGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEVGRPQ 244
YGAKNVKDVKCN+G +PTVKKGATF+C VSIDG K VTVTFQDNKGTYEVGRPQ
Sbjct 171 YGAKNVKDVKCNHGQNPTVKKGATFDCDVSIDGAPKHVTVTFQDNKGTYEVGRPQ 225
>gi|254819634|ref|ZP_05224635.1| hypothetical protein MintA_06914 [Mycobacterium intracellulare
ATCC 13950]
Length=253
Score = 189 bits (481), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 145/290 (50%), Positives = 159/290 (55%), Gaps = 83/290 (28%)
Query 1 MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTPQYQQPADPA 60
MSGPQGSD + GQG+DHSSDPT A W Q TQEA+WQAPAYTP
Sbjct 1 MSGPQGSDQQWQPPGQGQGSDHSSDPTQVAP-SWHDQQTQEASWQAPAYTP--------- 50
Query 61 YPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPPGQY-- 118
P +Y QYQQP + P QY Q YA PGQY
Sbjct 51 ---------------------------PAEYPQYQQPAEQAYPQQYPQQPGYAQPGQYGQ 83
Query 119 PGQYGPYGQSGQ--------------------------------------------GSKR 134
PGQY GQ GQ G KR
Sbjct 84 PGQYAQPGQYGQPGQYPQPGQYGQPGQYPPPGQYGQYGQYGQPGQYPPQYPPYEQPGKKR 143
Query 135 SVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKN 194
V +I GV +A+L + VL+LGFW PGFFVTTKLDV KAQAGVQQVL+DET GYGAKN
Sbjct 144 PVGLIAGVGGGIALLLVVVVLVLGFWQPGFFVTTKLDVSKAQAGVQQVLSDETNGYGAKN 203
Query 195 VKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEVGRPQ 244
VKDVKCN+G +PTVKKGATF+C VSIDG K VTVTFQDNKGTYEVGRPQ
Sbjct 204 VKDVKCNHGQNPTVKKGATFDCDVSIDGAPKHVTVTFQDNKGTYEVGRPQ 253
>gi|169629276|ref|YP_001702925.1| hypothetical protein MAB_2190 [Mycobacterium abscessus ATCC 19977]
gi|169241243|emb|CAM62271.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=271
Score = 178 bits (452), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 122/251 (49%), Positives = 149/251 (60%), Gaps = 23/251 (9%)
Query 1 MSGPQGSDPRQPWQP-PGQGAD-----HSSDPTVAAGYPWQQQPTQEATWQAPAYTPQYQ 54
MSGPQG++P W P PGQG D SSDP WQQ P Q P Y
Sbjct 37 MSGPQGTEPGGQWAPQPGQGQDPWQTPQSSDPAQQGEQGWQQ-PAYGQGQQYPQYQQPGF 95
Query 55 QPADPAYPQQY-PQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYA 113
P Q+ QP G+ +P QFG Q PGQYGQP GQP
Sbjct 96 PGGGFGDPSQFGQQPGQGFGEPSQFGQPGAYGQQGQYGQQGAYPGQYGQP--QGQP---- 149
Query 114 PPGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVI 173
+P Q+G Q S +++A+IGG++A VL + +L+ F APG+ VTT LDV
Sbjct 150 ----FP-QFG--AGKPQRSTKTIAIIGGIVAAAVVLIV--LLVTAFLAPGWAVTTTLDVN 200
Query 174 KAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQD 233
KAQ GV ++LTDET GYGAKNVKDVKCN+G++P VKKGATF C VSIDGT ++VTVTFQ+
Sbjct 201 KAQDGVTKILTDETNGYGAKNVKDVKCNDGNNPEVKKGATFTCDVSIDGTKRQVTVTFQN 260
Query 234 NKGTYEVGRPQ 244
+KGTYEVGRP+
Sbjct 261 DKGTYEVGRPK 271
>gi|108799444|ref|YP_639641.1| hypothetical protein Mmcs_2477 [Mycobacterium sp. MCS]
gi|119868557|ref|YP_938509.1| hypothetical protein Mkms_2522 [Mycobacterium sp. KMS]
gi|126435098|ref|YP_001070789.1| hypothetical protein Mjls_2514 [Mycobacterium sp. JLS]
gi|108769863|gb|ABG08585.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119694646|gb|ABL91719.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126234898|gb|ABN98298.1| hypothetical protein Mjls_2514 [Mycobacterium sp. JLS]
Length=262
Score = 166 bits (419), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 88/138 (64%), Positives = 104/138 (76%), Gaps = 11/138 (7%)
Query 107 GQPGQYAPPGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFV 166
GQ G + PPG S GSKRS+ VIGG++ + +L + VL+LGFW PGFFV
Sbjct 136 GQYGSFPPPG-----------SEDGSKRSLGVIGGILGALVLLIVIVVLVLGFWKPGFFV 184
Query 167 TTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKR 226
TTKLDV AQ+GVQQ+LTDE+ GYGAKNV+DVKCNNG P VKKGATF+C VSIDGT +
Sbjct 185 TTKLDVNAAQSGVQQILTDESNGYGAKNVRDVKCNNGESPEVKKGATFDCEVSIDGTKRE 244
Query 227 VTVTFQDNKGTYEVGRPQ 244
VTVTFQD+ GTYEVGRP+
Sbjct 245 VTVTFQDDDGTYEVGRPR 262
>gi|145223683|ref|YP_001134361.1| hypothetical protein Mflv_3096 [Mycobacterium gilvum PYR-GCK]
gi|315444010|ref|YP_004076889.1| hypothetical protein Mspyr1_24120 [Mycobacterium sp. Spyr1]
gi|145216169|gb|ABP45573.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
gi|315262313|gb|ADT99054.1| hypothetical protein Mspyr1_24120 [Mycobacterium sp. Spyr1]
Length=255
Score = 160 bits (406), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 85/127 (67%), Positives = 97/127 (77%), Gaps = 5/127 (3%)
Query 118 YPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQA 177
YPGQ G +GSKRS+AVIGGVI + + + VL+ GFW PGFFVTTKLDV AQ
Sbjct 134 YPGQGGD-----EGSKRSMAVIGGVIGALVAVILAVVLVTGFWKPGFFVTTKLDVDAAQT 188
Query 178 GVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGT 237
GV+Q+LTDET GYGAKNV DV CN G PTVKKG TF C VSIDGT ++VTVTFQD+ GT
Sbjct 189 GVRQILTDETNGYGAKNVNDVTCNGGVSPTVKKGDTFTCEVSIDGTKRQVTVTFQDDDGT 248
Query 238 YEVGRPQ 244
YEVGRP+
Sbjct 249 YEVGRPR 255
>gi|118473777|ref|YP_888175.1| hypothetical protein MSMEG_3884 [Mycobacterium smegmatis str.
MC2 155]
gi|118175064|gb|ABK75960.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=287
Score = 157 bits (396), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/125 (64%), Positives = 95/125 (76%), Gaps = 4/125 (3%)
Query 122 YGPYG--QSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGV 179
+GPY S GSK+S+ + GG+ A VL + AVL FW PG+ VTT+LD+ KAQ GV
Sbjct 165 FGPYTPPGSADGSKKSLLLFGGLGAAALVLIVLAVLF--FWVPGWAVTTELDIDKAQTGV 222
Query 180 QQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYE 239
QQ+LTDET GYGAKNVKDVKCN G +P VKKG TF C VSIDGT ++VTVTFQ++ GTYE
Sbjct 223 QQILTDETNGYGAKNVKDVKCNGGKNPVVKKGDTFTCEVSIDGTKRQVTVTFQNDDGTYE 282
Query 240 VGRPQ 244
VGRP+
Sbjct 283 VGRPK 287
>gi|120404411|ref|YP_954240.1| hypothetical protein Mvan_3439 [Mycobacterium vanbaalenii PYR-1]
gi|119957229|gb|ABM14234.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=264
Score = 154 bits (389), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 86/117 (74%), Positives = 98/117 (84%), Gaps = 0/117 (0%)
Query 128 SGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDET 187
S +GSKRS+AVIGGV+ V+A + I AVL+LGFW PGFFVTTKLDV AQ GV+Q+LTDET
Sbjct 148 SEEGSKRSLAVIGGVVGVLAAVVIAAVLVLGFWKPGFFVTTKLDVNAAQNGVKQILTDET 207
Query 188 TGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEVGRPQ 244
GYGA NV DVKCNNG PTVKKG TF C VSIDGT ++VTVTFQD+ GTYEVGRP+
Sbjct 208 NGYGATNVNDVKCNNGVSPTVKKGDTFNCEVSIDGTKRQVTVTFQDDNGTYEVGRPK 264
>gi|296164334|ref|ZP_06846918.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295900316|gb|EFG79738.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=106
Score = 124 bits (312), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 58/99 (59%), Positives = 74/99 (75%), Gaps = 0/99 (0%)
Query 146 MAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSD 205
+AVL G L++G P T KLDV AQA +Q+VLTD+ TGYG KNV DVKCN+G++
Sbjct 8 LAVLVGGIALVIGLSQPDLLSTKKLDVQAAQAEIQRVLTDDITGYGDKNVADVKCNDGAN 67
Query 206 PTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEVGRPQ 244
TVK+G++F C VS+DGT++RVTVTF DN G +EVGRP
Sbjct 68 VTVKQGSSFTCQVSVDGTARRVTVTFLDNNGNFEVGRPD 106
>gi|296164330|ref|ZP_06846914.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295900312|gb|EFG79734.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=107
Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 54/89 (61%), Positives = 63/89 (71%), Gaps = 1/89 (1%)
Query 154 VLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGAT 213
VLI GFWAPGFFVT +LDV AQAGV VL+D GYGAK+V DV CN+G +PTV KGAT
Sbjct 15 VLITGFWAPGFFVTRQLDVRAAQAGVAHVLSDPA-GYGAKDVSDVICNDGRNPTVTKGAT 73
Query 214 FECTVSIDGTSKRVTVTFQDNKGTYEVGR 242
F C +ID + VTF D+ G YE+ R
Sbjct 74 FTCQATIDHIKHQFVVTFTDDAGNYEISR 102
>gi|343927929|ref|ZP_08767395.1| hypothetical protein GOALK_099_00610 [Gordonia alkanivorans NBRC
16433]
gi|343762152|dbj|GAA14321.1| hypothetical protein GOALK_099_00610 [Gordonia alkanivorans NBRC
16433]
Length=292
Score = 92.4 bits (228), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 95/247 (39%), Positives = 120/247 (49%), Gaps = 52/247 (21%)
Query 33 PWQQQPTQEATWQAPAYTPQYQQPADPAY------------PQQYPQP--TPGYAQPEQ- 77
P++Q P Q QAP Y P Q P PA P QY QP P Y P Q
Sbjct 61 PYRQNPYQ----QAPGYAPGAQAPGQPAQNQPTQVGYPGAQPGQYGQPGGQPAYGAPNQG 116
Query 78 ----------FGAQP--TQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPPGQYPGQYGPY 125
+G QP Q G GQYG GQYG PGQYGQ Y+ P Q PG YG
Sbjct 117 APNYGGPAQPYGQQPDPAQYGA-GQYGA----GQYGAPGQYGQQ-PYSQPQQTPGAYGQN 170
Query 126 GQSGQGSKRSVAVIGGVIAVMAVLFIGA---------VLILGFWAPGFFVTTKLDVIKAQ 176
G +A + +LF G +LI F APG+ LD AQ
Sbjct 171 QFGGPDQFSGMATPKSNKTLRTLLFAGGGALIVIAAIILITAFVAPGW-APKNLDQQAAQ 229
Query 177 AGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKG 236
GVQQ+LT++ Y A +V +VKC +G VK+G++F C+V++ G ++VTVTF D++G
Sbjct 230 DGVQQILTED---YQASDVSNVKCPSGQ--RVKEGSSFTCSVTVGGQDQQVTVTFIDDEG 284
Query 237 TYEVGRP 243
YEV RP
Sbjct 285 RYEVSRP 291
>gi|226359986|ref|YP_002777764.1| hypothetical protein ROP_05720 [Rhodococcus opacus B4]
gi|226238471|dbj|BAH48819.1| hypothetical membrane protein [Rhodococcus opacus B4]
Length=231
Score = 92.0 bits (227), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 49/113 (44%), Positives = 73/113 (65%), Gaps = 10/113 (8%)
Query 131 GSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGY 190
G + +IGGV+A++ + +G L+L F F T LD A+ GV++++TD Y
Sbjct 128 GKSKLPLIIGGVVALLVI--VGIALVLAFT---VFGTDTLDQKAAEDGVERIVTDS---Y 179
Query 191 GAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEVGRP 243
GA+NV V C +G + VKKG +F C++++DG K+VT+TF D+KGTYEV RP
Sbjct 180 GAENVSGVTCPSGQE--VKKGNSFTCSLTVDGEKKQVTLTFTDDKGTYEVSRP 230
>gi|111017853|ref|YP_700825.1| hypothetical protein RHA1_ro00834 [Rhodococcus jostii RHA1]
gi|110817383|gb|ABG92667.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length=229
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/113 (44%), Positives = 72/113 (64%), Gaps = 10/113 (8%)
Query 131 GSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGY 190
G + +IGGV+A++ + +G L+L F F T LD A+ GV++++TD Y
Sbjct 126 GKSKLPLIIGGVVALLVI--VGIALVLAF---TVFGTDTLDQKAAEDGVERIVTDS---Y 177
Query 191 GAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEVGRP 243
GA+NV V C +D VKKG +F C++++DG K+VT+TF D+KGTYEV RP
Sbjct 178 GAENVSGVTCP--ADQEVKKGNSFTCSLTVDGEKKQVTLTFTDDKGTYEVSRP 228
>gi|333992672|ref|YP_004525286.1| hypothetical protein JDM601_4032 [Mycobacterium sp. JDM601]
gi|333488640|gb|AEF38032.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=107
Score = 89.4 bits (220), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 41/77 (54%), Positives = 54/77 (71%), Gaps = 0/77 (0%)
Query 168 TKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRV 227
+LDV AQ GV +VLTD GYG +V DV+CNNG +P V+KG++F CTV ++G +RV
Sbjct 31 VELDVGHAQDGVWRVLTDPINGYGRDDVTDVRCNNGVNPRVRKGSSFTCTVIVNGVQRRV 90
Query 228 TVTFQDNKGTYEVGRPQ 244
F D+ GTYEV RP+
Sbjct 91 LAEFADDAGTYEVDRPR 107
>gi|118469274|ref|YP_889150.1| hypothetical protein MSMEG_4894 [Mycobacterium smegmatis str.
MC2 155]
gi|118170561|gb|ABK71457.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=194
Score = 87.4 bits (215), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/87 (48%), Positives = 56/87 (65%), Gaps = 0/87 (0%)
Query 158 GFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECT 217
G W G +LDV +A+AGV ++L+D GYGA +V V CNNG +P V+ G+TF C
Sbjct 108 GLWMRGEAGGPRLDVRQAEAGVAEILSDPVYGYGANDVAAVACNNGKNPRVEVGSTFMCA 167
Query 218 VSIDGTSKRVTVTFQDNKGTYEVGRPQ 244
V I+GT ++V V F D+ GTY V P+
Sbjct 168 VDINGTVRQVVVEFTDDDGTYAVDGPR 194
>gi|326382290|ref|ZP_08203982.1| hypothetical protein SCNU_05091 [Gordonia neofelifaecis NRRL
B-59395]
gi|326199020|gb|EGD56202.1| hypothetical protein SCNU_05091 [Gordonia neofelifaecis NRRL
B-59395]
Length=291
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/254 (38%), Positives = 120/254 (48%), Gaps = 47/254 (18%)
Query 26 PTVAAGYPWQQQPTQEATWQAPAYTPQYQQP--ADPAYPQQYPQPTPGYAQPEQFGA--- 80
P A PWQQ+ Q+ T Q QP PAY Q+ P P Y QP Q+GA
Sbjct 47 PAAAQASPWQQE-------QSAGPTEQLGQPPIGAPAYGQETPGAPPQYGQP-QYGAGQP 98
Query 81 QPTQLGVPGQYGQYQQPGQYGQPGQY--GQP--GQYAPPGQYPG--------QYGP--YG 126
P Q P QYGQ Q P Q Y G P GQ P GQ PG Q+G YG
Sbjct 99 APGQFVAPAQYGQPQTPPGAPQQNPYAAGPPAYGQQQPYGQVPGSQTQFGQSQFGQSQYG 158
Query 127 QSGQ----------------GSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKL 170
QSGQ SK+S I G++ + VL + + F P + + L
Sbjct 159 QSGQLSGDPTLNPYAPPQAPASKKSTGKIVGIVVGVLVLIAAVIALTAFVWPKW-ASKDL 217
Query 171 DVIKAQAGVQQVLTD-ETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTV 229
+ Q GV ++LT E GYG K+V V C +G V+ G TF C+V ++G +K VTV
Sbjct 218 NQSAVQDGVTKILTTPEPEGYGLKDVTSVDCPSGQK--VETGHTFTCSVKVNGENKHVTV 275
Query 230 TFQDNKGTYEVGRP 243
T +D+ GTYEV RP
Sbjct 276 TVKDDSGTYEVSRP 289
>gi|296394175|ref|YP_003659059.1| hypothetical protein Srot_1768 [Segniliparus rotundus DSM 44985]
gi|296181322|gb|ADG98228.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=208
Score = 79.3 bits (194), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 61/164 (38%), Positives = 79/164 (49%), Gaps = 22/164 (13%)
Query 88 PGQYGQYQQPGQYGQP-GQYG-------QPGQYAPPGQYPGQYGPYGQSGQGSKRSVAVI 139
P YGQ QQP YGQP YG APPG YG + G G+ +++
Sbjct 52 PSPYGQPQQPSAYGQPPSPYGAPQQPPPYGQPVAPPG-----YGAPPKQGNGATKAL--- 103
Query 140 GGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDET-TGYGAKNVKDV 198
+I + +L G GFW F LD +A V+ VLT GYG N+ DV
Sbjct 104 --IIVPVVLLVAGIAGFAGFWL--FGRQKVLDTEQAAKDVKSVLTAAAPNGYGETNITDV 159
Query 199 KC-NNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEVG 241
KC + GSD V+K ATF C++ DG V F ++KG Y+VG
Sbjct 160 KCPDTGSDIKVEKDATFTCSLKKDGKPTTVKAKFSNDKGDYKVG 203
>gi|325676817|ref|ZP_08156490.1| hypothetical protein HMPREF0724_14273 [Rhodococcus equi ATCC
33707]
gi|325552365|gb|EGD22054.1| hypothetical protein HMPREF0724_14273 [Rhodococcus equi ATCC
33707]
Length=278
Score = 79.0 bits (193), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 68/185 (37%), Positives = 94/185 (51%), Gaps = 32/185 (17%)
Query 62 PQQYPQPTPGYAQPEQFGAQPTQLGVPGQ--YGQYQQPGQYGQPGQYGQPG-QYAPPGQY 118
PQ +PQ PG Q G PGQ +G Q G P Q+GQPG Q+APP Q
Sbjct 122 PQGWPQQNPGQ----------QQWGAPGQPQWGPGNQGMAQGYPQQFGQPGNQWAPPQQG 171
Query 119 PGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAG 178
P Q G G + V+ ++ + +G LD AQ G
Sbjct 172 PRQGGGKTMLWVGIGVAAVVVVALVGWLLTSALGGKT--------------LDEQAAQRG 217
Query 179 VQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTY 238
V+Q++T+ YGA++V V C +G V+KGA+FEC++++DG+ K V VTF D++GTY
Sbjct 218 VEQIVTES---YGARSVTGVSCPSGQK--VEKGASFECSLTVDGSPKSVKVTFTDDEGTY 272
Query 239 EVGRP 243
EV RP
Sbjct 273 EVSRP 277
>gi|257056195|ref|YP_003134027.1| hypothetical protein Svir_21920 [Saccharomonospora viridis DSM
43017]
gi|256586067|gb|ACU97200.1| hypothetical protein Svir_21920 [Saccharomonospora viridis DSM
43017]
Length=207
Score = 78.6 bits (192), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 76/206 (37%), Positives = 103/206 (50%), Gaps = 34/206 (16%)
Query 54 QQPADPAYPQQYPQPTPGYAQPEQFGAQPTQLGVPG--------QYGQYQQPGQYGQPGQ 105
Q PA+PA + G+AQP +G QP PG QYGQ P Y Q
Sbjct 16 QPPANPA--------SGGFAQPGGYGQQPAFGQQPGYGQPYGAPQYGQQPAPNPYDQSNP 67
Query 106 YGQP---GQYAPPGQYPGQ-YG---PYG-QSGQGSKRSVAVIGGVIAVMAVLFIGAVLIL 157
YGQ GQ GQ PGQ YG P G QSG+ S + + + IAV VL A ++L
Sbjct 68 YGQQNAYGQQPAFGQQPGQSYGAPPPGGEQSGKKSGKGLWI---GIAVTVVLLAAAGVVL 124
Query 158 GFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECT 217
FW PG +T D + ++ +Q+VL D YG +V+ V C +D V++G TF C
Sbjct 125 -FWKPGLLNSTVFDQERMESDIQRVLQDS---YGL-SVESVSCP--ADQKVEEGHTFTCD 177
Query 218 VSIDGTSKRVTVTFQDNKGTYEVGRP 243
++DG + V +T ++G Y V RP
Sbjct 178 ATVDGQQQSVKITVVSDEGRYTVDRP 203
>gi|312139732|ref|YP_004007068.1| membrane protein [Rhodococcus equi 103S]
gi|311889071|emb|CBH48384.1| putative membrane protein [Rhodococcus equi 103S]
Length=238
Score = 78.6 bits (192), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 68/185 (37%), Positives = 94/185 (51%), Gaps = 32/185 (17%)
Query 62 PQQYPQPTPGYAQPEQFGAQPTQLGVPGQ--YGQYQQPGQYGQPGQYGQPG-QYAPPGQY 118
PQ +PQ PG Q G PGQ +G Q G P Q+GQPG Q+APP Q
Sbjct 82 PQGWPQQNPGQ----------QQWGAPGQPQWGPGNQGMAQGYPQQFGQPGNQWAPPQQG 131
Query 119 PGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAG 178
P Q G G + V+ ++ + +G LD AQ G
Sbjct 132 PRQGGGKTMLWVGIGVAAVVVVALVGWLLTSALGGKT--------------LDEQAAQRG 177
Query 179 VQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTY 238
V+Q++T+ YGA++V V C +G V+KGA+FEC++++DG+ K V VTF D++GTY
Sbjct 178 VEQIVTES---YGARSVTGVSCPSGQK--VEKGASFECSLTVDGSPKSVKVTFTDDEGTY 232
Query 239 EVGRP 243
EV RP
Sbjct 233 EVSRP 237
>gi|226306652|ref|YP_002766612.1| hypothetical protein RER_31650 [Rhodococcus erythropolis PR4]
gi|226185769|dbj|BAH33873.1| hypothetical protein RER_31650 [Rhodococcus erythropolis PR4]
Length=277
Score = 77.0 bits (188), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/81 (51%), Positives = 51/81 (63%), Gaps = 5/81 (6%)
Query 163 GFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDG 222
G F LD AQAGV++++T+ YGAKNV +V+C +D VK G F CT+SI G
Sbjct 201 GVFGKNTLDQSAAQAGVEKIVTES---YGAKNVSNVQC--PADQEVKSGNKFTCTLSIAG 255
Query 223 TSKRVTVTFQDNKGTYEVGRP 243
VTVTF D+ GTYEV RP
Sbjct 256 VKSNVTVTFVDDNGTYEVSRP 276
>gi|229493548|ref|ZP_04387333.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229319509|gb|EEN85345.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=274
Score = 77.0 bits (188), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/81 (51%), Positives = 51/81 (63%), Gaps = 5/81 (6%)
Query 163 GFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDG 222
G F LD AQAGV++++T+ YGAKNV +V+C +D VK G F CT+SI G
Sbjct 198 GVFGKNTLDQSAAQAGVEKIVTES---YGAKNVSNVQCP--ADQEVKSGNKFTCTLSIAG 252
Query 223 TSKRVTVTFQDNKGTYEVGRP 243
VTVTF D+ GTYEV RP
Sbjct 253 VKSNVTVTFVDDNGTYEVSRP 273
>gi|262202397|ref|YP_003273605.1| hypothetical protein Gbro_2471 [Gordonia bronchialis DSM 43247]
gi|262085744|gb|ACY21712.1| hypothetical protein Gbro_2471 [Gordonia bronchialis DSM 43247]
Length=324
Score = 75.5 bits (184), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 49/118 (42%), Positives = 68/118 (58%), Gaps = 10/118 (8%)
Query 126 GQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTD 185
G SG+G +R++ G V+ V A+ + L WAP LD + GV+QVLT
Sbjct 213 GSSGKGLRRAMLAGGVVLFVAAIAVVVTAFWLPGWAP-----KNLDQSAVEDGVKQVLT- 266
Query 186 ETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEVGRP 243
T Y A++VKDV C +G +K+GATF CT++ G ++VTVT D+ G YEV RP
Sbjct 267 --TDYQAQDVKDVSCPSGQ--RIKQGATFTCTLNAGGADQQVTVTILDDDGKYEVSRP 320
>gi|54025130|ref|YP_119372.1| hypothetical protein nfa31610 [Nocardia farcinica IFM 10152]
gi|54016638|dbj|BAD58008.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=230
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 42/101 (42%), Positives = 55/101 (55%), Gaps = 8/101 (7%)
Query 144 AVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNG 203
A + VL +G ++ L T KLD Q GV+QVLTD YG +V DV C +G
Sbjct 138 AAVGVLVLGGIVALVLL---LTATDKLDQAAVQDGVKQVLTDS---YGITDVSDVSCPSG 191
Query 204 SDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEVGRPQ 244
V+ ATFECT+ + G K+V + + GTYEVGRP
Sbjct 192 Q--KVEVDATFECTLKVSGEQKKVNIKITKDDGTYEVGRPN 230
>gi|333919687|ref|YP_004493268.1| hypothetical protein AS9A_2019 [Amycolicicoccus subflavus DQS3-9A1]
gi|333481908|gb|AEF40468.1| hypothetical protein AS9A_2019 [Amycolicicoccus subflavus DQS3-9A1]
Length=146
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/123 (39%), Positives = 67/123 (55%), Gaps = 10/123 (8%)
Query 123 GPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPG--FFVTTKLDVIKAQAGVQ 180
G G+S K VAV+ +A ++ GAVL+LGF PG V+ D +AGV
Sbjct 31 GRNGESNGSRKTLVAVL---LAGAGLVIAGAVLVLGFVWPGGGLLVSQTFDRGAVEAGVA 87
Query 181 QVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEV 240
Q++TD Y A +V V C ++ V+ G+ F C V++DG +RV VT ++ G YEV
Sbjct 88 QIITDV---YRAGSVGAVDCP--ANQEVRSGSVFTCLVTVDGEPRRVEVTVLNDNGEYEV 142
Query 241 GRP 243
G P
Sbjct 143 GNP 145
>gi|317507054|ref|ZP_07964818.1| hypothetical protein HMPREF9336_01189 [Segniliparus rugosus ATCC
BAA-974]
gi|316254657|gb|EFV13963.1| hypothetical protein HMPREF9336_01189 [Segniliparus rugosus ATCC
BAA-974]
Length=219
Score = 69.3 bits (168), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 53/142 (38%), Positives = 68/142 (48%), Gaps = 16/142 (11%)
Query 102 QPGQYGQPGQYAPPGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWA 161
QP YGQP AP G YG Q G + + V + GV ++AVL GFW
Sbjct 87 QPLPYGQPA--APIG-----YGAPPQQGNAALKGVLIGLGVFVLVAVLGFA-----GFWI 134
Query 162 PGFFVTTKLDVIKAQAGVQQVLTDET-TGYGAKNVKDVKC-NNGSDPTVKKGATFECTVS 219
F LD +A V+ VLT GYG N+ DVKC + G D V+K A+F C++
Sbjct 135 --FGRQKVLDTAQAAKDVKGVLTAAAPNGYGETNITDVKCPDQGGDIKVEKDASFTCSLK 192
Query 220 IDGTSKRVTVTFQDNKGTYEVG 241
DG V F + G Y+VG
Sbjct 193 KDGKETTVKAKFTNEDGDYQVG 214
>gi|258653123|ref|YP_003202279.1| hypothetical protein Namu_2952 [Nakamurella multipartita DSM
44233]
gi|258556348|gb|ACV79290.1| hypothetical protein Namu_2952 [Nakamurella multipartita DSM
44233]
Length=334
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/86 (41%), Positives = 46/86 (54%), Gaps = 3/86 (3%)
Query 159 FWAPGFFVTTKLDVIKAQAGVQQVLT-DETTGYGAKNVKDVKCNNGSDPTVKKGATFECT 217
F PGF D + GV VLT + +GYG V DV C +G V+ G +F+C
Sbjct 250 FLWPGFLNKKVFDEKQVAQGVTSVLTGNPPSGYGLTGVTDVTCPSGQ--PVQAGTSFQCD 307
Query 218 VSIDGTSKRVTVTFQDNKGTYEVGRP 243
+ +DG S +VTV QDN G Y+V P
Sbjct 308 LKVDGKSTKVTVNVQDNNGLYQVNPP 333
>gi|331697088|ref|YP_004333327.1| hypothetical protein Psed_3284 [Pseudonocardia dioxanivorans
CB1190]
gi|326951777|gb|AEA25474.1| hypothetical protein Psed_3284 [Pseudonocardia dioxanivorans
CB1190]
Length=683
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/85 (43%), Positives = 48/85 (57%), Gaps = 7/85 (8%)
Query 159 FWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTV 218
FW PGF VT D QAGV++VL TT YG V V C + VK G TF+C+
Sbjct 604 FWKPGFVVTRVFDQTALQAGVEKVL---TTDYG-HQVSAVSCPANRE--VKSGDTFQCSA 657
Query 219 SIDGT-SKRVTVTFQDNKGTYEVGR 242
++DG + +V + ++G YEVGR
Sbjct 658 TVDGEPNTQVPIHVTSSEGNYEVGR 682
>gi|302524039|ref|ZP_07276381.1| predicted protein [Streptomyces sp. AA4]
gi|302432934|gb|EFL04750.1| predicted protein [Streptomyces sp. AA4]
Length=225
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/85 (40%), Positives = 46/85 (55%), Gaps = 5/85 (5%)
Query 160 WAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVS 219
WA G F LD Q GV +VL + YG +VK+V C +G ++ G TF+CTV
Sbjct 146 WAAGVFSGDTLDQKSLQEGVSRVLNEN---YGEPDVKNVVCPSGQ--PIENGTTFDCTVQ 200
Query 220 IDGTSKRVTVTFQDNKGTYEVGRPQ 244
+ G K+VTV ++K Y VG P
Sbjct 201 VGGQPKKVTVRVLNDKPEYSVGAPH 225
>gi|134098835|ref|YP_001104496.1| hypothetical protein SACE_2266 [Saccharopolyspora erythraea NRRL
2338]
gi|133911458|emb|CAM01571.1| hypothetical protein SACE_2266 [Saccharopolyspora erythraea NRRL
2338]
Length=188
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/86 (40%), Positives = 51/86 (60%), Gaps = 5/86 (5%)
Query 159 FWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTV 218
F APG+FV + D + GV+Q L T Y +V V C +G V+ G +F+C V
Sbjct 108 FVAPGWFVRSVFDAESVEKGVRQTLE---TSYRLGDVGPVTCPSGQ--AVQPGHSFDCRV 162
Query 219 SIDGTSKRVTVTFQDNKGTYEVGRPQ 244
+++G + V VT +++KGTYEVG P+
Sbjct 163 TVEGRQQSVKVTVKNDKGTYEVGHPR 188
>gi|291005807|ref|ZP_06563780.1| hypothetical protein SeryN2_14903 [Saccharopolyspora erythraea
NRRL 2338]
Length=190
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/86 (40%), Positives = 51/86 (60%), Gaps = 5/86 (5%)
Query 159 FWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTV 218
F APG+FV + D + GV+Q L T Y +V V C +G V+ G +F+C V
Sbjct 110 FVAPGWFVRSVFDAESVEKGVRQTLE---TSYRLGDVGPVTCPSGQ--AVQPGHSFDCRV 164
Query 219 SIDGTSKRVTVTFQDNKGTYEVGRPQ 244
+++G + V VT +++KGTYEVG P+
Sbjct 165 TVEGRQQSVKVTVKNDKGTYEVGHPR 190
>gi|256376381|ref|YP_003100041.1| hypothetical protein Amir_2255 [Actinosynnema mirum DSM 43827]
gi|255920684|gb|ACU36195.1| hypothetical protein Amir_2255 [Actinosynnema mirum DSM 43827]
Length=228
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 56/153 (37%), Positives = 75/153 (50%), Gaps = 21/153 (13%)
Query 91 YGQYQQPGQYGQPGQYGQPGQYAPPGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLF 150
YGQ QPGQYGQPGQ+GQPGQY +K+S +I GV+ + ++
Sbjct 95 YGQPGQPGQYGQPGQFGQPGQYG--------------PPPPAKKSNGLIWGVVVAVVLIA 140
Query 151 IGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKK 210
V I GF PG+F LD Q G+ QVL D+ + K C + V
Sbjct 141 A-VVGIGGFVTPGWFNKKVLDNTSVQNGIVQVLKDDY----KLDAKSASCT--GEHEVTP 193
Query 211 GATFECTVSIDGTSKRVTVTFQDNKGTYEVGRP 243
TFEC V + K+V +T + + G YEVG+P
Sbjct 194 NTTFECDVKVGDADKKVKITIKTSDGEYEVGQP 226
>gi|300782655|ref|YP_003762946.1| hypothetical protein AMED_0724 [Amycolatopsis mediterranei U32]
gi|299792169|gb|ADJ42544.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340524026|gb|AEK39231.1| hypothetical protein RAM_03695 [Amycolatopsis mediterranei S699]
Length=238
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 32/85 (38%), Positives = 45/85 (53%), Gaps = 5/85 (5%)
Query 160 WAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVS 219
W G F L + GV +VL + YG +VKD +C ++ K G TF+CTV+
Sbjct 159 WLLGAFRGDVLAQQSLEDGVVKVLNEN---YGEPDVKDAQCP--ANEAAKNGTTFDCTVT 213
Query 220 IDGTSKRVTVTFQDNKGTYEVGRPQ 244
I G K+VTV +++ YEVG P
Sbjct 214 IGGQQKKVTVRVLNDRPEYEVGAPH 238
>gi|302527221|ref|ZP_07279563.1| predicted protein [Streptomyces sp. AA4]
gi|302436116|gb|EFL07932.1| predicted protein [Streptomyces sp. AA4]
Length=258
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/85 (37%), Positives = 47/85 (56%), Gaps = 5/85 (5%)
Query 159 FWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTV 218
F APGFF T + + Q V ++LT+ Y + V V C ++ V+ GA FECT
Sbjct 176 FVAPGFFKTEVFNNTQMQTDVAKMLTET---YKIQGVSGVTCP--AEQKVEDGAKFECTA 230
Query 219 SIDGTSKRVTVTFQDNKGTYEVGRP 243
+++G ++V +T + N G YEV P
Sbjct 231 TVNGKPQQVPITVKGNDGNYEVSPP 255
>gi|330806033|ref|XP_003290979.1| hypothetical protein DICPUDRAFT_98866 [Dictyostelium purpureum]
gi|325078857|gb|EGC32486.1| hypothetical protein DICPUDRAFT_98866 [Dictyostelium purpureum]
Length=399
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/76 (53%), Positives = 41/76 (54%), Gaps = 11/76 (14%)
Query 59 PAYPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQ-YAPPGQ 117
P P QY P Y P Q+GA G PGQYG QQPGQYG PGQYG P Q Y PGQ
Sbjct 52 PQQPGQYGAPQQPYGAPGQYGAPQQPYGAPGQYGAPQQPGQYGAPGQYGAPQQPYGAPGQ 111
Query 118 Y----------PGQYG 123
Y PGQYG
Sbjct 112 YGAPQQPQYGAPGQYG 127
Score = 48.5 bits (114), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 58/169 (35%), Positives = 74/169 (44%), Gaps = 38/169 (22%)
Query 70 PGYAQPEQFGA-QPTQLGVPGQYGQYQQPGQYGQPGQYG--QPGQYAPPGQY----PGQY 122
P Y P Q+GA QP Q G PGQYG QQP YG PGQYG QP QY PGQY P QY
Sbjct 118 PQYGAPGQYGAPQPPQYGAPGQYGAPQQP--YGAPGQYGAPQPPQYGAPGQYGAPQPPQY 175
Query 123 GPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFWAPGFFVTTKLDVIKAQAGVQQV 182
G Q+ G +A G+ V V F D ++A+
Sbjct 176 GAPPQA-HGKPAPMAPPSGMGGVTLVKF-------------------FDHVRARYARDYT 215
Query 183 LTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTF 231
L + +G + ++ ++ V K A F C+ DG +TV F
Sbjct 216 LIIDKSGSMSGSLWKQ-----AEAAVAKIAPFACSADPDG----ITVYF 255
Score = 35.4 bits (80), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 35/68 (52%), Positives = 38/68 (56%), Gaps = 5/68 (7%)
Query 52 QYQQPADPAY--PQQYPQPTPGYAQPEQFGA-QPTQLGVPGQYGQYQQPGQYGQPGQ-YG 107
QY P P Y P QY P Y P Q+GA QP Q G PGQYG QP QYG P Q +G
Sbjct 125 QYGAPQPPQYGAPGQYGAPQQPYGAPGQYGAPQPPQYGAPGQYGA-PQPPQYGAPPQAHG 183
Query 108 QPGQYAPP 115
+P APP
Sbjct 184 KPAPMAPP 191
>gi|281208312|gb|EFA82490.1| hypothetical protein PPL_04915 [Polysphondylium pallidum PN500]
Length=460
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/80 (58%), Positives = 47/80 (59%), Gaps = 13/80 (16%)
Query 52 QYQQPADPAYPQ--QYPQPTPG-YAQPEQFGAQ-PTQLGVPGQYGQYQQPGQYGQPGQYG 107
QY QP AYP QY QP PG Y P Q+G Q P PGQYGQ Q PGQYGQP
Sbjct 168 QYGQPQPGAYPPPGQYGQPQPGAYPPPGQYGQQQPGAYPPPGQYGQPQAPGQYGQP---- 223
Query 108 QPGQYAPPGQY-----PGQY 122
QPG Y PPGQY PGQY
Sbjct 224 QPGAYPPPGQYGQPAAPGQY 243
Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 41/75 (55%), Positives = 42/75 (56%), Gaps = 14/75 (18%)
Query 66 PQPTPGY-AQPEQFGAQPTQLGVPGQYGQ-----YQQPGQYGQPGQYGQPGQYAPPGQY- 118
P PTPG A P Q+G P PGQYGQ Y PGQYGQP QPG Y PPGQY
Sbjct 143 PYPTPGQPAAPGQYGKPPAGAPAPGQYGQPQPGAYPPPGQYGQP----QPGAYPPPGQYG 198
Query 119 ---PGQYGPYGQSGQ 130
PG Y P GQ GQ
Sbjct 199 QQQPGAYPPPGQYGQ 213
Score = 39.7 bits (91), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 52/101 (52%), Positives = 53/101 (53%), Gaps = 25/101 (24%)
Query 48 AYTP--QYQQPADPAYPQQYPQPTPG-YAQPEQFGAQPTQLGVPGQYGQYQQPGQYGQP- 103
AY P QY QP P QY QP PG Y P Q+G PGQYGQ Q PGQYGQP
Sbjct 92 AYPPPGQYGQPQAPG---QYGQPQPGAYPPPGQYG----HPQAPGQYGQPQAPGQYGQPY 144
Query 104 ---GQYGQPGQY------AP-PGQY----PGQYGPYGQSGQ 130
GQ PGQY AP PGQY PG Y P GQ GQ
Sbjct 145 PTPGQPAAPGQYGKPPAGAPAPGQYGQPQPGAYPPPGQYGQ 185
>gi|227874619|ref|ZP_03992782.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35243]
gi|227844828|gb|EEJ54974.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35243]
Length=490
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/43 (63%), Positives = 32/43 (75%), Gaps = 1/43 (2%)
Query 81 QPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPPGQYPGQYG 123
QP Q G PGQ GQ +QPG++GQ GQ GQPGQ PGQ PG++G
Sbjct 269 QPNQPGQPGQLGQTEQPGRFGQTGQPGQPGQTEQPGQ-PGRFG 310
>gi|72016099|ref|XP_786229.1| PREDICTED: similar to RNA-binding protein involved in epigenetic
programming of developmental genome rearrangements [Strongylocentrotus
purpuratus]
gi|115956992|ref|XP_001194435.1| PREDICTED: similar to RNA-binding protein involved in epigenetic
programming of developmental genome rearrangements [Strongylocentrotus
purpuratus]
Length=1584
Score = 41.6 bits (96), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 27/49 (56%), Positives = 30/49 (62%), Gaps = 1/49 (2%)
Query 81 QPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPPGQYPGQYGPYGQSG 129
QP G PGQ G + PGQ G G GQPG + PPGQ PG +GP GQ G
Sbjct 581 QPGSHGPPGQPGSHGPPGQPGSHGPPGQPGSHGPPGQ-PGSHGPPGQPG 628
Score = 41.6 bits (96), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 27/49 (56%), Positives = 30/49 (62%), Gaps = 1/49 (2%)
Query 81 QPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPPGQYPGQYGPYGQSG 129
QP G PGQ G + PGQ G G GQPG + PPGQ PG +GP GQ G
Sbjct 608 QPGSHGPPGQPGSHGPPGQPGSHGPPGQPGSHGPPGQ-PGSHGPPGQPG 655
>gi|256376356|ref|YP_003100016.1| hypothetical protein Amir_2230 [Actinosynnema mirum DSM 43827]
gi|255920659|gb|ACU36170.1| hypothetical protein Amir_2230 [Actinosynnema mirum DSM 43827]
Length=319
Score = 40.8 bits (94), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 25/42 (60%), Positives = 29/42 (70%), Gaps = 6/42 (14%)
Query 88 PGQYGQ---YQQPGQYGQPGQY---GQPGQYAPPGQYPGQYG 123
PGQ+GQ ++Q GQYGQPGQY GQPGQY PGQ+G
Sbjct 80 PGQFGQPGPFEQAGQYGQPGQYAQHGQPGQYGRQPGMPGQFG 121
Lambda K H
0.312 0.134 0.421
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 340053351120
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40