BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0879c Length=91 Score E Sequences producing significant alignments: (Bits) Value gi|15608019|ref|NP_215394.1| transmembrane protein [Mycobacteriu... 174 5e-42 gi|298524371|ref|ZP_07011780.1| predicted protein [Mycobacterium... 118 4e-25 gi|15828149|ref|NP_302412.1| hypothetical protein ML2141 [Mycoba... 109 1e-22 gi|118616145|ref|YP_904477.1| transmembrane protein [Mycobacteri... 109 1e-22 gi|183984620|ref|YP_001852911.1| transmembrane protein [Mycobact... 108 3e-22 gi|240171960|ref|ZP_04750619.1| hypothetical protein MkanA1_2178... 94.4 5e-18 gi|336461241|gb|EGO40116.1| Protein of unknown function (DUF2530... 94.4 5e-18 gi|41406916|ref|NP_959752.1| hypothetical protein MAP0818c [Myco... 94.0 6e-18 gi|120405986|ref|YP_955815.1| hypothetical protein Mvan_5037 [My... 93.6 8e-18 gi|108801432|ref|YP_641629.1| hypothetical protein Mmcs_4469 [My... 89.7 1e-16 gi|118473017|ref|YP_889922.1| hypothetical protein MSMEG_5689 [M... 88.6 3e-16 gi|342860491|ref|ZP_08717142.1| hypothetical protein MCOL_16491 ... 83.6 9e-15 gi|296169644|ref|ZP_06851262.1| hypothetical protein HMPREF0591_... 82.0 3e-14 gi|333989446|ref|YP_004522060.1| hypothetical protein JDM601_080... 80.9 7e-14 gi|169627977|ref|YP_001701626.1| hypothetical protein MAB_0878c ... 80.1 1e-13 gi|145222305|ref|YP_001132983.1| hypothetical protein Mflv_1714 ... 77.8 4e-13 gi|331694478|ref|YP_004330717.1| hypothetical protein Psed_0600 ... 60.1 1e-07 gi|302530546|ref|ZP_07282888.1| predicted protein [Streptomyces ... 59.7 1e-07 gi|300790290|ref|YP_003770581.1| hypothetical protein AMED_8483 ... 52.8 2e-05 gi|54022615|ref|YP_116857.1| hypothetical protein nfa6480 [Nocar... 48.9 2e-04 gi|317509208|ref|ZP_07966831.1| K1C25 keratin [Segniliparus rugo... 48.9 3e-04 gi|256374640|ref|YP_003098300.1| hypothetical protein Amir_0487 ... 47.4 7e-04 gi|319949872|ref|ZP_08023881.1| hypothetical protein ES5_10282 [... 47.0 9e-04 gi|343928224|ref|ZP_08767679.1| hypothetical protein GOALK_110_0... 46.2 0.002 gi|134097219|ref|YP_001102880.1| hypothetical protein SACE_0609 ... 45.8 0.002 gi|258654835|ref|YP_003203991.1| hypothetical protein Namu_4726 ... 44.7 0.004 gi|237785053|ref|YP_002905758.1| hypothetical protein ckrop_0438... 43.5 0.011 gi|254823195|ref|ZP_05228196.1| hypothetical protein MintA_24919... 42.7 0.019 gi|296395227|ref|YP_003660111.1| hypothetical protein Srot_2849 ... 42.4 0.025 gi|324999197|ref|ZP_08120309.1| hypothetical protein PseP1_10551... 40.8 0.062 gi|296138564|ref|YP_003645807.1| hypothetical protein Tpau_0832 ... 38.5 0.30 gi|262201230|ref|YP_003272438.1| hypothetical protein Gbro_1250 ... 38.5 0.31 gi|226308114|ref|YP_002768074.1| hypothetical protein RER_46270 ... 33.9 7.5 >gi|15608019|ref|NP_215394.1| transmembrane protein [Mycobacterium tuberculosis H37Rv] gi|15840293|ref|NP_335330.1| hypothetical protein MT0902 [Mycobacterium tuberculosis CDC1551] gi|31792067|ref|NP_854560.1| transmembrane protein [Mycobacterium bovis AF2122/97] 78 more sequence titlesLength=91 Score = 174 bits (440), Expect = 5e-42, Method: Compositional matrix adjust. Identities = 91/91 (100%), Positives = 91/91 (100%), Gaps = 0/91 (0%) Query 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL Sbjct 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 Query 61 LGTTIFVWQLAAARRGARGAQAGLETYLDPK 91 LGTTIFVWQLAAARRGARGAQAGLETYLDPK Sbjct 61 LGTTIFVWQLAAARRGARGAQAGLETYLDPK 91 >gi|298524371|ref|ZP_07011780.1| predicted protein [Mycobacterium tuberculosis 94_M4241A] gi|298494165|gb|EFI29459.1| predicted protein [Mycobacterium tuberculosis 94_M4241A] Length=84 Score = 118 bits (295), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 62/62 (100%), Positives = 62/62 (100%), Gaps = 0/62 (0%) Query 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL Sbjct 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 Query 61 LG 62 LG Sbjct 61 LG 62 >gi|15828149|ref|NP_302412.1| hypothetical protein ML2141 [Mycobacterium leprae TN] gi|221230626|ref|YP_002504042.1| hypothetical protein MLBr_02141 [Mycobacterium leprae Br4923] gi|2440101|emb|CAB16670.1| hypothetical protein MLCB57.30c [Mycobacterium leprae] gi|13093703|emb|CAC31096.1| conserved hypothetical protein [Mycobacterium leprae] gi|219933733|emb|CAR72238.1| conserved hypothetical protein [Mycobacterium leprae Br4923] Length=91 Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 70/91 (77%), Positives = 75/91 (83%), Gaps = 0/91 (0%) Query 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 M VE + REPPPLP +LLEVWPVI VGALAWL+A VAAFVV L SWRPV +AGL GL Sbjct 1 MLVEPDRSREPPPLPSMLLEVWPVIMVGALAWLIAVVAAFVVSSLQSWRPVALAGLVVGL 60 Query 61 LGTTIFVWQLAAARRGARGAQAGLETYLDPK 91 GT IFVWQLAAARRGARGAQ GLETYLDP+ Sbjct 61 FGTGIFVWQLAAARRGARGAQGGLETYLDPR 91 >gi|118616145|ref|YP_904477.1| transmembrane protein [Mycobacterium ulcerans Agy99] gi|118568255|gb|ABL03006.1| conserved transmembrane protein [Mycobacterium ulcerans Agy99] Length=93 Score = 109 bits (272), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 71/91 (79%), Positives = 76/91 (84%), Gaps = 0/91 (0%) Query 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 MS+E Q EPPPLP VLLEVWPVI VGALAWL AA+AAF+VP L WRPVTVAGL GL Sbjct 1 MSIEPDQSPEPPPLPAVLLEVWPVILVGALAWLSAAMAAFLVPSLQGWRPVTVAGLVVGL 60 Query 61 LGTTIFVWQLAAARRGARGAQAGLETYLDPK 91 GT +FVWQLAAARRGARGAQAGLETYLD + Sbjct 61 FGTCLFVWQLAAARRGARGAQAGLETYLDQQ 91 >gi|183984620|ref|YP_001852911.1| transmembrane protein [Mycobacterium marinum M] gi|183177946|gb|ACC43056.1| conserved transmembrane protein [Mycobacterium marinum M] Length=95 Score = 108 bits (270), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 71/91 (79%), Positives = 76/91 (84%), Gaps = 0/91 (0%) Query 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 MS+E Q EPPPLP VLLEVWPVI VGALAWL AA+AAF+VP L WRPVTVAGL GL Sbjct 1 MSIEPDQSPEPPPLPAVLLEVWPVILVGALAWLGAAMAAFLVPSLQGWRPVTVAGLVVGL 60 Query 61 LGTTIFVWQLAAARRGARGAQAGLETYLDPK 91 GT +FVWQLAAARRGARGAQAGLETYLD + Sbjct 61 FGTCLFVWQLAAARRGARGAQAGLETYLDQQ 91 >gi|240171960|ref|ZP_04750619.1| hypothetical protein MkanA1_21780 [Mycobacterium kansasii ATCC 12478] Length=91 Score = 94.4 bits (233), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 72/91 (80%), Positives = 75/91 (83%), Gaps = 0/91 (0%) Query 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 MS E EPPPLP LLEVWPVIAVGAL W+ AAVAAF VP L +WRPVTVAGLA GL Sbjct 1 MSDEPDHNLEPPPLPAALLEVWPVIAVGALGWVAAAVAAFAVPSLQTWRPVTVAGLAVGL 60 Query 61 LGTTIFVWQLAAARRGARGAQAGLETYLDPK 91 LGT IFVWQLAAARRGARGAQAGLETYLD + Sbjct 61 LGTGIFVWQLAAARRGARGAQAGLETYLDQQ 91 >gi|336461241|gb|EGO40116.1| Protein of unknown function (DUF2530) [Mycobacterium avium subsp. paratuberculosis S397] Length=95 Score = 94.4 bits (233), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 59/89 (67%), Positives = 68/89 (77%), Gaps = 0/89 (0%) Query 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 MS E+ R PPLP LL VWP IA+GAL WL A AAF+VPGL WRPVT+AGL G+ Sbjct 5 MSTESGPNRTAPPLPAALLRVWPFIALGALGWLAAVGAAFLVPGLHCWRPVTLAGLGVGV 64 Query 61 LGTTIFVWQLAAARRGARGAQAGLETYLD 89 LGT+IF+ QLAAARRGARGAQ GLE +L+ Sbjct 65 LGTSIFLLQLAAARRGARGAQTGLENFLE 93 >gi|41406916|ref|NP_959752.1| hypothetical protein MAP0818c [Mycobacterium avium subsp. paratuberculosis K-10] gi|118465635|ref|YP_880267.1| hypothetical protein MAV_1008 [Mycobacterium avium 104] gi|254773893|ref|ZP_05215409.1| hypothetical protein MaviaA2_04352 [Mycobacterium avium subsp. avium ATCC 25291] gi|41395266|gb|AAS03135.1| hypothetical protein MAP_0818c [Mycobacterium avium subsp. paratuberculosis K-10] gi|118166922|gb|ABK67819.1| conserved hypothetical protein [Mycobacterium avium 104] Length=91 Score = 94.0 bits (232), Expect = 6e-18, Method: Compositional matrix adjust. Identities = 59/89 (67%), Positives = 68/89 (77%), Gaps = 0/89 (0%) Query 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 MS E+ R PPLP LL VWP IA+GAL WL A AAF+VPGL WRPVT+AGL G+ Sbjct 1 MSTESGPNRTAPPLPAALLRVWPFIALGALGWLAAVGAAFLVPGLHCWRPVTLAGLGVGV 60 Query 61 LGTTIFVWQLAAARRGARGAQAGLETYLD 89 LGT+IF+ QLAAARRGARGAQ GLE +L+ Sbjct 61 LGTSIFLLQLAAARRGARGAQTGLENFLE 89 >gi|120405986|ref|YP_955815.1| hypothetical protein Mvan_5037 [Mycobacterium vanbaalenii PYR-1] gi|119958804|gb|ABM15809.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1] Length=83 Score = 93.6 bits (231), Expect = 8e-18, Method: Compositional matrix adjust. Identities = 48/82 (59%), Positives = 55/82 (68%), Gaps = 0/82 (0%) Query 3 VENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGLLG 62 ++ +PPPLP LL WPVI V W++A V AF V L WRPVTVAGLA G LG Sbjct 1 MDTDSDPQPPPLPASLLNPWPVIVVITCGWVIATVLAFTVAALHQWRPVTVAGLAVGALG 60 Query 63 TTIFVWQLAAARRGARGAQAGL 84 TTIF+WQ A RRG RGAQ+GL Sbjct 61 TTIFLWQRHAVRRGHRGAQSGL 82 >gi|108801432|ref|YP_641629.1| hypothetical protein Mmcs_4469 [Mycobacterium sp. MCS] gi|119870585|ref|YP_940537.1| hypothetical protein Mkms_4556 [Mycobacterium sp. KMS] gi|126437417|ref|YP_001073108.1| hypothetical protein Mjls_4852 [Mycobacterium sp. JLS] gi|108771851|gb|ABG10573.1| conserved hypothetical protein [Mycobacterium sp. MCS] gi|119696674|gb|ABL93747.1| conserved hypothetical protein [Mycobacterium sp. KMS] gi|126237217|gb|ABO00618.1| hypothetical protein Mjls_4852 [Mycobacterium sp. JLS] Length=91 Score = 89.7 bits (221), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 48/83 (58%), Positives = 56/83 (68%), Gaps = 0/83 (0%) Query 3 VENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGLLG 62 S+ +PPPLP LL+ PVIAV A WLVA V AF V L WRP T+AGL G+ G Sbjct 9 TSESESPQPPPLPAGLLDPRPVIAVIAGVWLVATVLAFTVSTLHEWRPFTIAGLGIGVFG 68 Query 63 TTIFVWQLAAARRGARGAQAGLE 85 T+IF+WQ A RRGARGAQ+GL Sbjct 69 TSIFLWQRHAVRRGARGAQSGLR 91 >gi|118473017|ref|YP_889922.1| hypothetical protein MSMEG_5689 [Mycobacterium smegmatis str. MC2 155] gi|118174304|gb|ABK75200.1| conserved hypothetical protein [Mycobacterium smegmatis str. MC2 155] Length=89 Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 46/74 (63%), Positives = 55/74 (75%), Gaps = 0/74 (0%) Query 11 PPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGLLGTTIFVWQL 70 PP LP LL+ PVI AL WLVA +AAF +P L +WRP TVAGL G+LGT+IF+WQ Sbjct 15 PPQLPRALLDPRPVIIAIALGWLVATIAAFTLPDLHAWRPYTVAGLGVGVLGTSIFLWQR 74 Query 71 AAARRGARGAQAGL 84 A RRG+RGAQ+GL Sbjct 75 RAVRRGSRGAQSGL 88 >gi|342860491|ref|ZP_08717142.1| hypothetical protein MCOL_16491 [Mycobacterium colombiense CECT 3035] gi|342132146|gb|EGT85387.1| hypothetical protein MCOL_16491 [Mycobacterium colombiense CECT 3035] Length=91 Score = 83.6 bits (205), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 60/91 (66%), Positives = 68/91 (75%), Gaps = 0/91 (0%) Query 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 MS E + R PPLP LL VWP IA+GAL W+ A AAF+VP L WRPV +AGL G+ Sbjct 1 MSAEPGESRTAPPLPAALLHVWPFIALGALGWVAAVAAAFLVPALQGWRPVALAGLGVGV 60 Query 61 LGTTIFVWQLAAARRGARGAQAGLETYLDPK 91 LGT+IF+ QLAAARRGARGAQAGLE YLD K Sbjct 61 LGTSIFLLQLAAARRGARGAQAGLENYLDHK 91 >gi|296169644|ref|ZP_06851262.1| hypothetical protein HMPREF0591_4703 [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295895641|gb|EFG75337.1| hypothetical protein HMPREF0591_4703 [Mycobacterium parascrofulaceum ATCC BAA-614] Length=94 Score = 82.0 bits (201), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 65/89 (74%), Positives = 69/89 (78%), Gaps = 0/89 (0%) Query 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 MSVE Q RE PPLPP LL WP IA+GAL WL A AAF+VP L SWRPVT+AGL G Sbjct 5 MSVEPGQNREAPPLPPALLNAWPFIALGALGWLAAVAAAFLVPALQSWRPVTLAGLGVGA 64 Query 61 LGTTIFVWQLAAARRGARGAQAGLETYLD 89 LGT+IFV QLAAARRGARGAQAGLE Y D Sbjct 65 LGTSIFVLQLAAARRGARGAQAGLENYFD 93 >gi|333989446|ref|YP_004522060.1| hypothetical protein JDM601_0806 [Mycobacterium sp. JDM601] gi|333485414|gb|AEF34806.1| hypothetical protein JDM601_0806 [Mycobacterium sp. JDM601] Length=74 Score = 80.9 bits (198), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 45/72 (63%), Positives = 51/72 (71%), Gaps = 0/72 (0%) Query 14 LPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGLLGTTIFVWQLAAA 73 +P LL WPVI GA + A +AAF +P L SWRPV+VAGL G LGTTIF+ Q AA Sbjct 1 MPTALLRAWPVIGAGAAGFGCATIAAFAIPALESWRPVSVAGLGVGALGTTIFLLQRTAA 60 Query 74 RRGARGAQAGLE 85 RRGARGAQ GLE Sbjct 61 RRGARGAQTGLE 72 >gi|169627977|ref|YP_001701626.1| hypothetical protein MAB_0878c [Mycobacterium abscessus ATCC 19977] gi|169239944|emb|CAM60972.1| Conserved hypothetical protein [Mycobacterium abscessus] Length=81 Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 40/63 (64%), Positives = 47/63 (75%), Gaps = 0/63 (0%) Query 22 WPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGLLGTTIFVWQLAAARRGARGAQ 81 WPV+ G + WL+A V AF VP SWRP+TVAGL TG LGTTI + Q+ AARRG+RGAQ Sbjct 19 WPVVVAGTVLWLIANVLAFTVPAFESWRPITVAGLGTGALGTTIVLLQVRAARRGSRGAQ 78 Query 82 AGL 84 GL Sbjct 79 TGL 81 >gi|145222305|ref|YP_001132983.1| hypothetical protein Mflv_1714 [Mycobacterium gilvum PYR-GCK] gi|315442739|ref|YP_004075618.1| hypothetical protein Mspyr1_11000 [Mycobacterium sp. Spyr1] gi|145214791|gb|ABP44195.1| hypothetical protein Mflv_1714 [Mycobacterium gilvum PYR-GCK] gi|315261042|gb|ADT97783.1| Protein of unknown function (DUF2530) [Mycobacterium sp. Spyr1] Length=83 Score = 77.8 bits (190), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 38/62 (62%), Positives = 45/62 (73%), Gaps = 0/62 (0%) Query 24 VIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGLLGTTIFVWQLAAARRGARGAQAG 83 VI V W+VAA+ AF V GL WRP+TVAGL G+LGT+IF+WQ +ARRG RGAQ G Sbjct 22 VIVVITCGWVVAALLAFTVDGLHDWRPITVAGLGVGVLGTSIFLWQRHSARRGHRGAQRG 81 Query 84 LE 85 L Sbjct 82 LN 83 >gi|331694478|ref|YP_004330717.1| hypothetical protein Psed_0600 [Pseudonocardia dioxanivorans CB1190] gi|326949167|gb|AEA22864.1| Protein of unknown function DUF2530 [Pseudonocardia dioxanivorans CB1190] Length=96 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 37/87 (43%), Positives = 48/87 (56%), Gaps = 5/87 (5%) Query 4 ENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVA---AFVVPG--LASWRPVTVAGLAT 58 + ++ PP LPP L ++ + +VG L WL AV A +V G L W VAG+ Sbjct 10 DTDVMQNPPELPPRLSDIGVIASVGTLLWLAGAVGLLVAHIVGGRPLDLWFGTCVAGVVL 69 Query 59 GLLGTTIFVWQLAAARRGARGAQAGLE 85 G +G IF WQ +AARRG R AQ GL Sbjct 70 GGIGYAIFRWQRSAARRGTRTAQQGLH 96 >gi|302530546|ref|ZP_07282888.1| predicted protein [Streptomyces sp. AA4] gi|302439441|gb|EFL11257.1| predicted protein [Streptomyces sp. AA4] Length=88 Score = 59.7 bits (143), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 33/74 (45%), Positives = 42/74 (57%), Gaps = 1/74 (1%) Query 8 IREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGLLGTTIFV 67 +R P LP L ++ PV+ VG WLVA V F G W +AG+A G +G I Sbjct 14 LRHTPDLPKRLTDLTPVVIVGTALWLVALVVLFFTSG-GVWLQTCIAGIALGFIGFGIIF 72 Query 68 WQLAAARRGARGAQ 81 WQ AAARRG++ AQ Sbjct 73 WQRAAARRGSKSAQ 86 >gi|300790290|ref|YP_003770581.1| hypothetical protein AMED_8483 [Amycolatopsis mediterranei U32] gi|299799804|gb|ADJ50179.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] gi|340531971|gb|AEK47176.1| hypothetical protein RAM_43545 [Amycolatopsis mediterranei S699] Length=75 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 32/75 (43%), Positives = 41/75 (55%), Gaps = 3/75 (4%) Query 8 IREPPPLPPVLLEVWPVIAVGALAWLVAAVAAF-VVPGLASWRPVTVAGLATGLLGTTIF 66 +R P LP L ++ PV+ VG W VA V F GL W ++G A G +G I Sbjct 1 MRHTPELPKRLTDLTPVVIVGTSIWAVALVVLFFTTSGL--WVQTALSGFALGFVGLAII 58 Query 67 VWQLAAARRGARGAQ 81 WQ AAARRG++ AQ Sbjct 59 AWQRAAARRGSKSAQ 73 >gi|54022615|ref|YP_116857.1| hypothetical protein nfa6480 [Nocardia farcinica IFM 10152] gi|54014123|dbj|BAD55493.1| hypothetical protein [Nocardia farcinica IFM 10152] Length=79 Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 40/76 (53%), Positives = 47/76 (62%), Gaps = 5/76 (6%) Query 12 PPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASW---RPVTVAGLATGLLGTTIFVW 68 P +PP L + PV+AVG L WLVA V +V G W RPV V GL GL+G IF+ Sbjct 6 PQIPPRLTDPRPVLAVGTLLWLVALVVVWV--GGPRWEAARPVCVMGLVVGLIGLVIFLV 63 Query 69 QLAAARRGARGAQAGL 84 Q AARRG +GAQ GL Sbjct 64 QRRAARRGDKGAQTGL 79 >gi|317509208|ref|ZP_07966831.1| K1C25 keratin [Segniliparus rugosus ATCC BAA-974] gi|316252508|gb|EFV11955.1| K1C25 keratin [Segniliparus rugosus ATCC BAA-974] Length=85 Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 32/85 (38%), Positives = 43/85 (51%), Gaps = 0/85 (0%) Query 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 M+ EN+Q +P L + PV+ V + WLV + + A W +AGL G Sbjct 1 MANENTQRAKPVALAAKWRDPIPVVVVVSALWLVGLAVSLISGHWAGWGQACLAGLVVGA 60 Query 61 LGTTIFVWQLAAARRGARGAQAGLE 85 LG +F+ Q AARRG R AQ GL Sbjct 61 LGFALFLSQRRAARRGDRTAQRGLS 85 >gi|256374640|ref|YP_003098300.1| hypothetical protein Amir_0487 [Actinosynnema mirum DSM 43827] gi|255918943|gb|ACU34454.1| hypothetical protein Amir_0487 [Actinosynnema mirum DSM 43827] Length=81 Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 33/79 (42%), Positives = 44/79 (56%), Gaps = 12/79 (15%) Query 9 REPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVV--PGLASWRPVTVAGLATGLLGTTIF 66 R P+PP I VG+ WLVA V ++ P + W AG A G++G IF Sbjct 13 RLTDPVPP--------IVVGSALWLVALVVTLLIGAPAVVVW--TCAAGGALGVIGFGIF 62 Query 67 VWQLAAARRGARGAQAGLE 85 WQ +AARRG+R AQ+GL+ Sbjct 63 AWQRSAARRGSRTAQSGLQ 81 >gi|319949872|ref|ZP_08023881.1| hypothetical protein ES5_10282 [Dietzia cinnamea P4] gi|319436462|gb|EFV91573.1| hypothetical protein ES5_10282 [Dietzia cinnamea P4] Length=89 Score = 47.0 bits (110), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 31/90 (35%), Positives = 48/90 (54%), Gaps = 2/90 (2%) Query 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVA-GLATG 59 M+ E R PP+P LL+ PV+ +G LAW +A +A ++ + V A G+ G Sbjct 1 MTDEAKDTR-VPPIPTYLLDPRPVVILGTLAWTLALLAHMLMTDVDMRTVVLCAVGIGVG 59 Query 60 LLGTTIFVWQLAAARRGARGAQAGLETYLD 89 +G+ ++ Q A RG+ GAQ GL+ D Sbjct 60 AVGSLVYALQRRAVLRGSAGAQRGLDFDRD 89 >gi|343928224|ref|ZP_08767679.1| hypothetical protein GOALK_110_00650 [Gordonia alkanivorans NBRC 16433] gi|343761922|dbj|GAA14605.1| hypothetical protein GOALK_110_00650 [Gordonia alkanivorans NBRC 16433] Length=82 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 41/78 (53%), Positives = 48/78 (62%), Gaps = 3/78 (3%) Query 10 EPPPLPPVLLEVWPVIAVGALAWLVAAVAAFV--VPGLASWRPVTVAGLATGLLGTTIFV 67 + P LP VL PVI VG +AWLVA V V V G + R + + GL GLLGTTI + Sbjct 6 DVPELPRVLRAPEPVIVVGMVAWLVATVVVAVNDVGGGNALR-ICLVGLGVGLLGTTIVL 64 Query 68 WQLAAARRGARGAQAGLE 85 Q A RRGARGAQ GL+ Sbjct 65 VQRAGVRRGARGAQEGLD 82 >gi|134097219|ref|YP_001102880.1| hypothetical protein SACE_0609 [Saccharopolyspora erythraea NRRL 2338] gi|291005323|ref|ZP_06563296.1| hypothetical protein SeryN2_12457 [Saccharopolyspora erythraea NRRL 2338] gi|133909842|emb|CAL99954.1| hypothetical protein SACE_0609 [Saccharopolyspora erythraea NRRL 2338] Length=95 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 31/79 (40%), Positives = 41/79 (52%), Gaps = 4/79 (5%) Query 9 REPPPLPPVLLEVWPVIAVGALAWLVAAVA-AFVVPG-LASWRPVTVAGLATGLLGTTIF 66 R P LP L E PV+ G + W AV A PG + W + G G++G +F Sbjct 15 RHVPALPRRLAEPTPVVLAGTVLWCAGAVVFAVTDPGSIQFW--TSAVGCGLGVIGYGVF 72 Query 67 VWQLAAARRGARGAQAGLE 85 WQ +A+RRG+RGA GL Sbjct 73 RWQRSASRRGSRGAWKGLS 91 >gi|258654835|ref|YP_003203991.1| hypothetical protein Namu_4726 [Nakamurella multipartita DSM 44233] gi|258558060|gb|ACV81002.1| hypothetical protein Namu_4726 [Nakamurella multipartita DSM 44233] Length=99 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 32/77 (42%), Positives = 39/77 (51%), Gaps = 9/77 (11%) Query 21 VWPVIAVGALAWLVAAVAA-FVVPGLAS-----WRPVTVAGLATGLLGTTIFVWQLAAAR 74 VW V G + WLVA V F +P L S W +AG TGLLG +++ WQ AAR Sbjct 25 VWHVAVPGTIIWLVAFVVLLFFIPQLQSNDAMIWLWTCLAGFLTGLLGLSVYGWQRRAAR 84 Query 75 RGARGAQAGLETYLDPK 91 G R A + LD K Sbjct 85 LGKRSAN---QMALDEK 98 >gi|237785053|ref|YP_002905758.1| hypothetical protein ckrop_0438 [Corynebacterium kroppenstedtii DSM 44385] gi|237757965|gb|ACR17215.1| hypothetical protein ckrop_0438 [Corynebacterium kroppenstedtii DSM 44385] Length=120 Score = 43.5 bits (101), Expect = 0.011, Method: Compositional matrix adjust. Identities = 27/77 (36%), Positives = 41/77 (54%), Gaps = 5/77 (6%) Query 12 PPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLAS---WRPVTVAGLATGLLGTTIFVW 68 P LP L + P + VG + W++ AV + G W ++ G TG+LG +++W Sbjct 46 PRLPRALEDPRPALIVGIIVWVLGAVCFGLTRGWTDQLVW--TSIIGAVTGVLGYGVYMW 103 Query 69 QLAAARRGARGAQAGLE 85 Q A RRGA+ Q GL+ Sbjct 104 QREAVRRGAQWVQDGLD 120 >gi|254823195|ref|ZP_05228196.1| hypothetical protein MintA_24919 [Mycobacterium intracellulare ATCC 13950] Length=95 Score = 42.7 bits (99), Expect = 0.019, Method: Compositional matrix adjust. Identities = 64/91 (71%), Positives = 70/91 (77%), Gaps = 0/91 (0%) Query 1 MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGL 60 MS E R PPLP VL+ W IA+GAL W+ AA AAF+VP LASWRPVT+AGL GL Sbjct 5 MSAEPDHHRALPPLPAVLVHAWAFIALGALGWVAAAAAAFLVPALASWRPVTLAGLGVGL 64 Query 61 LGTTIFVWQLAAARRGARGAQAGLETYLDPK 91 LGTTIF+ QLAAARRGARGAQAGLE YLD K Sbjct 65 LGTTIFLLQLAAARRGARGAQAGLENYLDEK 95 >gi|296395227|ref|YP_003660111.1| hypothetical protein Srot_2849 [Segniliparus rotundus DSM 44985] gi|296182374|gb|ADG99280.1| Protein of unknown function DUF2530 [Segniliparus rotundus DSM 44985] Length=85 Score = 42.4 bits (98), Expect = 0.025, Method: Compositional matrix adjust. Identities = 25/63 (40%), Positives = 32/63 (51%), Gaps = 0/63 (0%) Query 23 PVIAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGLLGTTIFVWQLAAARRGARGAQA 82 P + ++AWLV + A W +AGL G LG +F+ Q AARRG R AQ Sbjct 23 PFVGALSVAWLVGLAVSLATGHWAGWGQTCLAGLVVGALGFALFLSQRRAARRGDRTAQR 82 Query 83 GLE 85 GL Sbjct 83 GLR 85 >gi|324999197|ref|ZP_08120309.1| hypothetical protein PseP1_10551 [Pseudonocardia sp. P1] Length=93 Score = 40.8 bits (94), Expect = 0.062, Method: Compositional matrix adjust. Identities = 34/75 (46%), Positives = 38/75 (51%), Gaps = 10/75 (13%) Query 15 PPVLLE---VWPV-IAVGALAWLVAAVAAFVVPGLASWRPVTVAGLATGLLGTTIFVWQL 70 PPVLL +W V V L+WLV P A +AGL GL G IF Q Sbjct 23 PPVLLGGMLLWAVATGVVVLSWLVGG------PWDAGSFWTCIAGLGVGLFGAAIFTAQR 76 Query 71 AAARRGARGAQAGLE 85 AAARRG R AQ G+E Sbjct 77 AAARRGDRSAQRGIE 91 >gi|296138564|ref|YP_003645807.1| hypothetical protein Tpau_0832 [Tsukamurella paurometabola DSM 20162] gi|296026698|gb|ADG77468.1| conserved hypothetical protein [Tsukamurella paurometabola DSM 20162] Length=89 Score = 38.5 bits (88), Expect = 0.30, Method: Compositional matrix adjust. Identities = 28/75 (38%), Positives = 37/75 (50%), Gaps = 3/75 (4%) Query 12 PPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGL--ASWRPVTVAGLATGLLGTTIFVWQ 69 P LP L E PV+ G + + ++ V + A+W V G+ GLLG + Q Sbjct 15 PELPRSLTEPQPVVIAGTVLFAISLVITLIARDTFGAAW-IVCACGVGVGLLGAAVLTAQ 73 Query 70 LAAARRGARGAQAGL 84 AA RG RGAQ GL Sbjct 74 RRAALRGDRGAQRGL 88 >gi|262201230|ref|YP_003272438.1| hypothetical protein Gbro_1250 [Gordonia bronchialis DSM 43247] gi|262084577|gb|ACY20545.1| hypothetical protein Gbro_1250 [Gordonia bronchialis DSM 43247] Length=82 Score = 38.5 bits (88), Expect = 0.31, Method: Compositional matrix adjust. Identities = 35/75 (47%), Positives = 42/75 (56%), Gaps = 1/75 (1%) Query 12 PPLPPVLLEVWPVIAVGALAWLVAAVAAFVVP-GLASWRPVTVAGLATGLLGTTIFVWQL 70 P LP +L PVI +G LAW VA + ++ G V + GL GLLGTTI Q Sbjct 8 PELPRLLRAPEPVIVIGMLAWAVATLVVWLADVGGDRALQVCLVGLGVGLLGTTIVTVQR 67 Query 71 AAARRGARGAQAGLE 85 A RRG RGAQ GL+ Sbjct 68 AGVRRGDRGAQEGLD 82 >gi|226308114|ref|YP_002768074.1| hypothetical protein RER_46270 [Rhodococcus erythropolis PR4] gi|226187231|dbj|BAH35335.1| hypothetical protein RER_46270 [Rhodococcus erythropolis PR4] Length=85 Score = 33.9 bits (76), Expect = 7.5, Method: Compositional matrix adjust. Identities = 22/62 (36%), Positives = 31/62 (50%), Gaps = 1/62 (1%) Query 23 PVIAVGALAWLVAAVAAFVVPGL-ASWRPVTVAGLATGLLGTTIFVWQLAAARRGARGAQ 81 P + +G W+VA + V + P+ AG+ G LG +F Q AA RG +GAQ Sbjct 19 PALIIGTALWVVATIVVAVSGDRWSDALPICYAGIGVGFLGLALFSIQRRAAVRGDKGAQ 78 Query 82 AG 83 G Sbjct 79 QG 80 Lambda K H 0.321 0.136 0.430 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 128725229700 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40