BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0098 Length=183 Score E Sequences producing significant alignments: (Bits) Value gi|15607240|ref|NP_214612.1| hypothetical protein Rv0098 [Mycoba... 380 5e-104 gi|340625134|ref|YP_004743586.1| hypothetical protein MCAN_01011... 378 2e-103 gi|308231460|ref|ZP_07412517.2| hypothetical protein TMAG_01230 ... 353 4e-96 gi|183980291|ref|YP_001848582.1| hypothetical protein MMAR_0259 ... 300 9e-80 gi|118619895|ref|YP_908227.1| hypothetical protein MUL_4855 [Myc... 295 2e-78 gi|15828081|ref|NP_302344.1| hypothetical protein ML1993 [Mycoba... 252 1e-65 gi|169627763|ref|YP_001701412.1| hypothetical protein MAB_0660 [... 246 1e-63 gi|317506387|ref|ZP_07964196.1| hypothetical protein HMPREF9336_... 234 6e-60 gi|297153929|gb|ADI03641.1| hypothetical protein SBI_00520 [Stre... 125 2e-27 gi|329940137|ref|ZP_08289419.1| hypothetical protein SGM_4911 [S... 119 3e-25 gi|29827147|ref|NP_821781.1| hypothetical protein SAV_606 [Strep... 110 9e-23 gi|335058601|gb|AEH26491.1| hypothetical protein [uncultured Aci... 109 2e-22 gi|302527904|ref|ZP_07280246.1| conserved hypothetical protein [... 105 4e-21 gi|226973312|gb|ACO94459.1| putative NRPS accessory protein [Str... 102 2e-20 gi|302547444|ref|ZP_07299786.1| conserved hypothetical protein [... 101 5e-20 gi|226973354|gb|ACO94487.1| putative NRPS accessory protein [Str... 99.4 2e-19 gi|261855558|ref|YP_003262841.1| hypothetical protein Hneap_0956... 99.0 3e-19 gi|262199175|ref|YP_003270384.1| hypothetical protein Hoch_6016 ... 92.8 2e-17 gi|124007605|ref|ZP_01692309.1| conserved hypothetical protein [... 92.8 2e-17 gi|115374318|ref|ZP_01461602.1| conserved hypothetical protein [... 91.7 4e-17 gi|274321839|ref|NP_001162084.1| UL16-binding protein 17 [Bos ta... 35.8 2.7 gi|312216588|emb|CBX96538.1| similar to DEAD/DEAH box helicase [... 35.4 4.0 gi|281337283|gb|EFB12867.1| hypothetical protein PANDA_022448 [A... 35.0 5.4 gi|301792859|ref|XP_002931396.1| PREDICTED: disintegrin and meta... 34.7 6.5 gi|291410863|ref|XP_002721720.1| PREDICTED: ADAM metallopeptidas... 34.3 8.9 >gi|15607240|ref|NP_214612.1| hypothetical protein Rv0098 [Mycobacterium tuberculosis H37Rv] gi|15839478|ref|NP_334515.1| hypothetical protein MT0107 [Mycobacterium tuberculosis CDC1551] gi|31791276|ref|NP_853769.1| hypothetical protein Mb0101 [Mycobacterium bovis AF2122/97] 69 more sequence titlesLength=183 Score = 380 bits (975), Expect = 5e-104, Method: Compositional matrix adjust. Identities = 183/183 (100%), Positives = 183/183 (100%), Gaps = 0/183 (0%) Query 1 MSHTDLTPCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFT 60 MSHTDLTPCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFT Sbjct 1 MSHTDLTPCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFT 60 Query 61 IGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLI 120 IGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLI Sbjct 61 IGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLI 120 Query 121 RKASSRFRKPLNPQKFSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAAL 180 RKASSRFRKPLNPQKFSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAAL Sbjct 121 RKASSRFRKPLNPQKFSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAAL 180 Query 181 NIP 183 NIP Sbjct 181 NIP 183 >gi|340625134|ref|YP_004743586.1| hypothetical protein MCAN_01011 [Mycobacterium canettii CIPT 140010059] gi|340003324|emb|CCC42442.1| conserved hypothetical protein [Mycobacterium canettii CIPT 140010059] Length=183 Score = 378 bits (971), Expect = 2e-103, Method: Compositional matrix adjust. Identities = 182/183 (99%), Positives = 183/183 (100%), Gaps = 0/183 (0%) Query 1 MSHTDLTPCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFT 60 MSHTDLTPCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFT Sbjct 1 MSHTDLTPCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFT 60 Query 61 IGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLI 120 IGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLI Sbjct 61 IGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLI 120 Query 121 RKASSRFRKPLNPQKFSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAAL 180 RKASSRFRKPLNPQKFSARLLCRDL+VIERTWRYLKVPCVIEFWDENGGAASGEIELAAL Sbjct 121 RKASSRFRKPLNPQKFSARLLCRDLRVIERTWRYLKVPCVIEFWDENGGAASGEIELAAL 180 Query 181 NIP 183 NIP Sbjct 181 NIP 183 >gi|308231460|ref|ZP_07412517.2| hypothetical protein TMAG_01230 [Mycobacterium tuberculosis SUMu001] gi|308369302|ref|ZP_07417262.2| hypothetical protein TMBG_02566 [Mycobacterium tuberculosis SUMu002] gi|308371584|ref|ZP_07425403.2| hypothetical protein TMDG_01567 [Mycobacterium tuberculosis SUMu004] 9 more sequence titles Length=172 Score = 353 bits (907), Expect = 4e-96, Method: Compositional matrix adjust. Identities = 171/172 (99%), Positives = 172/172 (100%), Gaps = 0/172 (0%) Query 12 VLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFTIGESAYIRSTG 71 +LASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFTIGESAYIRSTG Sbjct 1 MLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFTIGESAYIRSTG 60 Query 72 HFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPL 131 HFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPL Sbjct 61 HFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPL 120 Query 132 NPQKFSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAALNIP 183 NPQKFSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAALNIP Sbjct 121 NPQKFSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAALNIP 172 >gi|183980291|ref|YP_001848582.1| hypothetical protein MMAR_0259 [Mycobacterium marinum M] gi|183173617|gb|ACC38727.1| conserved hypothetical protein [Mycobacterium marinum M] Length=184 Score = 300 bits (767), Expect = 9e-80, Method: Compositional matrix adjust. Identities = 143/184 (78%), Positives = 160/184 (87%), Gaps = 1/184 (0%) Query 1 MSHTDLT-PCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNF 59 MS TDLT P + S+GT IAE+LLARVLEPYS KGCRYL+DA+Y A +DSVLAYGNF Sbjct 1 MSTTDLTSPAHAAVESAGTTAIAEDLLARVLEPYSYKGCRYLLDARYHADDDSVLAYGNF 60 Query 60 TIGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSML 119 TI ESAYIRSTGHFNAVELILCFNQLAYSAFAPAV NEEI LRGWS++DY QHQL+SM Sbjct 61 TISESAYIRSTGHFNAVELILCFNQLAYSAFAPAVANEEIPQLRGWSLEDYFQHQLASMF 120 Query 120 IRKASSRFRKPLNPQKFSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAA 179 IR +SSRF +P+NP KFSARL CR+LQV++RTWRYL VPC IEFWDE+GGAASGE+ELAA Sbjct 121 IRNSSSRFNRPINPAKFSARLQCRNLQVVQRTWRYLLVPCAIEFWDEDGGAASGEVELAA 180 Query 180 LNIP 183 LNIP Sbjct 181 LNIP 184 >gi|118619895|ref|YP_908227.1| hypothetical protein MUL_4855 [Mycobacterium ulcerans Agy99] gi|118572005|gb|ABL06756.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99] Length=184 Score = 295 bits (755), Expect = 2e-78, Method: Compositional matrix adjust. Identities = 141/184 (77%), Positives = 158/184 (86%), Gaps = 1/184 (0%) Query 1 MSHTDLT-PCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNF 59 MS TDLT P + S+GT IAE+LLARVLEPYS KGCRYL+DA+Y A +DSVLAYGNF Sbjct 1 MSTTDLTSPAHAAVESAGTTAIAEDLLARVLEPYSYKGCRYLLDARYHADDDSVLAYGNF 60 Query 60 TIGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSML 119 TI ESAYIRSTGHFNAVELILCFNQLAYSAFAPAV NEEI LRGWS++DY QHQL+SM Sbjct 61 TISESAYIRSTGHFNAVELILCFNQLAYSAFAPAVANEEIPQLRGWSLEDYFQHQLASMF 120 Query 120 IRKASSRFRKPLNPQKFSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAA 179 IR +SSRF +P+N KFS RL CR+LQV++RTWRYL VPC IEFWDE+GGAASGE+ELAA Sbjct 121 IRNSSSRFNRPINTAKFSGRLQCRNLQVVQRTWRYLLVPCAIEFWDEDGGAASGEVELAA 180 Query 180 LNIP 183 LNIP Sbjct 181 LNIP 184 >gi|15828081|ref|NP_302344.1| hypothetical protein ML1993 [Mycobacterium leprae TN] gi|221230558|ref|YP_002503974.1| hypothetical protein MLBr_01993 [Mycobacterium leprae Br4923] gi|13093635|emb|CAC30948.1| conserved hypothetical protein [Mycobacterium leprae] gi|219933665|emb|CAR72090.1| conserved hypothetical protein [Mycobacterium leprae Br4923] Length=183 Score = 252 bits (644), Expect = 1e-65, Method: Compositional matrix adjust. Identities = 118/180 (66%), Positives = 145/180 (81%), Gaps = 1/180 (0%) Query 5 DLTPCTRVLA-SSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFTIGE 63 +L P + A S+ T P+AE+LL +LEPYS KGCRYLIDAQY A+ DSV AYGNF I E Sbjct 4 ELAPSLQNAAESTNTFPMAEDLLGSILEPYSYKGCRYLIDAQYRASPDSVFAYGNFGIEE 63 Query 64 SAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKA 123 SAYIR TGHFNAVEL+LCFNQL YSA+A +V+N++I LRGWSI DYC++QLS +LI+ Sbjct 64 SAYIRGTGHFNAVELMLCFNQLGYSAYAQSVVNKDISALRGWSIADYCRNQLSGILIKNT 123 Query 124 SSRFRKPLNPQKFSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAALNIP 183 SSRF+K +NPQKFSARL DL+++ERTWRYL++ IEFWD+NGG+A GE E+A LNIP Sbjct 124 SSRFKKLINPQKFSARLHVYDLRIVERTWRYLQLSNTIEFWDDNGGSAIGEFEVAILNIP 183 >gi|169627763|ref|YP_001701412.1| hypothetical protein MAB_0660 [Mycobacterium abscessus ATCC 19977] gi|169239730|emb|CAM60758.1| Conserved hypothetical protein [Mycobacterium abscessus] Length=184 Score = 246 bits (627), Expect = 1e-63, Method: Compositional matrix adjust. Identities = 110/171 (65%), Positives = 139/171 (82%), Gaps = 0/171 (0%) Query 13 LASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFTIGESAYIRSTGH 72 L S+G VP++ LL+RVLEPYS KGCRYL DA+Y T D+V A GNF I ESAYIRSTGH Sbjct 14 LPSAGMVPMSAALLSRVLEPYSYKGCRYLNDAEYGITSDAVHAQGNFAIDESAYIRSTGH 73 Query 73 FNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPLN 132 FNAVEL+LCFNQLAY AFA V+N++I RGWSIDDYC++QL+SMLI+ SSRFR+ +N Sbjct 74 FNAVELVLCFNQLAYCAFAQGVVNDDIWAFRGWSIDDYCENQLASMLIKSTSSRFRRLVN 133 Query 133 PQKFSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAALNIP 183 P++FSARL RD +++R+WRYL+ IEFWD++GG+A+GE ++A LNIP Sbjct 134 PRQFSARLQARDFHIVDRSWRYLQFHSTIEFWDQDGGSAAGEFDIAVLNIP 184 >gi|317506387|ref|ZP_07964196.1| hypothetical protein HMPREF9336_00566 [Segniliparus rugosus ATCC BAA-974] gi|316255316|gb|EFV14577.1| hypothetical protein HMPREF9336_00566 [Segniliparus rugosus ATCC BAA-974] Length=193 Score = 234 bits (596), Expect = 6e-60, Method: Compositional matrix adjust. Identities = 105/167 (63%), Positives = 132/167 (80%), Gaps = 0/167 (0%) Query 17 GTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFTIGESAYIRSTGHFNAV 76 GT PI EELLA+VLEPYS KGCRYL+DA Y +D V A +F I ESAYIRSTGHFNAV Sbjct 16 GTAPIPEELLAKVLEPYSHKGCRYLVDADYGIADDVVHARASFGIEESAYIRSTGHFNAV 75 Query 77 ELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQKF 136 EL+LCFNQL YSAFA +NE+I GWSIDDYC++QL+ MLI+ ASSRF++ ++ ++F Sbjct 76 ELVLCFNQLTYSAFAQGCVNEDIDAFLGWSIDDYCRNQLAGMLIKSASSRFKRQIDARRF 135 Query 137 SARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAALNIP 183 SARL D +++ERTWRYL+ V+EFWD++GG A+GE E+A LN+P Sbjct 136 SARLRAHDFRIVERTWRYLQFQSVVEFWDQDGGLAAGEFEIAVLNLP 182 >gi|297153929|gb|ADI03641.1| hypothetical protein SBI_00520 [Streptomyces bingchenggensis BCW-1] Length=188 Score = 125 bits (315), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 72/177 (41%), Positives = 97/177 (55%), Gaps = 11/177 (6%) Query 15 SSGTV---PIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFTIGESAYIRSTG 71 ++GTV P L RVL PY CRYL A+ ++V A G F IGES YI TG Sbjct 10 TAGTVTAHPDDPALRERVLRPYRTN-CRYLGPAEVRLENEAVTASGTFRIGESCYIDDTG 68 Query 72 HFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPL 131 HFNAVE +CFNQ+AY A N V W++DDY Q QLS +LI + +SRF + + Sbjct 69 HFNAVEFNICFNQIAYYLIAKCAQNRLPPVFAAWTLDDYWQRQLSDILITRFTSRFHRAI 128 Query 132 NPQKFSARLLCRDLQVIERTWRYLKVPCVI-----EFWDENGGAASGEIELAALNIP 183 Q+F L D ++ R+ + P + FWD+ GG +GEI++A N+P Sbjct 129 RGQEFRGELSFTDFKL--RSGENGRAPLALIESECRFWDKGGGRCTGEIQIALTNLP 183 >gi|329940137|ref|ZP_08289419.1| hypothetical protein SGM_4911 [Streptomyces griseoaurantiacus M045] gi|329300963|gb|EGG44859.1| hypothetical protein SGM_4911 [Streptomyces griseoaurantiacus M045] Length=175 Score = 119 bits (297), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 63/162 (39%), Positives = 91/162 (57%), Gaps = 3/162 (1%) Query 24 ELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFTIGESAYIRSTGHFNAVELILCFN 83 ELL VL PY RYL +A+ + D+V A G F I ES YI TGHFNAVE +C+N Sbjct 11 ELLDGVLRPYR-DHSRYLREAEVTGRPDTVSARGRFGIAESCYIDDTGHFNAVEFNICYN 69 Query 84 QLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQKFSARLLCR 143 QL Y A V + W++ D+ + QLS +LI + SRFR+ +NP+ F ++ + Sbjct 70 QLGYYLIAKCVEDRLFEAFSTWTMADFWERQLSDVLIYRLGSRFRRMINPRAFEGEVVFQ 129 Query 144 DLQVIERTWR--YLKVPCVIEFWDENGGAASGEIELAALNIP 183 + +V R R + + FWD+ GG+A G ++LA +P Sbjct 130 EPRVSVRPGRPAVMSMETTCSFWDDEGGSADGTVKLAFTRLP 171 >gi|29827147|ref|NP_821781.1| hypothetical protein SAV_606 [Streptomyces avermitilis MA-4680] gi|15824220|dbj|BAB69378.1| hypothetical protein [Streptomyces avermitilis] gi|29604245|dbj|BAC68316.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length=172 Score = 110 bits (275), Expect = 9e-23, Method: Compositional matrix adjust. Identities = 62/170 (37%), Positives = 92/170 (55%), Gaps = 10/170 (5%) Query 20 PIAEELLARVLEPYSCKGCRYLIDAQYSATEDS-VLAYGNFTIGESAYIRSTGHFNAVEL 78 P E+LLARVL PY C+YL A + ++ +A F I ES YI TGH N+VE+ Sbjct 7 PTDEDLLARVLVPYK-DHCKYLRSAVVTESDTGRAVARCEFAIPESCYIDDTGHLNSVEV 65 Query 79 ILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQKFSA 138 +C+NQ+ Y A +V + W++DD+ +HQL +LI + +S FR+P+NP+ FS Sbjct 66 NICYNQMMYYLVAKSVKEGLLAGFESWTLDDFWKHQLPDILIARFASNFRRPVNPRAFSG 125 Query 139 RLLCRDLQVIER-----TWRYLKVPCVIEFWDENGGAASGEIELAALNIP 183 + + Q + R +L +WD + G GE LA +NIP Sbjct 126 EM---EFQSVTRRAPAGGIPFLHAETAYRYWDADSGRCDGEAVLAFVNIP 172 >gi|335058601|gb|AEH26491.1| hypothetical protein [uncultured Acidobacteria bacterium A3] Length=181 Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 67/166 (41%), Positives = 91/166 (55%), Gaps = 7/166 (4%) Query 19 VPIAEELLARVLEPYSCKGCRYLIDA--QYSATEDSVLA-YGNFTIGESAYIRSTGHFNA 75 V + + LA+VL PY C+YL A +Y+ T D+V A G F+I ES YI STGHFNA Sbjct 16 VVVDPQTLAQVLAPYK-PHCQYLRTAVVEYAGTADTVAAATGRFSIPESCYIASTGHFNA 74 Query 76 VELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQK 135 VE +C+NQL Y A + + +R GW+ D+ + QLS LI + S F KPL Sbjct 75 VEFNICYNQLTYCLLAKCIEDNALRCFDGWTPADFRRRQLSDFLIARFFSSFHKPLAGLS 134 Query 136 FSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAALN 181 F R+ + V R ++K C F D+ GGAA G +A +N Sbjct 135 FEGRVAVVKIAV-RRDQTFMKTIC--RFDDDRGGAAEGGATIAIVN 177 >gi|302527904|ref|ZP_07280246.1| conserved hypothetical protein [Streptomyces sp. AA4] gi|302436799|gb|EFL08615.1| conserved hypothetical protein [Streptomyces sp. AA4] Length=185 Score = 105 bits (261), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 67/188 (36%), Positives = 97/188 (52%), Gaps = 17/188 (9%) Query 4 TDLTPCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSA-----TEDSVLAYGN 58 TD P R++ ELL RVL Y + CRYL A+ + + + + G Sbjct 3 TDEHPAGRLVR----FDTDHELLDRVLRVYR-EHCRYLKFAETTVPHGPESGEGISVAGE 57 Query 59 FTIGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSM 118 F I ES YI TGHFNAVE +C+NQ+AY A AV + V W+IDD+ + QL ++ Sbjct 58 FAIPESCYIDDTGHFNAVEFNICYNQMAYYLIAKAVQDGLHPVFARWTIDDFWRLQLGNI 117 Query 119 LIRKASSRFRKPLNPQKFSARLLCRDLQVIERTWRY-----LKVPCVIEFWDENGGAASG 173 LI S F++ + + F R D+ +++ R+ L+ C EFWDE GG G Sbjct 118 LITDLKSTFKRQMRGRMFEGRTTVVDVVRLDQVERWDPLIVLRTKC--EFWDETGGNCRG 175 Query 174 EIELAALN 181 E+++A N Sbjct 176 EVKIAISN 183 >gi|226973312|gb|ACO94459.1| putative NRPS accessory protein [Streptomyces sp. DSM 21069] Length=187 Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 60/174 (35%), Positives = 85/174 (49%), Gaps = 12/174 (6%) Query 20 PIAEELLARVLEPYSCKGCRYLIDAQYSATED-----SVLAYGNFTIGESAYIRSTGHFN 74 P +ELL RVL PY K C YL A ++ D ++A F I ES YI TGHFN Sbjct 14 PTDDELLHRVLTPYRAKRCEYLTSATVTSEGDPRDGGRLIATCTFEIPESCYIDDTGHFN 73 Query 75 AVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQ 134 +VE +CFNQ+AY A +V + GW+I+ + QL+ + I S FR + + Sbjct 74 SVEFNICFNQMAYYLLAMSVRESLVEPFSGWTIEQFWTRQLADVFITDFKSSFRSAMQGR 133 Query 135 KFSARLLCRDL-----QVIERTWRYLKVPCVIEFWDENGGAASGEIELAALNIP 183 +F+ + D+ + L+ C + DE GG + GEI A N P Sbjct 134 RFTGEIEIIDIAEWDANDLRDALVILRTKC--HYADEQGGESHGEITAAVTNPP 185 >gi|302547444|ref|ZP_07299786.1| conserved hypothetical protein [Streptomyces hygroscopicus ATCC 53653] gi|302465062|gb|EFL28155.1| conserved hypothetical protein [Streptomyces himastatinicus ATCC 53653] Length=187 Score = 101 bits (251), Expect = 5e-20, Method: Compositional matrix adjust. Identities = 62/174 (36%), Positives = 84/174 (49%), Gaps = 12/174 (6%) Query 20 PIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVL-----AYGNFTIGESAYIRSTGHFN 74 P EELL RVL PY K C YL A + D + A F I ES YI TGHFN Sbjct 14 PTDEELLHRVLNPYRAKRCEYLTSAAVTVAGDPLDGGRLGATCTFLIPESCYIDDTGHFN 73 Query 75 AVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQ 134 +VE +CFNQ+AY A +V + W+++ + QL+ + I S FR ++ + Sbjct 74 SVEFNICFNQMAYYLMAKSVKESLVEPFSRWTLEQFWTRQLADVFITDFKSTFRTAMSGR 133 Query 135 KFSARLLCRDLQVIERTWRYLKVPCVI-----EFWDENGGAASGEIELAALNIP 183 F + D+ + T L+ VI +WDE GG + GEI A N P Sbjct 134 HFRGEIEIADIAEWDAT--DLRDALVILRSRCRYWDEQGGESHGEIAAAITNPP 185 >gi|226973354|gb|ACO94487.1| putative NRPS accessory protein [Streptomyces sp. MP39-85] Length=187 Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 61/171 (36%), Positives = 84/171 (50%), Gaps = 12/171 (7%) Query 23 EELLARVLEPYSCKGCRYLIDAQYSATED-----SVLAYGNFTIGESAYIRSTGHFNAVE 77 EELL RVL PY K C YL A ++ D + A F I ES YI TGHFN+VE Sbjct 17 EELLHRVLTPYRAKRCEYLTSATVTSEGDPRDGGRLSAACTFEIPESCYIDDTGHFNSVE 76 Query 78 LILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQKFS 137 +CFNQ+AY A +V + W++D + QL+ + I S FRK + ++F Sbjct 77 FNICFNQMAYYLMAKSVKESLVEPFSRWTLDQFWTRQLADVFITDFKSTFRKAMRGRRFK 136 Query 138 ARLLCRDLQVIERTWRYLKVPCVI-----EFWDENGGAASGEIELAALNIP 183 + D + E L+ VI + DE+GG + GE+ A N P Sbjct 137 GEIEIVD--IAEWDANDLREALVILRTRCHYADEHGGESHGEVTAAVTNPP 185 >gi|261855558|ref|YP_003262841.1| hypothetical protein Hneap_0956 [Halothiobacillus neapolitanus c2] gi|261836027|gb|ACX95794.1| conserved hypothetical protein [Halothiobacillus neapolitanus c2] Length=179 Score = 99.0 bits (245), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 55/161 (35%), Positives = 83/161 (52%), Gaps = 5/161 (3%) Query 25 LLARVLEPYSCKGCRYLIDAQY----SATEDSVLAYGNFTIGESAYIRSTGHFNAVELIL 80 L RVL Y RYL A + + ++++G F++ ES YI STGHFN+VE + Sbjct 19 LRDRVLSVY-FDNTRYLKSANIFLDDNGAKSRIVSHGKFSVPESCYIESTGHFNSVEFNI 77 Query 81 CFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQKFSARL 140 C+NQ+ Y A A A E + W + + QL +LI +SRF KP+N FS Sbjct 78 CYNQIMYDAIAHATKYELVPQFCAWGTEGFFSRQLPDILITSFNSRFTKPVNSVDFSGEF 137 Query 141 LCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAALN 181 +++ + +L + F+D+ G A GE++LA LN Sbjct 138 TIHTMRLHNKGQPFLFIKTGCRFFDDEQGEAVGEVDLAILN 178 >gi|262199175|ref|YP_003270384.1| hypothetical protein Hoch_6016 [Haliangium ochraceum DSM 14365] gi|262082522|gb|ACY18491.1| hypothetical protein Hoch_6016 [Haliangium ochraceum DSM 14365] Length=205 Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 60/171 (36%), Positives = 85/171 (50%), Gaps = 26/171 (15%) Query 19 VPIAEELLARVLEPYSCKGCRYL--IDAQYS-ATEDSVL-----AYGNFTIGESAYIRST 70 V + E L +L PY C+YL +D Y + + VL A G F+IGES YI T Sbjct 20 VTVDERFLEHILTPYMTP-CKYLKSVDISYDPSASNGVLPAKLRARGRFSIGESCYIEDT 78 Query 71 GHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKP 130 GHFNAVE I+CFNQ+ Y A A+ +R + W +D+ + QL +LI A S FR+ Sbjct 79 GHFNAVEFIMCFNQIGYVLLAHALQAGLVRGVPAWGREDFAERQLPDVLIVNAKSTFRRM 138 Query 131 LNPQKFSARLLCRDLQVIERTWR-------YLKVPCVIEFWDENGGAASGE 174 ++ + F V E W +L + F+D GG ++GE Sbjct 139 IDARDF----------VGEAVWTRASARGDHLFGSFDVSFYDAAGGRSTGE 179 >gi|124007605|ref|ZP_01692309.1| conserved hypothetical protein [Microscilla marina ATCC 23134] gi|123986903|gb|EAY26668.1| conserved hypothetical protein [Microscilla marina ATCC 23134] Length=179 Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 57/165 (35%), Positives = 85/165 (52%), Gaps = 8/165 (4%) Query 21 IAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLA----YGNFTIGESAYIRSTGHFNAV 76 I + + +V++PY YL A++ LA G F I ES YI TGHFNAV Sbjct 18 IEDGFVEKVMKPYRAN-TAYLQSAEFVQQPGQGLAGLTMEGVFNIPESCYIDDTGHFNAV 76 Query 77 ELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQKF 136 E +C+NQL Y + N I L + D + + QLS++LI K +S ++KP+NP+KF Sbjct 77 EYNICYNQLGYVFLGHCIKNGLIPDLASYDEDTFFEKQLSNILIVKIASNYKKPINPKKF 136 Query 137 SARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGEIELAALN 181 + + +L C F+D+N G + GE+ LA L+ Sbjct 137 YGTYGVKSTTYKSKCI-FLSTYC--NFYDDNEGKSQGEVMLAILH 178 >gi|115374318|ref|ZP_01461602.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1] gi|310818960|ref|YP_003951318.1| hypothetical protein STAUR_1687 [Stigmatella aurantiaca DW4/3-1] gi|115368628|gb|EAU67579.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1] gi|309392032|gb|ADO69491.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length=190 Score = 91.7 bits (226), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 60/160 (38%), Positives = 84/160 (53%), Gaps = 15/160 (9%) Query 25 LLARVLEPYSCKGCRYLIDAQYS----------ATEDSVLAYGNFTIGESAYIRSTGHFN 74 LL +VL PY CRYL+ A + + A G F+I ES YI TGHFN Sbjct 18 LLQQVLIPYKPH-CRYLLRAHLEHEGASTPEGRSRGNGAAAQGEFSIPESCYIDDTGHFN 76 Query 75 AVELILCFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQ 134 AVE +C+NQLAY A + + + L WSI++Y + QLS LI + SS FRK + Sbjct 77 AVEFNICYNQLAYFLLA-SCAHHRLEPLAAWSIEEYRRRQLSDFLIVQFSSSFRKQMKAA 135 Query 135 KFSARLLCRDLQVIERTWRYLKVPCVIEFWDENGGAASGE 174 F+ R+ R + ++ ++K C F D +GG + GE Sbjct 136 HFTGRVELRKVMGRGKSL-FIKTYC--RFEDGHGGLSEGE 172 >gi|274321839|ref|NP_001162084.1| UL16-binding protein 17 [Bos taurus] gi|114540273|gb|ABI75236.1| ULBP17 [Bos taurus] Length=315 Score = 35.8 bits (81), Expect = 2.7, Method: Compositional matrix adjust. Identities = 22/88 (25%), Positives = 41/88 (47%), Gaps = 16/88 (18%) Query 92 PAVLNEEIRVLRGW--------SIDDYCQHQLSSMLIRKASSRFRKPLNPQKFSARLLCR 143 P+VL EE++ ++ W I D+ + + + K P +P+ AR+ CR Sbjct 75 PSVLGEEVKTIKAWETQIETLRDIRDWIKDHMHDFTMEK-----HMPRDPRTLQARMTCR 129 Query 144 --DLQVIERTWRY-LKVPCVIEFWDENG 168 D + + +W++ L ++ F ENG Sbjct 130 CEDDRHVSGSWKFGLNGVMILHFDSENG 157 >gi|312216588|emb|CBX96538.1| similar to DEAD/DEAH box helicase [Leptosphaeria maculans] Length=1386 Score = 35.4 bits (80), Expect = 4.0, Method: Compositional matrix adjust. Identities = 22/90 (25%), Positives = 39/90 (44%), Gaps = 0/90 (0%) Query 41 LIDAQYSATEDSVLAYGNFTIGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIR 100 L +Q ++ + +AY N + +IR FNA L++ Q+ ++ +E Sbjct 1271 LFSSQTFSSTATFIAYFNKMMTSKVFIRDITPFNAYSLLMFGGQIQVDTLGRGLIVDEWI 1330 Query 101 VLRGWSIDDYCQHQLSSMLIRKASSRFRKP 130 LRGW+ +L ML R ++P Sbjct 1331 RLRGWARIGVLVSRLRGMLDRVLDGMVKEP 1360 >gi|281337283|gb|EFB12867.1| hypothetical protein PANDA_022448 [Ailuropoda melanoleuca] Length=393 Score = 35.0 bits (79), Expect = 5.4, Method: Compositional matrix adjust. Identities = 15/49 (31%), Positives = 30/49 (62%), Gaps = 0/49 (0%) Query 58 NFTIGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWS 106 ++T+ S ++ +T VE ++ FN + ++ + PA+L+ IRVL W+ Sbjct 152 HYTVSHSLFLLNTNITRIVENVVIFNNILHTIYKPALLDVNIRVLCIWN 200 >gi|301792859|ref|XP_002931396.1| PREDICTED: disintegrin and metalloproteinase domain-containing protein 20-like, partial [Ailuropoda melanoleuca] Length=560 Score = 34.7 bits (78), Expect = 6.5, Method: Compositional matrix adjust. Identities = 15/49 (31%), Positives = 30/49 (62%), Gaps = 0/49 (0%) Query 58 NFTIGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWS 106 ++T+ S ++ +T VE ++ FN + ++ + PA+L+ IRVL W+ Sbjct 211 HYTVSHSLFLLNTNITRIVENVVIFNNILHTIYKPALLDVNIRVLCIWN 259 >gi|291410863|ref|XP_002721720.1| PREDICTED: ADAM metallopeptidase domain 21 preproprotein-like [Oryctolagus cuniculus] Length=747 Score = 34.3 bits (77), Expect = 8.9, Method: Compositional matrix adjust. Identities = 17/51 (34%), Positives = 26/51 (51%), Gaps = 0/51 (0%) Query 58 NFTIGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIRVLRGWSID 108 ++T+ S + +E + N + +S + PA LN IRVL WSID Sbjct 211 HYTVSSSLRNMNKNETQIIENVFIMNNILHSIYRPASLNAAIRVLCIWSID 261 Lambda K H 0.322 0.136 0.416 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 167689013960 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40