BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv2784c Length=171 Score E Sequences producing significant alignments: (Bits) Value gi|15609921|ref|NP_217300.1| lipoprotein LppU [Mycobacterium tub... 351 2e-95 gi|340627786|ref|YP_004746238.1| putative lipoprotein LPPU [Myco... 347 3e-94 gi|289758911|ref|ZP_06518289.1| lipoprotein lppU [Mycobacterium ... 277 5e-73 gi|240171315|ref|ZP_04749974.1| lipoprotein LppU [Mycobacterium ... 257 4e-67 gi|118617703|ref|YP_906035.1| lipoprotein LppU [Mycobacterium ul... 242 2e-62 gi|118473707|ref|YP_886992.1| LppU protein [Mycobacterium smegma... 182 2e-44 gi|169630189|ref|YP_001703838.1| lipoprotein LppU [Mycobacterium... 142 1e-32 gi|312138653|ref|YP_004005989.1| lipoprotein [Rhodococcus equi 1... 111 4e-23 gi|325676278|ref|ZP_08155957.1| hypothetical protein HMPREF0724_... 111 4e-23 gi|226364577|ref|YP_002782359.1| hypothetical protein ROP_51670 ... 104 4e-21 gi|111022073|ref|YP_705045.1| hypothetical protein RHA1_ro05106 ... 101 4e-20 gi|54027610|ref|YP_121852.1| hypothetical protein nfa56360 [Noca... 99.4 2e-19 gi|226308464|ref|YP_002768424.1| hypothetical protein RER_49770 ... 96.3 1e-18 gi|229489096|ref|ZP_04382962.1| conserved hypothetical protein [... 95.5 3e-18 gi|169627879|ref|YP_001701528.1| hypothetical protein MAB_0778 [... 65.1 3e-09 gi|169630064|ref|YP_001703713.1| putative liporotein LppU [Mycob... 62.8 2e-08 gi|256377885|ref|YP_003101545.1| hypothetical protein Amir_3820 ... 42.4 0.024 gi|134097160|ref|YP_001102821.1| hypothetical protein SACE_0549 ... 38.9 0.30 gi|310795722|gb|EFQ31183.1| glycolipid anchored surface protein ... 37.7 0.53 gi|300789923|ref|YP_003770214.1| hypothetical protein AMED_8109 ... 37.4 0.81 gi|311899454|dbj|BAJ31862.1| hypothetical protein KSE_60960 [Kit... 37.0 1.1 gi|54022601|ref|YP_116843.1| hypothetical protein nfa6340 [Nocar... 36.6 1.2 gi|223937295|ref|ZP_03629201.1| Mammalian cell entry related dom... 36.6 1.5 gi|324500344|gb|ADY40164.1| Plexin-2 [Ascaris suum] 34.7 5.4 gi|256374607|ref|YP_003098267.1| hypothetical protein Amir_0454 ... 33.9 8.1 >gi|15609921|ref|NP_217300.1| lipoprotein LppU [Mycobacterium tuberculosis H37Rv] gi|15842322|ref|NP_337359.1| hypothetical protein MT2854 [Mycobacterium tuberculosis CDC1551] gi|31793960|ref|NP_856453.1| lipoprotein LppU [Mycobacterium bovis AF2122/97] 76 more sequence titlesLength=171 Score = 351 bits (901), Expect = 2e-95, Method: Compositional matrix adjust. Identities = 171/171 (100%), Positives = 171/171 (100%), Gaps = 0/171 (0%) Query 1 MRAWLAAATTALFVVATGCSSATNVAELKVGDCVKLAGTPDRPQATKAECGSPASNFKVV 60 MRAWLAAATTALFVVATGCSSATNVAELKVGDCVKLAGTPDRPQATKAECGSPASNFKVV Sbjct 1 MRAWLAAATTALFVVATGCSSATNVAELKVGDCVKLAGTPDRPQATKAECGSPASNFKVV 60 Query 61 AVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDD 120 AVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDD Sbjct 61 AVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDD 120 Query 121 ASVPHRQRATQILKDLDSPVSVDQCASGVGYVYTQRRFAVCVEDVTGGPRS 171 ASVPHRQRATQILKDLDSPVSVDQCASGVGYVYTQRRFAVCVEDVTGGPRS Sbjct 121 ASVPHRQRATQILKDLDSPVSVDQCASGVGYVYTQRRFAVCVEDVTGGPRS 171 >gi|340627786|ref|YP_004746238.1| putative lipoprotein LPPU [Mycobacterium canettii CIPT 140010059] gi|340005976|emb|CCC45143.1| putative lipoprotein LPPU [Mycobacterium canettii CIPT 140010059] Length=171 Score = 347 bits (891), Expect = 3e-94, Method: Compositional matrix adjust. Identities = 168/171 (99%), Positives = 170/171 (99%), Gaps = 0/171 (0%) Query 1 MRAWLAAATTALFVVATGCSSATNVAELKVGDCVKLAGTPDRPQATKAECGSPASNFKVV 60 MRAWLAAATTALFVVATGCS+ATNVAELKVGDCVKLAGTPDRPQATKAECGSPASNFKVV Sbjct 1 MRAWLAAATTALFVVATGCSAATNVAELKVGDCVKLAGTPDRPQATKAECGSPASNFKVV 60 Query 61 AVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDD 120 AVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDD Sbjct 61 AVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDD 120 Query 121 ASVPHRQRATQILKDLDSPVSVDQCASGVGYVYTQRRFAVCVEDVTGGPRS 171 ASVPHRQRATQILKDLD+PVSVDQCASGVGYVYTQRRF VCVEDVTGGPRS Sbjct 121 ASVPHRQRATQILKDLDAPVSVDQCASGVGYVYTQRRFVVCVEDVTGGPRS 171 >gi|289758911|ref|ZP_06518289.1| lipoprotein lppU [Mycobacterium tuberculosis T85] gi|289714475|gb|EFD78487.1| lipoprotein lppU [Mycobacterium tuberculosis T85] Length=133 Score = 277 bits (708), Expect = 5e-73, Method: Compositional matrix adjust. Identities = 133/133 (100%), Positives = 133/133 (100%), Gaps = 0/133 (0%) Query 39 TPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWV 98 TPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWV Sbjct 1 TPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWV 60 Query 99 IGGCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCASGVGYVYTQRRF 158 IGGCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCASGVGYVYTQRRF Sbjct 61 IGGCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCASGVGYVYTQRRF 120 Query 159 AVCVEDVTGGPRS 171 AVCVEDVTGGPRS Sbjct 121 AVCVEDVTGGPRS 133 >gi|240171315|ref|ZP_04749974.1| lipoprotein LppU [Mycobacterium kansasii ATCC 12478] Length=172 Score = 257 bits (657), Expect = 4e-67, Method: Compositional matrix adjust. Identities = 122/173 (71%), Positives = 144/173 (84%), Gaps = 3/173 (1%) Query 1 MRAWLAAATTALFVVATGCSSATNVAELKVGDCVKLAGTPDRPQATKAECGSPASNFKVV 60 MRA LA+ A + GCSSATN+ +LKVGDC+KL GTPDRPQATKA CGSP SNFKVV Sbjct 1 MRA-LASVVVAASGIVLGCSSATNLVDLKVGDCLKLGGTPDRPQATKAACGSPDSNFKVV 59 Query 61 AVVQE--DHAECPADVDSTYSMRNAFNGSTNTICLDIDWVIGGCMSVDPTHNTDPFRVDC 118 AVV+ + +CPADVDS+YSM N+ +G +T+CLD+DWV+GGCMSVDP H T+PFRVDC Sbjct 60 AVVKPGGERTQCPADVDSSYSMHNSLSGENSTLCLDVDWVVGGCMSVDPAHKTEPFRVDC 119 Query 119 DDASVPHRQRATQILKDLDSPVSVDQCASGVGYVYTQRRFAVCVEDVTGGPRS 171 +DAS PHRQRATQIL++L+ PV+ DQCASGVGY YTQRRFAVCVEDVT GPR+ Sbjct 120 NDASAPHRQRATQILENLEPPVTADQCASGVGYTYTQRRFAVCVEDVTNGPRT 172 >gi|118617703|ref|YP_906035.1| lipoprotein LppU [Mycobacterium ulcerans Agy99] gi|183981936|ref|YP_001850227.1| lipoprotein LppU [Mycobacterium marinum M] gi|118569813|gb|ABL04564.1| lipoprotein LppU [Mycobacterium ulcerans Agy99] gi|183175262|gb|ACC40372.1| lipoprotein LppU [Mycobacterium marinum M] Length=177 Score = 242 bits (617), Expect = 2e-62, Method: Compositional matrix adjust. Identities = 119/177 (68%), Positives = 136/177 (77%), Gaps = 6/177 (3%) Query 1 MRAWLAAATTALFV----VATGCSSATNVAELKVGDCVKLAGTPDRPQATKAECGSPASN 56 MRA A AL V V GCSS T A+L VGDC+KLAG PDRPQATKA CGS SN Sbjct 1 MRALFLAVLMALAVPASGVLVGCSSTTKAADLAVGDCLKLAGPPDRPQATKAACGSEDSN 60 Query 57 FKVVAVVQE--DHAECPADVDSTYSMRNAFNGSTNTICLDIDWVIGGCMSVDPTHNTDPF 114 FKVVAV ++ DH ECPADVDS+YS RN G+ +T+CLD+DWV+G CMSVDP H TDPF Sbjct 61 FKVVAVAKDGTDHTECPADVDSSYSSRNVLGGANSTLCLDVDWVLGSCMSVDPDHKTDPF 120 Query 115 RVDCDDASVPHRQRATQILKDLDSPVSVDQCASGVGYVYTQRRFAVCVEDVTGGPRS 171 RV C+DAS PHRQRATQIL+D+ SPV+VDQCASGVGY YT+RRF VCVEDV G ++ Sbjct 121 RVGCNDASAPHRQRATQILQDVASPVTVDQCASGVGYTYTERRFVVCVEDVGGSSQT 177 >gi|118473707|ref|YP_886992.1| LppU protein [Mycobacterium smegmatis str. MC2 155] gi|118174994|gb|ABK75890.1| LppU protein [Mycobacterium smegmatis str. MC2 155] Length=135 Score = 182 bits (461), Expect = 2e-44, Method: Compositional matrix adjust. Identities = 81/138 (59%), Positives = 109/138 (79%), Gaps = 4/138 (2%) Query 28 LKVGDCVKLAGTPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNGS 87 ++ GDC++L GT D+P+AT+AECGS SN+KVV V D A CPADVDS Y++ + F G Sbjct 1 MQAGDCLELGGTFDQPEATRAECGSKKSNYKVVQTVA-DSARCPADVDSYYTLSSRFGGE 59 Query 88 TNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCAS 147 T+T+C+DIDWV+GGCM+VDP +NTDP+RVDC DA PHRQR T++L+ + +P DQCA+ Sbjct 60 THTVCMDIDWVVGGCMNVDPENNTDPYRVDCSDAGAPHRQRVTEVLEGISNP---DQCAT 116 Query 148 GVGYVYTQRRFAVCVEDV 165 G+GY Y +R+F VCVE+V Sbjct 117 GLGYAYDERQFTVCVENV 134 >gi|169630189|ref|YP_001703838.1| lipoprotein LppU [Mycobacterium abscessus ATCC 19977] gi|169242156|emb|CAM63184.1| Possible lipoprotein LppU [Mycobacterium abscessus] Length=168 Score = 142 bits (359), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 74/165 (45%), Positives = 98/165 (60%), Gaps = 8/165 (4%) Query 5 LAAATTALFVVA----TGCSSATNVAELKVGDCVKLAGTPDRPQATKAECGSPASNFKVV 60 L AA+ L V +GCS+A L VGDCV L+G+ R + K CGSP SNFKV Sbjct 7 LGAASIVLLTVGVALLSGCSAAAASDGLAVGDCVNLSGSDQRAKMVKEPCGSPTSNFKVF 66 Query 61 AVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDD 120 A D A+CP D DS+Y + F + +CLDIDWV+GGCM V + DP RVDC+D Sbjct 67 AKAATD-ADCPRDADSSYYAKRGFGRKSQALCLDIDWVVGGCMDVPDKWDGDPVRVDCND 125 Query 121 ASVPHRQRATQILKDLDSPVSVDQCASGVGYVYTQRRFAVCVEDV 165 +++R TQIL+ + + D C +G+GY Y R F VCVE++ Sbjct 126 PRAQNKKRVTQILQQVS---TADDCITGLGYPYVDRNFTVCVEEL 167 >gi|312138653|ref|YP_004005989.1| lipoprotein [Rhodococcus equi 103S] gi|311887992|emb|CBH47304.1| putative lipoprotein [Rhodococcus equi 103S] Length=205 Score = 111 bits (277), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 58/139 (42%), Positives = 86/139 (62%), Gaps = 6/139 (4%) Query 27 ELKVGDCVKLAGTPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNG 86 ++++GDCV+L GT D +A CGS SN+KVV + +A+C +DVD Y + Sbjct 70 DVEIGDCVRLGGTADAATIDEAVCGSDKSNYKVVGKAAK-NAQCASDVDQVY-YETRWGN 127 Query 87 STNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCA 146 +CLDIDWV+GGCMS+ + +P RV+CDD P +RA ++++ + V V+QC+ Sbjct 128 ERGALCLDIDWVMGGCMSLPDGDDDEPQRVECDDPYAPGIERAIEVIEGV---VDVEQCS 184 Query 147 SGVGYVYTQRRFAVCVEDV 165 G GYV+ +R F VC E V Sbjct 185 EG-GYVHDEREFTVCTETV 202 >gi|325676278|ref|ZP_08155957.1| hypothetical protein HMPREF0724_13740 [Rhodococcus equi ATCC 33707] gi|325552839|gb|EGD22522.1| hypothetical protein HMPREF0724_13740 [Rhodococcus equi ATCC 33707] Length=204 Score = 111 bits (277), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 58/139 (42%), Positives = 86/139 (62%), Gaps = 6/139 (4%) Query 27 ELKVGDCVKLAGTPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNG 86 ++++GDCV+L GT D +A CGS SN+KVV + +A+C +DVD Y + Sbjct 69 DVEIGDCVRLGGTADAATIDEAVCGSDKSNYKVVGKAAK-NAQCASDVDQVY-YETRWGN 126 Query 87 STNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCA 146 +CLDIDWV+GGCMS+ + +P RV+CDD P +RA ++++ + V V+QC+ Sbjct 127 ERGALCLDIDWVMGGCMSLPDGDDDEPQRVECDDPYAPGIERAIEVIEGV---VDVEQCS 183 Query 147 SGVGYVYTQRRFAVCVEDV 165 G GYV+ +R F VC E V Sbjct 184 EG-GYVHDEREFTVCTETV 201 >gi|226364577|ref|YP_002782359.1| hypothetical protein ROP_51670 [Rhodococcus opacus B4] gi|226243066|dbj|BAH53414.1| hypothetical protein [Rhodococcus opacus B4] Length=198 Score = 104 bits (260), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 58/141 (42%), Positives = 85/141 (61%), Gaps = 11/141 (7%) Query 27 ELKVGDCVKLAGTPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNG 86 E+ +G+CVKL GT + KA CGSP SN+KV+A + +++C +D DS Y G Sbjct 67 EVAIGECVKLGGTVTDAEIDKAVCGSPDSNYKVIAKAAK-NSQCISDADSYY--YETLGG 123 Query 87 -STNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQC 145 ICLD+DWVIGGCM V DP R++C+D + + T+I++ SVD C Sbjct 124 IEQGAICLDVDWVIGGCMDV---GGEDPARIECNDTTAVDGVKVTEIVQ---GAASVDSC 177 Query 146 A-SGVGYVYTQRRFAVCVEDV 165 + S GY Y++R+F VCV+++ Sbjct 178 STSSNGYEYSERKFVVCVDEL 198 >gi|111022073|ref|YP_705045.1| hypothetical protein RHA1_ro05106 [Rhodococcus jostii RHA1] gi|110821603|gb|ABG96887.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length=230 Score = 101 bits (252), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 58/141 (42%), Positives = 82/141 (59%), Gaps = 11/141 (7%) Query 27 ELKVGDCVKLAGTPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNG 86 E+ +G+CVKL GT + KA CGS SN+KV+A + +++C +D DS Y G Sbjct 99 EVAIGECVKLGGTVSDAEIDKAVCGSADSNYKVIAKAAK-NSQCISDADSYY--YETLGG 155 Query 87 -STNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQC 145 ICLD+DWVIGGCM V DP R+DC D + + T+I++ SVD C Sbjct 156 IEQGAICLDVDWVIGGCMDV---GGEDPARIDCGDTTAVDGVKVTEIVQ---GATSVDSC 209 Query 146 A-SGVGYVYTQRRFAVCVEDV 165 + S GY Y +R+F VCV+++ Sbjct 210 STSSNGYEYPERKFVVCVDEL 230 >gi|54027610|ref|YP_121852.1| hypothetical protein nfa56360 [Nocardia farcinica IFM 10152] gi|54019118|dbj|BAD60488.1| hypothetical protein [Nocardia farcinica IFM 10152] Length=201 Score = 99.4 bits (246), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 57/140 (41%), Positives = 76/140 (55%), Gaps = 11/140 (7%) Query 27 ELKVGDCVKLAGTPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNG 86 E +GDCV L GT KA CGS ASN+K++A + CP+D D+ Y+ NG Sbjct 72 EAGIGDCVTLGGTTMNATIEKASCGSRASNYKIIAKTAT-SSSCPSDRDNYYA--ETLNG 128 Query 87 -STNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQC 145 CLDIDWV+GGCM V DP R+DC + + R T+I + C Sbjct 129 IEQGAYCLDIDWVVGGCMDV---GGDDPKRIDCTERGL-QGVRVTEI---AEGASDAGAC 181 Query 146 ASGVGYVYTQRRFAVCVEDV 165 SG+G+ Y +RRF VCVE++ Sbjct 182 GSGLGFEYPERRFVVCVEEL 201 >gi|226308464|ref|YP_002768424.1| hypothetical protein RER_49770 [Rhodococcus erythropolis PR4] gi|226187581|dbj|BAH35685.1| hypothetical protein RER_49770 [Rhodococcus erythropolis PR4] Length=205 Score = 96.3 bits (238), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 54/135 (40%), Positives = 79/135 (59%), Gaps = 8/135 (5%) Query 30 VGDCVKLAGTPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNGSTN 89 +GDCVKL GT + A+CGS SN+KVVA V +C +DVDS Y A + Sbjct 74 IGDCVKLGGTTTAAEIDNADCGSKDSNYKVVAKVPTSD-QCASDVDSYYYETLAGD-EQG 131 Query 90 TICLDIDWVIGGCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCASGV 149 +CLD+DWV+GGCM + + +P R++C D S + +IL++ S+D+C SG Sbjct 132 AVCLDVDWVVGGCMDLGSGMD-EPARIECSDTSGTNVVEVVEILQN---STSIDECGSGA 187 Query 150 --GYVYTQRRFAVCV 162 G+ + +R+F VCV Sbjct 188 DSGFEHPERKFTVCV 202 >gi|229489096|ref|ZP_04382962.1| conserved hypothetical protein [Rhodococcus erythropolis SK121] gi|229324600|gb|EEN90355.1| conserved hypothetical protein [Rhodococcus erythropolis SK121] Length=194 Score = 95.5 bits (236), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 54/135 (40%), Positives = 78/135 (58%), Gaps = 8/135 (5%) Query 30 VGDCVKLAGTPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNGSTN 89 +GDCVKL GT + A+CGS SN+KVVA V C +DVDS Y A + Sbjct 63 IGDCVKLGGTTTAAEIDNADCGSKDSNYKVVAKVPTSDL-CASDVDSYYYETLAGD-EQG 120 Query 90 TICLDIDWVIGGCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCASGV 149 +CLD+DWV+GGCM + + +P R++C D S + +IL++ S+D+C SG Sbjct 121 AVCLDVDWVVGGCMDLGSGMD-EPARIECSDTSGTNVVEVVEILQN---STSIDECGSGA 176 Query 150 --GYVYTQRRFAVCV 162 G+ + +R+F VCV Sbjct 177 DSGFEHPERKFTVCV 191 >gi|169627879|ref|YP_001701528.1| hypothetical protein MAB_0778 [Mycobacterium abscessus ATCC 19977] gi|169239846|emb|CAM60874.1| Conserved hypothetical protein (lipoprotein LppU?) [Mycobacterium abscessus] Length=193 Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 46/140 (33%), Positives = 66/140 (48%), Gaps = 16/140 (11%) Query 27 ELKVGDCVKLAGTPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNG 86 E +G CV L G P K +C S +N++V+ V +C D D R + G Sbjct 64 EAPIGACVYLKGKPGSVTLNKVDCDSQDANYRVIQRVGFPD-QCVNDAD-----RRFYLG 117 Query 87 STN---TICLDIDWVIGGCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVD 143 S T C+D W GC+SV P R +CDD ++P+R+R IL + + Sbjct 118 SPQGEWTACMDYAWTSEGCISVAPDKVV---RAECDDKNLPNRERPITILFNT---IDTS 171 Query 144 QCASGVGYVYTQRRFAVCVE 163 +C G G+ + RRF VC E Sbjct 172 RCLFG-GFAHPVRRFTVCTE 190 >gi|169630064|ref|YP_001703713.1| putative liporotein LppU [Mycobacterium abscessus ATCC 19977] gi|169242031|emb|CAM63059.1| Putative liporotein LppU [Mycobacterium abscessus] Length=187 Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 48/147 (33%), Positives = 69/147 (47%), Gaps = 13/147 (8%) Query 18 GCSSATNVAELKVGDCVKLAGTPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDST 77 G +A AE VG CV L G T +CGS + +++V V EC D D + Sbjct 50 GSLTANGQAEAPVGGCVNLGGELVNASLTVVDCGSDRNTYRIVQRVNIPQ-EC-GDTDRS 107 Query 78 YSMRNAFNGSTNTICLDIDWVIGGCMSVDPTHNTDPF-RVDCDDASVPHRQRATQILKDL 136 Y + G T CLD+ W C+S+ P +V C D + P R + +I+ D Sbjct 108 YYHNSEATGQY-TACLDLAWAKDSCISLG-----QPVAKVVCTDTNAPKRIKPLKIILDT 161 Query 137 DSPVSVDQCASGVGYVYTQRRFAVCVE 163 +++ C SG GY + QR+F VC E Sbjct 162 ---TTLEGCPSG-GYKHPQRKFTVCTE 184 >gi|256377885|ref|YP_003101545.1| hypothetical protein Amir_3820 [Actinosynnema mirum DSM 43827] gi|255922188|gb|ACU37699.1| hypothetical protein Amir_3820 [Actinosynnema mirum DSM 43827] Length=287 Score = 42.4 bits (98), Expect = 0.024, Method: Compositional matrix adjust. Identities = 33/118 (28%), Positives = 50/118 (43%), Gaps = 16/118 (13%) Query 26 AELKVGDCVKLAGTPDRPQATKAECGSPASNFKVVAV-------VQEDHAECPADVDSTY 78 A +K G CV T+ C +P +N+ V V + + H + + DS Sbjct 147 ATIKAGMCVHATEAGGELSMTERGCEAPDANYTVGKVSTIKCDILDQSHYQVLTETDS-- 204 Query 79 SMRNAFNGSTNTICLDIDWVIGGCMSVDPTHNTD--PFRVDCDDASVPHRQRATQILK 134 F T CL + V+G C + DP + + P V C D SVP R + T ++K Sbjct 205 -----FTKFTTNFCLVPNLVVGKCYNADPDVSVEAYPTPVSCTDTSVPERVKLTSVVK 257 >gi|134097160|ref|YP_001102821.1| hypothetical protein SACE_0549 [Saccharopolyspora erythraea NRRL 2338] gi|291005383|ref|ZP_06563356.1| hypothetical protein SeryN2_12757 [Saccharopolyspora erythraea NRRL 2338] gi|133909783|emb|CAL99895.1| hypothetical protein SACE_0549 [Saccharopolyspora erythraea NRRL 2338] Length=245 Score = 38.9 bits (89), Expect = 0.30, Method: Compositional matrix adjust. Identities = 26/78 (34%), Positives = 42/78 (54%), Gaps = 10/78 (12%) Query 28 LKVGDCVKLAGTPDRPQATKAECGSPASNFKVVAVVQ-EDHAECPADVDSTYSMRNAFNG 86 ++VGDCV L+ T+ CGS S++++V V ED CP + +TYS Sbjct 116 IEVGDCVALS-VQSGGSLTEEPCGSADSDYEIVEVKSGEDKNGCPDNYSNTYS------- 167 Query 87 STNTICLDIDWVIGGCMS 104 +T C+ +D V+G C++ Sbjct 168 -GDTYCMVLDVVVGDCLT 184 >gi|310795722|gb|EFQ31183.1| glycolipid anchored surface protein [Glomerella graminicola M1.001] Length=479 Score = 37.7 bits (86), Expect = 0.53, Method: Compositional matrix adjust. Identities = 25/58 (44%), Positives = 32/58 (56%), Gaps = 6/58 (10%) Query 95 IDWVIGGCMSVDPTHNTDPFRVDCD----DASVPHRQRATQI-LKDLDSPVSVDQCAS 147 +D+ IGG DPTHN DP D D DA+V R A I + +LD ++ D CAS Sbjct 41 VDYQIGGSAGYDPTHNRDPLS-DGDVCLRDAAVLQRLGANAIRVYNLDPNLNHDACAS 97 >gi|300789923|ref|YP_003770214.1| hypothetical protein AMED_8109 [Amycolatopsis mediterranei U32] gi|299799437|gb|ADJ49812.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] gi|340531594|gb|AEK46799.1| hypothetical protein RAM_41660 [Amycolatopsis mediterranei S699] Length=236 Score = 37.4 bits (85), Expect = 0.81, Method: Compositional matrix adjust. Identities = 24/96 (25%), Positives = 39/96 (41%), Gaps = 1/96 (1%) Query 26 AELKVGDCVKLAG-TPDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAF 84 A GDC+ + T KA+C P +N K+ + CP D+ Y + Sbjct 97 ATANAGDCLTITEFTQGGDDPAKADCNDPKANVKIAKKLDTSSENCPGGSDAGYDTYSVS 156 Query 85 NGSTNTICLDIDWVIGGCMSVDPTHNTDPFRVDCDD 120 S+ +CL I+ G C++ + +V C D Sbjct 157 GRSSYKLCLMINAKQGDCLANFTSQTKGYLKVPCTD 192 >gi|311899454|dbj|BAJ31862.1| hypothetical protein KSE_60960 [Kitasatospora setae KM-6054] Length=138 Score = 37.0 bits (84), Expect = 1.1, Method: Compositional matrix adjust. Identities = 32/91 (36%), Positives = 42/91 (47%), Gaps = 9/91 (9%) Query 7 AATTALFVVATGCSSATNVAELKVGDCVKLAGTPDRPQATKAECGSPASNFKVVAVVQ-E 65 A ALFV A S A K GDC AG+ ++P + +CGS + F V+ VV Sbjct 49 AIVAALFVAAY--FSRDTPAAAKAGDCAHNAGSEEKPDVSLVDCGSADAEFTVLKVVHGA 106 Query 66 DHAEC---PADVDSTYSMRNAFNGSTNTICL 93 D EC PA V + R + S +CL Sbjct 107 DEKECETEPALVATYVETRRS---SVLVLCL 134 >gi|54022601|ref|YP_116843.1| hypothetical protein nfa6340 [Nocardia farcinica IFM 10152] gi|54014109|dbj|BAD55479.1| hypothetical protein [Nocardia farcinica IFM 10152] Length=185 Score = 36.6 bits (83), Expect = 1.2, Method: Compositional matrix adjust. Identities = 38/145 (27%), Positives = 69/145 (48%), Gaps = 21/145 (14%) Query 17 TGCS-----SATNVAELKVGDCVKLAGTPDRPQATKAE---CGSPASNFKVVAVVQEDHA 68 +GCS + ++VA+ KVGDC+ + T + P AT+ E C SP + +KV E Sbjct 37 SGCSVIDDATKSDVAKTKVGDCINI--TDNSPTATEGEPIDCSSPKAVYKVHQTFDE-AT 93 Query 69 ECPADVDSTYSMRNAFNGSTNTICLDIDWVIGGCMSVDPTHNTDPFR-VDCDDASVPHRQ 127 +C ++ ++Y+ + +G T +CL ++ C + +T P+ VDC + Sbjct 94 QCASNEYTSYTEQLP-SGGTTFMCLAPNFAQDNCYN---DVSTSPYMWVDCSSTEATFK- 148 Query 128 RATQILKDLDSPVSVDQCASGVGYV 152 +L+ +D C SG ++ Sbjct 149 ----VLQRIDGQTDELLCESGDEFL 169 >gi|223937295|ref|ZP_03629201.1| Mammalian cell entry related domain protein [bacterium Ellin514] gi|223894080|gb|EEF60535.1| Mammalian cell entry related domain protein [bacterium Ellin514] Length=325 Score = 36.6 bits (83), Expect = 1.5, Method: Compositional matrix adjust. Identities = 22/64 (35%), Positives = 33/64 (52%), Gaps = 4/64 (6%) Query 24 NVAELKVGDCVKLAGTPDRPQATKAECGSPASNFKVVAVVQED---HAECPADVDSTYSM 80 N+ +LKVGD VK+AG P Q K + + + +VV + +D H + A + T M Sbjct 45 NIQDLKVGDAVKMAGVP-VGQVEKIQLATNEAKVEVVLRLNKDTPVHTDSKATIKFTGLM 103 Query 81 RNAF 84 N F Sbjct 104 GNYF 107 >gi|324500344|gb|ADY40164.1| Plexin-2 [Ascaris suum] Length=1792 Score = 34.7 bits (78), Expect = 5.4, Method: Compositional matrix adjust. Identities = 18/42 (43%), Positives = 22/42 (53%), Gaps = 0/42 (0%) Query 109 HNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCASGVG 150 +N D RV D +PH R T+ L P+S D CA GVG Sbjct 306 YNIDRCRVGTDTVGLPHIGRDTKCLNKSHLPLSEDTCAMGVG 347 >gi|256374607|ref|YP_003098267.1| hypothetical protein Amir_0454 [Actinosynnema mirum DSM 43827] gi|255918910|gb|ACU34421.1| hypothetical protein Amir_0454 [Actinosynnema mirum DSM 43827] Length=176 Score = 33.9 bits (76), Expect = 8.1, Method: Compositional matrix adjust. Identities = 32/133 (25%), Positives = 52/133 (40%), Gaps = 11/133 (8%) Query 31 GDCVKLAGT-PDRPQATKAECGSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNGSTN 89 G+CV++ G D T +C + +NF+V VV A CP + Y+ A ++ Sbjct 49 GECVRITGADGDSLAVTPDDCDADLANFRVGKVVDGADAPCPE--EGVYT--EARGQGSS 104 Query 90 TICLDIDWVIGGCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCASGV 149 T+CL + V G C D + C + ++ K ++ C G Sbjct 105 TLCLLPNMVEGACYGPDDRGFGGLVKSAC------AGEATIKVTKVIEGSTDTSGCPDGA 158 Query 150 GYVYTQRRFAVCV 162 G Y + CV Sbjct 159 GMSYPEPPITFCV 171 Lambda K H 0.318 0.131 0.407 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 136720389372 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40