BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3378c Length=296 Score E Sequences producing significant alignments: (Bits) Value gi|15610514|ref|NP_217895.1| hypothetical protein Rv3378c [Mycob... 617 6e-175 gi|121639304|ref|YP_979528.1| hypothetical protein BCG_3449c [My... 616 1e-174 gi|340628359|ref|YP_004746811.1| hypothetical protein MCAN_34041... 615 2e-174 gi|289747200|ref|ZP_06506578.1| conserved hypothetical protein [... 614 5e-174 gi|330801871|ref|XP_003288946.1| hypothetical protein DICPUDRAFT... 156 4e-36 gi|281209687|gb|EFA83855.1| hypothetical protein PPL_02925 [Poly... 148 8e-34 gi|66810337|ref|XP_638892.1| hypothetical protein DDB_G0283885 [... 143 3e-32 gi|66820362|ref|XP_643805.1| hypothetical protein DDB_G0275279 [... 128 9e-28 gi|66810339|ref|XP_638893.1| hypothetical protein DDB_G0283887 [... 127 2e-27 gi|330801869|ref|XP_003288945.1| hypothetical protein DICPUDRAFT... 124 1e-26 gi|281204970|gb|EFA79164.1| hypothetical protein PPL_07989 [Poly... 119 5e-25 gi|328870186|gb|EGG18561.1| hypothetical protein DFA_04055 [Dict... 119 6e-25 gi|159898667|ref|YP_001544914.1| hypothetical protein Haur_2146 ... 72.4 8e-11 gi|309799105|ref|ZP_07693358.1| conserved hypothetical protein [... 42.4 0.087 gi|322391239|ref|ZP_08064711.1| efflux ABC superfamily ATP bindi... 40.0 0.41 gi|306830221|ref|ZP_07463404.1| efflux ABC superfamily ATP bindi... 39.7 0.51 gi|322378279|ref|ZP_08052761.1| efflux ABC transporter, permease... 38.5 1.3 gi|330804377|ref|XP_003290172.1| hypothetical protein DICPUDRAFT... 36.2 6.7 >gi|15610514|ref|NP_217895.1| hypothetical protein Rv3378c [Mycobacterium tuberculosis H37Rv] gi|15842973|ref|NP_338010.1| hypothetical protein MT3488 [Mycobacterium tuberculosis CDC1551] gi|31794560|ref|NP_857053.1| hypothetical protein Mb3412c [Mycobacterium bovis AF2122/97] 60 more sequence titlesLength=296 Score = 617 bits (1592), Expect = 6e-175, Method: Compositional matrix adjust. Identities = 296/296 (100%), Positives = 296/296 (100%), Gaps = 0/296 (0%) Query 1 MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSI 60 MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSI Sbjct 1 MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSI 60 Query 61 RILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLF 120 RILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLF Sbjct 61 RILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLF 120 Query 121 YGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSISWNETH 180 YGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSISWNETH Sbjct 121 YGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSISWNETH 180 Query 181 GKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTL 240 GKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTL Sbjct 181 GKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTL 240 Query 241 RRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG 296 RRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG Sbjct 241 RRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG 296 >gi|121639304|ref|YP_979528.1| hypothetical protein BCG_3449c [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224991801|ref|YP_002646490.1| hypothetical protein JTY_3449 [Mycobacterium bovis BCG str. Tokyo 172] gi|121494952|emb|CAL73438.1| Hypothetical protein BCG_3449c [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224774916|dbj|BAH27722.1| hypothetical protein JTY_3449 [Mycobacterium bovis BCG str. Tokyo 172] gi|341603329|emb|CCC66010.1| hypothetical protein BCGM_3417c [Mycobacterium bovis BCG str. Moreau RDJ] Length=296 Score = 616 bits (1589), Expect = 1e-174, Method: Compositional matrix adjust. Identities = 295/296 (99%), Positives = 295/296 (99%), Gaps = 0/296 (0%) Query 1 MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSI 60 MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSI Sbjct 1 MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSI 60 Query 61 RILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLF 120 RILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLF Sbjct 61 RILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLF 120 Query 121 YGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSISWNETH 180 YGDYKKRLPSTAQGAA VKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSISWNETH Sbjct 121 YGDYKKRLPSTAQGAAAVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSISWNETH 180 Query 181 GKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTL 240 GKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTL Sbjct 181 GKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTL 240 Query 241 RRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG 296 RRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG Sbjct 241 RRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG 296 >gi|340628359|ref|YP_004746811.1| hypothetical protein MCAN_34041 [Mycobacterium canettii CIPT 140010059] gi|340006549|emb|CCC45735.1| hypothetical protein MCAN_34041 [Mycobacterium canettii CIPT 140010059] Length=296 Score = 615 bits (1587), Expect = 2e-174, Method: Compositional matrix adjust. Identities = 295/296 (99%), Positives = 296/296 (100%), Gaps = 0/296 (0%) Query 1 MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSI 60 MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSI Sbjct 1 MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSI 60 Query 61 RILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLF 120 RILKMLF+HGIETVISPIFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLF Sbjct 61 RILKMLFDHGIETVISPIFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLF 120 Query 121 YGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSISWNETH 180 YGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSISWNETH Sbjct 121 YGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSISWNETH 180 Query 181 GKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTL 240 GKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTL Sbjct 181 GKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTL 240 Query 241 RRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG 296 RRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG Sbjct 241 RRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG 296 >gi|289747200|ref|ZP_06506578.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289687728|gb|EFD55216.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] Length=296 Score = 614 bits (1583), Expect = 5e-174, Method: Compositional matrix adjust. Identities = 295/296 (99%), Positives = 295/296 (99%), Gaps = 0/296 (0%) Query 1 MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSI 60 MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSI Sbjct 1 MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSI 60 Query 61 RILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLF 120 RILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLF Sbjct 61 RILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLF 120 Query 121 YGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSISWNETH 180 YGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVF NDAAESVAQFSISWNETH Sbjct 121 YGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFCNDAAESVAQFSISWNETH 180 Query 181 GKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTL 240 GKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTL Sbjct 181 GKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTL 240 Query 241 RRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG 296 RRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG Sbjct 241 RRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG 296 >gi|330801871|ref|XP_003288946.1| hypothetical protein DICPUDRAFT_79732 [Dictyostelium purpureum] gi|325080977|gb|EGC34510.1| hypothetical protein DICPUDRAFT_79732 [Dictyostelium purpureum] Length=374 Score = 156 bits (394), Expect = 4e-36, Method: Compositional matrix adjust. Identities = 90/302 (30%), Positives = 164/302 (55%), Gaps = 22/302 (7%) Query 7 KEFLDLPLVSVAEIV--RCRGPKVSVFPFDGTRRWFHLECNP-------------QYDDY 51 +EF L ++ I+ R + V+ +DGTRR + +E YD Y Sbjct 17 QEFNKLGDSDISNIIKNRLKNCNTMVYAYDGTRRSYLIENTNFNSTNDVEKETLIDYDQY 76 Query 52 QQAALRQSIRILKMLFEHGIETVISPIFSDDLLDRGDRYI---VQALEGMALLANDEEIL 108 + A+++ + L M+F+HGI+T+I P++ L +RG Y+ ++ L G+ L ++EE++ Sbjct 77 CKTAIKKLLYDLVMMFKHGIKTIIYPMWFCTLEERGPEYLPKFIKYLRGLNELLDNEELV 136 Query 109 SFYKEHEVHVLFYGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAES 168 YKE+ + V+FYG+Y++ L ++K F+++ T ++T H + FG + +E+ Sbjct 137 QLYKENGIRVIFYGEYRELL-ERGNDLILLKKFEEIAELTKNHTNHTILFGTTIKEPSET 195 Query 169 VAQFSISWNETHGKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSS--GKTSLY 226 + +IS+ + PT+++++E YYG VD +IGF RFST P+L S G LY Sbjct 196 IINNTISFYTKNQYKPTKKDLVENYYGLQVDDVSFYIGFDRFSTDGRPILLSDKGNEDLY 255 Query 227 FTVAPSYYMTETTLRRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGC 286 +TV+P Y T+ R+IL+D ++ R +Y D + ++++ Y + + G+G Sbjct 256 YTVSPHSYFTKNNFRKILFDKLFCRSNVNAKEYKLKVID-VELMKDFYESNSTSIMGIGS 314 Query 287 VH 288 V+ Sbjct 315 VN 316 >gi|281209687|gb|EFA83855.1| hypothetical protein PPL_02925 [Polysphondylium pallidum PN500] Length=369 Score = 148 bits (374), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 98/308 (32%), Positives = 151/308 (50%), Gaps = 27/308 (8%) Query 8 EFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHL--ECNPQ-----------YDDYQQA 54 FL+ + ++EIVR K VF +DGTRR HL E N ++DY Sbjct 2 NFLNKSKIEISEIVRTSKTKTLVFAYDGTRR-SHLINEINKNKGTDDSLIEINWNDYSSK 60 Query 55 ALRQSIRILKMLFEHGIETVISPIFSDDLLDRGDRY---IVQALEGMALLANDEEILSFY 111 + ++ I + M+ +HGI TVI P++ L RG Y ++ L G+ L D ++ + Sbjct 61 SFKKMIDLTIMMMKHGIHTVIYPMWFPTLGKRGPEYYPKFIKYLWGLNCLITDSRLMDIF 120 Query 112 KEHEVHVLFYGDYKK--RLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESV 169 + ++FYG++++ R+ + + +++S L T T H L FG E + Sbjct 121 LSLGIRIVFYGEWREFCRIGNDEELENLMES---LMSKTKHCTNHLLLFGTNITSTTEII 177 Query 170 AQFSISWNETHGKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSS--GKTSLYF 227 ++ SI + + H K P++ E+IE YYG VD D++IGF RF T P + S G +LYF Sbjct 178 SKLSIDYFQIHNKLPSKNELIEQYYGVPVDSVDLYIGFDRFCTDGRPPIISEEGSENLYF 237 Query 228 TVAPSYYMTETTLRRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCV 287 TV+P Y + R IL+DHIY R DY +D + ++ Y A G G V Sbjct 238 TVSPHSYFNKKQFRSILFDHIYARSVVNSKDYELKKSDII-LMNEFYNANSMSTLGCGNV 296 Query 288 HDG--IWF 293 WF Sbjct 297 QKNGYYWF 304 >gi|66810337|ref|XP_638892.1| hypothetical protein DDB_G0283885 [Dictyostelium discoideum AX4] gi|60467535|gb|EAL65557.1| hypothetical protein DDB_G0283885 [Dictyostelium discoideum AX4] Length=528 Score = 143 bits (361), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 89/320 (28%), Positives = 158/320 (50%), Gaps = 35/320 (10%) Query 7 KEFLDLPLVSVAEIVRCR--GPKVSVFPFDGTRRWFHLE--------------------- 43 +EF L +++I+ R V+ +DGTRR + +E Sbjct 11 QEFNKLTDNEISKIINSRLNNCNTMVYAYDGTRRSYLIENTISKLQTNGIHNNKCKFTGK 70 Query 44 ---CNPQYDDYQQAALRQSIRILKMLFEHGIETVISPIFSDDLLDRGDRYI---VQALEG 97 YDDY + A+ + + L M+F+HGI+T++ P++ L DRG Y+ ++ L G Sbjct 71 DEKTTIDYDDYCKTAISKLLFDLVMMFKHGIKTIVYPMWFCTLEDRGPEYLPKFIKYLSG 130 Query 98 MALLANDEEILSFYKEHEVHVLFYGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLC 157 + L +E ++ YKE + V+FYG+Y K L ++++F+ + T N H + Sbjct 131 LKALLENETLVKLYKECGIRVIFYGEYIKLL-ERGNDPILLETFNKIMELTKDNISHTIL 189 Query 158 FGVFGNDAAESVAQFSISWNETHGKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPL 217 FG + ++++ + SI + E + PT+ ++I+ YYG VD+ ++GF RFST P+ Sbjct 190 FGTTIQEPSQTIIENSIDFFEKYNYRPTKNQLIKKYYGVDVDQVSFYLGFDRFSTDGRPI 249 Query 218 LSS--GKTSLYFTVAPSYYMTETTLRRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYR 275 S G LY+T++P Y ++ R++L+D +Y R +Y D + +++ Y Sbjct 250 YISDKGNEDLYYTISPHSYFSKINFRKVLFDKLYCRSNTNAKEYELKLTD-IEMMKEFYE 308 Query 276 AQPDRVFGVGCV--HDGIWF 293 V G+G V H W+ Sbjct 309 NNSTNVMGLGNVNPHGNYWY 328 >gi|66820362|ref|XP_643805.1| hypothetical protein DDB_G0275279 [Dictyostelium discoideum AX4] gi|60471966|gb|EAL69920.1| hypothetical protein DDB_G0275279 [Dictyostelium discoideum AX4] Length=322 Score = 128 bits (322), Expect = 9e-28, Method: Compositional matrix adjust. Identities = 79/290 (28%), Positives = 149/290 (52%), Gaps = 22/290 (7%) Query 17 VAEIV--RCRGPKVSVFPFDGTRRWFHLECNPQYD----DYQQAALRQSIRILKMLF--- 67 ++EIV + G V+ FDG+ F L N + D D A+ ++ I K+L+ Sbjct 16 ISEIVIEKLSGNNTIVYAFDGST--FKLNSNNENDMENIDQCSTAMAPNVSINKLLYDLV 73 Query 68 ---EHGIETVISPIFSDDLLDRGDRYI---VQALEGMALLANDEEILSFYKEHEVHVLFY 121 +HGI+T+ P++ D + D+ Y+ +Q L+G++ L +E+++ YKE + V+FY Sbjct 74 MMCQHGIKTICVPMWCDKIEDKSSDYLSYFIQYLQGLSELLENEQLVKMYKETNIRVIFY 133 Query 122 GDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSISWNETHG 181 GD+K L ++ F+ + T +NT H + G + +E++ IS+ +G Sbjct 134 GDFKLLLKH-CNALELLNKFELIMEQTKNNTNHTILLGTNIEEPSETIINNIISFYNLNG 192 Query 182 K-PPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLL--SSGKTSLYFTVAPSYYMTET 238 PT ++I+ YYG VD+ +++G +F+T P+L G LY+++ Y+++ Sbjct 193 NVKPTSIDLIKQYYGVMVDQVSLYLGSHKFTTQGRPILICDKGNEDLYYSIGSHEYLSKN 252 Query 239 TLRRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVH 288 R++L+D ++ R +Y D + +++ Y + V GVG V+ Sbjct 253 GFRKVLFDKLFCRKVANAKEYQLKIHD-IKMMKQFYLNNCENVMGVGNVN 301 >gi|66810339|ref|XP_638893.1| hypothetical protein DDB_G0283887 [Dictyostelium discoideum AX4] gi|60467536|gb|EAL65558.1| hypothetical protein DDB_G0283887 [Dictyostelium discoideum AX4] Length=495 Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 69/249 (28%), Positives = 137/249 (56%), Gaps = 10/249 (4%) Query 48 YDDYQQAALRQSIRILKMLFEHGIETVISPIFSDDLLDRGDRYI---VQALEGMA-LLAN 103 Y++Y + A+ + + M+F+HGI+ ++ P++ L RG Y+ +Q L G++ LL Sbjct 106 YNEYSKTAVHNFLYLSIMMFQHGIKNIVYPMWFCTLEKRGPEYLPKFIQYLWGLSKLLDP 165 Query 104 DEEILSFYKEHEVHVLFYGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGN 163 + + ++E+ V ++FYG+YKK L ++ F+++ T +N+ L G Sbjct 166 NYDFFKLFQENGVRIIFYGEYKKLL-ERGNDNELLSKFEEIMDKTKNNSNKILLLGTNIE 224 Query 164 DAAESVAQFSISWNETHGKPPTRREIIEGYYG--EYVDKADMFIGFGRFSTFDFPLLSS- 220 + ++++ ++S+ + G+ PT+ ++I+ YYG +D ++GF RFST P+L S Sbjct 225 EPSQTIINNTLSFYKKFGREPTKNDLIQHYYGVNTQIDDVSFYLGFDRFSTDGRPILISD 284 Query 221 -GKTSLYFTVAPSYYMTETTLRRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPD 279 G LY+TV+P +++ R++LYDHIY R +Y + + + +++ Y + Sbjct 285 KGAEDLYYTVSPHSFLSTNGFRKVLYDHIYQRTITNAKEY-ELKVNDIEMMKKFYENNSN 343 Query 280 RVFGVGCVH 288 + G+G V+ Sbjct 344 NIMGIGNVN 352 >gi|330801869|ref|XP_003288945.1| hypothetical protein DICPUDRAFT_34855 [Dictyostelium purpureum] gi|325080976|gb|EGC34509.1| hypothetical protein DICPUDRAFT_34855 [Dictyostelium purpureum] Length=508 Score = 124 bits (312), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 73/254 (29%), Positives = 142/254 (56%), Gaps = 10/254 (3%) Query 48 YDDYQQAALRQSIRILKMLFEHGIETVISPIFSDDLLDRGDRYI---VQALEGMALLAND 104 Y +Y + A+ + + + +F+HGI+T++ P++ L RG Y+ +Q L G++ L D Sbjct 87 YLEYSKTAIHKFLNLSITMFQHGIKTIVYPMWFCTLEKRGPEYLPKFIQYLWGLSALLED 146 Query 105 EEILSFYKEHEVHVLFYGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGND 164 ++ Y E + V+FYG+YKK L + A+++ F+ + T +N + L G + Sbjct 147 PSLVQQYYESGIKVVFYGEYKKLL-ARVNDRALLEKFEKIMELTKNNNKKLLLLGTNIEE 205 Query 165 AAESVAQFSISWNETHGKPPTRREIIEGYYGE-YVDKADMFIGFGRFSTFDFPLLSS--G 221 ++++ ++S+ + GK PT++++++ YYG+ + +IGF RFST P+L S G Sbjct 206 PSQTIINNTLSYFKKFGKEPTKKDLVKEYYGDSNIQDVSFYIGFDRFSTDGRPILISENG 265 Query 222 KTSLYFTVAPSYYMTETTLRRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRV 281 LY++V+P + T R++LYDH+Y R +Y +D + ++++ Y + ++ Sbjct 266 DEDLYYSVSPHSFFTTEHFRKVLYDHLYQRSCVNAKEYELKISD-VEMMKDFYESNAGQI 324 Query 282 FGVGCVHD--GIWF 293 GVGC+ + W+ Sbjct 325 MGVGCIQEQGNYWY 338 >gi|281204970|gb|EFA79164.1| hypothetical protein PPL_07989 [Polysphondylium pallidum PN500] Length=787 Score = 119 bits (299), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 80/287 (28%), Positives = 144/287 (51%), Gaps = 27/287 (9%) Query 7 KEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSIRILKML 66 +EFL+L + ++++V G K + Y Q A+R+ + L M+ Sbjct 2 EEFLNLSNIEISKLVSESGNKTML--------------------YSQNAIRKLLDHLLMI 41 Query 67 FEHGIETVISPIFSDDLLDRGDRYI---VQALEGMALLANDEEILSFYKEHEVHVLFYGD 123 FEHGI TVI P++ L RG Y+ + L+G+ L + +L Y + + ++FYG+ Sbjct 42 FEHGISTVIYPMWFYTLEMRGPEYVPKFIGYLQGLKSLLLEPLLLQAYMKAGIRIIFYGE 101 Query 124 YKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSISWNETHGKP 183 +++ L +++ F+ + T +NT+ + FG D + + SI++ + + + Sbjct 102 FRELL-MRENDTKLIEVFERIMEITKNNTKKVVLFGTNIQDPSTLIIDKSINFFKKNNRE 160 Query 184 PTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLL--SSGKTSLYFTVAPSYYMTETTLR 241 PT+ E+I+ YYG V++ + GF RFST P+L G LYF+V+P + T+ LR Sbjct 161 PTKSELIKEYYGVEVEEVSFYFGFDRFSTDGRPILLCDKGNEDLYFSVSPHSFFTQKQLR 220 Query 242 RILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVH 288 ++L+DH++ R +Y D + +++ Y GVG V Sbjct 221 KVLFDHLFCRSVANAKEYQLKVID-VEIMKTFYTMNTGNTMGVGEVQ 266 >gi|328870186|gb|EGG18561.1| hypothetical protein DFA_04055 [Dictyostelium fasciculatum] Length=357 Score = 119 bits (298), Expect = 6e-25, Method: Compositional matrix adjust. Identities = 80/297 (27%), Positives = 142/297 (48%), Gaps = 21/297 (7%) Query 10 LDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHL-------------ECNPQYDDYQQAAL 56 + L +A +VR G K VF +DGTRR HL + + ++DY + A Sbjct 4 MSLETDEIAAMVRKSGTKSMVFAYDGTRR-SHLIQEVSKTEGPDSEKLSIDWNDYSKNAF 62 Query 57 RQSIRILKMLFEHGIETVISPIFSDDLLDRGDRY---IVQALEGMALLANDEEILSFYKE 113 ++ + I ++F HG++ + P++ L RG Y + + G+ L +D + Y+ Sbjct 63 KKMLEISVLMFAHGLQEITYPMWFPTLGKRGKEYTPKFISYMWGLNTLYSDPYLREKYEA 122 Query 114 HEVHVLFYGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFS 173 + ++FYG++++ L + + + + + + T++ L FG + A +A + Sbjct 123 DGIRIIFYGEWRE-LCRLGEDPELERLLEKIQEDSKHRTKNVLLFGTNISSPATVMANLA 181 Query 174 ISWNETHGKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSS--GKTSLYFTVAP 231 I + + K PTR E+I YYG + DM++GF RF T P + S G +LYFTV+P Sbjct 182 IDHYKKYNKTPTREEMIMDYYGYPLSDVDMYVGFDRFVTDGRPPIISENGNENLYFTVSP 241 Query 232 SYYMTETTLRRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVH 288 + + LR IL+DH++ R +Y D + + + Y GVG + Sbjct 242 HSFFNISVLRSILFDHLFNRTVANTKEYDLTRLD-IKSMHSFYSKNEKTALGVGNIQ 297 >gi|159898667|ref|YP_001544914.1| hypothetical protein Haur_2146 [Herpetosiphon aurantiacus DSM 785] gi|159891706|gb|ABX04786.1| hypothetical protein Haur_2146 [Herpetosiphon aurantiacus DSM 785] Length=289 Score = 72.4 bits (176), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 63/250 (26%), Positives = 112/250 (45%), Gaps = 23/250 (9%) Query 8 EFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHL-ECNPQYDDYQQAALRQSIRILKML 66 EFL PL ++ ++ P VF G+RR L + ++Y + + +Q ++ L++ Sbjct 9 EFLHAPLTTIRQV----APATMVFSSGGSRRKAALANMSAAGEEYARWSHQQLLKCLELF 64 Query 67 FEHGIETVISP-IFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLFYGDYK 125 F HGI+ + P + + + Y + +A A + +L +Y+EH +++ Sbjct 65 FSHGIKHLFLPMLLPNQFQETTPNYREHIEQWVAWGAASQTMLEYYQEH--------NWR 116 Query 126 KRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVF--GNDAAESVAQFSISWNETHGKP 183 RL T + + L + L + V D + + Q + +T K Sbjct 117 VRLLDTQYSPILADAAQRLQQPYDHPDQPTLWWFVVRDSEDPWQIIFQAA---QKTVFK- 172 Query 184 PTRREIIEGYYGEYVDKADMFIGFGRFSTFD--FPLLSSGKTSLYFTVAPSYYMTETTLR 241 TR + IE YGE + A++F+ FG+ P L G+ Y+T P Y ++E R Sbjct 173 -TRSQAIEAIYGEPIPPAELFVSFGKPQVNHDLLPPLLVGELQCYWTQKPGYTLSEEEFR 231 Query 242 RILYDHIYLR 251 +ILYD +LR Sbjct 232 QILYDFAFLR 241 >gi|309799105|ref|ZP_07693358.1| conserved hypothetical protein [Streptococcus infantis SK1302] gi|308117340|gb|EFO54763.1| conserved hypothetical protein [Streptococcus infantis SK1302] Length=244 Score = 42.4 bits (98), Expect = 0.087, Method: Compositional matrix adjust. Identities = 26/72 (37%), Positives = 38/72 (53%), Gaps = 7/72 (9%) Query 123 DYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSIS----WNE 178 D+ + +PS AQ VV ++D +IS SSN E+++ G DA E +S WNE Sbjct 93 DFSQTMPSYAQ---VVSLYEDTSISVSSNEENKVLAGSIYTDAKEQGLTIPMSLLKNWNE 149 Query 179 THGKPPTRREII 190 GK T ++I Sbjct 150 QTGKNLTASDVI 161 >gi|322391239|ref|ZP_08064711.1| efflux ABC superfamily ATP binding cassette transporter, permease protein [Streptococcus peroris ATCC 700780] gi|321145992|gb|EFX41381.1| efflux ABC superfamily ATP binding cassette transporter, permease protein [Streptococcus peroris ATCC 700780] Length=433 Score = 40.0 bits (92), Expect = 0.41, Method: Compositional matrix adjust. Identities = 27/81 (34%), Positives = 39/81 (49%), Gaps = 13/81 (16%) Query 114 HEVHVLFYGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFS 173 HEV D+ + +PS AQ VV ++D +IS SSN + ++ G D E Sbjct 124 HEV------DFSQSMPSYAQ---VVSLYEDTSISVSSNEKEKVLAGTLYTDVNEQGLTIP 174 Query 174 IS----WNETHGKPPTRREII 190 +S WNE GK T ++I Sbjct 175 MSLLKNWNEQTGKNLTASDVI 195 >gi|306830221|ref|ZP_07463404.1| efflux ABC superfamily ATP binding cassette transporter, permease protein [Streptococcus mitis ATCC 6249] gi|304427588|gb|EFM30685.1| efflux ABC superfamily ATP binding cassette transporter, permease protein [Streptococcus mitis ATCC 6249] Length=433 Score = 39.7 bits (91), Expect = 0.51, Method: Compositional matrix adjust. Identities = 28/81 (35%), Positives = 39/81 (49%), Gaps = 13/81 (16%) Query 114 HEVHVLFYGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFS 173 HEV D+ + LPS AQ VV ++D +IS SSN + ++ G D E Sbjct 124 HEV------DFSQALPSYAQ---VVSLYEDTSISVSSNEKEKVLAGSLYTDVNEEGLTIP 174 Query 174 IS----WNETHGKPPTRREII 190 +S WNE GK T ++I Sbjct 175 MSLLKNWNEQTGKDLTASDVI 195 >gi|322378279|ref|ZP_08052761.1| efflux ABC transporter, permease protein [Streptococcus sp. M334] gi|321280781|gb|EFX57799.1| efflux ABC transporter, permease protein [Streptococcus sp. M334] Length=419 Score = 38.5 bits (88), Expect = 1.3, Method: Compositional matrix adjust. Identities = 25/72 (35%), Positives = 35/72 (49%), Gaps = 7/72 (9%) Query 123 DYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGVFGNDAAESVAQFSIS----WNE 178 D + +PS AQ VV ++D +IS SSN + ++ G DA E IS WNE Sbjct 113 DLSQTMPSYAQ---VVSLYEDTSISVSSNEKDKVVAGSLYTDANEQGLTIPISLLKNWNE 169 Query 179 THGKPPTRREII 190 G T ++I Sbjct 170 QTGNNLTATDVI 181 >gi|330804377|ref|XP_003290172.1| hypothetical protein DICPUDRAFT_80916 [Dictyostelium purpureum] gi|325079729|gb|EGC33316.1| hypothetical protein DICPUDRAFT_80916 [Dictyostelium purpureum] Length=2335 Score = 36.2 bits (82), Expect = 6.7, Method: Composition-based stats. Identities = 22/77 (29%), Positives = 36/77 (47%), Gaps = 8/77 (10%) Query 77 PIFSDDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLFYGDYKKRLPSTAQGAA 136 P + +D+ +R RY++ L G LL ND++ + + + Y KRLP Sbjct 1599 PNWCNDIKNRNQRYVIVPLPGSNLLPNDDDFWGWITLFDDKGIAYIASNKRLP------- 1651 Query 137 VVKSFDDLTISTSSNTE 153 V S D + +S + N E Sbjct 1652 -VNSLDSIPLSPNGNIE 1667 Lambda K H 0.323 0.140 0.427 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 492672993632 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40