BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3528c Length=237 Score E Sequences producing significant alignments: (Bits) Value gi|15610664|ref|NP_218045.1| hypothetical protein Rv3528c [Mycob... 490 6e-137 gi|298527006|ref|ZP_07014415.1| conserved hypothetical protein [... 488 3e-136 gi|340628492|ref|YP_004746944.1| hypothetical protein MCAN_35391... 484 5e-135 gi|308372669|ref|ZP_07429422.2| LOW QUALITY PROTEIN: hypothetica... 464 3e-129 gi|308232491|ref|ZP_07416216.2| hypothetical protein TMAG_00018 ... 405 2e-111 gi|187761548|dbj|BAG31969.1| putative methyltransferase [Mycobac... 332 3e-89 gi|254819585|ref|ZP_05224586.1| hypothetical protein MintA_06659... 332 4e-89 gi|168479938|dbj|BAG11526.1| putative methyltransferase [Mycobac... 324 6e-87 gi|218778897|ref|YP_002430215.1| hypothetical protein Dalk_1044 ... 224 1e-56 gi|196016354|ref|XP_002118030.1| hypothetical protein TRIADDRAFT... 37.7 1.3 gi|320161130|ref|YP_004174354.1| putative oxidoreductase [Anaero... 37.4 1.7 gi|341875557|gb|EGT31492.1| hypothetical protein CAEBREN_06106 [... 37.0 2.7 gi|296188633|ref|ZP_06857021.1| membrane family protein [Clostri... 36.6 3.4 gi|225320663|dbj|BAH29727.1| UDP-glucose 4-epimerase [Dicyema ja... 36.6 3.5 gi|255524832|ref|ZP_05391782.1| conserved hypothetical protein [... 36.6 3.5 gi|326528779|dbj|BAJ97411.1| predicted protein [Hordeum vulgare ... 35.0 9.8 >gi|15610664|ref|NP_218045.1| hypothetical protein Rv3528c [Mycobacterium tuberculosis H37Rv] gi|15843139|ref|NP_338176.1| hypothetical protein MT3629 [Mycobacterium tuberculosis CDC1551] gi|31794704|ref|NP_857197.1| hypothetical protein Mb3558c [Mycobacterium bovis AF2122/97] 48 more sequence titlesLength=237 Score = 490 bits (1262), Expect = 6e-137, Method: Compositional matrix adjust. Identities = 237/237 (100%), Positives = 237/237 (100%), Gaps = 0/237 (0%) Query 1 MMLDRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAV 60 MMLDRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAV Sbjct 1 MMLDRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAV 60 Query 61 LVDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTIL 120 LVDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTIL Sbjct 61 LVDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTIL 120 Query 121 DMYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPD 180 DMYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPD Sbjct 121 DMYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPD 180 Query 181 HDKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ 237 HDKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ Sbjct 181 HDKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ 237 >gi|298527006|ref|ZP_07014415.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|308369154|ref|ZP_07416747.2| hypothetical protein TMBG_02063 [Mycobacterium tuberculosis SUMu002] gi|308371379|ref|ZP_07424755.2| hypothetical protein TMCG_03651 [Mycobacterium tuberculosis SUMu003] 7 more sequence titles Length=236 Score = 488 bits (1256), Expect = 3e-136, Method: Compositional matrix adjust. Identities = 236/236 (100%), Positives = 236/236 (100%), Gaps = 0/236 (0%) Query 2 MLDRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAVL 61 MLDRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAVL Sbjct 1 MLDRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAVL 60 Query 62 VDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTILD 121 VDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTILD Sbjct 61 VDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTILD 120 Query 122 MYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPDH 181 MYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPDH Sbjct 121 MYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPDH 180 Query 182 DKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ 237 DKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ Sbjct 181 DKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ 236 >gi|340628492|ref|YP_004746944.1| hypothetical protein MCAN_35391 [Mycobacterium canettii CIPT 140010059] gi|340006682|emb|CCC45870.1| hypothetical protein MCAN_35391 [Mycobacterium canettii CIPT 140010059] Length=237 Score = 484 bits (1246), Expect = 5e-135, Method: Compositional matrix adjust. Identities = 233/237 (99%), Positives = 235/237 (99%), Gaps = 0/237 (0%) Query 1 MMLDRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAV 60 MMLDRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAV Sbjct 1 MMLDRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAV 60 Query 61 LVDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTIL 120 LVDGRITPTVAARA SYPQLRVIEGNFGD+EIADKVGNVDALFLFDVLLHQVSPDWDTIL Sbjct 61 LVDGRITPTVAARAKSYPQLRVIEGNFGDEEIADKVGNVDALFLFDVLLHQVSPDWDTIL 120 Query 121 DMYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPD 180 DMYAKNVRCLLIYNQQW GSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLF+KLDKKHPD Sbjct 121 DMYAKNVRCLLIYNQQWTGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFEKLDKKHPD 180 Query 181 HDKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ 237 HDKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ Sbjct 181 HDKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ 237 >gi|308372669|ref|ZP_07429422.2| LOW QUALITY PROTEIN: hypothetical protein TMEG_00016 [Mycobacterium tuberculosis SUMu005] gi|308340312|gb|EFP29163.1| LOW QUALITY PROTEIN: hypothetical protein TMEG_00016 [Mycobacterium tuberculosis SUMu005] Length=225 Score = 464 bits (1195), Expect = 3e-129, Method: Compositional matrix adjust. Identities = 225/225 (100%), Positives = 225/225 (100%), Gaps = 0/225 (0%) Query 13 LVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAVLVDGRITPTVAA 72 LVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAVLVDGRITPTVAA Sbjct 1 LVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAVLVDGRITPTVAA 60 Query 73 RANSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTILDMYAKNVRCLLI 132 RANSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTILDMYAKNVRCLLI Sbjct 61 RANSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTILDMYAKNVRCLLI 120 Query 133 YNQQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPDHDKPWRDIPDIW 192 YNQQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPDHDKPWRDIPDIW Sbjct 121 YNQQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPDHDKPWRDIPDIW 180 Query 193 QWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ 237 QWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ Sbjct 181 QWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ 225 >gi|308232491|ref|ZP_07416216.2| hypothetical protein TMAG_00018 [Mycobacterium tuberculosis SUMu001] gi|308372573|ref|ZP_07429119.2| hypothetical protein TMDG_01258 [Mycobacterium tuberculosis SUMu004] gi|308376156|ref|ZP_07437821.2| hypothetical protein TMHG_02586 [Mycobacterium tuberculosis SUMu008] 12 more sequence titles Length=196 Score = 405 bits (1042), Expect = 2e-111, Method: Compositional matrix adjust. Identities = 195/196 (99%), Positives = 196/196 (100%), Gaps = 0/196 (0%) Query 42 VEGAYTFRALDKYPVKEAVLVDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVDA 101 +EGAYTFRALDKYPVKEAVLVDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVDA Sbjct 1 MEGAYTFRALDKYPVKEAVLVDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVDA 60 Query 102 LFLFDVLLHQVSPDWDTILDMYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNVPHS 161 LFLFDVLLHQVSPDWDTILDMYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNVPHS Sbjct 61 LFLFDVLLHQVSPDWDTILDMYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNVPHS 120 Query 162 KLNKAYRDLFQKLDKKHPDHDKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDCRGF 221 KLNKAYRDLFQKLDKKHPDHDKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDCRGF Sbjct 121 KLNKAYRDLFQKLDKKHPDHDKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDCRGF 180 Query 222 GWLPNIQNRAFLFARQ 237 GWLPNIQNRAFLFARQ Sbjct 181 GWLPNIQNRAFLFARQ 196 >gi|187761548|dbj|BAG31969.1| putative methyltransferase [Mycobacterium intracellulare] Length=233 Score = 332 bits (851), Expect = 3e-89, Method: Compositional matrix adjust. Identities = 155/223 (70%), Positives = 185/223 (83%), Gaps = 0/223 (0%) Query 15 RGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAVLVDGRITPTVAARA 74 + KI LIDRAFTS I+SFADLGAVW VEGAYTF AL+ + +K+A LVD +TPTV+ARA Sbjct 11 KDKIELIDRAFTSLGIQSFADLGAVWRVEGAYTFHALETHQIKDAALVDLNVTPTVSARA 70 Query 75 NSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTILDMYAKNVRCLLIYN 134 S+PQLR+I GNFGDQ +AD+VGNVDA+FLFDVLLHQVSPDWD IL+MYAK LLIYN Sbjct 71 QSHPQLRLIGGNFGDQAVADQVGNVDAVFLFDVLLHQVSPDWDAILEMYAKQTNSLLIYN 130 Query 135 QQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPDHDKPWRDIPDIWQW 194 QQW GS TVRLLDLGEK YFRNVPHS+ + Y +LF+KL++KHPD D+ WRD P +WQW Sbjct 131 QQWTGSEETVRLLDLGEKEYFRNVPHSRRVEEYENLFEKLNEKHPDMDRTWRDFPGVWQW 190 Query 195 GITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ 237 GITDADLE+K S+LGFKL+YK+DC FG L N +N+AF+F R+ Sbjct 191 GITDADLEAKVSQLGFKLVYKKDCGRFGRLRNFRNQAFIFTRE 233 >gi|254819585|ref|ZP_05224586.1| hypothetical protein MintA_06659 [Mycobacterium intracellulare ATCC 13950] Length=226 Score = 332 bits (850), Expect = 4e-89, Method: Compositional matrix adjust. Identities = 155/223 (70%), Positives = 185/223 (83%), Gaps = 0/223 (0%) Query 15 RGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAVLVDGRITPTVAARA 74 + KI LIDRAFTS I+SFADLGAVW VEGAYTF AL+ + +K+A LVD +TPTV+ARA Sbjct 4 KDKIELIDRAFTSLGIQSFADLGAVWRVEGAYTFHALETHQIKDAALVDLNVTPTVSARA 63 Query 75 NSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTILDMYAKNVRCLLIYN 134 S+PQLR+I GNFGDQ +AD+VGNVDA+FLFDVLLHQVSPDWD IL+MYAK LLIYN Sbjct 64 QSHPQLRLIGGNFGDQAVADQVGNVDAVFLFDVLLHQVSPDWDAILEMYAKQTNSLLIYN 123 Query 135 QQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPDHDKPWRDIPDIWQW 194 QQW GS TVRLLDLGEK YFRNVPHS+ + Y +LF+KL++KHPD D+ WRD P +WQW Sbjct 124 QQWTGSEETVRLLDLGEKEYFRNVPHSRRVEEYENLFEKLNEKHPDMDRTWRDFPGVWQW 183 Query 195 GITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ 237 GITDADLE+K S+LGFKL+YK+DC FG L N +N+AF+F R+ Sbjct 184 GITDADLEAKVSQLGFKLVYKKDCGRFGRLRNFRNQAFIFTRE 226 >gi|168479938|dbj|BAG11526.1| putative methyltransferase [Mycobacterium intracellulare] Length=251 Score = 324 bits (831), Expect = 6e-87, Method: Compositional matrix adjust. Identities = 149/224 (67%), Positives = 182/224 (82%), Gaps = 0/224 (0%) Query 13 LVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAVLVDGRITPTVAA 72 ++R K+++ID AF+S +ESFADLG VWGVEGAYTF ALDK+ +K A LVD +TPTV Sbjct 28 ILRDKLDMIDHAFSSLGVESFADLGGVWGVEGAYTFHALDKHEIKAAALVDTHLTPTVVD 87 Query 73 RANSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTILDMYAKNVRCLLI 132 RA SYPQLR+I GNFGDQ +AD+VG+VDA+FLFDVLLHQVSP+WD++L MYAKN R L++ Sbjct 88 RAKSYPQLRLINGNFGDQNVADEVGDVDAIFLFDVLLHQVSPNWDSVLKMYAKNARVLVV 147 Query 133 YNQQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPDHDKPWRDIPDIW 192 YNQQW GS TVRLLDLGE+ YFRNVPH + K YR+LF+KLD+KHPDHD+ WRD+ IW Sbjct 148 YNQQWTGSDGTVRLLDLGEEEYFRNVPHPRYRKPYRNLFEKLDEKHPDHDRAWRDVHHIW 207 Query 193 QWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFAR 236 QWGITD DLE+ + LGF L YK++C FG L N NRAF+F+R Sbjct 208 QWGITDDDLEAAVARLGFDLKYKKECGRFGRLANFTNRAFIFSR 251 >gi|218778897|ref|YP_002430215.1| hypothetical protein Dalk_1044 [Desulfatibacillum alkenivorans AK-01] gi|218760281|gb|ACL02747.1| conserved hypothetical protein [Desulfatibacillum alkenivorans AK-01] Length=242 Score = 224 bits (570), Expect = 1e-56, Method: Compositional matrix adjust. Identities = 113/239 (48%), Positives = 152/239 (64%), Gaps = 2/239 (0%) Query 1 MMLDRLRQGGYWLVRGKINLIDRAF--TSCRIESFADLGAVWGVEGAYTFRALDKYPVKE 58 M L + + + V K +ID A S SFADLG +W V+G YTF A + + V++ Sbjct 1 MSLYKNLKPAHLTVLDKKEIIDYALGRLSPSPCSFADLGGIWDVDGEYTFHAFENHDVEK 60 Query 59 AVLVDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDT 118 A LVD T A+A P L++I+ NFG E+ +K+G VDA+F+FDVLLHQVSPDWD Sbjct 61 AFLVDTDFTDKALAKAEKRPALQIIQDNFGRPEVVEKIGPVDAVFMFDVLLHQVSPDWDR 120 Query 119 ILDMYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKH 178 IL+MY++ C +I+NQQW TVRLLDLGEK YF NVPH + Y+ LF +LD H Sbjct 121 ILEMYSRICSCFVIFNQQWTRGDHTVRLLDLGEKEYFANVPHDPEHPNYKGLFDRLDDMH 180 Query 179 PDHDKPWRDIPDIWQWGITDADLESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ 237 P H + RDI +IWQWGI+D DL +K +GF L Y ++C F L + N AF+F+++ Sbjct 181 PQHRRRIRDIHNIWQWGISDKDLIAKMEAMGFGLQYYKNCGQFQKLEHFYNHAFVFSKR 239 >gi|196016354|ref|XP_002118030.1| hypothetical protein TRIADDRAFT_62058 [Trichoplax adhaerens] gi|190579417|gb|EDV19513.1| hypothetical protein TRIADDRAFT_62058 [Trichoplax adhaerens] Length=1314 Score = 37.7 bits (86), Expect = 1.3, Method: Composition-based stats. Identities = 28/108 (26%), Positives = 52/108 (49%), Gaps = 13/108 (12%) Query 90 QEIADKVGNVDALFLFDVL-----LHQVSPDWDTILDMYAKNV----RCLLIYNQQWIGS 140 +EIA+K+ D + + + L + QV DW+ L+ Y K+V RC++ YN S Sbjct 46 REIAEKLK--DEMMIAESLHRYGDIKQVERDWNEALNSYMKSVDIKLRCIVEYNPSIANS 103 Query 141 TTTVRLLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPDHDKPWRDI 188 + ++ + +Y + S L K+ LD+ HPD + + ++ Sbjct 104 YNEIGIIYYDQGNYKEAI--SMLEKSLNIRLSILDRHHPDITRSYNNV 149 >gi|320161130|ref|YP_004174354.1| putative oxidoreductase [Anaerolinea thermophila UNI-1] gi|319994983|dbj|BAJ63754.1| putative oxidoreductase [Anaerolinea thermophila UNI-1] Length=261 Score = 37.4 bits (85), Expect = 1.7, Method: Compositional matrix adjust. Identities = 36/128 (29%), Positives = 60/128 (47%), Gaps = 15/128 (11%) Query 120 LDMYAKNVRCLLIYNQQWIGSTTTVRLLDLGEK---HYFRN-VPHSKLNKAYRDLFQKLD 175 L KN+ L+ + + IG +R +LG +Y RN P ++ R++ +K+ Sbjct 8 LPFLEKNI--LVTGSGRGIGRAIALRFAELGANVVINYHRNETPAQEVANQIREMGRKVL 65 Query 176 KKHPDHDKPWRDIPDIW-----QWGITDADLESKASELGF-KLLYKEDCRGFGWLPNIQN 229 + KP DI ++ +WG D + + AS GF + ++ G+ W N+ Sbjct 66 VIRANLAKP-EDIDLLFDSIEQEWGSLDGFISNAAS--GFNRPALQQKVTGWDWTMNVNA 122 Query 230 RAFLFARQ 237 RAFLFA Q Sbjct 123 RAFLFATQ 130 >gi|341875557|gb|EGT31492.1| hypothetical protein CAEBREN_06106 [Caenorhabditis brenneri] Length=1115 Score = 37.0 bits (84), Expect = 2.7, Method: Compositional matrix adjust. Identities = 19/66 (29%), Positives = 39/66 (60%), Gaps = 3/66 (4%) Query 146 LLDLGEKHYFRNVPHSKLNKAYRDLFQKLDKKHPDH-DKPWRDIPDIWQWGITDADLESK 204 +L+ ++ YF + H K + ++++++L K P +K R++ D+ Q G T DL+++ Sbjct 972 ILENSQQSYF--IDHEKFEELKKEIWKELAKNAPKQLEKKKREVQDVGQNGFTKKDLKNQ 1029 Query 205 ASELGF 210 +LGF Sbjct 1030 LHQLGF 1035 >gi|296188633|ref|ZP_06857021.1| membrane family protein [Clostridium carboxidivorans P7] gi|296046897|gb|EFG86343.1| membrane family protein [Clostridium carboxidivorans P7] Length=291 Score = 36.6 bits (83), Expect = 3.4, Method: Compositional matrix adjust. Identities = 27/75 (36%), Positives = 31/75 (42%), Gaps = 16/75 (21%) Query 4 DRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAVLVD 63 RL +GG WLV G + LI F + I SFA AV+ K LVD Sbjct 32 GRLEEGGNWLVYGIVGLIANFFDTLGIGSFAPTTAVY----------------KFLKLVD 75 Query 64 GRITPTVAARANSYP 78 RI P AN P Sbjct 76 DRIIPGTLNVANCVP 90 >gi|225320663|dbj|BAH29727.1| UDP-glucose 4-epimerase [Dicyema japonicum] Length=341 Score = 36.6 bits (83), Expect = 3.5, Method: Compositional matrix adjust. Identities = 29/98 (30%), Positives = 47/98 (48%), Gaps = 12/98 (12%) Query 83 IEGNFGDQEIADKVGNVDALFLFDVLLHQVSPDWDTILDMYAKNVRCLLIYNQQWIGSTT 142 IEG+ DQEI +K+ + +++F ++H ++VR L Y Q +G Sbjct 56 IEGDINDQEILNKIFSENSIF---SVIHLAGS------KAVGESVRMPLKYYQNNVGGAM 106 Query 143 TVRLLDLGEKHYFRNVPHSKLNKAYRD-LFQKLDKKHP 179 T LL + + H RN S Y D ++ +D+KHP Sbjct 107 T--LLKVMDDHGVRNFIFSSSATVYGDPVYLPIDEKHP 142 >gi|255524832|ref|ZP_05391782.1| conserved hypothetical protein [Clostridium carboxidivorans P7] gi|255511499|gb|EET87789.1| conserved hypothetical protein [Clostridium carboxidivorans P7] Length=266 Score = 36.6 bits (83), Expect = 3.5, Method: Compositional matrix adjust. Identities = 27/75 (36%), Positives = 31/75 (42%), Gaps = 16/75 (21%) Query 4 DRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRALDKYPVKEAVLVD 63 RL +GG WLV G + LI F + I SFA AV+ K LVD Sbjct 7 GRLEEGGNWLVYGIVGLIANFFDTLGIGSFAPTTAVY----------------KFLKLVD 50 Query 64 GRITPTVAARANSYP 78 RI P AN P Sbjct 51 DRIIPGTLNVANCVP 65 >gi|326528779|dbj|BAJ97411.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326534194|dbj|BAJ89447.1| predicted protein [Hordeum vulgare subsp. vulgare] Length=390 Score = 35.0 bits (79), Expect = 9.8, Method: Compositional matrix adjust. Identities = 24/92 (27%), Positives = 48/92 (53%), Gaps = 14/92 (15%) Query 100 DALFLFDVLLHQV-SPDWDTILDMYAKNVRCLLIYNQQWIGSTTTVRLLDLGEKHYFRNV 158 D+ F F V+ H + D+DT L +Y + I + WI +T+++L G Sbjct 308 DSNFYFTVIFHLCKAGDFDTALSVYNE------IAPRNWIPCFSTMKMLVNGL------A 355 Query 159 PHSKLNKAYRDLFQKLDKKHPDHDKPWRDIPD 190 S++++A + + +K+ +K PD D W+++ + Sbjct 356 GSSRIDEA-KGIIEKMKEKFPDRDAGWKEVEE 386 Lambda K H 0.324 0.141 0.453 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 322714888716 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40