BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0396 Length=130 Score E Sequences producing significant alignments: (Bits) Value gi|167970791|ref|ZP_02553068.1| hypothetical protein MtubH3_2323... 262 1e-68 gi|15607537|ref|NP_214910.1| hypothetical protein Rv0396 [Mycoba... 262 1e-68 gi|298523870|ref|ZP_07011279.1| conserved hypothetical protein [... 258 2e-67 gi|15839778|ref|NP_334815.1| hypothetical protein MT0406 [Mycoba... 227 5e-58 gi|342858699|ref|ZP_08715354.1| hypothetical protein MCOL_07476 ... 180 7e-44 gi|256398018|ref|YP_003119582.1| hypothetical protein Caci_8928 ... 88.6 3e-16 gi|302868600|ref|YP_003837237.1| hypothetical protein Micau_4147... 60.1 1e-07 gi|338174094|ref|YP_004650904.1| hypothetical protein PUV_01000 ... 36.2 1.8 gi|284167513|ref|YP_003405791.1| blue (type 1) copper domain pro... 35.8 1.9 gi|282890911|ref|ZP_06299427.1| hypothetical protein pah_c030o00... 35.8 2.0 gi|154315087|ref|XP_001556867.1| hypothetical protein BC1G_04885... 34.3 5.4 gi|262194785|ref|YP_003265994.1| hypothetical protein Hoch_1549 ... 34.3 6.7 >gi|167970791|ref|ZP_02553068.1| hypothetical protein MtubH3_23230 [Mycobacterium tuberculosis H37Ra] gi|253797318|ref|YP_003030319.1| hypothetical protein TBMG_00396 [Mycobacterium tuberculosis KZN 1435] gi|308231542|ref|ZP_07412827.2| hypothetical protein TMAG_01655 [Mycobacterium tuberculosis SUMu001] 23 more sequence titlesLength=138 Score = 262 bits (670), Expect = 1e-68, Method: Compositional matrix adjust. Identities = 130/130 (100%), Positives = 130/130 (100%), Gaps = 0/130 (0%) Query 1 VRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAG 60 VRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAG Sbjct 9 VRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAG 68 Query 61 SHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPEPNLVQVEFDDAAMAEAMEEM 120 SHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPEPNLVQVEFDDAAMAEAMEEM Sbjct 69 SHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPEPNLVQVEFDDAAMAEAMEEM 128 Query 121 AEPDDDGEDW 130 AEPDDDGEDW Sbjct 129 AEPDDDGEDW 138 >gi|15607537|ref|NP_214910.1| hypothetical protein Rv0396 [Mycobacterium tuberculosis H37Rv] gi|31791572|ref|NP_854065.1| hypothetical protein Mb0402 [Mycobacterium bovis AF2122/97] gi|121636308|ref|YP_976531.1| hypothetical protein BCG_0433 [Mycobacterium bovis BCG str. Pasteur 1173P2] 46 more sequence titles Length=130 Score = 262 bits (669), Expect = 1e-68, Method: Compositional matrix adjust. Identities = 129/130 (99%), Positives = 130/130 (100%), Gaps = 0/130 (0%) Query 1 VRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAG 60 +RALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAG Sbjct 1 MRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAG 60 Query 61 SHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPEPNLVQVEFDDAAMAEAMEEM 120 SHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPEPNLVQVEFDDAAMAEAMEEM Sbjct 61 SHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPEPNLVQVEFDDAAMAEAMEEM 120 Query 121 AEPDDDGEDW 130 AEPDDDGEDW Sbjct 121 AEPDDDGEDW 130 >gi|298523870|ref|ZP_07011279.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|298493664|gb|EFI28958.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] Length=130 Score = 258 bits (660), Expect = 2e-67, Method: Compositional matrix adjust. Identities = 128/130 (99%), Positives = 129/130 (99%), Gaps = 0/130 (0%) Query 1 VRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAG 60 +RALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAG Sbjct 1 MRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAG 60 Query 61 SHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPEPNLVQVEFDDAAMAEAMEEM 120 SHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPE NLVQVEFDDAAMAEAMEEM Sbjct 61 SHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPELNLVQVEFDDAAMAEAMEEM 120 Query 121 AEPDDDGEDW 130 AEPDDDGEDW Sbjct 121 AEPDDDGEDW 130 >gi|15839778|ref|NP_334815.1| hypothetical protein MT0406 [Mycobacterium tuberculosis CDC1551] gi|13879907|gb|AAK44629.1| hypothetical protein MT0406 [Mycobacterium tuberculosis CDC1551] Length=138 Score = 227 bits (578), Expect = 5e-58, Method: Compositional matrix adjust. Identities = 126/130 (97%), Positives = 126/130 (97%), Gaps = 0/130 (0%) Query 1 VRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAG 60 VRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAG Sbjct 9 VRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAG 68 Query 61 SHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPEPNLVQVEFDDAAMAEAMEEM 120 SHLEVGFACDDPTLVEEATAFVADVI SE IDTTCAGPE NLVQVEFDDAAMAEAMEEM Sbjct 69 SHLEVGFACDDPTLVEEATAFVADVIXXSEXIDTTCAGPEXNLVQVEFDDAAMAEAMEEM 128 Query 121 AEPDDDGEDW 130 AEPDDDGEDW Sbjct 129 AEPDDDGEDW 138 >gi|342858699|ref|ZP_08715354.1| hypothetical protein MCOL_07476 [Mycobacterium colombiense CECT 3035] gi|342134403|gb|EGT87583.1| hypothetical protein MCOL_07476 [Mycobacterium colombiense CECT 3035] Length=267 Score = 180 bits (456), Expect = 7e-44, Method: Compositional matrix adjust. Identities = 90/130 (70%), Positives = 99/130 (77%), Gaps = 2/130 (1%) Query 1 VRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAG 60 VR GW R+ KPLL+AK+LVLGHL NVY PD+ GEE+LDFEPRTVWWGSANWT Sbjct 140 VRVAGWDRDGNKPLLHAKILVLGHLVHNVYGPDDTCGEEMLDFEPRTVWWGSANWTTMTR 199 Query 61 SHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPEPNLVQVEFDDAAMAEAMEEM 120 HLE+GFAC DP LV EAT FV DVIAFSEP+D C GPEPNLV VEFDDAAM EA+ E Sbjct 200 KHLELGFACGDPDLVREATEFVTDVIAFSEPVDRVCVGPEPNLVGVEFDDAAMREALAEQ 259 Query 121 AEPDDDGEDW 130 P DGE+W Sbjct 260 EPP--DGEEW 267 >gi|256398018|ref|YP_003119582.1| hypothetical protein Caci_8928 [Catenulispora acidiphila DSM 44928] gi|256364244|gb|ACU77741.1| hypothetical protein Caci_8928 [Catenulispora acidiphila DSM 44928] Length=272 Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 49/129 (38%), Positives = 68/129 (53%), Gaps = 5/129 (3%) Query 1 VRALGWLREDR---KPLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTV 57 +R GW R +P+L+AK+LVLG Y+ D + +VL F P+ W GSANWT Sbjct 145 IRVAGWRRASNGTARPMLHAKMLVLG--VTTYYEDDEMFAGDVLRFHPKLTWMGSANWTQ 202 Query 58 RAGSHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAGPEPNLVQVEFDDAAMAEAM 117 A H+E G D LV A++ ++ FSEP GPEP+LV V +DD A E Sbjct 203 AARHHIEFGVWSSDADLVRHNYAYLLSLLTFSEPRGAATIGPEPDLVSVVWDDDAFREYF 262 Query 118 EEMAEPDDD 126 + + +D Sbjct 263 AKHRDQYED 271 >gi|302868600|ref|YP_003837237.1| hypothetical protein Micau_4147 [Micromonospora aurantiaca ATCC 27029] gi|302571459|gb|ADL47661.1| hypothetical protein Micau_4147 [Micromonospora aurantiaca ATCC 27029] Length=287 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 49/116 (43%), Positives = 67/116 (58%), Gaps = 4/116 (3%) Query 13 PLLNAKLLVLGHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTVRAGSHLEVGFACDDP 72 P+L+AKL +LG L +D D+ +V F P+ +W GSAN+T + LE GF +DP Sbjct 174 PILHAKLALLGRLW--EHDEDDFGPADVTAFTPKRLWIGSANFTASSRRSLEFGFWTEDP 231 Query 73 TLVEEATAFVADVIAFSEPIDTTCAGPEPNLVQVEFDDAAMAEAMEEM--AEPDDD 126 L+ A F+ D+I SE ID P V ++DDAAMAEA E+ A+ DDD Sbjct 232 VLLRGAEEFLLDLIGNSEGIDPDADVHMPERVLPDYDDAAMAEAAAELRFADLDDD 287 >gi|338174094|ref|YP_004650904.1| hypothetical protein PUV_01000 [Parachlamydia acanthamoebae UV7] gi|336478452|emb|CCB85050.1| putative uncharacterized protein [Parachlamydia acanthamoebae UV7] Length=196 Score = 36.2 bits (82), Expect = 1.8, Method: Compositional matrix adjust. Identities = 26/76 (35%), Positives = 39/76 (52%), Gaps = 15/76 (19%) Query 23 GHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTV-----------RAGSHL--EVGFAC 69 GH AL+ +D NGY EE D P T+W G A + ++G L E+G Sbjct 62 GHKALHAFDYHNGYHEEEEDESPATMWLGGAFGAILGATSALLLAPQSGEKLREELGEKY 121 Query 70 DDPTLVEEATAFVADV 85 D+ + E+A +F+A+V Sbjct 122 DE--IREKAESFMANV 135 >gi|284167513|ref|YP_003405791.1| blue (type 1) copper domain protein [Haloterrigena turkmenica DSM 5511] gi|284017168|gb|ADB63118.1| blue (type 1) copper domain protein [Haloterrigena turkmenica DSM 5511] Length=235 Score = 35.8 bits (81), Expect = 1.9, Method: Compositional matrix adjust. Identities = 25/55 (46%), Positives = 33/55 (60%), Gaps = 8/55 (14%) Query 43 FEPRTVW---WGSANWTVRAGSHLEVGFA--CDDPTLV-EEATAFVADVIAFSEP 91 FEP VW GS NWT +GSH + D+P+LV +EATA+ + V+ SEP Sbjct 92 FEPHVVWIEQGGSVNWTNESGSHSTTAYHPDNDEPSLVPDEATAWDSGVL--SEP 144 >gi|282890911|ref|ZP_06299427.1| hypothetical protein pah_c030o009 [Parachlamydia acanthamoebae str. Hall's coccus] gi|281499195|gb|EFB41498.1| hypothetical protein pah_c030o009 [Parachlamydia acanthamoebae str. Hall's coccus] Length=198 Score = 35.8 bits (81), Expect = 2.0, Method: Compositional matrix adjust. Identities = 26/76 (35%), Positives = 39/76 (52%), Gaps = 15/76 (19%) Query 23 GHLALNVYDPDNGYGEEVLDFEPRTVWWGSANWTV-----------RAGSHL--EVGFAC 69 GH AL+ +D NGY EE D P T+W G A + ++G L E+G Sbjct 64 GHKALHAFDYHNGYHEEEEDESPATMWLGGAFGAILGATSALLLAPQSGEKLREELGEKY 123 Query 70 DDPTLVEEATAFVADV 85 D+ + E+A +F+A+V Sbjct 124 DE--IREKAESFMANV 137 >gi|154315087|ref|XP_001556867.1| hypothetical protein BC1G_04885 [Botryotinia fuckeliana B05.10] gi|150848423|gb|EDN23616.1| hypothetical protein BC1G_04885 [Botryotinia fuckeliana B05.10] Length=424 Score = 34.3 bits (77), Expect = 5.4, Method: Compositional matrix adjust. Identities = 15/28 (54%), Positives = 22/28 (79%), Gaps = 0/28 (0%) Query 90 EPIDTTCAGPEPNLVQVEFDDAAMAEAM 117 EP++T AGPEPN +++ DDAA+ EA+ Sbjct 394 EPMETDKAGPEPNASEIDSDDAALFEAL 421 >gi|262194785|ref|YP_003265994.1| hypothetical protein Hoch_1549 [Haliangium ochraceum DSM 14365] gi|262078132|gb|ACY14101.1| hypothetical protein Hoch_1549 [Haliangium ochraceum DSM 14365] Length=917 Score = 34.3 bits (77), Expect = 6.7, Method: Compositional matrix adjust. Identities = 29/94 (31%), Positives = 42/94 (45%), Gaps = 6/94 (6%) Query 39 EVLDFEPRTVWWGSANWTVRAGSHLEVGFACDDPTLVEEATAFVADVIAFSEPIDTTCAG 98 EV D PRT W + V S V F+ DP+ + T + ++ D AG Sbjct 651 EVSDLAPRTAWLVPGDTRVSRVSATNVRFSSVDPSALLPRTTWATALLTLDSLTDALPAG 710 Query 99 PEPNLVQVEFDDA--AMAEAMEEMAEPD--DDGE 128 L VEF DA + + + E+ +PD +DGE Sbjct 711 SLTRL--VEFPDAHEGLEQGLVEVYQPDAGEDGE 742 Lambda K H 0.316 0.136 0.431 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 127765705240 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40