BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv1289 Length=210 Score E Sequences producing significant alignments: (Bits) Value gi|15608429|ref|NP_215805.1| hypothetical protein Rv1289 [Mycoba... 430 5e-119 gi|308370682|ref|ZP_07422332.2| hypothetical protein TMCG_00919 ... 424 6e-117 gi|183984094|ref|YP_001852385.1| hypothetical protein MMAR_4123 ... 365 3e-99 gi|240169527|ref|ZP_04748186.1| hypothetical protein MkanA1_0945... 351 4e-95 gi|110833708|ref|YP_692567.1| hypothetical protein ABO_0847 [Alc... 63.5 2e-08 gi|54302873|ref|YP_132866.1| hypothetical protein PBPRB1194 [Pho... 53.9 1e-05 gi|94499083|ref|ZP_01305621.1| hypothetical protein RED65_09854 ... 45.4 0.005 gi|163783523|ref|ZP_02178513.1| hypothetical protein HG1285_1463... 37.0 1.9 gi|242777835|ref|XP_002479114.1| hypothetical protein TSTA_09397... 35.8 3.7 gi|344924974|ref|ZP_08778435.1| hypothetical protein COdytL_1005... 35.8 4.6 >gi|15608429|ref|NP_215805.1| hypothetical protein Rv1289 [Mycobacterium tuberculosis H37Rv] gi|15840737|ref|NP_335774.1| hypothetical protein MT1327 [Mycobacterium tuberculosis CDC1551] gi|31792481|ref|NP_854974.1| hypothetical protein Mb1320 [Mycobacterium bovis AF2122/97] 63 more sequence titlesLength=210 Score = 430 bits (1106), Expect = 5e-119, Method: Compositional matrix adjust. Identities = 210/210 (100%), Positives = 210/210 (100%), Gaps = 0/210 (0%) Query 1 MCVSVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLGVGTRFRTALRDSLDIY 60 MCVSVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLGVGTRFRTALRDSLDIY Sbjct 1 MCVSVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLGVGTRFRTALRDSLDIY 60 Query 61 GVMATPGVDLEKTRFPVGVRSDLLPDKRPDIADVLYGIHRWLHGHADESSVEFEVSPYVN 120 GVMATPGVDLEKTRFPVGVRSDLLPDKRPDIADVLYGIHRWLHGHADESSVEFEVSPYVN Sbjct 61 GVMATPGVDLEKTRFPVGVRSDLLPDKRPDIADVLYGIHRWLHGHADESSVEFEVSPYVN 120 Query 121 ASAALRIANDGKIQLPKSAILGLLAVAVFAPENKGEVIPPDYQLSWYDHVFFISVWWGWQ 180 ASAALRIANDGKIQLPKSAILGLLAVAVFAPENKGEVIPPDYQLSWYDHVFFISVWWGWQ Sbjct 121 ASAALRIANDGKIQLPKSAILGLLAVAVFAPENKGEVIPPDYQLSWYDHVFFISVWWGWQ 180 Query 181 DHFREIVNVDRASLVALDFGDLWNGWTPVG 210 DHFREIVNVDRASLVALDFGDLWNGWTPVG Sbjct 181 DHFREIVNVDRASLVALDFGDLWNGWTPVG 210 >gi|308370682|ref|ZP_07422332.2| hypothetical protein TMCG_00919 [Mycobacterium tuberculosis SUMu003] gi|308371931|ref|ZP_07426695.2| hypothetical protein TMDG_01167 [Mycobacterium tuberculosis SUMu004] gi|308373101|ref|ZP_07431002.2| hypothetical protein TMEG_01185 [Mycobacterium tuberculosis SUMu005] 16 more sequence titles Length=208 Score = 424 bits (1089), Expect = 6e-117, Method: Compositional matrix adjust. Identities = 207/208 (99%), Positives = 208/208 (100%), Gaps = 0/208 (0%) Query 3 VSVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLGVGTRFRTALRDSLDIYGV 62 +SVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLGVGTRFRTALRDSLDIYGV Sbjct 1 MSVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLGVGTRFRTALRDSLDIYGV 60 Query 63 MATPGVDLEKTRFPVGVRSDLLPDKRPDIADVLYGIHRWLHGHADESSVEFEVSPYVNAS 122 MATPGVDLEKTRFPVGVRSDLLPDKRPDIADVLYGIHRWLHGHADESSVEFEVSPYVNAS Sbjct 61 MATPGVDLEKTRFPVGVRSDLLPDKRPDIADVLYGIHRWLHGHADESSVEFEVSPYVNAS 120 Query 123 AALRIANDGKIQLPKSAILGLLAVAVFAPENKGEVIPPDYQLSWYDHVFFISVWWGWQDH 182 AALRIANDGKIQLPKSAILGLLAVAVFAPENKGEVIPPDYQLSWYDHVFFISVWWGWQDH Sbjct 121 AALRIANDGKIQLPKSAILGLLAVAVFAPENKGEVIPPDYQLSWYDHVFFISVWWGWQDH 180 Query 183 FREIVNVDRASLVALDFGDLWNGWTPVG 210 FREIVNVDRASLVALDFGDLWNGWTPVG Sbjct 181 FREIVNVDRASLVALDFGDLWNGWTPVG 208 >gi|183984094|ref|YP_001852385.1| hypothetical protein MMAR_4123 [Mycobacterium marinum M] gi|183177420|gb|ACC42530.1| conserved hypothetical protein [Mycobacterium marinum M] Length=208 Score = 365 bits (936), Expect = 3e-99, Method: Compositional matrix adjust. Identities = 169/208 (82%), Positives = 190/208 (92%), Gaps = 0/208 (0%) Query 3 VSVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLGVGTRFRTALRDSLDIYGV 62 ++VG SV QSL+QWDRK+WDVAMLHACNAVD+T RKRYP+LG GTRFR +RD++DIYGV Sbjct 1 MTVGASVKQSLEQWDRKMWDVAMLHACNAVDDTSRKRYPSLGAGTRFRRVIRDAVDIYGV 60 Query 63 MATPGVDLEKTRFPVGVRSDLLPDKRPDIADVLYGIHRWLHGHADESSVEFEVSPYVNAS 122 MATPGVDLE TRFPV VRSDL P+ RPDIADVLYGIHRWLHGH +ES+ FEVSPYVN S Sbjct 61 MATPGVDLENTRFPVAVRSDLTPEMRPDIADVLYGIHRWLHGHDEESATGFEVSPYVNGS 120 Query 123 AALRIANDGKIQLPKSAILGLLAVAVFAPENKGEVIPPDYQLSWYDHVFFISVWWGWQDH 182 AALR+A+DGKIQLPK+AILGLLA+AVFAPENKGEVIPPDYQLSW+DHVFF+S WWGWQDH Sbjct 121 AALRVASDGKIQLPKTAILGLLAIAVFAPENKGEVIPPDYQLSWFDHVFFVSAWWGWQDH 180 Query 183 FREIVNVDRASLVALDFGDLWNGWTPVG 210 FREIVNVDR+SLVALDFG+ W+ W PVG Sbjct 181 FREIVNVDRSSLVALDFGNSWSDWAPVG 208 >gi|240169527|ref|ZP_04748186.1| hypothetical protein MkanA1_09451 [Mycobacterium kansasii ATCC 12478] Length=212 Score = 351 bits (900), Expect = 4e-95, Method: Compositional matrix adjust. Identities = 169/207 (82%), Positives = 185/207 (90%), Gaps = 1/207 (0%) Query 3 VSVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLGVGTRFRTALRDSLDIYGV 62 ++VGESVAQSLQQWDRKLWDVAMLHACNAVD T RKRYP LGVGTRFR +RDSLDI+GV Sbjct 1 MTVGESVAQSLQQWDRKLWDVAMLHACNAVDGTARKRYPALGVGTRFRRVIRDSLDIFGV 60 Query 63 MATPGVDLEKTRFPVGVRSDLLPDKRPDIADVLYGIHRWLHGHADESSVEFEVSPYVNAS 122 MATPGV+LE+TRFPV VRSDL PDKRPDIADVL+GIHRWLHGH DE FEV+PYVN+ Sbjct 61 MATPGVNLEETRFPVAVRSDL-PDKRPDIADVLFGIHRWLHGHIDEGPDGFEVTPYVNSG 119 Query 123 AALRIANDGKIQLPKSAILGLLAVAVFAPENKGEVIPPDYQLSWYDHVFFISVWWGWQDH 182 A LR ANDGKIQL K AILG+LAVAVFAPENKGE IP DYQLSW+DHVF+ISVWWGWQDH Sbjct 120 AVLRTANDGKIQLTKFAILGMLAVAVFAPENKGESIPADYQLSWFDHVFYISVWWGWQDH 179 Query 183 FREIVNVDRASLVALDFGDLWNGWTPV 209 FREIVNVD +SLV L+FGD+W+ WTPV Sbjct 180 FREIVNVDPSSLVTLNFGDMWDRWTPV 206 >gi|110833708|ref|YP_692567.1| hypothetical protein ABO_0847 [Alcanivorax borkumensis SK2] gi|110646819|emb|CAL16295.1| hypothetical protein ABO_0847 [Alcanivorax borkumensis SK2] Length=163 Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 46/167 (28%), Positives = 72/167 (44%), Gaps = 19/167 (11%) Query 19 KLWDVAMLHACNAVDETGRKRYPTLGVGTRFRTALRDSLDIYGVMATPGVDLEKTRFPVG 78 K ++ A++H A+D+T +KR P VG R R L D L+I +AT + F V Sbjct 3 KDFEAALVHYFPALDKTAKKRRPAAKVGERIRAFLDDELEIISDIATKNI------FIVN 56 Query 79 VRSDLLPDKRPDIADVLYGIHRWLHGHADESSVEFEVSPYVNASAALRIANDGKIQLPKS 138 P + +Y R H E E+ P +N + + LP S Sbjct 57 CNGVSFP-------EAIYKFGRTSIAH------EGELDPRLNFNNNSGMEIGDTWNLPPS 103 Query 139 AILGLLAVAVFAPENKGEVIPPDYQLSWYDHVFFISVWWGWQDHFRE 185 I GL + APEN E DY+++ ++ F ++ WG + R+ Sbjct 104 FITGLSIAVILAPENTAERFQKDYEVAIHEERFSVNALWGQRQLIRD 150 >gi|54302873|ref|YP_132866.1| hypothetical protein PBPRB1194 [Photobacterium profundum SS9] gi|46916297|emb|CAG23066.1| hypothetical protein PBPRB1194 [Photobacterium profundum SS9] Length=251 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 50/190 (27%), Positives = 75/190 (40%), Gaps = 22/190 (11%) Query 5 VGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLGVGTRFRTALRDSLDIYGVMA 64 V V QS++ ++ A+++ A+D+T ++R P GVG R + L D + +A Sbjct 79 VSRRVLQSIKHLQTDDFEGALVNLFPAIDQTAKRRRPKDGVGKRIKAFLEDEEKLISSIA 138 Query 65 TPGVDLEKTRFPVGVRSDLLPDKRPDIADVLYGIHRWLHGHADESSVEFEVSPYVNASAA 124 T D L D I D LY R H E E N + + Sbjct 139 T---------------GDCLCDG-ISITDALYKFGRTSIAHEGELDPRLE----FNLNGS 178 Query 125 LRIANDGKIQLPKSAILGLLAVAVFAPENKGEVIPPDYQLSWYDHVFFISVWWGWQDHFR 184 ++I +D K LP I G+ A + A EN+ E + + F I WG + Sbjct 179 IQIGSD-KWNLPIGYITGMCAAVIVAEENEAENFDDQLAIPLFGKQFPIESLWGNSHQIK 237 Query 185 -EIVNVDRAS 193 I N R S Sbjct 238 THICNEFRNS 247 >gi|94499083|ref|ZP_01305621.1| hypothetical protein RED65_09854 [Oceanobacter sp. RED65] gi|94428715|gb|EAT13687.1| hypothetical protein RED65_09854 [Oceanobacter sp. RED65] Length=184 Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust. Identities = 42/199 (22%), Positives = 86/199 (44%), Gaps = 27/199 (13%) Query 4 SVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLG-VGTRFRTALRDSLDIYGV 62 +VG + +++Q+ + A++ A+D + ++++P VG R++ +R+ + Sbjct 3 AVGCRLDEAIQKISAGDLENALIQVSIAIDVSSKRKWPKQKKVGERYKNFIREHESLIYF 62 Query 63 MATPGVDLEKTRFPVGVRSDLLP--------DKRPDIADVLYGIHRWLHGHADESSVEFE 114 M+ +G++ D P R DIA V Y R H E S Sbjct 63 MS------------LGIKGDTKPLVSFPNPNGDRYDIAHVYYKAVRNGLLHDGEISENLS 110 Query 115 VSPYVNASAALRIANDGKIQLPKSAILGLLAVAVFAPENKGEVIPPDYQLSWYDHV-FFI 173 V + I DGKI + K ++ L+ + P N E + +Y +++ + + + Sbjct 111 V-----VDENVLIYKDGKITISKMILMALMLAVICEPVNCNERLSKNYTVTFTNDIDINL 165 Query 174 SVWWGWQDHFREIVNVDRA 192 + +WG +D + + D+A Sbjct 166 NDFWGKRDLLYQKIGHDKA 184 >gi|163783523|ref|ZP_02178513.1| hypothetical protein HG1285_14634 [Hydrogenivirga sp. 128-5-R1-1] gi|159881143|gb|EDP74657.1| hypothetical protein HG1285_14634 [Hydrogenivirga sp. 128-5-R1-1] Length=248 Score = 37.0 bits (84), Expect = 1.9, Method: Compositional matrix adjust. Identities = 22/73 (31%), Positives = 33/73 (46%), Gaps = 0/73 (0%) Query 36 GRKRYPTLGVGTRFRTALRDSLDIYGVMATPGVDLEKTRFPVGVRSDLLPDKRPDIADVL 95 GR Y + G R R R ++ V + PG+DL ++ + L+ +R D A L Sbjct 112 GRISYDKVKYGKRLRELKRKAVSGKPVFSKPGIDLTESEKESVLMQKLIALQRGDSAKTL 171 Query 96 YGIHRWLHGHADE 108 Y + L H DE Sbjct 172 YDVALILEEHGDE 184 >gi|242777835|ref|XP_002479114.1| hypothetical protein TSTA_093970 [Talaromyces stipitatus ATCC 10500] gi|218722733|gb|EED22151.1| hypothetical protein TSTA_093970 [Talaromyces stipitatus ATCC 10500] Length=367 Score = 35.8 bits (81), Expect = 3.7, Method: Compositional matrix adjust. Identities = 21/84 (25%), Positives = 39/84 (47%), Gaps = 9/84 (10%) Query 77 VGVRSDLLPDKRPDIADVLYGIHRWLHGHADES---SVEFEVSPYVNASAALRIANDGKI 133 + S L ++ + D+L+G ++WL H E+ S ++E S AL +N+ + Sbjct 136 IFYHSQALLSEKATVNDILFGFYKWLDDHPSETLFLSFQYE------GSTALHASNNAAV 189 Query 134 QLPKSAILGLLAVAVFAPENKGEV 157 QL L + A + + K E+ Sbjct 190 QLQLYEALTMPAARAYFVQTKDEL 213 >gi|344924974|ref|ZP_08778435.1| hypothetical protein COdytL_10056 [Candidatus Odyssella thessalonicensis L13] Length=291 Score = 35.8 bits (81), Expect = 4.6, Method: Compositional matrix adjust. Identities = 20/49 (41%), Positives = 26/49 (54%), Gaps = 4/49 (8%) Query 15 QWDRKLWDVAMLHACNAVDETGRKR-YPTLGVGTRFRTALRDSLDIYGV 62 W + D LH N VD TG+ YPT G RF A+RD+ D+Y + Sbjct 227 NWKSSICD---LHVDNKVDGTGKYCCYPTRGFNRRFEGAIRDAGDLYNL 272 Lambda K H 0.322 0.138 0.451 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 245963417238 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40