BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv3312A Length=103 Score E Sequences producing significant alignments: (Bits) Value gi|15842905|ref|NP_337942.1| hypothetical protein MT3413 [Mycoba... 207 4e-52 gi|31794493|ref|NP_856986.1| secreted protein antigen [Mycobacte... 206 9e-52 gi|183983468|ref|YP_001851759.1| hypothetical protein MMAR_3485 ... 103 9e-21 gi|183984668|ref|YP_001852959.1| hypothetical protein MMAR_4700 ... 82.0 2e-14 gi|169627375|ref|YP_001701024.1| hypothetical protein MAB_0270c ... 53.9 8e-06 gi|118618995|ref|YP_907327.1| hypothetical protein MUL_3736 [Myc... 52.0 3e-05 gi|183983785|ref|YP_001852076.1| hypothetical protein MMAR_3811 ... 50.8 7e-05 gi|317507588|ref|ZP_07965302.1| hypothetical protein HMPREF9336_... 40.0 0.10 gi|183984554|ref|YP_001852845.1| hypothetical protein MMAR_4585 ... 37.7 0.58 gi|296169980|ref|ZP_06851587.1| secreted protein antigen [Mycoba... 37.7 0.60 gi|118619566|ref|YP_907898.1| hypothetical protein MUL_4446 [Myc... 37.4 0.81 gi|296392713|ref|YP_003657597.1| hypothetical protein Srot_0279 ... 37.0 0.96 gi|317507596|ref|ZP_07965310.1| far upstream element-binding pro... 34.7 4.6 gi|320663831|gb|EFX31059.1| putative BigA-like protein [Escheric... 34.7 4.7 gi|291282505|ref|YP_003499323.1| BigA-like protein [Escherichia ... 34.7 4.7 gi|118464215|ref|YP_880490.1| secreted protein antigen [Mycobact... 34.7 4.8 gi|320658999|gb|EFX26622.1| putative BigA-like protein [Escheric... 34.3 6.0 gi|336458734|gb|EGO37694.1| hypothetical protein MAPs_10170 [Myc... 33.9 7.1 gi|169627374|ref|YP_001701023.1| hypothetical protein MAB_0269c ... 33.9 9.2 gi|320190142|gb|EFW64793.1| porin, autotransporter (AT) family [... 33.5 9.3 gi|15831260|ref|NP_310033.1| BigA-like protein [Escherichia coli... 33.5 9.3 >gi|15842905|ref|NP_337942.1| hypothetical protein MT3413 [Mycobacterium tuberculosis CDC1551] gi|13883238|gb|AAK47756.1| hypothetical protein MT3413 [Mycobacterium tuberculosis CDC1551] Length=114 Score = 207 bits (527), Expect = 4e-52, Method: Compositional matrix adjust. Identities = 103/103 (100%), Positives = 103/103 (100%), Gaps = 0/103 (0%) Query 1 MYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPAWGPNWDPYTC 60 MYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPAWGPNWDPYTC Sbjct 12 MYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPAWGPNWDPYTC 71 Query 61 HDDFHRDSDGPDHSRDYPGPILEGPVLDDPGAAPPPPAAGGGA 103 HDDFHRDSDGPDHSRDYPGPILEGPVLDDPGAAPPPPAAGGGA Sbjct 72 HDDFHRDSDGPDHSRDYPGPILEGPVLDDPGAAPPPPAAGGGA 114 >gi|31794493|ref|NP_856986.1| secreted protein antigen [Mycobacterium bovis AF2122/97] gi|57117088|ref|YP_177957.1| secreted protein antigen [Mycobacterium tuberculosis H37Rv] gi|121639236|ref|YP_979460.1| secreted protein antigen [Mycobacterium bovis BCG str. Pasteur 1173P2] 87 more sequence titlesLength=103 Score = 206 bits (524), Expect = 9e-52, Method: Compositional matrix adjust. Identities = 103/103 (100%), Positives = 103/103 (100%), Gaps = 0/103 (0%) Query 1 MYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPAWGPNWDPYTC 60 MYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPAWGPNWDPYTC Sbjct 1 MYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPAWGPNWDPYTC 60 Query 61 HDDFHRDSDGPDHSRDYPGPILEGPVLDDPGAAPPPPAAGGGA 103 HDDFHRDSDGPDHSRDYPGPILEGPVLDDPGAAPPPPAAGGGA Sbjct 61 HDDFHRDSDGPDHSRDYPGPILEGPVLDDPGAAPPPPAAGGGA 103 >gi|183983468|ref|YP_001851759.1| hypothetical protein MMAR_3485 [Mycobacterium marinum M] gi|183176794|gb|ACC41904.1| conserved hypothetical secreted protein [Mycobacterium marinum M] Length=108 Score = 103 bits (257), Expect = 9e-21, Method: Compositional matrix adjust. Identities = 54/94 (58%), Positives = 65/94 (70%), Gaps = 5/94 (5%) Query 1 MYRFACRTLMLAACILAT--GVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPAWGPNWDPY 58 M+RF +++ A I+A +A G+ A + A AP P YYWCPGQPFDPAWGPNWDP Sbjct 1 MHRFIRLAVLVVAGIIAAVLAMADFGLIANAGAHPAPAPTYYWCPGQPFDPAWGPNWDPT 60 Query 59 TCHDDFHRDSDGPDHSRDY-PG--PILEGPVLDD 89 TCHDD HRD DG DHSRD+ PG P+ E P LD+ Sbjct 61 TCHDDVHRDVDGADHSRDFVPGDLPVDEQPWLDE 94 >gi|183984668|ref|YP_001852959.1| hypothetical protein MMAR_4700 [Mycobacterium marinum M] gi|183177994|gb|ACC43104.1| conserved hypothetical secreted protein [Mycobacterium marinum M] Length=96 Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 42/82 (52%), Positives = 49/82 (60%), Gaps = 2/82 (2%) Query 1 MYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVP--DYYWCPGQPFDPAWGPNWDPY 58 M + A A + G+ G+G A++ A P P Y+WCPGQPFDPAWGP WDP Sbjct 1 MSKVARSVAATAIVLTGFGLIGVGAAARAHADDPPWPFVGYHWCPGQPFDPAWGPQWDPT 60 Query 59 TCHDDFHRDSDGPDHSRDYPGP 80 TCHD HRD DG H RDY GP Sbjct 61 TCHDAHHRDMDGTLHDRDYFGP 82 >gi|169627375|ref|YP_001701024.1| hypothetical protein MAB_0270c [Mycobacterium abscessus ATCC 19977] gi|169239342|emb|CAM60370.1| Hypothetical protein MAB_0270c [Mycobacterium abscessus] Length=98 Score = 53.9 bits (128), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 23/41 (57%), Positives = 30/41 (74%), Gaps = 0/41 (0%) Query 39 YYWCPGQPFDPAWGPNWDPYTCHDDFHRDSDGPDHSRDYPG 79 Y+WCPG+ ++P WG NW+ CHDD+HRD DG H RD+ G Sbjct 35 YHWCPGEFWNPIWGFNWEFGECHDDWHRDRDGDWHDRDWHG 75 >gi|118618995|ref|YP_907327.1| hypothetical protein MUL_3736 [Mycobacterium ulcerans Agy99] gi|118571105|gb|ABL05856.1| conserved hypothetical secreted protein [Mycobacterium ulcerans Agy99] Length=98 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 26/53 (50%), Positives = 35/53 (67%), Gaps = 2/53 (3%) Query 13 ACILATGVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPAWGPNWDPYTCHDDFH 65 A +L G+AG+GV +++AAQ P WCPG +DPAWG NWD CHD++ Sbjct 15 AMVLGLGLAGVGVASEAAAQ--PGAPTQWCPGDFWDPAWGQNWDMGHCHDNWR 65 >gi|183983785|ref|YP_001852076.1| hypothetical protein MMAR_3811 [Mycobacterium marinum M] gi|183177111|gb|ACC42221.1| conserved hypothetical secreted protein [Mycobacterium marinum M] Length=119 Score = 50.8 bits (120), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 25/53 (48%), Positives = 34/53 (65%), Gaps = 2/53 (3%) Query 13 ACILATGVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPAWGPNWDPYTCHDDFH 65 A +L G+AG+GV +++AAQ P WCPG +DP WG NWD CHD++ Sbjct 13 AMVLGLGLAGVGVASEAAAQ--PGAPTQWCPGDFWDPGWGQNWDMGHCHDNWR 63 >gi|317507588|ref|ZP_07965302.1| hypothetical protein HMPREF9336_01674 [Segniliparus rugosus ATCC BAA-974] gi|316254108|gb|EFV13464.1| hypothetical protein HMPREF9336_01674 [Segniliparus rugosus ATCC BAA-974] Length=106 Score = 40.0 bits (92), Expect = 0.10, Method: Compositional matrix adjust. Identities = 19/33 (58%), Positives = 21/33 (64%), Gaps = 1/33 (3%) Query 35 PVPDYY-WCPGQPFDPAWGPNWDPYTCHDDFHR 66 P PD+Y WCPG +D WG NWD CHDD R Sbjct 39 PAPDHYRWCPGWRWDNRWGRNWDWNRCHDDRFR 71 >gi|183984554|ref|YP_001852845.1| hypothetical protein MMAR_4585 [Mycobacterium marinum M] gi|183177880|gb|ACC42990.1| conserved hypothetical secreted protein [Mycobacterium marinum M] Length=109 Score = 37.7 bits (86), Expect = 0.58, Method: Compositional matrix adjust. Identities = 28/78 (36%), Positives = 35/78 (45%), Gaps = 12/78 (15%) Query 1 MYRFACRTLMLAACILATGVAGLGVGAQSAAQTA------PVPD-----YYWCPGQPFDP 49 M A M+ A +++ GVA G G + A PVP Y WCPG+P P Sbjct 1 MNTTANLKRMITAALVSGGVAVAGFGLTAGTAHAGPGAHGPVPQAPRGPYQWCPGEPV-P 59 Query 50 AWGPNWDPYTCHDDFHRD 67 A G NWD CH + D Sbjct 60 AGGVNWDMNVCHTWYWVD 77 >gi|296169980|ref|ZP_06851587.1| secreted protein antigen [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295895384|gb|EFG75090.1| secreted protein antigen [Mycobacterium parascrofulaceum ATCC BAA-614] Length=121 Score = 37.7 bits (86), Expect = 0.60, Method: Compositional matrix adjust. Identities = 25/55 (46%), Positives = 31/55 (57%), Gaps = 7/55 (12%) Query 11 LAACILATGVAGLG-VGAQSAAQTAPVPDYYWCPGQPFDPAWGP--NWDPYTCHD 62 L ++ G+A G VG AQ AP +WCPG P+DP+WG NWD CHD Sbjct 12 LMGFVVGCGLALFGPVGG--TAQAAPTS--HWCPGNPWDPSWGNVYNWDWNHCHD 62 >gi|118619566|ref|YP_907898.1| hypothetical protein MUL_4446 [Mycobacterium ulcerans Agy99] gi|118571676|gb|ABL06427.1| conserved hypothetical secreted protein [Mycobacterium ulcerans Agy99] Length=109 Score = 37.4 bits (85), Expect = 0.81, Method: Compositional matrix adjust. Identities = 26/69 (38%), Positives = 33/69 (48%), Gaps = 12/69 (17%) Query 10 MLAACILATGVAGLGVGAQSAAQTA------PVPD-----YYWCPGQPFDPAWGPNWDPY 58 M+ A +++ GVA G G + A PVP Y WCPG+P PA G NWD Sbjct 10 MITAALVSGGVAVAGFGLTAGTAHAGPGAHGPVPQAPRGPYQWCPGEPV-PAGGVNWDMN 68 Query 59 TCHDDFHRD 67 CH + D Sbjct 69 VCHTWYWVD 77 >gi|296392713|ref|YP_003657597.1| hypothetical protein Srot_0279 [Segniliparus rotundus DSM 44985] gi|296179860|gb|ADG96766.1| hypothetical protein Srot_0279 [Segniliparus rotundus DSM 44985] Length=109 Score = 37.0 bits (84), Expect = 0.96, Method: Compositional matrix adjust. Identities = 25/54 (47%), Positives = 30/54 (56%), Gaps = 3/54 (5%) Query 11 LAACILATGVAGLGVGAQSAAQTAPVP---DYYWCPGQPFDPAWGPNWDPYTCH 61 + A + AT V G A S A AP P D WCPGQP+D WG N +P +CH Sbjct 4 MTAALFATAVCGAAFLAPSPALAAPAPGHHDKQWCPGQPWDEEWGVNDNPISCH 57 >gi|317507596|ref|ZP_07965310.1| far upstream element-binding protein [Segniliparus rugosus ATCC BAA-974] gi|316254116|gb|EFV13472.1| far upstream element-binding protein [Segniliparus rugosus ATCC BAA-974] Length=111 Score = 34.7 bits (78), Expect = 4.6, Method: Compositional matrix adjust. Identities = 22/55 (40%), Positives = 30/55 (55%), Gaps = 3/55 (5%) Query 7 RTLMLAACILATGVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPAWGPNWDPYTCH 61 R+ + +AC++A+ A + A A A D WCPGQP+ WG NWD CH Sbjct 2 RSALGSACLVASCAA---LAALCAPALAAPEDGQWCPGQPWRLDWGVNWDAEHCH 53 >gi|320663831|gb|EFX31059.1| putative BigA-like protein [Escherichia coli O157:H7 str. LSU-61] Length=981 Score = 34.7 bits (78), Expect = 4.7, Method: Composition-based stats. Identities = 22/50 (44%), Positives = 24/50 (48%), Gaps = 6/50 (12%) Query 58 YTCHDD---FHRDSDGPDHSRDYPGPILEG---PVLDDPGAAPPPPAAGG 101 YT DD H +S PD D P P +G PV DD G P PP GG Sbjct 86 YTLSDDDNHHHNNSPVPDDGGDTPVPPDDGGDTPVPDDGGDTPVPPDDGG 135 >gi|291282505|ref|YP_003499323.1| BigA-like protein [Escherichia coli O55:H7 str. CB9615] gi|290762378|gb|ADD56339.1| putative BigA-like protein [Escherichia coli O55:H7 str. CB9615] Length=981 Score = 34.7 bits (78), Expect = 4.7, Method: Composition-based stats. Identities = 22/50 (44%), Positives = 24/50 (48%), Gaps = 6/50 (12%) Query 58 YTCHDD---FHRDSDGPDHSRDYPGPILEG---PVLDDPGAAPPPPAAGG 101 YT DD H +S PD D P P +G PV DD G P PP GG Sbjct 86 YTLSDDDNHHHNNSPVPDDGGDTPVPPDDGGDTPVPDDGGDTPVPPDDGG 135 >gi|118464215|ref|YP_880490.1| secreted protein antigen [Mycobacterium avium 104] gi|118165502|gb|ABK66399.1| secreted protein antigen [Mycobacterium avium 104] Length=127 Score = 34.7 bits (78), Expect = 4.8, Method: Compositional matrix adjust. Identities = 27/66 (41%), Positives = 37/66 (57%), Gaps = 7/66 (10%) Query 7 RTLMLAACILATGVAGLGV-----GAQSAAQTAPVPDYYWCPGQPFDPAWGP--NWDPYT 59 RT AA ++A G + V G +AA AP P +WCPG P++P+WG +WD + Sbjct 6 RTAGWAASVVAGGALAMSVVGLAGGPVAAAAPAPAPTGHWCPGDPWNPSWGNVLDWDWHQ 65 Query 60 CHDDFH 65 CHD H Sbjct 66 CHDWQH 71 >gi|320658999|gb|EFX26622.1| putative BigA-like protein [Escherichia coli O55:H7 str. USDA 5905] Length=991 Score = 34.3 bits (77), Expect = 6.0, Method: Composition-based stats. Identities = 22/50 (44%), Positives = 24/50 (48%), Gaps = 6/50 (12%) Query 58 YTCHDD---FHRDSDGPDHSRDYPGPILEG---PVLDDPGAAPPPPAAGG 101 YT DD H +S PD D P P +G PV DD G P PP GG Sbjct 86 YTLSDDDNHHHNNSPVPDDGGDTPVPPDDGGDTPVPDDGGDTPVPPDDGG 135 >gi|336458734|gb|EGO37694.1| hypothetical protein MAPs_10170 [Mycobacterium avium subsp. paratuberculosis S397] Length=127 Score = 33.9 bits (76), Expect = 7.1, Method: Compositional matrix adjust. Identities = 14/28 (50%), Positives = 20/28 (72%), Gaps = 2/28 (7%) Query 40 YWCPGQPFDPAWGP--NWDPYTCHDDFH 65 +WCPG P++P+WG +WD + CHD H Sbjct 44 HWCPGDPWNPSWGNVLDWDWHQCHDWQH 71 >gi|169627374|ref|YP_001701023.1| hypothetical protein MAB_0269c [Mycobacterium abscessus ATCC 19977] gi|169239341|emb|CAM60369.1| Hypothetical protein MAB_0269c [Mycobacterium abscessus] Length=111 Score = 33.9 bits (76), Expect = 9.2, Method: Compositional matrix adjust. Identities = 17/37 (46%), Positives = 22/37 (60%), Gaps = 0/37 (0%) Query 39 YYWCPGQPFDPAWGPNWDPYTCHDDFHRDSDGPDHSR 75 Y+WCPG+ ++P WG N + CH D D D PD R Sbjct 38 YHWCPGEFWNPIWGFNMNWGECHADGILDRDRPDDWR 74 >gi|320190142|gb|EFW64793.1| porin, autotransporter (AT) family [Escherichia coli O157:H7 str. EC1212] Length=959 Score = 33.5 bits (75), Expect = 9.3, Method: Composition-based stats. Identities = 22/50 (44%), Positives = 24/50 (48%), Gaps = 6/50 (12%) Query 58 YTCHDD---FHRDSDGPDHSRDYPGPILEG---PVLDDPGAAPPPPAAGG 101 YT DD H +S PD D P P +G PV DD G P PP GG Sbjct 86 YTLSDDDNHHHNNSPVPDDGGDTPVPPDDGGDTPVPDDGGDTPVPPDDGG 135 >gi|15831260|ref|NP_310033.1| BigA-like protein [Escherichia coli O157:H7 str. Sakai] gi|195938021|ref|ZP_03083403.1| putative BigA-like protein [Escherichia coli O157:H7 str. EC4024] gi|217329106|ref|ZP_03445186.1| hypothetical protein ESCCO14588_3561 [Escherichia coli O157:H7 str. TW14588] 7 more sequence titles Length=1011 Score = 33.5 bits (75), Expect = 9.3, Method: Composition-based stats. Identities = 22/50 (44%), Positives = 24/50 (48%), Gaps = 6/50 (12%) Query 58 YTCHDD---FHRDSDGPDHSRDYPGPILEG---PVLDDPGAAPPPPAAGG 101 YT DD H +S PD D P P +G PV DD G P PP GG Sbjct 86 YTLSDDDNHHHNNSPVPDDGGDTPVPPDDGGDTPVPDDGGDTPVPPDDGG 135 Lambda K H 0.319 0.143 0.507 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 127822873252 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40