BLASTP 2.2.25+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 15,229,318 sequences; 5,219,829,388 total letters Query= Rv0395 Length=134 Score E Sequences producing significant alignments: (Bits) Value gi|167970792|ref|ZP_02553069.1| hypothetical protein MtubH3_2323... 273 5e-72 gi|15607536|ref|NP_214909.1| hypothetical protein Rv0395 [Mycoba... 273 7e-72 gi|15839777|ref|NP_334814.1| hypothetical protein MT0405 [Mycoba... 271 2e-71 gi|323721201|gb|EGB30262.1| hypothetical protein TMMG_03150 [Myc... 265 2e-69 gi|342858699|ref|ZP_08715354.1| hypothetical protein MCOL_07476 ... 186 1e-45 gi|256398018|ref|YP_003119582.1| hypothetical protein Caci_8928 ... 49.7 2e-04 gi|124360420|gb|ABN08430.1| hypothetical protein MtrDRAFT_AC1573... 38.1 0.45 gi|452504|gb|AAA62129.1| rhamnosyl transferase [Pseudomonas aeru... 36.6 1.4 gi|281331816|emb|CBI71031.1| rhamnosyltrasferase-1 [Pseudomonas ... 36.2 1.5 gi|152987698|ref|YP_001347032.1| rhamnosyltransferase chain B [P... 36.2 1.6 gi|281331810|emb|CBI71028.1| rhamnosyltransferase-1 [Pseudomonas... 36.2 1.8 gi|15598674|ref|NP_252168.1| rhamnosyltransferase chain B [Pseud... 36.2 1.8 gi|281331812|emb|CBI71029.1| rhamnosyltransferase-1 [Pseudomonas... 36.2 1.9 gi|254236427|ref|ZP_04929750.1| rhamnosyltransferase chain B [Ps... 36.2 1.9 gi|281331822|emb|CBI71034.1| rhamnosyltransferase-1 [Pseudomonas... 36.2 1.9 gi|218890274|ref|YP_002439138.1| rhamnosyltransferase chain B [P... 35.8 2.1 >gi|167970792|ref|ZP_02553069.1| hypothetical protein MtubH3_23235 [Mycobacterium tuberculosis H37Ra] Length=146 Score = 273 bits (699), Expect = 5e-72, Method: Compositional matrix adjust. Identities = 134/134 (100%), Positives = 134/134 (100%), Gaps = 0/134 (0%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI Sbjct 13 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 72 Query 61 GCVPWLSSEAVAETLLALSVFCVVIDKGTSFPSRLRNPDKGFPNVALLRLRDMAPSEHGS 120 GCVPWLSSEAVAETLLALSVFCVVIDKGTSFPSRLRNPDKGFPNVALLRLRDMAPSEHGS Sbjct 73 GCVPWLSSEAVAETLLALSVFCVVIDKGTSFPSRLRNPDKGFPNVALLRLRDMAPSEHGS 132 Query 121 RCSSARGRLCLSMS 134 RCSSARGRLCLSMS Sbjct 133 RCSSARGRLCLSMS 146 >gi|15607536|ref|NP_214909.1| hypothetical protein Rv0395 [Mycobacterium tuberculosis H37Rv] gi|148660160|ref|YP_001281683.1| hypothetical protein MRA_0401A [Mycobacterium tuberculosis H37Ra] gi|307082877|ref|ZP_07491990.1| hypothetical protein TMLG_01817 [Mycobacterium tuberculosis SUMu012] gi|1817710|emb|CAB06601.1| HYPOTHETICAL PROTEIN Rv0395 [Mycobacterium tuberculosis H37Rv] gi|148504312|gb|ABQ72121.1| hypothetical protein MRA_0401A [Mycobacterium tuberculosis H37Ra] gi|308367402|gb|EFP56253.1| hypothetical protein TMLG_01817 [Mycobacterium tuberculosis SUMu012] Length=134 Score = 273 bits (698), Expect = 7e-72, Method: Compositional matrix adjust. Identities = 134/134 (100%), Positives = 134/134 (100%), Gaps = 0/134 (0%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI Sbjct 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 Query 61 GCVPWLSSEAVAETLLALSVFCVVIDKGTSFPSRLRNPDKGFPNVALLRLRDMAPSEHGS 120 GCVPWLSSEAVAETLLALSVFCVVIDKGTSFPSRLRNPDKGFPNVALLRLRDMAPSEHGS Sbjct 61 GCVPWLSSEAVAETLLALSVFCVVIDKGTSFPSRLRNPDKGFPNVALLRLRDMAPSEHGS 120 Query 121 RCSSARGRLCLSMS 134 RCSSARGRLCLSMS Sbjct 121 RCSSARGRLCLSMS 134 >gi|15839777|ref|NP_334814.1| hypothetical protein MT0405 [Mycobacterium tuberculosis CDC1551] gi|31791571|ref|NP_854064.1| hypothetical protein Mb0401 [Mycobacterium bovis AF2122/97] gi|121636307|ref|YP_976530.1| hypothetical protein BCG_0432 [Mycobacterium bovis BCG str. Pasteur 1173P2] 68 more sequence titlesLength=134 Score = 271 bits (694), Expect = 2e-71, Method: Compositional matrix adjust. Identities = 133/134 (99%), Positives = 133/134 (99%), Gaps = 0/134 (0%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI Sbjct 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 Query 61 GCVPWLSSEAVAETLLALSVFCVVIDKGTSFPSRLRNPDKGFPNVALLRLRDMAPSEHGS 120 GCVPWLSSEAVAETLLALS FCVVIDKGTSFPSRLRNPDKGFPNVALLRLRDMAPSEHGS Sbjct 61 GCVPWLSSEAVAETLLALSAFCVVIDKGTSFPSRLRNPDKGFPNVALLRLRDMAPSEHGS 120 Query 121 RCSSARGRLCLSMS 134 RCSSARGRLCLSMS Sbjct 121 RCSSARGRLCLSMS 134 >gi|323721201|gb|EGB30262.1| hypothetical protein TMMG_03150 [Mycobacterium tuberculosis CDC1551A] gi|339293448|gb|AEJ45559.1| hypothetical protein CCDC5079_0369 [Mycobacterium tuberculosis CCDC5079] gi|339297092|gb|AEJ49202.1| hypothetical protein CCDC5180_0365 [Mycobacterium tuberculosis CCDC5180] Length=131 Score = 265 bits (676), Expect = 2e-69, Method: Compositional matrix adjust. Identities = 130/131 (99%), Positives = 130/131 (99%), Gaps = 0/131 (0%) Query 4 MPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAIGCV 63 MPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAIGCV Sbjct 1 MPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAIGCV 60 Query 64 PWLSSEAVAETLLALSVFCVVIDKGTSFPSRLRNPDKGFPNVALLRLRDMAPSEHGSRCS 123 PWLSSEAVAETLLALS FCVVIDKGTSFPSRLRNPDKGFPNVALLRLRDMAPSEHGSRCS Sbjct 61 PWLSSEAVAETLLALSAFCVVIDKGTSFPSRLRNPDKGFPNVALLRLRDMAPSEHGSRCS 120 Query 124 SARGRLCLSMS 134 SARGRLCLSMS Sbjct 121 SARGRLCLSMS 131 >gi|342858699|ref|ZP_08715354.1| hypothetical protein MCOL_07476 [Mycobacterium colombiense CECT 3035] gi|342134403|gb|EGT87583.1| hypothetical protein MCOL_07476 [Mycobacterium colombiense CECT 3035] Length=267 Score = 186 bits (471), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 85/119 (72%), Positives = 100/119 (85%), Gaps = 0/119 (0%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 D M LGDYE FR WSGKPRAWGP E+GWRAWFGG++VDGLCEV++E LAV+RRG+PAAI Sbjct 2 FDEMALGDYEVFRRWSGKPRAWGPHEAGWRAWFGGQVVDGLCEVIEEDLAVKRRGMPAAI 61 Query 61 GCVPWLSSEAVAETLLALSVFCVVIDKGTSFPSRLRNPDKGFPNVALLRLRDMAPSEHG 119 GCVPW +S+ VA LL L+ FCVV+DK T FP RLRNP+K PNV+L+RLRDMAPS+ G Sbjct 62 GCVPWFTSQPVARRLLDLTAFCVVVDKRTVFPDRLRNPEKALPNVSLVRLRDMAPSDSG 120 >gi|256398018|ref|YP_003119582.1| hypothetical protein Caci_8928 [Catenulispora acidiphila DSM 44928] gi|256364244|gb|ACU77741.1| hypothetical protein Caci_8928 [Catenulispora acidiphila DSM 44928] Length=272 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 35/95 (37%), Positives = 43/95 (46%), Gaps = 14/95 (14%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAV--------R 52 MD PLGD+E PR WG G ++D L L H + R Sbjct 1 MDMRPLGDHEQL--LGSTPRPWGFGT----VVHGSGVLDDLVAGLARHGSTDWTGEYWRR 54 Query 53 RRGVPAAIGCVPWLSSEAVAETLLALSVFCVVIDK 87 AAIGCVPWL+ AVAE L + C+V+DK Sbjct 55 LEPAAAAIGCVPWLTDFAVAEALASFDQCCIVVDK 89 >gi|124360420|gb|ABN08430.1| hypothetical protein MtrDRAFT_AC157375g6v1 [Medicago truncatula] Length=92 Score = 38.1 bits (87), Expect = 0.45, Method: Compositional matrix adjust. Identities = 28/93 (31%), Positives = 43/93 (47%), Gaps = 6/93 (6%) Query 18 KPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAIGCVPWLSSEAVAETLLA 77 K ++ G ++ G R FGG VDG CE + A G A+ C W + + L+ Sbjct 6 KKKSIGGEKRG-RESFGGHAVDGCCEFI----AAGEEGTLEAVICAAWRIQNSRIQILIP 60 Query 78 LSVFCVVIDKGTSFPSRLRNPDKGFPNVALLRL 110 S FCVV+ + + P F +A++RL Sbjct 61 PSCFCVVVLHESDSLMLVMEPTSEF-FLAMMRL 92 >gi|452504|gb|AAA62129.1| rhamnosyl transferase [Pseudomonas aeruginosa] gi|218321098|emb|CAV17614.1| rhamnosyl transferase 2 [Pseudomonas aeruginosa] gi|310696647|gb|ADP06388.1| RhlB [Pseudomonas aeruginosa] Length=426 Score = 36.6 bits (83), Expect = 1.4, Method: Compositional matrix adjust. Identities = 23/91 (26%), Positives = 42/91 (47%), Gaps = 17/91 (18%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 + ++PL D T+R G PR W P+ S W + + G+ E + E+++ +R + Sbjct 48 IAFVPLSDELTYRRTMGDPRLWDPKTSFGVLW---QTIAGMIEPVYEYVSAQRHDDIVVV 104 Query 61 GC--------------VPWLSSEAVAETLLA 77 G +P+LS++ TLL+ Sbjct 105 GSLWALGARIAHEKYGIPYLSAQVSPSTLLS 135 >gi|281331816|emb|CBI71031.1| rhamnosyltrasferase-1 [Pseudomonas aeruginosa] gi|281331818|emb|CBI71032.1| rhamnosyltransferase-1 [Pseudomonas aeruginosa] Length=426 Score = 36.2 bits (82), Expect = 1.5, Method: Compositional matrix adjust. Identities = 23/91 (26%), Positives = 42/91 (47%), Gaps = 17/91 (18%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 + ++PL D T+R G PR W P+ S W + + G+ E + E+++ +R + Sbjct 48 IAFVPLSDELTYRRTMGDPRLWDPKTSFGVLW---QAIAGMIEPVYEYVSAQRHDDIVVV 104 Query 61 GC--------------VPWLSSEAVAETLLA 77 G +P+LS++ TLL+ Sbjct 105 GSLWALGARIAHEKYGIPYLSTQVSPSTLLS 135 >gi|152987698|ref|YP_001347032.1| rhamnosyltransferase chain B [Pseudomonas aeruginosa PA7] gi|150962856|gb|ABR84881.1| rhamnosyltransferase chain B [Pseudomonas aeruginosa PA7] Length=426 Score = 36.2 bits (82), Expect = 1.6, Method: Compositional matrix adjust. Identities = 26/115 (23%), Positives = 48/115 (42%), Gaps = 6/115 (5%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 ++++PL D T+R G PR W P+ S W + + G+ E + E++ +R + Sbjct 48 IEFVPLSDELTYRRTMGDPRLWDPKTSFGVLW---QAIAGMIEPVYEYVCAQRHDDIVVV 104 Query 61 GCVPWLSSEAVAETLLALSVFCVVIDKGTSFPSRL--RNPDKGFPNVALLRLRDM 113 G + W +A + V + T + L +P P L +R + Sbjct 105 GSL-WALGARIAHEKYGIPYLSVQVSPSTLLSAHLPPVHPRFNVPEQVPLAMRKL 158 >gi|281331810|emb|CBI71028.1| rhamnosyltransferase-1 [Pseudomonas aeruginosa] gi|281331814|emb|CBI71030.1| rhamnosyltransferse-1 [Pseudomonas aeruginosa] Length=426 Score = 36.2 bits (82), Expect = 1.8, Method: Compositional matrix adjust. Identities = 23/91 (26%), Positives = 42/91 (47%), Gaps = 17/91 (18%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 + ++PL D T+R G PR W P+ S W + + G+ E + E+++ +R + Sbjct 48 IAFVPLSDELTYRRTMGDPRLWDPKTSFGVLW---QAIAGMIEPVYEYVSAQRHDDIVVV 104 Query 61 GC--------------VPWLSSEAVAETLLA 77 G +P+LS++ TLL+ Sbjct 105 GSLWALGARIAHEKYGIPYLSAQVSPSTLLS 135 >gi|15598674|ref|NP_252168.1| rhamnosyltransferase chain B [Pseudomonas aeruginosa PAO1] gi|107103011|ref|ZP_01366929.1| hypothetical protein PaerPA_01004080 [Pseudomonas aeruginosa PACS2] gi|116051497|ref|YP_789669.1| rhamnosyltransferase chain B [Pseudomonas aeruginosa UCBPP-PA14] 12 more sequence titles Length=426 Score = 36.2 bits (82), Expect = 1.8, Method: Compositional matrix adjust. Identities = 23/91 (26%), Positives = 42/91 (47%), Gaps = 17/91 (18%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 + ++PL D T+R G PR W P+ S W + + G+ E + E+++ +R + Sbjct 48 IAFVPLSDELTYRRTMGDPRLWDPKTSFGVLW---QAIAGMIEPVYEYVSAQRHDDIVVV 104 Query 61 GC--------------VPWLSSEAVAETLLA 77 G +P+LS++ TLL+ Sbjct 105 GSLWALGARIAHEKYGIPYLSAQVSPSTLLS 135 >gi|281331812|emb|CBI71029.1| rhamnosyltransferase-1 [Pseudomonas aeruginosa] Length=426 Score = 36.2 bits (82), Expect = 1.9, Method: Compositional matrix adjust. Identities = 23/91 (26%), Positives = 42/91 (47%), Gaps = 17/91 (18%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 + ++PL D T+R G PR W P+ S W + + G+ E + E+++ +R + Sbjct 48 IAFVPLSDELTYRRTMGDPRLWDPKTSFGVLW---QAIAGMIEPVYEYVSAQRHDDIVVV 104 Query 61 GC--------------VPWLSSEAVAETLLA 77 G +P+LS++ TLL+ Sbjct 105 GSLWALGARIAHEKYGIPYLSAQVSPSTLLS 135 >gi|254236427|ref|ZP_04929750.1| rhamnosyltransferase chain B [Pseudomonas aeruginosa C3719] gi|126168358|gb|EAZ53869.1| rhamnosyltransferase chain B [Pseudomonas aeruginosa C3719] Length=427 Score = 36.2 bits (82), Expect = 1.9, Method: Compositional matrix adjust. Identities = 23/91 (26%), Positives = 42/91 (47%), Gaps = 17/91 (18%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 + ++PL D T+R G PR W P+ S W + + G+ E + E+++ +R + Sbjct 48 IAFVPLSDELTYRRTMGDPRLWDPKTSFGVLW---QAIAGMIEPVYEYVSAQRHDDIVVV 104 Query 61 GC--------------VPWLSSEAVAETLLA 77 G +P+LS++ TLL+ Sbjct 105 GSLWALGARIAHEKYGIPYLSAQVSPSTLLS 135 >gi|281331822|emb|CBI71034.1| rhamnosyltransferase-1 [Pseudomonas aeruginosa] Length=426 Score = 36.2 bits (82), Expect = 1.9, Method: Compositional matrix adjust. Identities = 23/91 (26%), Positives = 42/91 (47%), Gaps = 17/91 (18%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 + ++PL D T+R G PR W P+ S W + + G+ E + E+++ +R + Sbjct 48 IAFVPLSDELTYRRTMGDPRLWDPKTSFGVLW---QAIAGMIEPVYEYVSAQRHDDIVVV 104 Query 61 GC--------------VPWLSSEAVAETLLA 77 G +P+LS++ TLL+ Sbjct 105 GSLWALGARIAHEKYGIPYLSAQVSPSTLLS 135 >gi|218890274|ref|YP_002439138.1| rhamnosyltransferase chain B [Pseudomonas aeruginosa LESB58] gi|218770497|emb|CAW26262.1| rhamnosyltransferase chain B [Pseudomonas aeruginosa LESB58] Length=426 Score = 35.8 bits (81), Expect = 2.1, Method: Compositional matrix adjust. Identities = 23/91 (26%), Positives = 42/91 (47%), Gaps = 17/91 (18%) Query 1 MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLAVRRRGVPAAI 60 + ++PL D T+R G PR W P+ S W + + G+ E + E+++ +R + Sbjct 48 IAFVPLSDELTYRRAMGDPRLWDPKTSFGVLW---QAIAGMIEPVYEYVSAQRHDDIVVV 104 Query 61 GC--------------VPWLSSEAVAETLLA 77 G +P+LS++ TLL+ Sbjct 105 GSLWALGARIAHEKYGIPYLSAQVSPSTLLS 135 Lambda K H 0.323 0.138 0.468 Gapped Lambda K H 0.267 0.0410 0.140 Effective search space used: 129924441710 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Sep 5, 2011 4:36 AM Number of letters in database: 5,219,829,388 Number of sequences in database: 15,229,318 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40