BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3747
Length=127
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610883|ref|NP_218264.1| hypothetical protein Rv3747 [Mycoba... 247 4e-64
gi|240168504|ref|ZP_04747163.1| hypothetical protein MkanA1_0427... 204 3e-51
gi|183985255|ref|YP_001853546.1| hypothetical protein MMAR_5287 ... 179 8e-44
gi|118619501|ref|YP_907833.1| hypothetical protein MUL_4360 [Myc... 173 6e-42
gi|240168510|ref|ZP_04747169.1| hypothetical protein MkanA1_0430... 149 1e-34
gi|15610884|ref|NP_218265.1| hypothetical protein Rv3748 [Mycoba... 149 2e-34
gi|31794918|ref|NP_857411.1| hypothetical protein Mb3774 [Mycoba... 146 1e-33
gi|183985261|ref|YP_001853552.1| hypothetical protein MMAR_5293 ... 135 3e-30
gi|118619507|ref|YP_907839.1| hypothetical protein MUL_4366 [Myc... 135 3e-30
gi|306801387|ref|ZP_07438055.1| hypothetical protein TMHG_02814 ... 93.6 9e-18
gi|343924097|ref|ZP_08763660.1| hypothetical protein GOALK_002_0... 52.4 2e-05
gi|297841379|ref|XP_002888571.1| hypothetical protein ARALYDRAFT... 36.2 1.5
gi|239830961|ref|ZP_04679290.1| glycosyltransferase 36 [Ochrobac... 35.4 2.7
gi|206590033|emb|CAQ36994.1| lipase protein [Ralstonia solanacea... 35.4 2.9
gi|83745681|ref|ZP_00942739.1| Hypothetical Protein RRSL_04747 [... 34.7 5.2
>gi|15610883|ref|NP_218264.1| hypothetical protein Rv3747 [Mycobacterium tuberculosis H37Rv]
gi|31794917|ref|NP_857410.1| hypothetical protein Mb3773 [Mycobacterium bovis AF2122/97]
gi|121639661|ref|YP_979885.1| hypothetical protein BCG_3806 [Mycobacterium bovis BCG str. Pasteur
1173P2]
74 more sequence titles
Length=127
Score = 247 bits (631), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 126/127 (99%), Positives = 127/127 (100%), Gaps = 0/127 (0%)
Query 1 VILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPDSSDRDITVEM 60
+ILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPDSSDRDITVEM
Sbjct 1 MILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPDSSDRDITVEM 60
Query 61 RPPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWVLVVTGGTGAISLPVLVSD 120
RPPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWVLVVTGGTGAISLPVLVSD
Sbjct 61 RPPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWVLVVTGGTGAISLPVLVSD 120
Query 121 MPATIGF 127
MPATIGF
Sbjct 121 MPATIGF 127
>gi|240168504|ref|ZP_04747163.1| hypothetical protein MkanA1_04277 [Mycobacterium kansasii ATCC
12478]
Length=128
Score = 204 bits (520), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 106/128 (83%), Positives = 113/128 (89%), Gaps = 1/128 (0%)
Query 1 VILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPDSSD-RDITVE 59
+ILTGAFLADAAA VDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQ+EP SSD R + +E
Sbjct 1 MILTGAFLADAAAVVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQSEPGSSDDRQLNIE 60
Query 60 MRPPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWVLVVTGGTGAISLPVLVS 119
RPP D E IRL FE PEAAVAEFPGFAFFEIQLRLPV+GRWVLVVT TGAISLPVLVS
Sbjct 61 ARPPADAEAIRLQFEVPEAAVAEFPGFAFFEIQLRLPVDGRWVLVVTADTGAISLPVLVS 120
Query 120 DMPATIGF 127
+MP + GF
Sbjct 121 EMPPSFGF 128
>gi|183985255|ref|YP_001853546.1| hypothetical protein MMAR_5287 [Mycobacterium marinum M]
gi|183178581|gb|ACC43691.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=127
Score = 179 bits (455), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 97/127 (77%), Positives = 110/127 (87%), Gaps = 0/127 (0%)
Query 1 VILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPDSSDRDITVEM 60
+ILTGAFLADAAAAVDNKLNV GGVLSRFAVGPDRLARFVLVVLTQ+ P S+DR +TVE
Sbjct 1 MILTGAFLADAAAAVDNKLNVSGGVLSRFAVGPDRLARFVLVVLTQSNPGSTDRQLTVEA 60
Query 61 RPPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWVLVVTGGTGAISLPVLVSD 120
RPP D E RL FE PEAAVAEFPGFAFFEIQLRLP++GRW LV + TG+++LPVLV++
Sbjct 61 RPPADAEATRLQFEVPEAAVAEFPGFAFFEIQLRLPIDGRWELVASSDTGSVTLPVLVTE 120
Query 121 MPATIGF 127
MP +GF
Sbjct 121 MPQQLGF 127
>gi|118619501|ref|YP_907833.1| hypothetical protein MUL_4360 [Mycobacterium ulcerans Agy99]
gi|118571611|gb|ABL06362.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=127
Score = 173 bits (439), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 94/127 (75%), Positives = 107/127 (85%), Gaps = 0/127 (0%)
Query 1 VILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPDSSDRDITVEM 60
+ILTGAFLADAAAAVDNKLNV GGVLSRFAVGPDRL RFVLVVLTQ+ P + R +TVE
Sbjct 1 MILTGAFLADAAAAVDNKLNVSGGVLSRFAVGPDRLTRFVLVVLTQSNPGRTGRQLTVEA 60
Query 61 RPPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWVLVVTGGTGAISLPVLVSD 120
RPP D E RL FE PEAAVAEFPGFAFFEIQLRLP++GRW LV + TG+++LPVLV++
Sbjct 61 RPPADAEATRLQFEVPEAAVAEFPGFAFFEIQLRLPIDGRWELVASSDTGSVTLPVLVTE 120
Query 121 MPATIGF 127
MP +GF
Sbjct 121 MPQQLGF 127
>gi|240168510|ref|ZP_04747169.1| hypothetical protein MkanA1_04307 [Mycobacterium kansasii ATCC
12478]
Length=120
Score = 149 bits (377), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 77/118 (66%), Positives = 94/118 (80%), Gaps = 0/118 (0%)
Query 2 ILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPDSSDRDITVEMR 61
++ GAFLA+AA+AVDNKLNV GGVL R+A+ DRLA+F+LVVLTQ E + DR + VE+
Sbjct 1 MIVGAFLAEAASAVDNKLNVSGGVLFRYALDADRLAQFLLVVLTQTETGNPDRRVDVEIW 60
Query 62 PPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWVLVVTGGTGAISLPVLVS 119
PPTD EP+ L FE PEAA A GFA F I++ LPV+GRWV+VVTGG GAISLP+LVS
Sbjct 61 PPTDGEPLHLPFELPEAATAAEVGFAIFGIEVTLPVDGRWVIVVTGGAGAISLPLLVS 118
>gi|15610884|ref|NP_218265.1| hypothetical protein Rv3748 [Mycobacterium tuberculosis H37Rv]
gi|148663614|ref|YP_001285137.1| hypothetical protein MRA_3786 [Mycobacterium tuberculosis H37Ra]
gi|148824953|ref|YP_001289707.1| hypothetical protein TBFG_13780 [Mycobacterium tuberculosis F11]
62 more sequence titles
Length=119
Score = 149 bits (375), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/118 (65%), Positives = 92/118 (78%), Gaps = 0/118 (0%)
Query 2 ILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPDSSDRDITVEMR 61
++ GAFLA+AA+ VDNKLNV GGVL RFAV PDR A+F+LVVLTQAE D DR + VE+
Sbjct 1 MIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW 60
Query 62 PPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWVLVVTGGTGAISLPVLVS 119
PPT D+ + FE PEAAVA GFA F I++ LPV+GRWVLVVTGG G ISLP++V+
Sbjct 61 PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGGAGTISLPLIVT 118
>gi|31794918|ref|NP_857411.1| hypothetical protein Mb3774 [Mycobacterium bovis AF2122/97]
gi|121639662|ref|YP_979886.1| hypothetical protein BCG_3807 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224992158|ref|YP_002646847.1| hypothetical protein JTY_3809 [Mycobacterium bovis BCG str. Tokyo
172]
8 more sequence titles
Length=119
Score = 146 bits (368), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 75/118 (64%), Positives = 91/118 (78%), Gaps = 0/118 (0%)
Query 2 ILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPDSSDRDITVEMR 61
++ GAFLA+AA+ VDNKLNV GGVL RFAV PDR A+F+LVVLTQAE D DR + VE+
Sbjct 1 MIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW 60
Query 62 PPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWVLVVTGGTGAISLPVLVS 119
PPT D+ + FE PEAAVA GFA F I++ LPV+GRWVLVVTG G ISLP++V+
Sbjct 61 PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGDAGTISLPLIVT 118
>gi|183985261|ref|YP_001853552.1| hypothetical protein MMAR_5293 [Mycobacterium marinum M]
gi|183178587|gb|ACC43697.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=119
Score = 135 bits (339), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/118 (63%), Positives = 92/118 (78%), Gaps = 0/118 (0%)
Query 2 ILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPDSSDRDITVEMR 61
++ GAF+A+AAAAVDNKLNV GGVL R+ V DR ARF+LVVLTQ E D + I VE+R
Sbjct 1 MIVGAFIAEAAAAVDNKLNVSGGVLYRYWVDTDRAARFLLVVLTQTETDDPHQRIEVEIR 60
Query 62 PPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWVLVVTGGTGAISLPVLVS 119
PPTDDEP+ + FE P+AA GFA F +++ LPV+GRWV+VVTGG GAISLP+L+S
Sbjct 61 PPTDDEPLLMGFELPDAATTAEVGFAIFNVEVSLPVDGRWVIVVTGGAGAISLPLLIS 118
>gi|118619507|ref|YP_907839.1| hypothetical protein MUL_4366 [Mycobacterium ulcerans Agy99]
gi|118571617|gb|ABL06368.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=119
Score = 135 bits (339), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/118 (64%), Positives = 92/118 (78%), Gaps = 0/118 (0%)
Query 2 ILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPDSSDRDITVEMR 61
++ GAFLA+AAAAVDNKLNV GGVL R+ V DR ARF+LVVLTQ E D + I VE+R
Sbjct 1 MIVGAFLAEAAAAVDNKLNVSGGVLYRYWVDTDRAARFLLVVLTQTETDDPHQRIEVEIR 60
Query 62 PPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWVLVVTGGTGAISLPVLVS 119
PPTDDEP+ + FE P+AA GFA F +++ LPV+GRWV+VVTGG GAISLP+L+S
Sbjct 61 PPTDDEPLLMGFELPDAATTAEVGFAIFNVEVSLPVDGRWVIVVTGGAGAISLPLLIS 118
>gi|306801387|ref|ZP_07438055.1| hypothetical protein TMHG_02814 [Mycobacterium tuberculosis SUMu008]
gi|308351811|gb|EFP40662.1| hypothetical protein TMHG_02814 [Mycobacterium tuberculosis SUMu008]
Length=79
Score = 93.6 bits (231), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 47/78 (61%), Positives = 58/78 (75%), Gaps = 0/78 (0%)
Query 42 VVLTQAEPDSSDRDITVEMRPPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRW 101
+VLTQAE D DR + VE+ PPT D+ + FE PEAAVA GFA F I++ LPV+GRW
Sbjct 1 MVLTQAETDDPDRRVDVEVWPPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRW 60
Query 102 VLVVTGGTGAISLPVLVS 119
VLVVTGG G ISLP++V+
Sbjct 61 VLVVTGGAGTISLPLIVT 78
>gi|343924097|ref|ZP_08763660.1| hypothetical protein GOALK_002_00510 [Gordonia alkanivorans NBRC
16433]
gi|343765902|dbj|GAA10586.1| hypothetical protein GOALK_002_00510 [Gordonia alkanivorans NBRC
16433]
Length=123
Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/122 (38%), Positives = 61/122 (50%), Gaps = 7/122 (5%)
Query 1 VILTGAFLADAAAAVDNKLNVQGGVLSRFAVGP-DRLARFVLVVLTQAEPDSSDRDITVE 59
+I+TGA LA+AA D KL V GGVL+ F P L +L+VL QAE VE
Sbjct 1 MIVTGAMLAEAATVADGKLYVLGGVLTDFWQPPGGYLIETLLIVLIQAEEGDLHNPQFVE 60
Query 60 MRPPT-DDEPIRLNFEAPEAAVA-EFPGFAFFEIQLRLPVNGRWVLVVTGGTGAISLPVL 117
+ T D + PE A A GF F +I V G++V+VV ++SLP+
Sbjct 61 VSITTPDGKSGSSRLPVPEVATAGTRAGFFFHKIGFEAKVPGQYVIVVE----SVSLPIE 116
Query 118 VS 119
V
Sbjct 117 VH 118
>gi|297841379|ref|XP_002888571.1| hypothetical protein ARALYDRAFT_475799 [Arabidopsis lyrata subsp.
lyrata]
gi|297334412|gb|EFH64830.1| hypothetical protein ARALYDRAFT_475799 [Arabidopsis lyrata subsp.
lyrata]
Length=781
Score = 36.2 bits (82), Expect = 1.5, Method: Composition-based stats.
Identities = 23/65 (36%), Positives = 31/65 (48%), Gaps = 0/65 (0%)
Query 3 LTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPDSSDRDITVEMRP 62
LTG FL DA + N+ G+L F+ PD L R VL + + + D I +EM
Sbjct 680 LTGGFLVDAMIEHLEERNISCGLLKSFSAKPDELIRGVLEEVPKWTRINMDEVIGIEMEK 739
Query 63 PTDDE 67
D E
Sbjct 740 WLDPE 744
>gi|239830961|ref|ZP_04679290.1| glycosyltransferase 36 [Ochrobactrum intermedium LMG 3301]
gi|239823228|gb|EEQ94796.1| glycosyltransferase 36 [Ochrobactrum intermedium LMG 3301]
Length=2884
Score = 35.4 bits (80), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 27/77 (36%), Positives = 36/77 (47%), Gaps = 12/77 (15%)
Query 48 EPDSSDRDITVEMRPPTDDEPIRLNFEAPE------AAVAE-----FPGFAFFEIQLRLP 96
EP SSDRD + ++ PP PIR ++ E AAVA PG+ F RL
Sbjct 26 EPKSSDRDASGQI-PPEQPSPIRALYKTEEELGALAAAVARGEDIPLPGYIAFPFDQRLS 84
Query 97 VNGRWVLVVTGGTGAIS 113
NG+ +L + A S
Sbjct 85 ENGKLILHAYRASDAAS 101
>gi|206590033|emb|CAQ36994.1| lipase protein [Ralstonia solanacearum MolK2]
Length=340
Score = 35.4 bits (80), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 25/77 (33%), Positives = 36/77 (47%), Gaps = 10/77 (12%)
Query 50 DSSDRDITVEMRPPTDDEPIRLNFEAPEAAVAEFPGFAFFE----IQLRLPVNGRWVLVV 105
D S++ I V D +PIRL P A E P F FF I P + R+V +
Sbjct 75 DVSEKTIHV------DGKPIRLTIVRPAGAKGELPAFMFFHGGGWILGDFPTHERFVRDL 128
Query 106 TGGTGAISLPVLVSDMP 122
G+GA+++ V + P
Sbjct 129 VAGSGAVAVFVNYTSSP 145
>gi|83745681|ref|ZP_00942739.1| Hypothetical Protein RRSL_04747 [Ralstonia solanacearum UW551]
gi|207739478|ref|YP_002257871.1| lipase protein [Ralstonia solanacearum IPO1609]
gi|83727758|gb|EAP74878.1| Hypothetical Protein RRSL_04747 [Ralstonia solanacearum UW551]
gi|206592855|emb|CAQ59761.1| lipase protein [Ralstonia solanacearum IPO1609]
Length=339
Score = 34.7 bits (78), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 25/77 (33%), Positives = 36/77 (47%), Gaps = 10/77 (12%)
Query 50 DSSDRDITVEMRPPTDDEPIRLNFEAPEAAVAEFPGFAFFE----IQLRLPVNGRWVLVV 105
D S++ I V D +PIRL P A E P F FF I P + R+V +
Sbjct 74 DVSEKTIHV------DGKPIRLTIVRPAGAKGELPAFMFFHGGGWILGDFPTHERFVRDL 127
Query 106 TGGTGAISLPVLVSDMP 122
G+GA+++ V + P
Sbjct 128 VAGSGAVAVFVNYTPSP 144
Lambda K H
0.322 0.140 0.408
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 129319095676
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40