BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3748
Length=119
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610884|ref|NP_218265.1| hypothetical protein Rv3748 [Mycoba... 232 1e-59
gi|31794918|ref|NP_857411.1| hypothetical protein Mb3774 [Mycoba... 230 6e-59
gi|240168510|ref|ZP_04747169.1| hypothetical protein MkanA1_0430... 187 3e-46
gi|183985261|ref|YP_001853552.1| hypothetical protein MMAR_5293 ... 159 1e-37
gi|118619507|ref|YP_907839.1| hypothetical protein MUL_4366 [Myc... 159 1e-37
gi|306801387|ref|ZP_07438055.1| hypothetical protein TMHG_02814 ... 156 9e-37
gi|240168504|ref|ZP_04747163.1| hypothetical protein MkanA1_0427... 139 1e-31
gi|15610883|ref|NP_218264.1| hypothetical protein Rv3747 [Mycoba... 136 1e-30
gi|183985255|ref|YP_001853546.1| hypothetical protein MMAR_5287 ... 123 8e-27
gi|118619501|ref|YP_907833.1| hypothetical protein MUL_4360 [Myc... 119 2e-25
gi|343924097|ref|ZP_08763660.1| hypothetical protein GOALK_002_0... 54.7 4e-06
gi|297566166|ref|YP_003685138.1| hypothetical protein Mesil_1752... 38.1 0.49
gi|115360487|ref|YP_777624.1| ATPase domain-containing protein [... 34.3 7.0
>gi|15610884|ref|NP_218265.1| hypothetical protein Rv3748 [Mycobacterium tuberculosis H37Rv]
gi|148663614|ref|YP_001285137.1| hypothetical protein MRA_3786 [Mycobacterium tuberculosis H37Ra]
gi|148824953|ref|YP_001289707.1| hypothetical protein TBFG_13780 [Mycobacterium tuberculosis F11]
62 more sequence titles
Length=119
Score = 232 bits (592), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 118/119 (99%), Positives = 119/119 (100%), Gaps = 0/119 (0%)
Query 1 VIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW 60
+IVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW
Sbjct 1 MIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW 60
Query 61 PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGGAGTISLPLIVTG 119
PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGGAGTISLPLIVTG
Sbjct 61 PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGGAGTISLPLIVTG 119
>gi|31794918|ref|NP_857411.1| hypothetical protein Mb3774 [Mycobacterium bovis AF2122/97]
gi|121639662|ref|YP_979886.1| hypothetical protein BCG_3807 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|224992158|ref|YP_002646847.1| hypothetical protein JTY_3809 [Mycobacterium bovis BCG str. Tokyo
172]
8 more sequence titles
Length=119
Score = 230 bits (586), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 117/119 (99%), Positives = 118/119 (99%), Gaps = 0/119 (0%)
Query 1 VIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW 60
+IVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW
Sbjct 1 MIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW 60
Query 61 PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGGAGTISLPLIVTG 119
PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTG AGTISLPLIVTG
Sbjct 61 PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGDAGTISLPLIVTG 119
>gi|240168510|ref|ZP_04747169.1| hypothetical protein MkanA1_04307 [Mycobacterium kansasii ATCC
12478]
Length=120
Score = 187 bits (476), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 94/119 (79%), Positives = 105/119 (89%), Gaps = 0/119 (0%)
Query 1 VIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW 60
+IVGAFLAEAAS VDNKLNVSGGVL+R+A+D DR AQFLLVVLTQ ET +PDRRVDVE+W
Sbjct 1 MIVGAFLAEAASAVDNKLNVSGGVLFRYALDADRLAQFLLVVLTQTETGNPDRRVDVEIW 60
Query 61 PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGGAGTISLPLIVTG 119
PPT + H+ FELPEAA AAEVGFAIF IEV LPVDGRWV+VVTGGAG ISLPL+V+G
Sbjct 61 PPTDGEPLHLPFELPEAATAAEVGFAIFGIEVTLPVDGRWVIVVTGGAGAISLPLLVSG 119
>gi|183985261|ref|YP_001853552.1| hypothetical protein MMAR_5293 [Mycobacterium marinum M]
gi|183178587|gb|ACC43697.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=119
Score = 159 bits (402), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 85/119 (72%), Positives = 104/119 (88%), Gaps = 0/119 (0%)
Query 1 VIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW 60
+IVGAF+AEAA+ VDNKLNVSGGVLYR+ VD DR+A+FLLVVLTQ ETDDP +R++VE+
Sbjct 1 MIVGAFIAEAAAAVDNKLNVSGGVLYRYWVDTDRAARFLLVVLTQTETDDPHQRIEVEIR 60
Query 61 PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGGAGTISLPLIVTG 119
PPT D+ + FELP+AA AEVGFAIF +EV+LPVDGRWV+VVTGGAG ISLPL+++G
Sbjct 61 PPTDDEPLLMGFELPDAATTAEVGFAIFNVEVSLPVDGRWVIVVTGGAGAISLPLLISG 119
>gi|118619507|ref|YP_907839.1| hypothetical protein MUL_4366 [Mycobacterium ulcerans Agy99]
gi|118571617|gb|ABL06368.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=119
Score = 159 bits (402), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 86/119 (73%), Positives = 104/119 (88%), Gaps = 0/119 (0%)
Query 1 VIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW 60
+IVGAFLAEAA+ VDNKLNVSGGVLYR+ VD DR+A+FLLVVLTQ ETDDP +R++VE+
Sbjct 1 MIVGAFLAEAAAAVDNKLNVSGGVLYRYWVDTDRAARFLLVVLTQTETDDPHQRIEVEIR 60
Query 61 PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGGAGTISLPLIVTG 119
PPT D+ + FELP+AA AEVGFAIF +EV+LPVDGRWV+VVTGGAG ISLPL+++G
Sbjct 61 PPTDDEPLLMGFELPDAATTAEVGFAIFNVEVSLPVDGRWVIVVTGGAGAISLPLLISG 119
>gi|306801387|ref|ZP_07438055.1| hypothetical protein TMHG_02814 [Mycobacterium tuberculosis SUMu008]
gi|308351811|gb|EFP40662.1| hypothetical protein TMHG_02814 [Mycobacterium tuberculosis SUMu008]
Length=79
Score = 156 bits (395), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 78/79 (99%), Positives = 79/79 (100%), Gaps = 0/79 (0%)
Query 41 VVLTQAETDDPDRRVDVEVWPPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRW 100
+VLTQAETDDPDRRVDVEVWPPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRW
Sbjct 1 MVLTQAETDDPDRRVDVEVWPPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRW 60
Query 101 VLVVTGGAGTISLPLIVTG 119
VLVVTGGAGTISLPLIVTG
Sbjct 61 VLVVTGGAGTISLPLIVTG 79
>gi|240168504|ref|ZP_04747163.1| hypothetical protein MkanA1_04277 [Mycobacterium kansasii ATCC
12478]
Length=128
Score = 139 bits (350), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/119 (61%), Positives = 92/119 (78%), Gaps = 1/119 (0%)
Query 1 VIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAET-DDPDRRVDVEV 59
++ GAFLA+AA+VVDNKLNV GGVL RFAV PDR A+F+LVVLTQ+E DR++++E
Sbjct 2 ILTGAFLADAAAVVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQSEPGSSDDRQLNIEA 61
Query 60 WPPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGGAGTISLPLIVT 118
PP +A ++FE+PEAAVA GFA F I++ LPVDGRWVLVVT G ISLP++V+
Sbjct 62 RPPADAEAIRLQFEVPEAAVAEFPGFAFFEIQLRLPVDGRWVLVVTADTGAISLPVLVS 120
>gi|15610883|ref|NP_218264.1| hypothetical protein Rv3747 [Mycobacterium tuberculosis H37Rv]
gi|31794917|ref|NP_857410.1| hypothetical protein Mb3773 [Mycobacterium bovis AF2122/97]
gi|121639661|ref|YP_979885.1| hypothetical protein BCG_3806 [Mycobacterium bovis BCG str. Pasteur
1173P2]
74 more sequence titles
Length=127
Score = 136 bits (342), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/118 (65%), Positives = 92/118 (78%), Gaps = 0/118 (0%)
Query 1 VIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW 60
++ GAFLA+AA+ VDNKLNV GGVL RFAV PDR A+F+LVVLTQAE D DR + VE+
Sbjct 2 ILTGAFLADAAAAVDNKLNVQGGVLSRFAVGPDRLARFVLVVLTQAEPDSSDRDITVEMR 61
Query 61 PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGGAGTISLPLIVT 118
PPT D+ + FE PEAAVA GFA F I++ LPV+GRWVLVVTGG G ISLP++V+
Sbjct 62 PPTDDEPIRLNFEAPEAAVAEFPGFAFFEIQLRLPVNGRWVLVVTGGTGAISLPVLVS 119
>gi|183985255|ref|YP_001853546.1| hypothetical protein MMAR_5287 [Mycobacterium marinum M]
gi|183178581|gb|ACC43691.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=127
Score = 123 bits (309), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 67/118 (57%), Positives = 89/118 (76%), Gaps = 0/118 (0%)
Query 1 VIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW 60
++ GAFLA+AA+ VDNKLNVSGGVL RFAV PDR A+F+LVVLTQ+ DR++ VE
Sbjct 2 ILTGAFLADAAAAVDNKLNVSGGVLSRFAVGPDRLARFVLVVLTQSNPGSTDRQLTVEAR 61
Query 61 PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGGAGTISLPLIVT 118
PP +A ++FE+PEAAVA GFA F I++ LP+DGRW LV + G+++LP++VT
Sbjct 62 PPADAEATRLQFEVPEAAVAEFPGFAFFEIQLRLPIDGRWELVASSDTGSVTLPVLVT 119
>gi|118619501|ref|YP_907833.1| hypothetical protein MUL_4360 [Mycobacterium ulcerans Agy99]
gi|118571611|gb|ABL06362.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=127
Score = 119 bits (297), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/118 (56%), Positives = 87/118 (74%), Gaps = 0/118 (0%)
Query 1 VIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW 60
++ GAFLA+AA+ VDNKLNVSGGVL RFAV PDR +F+LVVLTQ+ R++ VE
Sbjct 2 ILTGAFLADAAAAVDNKLNVSGGVLSRFAVGPDRLTRFVLVVLTQSNPGRTGRQLTVEAR 61
Query 61 PPTGDDAHHIEFELPEAAVAAEVGFAIFRIEVNLPVDGRWVLVVTGGAGTISLPLIVT 118
PP +A ++FE+PEAAVA GFA F I++ LP+DGRW LV + G+++LP++VT
Sbjct 62 PPADAEATRLQFEVPEAAVAEFPGFAFFEIQLRLPIDGRWELVASSDTGSVTLPVLVT 119
>gi|343924097|ref|ZP_08763660.1| hypothetical protein GOALK_002_00510 [Gordonia alkanivorans NBRC
16433]
gi|343765902|dbj|GAA10586.1| hypothetical protein GOALK_002_00510 [Gordonia alkanivorans NBRC
16433]
Length=123
Score = 54.7 bits (130), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 45/119 (38%), Positives = 61/119 (52%), Gaps = 9/119 (7%)
Query 1 VIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSA-QFLLVVLTQAETDD--PDRRVDV 57
++ GA LAEAA+V D KL V GGVL F P + LL+VL QAE D + V+V
Sbjct 2 IVTGAMLAEAATVADGKLYVLGGVLTDFWQPPGGYLIETLLIVLIQAEEGDLHNPQFVEV 61
Query 58 EVWPPTGDDAHHIEFELPEAAVAA-EVGFAIFRIEVNLPVDGRWVLVVTGGAGTISLPL 115
+ P G +PE A A GF +I V G++V+VV ++SLP+
Sbjct 62 SITTPDGKSGSS-RLPVPEVATAGTRAGFFFHKIGFEAKVPGQYVIVVE----SVSLPI 115
>gi|297566166|ref|YP_003685138.1| hypothetical protein Mesil_1752 [Meiothermus silvanus DSM 9946]
gi|296850615|gb|ADH63630.1| hypothetical protein Mesil_1752 [Meiothermus silvanus DSM 9946]
Length=110
Score = 38.1 bits (87), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 25/80 (32%), Positives = 39/80 (49%), Gaps = 7/80 (8%)
Query 29 AVDPDRSAQFLLVVLTQAETDDPDRRVDVEVWPPTGDDAHHIEFELPEAAVAAEVGFAIF 88
A +P R+A+F L A T P R + ++W P G A+ +E +A + +
Sbjct 11 ACNPGRTARFRRAGLLIALTWTPSREWEAKIWGPNGVTANQLE------GIAKALELDFY 64
Query 89 RIEVNLPVDGR-WVLVVTGG 107
IE L D + +V V+TGG
Sbjct 65 AIESYLSADFQSYVYVITGG 84
>gi|115360487|ref|YP_777624.1| ATPase domain-containing protein [Burkholderia ambifaria AMMD]
gi|115285815|gb|ABI91290.1| ATP-binding region, ATPase domain protein [Burkholderia ambifaria
AMMD]
Length=1685
Score = 34.3 bits (77), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 19/63 (31%), Positives = 30/63 (48%), Gaps = 0/63 (0%)
Query 1 VIVGAFLAEAASVVDNKLNVSGGVLYRFAVDPDRSAQFLLVVLTQAETDDPDRRVDVEVW 60
+ G FL + V + L ++ ++ +D + Q LL L + E+DD DR E
Sbjct 1010 LFAGEFLGDVRDTVTHGLAIARDANFQLVIDALLAQQMLLKQLQEGESDDRDRLAPPEGR 1069
Query 61 PPT 63
PPT
Sbjct 1070 PPT 1072
Lambda K H
0.321 0.140 0.414
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 129033565320
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40