BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3615c
Length=103
Score E
Sequences producing significant alignments: (Bits) Value
gi|15610751|ref|NP_218132.1| hypothetical protein Rv3615c [Mycob... 210 4e-53
gi|289748128|ref|ZP_06507506.1| conserved hypothetical protein [... 197 3e-49
gi|183984138|ref|YP_001852429.1| hypothetical protein MMAR_4167 ... 146 1e-33
gi|240173315|ref|ZP_04751973.1| hypothetical protein MkanA1_2864... 120 6e-26
gi|15827125|ref|NP_301388.1| hypothetical protein ML0406 [Mycoba... 117 6e-25
gi|240169789|ref|ZP_04748448.1| hypothetical protein MkanA1_1077... 99.0 2e-19
gi|240171817|ref|ZP_04750476.1| hypothetical protein MkanA1_2105... 67.0 1e-09
gi|183985411|ref|YP_001853702.1| hypothetical protein MMAR_5440 ... 64.3 5e-09
gi|240168344|ref|ZP_04747003.1| hypothetical protein MkanA1_0346... 62.8 2e-08
gi|183985404|ref|YP_001853695.1| hypothetical protein MMAR_5433 ... 60.5 8e-08
gi|15611001|ref|NP_218382.1| hypothetical protein Rv3865 [Mycoba... 55.8 2e-06
gi|148825073|ref|YP_001289827.1| hypothetical protein TBFG_13900... 55.8 2e-06
gi|183984320|ref|YP_001852611.1| hypothetical protein MMAR_4349 ... 54.3 6e-06
gi|240170254|ref|ZP_04748913.1| hypothetical protein MkanA1_1314... 52.4 2e-05
gi|118619681|ref|YP_908013.1| hypothetical protein MUL_4591 [Myc... 52.0 3e-05
gi|118471720|ref|YP_885134.1| hypothetical protein MSMEG_0728 [M... 37.7 0.60
>gi|15610751|ref|NP_218132.1| hypothetical protein Rv3615c [Mycobacterium tuberculosis H37Rv]
gi|15843225|ref|NP_338262.1| hypothetical protein MT3717 [Mycobacterium tuberculosis CDC1551]
gi|31794791|ref|NP_857284.1| hypothetical protein Mb3645c [Mycobacterium bovis AF2122/97]
80 more sequence titles
Length=103
Score = 210 bits (535), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 103/103 (100%), Positives = 103/103 (100%), Gaps = 0/103 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT
Sbjct 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGLFT 103
AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGLFT
Sbjct 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGLFT 103
>gi|289748128|ref|ZP_06507506.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289688715|gb|EFD56144.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=97
Score = 197 bits (502), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 97/97 (100%), Positives = 97/97 (100%), Gaps = 0/97 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT
Sbjct 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKA 97
AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKA
Sbjct 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKA 97
>gi|183984138|ref|YP_001852429.1| hypothetical protein MMAR_4167 [Mycobacterium marinum M]
gi|183177464|gb|ACC42574.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=103
Score = 146 bits (368), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 67/103 (66%), Positives = 82/103 (80%), Gaps = 0/103 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
MTENL VQPE LGVLASHHDNAA A+SG E +GL ESV +THG YCSQFN TL +Y +
Sbjct 1 MTENLKVQPELLGVLASHHDNAAASATSGTEVTSGLSESVTVTHGSYCSQFNTTLKMYES 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGLFT 103
+ALGSSL+ AG+DLAK+LR AA++Y+EAD+ W KA+ LF+
Sbjct 61 TRSALGSSLNNAGIDLAKNLRTAARVYTEADDTWSKALGSLFS 103
>gi|240173315|ref|ZP_04751973.1| hypothetical protein MkanA1_28641 [Mycobacterium kansasii ATCC
12478]
Length=103
Score = 120 bits (301), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 68/101 (68%), Positives = 77/101 (77%), Gaps = 0/101 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
MT L VQPE L VLASH NAA ASSGV A AGL ESVAI+HG YC QFNDTL +Y +
Sbjct 1 MTNILKVQPELLDVLASHQHNAAASASSGVAATAGLAESVAISHGSYCKQFNDTLKMYES 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGL 101
AHNA GSSLH AG+ LAK+LR AA+ Y +ADE WR+AI+ L
Sbjct 61 AHNAFGSSLHAAGIALAKNLRTAARAYLDADETWRQAIESL 101
>gi|15827125|ref|NP_301388.1| hypothetical protein ML0406 [Mycobacterium leprae TN]
gi|221229603|ref|YP_002503019.1| hypothetical protein MLBr_00406 [Mycobacterium leprae Br4923]
gi|17433220|sp|Q49723.1|Y406_MYCLE RecName: Full=EspC protein homolog
gi|466936|gb|AAC43224.1| B1620_C2_214 [Mycobacterium leprae]
gi|2222690|emb|CAB09941.1| hypothetical protein MLCL383.02 [Mycobacterium leprae]
gi|13092673|emb|CAC29914.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219932710|emb|CAR70499.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=106
Score = 117 bits (293), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 58/102 (57%), Positives = 70/102 (69%), Gaps = 0/102 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
M +NLTVQ E L LAS H+N A ASSGV AAAGL +V+ +HG YC+QFNDTL +Y
Sbjct 4 MIDNLTVQSEHLNSLASQHENEAACASSGVSAAAGLANAVSTSHGSYCAQFNDTLKMYED 63
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGLF 102
AH LG SLHT G+DLA+ LR+AA +Y +ADE I F
Sbjct 64 AHRTLGESLHTGGIDLARVLRVAAAMYCDADEICGSDIKSAF 105
>gi|240169789|ref|ZP_04748448.1| hypothetical protein MkanA1_10777 [Mycobacterium kansasii ATCC
12478]
Length=105
Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/101 (49%), Positives = 67/101 (67%), Gaps = 0/101 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
M+++L V PE LGVLA+ D AA S EA AG+GE V THG Y S+FN TL +T
Sbjct 1 MSDSLAVNPEFLGVLATAQDQAATYVQSATEAVAGIGEDVETTHGSYTSKFNTTLTALVT 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGL 101
N++G+SL+T ++A +LRIAAK YSEAD+ K ++ +
Sbjct 61 IRNSVGTSLYTLAGEVASNLRIAAKAYSEADDVLAKVVERI 101
>gi|240171817|ref|ZP_04750476.1| hypothetical protein MkanA1_21055 [Mycobacterium kansasii ATCC
12478]
Length=105
Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/99 (37%), Positives = 52/99 (53%), Gaps = 0/99 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
M +L V P+ L +LA DN A+D + ++ G+ E+V+ THG S FN TL +T
Sbjct 1 MLHSLGVSPDYLQLLAGSQDNLAIDIKAATQSVDGISEAVSTTHGSLTSTFNITLAKLVT 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAID 99
+ G L D+A +LRIAA Y + D W I+
Sbjct 61 IRSFTGMGLEKLTTDVATNLRIAAHAYRDTDSDWADLIE 99
>gi|183985411|ref|YP_001853702.1| hypothetical protein MMAR_5440 [Mycobacterium marinum M]
gi|183178737|gb|ACC43847.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=103
Score = 64.3 bits (155), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 38/102 (38%), Positives = 50/102 (50%), Gaps = 0/102 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
MT L V P L VLA H+ + S +G+G V +THG + S FNDTL + T
Sbjct 1 MTGLLNVVPSFLKVLAGMHNEIVGELKSATNVVSGIGSRVQLTHGSFTSNFNDTLVEFET 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGLF 102
N+ G+ L LA +L AA Y +DE ID +F
Sbjct 61 TRNSAGTGLQGVTGKLANNLISAAGAYLNSDEGLAGIIDKIF 102
>gi|240168344|ref|ZP_04747003.1| hypothetical protein MkanA1_03467 [Mycobacterium kansasii ATCC
12478]
Length=103
Score = 62.8 bits (151), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/102 (37%), Positives = 49/102 (49%), Gaps = 0/102 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
MT L V P L VLA + A S G+ + V ITHG + S+FNDTL + T
Sbjct 1 MTGILGVVPSFLKVLAGMQNEIAGQLKSATSVVGGVSQRVQITHGSFTSKFNDTLQEFET 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGLF 102
N+ G+ L LA +L AA Y +D+ ID +F
Sbjct 61 TRNSTGTGLQGVTSGLANNLISAAGAYLNSDQGLAGVIDKIF 102
>gi|183985404|ref|YP_001853695.1| hypothetical protein MMAR_5433 [Mycobacterium marinum M]
gi|183178730|gb|ACC43840.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=104
Score = 60.5 bits (145), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 34/99 (35%), Positives = 53/99 (54%), Gaps = 0/99 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
M + LT+QP+ + LAS HD A + A AG+G +VA THGP+ S FN+ L+ Y
Sbjct 1 MNDMLTLQPDVVNRLASGHDATATSLRAATAAPAGIGATVAETHGPFTSTFNNALSAYEA 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAID 99
+ G +L L+ +L A Y++ D+ + +D
Sbjct 61 VRASAGRALEGVADGLSTNLTRALAAYTDTDQRGAEILD 99
>gi|15611001|ref|NP_218382.1| hypothetical protein Rv3865 [Mycobacterium tuberculosis H37Rv]
gi|15843497|ref|NP_338534.1| hypothetical protein MT3979 [Mycobacterium tuberculosis CDC1551]
gi|31795039|ref|NP_857532.1| hypothetical protein Mb3895 [Mycobacterium bovis AF2122/97]
79 more sequence titles
Length=103
Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/102 (37%), Positives = 51/102 (50%), Gaps = 0/102 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
MT L V P L VLA H+ D + AG+ V +THG + S+FNDTL + T
Sbjct 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGLF 102
++ G+ L LA +L AA Y +AD+ ID +F
Sbjct 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIF 102
>gi|148825073|ref|YP_001289827.1| hypothetical protein TBFG_13900 [Mycobacterium tuberculosis F11]
gi|148723600|gb|ABR08225.1| conserved hypothetical protein [Mycobacterium tuberculosis F11]
Length=103
Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/102 (37%), Positives = 51/102 (50%), Gaps = 0/102 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
MT L V P L VLA H+ D + AG+ V +THG + S+FNDTL + T
Sbjct 1 MTGFLGVVPSFLKVLAGMHNEIVDDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGLF 102
++ G+ L LA +L AA Y +AD+ ID +F
Sbjct 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIF 102
>gi|183984320|ref|YP_001852611.1| hypothetical protein MMAR_4349 [Mycobacterium marinum M]
gi|183177646|gb|ACC42756.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=104
Score = 54.3 bits (129), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/99 (32%), Positives = 48/99 (49%), Gaps = 0/99 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
M E LT+QP+ + L+ HD + A AG+ +VA THG + S+FN+ L+ +
Sbjct 1 MNEILTLQPDVISRLSQGHDATVTGLQAATAAPAGISANVATTHGTFTSEFNNALSEFEA 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAID 99
G +L L+K+L A Y D A + +D
Sbjct 61 VRARAGQALQGVATGLSKNLNKALTAYVNTDRAGAEILD 99
>gi|240170254|ref|ZP_04748913.1| hypothetical protein MkanA1_13148 [Mycobacterium kansasii ATCC
12478]
Length=104
Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/104 (34%), Positives = 47/104 (46%), Gaps = 4/104 (3%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
M E LT++P+ + LA HD + A AG+ VA THG + S+FN+ L Y
Sbjct 1 MNEILTLRPDVINRLAKGHDATITGLHAATVAPAGISSDVATTHGTFTSEFNNALRAYEA 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEAD----EAWRKAIDG 100
G +L L+KSL A Y D E + IDG
Sbjct 61 VRAHAGQALQGVAAGLSKSLNKALTAYVNTDLNSAEILGEQIDG 104
>gi|118619681|ref|YP_908013.1| hypothetical protein MUL_4591 [Mycobacterium ulcerans Agy99]
gi|118571791|gb|ABL06542.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=104
Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/99 (31%), Positives = 47/99 (48%), Gaps = 0/99 (0%)
Query 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
M E LT+QP+ + L+ HD + A AG+ +VA THG + S++N+ L+ +
Sbjct 1 MNEILTLQPDVISRLSQGHDATVTGLQAATAAPAGISANVATTHGTFTSEYNNALSEFEA 60
Query 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAID 99
G +L L+K L A Y D A + +D
Sbjct 61 VRARAGQALQGVATGLSKDLNKALTAYVNTDRAGVEILD 99
>gi|118471720|ref|YP_885134.1| hypothetical protein MSMEG_0728 [Mycobacterium smegmatis str.
MC2 155]
gi|118173007|gb|ABK73903.1| hypothetical protein MSMEG_0728 [Mycobacterium smegmatis str.
MC2 155]
Length=102
Score = 37.7 bits (86), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 25/85 (30%), Positives = 39/85 (46%), Gaps = 0/85 (0%)
Query 15 LASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLTAHNALGSSLHTAGV 74
LA A S +EAA G+G S+ HG C+ N + TA A + ++
Sbjct 9 LAELQGRTAQQILSALEAAQGVGTSMWKNHGMVCAPSNSAVIAAETARRAACTRMNAKSE 68
Query 75 DLAKSLRIAAKIYSEADEAWRKAID 99
+LA+ L +A K+Y D + +D
Sbjct 69 ELAQKLGMARKLYDGTDMQEEQKLD 93
Lambda K H
0.313 0.127 0.361
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 127822873252
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40