BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv3865
Length=103
Score E
Sequences producing significant alignments: (Bits) Value
gi|15611001|ref|NP_218382.1| hypothetical protein Rv3865 [Mycoba... 204 3e-51
gi|148825073|ref|YP_001289827.1| hypothetical protein TBFG_13900... 202 2e-50
gi|240168344|ref|ZP_04747003.1| hypothetical protein MkanA1_0346... 173 6e-42
gi|183985411|ref|YP_001853702.1| hypothetical protein MMAR_5440 ... 166 8e-40
gi|240169789|ref|ZP_04748448.1| hypothetical protein MkanA1_1077... 84.7 4e-15
gi|240171817|ref|ZP_04750476.1| hypothetical protein MkanA1_2105... 84.3 5e-15
gi|183984320|ref|YP_001852611.1| hypothetical protein MMAR_4349 ... 76.6 1e-12
gi|183984138|ref|YP_001852429.1| hypothetical protein MMAR_4167 ... 75.5 2e-12
gi|118619681|ref|YP_908013.1| hypothetical protein MUL_4591 [Myc... 72.0 3e-11
gi|183985404|ref|YP_001853695.1| hypothetical protein MMAR_5433 ... 69.3 2e-10
gi|240170254|ref|ZP_04748913.1| hypothetical protein MkanA1_1314... 66.6 1e-09
gi|240173315|ref|ZP_04751973.1| hypothetical protein MkanA1_2864... 65.9 2e-09
gi|15610751|ref|NP_218132.1| hypothetical protein Rv3615c [Mycob... 63.9 7e-09
gi|289748128|ref|ZP_06507506.1| conserved hypothetical protein [... 58.5 3e-07
gi|15827125|ref|NP_301388.1| hypothetical protein ML0406 [Mycoba... 49.3 2e-04
gi|154292356|ref|XP_001546753.1| hypothetical protein BC1G_14667... 37.0 0.97
gi|118470718|ref|YP_890660.1| hypothetical protein MSMEG_6447 [M... 36.6 1.2
gi|118469912|ref|YP_884474.1| hypothetical protein MSMEG_0056 [M... 34.3 5.5
>gi|15611001|ref|NP_218382.1| hypothetical protein Rv3865 [Mycobacterium tuberculosis H37Rv]
gi|15843497|ref|NP_338534.1| hypothetical protein MT3979 [Mycobacterium tuberculosis CDC1551]
gi|31795039|ref|NP_857532.1| hypothetical protein Mb3895 [Mycobacterium bovis AF2122/97]
79 more sequence titles
Length=103
Score = 204 bits (520), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 103/103 (100%), Positives = 103/103 (100%), Gaps = 0/103 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET
Sbjct 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIFG 103
TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIFG
Sbjct 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIFG 103
>gi|148825073|ref|YP_001289827.1| hypothetical protein TBFG_13900 [Mycobacterium tuberculosis F11]
gi|148723600|gb|ABR08225.1| conserved hypothetical protein [Mycobacterium tuberculosis F11]
Length=103
Score = 202 bits (513), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 102/103 (99%), Positives = 102/103 (99%), Gaps = 0/103 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
MTGFLGVVPSFLKVLAGMHNEIV DIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET
Sbjct 1 MTGFLGVVPSFLKVLAGMHNEIVDDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIFG 103
TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIFG
Sbjct 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIFG 103
>gi|240168344|ref|ZP_04747003.1| hypothetical protein MkanA1_03467 [Mycobacterium kansasii ATCC
12478]
Length=103
Score = 173 bits (439), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 85/103 (83%), Positives = 92/103 (90%), Gaps = 0/103 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
MTG LGVVPSFLKVLAGM NEI G +K AT V G+S RVQ+THGSFTSKFNDTLQEFET
Sbjct 1 MTGILGVVPSFLKVLAGMQNEIAGQLKSATSVVGGVSQRVQITHGSFTSKFNDTLQEFET 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIFG 103
TR+STGTGLQGVTSGLANNL++AAGAYL +D GLAGVIDKIFG
Sbjct 61 TRNSTGTGLQGVTSGLANNLISAAGAYLNSDQGLAGVIDKIFG 103
>gi|183985411|ref|YP_001853702.1| hypothetical protein MMAR_5440 [Mycobacterium marinum M]
gi|183178737|gb|ACC43847.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=103
Score = 166 bits (421), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 81/103 (79%), Positives = 91/103 (89%), Gaps = 0/103 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
MTG L VVPSFLKVLAGMHNEIVG++K AT+ V+GI RVQLTHGSFTS FNDTL EFET
Sbjct 1 MTGLLNVVPSFLKVLAGMHNEIVGELKSATNVVSGIGSRVQLTHGSFTSNFNDTLVEFET 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIFG 103
TR+S GTGLQGVT LANNL++AAGAYL +D+GLAG+IDKIFG
Sbjct 61 TRNSAGTGLQGVTGKLANNLISAAGAYLNSDEGLAGIIDKIFG 103
>gi|240169789|ref|ZP_04748448.1| hypothetical protein MkanA1_10777 [Mycobacterium kansasii ATCC
12478]
Length=105
Score = 84.7 bits (208), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/101 (47%), Positives = 63/101 (63%), Gaps = 0/101 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
M+ L V P FL VLA ++ ++ AT+ VAGI V+ THGS+TSKFN TL T
Sbjct 1 MSDSLAVNPEFLGVLATAQDQAATYVQSATEAVAGIGEDVETTHGSYTSKFNTTLTALVT 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKI 101
R+S GT L + +A+NL AA AY +ADD LA V+++I
Sbjct 61 IRNSVGTSLYTLAGEVASNLRIAAKAYSEADDVLAKVVERI 101
>gi|240171817|ref|ZP_04750476.1| hypothetical protein MkanA1_21055 [Mycobacterium kansasii ATCC
12478]
Length=105
Score = 84.3 bits (207), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 48/100 (48%), Positives = 61/100 (61%), Gaps = 0/100 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
M LGV P +L++LAG + + DIK AT +V GIS V THGS TS FN TL + T
Sbjct 1 MLHSLGVSPDYLQLLAGSQDNLAIDIKAATQSVDGISEAVSTTHGSLTSTFNITLAKLVT 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDK 100
RS TG GL+ +T+ +A NL AA AY D A +I+K
Sbjct 61 IRSFTGMGLEKLTTDVATNLRIAAHAYRDTDSDWADLIEK 100
>gi|183984320|ref|YP_001852611.1| hypothetical protein MMAR_4349 [Mycobacterium marinum M]
gi|183177646|gb|ACC42756.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=104
Score = 76.6 bits (187), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/100 (41%), Positives = 57/100 (57%), Gaps = 0/100 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
M L + P + L+ H+ V ++ AT AGIS V THG+FTS+FN+ L EFE
Sbjct 1 MNEILTLQPDVISRLSQGHDATVTGLQAATAAPAGISANVATTHGTFTSEFNNALSEFEA 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDK 100
R+ G LQGV +GL+ NL A AY+ D A ++D+
Sbjct 61 VRARAGQALQGVATGLSKNLNKALTAYVNTDRAGAEILDQ 100
>gi|183984138|ref|YP_001852429.1| hypothetical protein MMAR_4167 [Mycobacterium marinum M]
gi|183177464|gb|ACC42574.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=103
Score = 75.5 bits (184), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 40/102 (40%), Positives = 56/102 (55%), Gaps = 0/102 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
MT L V P L VLA H+ T+ +G+S V +THGS+ S+FN TL+ +E+
Sbjct 1 MTENLKVQPELLGVLASHHDNAAASATSGTEVTSGLSESVTVTHGSYCSQFNTTLKMYES 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIF 102
TRS+ G+ L LA NL AA Y +ADD + + +F
Sbjct 61 TRSALGSSLNNAGIDLAKNLRTAARVYTEADDTWSKALGSLF 102
>gi|118619681|ref|YP_908013.1| hypothetical protein MUL_4591 [Mycobacterium ulcerans Agy99]
gi|118571791|gb|ABL06542.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=104
Score = 72.0 bits (175), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 56/100 (56%), Gaps = 0/100 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
M L + P + L+ H+ V ++ AT AGIS V THG+FTS++N+ L EFE
Sbjct 1 MNEILTLQPDVISRLSQGHDATVTGLQAATAAPAGISANVATTHGTFTSEYNNALSEFEA 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDK 100
R+ G LQGV +GL+ +L A AY+ D ++D+
Sbjct 61 VRARAGQALQGVATGLSKDLNKALTAYVNTDRAGVEILDQ 100
>gi|183985404|ref|YP_001853695.1| hypothetical protein MMAR_5433 [Mycobacterium marinum M]
gi|183178730|gb|ACC43840.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=104
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 51/100 (51%), Gaps = 0/100 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
M L + P + LA H+ ++ AT AGI V THG FTS FN+ L +E
Sbjct 1 MNDMLTLQPDVVNRLASGHDATATSLRAATAAPAGIGATVAETHGPFTSTFNNALSAYEA 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDK 100
R+S G L+GV GL+ NL A AY D A ++D+
Sbjct 61 VRASAGRALEGVADGLSTNLTRALAAYTDTDQRGAEILDE 100
>gi|240170254|ref|ZP_04748913.1| hypothetical protein MkanA1_13148 [Mycobacterium kansasii ATCC
12478]
Length=104
Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/91 (40%), Positives = 51/91 (57%), Gaps = 0/91 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
M L + P + LA H+ + + AT AGIS V THG+FTS+FN+ L+ +E
Sbjct 1 MNEILTLRPDVINRLAKGHDATITGLHAATVAPAGISSDVATTHGTFTSEFNNALRAYEA 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKAD 91
R+ G LQGV +GL+ +L A AY+ D
Sbjct 61 VRAHAGQALQGVAAGLSKSLNKALTAYVNTD 91
>gi|240173315|ref|ZP_04751973.1| hypothetical protein MkanA1_28641 [Mycobacterium kansasii ATCC
12478]
Length=103
Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/101 (35%), Positives = 51/101 (51%), Gaps = 0/101 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
MT L V P L VLA + AG++ V ++HGS+ +FNDTL+ +E+
Sbjct 1 MTNILKVQPELLDVLASHQHNAAASASSGVAATAGLAESVAISHGSYCKQFNDTLKMYES 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKI 101
++ G+ L LA NL AA AYL AD+ I+ +
Sbjct 61 AHNAFGSSLHAAGIALAKNLRTAARAYLDADETWRQAIESL 101
>gi|15610751|ref|NP_218132.1| hypothetical protein Rv3615c [Mycobacterium tuberculosis H37Rv]
gi|15843225|ref|NP_338262.1| hypothetical protein MT3717 [Mycobacterium tuberculosis CDC1551]
gi|31794791|ref|NP_857284.1| hypothetical protein Mb3645c [Mycobacterium bovis AF2122/97]
80 more sequence titles
Length=103
Score = 63.9 bits (154), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 37/102 (37%), Positives = 51/102 (50%), Gaps = 0/102 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
MT L V P L VLA H+ D + AG+ V +THG + S+FNDTL + T
Sbjct 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIF 102
++ G+ L LA +L AA Y +AD+ ID +F
Sbjct 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDGLF 102
>gi|289748128|ref|ZP_06507506.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289688715|gb|EFD56144.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length=97
Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/94 (37%), Positives = 47/94 (50%), Gaps = 0/94 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
MT L V P L VLA H+ D + AG+ V +THG + S+FNDTL + T
Sbjct 1 MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLT 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGL 94
++ G+ L LA +L AA Y +AD+
Sbjct 61 AHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAW 94
>gi|15827125|ref|NP_301388.1| hypothetical protein ML0406 [Mycobacterium leprae TN]
gi|221229603|ref|YP_002503019.1| hypothetical protein MLBr_00406 [Mycobacterium leprae Br4923]
gi|17433220|sp|Q49723.1|Y406_MYCLE RecName: Full=EspC protein homolog
gi|466936|gb|AAC43224.1| B1620_C2_214 [Mycobacterium leprae]
gi|2222690|emb|CAB09941.1| hypothetical protein MLCL383.02 [Mycobacterium leprae]
gi|13092673|emb|CAC29914.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219932710|emb|CAR70499.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=106
Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/103 (32%), Positives = 42/103 (41%), Gaps = 0/103 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
M L V L LA H AG++ V +HGS+ ++FNDTL+ +E
Sbjct 4 MIDNLTVQSEHLNSLASQHENEAACASSGVSAAAGLANAVSTSHGSYCAQFNDTLKMYED 63
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIFG 103
+ G L LA L AA Y AD+ I FG
Sbjct 64 AHRTLGESLHTGGIDLARVLRVAAAMYCDADEICGSDIKSAFG 106
>gi|154292356|ref|XP_001546753.1| hypothetical protein BC1G_14667 [Botryotinia fuckeliana B05.10]
gi|150846146|gb|EDN21339.1| hypothetical protein BC1G_14667 [Botryotinia fuckeliana B05.10]
Length=3554
Score = 37.0 bits (84), Expect = 0.97, Method: Composition-based stats.
Identities = 25/75 (34%), Positives = 34/75 (46%), Gaps = 4/75 (5%)
Query 19 HNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFETTRSSTGTGLQGVTSGLAN 78
HNE+ DI + V S + H SKF D ++E + GL+G +GLA
Sbjct 1385 HNEVRSDIDGLRNLVDENSNK----HEESLSKFGDLIREHGDLVKDSHDGLKGTIAGLAL 1440
Query 79 NLLAAAGAYLKADDG 93
+A AG DDG
Sbjct 1441 GGIAGAGIMKAVDDG 1455
>gi|118470718|ref|YP_890660.1| hypothetical protein MSMEG_6447 [Mycobacterium smegmatis str.
MC2 155]
gi|118172005|gb|ABK72901.1| hypothetical protein MSMEG_6447 [Mycobacterium smegmatis str.
MC2 155]
Length=107
Score = 36.6 bits (83), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/95 (29%), Positives = 47/95 (50%), Gaps = 0/95 (0%)
Query 5 LGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFETTRSS 64
L V+ ++ L+ +H++ VG+I A ++ + THG ++ N + + R++
Sbjct 9 LTVLTDHIRKLSTVHDKAVGEIDGANRSMVENGTNMWETHGVISALTNWAVADAVEARTA 68
Query 65 TGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVID 99
G L+ V+ L+ L AAA Y D AG ID
Sbjct 69 AGGALRRVSVELSEKLRAAATNYDNTDSTEAGNID 103
>gi|118469912|ref|YP_884474.1| hypothetical protein MSMEG_0056 [Mycobacterium smegmatis str.
MC2 155]
gi|118171199|gb|ABK72095.1| hypothetical protein MSMEG_0056 [Mycobacterium smegmatis str.
MC2 155]
Length=105
Score = 34.3 bits (77), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 25/102 (25%), Positives = 45/102 (45%), Gaps = 0/102 (0%)
Query 1 MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFET 60
M+ L V + L+ L+ ++ AT V G+ ++ THG + ++ +
Sbjct 1 MSDDLRVTTAHLRELSAKQGRAAAELATATAVVDGVDTALRFTHGPISWGTAAAVEAVQH 60
Query 61 TRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDKIF 102
R + GTG+ V+ L L AAG Y + D + +D+
Sbjct 61 ARRAAGTGMVKVSQELETKLDTAAGRYHRTDSTMGDALDETI 102
Lambda K H
0.317 0.135 0.376
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 127822873252
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40