BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1693
Length=58
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608831|ref|NP_216209.1| hypothetical protein Rv1693 [Mycoba... 110 5e-23
gi|240170142|ref|ZP_04748801.1| hypothetical protein MkanA1_1256... 96.3 1e-18
gi|183982513|ref|YP_001850804.1| hypothetical protein MMAR_2498 ... 90.9 6e-17
gi|118617310|ref|YP_905642.1| hypothetical protein MUL_1682 [Myc... 86.7 1e-15
gi|41407498|ref|NP_960334.1| hypothetical protein MAP1400 [Mycob... 85.9 2e-15
gi|254821087|ref|ZP_05226088.1| hypothetical protein MintA_14217... 84.7 4e-15
gi|342860729|ref|ZP_08717379.1| hypothetical protein MCOL_17698 ... 82.4 2e-14
gi|296164593|ref|ZP_06847160.1| conserved hypothetical protein [... 82.4 2e-14
gi|15827704|ref|NP_301967.1| hypothetical protein ML1357 [Mycoba... 80.9 6e-14
gi|333990489|ref|YP_004523103.1| hypothetical protein JDM601_184... 50.1 1e-04
gi|169629444|ref|YP_001703093.1| hypothetical protein MAB_2358 [... 49.3 2e-04
gi|108799918|ref|YP_640115.1| hypothetical protein Mmcs_2952 [My... 47.8 5e-04
gi|145224094|ref|YP_001134772.1| hypothetical protein Mflv_3510 ... 46.6 0.001
gi|120404266|ref|YP_954095.1| hypothetical protein Mvan_3291 [My... 45.1 0.003
gi|118469330|ref|YP_888049.1| hypothetical protein MSMEG_3752 [M... 44.3 0.006
gi|54023963|ref|YP_118205.1| hypothetical protein nfa19950 [Noca... 42.4 0.022
gi|289750249|ref|ZP_06509627.1| LOW QUALITY PROTEIN: conserved h... 36.2 1.8
gi|261420610|ref|YP_003254292.1| nucleotide sugar dehydrogenase ... 35.4 2.8
gi|229494832|ref|ZP_04388586.1| conserved hypothetical protein [... 35.0 3.8
gi|226360084|ref|YP_002777862.1| hypothetical protein ROP_06700 ... 35.0 4.0
gi|224985441|gb|ACN74538.1| hypothetical UDP-N-acetyl-D-mannosam... 33.9 8.9
>gi|15608831|ref|NP_216209.1| hypothetical protein Rv1693 [Mycobacterium tuberculosis H37Rv]
gi|15841150|ref|NP_336187.1| hypothetical protein MT1732 [Mycobacterium tuberculosis CDC1551]
gi|31792879|ref|NP_855372.1| hypothetical protein Mb1719 [Mycobacterium bovis AF2122/97]
47 more sequence titles
Length=58
Score = 110 bits (276), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 58/58 (100%), Positives = 58/58 (100%), Gaps = 0/58 (0%)
Query 1 MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEKG 58
MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEKG
Sbjct 1 MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEKG 58
>gi|240170142|ref|ZP_04748801.1| hypothetical protein MkanA1_12568 [Mycobacterium kansasii ATCC
12478]
Length=58
Score = 96.3 bits (238), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 50/58 (87%), Positives = 54/58 (94%), Gaps = 0/58 (0%)
Query 1 MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEKG 58
MTIDPDQIR+EI+AL+A LPD ADAENGPSLAELE IARRLSEAH+VLL ALESAEKG
Sbjct 1 MTIDPDQIRSEIEALVAQLPDVADAENGPSLAELEVIARRLSEAHDVLLRALESAEKG 58
>gi|183982513|ref|YP_001850804.1| hypothetical protein MMAR_2498 [Mycobacterium marinum M]
gi|183175839|gb|ACC40949.1| conserved hypothetical protein [Mycobacterium marinum M]
Length=58
Score = 90.9 bits (224), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 47/58 (82%), Positives = 51/58 (88%), Gaps = 0/58 (0%)
Query 1 MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEKG 58
MTIDPDQIR EI++LLA LP+ AD NGPSLAELE IARRLSEAH+VLL ALESAEKG
Sbjct 1 MTIDPDQIRTEIESLLAQLPEVADGANGPSLAELEDIARRLSEAHDVLLQALESAEKG 58
>gi|118617310|ref|YP_905642.1| hypothetical protein MUL_1682 [Mycobacterium ulcerans Agy99]
gi|118569420|gb|ABL04171.1| conserved hypothetical protein [Mycobacterium ulcerans Agy99]
Length=58
Score = 86.7 bits (213), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/58 (80%), Positives = 50/58 (87%), Gaps = 0/58 (0%)
Query 1 MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEKG 58
MTIDPDQIR EI++LLA LP+ AD NG SLAELE IARRLSEAH+VLL ALESAEKG
Sbjct 1 MTIDPDQIRTEIESLLAQLPEVADDANGLSLAELEDIARRLSEAHDVLLQALESAEKG 58
>gi|41407498|ref|NP_960334.1| hypothetical protein MAP1400 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|118467047|ref|YP_882262.1| hypothetical protein MAV_3078 [Mycobacterium avium 104]
gi|254775530|ref|ZP_05217046.1| hypothetical protein MaviaA2_12796 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|41395851|gb|AAS03717.1| hypothetical protein MAP_1400 [Mycobacterium avium subsp. paratuberculosis
K-10]
gi|118168334|gb|ABK69231.1| conserved hypothetical protein [Mycobacterium avium 104]
gi|336457852|gb|EGO36846.1| hypothetical protein MAPs_19510 [Mycobacterium avium subsp. paratuberculosis
S397]
Length=61
Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/61 (79%), Positives = 51/61 (84%), Gaps = 3/61 (4%)
Query 1 MTIDPDQIRAEIDALLASLPDPADAEN---GPSLAELEGIARRLSEAHEVLLAALESAEK 57
M+IDPDQIRAEIDALLA LP P D E+ PSL ELE IARRLSEAH+VLLAALESAEK
Sbjct 1 MSIDPDQIRAEIDALLARLPQPGDTEDPDKAPSLDELEEIARRLSEAHDVLLAALESAEK 60
Query 58 G 58
G
Sbjct 61 G 61
>gi|254821087|ref|ZP_05226088.1| hypothetical protein MintA_14217 [Mycobacterium intracellulare
ATCC 13950]
Length=61
Score = 84.7 bits (208), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/61 (79%), Positives = 51/61 (84%), Gaps = 3/61 (4%)
Query 1 MTIDPDQIRAEIDALLASLPDPADAEN---GPSLAELEGIARRLSEAHEVLLAALESAEK 57
M+IDPDQIRAEIDALLA LP P D E+ PSL ELE IARRLSEAH+VLLAALESAEK
Sbjct 1 MSIDPDQIRAEIDALLARLPRPGDQEDPEKAPSLDELEEIARRLSEAHDVLLAALESAEK 60
Query 58 G 58
G
Sbjct 61 G 61
>gi|342860729|ref|ZP_08717379.1| hypothetical protein MCOL_17698 [Mycobacterium colombiense CECT
3035]
gi|342131753|gb|EGT85010.1| hypothetical protein MCOL_17698 [Mycobacterium colombiense CECT
3035]
Length=61
Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/61 (79%), Positives = 50/61 (82%), Gaps = 3/61 (4%)
Query 1 MTIDPDQIRAEIDALLASLP---DPADAENGPSLAELEGIARRLSEAHEVLLAALESAEK 57
MTIDPD+IRAEIDALLA LP D D E PSL ELE IARRLSEAH+VLLAALESAEK
Sbjct 1 MTIDPDEIRAEIDALLARLPQAGDTEDPEQVPSLDELEEIARRLSEAHDVLLAALESAEK 60
Query 58 G 58
G
Sbjct 61 G 61
>gi|296164593|ref|ZP_06847160.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295900012|gb|EFG79451.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=61
Score = 82.4 bits (202), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/61 (76%), Positives = 52/61 (86%), Gaps = 3/61 (4%)
Query 1 MTIDPDQIRAEIDALLASLPDPA---DAENGPSLAELEGIARRLSEAHEVLLAALESAEK 57
MTIDP++IRAEIDALLA LP+P +N PSL+ELE IARRLSEAH+VLLAALESAEK
Sbjct 1 MTIDPEEIRAEIDALLARLPEPGAGDTPDNQPSLSELEEIARRLSEAHDVLLAALESAEK 60
Query 58 G 58
G
Sbjct 61 G 61
>gi|15827704|ref|NP_301967.1| hypothetical protein ML1357 [Mycobacterium leprae TN]
gi|221230181|ref|YP_002503597.1| hypothetical protein MLBr_01357 [Mycobacterium leprae Br4923]
gi|13093255|emb|CAC31738.1| conserved hypothetical protein [Mycobacterium leprae]
gi|219933288|emb|CAR71452.1| conserved hypothetical protein [Mycobacterium leprae Br4923]
Length=61
Score = 80.9 bits (198), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 45/61 (74%), Positives = 48/61 (79%), Gaps = 3/61 (4%)
Query 1 MTIDPDQIRAEIDALLASLPDPADAEN---GPSLAELEGIARRLSEAHEVLLAALESAEK 57
MTIDPDQIRAEIDALLA LPD AD E+ G SLA+LE +A R SE H VLL ALESAEK
Sbjct 1 MTIDPDQIRAEIDALLAQLPDFADLEDSVSGLSLAQLEEVALRFSEVHSVLLQALESAEK 60
Query 58 G 58
G
Sbjct 61 G 61
>gi|333990489|ref|YP_004523103.1| hypothetical protein JDM601_1849 [Mycobacterium sp. JDM601]
gi|333486457|gb|AEF35849.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=58
Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 38/58 (66%), Gaps = 0/58 (0%)
Query 1 MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEKG 58
M + D +R I ALLA LP A+ P+ ++L +A+RL AHEVL+AALES EKG
Sbjct 1 MNSEIDDVRERIAALLAELPGEAELAELPAHSDLNALAQRLEAAHEVLVAALESVEKG 58
>gi|169629444|ref|YP_001703093.1| hypothetical protein MAB_2358 [Mycobacterium abscessus ATCC 19977]
gi|169241411|emb|CAM62439.1| Conserved hypothetical protein [Mycobacterium abscessus]
Length=69
Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/55 (50%), Positives = 38/55 (70%), Gaps = 0/55 (0%)
Query 4 DPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEKG 58
DPD++R+ + ALL L PA A + S +LE A+ L +AH+VL+ ALESAE+G
Sbjct 15 DPDEVRSRVQALLGQLDSPAGAGDTESDLDLENQAQILEQAHDVLVRALESAEQG 69
>gi|108799918|ref|YP_640115.1| hypothetical protein Mmcs_2952 [Mycobacterium sp. MCS]
gi|119869028|ref|YP_938980.1| hypothetical protein Mkms_2996 [Mycobacterium sp. KMS]
gi|126435546|ref|YP_001071237.1| hypothetical protein Mjls_2967 [Mycobacterium sp. JLS]
gi|108770337|gb|ABG09059.1| conserved hypothetical protein [Mycobacterium sp. MCS]
gi|119695117|gb|ABL92190.1| conserved hypothetical protein [Mycobacterium sp. KMS]
gi|126235346|gb|ABN98746.1| conserved hypothetical protein [Mycobacterium sp. JLS]
Length=61
Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 26/58 (45%), Positives = 39/58 (68%), Gaps = 1/58 (1%)
Query 1 MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEKG 58
M+ DPDQ+R + ALLA LP+P D + +++ +A RL EAH++L+ AL + EKG
Sbjct 5 MSNDPDQLRTRVAALLADLPEP-DHAGDLADCDIDTVAARLEEAHDLLVQALAAVEKG 61
>gi|145224094|ref|YP_001134772.1| hypothetical protein Mflv_3510 [Mycobacterium gilvum PYR-GCK]
gi|315444431|ref|YP_004077310.1| hypothetical protein Mspyr1_28490 [Mycobacterium sp. Spyr1]
gi|145216580|gb|ABP45984.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK]
gi|315262734|gb|ADT99475.1| hypothetical protein Mspyr1_28490 [Mycobacterium sp. Spyr1]
Length=51
Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/58 (49%), Positives = 37/58 (64%), Gaps = 7/58 (12%)
Query 1 MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEKG 58
M+ DPDQIRA+I LL D S ++L+ +A RL EAH++L+ ALES EKG
Sbjct 1 MSNDPDQIRAQIAELLGERTDL-------SASDLDSVAARLEEAHDLLVRALESVEKG 51
>gi|120404266|ref|YP_954095.1| hypothetical protein Mvan_3291 [Mycobacterium vanbaalenii PYR-1]
gi|119957084|gb|ABM14089.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
Length=54
Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/58 (47%), Positives = 38/58 (66%), Gaps = 8/58 (13%)
Query 1 MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEKG 58
M+ DPDQIRA++ LL PD + ++L+ +A RL EAH++L+ ALES EKG
Sbjct 5 MSNDPDQIRAQVAELLGQ-PDV-------TASDLDAVAARLEEAHDLLVRALESVEKG 54
>gi|118469330|ref|YP_888049.1| hypothetical protein MSMEG_3752 [Mycobacterium smegmatis str.
MC2 155]
gi|118170617|gb|ABK71513.1| conserved hypothetical protein [Mycobacterium smegmatis str.
MC2 155]
Length=68
Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/60 (42%), Positives = 41/60 (69%), Gaps = 4/60 (6%)
Query 1 MTIDPDQIRAEIDALLASLPDPADAENGPSL--AELEGIARRLSEAHEVLLAALESAEKG 58
MT DPDQ+RA + +LA LPD + +G L +++ +A +L +AH++L+ ALE+ +KG
Sbjct 1 MTTDPDQLRARVAEVLADLPDL--SADGSELVDTDIDLLAAKLEQAHDLLVQALETVDKG 58
>gi|54023963|ref|YP_118205.1| hypothetical protein nfa19950 [Nocardia farcinica IFM 10152]
gi|54015471|dbj|BAD56841.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=88
Score = 42.4 bits (98), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 28/58 (49%), Positives = 37/58 (64%), Gaps = 4/58 (6%)
Query 4 DPDQIRAEIDALLASL----PDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEK 57
DPD+IRAEID LLA L P AE+ + A++ AR L +AHEVL+ AL + +K
Sbjct 30 DPDRIRAEIDELLAELDAGGGRPGPAEHAETGADITRRARILEQAHEVLVQALATVDK 87
>gi|289750249|ref|ZP_06509627.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T92]
gi|289690836|gb|EFD58265.1| LOW QUALITY PROTEIN: conserved hypothetical protein [Mycobacterium
tuberculosis T92]
Length=60
Score = 36.2 bits (82), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/33 (55%), Positives = 22/33 (67%), Gaps = 0/33 (0%)
Query 1 MTIDPDQIRAEIDALLASLPDPADAENGPSLAE 33
MTIDPDQIRA +SLPD + +G SLA+
Sbjct 1 MTIDPDQIRARQSPPTSSLPDQTMSRHGQSLAQ 33
>gi|261420610|ref|YP_003254292.1| nucleotide sugar dehydrogenase [Geobacillus sp. Y412MC61]
gi|319768281|ref|YP_004133782.1| nucleotide sugar dehydrogenase [Geobacillus sp. Y412MC52]
gi|261377067|gb|ACX79810.1| nucleotide sugar dehydrogenase [Geobacillus sp. Y412MC61]
gi|317113147|gb|ADU95639.1| nucleotide sugar dehydrogenase [Geobacillus sp. Y412MC52]
Length=437
Score = 35.4 bits (80), Expect = 2.8, Method: Composition-based stats.
Identities = 18/46 (40%), Positives = 30/46 (66%), Gaps = 1/46 (2%)
Query 10 AEIDALLASLPDPADAENGPSLAELEGIARRLSE-AHEVLLAALES 54
AE+DA+ ++P P DA + P + +E A+ +++ AHE +L LES
Sbjct 88 AEVDAVAIAVPTPLDAHHQPDTSYVENSAKEIAKYAHEGMLVVLES 133
>gi|229494832|ref|ZP_04388586.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
gi|229318270|gb|EEN84137.1| conserved hypothetical protein [Rhodococcus erythropolis SK121]
Length=80
Score = 35.0 bits (79), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 22/60 (37%), Positives = 35/60 (59%), Gaps = 7/60 (11%)
Query 2 TIDPDQIRAEIDALLASL----PDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEK 57
++DP++IR+++D LLA L DP D + + I L +AHEVL+ AL S ++
Sbjct 23 SVDPERIRSDVDGLLARLGGIEQDPNDQHGAGVIPQKAHI---LEQAHEVLVQALASVDR 79
>gi|226360084|ref|YP_002777862.1| hypothetical protein ROP_06700 [Rhodococcus opacus B4]
gi|226238569|dbj|BAH48917.1| hypothetical protein [Rhodococcus opacus B4]
Length=95
Score = 35.0 bits (79), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 23/60 (39%), Positives = 37/60 (62%), Gaps = 7/60 (11%)
Query 2 TIDPDQIRAEIDALLASL----PDPADAENGPSLAELEGIARRLSEAHEVLLAALESAEK 57
++DP++I E+D LL+ L PDP D E+G + + A L +AH+VL+ AL + +K
Sbjct 38 SVDPEKIHTEVDGLLSRLGAVEPDP-DDEHGAGVIPRK--AHLLEKAHDVLVEALATVDK 94
>gi|224985441|gb|ACN74538.1| hypothetical UDP-N-acetyl-D-mannosaminuronate dehydrogenase [Geobacillus
stearothermophilus]
Length=437
Score = 33.9 bits (76), Expect = 8.9, Method: Composition-based stats.
Identities = 17/46 (37%), Positives = 29/46 (64%), Gaps = 1/46 (2%)
Query 10 AEIDALLASLPDPADAENGPSLAELEGIARRLSE-AHEVLLAALES 54
AE+DA+ ++P P D + P + +E A+ +++ AHE +L LES
Sbjct 88 AEVDAVAIAVPTPLDEHHQPDTSYVENSAKEIAKYAHEGMLVVLES 133
Lambda K H
0.309 0.129 0.339
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 128188454310
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40