BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv0691A
Length=62
Score E
Sequences producing significant alignments: (Bits) Value
gi|15840096|ref|NP_335133.1| hypothetical protein MT0719.1 [Myco... 128 2e-28
gi|308370477|ref|ZP_07666942.1| hypothetical protein TMCG_03503 ... 97.4 7e-19
gi|254822796|ref|ZP_05227797.1| hypothetical protein MintA_22894... 54.7 5e-06
gi|254776916|ref|ZP_05218432.1| hypothetical protein MaviaA2_199... 51.6 4e-05
gi|108797972|ref|YP_638169.1| hypothetical protein Mmcs_0999 [My... 50.4 8e-05
gi|118466062|ref|YP_883615.1| hypothetical protein MAV_4480 [Myc... 50.4 9e-05
gi|118473061|ref|YP_885804.1| hypothetical protein MSMEG_1421 [M... 47.8 6e-04
gi|333991990|ref|YP_004524604.1| hypothetical protein JDM601_335... 45.4 0.003
gi|262203644|ref|YP_003274852.1| hypothetical protein Gbro_3780 ... 44.7 0.004
gi|315445990|ref|YP_004078869.1| hypothetical protein Mspyr1_448... 41.6 0.042
gi|326384862|ref|ZP_08206537.1| hypothetical protein SCNU_18052 ... 41.2 0.054
gi|288919786|ref|ZP_06414111.1| hypothetical protein FrEUN1fDRAF... 39.7 0.16
gi|312140961|ref|YP_004008297.1| hypothetical protein REQ_36290 ... 38.9 0.23
gi|331699424|ref|YP_004335663.1| hypothetical protein Psed_5683 ... 38.9 0.23
gi|226305286|ref|YP_002765244.1| hypothetical protein RER_17970 ... 38.9 0.25
gi|54025411|ref|YP_119653.1| hypothetical protein nfa34410 [Noca... 38.5 0.37
gi|284993285|ref|YP_003411840.1| hypothetical protein Gobs_4938 ... 37.7 0.52
gi|111222982|ref|YP_713776.1| hypothetical protein FRAAL3570 [Fr... 37.7 0.66
gi|326329211|ref|ZP_08195538.1| hypothetical protein NBCG_00646 ... 37.0 0.88
gi|269126597|ref|YP_003299967.1| hypothetical protein Tcur_2364 ... 35.8 2.4
gi|134098951|ref|YP_001104612.1| hypothetical protein SACE_2387 ... 35.0 4.1
>gi|15840096|ref|NP_335133.1| hypothetical protein MT0719.1 [Mycobacterium tuberculosis CDC1551]
gi|167967930|ref|ZP_02550207.1| hypothetical protein MtubH3_07793 [Mycobacterium tuberculosis
H37Ra]
gi|254231011|ref|ZP_04924338.1| hypothetical protein TBCG_00682 [Mycobacterium tuberculosis C]
12 more sequence titles
Length=62
Score = 128 bits (322), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 62/62 (100%), Positives = 62/62 (100%), Gaps = 0/62 (0%)
Query 1 MRHHIRPSISALDAILCPDRRIAVETCWRKAIQMDYETDTDTELVTETLVEEVSIDGMCG 60
MRHHIRPSISALDAILCPDRRIAVETCWRKAIQMDYETDTDTELVTETLVEEVSIDGMCG
Sbjct 1 MRHHIRPSISALDAILCPDRRIAVETCWRKAIQMDYETDTDTELVTETLVEEVSIDGMCG 60
Query 61 VY 62
VY
Sbjct 61 VY 62
>gi|308370477|ref|ZP_07666942.1| hypothetical protein TMCG_03503 [Mycobacterium tuberculosis SUMu003]
gi|308373548|ref|ZP_07667604.1| hypothetical protein TMEG_02064 [Mycobacterium tuberculosis SUMu005]
gi|308374074|ref|ZP_07667704.1| hypothetical protein TMFG_03362 [Mycobacterium tuberculosis SUMu006]
gi|308331838|gb|EFP20689.1| hypothetical protein TMCG_03503 [Mycobacterium tuberculosis SUMu003]
gi|308337241|gb|EFP26092.1| hypothetical protein TMEG_02064 [Mycobacterium tuberculosis SUMu005]
gi|308343105|gb|EFP31956.1| hypothetical protein TMFG_03362 [Mycobacterium tuberculosis SUMu006]
Length=47
Score = 97.4 bits (241), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 46/47 (98%), Positives = 47/47 (100%), Gaps = 0/47 (0%)
Query 16 LCPDRRIAVETCWRKAIQMDYETDTDTELVTETLVEEVSIDGMCGVY 62
+CPDRRIAVETCWRKAIQMDYETDTDTELVTETLVEEVSIDGMCGVY
Sbjct 1 MCPDRRIAVETCWRKAIQMDYETDTDTELVTETLVEEVSIDGMCGVY 47
>gi|254822796|ref|ZP_05227797.1| hypothetical protein MintA_22894 [Mycobacterium intracellulare
ATCC 13950]
gi|342861781|ref|ZP_08718426.1| hypothetical protein MCOL_22945 [Mycobacterium colombiense CECT
3035]
gi|342130598|gb|EGT83902.1| hypothetical protein MCOL_22945 [Mycobacterium colombiense CECT
3035]
Length=29
Score = 54.7 bits (130), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/29 (90%), Positives = 29/29 (100%), Gaps = 0/29 (0%)
Query 34 MDYETDTDTELVTETLVEEVSIDGMCGVY 62
MD+ET+TDT+LVTETLVEEVSIDGMCGVY
Sbjct 1 MDHETETDTQLVTETLVEEVSIDGMCGVY 29
>gi|254776916|ref|ZP_05218432.1| hypothetical protein MaviaA2_19926 [Mycobacterium avium subsp.
avium ATCC 25291]
gi|336460677|gb|EGO39568.1| mycofactocin precursor [Mycobacterium avium subsp. paratuberculosis
S397]
Length=29
Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 24/29 (83%), Positives = 28/29 (97%), Gaps = 0/29 (0%)
Query 34 MDYETDTDTELVTETLVEEVSIDGMCGVY 62
M++ET+TD +LVTETLVEEVSIDGMCGVY
Sbjct 1 MEHETETDAQLVTETLVEEVSIDGMCGVY 29
>gi|108797972|ref|YP_638169.1| hypothetical protein Mmcs_0999 [Mycobacterium sp. MCS]
gi|108768391|gb|ABG07113.1| hypothetical protein Mmcs_0999 [Mycobacterium sp. MCS]
Length=52
Score = 50.4 bits (119), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 26/36 (73%), Positives = 29/36 (81%), Gaps = 2/36 (5%)
Query 29 RKAIQM--DYETDTDTELVTETLVEEVSIDGMCGVY 62
RKA+ M + + DTELVTETLVEEVSIDGMCGVY
Sbjct 17 RKAVLMEPNQHVEADTELVTETLVEEVSIDGMCGVY 52
>gi|118466062|ref|YP_883615.1| hypothetical protein MAV_4480 [Mycobacterium avium 104]
gi|118167349|gb|ABK68246.1| conserved hypothetical protein [Mycobacterium avium 104]
Length=52
Score = 50.4 bits (119), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 23/29 (80%), Positives = 28/29 (97%), Gaps = 0/29 (0%)
Query 34 MDYETDTDTELVTETLVEEVSIDGMCGVY 62
+++ET+TD +LVTETLVEEVSIDGMCGVY
Sbjct 24 VEHETETDAQLVTETLVEEVSIDGMCGVY 52
>gi|118473061|ref|YP_885804.1| hypothetical protein MSMEG_1421 [Mycobacterium smegmatis str.
MC2 155]
gi|118174348|gb|ABK75244.1| hypothetical protein MSMEG_1421 [Mycobacterium smegmatis str.
MC2 155]
Length=52
Score = 47.8 bits (112), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/36 (67%), Positives = 28/36 (78%), Gaps = 2/36 (5%)
Query 29 RKAIQM--DYETDTDTELVTETLVEEVSIDGMCGVY 62
RK + M + + +TELVTETLVEEVSIDGMCGVY
Sbjct 17 RKGVHMEPNQHVEAETELVTETLVEEVSIDGMCGVY 52
>gi|333991990|ref|YP_004524604.1| hypothetical protein JDM601_3350 [Mycobacterium sp. JDM601]
gi|333487958|gb|AEF37350.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length=51
Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/30 (74%), Positives = 23/30 (77%), Gaps = 0/30 (0%)
Query 33 QMDYETDTDTELVTETLVEEVSIDGMCGVY 62
Q E TDT+LV E LVEEVSIDGMCGVY
Sbjct 22 QQQTEASTDTDLVVENLVEEVSIDGMCGVY 51
>gi|262203644|ref|YP_003274852.1| hypothetical protein Gbro_3780 [Gordonia bronchialis DSM 43247]
gi|262086991|gb|ACY22959.1| hypothetical protein Gbro_3780 [Gordonia bronchialis DSM 43247]
Length=43
Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 20/24 (84%), Positives = 22/24 (92%), Gaps = 0/24 (0%)
Query 39 DTDTELVTETLVEEVSIDGMCGVY 62
D DT+LV E+LVEEVSIDGMCGVY
Sbjct 20 DADTDLVGESLVEEVSIDGMCGVY 43
>gi|315445990|ref|YP_004078869.1| hypothetical protein Mspyr1_44800 [Mycobacterium sp. Spyr1]
gi|315264293|gb|ADU01035.1| hypothetical protein Mspyr1_44800 [Mycobacterium sp. Spyr1]
Length=30
Score = 41.6 bits (96), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 19/20 (95%), Positives = 20/20 (100%), Gaps = 0/20 (0%)
Query 43 ELVTETLVEEVSIDGMCGVY 62
ELVTE+LVEEVSIDGMCGVY
Sbjct 11 ELVTESLVEEVSIDGMCGVY 30
>gi|326384862|ref|ZP_08206537.1| hypothetical protein SCNU_18052 [Gordonia neofelifaecis NRRL
B-59395]
gi|326196381|gb|EGD53580.1| hypothetical protein SCNU_18052 [Gordonia neofelifaecis NRRL
B-59395]
Length=33
Score = 41.2 bits (95), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 19/28 (68%), Positives = 21/28 (75%), Gaps = 0/28 (0%)
Query 35 DYETDTDTELVTETLVEEVSIDGMCGVY 62
D T +LV E+LVEEVSIDGMCGVY
Sbjct 6 DQANATQQDLVAESLVEEVSIDGMCGVY 33
>gi|288919786|ref|ZP_06414111.1| hypothetical protein FrEUN1fDRAFT_3808 [Frankia sp. EUN1f]
gi|288348794|gb|EFC83046.1| hypothetical protein FrEUN1fDRAFT_3808 [Frankia sp. EUN1f]
Length=56
Score = 39.7 bits (91), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 18/33 (55%), Positives = 23/33 (70%), Gaps = 0/33 (0%)
Query 30 KAIQMDYETDTDTELVTETLVEEVSIDGMCGVY 62
A Q+ + + LV +TL+EEVSIDGMCGVY
Sbjct 24 PAAQIHHVHGAEAPLVEDTLIEEVSIDGMCGVY 56
>gi|312140961|ref|YP_004008297.1| hypothetical protein REQ_36290 [Rhodococcus equi 103S]
gi|311890300|emb|CBH49618.1| hypothetical protein REQ_36290 [Rhodococcus equi 103S]
Length=38
Score = 38.9 bits (89), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 17/23 (74%), Positives = 20/23 (87%), Gaps = 0/23 (0%)
Query 40 TDTELVTETLVEEVSIDGMCGVY 62
++ LV E+LVEEVSIDGMCGVY
Sbjct 16 NESSLVEESLVEEVSIDGMCGVY 38
>gi|331699424|ref|YP_004335663.1| hypothetical protein Psed_5683 [Pseudonocardia dioxanivorans
CB1190]
gi|326954113|gb|AEA27810.1| hypothetical protein Psed_5683 [Pseudonocardia dioxanivorans
CB1190]
Length=35
Score = 38.9 bits (89), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 18/19 (95%), Positives = 18/19 (95%), Gaps = 0/19 (0%)
Query 44 LVTETLVEEVSIDGMCGVY 62
LV ETLVEEVSIDGMCGVY
Sbjct 17 LVEETLVEEVSIDGMCGVY 35
>gi|226305286|ref|YP_002765244.1| hypothetical protein RER_17970 [Rhodococcus erythropolis PR4]
gi|229490530|ref|ZP_04384368.1| hypothetical protein RHOER0001_5961 [Rhodococcus erythropolis
SK121]
gi|226184401|dbj|BAH32505.1| hypothetical protein RER_17970 [Rhodococcus erythropolis PR4]
gi|229322350|gb|EEN88133.1| hypothetical protein RHOER0001_5961 [Rhodococcus erythropolis
SK121]
Length=55
Score = 38.9 bits (89), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 17/22 (78%), Positives = 20/22 (91%), Gaps = 0/22 (0%)
Query 41 DTELVTETLVEEVSIDGMCGVY 62
+ +LV E+LVEEVSIDGMCGVY
Sbjct 34 ENDLVEESLVEEVSIDGMCGVY 55
>gi|54025411|ref|YP_119653.1| hypothetical protein nfa34410 [Nocardia farcinica IFM 10152]
gi|54016919|dbj|BAD58289.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=154
Score = 38.5 bits (88), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 16/22 (73%), Positives = 20/22 (91%), Gaps = 0/22 (0%)
Query 41 DTELVTETLVEEVSIDGMCGVY 62
++ +V ETL+EEVSIDGMCGVY
Sbjct 133 NSAVVDETLIEEVSIDGMCGVY 154
>gi|284993285|ref|YP_003411840.1| hypothetical protein Gobs_4938 [Geodermatophilus obscurus DSM
43160]
gi|284066531|gb|ADB77469.1| conserved hypothetical protein [Geodermatophilus obscurus DSM
43160]
Length=37
Score = 37.7 bits (86), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 17/19 (90%), Positives = 18/19 (95%), Gaps = 0/19 (0%)
Query 44 LVTETLVEEVSIDGMCGVY 62
LV E+LVEEVSIDGMCGVY
Sbjct 19 LVEESLVEEVSIDGMCGVY 37
>gi|111222982|ref|YP_713776.1| hypothetical protein FRAAL3570 [Frankia alni ACN14a]
gi|111150514|emb|CAJ62213.1| conserved hypothetical protein [Frankia alni ACN14a]
Length=38
Score = 37.7 bits (86), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 18/35 (52%), Positives = 25/35 (72%), Gaps = 1/35 (2%)
Query 29 RKAIQMDYETDTDTE-LVTETLVEEVSIDGMCGVY 62
+++Q+D + E +V + LVEEVSIDGMCGVY
Sbjct 4 EESVQVDQAVEASAEPVVADLLVEEVSIDGMCGVY 38
>gi|326329211|ref|ZP_08195538.1| hypothetical protein NBCG_00646 [Nocardioidaceae bacterium Broad-1]
gi|325953006|gb|EGD45019.1| hypothetical protein NBCG_00646 [Nocardioidaceae bacterium Broad-1]
Length=34
Score = 37.0 bits (84), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 16/24 (67%), Positives = 20/24 (84%), Gaps = 0/24 (0%)
Query 39 DTDTELVTETLVEEVSIDGMCGVY 62
D +++ TE L+EEVSIDGMCGVY
Sbjct 11 DIASDVATEDLIEEVSIDGMCGVY 34
>gi|269126597|ref|YP_003299967.1| hypothetical protein Tcur_2364 [Thermomonospora curvata DSM 43183]
gi|268311555|gb|ACY97929.1| hypothetical protein Tcur_2364 [Thermomonospora curvata DSM 43183]
Length=39
Score = 35.8 bits (81), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 15/20 (75%), Positives = 17/20 (85%), Gaps = 0/20 (0%)
Query 43 ELVTETLVEEVSIDGMCGVY 62
E E+L+EEVSIDGMCGVY
Sbjct 20 EAAVESLIEEVSIDGMCGVY 39
>gi|134098951|ref|YP_001104612.1| hypothetical protein SACE_2387 [Saccharopolyspora erythraea NRRL
2338]
gi|291006811|ref|ZP_06564784.1| hypothetical protein SeryN2_20008 [Saccharopolyspora erythraea
NRRL 2338]
gi|133911574|emb|CAM01687.1| hypothetical protein SACE_2387 [Saccharopolyspora erythraea NRRL
2338]
Length=35
Score = 35.0 bits (79), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 16/18 (89%), Positives = 16/18 (89%), Gaps = 0/18 (0%)
Query 45 VTETLVEEVSIDGMCGVY 62
V E LVEEVSIDGMCGVY
Sbjct 18 VEELLVEEVSIDGMCGVY 35
Lambda K H
0.323 0.135 0.417
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 131656912128
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40