BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1813c
Length=143
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608950|ref|NP_216329.1| hypothetical protein Rv1813c [Mycob... 292 9e-78
gi|3366597|gb|AAC28395.1| putative open reading frame [Mycobacte... 206 1e-51
gi|240173178|ref|ZP_04751836.1| hypothetical protein MkanA1_2794... 152 1e-35
gi|183982546|ref|YP_001850837.1| hypothetical protein MMAR_2533 ... 137 4e-31
gi|296165794|ref|ZP_06848300.1| conserved hypothetical protein [... 137 5e-31
gi|183981448|ref|YP_001849739.1| hypothetical protein MMAR_1426 ... 122 1e-26
gi|240170411|ref|ZP_04749070.1| hypothetical protein MkanA1_1395... 117 7e-25
gi|15840716|ref|NP_335753.1| hypothetical protein MT1307 [Mycoba... 88.6 3e-16
gi|15608409|ref|NP_215785.1| hypothetical protein Rv1269c [Mycob... 87.8 5e-16
gi|254231526|ref|ZP_04924853.1| hypothetical protein TBCG_01250 ... 87.4 6e-16
gi|240173140|ref|ZP_04751798.1| hypothetical protein MkanA1_2775... 87.0 7e-16
gi|183984691|ref|YP_001852982.1| hypothetical protein MMAR_4723 ... 85.5 2e-15
gi|118619215|ref|YP_907547.1| hypothetical protein MUL_4016 [Myc... 84.7 4e-15
gi|183984124|ref|YP_001852415.1| hypothetical protein MMAR_4153 ... 84.7 4e-15
gi|296140994|ref|YP_003648237.1| hypothetical protein Tpau_3313 ... 57.0 8e-07
gi|240173458|ref|ZP_04752116.1| hypothetical protein MkanA1_2936... 55.8 2e-06
gi|262201902|ref|YP_003273110.1| hypothetical protein Gbro_1963 ... 53.5 1e-05
gi|119489310|ref|ZP_01622117.1| hypothetical protein L8106_07641... 53.1 1e-05
gi|294994825|ref|ZP_06800516.1| hypothetical protein Mtub2_10022... 53.1 1e-05
gi|326385281|ref|ZP_08206943.1| hypothetical protein SCNU_20142 ... 52.0 3e-05
gi|296164287|ref|ZP_06846873.1| conserved hypothetical protein [... 51.6 4e-05
gi|54025836|ref|YP_120078.1| hypothetical protein nfa38660 [Noca... 50.8 6e-05
gi|209525998|ref|ZP_03274531.1| conserved hypothetical protein [... 44.7 0.005
gi|284051993|ref|ZP_06382203.1| hypothetical protein AplaP_11031... 42.0 0.033
gi|326383688|ref|ZP_08205373.1| hypothetical protein SCNU_12152 ... 40.4 0.098
gi|54024135|ref|YP_118377.1| hypothetical protein nfa21670 [Noca... 36.2 1.5
gi|158336921|ref|YP_001518096.1| hypothetical protein AM1_3792 [... 35.4 2.7
gi|30424697|ref|NP_780305.1| starch-binding domain-containing pr... 35.0 3.3
gi|110002651|gb|AAI18616.1| Stbd1 protein [Mus musculus] 35.0 4.1
gi|190574895|ref|YP_001972740.1| hypothetical protein Smlt2996 [... 34.7 4.3
gi|269120985|ref|YP_003309162.1| hypothetical protein Sterm_2378... 34.7 5.5
gi|326381446|ref|ZP_08203140.1| hypothetical protein SCNU_00805 ... 34.3 5.5
gi|126653006|ref|ZP_01725146.1| hypothetical protein BB14905_190... 34.3 5.9
gi|170079327|ref|YP_001735965.1| serine/threonine-protein kinase... 34.3 6.2
gi|241205439|ref|YP_002976535.1| hypothetical protein Rleg_2733 ... 33.9 7.3
>gi|15608950|ref|NP_216329.1| hypothetical protein Rv1813c [Mycobacterium tuberculosis H37Rv]
gi|15841283|ref|NP_336320.1| hypothetical protein MT1861 [Mycobacterium tuberculosis CDC1551]
gi|31793002|ref|NP_855495.1| hypothetical protein Mb1843c [Mycobacterium bovis AF2122/97]
82 more sequence titles
Length=143
Score = 292 bits (748), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 143/143 (100%), Positives = 143/143 (100%), Gaps = 0/143 (0%)
Query 1 MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMMSEIAGLPIPPIIHYGA 60
MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMMSEIAGLPIPPIIHYGA
Sbjct 1 MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMMSEIAGLPIPPIIHYGA 60
Query 61 IAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGTGLT 120
IAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGTGLT
Sbjct 61 IAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGTGLT 120
Query 121 RRAAEDDAVNRLEGGRIVNWACN 143
RRAAEDDAVNRLEGGRIVNWACN
Sbjct 121 RRAAEDDAVNRLEGGRIVNWACN 143
>gi|3366597|gb|AAC28395.1| putative open reading frame [Mycobacterium tuberculosis]
Length=124
Score = 206 bits (524), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 101/102 (99%), Positives = 102/102 (100%), Gaps = 0/102 (0%)
Query 1 MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMMSEIAGLPIPPIIHYGA 60
MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMMSEIAGLPIPPIIHYGA
Sbjct 14 MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMMSEIAGLPIPPIIHYGA 73
Query 61 IAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTR 102
IAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFT+
Sbjct 74 IAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTQ 115
>gi|240173178|ref|ZP_04751836.1| hypothetical protein MkanA1_27946 [Mycobacterium kansasii ATCC
12478]
Length=145
Score = 152 bits (385), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 78/142 (55%), Positives = 102/142 (72%), Gaps = 7/142 (4%)
Query 6 RRRTAMAAAGLGAALGLGIL---LVPTVDAHLANGSMSEV-MMSEIAGLPIPPIIHYGAI 61
RRR + AA +GA +GL +L L+P +DAH+ + +SE+ M+ E+ P+PP IHYGAI
Sbjct 5 RRRITLVAATIGATVGLMVLALPLIPPLDAHIDSAVLSEMGMLPEV---PVPPRIHYGAI 61
Query 62 AYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGTGLTR 121
AYAP+G GK+ + T A+AE+VAL++CG TCKV+ F RCGAVAY+GS Y GG+GLT
Sbjct 62 AYAPTGEWGKSRNYLTLAKAEEVALDQCGLDTCKVLINFKRCGAVAYDGSTYHGGSGLTL 121
Query 122 RAAEDDAVNRLEGGRIVNWACN 143
A DA+NRL GRIVNW CN
Sbjct 122 SDAMADAINRLGAGRIVNWLCN 143
>gi|183982546|ref|YP_001850837.1| hypothetical protein MMAR_2533 [Mycobacterium marinum M]
gi|183175872|gb|ACC40982.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=142
Score = 137 bits (346), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 75/141 (54%), Positives = 95/141 (68%), Gaps = 5/141 (3%)
Query 6 RRRTAMAAAGLGAALGL---GILLVPTVDAHLANGSMSEVMMSEIAGLPIPPIIHYGAIA 62
RRR A+A A +GA GL G+ L ++ A++ MSE M + P+P I+HYGAIA
Sbjct 4 RRRIALATATVGATAGLMFIGLALTGSIGANMDRAVMSE--MGMLPEGPVPLIVHYGAIA 61
Query 63 YAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGTGLTRR 122
YAP+GA GKA + AEQ AL++CG +CKV+ F RCGAVAYN KYQGG+G T
Sbjct 62 YAPNGAFGKARRFTSRFGAEQAALKQCGLDSCKVLINFNRCGAVAYNNLKYQGGSGWTLS 121
Query 123 AAEDDAVNRLEGGRIVNWACN 143
AA+ DA++RL GG IVNWACN
Sbjct 122 AAQQDAIDRLGGGWIVNWACN 142
>gi|296165794|ref|ZP_06848300.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295898848|gb|EFG78348.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=141
Score = 137 bits (345), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 75/144 (53%), Positives = 95/144 (66%), Gaps = 4/144 (2%)
Query 1 MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEV-MMSEIAGLPIPPIIHYG 59
M+ R R +A A +GA GL I +P + AN M E +M E P+PP+I YG
Sbjct 1 MMIKRRYRIGLAVATVGATAGLMIAALPFMPGVGANTLMPETAVMPE---GPVPPVIRYG 57
Query 60 AIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGTGL 119
A+AYAPSGA G+ T RA QVAL++CG K CK++ + RCGAVAY+G+ Y GG G+
Sbjct 58 AMAYAPSGAWGRTRGYGTRERAIQVALDQCGVKDCKLIVSYQRCGAVAYDGTTYLGGKGV 117
Query 120 TRRAAEDDAVNRLEGGRIVNWACN 143
TR AE+DA+NRL GGRIVNWACN
Sbjct 118 TRSLAEEDAINRLGGGRIVNWACN 141
>gi|183981448|ref|YP_001849739.1| hypothetical protein MMAR_1426 [Mycobacterium marinum M]
gi|183174774|gb|ACC39884.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=136
Score = 122 bits (307), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 76/143 (54%), Positives = 95/143 (67%), Gaps = 8/143 (5%)
Query 1 MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMMSEIAGLPIPPIIHYGA 60
M+TNLRRR A+ L AALGLG+LL+ AHL + S I G + PI +YGA
Sbjct 1 MMTNLRRRAALIVVTLAAALGLGLLLLSPAGAHLYDDS--------ITGRIVAPITYYGA 52
Query 61 IAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGTGLT 120
IAY P+G +G++W+ RT A+AE AL+ CG + CKV+S F RCGAVA++GS GG G T
Sbjct 53 IAYGPNGVNGRSWNNRTRAQAESSALKLCGVEGCKVLSSFVRCGAVAFDGSARHGGVGRT 112
Query 121 RRAAEDDAVNRLEGGRIVNWACN 143
R+ AEDDA RL GG I WACN
Sbjct 113 RQMAEDDARFRLGGGWIETWACN 135
>gi|240170411|ref|ZP_04749070.1| hypothetical protein MkanA1_13955 [Mycobacterium kansasii ATCC
12478]
Length=125
Score = 117 bits (292), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 62/131 (48%), Positives = 81/131 (62%), Gaps = 8/131 (6%)
Query 14 AGLGAALGLGILLVPTVDAHLANGSMSEVMMSEIAGLPIPPI-IHYGAIAYAPSGASGKA 72
+GL A + + + AH+ G M+ V +PP ++YGAIAY G++GKA
Sbjct 2 SGLAVAAAMTVTQIHPAGAHIHRGEMTHVS-------NMPPFPVYYGAIAYGHDGSNGKA 54
Query 73 WHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGTGLTRRAAEDDAVNRL 132
W + A+A+ ALE CG TC VVS FTRCGAVA++G+KY GG G R AAE A+ L
Sbjct 55 WRHLSKAQAKHRALELCGIDTCTVVSVFTRCGAVAHDGAKYHGGYGYNRSAAEAHAMANL 114
Query 133 EGGRIVNWACN 143
GGRIV+WACN
Sbjct 115 GGGRIVDWACN 125
>gi|15840716|ref|NP_335753.1| hypothetical protein MT1307 [Mycobacterium tuberculosis CDC1551]
gi|13880905|gb|AAK45567.1| hypothetical protein MT1307 [Mycobacterium tuberculosis CDC1551]
Length=140
Score = 88.6 bits (218), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 45/86 (53%), Positives = 56/86 (66%), Gaps = 0/86 (0%)
Query 58 YGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGT 117
YGAIAY+ +G+ G++W T A AE A++ CG CKV++ FT CGAVA N YQGG
Sbjct 55 YGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGV 114
Query 118 GLTRRAAEDDAVNRLEGGRIVNWACN 143
G T AA DA+ +L GG I WACN
Sbjct 115 GPTLAAAMKDALTKLGGGYIDTWACN 140
>gi|15608409|ref|NP_215785.1| hypothetical protein Rv1269c [Mycobacterium tuberculosis H37Rv]
gi|31792461|ref|NP_854954.1| hypothetical protein Mb1300c [Mycobacterium bovis AF2122/97]
gi|121637197|ref|YP_977420.1| hypothetical protein BCG_1328c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
43 more sequence titles
Length=124
Score = 87.8 bits (216), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 45/86 (53%), Positives = 56/86 (66%), Gaps = 0/86 (0%)
Query 58 YGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGT 117
YGAIAY+ +G+ G++W T A AE A++ CG CKV++ FT CGAVA N YQGG
Sbjct 39 YGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGV 98
Query 118 GLTRRAAEDDAVNRLEGGRIVNWACN 143
G T AA DA+ +L GG I WACN
Sbjct 99 GPTLAAAMKDALTKLGGGYIDTWACN 124
>gi|254231526|ref|ZP_04924853.1| hypothetical protein TBCG_01250 [Mycobacterium tuberculosis C]
gi|298524772|ref|ZP_07012181.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|308231800|ref|ZP_07413774.2| conserved secreted protein [Mycobacterium tuberculosis SUMu001]
26 more sequence titles
Length=121
Score = 87.4 bits (215), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 45/86 (53%), Positives = 56/86 (66%), Gaps = 0/86 (0%)
Query 58 YGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGT 117
YGAIAY+ +G+ G++W T A AE A++ CG CKV++ FT CGAVA N YQGG
Sbjct 36 YGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGV 95
Query 118 GLTRRAAEDDAVNRLEGGRIVNWACN 143
G T AA DA+ +L GG I WACN
Sbjct 96 GPTLAAAMKDALTKLGGGYIDTWACN 121
>gi|240173140|ref|ZP_04751798.1| hypothetical protein MkanA1_27756 [Mycobacterium kansasii ATCC
12478]
Length=132
Score = 87.0 bits (214), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 43/86 (50%), Positives = 57/86 (67%), Gaps = 0/86 (0%)
Query 58 YGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGT 117
YGAIAY+ +G+ G++W T A AE A++ CG CKV++ FT CGAVA Y+GGT
Sbjct 47 YGAIAYSSNGSWGRSWAYPTKAAAEATAVKSCGYSDCKVLTSFTACGAVAAKDRDYRGGT 106
Query 118 GLTRRAAEDDAVNRLEGGRIVNWACN 143
G AA DA+++L+GG I WACN
Sbjct 107 GPNLSAAMKDALSKLDGGYIDTWACN 132
>gi|183984691|ref|YP_001852982.1| hypothetical protein MMAR_4723 [Mycobacterium marinum M]
gi|183178017|gb|ACC43127.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=155
Score = 85.5 bits (210), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/86 (52%), Positives = 56/86 (66%), Gaps = 0/86 (0%)
Query 58 YGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGT 117
YGAIA A +GA GK W R A+AE AL CG +C V+S FTRCGA+A++G + GG
Sbjct 70 YGAIAVADNGAVGKTWGHRKRAQAEIHALTACGHPSCNVLSVFTRCGAIAHDGQNFHGGL 129
Query 118 GLTRRAAEDDAVNRLEGGRIVNWACN 143
G + +AA DA RL GG ++ ACN
Sbjct 130 GRSHQAAGHDAKARLGGGWVLTSACN 155
>gi|118619215|ref|YP_907547.1| hypothetical protein MUL_4016 [Mycobacterium ulcerans Agy99]
gi|118571325|gb|ABL06076.1| conserved hypothetical secreted protein [Mycobacterium ulcerans
Agy99]
Length=111
Score = 84.7 bits (208), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 43/87 (50%), Positives = 57/87 (66%), Gaps = 0/87 (0%)
Query 57 HYGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGG 116
YGAIAY+ G+ G+A H T A AE A++ CG C+V++ FT CGAVA +G ++GG
Sbjct 25 QYGAIAYSGDGSWGRASHYPTRAAAEATAVKLCGYSDCRVLTTFTACGAVAADGKTFEGG 84
Query 117 TGLTRRAAEDDAVNRLEGGRIVNWACN 143
G T AA DA+++L GG I WACN
Sbjct 85 VGPTLSAAMKDALSKLGGGYIDTWACN 111
>gi|183984124|ref|YP_001852415.1| hypothetical protein MMAR_4153 [Mycobacterium marinum M]
gi|183177450|gb|ACC42560.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=122
Score = 84.7 bits (208), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 44/87 (51%), Positives = 57/87 (66%), Gaps = 0/87 (0%)
Query 57 HYGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGG 116
YGAIAY+ G+ G+A H T A AE A++ CG CKV++ FT CGAVA +G ++GG
Sbjct 36 QYGAIAYSGDGSWGRASHYPTRAAAEATAVKLCGYSDCKVLTTFTACGAVAADGKTFEGG 95
Query 117 TGLTRRAAEDDAVNRLEGGRIVNWACN 143
G T AA DA+++L GG I WACN
Sbjct 96 VGPTLSAAMKDALSKLGGGYIDTWACN 122
>gi|296140994|ref|YP_003648237.1| hypothetical protein Tpau_3313 [Tsukamurella paurometabola DSM
20162]
gi|296029128|gb|ADG79898.1| conserved putative secreted protein [Tsukamurella paurometabola
DSM 20162]
Length=126
Score = 57.0 bits (136), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 37/90 (42%), Positives = 49/90 (55%), Gaps = 4/90 (4%)
Query 54 PIIHYGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTR-CGAVAYNGSK 112
P + YGAIA +GA G+A A AE+VAL C D C++++ F CGAVA +
Sbjct 35 PGVFYGAIAVGSNGAWGRALDYGNRATAERVALSYC-DGNCRILASFVNGCGAVAKTRTS 93
Query 113 YQGGTGLTRRAAEDDAVNRLEGGRIVNWAC 142
Y G G T A++ A+ GG I WAC
Sbjct 94 YWGNVGDTLGVAQNRALR--NGGYIYTWAC 121
>gi|240173458|ref|ZP_04752116.1| hypothetical protein MkanA1_29366 [Mycobacterium kansasii ATCC
12478]
Length=125
Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/90 (47%), Positives = 49/90 (55%), Gaps = 3/90 (3%)
Query 57 HYGAIAYAPSG-ASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNG-SKYQ 114
+ GAIAY+PSG G+ H + A AE AL CG CKV+ FT CGA+A N +
Sbjct 36 YVGAIAYSPSGKVFGRTKHAPSRAAAESAALGACGYSDCKVLVTFTDCGAIAENSRGDHA 95
Query 115 GGTGLTRRAAEDDAVNRL-EGGRIVNWACN 143
GG G T AAE DA L G I W CN
Sbjct 96 GGYGPTLLAAEQDAAKNLGTSGWIGTWYCN 125
>gi|262201902|ref|YP_003273110.1| hypothetical protein Gbro_1963 [Gordonia bronchialis DSM 43247]
gi|262085249|gb|ACY21217.1| hypothetical protein Gbro_1963 [Gordonia bronchialis DSM 43247]
Length=131
Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/143 (33%), Positives = 70/143 (49%), Gaps = 20/143 (13%)
Query 4 NLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMMSEIAGLPIPPIIHYGAIAY 63
+LR+R ++ A L +G L++P+V SE + G +YGA+A
Sbjct 2 SLRKRLSILALTLAGVVG--ALILPSV---------SEPAPAHAYG------YYYGALAL 44
Query 64 APSG-ASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTR-CGAVAYNGSKYQGGTGLTR 121
+ S G+A + A A Q AL CG CKVV+RF CGA+A + S + G G +
Sbjct 45 STSERYVGRALDYDSYAEASQAALRACGYADCKVVTRFANGCGAIAESPSYWGFGNGSSL 104
Query 122 RAAEDDAVNRL-EGGRIVNWACN 143
+A+ +A+ G IV WAC
Sbjct 105 YSAQSEALYYSGSGAEIVYWACT 127
>gi|119489310|ref|ZP_01622117.1| hypothetical protein L8106_07641 [Lyngbya sp. PCC 8106]
gi|119454784|gb|EAW35929.1| hypothetical protein L8106_07641 [Lyngbya sp. PCC 8106]
Length=123
Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/109 (41%), Positives = 55/109 (51%), Gaps = 7/109 (6%)
Query 40 SEVMMSEIAGLPIPPIIHYGAIAYAPSG-ASGKAWHQRTPARAEQVALEKCGDKTCKVVS 98
+E+++S IA P YGAIA P G G A+ + +AEQ ALE+CG+ C+V
Sbjct 14 TEILVSPIASAQ--PSDSYGAIAITPDGQVWGYAYDYPSREQAEQRALEECGESNCQVQV 71
Query 99 RFTR-CGAVAYNGS-KYQGGTGLTRRAAEDDAVNRLEGG--RIVNWACN 143
F CGAVA N K TR+ AE AV G RI WAC
Sbjct 72 WFKNACGAVAKNEEGKLGWAWADTRKQAEASAVAACGTGTCRIETWACT 120
>gi|294994825|ref|ZP_06800516.1| hypothetical protein Mtub2_10022 [Mycobacterium tuberculosis
210]
Length=48
Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/48 (55%), Positives = 31/48 (65%), Gaps = 0/48 (0%)
Query 96 VVSRFTRCGAVAYNGSKYQGGTGLTRRAAEDDAVNRLEGGRIVNWACN 143
+++ FT CGAVA N YQGG G T AA DA+ +L GG I WACN
Sbjct 1 MLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDTWACN 48
>gi|326385281|ref|ZP_08206943.1| hypothetical protein SCNU_20142 [Gordonia neofelifaecis NRRL
B-59395]
gi|326195990|gb|EGD53202.1| hypothetical protein SCNU_20142 [Gordonia neofelifaecis NRRL
B-59395]
Length=134
Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/89 (39%), Positives = 47/89 (53%), Gaps = 2/89 (2%)
Query 57 HYGAIAYAPS-GASGKAWHQRTPARAEQVALEKCGDKTCKVVSRF-TRCGAVAYNGSKYQ 114
+YGAIA +PS GA+G+A + A AL CG C+VV + CGA+A + S +
Sbjct 42 YYGAIALSPSTGATGRALDYPDYSSASNAALSWCGYSDCQVVVQMRNACGAIAKSSSYWG 101
Query 115 GGTGLTRRAAEDDAVNRLEGGRIVNWACN 143
G AE +A+ GG I +WAC
Sbjct 102 YAWGADLYTAESNALYYSGGGYIHDWACT 130
>gi|296164287|ref|ZP_06846873.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295900349|gb|EFG79769.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=61
Score = 51.6 bits (122), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 35/92 (39%), Positives = 44/92 (48%), Gaps = 36/92 (39%)
Query 52 IPPIIHYGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGS 111
+P ++ YGAIAYAPSGA G++W R P +A A NG
Sbjct 6 VPFVMRYGAIAYAPSGAWGRSW--RYPNQA-------------------------AANGY 38
Query 112 KYQGGTGLTRRAAEDDAVNRLEGGRIVNWACN 143
+Q DDA+NRL GG+IVNW CN
Sbjct 39 THQ---------IADDALNRLGGGKIVNWVCN 61
>gi|54025836|ref|YP_120078.1| hypothetical protein nfa38660 [Nocardia farcinica IFM 10152]
gi|54017344|dbj|BAD58714.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=112
Score = 50.8 bits (120), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 31/86 (37%), Positives = 49/86 (57%), Gaps = 0/86 (0%)
Query 57 HYGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGG 116
+YGAIA + SGA G A + + + AEQ A++ CG +VS CG +A + +++
Sbjct 21 YYGAIATSRSGAYGIANNYGSFSDAEQAAVDACGAGCRVLVSWSNGCGVLASSNTQWSAA 80
Query 117 TGLTRRAAEDDAVNRLEGGRIVNWAC 142
+ AA A++RL GG +V+W C
Sbjct 81 ARSSYTAARSAALSRLSGGWVVDWRC 106
>gi|209525998|ref|ZP_03274531.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|209493524|gb|EDZ93846.1| conserved hypothetical protein [Arthrospira maxima CS-328]
Length=126
Score = 44.7 bits (104), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/115 (30%), Positives = 56/115 (49%), Gaps = 10/115 (8%)
Query 38 SMSEVMMSEIAGLPIPPII-----HYGAIAYAPSGASGKAWHQRTP--ARAEQVALEKCG 90
S++ +++ + G+ I+ HYGAIA + + + + P A+A++ ALE CG
Sbjct 6 SLTGLILIALEGITTGAIVAQNRDHYGAIATSTTNPAIWGYSHDYPTLAQAQRYALEYCG 65
Query 91 DKTCKVVSRFTR-CGAVAYNGSKYQGGTGLTRRAAEDDAVNRLEGG--RIVNWAC 142
C++ F CGA+A NGS + R AE ++ G +I WAC
Sbjct 66 QADCQIRVWFKNGCGAIATNGSNIGSAWAVNRAEAEARSIVACGQGDCKIEVWAC 120
>gi|284051993|ref|ZP_06382203.1| hypothetical protein AplaP_11031 [Arthrospira platensis str.
Paraca]
gi|291568823|dbj|BAI91095.1| hypothetical protein [Arthrospira platensis NIES-39]
Length=126
Score = 42.0 bits (97), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 33/115 (29%), Positives = 56/115 (49%), Gaps = 10/115 (8%)
Query 38 SMSEVMMSEIAGLPIPPII-----HYGAIAYAPSGASGKAWHQRTP--ARAEQVALEKCG 90
S++ +++ + G+ I+ +YGAIA + + + + P A+A++ ALE CG
Sbjct 6 SLTGLILIALEGITTGAILAQNRDNYGAIATSTTNPAQWGYSHDYPTLAQAQRYALEYCG 65
Query 91 DKTCKVVSRFTR-CGAVAYNGSKYQGGTGLTRRAAEDDAVNRLEGG--RIVNWAC 142
C++ F CGA+A NGS + R AE ++ G +I WAC
Sbjct 66 QADCQIRVWFKNGCGAIATNGSNIGSAWSVNRAEAEARSIVACGQGDCKIQVWAC 120
>gi|326383688|ref|ZP_08205373.1| hypothetical protein SCNU_12152 [Gordonia neofelifaecis NRRL
B-59395]
gi|326197452|gb|EGD54641.1| hypothetical protein SCNU_12152 [Gordonia neofelifaecis NRRL
B-59395]
Length=298
Score = 40.4 bits (93), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 32/94 (35%), Positives = 48/94 (52%), Gaps = 11/94 (11%)
Query 52 IPPII-------HYGAIAYA-PSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRF-TR 102
+PP + +YG+IA + +G G + + T A A+ KCG TC+ V RF
Sbjct 190 VPPAVPTTSSTTYYGSIAISRTTGDIGYSINNLTEESAVSAAMSKCGASTCETVLRFWNA 249
Query 103 CGAVAYNGSKYQGGTGL--TRRAAEDDAVNRLEG 134
CGAVA + G G TR+ A D A+ +++G
Sbjct 250 CGAVAQSQENLYWGWGWAATRQGAIDTAIGQVKG 283
>gi|54024135|ref|YP_118377.1| hypothetical protein nfa21670 [Nocardia farcinica IFM 10152]
gi|54015643|dbj|BAD57013.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=138
Score = 36.2 bits (82), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/49 (41%), Positives = 27/49 (56%), Gaps = 1/49 (2%)
Query 85 ALEKCGDKTCKVVSRF-TRCGAVAYNGSKYQGGTGLTRRAAEDDAVNRL 132
AL++CG C +V +F CGAVA G++ G TR AE A+ L
Sbjct 62 ALQECGVDNCSIVVQFRNACGAVAVRGNEVAWAGGYTRVEAEQSALAEL 110
>gi|158336921|ref|YP_001518096.1| hypothetical protein AM1_3792 [Acaryochloris marina MBIC11017]
gi|158307162|gb|ABW28779.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length=133
Score = 35.4 bits (80), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 34/130 (27%), Positives = 55/130 (43%), Gaps = 21/130 (16%)
Query 34 LANGSMSEVMMSEIAGLPIPPII----------HYGAIAYAP-SGASGKAWHQRTPARAE 82
+ + +++V+M +P ++ +YGAIAY+ +G+ G ++ T A+
Sbjct 1 MIHSKLAQVLMVTAFSMPTVSLVAVQPASANGNNYGAIAYSTATGSHGYSYDYSTAQAAQ 60
Query 83 QVALEKC----GDKTCKVVSRFTR-CGAVAYNGSKYQG-GTGLTRRAAEDDAVNRLE--- 133
AL C G CK + F CGA+A G G G+ R AE A+
Sbjct 61 NAALRYCENYSGTGDCKSLVVFQNACGALAQTPDNSAGSGWGVDRPTAESFALQSCRQFG 120
Query 134 -GGRIVNWAC 142
+I W C
Sbjct 121 PNCKITRWVC 130
>gi|30424697|ref|NP_780305.1| starch-binding domain-containing protein 1 [Mus musculus]
gi|81876921|sp|Q8C7E7.1|STBD1_MOUSE RecName: Full=Starch-binding domain-containing protein 1; AltName:
Full=Genethonin-1
gi|26341164|dbj|BAC34244.1| unnamed protein product [Mus musculus]
gi|110002551|gb|AAI18662.1| Starch binding domain 1 [Mus musculus]
gi|148673297|gb|EDL05244.1| DNA segment, Chr 5, ERATO Doi 593, expressed, isoform CRA_b [Mus
musculus]
Length=338
Score = 35.0 bits (79), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 21/57 (37%), Positives = 31/57 (55%), Gaps = 2/57 (3%)
Query 80 RAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGTGLTRRAAEDDAVNRLEGGR 136
RA+ V+ ++ G + +VVSR + G+V GS L +R DD+ N L GGR
Sbjct 174 RAKAVSQDQAGHEDWEVVSRHSSWGSVGLGGSLEASRLSLNQRM--DDSTNSLVGGR 228
>gi|110002651|gb|AAI18616.1| Stbd1 protein [Mus musculus]
Length=279
Score = 35.0 bits (79), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 21/57 (37%), Positives = 31/57 (55%), Gaps = 2/57 (3%)
Query 80 RAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGTGLTRRAAEDDAVNRLEGGR 136
RA+ V+ ++ G + +VVSR + G+V GS L +R DD+ N L GGR
Sbjct 115 RAKAVSQDQAGHEDWEVVSRHSSWGSVGLGGSLEASRLSLNQRM--DDSTNSLVGGR 169
>gi|190574895|ref|YP_001972740.1| hypothetical protein Smlt2996 [Stenotrophomonas maltophilia K279a]
gi|190012817|emb|CAQ46446.1| conserved hypothetical exported protein [Stenotrophomonas maltophilia
K279a]
Length=172
Score = 34.7 bits (78), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 27/82 (33%), Positives = 44/82 (54%), Gaps = 11/82 (13%)
Query 58 YGAIAYAPSGASGKAWHQRTPARAEQVALEKC---GDKTCKVV-SRFTRCGAVAY----- 108
+GA+A +P G +G A + A AE+ A+E+C G C VV + + +C AV
Sbjct 68 WGAVASSPGGDAGSATGHQAKASAERQAVERCRQGGATDCTVVFTYYNQCYAVVRAARPD 127
Query 109 NGSKYQGGTGLTRRAAEDDAVN 130
NG ++ TG T+ A++ A+
Sbjct 128 NGMRFN--TGATKEQAQERAIK 147
>gi|269120985|ref|YP_003309162.1| hypothetical protein Sterm_2378 [Sebaldella termitidis ATCC 33386]
gi|268614863|gb|ACZ09231.1| hypothetical protein Sterm_2378 [Sebaldella termitidis ATCC 33386]
Length=251
Score = 34.7 bits (78), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 19/45 (43%), Positives = 23/45 (52%), Gaps = 1/45 (2%)
Query 51 PIPPIIHYGAIAYAPSGASG-KAWHQRTPARAEQVALEKCGDKTC 94
P P I +YG IA P S AW+ R AE AL+ CG +C
Sbjct 43 PGPSITYYGGIAINPHTRSFYSAWNYRNGEEAEAAALKGCGGNSC 87
>gi|326381446|ref|ZP_08203140.1| hypothetical protein SCNU_00805 [Gordonia neofelifaecis NRRL
B-59395]
gi|326199693|gb|EGD56873.1| hypothetical protein SCNU_00805 [Gordonia neofelifaecis NRRL
B-59395]
Length=128
Score = 34.3 bits (77), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 30/90 (34%), Positives = 42/90 (47%), Gaps = 4/90 (4%)
Query 57 HYGAIAYAP-SGASGKAWHQRTPARAEQVALEKCGDKTCKVVSR-FTRCGAVAYNGSKYQ 114
+YGAIA + +G + + A A++ A KCG C+ V R + CGA A N +
Sbjct 34 YYGAIAISQRTGRAAVVVNYHDGASAQRAAARKCGAGDCRWVVRMYKNCGAAAQNPRTRR 93
Query 115 GGTGL--TRRAAEDDAVNRLEGGRIVNWAC 142
G T A+ A N GGR + W C
Sbjct 94 WGWAYAPTLNGAKARARNAAGGGRSIVWGC 123
>gi|126653006|ref|ZP_01725146.1| hypothetical protein BB14905_19015 [Bacillus sp. B14905]
gi|126590225|gb|EAZ84348.1| hypothetical protein BB14905_19015 [Bacillus sp. B14905]
Length=519
Score = 34.3 bits (77), Expect = 5.9, Method: Composition-based stats.
Identities = 13/59 (23%), Positives = 29/59 (50%), Gaps = 0/59 (0%)
Query 34 LANGSMSEVMMSEIAGLPIPPIIHYGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDK 92
L+N S+ E+ ++++ + P ++ + Y WHQ T +R Q+ + + D+
Sbjct 447 LSNQSLLEIRDTDLSAVKAPAVVQENSQLYVQKHVQTTTWHQDTTSRVSQIEMVQTNDE 505
>gi|170079327|ref|YP_001735965.1| serine/threonine-protein kinase [Synechococcus sp. PCC 7002]
gi|169886996|gb|ACB00710.1| serine/threonine-protein kinase [Synechococcus sp. PCC 7002]
Length=538
Score = 34.3 bits (77), Expect = 6.2, Method: Composition-based stats.
Identities = 26/81 (33%), Positives = 38/81 (47%), Gaps = 6/81 (7%)
Query 56 IHYGAIAYA-PSGASGKAWHQRTPARAEQVALEKC----GDKTCKVVSRF-TRCGAVAYN 109
+ +GAIA++ +G G T A AEQ A+E C C+ + F CGA+A
Sbjct 439 VFFGAIAFSEATGEYGYVIDVPTQAEAEQAAVEDCEFFAASGDCQALVWFRNACGAIAMG 498
Query 110 GSKYQGGTGLTRRAAEDDAVN 130
Y G G +AE A++
Sbjct 499 PEAYGSGWGADIESAEAAALD 519
>gi|241205439|ref|YP_002976535.1| hypothetical protein Rleg_2733 [Rhizobium leguminosarum bv. trifolii
WSM1325]
gi|240859329|gb|ACS56996.1| hypothetical protein Rleg_2733 [Rhizobium leguminosarum bv. trifolii
WSM1325]
Length=120
Score = 33.9 bits (76), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 35/113 (31%), Positives = 53/113 (47%), Gaps = 11/113 (9%)
Query 40 SEVMMSEIAGLPIPPIIHYGAIAYAPS-GASGKAWHQRTPARAEQVALEKCGD--KTCKV 96
S +++ +AG + YGAIAY+PS A G ++ AE VA C C++
Sbjct 7 SFAVLTSLAGAALADT--YGAIAYSPSTSAIGWSYAHANRGDAETVARRNCDSSANDCRI 64
Query 97 VSRFTR-CGAVAY-NGSKYQGGTGLTRRAAEDDAVN--RLEGG--RIVNWACN 143
F CGAVA + S + G G R A+ A+ R + G ++ W C+
Sbjct 65 AIWFRNGCGAVAVGHRSGWGSGWGYDGREAQRQAIRSCRKQTGSCHVIRWQCS 117
Lambda K H
0.319 0.133 0.404
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 129250525032
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40