BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1269c
Length=124
Score E
Sequences producing significant alignments: (Bits) Value
gi|15840716|ref|NP_335753.1| hypothetical protein MT1307 [Mycoba... 244 3e-63
gi|15608409|ref|NP_215785.1| hypothetical protein Rv1269c [Mycob... 244 4e-63
gi|254231526|ref|ZP_04924853.1| hypothetical protein TBCG_01250 ... 238 3e-61
gi|240173140|ref|ZP_04751798.1| hypothetical protein MkanA1_2775... 166 1e-39
gi|183984124|ref|YP_001852415.1| hypothetical protein MMAR_4153 ... 157 7e-37
gi|118619215|ref|YP_907547.1| hypothetical protein MUL_4016 [Myc... 156 9e-37
gi|289761421|ref|ZP_06520799.1| conserved secreted protein [Myco... 144 5e-33
gi|183981448|ref|YP_001849739.1| hypothetical protein MMAR_1426 ... 103 1e-20
gi|183982546|ref|YP_001850837.1| hypothetical protein MMAR_2533 ... 101 4e-20
gi|294994825|ref|ZP_06800516.1| hypothetical protein Mtub2_10022... 97.4 6e-19
gi|240173458|ref|ZP_04752116.1| hypothetical protein MkanA1_2936... 96.3 1e-18
gi|240173178|ref|ZP_04751836.1| hypothetical protein MkanA1_2794... 94.0 6e-18
gi|296165794|ref|ZP_06848300.1| conserved hypothetical protein [... 90.5 7e-17
gi|15608950|ref|NP_216329.1| hypothetical protein Rv1813c [Mycob... 89.0 2e-16
gi|183984691|ref|YP_001852982.1| hypothetical protein MMAR_4723 ... 83.2 1e-14
gi|240170411|ref|ZP_04749070.1| hypothetical protein MkanA1_1395... 82.0 2e-14
gi|296140994|ref|YP_003648237.1| hypothetical protein Tpau_3313 ... 82.0 2e-14
gi|326385281|ref|ZP_08206943.1| hypothetical protein SCNU_20142 ... 70.1 1e-10
gi|262201902|ref|YP_003273110.1| hypothetical protein Gbro_1963 ... 69.3 2e-10
gi|119489310|ref|ZP_01622117.1| hypothetical protein L8106_07641... 66.6 1e-09
gi|209525998|ref|ZP_03274531.1| conserved hypothetical protein [... 53.1 1e-05
gi|284051993|ref|ZP_06382203.1| hypothetical protein AplaP_11031... 52.8 2e-05
gi|54025836|ref|YP_120078.1| hypothetical protein nfa38660 [Noca... 49.3 2e-04
gi|170079327|ref|YP_001735965.1| serine/threonine-protein kinase... 48.9 2e-04
gi|158336921|ref|YP_001518096.1| hypothetical protein AM1_3792 [... 48.5 3e-04
gi|3366597|gb|AAC28395.1| putative open reading frame [Mycobacte... 47.0 9e-04
gi|326381446|ref|ZP_08203140.1| hypothetical protein SCNU_00805 ... 45.8 0.002
gi|338975179|ref|ZP_08630534.1| hypothetical protein CSIRO_3644 ... 45.4 0.003
gi|54024135|ref|YP_118377.1| hypothetical protein nfa21670 [Noca... 45.4 0.003
gi|146340297|ref|YP_001205345.1| hypothetical protein BRADO3320 ... 45.4 0.003
gi|27379202|ref|NP_770731.1| hypothetical protein bll4091 [Brady... 45.1 0.003
gi|296164287|ref|ZP_06846873.1| conserved hypothetical protein [... 43.9 0.007
gi|90423150|ref|YP_531520.1| hypothetical protein RPC_1639 [Rhod... 43.5 0.012
gi|209884627|ref|YP_002288484.1| hypothetical protein OCAR_5487 ... 42.7 0.020
gi|299131688|ref|ZP_07024883.1| conserved hypothetical protein [... 40.4 0.098
gi|158336328|ref|YP_001517502.1| serine/threonine protein kinase... 39.7 0.14
gi|338974076|ref|ZP_08629438.1| hypothetical protein CSIRO_2527 ... 39.3 0.17
gi|296393827|ref|YP_003658711.1| hypothetical protein Srot_1416 ... 39.3 0.22
gi|146339862|ref|YP_001204910.1| putative signal peptide [Bradyr... 38.9 0.24
gi|241205439|ref|YP_002976535.1| hypothetical protein Rleg_2733 ... 38.9 0.24
gi|84499865|ref|ZP_00998131.1| 30S ribosomal protein S12 [Oceani... 38.9 0.25
gi|54025524|ref|YP_119766.1| hypothetical protein nfa35540 [Noca... 38.5 0.33
gi|148256612|ref|YP_001241197.1| hypothetical protein BBta_5310 ... 38.5 0.36
gi|86750611|ref|YP_487107.1| peptidase C14, caspase catalytic su... 37.7 0.51
gi|269120983|ref|YP_003309160.1| hypothetical protein Sterm_2376... 37.4 0.80
gi|262201100|ref|YP_003272308.1| hypothetical protein Gbro_1107 ... 37.0 0.90
gi|119868707|ref|YP_938659.1| hypothetical protein Mkms_2674 [My... 36.6 1.3
gi|126435238|ref|YP_001070929.1| hypothetical protein Mjls_2658 ... 36.6 1.4
gi|339502098|ref|YP_004689518.1| hypothetical protein RLO149_c00... 36.2 1.7
gi|91976267|ref|YP_568926.1| hypothetical protein RPD_1789 [Rhod... 35.8 2.4
>gi|15840716|ref|NP_335753.1| hypothetical protein MT1307 [Mycobacterium tuberculosis CDC1551]
gi|13880905|gb|AAK45567.1| hypothetical protein MT1307 [Mycobacterium tuberculosis CDC1551]
Length=140
Score = 244 bits (623), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 124/124 (100%), Positives = 124/124 (100%), Gaps = 0/124 (0%)
Query 1 MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRA 60
MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRA
Sbjct 17 MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRA 76
Query 61 AAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDT 120
AAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDT
Sbjct 77 AAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDT 136
Query 121 WACN 124
WACN
Sbjct 137 WACN 140
>gi|15608409|ref|NP_215785.1| hypothetical protein Rv1269c [Mycobacterium tuberculosis H37Rv]
gi|31792461|ref|NP_854954.1| hypothetical protein Mb1300c [Mycobacterium bovis AF2122/97]
gi|121637197|ref|YP_977420.1| hypothetical protein BCG_1328c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
43 more sequence titles
Length=124
Score = 244 bits (622), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 124/124 (100%), Positives = 124/124 (100%), Gaps = 0/124 (0%)
Query 1 MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRA 60
MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRA
Sbjct 1 MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRA 60
Query 61 AAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDT 120
AAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDT
Sbjct 61 AAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDT 120
Query 121 WACN 124
WACN
Sbjct 121 WACN 124
>gi|254231526|ref|ZP_04924853.1| hypothetical protein TBCG_01250 [Mycobacterium tuberculosis C]
gi|298524772|ref|ZP_07012181.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A]
gi|308231800|ref|ZP_07413774.2| conserved secreted protein [Mycobacterium tuberculosis SUMu001]
26 more sequence titles
Length=121
Score = 238 bits (606), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 121/121 (100%), Positives = 121/121 (100%), Gaps = 0/121 (0%)
Query 4 MITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRAAAE 63
MITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRAAAE
Sbjct 1 MITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRAAAE 60
Query 64 ATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDTWAC 123
ATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDTWAC
Sbjct 61 ATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDTWAC 120
Query 124 N 124
N
Sbjct 121 N 121
>gi|240173140|ref|ZP_04751798.1| hypothetical protein MkanA1_27756 [Mycobacterium kansasii ATCC
12478]
Length=132
Score = 166 bits (419), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 92/132 (70%), Positives = 99/132 (75%), Gaps = 8/132 (6%)
Query 1 MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAA--------DVYGAIAYSGNGSWGR 52
MT M+ RRRFAV V +A AT +A A A D YGAIAYS NGSWGR
Sbjct 1 MTMMVPHRRRFAVTVTRLAVVIATIAAIATTAATVAPAPAARATDSYGAIAYSSNGSWGR 60
Query 53 SWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTK 112
SW YPT+AAAEATAVKSCGYSDCKVLTSFTACGAVAA DR Y+GG GP L+AAMKDAL+K
Sbjct 61 SWAYPTKAAAEATAVKSCGYSDCKVLTSFTACGAVAAKDRDYRGGTGPNLSAAMKDALSK 120
Query 113 LGGGYIDTWACN 124
L GGYIDTWACN
Sbjct 121 LDGGYIDTWACN 132
>gi|183984124|ref|YP_001852415.1| hypothetical protein MMAR_4153 [Mycobacterium marinum M]
gi|183177450|gb|ACC42560.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=122
Score = 157 bits (396), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 96/122 (79%), Positives = 108/122 (89%), Gaps = 1/122 (0%)
Query 3 TMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRAAA 62
T + RRR AVAVA VATA ATT+TLAP A+AAD YGAIAYSG+GSWGR+ YPTRAAA
Sbjct 2 TTNSRRRRLAVAVASVATATATTLTLAPV-ADAADQYGAIAYSGDGSWGRASHYPTRAAA 60
Query 63 EATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDTWA 122
EATAVK CGYSDCKVLT+FTACGAVAA+ + ++GGVGPTL+AAMKDAL+KLGGGYIDTWA
Sbjct 61 EATAVKLCGYSDCKVLTTFTACGAVAADGKTFEGGVGPTLSAAMKDALSKLGGGYIDTWA 120
Query 123 CN 124
CN
Sbjct 121 CN 122
>gi|118619215|ref|YP_907547.1| hypothetical protein MUL_4016 [Mycobacterium ulcerans Agy99]
gi|118571325|gb|ABL06076.1| conserved hypothetical secreted protein [Mycobacterium ulcerans
Agy99]
Length=111
Score = 156 bits (395), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 88/112 (79%), Positives = 102/112 (92%), Gaps = 1/112 (0%)
Query 13 VAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGY 72
+A+A VATA ATT+TLAP A+AAD YGAIAYSG+GSWGR+ YPTRAAAEATAVK CGY
Sbjct 1 MAIASVATATATTLTLAPV-ADAADQYGAIAYSGDGSWGRASHYPTRAAAEATAVKLCGY 59
Query 73 SDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDTWACN 124
SDC+VLT+FTACGAVAA+ + ++GGVGPTL+AAMKDAL+KLGGGYIDTWACN
Sbjct 60 SDCRVLTTFTACGAVAADGKTFEGGVGPTLSAAMKDALSKLGGGYIDTWACN 111
>gi|289761421|ref|ZP_06520799.1| conserved secreted protein [Mycobacterium tuberculosis GM 1503]
gi|289708927|gb|EFD72943.1| conserved secreted protein [Mycobacterium tuberculosis GM 1503]
Length=106
Score = 144 bits (363), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 74/81 (92%), Positives = 75/81 (93%), Gaps = 0/81 (0%)
Query 1 MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRA 60
MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRA
Sbjct 1 MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRA 60
Query 61 AAEATAVKSCGYSDCKVLTSF 81
AAEATAVKSCGYS + F
Sbjct 61 AAEATAVKSCGYSRLQGAHQF 81
>gi|183981448|ref|YP_001849739.1| hypothetical protein MMAR_1426 [Mycobacterium marinum M]
gi|183174774|gb|ACC39884.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=136
Score = 103 bits (256), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/134 (48%), Positives = 77/134 (58%), Gaps = 13/134 (9%)
Query 4 MITLRRRFA-VAVAGVATAAATTVTLAPAPANAAD------------VYGAIAYSGNGSW 50
M LRRR A + V A + L+PA A+ D YGAIAY NG
Sbjct 2 MTNLRRRAALIVVTLAAALGLGLLLLSPAGAHLYDDSITGRIVAPITYYGAIAYGPNGVN 61
Query 51 GRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDAL 110
GRSW+ TRA AE++A+K CG CKVL+SF CGAVA + A GGVG T A DA
Sbjct 62 GRSWNNRTRAQAESSALKLCGVEGCKVLSSFVRCGAVAFDGSARHGGVGRTRQMAEDDAR 121
Query 111 TKLGGGYIDTWACN 124
+LGGG+I+TWACN
Sbjct 122 FRLGGGWIETWACN 135
>gi|183982546|ref|YP_001850837.1| hypothetical protein MMAR_2533 [Mycobacterium marinum M]
gi|183175872|gb|ACC40982.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=142
Score = 101 bits (251), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/142 (43%), Positives = 82/142 (58%), Gaps = 22/142 (15%)
Query 5 ITLRRRFAVAVAGV-ATAAATTVTLAPAPANAADV---------------------YGAI 42
+T RRR A+A A V ATA + LA + A++ YGAI
Sbjct 1 MTRRRRIALATATVGATAGLMFIGLALTGSIGANMDRAVMSEMGMLPEGPVPLIVHYGAI 60
Query 43 AYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTL 102
AY+ NG++G++ + +R AE A+K CG CKVL +F CGAVA N+ YQGG G TL
Sbjct 61 AYAPNGAFGKARRFTSRFGAEQAALKQCGLDSCKVLINFNRCGAVAYNNLKYQGGSGWTL 120
Query 103 AAAMKDALTKLGGGYIDTWACN 124
+AA +DA+ +LGGG+I WACN
Sbjct 121 SAAQQDAIDRLGGGWIVNWACN 142
>gi|294994825|ref|ZP_06800516.1| hypothetical protein Mtub2_10022 [Mycobacterium tuberculosis
210]
Length=48
Score = 97.4 bits (241), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 47/48 (98%), Positives = 48/48 (100%), Gaps = 0/48 (0%)
Query 77 VLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDTWACN 124
+LTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDTWACN
Sbjct 1 MLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDTWACN 48
>gi|240173458|ref|ZP_04752116.1| hypothetical protein MkanA1_29366 [Mycobacterium kansasii ATCC
12478]
Length=125
Score = 96.3 bits (238), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/124 (52%), Positives = 80/124 (65%), Gaps = 4/124 (3%)
Query 4 MITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGS-WGRSWDYPTRAAA 62
+ +RR AV+V A +TLAPA A A GAIAYS +G +GR+ P+RAAA
Sbjct 3 LTAFQRRAAVSVTNFVAVTAMILTLAPA-AEADGYVGAIAYSPSGKVFGRTKHAPSRAAA 61
Query 63 EATAVKSCGYSDCKVLTSFTACGAVAANDRA-YQGGVGPTLAAAMKDALTKLG-GGYIDT 120
E+ A+ +CGYSDCKVL +FT CGA+A N R + GG GPTL AA +DA LG G+I T
Sbjct 62 ESAALGACGYSDCKVLVTFTDCGAIAENSRGDHAGGYGPTLLAAEQDAAKNLGTSGWIGT 121
Query 121 WACN 124
W CN
Sbjct 122 WYCN 125
>gi|240173178|ref|ZP_04751836.1| hypothetical protein MkanA1_27946 [Mycobacterium kansasii ATCC
12478]
Length=145
Score = 94.0 bits (232), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 56/143 (40%), Positives = 70/143 (49%), Gaps = 22/143 (15%)
Query 4 MITLRRRFAVAVAGV-ATAAATTVTLAPAPANAADV---------------------YGA 41
M+T RRR + A + AT + L P A + YGA
Sbjct 1 MMTTRRRITLVAATIGATVGLMVLALPLIPPLDAHIDSAVLSEMGMLPEVPVPPRIHYGA 60
Query 42 IAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPT 101
IAY+ G WG+S +Y T A AE A+ CG CKVL +F CGAVA + Y GG G T
Sbjct 61 IAYAPTGEWGKSRNYLTLAKAEEVALDQCGLDTCKVLINFKRCGAVAYDGSTYHGGSGLT 120
Query 102 LAAAMKDALTKLGGGYIDTWACN 124
L+ AM DA+ +LG G I W CN
Sbjct 121 LSDAMADAINRLGAGRIVNWLCN 143
>gi|296165794|ref|ZP_06848300.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295898848|gb|EFG78348.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=141
Score = 90.5 bits (223), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 55/140 (40%), Positives = 71/140 (51%), Gaps = 19/140 (13%)
Query 4 MITLRRRFAVAVAGVATAAATTV------------TLAP-------APANAADVYGAIAY 44
MI R R +AVA V A + TL P P YGA+AY
Sbjct 2 MIKRRYRIGLAVATVGATAGLMIAALPFMPGVGANTLMPETAVMPEGPVPPVIRYGAMAY 61
Query 45 SGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAA 104
+ +G+WGR+ Y TR A A+ CG DCK++ S+ CGAVA + Y GG G T +
Sbjct 62 APSGAWGRTRGYGTRERAIQVALDQCGVKDCKLIVSYQRCGAVAYDGTTYLGGKGVTRSL 121
Query 105 AMKDALTKLGGGYIDTWACN 124
A +DA+ +LGGG I WACN
Sbjct 122 AEEDAINRLGGGRIVNWACN 141
>gi|15608950|ref|NP_216329.1| hypothetical protein Rv1813c [Mycobacterium tuberculosis H37Rv]
gi|15841283|ref|NP_336320.1| hypothetical protein MT1861 [Mycobacterium tuberculosis CDC1551]
gi|31793002|ref|NP_855495.1| hypothetical protein Mb1843c [Mycobacterium bovis AF2122/97]
82 more sequence titles
Length=143
Score = 89.0 bits (219), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/86 (53%), Positives = 56/86 (66%), Gaps = 0/86 (0%)
Query 39 YGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGV 98
YGAIAY+ +G+ G++W T A AE A++ CG CKV++ FT CGAVA N YQGG
Sbjct 58 YGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGT 117
Query 99 GPTLAAAMKDALTKLGGGYIDTWACN 124
G T AA DA+ +L GG I WACN
Sbjct 118 GLTRRAAEDDAVNRLEGGRIVNWACN 143
>gi|183984691|ref|YP_001852982.1| hypothetical protein MMAR_4723 [Mycobacterium marinum M]
gi|183178017|gb|ACC43127.1| conserved hypothetical secreted protein [Mycobacterium marinum
M]
Length=155
Score = 83.2 bits (204), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/86 (47%), Positives = 58/86 (68%), Gaps = 0/86 (0%)
Query 39 YGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGV 98
YGAIA + NG+ G++W + RA AE A+ +CG+ C VL+ FT CGA+A + + + GG+
Sbjct 70 YGAIAVADNGAVGKTWGHRKRAQAEIHALTACGHPSCNVLSVFTRCGAIAHDGQNFHGGL 129
Query 99 GPTLAAAMKDALTKLGGGYIDTWACN 124
G + AA DA +LGGG++ T ACN
Sbjct 130 GRSHQAAGHDAKARLGGGWVLTSACN 155
>gi|240170411|ref|ZP_04749070.1| hypothetical protein MkanA1_13955 [Mycobacterium kansasii ATCC
12478]
Length=125
Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/125 (41%), Positives = 70/125 (56%), Gaps = 15/125 (12%)
Query 15 VAGVATAAATTVT-LAPAPANAAD--------------VYGAIAYSGNGSWGRSWDYPTR 59
++G+A AAA TVT + PA A+ YGAIAY +GS G++W + ++
Sbjct 1 MSGLAVAAAMTVTQIHPAGAHIHRGEMTHVSNMPPFPVYYGAIAYGHDGSNGKAWRHLSK 60
Query 60 AAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYID 119
A A+ A++ CG C V++ FT CGAVA + Y GG G +AA A+ LGGG I
Sbjct 61 AQAKHRALELCGIDTCTVVSVFTRCGAVAHDGAKYHGGYGYNRSAAEAHAMANLGGGRIV 120
Query 120 TWACN 124
WACN
Sbjct 121 DWACN 125
>gi|296140994|ref|YP_003648237.1| hypothetical protein Tpau_3313 [Tsukamurella paurometabola DSM
20162]
gi|296029128|gb|ADG79898.1| conserved putative secreted protein [Tsukamurella paurometabola
DSM 20162]
Length=126
Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/125 (46%), Positives = 71/125 (57%), Gaps = 12/125 (9%)
Query 5 ITLRRRFA---VAVAGVATAAATTVTLAPAPANAAD--VYGAIAYSGNGSWGRSWDYPTR 59
I+ +R A +AV G+AT +T +AP +NA YGAIA NG+WGR+ DY R
Sbjct 3 ISRSKRIAAGGLAVIGIATGLST---VAPVASNAEPGVFYGAIAVGSNGAWGRALDYGNR 59
Query 60 AAAEATAVKSCGYSDCKVLTSF-TACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYI 118
A AE A+ C +C++L SF CGAVA +Y G VG TL A AL GGYI
Sbjct 60 ATAERVALSYCD-GNCRILASFVNGCGAVAKTRTSYWGNVGDTLGVAQNRALRN--GGYI 116
Query 119 DTWAC 123
TWAC
Sbjct 117 YTWAC 121
>gi|326385281|ref|ZP_08206943.1| hypothetical protein SCNU_20142 [Gordonia neofelifaecis NRRL
B-59395]
gi|326195990|gb|EGD53202.1| hypothetical protein SCNU_20142 [Gordonia neofelifaecis NRRL
B-59395]
Length=134
Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/124 (45%), Positives = 72/124 (59%), Gaps = 5/124 (4%)
Query 5 ITLRRRFAVA---VAGVATAAATTVTLAPAPANAADVYGAIAYS-GNGSWGRSWDYPTRA 60
I++R+R A A AGV A +APAPA+AA+ YGAIA S G+ GR+ DYP +
Sbjct 6 ISMRKRLATAALVTAGVGAALTVPSVVAPAPAHAANYYGAIALSPSTGATGRALDYPDYS 65
Query 61 AAEATAVKSCGYSDCKVLTSF-TACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYID 119
+A A+ CGYSDC+V+ ACGA+A + + G L A +AL GGGYI
Sbjct 66 SASNAALSWCGYSDCQVVVQMRNACGAIAKSSSYWGYAWGADLYTAESNALYYSGGGYIH 125
Query 120 TWAC 123
WAC
Sbjct 126 DWAC 129
>gi|262201902|ref|YP_003273110.1| hypothetical protein Gbro_1963 [Gordonia bronchialis DSM 43247]
gi|262085249|gb|ACY21217.1| hypothetical protein Gbro_1963 [Gordonia bronchialis DSM 43247]
Length=131
Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/126 (40%), Positives = 72/126 (58%), Gaps = 7/126 (5%)
Query 5 ITLRRRF---AVAVAGVATAAATTVTLAPAPANA-ADVYGAIAYSGNGSW-GRSWDYPTR 59
++LR+R A+ +AGV A PAPA+A YGA+A S + + GR+ DY +
Sbjct 1 MSLRKRLSILALTLAGVVGALILPSVSEPAPAHAYGYYYGALALSTSERYVGRALDYDSY 60
Query 60 AAAEATAVKSCGYSDCKVLTSF-TACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGG-Y 117
A A A+++CGY+DCKV+T F CGA+A + + G G +L +A +AL G G
Sbjct 61 AEASQAALRACGYADCKVVTRFANGCGAIAESPSYWGFGNGSSLYSAQSEALYYSGSGAE 120
Query 118 IDTWAC 123
I WAC
Sbjct 121 IVYWAC 126
>gi|119489310|ref|ZP_01622117.1| hypothetical protein L8106_07641 [Lyngbya sp. PCC 8106]
gi|119454784|gb|EAW35929.1| hypothetical protein L8106_07641 [Lyngbya sp. PCC 8106]
Length=123
Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/107 (43%), Positives = 61/107 (58%), Gaps = 6/107 (5%)
Query 23 ATTVTLAP-APANAADVYGAIAYSGNGS-WGRSWDYPTRAAAEATAVKSCGYSDCKVLTS 80
AT + ++P A A +D YGAIA + +G WG ++DYP+R AE A++ CG S+C+V
Sbjct 13 ATEILVSPIASAQPSDSYGAIAITPDGQVWGYAYDYPSREQAEQRALEECGESNCQVQVW 72
Query 81 F-TACGAVAANDRAYQG-GVGPTLAAAMKDALTKLGGG--YIDTWAC 123
F ACGAVA N+ G T A A+ G G I+TWAC
Sbjct 73 FKNACGAVAKNEEGKLGWAWADTRKQAEASAVAACGTGTCRIETWAC 119
>gi|209525998|ref|ZP_03274531.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|209493524|gb|EDZ93846.1| conserved hypothetical protein [Arthrospira maxima CS-328]
Length=126
Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/128 (33%), Positives = 57/128 (45%), Gaps = 13/128 (10%)
Query 1 MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGS--WGRSWDYPT 58
M ++L +A+ G+ T A A D YGAIA S WG S DYPT
Sbjct 1 MIKCLSLTGLILIALEGITTGAIV--------AQNRDHYGAIATSTTNPAIWGYSHDYPT 52
Query 59 RAAAEATAVKSCGYSDCKVLTSF-TACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGG- 116
A A+ A++ CG +DC++ F CGA+A N A A ++ G G
Sbjct 53 LAQAQRYALEYCGQADCQIRVWFKNGCGAIATNGSNIGSAWAVNRAEAEARSIVACGQGD 112
Query 117 -YIDTWAC 123
I+ WAC
Sbjct 113 CKIEVWAC 120
>gi|284051993|ref|ZP_06382203.1| hypothetical protein AplaP_11031 [Arthrospira platensis str.
Paraca]
gi|291568823|dbj|BAI91095.1| hypothetical protein [Arthrospira platensis NIES-39]
Length=126
Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/128 (33%), Positives = 56/128 (44%), Gaps = 13/128 (10%)
Query 1 MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNG--SWGRSWDYPT 58
M ++L +A+ G+ T A A D YGAIA S WG S DYPT
Sbjct 1 MIKCLSLTGLILIALEGITTGAIL--------AQNRDNYGAIATSTTNPAQWGYSHDYPT 52
Query 59 RAAAEATAVKSCGYSDCKVLTSF-TACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGG- 116
A A+ A++ CG +DC++ F CGA+A N A A ++ G G
Sbjct 53 LAQAQRYALEYCGQADCQIRVWFKNGCGAIATNGSNIGSAWSVNRAEAEARSIVACGQGD 112
Query 117 -YIDTWAC 123
I WAC
Sbjct 113 CKIQVWAC 120
>gi|54025836|ref|YP_120078.1| hypothetical protein nfa38660 [Nocardia farcinica IFM 10152]
gi|54017344|dbj|BAD58714.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=112
Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/86 (37%), Positives = 53/86 (62%), Gaps = 2/86 (2%)
Query 39 YGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFT-ACGAVAANDRAYQGG 97
YGAIA S +G++G + +Y + + AE AV +CG + C+VL S++ CG +A+++ +
Sbjct 22 YGAIATSRSGAYGIANNYGSFSDAEQAAVDACG-AGCRVLVSWSNGCGVLASSNTQWSAA 80
Query 98 VGPTLAAAMKDALTKLGGGYIDTWAC 123
+ AA AL++L GG++ W C
Sbjct 81 ARSSYTAARSAALSRLSGGWVVDWRC 106
>gi|170079327|ref|YP_001735965.1| serine/threonine-protein kinase [Synechococcus sp. PCC 7002]
gi|169886996|gb|ACB00710.1| serine/threonine-protein kinase [Synechococcus sp. PCC 7002]
Length=538
Score = 48.9 bits (115), Expect = 2e-04, Method: Composition-based stats.
Identities = 32/78 (42%), Positives = 42/78 (54%), Gaps = 6/78 (7%)
Query 39 YGAIAYS-GNGSWGRSWDYPTRAAAEATAVKSCGY----SDCKVLTSF-TACGAVAANDR 92
+GAIA+S G +G D PT+A AE AV+ C + DC+ L F ACGA+A
Sbjct 441 FGAIAFSEATGEYGYVIDVPTQAEAEQAAVEDCEFFAASGDCQALVWFRNACGAIAMGPE 500
Query 93 AYQGGVGPTLAAAMKDAL 110
AY G G + +A AL
Sbjct 501 AYGSGWGADIESAEAAAL 518
>gi|158336921|ref|YP_001518096.1| hypothetical protein AM1_3792 [Acaryochloris marina MBIC11017]
gi|158307162|gb|ABW28779.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length=133
Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/91 (41%), Positives = 48/91 (53%), Gaps = 7/91 (7%)
Query 4 MITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYS-GNGSWGRSWDYPTRAAA 62
MI + + V + + V + PA AN + YGAIAYS GS G S+DY T AA
Sbjct 1 MIHSKLAQVLMVTAFSMPTVSLVAVQPASANGNN-YGAIAYSTATGSHGYSYDYSTAQAA 59
Query 63 EATAVKSC----GYSDCKVLTSF-TACGAVA 88
+ A++ C G DCK L F ACGA+A
Sbjct 60 QNAALRYCENYSGTGDCKSLVVFQNACGALA 90
>gi|3366597|gb|AAC28395.1| putative open reading frame [Mycobacterium tuberculosis]
Length=124
Score = 47.0 bits (110), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 21/44 (48%), Positives = 30/44 (69%), Gaps = 0/44 (0%)
Query 39 YGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFT 82
YGAIAY+ +G+ G++W T A AE A++ CG CKV++ FT
Sbjct 71 YGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRFT 114
>gi|326381446|ref|ZP_08203140.1| hypothetical protein SCNU_00805 [Gordonia neofelifaecis NRRL
B-59395]
gi|326199693|gb|EGD56873.1| hypothetical protein SCNU_00805 [Gordonia neofelifaecis NRRL
B-59395]
Length=128
Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/123 (37%), Positives = 57/123 (47%), Gaps = 4/123 (3%)
Query 5 ITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGN-GSWGRSWDYPTRAAAE 63
+ L + AVA A +AT TT + P A A YGAIA S G +Y A+A+
Sbjct 1 MKLPLKLAVAGAVLATGMTTTAVVVPEAAAATPYYGAIAISQRTGRAAVVVNYHDGASAQ 60
Query 64 ATAVKSCGYSDCK-VLTSFTACGAVAANDRAYQGG--VGPTLAAAMKDALTKLGGGYIDT 120
A + CG DC+ V+ + CGA A N R + G PTL A A GGG
Sbjct 61 RAAARKCGAGDCRWVVRMYKNCGAAAQNPRTRRWGWAYAPTLNGAKARARNAAGGGRSIV 120
Query 121 WAC 123
W C
Sbjct 121 WGC 123
>gi|338975179|ref|ZP_08630534.1| hypothetical protein CSIRO_3644 [Bradyrhizobiaceae bacterium
SG-6C]
gi|338231778|gb|EGP06913.1| hypothetical protein CSIRO_3644 [Bradyrhizobiaceae bacterium
SG-6C]
Length=128
Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/131 (33%), Positives = 68/131 (52%), Gaps = 19/131 (14%)
Query 3 TMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRAAA 62
T+ LR F VA+ +A AT + PA+AA GA+A G++G+++DYP +A
Sbjct 5 TVARLRAFFLVALV---SAFATHFAV---PADAA---GALAIGKCGAYGQAFDYPAEQSA 55
Query 63 EATAVKSCGYSDCKVLTSFTACGAVAANDR----AYQGGVGPTLAAAMKDALTK---LGG 115
A+K C +C +T AC A++ + A+ V P +++A+ +A+ K GG
Sbjct 56 RVAALKQCK-GECSAVTMKRACVALSLDMTKPCGAHGYAVAPRISSALNEAMKKCYAFGG 114
Query 116 G--YIDTWACN 124
I WAC+
Sbjct 115 KECVIRAWACD 125
>gi|54024135|ref|YP_118377.1| hypothetical protein nfa21670 [Nocardia farcinica IFM 10152]
gi|54015643|dbj|BAD57013.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=138
Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/106 (35%), Positives = 53/106 (50%), Gaps = 8/106 (7%)
Query 15 VAGVATAAATTVTLAPAPANAADVYGAIAYS---GNGSWGRSWDYPTRAAAEATAVKSCG 71
V G+ +A TV A A D YG++A G G +YP + A+ A++ CG
Sbjct 8 VLGLIAGSAITVLGAGTALAAEDYYGSLALGLEPGAIIVGSGVNYPDQEGADVRALQECG 67
Query 72 YSDCKVLTSF-TACGAVAA--NDRAYQGGVGPTLAAAMKDALTKLG 114
+C ++ F ACGAVA N+ A+ GG T A + AL +LG
Sbjct 68 VDNCSIVVQFRNACGAVAVRGNEVAWAGGY--TRVEAEQSALAELG 111
>gi|146340297|ref|YP_001205345.1| hypothetical protein BRADO3320 [Bradyrhizobium sp. ORS 278]
gi|146193103|emb|CAL77114.1| hypothetical protein; putative Peptidase C14, Caspase domain
[Bradyrhizobium sp. ORS 278]
Length=473
Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/79 (37%), Positives = 43/79 (55%), Gaps = 3/79 (3%)
Query 39 YGAIAYS-GNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSF--TACGAVAANDRAYQ 95
+G+I Y N WG SW++ +R AA A +CG + C V SF + CGA A +D+++
Sbjct 376 WGSIVYGVDNNKWGMSWNHASRRAAVTDARSNCGSAKCPVELSFYGSHCGAFAISDKSWS 435
Query 96 GGVGPTLAAAMKDALTKLG 114
T+ AA AL + G
Sbjct 436 LVQRDTVKAARDAALDECG 454
>gi|27379202|ref|NP_770731.1| hypothetical protein bll4091 [Bradyrhizobium japonicum USDA 110]
gi|27352353|dbj|BAC49356.1| bll4091 [Bradyrhizobium japonicum USDA 110]
Length=592
Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 33/98 (34%), Positives = 53/98 (55%), Gaps = 7/98 (7%)
Query 1 MTTMIT-LRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGS-WGRSWDYPT 58
+T IT L++R ++A G AT + L + +G+I Y +GS WG SW++ +
Sbjct 459 LTRSITKLQQRISMAPTGEATEG---LLLRMRKMDDLKPWGSIVYGPDGSKWGMSWNHAS 515
Query 59 RAAAEATAVKSCGYSDCKVLTSF--TACGAVAANDRAY 94
R AA A A +C + C + SF CGA A +D+++
Sbjct 516 RRAAVADARGNCAGAKCPIELSFYGKGCGAFAISDKSW 553
>gi|296164287|ref|ZP_06846873.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
gi|295900349|gb|EFG79769.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
ATCC BAA-614]
Length=61
Score = 43.9 bits (102), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/86 (35%), Positives = 36/86 (42%), Gaps = 36/86 (41%)
Query 39 YGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAYQGGV 98
YGAIAY+ +G+WGRSW YP +AAA GY+
Sbjct 12 YGAIAYAPSGAWGRSWRYPNQAAAN-------GYTH------------------------ 40
Query 99 GPTLAAAMKDALTKLGGGYIDTWACN 124
DAL +LGGG I W CN
Sbjct 41 -----QIADDALNRLGGGKIVNWVCN 61
>gi|90423150|ref|YP_531520.1| hypothetical protein RPC_1639 [Rhodopseudomonas palustris BisB18]
gi|90105164|gb|ABD87201.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18]
Length=128
Score = 43.5 bits (101), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 40/116 (35%), Positives = 59/116 (51%), Gaps = 15/116 (12%)
Query 18 VATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKV 77
VA A + + PAPA GA A G++G+++DYP AAA A A K C DC
Sbjct 16 VALLIAQLMLVTPAPAA-----GAFAVGKCGAYGQAFDYPVEAAAIAAARKQC-KGDCTT 69
Query 78 LTSFTACGAVAANDR----AYQGGVGPTLAAAMKDALTK---LGGG--YIDTWACN 124
LT AC A++ + + A+ V P +++++ +A K GG I WAC+
Sbjct 70 LTMRRACAALSIDLKNPCGAHGYAVKPKISSSLNEATRKCYEFGGKECVIRAWACD 125
>gi|209884627|ref|YP_002288484.1| hypothetical protein OCAR_5487 [Oligotropha carboxidovorans OM5]
gi|337741710|ref|YP_004633438.1| hypothetical protein OCA5_c25020 [Oligotropha carboxidovorans
OM5]
gi|209872823|gb|ACI92619.1| conserved hypothetical protein [Oligotropha carboxidovorans OM5]
gi|336095794|gb|AEI03620.1| hypothetical protein OCA4_c25010 [Oligotropha carboxidovorans
OM4]
gi|336099374|gb|AEI07197.1| hypothetical protein OCA5_c25020 [Oligotropha carboxidovorans
OM5]
Length=126
Score = 42.7 bits (99), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 43/133 (33%), Positives = 66/133 (50%), Gaps = 19/133 (14%)
Query 1 MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRA 60
M + +TLR AG+ AA + PA + AA GA+A G++G+++D+ +
Sbjct 1 MRSTLTLR------FAGLFLAALCFSGVLPASSYAA---GALAVGKCGAYGQAFDHARQE 51
Query 61 AAEATAVKSCGYSDCKVLTSFTACGAVAANDR----AYQGGVGPTLAAAMKDALTKL--- 113
AA A+A K C DC V+T AC A + + A+ V P +++A+ A K
Sbjct 52 AAIASARKQCK-GDCTVVTMRRACAAFVVDMKNPCGAFGYAVEPRISSALNAASKKCYAY 110
Query 114 GGG--YIDTWACN 124
GG I WAC+
Sbjct 111 GGQECVIRAWACD 123
>gi|299131688|ref|ZP_07024883.1| conserved hypothetical protein [Afipia sp. 1NLS2]
gi|298591825|gb|EFI52025.1| conserved hypothetical protein [Afipia sp. 1NLS2]
Length=126
Score = 40.4 bits (93), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 40/119 (34%), Positives = 61/119 (52%), Gaps = 13/119 (10%)
Query 15 VAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSD 74
+AG+ AA + AP+ AA GA+A G++G+++ Y + AA A A K C D
Sbjct 9 LAGLFFAAILICGVFAAPSYAA---GALAVGKCGAYGQAFGYKHQNAAIAAARKQCK-GD 64
Query 75 CKVLTSFTACGAVAANDR----AYQGGVGPTLAAAMKDALTKL---GGG--YIDTWACN 124
C V+T AC A+A + + A+ V P +++A+ A K GG I WAC+
Sbjct 65 CTVITMRRACAALAIDMKNPCGAFGYAVEPRISSALNAASKKCYAYGGKECVIRAWACD 123
>gi|158336328|ref|YP_001517502.1| serine/threonine protein kinase [Acaryochloris marina MBIC11017]
gi|158306569|gb|ABW28186.1| serine/threonine protein kinase [Acaryochloris marina MBIC11017]
Length=509
Score = 39.7 bits (91), Expect = 0.14, Method: Composition-based stats.
Identities = 30/95 (32%), Positives = 47/95 (50%), Gaps = 12/95 (12%)
Query 21 AAATTVTLAPAPANAADVYGAIAYS-GNGSWGRSWDYPTRAAAEATAVKSCGY----SDC 75
+ + T +P P + AIAYS G++G YP++AAAE A++ C DC
Sbjct 390 SPSVTPEASPPPTTQ---FAAIAYSIPTGTYGYGLRYPSQAAAEQRALQECNQRVPAQDC 446
Query 76 KVLTSF-TACGAVAANDRAYQGGVGPTLAAAMKDA 109
+ L F ACG++ RA G G + +++A
Sbjct 447 QALFWFKNACGSLV---RATSGAYGWAWSETIQEA 478
>gi|338974076|ref|ZP_08629438.1| hypothetical protein CSIRO_2527 [Bradyrhizobiaceae bacterium
SG-6C]
gi|338232803|gb|EGP07931.1| hypothetical protein CSIRO_2527 [Bradyrhizobiaceae bacterium
SG-6C]
Length=190
Score = 39.3 bits (90), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 25/66 (38%), Positives = 38/66 (58%), Gaps = 5/66 (7%)
Query 27 TLAPA-PANAADVYGAIAYSGNGSWGRSWDYPTRAAAEATAVKSC---GYSDCKVLTSFT 82
+L P PA+ ++ + AI ++ G+WG W +R AEA A+ C G CK+ +S T
Sbjct 69 SLKPKTPASPSEQWAAIGFAARGAWGAVWQKTSRDEAEARALTLCVNNGGRGCKI-SSTT 127
Query 83 ACGAVA 88
CGA+A
Sbjct 128 QCGAMA 133
>gi|296393827|ref|YP_003658711.1| hypothetical protein Srot_1416 [Segniliparus rotundus DSM 44985]
gi|296180974|gb|ADG97880.1| conserved hypothetical protein [Segniliparus rotundus DSM 44985]
Length=122
Score = 39.3 bits (90), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 30/79 (38%), Positives = 45/79 (57%), Gaps = 3/79 (3%)
Query 26 VTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACG 85
+T+ P A A+ +GA+AY G G+ G S +YP ++A+ A SCG + C F CG
Sbjct 15 LTVCPESA-ASGSWGAVAYDGQGNAGSSRNYPDPSSAKGAAEHSCG-ARCGA-KPFRHCG 71
Query 86 AVAANDRAYQGGVGPTLAA 104
AVA ++ +G G +AA
Sbjct 72 AVAFTRQSNRGRHGYGVAA 90
>gi|146339862|ref|YP_001204910.1| putative signal peptide [Bradyrhizobium sp. ORS 278]
gi|146192668|emb|CAL76673.1| conserved hypothetical protein; putative signal peptide [Bradyrhizobium
sp. ORS 278]
Length=115
Score = 38.9 bits (89), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 38/119 (32%), Positives = 57/119 (48%), Gaps = 17/119 (14%)
Query 15 VAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSD 74
+ G A A T ++ PA+AA GA A ++G+++DYP A+A A+K C +
Sbjct 2 IVGAARVATTVIS----PAHAA---GAFAIGQCAAYGQAFDYPAEHLAKAAALKQCK-GE 53
Query 75 CKVLTSFTACGAVAA---NDRAYQG-GVGPTLAAAMKDA---LTKLGGG--YIDTWACN 124
C +T AC A A N G V P ++A++ +A K GG I W C+
Sbjct 54 CTAVTMKKACAAFAVDLTNPCGPHGYAVRPKISASLNEATRECYKYGGKECVIRAWVCD 112
>gi|241205439|ref|YP_002976535.1| hypothetical protein Rleg_2733 [Rhizobium leguminosarum bv. trifolii
WSM1325]
gi|240859329|gb|ACS56996.1| hypothetical protein Rleg_2733 [Rhizobium leguminosarum bv. trifolii
WSM1325]
Length=120
Score = 38.9 bits (89), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 39/112 (35%), Positives = 55/112 (50%), Gaps = 9/112 (8%)
Query 22 AATTVTLAPAPANAADVYGAIAYSGNGS-WGRSWDYPTRAAAEATAVKSCGYS--DCKVL 78
A+ V + A A AD YGAIAYS + S G S+ + R AE A ++C S DC++
Sbjct 6 ASFAVLTSLAGAALADTYGAIAYSPSTSAIGWSYAHANRGDAETVARRNCDSSANDCRIA 65
Query 79 TSF-TACGAVAANDRAYQG-GVGPTLAAAMKDAL----TKLGGGYIDTWACN 124
F CGAVA R+ G G G A + A+ + G ++ W C+
Sbjct 66 IWFRNGCGAVAVGHRSGWGSGWGYDGREAQRQAIRSCRKQTGSCHVIRWQCS 117
>gi|84499865|ref|ZP_00998131.1| 30S ribosomal protein S12 [Oceanicola batsensis HTCC2597]
gi|84391799|gb|EAQ04067.1| 30S ribosomal protein S12 [Oceanicola batsensis HTCC2597]
Length=129
Score = 38.9 bits (89), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 24/65 (37%), Positives = 37/65 (57%), Gaps = 5/65 (7%)
Query 18 VATAAATTVTLAPAPAN-AADVYGAIAYS-GNGSWGRSWDYPTRAAAEATAVKSC---GY 72
+ A + + L AP + +A+ YGAIAYS + +GR+W+Y +R AE A+ C G
Sbjct 2 IRVALISVLALVAAPVSVSAETYGAIAYSFQSDKYGRAWNYGSRQEAEKAAIGFCKAAGG 61
Query 73 SDCKV 77
CK+
Sbjct 62 KRCKI 66
>gi|54025524|ref|YP_119766.1| hypothetical protein nfa35540 [Nocardia farcinica IFM 10152]
gi|54017032|dbj|BAD58402.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length=151
Score = 38.5 bits (88), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 33/84 (40%), Positives = 44/84 (53%), Gaps = 13/84 (15%)
Query 39 YGAIAYSGNGSWGR---SWDYPTRAAAEATAVKSCGYSDCK-VLTSFTACGAVAANDRAY 94
YG +AYS S GR + +P+ AA+A A++ CG DC VL ACGA+A R
Sbjct 43 YGTLAYS--PSTGRAVAAVGHPSPVAADAAAIRECGVYDCDLVLRLVDACGAIA---RGA 97
Query 95 QGGVG----PTLAAAMKDALTKLG 114
G G P+ A A + A+ LG
Sbjct 98 DGRFGWAAAPSRAEAEQAAVASLG 121
>gi|148256612|ref|YP_001241197.1| hypothetical protein BBta_5310 [Bradyrhizobium sp. BTAi1]
gi|146408785|gb|ABQ37291.1| putative exported protein of unknown function [Bradyrhizobium
sp. BTAi1]
Length=116
Score = 38.5 bits (88), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 35/103 (34%), Positives = 51/103 (50%), Gaps = 13/103 (12%)
Query 31 APANAADVYGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAA- 89
+PA+AA GA A ++G+++DYP A+A A+K C +C +T AC A A
Sbjct 15 SPAHAA---GAFAIGRCAAFGQAFDYPAEHLAKAAALKQCK-GECTAVTMKKACAAFAVD 70
Query 90 --NDRAYQG-GVGPTLAAAMKDALT---KLGGG--YIDTWACN 124
N G V P ++A++ DA K GG I W C+
Sbjct 71 LTNPCGPHGYAVRPKISASLNDATRECYKYGGKECVIRAWVCD 113
>gi|86750611|ref|YP_487107.1| peptidase C14, caspase catalytic subunit p20 [Rhodopseudomonas
palustris HaA2]
gi|86573639|gb|ABD08196.1| Peptidase C14, caspase catalytic subunit p20 [Rhodopseudomonas
palustris HaA2]
Length=445
Score = 37.7 bits (86), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 22/53 (42%), Positives = 29/53 (55%), Gaps = 3/53 (5%)
Query 39 YGAIAYS-GNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSF--TACGAVA 88
+G+I Y N WG SW++ +R AA + A C + CKV SF CGA A
Sbjct 348 WGSIVYDPDNEKWGMSWNHASRKAAVSDAGAKCSGAPCKVELSFYGQRCGAFA 400
>gi|269120983|ref|YP_003309160.1| hypothetical protein Sterm_2376 [Sebaldella termitidis ATCC 33386]
gi|268614861|gb|ACZ09229.1| hypothetical protein Sterm_2376 [Sebaldella termitidis ATCC 33386]
Length=220
Score = 37.4 bits (85), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 25/77 (33%), Positives = 40/77 (52%), Gaps = 3/77 (3%)
Query 39 YGAIAYS-GNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVL--TSFTACGAVAANDRAYQ 95
+G+++Y+ G WG+SW++ TR A A + KSCG +C L S A A++
Sbjct 121 WGSLSYNIKTGKWGKSWNHYTRREAVAASEKSCGSPECASLGFQSTYAVMAMSPGKELIV 180
Query 96 GGVGPTLAAAMKDALTK 112
G + A KDA+ +
Sbjct 181 GLSNKNMKDAGKDAIKR 197
>gi|262201100|ref|YP_003272308.1| hypothetical protein Gbro_1107 [Gordonia bronchialis DSM 43247]
gi|262084447|gb|ACY20415.1| conserved hypothetical protein [Gordonia bronchialis DSM 43247]
Length=126
Score = 37.0 bits (84), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 30/90 (34%), Positives = 44/90 (49%), Gaps = 5/90 (5%)
Query 38 VYGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVA-ANDRAYQG 96
+G IA+ +G + DYP++ +A A CG S C + +CGA+A NDR +
Sbjct 33 NWGVIAFDNSGRTSYAVDYPSQRSAINAAKARCG-SHCGYFAFYNSCGAIAYTNDRRHWH 91
Query 97 GVG--PTLAAAMKDALTKLG-GGYIDTWAC 123
T AA A ++L GY+ WAC
Sbjct 92 WASGYRTPQAAQNAARSELPYRGYVARWAC 121
>gi|119868707|ref|YP_938659.1| hypothetical protein Mkms_2674 [Mycobacterium sp. KMS]
gi|119694796|gb|ABL91869.1| hypothetical protein Mkms_2674 [Mycobacterium sp. KMS]
Length=125
Score = 36.6 bits (83), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/110 (35%), Positives = 51/110 (47%), Gaps = 6/110 (5%)
Query 4 MITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGN-GSWGRSWDYPTRAAA 62
M+ LRR ++ T AP A +GAIA S + G G + Y + A
Sbjct 1 MVALRRLLVTSLVAGCLGGGAAFT---APTAGALGWGAIAVSPDTGKVGYTKGYNSAIEA 57
Query 63 EATAVKSCGYSDCKVLTSFT-ACGAVA-ANDRAYQGGVGPTLAAAMKDAL 110
E A+ C DC+ + +FT CGAVA A D + GVG A A + AL
Sbjct 58 ERAAIGLCSARDCRAVVNFTKGCGAVAQAPDAWWAPGVGRDSAGAQRMAL 107
>gi|126435238|ref|YP_001070929.1| hypothetical protein Mjls_2658 [Mycobacterium sp. JLS]
gi|126235038|gb|ABN98438.1| hypothetical protein Mjls_2658 [Mycobacterium sp. JLS]
Length=133
Score = 36.6 bits (83), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/110 (35%), Positives = 51/110 (47%), Gaps = 6/110 (5%)
Query 4 MITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGN-GSWGRSWDYPTRAAA 62
M+ LRR ++ T AP A +GAIA S + G G + Y + A
Sbjct 9 MVALRRLLVTSLVAGCLGGGAAFT---APTAGALGWGAIAVSPDTGKVGYTKGYNSAIEA 65
Query 63 EATAVKSCGYSDCKVLTSFT-ACGAVA-ANDRAYQGGVGPTLAAAMKDAL 110
E A+ C DC+ + +FT CGAVA A D + GVG A A + AL
Sbjct 66 ERAAIGLCSARDCRAVVNFTKGCGAVAQAPDAWWAPGVGRDSAGAQRMAL 115
>gi|339502098|ref|YP_004689518.1| hypothetical protein RLO149_c005270 [Roseobacter litoralis Och
149]
gi|338756091|gb|AEI92555.1| hypothetical protein RLO149_c005270 [Roseobacter litoralis Och
149]
Length=209
Score = 36.2 bits (82), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 21/45 (47%), Positives = 24/45 (54%), Gaps = 0/45 (0%)
Query 39 YGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTA 83
YGA A S GSWG SW + R+ A ATA+ C S VL A
Sbjct 142 YGAFAISRVGSWGYSWSFDERSEALATALLQCEASTKGVLADLNA 186
>gi|91976267|ref|YP_568926.1| hypothetical protein RPD_1789 [Rhodopseudomonas palustris BisB5]
gi|91682723|gb|ABE39025.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5]
Length=125
Score = 35.8 bits (81), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 34/99 (35%), Positives = 51/99 (52%), Gaps = 10/99 (10%)
Query 35 AADVYGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTACGAVAANDRAY 94
+A+ GA A G++G+++DYP AAA A A K C DC +T AC A+A +
Sbjct 25 SAEAAGAFAVGKCGAYGKAFDYPAEAAAIAAARKQC-RGDCTTITMRRACAALAIDLLNP 83
Query 95 QG----GVGPTLAAAMKDALTKL---GGG--YIDTWACN 124
G V P +++++ +A K GG I WAC+
Sbjct 84 CGPHGYAVEPKISSSLNEATRKCYEYGGKECVIRAWACD 122
Lambda K H
0.318 0.127 0.394
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 130872486112
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40