BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv2019
Length=138
Score E
Sequences producing significant alignments: (Bits) Value
gi|15609156|ref|NP_216535.1| hypothetical protein Rv2019 [Mycoba... 283 5e-75
gi|289574696|ref|ZP_06454923.1| conserved hypothetical protein [... 280 4e-74
gi|340627031|ref|YP_004745483.1| hypothetical protein MCAN_20421... 257 3e-67
gi|148823235|ref|YP_001287989.1| hypothetical protein TBFG_12054... 239 1e-61
gi|253798926|ref|YP_003031927.1| hypothetical protein TBMG_01964... 208 2e-52
gi|119716256|ref|YP_923221.1| hypothetical protein Noca_2024 [No... 80.9 5e-14
gi|254416801|ref|ZP_05030550.1| hypothetical protein MC7420_1576... 62.4 2e-08
gi|307152992|ref|YP_003888376.1| hypothetical protein Cyan7822_3... 60.8 6e-08
gi|120406340|ref|YP_956169.1| hypothetical protein Mvan_5392 [My... 57.0 8e-07
gi|75909457|ref|YP_323753.1| hypothetical protein Ava_3250 [Anab... 53.5 1e-05
gi|269976998|ref|ZP_06183972.1| putative toxin-antitoxin system,... 53.5 1e-05
gi|257060983|ref|YP_003138871.1| hypothetical protein Cyan8802_3... 52.0 3e-05
gi|218247680|ref|YP_002373051.1| hypothetical protein PCC8801_28... 50.8 7e-05
gi|168704109|ref|ZP_02736386.1| hypothetical protein GobsU_31534... 48.9 2e-04
gi|306819378|ref|ZP_07453086.1| PIN family toxin-antitoxin syste... 48.5 3e-04
gi|227876102|ref|ZP_03994221.1| hypothetical protein HMPREF0577_... 47.8 6e-04
gi|307701612|ref|ZP_07638628.1| conserved hypothetical protein [... 47.4 7e-04
gi|269955272|ref|YP_003325061.1| hypothetical protein Xcel_0464 ... 45.1 0.004
gi|334120498|ref|ZP_08494578.1| hypothetical protein MicvaDRAFT_... 43.5 0.010
gi|336176928|ref|YP_004582303.1| hypothetical protein FsymDg_087... 42.7 0.018
gi|17232673|ref|NP_489221.1| hypothetical protein alr5181 [Nosto... 39.3 0.20
gi|226327991|ref|ZP_03803509.1| hypothetical protein PROPEN_0188... 38.5 0.34
gi|158315766|ref|YP_001508274.1| hypothetical protein Franean1_3... 37.7 0.50
gi|159029752|emb|CAO87830.1| unnamed protein product [Microcysti... 37.7 0.54
gi|194759278|ref|XP_001961876.1| GF15195 [Drosophila ananassae] ... 37.4 0.83
gi|334127107|ref|ZP_08501037.1| delta-aminovaleramide aminohydro... 35.0 3.3
>gi|15609156|ref|NP_216535.1| hypothetical protein Rv2019 [Mycobacterium tuberculosis H37Rv]
gi|15841502|ref|NP_336539.1| hypothetical protein MT2075 [Mycobacterium tuberculosis CDC1551]
gi|31793199|ref|NP_855692.1| hypothetical protein Mb2042 [Mycobacterium bovis AF2122/97]
37 more sequence titles
Length=138
Score = 283 bits (725), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 137/138 (99%), Positives = 138/138 (100%), Gaps = 0/138 (0%)
Query 1 LQPDRNLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMT 60
+QPDRNLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMT
Sbjct 1 MQPDRNLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMT 60
Query 61 AECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAAR 120
AECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAAR
Sbjct 61 AECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAAR 120
Query 121 FPGPFIYTVHPSKIVRVL 138
FPGPFIYTVHPSKIVRVL
Sbjct 121 FPGPFIYTVHPSKIVRVL 138
>gi|289574696|ref|ZP_06454923.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|339632056|ref|YP_004723698.1| hypothetical protein MAF_20310 [Mycobacterium africanum GM041182]
gi|289539127|gb|EFD43705.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|339331412|emb|CCC27101.1| hypothetical protein MAF_20310 [Mycobacterium africanum GM041182]
Length=138
Score = 280 bits (716), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 136/138 (99%), Positives = 137/138 (99%), Gaps = 0/138 (0%)
Query 1 LQPDRNLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMT 60
+ PDRNLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMT
Sbjct 1 MPPDRNLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMT 60
Query 61 AECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAAR 120
AECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAAR
Sbjct 61 AECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAAR 120
Query 121 FPGPFIYTVHPSKIVRVL 138
FPGPFIYTVHPSKIVRVL
Sbjct 121 FPGPFIYTVHPSKIVRVL 138
>gi|340627031|ref|YP_004745483.1| hypothetical protein MCAN_20421 [Mycobacterium canettii CIPT
140010059]
gi|340005221|emb|CCC44374.1| hypothetical protein MCAN_20421 [Mycobacterium canettii CIPT
140010059]
Length=139
Score = 257 bits (657), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 133/138 (97%), Positives = 137/138 (99%), Gaps = 0/138 (0%)
Query 1 LQPDRNLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMT 60
+QPD +LLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDH+WIAMT
Sbjct 1 MQPDPHLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHEWIAMT 60
Query 61 AECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAAR 120
AECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAA+ARAAR
Sbjct 61 AECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAMARAAR 120
Query 121 FPGPFIYTVHPSKIVRVL 138
FPGPFIYTVHPSKIVRVL
Sbjct 121 FPGPFIYTVHPSKIVRVL 138
>gi|148823235|ref|YP_001287989.1| hypothetical protein TBFG_12054 [Mycobacterium tuberculosis F11]
gi|308232008|ref|ZP_07414593.2| hypothetical protein TMAG_02201 [Mycobacterium tuberculosis SUMu001]
gi|308369595|ref|ZP_07666757.1| hypothetical protein TMBG_00559 [Mycobacterium tuberculosis SUMu002]
26 more sequence titles
Length=116
Score = 239 bits (609), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 115/116 (99%), Positives = 116/116 (100%), Gaps = 0/116 (0%)
Query 23 VQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERR 82
+QVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERR
Sbjct 1 MQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERR 60
Query 83 TVLDTGARLFCVPRADILAEQVAARYIASLAAIARAARFPGPFIYTVHPSKIVRVL 138
TVLDTGARLFCVPRADILAEQVAARYIASLAAIARAARFPGPFIYTVHPSKIVRVL
Sbjct 61 TVLDTGARLFCVPRADILAEQVAARYIASLAAIARAARFPGPFIYTVHPSKIVRVL 116
>gi|253798926|ref|YP_003031927.1| hypothetical protein TBMG_01964 [Mycobacterium tuberculosis KZN
1435]
gi|254232190|ref|ZP_04925517.1| hypothetical protein TBCG_01972 [Mycobacterium tuberculosis C]
gi|124601249|gb|EAY60259.1| hypothetical protein TBCG_01972 [Mycobacterium tuberculosis C]
gi|253320429|gb|ACT25032.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
1435]
Length=100
Score = 208 bits (529), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 100/100 (100%), Positives = 100/100 (100%), Gaps = 0/100 (0%)
Query 39 MREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRAD 98
MREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRAD
Sbjct 1 MREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRAD 60
Query 99 ILAEQVAARYIASLAAIARAARFPGPFIYTVHPSKIVRVL 138
ILAEQVAARYIASLAAIARAARFPGPFIYTVHPSKIVRVL
Sbjct 61 ILAEQVAARYIASLAAIARAARFPGPFIYTVHPSKIVRVL 100
>gi|119716256|ref|YP_923221.1| hypothetical protein Noca_2024 [Nocardioides sp. JS614]
gi|119536917|gb|ABL81534.1| conserved hypothetical protein [Nocardioides sp. JS614]
Length=133
Score = 80.9 bits (198), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 40/103 (39%), Positives = 58/103 (57%), Gaps = 0/103 (0%)
Query 13 HIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDA 72
H FVDRSLG QVP LLR G+ L T+ EHYG + V+D +W+ + + W KD
Sbjct 5 HFFVDRSLGRRQVPDLLRADGWDLATLAEHYGIPADEDVTDVEWLRLAGQHAWPVLMKDE 64
Query 73 NIRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAI 115
IR + ER +L+ G R FC+ ++ ++A +I +AI
Sbjct 65 RIRYRSAERAALLEHGVRAFCLTSGNLTGREMAELFIGHRSAI 107
>gi|254416801|ref|ZP_05030550.1| hypothetical protein MC7420_1576 [Microcoleus chthonoplastes
PCC 7420]
gi|196176347|gb|EDX71362.1| hypothetical protein MC7420_1576 [Microcoleus chthonoplastes
PCC 7420]
Length=114
Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/112 (30%), Positives = 54/112 (49%), Gaps = 4/112 (3%)
Query 6 NLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGW 65
NL F+DR LG + ++L++ G + +H+ Q+ D WI E GW
Sbjct 2 NLKPKTTVFFIDRCLGKHPILEMLKETGVSVEIHDDHF----PQNTFDQDWIPQVGEWGW 57
Query 66 IGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIAR 117
I KDA I RN +ER+ V G R+F + + E+ A + +L ++ +
Sbjct 58 IILTKDARIARNTLERQAVARAGIRMFTLASKKLSGEETAIAFRDALPSMLK 109
>gi|307152992|ref|YP_003888376.1| hypothetical protein Cyan7822_3147 [Cyanothece sp. PCC 7822]
gi|306983220|gb|ADN15101.1| conserved hypothetical protein [Cyanothece sp. PCC 7822]
Length=146
Score = 60.8 bits (146), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 34/118 (29%), Positives = 57/118 (49%), Gaps = 5/118 (4%)
Query 14 IFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDAN 73
F+DR LG + + LR AG + +H+ E AQ V W+ + GW+ KDA
Sbjct 9 FFIDRCLGKKHIAEALRKAGITVEIHDDHF-ENNAQDVE---WLPEVGKKGWVVLTKDAK 64
Query 74 IRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAAR-FPGPFIYTVH 130
I +N E+ V G ++F + ++ +A ++A++ + R P PFI V+
Sbjct 65 IGKNNTEQLAVAYAGIKMFALASQNLSGPDMAQIFLATIVKMQEFVRQNPAPFIAKVY 122
>gi|120406340|ref|YP_956169.1| hypothetical protein Mvan_5392 [Mycobacterium vanbaalenii PYR-1]
gi|119959158|gb|ABM16163.1| hypothetical protein Mvan_5392 [Mycobacterium vanbaalenii PYR-1]
Length=132
Score = 57.0 bits (136), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 36/124 (30%), Positives = 59/124 (48%), Gaps = 2/124 (1%)
Query 14 IFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDAN 73
F+DRSLG +V L + G+R+ + +H+ AQ + D +W+ GW HKD
Sbjct 9 FFLDRSLGK-RVAAGLTERGWRIHRVVDHFA-NDAQDIPDEEWMRYGLVRGWFPLHKDGR 66
Query 74 IRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAARFPGPFIYTVHPSK 133
IR ERR +++ A +F + + +++ AR+ A+ I R G + V
Sbjct 67 IRGTEAERRPLIEFNAPMFYLDNQQLRIDEMVARFHAAQRQIYTRTRVAGAACFAVSAHG 126
Query 134 IVRV 137
I R
Sbjct 127 IRRT 130
>gi|75909457|ref|YP_323753.1| hypothetical protein Ava_3250 [Anabaena variabilis ATCC 29413]
gi|75703182|gb|ABA22858.1| hypothetical protein Ava_3250 [Anabaena variabilis ATCC 29413]
Length=144
Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/125 (25%), Positives = 60/125 (48%), Gaps = 5/125 (4%)
Query 14 IFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDAN 73
F+DR+LG V LR G + +H+ D +W+ + GW+ KD
Sbjct 9 FFIDRALGKKYVADALRSMGANVEVHADHF----LSDAPDTEWLPEVSRRGWLILTKDDE 64
Query 74 IRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAARF-PGPFIYTVHPS 132
I RN +E+ + + A++F + ++ E++A ++ ++ + + A+ PFI ++
Sbjct 65 IGRNFLEQMAIASSNAKVFVLSLRNLTGEEMADIFVQAVERMEKFAQSNQAPFIAKIYKY 124
Query 133 KIVRV 137
VRV
Sbjct 125 GKVRV 129
>gi|269976998|ref|ZP_06183972.1| putative toxin-antitoxin system, toxin component, PIN family
[Mobiluncus mulieris 28-1]
gi|269934829|gb|EEZ91389.1| putative toxin-antitoxin system, toxin component, PIN family
[Mobiluncus mulieris 28-1]
Length=126
Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/118 (27%), Positives = 55/118 (47%), Gaps = 1/118 (0%)
Query 15 FVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANI 74
F+DR L + + + LR+AG + TMR+ +G+ + + V+D +W+ + I KD+ I
Sbjct 7 FIDRCLESPSLIKGLREAGLDVVTMRDVFGD-RGEQVTDLEWLRYCEQHNLIALTKDSRI 65
Query 75 RRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAARFPGPFIYTVHPS 132
RN VER V+ R A + + + R + + GP + + S
Sbjct 66 SRNQVEREAVIANKVRYVRTTAAGLRIDMIQDRILQHRKKLENLINQAGPTFHKIDGS 123
>gi|257060983|ref|YP_003138871.1| hypothetical protein Cyan8802_3198 [Cyanothece sp. PCC 8802]
gi|256591149|gb|ACV02036.1| hypothetical protein Cyan8802_3198 [Cyanothece sp. PCC 8802]
Length=143
Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/117 (24%), Positives = 55/117 (48%), Gaps = 5/117 (4%)
Query 15 FVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANI 74
F+DR LG + + L++ G + +H+ Q+ D W+ + GW+ KDA I
Sbjct 12 FIDRCLGNRIIVETLKNTGITVEIHDDHF----PQNAQDVDWLPEVGKRGWVILTKDAAI 67
Query 75 RRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAAR-FPGPFIYTVH 130
+N++ER V +LF + + + +A ++ ++ + R PFI ++
Sbjct 68 SKNSLERMAVTHARIKLFTLASQSLSGQDMADIFLKAIVKMQEFVRKHDAPFIAKIY 124
>gi|218247680|ref|YP_002373051.1| hypothetical protein PCC8801_2898 [Cyanothece sp. PCC 8801]
gi|218168158|gb|ACK66895.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
Length=143
Score = 50.8 bits (120), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/124 (23%), Positives = 56/124 (46%), Gaps = 5/124 (4%)
Query 15 FVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANI 74
F+DR G + + L++ G + +H+ Q+ D W+ + GW+ KDA I
Sbjct 12 FIDRCWGNRIIVETLKNTGITVEIHDDHF----PQNAQDVDWLPEVGKRGWVILTKDAAI 67
Query 75 RRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAAR-FPGPFIYTVHPSK 133
+N++ER V +LF + + + +A ++ ++ + R PFI ++
Sbjct 68 SKNSLERMAVTHAKIKLFTLASQSLSGQDMADIFLKAIVKMQEFVRKHDAPFIAKIYKDS 127
Query 134 IVRV 137
V +
Sbjct 128 QVEM 131
>gi|168704109|ref|ZP_02736386.1| hypothetical protein GobsU_31534 [Gemmata obscuriglobus UQM 2246]
Length=138
Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/95 (32%), Positives = 44/95 (47%), Gaps = 1/95 (1%)
Query 41 EHYGETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADIL 100
E + + AQ V D WI A GW+ KD NIRR ER L + AR+F + +
Sbjct 29 EIHTDNFAQDVEDTTWIPRVAAKGWVILTKDKNIRRRHGEREATLLSAARMFTLSSGSMR 88
Query 101 AEQVAARYIASLAAIAR-AARFPGPFIYTVHPSKI 134
+A +++ L A+ + A PF+ V I
Sbjct 89 GAVMAEIFVSQLDAMEQLATALAPPFLAVVTQDGI 123
>gi|306819378|ref|ZP_07453086.1| PIN family toxin-antitoxin system [Mobiluncus mulieris ATCC 35239]
gi|304647865|gb|EFM45182.1| PIN family toxin-antitoxin system [Mobiluncus mulieris ATCC 35239]
Length=109
Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/76 (33%), Positives = 42/76 (56%), Gaps = 1/76 (1%)
Query 15 FVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANI 74
F+DR L + + + LR+AG + TMR+ +G+ + + V+D +W+ + I KD+ I
Sbjct 7 FIDRCLESPSLIKGLREAGLDVVTMRDAFGD-RGEQVTDLEWLRYCGQHNLIALTKDSRI 65
Query 75 RRNAVERRTVLDTGAR 90
R VER V+ R
Sbjct 66 SRKQVEREAVIANKVR 81
>gi|227876102|ref|ZP_03994221.1| hypothetical protein HMPREF0577_1522 [Mobiluncus mulieris ATCC
35243]
gi|227843401|gb|EEJ53591.1| hypothetical protein HMPREF0577_1522 [Mobiluncus mulieris ATCC
35243]
Length=109
Score = 47.8 bits (112), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/76 (33%), Positives = 42/76 (56%), Gaps = 1/76 (1%)
Query 15 FVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANI 74
F+DR L + + + L +AG + TMR+ +G+ + + V+D +W+ + I KD+ I
Sbjct 7 FIDRCLESPSLIKGLCEAGLDVVTMRDAFGD-RGEQVTDLEWLRYCGQHNLIALTKDSRI 65
Query 75 RRNAVERRTVLDTGAR 90
RN VER V+ R
Sbjct 66 SRNQVEREAVIANKVR 81
>gi|307701612|ref|ZP_07638628.1| conserved hypothetical protein [Mobiluncus mulieris FB024-16]
gi|307613115|gb|EFN92368.1| conserved hypothetical protein [Mobiluncus mulieris FB024-16]
Length=126
Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/104 (26%), Positives = 47/104 (46%), Gaps = 1/104 (0%)
Query 29 LRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERRTVLDTG 88
LR+AG + TMR+ +G+ + + V+D +W+ + I KD+ I RN VER V+
Sbjct 5 LREAGLDVVTMRDAFGD-RGEQVTDLEWLRYCGQHNLIALTKDSRISRNQVEREAVIANK 63
Query 89 ARLFCVPRADILAEQVAARYIASLAAIARAARFPGPFIYTVHPS 132
R A + + + R + + GP + + S
Sbjct 64 VRYVRTTAAGLRIDMIQDRILQHRKKLENLINQAGPTFHKIDGS 107
>gi|269955272|ref|YP_003325061.1| hypothetical protein Xcel_0464 [Xylanimonas cellulosilytica DSM
15894]
gi|269303953|gb|ACZ29503.1| hypothetical protein Xcel_0464 [Xylanimonas cellulosilytica DSM
15894]
Length=112
Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/91 (30%), Positives = 48/91 (53%), Gaps = 1/91 (1%)
Query 48 AQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAEQVAAR 107
+++++D WI +E G + KD I R +E V++ A++F + + I Q+ R
Sbjct 15 SEAIADVDWIREASELGEVILTKDVRIARVPIEAAVVVNCDAKVFAMASSRITGPQMLER 74
Query 108 YIASLAAIARAAR-FPGPFIYTVHPSKIVRV 137
+ AI R AR P PF+ V+ ++I R+
Sbjct 75 LLRHQEAIHRWARQAPPPFVVGVYENRIARL 105
>gi|334120498|ref|ZP_08494578.1| hypothetical protein MicvaDRAFT_2272 [Microcoleus vaginatus FGP-2]
gi|333456476|gb|EGK85108.1| hypothetical protein MicvaDRAFT_2272 [Microcoleus vaginatus FGP-2]
Length=148
Score = 43.5 bits (101), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 34/124 (28%), Positives = 52/124 (42%), Gaps = 6/124 (4%)
Query 14 IFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDAN 73
F+DR LG ++ LR+ G + H+ Q + D W+ E W+ KD
Sbjct 5 FFIDRCLGK-KLADSLRNEGATVEIHDAHF----KQDMKDEDWLREVGERNWVVLTKDKK 59
Query 74 IRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAAR-FPGPFIYTVHPS 132
I +E V RLF D+ VA ++ +L + + R PFI VH S
Sbjct 60 IASRRLELFAVAQGNVRLFAFVDGDVSGVVVAQGFVNALENMQQFIRGNQAPFIAKVHQS 119
Query 133 KIVR 136
+V+
Sbjct 120 GMVK 123
>gi|336176928|ref|YP_004582303.1| hypothetical protein FsymDg_0872 [Frankia symbiont of Datisca
glomerata]
gi|334857908|gb|AEH08382.1| hypothetical protein FsymDg_0872 [Frankia symbiont of Datisca
glomerata]
Length=154
Score = 42.7 bits (99), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 26/79 (33%), Positives = 36/79 (46%), Gaps = 1/79 (1%)
Query 3 PDRNLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQS-VSDHKWIAMTA 61
P + A FVD SLG V LR+ G+ + T+ E +G A S D W+ +
Sbjct 7 PPASYWAKPPEFFVDASLGGRTVTGRLRELGYAVHTLAEVFGSKAAASGQPDTAWLTLVR 66
Query 62 ECGWIGFHKDANIRRNAVE 80
GW+ +DA I R E
Sbjct 67 RTGWVVLARDAAILRRPDE 85
>gi|17232673|ref|NP_489221.1| hypothetical protein alr5181 [Nostoc sp. PCC 7120]
gi|17134320|dbj|BAB76880.1| alr5181 [Nostoc sp. PCC 7120]
Length=99
Score = 39.3 bits (90), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 18/77 (24%), Positives = 36/77 (47%), Gaps = 1/77 (1%)
Query 55 KWIAMTAECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAA 114
+W+ + GW+ KD I N +ER V ++F + E++A + ++
Sbjct 2 EWLPEVGKRGWVVLTKDGKISNNRLERIAVARAQIKMFTFASQSLSGEEMAGILLQAIVP 61
Query 115 IAR-AARFPGPFIYTVH 130
+ R ++ P PFI ++
Sbjct 62 MKRFVSKHPAPFIAKIY 78
>gi|226327991|ref|ZP_03803509.1| hypothetical protein PROPEN_01882 [Proteus penneri ATCC 35198]
gi|225203695|gb|EEG86049.1| hypothetical protein PROPEN_01882 [Proteus penneri ATCC 35198]
Length=165
Score = 38.5 bits (88), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 32/116 (28%), Positives = 47/116 (41%), Gaps = 11/116 (9%)
Query 15 FVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANI 74
F++R LG V L + +R+ YGE + Q + I WI D N+
Sbjct 32 FIERGLGDV-----LISFESEVNNIRQQYGEDEYQVIVPPVDILAEFPVTWI----DKNV 82
Query 75 RRNAVERRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAARFPGPFIYTVH 130
+RN E T+ T PRA + Q R +AA+FP ++TV
Sbjct 83 QRNNTE--TLAKTYLNYLYSPRAQEIITQFNYRVNDKAVMAQKAAQFPTTSLFTVE 136
>gi|158315766|ref|YP_001508274.1| hypothetical protein Franean1_3978 [Frankia sp. EAN1pec]
gi|158111171|gb|ABW13368.1| hypothetical protein Franean1_3978 [Frankia sp. EAN1pec]
Length=249
Score = 37.7 bits (86), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 20/52 (39%), Positives = 30/52 (58%), Gaps = 1/52 (1%)
Query 84 VLDTGARLFCVPRADILAEQVAARYIASLAAIARAARF-PGPFIYTVHPSKI 134
+L +GAR F + R D+ Q+A R I + I R AR PGP+ + VH ++
Sbjct 185 ILASGARCFALARQDLTGCQMAQRLIDNREEILRVARSRPGPYFFHVHQDRV 236
>gi|159029752|emb|CAO87830.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length=75
Score = 37.7 bits (86), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 17/71 (24%), Positives = 36/71 (51%), Gaps = 4/71 (5%)
Query 29 LRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERRTVLDTG 88
+++ G ++ H+ A D +W+ + ++ GWI KD NI RN +E + +
Sbjct 1 MQEIGVKIEFHHAHF----APDTPDTEWLPIVSQRGWIVLTKDVNIGRNILEVQQIARYQ 56
Query 89 ARLFCVPRADI 99
A++F + ++
Sbjct 57 AQVFVLVSGNL 67
>gi|194759278|ref|XP_001961876.1| GF15195 [Drosophila ananassae]
gi|190615573|gb|EDV31097.1| GF15195 [Drosophila ananassae]
Length=392
Score = 37.4 bits (85), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 24/74 (33%), Positives = 38/74 (52%), Gaps = 5/74 (6%)
Query 37 TTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPR 96
T HYG +V++ K+ T +IG H D++ +R + + +LD+G L P
Sbjct 238 TNSSLHYGPLTYTNVTEDKY--WTFHMDFIGIH-DSDTKRCPLGCKAILDSGTSLIAGPA 294
Query 97 ADI--LAEQVAARY 108
ADI L E + A+Y
Sbjct 295 ADIVHLNEAIGAQY 308
>gi|334127107|ref|ZP_08501037.1| delta-aminovaleramide aminohydrolase [Centipeda periodontii DSM
2778]
gi|333390069|gb|EGK61221.1| delta-aminovaleramide aminohydrolase [Centipeda periodontii DSM
2778]
Length=282
Score = 35.0 bits (79), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 19/58 (33%), Positives = 26/58 (45%), Gaps = 0/58 (0%)
Query 38 TMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVERRTVLDTGARLFCVP 95
T E E++ HKWIA + + + KDAN+R A GARL +P
Sbjct 4 TKAEDNDESKGGEAMAHKWIAAAVQMDCVPYDKDANLRHAAERIADAASQGARLVVLP 61
Lambda K H
0.327 0.138 0.420
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 127792337190
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40