BLASTP 2.2.25+
Reference:
Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database
search programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Alejandro A. Schäffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
15,229,318 sequences; 5,219,829,388 total letters
Query= Rv1048c
Length=371
Score E
Sequences producing significant alignments: (Bits) Value
gi|15608188|ref|NP_215564.1| hypothetical protein Rv1048c [Mycob... 743 0.0
gi|289744787|ref|ZP_06504165.1| conserved hypothetical protein [... 742 0.0
gi|289749581|ref|ZP_06508959.1| hypothetical protein TBDG_01806 ... 741 0.0
gi|121636977|ref|YP_977200.1| hypothetical protein BCG_1106c [My... 738 0.0
gi|254231338|ref|ZP_04924665.1| hypothetical protein TBCG_01036 ... 685 0.0
gi|167967681|ref|ZP_02549958.1| hypothetical protein MtubH3_0644... 673 0.0
gi|289569030|ref|ZP_06449257.1| predicted protein [Mycobacterium... 238 2e-60
gi|213649545|ref|ZP_03379598.1| hypothetical protein SentesTy_20... 72.0 2e-10
gi|296131143|ref|YP_003638393.1| hypothetical protein Cfla_3315 ... 60.8 3e-07
gi|194292977|ref|YP_002008884.1| hypothetical protein RALTA_B225... 48.9 0.001
gi|71737112|ref|YP_273257.1| hypothetical protein PSPPH_0985 [Ps... 48.9 0.001
gi|225155556|ref|ZP_03724046.1| hypothetical protein ObacDRAFT_8... 47.8 0.003
gi|288926521|ref|ZP_06420440.1| conserved hypothetical protein [... 47.4 0.004
gi|281426338|ref|ZP_06257251.1| conserved hypothetical protein [... 47.4 0.004
gi|271501964|ref|YP_003334990.1| hypothetical protein Dd586_3454... 46.2 0.008
gi|71907835|ref|YP_285422.1| hypothetical protein Daro_2213 [Dec... 45.4 0.016
gi|320325894|gb|EFW81954.1| hypothetical protein PsgB076_04716 [... 45.1 0.019
gi|336126149|ref|YP_004578105.1| hypothetical protein VAA_00992 ... 44.7 0.024
gi|315922842|ref|ZP_07919082.1| conserved hypothetical protein [... 43.1 0.079
gi|171321090|ref|ZP_02910070.1| conserved hypothetical protein [... 42.7 0.099
gi|309813246|ref|ZP_07706967.1| VanW-like protein [Dermacoccus s... 41.2 0.29
gi|121610648|ref|YP_998455.1| hypothetical protein Veis_3722 [Ve... 40.0 0.62
gi|187939938|gb|ACD39074.1| hypothetical protein PACL_0286 [Pseu... 39.3 0.94
gi|326435096|gb|EGD80666.1| dynein [Salpingoeca sp. ATCC 50818] 39.3 0.96
gi|241518234|ref|YP_002978862.1| hypothetical protein Rleg_5495 ... 39.3 1.1
gi|241589734|ref|YP_002979759.1| hypothetical protein Rpic12D_48... 39.3 1.1
gi|330883137|gb|EGH17286.1| hypothetical protein Pgy4_30395 [Pse... 38.9 1.4
gi|271962174|ref|YP_003336370.1| hypothetical protein Sros_0604 ... 38.5 1.7
gi|335033691|ref|ZP_08527056.1| putative ftsK cell division prot... 38.1 2.6
gi|159185366|ref|NP_355689.2| ftsK cell division protein [Agroba... 38.1 2.7
gi|56476846|ref|YP_158435.1| hypothetical protein ebA2502 [Aroma... 37.4 3.7
gi|183980405|ref|YP_001848696.1| acyl-CoA dehydrogenase FadE2 [M... 37.4 3.8
gi|330340412|ref|NP_001179598.2| neuron navigator 3 [Bos taurus]... 36.6 6.5
gi|116254535|ref|YP_770371.1| hypothetical protein pRL100076 [Rh... 36.2 8.9
>gi|15608188|ref|NP_215564.1| hypothetical protein Rv1048c [Mycobacterium tuberculosis H37Rv]
gi|15840479|ref|NP_335516.1| hypothetical protein MT1078 [Mycobacterium tuberculosis CDC1551]
gi|31792239|ref|NP_854732.1| hypothetical protein Mb1077c [Mycobacterium bovis AF2122/97]
46 more sequence titles
Length=371
Score = 743 bits (1919), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/371 (99%), Positives = 371/371 (100%), Gaps = 0/371 (0%)
Query 1 VQASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWDERTVSALEGAFRSEVRARRVNGP 60
+QASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWDERTVSALEGAFRSEVRARRVNGP
Sbjct 1 MQASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWDERTVSALEGAFRSEVRARRVNGP 60
Query 61 HRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAPTMSPGARKAAHDAGVGWV 120
HRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAPTMSPGARKAAHDAGVGWV
Sbjct 61 HRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAPTMSPGARKAAHDAGVGWV 120
Query 121 DESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRATLAVCEALLANIAGPTVASV 180
DESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRATLAVCEALLANIAGPTVASV
Sbjct 121 DESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRATLAVCEALLANIAGPTVASV 180
Query 181 VEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPIS 240
VEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPIS
Sbjct 181 VEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPIS 240
Query 241 ISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD 300
ISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD
Sbjct 241 ISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD 300
Query 301 LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGED 360
LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGED
Sbjct 301 LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGED 360
Query 361 AAEHLREAMTK 371
AAEHLREAMTK
Sbjct 361 AAEHLREAMTK 371
>gi|289744787|ref|ZP_06504165.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289757131|ref|ZP_06516509.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|294995239|ref|ZP_06800930.1| hypothetical protein Mtub2_12192 [Mycobacterium tuberculosis
210]
gi|289685315|gb|EFD52803.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289712695|gb|EFD76707.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|326904725|gb|EGE51658.1| hypothetical protein TBPG_02639 [Mycobacterium tuberculosis W-148]
Length=371
Score = 742 bits (1915), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/371 (99%), Positives = 371/371 (100%), Gaps = 0/371 (0%)
Query 1 VQASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWDERTVSALEGAFRSEVRARRVNGP 60
+QASDRTWQSNFIRRWYFTETV+YRPLVKYDASMSWDERTVSALEGAFRSEVRARRVNGP
Sbjct 1 MQASDRTWQSNFIRRWYFTETVDYRPLVKYDASMSWDERTVSALEGAFRSEVRARRVNGP 60
Query 61 HRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAPTMSPGARKAAHDAGVGWV 120
HRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAPTMSPGARKAAHDAGVGWV
Sbjct 61 HRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAPTMSPGARKAAHDAGVGWV 120
Query 121 DESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRATLAVCEALLANIAGPTVASV 180
DESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRATLAVCEALLANIAGPTVASV
Sbjct 121 DESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRATLAVCEALLANIAGPTVASV 180
Query 181 VEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPIS 240
VEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPIS
Sbjct 181 VEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPIS 240
Query 241 ISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD 300
ISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD
Sbjct 241 ISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD 300
Query 301 LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGED 360
LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGED
Sbjct 301 LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGED 360
Query 361 AAEHLREAMTK 371
AAEHLREAMTK
Sbjct 361 AAEHLREAMTK 371
>gi|289749581|ref|ZP_06508959.1| hypothetical protein TBDG_01806 [Mycobacterium tuberculosis T92]
gi|289690168|gb|EFD57597.1| hypothetical protein TBDG_01806 [Mycobacterium tuberculosis T92]
Length=371
Score = 741 bits (1913), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/371 (99%), Positives = 370/371 (99%), Gaps = 0/371 (0%)
Query 1 VQASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWDERTVSALEGAFRSEVRARRVNGP 60
+QASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWDERTVSALEGAFRSEVRARRVNGP
Sbjct 1 MQASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWDERTVSALEGAFRSEVRARRVNGP 60
Query 61 HRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAPTMSPGARKAAHDAGVGWV 120
HRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAPTMSPGARKAAHDAGVGWV
Sbjct 61 HRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAPTMSPGARKAAHDAGVGWV 120
Query 121 DESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRATLAVCEALLANIAGPTVASV 180
DESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRATLAVCEALLANIAGPTVASV
Sbjct 121 DESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRATLAVCEALLANIAGPTVASV 180
Query 181 VEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPIS 240
VEATGLSMGSSAQALK LEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPIS
Sbjct 181 VEATGLSMGSSAQALKLLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPIS 240
Query 241 ISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD 300
ISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD
Sbjct 241 ISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD 300
Query 301 LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGED 360
LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGED
Sbjct 301 LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGED 360
Query 361 AAEHLREAMTK 371
AAEHLREAMTK
Sbjct 361 AAEHLREAMTK 371
>gi|121636977|ref|YP_977200.1| hypothetical protein BCG_1106c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224989449|ref|YP_002644136.1| hypothetical protein JTY_1078 [Mycobacterium bovis BCG str. Tokyo
172]
gi|121492624|emb|CAL71093.1| Hypothetical protein BCG_1106c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|224772562|dbj|BAH25368.1| hypothetical protein JTY_1078 [Mycobacterium bovis BCG str. Tokyo
172]
gi|341600993|emb|CCC63665.1| hypothetical protein BCGM_1072c [Mycobacterium bovis BCG str.
Moreau RDJ]
Length=371
Score = 738 bits (1906), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/371 (99%), Positives = 369/371 (99%), Gaps = 0/371 (0%)
Query 1 VQASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWDERTVSALEGAFRSEVRARRVNGP 60
+QASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSW ERTVSALEGAFRSEVRARRVNGP
Sbjct 1 MQASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWAERTVSALEGAFRSEVRARRVNGP 60
Query 61 HRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAPTMSPGARKAAHDAGVGWV 120
HRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAPTMSPGARKAAHDAGVGWV
Sbjct 61 HRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILAAPTMSPGARKAAHDAGVGWV 120
Query 121 DESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRATLAVCEALLANIAGPTVASV 180
DESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRATLAVCEALLANI GPTVASV
Sbjct 121 DESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRATLAVCEALLANITGPTVASV 180
Query 181 VEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPIS 240
VEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPIS
Sbjct 181 VEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPIS 240
Query 241 ISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD 300
ISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD
Sbjct 241 ISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD 300
Query 301 LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGED 360
LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGED
Sbjct 301 LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGED 360
Query 361 AAEHLREAMTK 371
AAEHLREAMTK
Sbjct 361 AAEHLREAMTK 371
>gi|254231338|ref|ZP_04924665.1| hypothetical protein TBCG_01036 [Mycobacterium tuberculosis C]
gi|124600397|gb|EAY59407.1| hypothetical protein TBCG_01036 [Mycobacterium tuberculosis C]
gi|339294048|gb|AEJ46159.1| hypothetical protein CCDC5079_0969 [Mycobacterium tuberculosis
CCDC5079]
gi|339297688|gb|AEJ49798.1| hypothetical protein CCDC5180_0961 [Mycobacterium tuberculosis
CCDC5180]
Length=344
Score = 685 bits (1767), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/344 (99%), Positives = 344/344 (100%), Gaps = 0/344 (0%)
Query 28 VKYDASMSWDERTVSALEGAFRSEVRARRVNGPHRDVIVSLDGAEFLVRWLTTGWPRQVA 87
+KYDASMSWDERTVSALEGAFRSEVRARRVNGPHRDVIVSLDGAEFLVRWLTTGWPRQVA
Sbjct 1 MKYDASMSWDERTVSALEGAFRSEVRARRVNGPHRDVIVSLDGAEFLVRWLTTGWPRQVA 60
Query 88 EALHATSRPDILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAP 147
EALHATSRPDILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAP
Sbjct 61 EALHATSRPDILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAP 120
Query 148 PAPLDARIGWRRATLAVCEALLANIAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASA 207
PAPLDARIGWRRATLAVCEALLANIAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASA
Sbjct 121 PAPLDARIGWRRATLAVCEALLANIAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASA 180
Query 208 TARGPKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIE 267
TARGPKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIE
Sbjct 181 TARGPKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIE 240
Query 268 WAATSALSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPA 327
WAATSALSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPA
Sbjct 241 WAATSALSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPA 300
Query 328 CARLTEQNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREAMTK 371
CARLTEQNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREAMTK
Sbjct 301 CARLTEQNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREAMTK 344
>gi|167967681|ref|ZP_02549958.1| hypothetical protein MtubH3_06441 [Mycobacterium tuberculosis
H37Ra]
gi|254550034|ref|ZP_05140481.1| hypothetical protein Mtube_06183 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
gi|308374184|ref|ZP_07435141.2| hypothetical protein TMFG_02871 [Mycobacterium tuberculosis SUMu006]
gi|308398428|ref|ZP_07492700.2| hypothetical protein TMLG_04017 [Mycobacterium tuberculosis SUMu012]
gi|308342745|gb|EFP31596.1| hypothetical protein TMFG_02871 [Mycobacterium tuberculosis SUMu006]
gi|308366718|gb|EFP55569.1| hypothetical protein TMLG_04017 [Mycobacterium tuberculosis SUMu012]
gi|323720480|gb|EGB29564.1| hypothetical protein TMMG_03047 [Mycobacterium tuberculosis CDC1551A]
Length=338
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/338 (100%), Positives = 338/338 (100%), Gaps = 0/338 (0%)
Query 34 MSWDERTVSALEGAFRSEVRARRVNGPHRDVIVSLDGAEFLVRWLTTGWPRQVAEALHAT 93
MSWDERTVSALEGAFRSEVRARRVNGPHRDVIVSLDGAEFLVRWLTTGWPRQVAEALHAT
Sbjct 1 MSWDERTVSALEGAFRSEVRARRVNGPHRDVIVSLDGAEFLVRWLTTGWPRQVAEALHAT 60
Query 94 SRPDILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDA 153
SRPDILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDA
Sbjct 61 SRPDILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDA 120
Query 154 RIGWRRATLAVCEALLANIAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASATARGPK 213
RIGWRRATLAVCEALLANIAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASATARGPK
Sbjct 121 RIGWRRATLAVCEALLANIAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASATARGPK 180
Query 214 SARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSA 273
SARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSA
Sbjct 181 SARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSA 240
Query 274 LSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPACARLTE 333
LSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPACARLTE
Sbjct 241 LSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPACARLTE 300
Query 334 QNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREAMTK 371
QNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREAMTK
Sbjct 301 QNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREAMTK 338
>gi|289569030|ref|ZP_06449257.1| predicted protein [Mycobacterium tuberculosis T17]
gi|289542784|gb|EFD46432.1| predicted protein [Mycobacterium tuberculosis T17]
Length=118
Score = 238 bits (606), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 117/118 (99%), Positives = 118/118 (100%), Gaps = 0/118 (0%)
Query 254 VVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEI 313
+VKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEI
Sbjct 1 MVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEI 60
Query 314 AGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREAMTK 371
AGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREAMTK
Sbjct 61 AGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREAMTK 118
>gi|213649545|ref|ZP_03379598.1| hypothetical protein SentesTy_20912 [Salmonella enterica subsp.
enterica serovar Typhi str. J185]
Length=34
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/34 (98%), Positives = 34/34 (100%), Gaps = 0/34 (0%)
Query 100 AAPTMSPGARKAAHDAGVGWVDESGAADIHYRNT 133
+APTMSPGARKAAHDAGVGWVDESGAADIHYRNT
Sbjct 1 SAPTMSPGARKAAHDAGVGWVDESGAADIHYRNT 34
>gi|296131143|ref|YP_003638393.1| hypothetical protein Cfla_3315 [Cellulomonas flavigena DSM 20109]
gi|296022958|gb|ADG76194.1| hypothetical protein Cfla_3315 [Cellulomonas flavigena DSM 20109]
Length=365
Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 77/289 (27%), Positives = 125/289 (44%), Gaps = 32/289 (11%)
Query 98 ILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDAR--- 154
++ A +SPGAR D G+ W DE G N V+ PP P+ R
Sbjct 85 VVVARHLSPGARTLLDDRGLSWADEEG-------NLRVSAGPVVVAVDGPPPPVTERRAP 137
Query 155 --IGWRRATLAVCEALLANIAGPTVASVVEAT-------GLSMGSSAQALKFLEKNG-HL 204
+ W A+ AV E +L + + +EA G+S S ++ L+ + G
Sbjct 138 TEMSWADASGAVAELILERATATDLHTEIEAVTSIADRLGISAASVSRTLQRFDAIGWTR 197
Query 205 ASATARGPKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAA 264
S RGP+ R + D A+L ++A + + +++ ++ D + + W A
Sbjct 198 RSGPGRGPQVVRHLADPSAMLSSWAAWSTRRTRRTTVAHALV-TDIEGWLRRLATAWPAG 256
Query 265 GIEWAATSALSASLLAPMQTEIAPMEIYV-PGRSWSDLRRAAMAAGLQEI-AGGRLIL-- 320
WA T +A + AP + ++ +E+Y+ P DL A L + GGR+ L
Sbjct 257 C--WAVTGEAAAQIRAPHLSRVSVVELYIEPDLYDDDLDSLLARAELTPVPTGGRVRLLR 314
Query 321 --RFFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLRE 367
R+ T A L + R+YADL + GVRG++AA LR+
Sbjct 315 ADRYLTTLIAADPGASELPLVPDI---RLYADLLSGGVRGDEAASALRD 360
>gi|194292977|ref|YP_002008884.1| hypothetical protein RALTA_B2255 [Cupriavidus taiwanensis LMG
19424]
gi|193226881|emb|CAQ72832.1| conserved hypothetical protein [Cupriavidus taiwanensis LMG 19424]
Length=371
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 83/331 (26%), Positives = 137/331 (42%), Gaps = 49/331 (14%)
Query 63 DVIVSLDGAEFLVRWLTTGWPRQVAEAL---------HATSRPDILAAPTMSPGARKAAH 113
D+ V+ LV + +PR V +AL H+ +L A ++SPGA++
Sbjct 59 DLHVAGKSIVLLVEARKSVFPRDVRQALWQLKSLQHGHSPDVQHLLIAESLSPGAKELLR 118
Query 114 DAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDAR------IGWRRATLAVCEA 167
+G+ D G+ + + G + I+ PAP A RRA V A
Sbjct 119 AERIGYFDSGGSLFL----PAAGAYVYIDK----PAPKTAEKAVRSLFSGRRAQ--VLHA 168
Query 168 LLANI-AGPTVASVVEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLD 226
LL + A V V ++ + + L LE+ L S +GP R + D ALLD
Sbjct 169 LLVHHEAWFGVTEVAARAQVAPSTVSDVLSELERFDWLVS-RGQGPGKERHLRDPGALLD 227
Query 227 AYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEI 286
A+A+ R+P+ V A + + QL D + +A + +A AP + I
Sbjct 228 AWAKQLVTQRAPVLRRYFVPGLKSEALIERLDQLLDTHQVAYAVSYEAAAQRYAPFLSGI 287
Query 287 APMEIYV-----PGRSWSDLRRAAMAAG-----LQEIAGGRLILRFFPTPACARLTEQNL 336
+ + + + + ++LR A++ G ++ + G L+ R QN+
Sbjct 288 SQVRVRLLPTTAAETAMAELRARAVSEGANLAVIEAKSAGELLFR------------QNV 335
Query 337 QGFRSMLWPRVYADLRTAGVRGEDAAEHLRE 367
G +VY DL R +D AEHLR+
Sbjct 336 GGIWLASPVQVYLDLLRGEGRAKDMAEHLRK 366
>gi|71737112|ref|YP_273257.1| hypothetical protein PSPPH_0985 [Pseudomonas syringae pv. phaseolicola
1448A]
gi|71557665|gb|AAZ36876.1| hypothetical protein PSPPH_0985 [Pseudomonas syringae pv. phaseolicola
1448A]
Length=377
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/301 (23%), Positives = 117/301 (39%), Gaps = 48/301 (15%)
Query 91 HATSRPDILAAPTMSPGARKAAHDAGVGWVDESGAADIHYR--------------NTSTG 136
HA +LAA ++SPGAR+ + + D G+ + ++ S G
Sbjct 96 HAYDAVGMLAAGSLSPGAREELKTQNIAYFDLGGSLYLKHKTWLISIDKPSKRLKKYSNG 155
Query 137 TTLVIETKGAPPAPLDARIGWRRATLAVCEALLAN----IAGPTVASVVEATGLSMGSSA 192
+ + +G+ V ALL + + G +A E + +
Sbjct 156 IDIFTDARGS-----------------VVHALLMHANVWLTGAELAEQAETSSYTCSVVL 198
Query 193 QALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTA 252
Q L E ++ GP R++ + LL A+AE + R +P
Sbjct 199 QELTLRE----WVESSGGGPSKRRMLTRPEKLLHAWAEQWQE-RKEKQTKWYTFVENPKH 253
Query 253 GVVKAGQLWDAAGIE--WAATSALSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGL 310
+ D I+ WA T A +A+++AP+ T EI VP + ++D R A GL
Sbjct 254 MLADLADRIDDQRIDFPWAFTGATAANVVAPLLTSTEGAEIIVP-KGYAD--RMADVLGL 310
Query 311 QEIAGGRLILRFFPTPA--CARLTEQNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREA 368
+ ++ G + PA R + F + + +Y DL R ++ A+HLRE
Sbjct 311 KSVSKGANVTLIEREPASLLYRYRHSDHPAFFASAYI-LYLDLLDGRGRNKELADHLREQ 369
Query 369 M 369
+
Sbjct 370 L 370
>gi|225155556|ref|ZP_03724046.1| hypothetical protein ObacDRAFT_8851 [Opitutaceae bacterium TAV2]
gi|224803699|gb|EEG21932.1| hypothetical protein ObacDRAFT_8851 [Opitutaceae bacterium TAV2]
Length=346
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 72/334 (22%), Positives = 133/334 (40%), Gaps = 31/334 (9%)
Query 47 AFRSEVRARRVNGPHR--DVIVSLDGAEF---LVRWLTTGWPRQVAEALHATSRPDILAA 101
A + ++ +N PH + ++ LDG +F L L ++ + T++P +
Sbjct 24 AHDTRIKQLALNSPHNPTEALLHLDGRQFRFALKFLLVPTVENLLSAKTNGTTQP-LFVV 82
Query 102 PTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAP-------PAPLDAR 154
P ++P +A W + + AD++ + G L++ G P P +
Sbjct 83 PRLTPAFLQAC------WQNGTSVADLNGQLFLRGPGLLVSLPGLPGRHFRFEQEPHNIF 136
Query 155 IGWRRATLAVCEALLANIAGP-TVASVVEATGLSMGSSAQALKFLEKNGHLASATARGPK 213
+G + + ALL+++ + +++ TG + G ++ + +L + GHL AR
Sbjct 137 VG---KSARIVRALLSDVERTWQQSELIKRTGATSGLVSRIVTYLTRQGHLKKVDARRFH 193
Query 214 SARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSA 273
+V LLD + +A D R + L DP + +A T
Sbjct 194 ----VVSPLGLLDVWVQADDFSRRATTYRFAALNNDPVRLARTIRNVLAHDSSPFAFTQW 249
Query 274 LSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPACARLTE 333
++A L P E A + +YV DL GLQ + + P
Sbjct 250 IAAWLRHPY-AEPAIVSLYVQQLPTQDL---LDQLGLQPVNEAGRVWFHLPNDEGVFQEC 305
Query 334 QNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLRE 367
+ +Q + ++Y DL G+RG + A+ LRE
Sbjct 306 RFVQDLPLVTDAQIYLDLLKTGLRGPEQAKALRE 339
>gi|288926521|ref|ZP_06420440.1| conserved hypothetical protein [Prevotella buccae D17]
gi|288336733|gb|EFC75100.1| conserved hypothetical protein [Prevotella buccae D17]
Length=334
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 52/203 (26%), Positives = 78/203 (39%), Gaps = 25/203 (12%)
Query 96 PDILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIET----KGAPPAPL 151
P +L A + PG AG+ +VD G I Y + G L+ + + AP A
Sbjct 78 PILLIARYVQPGVYNILRTAGINFVDTVGNYQILY---TKGKKLIFQLSHTGEKAPIALN 134
Query 152 DARIGWRRATLAVCEALLANI--AGPTVASVVEATGLSMGSSAQALKFLEKNGHLASATA 209
A ++ A L V LL ++ G T + E +S+G+ L LE A
Sbjct 135 KAYPIFQEAGLKVIFYLLQDVDNVGKTFREIKEQCDVSLGTIKNVLDELE-----ARKFV 189
Query 210 RGPKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAA----G 265
K R++ D+ LLD + E + P + +RD ++ WD G
Sbjct 190 LTTKRKRILKDKRRLLDLWVENYHHVLKPKLLVKHFAFRDE-----QSKTQWDKIVLPEG 244
Query 266 IEWAATSALSA--SLLAPMQTEI 286
I W A L P + EI
Sbjct 245 ICWGGECAAYQVNGYLTPQKFEI 267
>gi|281426338|ref|ZP_06257251.1| conserved hypothetical protein [Prevotella oris F0302]
gi|281399516|gb|EFB30347.1| conserved hypothetical protein [Prevotella oris F0302]
Length=334
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/205 (27%), Positives = 80/205 (40%), Gaps = 25/205 (12%)
Query 94 SRPDILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIET----KGAPPA 149
+ P +L A + PG AG+ +VD G I Y + G L+ + + AP A
Sbjct 76 NTPILLIAQYVQPGVYSILRTAGINFVDTVGNYQILY---TKGKKLIFQLSHTGEKAPIA 132
Query 150 PLDARIGWRRATLAVCEALLANI--AGPTVASVVEATGLSMGSSAQALKFLEKNGHLASA 207
A ++ A L V LL ++ G T + E +S+G+ L LE A
Sbjct 133 LNKAYPIFQEAGLKVIFYLLQDVDNVGKTFREIKEQCDVSLGTIKNVLDELE-----ARK 187
Query 208 TARGPKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAA--- 264
K R++ D+ LLD + E + P + +RD + KA WD
Sbjct 188 FVLTTKRKRILKDKRRLLDLWVENYHHVLKPKLLVKHFAFRDEQS---KAQ--WDKIVLP 242
Query 265 -GIEWAATSALSA--SLLAPMQTEI 286
GI W A L P + EI
Sbjct 243 EGICWGGECAAYQVNGYLTPQKFEI 267
>gi|271501964|ref|YP_003334990.1| hypothetical protein Dd586_3454 [Dickeya dadantii Ech586]
gi|270345519|gb|ACZ78284.1| Protein of unknown function DUF2186 [Dickeya dadantii Ech586]
Length=374
Score = 46.2 bits (108), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 83/361 (23%), Positives = 140/361 (39%), Gaps = 46/361 (12%)
Query 41 VSALEGAFRSEVRARRVNGPHRDVIVSLDGA----------EFLVRWLTTGWPRQVAEAL 90
V AL AF SE + R V V LDG + V +PR + A+
Sbjct 21 VEALAEAFGSEASVLDASHELRGVGVELDGMIVIKAPGKTLQVFVEVKREVYPRDLRNAV 80
Query 91 HATSRP------------DILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTT 138
+ R +LAA +SPGA++ + + + G+ + + G
Sbjct 81 YQLHRGIDETRHRHEAIIGLLAAGVLSPGAKQELREQNIASFELGGSLYLKH----DGWL 136
Query 139 LVIETKGAPPAPLDARIG-WRRATLAVCEALLAN----IAGPTVASVVEATGLSMGSSAQ 193
+ IE I + A +V ALL N + G +A E + + Q
Sbjct 137 INIEKPSHRTKKNTQGIDLFTEARESVIHALLMNSHGWLTGTELAEQAETSPYTCSLVLQ 196
Query 194 ALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAG 253
L E +T GP R++ LLDA++E + + S +P
Sbjct 197 ELTLRE----WVESTGGGPSKRRMLTRPGKLLDAWSEQWQERKEKKS-KWYTFVENPKDM 251
Query 254 VVKAGQLWDAAGIE--WAATSALSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQ 311
+ + D ++ WA T A +A++ AP+ T EI VP + +++ R A GL+
Sbjct 252 LSHLAERIDRQKVDYPWAFTGAAAANVYAPLLTSTEGAEIIVP-KGYAE--RMADVLGLK 308
Query 312 EIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPR---VYADLRTAGVRGEDAAEHLREA 368
++ G + PA L +++ + + +Y DL R ++ AEH+RE
Sbjct 309 PVSKGANVTLIEREPAS--LLYRDMHSDHPIFFASPYILYLDLLDGRGRNKELAEHIRER 366
Query 369 M 369
+
Sbjct 367 L 367
>gi|71907835|ref|YP_285422.1| hypothetical protein Daro_2213 [Dechloromonas aromatica RCB]
gi|71847456|gb|AAZ46952.1| hypothetical protein Daro_2213 [Dechloromonas aromatica RCB]
Length=372
Score = 45.4 bits (106), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 81/323 (26%), Positives = 135/323 (42%), Gaps = 33/323 (10%)
Query 63 DVIVSLDGAEFLVRWLTTGWPRQVAEAL-HATSRPD--------ILAAPTMSPGARKAAH 113
D+ V+ LV + +PR V +AL S P +L A ++SPGA++
Sbjct 60 DLHVAGKSIVLLVEAKKSVYPRDVRQALWQLKSTPHSRQAEVQHLLMAESLSPGAKELLR 119
Query 114 DAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAP------LDARIGWRRATLAVCEA 167
+G+ D G+ + ++G + I+ P P + + RRA V A
Sbjct 120 AERIGYFDSGGSLFL----PTSGVYVYIDK----PVPKTLEKSVRSLFSGRRAQ--VLYA 169
Query 168 LLANIAG-PTVASVVEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLD 226
LL N V V E ++ +++ L LE+ L S +GP R + + ALLD
Sbjct 170 LLVNREEWFGVTEVAERAQVAPSTASDVLGELERFDWLVS-RGQGPSKERHLREPGALLD 228
Query 227 AYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEI 286
A+A+ +P+ V A + + +++DA + +A + +A AP + I
Sbjct 229 AWAKQITMQPAPVLRRYFVPGLKSDALIERLSKIFDAHQVPYAVSYEAAAQRYAPFLSAI 288
Query 287 APMEI-YVPGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPACARLT-EQNLQGFRSMLW 344
+ + + +P S AAMA + L T + L QN+ G
Sbjct 289 SQVRVRLLPSTSAE----AAMAELDARVVNEGANLAVIETKSAEELLFRQNVGGVWLASP 344
Query 345 PRVYADLRTAGVRGEDAAEHLRE 367
+VY DL R ++ AEHLR+
Sbjct 345 VQVYLDLLRGEGRSKEMAEHLRK 367
>gi|320325894|gb|EFW81954.1| hypothetical protein PsgB076_04716 [Pseudomonas syringae pv.
glycinea str. B076]
gi|320330302|gb|EFW86286.1| hypothetical protein PsgRace4_09287 [Pseudomonas syringae pv.
glycinea str. race 4]
Length=377
Score = 45.1 bits (105), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 67/301 (23%), Positives = 116/301 (39%), Gaps = 48/301 (15%)
Query 91 HATSRPDILAAPTMSPGARKAAHDAGVGWVDESGAADIHYR--------------NTSTG 136
HA +LAA ++S GAR+ + + D G+ + ++ S G
Sbjct 96 HAYDAVGMLAAGSLSLGAREELKTQNIAYFDLGGSLYLKHKTWLISIDKPSKRLKKYSNG 155
Query 137 TTLVIETKGAPPAPLDARIGWRRATLAVCEALLAN----IAGPTVASVVEATGLSMGSSA 192
+ + +G+ V ALL + + G +A E + +
Sbjct 156 IDIFTDARGS-----------------VVHALLMHANVWLTGAELAEQAETSSYTCSVVL 198
Query 193 QALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTA 252
Q L E ++ GP R++ + LL A+AE + R +P
Sbjct 199 QELTLRE----WVESSGGGPSKRRMLTRPEKLLHAWAEQWQE-RKEKQTKWYTFVENPKH 253
Query 253 GVVKAGQLWDAAGIE--WAATSALSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGL 310
+ D I+ WA T A +A+++AP+ T EI VP + ++D R A GL
Sbjct 254 MLADLADRIDDQRIDFPWAFTGATAANVVAPLLTSTEGAEIIVP-KGYAD--RMADVLGL 310
Query 311 QEIAGGRLILRFFPTPA--CARLTEQNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREA 368
+ ++ G + PA R + F + + +Y DL R ++ A+HLRE
Sbjct 311 KSVSKGANVTLIEREPASLLYRYRHSDHPAFFASAYI-LYLDLLDGRGRNKELADHLREQ 369
Query 369 M 369
+
Sbjct 370 L 370
>gi|336126149|ref|YP_004578105.1| hypothetical protein VAA_00992 [Vibrio anguillarum 775]
gi|335343866|gb|AEH35148.1| hypothetical protein VAA_00992 [Vibrio anguillarum 775]
Length=329
Score = 44.7 bits (104), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 28/116 (25%), Positives = 53/116 (46%), Gaps = 13/116 (11%)
Query 97 DILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIG 156
DIL +S R+ HD+ + + D+SG + TG + G P P+
Sbjct 80 DILFCNHLSDYLRQLCHDSNINYADDSGNVRV-----MTGDICIF--IGNRP-PIKQNKS 131
Query 157 WRRATLAVCEALLA-----NIAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASA 207
+ T+ + + L A ++ T A++ +S+G +A+K+L +N H+A +
Sbjct 132 HQFMTIGIMKCLFALFAEKDLINETYANIASKADISVGMVTKAMKYLIENNHIAKS 187
>gi|315922842|ref|ZP_07919082.1| conserved hypothetical protein [Bacteroides sp. D2]
gi|313696717|gb|EFS33552.1| conserved hypothetical protein [Bacteroides sp. D2]
Length=331
Score = 43.1 bits (100), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 34/141 (25%), Positives = 63/141 (45%), Gaps = 10/141 (7%)
Query 114 DAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGWRRATLAVCEALLA--N 171
D + WVD++G DI + N T V+ KG+ + A A++ + L +
Sbjct 96 DNHISWVDKAGNCDIRHENL---TMKVVGQKGSAETKVTATGKINEASMKLILFFLQHPD 152
Query 172 IAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEA 231
+ + E G S+G+ +A L+ N +LA T +G R I R+ L++ + +
Sbjct 153 TINLSYREIQEKVGYSLGTITKAFDLLKANNYLAQ-TEKG----RKIAMREELIEWWQQQ 207
Query 232 ADKLRSPISISTGVLWRDPTA 252
++ P + + +R P A
Sbjct 208 YNEFLKPKLLVNRMAFRSPEA 228
>gi|171321090|ref|ZP_02910070.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5]
gi|171093647|gb|EDT38804.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5]
Length=374
Score = 42.7 bits (99), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 85/331 (26%), Positives = 135/331 (41%), Gaps = 54/331 (16%)
Query 71 AEFLVRWLTTGWPRQVAEAL-------HATSRPD----ILAAPTMSPGARKAAHDAGVGW 119
A+ V L +PR + EA+ A+ RP ++AA ++SPGAR GVG+
Sbjct 64 ADIAVETLRHAYPRDIREAIWRLDEYKLASERPQDLLTMVAAESLSPGARDMLRKRGVGY 123
Query 120 VDESGAADIHYRNTSTGT---TLVIETKGAPPAPLDARIGWRRATLAVCEALLAN----I 172
+ +G + +RN L K DAR V ALL + I
Sbjct 124 FERNGNLFLRWRNWFINIERPELASARKAVTALFTDARE-------MVVHALLEHRNEWI 176
Query 173 AGPTVASVVEATGLSMGSSAQALKFLEKNGHLASATARGPKSARLIVDRDALLDAYAEAA 232
G +A + +++ + + L+ LE+ S A RLI R LLDA+AE
Sbjct 177 TGGDLAYMTKSSSY---TCSVVLQELERREWCESVGAGRTIRRRLIKPRQ-LLDAWAEHW 232
Query 233 DKLRSP------ISISTGVLWRDPTAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEI 286
K + + VL + + K+G +D +A T +A+L AP+ T +
Sbjct 233 AKRKEQRTRWYSFADRPEVLLTHLSYKLSKSGVPFD-----YAFTGTAAANLYAPLLTSV 287
Query 287 APMEIYV-PGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWP 345
EI V PG + L + + A L+ R A L ++L+G R +P
Sbjct 288 DTAEIIVSPGHA-EQLAKTLQLKPADKGANVTLVER-----TGASLLFRDLRGVRPDEYP 341
Query 346 R-------VYADLRTAGVRGEDAAEHLREAM 369
+Y DL R ++ A+H+ E +
Sbjct 342 SYFASPFILYLDLLDGRGRNKELAQHVLERL 372
>gi|309813246|ref|ZP_07706967.1| VanW-like protein [Dermacoccus sp. Ellin185]
gi|308432842|gb|EFP56753.1| VanW-like protein [Dermacoccus sp. Ellin185]
Length=579
Score = 41.2 bits (95), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 32/107 (30%), Positives = 54/107 (51%), Gaps = 12/107 (11%)
Query 217 LIVDRDALLDAYAEAADKLRSPISISTGVLWRD--PTAGVVKAGQLWDAAGIEWAATSAL 274
+ VD DALL E + +++ I+ V+W+D P+ KAGQ DA+ ++ +AL
Sbjct 265 IKVDADALLAHVLERSTDMQNDIATDAKVVWKDGKPSVQPGKAGQQIDASKVQSVVAAAL 324
Query 275 SASLLA-----PMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEIAGG 316
+ + +A PMQ ++ +I V S L +MA ++ GG
Sbjct 325 TGNHVANLPMKPMQPQVTEKDINV-----SSLPTTSMAHFESKLPGG 366
>gi|121610648|ref|YP_998455.1| hypothetical protein Veis_3722 [Verminephrobacter eiseniae EF01-2]
gi|121555288|gb|ABM59437.1| conserved hypothetical protein [Verminephrobacter eiseniae EF01-2]
Length=363
Score = 40.0 bits (92), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 69/285 (25%), Positives = 115/285 (41%), Gaps = 36/285 (12%)
Query 98 ILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDARI-- 155
+L A ++SPGA++ VG+ D G+ + ++ G L I+ PP L +
Sbjct 95 LLVAESISPGAKELLRSERVGYYDSGGSLYL----SAPGAYLYIDK--PPPKALAKSVRT 148
Query 156 --GWRRATLAVCEALLANIAG-PTVASVVEATGLSMGSSAQALKFLEKNGHLASATARGP 212
RRA V ALL V + + +S +++Q L LE+ L A +GP
Sbjct 149 LFTGRRAQ--VLHALLVQHQNWFGVTELAQQATVSPATASQVLTELERFDWLV-ARGQGP 205
Query 213 KSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIEWAATS 272
R + + ALLDA+A+ +R V + Q++DA +++A +
Sbjct 206 GKERHLREPAALLDAWAKQLATIRPSPVRRYYVPGTKADTLATRIDQVFDAHEVQYAISH 265
Query 273 ALSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPACARLT 332
+A AP + ++ + + V + + A G L R A +
Sbjct 266 EAAAQRYAPFISHVSQVRVRV------------LIGANADAAIGDLDARVVNEGANLGVI 313
Query 333 EQNLQG---FRSM---LW----PRVYADLRTAGVRGEDAAEHLRE 367
E G FR LW ++Y DL R ++ AEH R+
Sbjct 314 EAKSSGELLFREQIDGLWLASPIQIYLDLLRGEGRSKEMAEHFRK 358
>gi|187939938|gb|ACD39074.1| hypothetical protein PACL_0286 [Pseudomonas aeruginosa]
Length=369
Score = 39.3 bits (90), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 72/292 (25%), Positives = 115/292 (40%), Gaps = 41/292 (14%)
Query 98 ILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGW 157
++AA +SPGA+ D G + + SG+ +H + + IE A + +
Sbjct 96 LIAAAHLSPGAKSTLKDKGYAFFERSGSLFLH----TEKMLINIECPSTSSARSHSIDLF 151
Query 158 RRATLAVCEALLAN----IAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASATARGPK 213
A LL N +AG A + E S + + L+ LE+ S A G
Sbjct 152 TEARERAVHGLLKNASQWMAG---ADLAEQARTSTYTCSVVLQELERREWCESQGA-GRT 207
Query 214 SARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIE--WAAT 271
R +V LLD +AE + R P + + +L +G++ WA T
Sbjct 208 KRRRLVQPGKLLDEWAEHWRQ-RDVNRSRWYTFVEHPRLLIDRLSELVKESGVDFPWAFT 266
Query 272 SALSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPACARL 331
A +A++ AP+ T + EI VP G I G L ++ P + +
Sbjct 267 GAAAANIYAPLLTHVDSAEIIVP-------------PGYATILGTTLNMKPAPKGSNVTI 313
Query 332 TEQ---NLQGFRS--------MLWPRV-YADLRTAGVRGEDAAEHLREAMTK 371
E+ +LQ FR P + Y DL R ++ A HLRE + +
Sbjct 314 VERGGASLQ-FRECSPLHSPYFASPYIQYLDLLDGRGRNKELAIHLRERLEQ 364
>gi|326435096|gb|EGD80666.1| dynein [Salpingoeca sp. ATCC 50818]
Length=4272
Score = 39.3 bits (90), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 28/92 (31%), Positives = 43/92 (47%), Gaps = 3/92 (3%)
Query 2 QASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWDERTVSALEGAFRSEVRARRVNGPH 61
QAS+ T Q N + + F+ L+ M TV +LEG + + +
Sbjct 839 QASEITGQFNRMAQVLFSRPNGIEELMDQREFMKTVPETVKSLEGNIKRTLAEWDLLEAF 898
Query 62 RDVIVSLDGAEFLVRWLTTGWPRQVAEALHAT 93
VSL ++F +RW GWP+++AE LH T
Sbjct 899 N---VSLSDSDFALRWDVYGWPKKIAEQLHNT 927
>gi|241518234|ref|YP_002978862.1| hypothetical protein Rleg_5495 [Rhizobium leguminosarum bv. trifolii
WSM1325]
gi|240862647|gb|ACS60311.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii
WSM1325]
Length=360
Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 48/212 (23%), Positives = 90/212 (43%), Gaps = 16/212 (7%)
Query 98 ILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDAR--- 154
+L A T+SPGAR + VG+ D SG+ + S V+ K P + AR
Sbjct 91 LLMANTISPGARALLREEKVGYFDRSGSLYL-----SADNLFVLVEK--PASKQQARSLN 143
Query 155 ---IGWRRATLAVCEALLANIAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASATARG 211
+G R L G V + E +S +++Q L LE+ ++S A G
Sbjct 144 NLFVGSRAQALHAVWTFKDQWFG--VHELAERASVSPTTASQVLIDLERREWVSSKGA-G 200
Query 212 PKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIEWAAT 271
P R++ + ALLD ++ ++ S + + + + ++ D G+ + +
Sbjct 201 PSKERILSNPRALLDEWSSYVASIKPKPLRSYYMRMTNIDEAIHEIDRICDETGVRYEIS 260
Query 272 SALSASLLAPMQTEIAPMEIYVPGRSWSDLRR 303
++ + AP ++I+ + + S LR+
Sbjct 261 GLMAGQIHAPHLSKISQIHCRIDHGGESLLRK 292
>gi|241589734|ref|YP_002979759.1| hypothetical protein Rpic12D_4871 [Ralstonia pickettii 12D]
gi|240868446|gb|ACS66105.1| conserved hypothetical protein [Ralstonia pickettii 12D]
Length=351
Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 55/210 (27%), Positives = 88/210 (42%), Gaps = 15/210 (7%)
Query 98 ILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGW 157
++AA +SPGAR+ + G+ + + +G ++H R + +E +A +
Sbjct 83 MVAAEALSPGAREQLRNRGIAYFERNG--NLHLRRHNWW--FDVERPPLSTTKREASTLF 138
Query 158 RRATLAVCEALLAN----IAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASATARGPK 213
A V ALL + + G +A + ++ S + + L+ LE+ S+ A
Sbjct 139 TDAREMVVHALLMHRGEWLTGSELARISQS---SQYTCSLVLQDLERREWCESSGAGRTL 195
Query 214 SARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIE--WAAT 271
RL R LLDA+AE K R V +P + D AG WA T
Sbjct 196 RRRLTQPRQ-LLDAWAEQWTK-RKEHRHRYYVFTPNPKNLIFDLSAQADKAGTNFPWAFT 253
Query 272 SALSASLLAPMQTEIAPMEIYVPGRSWSDL 301
+A+ AP+ T + EI VP DL
Sbjct 254 GTAAANDFAPLLTSVDTAEIIVPPNHTQDL 283
>gi|330883137|gb|EGH17286.1| hypothetical protein Pgy4_30395 [Pseudomonas syringae pv. glycinea
str. race 4]
Length=178
Score = 38.9 bits (89), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 43/163 (27%), Positives = 71/163 (44%), Gaps = 9/163 (5%)
Query 211 GPKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIE--W 268
GP R++ + LL A+AE + R +P + D I+ W
Sbjct 14 GPSKRRMLTRPEKLLHAWAEQWQE-RKEKQTKWYTFVENPKHMLADLADRIDDQRIDFPW 72
Query 269 AATSALSASLLAPMQTEIAPMEIYVPGRSWSDLRRAAMAAGLQEIAGGRLILRFFPTPA- 327
A T A +A+++AP+ T EI VP + ++D R A GL+ ++ G + PA
Sbjct 73 AFTGATAANVVAPLLTSTEGAEIIVP-KGYAD--RMADVLGLKSVSKGANVTLIEREPAS 129
Query 328 -CARLTEQNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREAM 369
R + F + + +Y DL R ++ A+HLRE +
Sbjct 130 LLYRYRHSDHPAFFASAYI-LYLDLLDGRGRNKELADHLREQL 171
>gi|271962174|ref|YP_003336370.1| hypothetical protein Sros_0604 [Streptosporangium roseum DSM
43021]
gi|270505349|gb|ACZ83627.1| hypothetical protein Sros_0604 [Streptosporangium roseum DSM
43021]
Length=359
Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 78/290 (27%), Positives = 118/290 (41%), Gaps = 45/290 (15%)
Query 98 ILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIGW 157
++ A +SP RK D+G+ ++D +G +IH + G L + +GA P W
Sbjct 92 VVVARYLSPPVRKQLSDSGLSYIDTTG--NIHLNVSRPG--LYVTDRGAERDP------W 141
Query 158 R-----RATL------AVCEALLANIAGPTVASVVEATGLSMGSSAQALKFLEKNGHLAS 206
R R TL V ALL + T+ +V+ +S GS+ + ++FLE LA+
Sbjct 142 RGPGRPRGTLKGAPAAKVVRALLDHDRSWTIRQLVDFADVSTGSTYRVIEFLESE-DLAT 200
Query 207 ATARGPKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGI 266
+ G A I D ALL ++E +R+ S W P G+ + + +
Sbjct 201 RNSAG---AVAIPDWVALLRRWSEDYGFVRN----SRVTRWIAPR-GLSNLMERGAGSAV 252
Query 267 EWAATSALSASLLAPMQTEIAPMEIYVPGRSWSDLR-RAAMAAGLQEI----AGGRLILR 321
++A T L+A AP Y P RS A A L ++ AG ++L
Sbjct 253 QYAVTGTLAAVEWAP----------YAPARSAMIYTANAEQVAQLWDLRPADAGANVMLA 302
Query 322 FFPTPACARLTEQNLQGFRSMLWPRVYADLRTAGVRGEDAAEHLREAMTK 371
T Q G +V DL T R AE L E MT+
Sbjct 303 EPQIDVVFTRTLQAASGLTIAAPAQVAVDLMTGPGRSPSEAEALIEWMTR 352
>gi|335033691|ref|ZP_08527056.1| putative ftsK cell division protein [Agrobacterium sp. ATCC 31749]
gi|333794982|gb|EGL66314.1| putative ftsK cell division protein [Agrobacterium sp. ATCC 31749]
Length=891
Score = 38.1 bits (87), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 21/67 (32%), Positives = 36/67 (54%), Gaps = 2/67 (2%)
Query 40 TVSALEGAFR--SEVRARRVNGPHRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPD 97
TV +E ++ S++ R ++G + V +LD E L R + TG+ RQ EA++ T D
Sbjct 606 TVREMEERYKKMSKIGVRNIDGFNSRVQQALDKGEILTRTVQTGFDRQTGEAMYETEEFD 665
Query 98 ILAAPTM 104
+ P +
Sbjct 666 LKPLPYI 672
>gi|159185366|ref|NP_355689.2| ftsK cell division protein [Agrobacterium tumefaciens str. C58]
gi|159140617|gb|AAK88474.2| putative ftsK cell division protein [Agrobacterium tumefaciens
str. C58]
Length=891
Score = 38.1 bits (87), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 21/67 (32%), Positives = 36/67 (54%), Gaps = 2/67 (2%)
Query 40 TVSALEGAFR--SEVRARRVNGPHRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPD 97
TV +E ++ S++ R ++G + V +LD E L R + TG+ RQ EA++ T D
Sbjct 606 TVREMEERYKKMSKIGVRNIDGFNSRVQQALDKGEILTRTVQTGFDRQTGEAMYETEEFD 665
Query 98 ILAAPTM 104
+ P +
Sbjct 666 LKPLPYI 672
>gi|56476846|ref|YP_158435.1| hypothetical protein ebA2502 [Aromatoleum aromaticum EbN1]
gi|56312889|emb|CAI07534.1| hypothetical protein ebA2502 [Aromatoleum aromaticum EbN1]
Length=469
Score = 37.4 bits (85), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 49/201 (25%), Positives = 80/201 (40%), Gaps = 30/201 (14%)
Query 43 ALEGAFRSEVRARRVNGPHRDVIVSLDGAEFLVRWLTTGWPRQVAEAL---------HAT 93
A+ G + ++ AR + G R V+V + + G PR AL A
Sbjct 145 AVSGRWEPDLIARLLVGGRRHVLV--------CEYKSNGQPRYARSALLELRDYVEQRAP 196
Query 94 SRPDILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDA 153
+ AP +SP R+ + GVG++D G A I + G + PA
Sbjct 197 QATPVFMAPYISPAVRQLCEEKGVGYLDLEGNARIAF-----GGVFIERMVADKPAAEQR 251
Query 154 RIG--WRRATLAVCEALLANIAGP-TVASVVEATGLSMGSSAQALKFLEKNGHLASATAR 210
+ +R + V A+L VA + E +G+S+G + + G + AR
Sbjct 252 ELKSLFRPKSAQVLRAMLREPGRAWRVAELSEISGVSLGHVSNV-----RAGLIDREWAR 306
Query 211 GPKSARLIVDRDALLDAYAEA 231
L+ DALLDA+ ++
Sbjct 307 ASDEGLLLSQPDALLDAWRDS 327
>gi|183980405|ref|YP_001848696.1| acyl-CoA dehydrogenase FadE2 [Mycobacterium marinum M]
gi|183173731|gb|ACC38841.1| acyl-CoA dehydrogenase FadE2 [Mycobacterium marinum M]
Length=416
Score = 37.4 bits (85), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 28/107 (27%), Positives = 48/107 (45%), Gaps = 13/107 (12%)
Query 63 DVIVSLDGAEFLV---RWLTTGW--PRQVAEALHATSRPD--------ILAAPTMSPGAR 109
+ +S DGA++++ +W T+G PR + + PD ++ PT +PG
Sbjct 155 ETTISRDGADYIINGRKWWTSGAADPRCKILIVMGRTNPDAAAHQQQSMILVPTDTPGVT 214
Query 110 KAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDARIG 156
A GW D+ G ++ Y N T ++ +G A AR+G
Sbjct 215 IARSTPVFGWQDQHGHCEVVYDNVRVPATNLLGEEGTGFAIAQARLG 261
>gi|330340412|ref|NP_001179598.2| neuron navigator 3 [Bos taurus]
gi|297474353|ref|XP_002687219.1| PREDICTED: neuron navigator 3 [Bos taurus]
gi|296488026|gb|DAA30139.1| neuron navigator 3 [Bos taurus]
Length=2363
Score = 36.6 bits (83), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 24/73 (33%), Positives = 38/73 (53%), Gaps = 7/73 (9%)
Query 143 TKGAPPAPLDARIGWRRATLAVCEALL-------ANIAGPTVASVVEATGLSMGSSAQAL 195
TKG+P I +A+ + C A L A++AG +V VV+ +G SMG+ A L
Sbjct 557 TKGSPSQSFPKPITTEKASTSSCPAPLEGRETSHASLAGSSVGLVVQGSGQSMGNGAVQL 616
Query 196 KFLEKNGHLASAT 208
+++ H +AT
Sbjct 617 PQQQQHSHPNTAT 629
>gi|116254535|ref|YP_770371.1| hypothetical protein pRL100076 [Rhizobium leguminosarum bv. viciae
3841]
gi|115259183|emb|CAK10301.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length=360
Score = 36.2 bits (82), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 47/212 (23%), Positives = 88/212 (42%), Gaps = 16/212 (7%)
Query 98 ILAAPTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAPLDAR--- 154
+L A T+SPGAR +G+ D SG+ + S V+ K P + AR
Sbjct 91 LLMADTISPGARALLRQEKIGYFDRSGSLYL-----SADNLFVLVEK--PASKQQARSLN 143
Query 155 ---IGWRRATLAVCEALLANIAGPTVASVVEATGLSMGSSAQALKFLEKNGHLASATARG 211
+G R L G V + E +S +++Q L LE+ ++S A G
Sbjct 144 NLFVGSRAKALHAVWTFKDQWFG--VHELAERASVSPTTASQVLIELERREWVSSKGA-G 200
Query 212 PKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDPTAGVVKAGQLWDAAGIEWAAT 271
P R++ + ALLD ++ ++ S + + + + ++ D G + +
Sbjct 201 PSKERILSNPRALLDEWSSYVASIKPKPLRSYYMRMTNIDEAIREIDRICDETGARYEIS 260
Query 272 SALSASLLAPMQTEIAPMEIYVPGRSWSDLRR 303
++ + AP ++I+ + + S LR+
Sbjct 261 GLMAGQIHAPHLSKISQIHCRIDHGGESLLRK 292
Lambda K H
0.318 0.130 0.396
Gapped
Lambda K H
0.267 0.0410 0.140
Effective search space used: 706673976500
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF
excluding environmental samples from WGS projects
Posted date: Sep 5, 2011 4:36 AM
Number of letters in database: 5,219,829,388
Number of sequences in database: 15,229,318
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40