ENH: add global optimizer Shuffled Complex Evolution (SCE) to SciPy.optimize. #18436

mcuntz · 2023-05-06T22:58:43Z

What does this implement/fix?

This adds the global optimizer Shuffled Complex Evolution (SCE) to scipy.optimize:

Duan, Sorooshian and Gupta (1992) Effective and efficient global optimization for conceptual rainfall-runoff models, Water Resour Res 28, 1015-1031, doi: 10.1029/91WR02985

SCE is very popular in the hydrologic community and had performed well in Andrea Gavana's Global Optimization Benchmarks (https://infinity77.net/global_optimization/).

Additional information

The current implementation has some nice features that are missing from other optimizers in scipy.optimize:

It can write out restart files so that one can continue an optimisation if the original run was interrupted. This happens, for example, on compute clusters where a job has an allocated run time.
Parameters can be sampled in different ways (open and closed intervals, logarithmic). Sampling a parameter with a normal distribution if it can vary orders of magnitude gives suboptimal solutions. It is much better to sample the parameter in log-space (e.g. Mai, J Hydrolo 2023, 10.1016/j.jhydrol.2023.129414).
It is also practical that one can pass args and kwargs to the objective function.

The practical coding (function, class, OptimizeResult) follows closely the implementation of the differential evolution code in scipy.optimize.

andyfaff · 2023-05-07T02:29:25Z

Thank you for this contribution @mcuntz. When introducing functionality of this kind we ask that you bring it up for discussion on the scipy-dev mailing list first, this is to ensure that it has greater visibility in the project. It also reduces work, because the maintainer team can advise on how to go about the PR - we've had misguided PRs be submitted that were not going to be merged (for one reason or another), which is a waste of time for the author.

Before adding new global optimisers we ask that the functionality goes through the benchmark process first. I see from the changeset that you've modified the global optimiser benchmark. Can you run sce against the benchmarks and report back with the stats please? Please do 100 repeats.

The review process for adding a new minimiser can be drawn out. This is because ongoing maintenance cost needs to be low, which means that code-style, structure, etc, needs to be really good. If you feel as if the review is never ending please don't get down heartened - 1200 lines of code takes a while to get through.

We'll probably have to come up with a name other than sce, purely because it's not obvious what it is.

Lastly, there is code here that has been committed by accident, namely the PROPACK and boost files

mcuntz · 2023-05-09T20:29:01Z

Thanks @andyfaff for the encouraging words.

I renamed it from sce to shuffled_complex_evolution.

I ran the global benchmark suite. That was not obvious given that the docs and readme are out of date. I had to set export SCIPY_XSLOW=1 on the command line, otherwise it would not work. I finally ran python dev.py bench -t optimize.BenchGlobal. I (try to) attach the output to this comment.
BenchGlobal.txt
While investigating why shuffled_complex_evolution did not work with the function Deb03, I found that its bounds are wrong. They were set to self._bounds = list(zip([-1.0] * self.N, [1.0] * self.N)). But there is x**0.75 in the function so that does not work with x<0. Setting self._bounds = list(zip([0.0] * self.N, [1.0] * self.N)) works.
I also came across that the testsuite stopped at Mishra10. The output was of type int64 and json did not like that to dump that. So setting newres.max_obj = np.max(funs).astype(float) (same for min_obj) solved that.

The PROPACK and boost_math must have been update when I rebased with scipy. I am sorry but I do not know how to remove these again. (I cannot simply delete them, isn't it?) git submodule update --init given in forums did not work. I also tried git submodule deinit -f . ; git submodule update --init (which did a lot of things) but to no avail. It check out the new version again, e.g. 109a814... for boost_math:

+++ b/scipy/_lib/boost_math
@@ -1 +1 @@
-Subproject commit 298a243ccd3639b6eaa59bcdab7ab9d5f008fb36
+Subproject commit 109a814e89f77ff8a3fc8f0391f6b35a12640669

Lastly, I will write an e-mail to the scipy-dev mailing list now.

dschmitz89 · 2023-05-09T21:06:59Z

First of all, thanks for this promising PR!

I ran the global benchmark suite. That was not obvious given that the docs and readme are out of date. I had to set export SCIPY_XSLOW=1 on the command line, otherwise it would not work. I finally ran python dev.py bench -t optimize.BenchGlobal. I (try to) attach the output to this comment. BenchGlobal.txt While investigating why shuffled_complex_evolution did not work with the function Deb03, I found that its bounds are wrong. They were set to self._bounds = list(zip([-1.0] * self.N, [1.0] * self.N)). But there is x**0.75 in the function so that does not work with x<0. Setting self._bounds = list(zip([0.0] * self.N, [1.0] * self.N)) works. I also came across that the testsuite stopped at Mishra10. The output was of type int64 and json did not like that to dump that. So setting newres.max_obj = np.max(funs).astype(float) (same for min_obj) solved that.

Another thanks for fixing those issues in the testsuite. It's quite encouraging that benchmarks run succesfully.

To make the benchmark output more easy to digest, could you reuse the script we used for benchmarking DIRECT here? With some plotting on top, we should get plots like these on top of the DIRECT PR. That said, the plots could also be created from the benchmark output text file.

The PROPACK and boost_math must have been update when I rebased with scipy. I am sorry but I do not know how to remove these again. (I cannot simply delete them, isn't it?) git submodule update --init given in forums did not work. I also tried git submodule deinit -f . ; git submodule update --init (which did a lot of things) but to no avail. It check out the new version again, e.g. 109a814... for boost_math:
+++ b/scipy/_lib/boost_math
@@ -1 +1 @@
-Subproject commit 298a243ccd3639b6eaa59bcdab7ab9d5f008fb36
+Subproject commit 109a814e89f77ff8a3fc8f0391f6b35a12640669

I ran into that multiple times as well. Merging main and then using git submodule update --init usually fixed it for me.

rkern · 2023-05-09T22:24:42Z

I would probably drop the restart facility for the first PR (or release). Restarting is something that, in principle, could be added onto many, if not all, of the optimizers. If we're going to add such capabilities to one, I think it behooves us to at least think about the interface such that can be reused for those others ones, too, at least in terms of the call signature. We don't want to get into the position of having slightly different conventions and argument names for each one. We could definitely add the restart facility just to this one at first, but we should first think about a design that would be likely to work for all of them.

For example, I would probably enforce the use of only one file. Exposing two filenames in the signature is a quirk of this particular implementation that wouldn't be needed for other optimizers. For that matter, there isn't a strict need for 2 files even for this optimizer; you can add non-.npy files like JSON-formatted files to an .npz just fine.

andyfaff · 2023-05-09T22:28:49Z

@rkern, thanks for the comments. During the review process we'll definitely get to that. At the moment I'd like to know if you think we should add the minimiser to the global optimizer stable. There's a post on scipy-dev that is testing the waters for this.

rkern · 2023-05-09T23:32:27Z

Yeah, I mentioned it just because it was listed as a feature over the existing solvers. I'd propose setting those aside for evaluation purposes.

Looking at the benchmark results, I'm not seeing a strong standout role for it given SHGO. Maybe it shines against SHGO on the ND=50 problems, though.

mcuntz · 2023-05-21T21:03:36Z

@dschmitz89 I could not find the script that you used for DIRECT. Here my script to plot mean_nfev and nsuccess (i.e. nsuccess/ntrials*100) from global-bench-results.json:

#!/usr/bin/env python
import json
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

# read output from
#     export SCIPY_XSLOW=1 ; python dev.py bench -t optimize.BenchGlobal
# in subdirectory benchmarks/
ifile = 'global-bench-results.json'
with open(ifile, 'r') as f:
    res = json.load(f)

# pandas DataFrame with global optimizer as column 'go'
# and function as column 'func'
res2 = {k: pd.DataFrame(v).T.reset_index(names=['go']) for k, v in res.items()}
df = pd.concat(res2, axis=0).reset_index(level=0, names='func')

# mean per optimizer
dfm = df.groupby('go').mean()
dfm.reset_index(inplace=True, names='go')

# plot bar chart
ngo = dfm.shape[0]
xticks = np.arange(ngo)

fig = plt.figure()

metric1 = 'mean_nfev'
ax = fig.subplots()
b1 = ax.bar(xticks, height=dfm[metric1], width=-0.4, align='edge',
            tick_label=dfm['go'], label=metric1)
ax.set_ylabel(metric1)

metric2 = 'nsuccess'
ax2 = plt.twinx()
b2 = ax2.bar(xticks, height=dfm[metric2] / dfm['ntrials'] * 100., width=0.4,
             align='edge', color='red', label=metric2)
ax2.set_ylabel(metric2 + ' (%)')

plt.legend([b1, b2], [metric1, metric2], loc='upper center')

fig.savefig('global-bench-results.png')

giving:

One can see that SCE has few function evaluations with a rather high success rate. However, SHGO has even less function evaluations.
@rkern So I looked for high-dimensional problems. There is only one 9- and one 10-dimensional problem in the benchmarks.
print(df[df['ndim'] == 9].groupby('go').mean()) gives

          mean_nfev  ndim  nfail  nsuccess  ntrials
go                                                                                                                   
DA      18382.5   9.0   67.0      33.0    100.0
DE       1514.5   9.0   91.0       9.0    100.0
DIRECT  1778.0   9.0    1.0       0.0      1.0
SCE      1225.1   9.0   84.0      16.0    100.0
SHGO     1434.0   9.0    1.0       0.0      1.0
basinh.  5452.3   9.0  100.0       0.0    100.0

and print(df[df['ndim'] == 10].groupby('go').mean()) gives

           mean_nfev ndim  nfail  nsuccess  ntrials
go                                                                                                                  
DA      20284.92  10.0    0.0     100.0    100.0
DE      6194.80  10.0    0.0     100.0    100.0
DIRECT  534.00  10.0    1.0       0.0      1.0
SCE     1086.64  10.0    0.0     100.0    100.0
SHGO    1240.00  10.0    0.0       1.0      1.0
basinh. 14402.86  10.0    0.0     100.0    100.0

(Note that SHGO and DIRECT are no stochastic optimizers (according to the benchmark) and make only one run per benchmark function.) SCE needs less function evaluations than SHGO in this case. But SHGO also fails for ndim=9 so it might stop from some criteria.

I also rebased with scipy main and did git submodule update --init but the PROPACK files are still there :-(

rgommers · 2023-05-22T08:58:08Z

I pushed an update that removed the PROPACK and boost-math submodule updates.

rgommers · 2023-06-11T19:35:14Z

My impression is that the benchmark results do make a case for inclusion of the SCE solver in scipy.optimize. It improves on DIRECT overall, and has a higher success rate than SHGO.

dschmitz89

A first quick review.

I cannot really review the math in detail but the API is usually the bigger discussion point.

dschmitz89 · 2023-07-09T09:57:07Z

benchmarks/benchmarks/go_benchmark_functions/go_funcs_D.py

+            factor1 = 1.0 - (abs(num / den)) ** 5
+            factor2 = 2 + (x[0] - 7.0) ** 2 + 2 * (x[1] - 7.0) ** 2


Are these and other similar changes to the benchmark functions necessary? Often an explicit double is used in power operations like here to convert the output to doubles. That is especially important for scipy's optimizers that are implemented in C or Fortran. An objective returning integers crashed them.

dschmitz89 · 2023-07-09T09:57:43Z

benchmarks/benchmarks/go_benchmark_functions/go_funcs_M.py

+        # self.global_optimum = [[-9.99378322, -9.99918927]]
+        # self.fglob = -0.19990562


Suggested change

# self.global_optimum = [[-9.99378322, -9.99918927]]

# self.fglob = -0.19990562

Good finding, the new optimum.

dschmitz89 · 2023-07-09T10:00:25Z

scipy/optimize/tests/test__shuffled_complex_evolution.py

+        assert_raises(TypeError, shuffled_complex_evolution, func, x0, bounds)
+        bounds = [(-1, 1), (-1, 1)]
+        assert_raises(ValueError, shuffled_complex_evolution, func, x0,
+                      bounds, sampling='unknown')
+        # test correct bool string
+        assert_raises(ValueError, _strtobool, 'Ja')
+        # test no initial population found
+        func = deb03
+        x0 = [-0.5, -0.5]
+        bounds = [(-1, 0), (-1, 0.)]  # should be (0, 1) to work
+        assert_raises(ValueError, shuffled_complex_evolution, func, x0,
+                      bounds, printit=1)


Please test that the correct error messages are shown using

with pytest.raises(ValueError, match="error messsage")

Other necessary tests include objectives that:

return NaN

return inf

dschmitz89 · 2023-07-09T10:02:14Z