TST introducing the random_seed fixture #22749

ogrisel · 2022-03-10T16:57:29Z

This is a new fixture similar to what is being developed in #22690 that will make it possible to ensure that (some of) our tests do no rely on a specific value of a random seed while giving full control to make it possible to reproduce CI runs locally or to make it possible to use

The docstring of the fixture should be quite explicit. I tested it locally and it seems to work as expected:

dev ❯ SKLEARN_TESTS_GLOBAL_RANDOM_SEED='0-2' pytest -v -k "test_kmeans_elkan_results and 0.01-dense-normal" sklearn/cluster/tests/
======================================================================================== test session starts =========================================================================================
platform darwin -- Python 3.9.7, pytest-6.2.5, py-1.10.0, pluggy-0.13.1 -- /Users/ogrisel/mambaforge/envs/dev/bin/python
cachedir: .pytest_cache
To reproduce this test run, set the following environment variable:
    SKLEARN_TESTS_GLOBAL_RANDOM_SEED="0-2"
rootdir: /Users/ogrisel/code/scikit-learn, configfile: setup.cfg
plugins: anyio-3.3.0, xdist-2.3.0, timeout-1.4.2, forked-1.3.0, cov-3.0.0
collected 548 items / 545 deselected / 3 selected                                                                                                                                                    

sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[0-0.01-dense-normal] PASSED                                                                                                   [ 33%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[1-0.01-dense-normal] PASSED                                                                                                   [ 66%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[2-0.01-dense-normal] PASSED                                                                                                   [100%]

================================================================================= 3 passed, 545 deselected in 0.25s ==================================================================================

dev ❯ SKLEARN_TESTS_GLOAL_RANDOM_SEED='42' pytest -v -k "test_kmeans_elkan_results and 0.01-dense-normal" sklearn/cluster/tests/
======================================================================================== test session starts =========================================================================================
platform darwin -- Python 3.9.7, pytest-6.2.5, py-1.10.0, pluggy-0.13.1 -- /Users/ogrisel/mambaforge/envs/dev/bin/python
cachedir: .pytest_cache
To reproduce this test run, set the following environment variable:
    SKLEARN_TESTS_GLOAL_RANDOM_SEED="42"
rootdir: /Users/ogrisel/code/scikit-learn, configfile: setup.cfg
plugins: anyio-3.3.0, xdist-2.3.0, timeout-1.4.2, forked-1.3.0, cov-3.0.0
collected 516 items / 515 deselected / 1 selected                                                                                                                                                    

sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[42-0.01-dense-normal] PASSED                                                                                                  [100%]

================================================================================= 1 passed, 515 deselected in 0.19s ==================================================================================

dev ❯ pytest -v -k "test_kmeans_elkan_results and 0.01-dense-normal" sklearn/cluster/tests/
======================================================================================== test session starts =========================================================================================
platform darwin -- Python 3.9.7, pytest-6.2.5, py-1.10.0, pluggy-0.13.1 -- /Users/ogrisel/mambaforge/envs/dev/bin/python
cachedir: .pytest_cache
To reproduce this test run, set the following environment variable:
    SKLEARN_TESTS_GLOAL_RANDOM_SEED="94"
rootdir: /Users/ogrisel/code/scikit-learn, configfile: setup.cfg
plugins: anyio-3.3.0, xdist-2.3.0, timeout-1.4.2, forked-1.3.0, cov-3.0.0
collected 516 items / 515 deselected / 1 selected                                                                                                                                                    

sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[94-0.01-dense-normal] PASSED                                                                                                  [100%]

================================================================================= 1 passed, 515 deselected in 0.22s ==================================================================================

dev ❯ SKLEARN_TESTS_GLOAL_RANDOM_SEED='all' pytest -v -k "test_kmeans_elkan_results and 0.01-dense-normal" sklearn/cluster/tests/
======================================================================================== test session starts =========================================================================================
platform darwin -- Python 3.9.7, pytest-6.2.5, py-1.10.0, pluggy-0.13.1 -- /Users/ogrisel/mambaforge/envs/dev/bin/python
cachedir: .pytest_cache
To reproduce this test run, set the following environment variable:
    SKLEARN_TESTS_GLOAL_RANDOM_SEED="all"
rootdir: /Users/ogrisel/code/scikit-learn, configfile: setup.cfg
plugins: anyio-3.3.0, xdist-2.3.0, timeout-1.4.2, forked-1.3.0, cov-3.0.0
collected 2100 items / 2000 deselected / 100 selected                                                                                                                                                

sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[0-0.01-dense-normal] PASSED                                                                                                   [  1%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[1-0.01-dense-normal] PASSED                                                                                                   [  2%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[2-0.01-dense-normal] PASSED                                                                                                   [  3%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[3-0.01-dense-normal] PASSED                                                                                                   [  4%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[4-0.01-dense-normal] PASSED                                                                                                   [  5%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[5-0.01-dense-normal] PASSED                                                                                                   [  6%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[6-0.01-dense-normal] PASSED                                                                                                   [  7%]
[...]

TODO:

find out if we can make the test pass even if the plugin is not installed or maybe force the activation of the plugin in sklearn/conftests.py?
document the new environment variable;
decide and implement the renaming of the fixture based on the discussion in TST introducing the random_seed fixture #22749 (comment)

ogrisel · 2022-03-10T17:04:04Z

If we agree that this is a good idea, I will open a meta-issue to progressively convert existing tests to use that fixture.

Each time that will require running the test locally with:

SKLEARN_TESTS_RANDOM_SEED="all" pytest -v -k test_your_test_name

if all seeds pass, then fine it's an easy PR.

If it it's not the case, it will be revealing that the test is too brittle to use this fixture. This will require investigating if the test can be more robust by changing some hyper-parameters or increasing the size of a dataset or similar.

If it's still too hard to make the test pass for any admissible seed, then we might want to keep it with a fixed seed but add a comment to explain that this test to acknowledge that this test is quite sensitive to the specific choice of the seed and move on to the next test.

ogrisel · 2022-03-10T17:06:43Z

Hum, we have a bad interaction between the use of pytest-xdist and this new fixture:

Different tests were collected between gw0 and gw1. The difference is:

https://dev.azure.com/scikit-learn/scikit-learn/_build/results?buildId=39225&view=logs&j=dde5042c-7464-5d47-9507-31bdd2ee0a3a&t=4bd2dad8-62b3-5bf9-08a5-a9880c530c94&l=205

Instead of picking a seed completely at random, I will have to make it depend on the year and the day for instance.

ogrisel · 2022-03-10T17:27:53Z

I think I chose the right day to write this PR:

>>> from random import Random
>>> from datetime import datetime
>>> RANDOM_SEED_RANGE = list(range(100))
... rng = Random(int(datetime.now().strftime("%Y%j")))
... rng.choice(RANDOM_SEED_RANGE)
42

jjerphan

Let's pytest.fixture all the tests!

sklearn/conftest.py

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

sklearn/conftest.py

FIX Adds random_seed as a plugin

jjerphan

LGTM if the CI passes.

jeremiedbb · 2022-03-11T14:15:06Z

Great, the report header shows up on azure ! but now circleci breaks :'(

ogrisel · 2022-03-11T14:25:32Z

Indeed I was afraid this might happen. It will also break when people run the tests from outside of the source folder, e.g. for instance when testing the installed wheels in [cd build] runs.

ogrisel · 2022-03-11T14:28:40Z

I have edited the description of the PR with a TODO list of things to finalize before considering the merge of this PR.

thomasjpfan · 2022-03-11T15:14:08Z

I think the the random seed should be fixed by default. With a non-fixed seed, tests in PRs can fail randomly because a different seed is used.

ogrisel · 2022-03-11T15:16:51Z

I think the the random seed should be fixed by default. With a non-fixed seed, tests in PRs can fail randomly because a different seed is used.

This is the goal of this PR: to make sure that our CI explores over time all the seeds that we are expected to support so as to make sure that we will not have regressions or test code updates that reintroduce seed-specificity by mistake without realizing it.

If that happens, it's easy to read the log of the CI to identify that test and run it locally to with the SKLEARN_TESTS_GLOAL_RANDOM_SEED="all" variable to make it seed insensitive again.

jeremiedbb · 2022-03-11T15:33:51Z

I would not want PRs to have failing tests because of a poor random seed that is unrelated to the PR itself.

That should never happen since we will add the fixture to a test iff the test passes when setting "all". Then, if a PR does not touch the test or the code that this test cover, it wont fail on any seed.

ogrisel · 2022-03-11T15:39:20Z

I would not want PRs to have failing tests because of a poor random seed that is unrelated to the PR itself. That feels like a poor contributor experience.

Indeed that's a valid point.

The other issue with non-fixed seed is when an external packager such as debian runs our tests and happens to use a seed that fails. A non-fixed seed can also block our own release process, when wheel building failed because of a random seed.

Also a valid point.

TLDR I'm advocating for "if SKLEARN_TESTS_GLOBAL_RANDOM_SEED is not set", use SKLEARN_TESTS_GLOBAL_RANDOM_SEED=42 by default.

Let's instead have a dedicated explicit marker SKLEARN_TESTS_GLOBAL_RANDOM_SEED="any" that will pick a random seed in the range.

This will not be the default, but I will configure the CI to use it in the scheduled nightly builds.

ogrisel · 2022-03-11T15:41:23Z

That should never happen since we will add the fixture to a test iff the test passes when setting "all". Then, if a PR does not touch the test or the code that this test cover, it wont fail on any seed.

It could happen if a seed-specificity regression is introduced silently in main and there are no sufficient builds to detect it prior to having random contributors' PR detecting the regression for the first time while unrelated to their work.

ogrisel · 2022-03-11T16:19:08Z

Done. This is now silent by default and only display the seed in the pytest report when enabling the global seed randomization.

I tested it locally and it works but it should now be disabled in the PRs.

ogrisel · 2022-03-11T16:20:30Z

dev ❯ SKLEARN_TESTS_GLOBAL_RANDOM_SEED="any" pytest -v -k "test_kmeans_elkan_results and 1e-08-dense-blobs" --pyargs sklearn.cluster.tests
===================================================================== test session starts =====================================================================
platform darwin -- Python 3.9.7, pytest-7.0.1, pluggy-0.13.1 -- /Users/ogrisel/mambaforge/envs/dev/bin/python
cachedir: .pytest_cache
To reproduce this test run, set the following environment variable:
    SKLEARN_TESTS_GLOBAL_RANDOM_SEED="27"
See: https://scikit-learn.org/dev/computing/parallelism.html#environment-variables
rootdir: /Users/ogrisel/code/scikit-learn, configfile: setup.cfg
plugins: anyio-3.3.0, xdist-2.3.0, timeout-1.4.2, forked-1.3.0, cov-3.0.0
collected 516 items / 515 deselected / 1 selected                                                                                                             

sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[27-1e-08-dense-blobs] PASSED                                                           [100%]

============================================================== 1 passed, 515 deselected in 0.24s ==============================================================

dev ❯ SKLEARN_TESTS_GLOBAL_RANDOM_SEED="0-9" pytest -v -k "test_kmeans_elkan_results and 1e-08-dense-blobs" --pyargs sklearn.cluster.tests
===================================================================== test session starts =====================================================================
platform darwin -- Python 3.9.7, pytest-7.0.1, pluggy-0.13.1 -- /Users/ogrisel/mambaforge/envs/dev/bin/python
cachedir: .pytest_cache
rootdir: /Users/ogrisel/code/scikit-learn, configfile: setup.cfg
plugins: anyio-3.3.0, xdist-2.3.0, timeout-1.4.2, forked-1.3.0, cov-3.0.0
collected 660 items / 650 deselected / 10 selected                                                                                                            

sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[0-1e-08-dense-blobs] PASSED                                                            [ 10%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[1-1e-08-dense-blobs] PASSED                                                            [ 20%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[2-1e-08-dense-blobs] PASSED                                                            [ 30%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[3-1e-08-dense-blobs] PASSED                                                            [ 40%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[4-1e-08-dense-blobs] PASSED                                                            [ 50%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[5-1e-08-dense-blobs] PASSED                                                            [ 60%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[6-1e-08-dense-blobs] PASSED                                                            [ 70%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[7-1e-08-dense-blobs] PASSED                                                            [ 80%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[8-1e-08-dense-blobs] PASSED                                                            [ 90%]
sklearn/cluster/tests/test_k_means.py::test_kmeans_elkan_results[9-1e-08-dense-blobs] PASSED                                                            [100%]

============================================================= 10 passed, 650 deselected in 0.25s ==============================================================

jeremiedbb

LGTM !

doc/computing/parallelism.rst

sklearn/tests/random_seed.py

setup.cfg

sklearn/tests/random_seed.py

Co-authored-by: Jérémie du Boisberranger <34657725+jeremiedbb@users.noreply.github.com>

ogrisel · 2022-03-13T18:02:58Z

We could also define SKLEARN_TESTS_GLOBAL_RANDOM_SEED=0-2, SKLEARN_TESTS_GLOBAL_RANDOM_SEED=3-5, and so on in our different CI entries the regular builds (including PRs). The slowest CI configs would only define a single SKLEARN_TESTS_GLOBAL_RANDOM_SEED=X value that would be different for each slow CI config and we need one for 42 as well make sure that the default seed is also tested.

This might be a good trade-off between computational speed and protection against seed-sensitivity.

ogrisel · 2022-03-13T18:05:16Z

Weird: the import lines of the new fixture are not covered because the pytest coverage plugin kicks in after this plugin has been imported:

https://app.codecov.io/gh/scikit-learn/scikit-learn/compare/22749/diff

:)

thomasjpfan

LGTM

I like the idea of setting different random seeds for different CI configurations.

ogrisel · 2022-03-14T09:03:09Z

I pushed that. For now I decided not to use range seeds to keep the CI build times unchanged.

I will merge if green.

ogrisel · 2022-03-14T09:42:11Z

Merged! Thanks all for the reviews and contributions to improve this fixture.

I will open an issue to give guidelines to start using this fixture in the scikit-learn code base.

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz> Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com> Co-authored-by: Jérémie du Boisberranger <34657725+jeremiedbb@users.noreply.github.com>

TST introduce the random_seed fixture

e3c57de

ogrisel added the Build / CI label Mar 10, 2022

ogrisel changed the title ~~TST introduce the random_seed fixture~~ TST introducing the random_seed fixture Mar 10, 2022

github-actions bot added the module:cluster label Mar 10, 2022

ogrisel added the No Changelog Needed label Mar 10, 2022

ogrisel added 3 commits March 10, 2022 18:13

Use a deterministic meta-seed to make it robust to pytest-xdist

b9ba0fe

Fix phrasing

ed2d665

More motivation

eb82e88

jjerphan reviewed Mar 10, 2022

View reviewed changes

sklearn/conftest.py Outdated Show resolved Hide resolved

sklearn/conftest.py Outdated Show resolved Hide resolved

sklearn/conftest.py Outdated Show resolved Hide resolved

Apply suggestions from code review

953fe1e

Co-authored-by: Julien Jerphanion <git@jjerphan.xyz>

ogrisel commented Mar 10, 2022

View reviewed changes

sklearn/conftest.py Outdated Show resolved Hide resolved

jeremiedbb reviewed Mar 10, 2022

View reviewed changes

sklearn/conftest.py Outdated Show resolved Hide resolved

sklearn/conftest.py Outdated Show resolved Hide resolved

sklearn/conftest.py Outdated Show resolved Hide resolved

ogrisel and others added 4 commits March 10, 2022 19:08

Update sklearn/conftest.py

231aa2e

Only test to raise when there is a non-zero chance to have a problem

79f88b9

Make pytest_report_header return a list of strings

6213872

FIX Adds random_seed as a plugin

ac1a24e

jeremiedbb mentioned this pull request Mar 11, 2022

ENH Add inverse_transform to random projection transformers #21701

Merged

Merge pull request #11 from thomasjpfan/random_seed_plugin

e1807cf

FIX Adds random_seed as a plugin

jjerphan approved these changes Mar 11, 2022

View reviewed changes

ogrisel added 3 commits March 11, 2022 16:08

Force activate the random_seed fixture plugin in sklearn/conftest.py

1d6cd70

Rename to global_random_seed + doc

5155909

Simplify: remove the explicit comma separated list syntax

e8145bd

Avoid pytest-xdist failures at 23:59:59 UTC

4773683

Deterministic and silent by default

367c043

Improved phrasing

c92521e

ogrisel added the Waiting for Reviewer label Mar 11, 2022

jeremiedbb approved these changes Mar 11, 2022

View reviewed changes

doc/computing/parallelism.rst Outdated Show resolved Hide resolved

sklearn/tests/random_seed.py Outdated Show resolved Hide resolved

sklearn/tests/random_seed.py Outdated Show resolved Hide resolved

thomasjpfan reviewed Mar 11, 2022

View reviewed changes

setup.cfg Show resolved Hide resolved

sklearn/tests/random_seed.py Outdated Show resolved Hide resolved

thomasjpfan and others added 4 commits March 11, 2022 14:07

FIX Better pytest-xdist support

e58b5c5

Remove redundant doc

716109e

The 'any' scheme is no long meta-seeded with the day

bbee801

Phrasing in env var doc

12663ba

Co-authored-by: Jérémie du Boisberranger <34657725+jeremiedbb@users.noreply.github.com>

jeremiedbb mentioned this pull request Mar 12, 2022

Use modifiable global random state in tests #13913

Closed

2 tasks

thomasjpfan approved these changes Mar 14, 2022

View reviewed changes

Use different but fixed global random seeds in CI entries

8c455e6

ogrisel merged commit d3429ca into scikit-learn:main Mar 14, 2022

ogrisel deleted the random-seed-fixture branch March 14, 2022 09:41

This was referenced Mar 14, 2022

Improve tests by using global_random_seed fixture to make them less seed-sensitive #22827

Open

DOC Fix the formatting for environment variables in docs #22833

Merged

lorentzenchr mentioned this pull request Apr 21, 2022

Investigate SAG/SAGA solver #23180

Open

davidgilbertson mentioned this pull request Jun 22, 2022

Add n_jobs and random_state to global config #23732

Open

lesteve mentioned this pull request Dec 14, 2023

ENH Simplify pytest global random test plugin #27963

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST introducing the random_seed fixture #22749

TST introducing the random_seed fixture #22749

ogrisel commented Mar 10, 2022 •

edited by jeremiedbb

Loading

ogrisel commented Mar 10, 2022

ogrisel commented Mar 10, 2022 •

edited

Loading

ogrisel commented Mar 10, 2022

jjerphan left a comment

jjerphan left a comment

jeremiedbb commented Mar 11, 2022

ogrisel commented Mar 11, 2022

ogrisel commented Mar 11, 2022

thomasjpfan commented Mar 11, 2022

ogrisel commented Mar 11, 2022 •

edited

Loading

jeremiedbb commented Mar 11, 2022

ogrisel commented Mar 11, 2022 •

edited

Loading

ogrisel commented Mar 11, 2022

ogrisel commented Mar 11, 2022

ogrisel commented Mar 11, 2022

jeremiedbb left a comment

ogrisel commented Mar 13, 2022 •

edited

Loading

ogrisel commented Mar 13, 2022

thomasjpfan left a comment •

edited

Loading

ogrisel commented Mar 14, 2022

ogrisel commented Mar 14, 2022

TST introducing the random_seed fixture #22749

TST introducing the random_seed fixture #22749

Conversation

ogrisel commented Mar 10, 2022 • edited by jeremiedbb Loading

ogrisel commented Mar 10, 2022

ogrisel commented Mar 10, 2022 • edited Loading

ogrisel commented Mar 10, 2022

jjerphan left a comment

Choose a reason for hiding this comment

jjerphan left a comment

Choose a reason for hiding this comment

jeremiedbb commented Mar 11, 2022

ogrisel commented Mar 11, 2022

ogrisel commented Mar 11, 2022

thomasjpfan commented Mar 11, 2022

ogrisel commented Mar 11, 2022 • edited Loading

jeremiedbb commented Mar 11, 2022

ogrisel commented Mar 11, 2022 • edited Loading

ogrisel commented Mar 11, 2022

ogrisel commented Mar 11, 2022

ogrisel commented Mar 11, 2022

jeremiedbb left a comment

Choose a reason for hiding this comment

ogrisel commented Mar 13, 2022 • edited Loading

ogrisel commented Mar 13, 2022

thomasjpfan left a comment • edited Loading

Choose a reason for hiding this comment

ogrisel commented Mar 14, 2022

ogrisel commented Mar 14, 2022

ogrisel commented Mar 10, 2022 •

edited by jeremiedbb

Loading

ogrisel commented Mar 10, 2022 •

edited

Loading

ogrisel commented Mar 11, 2022 •

edited

Loading

ogrisel commented Mar 11, 2022 •

edited

Loading

ogrisel commented Mar 13, 2022 •

edited

Loading

thomasjpfan left a comment •

edited

Loading