Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI test windows://python/ray/tests:test_actor_retry is flaky #43845

Open
can-anyscale opened this issue Mar 9, 2024 · 91 comments
Open

CI test windows://python/ray/tests:test_actor_retry is flaky #43845

can-anyscale opened this issue Mar 9, 2024 · 91 comments
Labels
bug Something that is supposed to be working; but isn't ci-test core Issues that should be addressed in Ray Core flaky-tracker Issue created via Flaky Test Tracker https://flaky-tests.ray.io/ P2 Important issue, but not time-critical ray-test-bot Issues managed by OSS test policy stability windows

Comments

@can-anyscale can-anyscale added bug Something that is supposed to be working; but isn't ci-test core Issues that should be addressed in Ray Core flaky-tracker Issue created via Flaky Test Tracker https://flaky-tests.ray.io/ ray-test-bot Issues managed by OSS test policy stability triage Needs triage (eg: priority, bug/not-bug, and owning component) weekly-release-blocker Issues that will be blocking Ray weekly releases and removed weekly-release-blocker Issues that will be blocking Ray weekly releases labels Mar 9, 2024
@jjyao jjyao added P2 Important issue, but not time-critical windows and removed triage Needs triage (eg: priority, bug/not-bug, and owning component) labels Mar 11, 2024
@anyscalesam anyscalesam added P1 Issue that should be fixed within a few weeks and removed P2 Important issue, but not time-critical labels Mar 13, 2024
@anyscalesam anyscalesam added P2 Important issue, but not time-critical and removed P1 Issue that should be fixed within a few weeks labels Mar 14, 2024
@anyscalesam
Copy link
Contributor

p2 as it doesn't block weekly or release for ray210; but @mattip please continue prioritizing

@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale reopened this Mar 21, 2024
@can-anyscale
Copy link
Collaborator Author

@can-anyscale can-anyscale changed the title CI test windows://python/ray/tests:test_actor_retry is flaky CI test windows://python/ray/tests:test_actor_retry is consistently_failing Jun 14, 2024
@can-anyscale can-anyscale reopened this Jun 14, 2024
@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

This test is now considered as flaky because it has been failing on postmerge for too long. Flaky tests do not run on premerge.

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

CI test windows://python/ray/tests:test_actor_retry is flaky. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/5979#01917bc4-e34d-41a5-a162-7534e8c859b8
- https://buildkite.com/ray-project/postmerge/builds/5977#01917bae-bb5c-4412-9383-b4768f0e3da9
- https://buildkite.com/ray-project/postmerge/builds/5967#0191787c-409a-4537-907d-1a241a057aff

DataCaseName-windows://python/ray/tests:test_actor_retry-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

CI test windows://python/ray/tests:test_actor_retry is flaky. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/6004#019182c8-b4e8-4b23-8022-581199b8aa3a
- https://buildkite.com/ray-project/postmerge/builds/5990#01917f6d-d651-4cf1-91f4-853bddd9cb5a
- https://buildkite.com/ray-project/postmerge/builds/5986#01917e25-44b2-4925-bb2f-1a9fec1c8e8b

DataCaseName-windows://python/ray/tests:test_actor_retry-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

CI test windows://python/ray/tests:test_actor_retry is flaky. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/6022#01919388-7046-4102-a8d5-88d49afec6aa
- https://buildkite.com/ray-project/postmerge/builds/6022#01919286-bc62-45a6-9d01-7727a99ea365
- https://buildkite.com/ray-project/postmerge/builds/6020#0191923d-4946-4eb3-afb7-ba5a07c893d4

DataCaseName-windows://python/ray/tests:test_actor_retry-END
Managed by OSS Test Policy

@can-anyscale
Copy link
Collaborator Author

@can-anyscale
Copy link
Collaborator Author

CI test windows://python/ray/tests:test_actor_retry is flaky. Recent failures:
- https://buildkite.com/ray-project/postmerge/builds/6053#01919c88-eb1f-49a8-bfa4-84f6f544c90c
- https://buildkite.com/ray-project/postmerge/builds/6048#01919b48-d02f-45a8-b490-a4afd4662ee7
- https://buildkite.com/ray-project/postmerge/builds/6048#01919a9b-f9de-4f9e-8616-51ebd67cefd6

DataCaseName-windows://python/ray/tests:test_actor_retry-END
Managed by OSS Test Policy

aslonnie pushed a commit that referenced this issue Aug 29, 2024
The `test_actor_retry` tests are failing/flaky on windows. They pass locally. I have not been able to access the CI logs to see what is going wrong. In order to shrink the problem (is it a overall timeout? Is one of the tests failing?) we can start by splitting the tests into two files.

Toward solving #43845.
ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this issue Oct 12, 2024
…7188)

The `test_actor_retry` tests are failing/flaky on windows. They pass locally. I have not been able to access the CI logs to see what is going wrong. In order to shrink the problem (is it a overall timeout? Is one of the tests failing?) we can start by splitting the tests into two files.

Toward solving ray-project#43845.

Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>
ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this issue Oct 15, 2024
…7188)

The `test_actor_retry` tests are failing/flaky on windows. They pass locally. I have not been able to access the CI logs to see what is going wrong. In order to shrink the problem (is it a overall timeout? Is one of the tests failing?) we can start by splitting the tests into two files.

Toward solving ray-project#43845.

Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>
ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this issue Oct 15, 2024
…7188)

The `test_actor_retry` tests are failing/flaky on windows. They pass locally. I have not been able to access the CI logs to see what is going wrong. In order to shrink the problem (is it a overall timeout? Is one of the tests failing?) we can start by splitting the tests into two files.

Toward solving ray-project#43845.

Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>
ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this issue Oct 15, 2024
…7188)

The `test_actor_retry` tests are failing/flaky on windows. They pass locally. I have not been able to access the CI logs to see what is going wrong. In order to shrink the problem (is it a overall timeout? Is one of the tests failing?) we can start by splitting the tests into two files.

Toward solving ray-project#43845.

Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>
ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this issue Oct 15, 2024
…7188)

The `test_actor_retry` tests are failing/flaky on windows. They pass locally. I have not been able to access the CI logs to see what is going wrong. In order to shrink the problem (is it a overall timeout? Is one of the tests failing?) we can start by splitting the tests into two files.

Toward solving ray-project#43845.

Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>
@jjyao jjyao added P2 Important issue, but not time-critical and removed P1 Issue that should be fixed within a few weeks labels Oct 30, 2024
@mattip mattip removed their assignment Dec 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something that is supposed to be working; but isn't ci-test core Issues that should be addressed in Ray Core flaky-tracker Issue created via Flaky Test Tracker https://flaky-tests.ray.io/ P2 Important issue, but not time-critical ray-test-bot Issues managed by OSS test policy stability windows
Projects
None yet
Development

No branches or pull requests

4 participants