Fix ExecuTorch CI after landing #6564 #139700

huydhn · 2024-11-05T01:57:16Z

After landing pytorch/executorch#6564, we need to update the pinned ExecuTorch commit on PyTorch is fix the regression on PyTorch side. The change to .ci/docker/common/install_executorch.sh is needed because it's how the dependencies are setup on ExecuTorch CI now.

pytorch-bot · 2024-11-05T01:57:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/139700

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 9f94cea with merge base c92de3b ():

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

unstable / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, lf.linux.2xlarge) (gh)
kernels/quantized/test/test_out_variants.py::TestOutVariants::test_quantize_per_tensor_to_out_variant

This comment was automatically generated by Dr. CI and updates every 15 minutes.

huydhn · 2024-11-05T04:50:48Z

@guangy10 @larryliu0820 After pytorch/executorch#6564, we need this change to fix the regression on PyTorch side. However, I think during the gap when PyTorch wasn't installed correctly, some changes has already landed that doesn't work with ExecuTorch yet. Specifically, I'm referring to this failure https://github.com/pytorch/pytorch/actions/runs/11676105984/job/32512808907?pr=139700#step:22:7900

ERROR examples/models/llama3_2_vision/preprocess/test_preprocess.py
FAILED kernels/quantized/test/test_out_variants.py::TestOutVariants::test_add_to_out_variant - AttributeError: The underlying op of 'quantized_decomposed.add' has no overload name 'out'
FAILED kernels/quantized/test/test_out_variants.py::TestOutVariants::test_choose_qparams_tensor_to_out_variant - AttributeError: The underlying op of 'quantized_decomposed.choose_qparams' has no overload name 'Tensor_out'
FAILED kernels/quantized/test/test_out_variants.py::TestOutVariants::test_dequantize_per_channel_to_out_variant - AttributeError: The underlying op of 'quantized_decomposed.dequantize_per_channel' has no overload name 'out'
FAILED kernels/quantized/test/test_out_variants.py::TestOutVariants::test_dequantize_per_tensor_tensor_to_out_variant - AttributeError: The underlying op of 'quantized_decomposed.dequantize_per_tensor' has no overload name 'Tensor_out'
FAILED kernels/quantized/test/test_out_variants.py::TestOutVariants::test_dequantize_per_tensor_to_out_variant - AttributeError: The underlying op of 'quantized_decomposed.dequantize_per_tensor' has no overload name 'out'
FAILED kernels/quantized/test/test_out_variants.py::TestOutVariants::test_mixed_linear_to_out_variant - AttributeError: The underlying op of 'quantized_decomposed.mixed_linear' has no overload name 'out'
FAILED kernels/quantized/test/test_out_variants.py::TestOutVariants::test_mixed_mm_to_out_variant - AttributeError: The underlying op of 'quantized_decomposed.mixed_mm' has no overload name 'out'
FAILED kernels/quantized/test/test_out_variants.py::TestOutVariants::test_quantize_per_channel_to_out_variant - AttributeError: The underlying op of 'quantized_decomposed.quantize_per_channel' has no overload name 'out'
FAILED kernels/quantized/test/test_out_variants.py::TestOutVariants::test_quantize_per_tensor_tensor_to_out_variant - AttributeError: The underlying op of 'quantized_decomposed.quantize_per_tensor' has no overload name 'Tensor_out'
FAILED kernels/quantized/test/test_out_variants.py::TestOutVariants::test_quantize_per_tensor_to_out_variant - AttributeError: The underlying op of 'quantized_decomposed.quantize_per_tensor' has no overload name 'out'
FAILED extension/pybindings/test/test_pybindings.py::PybindingsTest::test - RuntimeError: Missing out variants: {'quantized_decomposed::dequantize_per_tensor', 'quantized_decomposed::add', 'quantized_decomposed::quantize_per_tensor'}

Is there a way to ignore them to land this change, then follow up with proper fixes later? I want to land this change as early as possible to avoid further misses.

huydhn · 2024-11-06T17:56:52Z

Chat with @larryliu0820, let's land this and mark this as unstable for a forward fix

huydhn · 2024-11-06T23:03:03Z

@pytorchbot merge -f 'Land this first and will follow up with ExecuTorch team later on the fix'

pytorchmergebot · 2024-11-06T23:04:27Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

huydhn · 2024-11-06T23:46:00Z

@larryliu0820 I think I understand what happens now. The test that is failing examples/models/llama3_2_vision/preprocess/test_preprocess.py requires torchtune and torchao, otherwise, it would fail. It makes sense now as the quantized ops are from torchao. From what I see, both repos are currently not included in https://github.com/.../blob/main/install_requirements.py, so I assume that they are optional dependency. Given that, I think it's better to skip the test if torchtune and ao are not available and run it only when they are installed. We need that flexibility when running ET tests on PT CI.

pytest -v examples/models/llama3_2_vision/preprocess/test_preprocess.py works locally for me when I have them installed.

After landing pytorch/executorch#6564, we need to update the pinned ExecuTorch commit on PyTorch is fix the regression on PyTorch side. The change to `.ci/docker/common/install_executorch.sh` is needed because it's how the dependencies are setup on ExecuTorch CI now. Pull Request resolved: pytorch#139700 Approved by: https://github.com/larryliu0820, https://github.com/malfet

Fix ExecuTorch CI after landing pytorch#6564

c902668

huydhn added the test-config/executorch label Nov 5, 2024

huydhn requested review from larryliu0820 and guangy10 November 5, 2024 01:57

huydhn requested a review from jeffdaily as a code owner November 5, 2024 01:57

pytorch-bot bot added ciflow/inductor topic: not user facing topic category labels Nov 5, 2024

huydhn added the test-config/default label Nov 5, 2024

Keep executorch job as unstable

4f39888

huydhn requested a review from a team as a code owner November 6, 2024 17:56

huydhn added the ciflow/unstable Run all experimental or flaky jobs on PyTorch unstable workflow label Nov 6, 2024

larryliu0820 approved these changes Nov 6, 2024

View reviewed changes

malfet approved these changes Nov 6, 2024

View reviewed changes

Forgot about get-label-type

9f94cea

pytorchmergebot added merging Merged labels Nov 6, 2024

pytorchmergebot closed this in ed16f28 Nov 6, 2024

pytorchmergebot removed the merging label Nov 6, 2024

huydhn mentioned this pull request Nov 7, 2024

Install torchtune and ao when testing ExecuTorch llama3 #139947

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix ExecuTorch CI after landing #6564 #139700

Fix ExecuTorch CI after landing #6564 #139700

huydhn commented Nov 5, 2024

pytorch-bot bot commented Nov 5, 2024 •

edited

Loading

huydhn commented Nov 5, 2024

huydhn commented Nov 6, 2024

huydhn commented Nov 6, 2024

pytorchmergebot commented Nov 6, 2024

huydhn commented Nov 6, 2024

Fix ExecuTorch CI after landing #6564 #139700

Fix ExecuTorch CI after landing #6564 #139700

Conversation

huydhn commented Nov 5, 2024

pytorch-bot bot commented Nov 5, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/139700

✅ You can merge normally! (1 Unrelated Failure)

huydhn commented Nov 5, 2024

huydhn commented Nov 6, 2024

huydhn commented Nov 6, 2024

pytorchmergebot commented Nov 6, 2024

Merge started

huydhn commented Nov 6, 2024

pytorch-bot bot commented Nov 5, 2024 •

edited

Loading