LLVM and SPIRV-LLVM-Translator pulldown (WW52 2024) #16484

iclsrc · 2024-12-27T04:44:44Z

LLVM: llvm/llvm-project@2fe30bc
SPIRV-LLVM-Translator: KhronosGroup/SPIRV-LLVM-Translator@b8b1b96

For whatever reason, each ctype test contains its own copy of some identical helper source code. These local helpers were defined with external linkage for no apparent reason. This leads to multiple definition errors when linking these tests together. This change moves each file's local helper code into an anonymous namespace so it has internal linkage. It's notable that the libc test code does not follow the most common norm of gtest-style code where all the `TEST(...)` cases themselves are defined inside an anonymous namespace (along with whatever other local helpers they use); whether libc's tests should follow that usual convention can be addressed holistically in future discussion. The replacement of numerous cut&paste'd copies of identical helper code with sharing the source code in some usual fashion is also left for later cleanup. This change only makes the test code not straightforwardly have multiple definition errors that prevent linking a test executable at all.

This was added in #117573 but the options were not being rendered correctly due to the missing newline after `::`.

…#119252) Reverts llvm/llvm-project#112277 This broke something on Fuchsia's Mac builders, so there's still something in the CMake that needs to be updated before we reland. Failed build: https://ci.chromium.org/ui/p/fuchsia/builders/toolchain.ci/clang-mac-xarm64/b8729005878443108801/overview

… (#117195)" (#119247) The previous patch https://github.com/llvm/llvm-project/pull/116860/files#diff-e7e06355c973f68f900d2a34a4103dbfa022589c55c59d02870da9365acf7b98L651 seems to mistakenly overwrites true16 test lines. i.e. ``` v_fmaak_f16 v5.l, v1.l, v2.l, 0xfe0b ``` to ``` v_fmaak_f16 v5, v1, v2, 0xfe0b ``` Planned to revert patch llvm/llvm-project#117195 llvm/llvm-project#116860 and redo these two. This is the revert of the patch 117195. The revert of 116860 will be in a seperate patch

Approximates the shadow propagation via OR'ing. Updates the neon_vmul.ll test introduced in llvm/llvm-project#117935

…… (#119253) The previous patch https://github.com/llvm/llvm-project/pull/116860/files#diff-e7e06355c973f68f900d2a34a4103dbfa022589c55c59d02870da9365acf7b98L651 seems to mistakenly overwrites true16 test lines. i.e. v_fmaak_f16 v5.l, v1.l, v2.l, 0xfe0b to v_fmaak_f16 v5, v1, v2, 0xfe0b Planned to revert patch llvm/llvm-project#117195 llvm/llvm-project#116860 and redo these two. This is the revert of the patch 116860.

@bogner

…9202) This PR improves general validity of emitted code between passes due to generation of `TargetOpcode::PHI` instead of `SPIRV::OpPhi` after Instruction Selection, fixing generation of OpTypePointer instructions and using of proper virtual register classes. Using `TargetOpcode::PHI` instead of `SPIRV::OpPhi` after Instruction Selection has a benefit to support existing optimization passes immediately, as an alternative path to disable those passes that use `MI.isPHI()`. This PR makes it possible thus to revert llvm/llvm-project#116060 actions and get back to use the `MachineSink` pass. This PR is a solution of the problem discussed in details in llvm/llvm-project#110507. It accepts an advice from code reviewers of the PR #110507 to postpone generation of OpPhi rather than to patch CodeGen. This solution allows to unblock improvements wrt. expensive checks and makes it unrelated to the general points of the discussion about OpPhi vs. G_PHI/PHI. This PR contains numerous small patches of emitted code validity that allows to substantially pass rate with expensive checks. Namely, the test suite with expensive checks set ON now has only 12 fails out of 569 total test cases. FYI @bogner

FWICT, these were the newly added headers for c11.

The functions are not relevant for most sanitizers and only required for MSan to see which regions have been written to. This eliminates a link dependency for all other sanitizers and fixes #59007: while `-lresolv` had been added for the static runtime in 6dce56b, it wasn't added to the shared runtimes. Instead of just moving the interceptors, we adapt them to MSan conventions: * We don't skip intercepting when `msan_init_is_running` is true, but directly call ENSURE_MSAN_INITED() like most other interceptors. It seems unlikely that these functions are called during initialization. * We don't unpoison `errno`, because none of the functions is specified to use it.

We do not have CI coverage for Windows/MacOS and we regularly run into problem where changes break post-commit fullbuild which is not tested in pre-commit builds. This PR utilizes the github action to address such issues.

…nc and gpu.func (#119034) Use `pm.nest` to schedule the pass on nested `func.func` and `gpu.func` in the `gpu.module`. AbstractResult pass is not meant to run on the whole gpu.module at once.

Reverts revert #118517 after (hopefully) fixing builders (llvm/llvm-zorg#328, llvm/llvm-zorg#327) This reverts commit 61bf308.

…13474) Update VOP3dot instructions with true16 and fake16 formats. This patch includes instructions: v_dot2_f16_f16 v_dot2_bf16_bf16

…9263) This is a NFC change The previous patch llvm/llvm-project#116860 has an issue and is reverted in llvm/llvm-project#119253. Redo the patch here

Lowering `math.powf` to `llvm.intr.powf` will result in `pow(x, 0) = 1`, even for `x=0`. When using the Math dialect expansion patterns, `pow(0, 0)` will result in `-nan`, however, This change adds two additional instructions to the lowering to ensure the `pow(x, 0)` case lowers to to `1` regardless of the value of `x`. Resolves llvm/llvm-project#118945.

Move code to prepare the VPlan for the epilogue vector loop to a helper to reduce size and complexity of processLoop.

Looks like we were missing docs for: - float.h - wchar.h - wctype.h Which AFAICT were added in ISO C99.

Once again, this is a clause on a combined construct that does almost exactly what the loop/compute construct version does, only with some sl ightly different evaluation rules/sema rules as it doesn't have to consider the parent, just the 'combined' construct. The two sets of rules for reduction on loop and compute are fine together, so this ensures they are all enforced for this too. The 'gangs' 'num_gangs' 'reduction' diagnostic (Dim>1) had to be applied to num_gangs as well, as it previously wasn't permissible to get in this situation, but we now can.

Directly check VectorizingEpilogue which directly indicates that the epilogue is vectorized.

Our byval call lowering isn't copying the argument. Looks like our SelectionDAG code for byval is different than AArch64 so this may be non-trivial to fix. Reject for now.

This is a NFC change The previous patch llvm/llvm-project#117195 has an issue and is reverted in llvm/llvm-project#119247 Redo the patch here

This is a NFC change. Update mc test for v_add/sub_f16 in true16 format. MC source change was done by previous patch and automatically enabled by t16 pesudo

Fix 64-bit PowerPC part of llvm/llvm-project#102783.

Update the code to create induction resume PHIs to also create a resume phi for the canonical induction during epilogue vectorization. This unifies the code for handling induction resume values and removes the need to explicitly create manually resume PHI and return it during epilogue creation. Overall it helps to move the code for updating the canonical induction resume value to the place where all other header phi resume values are updated. This is NFC, modulo order of the created phis.

The acc data clause operations hold an operand named `varPtr`. This was intended to hold a pointer to a variable - where the element type of that pointer specifies the type of the variable. However, for both memref and llvm dialects, this assumption is not true. This is because memref element type for cases like memref<10xf32> is simply f32 and for LLVM, after opaque pointers, the variable type is no longer recoverable. Thus, introduce varType to ensure that appropriate semantics are kept. Both the parser and printer for this new type attribute allow it to not be specified in cases where a dialect's getElementType() applied to `varPtr`'s type has a recoverable type. And more specifically, for FIR, no changes are needed in the MLIR unit tests.

…tructuredBuffer` (#118536) The methods are using existing clang builtins `__builtin_hlsl_buffer_update_counter` and `__builtin_hlsl_resource_getpointer` to update the buffer counter and then load or store the value. Fixes #112968

We have users that target baremetal aarch64.

jsji · 2024-12-27T17:54:37Z

This is ready for review.

Update test after 7954a05 @intel/dpcpp-cfe-reviewers
[SYCL][E2E] Remove unused -fno-sycl-dead-args-optimization @intel/dpcpp-cfe-reviewers
Regen group_load/store after 7954a05 (https://github.com/intel/llvm/pull/16310[)](https://github.com/intel/llvm/pull/16484/commits/515a700962457cbc5f0e210f078e10dd03a36c80)
[SYCL][E2E] Update CHECKs to reflect changes to __builtin_COLUMN
Cherry-pick only already reviewed before.

bader · 2024-12-27T18:10:47Z

sycl/test-e2e/Config/kernel_from_file.cpp

@@ -14,7 +14,7 @@
 // RUN: %if linux %{ llvm-link -o=%t_app.bc %t.bc %t_compiler_wrappers.bc %t_asan.bc %} %else %{ llvm-link -o=%t_app.bc %t.bc %t_compiler_wrappers.bc %}
 // >> ---- translate to SPIR-V
 // RUN: llvm-spirv -o %t.spv %t_app.bc
-// RUN: %clangxx -Wno-error=ignored-attributes %sycl_include -DSYCL_DISABLE_FALLBACK_ASSERT %cxx_std_optionc++17 %include_option %t.h %s -o %t.out %sycl_options -fno-sycl-dead-args-optimization -Xclang -verify-ignore-unexpected=note,warning


What do you mean "unused"? According to the comment at line 6, it's required for this test.

I guess it's required for the device compilation because host part is compiled in non-sycl mode. -fno-sycl-dead-args-optimization is ignored in non-sycl mode.
Did I get it right? If so, please, update the test comment and commit message.

Updated commit message with test failures. The option is still used in line 9 after the comments, so I don't think we need to update the comments.

bader · 2024-12-27T18:27:16Z

LLVM: llvm/llvm-project@2fe30bc

@jsji, this patch was committed 17 days ago. Do you know the reason for the delay with pulling llvm-project commits?

Fix failures in https://github.com/intel/llvm/actions/runs/12517379530/job/34918397208 ``` /__w/llvm/llvm/toolchain/bin//clang++ -Werror -Wno-error=ignored-attributes -isystem /__w/llvm/llvm/toolchain/include -DSYCL_DISABLE_FALLBACK_ASSERT -std=c++17 -include /__w/llvm/llvm/build-e2e/Config/Output/kernel_from_file.cpp.tmp.h /__w/llvm/llvm/llvm/sycl/test-e2e/Config/kernel_from_file.cpp -o /__w/llvm/llvm/build-e2e/Config/Output/kernel_from_file.cpp.tmp.out -lsycl -I/__w/llvm/llvm/toolchain/include -I/__w/llvm/llvm/toolchain/include/sycl -L/__w/llvm/llvm/toolchain/lib -fno-sycl-dead-args-optimization -Xclang -verify-ignore-unexpected=note,warning clang++: error: argument unused during compilation: '-fno-sycl-dead-args-optimization' [-Werror,-Wunused-command-line-argument] ```

jsji · 2024-12-30T14:34:10Z

@intel/llvm-gatekeepers This is ready for merge, can someone help issue a '/merge'. Thanks!

dm-vodopyanov · 2024-12-30T15:07:43Z

I can type '/merge' but not all checks were completed successfully.
@jsji, @intel/dpcpp-devops-reviewers,
IGC DEV CI Containers / Build and Push IGC Dev Docker Images (Intel Drivers Ubuntu 24.04 Docker image with dev IGC, ubunt... (pull_request) failed due to lack of /usr/local/lib/libopencl-clang.so.14*, is it safe to merge, or issue should be addressed before merge?

sarnex · 2024-12-30T15:11:57Z

We can ignore the dev igc issue, the current upstream IGC repo has been in a bad state for a long time.

sarnex · 2024-12-30T15:12:03Z

/merge

bb-sycl · 2024-12-30T15:12:28Z

Mon 30 Dec 2024 03:12:27 PM UTC --- Start to merge the commit into sycl branch. It will take several minutes.

bb-sycl · 2024-12-30T15:12:29Z

Mon 30 Dec 2024 03:12:28 PM UTC --- Merge failed with error: Please check whether the PR is mergeable

jsji · 2024-12-30T15:35:16Z

Mon 30 Dec 2024 03:12:28 PM UTC --- Merge failed with error: Please check whether the PR is mergeable

@DoyleLi The bot is failing AGAIN... Can you check what went wrong this time. Thanks.

frobtech and others added 30 commits December 9, 2024 11:18

[clang][docs] fix rendering of $-prefixed options (#119249)

511e84f

This was added in #117573 but the options were not being rendered correctly due to the missing newline after `::`.

[msan] Support NEON vector multiplication instructions (#117944)

3b74abd

Approximates the shadow propagation via OR'ing. Updates the neon_vmul.ll test introduced in llvm/llvm-project#117935

[libc][docs] add c11 threads and uchar (#119250)

a4e2927

FWICT, these were the newly added headers for c11.

[TargetLowering] Return Align from getByValTypeAlignment (NFC) (#119233)

e55c167

[libc] add multi-platform pre-commit github actions (#119104)

f15cc6f

We do not have CI coverage for Windows/MacOS and we regularly run into problem where changes break post-commit fullbuild which is not tested in pre-commit builds. This PR utilizes the github action to address such issues.

[flang][cuda] Change how abstract result pass is scheduled on func.fu…

1d4b5c1

…nc and gpu.func (#119034) Use `pm.nest` to schedule the pass on nested `func.func` and `gpu.func` in the `gpu.module`. AbstractResult pass is not meant to run on the whole gpu.module at once.

Revert "Revert "[mlir python] Add nanobind support (#119232)

392622d

Reverts revert #118517 after (hopefully) fixing builders (llvm/llvm-zorg#328, llvm/llvm-zorg#327) This reverts commit 61bf308.

[AMDGPU][MC][True16] VOP3dot instruction update for true16/fake16 (#1…

b9b46de

…13474) Update VOP3dot instructions with true16 and fake16 formats. This patch includes instructions: v_dot2_f16_f16 v_dot2_bf16_bf16

[AMDGPU][True16][MC] redo update vop2 mc test with update script (#11…

8471541

…9263) This is a NFC change The previous patch llvm/llvm-project#116860 has an issue and is reverted in llvm/llvm-project#119253. Redo the patch here

[LV] Move code to prepare VPlan for epilogue vector loop to helper (NFC)

4fd8dbc

Move code to prepare the VPlan for the epilogue vector loop to a helper to reduce size and complexity of processLoop.

[libc][docs] add missing c99 docs (#119239)

429f0f1

Looks like we were missing docs for: - float.h - wchar.h - wctype.h Which AFAICT were added in ISO C99.

[flang] Lower CSHIFT to hlfir.cshift operation. (#118917)

44cd8f0

[VPlan] Directly check VectorizingEpilogue in ::executePlan (NFC).

adfe54f

Directly check VectorizingEpilogue which directly indicates that the epilogue is vectorized.

[RISCV][GISel] Fallback in LowerCall for byval arguments. (#119251)

82f4ebf

Our byval call lowering isn't copying the argument. Looks like our SelectionDAG code for byval is different than AArch64 so this may be non-trivial to fix. Reject for now.

[AMDGPU][True16][MC] redo "remove duplication in VOP2 test" (#119274)

342fa15

This is a NFC change The previous patch llvm/llvm-project#117195 has an issue and is reverted in llvm/llvm-project#119247 Redo the patch here

[AMDGPU][True16][MC] test update for v_add/sub_f16 in true16 (#118926)

cbed714

This is a NFC change. Update mc test for v_add/sub_f16 in true16 format. MC source change was done by previous patch and automatically enabled by t16 pesudo

[PowerPC] Update data layout aligment of i128 to 16 (#118004)

a13ec9c

Fix 64-bit PowerPC part of llvm/llvm-project#102783.

[Clang][CodeGen] Remove extraneous dot prefixes [NFC] (#119275)

d74c73f

[libc] Support baremetal libc on aarch64 (#118691)

2e8ce30

We have users that target baremetal aarch64.

jsji had a problem deploying to WindowsCILock December 27, 2024 15:27 — with GitHub Actions Error

jsji self-assigned this Dec 27, 2024

jsji had a problem deploying to WindowsCILock December 27, 2024 15:27 — with GitHub Actions Error

jsji temporarily deployed to WindowsCILock December 27, 2024 15:52 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock December 27, 2024 15:53 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock December 27, 2024 17:03 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock December 27, 2024 17:36 — with GitHub Actions Inactive

bader reviewed Dec 27, 2024

View reviewed changes

jsji force-pushed the llvmspirv_pulldown branch from 3fee707 to e5c326e Compare December 27, 2024 19:15

jsji had a problem deploying to WindowsCILock December 27, 2024 19:15 — with GitHub Actions Error

jsji had a problem deploying to WindowsCILock December 27, 2024 19:16 — with GitHub Actions Error

jsji force-pushed the llvmspirv_pulldown branch from e5c326e to f20908c Compare December 27, 2024 19:26

jsji temporarily deployed to WindowsCILock December 27, 2024 19:27 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock December 27, 2024 19:28 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock December 27, 2024 20:22 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock December 27, 2024 20:44 — with GitHub Actions Inactive

Fznamznon approved these changes Dec 30, 2024

View reviewed changes

bb-sycl approved these changes Dec 30, 2024

View reviewed changes

sarnex merged commit 31a339e into sycl Dec 30, 2024
37 of 38 checks passed

jsji deleted the llvmspirv_pulldown branch December 30, 2024 15:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLVM and SPIRV-LLVM-Translator pulldown (WW52 2024) #16484

LLVM and SPIRV-LLVM-Translator pulldown (WW52 2024) #16484

iclsrc commented Dec 27, 2024 •

edited by jsji

Loading

jsji commented Dec 27, 2024 •

edited

Loading

bader Dec 27, 2024

bader Dec 27, 2024

jsji Dec 27, 2024

bader commented Dec 27, 2024

jsji commented Dec 30, 2024

dm-vodopyanov commented Dec 30, 2024 •

edited

Loading

sarnex commented Dec 30, 2024

sarnex commented Dec 30, 2024

bb-sycl commented Dec 30, 2024

bb-sycl commented Dec 30, 2024

jsji commented Dec 30, 2024

LLVM and SPIRV-LLVM-Translator pulldown (WW52 2024) #16484

LLVM and SPIRV-LLVM-Translator pulldown (WW52 2024) #16484

Conversation

iclsrc commented Dec 27, 2024 • edited by jsji Loading

jsji commented Dec 27, 2024 • edited Loading

bader Dec 27, 2024

Choose a reason for hiding this comment

bader Dec 27, 2024

Choose a reason for hiding this comment

jsji Dec 27, 2024

Choose a reason for hiding this comment

bader commented Dec 27, 2024

jsji commented Dec 30, 2024

dm-vodopyanov commented Dec 30, 2024 • edited Loading

sarnex commented Dec 30, 2024

sarnex commented Dec 30, 2024

bb-sycl commented Dec 30, 2024

bb-sycl commented Dec 30, 2024

jsji commented Dec 30, 2024

iclsrc commented Dec 27, 2024 •

edited by jsji

Loading

jsji commented Dec 27, 2024 •

edited

Loading

dm-vodopyanov commented Dec 30, 2024 •

edited

Loading