LLVM and SPIRV-LLVM-Translator pulldown (WW35) #4388

vmaksimo · 2021-08-23T07:57:10Z

LLVM: llvm/llvm-project@955b91c19c00
SPIRV-LLVM-Translator: KhronosGroup/SPIRV-LLVM-Translator@e9671a5

Require debug build for CodeGen/X86/fsafdo_test2.ll since it checks for messages only printed in debug mode. Reviewed By: wenlei, hoy Differential Revision: https://reviews.llvm.org/D108364

This was probably bugging more than is reasonable, but it makes merging changes in this file slightly less annoying to have the trailing comma here. I only noticed this because Rust is currently carrying a patch to this file and it kept making life a little difficult.

…-it-will-be-removed-from-clang Android enables zero initialisation globally by default, but also allows subprojects to override with different option. Clang complains the above flag being unused in this case. Instead of adding a 75 char long -no-* flag, don't warn unused argument for this flag. Differential Revision: https://reviews.llvm.org/D108278

This allows the instruction selector to realize that it can directly broadcast the low byte of the memset value, rather than replicating it to a 64-bit GPR before broadcasting. This fixes PR50985. Differential Revision: https://reviews.llvm.org/D108354

Reviewed By: #opaque-pointers, dblaikie Differential Revision: https://reviews.llvm.org/D105711

…oops

Folding a GEP from outside to inside a loop will materialize an add where there wasn't an equivalent operation before. Check the containing loops before making this fold. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D107935

In particular we were dropping volatility, which can lead to unwanted transformations.

Differential Revision: https://reviews.llvm.org/D108225

We still need to tag the llvm.isnan.? intrinsic as vectorizable

__split_buffer_common was entirely unused, and __deque_base_common was unused except for two calls to __throw_out_of_range(), which have been inlined. The usual intent of the __xxx_base_common base classes is to localize where the exception-throwing code is instantiated, however that wasn't the case here because we never explicitly instantiated those base classes in the shared library, unlike what we do for basic_string and vector. Differential Revision: https://reviews.llvm.org/D108384

This is based on the work done to add strtoll and the other strto functions. The atoi functions also were added to stdc and entrypoints.txt. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D108330

In 94d0914, I added support for unrolling of multiple exit loops which have multiple exits reaching the latch. Per reports on the review post commit, I'd missed updating the domtree for one case. This fix addresses that ommission. There's no new test as this is covered by existing tests with expensive verification turned on.

…dditional attributes. Differential Revision: https://reviews.llvm.org/D108338

As reported on https://bugs.llvm.org/show_bug.cgi?id=51020, the guard widening pass doesn't preserve MemorySSA, so it can no longer be scheduled in the same loop pass manager as LICM. However, the loop-schedule.ll test indicates that this is supposed to work. Fix this by preserving MemorySSA if available, as this seems to be trivial in this case (we only need to drop the memory access for the removed guards). Differential Revision: https://reviews.llvm.org/D108386

Alias analysis is unable to disambiguate accesses to the structure fields without it unlike distinct variables. As a result we cannot combine ds_read and ds_write operations in a case of any store in between which always considered clobbering. Differential Revision: https://reviews.llvm.org/D108315

This patch extends the runtime unrolling infrastructure to support unrolling a loop with multiple exiting blocks branching to the same exit block used by the latch. It intentionally does not include a cost model change to enable this functionality unless appropriate force flags are used. This is the prolog companion to D107381. Since this was LGTMed, a problem with DT updating was reported against that patch. I roled in the analogous fix here as it seemed obvious, and not worth re-review. As an aside, our prolog form leaves a lot of potential value on the floor when there is an invariant load or invariant condition in the loop being runtime unrolled. We should probably consider a "required prolog" heuristic. (Alternatively, maybe we should be peeling these cases more aggressively?) Differential Revision: https://reviews.llvm.org/D108262

This patch handles the return key for compound fields like lists and mapping fields. The return key, if not handled by the field will select the next primary element, skipping secondary elements like remove buttons and the like. Differential Revision: https://reviews.llvm.org/D108331

…pecify additional attributes." This reverts commit 95ddc83. Differential Revision: https://reviews.llvm.org/D108396

With unquoted ${CMAKE_CXX_FLAGS}, the REGEX fails when it's empty: ```CMake Error at lib/scudo/standalone/CMakeLists.txt:14 (string): string sub-command REGEX, mode REPLACE needs at least 6 arguments total to command.```

The default legalization of unsupported vector types is to promote the integers in each lane, which leads to extra sign or zero extending and masking when moving data into and out of vectors. Switch our preferred type legalization from the default to vector widening, which keeps the data in the low lanes of the vector rather than in the low bits of each lane. The unused high lanes can be ignored. Half-wide vectors are now loaded from memory into the low 64 bits of the v128 rather than spread out among the lanes. As a result, v128.load64_splat is a much more common operation, so add new patterns to support it. Differential Revision: https://reviews.llvm.org/D107502

vmaksimo · 2021-08-26T09:06:32Z

/merge

Fznamznon · 2021-08-26T09:06:40Z

FYI KhronosGroup/SPIRV-LLVM-Translator#1176 was merged to the translator with a slight change.

bb-sycl · 2021-08-26T09:06:58Z

Thu Aug 26 09:06:57 UTC 2021 --- Merge failed with error: PR is not clean for merge. Please examine approval status or check status before merge.

bader · 2021-08-26T09:10:45Z

FYI KhronosGroup/SPIRV-LLVM-Translator#1176 was merged to the translator with a slight change.

@vmaksimo, please, replace your version with the version merged to Khronos repository.

Signed-off-by: Dmitry Sidorov <dmitry.sidorov@intel.com> Original commit: KhronosGroup/SPIRV-LLVM-Translator@81ebabd

The previous patch has fixed only case with 1-element vectors, however after closer look at the spec it became clear that SPV_INTEL_vector_compute actually allows any number of vector elements (capability VectorAnyINTEL), so this is the proper fix. This is a follow-up change for b431cc8. Original commit: KhronosGroup/SPIRV-LLVM-Translator@e9671a5

CUDA support will be added later in a separate PR.

vmaksimo · 2021-08-31T14:00:09Z

/merge

bb-sycl · 2021-08-31T14:00:37Z

Tue Aug 31 14:00:37 UTC 2021 --- Merge failed with error: PR is not clean for merge. Please examine approval status or check status before merge.

RKSimon and others added 30 commits August 19, 2021 16:48

Fix empty paragraph passed to parameter Wdocumentation warning. NFC.

ff69c65

Fix CodeGen/X86/fsafdo_test2.ll fail in release

9d476f0

Require debug build for CodeGen/X86/fsafdo_test2.ll since it checks for messages only printed in debug mode. Reviewed By: wenlei, hoy Differential Revision: https://reviews.llvm.org/D108364

Fix unknown parameter Wdocumentation warnings. NFC.

94e1442

[OpaquePtr][Inline] Use byval type instead of pointee type

33d44b7

Reviewed By: #opaque-pointers, dblaikie Differential Revision: https://reviews.llvm.org/D105711

[NFC][InstCombine] Add test for one-use one-index geps in different l…

0f09056

…oops

[CostModel][X86] Add isnan half/float/double costs tests

72ebcd3

AArch64: copy all parts of the mem operand across when combining a store

edab411

In particular we were dropping volatility, which can lead to unwanted transformations.

[libomptarget][nfc] Move lanemask_t type into target_impl.h

6c75ce1

[libc] Add a trivial implementation for bcmp

c8f7989

Differential Revision: https://reviews.llvm.org/D108225

[SLP][X86] Regenerate intrinsic.ll test checks

26ed14f

[SLP][X86] Add llvm.isnan intrinsic test coverage

5fa6039

We still need to tag the llvm.isnan.? intrinsic as vectorizable

[libc] add atoi, atol, and atoll

bad3168

This is based on the work done to add strtoll and the other strto functions. The atoi functions also were added to stdc and entrypoints.txt. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D108330

[lldb][NFC] Remove unused header include

4947f6d

[mlir][Linalg] Allow all build methods of Structured ops to specify a…

95ddc83

…dditional attributes. Differential Revision: https://reviews.llvm.org/D108338

Revert "[mlir][Linalg] Allow all build methods of Structured ops to s…

16ffb28

…pecify additional attributes." This reverts commit 95ddc83. Differential Revision: https://reviews.llvm.org/D108396

[libc][Obvious] Fix llvm_libc_ext.td.

aeee014

[sanitizer] Fix for CMAKE_CXX_FLAGS update

68ab571

With unquoted ${CMAKE_CXX_FLAGS}, the REGEX fails when it's empty: ```CMake Error at lib/scudo/standalone/CMakeLists.txt:14 (string): string sub-command REGEX, mode REPLACE needs at least 6 arguments total to command.```

Move function definition out-of-line to fix the modularized build (NFC)

1e586bc

[openmp] Disable the tests that block CI for amdgpu and host offloading.

ad0f6e1

vmaksimo requested review from AlexeySotkin, bader, DenisBakhvalov, elizabethandrews, hchilama, kbobrovs, kychendev, mdtoguchi, mlychkov, premanandrao and sndmitriev as code owners August 25, 2021 17:15

intel deleted a comment from bb-sycl Aug 26, 2021

MrSidims and others added 3 commits August 26, 2021 12:24

Map llvm.isnan on OpIsNan

739793e

Signed-off-by: Dmitry Sidorov <dmitry.sidorov@intel.com> Original commit: KhronosGroup/SPIRV-LLVM-Translator@81ebabd

Merge remote-tracking branch 'intel_llvm/sycl' into llvmspirv_pulldown

7b0a7db

s-kanaev mentioned this pull request Aug 31, 2021

[SYCL] Implementation of fallback assert #3767

Merged

Mark bfloat16 test as unsupported on CUDA

81f1b5e

CUDA support will be added later in a separate PR.

vmaksimo requested a review from a team as a code owner August 31, 2021 09:34

vmaksimo requested a review from cperkinsintel August 31, 2021 09:34

vladimirlaz approved these changes Aug 31, 2021

View reviewed changes

AlexeySotkin approved these changes Aug 31, 2021

View reviewed changes

romanovvlad mentioned this pull request Aug 31, 2021

[SYCL] Refactor cl::sycl namespace. Part1 #4397

Closed

vmaksimo merged commit a358170 into intel:sycl Aug 31, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLVM and SPIRV-LLVM-Translator pulldown (WW35) #4388

LLVM and SPIRV-LLVM-Translator pulldown (WW35) #4388

vmaksimo commented Aug 23, 2021 •

edited

Loading

vmaksimo commented Aug 26, 2021

Fznamznon commented Aug 26, 2021

bb-sycl commented Aug 26, 2021

bader commented Aug 26, 2021

vmaksimo commented Aug 31, 2021

bb-sycl commented Aug 31, 2021

LLVM and SPIRV-LLVM-Translator pulldown (WW35) #4388

LLVM and SPIRV-LLVM-Translator pulldown (WW35) #4388

Conversation

vmaksimo commented Aug 23, 2021 • edited Loading

vmaksimo commented Aug 26, 2021

Fznamznon commented Aug 26, 2021

bb-sycl commented Aug 26, 2021

bader commented Aug 26, 2021

vmaksimo commented Aug 31, 2021

bb-sycl commented Aug 31, 2021

vmaksimo commented Aug 23, 2021 •

edited

Loading