Backport LLVM patches to fix X86 partial register stall #20025

yuyichao · 2017-01-14T02:38:58Z

Fix #19976

This improves the performance of pisum on one of my machine by ~5.5x and the assembly looks better.

@nanosoldier runbenchmarks(ALL, vs = ":master")

@mbauman @KristofferC you probably have better idea about the original/other case this comes up.

Keno · 2017-01-14T02:55:16Z

Out of curiosity, which of these commits is the actual fix for our issue?

yuyichao · 2017-01-14T02:58:27Z

The fourth patch is the one that nominally fixes it. I included the fifth one because it claims to have fixed a bug in the fourth one. With only this two commit the LLVM tests fails and the failure looks like real codegen regressions so I included a few more that touches the same tests and the LLVM tests are now passing. I'm not entirely sure which ones fixes the "regression" caused by the fix but from the commit message I suspect it's the third one.

yuyichao · 2017-01-14T03:03:53Z

Counted by the order they are cherry picked (see the [PATCH */5] in the commit message) Apparently I got the order wrong in llvm.mk but there doesn't seem to be too many conflicts.....

So the one that should be fixing this issue is [PATCH 4/5] Avoid false dependencies of undef machine operands (also the biggest one though a big part of it is tests...). The fix for it is [PATCH 5/5] Fixing bug committed in rev. 278321. The one that I think fixes the regression is [PATCH 3/5] ExecutionDepsFix - Fix bug in clearance calculation

nanosoldier · 2017-01-14T06:44:40Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

yuyichao · 2017-01-14T07:10:07Z

@nanosoldier runbenchmarks(ALL, vs = ":master")

Just to check which ones are noise....

nanosoldier · 2017-01-14T11:14:12Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

tkelman · 2017-01-14T11:28:22Z

looks like ["array","growth",("append!",2048)] was noise, the rest were not

Keno · 2017-01-15T06:26:29Z

I've looked into the regression and the problem is that LLVM fails to take into account the clearance from an incoming backedge when choosing which register to use. Should be fixable, I'll try for a patch.

Keno · 2017-01-16T05:35:34Z

Upstream patch is here: https://reviews.llvm.org/D28759

Will push an update to this PR that adds the upstream patch, so we can re-run nanosoldier.

Keno · 2017-01-16T05:40:30Z

@nanosoldier runbenchmarks(ALL, vs = ":master")

nanosoldier · 2017-01-16T09:46:08Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

Keno · 2017-01-16T16:43:00Z

Looks like I did indeed fix the super bad regressions we saw. There's still comprehension_iteration left which I should probably look at. Also, there's some new ones which may or may not be real.
Let's run again to see: @nanosoldier runbenchmarks(ALL, vs = ":master")

nanosoldier · 2017-01-16T20:57:07Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

Keno · 2017-01-16T21:00:46Z

Ok, I'm gonna go with the comprehension one is the only remaining regression. Looking into it.

tkelman · 2017-01-16T21:09:50Z

["scalar","arithmetic",("div","Complex{Float32}","Complex{Float32}")] also looks real and a bit larger?

Keno · 2017-01-16T21:34:45Z

Ah, yes. In any case, the cause seems to be what @yuyichao mentioned to me. Namely, that calls to non-inlined functions don't reset clearance information.

Keno · 2017-01-16T23:51:02Z

https://reviews.llvm.org/D28786 same procedure yesterday.

Keno · 2017-01-16T23:57:07Z

@nanosoldier runbenchmarks(ALL, vs = ":master")

nanosoldier · 2017-01-17T04:06:49Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @jrevels

vchuravy · 2017-01-17T05:25:02Z

Travis fails on 32bit:

julia: /home/travis/build/JuliaLang/julia/deps/srccache/llvm-3.9.1/include/llvm/CodeGen/MachineOperand.h:268: unsigned int llvm::MachineOperand::getReg() const: Assertion `isReg() && "This is not a register operand!"' failed.

Keno · 2017-01-17T18:18:21Z

I was unable to reproduce this locally, but I had an off by one in the patch, which could have easily caused that. Fixed.

Fix #19976

Keno · 2017-01-17T18:20:25Z

Also, I think the last nanosoldier run picked up some improvements on master, so @nanosoldier runbenchmarks(ALL, vs = ":master")

Keno · 2017-01-19T01:43:05Z

@nanosoldier runbenchmarks(ALL, vs = ":master")

nanosoldier · 2017-01-19T05:47:05Z

Your benchmark job has completed - no performance regressions were detected. A full report can be found here. cc @jrevels

vchuravy · 2017-01-19T05:51:56Z

Nanosoldier seems very happy with this 🎉. LGTM if travis is green

martinholters · 2017-01-19T07:15:33Z

The travis failure looks unrelated. Any ideas what that might be?

tkelman · 2017-01-19T13:23:22Z

it's the mac spawn test issue that #20073 is attempting to fix. these patches should be updated to match whatever gets committed upstream once it goes through review there, but lgtm for now

Keno · 2017-01-19T21:43:32Z

Let's wait a few days while upstream review is ongoing. The problem with replacing these patches later is that they won't apply cleanly to people who've had this version applied. Also, FYI, the second commit has been split up into
https://reviews.llvm.org/D28915.

tkelman · 2017-01-24T00:31:54Z

Any news upstream on these?

Keno · 2017-01-24T00:43:05Z

No, but it's only been one business day since I posted that, and some of these folks might not work weekends ;).

yuyichao · 2017-01-30T22:57:06Z

Any news a week later now?

Keno · 2017-01-30T23:22:53Z

Yes, for some reason I'm not getting email notifications on these anymore though.

tkelman · 2017-01-30T23:24:39Z

I had the same happen on phabricator, seems flaky

Keno · 2017-01-31T00:06:21Z

1/3 is now nominally upstream. We'll see if it survives the buildbot onslaught.

KristofferC · 2017-02-08T21:25:14Z

Bump, any news?

tkelman · 2017-02-13T19:12:55Z

@Keno what is the status on these?

Keno · 2017-02-13T20:22:59Z

Making its way through review upstream.

tkelman · 2017-02-13T20:30:43Z

Can we rebase this to reflect the latest state and get that in before alpha, then (later) add additional incremental patches on top of that as the upstream review continues?

tkelman · 2017-02-14T11:08:31Z

getting the latest state of this in would help make the benchmark resilts much more predictable and useful

tkelman · 2017-02-17T08:19:05Z

@Keno can we please just merge the latest state of this patch set now?

Keno · 2017-02-21T04:59:02Z

The latest state of this patch is still very much in flux. We can however, rebase this PR as is and merge it.

tkelman · 2017-02-21T08:47:24Z

one of them was upstreamed, wasn't it? what's closer to what will eventually be upstream, what's in this PR now or the current state of the upstream reviews?

StefanKarpinski · 2017-02-21T14:58:41Z

OS X failure: https://gist.github.com/StefanKarpinski/1663878980f211580b7fb83b93d2e911

tkelman · 2017-02-22T08:15:55Z

been a while since this last ran @nanosoldier runbenchmarks(ALL, vs = ":master")

nanosoldier · 2017-02-22T11:41:33Z

Your benchmark job has completed - no performance regressions were detected. A full report can be found here. cc @jrevels

StefanKarpinski · 2017-02-22T21:36:38Z

Dang, that's a lot of performance improvements. Hard to believe LLVM let something this big regress. Oh wait, no it isn't.

KristofferC · 2017-02-22T21:54:19Z

We got the best LLVM hackers don't we?

vchuravy added this to the 0.6.0 milestone Jan 14, 2017

Keno approved these changes Jan 14, 2017

View reviewed changes

tkelman added the upstream The issue is with an upstream dependency, e.g. LLVM label Jan 14, 2017

Keno force-pushed the yyc/codegen/llvm-stall branch from c1bc99f to 840b10f Compare January 16, 2017 05:39

Keno force-pushed the yyc/codegen/llvm-stall branch from d006cc4 to 780f9c5 Compare January 17, 2017 18:15

yuyichao and others added 2 commits January 17, 2017 13:19

Backport LLVM patches to fix X86 partial register stall

df891e0

Fix #19976

Add LLVM Patch D28759

e1f8faf

Keno force-pushed the yyc/codegen/llvm-stall branch from 780f9c5 to 6d094a3 Compare January 17, 2017 18:19

Keno force-pushed the yyc/codegen/llvm-stall branch from 5386ccb to f4cbc40 Compare January 19, 2017 01:42

tkelman mentioned this pull request Jan 20, 2017

speed up date time parsing #19545

Merged

Merge branch 'master' into yyc/codegen/llvm-stall

e1b88f7

tkelman merged commit f2b155a into master Feb 23, 2017

tkelman deleted the yyc/codegen/llvm-stall branch February 23, 2017 17:24

Backport LLVM patches to fix X86 partial register stall #20025

Backport LLVM patches to fix X86 partial register stall #20025

Conversation

yuyichao commented Jan 14, 2017

Keno commented Jan 14, 2017

yuyichao commented Jan 14, 2017

yuyichao commented Jan 14, 2017

nanosoldier commented Jan 14, 2017

yuyichao commented Jan 14, 2017

nanosoldier commented Jan 14, 2017

tkelman commented Jan 14, 2017

Keno commented Jan 15, 2017

Keno commented Jan 16, 2017

Keno commented Jan 16, 2017

nanosoldier commented Jan 16, 2017

Keno commented Jan 16, 2017

nanosoldier commented Jan 16, 2017

Keno commented Jan 16, 2017

tkelman commented Jan 16, 2017

Keno commented Jan 16, 2017

Keno commented Jan 16, 2017

Keno commented Jan 16, 2017

nanosoldier commented Jan 17, 2017

vchuravy commented Jan 17, 2017

Keno commented Jan 17, 2017

Keno commented Jan 17, 2017

Keno commented Jan 19, 2017

nanosoldier commented Jan 19, 2017

vchuravy commented Jan 19, 2017

martinholters commented Jan 19, 2017

tkelman commented Jan 19, 2017

Keno commented Jan 19, 2017

tkelman commented Jan 24, 2017

Keno commented Jan 24, 2017 • edited Loading

yuyichao commented Jan 30, 2017

Keno commented Jan 30, 2017

tkelman commented Jan 30, 2017

Keno commented Jan 31, 2017

KristofferC commented Feb 8, 2017

tkelman commented Feb 13, 2017

Keno commented Feb 13, 2017

tkelman commented Feb 13, 2017 • edited Loading

tkelman commented Feb 14, 2017

tkelman commented Feb 17, 2017 • edited Loading

Keno commented Feb 21, 2017

tkelman commented Feb 21, 2017

StefanKarpinski commented Feb 21, 2017

tkelman commented Feb 22, 2017

nanosoldier commented Feb 22, 2017

StefanKarpinski commented Feb 22, 2017

KristofferC commented Feb 22, 2017

Keno commented Jan 24, 2017 •

edited

Loading

tkelman commented Feb 13, 2017 •

edited

Loading

tkelman commented Feb 17, 2017 •

edited

Loading