-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
optimizer: enable SROA of mutable φ-nodes #43505
Open
aviatesk
wants to merge
1
commit into
avi/multisroa
Choose a base branch
from
avi/mutablephi
base: avi/multisroa
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
aviatesk
added
needs nanosoldier run
This PR should have benchmarks run on it
compiler:optimizer
Optimization passes (mostly in base/compiler/ssair/)
labels
Dec 21, 2021
aviatesk
force-pushed
the
avi/multisroa
branch
from
December 22, 2021 08:21
dec65e1
to
738df81
Compare
aviatesk
force-pushed
the
avi/mutablephi
branch
from
December 22, 2021 08:21
73fbd3d
to
1b8be01
Compare
aviatesk
force-pushed
the
avi/multisroa
branch
from
December 22, 2021 09:10
738df81
to
dd6d086
Compare
aviatesk
force-pushed
the
avi/mutablephi
branch
from
December 22, 2021 09:10
1b8be01
to
d2cf634
Compare
aviatesk
force-pushed
the
avi/multisroa
branch
from
December 23, 2021 05:57
dd6d086
to
250059f
Compare
aviatesk
force-pushed
the
avi/mutablephi
branch
2 times, most recently
from
December 23, 2021 09:07
c19c4e4
to
2ddf09f
Compare
aviatesk
force-pushed
the
avi/multisroa
branch
2 times, most recently
from
January 6, 2022 03:44
0908ab4
to
e1e502e
Compare
aviatesk
force-pushed
the
avi/mutablephi
branch
from
January 6, 2022 03:44
2ddf09f
to
29fd2ac
Compare
aviatesk
force-pushed
the
avi/multisroa
branch
from
January 8, 2022 04:56
e1e502e
to
a9c6daf
Compare
This commit allows elimination of mutable φ-node (and its predecessor mutables allocations). As an contrived example, it allows this `mutable_ϕ_elim(::String, ::Vector{String})` to run without any allocations at all: ```julia function mutable_ϕ_elim(x, xs) r = Ref(x) for x in xs r = Ref(x) end return r[] end let xs = String[string(gensym()) for _ in 1:100] mutable_ϕ_elim("init", xs) @test @allocated(mutable_ϕ_elim("init", xs)) == 0 end ``` This mutable ϕ-node elimination is still limited though. Most notably, the current implementation doesn't work if a mutable allocation forms multiple ϕ-nodes, since we check allocation eliminability (i.e. escapability) by counting usages counts and thus it's hard to reason about multiple ϕ-nodes at a time. For example, currently mutable allocations involved in cases like below will still not be eliminated: ```julia code_typed((Bool,String,String),) do cond, x, y if cond ϕ2 = ϕ1 = Ref(x) else ϕ2 = ϕ1 = Ref(y) end ϕ1[], ϕ2[] end \# more realistic example mutable struct Point{T} x::T y::T end add(a::Point, b::Point) = Point(a.x + b.x, a.y + b.y) function compute(a::Point{ComplexF64}, b::Point{ComplexF64}) for i in 0:(100000000-1) a = add(add(a, b), b) end a.x, a.y end ``` I'd say this limitation should be addressed by first introducing a better abstraction for reasoning escape information. More specifically, I'd like introduce EscapeAnalysis.jl into Julia base first, and then gradually adapt it to improve our SROA pass, since EA will allow us to reason about all escape information imposed on whatever object more easily and should help us get rid of the complexities of our current SROA implementation. For now, I'd like to get in this enhancement even though it has the limitation elaborated above, as far as this commit doesn't introduce latency problem (which is unlikely).
aviatesk
force-pushed
the
avi/mutablephi
branch
from
January 8, 2022 04:57
29fd2ac
to
bf97c29
Compare
@aviatesk what's the status of this PR? It's 3 years old and actually targets another branch of your, |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
compiler:optimizer
Optimization passes (mostly in base/compiler/ssair/)
needs nanosoldier run
This PR should have benchmarks run on it
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This commit allows elimination of mutable φ-node (and its predecessor mutable allocations).
As an contrived example, it allows this
mutable_ϕ_elim(::String, ::Vector{String})
to run without any allocations at all:
This mutable ϕ-node elimination is still limited though.
Most notably, the current implementation doesn't work if a mutable
allocation forms multiple ϕ-nodes, since we check allocation eliminability
(i.e. escapability) by counting usages by def-chain walking and thus
it's hard to reason about multiple ϕ-nodes at a time.
For example, currently mutable allocations involved in cases like below
will still not be eliminated:
I'd say this limitation should be addressed by first introducing a better
abstraction for reasoning escape information. More specifically, I'd like
introduce EscapeAnalysis.jl into Julia base first, and then gradually
adapt it to improve our SROA pass, since EA will allow us to reason about
all escape information imposed on whatever object more easily and should
help us get rid of the complexities of our current SROA implementation.
For now, I'd like to get in this enhancement even though it has the
limitation elaborated above, as far as this commit doesn't introduce
latency problem (which I believe is unlikely).