-
Notifications
You must be signed in to change notification settings - Fork 326
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Performance] Faster CatFrames.unfolding
with padding="same"
#2407
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2407
Note: Links to docs will display an error until the docs builds have been completed. ❌ 8 New Failures, 8 Unrelated FailuresAs of commit 86f4348 with merge base e82a69f (): NEW FAILURES - The following jobs have failed:
FLAKY - The following jobs failed but were likely due to flakiness present on trunk:
BROKEN TRUNK - The following jobs failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Amazing, I'm so grateful for this!
Can you just comment on the 0: indexing?
for sample_idx, num_repeats in enumerate(num_repeats_per_sample): | ||
if num_repeats > 0: | ||
res_slice = res_flat_series_flat_batch[sample_idx] | ||
res_slice[:, 0:num_repeats] = res_slice[ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we need the 0: here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh right, :num_repeats
will work, without the 0. I'll update it
862ea28
to
86f4348
Compare
Description
Improves the performance of
CatFrames.unfolding
withpadding="same"
. Running the performance measurement script in the issue description of #2406 shows a speedup of ~70x compared to the previous implementation:Motivation and Context
close #2406
Types of changes
What types of changes does your code introduce? Remove all that do not apply:
Checklist
Go over all the following points, and put an
x
in all the boxes that apply.If you are unsure about any of these, don't hesitate to ask. We are here to help!