Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI, BugFix] Py3.8 for old deps #2568

Merged
merged 5 commits into from
Nov 15, 2024
Merged

[CI, BugFix] Py3.8 for old deps #2568

merged 5 commits into from
Nov 15, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 15, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Nov 15, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2568

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

❌ 19 New Failures, 5 Unrelated Failures

As of commit 4a09f82 with merge base 9f8f77c (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens added a commit that referenced this pull request Nov 15, 2024
ghstack-source-id: 14b518d6f7f94c938a0312fb6e98bb6e22cb4c4f
Pull Request resolved: #2568
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 15, 2024
@vmoens vmoens added bug Something isn't working CI Has to do with CI setup (e.g. wheels & builds, tests...) labels Nov 15, 2024
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 15, 2024
ghstack-source-id: 204f2184e944713ba62c63b51ec4522712e94ac9
Pull Request resolved: #2568
Copy link

github-actions bot commented Nov 15, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}4$. Worsened: $\large\color{#d91a1a}4$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.4294s 0.4269s 2.3424 Ops/s 2.2382 Ops/s $\color{#35bf28}+4.65\%$
test_transformed 0.5961s 0.5950s 1.6808 Ops/s 1.6216 Ops/s $\color{#35bf28}+3.65\%$
test_serial 1.3420s 1.3380s 0.7474 Ops/s 0.7371 Ops/s $\color{#35bf28}+1.39\%$
test_parallel 1.3767s 1.2887s 0.7760 Ops/s 0.7692 Ops/s $\color{#35bf28}+0.88\%$
test_step_mdp_speed[True-True-True-True-True] 0.3739ms 26.7691μs 37.3565 KOps/s 37.7896 KOps/s $\color{#d91a1a}-1.15\%$
test_step_mdp_speed[True-True-True-True-False] 53.3510μs 15.7254μs 63.5914 KOps/s 65.0170 KOps/s $\color{#d91a1a}-2.19\%$
test_step_mdp_speed[True-True-True-False-True] 44.7440μs 15.2496μs 65.5756 KOps/s 66.2509 KOps/s $\color{#d91a1a}-1.02\%$
test_step_mdp_speed[True-True-True-False-False] 50.1150μs 8.9540μs 111.6815 KOps/s 114.0346 KOps/s $\color{#d91a1a}-2.06\%$
test_step_mdp_speed[True-True-False-True-True] 63.4990μs 28.7808μs 34.7454 KOps/s 35.3038 KOps/s $\color{#d91a1a}-1.58\%$
test_step_mdp_speed[True-True-False-True-False] 68.4890μs 17.6227μs 56.7449 KOps/s 58.0805 KOps/s $\color{#d91a1a}-2.30\%$
test_step_mdp_speed[True-True-False-False-True] 47.7000μs 16.9618μs 58.9560 KOps/s 60.7308 KOps/s $\color{#d91a1a}-2.92\%$
test_step_mdp_speed[True-True-False-False-False] 64.0500μs 10.5490μs 94.7957 KOps/s 96.5192 KOps/s $\color{#d91a1a}-1.79\%$
test_step_mdp_speed[True-False-True-True-True] 64.3910μs 30.3432μs 32.9563 KOps/s 33.6540 KOps/s $\color{#d91a1a}-2.07\%$
test_step_mdp_speed[True-False-True-True-False] 55.6440μs 19.1291μs 52.2764 KOps/s 53.4995 KOps/s $\color{#d91a1a}-2.29\%$
test_step_mdp_speed[True-False-True-False-True] 69.9320μs 16.9092μs 59.1393 KOps/s 59.5909 KOps/s $\color{#d91a1a}-0.76\%$
test_step_mdp_speed[True-False-True-False-False] 35.9870μs 10.6425μs 93.9632 KOps/s 96.7606 KOps/s $\color{#d91a1a}-2.89\%$
test_step_mdp_speed[True-False-False-True-True] 0.1210ms 31.4418μs 31.8048 KOps/s 32.2397 KOps/s $\color{#d91a1a}-1.35\%$
test_step_mdp_speed[True-False-False-True-False] 50.6550μs 20.0934μs 49.7676 KOps/s 49.9873 KOps/s $\color{#d91a1a}-0.44\%$
test_step_mdp_speed[True-False-False-False-True] 41.9790μs 18.0550μs 55.3864 KOps/s 55.3961 KOps/s $\color{#d91a1a}-0.02\%$
test_step_mdp_speed[True-False-False-False-False] 38.0910μs 11.9556μs 83.6427 KOps/s 83.6498 KOps/s $-0.01\%$
test_step_mdp_speed[False-True-True-True-True] 60.1440μs 30.3588μs 32.9394 KOps/s 33.4812 KOps/s $\color{#d91a1a}-1.62\%$
test_step_mdp_speed[False-True-True-True-False] 53.1000μs 19.1790μs 52.1404 KOps/s 53.6569 KOps/s $\color{#d91a1a}-2.83\%$
test_step_mdp_speed[False-True-True-False-True] 43.6120μs 19.3496μs 51.6806 KOps/s 52.8093 KOps/s $\color{#d91a1a}-2.14\%$
test_step_mdp_speed[False-True-True-False-False] 40.7960μs 11.9499μs 83.6824 KOps/s 86.3170 KOps/s $\color{#d91a1a}-3.05\%$
test_step_mdp_speed[False-True-False-True-True] 71.9750μs 31.8067μs 31.4399 KOps/s 31.6634 KOps/s $\color{#d91a1a}-0.71\%$
test_step_mdp_speed[False-True-False-True-False] 51.0960μs 20.6968μs 48.3167 KOps/s 49.8983 KOps/s $\color{#d91a1a}-3.17\%$
test_step_mdp_speed[False-True-False-False-True] 2.9804ms 21.0462μs 47.5145 KOps/s 48.6045 KOps/s $\color{#d91a1a}-2.24\%$
test_step_mdp_speed[False-True-False-False-False] 42.3600μs 13.3222μs 75.0626 KOps/s 75.3480 KOps/s $\color{#d91a1a}-0.38\%$
test_step_mdp_speed[False-False-True-True-True] 0.1386ms 33.5138μs 29.8385 KOps/s 30.6422 KOps/s $\color{#d91a1a}-2.62\%$
test_step_mdp_speed[False-False-True-True-False] 47.7090μs 22.2756μs 44.8922 KOps/s 46.1251 KOps/s $\color{#d91a1a}-2.67\%$
test_step_mdp_speed[False-False-True-False-True] 51.8870μs 20.9292μs 47.7802 KOps/s 49.2511 KOps/s $\color{#d91a1a}-2.99\%$
test_step_mdp_speed[False-False-True-False-False] 44.0420μs 13.3858μs 74.7060 KOps/s 76.3869 KOps/s $\color{#d91a1a}-2.20\%$
test_step_mdp_speed[False-False-False-True-True] 75.1910μs 34.3708μs 29.0945 KOps/s 29.3755 KOps/s $\color{#d91a1a}-0.96\%$
test_step_mdp_speed[False-False-False-True-False] 54.3420μs 23.6938μs 42.2051 KOps/s 43.2489 KOps/s $\color{#d91a1a}-2.41\%$
test_step_mdp_speed[False-False-False-False-True] 50.6150μs 22.0537μs 45.3439 KOps/s 45.8739 KOps/s $\color{#d91a1a}-1.16\%$
test_step_mdp_speed[False-False-False-False-False] 96.4110μs 14.8994μs 67.1167 KOps/s 68.7503 KOps/s $\color{#d91a1a}-2.38\%$
test_values[generalized_advantage_estimate-True-True] 10.4450ms 9.8490ms 101.5331 Ops/s 100.4087 Ops/s $\color{#35bf28}+1.12\%$
test_values[vec_generalized_advantage_estimate-True-True] 38.3475ms 35.7546ms 27.9685 Ops/s 28.1104 Ops/s $\color{#d91a1a}-0.50\%$
test_values[td0_return_estimate-False-False] 0.2444ms 0.1659ms 6.0276 KOps/s 5.9255 KOps/s $\color{#35bf28}+1.72\%$
test_values[td1_return_estimate-False-False] 28.5066ms 24.6256ms 40.6082 Ops/s 40.2253 Ops/s $\color{#35bf28}+0.95\%$
test_values[vec_td1_return_estimate-False-False] 38.3573ms 36.0430ms 27.7446 Ops/s 27.6914 Ops/s $\color{#35bf28}+0.19\%$
test_values[td_lambda_return_estimate-True-False] 39.1872ms 35.4023ms 28.2468 Ops/s 28.0236 Ops/s $\color{#35bf28}+0.80\%$
test_values[vec_td_lambda_return_estimate-True-False] 37.9909ms 35.9471ms 27.8187 Ops/s 28.0520 Ops/s $\color{#d91a1a}-0.83\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.8605ms 8.6397ms 115.7441 Ops/s 112.3911 Ops/s $\color{#35bf28}+2.98\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.3548ms 1.8657ms 535.9775 Ops/s 518.8649 Ops/s $\color{#35bf28}+3.30\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.4742ms 0.3641ms 2.7461 KOps/s 2.7315 KOps/s $\color{#35bf28}+0.54\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 43.9304ms 42.8140ms 23.3569 Ops/s 22.1449 Ops/s $\textbf{\color{#35bf28}+5.47\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 3.7912ms 3.0302ms 330.0149 Ops/s 329.4279 Ops/s $\color{#35bf28}+0.18\%$
test_dqn_speed[False-None] 6.3222ms 1.3377ms 747.5568 Ops/s 745.1753 Ops/s $\color{#35bf28}+0.32\%$
test_dqn_speed[False-backward] 1.8652ms 1.8106ms 552.3099 Ops/s 546.2863 Ops/s $\color{#35bf28}+1.10\%$
test_dqn_speed[True-None] 1.1758ms 0.4742ms 2.1088 KOps/s 2.1378 KOps/s $\color{#d91a1a}-1.36\%$
test_dqn_speed[True-backward] 0.9389ms 0.8952ms 1.1171 KOps/s 1.0860 KOps/s $\color{#35bf28}+2.86\%$
test_dqn_speed[reduce-overhead-None] 1.3050ms 0.4731ms 2.1138 KOps/s 2.1011 KOps/s $\color{#35bf28}+0.60\%$
test_dqn_speed[reduce-overhead-backward] 0.9790ms 0.9024ms 1.1082 KOps/s 1.1176 KOps/s $\color{#d91a1a}-0.85\%$
test_ddpg_speed[False-None] 2.9225ms 2.7431ms 364.5445 Ops/s 358.2605 Ops/s $\color{#35bf28}+1.75\%$
test_ddpg_speed[False-backward] 4.2802ms 3.8926ms 256.8984 Ops/s 254.4293 Ops/s $\color{#35bf28}+0.97\%$
test_ddpg_speed[True-None] 1.2730ms 1.0101ms 990.0366 Ops/s 982.6582 Ops/s $\color{#35bf28}+0.75\%$
test_ddpg_speed[True-backward] 1.9853ms 1.9092ms 523.7754 Ops/s 433.9360 Ops/s $\textbf{\color{#35bf28}+20.70\%}$
test_ddpg_speed[reduce-overhead-None] 1.4008ms 1.0175ms 982.8229 Ops/s 979.1888 Ops/s $\color{#35bf28}+0.37\%$
test_ddpg_speed[reduce-overhead-backward] 1.9721ms 1.9087ms 523.9172 Ops/s 522.1975 Ops/s $\color{#35bf28}+0.33\%$
test_sac_speed[False-None] 0.1956s 9.2628ms 107.9589 Ops/s 127.2531 Ops/s $\textbf{\color{#d91a1a}-15.16\%}$
test_sac_speed[False-backward] 11.0727ms 10.4870ms 95.3557 Ops/s 94.2955 Ops/s $\color{#35bf28}+1.12\%$
test_sac_speed[True-None] 2.2834ms 1.8382ms 544.0155 Ops/s 545.0703 Ops/s $\color{#d91a1a}-0.19\%$
test_sac_speed[True-backward] 3.6313ms 3.5365ms 282.7662 Ops/s 285.1473 Ops/s $\color{#d91a1a}-0.84\%$
test_sac_speed[reduce-overhead-None] 2.1701ms 1.8356ms 544.7735 Ops/s 544.9272 Ops/s $\color{#d91a1a}-0.03\%$
test_sac_speed[reduce-overhead-backward] 3.5670ms 3.5013ms 285.6095 Ops/s 284.9042 Ops/s $\color{#35bf28}+0.25\%$
test_redq_speed[False-None] 13.8109ms 12.6777ms 78.8785 Ops/s 77.4934 Ops/s $\color{#35bf28}+1.79\%$
test_redq_speed[False-backward] 24.2801ms 22.0110ms 45.4317 Ops/s 44.8279 Ops/s $\color{#35bf28}+1.35\%$
test_redq_speed[True-None] 6.0945ms 4.7209ms 211.8225 Ops/s 222.2846 Ops/s $\color{#d91a1a}-4.71\%$
test_redq_speed[True-backward] 13.0538ms 11.9081ms 83.9767 Ops/s 82.5184 Ops/s $\color{#35bf28}+1.77\%$
test_redq_speed[reduce-overhead-None] 5.6421ms 4.6407ms 215.4831 Ops/s 215.0855 Ops/s $\color{#35bf28}+0.18\%$
test_redq_speed[reduce-overhead-backward] 13.8052ms 12.0215ms 83.1844 Ops/s 82.5077 Ops/s $\color{#35bf28}+0.82\%$
test_redq_deprec_speed[False-None] 14.3287ms 12.3755ms 80.8047 Ops/s 80.3918 Ops/s $\color{#35bf28}+0.51\%$
test_redq_deprec_speed[False-backward] 21.5745ms 18.6051ms 53.7486 Ops/s 55.1914 Ops/s $\color{#d91a1a}-2.61\%$
test_redq_deprec_speed[True-None] 4.3820ms 3.5597ms 280.9248 Ops/s 278.0622 Ops/s $\color{#35bf28}+1.03\%$
test_redq_deprec_speed[True-backward] 9.4286ms 8.0630ms 124.0226 Ops/s 124.8208 Ops/s $\color{#d91a1a}-0.64\%$
test_redq_deprec_speed[reduce-overhead-None] 4.1222ms 3.5574ms 281.1014 Ops/s 279.4622 Ops/s $\color{#35bf28}+0.59\%$
test_redq_deprec_speed[reduce-overhead-backward] 8.9831ms 8.0619ms 124.0407 Ops/s 121.7187 Ops/s $\color{#35bf28}+1.91\%$
test_td3_speed[False-None] 33.1863ms 7.9184ms 126.2884 Ops/s 129.4665 Ops/s $\color{#d91a1a}-2.45\%$
test_td3_speed[False-backward] 10.6390ms 10.1225ms 98.7901 Ops/s 98.1031 Ops/s $\color{#35bf28}+0.70\%$
test_td3_speed[True-None] 1.9737ms 1.7204ms 581.2491 Ops/s 575.1945 Ops/s $\color{#35bf28}+1.05\%$
test_td3_speed[True-backward] 3.3907ms 3.3162ms 301.5526 Ops/s 302.6838 Ops/s $\color{#d91a1a}-0.37\%$
test_td3_speed[reduce-overhead-None] 1.8355ms 1.7153ms 582.9842 Ops/s 582.1988 Ops/s $\color{#35bf28}+0.13\%$
test_td3_speed[reduce-overhead-backward] 3.6711ms 3.3299ms 300.3069 Ops/s 304.4644 Ops/s $\color{#d91a1a}-1.37\%$
test_cql_speed[False-None] 43.2110ms 36.1732ms 27.6448 Ops/s 28.1010 Ops/s $\color{#d91a1a}-1.62\%$
test_cql_speed[False-backward] 49.1698ms 45.8703ms 21.8006 Ops/s 22.0837 Ops/s $\color{#d91a1a}-1.28\%$
test_cql_speed[True-None] 17.1392ms 15.4706ms 64.6385 Ops/s 64.7026 Ops/s $\color{#d91a1a}-0.10\%$
test_cql_speed[True-backward] 24.2489ms 22.1839ms 45.0777 Ops/s 44.7519 Ops/s $\color{#35bf28}+0.73\%$
test_cql_speed[reduce-overhead-None] 17.4478ms 15.6753ms 63.7945 Ops/s 64.5475 Ops/s $\color{#d91a1a}-1.17\%$
test_cql_speed[reduce-overhead-backward] 23.5580ms 22.1618ms 45.1226 Ops/s 44.8685 Ops/s $\color{#35bf28}+0.57\%$
test_a2c_speed[False-None] 9.9971ms 7.0890ms 141.0641 Ops/s 140.1423 Ops/s $\color{#35bf28}+0.66\%$
test_a2c_speed[False-backward] 14.4276ms 14.0605ms 71.1212 Ops/s 71.1089 Ops/s $\color{#35bf28}+0.02\%$
test_a2c_speed[True-None] 3.6317ms 3.3282ms 300.4624 Ops/s 302.9576 Ops/s $\color{#d91a1a}-0.82\%$
test_a2c_speed[True-backward] 10.0420ms 9.6993ms 103.1006 Ops/s 103.0913 Ops/s $+0.01\%$
test_a2c_speed[reduce-overhead-None] 3.6071ms 3.3036ms 302.6980 Ops/s 301.8344 Ops/s $\color{#35bf28}+0.29\%$
test_a2c_speed[reduce-overhead-backward] 10.6709ms 9.6984ms 103.1093 Ops/s 102.8829 Ops/s $\color{#35bf28}+0.22\%$
test_ppo_speed[False-None] 8.1535ms 7.2868ms 137.2337 Ops/s 136.9201 Ops/s $\color{#35bf28}+0.23\%$
test_ppo_speed[False-backward] 15.1636ms 14.5273ms 68.8359 Ops/s 69.2810 Ops/s $\color{#d91a1a}-0.64\%$
test_ppo_speed[True-None] 4.3664ms 3.6918ms 270.8674 Ops/s 270.7087 Ops/s $\color{#35bf28}+0.06\%$
test_ppo_speed[True-backward] 10.2643ms 9.5821ms 104.3610 Ops/s 104.7644 Ops/s $\color{#d91a1a}-0.39\%$
test_ppo_speed[reduce-overhead-None] 4.3656ms 3.7025ms 270.0860 Ops/s 271.3285 Ops/s $\color{#d91a1a}-0.46\%$
test_ppo_speed[reduce-overhead-backward] 10.2162ms 9.5990ms 104.1776 Ops/s 104.5330 Ops/s $\color{#d91a1a}-0.34\%$
test_reinforce_speed[False-None] 7.4414ms 6.4565ms 154.8820 Ops/s 156.9344 Ops/s $\color{#d91a1a}-1.31\%$
test_reinforce_speed[False-backward] 10.0162ms 9.6494ms 103.6332 Ops/s 104.3316 Ops/s $\color{#d91a1a}-0.67\%$
test_reinforce_speed[True-None] 3.1914ms 2.6508ms 377.2415 Ops/s 377.4491 Ops/s $\color{#d91a1a}-0.06\%$
test_reinforce_speed[True-backward] 9.1731ms 8.5632ms 116.7784 Ops/s 116.5244 Ops/s $\color{#35bf28}+0.22\%$
test_reinforce_speed[reduce-overhead-None] 3.1091ms 2.6492ms 377.4675 Ops/s 377.6587 Ops/s $\color{#d91a1a}-0.05\%$
test_reinforce_speed[reduce-overhead-backward] 9.4233ms 8.5776ms 116.5831 Ops/s 115.8546 Ops/s $\color{#35bf28}+0.63\%$
test_iql_speed[False-None] 34.0983ms 31.9573ms 31.2917 Ops/s 31.3145 Ops/s $\color{#d91a1a}-0.07\%$
test_iql_speed[False-backward] 47.0881ms 44.9428ms 22.2505 Ops/s 22.2959 Ops/s $\color{#d91a1a}-0.20\%$
test_iql_speed[True-None] 12.0357ms 10.5111ms 95.1372 Ops/s 95.1427 Ops/s $-0.01\%$
test_iql_speed[True-backward] 22.8978ms 21.4333ms 46.6563 Ops/s 46.8218 Ops/s $\color{#d91a1a}-0.35\%$
test_iql_speed[reduce-overhead-None] 14.4030ms 10.7880ms 92.6955 Ops/s 94.8403 Ops/s $\color{#d91a1a}-2.26\%$
test_iql_speed[reduce-overhead-backward] 23.6558ms 21.3311ms 46.8799 Ops/s 46.7737 Ops/s $\color{#35bf28}+0.23\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.8512ms 4.8610ms 205.7182 Ops/s 202.9037 Ops/s $\color{#35bf28}+1.39\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.9951ms 0.5058ms 1.9769 KOps/s 1.9912 KOps/s $\color{#d91a1a}-0.72\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8241ms 0.4817ms 2.0760 KOps/s 2.0811 KOps/s $\color{#d91a1a}-0.25\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.8343ms 4.7115ms 212.2486 Ops/s 213.6590 Ops/s $\color{#d91a1a}-0.66\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.9904ms 0.4957ms 2.0172 KOps/s 2.0274 KOps/s $\color{#d91a1a}-0.50\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7035ms 0.4699ms 2.1282 KOps/s 2.1407 KOps/s $\color{#d91a1a}-0.58\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.4738ms 1.6390ms 610.1287 Ops/s 607.8783 Ops/s $\color{#35bf28}+0.37\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.3593ms 1.5883ms 629.5989 Ops/s 630.1814 Ops/s $\color{#d91a1a}-0.09\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.6676ms 4.8105ms 207.8789 Ops/s 208.6005 Ops/s $\color{#d91a1a}-0.35\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.0454ms 0.6417ms 1.5583 KOps/s 1.5723 KOps/s $\color{#d91a1a}-0.89\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.1144ms 0.6244ms 1.6014 KOps/s 1.6445 KOps/s $\color{#d91a1a}-2.62\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 4.9176ms 4.6833ms 213.5249 Ops/s 214.1653 Ops/s $\color{#d91a1a}-0.30\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8691ms 0.5121ms 1.9527 KOps/s 1.9800 KOps/s $\color{#d91a1a}-1.38\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 7.2357ms 0.4891ms 2.0446 KOps/s 2.0822 KOps/s $\color{#d91a1a}-1.81\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.4811ms 4.6212ms 216.3936 Ops/s 219.5464 Ops/s $\color{#d91a1a}-1.44\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.0133ms 0.4988ms 2.0048 KOps/s 2.0018 KOps/s $\color{#35bf28}+0.15\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7660ms 0.4806ms 2.0808 KOps/s 2.1236 KOps/s $\color{#d91a1a}-2.02\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.5962ms 4.7277ms 211.5176 Ops/s 213.1818 Ops/s $\color{#d91a1a}-0.78\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.7619ms 0.6447ms 1.5510 KOps/s 1.5786 KOps/s $\color{#d91a1a}-1.75\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8141ms 0.6098ms 1.6398 KOps/s 1.6053 KOps/s $\color{#35bf28}+2.14\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 5.4781ms 4.2356ms 236.0915 Ops/s 260.6547 Ops/s $\textbf{\color{#d91a1a}-9.42\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.4264ms 2.3753ms 420.9975 Ops/s 440.2942 Ops/s $\color{#d91a1a}-4.38\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 4.4488ms 1.3663ms 731.9054 Ops/s 792.2109 Ops/s $\textbf{\color{#d91a1a}-7.61\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.3897s 11.9389ms 83.7595 Ops/s 38.3348 Ops/s $\textbf{\color{#35bf28}+118.49\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 3.2026ms 2.0161ms 495.9989 Ops/s 424.3389 Ops/s $\textbf{\color{#35bf28}+16.89\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 8.8158ms 1.3824ms 723.3959 Ops/s 738.8220 Ops/s $\color{#d91a1a}-2.09\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 7.2001ms 4.4473ms 224.8537 Ops/s 230.4067 Ops/s $\color{#d91a1a}-2.41\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 6.8177ms 2.4695ms 404.9414 Ops/s 394.2942 Ops/s $\color{#35bf28}+2.70\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.1316ms 1.5470ms 646.4020 Ops/s 681.2805 Ops/s $\textbf{\color{#d91a1a}-5.12\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 11.5697ms 11.3277ms 88.2794 Ops/s 91.3534 Ops/s $\color{#d91a1a}-3.36\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 15.8406ms 14.5755ms 68.6083 Ops/s 69.4184 Ops/s $\color{#d91a1a}-1.17\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 21.0411ms 20.1136ms 49.7176 Ops/s 50.3111 Ops/s $\color{#d91a1a}-1.18\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 15.1257ms 14.5000ms 68.9653 Ops/s 68.3334 Ops/s $\color{#35bf28}+0.92\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 22.2789ms 20.5129ms 48.7497 Ops/s 50.7698 Ops/s $\color{#d91a1a}-3.98\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 17.9202ms 15.9871ms 62.5503 Ops/s 63.2402 Ops/s $\color{#d91a1a}-1.09\%$

Copy link

github-actions bot commented Nov 15, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}14$. Worsened: $\large\color{#d91a1a}8$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.7222s 0.7213s 1.3863 Ops/s 1.3491 Ops/s $\color{#35bf28}+2.76\%$
test_transformed 0.9696s 0.9668s 1.0344 Ops/s 1.0388 Ops/s $\color{#d91a1a}-0.43\%$
test_serial 2.0861s 2.0795s 0.4809 Ops/s 0.4781 Ops/s $\color{#35bf28}+0.58\%$
test_parallel 1.9722s 1.9186s 0.5212 Ops/s 0.5004 Ops/s $\color{#35bf28}+4.15\%$
test_step_mdp_speed[True-True-True-True-True] 0.2118ms 36.4269μs 27.4522 KOps/s 27.8540 KOps/s $\color{#d91a1a}-1.44\%$
test_step_mdp_speed[True-True-True-True-False] 54.0610μs 20.4668μs 48.8595 KOps/s 48.4195 KOps/s $\color{#35bf28}+0.91\%$
test_step_mdp_speed[True-True-True-False-True] 49.0110μs 20.0073μs 49.9818 KOps/s 50.4664 KOps/s $\color{#d91a1a}-0.96\%$
test_step_mdp_speed[True-True-True-False-False] 50.5410μs 11.5823μs 86.3384 KOps/s 86.1606 KOps/s $\color{#35bf28}+0.21\%$
test_step_mdp_speed[True-True-False-True-True] 65.9910μs 38.8972μs 25.7088 KOps/s 26.1139 KOps/s $\color{#d91a1a}-1.55\%$
test_step_mdp_speed[True-True-False-True-False] 51.5610μs 22.7200μs 44.0141 KOps/s 44.0894 KOps/s $\color{#d91a1a}-0.17\%$
test_step_mdp_speed[True-True-False-False-True] 51.5410μs 22.1530μs 45.1405 KOps/s 45.9540 KOps/s $\color{#d91a1a}-1.77\%$
test_step_mdp_speed[True-True-False-False-False] 38.5110μs 13.5366μs 73.8738 KOps/s 73.1778 KOps/s $\color{#35bf28}+0.95\%$
test_step_mdp_speed[True-False-True-True-True] 99.4120μs 41.0035μs 24.3882 KOps/s 24.8771 KOps/s $\color{#d91a1a}-1.97\%$
test_step_mdp_speed[True-False-True-True-False] 51.6110μs 24.9077μs 40.1483 KOps/s 41.1690 KOps/s $\color{#d91a1a}-2.48\%$
test_step_mdp_speed[True-False-True-False-True] 53.2600μs 22.4883μs 44.4675 KOps/s 45.7350 KOps/s $\color{#d91a1a}-2.77\%$
test_step_mdp_speed[True-False-True-False-False] 39.4010μs 13.7243μs 72.8636 KOps/s 74.2915 KOps/s $\color{#d91a1a}-1.92\%$
test_step_mdp_speed[True-False-False-True-True] 85.0420μs 42.0142μs 23.8015 KOps/s 24.2677 KOps/s $\color{#d91a1a}-1.92\%$
test_step_mdp_speed[True-False-False-True-False] 52.0700μs 26.5880μs 37.6110 KOps/s 37.8348 KOps/s $\color{#d91a1a}-0.59\%$
test_step_mdp_speed[True-False-False-False-True] 53.1310μs 24.4974μs 40.8206 KOps/s 41.7340 KOps/s $\color{#d91a1a}-2.19\%$
test_step_mdp_speed[True-False-False-False-False] 55.3900μs 15.6182μs 64.0279 KOps/s 67.0986 KOps/s $\color{#d91a1a}-4.58\%$
test_step_mdp_speed[False-True-True-True-True] 78.6320μs 41.2639μs 24.2342 KOps/s 25.2472 KOps/s $\color{#d91a1a}-4.01\%$
test_step_mdp_speed[False-True-True-True-False] 51.6810μs 24.9301μs 40.1122 KOps/s 40.7762 KOps/s $\color{#d91a1a}-1.63\%$
test_step_mdp_speed[False-True-True-False-True] 55.6610μs 25.9995μs 38.4622 KOps/s 39.1924 KOps/s $\color{#d91a1a}-1.86\%$
test_step_mdp_speed[False-True-True-False-False] 42.5910μs 15.5788μs 64.1900 KOps/s 65.3328 KOps/s $\color{#d91a1a}-1.75\%$
test_step_mdp_speed[False-True-False-True-True] 70.7610μs 43.1958μs 23.1504 KOps/s 23.9047 KOps/s $\color{#d91a1a}-3.16\%$
test_step_mdp_speed[False-True-False-True-False] 52.8610μs 26.8675μs 37.2197 KOps/s 37.4288 KOps/s $\color{#d91a1a}-0.56\%$
test_step_mdp_speed[False-True-False-False-True] 3.4031ms 28.1437μs 35.5319 KOps/s 36.1037 KOps/s $\color{#d91a1a}-1.58\%$
test_step_mdp_speed[False-True-False-False-False] 53.8710μs 17.4755μs 57.2231 KOps/s 57.7840 KOps/s $\color{#d91a1a}-0.97\%$
test_step_mdp_speed[False-False-True-True-True] 77.8720μs 44.8881μs 22.2776 KOps/s 22.4929 KOps/s $\color{#d91a1a}-0.96\%$
test_step_mdp_speed[False-False-True-True-False] 57.0310μs 28.9781μs 34.5088 KOps/s 34.7478 KOps/s $\color{#d91a1a}-0.69\%$
test_step_mdp_speed[False-False-True-False-True] 69.5510μs 27.6878μs 36.1169 KOps/s 36.8027 KOps/s $\color{#d91a1a}-1.86\%$
test_step_mdp_speed[False-False-True-False-False] 46.0200μs 17.1794μs 58.2092 KOps/s 57.9660 KOps/s $\color{#35bf28}+0.42\%$
test_step_mdp_speed[False-False-False-True-True] 72.2010μs 46.2640μs 21.6151 KOps/s 21.9369 KOps/s $\color{#d91a1a}-1.47\%$
test_step_mdp_speed[False-False-False-True-False] 69.1810μs 30.5730μs 32.7086 KOps/s 32.7506 KOps/s $\color{#d91a1a}-0.13\%$
test_step_mdp_speed[False-False-False-False-True] 59.8510μs 29.3049μs 34.1240 KOps/s 34.5813 KOps/s $\color{#d91a1a}-1.32\%$
test_step_mdp_speed[False-False-False-False-False] 44.9710μs 19.1306μs 52.2724 KOps/s 52.1567 KOps/s $\color{#35bf28}+0.22\%$
test_values[generalized_advantage_estimate-True-True] 23.6895ms 23.1491ms 43.1982 Ops/s 43.2440 Ops/s $\color{#d91a1a}-0.11\%$
test_values[vec_generalized_advantage_estimate-True-True] 97.1697ms 2.8107ms 355.7823 Ops/s 345.6887 Ops/s $\color{#35bf28}+2.92\%$
test_values[td0_return_estimate-False-False] 84.0710μs 62.1780μs 16.0829 KOps/s 16.4767 KOps/s $\color{#d91a1a}-2.39\%$
test_values[td1_return_estimate-False-False] 52.5252ms 52.1517ms 19.1748 Ops/s 19.2395 Ops/s $\color{#d91a1a}-0.34\%$
test_values[vec_td1_return_estimate-False-False] 1.4306ms 1.0514ms 951.1240 Ops/s 962.3004 Ops/s $\color{#d91a1a}-1.16\%$
test_values[td_lambda_return_estimate-True-False] 84.8852ms 82.4235ms 12.1325 Ops/s 12.1837 Ops/s $\color{#d91a1a}-0.42\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.3154ms 1.0418ms 959.8957 Ops/s 964.7050 Ops/s $\color{#d91a1a}-0.50\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 23.4185ms 23.2493ms 43.0120 Ops/s 42.9173 Ops/s $\color{#35bf28}+0.22\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0081ms 0.6982ms 1.4322 KOps/s 1.4355 KOps/s $\color{#d91a1a}-0.23\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7227ms 0.6283ms 1.5916 KOps/s 1.5954 KOps/s $\color{#d91a1a}-0.24\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.4871ms 1.4364ms 696.1778 Ops/s 698.3166 Ops/s $\color{#d91a1a}-0.31\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.6862ms 0.6425ms 1.5565 KOps/s 1.5588 KOps/s $\color{#d91a1a}-0.14\%$
test_dqn_speed[False-None] 0.1017s 1.4565ms 686.5900 Ops/s 766.2266 Ops/s $\textbf{\color{#d91a1a}-10.39\%}$
test_dqn_speed[False-backward] 1.8565ms 1.7859ms 559.9300 Ops/s 560.3671 Ops/s $\color{#d91a1a}-0.08\%$
test_dqn_speed[True-None] 0.7191ms 0.5509ms 1.8151 KOps/s 1.8360 KOps/s $\color{#d91a1a}-1.14\%$
test_dqn_speed[True-backward] 1.0599ms 0.9992ms 1.0008 KOps/s 999.7595 Ops/s $\color{#35bf28}+0.10\%$
test_dqn_speed[reduce-overhead-None] 0.6932ms 0.5590ms 1.7889 KOps/s 1.8172 KOps/s $\color{#d91a1a}-1.56\%$
test_dqn_speed[reduce-overhead-backward] 1.3732ms 0.9924ms 1.0076 KOps/s 1.0051 KOps/s $\color{#35bf28}+0.25\%$
test_ddpg_speed[False-None] 3.1662ms 2.6221ms 381.3797 Ops/s 379.7441 Ops/s $\color{#35bf28}+0.43\%$
test_ddpg_speed[False-backward] 3.9348ms 3.7946ms 263.5334 Ops/s 262.4709 Ops/s $\color{#35bf28}+0.40\%$
test_ddpg_speed[True-None] 1.2582ms 1.1918ms 839.1013 Ops/s 822.0121 Ops/s $\color{#35bf28}+2.08\%$
test_ddpg_speed[True-backward] 2.2127ms 2.1618ms 462.5858 Ops/s 456.1044 Ops/s $\color{#35bf28}+1.42\%$
test_ddpg_speed[reduce-overhead-None] 1.5715ms 1.2020ms 831.9316 Ops/s 818.9079 Ops/s $\color{#35bf28}+1.59\%$
test_ddpg_speed[reduce-overhead-backward] 2.2159ms 2.1635ms 462.2157 Ops/s 459.9341 Ops/s $\color{#35bf28}+0.50\%$
test_sac_speed[False-None] 8.1102ms 7.2291ms 138.3293 Ops/s 137.2350 Ops/s $\color{#35bf28}+0.80\%$
test_sac_speed[False-backward] 10.8719ms 10.3097ms 96.9961 Ops/s 95.8892 Ops/s $\color{#35bf28}+1.15\%$
test_sac_speed[True-None] 2.3505ms 1.9564ms 511.1460 Ops/s 489.2871 Ops/s $\color{#35bf28}+4.47\%$
test_sac_speed[True-backward] 3.9822ms 3.8596ms 259.0974 Ops/s 255.9096 Ops/s $\color{#35bf28}+1.25\%$
test_sac_speed[reduce-overhead-None] 2.3385ms 1.9676ms 508.2296 Ops/s 504.7182 Ops/s $\color{#35bf28}+0.70\%$
test_sac_speed[reduce-overhead-backward] 3.9519ms 3.8569ms 259.2750 Ops/s 258.1582 Ops/s $\color{#35bf28}+0.43\%$
test_redq_speed[False-None] 15.4486ms 10.9933ms 90.9647 Ops/s 99.6948 Ops/s $\textbf{\color{#d91a1a}-8.76\%}$
test_redq_speed[False-backward] 17.4866ms 16.7505ms 59.6997 Ops/s 58.1462 Ops/s $\color{#35bf28}+2.67\%$
test_redq_speed[True-None] 3.9435ms 3.4827ms 287.1331 Ops/s 291.8742 Ops/s $\color{#d91a1a}-1.62\%$
test_redq_speed[True-backward] 8.8559ms 8.4948ms 117.7186 Ops/s 118.6707 Ops/s $\color{#d91a1a}-0.80\%$
test_redq_speed[reduce-overhead-None] 3.8312ms 3.4709ms 288.1073 Ops/s 289.4726 Ops/s $\color{#d91a1a}-0.47\%$
test_redq_speed[reduce-overhead-backward] 8.8935ms 8.4790ms 117.9378 Ops/s 116.7460 Ops/s $\color{#35bf28}+1.02\%$
test_redq_deprec_speed[False-None] 10.7140ms 10.2351ms 97.7027 Ops/s 97.5543 Ops/s $\color{#35bf28}+0.15\%$
test_redq_deprec_speed[False-backward] 15.3812ms 14.8941ms 67.1405 Ops/s 67.2184 Ops/s $\color{#d91a1a}-0.12\%$
test_redq_deprec_speed[True-None] 3.6086ms 3.1676ms 315.7005 Ops/s 296.7953 Ops/s $\textbf{\color{#35bf28}+6.37\%}$
test_redq_deprec_speed[True-backward] 7.2252ms 6.9410ms 144.0709 Ops/s 142.6402 Ops/s $\color{#35bf28}+1.00\%$
test_redq_deprec_speed[reduce-overhead-None] 3.5163ms 3.1468ms 317.7822 Ops/s 315.0650 Ops/s $\color{#35bf28}+0.86\%$
test_redq_deprec_speed[reduce-overhead-backward] 7.1886ms 6.9605ms 143.6674 Ops/s 146.3398 Ops/s $\color{#d91a1a}-1.83\%$
test_td3_speed[False-None] 7.2542ms 7.2021ms 138.8487 Ops/s 136.8165 Ops/s $\color{#35bf28}+1.49\%$
test_td3_speed[False-backward] 10.4095ms 9.9594ms 100.4078 Ops/s 98.9591 Ops/s $\color{#35bf28}+1.46\%$
test_td3_speed[True-None] 1.8980ms 1.8591ms 537.9056 Ops/s 526.9685 Ops/s $\color{#35bf28}+2.08\%$
test_td3_speed[True-backward] 3.7974ms 3.6489ms 274.0535 Ops/s 272.8409 Ops/s $\color{#35bf28}+0.44\%$
test_td3_speed[reduce-overhead-None] 1.8748ms 1.8472ms 541.3569 Ops/s 527.3334 Ops/s $\color{#35bf28}+2.66\%$
test_td3_speed[reduce-overhead-backward] 3.6923ms 3.6054ms 277.3597 Ops/s 273.2016 Ops/s $\color{#35bf28}+1.52\%$
test_cql_speed[False-None] 27.0576ms 24.3151ms 41.1267 Ops/s 40.5572 Ops/s $\color{#35bf28}+1.40\%$
test_cql_speed[False-backward] 36.4383ms 33.6750ms 29.6957 Ops/s 30.0101 Ops/s $\color{#d91a1a}-1.05\%$
test_cql_speed[True-None] 11.0752ms 10.7332ms 93.1688 Ops/s 93.7467 Ops/s $\color{#d91a1a}-0.62\%$
test_cql_speed[True-backward] 16.8988ms 16.3158ms 61.2903 Ops/s 60.4620 Ops/s $\color{#35bf28}+1.37\%$
test_cql_speed[reduce-overhead-None] 11.0511ms 10.7809ms 92.7563 Ops/s 93.1778 Ops/s $\color{#d91a1a}-0.45\%$
test_cql_speed[reduce-overhead-backward] 17.0047ms 16.4004ms 60.9742 Ops/s 61.3579 Ops/s $\color{#d91a1a}-0.63\%$
test_a2c_speed[False-None] 5.5928ms 5.2042ms 192.1518 Ops/s 186.9218 Ops/s $\color{#35bf28}+2.80\%$
test_a2c_speed[False-backward] 11.8119ms 11.5388ms 86.6639 Ops/s 85.0114 Ops/s $\color{#35bf28}+1.94\%$
test_a2c_speed[True-None] 3.1313ms 3.0002ms 333.3158 Ops/s 326.2377 Ops/s $\color{#35bf28}+2.17\%$
test_a2c_speed[True-backward] 8.5358ms 8.3385ms 119.9256 Ops/s 119.8320 Ops/s $\color{#35bf28}+0.08\%$
test_a2c_speed[reduce-overhead-None] 3.2955ms 3.0026ms 333.0443 Ops/s 322.5577 Ops/s $\color{#35bf28}+3.25\%$
test_a2c_speed[reduce-overhead-backward] 8.4177ms 8.2306ms 121.4973 Ops/s 119.9332 Ops/s $\color{#35bf28}+1.30\%$
test_ppo_speed[False-None] 5.8395ms 5.5063ms 181.6102 Ops/s 177.5686 Ops/s $\color{#35bf28}+2.28\%$
test_ppo_speed[False-backward] 12.2973ms 11.9149ms 83.9287 Ops/s 82.7798 Ops/s $\color{#35bf28}+1.39\%$
test_ppo_speed[True-None] 3.5361ms 3.3680ms 296.9115 Ops/s 289.2242 Ops/s $\color{#35bf28}+2.66\%$
test_ppo_speed[True-backward] 8.4835ms 8.1505ms 122.6915 Ops/s 122.5018 Ops/s $\color{#35bf28}+0.15\%$
test_ppo_speed[reduce-overhead-None] 3.5351ms 3.3796ms 295.8910 Ops/s 288.8813 Ops/s $\color{#35bf28}+2.43\%$
test_ppo_speed[reduce-overhead-backward] 8.5119ms 8.1186ms 123.1747 Ops/s 123.6228 Ops/s $\color{#d91a1a}-0.36\%$
test_reinforce_speed[False-None] 4.8534ms 4.3573ms 229.5012 Ops/s 224.0440 Ops/s $\color{#35bf28}+2.44\%$
test_reinforce_speed[False-backward] 7.7511ms 7.1513ms 139.8342 Ops/s 136.8934 Ops/s $\color{#35bf28}+2.15\%$
test_reinforce_speed[True-None] 2.7208ms 2.2038ms 453.7712 Ops/s 438.7158 Ops/s $\color{#35bf28}+3.43\%$
test_reinforce_speed[True-backward] 7.3097ms 7.0337ms 142.1729 Ops/s 141.4509 Ops/s $\color{#35bf28}+0.51\%$
test_reinforce_speed[reduce-overhead-None] 2.5119ms 2.2009ms 454.3540 Ops/s 444.9113 Ops/s $\color{#35bf28}+2.12\%$
test_reinforce_speed[reduce-overhead-backward] 7.3044ms 7.0019ms 142.8177 Ops/s 143.9174 Ops/s $\color{#d91a1a}-0.76\%$
test_iql_speed[False-None] 19.7127ms 18.8805ms 52.9648 Ops/s 52.7908 Ops/s $\color{#35bf28}+0.33\%$
test_iql_speed[False-backward] 30.0892ms 29.3889ms 34.0265 Ops/s 34.0630 Ops/s $\color{#d91a1a}-0.11\%$
test_iql_speed[True-None] 6.9282ms 6.6032ms 151.4410 Ops/s 149.9505 Ops/s $\color{#35bf28}+0.99\%$
test_iql_speed[True-backward] 15.5502ms 15.1083ms 66.1890 Ops/s 63.9515 Ops/s $\color{#35bf28}+3.50\%$
test_iql_speed[reduce-overhead-None] 7.0500ms 6.6561ms 150.2371 Ops/s 150.3941 Ops/s $\color{#d91a1a}-0.10\%$
test_iql_speed[reduce-overhead-backward] 15.8461ms 15.2115ms 65.7399 Ops/s 66.0476 Ops/s $\color{#d91a1a}-0.47\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.9840ms 6.2253ms 160.6356 Ops/s 159.1407 Ops/s $\color{#35bf28}+0.94\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.5915ms 0.2663ms 3.7551 KOps/s 3.2009 KOps/s $\textbf{\color{#35bf28}+17.31\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.4452ms 0.2435ms 4.1067 KOps/s 3.3763 KOps/s $\textbf{\color{#35bf28}+21.64\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.2115ms 5.9406ms 168.3321 Ops/s 165.4970 Ops/s $\color{#35bf28}+1.71\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.1402ms 0.2572ms 3.8886 KOps/s 3.3125 KOps/s $\textbf{\color{#35bf28}+17.39\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5177ms 0.2536ms 3.9426 KOps/s 3.4195 KOps/s $\textbf{\color{#35bf28}+15.30\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.5935ms 1.1970ms 835.4546 Ops/s 735.3674 Ops/s $\textbf{\color{#35bf28}+13.61\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.3567ms 1.1462ms 872.4155 Ops/s 760.3169 Ops/s $\textbf{\color{#35bf28}+14.74\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.2032ms 6.1183ms 163.4443 Ops/s 160.4321 Ops/s $\color{#35bf28}+1.88\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.7634ms 0.3976ms 2.5151 KOps/s 2.4410 KOps/s $\color{#35bf28}+3.03\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.5796ms 0.3750ms 2.6666 KOps/s 2.2388 KOps/s $\textbf{\color{#35bf28}+19.11\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.0894ms 5.9574ms 167.8583 Ops/s 165.5076 Ops/s $\color{#35bf28}+1.42\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.0079ms 0.3035ms 3.2953 KOps/s 2.9218 KOps/s $\textbf{\color{#35bf28}+12.78\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5419ms 0.3221ms 3.1049 KOps/s 4.1320 KOps/s $\textbf{\color{#d91a1a}-24.86\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 9.7404ms 5.9837ms 167.1208 Ops/s 166.8884 Ops/s $\color{#35bf28}+0.14\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.6622ms 0.2944ms 3.3966 KOps/s 3.7756 KOps/s $\textbf{\color{#d91a1a}-10.04\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5059ms 0.2639ms 3.7886 KOps/s 3.3281 KOps/s $\textbf{\color{#35bf28}+13.84\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.2737ms 6.1228ms 163.3234 Ops/s 162.5219 Ops/s $\color{#35bf28}+0.49\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.9684ms 0.3952ms 2.5301 KOps/s 2.3422 KOps/s $\textbf{\color{#35bf28}+8.02\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6120ms 0.3766ms 2.6553 KOps/s 2.5680 KOps/s $\color{#35bf28}+3.40\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 6.6894ms 5.1169ms 195.4325 Ops/s 196.5740 Ops/s $\color{#d91a1a}-0.58\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 3.8599ms 1.7568ms 569.2247 Ops/s 455.3752 Ops/s $\textbf{\color{#35bf28}+25.00\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 8.8217ms 1.2859ms 777.6672 Ops/s 882.8529 Ops/s $\textbf{\color{#d91a1a}-11.91\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.3952s 13.0382ms 76.6976 Ops/s 196.6023 Ops/s $\textbf{\color{#d91a1a}-60.99\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 9.9700ms 2.0109ms 497.2905 Ops/s 444.5252 Ops/s $\textbf{\color{#35bf28}+11.87\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 6.9525ms 1.1973ms 835.2061 Ops/s 872.6776 Ops/s $\color{#d91a1a}-4.29\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 7.0639ms 5.3702ms 186.2141 Ops/s 36.4315 Ops/s $\textbf{\color{#35bf28}+411.13\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 9.7859ms 2.1835ms 457.9731 Ops/s 535.9760 Ops/s $\textbf{\color{#d91a1a}-14.55\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.2373ms 1.3544ms 738.3383 Ops/s 861.3273 Ops/s $\textbf{\color{#d91a1a}-14.28\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 13.1114ms 12.7377ms 78.5072 Ops/s 76.8372 Ops/s $\color{#35bf28}+2.17\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 17.5429ms 16.3060ms 61.3271 Ops/s 61.8073 Ops/s $\color{#d91a1a}-0.78\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 18.3292ms 17.9751ms 55.6327 Ops/s 55.9473 Ops/s $\color{#d91a1a}-0.56\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 16.7269ms 16.5294ms 60.4982 Ops/s 59.6276 Ops/s $\color{#35bf28}+1.46\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 20.2892ms 18.4081ms 54.3240 Ops/s 56.6233 Ops/s $\color{#d91a1a}-4.06\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 18.5400ms 18.0021ms 55.5492 Ops/s 56.7745 Ops/s $\color{#d91a1a}-2.16\%$

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 15, 2024
ghstack-source-id: 375507a211ba83aeec553682e6bfeb607c648878
Pull Request resolved: #2568
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 15, 2024
ghstack-source-id: 18abba7bf284e7c939971a73daf165881c4dc816
Pull Request resolved: #2568
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 15, 2024
ghstack-source-id: 13c7923c0e5c8725c12c3bacc6c21b250d9f7457
Pull Request resolved: #2568
@vmoens vmoens merged commit 4a09f82 into gh/vmoens/40/base Nov 15, 2024
19 of 36 checks passed
vmoens added a commit that referenced this pull request Nov 15, 2024
ghstack-source-id: 13c7923c0e5c8725c12c3bacc6c21b250d9f7457
Pull Request resolved: #2568
@vmoens vmoens deleted the gh/vmoens/40/head branch November 15, 2024 12:01
vmoens added a commit that referenced this pull request Nov 15, 2024
ghstack-source-id: 13c7923c0e5c8725c12c3bacc6c21b250d9f7457
Pull Request resolved: #2568

(cherry picked from commit f3275da)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants