Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Versioning] Bump v0.3.0 and fix m1-wheels #1780

Merged
merged 2 commits into from
Jan 8, 2024
Merged

[Versioning] Bump v0.3.0 and fix m1-wheels #1780

merged 2 commits into from
Jan 8, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Jan 8, 2024

No description provided.

Copy link

pytorch-bot bot commented Jan 8, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/1780

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (5 Unrelated Failures)

As of commit 830b464 with merge base 55faae1 (image):

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 8, 2024
@vmoens vmoens added the versioning Versioning change (version number etc) label Jan 8, 2024
Copy link

github-actions bot commented Jan 8, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 89. Improved: $\large\color{#35bf28}6$. Worsened: $\large\color{#d91a1a}3$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_single 64.6215ms 64.0722ms 15.6074 Ops/s 15.1826 Ops/s $\color{#35bf28}+2.80\%$
test_sync 39.7712ms 35.0047ms 28.5676 Ops/s 28.7278 Ops/s $\color{#d91a1a}-0.56\%$
test_async 0.1232s 32.8685ms 30.4242 Ops/s 28.6684 Ops/s $\textbf{\color{#35bf28}+6.12\%}$
test_simple 0.5083s 0.4496s 2.2242 Ops/s 2.1600 Ops/s $\color{#35bf28}+2.97\%$
test_transformed 0.6870s 0.6163s 1.6225 Ops/s 1.5994 Ops/s $\color{#35bf28}+1.44\%$
test_serial 1.4431s 1.3822s 0.7235 Ops/s 0.7023 Ops/s $\color{#35bf28}+3.01\%$
test_parallel 1.4384s 1.3654s 0.7324 Ops/s 0.7346 Ops/s $\color{#d91a1a}-0.30\%$
test_step_mdp_speed[True-True-True-True-True] 0.1700ms 21.6922μs 46.0995 KOps/s 46.0197 KOps/s $\color{#35bf28}+0.17\%$
test_step_mdp_speed[True-True-True-True-False] 64.3020μs 13.3633μs 74.8317 KOps/s 77.0750 KOps/s $\color{#d91a1a}-2.91\%$
test_step_mdp_speed[True-True-True-False-True] 55.1430μs 12.8923μs 77.5654 KOps/s 78.4121 KOps/s $\color{#d91a1a}-1.08\%$
test_step_mdp_speed[True-True-True-False-False] 43.6320μs 7.9037μs 126.5230 KOps/s 129.7248 KOps/s $\color{#d91a1a}-2.47\%$
test_step_mdp_speed[True-True-False-True-True] 55.4530μs 23.2239μs 43.0591 KOps/s 43.3724 KOps/s $\color{#d91a1a}-0.72\%$
test_step_mdp_speed[True-True-False-True-False] 52.8700μs 14.8126μs 67.5103 KOps/s 69.7447 KOps/s $\color{#d91a1a}-3.20\%$
test_step_mdp_speed[True-True-False-False-True] 36.3780μs 14.2088μs 70.3787 KOps/s 71.4837 KOps/s $\color{#d91a1a}-1.55\%$
test_step_mdp_speed[True-True-False-False-False] 42.9500μs 9.1908μs 108.8042 KOps/s 111.7001 KOps/s $\color{#d91a1a}-2.59\%$
test_step_mdp_speed[True-False-True-True-True] 60.9740μs 24.7176μs 40.4569 KOps/s 40.7363 KOps/s $\color{#d91a1a}-0.69\%$
test_step_mdp_speed[True-False-True-True-False] 45.8560μs 16.1264μs 62.0103 KOps/s 63.6755 KOps/s $\color{#d91a1a}-2.62\%$
test_step_mdp_speed[True-False-True-False-True] 52.6990μs 14.2693μs 70.0808 KOps/s 71.2778 KOps/s $\color{#d91a1a}-1.68\%$
test_step_mdp_speed[True-False-True-False-False] 33.1510μs 9.1497μs 109.2935 KOps/s 112.1288 KOps/s $\color{#d91a1a}-2.53\%$
test_step_mdp_speed[True-False-False-True-True] 62.0960μs 26.0265μs 38.4223 KOps/s 39.5924 KOps/s $\color{#d91a1a}-2.96\%$
test_step_mdp_speed[True-False-False-True-False] 49.0320μs 17.2558μs 57.9516 KOps/s 59.3190 KOps/s $\color{#d91a1a}-2.31\%$
test_step_mdp_speed[True-False-False-False-True] 54.9030μs 15.4603μs 64.6817 KOps/s 66.3427 KOps/s $\color{#d91a1a}-2.50\%$
test_step_mdp_speed[True-False-False-False-False] 33.5530μs 10.3895μs 96.2514 KOps/s 99.3703 KOps/s $\color{#d91a1a}-3.14\%$
test_step_mdp_speed[False-True-True-True-True] 57.8690μs 24.6649μs 40.5434 KOps/s 41.4717 KOps/s $\color{#d91a1a}-2.24\%$
test_step_mdp_speed[False-True-True-True-False] 50.4750μs 16.1249μs 62.0161 KOps/s 63.5363 KOps/s $\color{#d91a1a}-2.39\%$
test_step_mdp_speed[False-True-True-False-True] 38.4920μs 16.6664μs 60.0009 KOps/s 61.8154 KOps/s $\color{#d91a1a}-2.94\%$
test_step_mdp_speed[False-True-True-False-False] 43.2210μs 10.4051μs 96.1066 KOps/s 98.5518 KOps/s $\color{#d91a1a}-2.48\%$
test_step_mdp_speed[False-True-False-True-True] 61.3950μs 25.9087μs 38.5970 KOps/s 39.5665 KOps/s $\color{#d91a1a}-2.45\%$
test_step_mdp_speed[False-True-False-True-False] 55.0230μs 17.3340μs 57.6901 KOps/s 58.2966 KOps/s $\color{#d91a1a}-1.04\%$
test_step_mdp_speed[False-True-False-False-True] 44.3030μs 17.9470μs 55.7197 KOps/s 57.4734 KOps/s $\color{#d91a1a}-3.05\%$
test_step_mdp_speed[False-True-False-False-False] 55.5540μs 11.7571μs 85.0547 KOps/s 88.4230 KOps/s $\color{#d91a1a}-3.81\%$
test_step_mdp_speed[False-False-True-True-True] 63.2990μs 27.1873μs 36.7819 KOps/s 37.3256 KOps/s $\color{#d91a1a}-1.46\%$
test_step_mdp_speed[False-False-True-True-False] 45.5660μs 18.5558μs 53.8914 KOps/s 54.1600 KOps/s $\color{#d91a1a}-0.50\%$
test_step_mdp_speed[False-False-True-False-True] 51.3960μs 18.0404μs 55.4311 KOps/s 57.0496 KOps/s $\color{#d91a1a}-2.84\%$
test_step_mdp_speed[False-False-True-False-False] 34.5850μs 11.7598μs 85.0356 KOps/s 87.2443 KOps/s $\color{#d91a1a}-2.53\%$
test_step_mdp_speed[False-False-False-True-True] 64.3000μs 28.0272μs 35.6796 KOps/s 35.7621 KOps/s $\color{#d91a1a}-0.23\%$
test_step_mdp_speed[False-False-False-True-False] 52.7680μs 19.5639μs 51.1145 KOps/s 51.5435 KOps/s $\color{#d91a1a}-0.83\%$
test_step_mdp_speed[False-False-False-False-True] 53.8610μs 18.8770μs 52.9745 KOps/s 53.8770 KOps/s $\color{#d91a1a}-1.68\%$
test_step_mdp_speed[False-False-False-False-False] 45.9060μs 12.8089μs 78.0710 KOps/s 79.4406 KOps/s $\color{#d91a1a}-1.72\%$
test_values[generalized_advantage_estimate-True-True] 16.8099ms 12.1035ms 82.6209 Ops/s 82.3855 Ops/s $\color{#35bf28}+0.29\%$
test_values[vec_generalized_advantage_estimate-True-True] 35.3927ms 28.4877ms 35.1029 Ops/s 35.2265 Ops/s $\color{#d91a1a}-0.35\%$
test_values[td0_return_estimate-False-False] 0.2231ms 0.1753ms 5.7040 KOps/s 5.5128 KOps/s $\color{#35bf28}+3.47\%$
test_values[td1_return_estimate-False-False] 28.8777ms 25.7044ms 38.9038 Ops/s 38.7492 Ops/s $\color{#35bf28}+0.40\%$
test_values[vec_td1_return_estimate-False-False] 35.9685ms 28.3329ms 35.2947 Ops/s 35.2779 Ops/s $\color{#35bf28}+0.05\%$
test_values[td_lambda_return_estimate-True-False] 36.0940ms 35.4185ms 28.2338 Ops/s 27.4209 Ops/s $\color{#35bf28}+2.96\%$
test_values[vec_td_lambda_return_estimate-True-False] 35.4954ms 28.4415ms 35.1599 Ops/s 35.4327 Ops/s $\color{#d91a1a}-0.77\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.3672ms 8.0826ms 123.7222 Ops/s 121.9591 Ops/s $\color{#35bf28}+1.45\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 10.1761ms 1.9580ms 510.7254 Ops/s 510.1717 Ops/s $\color{#35bf28}+0.11\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 9.9304ms 0.4457ms 2.2437 KOps/s 2.3111 KOps/s $\color{#d91a1a}-2.91\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 43.0680ms 40.3378ms 24.7906 Ops/s 25.2182 Ops/s $\color{#d91a1a}-1.70\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 12.0325ms 2.7069ms 369.4312 Ops/s 376.1839 Ops/s $\color{#d91a1a}-1.80\%$
test_dqn_speed 80.6469ms 8.3848ms 119.2639 Ops/s 115.9859 Ops/s $\color{#35bf28}+2.83\%$
test_ddpg_speed 15.6945ms 14.8835ms 67.1884 Ops/s 66.0583 Ops/s $\color{#35bf28}+1.71\%$
test_sac_speed 31.0640ms 30.0201ms 33.3110 Ops/s 32.7335 Ops/s $\color{#35bf28}+1.76\%$
test_redq_speed 44.2579ms 36.4385ms 27.4435 Ops/s 27.0544 Ops/s $\color{#35bf28}+1.44\%$
test_redq_deprec_speed 34.7692ms 26.2757ms 38.0579 Ops/s 35.8535 Ops/s $\textbf{\color{#35bf28}+6.15\%}$
test_td3_speed 29.5733ms 20.9238ms 47.7924 Ops/s 47.1285 Ops/s $\color{#35bf28}+1.41\%$
test_cql_speed 99.4802ms 91.4977ms 10.9292 Ops/s 10.9167 Ops/s $\color{#35bf28}+0.11\%$
test_a2c_speed 28.2658ms 27.5876ms 36.2482 Ops/s 34.8795 Ops/s $\color{#35bf28}+3.92\%$
test_ppo_speed 83.6522ms 30.3489ms 32.9501 Ops/s 34.7034 Ops/s $\textbf{\color{#d91a1a}-5.05\%}$
test_reinforce_speed 33.6982ms 27.0517ms 36.9662 Ops/s 36.5906 Ops/s $\color{#35bf28}+1.03\%$
test_iql_speed 68.4529ms 65.5334ms 15.2594 Ops/s 14.8234 Ops/s $\color{#35bf28}+2.94\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 1.9386ms 1.5004ms 666.4682 Ops/s 669.4624 Ops/s $\color{#d91a1a}-0.45\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 9.1104ms 0.5348ms 1.8700 KOps/s 1.9096 KOps/s $\color{#d91a1a}-2.07\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7984ms 0.5082ms 1.9677 KOps/s 1.9331 KOps/s $\color{#35bf28}+1.79\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 2.1912ms 1.4931ms 669.7418 Ops/s 675.1339 Ops/s $\color{#d91a1a}-0.80\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.1177ms 0.5198ms 1.9238 KOps/s 1.8850 KOps/s $\color{#35bf28}+2.06\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 8.7947ms 0.5095ms 1.9626 KOps/s 1.9752 KOps/s $\color{#d91a1a}-0.64\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 2.3006ms 1.6469ms 607.2004 Ops/s 582.9587 Ops/s $\color{#35bf28}+4.16\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 9.6033ms 0.6709ms 1.4906 KOps/s 1.5030 KOps/s $\color{#d91a1a}-0.83\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 9.0278ms 0.6492ms 1.5402 KOps/s 1.5269 KOps/s $\color{#35bf28}+0.88\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 2.1267ms 1.5244ms 656.0051 Ops/s 557.0089 Ops/s $\textbf{\color{#35bf28}+17.77\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 10.7736ms 0.5388ms 1.8559 KOps/s 1.8526 KOps/s $\color{#35bf28}+0.18\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 6.4939ms 0.5197ms 1.9241 KOps/s 1.9540 KOps/s $\color{#d91a1a}-1.53\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 1.8322ms 1.5127ms 661.0548 Ops/s 687.1684 Ops/s $\color{#d91a1a}-3.80\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 8.8644ms 0.5346ms 1.8705 KOps/s 1.9084 KOps/s $\color{#d91a1a}-1.98\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6377ms 0.5031ms 1.9877 KOps/s 1.8992 KOps/s $\color{#35bf28}+4.66\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 2.5354ms 1.7033ms 587.1044 Ops/s 580.0730 Ops/s $\color{#35bf28}+1.21\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 9.4612ms 0.6848ms 1.4603 KOps/s 1.5012 KOps/s $\color{#d91a1a}-2.73\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 5.1975ms 0.6615ms 1.5116 KOps/s 1.4806 KOps/s $\color{#35bf28}+2.09\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 0.1248s 17.1600ms 58.2750 Ops/s 57.0695 Ops/s $\color{#35bf28}+2.11\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 14.9221ms 12.4460ms 80.3472 Ops/s 67.9842 Ops/s $\textbf{\color{#35bf28}+18.19\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 4.7385ms 1.6809ms 594.9143 Ops/s 534.5232 Ops/s $\textbf{\color{#35bf28}+11.30\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.1285s 17.4721ms 57.2340 Ops/s 65.9160 Ops/s $\textbf{\color{#d91a1a}-13.17\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 15.6589ms 12.4115ms 80.5707 Ops/s 78.6779 Ops/s $\color{#35bf28}+2.41\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 5.5815ms 1.6229ms 616.1908 Ops/s 594.5145 Ops/s $\color{#35bf28}+3.65\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.1236s 17.3326ms 57.6946 Ops/s 64.4715 Ops/s $\textbf{\color{#d91a1a}-10.51\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 15.2601ms 12.5202ms 79.8708 Ops/s 66.6876 Ops/s $\textbf{\color{#35bf28}+19.77\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.6107ms 1.7245ms 579.8784 Ops/s 553.0424 Ops/s $\color{#35bf28}+4.85\%$

Copy link

github-actions bot commented Jan 8, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 92. Improved: $\large\color{#35bf28}7$. Worsened: $\large\color{#d91a1a}2$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_single 0.1212s 0.1209s 8.2746 Ops/s 7.9924 Ops/s $\color{#35bf28}+3.53\%$
test_sync 0.1782s 0.1101s 9.0833 Ops/s 8.9814 Ops/s $\color{#35bf28}+1.13\%$
test_async 0.2068s 99.6351ms 10.0366 Ops/s 10.0402 Ops/s $\color{#d91a1a}-0.04\%$
test_single_pixels 0.1473s 0.1456s 6.8685 Ops/s 6.7720 Ops/s $\color{#35bf28}+1.43\%$
test_sync_pixels 97.0015ms 95.8889ms 10.4287 Ops/s 10.5225 Ops/s $\color{#d91a1a}-0.89\%$
test_async_pixels 0.2490s 92.9984ms 10.7529 Ops/s 10.7822 Ops/s $\color{#d91a1a}-0.27\%$
test_simple 0.9577s 0.8885s 1.1256 Ops/s 1.1000 Ops/s $\color{#35bf28}+2.32\%$
test_transformed 1.1934s 1.1283s 0.8863 Ops/s 0.8584 Ops/s $\color{#35bf28}+3.24\%$
test_serial 2.5399s 2.4726s 0.4044 Ops/s 0.3844 Ops/s $\textbf{\color{#35bf28}+5.20\%}$
test_parallel 2.5876s 2.5273s 0.3957 Ops/s 0.3961 Ops/s $\color{#d91a1a}-0.12\%$
test_step_mdp_speed[True-True-True-True-True] 0.1828ms 32.4367μs 30.8292 KOps/s 30.3614 KOps/s $\color{#35bf28}+1.54\%$
test_step_mdp_speed[True-True-True-True-False] 0.3287ms 19.4373μs 51.4476 KOps/s 50.4106 KOps/s $\color{#35bf28}+2.06\%$
test_step_mdp_speed[True-True-True-False-True] 76.5710μs 18.7778μs 53.2544 KOps/s 53.1335 KOps/s $\color{#35bf28}+0.23\%$
test_step_mdp_speed[True-True-True-False-False] 31.2700μs 11.0360μs 90.6126 KOps/s 89.1686 KOps/s $\color{#35bf28}+1.62\%$
test_step_mdp_speed[True-True-False-True-True] 60.7910μs 34.2842μs 29.1679 KOps/s 28.5625 KOps/s $\color{#35bf28}+2.12\%$
test_step_mdp_speed[True-True-False-True-False] 47.3610μs 21.2773μs 46.9985 KOps/s 46.8037 KOps/s $\color{#35bf28}+0.42\%$
test_step_mdp_speed[True-True-False-False-True] 87.1910μs 20.3521μs 49.1351 KOps/s 49.0541 KOps/s $\color{#35bf28}+0.17\%$
test_step_mdp_speed[True-True-False-False-False] 37.1500μs 12.9557μs 77.1860 KOps/s 76.3269 KOps/s $\color{#35bf28}+1.13\%$
test_step_mdp_speed[True-False-True-True-True] 64.0610μs 36.6486μs 27.2861 KOps/s 27.4738 KOps/s $\color{#d91a1a}-0.68\%$
test_step_mdp_speed[True-False-True-True-False] 56.3110μs 23.5145μs 42.5270 KOps/s 42.9611 KOps/s $\color{#d91a1a}-1.01\%$
test_step_mdp_speed[True-False-True-False-True] 47.4400μs 20.8557μs 47.9485 KOps/s 48.5648 KOps/s $\color{#d91a1a}-1.27\%$
test_step_mdp_speed[True-False-True-False-False] 84.5210μs 12.9888μs 76.9896 KOps/s 76.5097 KOps/s $\color{#35bf28}+0.63\%$
test_step_mdp_speed[True-False-False-True-True] 62.0510μs 38.2127μs 26.1693 KOps/s 26.1396 KOps/s $\color{#35bf28}+0.11\%$
test_step_mdp_speed[True-False-False-True-False] 50.1910μs 24.7999μs 40.3228 KOps/s 40.0282 KOps/s $\color{#35bf28}+0.74\%$
test_step_mdp_speed[True-False-False-False-True] 63.5400μs 22.1071μs 45.2344 KOps/s 44.9846 KOps/s $\color{#35bf28}+0.56\%$
test_step_mdp_speed[True-False-False-False-False] 34.0610μs 14.7717μs 67.6969 KOps/s 67.2242 KOps/s $\color{#35bf28}+0.70\%$
test_step_mdp_speed[False-True-True-True-True] 0.1628ms 36.3967μs 27.4750 KOps/s 27.1669 KOps/s $\color{#35bf28}+1.13\%$
test_step_mdp_speed[False-True-True-True-False] 51.2610μs 23.1905μs 43.1210 KOps/s 42.5459 KOps/s $\color{#35bf28}+1.35\%$
test_step_mdp_speed[False-True-True-False-True] 44.1600μs 24.8186μs 40.2924 KOps/s 40.4416 KOps/s $\color{#d91a1a}-0.37\%$
test_step_mdp_speed[False-True-True-False-False] 33.8810μs 14.9752μs 66.7772 KOps/s 67.0363 KOps/s $\color{#d91a1a}-0.39\%$
test_step_mdp_speed[False-True-False-True-True] 0.1304ms 38.2992μs 26.1102 KOps/s 25.9128 KOps/s $\color{#35bf28}+0.76\%$
test_step_mdp_speed[False-True-False-True-False] 48.8710μs 25.0372μs 39.9405 KOps/s 39.4481 KOps/s $\color{#35bf28}+1.25\%$
test_step_mdp_speed[False-True-False-False-True] 0.1117ms 26.6602μs 37.5090 KOps/s 37.1101 KOps/s $\color{#35bf28}+1.08\%$
test_step_mdp_speed[False-True-False-False-False] 37.4110μs 16.6740μs 59.9736 KOps/s 60.0682 KOps/s $\color{#d91a1a}-0.16\%$
test_step_mdp_speed[False-False-True-True-True] 64.7910μs 40.5484μs 24.6619 KOps/s 24.5794 KOps/s $\color{#35bf28}+0.34\%$
test_step_mdp_speed[False-False-True-True-False] 0.1646ms 27.2124μs 36.7479 KOps/s 36.9845 KOps/s $\color{#d91a1a}-0.64\%$
test_step_mdp_speed[False-False-True-False-True] 48.7910μs 26.5892μs 37.6092 KOps/s 37.5081 KOps/s $\color{#35bf28}+0.27\%$
test_step_mdp_speed[False-False-True-False-False] 42.8300μs 16.8204μs 59.4517 KOps/s 60.4029 KOps/s $\color{#d91a1a}-1.57\%$
test_step_mdp_speed[False-False-False-True-True] 66.8110μs 41.6121μs 24.0314 KOps/s 24.1885 KOps/s $\color{#d91a1a}-0.65\%$
test_step_mdp_speed[False-False-False-True-False] 93.1810μs 28.6219μs 34.9383 KOps/s 35.0290 KOps/s $\color{#d91a1a}-0.26\%$
test_step_mdp_speed[False-False-False-False-True] 71.9210μs 27.6976μs 36.1042 KOps/s 35.4194 KOps/s $\color{#35bf28}+1.93\%$
test_step_mdp_speed[False-False-False-False-False] 38.7800μs 18.3449μs 54.5110 KOps/s 54.5755 KOps/s $\color{#d91a1a}-0.12\%$
test_values[generalized_advantage_estimate-True-True] 25.5224ms 24.9308ms 40.1110 Ops/s 38.3881 Ops/s $\color{#35bf28}+4.49\%$
test_values[vec_generalized_advantage_estimate-True-True] 85.1843ms 3.2765ms 305.2023 Ops/s 105.4288 Ops/s $\textbf{\color{#35bf28}+189.49\%}$
test_values[td0_return_estimate-False-False] 96.9310μs 63.4068μs 15.7712 KOps/s 15.8478 KOps/s $\color{#d91a1a}-0.48\%$
test_values[td1_return_estimate-False-False] 53.5212ms 53.0637ms 18.8453 Ops/s 17.5923 Ops/s $\textbf{\color{#35bf28}+7.12\%}$
test_values[vec_td1_return_estimate-False-False] 2.0282ms 1.7694ms 565.1682 Ops/s 560.9574 Ops/s $\color{#35bf28}+0.75\%$
test_values[td_lambda_return_estimate-True-False] 87.4754ms 85.2230ms 11.7339 Ops/s 10.9353 Ops/s $\textbf{\color{#35bf28}+7.30\%}$
test_values[vec_td_lambda_return_estimate-True-False] 2.0675ms 1.7631ms 567.1788 Ops/s 563.2142 Ops/s $\color{#35bf28}+0.70\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 23.9241ms 23.6629ms 42.2603 Ops/s 38.7369 Ops/s $\textbf{\color{#35bf28}+9.10\%}$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 0.8585ms 0.7108ms 1.4069 KOps/s 1.3655 KOps/s $\color{#35bf28}+3.03\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7473ms 0.6610ms 1.5130 KOps/s 1.4843 KOps/s $\color{#35bf28}+1.93\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5048ms 1.4599ms 685.0009 Ops/s 673.6238 Ops/s $\color{#35bf28}+1.69\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.9684ms 0.6810ms 1.4685 KOps/s 1.4395 KOps/s $\color{#35bf28}+2.01\%$
test_dqn_speed 13.9170ms 7.2618ms 137.7063 Ops/s 133.8245 Ops/s $\color{#35bf28}+2.90\%$
test_ddpg_speed 14.9920ms 14.1355ms 70.7441 Ops/s 68.5070 Ops/s $\color{#35bf28}+3.27\%$
test_sac_speed 30.1574ms 28.6501ms 34.9039 Ops/s 33.5248 Ops/s $\color{#35bf28}+4.11\%$
test_redq_speed 35.6861ms 34.9612ms 28.6031 Ops/s 28.1943 Ops/s $\color{#35bf28}+1.45\%$
test_redq_deprec_speed 25.2442ms 24.0778ms 41.5319 Ops/s 40.9168 Ops/s $\color{#35bf28}+1.50\%$
test_td3_speed 28.7471ms 19.7580ms 50.6124 Ops/s 49.6198 Ops/s $\color{#35bf28}+2.00\%$
test_cql_speed 84.5105ms 83.5189ms 11.9733 Ops/s 11.7646 Ops/s $\color{#35bf28}+1.77\%$
test_a2c_speed 0.1258s 29.2072ms 34.2382 Ops/s 37.1182 Ops/s $\textbf{\color{#d91a1a}-7.76\%}$
test_ppo_speed 27.8168ms 26.8585ms 37.2322 Ops/s 36.4037 Ops/s $\color{#35bf28}+2.28\%$
test_reinforce_speed 26.7995ms 25.7725ms 38.8011 Ops/s 38.0031 Ops/s $\color{#35bf28}+2.10\%$
test_iql_speed 58.4009ms 57.4271ms 17.4134 Ops/s 17.1304 Ops/s $\color{#35bf28}+1.65\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 2.5797ms 1.9090ms 523.8273 Ops/s 460.3843 Ops/s $\textbf{\color{#35bf28}+13.78\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.0842ms 0.7846ms 1.2746 KOps/s 1.2662 KOps/s $\color{#35bf28}+0.66\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.9867ms 0.7735ms 1.2929 KOps/s 1.2861 KOps/s $\color{#35bf28}+0.53\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 2.4373ms 1.8949ms 527.7445 Ops/s 519.8124 Ops/s $\color{#35bf28}+1.53\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.0015ms 0.7732ms 1.2933 KOps/s 1.2833 KOps/s $\color{#35bf28}+0.78\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.9722ms 0.7632ms 1.3103 KOps/s 1.3028 KOps/s $\color{#35bf28}+0.58\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 2.7954ms 2.1781ms 459.1141 Ops/s 451.8005 Ops/s $\color{#35bf28}+1.62\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 3.4320ms 0.9040ms 1.1062 KOps/s 1.0995 KOps/s $\color{#35bf28}+0.61\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.0746ms 0.8917ms 1.1214 KOps/s 1.1126 KOps/s $\color{#35bf28}+0.79\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 2.5944ms 1.9286ms 518.4987 Ops/s 514.7059 Ops/s $\color{#35bf28}+0.74\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.9100ms 0.7850ms 1.2738 KOps/s 1.2678 KOps/s $\color{#35bf28}+0.48\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 5.0068ms 0.7782ms 1.2850 KOps/s 1.2854 KOps/s $\color{#d91a1a}-0.03\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 2.0370ms 1.9029ms 525.5163 Ops/s 516.6560 Ops/s $\color{#35bf28}+1.71\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.9981ms 0.7735ms 1.2928 KOps/s 1.2840 KOps/s $\color{#35bf28}+0.69\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.9998ms 0.7645ms 1.3081 KOps/s 1.2894 KOps/s $\color{#35bf28}+1.45\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 0.1335s 2.5014ms 399.7770 Ops/s 455.5729 Ops/s $\textbf{\color{#d91a1a}-12.25\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.0678ms 0.9018ms 1.1089 KOps/s 1.0983 KOps/s $\color{#35bf28}+0.96\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.0729ms 0.8929ms 1.1200 KOps/s 925.2123 Ops/s $\textbf{\color{#35bf28}+21.05\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 0.1681s 16.4309ms 60.8608 Ops/s 62.9485 Ops/s $\color{#d91a1a}-3.32\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 15.6680ms 12.7014ms 78.7312 Ops/s 76.4677 Ops/s $\color{#35bf28}+2.96\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 2.6383ms 1.8611ms 537.3105 Ops/s 533.1007 Ops/s $\color{#35bf28}+0.79\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.1239s 17.6386ms 56.6938 Ops/s 55.4451 Ops/s $\color{#35bf28}+2.25\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 15.9094ms 12.7996ms 78.1274 Ops/s 76.9040 Ops/s $\color{#35bf28}+1.59\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 2.6168ms 1.8525ms 539.8140 Ops/s 531.5075 Ops/s $\color{#35bf28}+1.56\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.1268s 17.8145ms 56.1340 Ops/s 55.0016 Ops/s $\color{#35bf28}+2.06\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 16.0623ms 12.9590ms 77.1666 Ops/s 76.1089 Ops/s $\color{#35bf28}+1.39\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.7372ms 2.0155ms 496.1666 Ops/s 480.5835 Ops/s $\color{#35bf28}+3.24\%$

@vmoens vmoens merged commit fd27cb7 into main Jan 8, 2024
59 of 64 checks passed
@vmoens vmoens deleted the fix-m1-wheels branch January 8, 2024 16:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. versioning Versioning change (version number etc)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants