Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Video recording in SOTA examples #2070

Merged
merged 13 commits into from
Apr 23, 2024
Merged

[Feature] Video recording in SOTA examples #2070

merged 13 commits into from
Apr 23, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Apr 9, 2024

Copy link

pytorch-bot bot commented Apr 9, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2070

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 14 Unrelated Failures

As of commit 12dfa65 with merge base df749a3 (image):

NEW FAILURES - The following jobs have failed:

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 9, 2024
@vmoens vmoens added the enhancement New feature or request label Apr 9, 2024
Copy link

github-actions bot commented Apr 22, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 91. Improved: $\large\color{#35bf28}2$. Worsened: $\large\color{#d91a1a}2$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_single 54.6724ms 54.0780ms 18.4918 Ops/s 17.3524 Ops/s $\textbf{\color{#35bf28}+6.57\%}$
test_sync 32.5383ms 29.8536ms 33.4968 Ops/s 33.6637 Ops/s $\color{#d91a1a}-0.50\%$
test_async 54.7777ms 27.2803ms 36.6565 Ops/s 36.7621 Ops/s $\color{#d91a1a}-0.29\%$
test_simple 0.3371s 0.3357s 2.9788 Ops/s 2.9825 Ops/s $\color{#d91a1a}-0.12\%$
test_transformed 0.4819s 0.4804s 2.0817 Ops/s 1.9867 Ops/s $\color{#35bf28}+4.78\%$
test_serial 1.2620s 1.2030s 0.8313 Ops/s 0.8301 Ops/s $\color{#35bf28}+0.14\%$
test_parallel 1.0583s 1.0037s 0.9963 Ops/s 0.9848 Ops/s $\color{#35bf28}+1.16\%$
test_step_mdp_speed[True-True-True-True-True] 0.1903ms 21.5222μs 46.4636 KOps/s 46.4021 KOps/s $\color{#35bf28}+0.13\%$
test_step_mdp_speed[True-True-True-True-False] 63.6990μs 13.1609μs 75.9826 KOps/s 74.6076 KOps/s $\color{#35bf28}+1.84\%$
test_step_mdp_speed[True-True-True-False-True] 39.8450μs 12.5448μs 79.7140 KOps/s 78.7659 KOps/s $\color{#35bf28}+1.20\%$
test_step_mdp_speed[True-True-True-False-False] 48.2110μs 7.5693μs 132.1133 KOps/s 130.9836 KOps/s $\color{#35bf28}+0.86\%$
test_step_mdp_speed[True-True-False-True-True] 52.0680μs 22.9868μs 43.5033 KOps/s 43.6652 KOps/s $\color{#d91a1a}-0.37\%$
test_step_mdp_speed[True-True-False-True-False] 52.3680μs 14.3873μs 69.5056 KOps/s 69.3358 KOps/s $\color{#35bf28}+0.24\%$
test_step_mdp_speed[True-True-False-False-True] 41.9490μs 13.8083μs 72.4204 KOps/s 72.3870 KOps/s $\color{#35bf28}+0.05\%$
test_step_mdp_speed[True-True-False-False-False] 44.7240μs 8.9044μs 112.3044 KOps/s 111.4993 KOps/s $\color{#35bf28}+0.72\%$
test_step_mdp_speed[True-False-True-True-True] 84.4490μs 24.3049μs 41.1440 KOps/s 41.2654 KOps/s $\color{#d91a1a}-0.29\%$
test_step_mdp_speed[True-False-True-True-False] 59.0110μs 15.6990μs 63.6982 KOps/s 62.8822 KOps/s $\color{#35bf28}+1.30\%$
test_step_mdp_speed[True-False-True-False-True] 63.6100μs 13.6726μs 73.1389 KOps/s 72.2575 KOps/s $\color{#35bf28}+1.22\%$
test_step_mdp_speed[True-False-True-False-False] 53.4230μs 8.8225μs 113.3472 KOps/s 112.1066 KOps/s $\color{#35bf28}+1.11\%$
test_step_mdp_speed[True-False-False-True-True] 79.5470μs 25.3245μs 39.4874 KOps/s 39.0636 KOps/s $\color{#35bf28}+1.08\%$
test_step_mdp_speed[True-False-False-True-False] 44.3240μs 17.0031μs 58.8128 KOps/s 58.8136 KOps/s $-0.00\%$
test_step_mdp_speed[True-False-False-False-True] 48.0810μs 14.9300μs 66.9791 KOps/s 66.8566 KOps/s $\color{#35bf28}+0.18\%$
test_step_mdp_speed[True-False-False-False-False] 62.6880μs 10.0565μs 99.4386 KOps/s 99.4777 KOps/s $\color{#d91a1a}-0.04\%$
test_step_mdp_speed[False-True-True-True-True] 60.4530μs 24.1865μs 41.3454 KOps/s 41.3766 KOps/s $\color{#d91a1a}-0.08\%$
test_step_mdp_speed[False-True-True-True-False] 72.3760μs 15.8501μs 63.0912 KOps/s 63.6176 KOps/s $\color{#d91a1a}-0.83\%$
test_step_mdp_speed[False-True-True-False-True] 64.8620μs 16.0369μs 62.3562 KOps/s 62.1542 KOps/s $\color{#35bf28}+0.32\%$
test_step_mdp_speed[False-True-True-False-False] 0.1854ms 10.2928μs 97.1552 KOps/s 97.5189 KOps/s $\color{#d91a1a}-0.37\%$
test_step_mdp_speed[False-True-False-True-True] 87.3640μs 25.8944μs 38.6184 KOps/s 38.8273 KOps/s $\color{#d91a1a}-0.54\%$
test_step_mdp_speed[False-True-False-True-False] 64.2110μs 16.9166μs 59.1134 KOps/s 58.4012 KOps/s $\color{#35bf28}+1.22\%$
test_step_mdp_speed[False-True-False-False-True] 64.7710μs 17.1965μs 58.1515 KOps/s 55.9059 KOps/s $\color{#35bf28}+4.02\%$
test_step_mdp_speed[False-True-False-False-False] 55.5850μs 11.4689μs 87.1922 KOps/s 88.7928 KOps/s $\color{#d91a1a}-1.80\%$
test_step_mdp_speed[False-False-True-True-True] 97.2230μs 26.5676μs 37.6398 KOps/s 37.5233 KOps/s $\color{#35bf28}+0.31\%$
test_step_mdp_speed[False-False-True-True-False] 63.9000μs 18.2587μs 54.7684 KOps/s 55.1267 KOps/s $\color{#d91a1a}-0.65\%$
test_step_mdp_speed[False-False-True-False-True] 0.2314ms 17.2603μs 57.9363 KOps/s 57.5850 KOps/s $\color{#35bf28}+0.61\%$
test_step_mdp_speed[False-False-True-False-False] 77.0350μs 11.2977μs 88.5133 KOps/s 88.6811 KOps/s $\color{#d91a1a}-0.19\%$
test_step_mdp_speed[False-False-False-True-True] 59.5220μs 27.8248μs 35.9392 KOps/s 35.8933 KOps/s $\color{#35bf28}+0.13\%$
test_step_mdp_speed[False-False-False-True-False] 78.2870μs 19.3633μs 51.6442 KOps/s 51.6744 KOps/s $\color{#d91a1a}-0.06\%$
test_step_mdp_speed[False-False-False-False-True] 76.5240μs 18.3087μs 54.6189 KOps/s 54.2907 KOps/s $\color{#35bf28}+0.60\%$
test_step_mdp_speed[False-False-False-False-False] 42.5000μs 12.3779μs 80.7891 KOps/s 79.9886 KOps/s $\color{#35bf28}+1.00\%$
test_values[generalized_advantage_estimate-True-True] 11.3336ms 9.7026ms 103.0654 Ops/s 103.7808 Ops/s $\color{#d91a1a}-0.69\%$
test_values[vec_generalized_advantage_estimate-True-True] 42.4801ms 36.0175ms 27.7643 Ops/s 29.5604 Ops/s $\textbf{\color{#d91a1a}-6.08\%}$
test_values[td0_return_estimate-False-False] 0.2638ms 0.1916ms 5.2194 KOps/s 5.0073 KOps/s $\color{#35bf28}+4.24\%$
test_values[td1_return_estimate-False-False] 24.3137ms 23.8640ms 41.9041 Ops/s 41.6212 Ops/s $\color{#35bf28}+0.68\%$
test_values[vec_td1_return_estimate-False-False] 36.9648ms 35.4673ms 28.1950 Ops/s 29.6001 Ops/s $\color{#d91a1a}-4.75\%$
test_values[td_lambda_return_estimate-True-False] 35.9558ms 34.5321ms 28.9586 Ops/s 28.4523 Ops/s $\color{#35bf28}+1.78\%$
test_values[vec_td_lambda_return_estimate-True-False] 36.4964ms 35.2807ms 28.3441 Ops/s 29.6534 Ops/s $\color{#d91a1a}-4.42\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.4408ms 8.2289ms 121.5228 Ops/s 120.3001 Ops/s $\color{#35bf28}+1.02\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.4911ms 1.9584ms 510.6193 Ops/s 487.5098 Ops/s $\color{#35bf28}+4.74\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.4594ms 0.3553ms 2.8149 KOps/s 2.8016 KOps/s $\color{#35bf28}+0.47\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 50.5952ms 47.4863ms 21.0587 Ops/s 24.1056 Ops/s $\textbf{\color{#d91a1a}-12.64\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 3.8157ms 3.0497ms 327.8999 Ops/s 324.8995 Ops/s $\color{#35bf28}+0.92\%$
test_dqn_speed 1.9282ms 1.3819ms 723.6309 Ops/s 719.7143 Ops/s $\color{#35bf28}+0.54\%$
test_ddpg_speed 5.4666ms 2.7763ms 360.1854 Ops/s 364.6220 Ops/s $\color{#d91a1a}-1.22\%$
test_sac_speed 10.1006ms 8.4959ms 117.7039 Ops/s 106.3774 Ops/s $\textbf{\color{#35bf28}+10.65\%}$
test_redq_speed 14.4298ms 13.5096ms 74.0216 Ops/s 74.1780 Ops/s $\color{#d91a1a}-0.21\%$
test_redq_deprec_speed 21.7626ms 14.6855ms 68.0943 Ops/s 71.2026 Ops/s $\color{#d91a1a}-4.37\%$
test_td3_speed 9.0112ms 8.4548ms 118.2760 Ops/s 118.4079 Ops/s $\color{#d91a1a}-0.11\%$
test_cql_speed 39.7501ms 37.6767ms 26.5416 Ops/s 26.8436 Ops/s $\color{#d91a1a}-1.13\%$
test_a2c_speed 8.5657ms 7.6188ms 131.2551 Ops/s 127.6045 Ops/s $\color{#35bf28}+2.86\%$
test_ppo_speed 9.6759ms 7.9009ms 126.5681 Ops/s 125.3418 Ops/s $\color{#35bf28}+0.98\%$
test_reinforce_speed 7.5121ms 6.7522ms 148.0996 Ops/s 146.0225 Ops/s $\color{#35bf28}+1.42\%$
test_iql_speed 33.9466ms 33.3923ms 29.9471 Ops/s 29.7571 Ops/s $\color{#35bf28}+0.64\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 3.6136ms 2.3978ms 417.0554 Ops/s 418.7708 Ops/s $\color{#d91a1a}-0.41\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8257ms 0.5184ms 1.9290 KOps/s 1.9079 KOps/s $\color{#35bf28}+1.11\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 3.7759ms 0.5049ms 1.9805 KOps/s 1.9530 KOps/s $\color{#35bf28}+1.41\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 3.4379ms 2.4319ms 411.2051 Ops/s 403.3125 Ops/s $\color{#35bf28}+1.96\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.9254ms 0.5095ms 1.9628 KOps/s 1.9567 KOps/s $\color{#35bf28}+0.31\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7190ms 0.4833ms 2.0693 KOps/s 2.0185 KOps/s $\color{#35bf28}+2.52\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.9912ms 1.2480ms 801.3096 Ops/s 803.2258 Ops/s $\color{#d91a1a}-0.24\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.5800ms 1.1796ms 847.7534 Ops/s 853.0031 Ops/s $\color{#d91a1a}-0.62\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 3.5309ms 2.5897ms 386.1446 Ops/s 399.7648 Ops/s $\color{#d91a1a}-3.41\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.5989ms 0.6441ms 1.5525 KOps/s 1.5471 KOps/s $\color{#35bf28}+0.35\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 3.5737ms 0.6078ms 1.6453 KOps/s 1.6449 KOps/s $\color{#35bf28}+0.02\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 3.3435ms 2.3824ms 419.7442 Ops/s 424.3022 Ops/s $\color{#d91a1a}-1.07\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.9954ms 0.5242ms 1.9076 KOps/s 1.8961 KOps/s $\color{#35bf28}+0.61\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 3.8148ms 0.4918ms 2.0333 KOps/s 2.0030 KOps/s $\color{#35bf28}+1.51\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 3.4891ms 2.4169ms 413.7552 Ops/s 411.8249 Ops/s $\color{#35bf28}+0.47\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.1160ms 0.5131ms 1.9490 KOps/s 1.9317 KOps/s $\color{#35bf28}+0.90\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.9142ms 0.4933ms 2.0272 KOps/s 2.0274 KOps/s $-0.01\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 2.7682ms 2.5434ms 393.1755 Ops/s 402.1513 Ops/s $\color{#d91a1a}-2.23\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.7495ms 0.6302ms 1.5868 KOps/s 1.5505 KOps/s $\color{#35bf28}+2.34\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 3.6351ms 0.6138ms 1.6293 KOps/s 1.6262 KOps/s $\color{#35bf28}+0.19\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 0.1170s 8.1487ms 122.7189 Ops/s 118.4552 Ops/s $\color{#35bf28}+3.60\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 15.4700ms 12.3434ms 81.0152 Ops/s 80.8064 Ops/s $\color{#35bf28}+0.26\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 1.8161ms 1.0514ms 951.1237 Ops/s 947.5467 Ops/s $\color{#35bf28}+0.38\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.1153s 6.0254ms 165.9634 Ops/s 165.8826 Ops/s $\color{#35bf28}+0.05\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 0.1246s 14.6916ms 68.0661 Ops/s 68.3710 Ops/s $\color{#d91a1a}-0.45\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 1.2224ms 1.0282ms 972.5484 Ops/s 929.1974 Ops/s $\color{#35bf28}+4.67\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.1159s 6.3553ms 157.3483 Ops/s 153.5321 Ops/s $\color{#35bf28}+2.49\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 15.8864ms 12.9758ms 77.0666 Ops/s 78.7998 Ops/s $\color{#d91a1a}-2.20\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 1.5269ms 1.3665ms 731.8145 Ops/s 731.7239 Ops/s $\color{#35bf28}+0.01\%$

Copy link

github-actions bot commented Apr 22, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 94. Improved: $\large\color{#35bf28}3$. Worsened: $\large\color{#d91a1a}8$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_single 0.1002s 99.6138ms 10.0388 Ops/s 9.3184 Ops/s $\textbf{\color{#35bf28}+7.73\%}$
test_sync 88.0597ms 87.0133ms 11.4925 Ops/s 11.4523 Ops/s $\color{#35bf28}+0.35\%$
test_async 0.1655s 72.0201ms 13.8850 Ops/s 13.9831 Ops/s $\color{#d91a1a}-0.70\%$
test_single_pixels 0.1109s 0.1107s 9.0371 Ops/s 8.8973 Ops/s $\color{#35bf28}+1.57\%$
test_sync_pixels 68.2057ms 66.6150ms 15.0116 Ops/s 14.9180 Ops/s $\color{#35bf28}+0.63\%$
test_async_pixels 70.0772ms 61.9274ms 16.1480 Ops/s 16.2543 Ops/s $\color{#d91a1a}-0.65\%$
test_simple 0.6925s 0.6838s 1.4623 Ops/s 1.4460 Ops/s $\color{#35bf28}+1.13\%$
test_transformed 0.8925s 0.8919s 1.1212 Ops/s 1.0904 Ops/s $\color{#35bf28}+2.82\%$
test_serial 2.1594s 2.0954s 0.4772 Ops/s 0.4739 Ops/s $\color{#35bf28}+0.70\%$
test_parallel 1.8375s 1.7894s 0.5589 Ops/s 0.5617 Ops/s $\color{#d91a1a}-0.50\%$
test_step_mdp_speed[True-True-True-True-True] 89.1410μs 33.5091μs 29.8426 KOps/s 30.0264 KOps/s $\color{#d91a1a}-0.61\%$
test_step_mdp_speed[True-True-True-True-False] 37.6600μs 20.0377μs 49.9060 KOps/s 50.1317 KOps/s $\color{#d91a1a}-0.45\%$
test_step_mdp_speed[True-True-True-False-True] 44.8500μs 19.0767μs 52.4201 KOps/s 52.5400 KOps/s $\color{#d91a1a}-0.23\%$
test_step_mdp_speed[True-True-True-False-False] 39.1200μs 11.4055μs 87.6768 KOps/s 87.9588 KOps/s $\color{#d91a1a}-0.32\%$
test_step_mdp_speed[True-True-False-True-True] 57.5400μs 35.8208μs 27.9167 KOps/s 28.4731 KOps/s $\color{#d91a1a}-1.95\%$
test_step_mdp_speed[True-True-False-True-False] 44.5200μs 21.8111μs 45.8483 KOps/s 45.5446 KOps/s $\color{#35bf28}+0.67\%$
test_step_mdp_speed[True-True-False-False-True] 43.5410μs 20.6938μs 48.3236 KOps/s 48.3730 KOps/s $\color{#d91a1a}-0.10\%$
test_step_mdp_speed[True-True-False-False-False] 32.8600μs 13.3343μs 74.9946 KOps/s 75.8218 KOps/s $\color{#d91a1a}-1.09\%$
test_step_mdp_speed[True-False-True-True-True] 54.1120μs 37.2520μs 26.8442 KOps/s 26.8379 KOps/s $\color{#35bf28}+0.02\%$
test_step_mdp_speed[True-False-True-True-False] 45.9000μs 24.1384μs 41.4277 KOps/s 42.4881 KOps/s $\color{#d91a1a}-2.50\%$
test_step_mdp_speed[True-False-True-False-True] 37.1710μs 20.6290μs 48.4754 KOps/s 48.0907 KOps/s $\color{#35bf28}+0.80\%$
test_step_mdp_speed[True-False-True-False-False] 30.2300μs 13.2409μs 75.5234 KOps/s 75.8167 KOps/s $\color{#d91a1a}-0.39\%$
test_step_mdp_speed[True-False-False-True-True] 65.2110μs 39.3878μs 25.3886 KOps/s 25.8417 KOps/s $\color{#d91a1a}-1.75\%$
test_step_mdp_speed[True-False-False-True-False] 0.1285ms 25.8062μs 38.7504 KOps/s 38.6589 KOps/s $\color{#35bf28}+0.24\%$
test_step_mdp_speed[True-False-False-False-True] 43.4610μs 22.6955μs 44.0616 KOps/s 44.3778 KOps/s $\color{#d91a1a}-0.71\%$
test_step_mdp_speed[True-False-False-False-False] 34.1000μs 15.3452μs 65.1671 KOps/s 65.8849 KOps/s $\color{#d91a1a}-1.09\%$
test_step_mdp_speed[False-True-True-True-True] 61.7010μs 37.5550μs 26.6276 KOps/s 27.2562 KOps/s $\color{#d91a1a}-2.31\%$
test_step_mdp_speed[False-True-True-True-False] 47.9700μs 24.7219μs 40.4499 KOps/s 42.4010 KOps/s $\color{#d91a1a}-4.60\%$
test_step_mdp_speed[False-True-True-False-True] 82.0320μs 25.1999μs 39.6827 KOps/s 40.9569 KOps/s $\color{#d91a1a}-3.11\%$
test_step_mdp_speed[False-True-True-False-False] 30.5600μs 15.4457μs 64.7430 KOps/s 66.0894 KOps/s $\color{#d91a1a}-2.04\%$
test_step_mdp_speed[False-True-False-True-True] 70.6620μs 39.9164μs 25.0523 KOps/s 25.2049 KOps/s $\color{#d91a1a}-0.61\%$
test_step_mdp_speed[False-True-False-True-False] 57.9400μs 26.0786μs 38.3456 KOps/s 38.9603 KOps/s $\color{#d91a1a}-1.58\%$
test_step_mdp_speed[False-True-False-False-True] 50.6310μs 26.6303μs 37.5512 KOps/s 37.8543 KOps/s $\color{#d91a1a}-0.80\%$
test_step_mdp_speed[False-True-False-False-False] 36.5620μs 16.9508μs 58.9944 KOps/s 58.4922 KOps/s $\color{#35bf28}+0.86\%$
test_step_mdp_speed[False-False-True-True-True] 65.7500μs 41.0877μs 24.3382 KOps/s 24.5367 KOps/s $\color{#d91a1a}-0.81\%$
test_step_mdp_speed[False-False-True-True-False] 54.0400μs 27.8066μs 35.9627 KOps/s 36.1865 KOps/s $\color{#d91a1a}-0.62\%$
test_step_mdp_speed[False-False-True-False-True] 55.6800μs 26.5671μs 37.6405 KOps/s 37.7345 KOps/s $\color{#d91a1a}-0.25\%$
test_step_mdp_speed[False-False-True-False-False] 31.9300μs 16.9197μs 59.1028 KOps/s 59.1621 KOps/s $\color{#d91a1a}-0.10\%$
test_step_mdp_speed[False-False-False-True-True] 64.2210μs 42.6454μs 23.4492 KOps/s 23.8393 KOps/s $\color{#d91a1a}-1.64\%$
test_step_mdp_speed[False-False-False-True-False] 65.6000μs 29.7082μs 33.6608 KOps/s 34.4913 KOps/s $\color{#d91a1a}-2.41\%$
test_step_mdp_speed[False-False-False-False-True] 54.1200μs 27.9982μs 35.7165 KOps/s 35.9958 KOps/s $\color{#d91a1a}-0.78\%$
test_step_mdp_speed[False-False-False-False-False] 38.8100μs 18.9507μs 52.7685 KOps/s 53.4515 KOps/s $\color{#d91a1a}-1.28\%$
test_values[generalized_advantage_estimate-True-True] 25.0609ms 24.6826ms 40.5143 Ops/s 39.9577 Ops/s $\color{#35bf28}+1.39\%$
test_values[vec_generalized_advantage_estimate-True-True] 82.5864ms 3.2099ms 311.5390 Ops/s 300.4354 Ops/s $\color{#35bf28}+3.70\%$
test_values[td0_return_estimate-False-False] 95.7310μs 65.6320μs 15.2365 KOps/s 15.2630 KOps/s $\color{#d91a1a}-0.17\%$
test_values[td1_return_estimate-False-False] 54.3389ms 52.7972ms 18.9404 Ops/s 18.7133 Ops/s $\color{#35bf28}+1.21\%$
test_values[vec_td1_return_estimate-False-False] 2.0814ms 1.7560ms 569.4714 Ops/s 568.0916 Ops/s $\color{#35bf28}+0.24\%$
test_values[td_lambda_return_estimate-True-False] 84.4897ms 83.6272ms 11.9578 Ops/s 11.7264 Ops/s $\color{#35bf28}+1.97\%$
test_values[vec_td_lambda_return_estimate-True-False] 2.1178ms 1.7540ms 570.1244 Ops/s 568.5204 Ops/s $\color{#35bf28}+0.28\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 23.9292ms 23.5844ms 42.4010 Ops/s 42.2395 Ops/s $\color{#35bf28}+0.38\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 0.8812ms 0.6981ms 1.4324 KOps/s 1.4178 KOps/s $\color{#35bf28}+1.03\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.6898ms 0.6460ms 1.5481 KOps/s 1.5475 KOps/s $\color{#35bf28}+0.04\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5035ms 1.4508ms 689.2641 Ops/s 685.4100 Ops/s $\color{#35bf28}+0.56\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.9375ms 0.6656ms 1.5024 KOps/s 1.4985 KOps/s $\color{#35bf28}+0.26\%$
test_dqn_speed 8.1489ms 1.4740ms 678.4476 Ops/s 683.3314 Ops/s $\color{#d91a1a}-0.71\%$
test_ddpg_speed 3.4966ms 2.7929ms 358.0469 Ops/s 365.9537 Ops/s $\color{#d91a1a}-2.16\%$
test_sac_speed 8.8126ms 8.2392ms 121.3710 Ops/s 120.2565 Ops/s $\color{#35bf28}+0.93\%$
test_redq_speed 0.1010s 11.4632ms 87.2358 Ops/s 94.4896 Ops/s $\textbf{\color{#d91a1a}-7.68\%}$
test_redq_deprec_speed 12.2329ms 11.7304ms 85.2488 Ops/s 85.5294 Ops/s $\color{#d91a1a}-0.33\%$
test_td3_speed 8.2426ms 8.1620ms 122.5193 Ops/s 123.0038 Ops/s $\color{#d91a1a}-0.39\%$
test_cql_speed 26.6289ms 25.9761ms 38.4970 Ops/s 39.5042 Ops/s $\color{#d91a1a}-2.55\%$
test_a2c_speed 6.3934ms 5.8240ms 171.7037 Ops/s 188.3323 Ops/s $\textbf{\color{#d91a1a}-8.83\%}$
test_ppo_speed 6.3271ms 6.0565ms 165.1130 Ops/s 177.1074 Ops/s $\textbf{\color{#d91a1a}-6.77\%}$
test_reinforce_speed 5.5979ms 4.7591ms 210.1245 Ops/s 231.1330 Ops/s $\textbf{\color{#d91a1a}-9.09\%}$
test_iql_speed 20.8798ms 20.3562ms 49.1251 Ops/s 52.5153 Ops/s $\textbf{\color{#d91a1a}-6.46\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 2.9945ms 2.8223ms 354.3269 Ops/s 358.4128 Ops/s $\color{#d91a1a}-1.14\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.1931ms 0.5504ms 1.8167 KOps/s 1.8045 KOps/s $\color{#35bf28}+0.68\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6904ms 0.5285ms 1.8920 KOps/s 1.8929 KOps/s $\color{#d91a1a}-0.05\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 3.0199ms 2.8500ms 350.8794 Ops/s 352.9694 Ops/s $\color{#d91a1a}-0.59\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.2640ms 0.5459ms 1.8320 KOps/s 1.8223 KOps/s $\color{#35bf28}+0.53\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7869ms 0.5247ms 1.9059 KOps/s 1.9091 KOps/s $\color{#d91a1a}-0.17\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 4.0842ms 1.4590ms 685.4066 Ops/s 683.7781 Ops/s $\color{#35bf28}+0.24\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.5531ms 1.3991ms 714.7397 Ops/s 717.8489 Ops/s $\color{#d91a1a}-0.43\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 3.1964ms 2.9706ms 336.6307 Ops/s 342.3028 Ops/s $\color{#d91a1a}-1.66\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.8191ms 0.6775ms 1.4760 KOps/s 1.4708 KOps/s $\color{#35bf28}+0.36\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 4.4767ms 0.6559ms 1.5247 KOps/s 1.5001 KOps/s $\color{#35bf28}+1.64\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 2.9818ms 2.8377ms 352.3973 Ops/s 357.2719 Ops/s $\color{#d91a1a}-1.36\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8207ms 0.5529ms 1.8087 KOps/s 1.7915 KOps/s $\color{#35bf28}+0.96\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7061ms 0.5308ms 1.8839 KOps/s 1.8433 KOps/s $\color{#35bf28}+2.20\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 3.0334ms 2.8688ms 348.5738 Ops/s 352.7838 Ops/s $\color{#d91a1a}-1.19\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.1357s 0.7344ms 1.3616 KOps/s 1.8325 KOps/s $\textbf{\color{#d91a1a}-25.70\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6479ms 0.5202ms 1.9225 KOps/s 1.9127 KOps/s $\color{#35bf28}+0.51\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 3.1422ms 2.9713ms 336.5483 Ops/s 341.7409 Ops/s $\color{#d91a1a}-1.52\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.8056ms 0.6851ms 1.4596 KOps/s 1.4623 KOps/s $\color{#d91a1a}-0.19\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8673ms 0.6656ms 1.5023 KOps/s 1.5021 KOps/s $\color{#35bf28}+0.01\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 0.1206s 7.2456ms 138.0140 Ops/s 101.8410 Ops/s $\textbf{\color{#35bf28}+35.52\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 0.1311s 17.2332ms 58.0277 Ops/s 67.4076 Ops/s $\textbf{\color{#d91a1a}-13.92\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 1.1963ms 1.1116ms 899.6367 Ops/s 846.5878 Ops/s $\textbf{\color{#35bf28}+6.27\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.1195s 7.2206ms 138.4918 Ops/s 137.6174 Ops/s $\color{#35bf28}+0.64\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 16.8983ms 14.9022ms 67.1041 Ops/s 67.6129 Ops/s $\color{#d91a1a}-0.75\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 2.4282ms 1.2406ms 806.0922 Ops/s 849.6891 Ops/s $\textbf{\color{#d91a1a}-5.13\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.1226s 7.6334ms 131.0024 Ops/s 129.9265 Ops/s $\color{#35bf28}+0.83\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 17.6143ms 15.3347ms 65.2115 Ops/s 65.9468 Ops/s $\color{#d91a1a}-1.12\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.7284ms 1.5551ms 643.0379 Ops/s 657.7854 Ops/s $\color{#d91a1a}-2.24\%$

@vmoens vmoens merged commit 6c2e141 into main Apr 23, 2024
35 of 42 checks passed
@vmoens vmoens deleted the video-recording-sota branch April 23, 2024 16:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants