-
Notifications
You must be signed in to change notification settings - Fork 327
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Fix dumps for SamplerWithoutReplacement #2506
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2506
Note: Links to docs will display an error until the docs builds have been completed. ❌ 2 New Failures, 5 Unrelated FailuresAs of commit 764f636 with merge base a27514c (): NEW FAILURES - The following jobs have failed:
BROKEN TRUNK - The following jobs failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Oct 21, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_simple | 0.7262s | 0.7253s | 1.3788 Ops/s | 1.3742 Ops/s | |
test_transformed | 1.0607s | 0.9852s | 1.0150 Ops/s | 1.0339 Ops/s | |
test_serial | 2.1956s | 2.1178s | 0.4722 Ops/s | 0.4761 Ops/s | |
test_parallel | 2.0864s | 2.0012s | 0.4997 Ops/s | 0.5057 Ops/s | |
test_step_mdp_speed[True-True-True-True-True] | 0.2495ms | 39.9335μs | 25.0417 KOps/s | 26.3413 KOps/s | |
test_step_mdp_speed[True-True-True-True-False] | 0.3991ms | 22.9089μs | 43.6512 KOps/s | 43.8540 KOps/s | |
test_step_mdp_speed[True-True-True-False-True] | 59.1210μs | 20.9917μs | 47.6378 KOps/s | 48.7408 KOps/s | |
test_step_mdp_speed[True-True-True-False-False] | 0.4192ms | 12.4528μs | 80.3032 KOps/s | 81.6500 KOps/s | |
test_step_mdp_speed[True-True-False-True-True] | 0.4310ms | 42.3926μs | 23.5890 KOps/s | 24.1859 KOps/s | |
test_step_mdp_speed[True-True-False-True-False] | 0.4080ms | 25.6090μs | 39.0488 KOps/s | 39.5950 KOps/s | |
test_step_mdp_speed[True-True-False-False-True] | 0.1394ms | 23.6448μs | 42.2926 KOps/s | 42.0653 KOps/s | |
test_step_mdp_speed[True-True-False-False-False] | 0.4042ms | 15.1938μs | 65.8164 KOps/s | 66.9839 KOps/s | |
test_step_mdp_speed[True-False-True-True-True] | 0.4341ms | 44.9207μs | 22.2614 KOps/s | 22.3782 KOps/s | |
test_step_mdp_speed[True-False-True-True-False] | 95.3210μs | 28.1554μs | 35.5172 KOps/s | 35.1993 KOps/s | |
test_step_mdp_speed[True-False-True-False-True] | 0.4162ms | 23.9304μs | 41.7879 KOps/s | 42.7099 KOps/s | |
test_step_mdp_speed[True-False-True-False-False] | 0.4043ms | 15.3831μs | 65.0064 KOps/s | 66.4200 KOps/s | |
test_step_mdp_speed[True-False-False-True-True] | 80.9910μs | 46.8961μs | 21.3238 KOps/s | 21.1634 KOps/s | |
test_step_mdp_speed[True-False-False-True-False] | 69.9010μs | 31.0128μs | 32.2447 KOps/s | 31.9024 KOps/s | |
test_step_mdp_speed[True-False-False-False-True] | 0.4129ms | 27.2040μs | 36.7594 KOps/s | 38.1845 KOps/s | |
test_step_mdp_speed[True-False-False-False-False] | 0.4052ms | 18.0288μs | 55.4669 KOps/s | 56.7073 KOps/s | |
test_step_mdp_speed[False-True-True-True-True] | 84.4410μs | 45.1008μs | 22.1726 KOps/s | 22.5207 KOps/s | |
test_step_mdp_speed[False-True-True-True-False] | 0.4361ms | 28.4428μs | 35.1583 KOps/s | 35.4109 KOps/s | |
test_step_mdp_speed[False-True-True-False-True] | 0.4212ms | 29.0095μs | 34.4714 KOps/s | 35.0922 KOps/s | |
test_step_mdp_speed[False-True-True-False-False] | 57.6310μs | 17.5476μs | 56.9878 KOps/s | 56.4527 KOps/s | |
test_step_mdp_speed[False-True-False-True-True] | 87.1410μs | 47.9550μs | 20.8529 KOps/s | 21.2078 KOps/s | |
test_step_mdp_speed[False-True-False-True-False] | 0.4183ms | 31.0410μs | 32.2155 KOps/s | 32.2049 KOps/s | |
test_step_mdp_speed[False-True-False-False-True] | 3.0905ms | 31.5440μs | 31.7017 KOps/s | 32.3078 KOps/s | |
test_step_mdp_speed[False-True-False-False-False] | 59.2910μs | 20.3118μs | 49.2324 KOps/s | 49.0416 KOps/s | |
test_step_mdp_speed[False-False-True-True-True] | 0.4345ms | 50.1918μs | 19.9236 KOps/s | 19.9646 KOps/s | |
test_step_mdp_speed[False-False-True-True-False] | 0.4237ms | 33.7650μs | 29.6165 KOps/s | 29.3152 KOps/s | |
test_step_mdp_speed[False-False-True-False-True] | 0.4218ms | 31.4186μs | 31.8283 KOps/s | 31.7630 KOps/s | |
test_step_mdp_speed[False-False-True-False-False] | 63.6410μs | 20.0644μs | 49.8395 KOps/s | 49.1366 KOps/s | |
test_step_mdp_speed[False-False-False-True-True] | 0.4350ms | 52.4209μs | 19.0763 KOps/s | 19.3575 KOps/s | |
test_step_mdp_speed[False-False-False-True-False] | 0.4166ms | 35.8801μs | 27.8706 KOps/s | 27.2714 KOps/s | |
test_step_mdp_speed[False-False-False-False-True] | 0.4216ms | 33.6816μs | 29.6898 KOps/s | 29.9479 KOps/s | |
test_step_mdp_speed[False-False-False-False-False] | 0.4073ms | 22.5289μs | 44.3874 KOps/s | 43.6722 KOps/s | |
test_values[generalized_advantage_estimate-True-True] | 24.4424ms | 23.9663ms | 41.7252 Ops/s | 41.0566 Ops/s | |
test_values[vec_generalized_advantage_estimate-True-True] | 0.1101s | 3.0796ms | 324.7158 Ops/s | 341.0380 Ops/s | |
test_values[td0_return_estimate-False-False] | 84.5810μs | 64.6604μs | 15.4654 KOps/s | 15.4852 KOps/s | |
test_values[td1_return_estimate-False-False] | 54.8640ms | 54.1940ms | 18.4522 Ops/s | 18.3822 Ops/s | |
test_values[vec_td1_return_estimate-False-False] | 1.3636ms | 1.0572ms | 945.9227 Ops/s | 942.7069 Ops/s | |
test_values[td_lambda_return_estimate-True-False] | 91.1284ms | 89.3497ms | 11.1920 Ops/s | 11.3133 Ops/s | |
test_values[vec_td_lambda_return_estimate-True-False] | 1.3503ms | 1.0701ms | 934.5220 Ops/s | 934.0748 Ops/s | |
test_gae_speed[generalized_advantage_estimate-False-1-512] | 25.6521ms | 25.4117ms | 39.3520 Ops/s | 41.2971 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 1.0369ms | 0.7570ms | 1.3210 KOps/s | 1.3869 KOps/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.7634ms | 0.6678ms | 1.4974 KOps/s | 1.5546 KOps/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 1.5314ms | 1.4759ms | 677.5618 Ops/s | 686.4331 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 0.7821ms | 0.6853ms | 1.4593 KOps/s | 1.5242 KOps/s | |
test_dqn_speed[False-None] | 6.5015ms | 1.3311ms | 751.2540 Ops/s | 749.4011 Ops/s | |
test_dqn_speed[False-backward] | 1.9452ms | 1.8392ms | 543.7004 Ops/s | 557.8391 Ops/s | |
test_dqn_speed[True-None] | 0.9654ms | 0.5538ms | 1.8058 KOps/s | 1.7571 KOps/s | |
test_dqn_speed[True-backward] | 1.0533ms | 0.9937ms | 1.0063 KOps/s | 987.5605 Ops/s | |
test_dqn_speed[reduce-overhead-None] | 1.0000ms | 0.5661ms | 1.7663 KOps/s | 1.7698 KOps/s | |
test_dqn_speed[reduce-overhead-backward] | 1.1671ms | 1.0252ms | 975.3800 Ops/s | 989.2096 Ops/s | |
test_ddpg_speed[False-None] | 3.1366ms | 2.7256ms | 366.8924 Ops/s | 367.0019 Ops/s | |
test_ddpg_speed[False-backward] | 4.1310ms | 3.9605ms | 252.4922 Ops/s | 254.8496 Ops/s | |
test_ddpg_speed[True-None] | 1.6466ms | 1.2290ms | 813.6550 Ops/s | 789.6540 Ops/s | |
test_ddpg_speed[True-backward] | 2.2383ms | 2.2062ms | 453.2710 Ops/s | 412.1440 Ops/s | |
test_ddpg_speed[reduce-overhead-None] | 1.6568ms | 1.2430ms | 804.4916 Ops/s | 800.6073 Ops/s | |
test_ddpg_speed[reduce-overhead-backward] | 2.3020ms | 2.2174ms | 450.9729 Ops/s | 455.3621 Ops/s | |
test_sac_speed[False-None] | 8.0296ms | 7.5543ms | 132.3753 Ops/s | 131.1713 Ops/s | |
test_sac_speed[False-backward] | 11.3034ms | 10.7472ms | 93.0478 Ops/s | 93.6800 Ops/s | |
test_sac_speed[True-None] | 2.2074ms | 2.0254ms | 493.7386 Ops/s | 476.6017 Ops/s | |
test_sac_speed[True-backward] | 4.0345ms | 3.9318ms | 254.3387 Ops/s | 208.5170 Ops/s | |
test_sac_speed[reduce-overhead-None] | 2.5014ms | 2.0503ms | 487.7375 Ops/s | 492.4228 Ops/s | |
test_sac_speed[reduce-overhead-backward] | 4.0181ms | 3.9282ms | 254.5710 Ops/s | 256.3971 Ops/s | |
test_redq_speed[False-None] | 15.5382ms | 10.6870ms | 93.5716 Ops/s | 96.2104 Ops/s | |
test_redq_speed[False-backward] | 18.3205ms | 17.1067ms | 58.4568 Ops/s | 57.6433 Ops/s | |
test_redq_speed[True-None] | 3.8292ms | 3.5867ms | 278.8098 Ops/s | 271.5358 Ops/s | |
test_redq_speed[True-backward] | 8.8325ms | 8.5735ms | 116.6389 Ops/s | 120.5692 Ops/s | |
test_redq_speed[reduce-overhead-None] | 4.0697ms | 3.6027ms | 277.5721 Ops/s | 274.8189 Ops/s | |
test_redq_speed[reduce-overhead-backward] | 10.2361ms | 8.8148ms | 113.4458 Ops/s | 120.0102 Ops/s | |
test_redq_deprec_speed[False-None] | 11.0981ms | 10.4952ms | 95.2817 Ops/s | 95.1504 Ops/s | |
test_redq_deprec_speed[False-backward] | 15.9415ms | 15.3376ms | 65.1994 Ops/s | 66.2784 Ops/s | |
test_redq_deprec_speed[True-None] | 3.6818ms | 3.2751ms | 305.3379 Ops/s | 302.5126 Ops/s | |
test_redq_deprec_speed[True-backward] | 7.4814ms | 7.2878ms | 137.2158 Ops/s | 140.6021 Ops/s | |
test_redq_deprec_speed[reduce-overhead-None] | 3.6083ms | 3.2526ms | 307.4462 Ops/s | 309.0879 Ops/s | |
test_redq_deprec_speed[reduce-overhead-backward] | 7.4381ms | 7.2143ms | 138.6143 Ops/s | 138.4959 Ops/s | |
test_td3_speed[False-None] | 8.0303ms | 7.5529ms | 132.4003 Ops/s | 133.8772 Ops/s | |
test_td3_speed[False-backward] | 10.6053ms | 10.2760ms | 97.3144 Ops/s | 97.3052 Ops/s | |
test_td3_speed[True-None] | 1.9486ms | 1.9091ms | 523.7964 Ops/s | 527.5211 Ops/s | |
test_td3_speed[True-backward] | 3.8326ms | 3.7222ms | 268.6564 Ops/s | 271.2745 Ops/s | |
test_td3_speed[reduce-overhead-None] | 1.9567ms | 1.8991ms | 526.5684 Ops/s | 526.9107 Ops/s | |
test_td3_speed[reduce-overhead-backward] | 3.8355ms | 3.7204ms | 268.7913 Ops/s | 272.3564 Ops/s | |
test_cql_speed[False-None] | 28.2447ms | 25.2492ms | 39.6051 Ops/s | 40.7810 Ops/s | |
test_cql_speed[False-backward] | 0.2844s | 39.7838ms | 25.1358 Ops/s | 29.7668 Ops/s | |
test_cql_speed[True-None] | 11.1644ms | 10.8821ms | 91.8937 Ops/s | 92.9647 Ops/s | |
test_cql_speed[True-backward] | 17.1849ms | 16.7679ms | 59.6378 Ops/s | 58.9637 Ops/s | |
test_cql_speed[reduce-overhead-None] | 11.4156ms | 10.9384ms | 91.4207 Ops/s | 95.0743 Ops/s | |
test_cql_speed[reduce-overhead-backward] | 17.2497ms | 16.6836ms | 59.9393 Ops/s | 62.4056 Ops/s | |
test_a2c_speed[False-None] | 6.4817ms | 5.2962ms | 188.8141 Ops/s | 190.3825 Ops/s | |
test_a2c_speed[False-backward] | 12.0165ms | 11.7265ms | 85.2770 Ops/s | 86.3752 Ops/s | |
test_a2c_speed[True-None] | 3.2973ms | 3.0613ms | 326.6593 Ops/s | 325.8186 Ops/s | |
test_a2c_speed[True-backward] | 8.8538ms | 8.6173ms | 116.0463 Ops/s | 113.2148 Ops/s | |
test_a2c_speed[reduce-overhead-None] | 3.4611ms | 3.0332ms | 329.6849 Ops/s | 328.8046 Ops/s | |
test_a2c_speed[reduce-overhead-backward] | 8.8934ms | 8.5667ms | 116.7309 Ops/s | 119.3452 Ops/s | |
test_ppo_speed[False-None] | 6.1006ms | 5.6696ms | 176.3788 Ops/s | 178.0895 Ops/s | |
test_ppo_speed[False-backward] | 12.6467ms | 12.2293ms | 81.7708 Ops/s | 83.3498 Ops/s | |
test_ppo_speed[True-None] | 3.8003ms | 3.4207ms | 292.3344 Ops/s | 286.7036 Ops/s | |
test_ppo_speed[True-backward] | 8.4150ms | 8.2141ms | 121.7420 Ops/s | 122.2816 Ops/s | |
test_ppo_speed[reduce-overhead-None] | 3.8855ms | 3.4305ms | 291.4999 Ops/s | 289.7089 Ops/s | |
test_ppo_speed[reduce-overhead-backward] | 8.4282ms | 8.2850ms | 120.6996 Ops/s | 121.6675 Ops/s | |
test_reinforce_speed[False-None] | 6.4474ms | 4.4065ms | 226.9357 Ops/s | 224.8235 Ops/s | |
test_reinforce_speed[False-backward] | 7.7237ms | 7.2102ms | 138.6917 Ops/s | 136.9049 Ops/s | |
test_reinforce_speed[True-None] | 2.5996ms | 2.2032ms | 453.8822 Ops/s | 454.3613 Ops/s | |
test_reinforce_speed[True-backward] | 7.5528ms | 7.1340ms | 140.1729 Ops/s | 142.8035 Ops/s | |
test_reinforce_speed[reduce-overhead-None] | 2.6390ms | 2.2166ms | 451.1400 Ops/s | 445.5214 Ops/s | |
test_reinforce_speed[reduce-overhead-backward] | 7.2682ms | 7.0772ms | 141.2997 Ops/s | 141.5791 Ops/s | |
test_iql_speed[False-None] | 23.8216ms | 19.8747ms | 50.3154 Ops/s | 51.6424 Ops/s | |
test_iql_speed[False-backward] | 35.9867ms | 30.2094ms | 33.1023 Ops/s | 33.5315 Ops/s | |
test_iql_speed[True-None] | 7.3159ms | 6.7462ms | 148.2314 Ops/s | 150.1478 Ops/s | |
test_iql_speed[True-backward] | 15.9653ms | 15.5670ms | 64.2385 Ops/s | 64.7614 Ops/s | |
test_iql_speed[reduce-overhead-None] | 7.1632ms | 6.7704ms | 147.7019 Ops/s | 149.1756 Ops/s | |
test_iql_speed[reduce-overhead-backward] | 15.7015ms | 15.3796ms | 65.0213 Ops/s | 64.7068 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 6.5877ms | 6.1950ms | 161.4198 Ops/s | 162.9099 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.8442ms | 0.2367ms | 4.2249 KOps/s | 4.2215 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.7760ms | 0.2635ms | 3.7955 KOps/s | 4.6970 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 6.6765ms | 6.1538ms | 162.5007 Ops/s | 167.8967 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.8586ms | 0.2658ms | 3.7620 KOps/s | 4.3574 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.4560ms | 0.2100ms | 4.7629 KOps/s | 4.7434 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 1.6268ms | 1.1965ms | 835.7472 Ops/s | 789.3910 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 1.3349ms | 1.1267ms | 887.5661 Ops/s | 899.1451 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 6.7439ms | 6.3226ms | 158.1615 Ops/s | 162.4973 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 1.9451ms | 0.4117ms | 2.4288 KOps/s | 2.2739 KOps/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.7963ms | 0.3553ms | 2.8148 KOps/s | 2.8242 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 6.6375ms | 6.1830ms | 161.7336 Ops/s | 166.3908 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 1.7091ms | 0.2606ms | 3.8375 KOps/s | 4.2856 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.5511ms | 0.3501ms | 2.8562 KOps/s | 4.7553 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 10.3571ms | 6.2653ms | 159.6080 Ops/s | 167.8229 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 1.2532ms | 0.2330ms | 4.2911 KOps/s | 3.2973 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.6458ms | 0.2112ms | 4.7354 KOps/s | 3.5008 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 6.6522ms | 6.2900ms | 158.9828 Ops/s | 157.8852 Ops/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 0.9946ms | 0.4462ms | 2.2411 KOps/s | 2.5765 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.8775ms | 0.4357ms | 2.2953 KOps/s | 2.8042 KOps/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 7.0077ms | 5.2182ms | 191.6367 Ops/s | 191.8782 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 11.1480ms | 2.0476ms | 488.3829 Ops/s | 489.6532 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 2.2188ms | 1.0673ms | 936.9664 Ops/s | 819.0594 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.4186s | 13.5371ms | 73.8710 Ops/s | 189.5371 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 7.7605ms | 1.9871ms | 503.2421 Ops/s | 498.6656 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 8.2505ms | 1.2401ms | 806.3544 Ops/s | 821.8220 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 8.4358ms | 5.4235ms | 184.3831 Ops/s | 184.0285 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 9.3856ms | 2.1964ms | 455.2858 Ops/s | 454.6990 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 7.8890ms | 1.3422ms | 745.0377 Ops/s | 756.2385 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_simple | 0.4346s | 0.4281s | 2.3360 Ops/s | 2.2097 Ops/s | |
test_transformed | 0.7268s | 0.6277s | 1.5931 Ops/s | 1.5721 Ops/s | |
test_serial | 1.4778s | 1.3790s | 0.7252 Ops/s | 0.7091 Ops/s | |
test_parallel | 1.4645s | 1.3516s | 0.7399 Ops/s | 0.7218 Ops/s | |
test_step_mdp_speed[True-True-True-True-True] | 0.2147ms | 28.0623μs | 35.6350 KOps/s | 34.2621 KOps/s | |
test_step_mdp_speed[True-True-True-True-False] | 53.7110μs | 16.7461μs | 59.7155 KOps/s | 57.4688 KOps/s | |
test_step_mdp_speed[True-True-True-False-True] | 67.4970μs | 15.7872μs | 63.3423 KOps/s | 61.2418 KOps/s | |
test_step_mdp_speed[True-True-True-False-False] | 34.5750μs | 9.2398μs | 108.2276 KOps/s | 102.9206 KOps/s | |
test_step_mdp_speed[True-True-False-True-True] | 66.5250μs | 30.3619μs | 32.9361 KOps/s | 31.6141 KOps/s | |
test_step_mdp_speed[True-True-False-True-False] | 60.3830μs | 19.1846μs | 52.1250 KOps/s | 51.2251 KOps/s | |
test_step_mdp_speed[True-True-False-False-True] | 53.0300μs | 17.6758μs | 56.5746 KOps/s | 54.0500 KOps/s | |
test_step_mdp_speed[True-True-False-False-False] | 39.9750μs | 11.3358μs | 88.2160 KOps/s | 83.5952 KOps/s | |
test_step_mdp_speed[True-False-True-True-True] | 75.3610μs | 32.5052μs | 30.7643 KOps/s | 29.6534 KOps/s | |
test_step_mdp_speed[True-False-True-True-False] | 53.6510μs | 21.0757μs | 47.4480 KOps/s | 45.6994 KOps/s | |
test_step_mdp_speed[True-False-True-False-True] | 46.6170μs | 17.9008μs | 55.8635 KOps/s | 54.3400 KOps/s | |
test_step_mdp_speed[True-False-True-False-False] | 0.1368ms | 11.2936μs | 88.5460 KOps/s | 84.2716 KOps/s | |
test_step_mdp_speed[True-False-False-True-True] | 0.1084ms | 34.5934μs | 28.9073 KOps/s | 28.2140 KOps/s | |
test_step_mdp_speed[True-False-False-True-False] | 97.6690μs | 22.5693μs | 44.3080 KOps/s | 42.0665 KOps/s | |
test_step_mdp_speed[True-False-False-False-True] | 0.6124ms | 19.4454μs | 51.4260 KOps/s | 49.4767 KOps/s | |
test_step_mdp_speed[True-False-False-False-False] | 63.9400μs | 13.1407μs | 76.0992 KOps/s | 73.3554 KOps/s | |
test_step_mdp_speed[False-True-True-True-True] | 97.4330μs | 32.2027μs | 31.0533 KOps/s | 29.7348 KOps/s | |
test_step_mdp_speed[False-True-True-True-False] | 85.4510μs | 20.9530μs | 47.7258 KOps/s | 46.4463 KOps/s | |
test_step_mdp_speed[False-True-True-False-True] | 82.6850μs | 21.0834μs | 47.4308 KOps/s | 45.8838 KOps/s | |
test_step_mdp_speed[False-True-True-False-False] | 54.3820μs | 13.2050μs | 75.7288 KOps/s | 74.3056 KOps/s | |
test_step_mdp_speed[False-True-False-True-True] | 98.0340μs | 34.0744μs | 29.3475 KOps/s | 28.3485 KOps/s | |
test_step_mdp_speed[False-True-False-True-False] | 90.4000μs | 22.8499μs | 43.7639 KOps/s | 42.0791 KOps/s | |
test_step_mdp_speed[False-True-False-False-True] | 2.7392ms | 23.0507μs | 43.3827 KOps/s | 41.7767 KOps/s | |
test_step_mdp_speed[False-True-False-False-False] | 50.1950μs | 15.1061μs | 66.1983 KOps/s | 64.5902 KOps/s | |
test_step_mdp_speed[False-False-True-True-True] | 89.1670μs | 36.0900μs | 27.7085 KOps/s | 26.7372 KOps/s | |
test_step_mdp_speed[False-False-True-True-False] | 87.3740μs | 24.6772μs | 40.5232 KOps/s | 38.3669 KOps/s | |
test_step_mdp_speed[False-False-True-False-True] | 86.6430μs | 23.2428μs | 43.0240 KOps/s | 41.7204 KOps/s | |
test_step_mdp_speed[False-False-True-False-False] | 83.7370μs | 14.9540μs | 66.8717 KOps/s | 63.9477 KOps/s | |
test_step_mdp_speed[False-False-False-True-True] | 80.7020μs | 37.9238μs | 26.3687 KOps/s | 25.3902 KOps/s | |
test_step_mdp_speed[False-False-False-True-False] | 95.5890μs | 26.7307μs | 37.4102 KOps/s | 35.6554 KOps/s | |
test_step_mdp_speed[False-False-False-False-True] | 71.4540μs | 24.6069μs | 40.6390 KOps/s | 39.1626 KOps/s | |
test_step_mdp_speed[False-False-False-False-False] | 80.5310μs | 16.8066μs | 59.5003 KOps/s | 57.2406 KOps/s | |
test_values[generalized_advantage_estimate-True-True] | 9.9645ms | 9.7447ms | 102.6200 Ops/s | 100.6518 Ops/s | |
test_values[vec_generalized_advantage_estimate-True-True] | 37.3723ms | 34.1603ms | 29.2738 Ops/s | 27.0520 Ops/s | |
test_values[td0_return_estimate-False-False] | 0.2783ms | 0.1999ms | 5.0029 KOps/s | 5.2009 KOps/s | |
test_values[td1_return_estimate-False-False] | 27.3107ms | 24.3358ms | 41.0917 Ops/s | 41.0761 Ops/s | |
test_values[vec_td1_return_estimate-False-False] | 36.0248ms | 33.9074ms | 29.4921 Ops/s | 26.6919 Ops/s | |
test_values[td_lambda_return_estimate-True-False] | 47.3926ms | 34.7363ms | 28.7883 Ops/s | 28.2955 Ops/s | |
test_values[vec_td_lambda_return_estimate-True-False] | 35.6661ms | 33.8147ms | 29.5729 Ops/s | 26.9812 Ops/s | |
test_gae_speed[generalized_advantage_estimate-False-1-512] | 10.7192ms | 8.3851ms | 119.2588 Ops/s | 117.4082 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 2.4440ms | 2.0228ms | 494.3661 Ops/s | 485.5384 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 1.7859ms | 0.3588ms | 2.7867 KOps/s | 2.7497 KOps/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 46.4565ms | 43.9533ms | 22.7514 Ops/s | 21.5745 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 4.1684ms | 3.2086ms | 311.6666 Ops/s | 309.0716 Ops/s | |
test_dqn_speed[False-None] | 6.2446ms | 1.3673ms | 731.3837 Ops/s | 667.6242 Ops/s | |
test_dqn_speed[False-backward] | 1.9850ms | 1.8448ms | 542.0579 Ops/s | 517.1914 Ops/s | |
test_dqn_speed[True-None] | 0.5855ms | 0.4686ms | 2.1340 KOps/s | 2.0991 KOps/s | |
test_dqn_speed[True-backward] | 1.0216ms | 0.9065ms | 1.1031 KOps/s | 1.0461 KOps/s | |
test_dqn_speed[reduce-overhead-None] | 0.6387ms | 0.4740ms | 2.1097 KOps/s | 2.0628 KOps/s | |
test_dqn_speed[reduce-overhead-backward] | 0.9807ms | 0.8974ms | 1.1143 KOps/s | 1.0915 KOps/s | |
test_ddpg_speed[False-None] | 3.8034ms | 2.8956ms | 345.3478 Ops/s | 332.6856 Ops/s | |
test_ddpg_speed[False-backward] | 5.5001ms | 4.1955ms | 238.3495 Ops/s | 226.4165 Ops/s | |
test_ddpg_speed[True-None] | 1.4935ms | 1.0223ms | 978.2267 Ops/s | 963.7579 Ops/s | |
test_ddpg_speed[True-backward] | 2.0846ms | 1.9601ms | 510.1715 Ops/s | 452.6570 Ops/s | |
test_ddpg_speed[reduce-overhead-None] | 1.2354ms | 1.0089ms | 991.1778 Ops/s | 978.7418 Ops/s | |
test_ddpg_speed[reduce-overhead-backward] | 2.4521ms | 1.9599ms | 510.2308 Ops/s | 491.5707 Ops/s | |
test_sac_speed[False-None] | 9.0913ms | 8.0924ms | 123.5728 Ops/s | 108.1776 Ops/s | |
test_sac_speed[False-backward] | 11.4831ms | 10.9157ms | 91.6114 Ops/s | 80.0415 Ops/s | |
test_sac_speed[True-None] | 2.1760ms | 1.8739ms | 533.6449 Ops/s | 522.7900 Ops/s | |
test_sac_speed[True-backward] | 3.6877ms | 3.6455ms | 274.3134 Ops/s | 247.2640 Ops/s | |
test_sac_speed[reduce-overhead-None] | 2.5375ms | 1.8835ms | 530.9198 Ops/s | 457.7223 Ops/s | |
test_sac_speed[reduce-overhead-backward] | 4.2827ms | 4.0369ms | 247.7134 Ops/s | 233.2938 Ops/s | |
test_redq_speed[False-None] | 15.4633ms | 13.2037ms | 75.7364 Ops/s | 71.5423 Ops/s | |
test_redq_speed[False-backward] | 30.2788ms | 23.2580ms | 42.9959 Ops/s | 41.9321 Ops/s | |
test_redq_speed[True-None] | 6.5996ms | 5.5866ms | 179.0000 Ops/s | 166.5378 Ops/s | |
test_redq_speed[True-backward] | 14.2533ms | 12.9540ms | 77.1965 Ops/s | 74.3891 Ops/s | |
test_redq_speed[reduce-overhead-None] | 8.1880ms | 7.0717ms | 141.4086 Ops/s | 165.0809 Ops/s | |
test_redq_speed[reduce-overhead-backward] | 18.1321ms | 13.4294ms | 74.4633 Ops/s | 74.0750 Ops/s | |
test_redq_deprec_speed[False-None] | 15.4091ms | 13.8607ms | 72.1464 Ops/s | 68.0784 Ops/s | |
test_redq_deprec_speed[False-backward] | 21.3554ms | 19.8659ms | 50.3376 Ops/s | 47.6559 Ops/s | |
test_redq_deprec_speed[True-None] | 4.9882ms | 4.0341ms | 247.8865 Ops/s | 218.6160 Ops/s | |
test_redq_deprec_speed[True-backward] | 12.5508ms | 11.2624ms | 88.7911 Ops/s | 102.9786 Ops/s | |
test_redq_deprec_speed[reduce-overhead-None] | 6.2217ms | 5.5153ms | 181.3125 Ops/s | 207.9626 Ops/s | |
test_redq_deprec_speed[reduce-overhead-backward] | 11.9386ms | 11.3211ms | 88.3306 Ops/s | 101.0481 Ops/s | |
test_td3_speed[False-None] | 13.4810ms | 10.8013ms | 92.5811 Ops/s | 106.0974 Ops/s | |
test_td3_speed[False-backward] | 17.4484ms | 14.4783ms | 69.0691 Ops/s | 80.3913 Ops/s | |
test_td3_speed[True-None] | 4.2792ms | 3.0504ms | 327.8309 Ops/s | 461.2330 Ops/s | |
test_td3_speed[True-backward] | 5.5373ms | 5.1383ms | 194.6185 Ops/s | 224.3424 Ops/s | |
test_td3_speed[reduce-overhead-None] | 3.5651ms | 3.0532ms | 327.5279 Ops/s | 498.6412 Ops/s | |
test_td3_speed[reduce-overhead-backward] | 5.3836ms | 5.1196ms | 195.3290 Ops/s | 231.4346 Ops/s | |
test_cql_speed[False-None] | 48.6023ms | 44.2384ms | 22.6048 Ops/s | 25.6398 Ops/s | |
test_cql_speed[False-backward] | 57.1356ms | 54.7956ms | 18.2496 Ops/s | 20.2841 Ops/s | |
test_cql_speed[True-None] | 20.6271ms | 17.9643ms | 55.6658 Ops/s | 60.6341 Ops/s | |
test_cql_speed[True-backward] | 31.1022ms | 27.9555ms | 35.7712 Ops/s | 41.6749 Ops/s | |
test_cql_speed[reduce-overhead-None] | 19.0904ms | 17.9248ms | 55.7887 Ops/s | 59.9465 Ops/s | |
test_cql_speed[reduce-overhead-backward] | 28.4389ms | 27.1356ms | 36.8520 Ops/s | 40.8813 Ops/s | |
test_a2c_speed[False-None] | 10.5048ms | 8.7463ms | 114.3346 Ops/s | 130.8572 Ops/s | |
test_a2c_speed[False-backward] | 20.2101ms | 18.5037ms | 54.0432 Ops/s | 64.6874 Ops/s | |
test_a2c_speed[True-None] | 5.3277ms | 4.5433ms | 220.1036 Ops/s | 267.3075 Ops/s | |
test_a2c_speed[True-backward] | 13.4439ms | 12.6755ms | 78.8921 Ops/s | 89.3153 Ops/s | |
test_a2c_speed[reduce-overhead-None] | 5.1530ms | 4.4527ms | 224.5834 Ops/s | 264.7370 Ops/s | |
test_a2c_speed[reduce-overhead-backward] | 13.9537ms | 12.7846ms | 78.2191 Ops/s | 91.3866 Ops/s | |
test_ppo_speed[False-None] | 11.2895ms | 9.7925ms | 102.1193 Ops/s | 120.4795 Ops/s | |
test_ppo_speed[False-backward] | 31.8533ms | 19.5606ms | 51.1233 Ops/s | 62.2244 Ops/s | |
test_ppo_speed[True-None] | 5.9677ms | 4.9810ms | 200.7641 Ops/s | 238.6944 Ops/s | |
test_ppo_speed[True-backward] | 13.4147ms | 12.6967ms | 78.7609 Ops/s | 92.5198 Ops/s | |
test_ppo_speed[reduce-overhead-None] | 5.7563ms | 4.9051ms | 203.8690 Ops/s | 260.8226 Ops/s | |
test_ppo_speed[reduce-overhead-backward] | 13.3298ms | 12.4517ms | 80.3102 Ops/s | 94.9575 Ops/s | |
test_reinforce_speed[False-None] | 9.4257ms | 8.2224ms | 121.6196 Ops/s | 138.9074 Ops/s | |
test_reinforce_speed[False-backward] | 13.7511ms | 12.5908ms | 79.4230 Ops/s | 94.1627 Ops/s | |
test_reinforce_speed[True-None] | 4.2642ms | 3.7815ms | 264.4455 Ops/s | 358.4876 Ops/s | |
test_reinforce_speed[True-backward] | 12.9783ms | 11.7113ms | 85.3875 Ops/s | 108.2110 Ops/s | |
test_reinforce_speed[reduce-overhead-None] | 4.5891ms | 3.9043ms | 256.1289 Ops/s | 349.9043 Ops/s | |
test_reinforce_speed[reduce-overhead-backward] | 12.4043ms | 11.3892ms | 87.8025 Ops/s | 108.5160 Ops/s | |
test_iql_speed[False-None] | 38.9308ms | 36.8307ms | 27.1513 Ops/s | 28.7945 Ops/s | |
test_iql_speed[False-backward] | 55.2132ms | 52.4319ms | 19.0724 Ops/s | 20.9888 Ops/s | |
test_iql_speed[True-None] | 13.4485ms | 12.3310ms | 81.0966 Ops/s | 87.3654 Ops/s | |
test_iql_speed[True-backward] | 27.0574ms | 25.8228ms | 38.7254 Ops/s | 42.5001 Ops/s | |
test_iql_speed[reduce-overhead-None] | 13.4507ms | 12.4187ms | 80.5235 Ops/s | 87.2714 Ops/s | |
test_iql_speed[reduce-overhead-backward] | 26.3334ms | 25.5674ms | 39.1124 Ops/s | 41.9993 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 7.6533ms | 6.5662ms | 152.2950 Ops/s | 182.2617 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.8748ms | 0.5390ms | 1.8552 KOps/s | 1.9228 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.8402ms | 0.5125ms | 1.9514 KOps/s | 2.0445 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 7.5605ms | 6.5131ms | 153.5357 Ops/s | 189.6084 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 2.3437ms | 0.5426ms | 1.8431 KOps/s | 1.9740 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.8598ms | 0.5059ms | 1.9766 KOps/s | 2.0646 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 2.4388ms | 1.7611ms | 567.8396 Ops/s | 606.8627 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 2.5725ms | 1.7113ms | 584.3613 Ops/s | 628.4437 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 7.6018ms | 6.8420ms | 146.1571 Ops/s | 181.6562 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 1.8789ms | 0.6925ms | 1.4441 KOps/s | 1.5204 KOps/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.9548ms | 0.6559ms | 1.5247 KOps/s | 1.5689 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 11.2510ms | 6.5143ms | 153.5086 Ops/s | 189.4154 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 2.1991ms | 0.5500ms | 1.8182 KOps/s | 1.9487 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.7468ms | 0.5220ms | 1.9159 KOps/s | 2.0406 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 9.2009ms | 6.3105ms | 158.4664 Ops/s | 185.4683 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 2.9003ms | 0.5527ms | 1.8094 KOps/s | 1.9622 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.6920ms | 0.5067ms | 1.9737 KOps/s | 2.0594 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 7.3408ms | 6.7686ms | 147.7400 Ops/s | 172.6553 Ops/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 5.6724ms | 0.7213ms | 1.3863 KOps/s | 1.5136 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.9473ms | 0.6616ms | 1.5116 KOps/s | 1.5285 KOps/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 10.8635ms | 6.7548ms | 148.0423 Ops/s | 206.5483 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 12.3740ms | 3.0734ms | 325.3692 Ops/s | 415.8388 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 6.8632ms | 1.5233ms | 656.4659 Ops/s | 758.6844 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.6427s | 19.9219ms | 50.1960 Ops/s | 201.5999 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 8.6428ms | 3.0479ms | 328.0955 Ops/s | 404.4724 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 6.5891ms | 1.6625ms | 601.5045 Ops/s | 744.8158 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 12.0304ms | 7.6049ms | 131.4937 Ops/s | 196.8550 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 12.6606ms | 3.4124ms | 293.0521 Ops/s | 389.4981 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 2.9830ms | 1.8270ms | 547.3502 Ops/s | 636.7671 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.