-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Refactor] make TD.get default to None, like dict #948
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Aug 4, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 44.7220μs | 20.3255μs | 49.1993 KOps/s | 46.3133 KOps/s | |
test_plain_set_stack_nested | 46.5860μs | 20.6640μs | 48.3934 KOps/s | 46.5984 KOps/s | |
test_plain_set_nested_inplace | 67.6170μs | 22.3995μs | 44.6438 KOps/s | 42.2825 KOps/s | |
test_plain_set_stack_nested_inplace | 97.5800μs | 22.4666μs | 44.5105 KOps/s | 42.2769 KOps/s | |
test_items | 21.3100μs | 2.8047μs | 356.5447 KOps/s | 389.0112 KOps/s | |
test_items_nested | 0.4887ms | 0.3405ms | 2.9373 KOps/s | 2.8716 KOps/s | |
test_items_nested_locked | 2.7928ms | 0.3424ms | 2.9202 KOps/s | 2.9778 KOps/s | |
test_items_nested_leaf | 0.1538ms | 83.9307μs | 11.9146 KOps/s | 12.0386 KOps/s | |
test_items_stack_nested | 0.5990ms | 0.3433ms | 2.9133 KOps/s | 2.9544 KOps/s | |
test_items_stack_nested_leaf | 0.1604ms | 85.3844μs | 11.7117 KOps/s | 12.1065 KOps/s | |
test_items_stack_nested_locked | 0.6376ms | 0.3464ms | 2.8865 KOps/s | 2.9440 KOps/s | |
test_keys | 43.6310μs | 3.9538μs | 252.9192 KOps/s | 257.7633 KOps/s | |
test_keys_nested | 0.2612ms | 0.1436ms | 6.9625 KOps/s | 7.0813 KOps/s | |
test_keys_nested_locked | 0.7572ms | 0.1505ms | 6.6439 KOps/s | 6.7646 KOps/s | |
test_keys_nested_leaf | 0.2414ms | 0.1240ms | 8.0675 KOps/s | 8.1371 KOps/s | |
test_keys_stack_nested | 0.2820ms | 0.1504ms | 6.6503 KOps/s | 7.1423 KOps/s | |
test_keys_stack_nested_leaf | 0.2442ms | 0.1240ms | 8.0623 KOps/s | 8.2866 KOps/s | |
test_keys_stack_nested_locked | 0.3146ms | 0.1522ms | 6.5691 KOps/s | 6.8383 KOps/s | |
test_values | 10.4293μs | 1.1893μs | 840.8107 KOps/s | 863.6957 KOps/s | |
test_values_nested | 98.5620μs | 50.5765μs | 19.7720 KOps/s | 19.9465 KOps/s | |
test_values_nested_locked | 0.1033ms | 50.6495μs | 19.7435 KOps/s | 20.1524 KOps/s | |
test_values_nested_leaf | 92.3410μs | 45.1469μs | 22.1499 KOps/s | 22.0795 KOps/s | |
test_values_stack_nested | 99.0330μs | 50.8043μs | 19.6834 KOps/s | 19.3406 KOps/s | |
test_values_stack_nested_leaf | 97.9120μs | 45.5290μs | 21.9640 KOps/s | 22.3393 KOps/s | |
test_values_stack_nested_locked | 98.6830μs | 50.5737μs | 19.7731 KOps/s | 19.2494 KOps/s | |
test_membership | 5.9237μs | 0.7648μs | 1.3075 MOps/s | 1.0714 MOps/s | |
test_membership_nested | 17.4820μs | 2.6209μs | 381.5544 KOps/s | 384.5151 KOps/s | |
test_membership_nested_leaf | 42.1380μs | 2.6527μs | 376.9719 KOps/s | 380.4006 KOps/s | |
test_membership_stacked_nested | 15.4190μs | 2.6069μs | 383.6013 KOps/s | 387.0836 KOps/s | |
test_membership_stacked_nested_leaf | 24.9360μs | 2.6374μs | 379.1581 KOps/s | 383.8027 KOps/s | |
test_membership_nested_last | 22.5220μs | 3.8918μs | 256.9500 KOps/s | 255.8623 KOps/s | |
test_membership_nested_leaf_last | 22.2310μs | 3.8783μs | 257.8474 KOps/s | 254.5605 KOps/s | |
test_membership_stacked_nested_last | 15.3390μs | 3.8511μs | 259.6648 KOps/s | 170.3107 KOps/s | |
test_membership_stacked_nested_leaf_last | 41.8170μs | 3.8779μs | 257.8683 KOps/s | 167.0562 KOps/s | |
test_nested_getleaf | 41.2060μs | 10.6167μs | 94.1908 KOps/s | 93.3095 KOps/s | |
test_nested_get | 49.7920μs | 9.9470μs | 100.5324 KOps/s | 98.9666 KOps/s | |
test_stacked_getleaf | 0.1136ms | 10.5927μs | 94.4044 KOps/s | 94.2600 KOps/s | |
test_stacked_get | 22.2110μs | 10.0378μs | 99.6231 KOps/s | 98.8548 KOps/s | |
test_nested_getitemleaf | 45.5170μs | 11.0586μs | 90.4278 KOps/s | 89.4068 KOps/s | |
test_nested_getitem | 49.6020μs | 10.3163μs | 96.9336 KOps/s | 95.9169 KOps/s | |
test_stacked_getitemleaf | 29.8750μs | 11.1956μs | 89.3209 KOps/s | 88.0127 KOps/s | |
test_stacked_getitem | 49.8120μs | 10.0982μs | 99.0272 KOps/s | 96.8867 KOps/s | |
test_lock_nested | 78.2889ms | 0.5806ms | 1.7222 KOps/s | 1.9833 KOps/s | |
test_lock_stack_nested | 0.7364ms | 0.4804ms | 2.0815 KOps/s | 2.2028 KOps/s | |
test_unlock_nested | 75.9080ms | 0.4955ms | 2.0182 KOps/s | 2.4071 KOps/s | |
test_unlock_stack_nested | 0.8746ms | 0.3934ms | 2.5419 KOps/s | 2.6989 KOps/s | |
test_flatten_speed | 0.1972ms | 0.1035ms | 9.6573 KOps/s | 9.7314 KOps/s | |
test_unflatten_speed | 0.6357ms | 0.4382ms | 2.2821 KOps/s | 2.3082 KOps/s | |
test_common_ops | 1.8586ms | 1.0781ms | 927.5202 Ops/s | 907.0143 Ops/s | |
test_creation | 65.2910μs | 2.0675μs | 483.6674 KOps/s | 501.8439 KOps/s | |
test_creation_empty | 34.4430μs | 15.8861μs | 62.9483 KOps/s | 57.4356 KOps/s | |
test_creation_nested_1 | 0.1545ms | 19.3696μs | 51.6273 KOps/s | 48.6710 KOps/s | |
test_creation_nested_2 | 60.7120μs | 23.2869μs | 42.9427 KOps/s | 39.9266 KOps/s | |
test_clone | 95.2160μs | 17.3925μs | 57.4961 KOps/s | 60.9952 KOps/s | |
test_getitem[int] | 0.8095ms | 17.0342μs | 58.7056 KOps/s | 61.2534 KOps/s | |
test_getitem[slice_int] | 0.1226ms | 31.1715μs | 32.0806 KOps/s | 31.3156 KOps/s | |
test_getitem[range] | 0.1683ms | 58.4897μs | 17.0970 KOps/s | 17.7238 KOps/s | |
test_getitem[tuple] | 0.1195ms | 25.3019μs | 39.5228 KOps/s | 39.0895 KOps/s | |
test_getitem[list] | 0.1779ms | 53.5880μs | 18.6609 KOps/s | 19.0773 KOps/s | |
test_setitem_dim[int] | 67.6250μs | 39.0346μs | 25.6183 KOps/s | 24.1581 KOps/s | |
test_setitem_dim[slice_int] | 96.3690μs | 68.3901μs | 14.6220 KOps/s | 13.6383 KOps/s | |
test_setitem_dim[range] | 0.1831ms | 91.8909μs | 10.8825 KOps/s | 10.8783 KOps/s | |
test_setitem_dim[tuple] | 0.1839ms | 57.0428μs | 17.5307 KOps/s | 16.7262 KOps/s | |
test_setitem | 80.0370μs | 28.4069μs | 35.2027 KOps/s | 34.4043 KOps/s | |
test_set | 77.8850μs | 27.8305μs | 35.9318 KOps/s | 35.5776 KOps/s | |
test_set_shared | 1.3389ms | 0.2154ms | 4.6416 KOps/s | 4.6262 KOps/s | |
test_update | 0.1723ms | 34.1145μs | 29.3131 KOps/s | 28.5609 KOps/s | |
test_update_nested | 0.1295ms | 44.3365μs | 22.5548 KOps/s | 22.1719 KOps/s | |
test_update__nested | 89.0050μs | 35.3050μs | 28.3246 KOps/s | 29.3801 KOps/s | |
test_set_nested | 77.3330μs | 30.4544μs | 32.8360 KOps/s | 32.1632 KOps/s | |
test_set_nested_new | 86.8300μs | 35.3421μs | 28.2949 KOps/s | 25.6355 KOps/s | |
test_select | 0.1552ms | 53.6614μs | 18.6354 KOps/s | 19.1333 KOps/s | |
test_select_nested | 0.1267ms | 59.6191μs | 16.7732 KOps/s | 17.1997 KOps/s | |
test_exclude_nested | 0.3449ms | 77.9689μs | 12.8256 KOps/s | 13.0908 KOps/s | |
test_empty[True] | 1.2326ms | 0.3238ms | 3.0883 KOps/s | 3.1640 KOps/s | |
test_empty[False] | 7.5340μs | 1.2147μs | 823.2610 KOps/s | 856.2467 KOps/s | |
test_unbind_speed | 0.6325ms | 0.3158ms | 3.1666 KOps/s | 3.2132 KOps/s | |
test_unbind_speed_stack0 | 0.5420ms | 0.3137ms | 3.1879 KOps/s | 3.3599 KOps/s | |
test_unbind_speed_stack1 | 82.1477ms | 0.8037ms | 1.2442 KOps/s | 1.4186 KOps/s | |
test_split | 79.1100ms | 2.1821ms | 458.2716 Ops/s | 473.5484 Ops/s | |
test_chunk | 84.2767ms | 2.1992ms | 454.7195 Ops/s | 474.1806 Ops/s | |
test_creation[device0] | 0.2247ms | 0.1172ms | 8.5340 KOps/s | 8.3786 KOps/s | |
test_creation_from_tensor | 3.8371ms | 0.1212ms | 8.2500 KOps/s | 8.3392 KOps/s | |
test_add_one[memmap_tensor0] | 0.2596ms | 7.7662μs | 128.7624 KOps/s | 136.3753 KOps/s | |
test_contiguous[memmap_tensor0] | 26.0380μs | 2.0102μs | 497.4571 KOps/s | 511.9046 KOps/s | |
test_stack[memmap_tensor0] | 50.8440μs | 5.9869μs | 167.0322 KOps/s | 176.5912 KOps/s | |
test_memmaptd_index | 0.6398ms | 0.4222ms | 2.3683 KOps/s | 2.5175 KOps/s | |
test_memmaptd_index_astensor | 0.9132ms | 0.5048ms | 1.9811 KOps/s | 2.0996 KOps/s | |
test_memmaptd_index_op | 1.3748ms | 1.0317ms | 969.3061 Ops/s | 969.8524 Ops/s | |
test_serialize_model | 0.1282s | 0.1148s | 8.7087 Ops/s | 7.7747 Ops/s | |
test_serialize_model_pickle | 0.4480s | 0.4039s | 2.4757 Ops/s | 2.5397 Ops/s | |
test_serialize_weights | 0.1223s | 0.1162s | 8.6088 Ops/s | 8.5213 Ops/s | |
test_serialize_weights_returnearly | 0.1756s | 0.1626s | 6.1497 Ops/s | 6.3592 Ops/s | |
test_serialize_weights_pickle | 0.4571s | 0.4065s | 2.4602 Ops/s | 2.4163 Ops/s | |
test_serialize_weights_filesystem | 0.1516s | 0.1425s | 7.0156 Ops/s | 6.5531 Ops/s | |
test_serialize_model_filesystem | 0.1495s | 0.1459s | 6.8548 Ops/s | 6.8056 Ops/s | |
test_reshape_pytree | 94.8050μs | 40.2794μs | 24.8266 KOps/s | 24.8051 KOps/s | |
test_reshape_td | 0.1338ms | 47.5119μs | 21.0474 KOps/s | 21.1738 KOps/s | |
test_view_pytree | 94.7060μs | 40.1272μs | 24.9207 KOps/s | 25.1903 KOps/s | |
test_view_td | 0.1267ms | 53.1983μs | 18.7976 KOps/s | 18.4552 KOps/s | |
test_unbind_pytree | 86.2590μs | 37.8982μs | 26.3865 KOps/s | 27.4041 KOps/s | |
test_unbind_td | 0.3204ms | 46.4403μs | 21.5330 KOps/s | 21.4818 KOps/s | |
test_split_pytree | 94.3540μs | 40.8499μs | 24.4799 KOps/s | 25.1674 KOps/s | |
test_split_td | 0.2177ms | 59.3786μs | 16.8411 KOps/s | 17.1696 KOps/s | |
test_add_pytree | 0.1041ms | 47.8951μs | 20.8790 KOps/s | 21.9686 KOps/s | |
test_add_td | 0.1807ms | 80.5092μs | 12.4209 KOps/s | 12.1322 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1310ms | 54.6511μs | 18.2979 KOps/s | 18.4146 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3446ms | 0.1945ms | 5.1404 KOps/s | 5.3274 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2063ms | 56.6926μs | 17.6390 KOps/s | 19.0284 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3192ms | 0.1485ms | 6.7355 KOps/s | 6.9543 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 61.5140μs | 20.8671μs | 47.9223 KOps/s | 50.1398 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1447ms | 65.4801μs | 15.2718 KOps/s | 15.8053 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.3363ms | 81.3999μs | 12.2850 KOps/s | 12.7859 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1501ms | 72.8725μs | 13.7226 KOps/s | 14.1349 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3504ms | 0.1744ms | 5.7334 KOps/s | 5.8510 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3613ms | 0.1933ms | 5.1722 KOps/s | 5.2465 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 97.6510μs | 38.6349μs | 25.8833 KOps/s | 25.2281 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 1.0529ms | 71.7898μs | 13.9296 KOps/s | 14.1460 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3698ms | 0.1728ms | 5.7882 KOps/s | 5.7541 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6878ms | 0.3001ms | 3.3327 KOps/s | 3.4548 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4287ms | 0.2057ms | 4.8620 KOps/s | 4.8857 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3735ms | 0.1787ms | 5.5959 KOps/s | 5.8176 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.7198ms | 64.7106μs | 15.4534 KOps/s | 15.9017 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 99.1530μs | 40.6556μs | 24.5968 KOps/s | 25.8910 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5412ms | 0.2478ms | 4.0362 KOps/s | 4.2052 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2822ms | 0.1730ms | 5.7819 KOps/s | 5.8809 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1951ms | 0.1075ms | 9.3021 KOps/s | 9.4058 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1104ms | 55.5561μs | 17.9998 KOps/s | 18.0139 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1715ms | 80.7482μs | 12.3842 KOps/s | 12.7377 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1656ms | 71.3678μs | 14.0119 KOps/s | 13.9240 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2697ms | 0.1914ms | 5.2236 KOps/s | 5.3113 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.8678ms | 1.6537ms | 604.6944 Ops/s | 621.7785 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2794ms | 0.1910ms | 5.2348 KOps/s | 5.3702 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.3281ms | 1.1350ms | 881.0272 Ops/s | 923.2316 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.5021ms | 0.4112ms | 2.4317 KOps/s | 2.4637 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.9857ms | 3.7774ms | 264.7313 Ops/s | 261.8057 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 96.5190μs | 32.9516μs | 30.3475 KOps/s | 31.6399 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 1.2591ms | 49.5726μs | 20.1724 KOps/s | 20.9221 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 72.5240μs | 29.2807μs | 34.1522 KOps/s | 36.6535 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1009ms | 31.3864μs | 31.8610 KOps/s | 32.9930 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 84.0460μs | 28.4930μs | 35.0964 KOps/s | 36.5035 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1596ms | 31.8903μs | 31.3575 KOps/s | 33.1593 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1635ms | 73.4165μs | 13.6209 KOps/s | 13.7831 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.6350ms | 28.6549μs | 34.8980 KOps/s | 35.6067 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1387ms | 67.3717μs | 14.8430 KOps/s | 14.6803 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 67.4960μs | 25.3644μs | 39.4253 KOps/s | 41.1316 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2251ms | 68.2512μs | 14.6518 KOps/s | 14.7045 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.2034ms | 25.6351μs | 39.0091 KOps/s | 40.4010 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1383ms | 72.6323μs | 13.7680 KOps/s | 13.8920 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.7745ms | 28.0911μs | 35.5985 KOps/s | 36.0421 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1408ms | 66.8520μs | 14.9584 KOps/s | 14.8647 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.2016ms | 25.2802μs | 39.5566 KOps/s | 40.9648 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1513ms | 67.5007μs | 14.8147 KOps/s | 14.7541 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.5283ms | 24.9505μs | 40.0793 KOps/s | 41.7207 KOps/s | |
test_mod_add[eager] | 98.5420μs | 23.7916μs | 42.0317 KOps/s | 39.5806 KOps/s | |
test_mod_add[compile] | 0.1035ms | 36.6101μs | 27.3149 KOps/s | 27.4537 KOps/s | |
test_mod_add[compile-overhead] | 91.5190μs | 37.5800μs | 26.6099 KOps/s | 27.6708 KOps/s | |
test_mod_wrap[eager] | 0.3991ms | 0.2117ms | 4.7234 KOps/s | 4.7892 KOps/s | |
test_mod_wrap[compile] | 1.5408ms | 0.2323ms | 4.3049 KOps/s | 4.3476 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4307ms | 0.2267ms | 4.4104 KOps/s | 4.4538 KOps/s | |
test_mod_wrap_and_backward[eager] | 14.5199ms | 12.1097ms | 82.5785 Ops/s | 95.0916 Ops/s | |
test_mod_wrap_and_backward[compile] | 14.4054ms | 11.4482ms | 87.3502 Ops/s | 94.4363 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 13.9203ms | 11.3330ms | 88.2375 Ops/s | 91.3610 Ops/s | |
test_seq_add[eager] | 0.1727ms | 84.9149μs | 11.7765 KOps/s | 11.5361 KOps/s | |
test_seq_add[compile] | 0.1485ms | 61.3860μs | 16.2903 KOps/s | 16.7167 KOps/s | |
test_seq_add[compile-overhead] | 0.1406ms | 59.8140μs | 16.7185 KOps/s | 17.2440 KOps/s | |
test_seq_wrap[eager] | 0.5820ms | 0.3707ms | 2.6977 KOps/s | 2.5899 KOps/s | |
test_seq_wrap[compile] | 0.5392ms | 0.2651ms | 3.7719 KOps/s | 3.8541 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4954ms | 0.2659ms | 3.7614 KOps/s | 3.8368 KOps/s | |
test_func_call_runtime[False-eager] | 1.0901ms | 0.5521ms | 1.8113 KOps/s | 1.8266 KOps/s | |
test_func_call_runtime[False-compile] | 0.5856ms | 0.5051ms | 1.9798 KOps/s | 2.0394 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.7166ms | 0.5047ms | 1.9815 KOps/s | 2.0324 KOps/s | |
test_func_call_runtime[True-eager] | 1.3180ms | 0.7758ms | 1.2891 KOps/s | 1.3029 KOps/s | |
test_func_call_runtime[True-compile] | 0.6365ms | 0.5213ms | 1.9183 KOps/s | 1.9419 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6133ms | 0.5203ms | 1.9218 KOps/s | 1.9644 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7601ms | 0.5490ms | 1.8216 KOps/s | 1.8604 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6576ms | 0.5057ms | 1.9773 KOps/s | 2.0196 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.9694ms | 0.5143ms | 1.9445 KOps/s | 2.0355 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0870ms | 0.9088ms | 1.1003 KOps/s | 1.1032 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.2895ms | 0.8603ms | 1.1624 KOps/s | 1.1827 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1592ms | 0.8580ms | 1.1656 KOps/s | 1.1854 KOps/s | |
test_distributed | 0.2329ms | 0.1318ms | 7.5876 KOps/s | 7.5042 KOps/s | |
test_tdmodule | 32.3110μs | 15.6702μs | 63.8156 KOps/s | 57.2248 KOps/s | |
test_tdmodule_dispatch | 62.9970μs | 33.5435μs | 29.8121 KOps/s | 27.2204 KOps/s | |
test_tdseq | 36.9680μs | 17.7679μs | 56.2812 KOps/s | 50.6104 KOps/s | |
test_tdseq_dispatch | 58.5380μs | 37.5168μs | 26.6547 KOps/s | 24.5725 KOps/s | |
test_instantiation_functorch | 2.7846ms | 1.7005ms | 588.0561 Ops/s | 621.7762 Ops/s | |
test_instantiation_td | 1.8323ms | 1.1838ms | 844.7149 Ops/s | 846.9882 Ops/s | |
test_exec_functorch | 0.3194ms | 0.1832ms | 5.4594 KOps/s | 5.5664 KOps/s | |
test_exec_functional_call | 0.3867ms | 0.1771ms | 5.6467 KOps/s | 5.9946 KOps/s | |
test_exec_td | 0.3416ms | 0.1792ms | 5.5791 KOps/s | 5.9349 KOps/s | |
test_exec_td_decorator | 0.9130ms | 0.2323ms | 4.3039 KOps/s | 4.5025 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8735ms | 0.5701ms | 1.7542 KOps/s | 1.7356 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7924ms | 0.5642ms | 1.7723 KOps/s | 1.7441 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7027ms | 0.4718ms | 2.1198 KOps/s | 2.0922 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6566ms | 0.4664ms | 2.1439 KOps/s | 2.0924 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.1398ms | 0.6286ms | 1.5908 KOps/s | 1.5601 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8551ms | 0.6252ms | 1.5995 KOps/s | 1.5885 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7422ms | 0.5193ms | 1.9257 KOps/s | 1.9164 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.9484ms | 0.5228ms | 1.9127 KOps/s | 1.9104 KOps/s | |
test_to_module_speed[True] | 1.6584ms | 1.3315ms | 751.0444 Ops/s | 753.8746 Ops/s | |
test_to_module_speed[False] | 1.8604ms | 1.3009ms | 768.6949 Ops/s | 761.2653 Ops/s | |
test_tc_init | 82.9740μs | 41.9442μs | 23.8412 KOps/s | 21.7331 KOps/s | |
test_tc_init_nested | 0.1678ms | 84.7580μs | 11.7983 KOps/s | 10.6907 KOps/s | |
test_tc_first_layer_tensor | 27.5710μs | 1.4539μs | 687.7862 KOps/s | 698.2048 KOps/s | |
test_tc_first_layer_nontensor | 23.7440μs | 4.2764μs | 233.8428 KOps/s | 240.2425 KOps/s | |
test_tc_second_layer_tensor | 23.9450μs | 2.7158μs | 368.2097 KOps/s | 370.4388 KOps/s | |
test_tc_second_layer_nontensor | 24.2650μs | 5.5411μs | 180.4696 KOps/s | 182.4403 KOps/s | |
test_unbind | 0.4325s | 13.2882ms | 75.2545 Ops/s | 73.1566 Ops/s | |
test_full_like | 8.4628ms | 6.7542ms | 148.0551 Ops/s | 133.6461 Ops/s | |
test_zeros_like | 13.9228ms | 6.9743ms | 143.3832 Ops/s | 159.9856 Ops/s | |
test_ones_like | 12.5633ms | 6.9517ms | 143.8506 Ops/s | 134.9084 Ops/s | |
test_clone | 12.8745ms | 8.5392ms | 117.1072 Ops/s | 114.9423 Ops/s | |
test_squeeze | 62.1550μs | 13.0625μs | 76.5547 KOps/s | 80.2696 KOps/s | |
test_unsqueeze | 0.3107ms | 97.1161μs | 10.2970 KOps/s | 10.5507 KOps/s | |
test_split | 0.3311ms | 0.2022ms | 4.9468 KOps/s | 4.7917 KOps/s | |
test_permute | 0.3922ms | 0.2234ms | 4.4761 KOps/s | 4.5699 KOps/s | |
test_stack | 30.6502ms | 23.5470ms | 42.4683 Ops/s | 42.4469 Ops/s | |
test_cat | 28.7311ms | 23.3932ms | 42.7474 Ops/s | 42.6963 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.1578ms | 16.9765μs | 58.9049 KOps/s | 57.2754 KOps/s | |
test_plain_set_stack_nested | 46.9910μs | 17.1008μs | 58.4768 KOps/s | 57.1281 KOps/s | |
test_plain_set_nested_inplace | 43.6110μs | 18.1689μs | 55.0391 KOps/s | 53.4029 KOps/s | |
test_plain_set_stack_nested_inplace | 38.9910μs | 17.9279μs | 55.7789 KOps/s | 54.3317 KOps/s | |
test_items | 21.2200μs | 4.6282μs | 216.0669 KOps/s | 217.7042 KOps/s | |
test_items_nested | 0.4318ms | 0.3773ms | 2.6502 KOps/s | 2.6599 KOps/s | |
test_items_nested_locked | 1.1073ms | 0.3791ms | 2.6377 KOps/s | 2.6349 KOps/s | |
test_items_nested_leaf | 0.1121ms | 84.4747μs | 11.8379 KOps/s | 11.9638 KOps/s | |
test_items_stack_nested | 0.4333ms | 0.3778ms | 2.6469 KOps/s | 2.6692 KOps/s | |
test_items_stack_nested_leaf | 0.1053ms | 84.5800μs | 11.8231 KOps/s | 12.0498 KOps/s | |
test_items_stack_nested_locked | 0.4438ms | 0.3828ms | 2.6123 KOps/s | 2.6492 KOps/s | |
test_keys | 18.1300μs | 4.3457μs | 230.1128 KOps/s | 230.0569 KOps/s | |
test_keys_nested | 92.5520μs | 66.8844μs | 14.9512 KOps/s | 14.8044 KOps/s | |
test_keys_nested_locked | 0.7036ms | 74.3532μs | 13.4493 KOps/s | 14.1445 KOps/s | |
test_keys_nested_leaf | 81.7920μs | 57.0787μs | 17.5197 KOps/s | 17.2120 KOps/s | |
test_keys_stack_nested | 93.5020μs | 66.3329μs | 15.0755 KOps/s | 15.1736 KOps/s | |
test_keys_stack_nested_leaf | 88.1310μs | 59.1508μs | 16.9059 KOps/s | 17.2857 KOps/s | |
test_keys_stack_nested_locked | 0.1028ms | 73.6836μs | 13.5715 KOps/s | 14.0617 KOps/s | |
test_values | 6.4937μs | 1.7716μs | 564.4603 KOps/s | 569.2887 KOps/s | |
test_values_nested | 53.4510μs | 33.6180μs | 29.7460 KOps/s | 29.4242 KOps/s | |
test_values_nested_locked | 51.4110μs | 35.7707μs | 27.9559 KOps/s | 28.0696 KOps/s | |
test_values_nested_leaf | 52.9810μs | 29.9547μs | 33.3838 KOps/s | 33.3143 KOps/s | |
test_values_stack_nested | 52.9210μs | 34.4891μs | 28.9947 KOps/s | 28.8954 KOps/s | |
test_values_stack_nested_leaf | 44.7210μs | 30.4323μs | 32.8599 KOps/s | 32.6068 KOps/s | |
test_values_stack_nested_locked | 51.8410μs | 36.5823μs | 27.3357 KOps/s | 27.5618 KOps/s | |
test_membership | 1.6121μs | 0.5407μs | 1.8494 MOps/s | 1.8422 MOps/s | |
test_membership_nested | 9.6305μs | 1.9516μs | 512.3966 KOps/s | 516.0356 KOps/s | |
test_membership_nested_leaf | 9.9950μs | 1.9502μs | 512.7714 KOps/s | 504.5392 KOps/s | |
test_membership_stacked_nested | 24.4400μs | 2.0266μs | 493.4475 KOps/s | 492.8699 KOps/s | |
test_membership_stacked_nested_leaf | 18.9010μs | 2.0083μs | 497.9367 KOps/s | 499.3267 KOps/s | |
test_membership_nested_last | 16.6200μs | 2.9020μs | 344.5869 KOps/s | 343.6778 KOps/s | |
test_membership_nested_leaf_last | 21.2600μs | 2.8835μs | 346.8040 KOps/s | 341.6704 KOps/s | |
test_membership_stacked_nested_last | 27.4000μs | 3.3320μs | 300.1215 KOps/s | 110.2955 KOps/s | |
test_membership_stacked_nested_leaf_last | 28.2810μs | 3.3438μs | 299.0595 KOps/s | 109.1984 KOps/s | |
test_nested_getleaf | 24.4910μs | 7.7710μs | 128.6837 KOps/s | 126.1762 KOps/s | |
test_nested_get | 22.0600μs | 7.3442μs | 136.1614 KOps/s | 134.5471 KOps/s | |
test_stacked_getleaf | 26.3300μs | 7.8283μs | 127.7422 KOps/s | 125.8765 KOps/s | |
test_stacked_get | 23.9210μs | 7.3506μs | 136.0429 KOps/s | 133.9643 KOps/s | |
test_nested_getitemleaf | 23.9200μs | 8.1668μs | 122.4476 KOps/s | 123.6936 KOps/s | |
test_nested_getitem | 40.8010μs | 7.6698μs | 130.3818 KOps/s | 131.4945 KOps/s | |
test_stacked_getitemleaf | 25.1010μs | 8.2339μs | 121.4494 KOps/s | 122.9865 KOps/s | |
test_stacked_getitem | 22.5500μs | 7.6983μs | 129.8980 KOps/s | 131.6296 KOps/s | |
test_lock_nested | 0.9616ms | 0.4754ms | 2.1034 KOps/s | 2.1012 KOps/s | |
test_lock_stack_nested | 0.4776ms | 0.4339ms | 2.3048 KOps/s | 2.3516 KOps/s | |
test_unlock_nested | 0.8246ms | 0.3992ms | 2.5050 KOps/s | 2.5208 KOps/s | |
test_unlock_stack_nested | 0.4261ms | 0.3549ms | 2.8175 KOps/s | 2.9026 KOps/s | |
test_flatten_speed | 90.0408ms | 0.1186ms | 8.4291 KOps/s | 9.6185 KOps/s | |
test_unflatten_speed | 0.3534ms | 0.2880ms | 3.4716 KOps/s | 3.4686 KOps/s | |
test_common_ops | 1.5441ms | 1.3423ms | 744.9674 Ops/s | 733.6021 Ops/s | |
test_creation | 25.4710μs | 1.6507μs | 605.8012 KOps/s | 613.7691 KOps/s | |
test_creation_empty | 38.8300μs | 17.3532μs | 57.6262 KOps/s | 54.6154 KOps/s | |
test_creation_nested_1 | 41.0910μs | 19.4944μs | 51.2969 KOps/s | 48.8517 KOps/s | |
test_creation_nested_2 | 40.0210μs | 22.4497μs | 44.5440 KOps/s | 43.5444 KOps/s | |
test_clone | 52.8010μs | 31.4568μs | 31.7897 KOps/s | 32.4766 KOps/s | |
test_getitem[int] | 1.1121ms | 17.1712μs | 58.2372 KOps/s | 58.8562 KOps/s | |
test_getitem[slice_int] | 0.1398ms | 29.6554μs | 33.7207 KOps/s | 33.8283 KOps/s | |
test_getitem[range] | 0.2576ms | 0.1162ms | 8.6032 KOps/s | 8.6207 KOps/s | |
test_getitem[tuple] | 0.1445ms | 25.6476μs | 38.9900 KOps/s | 38.8235 KOps/s | |
test_getitem[list] | 0.2249ms | 0.1051ms | 9.5112 KOps/s | 9.5189 KOps/s | |
test_setitem_dim[int] | 72.8920μs | 54.1328μs | 18.4731 KOps/s | 17.5885 KOps/s | |
test_setitem_dim[slice_int] | 0.1019ms | 80.1727μs | 12.4731 KOps/s | 12.2017 KOps/s | |
test_setitem_dim[range] | 0.1710ms | 0.1434ms | 6.9757 KOps/s | 6.8954 KOps/s | |
test_setitem_dim[tuple] | 94.7220μs | 72.4383μs | 13.8049 KOps/s | 13.4686 KOps/s | |
test_setitem | 81.8020μs | 47.7317μs | 20.9504 KOps/s | 20.7178 KOps/s | |
test_set | 66.9710μs | 43.7985μs | 22.8318 KOps/s | 21.2745 KOps/s | |
test_set_shared | 0.4099ms | 54.5723μs | 18.3243 KOps/s | 18.0295 KOps/s | |
test_update | 79.9320μs | 52.9397μs | 18.8894 KOps/s | 18.5934 KOps/s | |
test_update_nested | 88.0420μs | 60.4944μs | 16.5304 KOps/s | 16.2961 KOps/s | |
test_update__nested | 98.3420μs | 62.9497μs | 15.8857 KOps/s | 16.0078 KOps/s | |
test_set_nested | 76.1310μs | 46.7768μs | 21.3781 KOps/s | 21.4173 KOps/s | |
test_set_nested_new | 76.2020μs | 51.0482μs | 19.5893 KOps/s | 18.5966 KOps/s | |
test_select | 87.8220μs | 65.8510μs | 15.1858 KOps/s | 15.1801 KOps/s | |
test_select_nested | 68.4810μs | 50.9305μs | 19.6346 KOps/s | 19.3591 KOps/s | |
test_exclude_nested | 98.5220μs | 71.2919μs | 14.0268 KOps/s | 14.3590 KOps/s | |
test_empty[True] | 0.3486ms | 0.2912ms | 3.4339 KOps/s | 3.4219 KOps/s | |
test_empty[False] | 3.0531μs | 0.8716μs | 1.1473 MOps/s | 1.1663 MOps/s | |
test_to | 45.5510μs | 27.9478μs | 35.7810 KOps/s | 38.5401 KOps/s | |
test_to_nonblocking | 47.9310μs | 26.7909μs | 37.3261 KOps/s | 39.0811 KOps/s | |
test_unbind_speed | 0.3567ms | 0.3081ms | 3.2457 KOps/s | 3.2577 KOps/s | |
test_unbind_speed_stack0 | 0.3483ms | 0.3036ms | 3.2938 KOps/s | 3.3390 KOps/s | |
test_unbind_speed_stack1 | 90.4265ms | 0.7706ms | 1.2976 KOps/s | 1.3001 KOps/s | |
test_split | 92.4321ms | 2.4042ms | 415.9466 Ops/s | 417.9250 Ops/s | |
test_chunk | 2.3509ms | 2.2244ms | 449.5543 Ops/s | 416.1843 Ops/s | |
test_creation[device0] | 0.1592ms | 0.1064ms | 9.4028 KOps/s | 9.5355 KOps/s | |
test_creation_from_tensor | 0.1576ms | 0.1037ms | 9.6456 KOps/s | 9.6517 KOps/s | |
test_add_one[memmap_tensor0] | 64.4820μs | 9.2437μs | 108.1813 KOps/s | 106.0869 KOps/s | |
test_contiguous[memmap_tensor0] | 16.9310μs | 2.2429μs | 445.8528 KOps/s | 447.8576 KOps/s | |
test_stack[memmap_tensor0] | 22.2910μs | 7.0367μs | 142.1111 KOps/s | 142.0698 KOps/s | |
test_memmaptd_index | 1.1895ms | 0.4327ms | 2.3112 KOps/s | 2.3164 KOps/s | |
test_memmaptd_index_astensor | 0.7684ms | 0.5013ms | 1.9948 KOps/s | 1.9925 KOps/s | |
test_memmaptd_index_op | 1.5209ms | 1.0897ms | 917.6587 Ops/s | 916.8321 Ops/s | |
test_serialize_model | 93.1758ms | 88.1959ms | 11.3384 Ops/s | 10.9377 Ops/s | |
test_serialize_model_pickle | 1.3509s | 1.2365s | 0.8087 Ops/s | 0.8083 Ops/s | |
test_serialize_weights | 89.0198ms | 85.9217ms | 11.6385 Ops/s | 10.0035 Ops/s | |
test_serialize_weights_returnearly | 0.1906s | 63.4534ms | 15.7596 Ops/s | 17.9929 Ops/s | |
test_serialize_weights_pickle | 1.3538s | 1.2370s | 0.8084 Ops/s | 0.8039 Ops/s | |
test_reshape_pytree | 72.7910μs | 38.2000μs | 26.1780 KOps/s | 26.7061 KOps/s | |
test_reshape_td | 72.3510μs | 44.7173μs | 22.3627 KOps/s | 22.8876 KOps/s | |
test_view_pytree | 81.5310μs | 38.2903μs | 26.1163 KOps/s | 26.8368 KOps/s | |
test_view_td | 75.1820μs | 50.6488μs | 19.7438 KOps/s | 20.3494 KOps/s | |
test_unbind_pytree | 62.6010μs | 37.3663μs | 26.7621 KOps/s | 26.7782 KOps/s | |
test_unbind_td | 0.4136ms | 47.0647μs | 21.2473 KOps/s | 21.8968 KOps/s | |
test_split_pytree | 92.1020μs | 51.1315μs | 19.5574 KOps/s | 19.0056 KOps/s | |
test_split_td | 91.8803ms | 71.3875μs | 14.0081 KOps/s | 15.9408 KOps/s | |
test_add_pytree | 92.4820μs | 64.8685μs | 15.4158 KOps/s | 15.4714 KOps/s | |
test_add_td | 0.2364ms | 0.1022ms | 9.7800 KOps/s | 9.6169 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4376ms | 0.2192ms | 4.5626 KOps/s | 4.5962 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2685ms | 0.1747ms | 5.7242 KOps/s | 5.7128 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1996ms | 0.1504ms | 6.6483 KOps/s | 6.5658 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2503ms | 0.1951ms | 5.1254 KOps/s | 4.8698 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 58.1710μs | 24.3533μs | 41.0623 KOps/s | 43.7914 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 75.0310μs | 49.1515μs | 20.3453 KOps/s | 20.5713 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1215ms | 73.9992μs | 13.5137 KOps/s | 13.6639 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1909ms | 61.3351μs | 16.3039 KOps/s | 16.7554 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4977ms | 0.3457ms | 2.8928 KOps/s | 2.9365 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3440ms | 0.2305ms | 4.3387 KOps/s | 4.4336 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2177ms | 0.1390ms | 7.1968 KOps/s | 7.3927 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1412ms | 66.6323μs | 15.0077 KOps/s | 15.5799 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3997ms | 0.3377ms | 2.9616 KOps/s | 2.9623 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.7302ms | 0.6405ms | 1.5612 KOps/s | 1.4897 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3370ms | 0.2705ms | 3.6970 KOps/s | 3.6511 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3925ms | 0.3405ms | 2.9372 KOps/s | 2.9315 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1573ms | 77.7264μs | 12.8656 KOps/s | 13.2706 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1864ms | 0.1347ms | 7.4231 KOps/s | 7.3483 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6586ms | 0.5426ms | 1.8431 KOps/s | 1.8549 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3882ms | 0.3364ms | 2.9729 KOps/s | 2.9530 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 42.0710μs | 19.6147μs | 50.9821 KOps/s | 53.1888 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 59.1810μs | 32.4912μs | 30.7776 KOps/s | 30.0056 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1079ms | 77.2580μs | 12.9436 KOps/s | 13.1618 KOps/s | |
test_compile_copy_flat[pytree-eager] | 83.5210μs | 60.4384μs | 16.5458 KOps/s | 16.5990 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.4699ms | 0.8551ms | 1.1694 KOps/s | 1.0698 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 4.5452ms | 3.4178ms | 292.5868 Ops/s | 286.1842 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.4402ms | 0.8457ms | 1.1825 KOps/s | 1.0811 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.4349ms | 3.3434ms | 299.0923 Ops/s | 298.1214 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1716ms | 0.1147ms | 8.7197 KOps/s | 8.6002 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.2090ms | 62.9410μs | 15.8879 KOps/s | 15.6710 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1516ms | 0.1053ms | 9.4957 KOps/s | 9.3547 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 80.4410μs | 46.0407μs | 21.7199 KOps/s | 21.0803 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1472ms | 0.1051ms | 9.5119 KOps/s | 9.3783 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 75.3810μs | 45.6125μs | 21.9238 KOps/s | 20.3847 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2000ms | 0.1420ms | 7.0411 KOps/s | 6.9904 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1775ms | 26.8175μs | 37.2891 KOps/s | 37.1080 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1981ms | 0.1341ms | 7.4551 KOps/s | 7.4026 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 60.8810μs | 22.7340μs | 43.9871 KOps/s | 43.7273 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1781ms | 0.1343ms | 7.4469 KOps/s | 7.4469 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 56.3210μs | 22.8956μs | 43.6765 KOps/s | 44.4954 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2153ms | 0.1414ms | 7.0707 KOps/s | 7.0341 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.4795ms | 26.1655μs | 38.2183 KOps/s | 35.5041 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2662ms | 0.1340ms | 7.4639 KOps/s | 7.4344 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1768ms | 22.5345μs | 44.3764 KOps/s | 44.4304 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1786ms | 0.1338ms | 7.4730 KOps/s | 7.4217 KOps/s | |
test_compile_indexing[int-pytree-eager] | 53.5910μs | 22.5968μs | 44.2541 KOps/s | 44.3856 KOps/s | |
test_mod_add[eager] | 63.7410μs | 34.8800μs | 28.6697 KOps/s | 29.4468 KOps/s | |
test_mod_add[compile] | 0.1129ms | 70.3787μs | 14.2088 KOps/s | 13.3569 KOps/s | |
test_mod_add[compile-overhead] | 0.2628ms | 0.1377ms | 7.2622 KOps/s | 6.3704 KOps/s | |
test_mod_wrap[eager] | 0.3452ms | 0.2586ms | 3.8663 KOps/s | 3.7514 KOps/s | |
test_mod_wrap[compile] | 1.1010ms | 0.3059ms | 3.2690 KOps/s | 3.1281 KOps/s | |
test_mod_wrap[compile-overhead] | 8.1236ms | 4.2734ms | 234.0036 Ops/s | 230.8095 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4790ms | 1.3728ms | 728.4191 Ops/s | 677.9674 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4860ms | 1.3668ms | 731.6195 Ops/s | 680.0652 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3489ms | 0.9187ms | 1.0885 KOps/s | 889.2591 Ops/s | |
test_seq_add[eager] | 0.3069ms | 0.1020ms | 9.8016 KOps/s | 9.5894 KOps/s | |
test_seq_add[compile] | 0.2970ms | 85.5638μs | 11.6872 KOps/s | 11.9971 KOps/s | |
test_seq_add[compile-overhead] | 0.3414ms | 0.1208ms | 8.2815 KOps/s | 8.3352 KOps/s | |
test_seq_wrap[eager] | 0.6195ms | 0.4023ms | 2.4857 KOps/s | 2.3827 KOps/s | |
test_seq_wrap[compile] | 0.5429ms | 0.3260ms | 3.0672 KOps/s | 2.9237 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2747ms | 0.2289ms | 4.3694 KOps/s | 4.2866 KOps/s | |
test_func_call_runtime[False-eager] | 0.9874ms | 0.7498ms | 1.3337 KOps/s | 1.2486 KOps/s | |
test_func_call_runtime[False-compile] | 1.0060ms | 0.8085ms | 1.2369 KOps/s | 1.2194 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5824ms | 0.3711ms | 2.6946 KOps/s | 2.6675 KOps/s | |
test_func_call_runtime[True-eager] | 1.1474ms | 0.9419ms | 1.0617 KOps/s | 1.0450 KOps/s | |
test_func_call_runtime[True-compile] | 1.0706ms | 0.8592ms | 1.1638 KOps/s | 1.1533 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6173ms | 0.4208ms | 2.3765 KOps/s | 2.3686 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9496ms | 0.7466ms | 1.3394 KOps/s | 1.2989 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.0341ms | 0.8170ms | 1.2239 KOps/s | 1.2059 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5892ms | 0.3732ms | 2.6794 KOps/s | 2.6430 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.2584ms | 1.0522ms | 950.3497 Ops/s | 930.4702 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.2355ms | 1.0276ms | 973.1740 Ops/s | 938.1111 Ops/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.2554ms | 1.0333ms | 967.8158 Ops/s | 961.5262 Ops/s | |
test_distributed | 2.6051ms | 71.5316μs | 13.9798 KOps/s | 13.8827 KOps/s | |
test_tdmodule | 31.3300μs | 16.0299μs | 62.3835 KOps/s | 58.3206 KOps/s | |
test_tdmodule_dispatch | 50.0510μs | 33.2261μs | 30.0969 KOps/s | 28.1803 KOps/s | |
test_tdseq | 33.1700μs | 16.8751μs | 59.2589 KOps/s | 56.6576 KOps/s | |
test_tdseq_dispatch | 52.8110μs | 35.2405μs | 28.3765 KOps/s | 27.0708 KOps/s | |
test_instantiation_functorch | 2.2029ms | 2.0176ms | 495.6323 Ops/s | 486.9631 Ops/s | |
test_instantiation_td | 2.0310ms | 1.3270ms | 753.5641 Ops/s | 753.0713 Ops/s | |
test_exec_functorch | 0.2715ms | 0.2212ms | 4.5211 KOps/s | 4.5209 KOps/s | |
test_exec_functional_call | 0.4202ms | 0.2168ms | 4.6128 KOps/s | 4.3437 KOps/s | |
test_exec_td | 0.2954ms | 0.2262ms | 4.4204 KOps/s | 4.1202 KOps/s | |
test_exec_td_decorator | 1.1661ms | 0.2787ms | 3.5875 KOps/s | 3.3550 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8569ms | 0.6631ms | 1.5080 KOps/s | 1.5090 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.9949ms | 0.6541ms | 1.5288 KOps/s | 1.4814 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7981ms | 0.5885ms | 1.6991 KOps/s | 1.7413 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.7712ms | 0.5746ms | 1.7403 KOps/s | 1.6920 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9052ms | 0.7088ms | 1.4108 KOps/s | 1.3474 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.1443ms | 0.7089ms | 1.4106 KOps/s | 1.3421 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8121ms | 0.6307ms | 1.5856 KOps/s | 1.5393 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8157ms | 0.6193ms | 1.6147 KOps/s | 1.5432 KOps/s | |
test_vmap_transformer_speed[True-True] | 9.0945ms | 8.7604ms | 114.1494 Ops/s | 110.0276 Ops/s | |
test_vmap_transformer_speed[True-False] | 9.1339ms | 8.7228ms | 114.6416 Ops/s | 113.4946 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.9995ms | 8.6670ms | 115.3799 Ops/s | 115.5823 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.9454ms | 8.6248ms | 115.9448 Ops/s | 115.6647 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 21.3443ms | 20.6450ms | 48.4379 Ops/s | 48.4295 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 21.4099ms | 20.6189ms | 48.4993 Ops/s | 48.5113 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 21.3761ms | 20.5710ms | 48.6122 Ops/s | 49.0294 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 21.0948ms | 20.4493ms | 48.9015 Ops/s | 48.7797 Ops/s | |
test_to_module_speed[True] | 2.3941ms | 1.1481ms | 870.9941 Ops/s | 871.7703 Ops/s | |
test_to_module_speed[False] | 1.6056ms | 1.1329ms | 882.6687 Ops/s | 880.1868 Ops/s | |
test_tc_init | 0.2316ms | 40.7266μs | 24.5540 KOps/s | 24.3895 KOps/s | |
test_tc_init_nested | 0.1100ms | 82.8175μs | 12.0747 KOps/s | 11.8535 KOps/s | |
test_tc_first_layer_tensor | 31.7272μs | 0.7844μs | 1.2748 MOps/s | 1.2679 MOps/s | |
test_tc_first_layer_nontensor | 0.1886ms | 2.5487μs | 392.3557 KOps/s | 389.7526 KOps/s | |
test_tc_second_layer_tensor | 41.2843μs | 1.5962μs | 626.4937 KOps/s | 612.0610 KOps/s | |
test_tc_second_layer_nontensor | 0.1901ms | 3.3941μs | 294.6288 KOps/s | 286.7412 KOps/s | |
test_unbind | 0.1844s | 10.5493ms | 94.7926 Ops/s | 68.4038 Ops/s | |
test_full_like | 0.1695s | 0.6590ms | 1.5174 KOps/s | 1.7321 KOps/s | |
test_zeros_like | 0.2690ms | 0.1974ms | 5.0671 KOps/s | 5.0604 KOps/s | |
test_ones_like | 0.3379ms | 0.1974ms | 5.0650 KOps/s | 5.0641 KOps/s | |
test_clone | 0.5954ms | 0.4144ms | 2.4134 KOps/s | 2.4124 KOps/s | |
test_squeeze | 28.4200μs | 10.6891μs | 93.5536 KOps/s | 92.7185 KOps/s | |
test_unsqueeze | 0.2761ms | 83.8440μs | 11.9269 KOps/s | 12.3919 KOps/s | |
test_split | 0.4739ms | 0.1775ms | 5.6343 KOps/s | 5.5620 KOps/s | |
test_permute | 0.2328ms | 0.1951ms | 5.1259 KOps/s | 5.2130 KOps/s | |
test_stack | 1.3845ms | 0.9106ms | 1.0982 KOps/s | 1.1087 KOps/s | |
test_cat | 1.2678ms | 1.2311ms | 812.3070 Ops/s | 812.1332 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
BC-breaking
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
dict.get defaults to None, but TD.get doesn't.
This PR solves this discrepancy.
Note that this is a BC-breaking change.