8000 [Feature] Make ProbabilisticTensorDictSequential account for more than one distribution by vmoens · Pull Request #1114 · pytorch/tensordict · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

[Feature] Make ProbabilisticTensorDictSequential account for more than one distribution #1114

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 8 commits into from
Dec 2, 2024

Conversation

vmoens
Copy link
Collaborator
@vmoens vmoens commented Nov 26, 2024

[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Nov 26, 2024
…n one distribution

ghstack-source-id: e00c8dc
Pull Request resolved: #1114
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 26, 2024
Copy link
github-actions bot commented Nov 26, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}11$. Worsened: $\large\color{#d91a1a}33$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 37.4690μs 17.9191μs 55.8063 KOps/s 60.4530 KOps/s $\textbf{\color{#d91a1a}-7.69\%}$
test_plain_set_stack_nested 51.7260μs 18.1616μs 55.0613 KOps/s 60.1200 KOps/s $\textbf{\color{#d91a1a}-8.41\%}$
test_plain_set_nested_inplace 53.0290μs 19.3849μs 51.5865 KOps/s 53.7500 KOps/s $\color{#d91a1a}-4.03\%$
test_plain_set_stack_nested_inplace 62.5760μs 19.7311μs 50.6815 KOps/s 53.4434 KOps/s $\textbf{\color{#d91a1a}-5.17\%}$
test_items 20.8680μs 4.0726μs 245.5462 KOps/s 240.5885 KOps/s $\color{#35bf28}+2.06\%$
test_items_nested 0.6102ms 0.4003ms 2.4981 KOps/s 2.5100 KOps/s $\color{#d91a1a}-0.47\%$
test_items_nested_locked 0.5583ms 0.4015ms 2.4908 KOps/s 2.5010 KOps/s $\color{#d91a1a}-0.41\%$
test_items_nested_leaf 0.1519ms 71.3814μs 14.0092 KOps/s 14.0945 KOps/s $\color{#d91a1a}-0.61\%$
test_items_stack_nested 0.5691ms 0.4032ms 2.4803 KOps/s 2.5054 KOps/s $\color{#d91a1a}-1.00\%$
test_items_stack_nested_leaf 0.1291ms 71.3056μs 14.0241 KOps/s 13.7833 KOps/s $\color{#35bf28}+1.75\%$
test_items_stack_nested_locked 0.7453ms 0.4052ms 2.4680 KOps/s 2.4803 KOps/s $\color{#d91a1a}-0.49\%$
test_keys 47.2080μs 3.5072μs 285.1304 KOps/s 211.1447 KOps/s $\textbf{\color{#35bf28}+35.04\%}$
test_keys_nested 0.2287ms 0.1444ms 6.9254 KOps/s 7.0646 KOps/s $\color{#d91a1a}-1.97\%$
test_keys_nested_locked 1.7091ms 0.1500ms 6.6678 KOps/s 6.7340 KOps/s $\color{#d91a1a}-0.98\%$
test_keys_nested_leaf 0.2117ms 0.1256ms 7.9624 KOps/s 8.1380 KOps/s $\color{#d91a1a}-2.16\%$
test_keys_stack_nested 0.2280ms 0.1440ms 6.9454 KOps/s 7.0696 KOps/s $\color{#d91a1a}-1.76\%$
test_keys_stack_nested_leaf 0.1813ms 0.1254ms 7.9721 KOps/s 8.1168 KOps/s $\color{#d91a1a}-1.78\%$
test_keys_stack_nested_locked 0.2425ms 0.1495ms 6.6876 KOps/s 6.6593 KOps/s $\color{#35bf28}+0.42\%$
test_values 8.6060μs 1.0260μs 974.6721 KOps/s 951.3335 KOps/s $\color{#35bf28}+2.45\%$
test_values_nested 0.1115ms 55.5164μs 18.0127 KOps/s 18.0875 KOps/s $\color{#d91a1a}-0.41\%$
test_values_nested_locked 0.1140ms 55.3308μs 18.0731 KOps/s 16.6068 KOps/s $\textbf{\color{#35bf28}+8.83\%}$
test_values_nested_leaf 0.1335ms 60.5095μs 16.5263 KOps/s 16.5714 KOps/s $\color{#d91a1a}-0.27\%$
test_values_stack_nested 0.1068ms 55.3541μs 18.0655 KOps/s 17.6547 KOps/s $\color{#35bf28}+2.33\%$
test_values_stack_nested_leaf 0.1183ms 61.2435μs 16.3283 KOps/s 16.6711 KOps/s $\color{#d91a1a}-2.06\%$
test_values_stack_nested_locked 0.1093ms 55.4125μs 18.0465 KOps/s 18.1941 KOps/s $\color{#d91a1a}-0.81\%$
test_membership 2.9946μs 0.7117μs 1.4050 MOps/s 1.1005 MOps/s $\textbf{\color{#35bf28}+27.67\%}$
test_membership_nested 21.1600μs 2.9090μs 343.7630 KOps/s 333.4704 KOps/s $\color{#35bf28}+3.09\%$
test_membership_nested_leaf 44.6630μs 2.9593μs 337.9130 KOps/s 331.7004 KOps/s $\color{#35bf28}+1.87\%$
test_membership_stacked_nested 25.7670μs 2.8884μs 346.2090 KOps/s 320.9830 KOps/s $\textbf{\color{#35bf28}+7.86\%}$
test_membership_stacked_nested_leaf 23.7450μs 2.9103μs 343.6106 KOps/s 336.6981 KOps/s $\color{#35bf28}+2.05\%$
test_membership_nested_last 50.4440μs 4.2449μs 235.5763 KOps/s 231.8551 KOps/s $\color{#35bf28}+1.60\%$
test_membership_nested_leaf_last 24.3260μs 4.2107μs 237.4928 KOps/s 231.1480 KOps/s $\color{#35bf28}+2.74\%$
test_membership_stacked_nested_last 57.6870μs 4.2149μs 237.2518 KOps/s 230.7567 KOps/s $\color{#35bf28}+2.81\%$
test_membership_stacked_nested_leaf_last 38.0410μs 4.2444μs 235.6058 KOps/s 233.3149 KOps/s $\color{#35bf28}+0.98\%$
test_nested_getleaf 55.4230μs 10.9300μs 91.4909 KOps/s 91.9200 KOps/s $\color{#d91a1a}-0.47\%$
test_nested_get 54.2610μs 10.3542μs 96.5796 KOps/s 97.8202 KOps/s $\color{#d91a1a}-1.27\%$
test_stacked_getleaf 34.7640μs 10.9472μs 91.3479 KOps/s 91.1710 KOps/s $\color{#35bf28}+0.19\%$
test_stacked_get 50.1530μs 10.2752μs 97.3220 KOps/s 97.7904 KOps/s $\color{#d91a1a}-0.48\%$
test_nested_getitemleaf 62.7660μs 11.3285μs 88.2732 KOps/s 89.4957 KOps/s $\color{#d91a1a}-1.37\%$
test_nested_getitem 43.3600μs 10.5330μs 94.9393 KOps/s 95.8081 KOps/s $\color{#d91a1a}-0.91\%$
test_stacked_getitemleaf 53.3290μs 11.2019μs 89.2710 KOps/s 88.8412 KOps/s $\color{#35bf28}+0.48\%$
test_stacked_getitem 38.3410μs 10.5866μs 94.4588 KOps/s 95.4084 KOps/s $\color{#d91a1a}-1.00\%$
test_lock_nested 3.2856ms 0.4532ms 2.2068 KOps/s 2.2137 KOps/s $\color{#d91a1a}-0.31\%$
test_lock_stack_nested 0.6806ms 0.4229ms 2.3649 KOps/s 2.3736 KOps/s $\color{#d91a1a}-0.37\%$
test_unlock_nested 0.7302ms 0.3640ms 2.7469 KOps/s 2.6865 KOps/s $\color{#35bf28}+2.25\%$
test_unlock_stack_nested 0.8728ms 0.3399ms 2.9424 KOps/s 2.9372 KOps/s $\color{#35bf28}+0.18\%$
test_flatten_speed 0.1816ms 95.2715μs 10.4963 KOps/s 10.3133 KOps/s $\color{#35bf28}+1.77\%$
test_unflatten_speed 1.0873ms 0.4989ms 2.0045 KOps/s 2.0409 KOps/s $\color{#d91a1a}-1.79\%$
test_common_ops 1.6208ms 0.8079ms 1.2378 KOps/s 1.3304 KOps/s $\textbf{\color{#d91a1a}-6.96\%}$
test_creation 94.5160μs 2.2171μs 451.0463 KOps/s 480.4217 KOps/s $\textbf{\color{#d91a1a}-6.11\%}$
test_creation_empty 45.8250μs 11.3807μs 87.8678 KOps/s 111.4736 KOps/s $\textbf{\color{#d91a1a}-21.18\%}$
test_creation_nested_1 49.6020μs 14.1437μs 70.7027 KOps/s 85.2093 KOps/s $\textbf{\color{#d91a1a}-17.02\%}$
test_creation_nested_2 54.7220μs 19.0936μs 52.3735 KOps/s 62.2832 KOps/s $\textbf{\color{#d91a1a}-15.91\%}$
test_clone 0.1133ms 13.5095μs 74.0222 KOps/s 74.7172 KOps/s $\color{#d91a1a}-0.93\%$
test_getitem[int] 1.0623ms 12.8391μs 77.8869 KOps/s 78.4085 KOps/s $\color{#d91a1a}-0.67\%$
test_getitem[slice_int] 0.1424ms 25.3815μs 39.3987 KOps/s 39.9834 KOps/s $\color{#d91a1a}-1.46\%$
test_getitem[range] 0.2399ms 50.4549μs 19.8197 KOps/s 20.6314 KOps/s $\color{#d91a1a}-3.93\%$
test_getitem[tuple] 0.1345ms 20.6050μs 48.5318 KOps/s 49.2735 KOps/s $\color{#d91a1a}-1.51\%$
test_getitem[list] 0.3321ms 45.0413μs 22.2019 KOps/s 22.9820 KOps/s $\color{#d91a1a}-3.39\%$
test_setitem_dim[int] 79.9590μs 26.0750μs 38.3510 KOps/s 39.7523 KOps/s $\color{#d91a1a}-3.53\%$
test_setitem_dim[slice_int] 84.1760μs 53.6924μs 18.6246 KOps/s 18.9652 KOps/s $\color{#d91a1a}-1.80\%$
test_setitem_dim[range] 0.1192ms 75.4500μs 13.2538 KOps/s 13.8048 KOps/s $\color{#d91a1a}-3.99\%$
test_setitem_dim[tuple] 80.1890μs 42.3284μs 23.6248 KOps/s 24.3521 KOps/s $\color{#d91a1a}-2.99\%$
test_setitem 0.1598ms 21.1730μs 47.2300 KOps/s 52.5976 KOps/s $\textbf{\color{#d91a1a}-10.21\%}$
test_set 0.1074ms 20.9195μs 47.8023 KOps/s 52.7196 KOps/s $\textbf{\color{#d91a1a}-9.33\%}$
test_set_shared 3.3452ms 0.1753ms 5.7061 KOps/s 5.8410 KOps/s $\color{#d91a1a}-2.31\%$
test_update 0.1537ms 23.6858μs 42.2193 KOps/s 48.6828 KOps/s $\textbf{\color{#d91a1a}-13.28\%}$
test_update_nested 0.1098ms 34.8717μs 28.6766 KOps/s 32.2953 KOps/s $\textbf{\color{#d91a1a}-11.21\%}$
test_update__nested 0.4205ms 33.4172μs 29.9247 KOps/s 30.2455 KOps/s $\color{#d91a1a}-1.06\%$
test_set_nested 0.1010ms 22.9762μs 43.5233 KOps/s 48.2586 KOps/s $\textbf{\color{#d91a1a}-9.81\%}$
test_set_nested_new 0.1013ms 27.6990μs 36.1024 KOps/s 39.5912 KOps/s $\textbf{\color{#d91a1a}-8.81\%}$
test_select 0.2152ms 45.0125μs 22.2161 KOps/s 23.8411 KOps/s $\textbf{\color{#d91a1a}-6.82\%}$
test_select_nested 0.1303ms 61.7460μs 16.1954 KOps/s 16.0302 KOps/s $\color{#35bf28}+1.03\%$
test_exclude_nested 0.1571ms 80.6010μs 12.4068 KOps/s 12.2325 KOps/s $\color{#35bf28}+1.43\%$
test_empty[True] 0.7208ms 0.3922ms 2.5498 KOps/s 2.5277 KOps/s $\color{#35bf28}+0.87\%$
test_empty[False] 10.6673μs 1.2690μs 788.0236 KOps/s 807.6065 KOps/s $\color{#d91a1a}-2.42\%$
test_unbind_speed 0.3512ms 0.2673ms 3.7417 KOps/s 3.6851 KOps/s $\color{#35bf28}+1.54\%$
test_unbind_speed_stack0 0.4500ms 0.2656ms 3.7653 KOps/s 3.7173 KOps/s $\color{#35bf28}+1.29\%$
test_unbind_speed_stack1 0.1063s 0.7900ms 1.2659 KOps/s 1.3902 KOps/s $\textbf{\color{#d91a1a}-8.95\%}$
test_split 2.7049ms 1.5904ms 628.7725 Ops/s 568.8757 Ops/s $\textbf{\color{#35bf28}+10.53\%}$
test_chunk 0.1025s 1.9119ms 523.0427 Ops/s 566.5834 Ops/s $\textbf{\color{#d91a1a}-7.68\%}$
test_consolidate_njt[False-None] 9.0165ms 8.1565ms 122.6023 Ops/s 123.0934 Ops/s $\color{#d91a1a}-0.40\%$
test_creation[device0] 3.3677ms 95.3944μs 10.4828 KOps/s 10.6955 KOps/s $\color{#d91a1a}-1.99\%$
test_creation_from_tensor 0.2781ms 96.1474μs 10.4007 KOps/s 10.6202 KOps/s $\color{#d91a1a}-2.07\%$
test_add_one[memmap_tensor0] 0.1604ms 5.1284μs 194.9927 KOps/s 210.5966 KOps/s $\textbf{\color{#d91a1a}-7.41\%}$
test_contiguous[memmap_tensor0] 18.9160μs 0.5452μs 1.8342 MOps/s 1.9373 MOps/s $\textbf{\color{#d91a1a}-5.32\%}$
test_stack[memmap_tensor0] 40.0950μs 3.5217μs 283.9555 KOps/s 301.3547 KOps/s $\textbf{\color{#d91a1a}-5.77\%}$
test_memmaptd_index 1.0284ms 0.2386ms 4.1910 KOps/s 4.3142 KOps/s $\color{#d91a1a}-2.86\%$
test_memmaptd_index_astensor 0.6552ms 0.3166ms 3.1586 KOps/s 3.2211 KOps/s $\color{#d91a1a}-1.94\%$
test_memmaptd_index_op 0.9691ms 0.5936ms 1.6846 KOps/s 1.8925 KOps/s $\textbf{\color{#d91a1a}-10.98\%}$
test_serialize_model 0.1243s 0.1183s 8.4565 Ops/s 7.2892 Ops/s $\textbf{\color{#35bf28}+16.01\%}$
test_serialize_model_pickle 0.4449s 0.3867s 2.5863 Ops/s 2.5222 Ops/s $\color{#35bf28}+2.54\%$
test_serialize_weights 0.2163s 0.1285s 7.7837 Ops/s 8.6907 Ops/s $\textbf{\color{#d91a1a}-10.44\%}$
test_serialize_weights_returnearly 0.1727s 0.1611s 6.2055 Ops/s 6.4318 Ops/s $\color{#d91a1a}-3.52\%$
test_serialize_weights_pickle 0.5395s 0.4133s 2.4196 Ops/s 2.3853 Ops/s $\color{#35bf28}+1.44\%$
test_serialize_weights_filesystem 0.1526s 0.1455s 6.8741 Ops/s 7.1097 Ops/s $\color{#d91a1a}-3.31\%$
test_serialize_model_filesystem 0.2464s 0.1648s 6.0681 Ops/s 6.5717 Ops/s $\textbf{\color{#d91a1a}-7.66\%}$
test_reshape_pytree 57.5870μs 27.2593μs 36.6848 KOps/s 37.2877 KOps/s $\color{#d91a1a}-1.62\%$
test_reshape_td 65.0310μs 34.1799μs 29.2569 KOps/s 29.3257 KOps/s $\color{#d91a1a}-0.23\%$
test_view_pytree 59.6410μs 26.9751μs 37.0712 KOps/s 37.4536 KOps/s $\color{#d91a1a}-1.02\%$
test_view_td 75.4100μs 38.3446μs 26.0793 KOps/s 26.2070 KOps/s $\color{#d91a1a}-0.49\%$
test_unbind_pytree 64.3290μs 29.8653μs 33.4837 KOps/s 33.9353 KOps/s $\color{#d91a1a}-1.33\%$
test_unbind_td 0.3039ms 39.6677μs 25.2095 KOps/s 25.7181 KOps/s $\color{#d91a1a}-1.98\%$
test_split_pytree 63.9890μs 29.6985μs 33.6718 KOps/s 33.8661 KOps/s $\color{#d91a1a}-0.57\%$
test_split_td 0.2067ms 45.6171μs 21.9216 KOps/s 21.9715 KOps/s $\color{#d91a1a}-0.23\%$
test_add_pytree 81.2400μs 37.2355μs 26.8561 KOps/s 28.4590 KOps/s $\textbf{\color{#d91a1a}-5.63\%}$
test_add_td 0.1148ms 58.7693μs 17.0157 KOps/s 18.7441 KOps/s $\textbf{\color{#d91a1a}-9.22\%}$
test_compile_add_one_nested[tensordict-compile] 0.1607ms 64.2053μs 15.5750 KOps/s 16.3743 KOps/s $\color{#d91a1a}-4.88\%$
test_compile_add_one_nested[tensordict-eager] 0.3950ms 0.1654ms 6.0459 KOps/s 6.2645 KOps/s $\color{#d91a1a}-3.49\%$
test_compile_add_one_nested[pytree-compile] 0.1328ms 46.5382μs 21.4877 KOps/s 21.7196 KOps/s $\color{#d91a1a}-1.07\%$
test_compile_add_one_nested[pytree-eager] 0.2602ms 0.1221ms 8.1919 KOps/s 8.3725 KOps/s $\color{#d91a1a}-2.16\%$
test_compile_copy_nested[tensordict-compile] 69.8100μs 25.8043μs 38.7532 KOps/s 39.3605 KOps/s $\color{#d91a1a}-1.54\%$
test_compile_copy_nested[tensordict-eager] 0.1026ms 54.8479μs 18.2322 KOps/s 18.0373 KOps/s $\color{#35bf28}+1.08\%$
test_compile_copy_nested[pytree-compile] 0.1453ms 79.5278μs 12.5742 KOps/s 12.4505 KOps/s $\color{#35bf28}+0.99\%$
test_compile_copy_nested[pytree-eager] 0.3023ms 68.7026μs 14.5555 KOps/s 14.6383 KOps/s $\color{#d91a1a}-0.57\%$
test_compile_add_one_flat[tensordict-compile] 0.2008ms 0.1053ms 9.5005 KOps/s 9.6469 KOps/s $\color{#d91a1a}-1.52\%$
test_compile_add_one_flat[tensordict-eager] 0.3019ms 0.2037ms 4.9102 KOps/s 5.0775 KOps/s $\color{#d91a1a}-3.30\%$
test_compile_add_one_flat[tensorclass-compile] 0.1079ms 46.5336μs 21.4899 KOps/s 22.0547 KOps/s $\color{#d91a1a}-2.56\%$
test_compile_add_one_flat[tensorclass-eager] 0.5176ms 64.3257μs 15.5459 KOps/s 16.3319 KOps/s $\color{#d91a1a}-4.81\%$
test_compile_add_one_flat[pytree-compile] 0.2122ms 0.1031ms 9.6947 KOps/s 9.8141 KOps/s $\color{#d91a1a}-1.22\%$
test_compile_add_one_flat[pytree-eager] 0.3756ms 0.2021ms 4.9478 KOps/s 4.9565 KOps/s $\color{#d91a1a}-0.18\%$
test_compile_add_self_flat[tensordict-eager] 0.3480ms 0.2145ms 4.6620 KOps/s 4.7702 KOps/s $\color{#d91a1a}-2.27\%$
test_compile_add_self_flat[tensordict-compile] 0.2143ms 0.1053ms 9.4946 KOps/s 9.5689 KOps/s $\color{#d91a1a}-0.78\%$
test_compile_add_self_flat[tensorclass-eager] 0.1324ms 56.9702μs 17.5530 KOps/s 18.3253 KOps/s $\color{#d91a1a}-4.21\%$
test_compile_add_self_flat[tensorclass-compile] 0.1108ms 47.3693μs 21.1107 KOps/s 21.6197 KOps/s $\color{#d91a1a}-2.35\%$
test_compile_add_self_flat[pytree-eager] 0.2751ms 0.1585ms 6.3109 KOps/s 6.2572 KOps/s $\color{#35bf28}+0.86\%$
test_compile_add_self_flat[pytree-compile] 0.1853ms 0.1041ms 9.6049 KOps/s 9.7750 KOps/s $\color{#d91a1a}-1.74\%$
test_compile_copy_flat[tensordict-compile] 65.5010μs 22.0818μs 45.2861 KOps/s 47.6318 KOps/s $\color{#d91a1a}-4.92\%$
test_compile_copy_flat[tensordict-eager] 0.1177ms 58.8536μs 16.9913 KOps/s 17.2759 KOps/s $\color{#d91a1a}-1.65\%$
test_compile_copy_flat[pytree-compile] 0.1576ms 80.8499μs 12.3686 KOps/s 12.2851 KOps/s $\color{#35bf28}+0.68\%$
test_compile_copy_flat[pytree-eager] 0.1576ms 67.9178μs 14.7237 KOps/s 14.3709 KOps/s $\color{#35bf28}+2.46\%$
test_compile_assign_and_add[tensordict-compile] 0.9466ms 0.2077ms 4.8142 KOps/s 4.9155 KOps/s $\color{#d91a1a}-2.06\%$
test_compile_assign_and_add[tensordict-eager] 1.6174ms 1.2937ms 772.9775 Ops/s 788.9275 Ops/s $\color{#d91a1a}-2.02\%$
test_compile_assign_and_add[pytree-compile] 0.2696ms 0.2051ms 4.8762 KOps/s 4.9441 KOps/s $\color{#d91a1a}-1.37\%$
test_compile_assign_and_add[pytree-eager] 1.9825ms 0.7780ms 1.2854 KOps/s 1.2973 KOps/s $\color{#d91a1a}-0.92\%$
test_compile_assign_and_add_stack[compile] 1.0712ms 0.4550ms 2.1980 KOps/s 2.1916 KOps/s $\color{#35bf28}+0.29\%$
test_compile_assign_and_add_stack[eager] 3.0371ms 2.7312ms 366.1328 Ops/s 405.8608 Ops/s $\textbf{\ 8000 color{#d91a1a}-9.79\%}$
test_compile_indexing[tensor-tensordict-compile] 0.1047ms 37.2276μs 26.8618 KOps/s 27.5552 KOps/s $\color{#d91a1a}-2.52\%$
test_compile_indexing[tensor-tensordict-eager] 0.4895ms 34.2475μs 29.1992 KOps/s 29.1843 KOps/s $\color{#35bf28}+0.05\%$
test_compile_indexing[tensor-tensorclass-compile] 79.2170μs 29.8932μs 33.4524 KOps/s 32.8383 KOps/s $\color{#35bf28}+1.87\%$
test_compile_indexing[tensor-tensorclass-eager] 58.0980μs 23.7158μs 42.1659 KOps/s 41.8450 KOps/s $\color{#35bf28}+0.77\%$
test_compile_indexing[tensor-pytree-compile] 0.1016ms 30.7656μs 32.5039 KOps/s 32.6025 KOps/s $\color{#d91a1a}-0.30\%$
test_compile_indexing[tensor-pytree-eager] 65.7120μs 23.7495μs 42.1062 KOps/s 41.7753 KOps/s $\color{#35bf28}+0.79\%$
test_compile_indexing[slice-tensordict-compile] 0.1160ms 52.9688μs 18.8790 KOps/s 18.8510 KOps/s $\color{#35bf28}+0.15\%$
test_compile_indexing[slice-tensordict-eager] 0.5569ms 21.1010μs 47.3911 KOps/s 47.9884 KOps/s $\color{#d91a1a}-1.24\%$
test_compile_indexing[slice-tensorclass-compile] 0.1115ms 45.0081μs 22.2182 KOps/s 21.6887 KOps/s $\color{#35bf28}+2.44\%$
test_compile_indexing[slice-tensorclass-eager] 71.4530μs 19.2256μs 52.0141 KOps/s 52.1396 KOps/s $\color{#d91a1a}-0.24\%$
test_compile_indexing[slice-pytree-compile] 0.1023ms 45.5771μs 21.9409 KOps/s 21.4696 KOps/s $\color{#35bf28}+2.19\%$
test_compile_indexing[slice-pytree-eager] 0.6586ms 19.2578μs 51.9270 KOps/s 52.2102 KOps/s $\color{#d91a1a}-0.54\%$
test_compile_indexing[int-tensordict-compile] 0.1195ms 53.5007μs 18.6913 KOps/s 18.4950 KOps/s $\color{#35bf28}+1.06\%$
test_compile_indexing[int-tensordict-eager] 0.9009ms 21.2542μs 47.0496 KOps/s 48.1536 KOps/s $\color{#d91a1a}-2.29\%$
test_compile_indexing[int-tensorclass-compile] 91.1390μs 45.0088μs 22.2179 KOps/s 21.3494 KOps/s $\color{#35bf28}+4.07\%$
test_compile_indexing[int-tensorclass-eager] 0.3044ms 19.1141μs 52.3175 KOps/s 52.6697 KOps/s $\color{#d91a1a}-0.67\%$
test_compile_indexing[int-pytree-compile] 0.1078ms 45.6273μs 21.9167 KOps/s 21.2999 KOps/s $\color{#35bf28}+2.90\%$
test_compile_indexing[int-pytree-eager] 0.1423ms 19.1349μs 52.2604 KOps/s 52.3012 KOps/s $\color{#d91a1a}-0.08\%$
test_mod_add[eager] 81.9520μs 35.5461μs 28.1325 KOps/s 29.8662 KOps/s $\textbf{\color{#d91a1a}-5.80\%}$
test_mod_add[compile] 0.1266ms 47.9328μs 20.8625 KOps/s 20.3429 KOps/s $\color{#35bf28}+2.55\%$
test_mod_add[compile-overhead] 90.4780μs 48.3607μs 20.6780 KOps/s 20.6633 KOps/s $\color{#35bf28}+0.07\%$
test_mod_wrap[eager] 0.4395ms 0.2326ms 4.2996 KOps/s 4.4299 KOps/s $\color{#d91a1a}-2.94\%$
test_mod_wrap[compile] 0.9888ms 0.2166ms 4.6170 KOps/s 4.7645 KOps/s $\color{#d91a1a}-3.10\%$
test_mod_wrap[compile-overhead] 0.3200ms 0.2119ms 4.7195 KOps/s 4.7534 KOps/s $\color{#d91a1a}-0.71\%$
test_mod_wrap_and_backward[eager] 15.5471ms 11.5539ms 86.5511 Ops/s 81.4619 Ops/s $\textbf{\color{#35bf28}+6.25\%}$
test_mod_wrap_and_backward[compile] 15.5645ms 13.0100ms 76.8637 Ops/s 77.8761 Ops/s $\color{#d91a1a}-1.30\%$
test_mod_wrap_and_backward[compile-overhead] 18.3744ms 12.9206ms 77.3958 Ops/s 75.3301 Ops/s $\color{#35bf28}+2.74\%$
test_seq_add[eager] 0.2474ms 0.1166ms 8.5781 KOps/s 9.0712 KOps/s $\textbf{\color{#d91a1a}-5.44\%}$
test_seq_add[compile] 0.1371ms 63.8506μs 15.6616 KOps/s 15.9303 KOps/s $\color{#d91a1a}-1.69\%$
test_seq_add[compile-overhead] 0.7999ms 61.5305μs 16.2521 KOps/s 16.1486 KOps/s $\color{#35bf28}+0.64\%$
test_seq_wrap[eager] 0.6457ms 0.4570ms 2.1884 KOps/s 2.2621 KOps/s $\color{#d91a1a}-3.26\%$
test_seq_wrap[compile] 0.3538ms 0.2351ms 4.2535 KOps/s 4.2589 KOps/s $\color{#d91a1a}-0.12\%$
test_seq_wrap[compile-overhead] 0.3743ms 0.2334ms 4.2839 KOps/s 4.2548 KOps/s $\color{#35bf28}+0.68\%$
test_func_call_runtime[False-eager] 0.8231ms 0.5752ms 1.7385 KOps/s 1.7999 KOps/s $\color{#d91a1a}-3.41\%$
test_func_call_runtime[False-compile] 0.7075ms 0.4421ms 2.2618 KOps/s 2.3475 KOps/s $\color{#d91a1a}-3.65\%$
test_func_call_runtime[False-compile-overhead] 0.5591ms 0.4375ms 2.2856 KOps/s 2.3269 KOps/s $\color{#d91a1a}-1.78\%$
test_func_call_runtime[True-eager] 1.8402ms 0.7956ms 1.2569 KOps/s 1.2730 KOps/s $\color{#d91a1a}-1.26\%$
test_func_call_runtime[True-compile] 0.5680ms 0.4746ms 2.1068 KOps/s 2.1369 KOps/s $\color{#d91a1a}-1.41\%$
test_func_call_runtime[True-compile-overhead] 0.7264ms 0.4752ms 2.1046 KOps/s 2.1434 KOps/s $\color{#d91a1a}-1.81\%$
test_func_call_cm_runtime[False-eager] 0.9879ms 0.5738ms 1.7427 KOps/s 1.8251 KOps/s $\color{#d91a1a}-4.51\%$
test_func_call_cm_runtime[False-compile] 0.7971ms 0.4393ms 2.2763 KOps/s 2.3312 KOps/s $\color{#d91a1a}-2.36\%$
test_func_call_cm_runtime[False-compile-overhead] 2.2860ms 0.4406ms 2.2697 KOps/s 2.3335 KOps/s $\color{#d91a1a}-2.73\%$
test_func_call_cm_runtime[True-eager] 1.1084ms 0.9179ms 1.0895 KOps/s 1.1020 KOps/s $\color{#d91a1a}-1.14\%$
test_func_call_cm_runtime[True-compile] 0.6628ms 0.5011ms 1.9955 KOps/s 2.0128 KOps/s $\color{#d91a1a}-0.86\%$
test_func_call_cm_runtime[True-compile-overhead] 0.7484ms 0.5054ms 1.9788 KOps/s 2.0271 KOps/s $\color{#d91a1a}-2.39\%$
test_vmap_func_call_cm_runtime[eager] 2.7492ms 1.9546ms 511.6053 Ops/s 524.8065 Ops/s $\color{#d91a1a}-2.52\%$
test_vmap_func_call_cm_runtime[compile] 0.9051ms 0.5228ms 1.9129 KOps/s 1.8919 KOps/s $\color{#35bf28}+1.11\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.8841ms 0.5311ms 1.8828 KOps/s 1.8975 KOps/s $\color{#d91a1a}-0.77\%$
test_distributed 0.2417ms 0.1265ms 7.9033 KOps/s 7.8155 KOps/s $\color{#35bf28}+1.12\%$
test_tdmodule 0.1016ms 27.5056μs 36.3562 KOps/s 38.5866 KOps/s $\textbf{\color{#d91a1a}-5.78\%}$
test_tdmodule_dispatch 72.1640μs 49.4203μs 20.2346 KOps/s 20.8685 KOps/s $\color{#d91a1a}-3.04\%$
test_tdseq 54.9720μs 26.7326μs 37.4075 KOps/s 38.6496 KOps/s $\color{#d91a1a}-3.21\%$
test_tdseq_dispatch 80.0680μs 51.4797μs 19.4251 KOps/s 20.5199 KOps/s $\textbf{\color{#d91a1a}-5.34\%}$
test_instantiation_functorch 2.3563ms 1.5799ms 632.9348 Ops/s 641.8520 Ops/s $\color{#d91a1a}-1.39\%$
test_exec_functorch 0.4171ms 0.1867ms 5.3548 KOps/s 5.5244 KOps/s $\color{#d91a1a}-3.07\%$
test_exec_functional_call 0.3267ms 0.1800ms 5.5552 KOps/s 5.7343 KOps/s $\color{#d91a1a}-3.12\%$
test_exec_td_decorator 0.4548ms 0.2378ms 4.2053 KOps/s 4.3891 KOps/s $\color{#d91a1a}-4.19\%$
test_vmap_mlp_speed_decorator[True-True] 3.2809ms 0.6689ms 1.4949 KOps/s 1.5055 KOps/s $\color{#d91a1a}-0.71\%$
test_vmap_mlp_speed_decorator[True-False] 0.9024ms 0.6635ms 1.5071 KOps/s 1.5232 KOps/s $\color{#d91a1a}-1.06\%$
test_vmap_mlp_speed_decorator[False-True] 0.8558ms 0.5343ms 1.8716 KOps/s 1.8463 KOps/s $\color{#35bf28}+1.37\%$
test_vmap_mlp_speed_decorator[False-False] 1.0046ms 0.5361ms 1.8654 KOps/s 1.8635 KOps/s $\color{#35bf28}+0.11\%$
test_to_module_speed[True] 1.4578ms 1.2974ms 770.7530 Ops/s 749.0675 Ops/s $\color{#35bf28}+2.89\%$
test_to_module_speed[False] 1.5621ms 1.2688ms 788.1686 Ops/s 768.2497 Ops/s $\color{#35bf28}+2.59\%$
test_tc_init 0.1178ms 48.3218μs 20.6946 KOps/s 22.5460 KOps/s $\textbf{\color{#d91a1a}-8.21\%}$
test_tc_init_nested 0.2050ms 96.7554μs 10.3353 KOps/s 11.0816 KOps/s $\textbf{\color{#d91a1a}-6.73\%}$
test_tc_first_layer_tensor 26.6390μs 1.5468μs 646.4843 KOps/s 675.5937 KOps/s $\color{#d91a1a}-4.31\%$
test_tc_first_layer_nontensor 24.7950μs 4.7372μs 211.0929 KOps/s 214.6859 KOps/s $\color{#d91a1a}-1.67\%$
test_tc_second_layer_tensor 42.2780μs 2.8878μs 346.2817 KOps/s 350.9760 KOps/s $\color{#d91a1a}-1.34\%$
test_tc_second_layer_nontensor 30.2660μs 6.1318μs 163.0846 KOps/s 168.8981 KOps/s $\color{#d91a1a}-3.44\%$
test_unbind 0.2145s 13.8215ms 72.3511 Ops/s 77.9448 Ops/s $\textbf{\color{#d91a1a}-7.18\%}$
test_full_like 8.2832ms 7.2621ms 137.7022 Ops/s 82.5529 Ops/s $\textbf{\color{#35bf28}+66.80\%}$
test_zeros_like 3.9830ms 2.7601ms 362.3106 Ops/s 138.7297 Ops/s $\textbf{\color{#35bf28}+161.16\%}$
test_ones_like 3.8962ms 3.3449ms 298.9628 Ops/s 125.4464 Ops/s $\textbf{\color{#35bf28}+138.32\%}$
test_clone 5.5158ms 4.9805ms 200.7829 Ops/s 105.2850 Ops/s $\textbf{\color{#35bf28}+90.70\%}$
test_squeeze 89.0680μs 12.0025μs 83.3158 KOps/s 83.2009 KOps/s $\color{#35bf28}+0.14\%$
test_unsqueeze 0.1491ms 90.3162μs 11.0722 KOps/s 11.0553 KOps/s $\color{#35bf28}+0.15\%$
test_split 0.4592ms 0.1963ms 5.0946 KOps/s 5.0541 KOps/s $\color{#35bf28}+0.80\%$
test_permute 0.3627ms 0.2236ms 4.4726 KOps/s 4.5035 KOps/s $\color{#d91a1a}-0.69\%$
test_stack 27.8286ms 25.4220ms 39.3360 Ops/s 39.7345 Ops/s $\color{#d91a1a}-1.00\%$
test_cat 27.8287ms 25.3544ms 39.4409 Ops/s 39.8760 Ops/s $\color{#d91a1a}-1.09\%$

[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Nov 26, 2024
…n one distribution

ghstack-source-id: 2b89ea6
Pull Request resolved: #1114
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Nov 26, 2024
…n one distribution

ghstack-source-id: b49b545
Pull Request resolved: #1114
Copy link
github-actions bot commented Nov 26, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}21$. Worsened: $\large\color{#d91a1a}10$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 91.9210μs 10.2942μs 97.1426 KOps/s 99.6123 KOps/s $\color{#d91a1a}-2.48\%$
test_plain_set_stack_nested 33.2700μs 10.3596μs 96.5288 KOps/s 98.7714 KOps/s $\color{#d91a1a}-2.27\%$
test_plain_set_nested_inplace 50.1100μs 11.2708μs 88.7252 KOps/s 91.5348 KOps/s $\color{#d91a1a}-3.07\%$
test_plain_set_stack_nested_inplace 0.1065ms 11.2190μs 89.1341 KOps/s 90.8845 KOps/s $\color{#d91a1a}-1.93\%$
test_items 0.1085ms 2.8417μs 351.9050 KOps/s 345.8692 KOps/s $\color{#35bf28}+1.75\%$
test_items_nested 0.4143ms 0.3525ms 2.8369 KOps/s 2.8150 KOps/s $\color{#35bf28}+0.78\%$
test_items_nested_locked 0.4574ms 0.3573ms 2.7990 KOps/s 2.8125 KOps/s $\color{#d91a1a}-0.48\%$
test_items_nested_leaf 0.1431ms 58.0829μs 17.2168 KOps/s 17.2641 KOps/s $\color{#d91a1a}-0.27\%$
test_items_stack_nested 0.4456ms 0.3488ms 2.8668 KOps/s 2.7667 KOps/s $\color{#35bf28}+3.62\%$
test_items_stack_nested_leaf 0.1552ms 59.0609μs 16.9317 KOps/s 17.1014 KOps/s $\color{#d91a1a}-0.99\%$
test_items_stack_nested_locked 0.4469ms 0.3578ms 2.7946 KOps/s 2.7953 KOps/s $\color{#d91a1a}-0.03\%$
test_keys 0.1063ms 3.4352μs 291.1023 KOps/s 290.7129 KOps/s $\color{#35bf28}+0.13\%$
test_keys_nested 0.2267ms 69.2634μs 14.4376 KOps/s 14.2908 KOps/s $\color{#35bf28}+1.03\%$
test_keys_nested_locked 0.8067ms 74.6648μs 13.3932 KOps/s 13.1439 KOps/s $\color{#35bf28}+1.90\%$
test_keys_nested_leaf 0.1444ms 61.6102μs 16.2311 KOps/s 16.2369 KOps/s $\color{#d91a1a}-0.04\%$
test_keys_stack_nested 0.1601ms 69.5913μs 14.3696 KOps/s 14.0669 KOps/s $\color{#35bf28}+2.15\%$
test_keys_stack_nested_leaf 92.0720μs 60.8444μs 16.4354 KOps/s 16.1259 KOps/s $\color{#35bf28}+1.92\%$
test_keys_stack_nested_locked 0.1183ms 74.9778μs 13.3373 KOps/s 13.0987 KOps/s $\color{#35bf28}+1.82\%$
test_values 13.0518μs 0.8439μs 1.1849 MOps/s 1.1867 MOps/s $\color{#d91a1a}-0.14\%$
test_values_nested 0.1602ms 31.0793μs 32.1757 KOps/s 32.3867 KOps/s $\color{#d91a1a}-0.65\%$
test_values_nested_locked 0.1115ms 32.5137μs 30.7563 KOps/s 30.7476 KOps/s $\color{#35bf28}+0.03\%$
test_values_nested_leaf 0.1173ms 33.6533μs 29.7148 KOps/s 29.8059 KOps/s $\color{#d91a1a}-0.31\%$
test_values_stack_nested 0.1175ms 31.6454μs 31.6002 KOps/s 31.8482 KOps/s $\color{#d91a1a}-0.78\%$
test_values_stack_nested_leaf 75.1410μs 34.0900μs 29.3341 KOps/s 29.5559 KOps/s $\color{#d91a1a}-0.75\%$
test_values_stack_nested_locked 0.1247ms 32.9365μs 30.3614 KOps/s 30.1136 KOps/s $\color{#35bf28}+0.82\%$
test_membership 3.9861μs 0.5052μs 1.9795 MOps/s 1.9706 MOps/s $\color{#35bf28}+0.45\%$
test_membership_nested 62.6410μs 1.9500μs 512.8123 KOps/s 489.3173 KOps/s $\color{#35bf28}+4.80\%$
test_membership_nested_leaf 16.2800μs 1.9885μs 502.9000 KOps/s 515.2183 KOps/s $\color{#d91a1a}-2.39\%$
test_membership_stacked_nested 41.2200μs 2.0068μs 498.2973 KOps/s 486.0713 KOps/s $\color{#35bf28}+2.52\%$
test_membership_stacked_nested_leaf 25.5200μs 2.0541μs 486.8276 KOps/s 493.3299 KOps/s $\color{#d91a1a}-1.32\%$
test_membership_nested_last 98.5220μs 2.8655μs 348.9735 KOps/s 343.8629 KOps/s $\color{#35bf28}+1.49\%$
test_membership_nested_leaf_last 61.1200μs 2.8755μs 347.7689 KOps/s 344.7163 KOps/s $\color{#35bf28}+0.89\%$
test_membership_stacked_nested_last 0.1652ms 2.8806μs 347.1469 KOps/s 341.5226 KOps/s $\color{#35bf28}+1.65\%$
test_membership_stacked_nested_leaf_last 57.6800μs 2.8582μs 349.8678 KOps/s 338.4986 KOps/s $\color{#35bf28}+3.36\%$
test_nested_getleaf 0.1643ms 6.1313μs 163.0987 KOps/s 163.3020 KOps/s $\color{#d91a1a}-0.12\%$
test_nested_get 95.5920μs 5.8518μs 170.8871 KOps/s 172.1732 KOps/s $\color{#d91a1a}-0.75\%$
test_stacked_getleaf 31.9700μs 6.1065μs 163.7607 KOps/s 162.6953 KOps/s $\color{#35bf28}+0.65\%$
test_stacked_get 55.7410μs 5.7964μs 172.5197 KOps/s 171.7724 KOps/s $\color{#35bf28}+0.44\%$
test_nested_getitemleaf 0.1875ms 6.1619μs 162.2873 KOps/s 160.2632 KOps/s $\color{#35bf28}+1.26\%$
test_nested_getitem 37.1700μs 5.8737μs 170.2501 KOps/s 170.1246 KOps/s $\color{#35bf28}+0.07\%$
test_stacked_getitemleaf 0.1967ms 6.1937μs 161.4556 KO 8000 ps/s 160.0581 KOps/s $\color{#35bf28}+0.87\%$
test_stacked_getitem 0.1825ms 5.8695μs 170.3710 KOps/s 169.5711 KOps/s $\color{#35bf28}+0.47\%$
test_lock_nested 10.6577ms 0.3708ms 2.6970 KOps/s 2.6808 KOps/s $\color{#35bf28}+0.60\%$
test_lock_stack_nested 0.4715ms 0.3322ms 3.0105 KOps/s 2.9815 KOps/s $\color{#35bf28}+0.97\%$
test_unlock_nested 0.6062ms 0.3025ms 3.3053 KOps/s 3.2686 KOps/s $\color{#35bf28}+1.12\%$
test_unlock_stack_nested 0.3943ms 0.2732ms 3.6609 KOps/s 3.6462 KOps/s $\color{#35bf28}+0.40\%$
test_flatten_speed 0.1599ms 74.4430μs 13.4331 KOps/s 13.3899 KOps/s $\color{#35bf28}+0.32\%$
test_unflatten_speed 0.4971ms 0.3077ms 3.2500 KOps/s 3.2968 KOps/s $\color{#d91a1a}-1.42\%$
test_common_ops 1.7063ms 0.5729ms 1.7455 KOps/s 1.7288 KOps/s $\color{#35bf28}+0.96\%$
test_creation 95.6620μs 1.4823μs 674.6211 KOps/s 677.0954 KOps/s $\color{#d91a1a}-0.37\%$
test_creation_empty 0.1044ms 6.7522μs 148.1005 KOps/s 156.9377 KOps/s $\textbf{\color{#d91a1a}-5.63\%}$
test_creation_nested_1 37.0800μs 8.2727μs 120.8789 KOps/s 126.8228 KOps/s $\color{#d91a1a}-4.69\%$
test_creation_nested_2 94.5320μs 10.7629μs 92.9115 KOps/s 96.5289 KOps/s $\color{#d91a1a}-3.75\%$
test_clone 57.8510μs 10.2702μs 97.3689 KOps/s 90.7283 KOps/s $\textbf{\color{#35bf28}+7.32\%}$
test_getitem[int] 75.2110μs 10.7227μs 93.2605 KOps/s 89.5625 KOps/s $\color{#35bf28}+4.13\%$
test_getitem[slice_int] 1.6166ms 20.6730μs 48.3723 KOps/s 46.8509 KOps/s $\color{#35bf28}+3.25\%$
test_getitem[range] 0.1020s 55.3242μs 18.0753 KOps/s 25.6763 KOps/s $\textbf{\color{#d91a1a}-29.60\%}$
test_getitem[tuple] 0.1509ms 17.8924μs 55.8897 KOps/s 53.1055 KOps/s $\textbf{\color{#35bf28}+5.24\%}$
test_getitem[list] 0.2934ms 33.1596μs 30.1572 KOps/s 29.7870 KOps/s $\color{#35bf28}+1.24\%$
test_setitem_dim[int] 31.1610μs 17.8957μs 55.8793 KOps/s 51.7385 KOps/s $\textbf{\color{#35bf28}+8.00\%}$
test_setitem_dim[slice_int] 76.9810μs 37.1428μs 26.9231 KOps/s 25.5573 KOps/s $\textbf{\color{#35bf28}+5.34\%}$
test_setitem_dim[range] 0.1197ms 54.3843μs 18.3877 KOps/s 18.2899 KOps/s $\color{#35bf28}+0.53\%$
test_setitem_dim[tuple] 55.5310μs 31.8223μs 31.4245 KOps/s 30.2340 KOps/s $\color{#35bf28}+3.94\%$
test_setitem 89.0910μs 13.8922μs 71.9827 KOps/s 67.9487 KOps/s $\textbf{\color{#35bf28}+5.94\%}$
test_set 0.1326ms 13.5710μs 73.6864 KOps/s 71.6399 KOps/s $\color{#35bf28}+2.86\%$
test_set_shared 1.5709ms 0.1469ms 6.8058 KOps/s 6.7795 KOps/s $\color{#35bf28}+0.39\%$
test_update 0.4654ms 15.6305μs 63.9773 KOps/s 60.8393 KOps/s $\textbf{\color{#35bf28}+5.16\%}$
test_update_nested 80.3910μs 20.2966μs 49.2694 KOps/s 47.6662 KOps/s $\color{#35bf28}+3.36\%$
test_update__nested 1.1741ms 23.7120μs 42.1728 KOps/s 40.5717 KOps/s $\color{#35bf28}+3.95\%$
test_set_nested 0.1555ms 14.6068μs 68.4611 KOps/s 65.9406 KOps/s $\color{#35bf28}+3.82\%$
test_set_nested_new 81.2110μs 16.6643μs 60.0085 KOps/s 56.6373 KOps/s $\textbf{\color{#35bf28}+5.95\%}$
test_select 0.1184ms 28.2903μs 35.3478 KOps/s 35.0060 KOps/s $\color{#35bf28}+0.98\%$
test_select_nested 0.1255ms 41.8979μs 23.8675 KOps/s 24.2540 KOps/s $\color{#d91a1a}-1.59\%$
test_exclude_nested 0.3774ms 61.7814μs 16.1861 KOps/s 16.5142 KOps/s $\color{#d91a1a}-1.99\%$
test_empty[True] 0.3421ms 0.2727ms 3.6664 KOps/s 3.6336 KOps/s $\color{#35bf28}+0.90\%$
test_empty[False] 7.8541μs 0.7474μs 1.3380 MOps/s 1.3506 MOps/s $\color{#d91a1a}-0.93\%$
test_to 0.1031ms 58.0094μs 17.2386 KOps/s 18.2536 KOps/s $\textbf{\color{#d91a1a}-5.56\%}$
test_to_nonblocking 0.1954ms 45.5245μs 21.9662 KOps/s 22.8986 KOps/s $\color{#d91a1a}-4.07\%$
test_unbind_speed 0.3308ms 0.2272ms 4.4021 KOps/s 4.2438 KOps/s $\color{#35bf28}+3.73\%$
test_unbind_speed_stack0 0.2816ms 0.2301ms 4.3458 KOps/s 4.3208 KOps/s $\color{#35bf28}+0.58\%$
test_unbind_speed_stack1 0.1033s 0.6530ms 1.5314 KOps/s 1.5227 KOps/s $\color{#35bf28}+0.57\%$
test_split 0.1020s 1.5796ms 633.0806 Ops/s 559.7721 Ops/s $\textbf{\color{#35bf28}+13.10\%}$
test_chunk 0.1027s 1.5868ms 630.1870 Ops/s 672.1917 Ops/s $\textbf{\color{#d91a1a}-6.25\%}$
test_consolidate[False-None] 0.1069s 2.8798ms 347.2485 Ops/s 347.0213 Ops/s $\color{#35bf28}+0.07\%$
test_consolidate[default-None] 1.9048ms 1.7488ms 571.8077 Ops/s 595.7732 Ops/s $\color{#d91a1a}-4.02\%$
test_consolidate[reduce-overhead-None] 1.9183ms 1.7641ms 566.8467 Ops/s 585.2423 Ops/s $\color{#d91a1a}-3.14\%$
test_consolidate_njt[False-None] 6.8048ms 6.4463ms 155.1281 Ops/s 154.6559 Ops/s $\color{#35bf28}+0.31\%$
test_to[False-False-None] 1.8520ms 1.6502ms 605.9733 Ops/s 613.2738 Ops/s $\color{#d91a1a}-1.19\%$
test_to[True-False-None] 1.5144ms 1.2851ms 778.1248 Ops/s 771.7080 Ops/s $\color{#35bf28}+0.83\%$
test_to[within-False-None] 4.2455ms 4.0076ms 249.5243 Ops/s 247.5674 Ops/s $\color{#35bf28}+0.79\%$
test_to[True-default-None] 5.3140ms 5.1398ms 194.5615 Ops/s 193.2542 Ops/s $\color{#35bf28}+0.68\%$
test_to_njt[False-False-None] 7.0914ms 6.8934ms 145.0668 Ops/s 142.3275 Ops/s $\color{#35bf28}+1.92\%$
test_to_njt[True-False-None] 5.6321ms 5.3924ms 185.4470 Ops/s 180.0562 Ops/s $\color{#35bf28}+2.99\%$
test_to_njt[within-False-None] 12.0175ms 11.8541ms 84.3592 Ops/s 82.0285 Ops/s $\color{#35bf28}+2.84\%$
test_creation[device0] 0.4569ms 78.2061μs 12.7867 KOps/s 12.3053 KOps/s $\color{#35bf28}+3.91\%$
test_creation_from_tensor 0.5383ms 82.2023μs 12.1651 KOps/s 11.7025 KOps/s $\color{#35bf28}+3.95\%$
test_add_one[memmap_tensor0] 0.2342ms 6.8410μs 146.1779 KOps/s 139.6865 KOps/s $\color{#35bf28}+4.65\%$
test_contiguous[memmap_tensor0] 1.8920μs 0.4075μs 2.4543 MOps/s 2.4104 MOps/s $\color{#35bf28}+1.82\%$
test_stack[memmap_tensor0] 37.9100μs 4.4640μs 224.0145 KOps/s 214.2781 KOps/s $\color{#35bf28}+4.54\%$
test_memmaptd_index 1.9011ms 0.2489ms 4.0180 KOps/s 3.9181 KOps/s $\color{#35bf28}+2.55\%$
test_memmaptd_index_astensor 0.9419ms 0.3069ms 3.2586 KOps/s 3.1977 KOps/s $\color{#35bf28}+1.91\%$
test_memmaptd_index_op 1.0245ms 0.5656ms 1.7681 KOps/s 1.7351 KOps/s $\color{#35bf28}+1.90\%$
test_serialize_model 0.1317s 0.1309s 7.6407 Ops/s 7.6262 Ops/s $\color{#35bf28}+0.19\%$
test_serialize_model_pickle 1.3512s 1.2161s 0.8223 Ops/s 0.8231 Ops/s $\color{#d91a1a}-0.10\%$
test_serialize_weights 0.4763s 0.1803s 5.5477 Ops/s 7.6685 Ops/s $\textbf{\color{#d91a1a}-27.66\%}$
test_serialize_weights_returnearly 0.3737s 54.5890ms 18.3187 Ops/s 14.7180 Ops/s $\textbf{\color{#35bf28}+24.47\%}$
test_serialize_weights_pickle 1.3482s 1.1922s 0.8388 Ops/s 0.8197 Ops/s $\color{#35bf28}+2.33\%$
test_reshape_pytree 0.1646ms 22.1476μs 45.1516 KOps/s 45.2872 KOps/s $\color{#d91a1a}-0.30\%$
test_reshape_td 56.5710μs 26.0790μs 38.3451 KOps/s 37.9635 KOps/s $\color{#35bf28}+1.01\%$
test_view_pytree 0.1797ms 22.1195μs 45.2089 KOps/s 45.4138 KOps/s $\color{#d91a1a}-0.45\%$
test_view_td 0.1216ms 29.4059μs 34.0068 KOps/s 32.5049 KOps/s $\color{#35bf28}+4.62\%$
test_unbind_pytree 0.1720ms 28.0115μs 35.6996 KOps/s 35.4487 KOps/s $\color{#35bf28}+0.71\%$
test_unbind_td 0.7567ms 34.8015μs 28.7344 KOps/s 28.5988 KOps/s $\color{#35bf28}+0.47\%$
test_split_pytree 0.1256ms 29.7320μs 33.6338 KOps/s 33.2255 KOps/s $\color{#35bf28}+1.23\%$
test_split_td 0.9590ms 36.5718μs 27.3435 KOps/s 25.2400 KOps/s $\textbf{\color{#35bf28}+8.33\%}$
test_add_pytree 0.2013ms 33.9915μs 29.4191 KOps/s 28.2866 KOps/s $\color{#35bf28}+4.00\%$
test_add_td 0.4292ms 44.7624μs 22.3402 KOps/s 23.2265 KOps/s $\color{#d91a1a}-3.82\%$
test_compile_add_one_nested[tensordict-compile] 0.2828ms 0.1189ms 8.4117 KOps/s 8.1757 KOps/s $\color{#35bf28}+2.89\%$
test_compile_add_one_nested[tensordict-eager] 0.5369ms 0.1225ms 8.1625 KOps/s 8.1379 KOps/s $\color{#35bf28}+0.30\%$
test_compile_add_one_nested[pytree-compile] 0.2762ms 94.5466μs 10.5768 KOps/s 10.0710 KOps/s $\textbf{\color{#35bf28}+5.02\%}$
test_compile_add_one_nested[pytree-eager] 0.5406ms 0.1454ms 6.8795 KOps/s 6.8601 KOps/s $\color{#35bf28}+0.28\%$
test_compile_copy_nested[tensordict-compile] 0.4240ms 20.9259μs 47.7876 KOps/s 43.9026 KOps/s $\textbf{\color{#35bf28}+8.85\%}$
test_compile_copy_nested[tensordict-eager] 0.5498ms 26.3049μs 38.0158 KOps/s 37.1509 KOps/s $\color{#35bf28}+2.33\%$
test_compile_copy_nested[pytree-compile] 0.3046ms 64.3308μs 15.5446 KOps/s 15.5084 KOps/s $\color{#35bf28}+0.23\%$
test_compile_copy_nested[pytree-eager] 0.4373ms 49.3944μs 20.2452 KOps/s 20.1167 KOps/s $\color{#35bf28}+0.64\%$
test_compile_add_one_flat[tensordict-compile] 0.3154ms 0.1415ms 7.0677 KOps/s 6.9038 KOps/s $\color{#35bf28}+2.37\%$
test_compile_add_one_flat[tensordict-eager] 0.3557ms 0.2064ms 4.8446 KOps/s 4.9256 KOps/s $\color{#d91a1a}-1.65\%$
test_compile_add_one_flat[tensorclass-compile] 0.2499ms 97.7640μs 10.2287 KOps/s 10.2056 KOps/s $\color{#35bf28}+0.23\%$
test_compile_add_one_flat[tensorclass-eager] 0.1976ms 49.8090μs 20.0767 KOps/s 19.1280 KOps/s $\color{#35bf28}+4.96\%$
test_compile_add_one_flat[pytree-compile] 0.2866ms 0.1356ms 7.3734 KOps/s 7.3033 KOps/s $\color{#35bf28}+0.96\%$
test_compile_add_one_flat[pytree-eager] 0.6318ms 0.4751ms 2.1048 KOps/s 2.1506 KOps/s $\color{#d91a1a}-2.13\%$
test_compile_add_self_flat[tensordict-eager] 0.3887ms 0.2468ms 4.0522 KOps/s 4.1121 KOps/s $\color{#d91a1a}-1.46\%$
test_compile_add_self_flat[tensordict-compile] 0.2936ms 0.1431ms 6.9900 KOps/s 6.8498 KOps/s $\color{#35bf28}+2.05\%$
test_compile_add_self_flat[tensorclass-eager] 0.2093ms 60.1453μs 16.6264 KOps/s 16.6229 KOps/s $\color{#35bf28}+0.02\%$
test_compile_add_self_flat[tensorclass-compile] 0.2352ms 98.7413μs 10.1275 KOps/s 10.0560 KOps/s $\color{#35bf28}+0.71\%$
test_compile_add_self_flat[pytree-eager] 0.5740ms 0.4025ms 2.4842 KOps/s 2.5543 KOps/s $\color{#d91a1a}-2.74\%$
test_compile_add_self_flat[pytree-compile] 0.3164ms 0.1347ms 7.4227 KOps/s 7.3082 KOps/s $\color{#35bf28}+1.57\%$
test_compile_copy_flat[tensordict-compile] 0.1465ms 17.5968μs 56.8285 KOps/s 56.5239 KOps/s $\color{#35bf28}+0.54\%$
test_compile_copy_flat[tensordict-eager] 0.4286ms 26.5270μs 37.6975 KOps/s 37.9917 KOps/s $\color{#d91a1a}-0.77\%$
test_compile_copy_flat[pytree-compile] 0.4721ms 69.6209μs 14.3635 KOps/s 14.2789 KOps/s $\color{#35bf28}+0.59\%$
test_compile_copy_flat[pytree-eager] 0.4232ms 50.9887μs 19.6122 KOps/s 19.4665 KOps/s $\color{#35bf28}+0.75\%$
test_compile_assign_and_add[tensordict-compile] 1.6592ms 0.4493ms 2.2255 KOps/s 2.0993 KOps/s $\textbf{\color{#35bf28}+6.01\%}$
test_compile_assign_and_add[tensordict-eager] 2.7664ms 2.5337ms 394.6820 Ops/s 397.7666 Ops/s $\color{#d91a1a}-0.78\%$
test_compile_assign_and_add[pytree-compile] 1.6462ms 0.4428ms 2.2581 KOps/s 2.1938 KOps/s $\color{#35bf28}+2.93\%$
test_compile_assign_and_add[pytree-eager] 2.7573ms 2.5439ms 393.0910 Ops/s 396.8322 Ops/s $\color{#d91a1a}-0.94\%$
test_compile_indexing[tensor-tensordict-compile] 0.5556ms 0.1144ms 8.7447 KOps/s 8.6937 KOps/s $\color{#35bf28}+0.59\%$
test_compile_indexing[tensor-tensordict-eager] 0.5897ms 80.9450μs 12.3541 KOps/s 12.2669 KOps/s $\color{#35bf28}+0.71\%$
test_compile_indexing[tensor-tensorclass-compile] 0.5127ms 0.1083ms 9.2373 KOps/s 9.1442 KOps/s $\color{#35bf28}+1.02\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2553ms 66.4236μs 15.0549 KOps/s 14.4048 KOps/s $\color{#35bf28}+4.51\%$
test_compile_indexing[tensor-pytree-compile] 0.3175ms 0.1091ms 9.1642 KOps/s 9.4592 KOps/s $\color{#d91a1a}-3.12\%$
test_compile_indexing[tensor-pytree-eager] 0.2715ms 70.0009μs 14.2855 KOps/s 15.1991 KOps/s $\textbf{\color{#d91a1a}-6.01\%}$
test_compile_indexing[slice-tensordict-compile] 0.2822ms 0.1007ms 9.9282 KOps/s 9.8064 KOps/s $\color{#35bf28}+1.24\%$
test_compile_indexing[slice-tensordict-eager] 0.2008ms 16.7946μs 59.5429 KOps/s 54.6702 KOps/s $\textbf{\color{#35bf28}+8.91\%}$
test_compile_indexing[slice-tensorclass-compile] 0.2430ms 96.6380μs 10.3479 KOps/s 10.2595 KOps/s $\color{#35bf28}+0.86\%$
test_compile_indexing[slice-tensorclass-eager] 0.1458ms 15.6098μs 64.0623 KOps/s 62.2046 KOps/s $\color{#35bf28}+2.99\%$
test_compile_indexing[slice-pytree-compile] 0.2445ms 97.7196μs 10.2334 KOps/s 10.0712 KOps/s $\color{#35bf28}+1.61\%$
test_compile_indexing[slice-pytree-eager] 0.1482ms 15.6611μs 63.8524 KOps/s 62.3706 KOps/s $\color{#35bf28}+2.38\%$
test_compile_indexing[int-tensordict-compile] 0.2813ms 0.1022ms 9.7857 KOps/s 9.7572 KOps/s $\color{#35bf28}+0.29\%$
test_compile_indexing[int-tensordict-eager] 0.6956ms 16.7912μs 59.5552 KOps/s 53.6087 KOps/s $\textbf{\color{#35bf28}+11.09\%}$
test_compile_indexing[int-tensorclass-compile] 0.2596ms 0.1008ms 9.9193 KOps/s 10.1917 KOps/s $\color{#d91a1a}-2.67\%$
test_compile_indexing[int-tensorclass-eager] 0.1984ms 15.5670μs 64.2383 KOps/s 62.7277 KOps/s $\color{#35bf28}+2.41\%$
test_compile_indexing[int-pytree-compile] 0.2816ms 0.1006ms 9.9414 KOps/s 10.1929 KOps/s $\color{#d91a1a}-2.47\%$
test_compile_indexing[int-pytree-eager] 0.1636ms 15.7196μs 63.6148 KOps/s 62.9656 KOps/s $\color{#35bf28}+1.03\%$
test_mod_add[eager] 0.1894ms 35.7958μs 27.9363 KOps/s 27.3368 KOps/s $\color{#35bf28}+2.19\%$
test_mod_add[compile] 0.2411ms 79.6635μs 12.5528 KOps/s 12.1751 KOps/s $\color{#35bf28}+3.10\%$
test_mod_add[compile-overhead] 0.3321ms 0.1674ms 5.9728 KOps/s 5.6646 KOps/s $\textbf{\color{#35bf28}+5.44\%}$
test_mod_wrap[eager] 0.3978ms 0.2461ms 4.0642 KOps/s 4.0801 KOps/s $\color{#d91a1a}-0.39\%$
test_mod_wrap[compile] 0.6838ms 0.2917ms 3.4288 KOps/s 3.5032 KOps/s $\color{#d91a1a}-2.12\%$
test_mod_wrap[compile-overhead] 7.6153ms 3.8294ms 261.1350 Ops/s 264.9230 Ops/s $\color{#d91a1a}-1.43\%$
test_mod_wrap_and_backward[eager] 1.7141ms 1.3422ms 745.0584 Ops/s 698.3217 Ops/s $\textbf{\color{#35bf28}+6.69\%}$
test_mod_wrap_and_backward[compile] 1.4819ms 1.2541ms 797.3875 Ops/s 733.5779 Ops/s $\textbf{\color{#35bf28}+8.70\%}$
test_mod_wrap_and_backward[compile-overhead] 1.4624ms 0.9459ms 1.0572 KOps/s 860.8678 Ops/s $\textbf{\color{#35bf28}+22.80\%}$
test_seq_add[eager] 0.2657ms 0.1113ms 8.9856 KOps/s 9.2229 KOps/s $\color{#d91a1a}-2.57\%$
test_seq_add[compile] 0.2723ms 89.9617μs 11.1158 KOps/s 11.3720 KOps/s $\color{#d91a1a}-2.25\%$
test_seq_add[compile-overhead] 0.2861ms 0.1285ms 7.7813 KOps/s 7.7566 KOps/s $\color{#35bf28}+0.32\%$
test_seq_wrap[eager] 0.5500ms 0.4071ms 2.4563 KOps/s 2.3818 KOps/s $\color{#35bf28}+3.13\%$
test_seq_wrap[compile] 0.4819ms 0.2968ms 3.3691 KOps/s 3.2574 KOps/s $\color{#35bf28}+3.43\%$
test_seq_wrap[compile-overhead] 0.3960ms 0.2244ms 4.4565 KOps/s 4.3582 KOps/s $\color{#35bf28}+2.26\%$
test_func_call_runtime[False-eager] 0.8918ms 0.7352ms 1.3602 KOps/s 1.3581 KOps/s $\color{#35bf28}+0.15\%$
test_func_call_runtime[False-compile] 0.9253ms 0.7323ms 1.3656 KOps/s 1.3417 KOps/s $\color{#35bf28}+1.78\%$
test_func_call_runtime[False-compile-overhead] 0.5078ms 0.3632ms 2.7531 KOps/s 2.6952 KOps/s $\color{#35bf28}+2.15\%$
test_func_call_runtime[True-eager] 1.0871ms 0.8960ms 1.1161 KOps/s 1.0972 KOps/s $\color{#35bf28}+1.72\%$
test_func_call_runtime[True-compile] 0.9233ms 0.7512ms 1.3313 KOps/s 1.3063 KOps/s $\color{#35bf28}+1.91\%$
test_func_call_runtime[True-compile-overhead] 0.5702ms 0.3925ms 2.5478 KOps/s 2.5578 KOps/s $\color{#d91a1a}-0.39\%$
test_func_call_cm_runtime[False-eager] 0.9058ms 0.7432ms 1.3455 KOps/s 1.3190 KOps/s $\color{#35bf28}+2.00\%$
test_func_call_cm_runtime[False-compile] 1.2448ms 0.7456ms 1.3412 KOps/s 1.3383 KOps/s $\color{#35bf28}+0.22\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5064ms 0.3697ms 2.7048 KOps/s 2.6839 KOps/s $\color{#35bf28}+0.78\%$
test_func_call_cm_runtime[True-eager] 1.1673ms 1.0087ms 991.3948 Ops/s 997.1968 Ops/s $\color{#d91a1a}-0.58\%$
test_func_call_cm_runtime[True-compile] 0.9681ms 0.7973ms 1.2542 KOps/s 1.2460 KOps/s $\color{#35bf28}+0.65\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5459ms 0.4102ms 2.4380 KOps/s 2.3796 KOps/s $\color{#35bf28}+2.45\%$
test_vmap_func_call_cm_runtime[eager] 2.5378ms 2.0723ms 482.5621 Ops/s 477.8999 Ops/s $\color{#35bf28}+0.98\%$
test_vmap_func_call_cm_runtime[compile] 0.9701ms 0.7992ms 1.2512 KOps/s 1.2457 KOps/s $\color{#35bf28}+0.44\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5381ms 0.4160ms 2.4037 KOps/s 2.3987 KOps/s $\color{#35bf28}+0.21\%$
test_distributed 2.7125ms 0.3557ms 2.8117 KOps/s 8.7843 KOps/s $\textbf{\color{#d91a1a}-67.99\%}$
test_tdmodule 59.0910μs 17.9513μs 55.7063 KOps/s 54.6504 KOps/s $\color{#35bf28}+1.93\%$
test_tdmodule_dispatch 75.4610μs 32.2175μs 31.0390 KOps/s 30.8092 KOps/s $\color{#35bf28}+0.75\%$
test_tdseq 37.1000μs 17.7671μs 56.2837 KOps/s 55.1826 KOps/s $\color{#35bf28}+2.00\%$
test_tdseq_dispatch 58.2810μs 34.6220μs 28.8833 KOps/s 29.0791 KOps/s $\color{#d91a1a}-0.67\%$
test_instantiation_functorch 1.6660ms 1.5266ms 655.0591 Ops/s 627.2658 Ops/s $\color{#35bf28}+4.43\%$
test_exec_functorch 0.2818ms 0.1418ms 7.0540 KOps/s 6.7572 KOps/s $\color{#35bf28}+4.39\%$
test_exec_functional_call 0.2757ms 0.1377ms 7.2634 KOps/s 7.2086 KOps/s $\color{#35bf28}+0.76\%$
test_exec_td_decorator 0.3723ms 0.1838ms 5.4404 KOps/s 5.3894 KOps/s $\color{#35bf28}+0.95\%$
test_vmap_mlp_speed_decorator[True-True] 0.8594ms 0.6762ms 1.4790 KOps/s 1.4918 KOps/s $\color{#d91a1a}-0.86\%$
test_vmap_mlp_speed_decorator[True-False] 0.8823ms 0.6835ms 1.4630 KOps/s 1.4921 KOps/s $\color{#d91a1a}-1.95\%$
test_vmap_mlp_speed_decorator[False-True] 0.7807ms 0.6180ms 1.6180 KOps/s 1.7152 KOps/s $\textbf{\color{#d91a1a}-5.67\%}$
test_vmap_mlp_speed_decorator[False-False] 0.7643ms 0.6179ms 1.6185 KOps/s 1.7118 KOps/s $\textbf{\color{#d91a1a}-5.45\%}$
test_vmap_transformer_speed_decorator[True-True] 19.9149ms 19.0325ms 52.5418 Ops/s 53.4063 Ops/s $\color{#d91a1a}-1.62\%$
test_vmap_transformer_speed_decorator[True-False] 19.8581ms 19.1663ms 52.1750 Ops/s 53.2579 Ops/s $\color{#d91a1a}-2.03\%$
test_vmap_transformer_speed_decorator[False-True] 19.8237ms 18.9609ms 52.7402 Ops/s 53.7624 Ops/s $\color{#d91a1a}-1.90\%$
test_vmap_transformer_speed_decorator[False-False] 19.7092ms 19.0160ms 52.5874 Ops/s 53.5694 Ops/s $\color{#d91a1a}-1.83\%$
test_to_module_speed[True] 1.1079ms 0.9308ms 1.0744 KOps/s 1.0691 KOps/s $\color{#35bf28}+0.50\%$
test_to_module_speed[False] 1.2979ms 0.8941ms 1.1184 KOps/s 1.0988 KOps/s $\color{#35bf28}+1.78\%$
test_tc_init 0.1825ms 32.4692μs 30.7984 KOps/s 29.5718 KOps/s $\color{#35bf28}+4.15\%$
test_tc_init_nested 0.1492ms 65.0693μs 15.3682 KOps/s 14.6023 KOps/s $\textbf{\color{#35bf28}+5.25\%}$
test_tc_first_layer_tensor 13.5601μs 0.7049μs 1.4187 MOps/s 1.4326 MOps/s $\color{#d91a1a}-0.97\%$
test_tc_first_layer_nontensor 43.4300μs 2.3208μs 430.8829 KOps/s 436.1841 KOps/s $\color{#d91a1a}-1.22\%$
test_tc_second_layer_tensor 24.6002μs 1.4126μs 707.9330 KOps/s 711.2164 KOps/s $\color{#d91a1a}-0.46\%$
test_tc_second_layer_nontensor 97.4710μs 3.0347μs 329.5231 KOps/s 333.9562 KOps/s $\color{#d91a1a}-1.33\%$
test_unbind 0.2633s 10.5487ms 94.7988 Ops/s 146.2318 Ops/s $\textbf{\color{#d91a1a}-35.17\%}$
test_full_like 11.5988ms 10.2838ms 97.2407 Ops/s 96.7579 Ops/s $\color{#35bf28}+0.50\%$
test_zeros_like 5.1602ms 4.5531ms 219.6329 Ops/s 225.4986 Ops/s $\color{#d91a1a}-2.60\%$
test_ones_like 5.2115ms 4.5657ms 219.0248 Ops/s 220.5542 Ops/s $\color{#d91a1a}-0.69\%$
test_clone 13.1463ms 9.9688ms 100.3130 Ops/s 99.8339 Ops/s $\color{#35bf28}+0.48\%$
test_squeeze 60.3210μs 9.0531μs 110.4593 KOps/s 110.7367 KOps/s $\color{#d91a1a}-0.25\%$
test_unsqueeze 0.1545ms 68.2429μs 14.6535 KOps/s 14.0153 KOps/s $\color{#35bf28}+4.55\%$
test_split 0.3923ms 0.1539ms 6.4957 KOps/s 6.2506 KOps/s $\color{#35bf28}+3.92\%$
test_permute 0.3451ms 0.1732ms 5.7744 KOps/s 5.4542 KOps/s $\textbf{\color{#35bf28}+5.87\%}$
test_stack 53.4670ms 52.3686ms 19.0954 Ops/s 19.1109 Ops/s $\color{#d91a1a}-0.08\%$
test_cat 53.6656ms 52.2434ms 19.1412 Ops/s 19.0762 Ops/s $\color{#35bf28}+0.34\%$

[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Nov 27, 2024
…n one distribution

ghstack-source-id: 8be9980
Pull Request resolved: #1114
@vmoens vmoens added the enhancement New feature or request label Nov 28, 2024
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Nov 28, 2024
…n one distribution

ghstack-source-id: e94a3e9
Pull Request resolved: #1114
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Nov 28, 2024
…n one distribution

ghstack-source-id: b4f71f9
Pull Request resolved: #1114
[ghstack-poisoned]
[ghstack-poisoned]
@vmoens vmoens merged commit e851a5c into gh/vmoens/37/base Dec 2, 2024
49 of 53 checks passed
@vmoens vmoens deleted the gh/vmoens/37/head branch December 2, 2024 11:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants
0