8000 [BugFix] Make min/max tensorclasses be interchangeable with PT equivalent by vmoens · Pull Request #1180 · pytorch/tensordict · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

[BugFix] Make min/max tensorclasses be interchangeable with PT equivalent #1180

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Jan 13, 2025

Conversation

vmoens
Copy link
Collaborator
@vmoens vmoens commented Jan 13, 2025

We want torch.min(tensor, dim=dim) and torch.min(tensordict, dim=dim) to be interchangeable.

Currently, values is named vals not to conflict with the method with the same name, and indexing works as it does with tensordict.

This means that a piece of code where we do torch.min(smth, ...)[idx] will actually index the resulting tensordict, which is susprising.

I therefore suggest we should just use torch.reduce_types.<type> instead.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 13, 2025
@vmoens vmoens added bug Something isn't working Refactor Refactoring code - not a new feature BC-breaking labels Jan 13, 2025
Copy link
github-actions bot commented Jan 13, 2025

$\color{#D29922}\textsf{\Large&amp;#x26A0;\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}14$. Worsened: $\large\color{#d91a1a}7$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 45.7850μs 21.3639μs 46.8079 KOps/s 46.0753 KOps/s $\color{#35bf28}+1.59\%$
test_plain_set_stack_nested 49.8730μs 21.2407μs 47.0795 KOps/s 46.4171 KOps/s $\color{#35bf28}+1.43\%$
test_plain_set_nested_inplace 65.5220μs 22.9899μs 43.4974 KOps/s 42.8984 KOps/s $\color{#35bf28}+1.40\%$
test_plain_set_stack_nested_inplace 69.3490μs 23.1430μs 43.2096 KOps/s 42.8774 KOps/s $\color{#35bf28}+0.77\%$
test_items 29.4750μs 4.1770μs 239.4089 KOps/s 245.4476 KOps/s $\color{#d91a1a}-2.46\%$
test_items_nested 0.5973ms 0.4018ms 2.4887 KOps/s 2.5113 KOps/s $\color{#d91a1a}-0.90\%$
test_items_nested_locked 0.7242ms 0.4037ms 2.4770 KOps/s 2.5029 KOps/s $\color{#d91a1a}-1.04\%$
test_items_nested_leaf 0.1335ms 76.4656μs 13.0778 KOps/s 12.9204 KOps/s $\color{#35bf28}+1.22\%$
test_items_stack_nested 0.6312ms 0.4069ms 2.4577 KOps/s 2.5160 KOps/s $\color{#d91a1a}-2.32\%$
test_items_stack_nested_leaf 0.1553ms 79.1188μs 12.6392 KOps/s 12.4795 KOps/s $\color{#35bf28}+1.28\%$
test_items_stack_nested_locked 0.5781ms 0.4088ms 2.4465 KOps/s 2.4632 KOps/s $\color{#d91a1a}-0.68\%$
test_keys 28.5540μs 3.5530μs 281.4515 KOps/s 276.9913 KOps/s $\color{#35bf28}+1.61\%$
test_keys_nested 0.2654ms 0.1647ms 6.0718 KOps/s 5.9931 KOps/s $\color{#35bf28}+1.31\%$
test_keys_nested_locked 0.6780ms 0.1717ms 5.8233 KOps/s 5.7867 KOps/s $\color{#35bf28}+0.63\%$
test_keys_nested_leaf 0.2806ms 0.1454ms 6.8777 KOps/s 6.8841 KOps/s $\color{#d91a1a}-0.09\%$
test_keys_stack_nested 0.2771ms 0.1650ms 6.0599 KOps/s 5.9925 KOps/s $\color{#35bf28}+1.12\%$
test_keys_stack_nested_leaf 0.2289ms 0.1434ms 6.9741 KOps/s 7.0661 KOps/s $\color{#d91a1a}-1.30\%$
test_keys_stack_nested_locked 0.3139ms 0.1718ms 5.8213 KOps/s 5.8704 KOps/s $\color{#d91a1a}-0.84\%$
test_values 5.8936μs 1.0460μs 956.0039 KOps/s 973.5178 KOps/s $\color{#d91a1a}-1.80\%$
test_values_nested 0.1046ms 63.2333μs 15.8145 KOps/s 15.4965 KOps/s $\color{#35bf28}+2.05\%$
test_values_nested_locked 0.1073ms 63.4736μs 15.7546 KOps/s 16.3150 KOps/s $\color{#d91a1a}-3.43\%$
test_values_nested_leaf 0.1295ms 71.6078μs 13.9650 KOps/s 14.1737 KOps/s $\color{#d91a1a}-1.47\%$
test_values_stack_nested 0.1089ms 64.3981μs 15.5284 KOps/s 15.6484 KOps/s $\color{#d91a1a}-0.77\%$
test_values_stack_nested_leaf 0.1291ms 72.1273μs 13.8644 KOps/s 14.1175 KOps/s $\color{#d91a1a}-1.79\%$
test_values_stack_nested_locked 0.1445ms 64.1465μs 15.5893 KOps/s 15.7138 KOps/s $\color{#d91a1a}-0.79\%$
test_membership 12.7740μs 0.8645μs 1.1568 MOps/s 1.1570 MOps/s $\color{#d91a1a}-0.02\%$
test_membership_nested 26.8800μs 2.9115μs 343.4639 KOps/s 342.8967 KOps/s $\color{#35bf28}+0.17\%$
test_membership_nested_leaf 22.3110μs 2.9302μs 341.2759 KOps/s 336.9527 KOps/s $\color{#35bf28}+1.28\%$
test_membership_stacked_nested 27.7220μs 2.8594μs 349.7202 KOps/s 336.8424 KOps/s $\color{#35bf28}+3.82\%$
test_membership_stacked_nested_leaf 21.6510μs 2.8966μs 345.2304 KOps/s 337.2191 KOps/s $\color{#35bf28}+2.38\%$
test_membership_nested_last 26.1090μs 4.3122μs 231.8976 KOps/s 227.6000 KOps/s $\color{#35bf28}+1.89\%$
test_membership_nested_leaf_last 29.9760μs 4.3159μs 231.7040 KOps/s 225.3721 KOps/s $\color{#35bf28}+2.81\%$
test_membership_stacked_nested_last 30.6170μs 4.3005μs 232.5297 KOps/s 131.7643 KOps/s $\textbf{\color{#35bf28}+76.47\%}$
test_membership_stacked_nested_leaf_last 27.0610μs 4.3477μs 230.0064 KOps/s 131.8631 KOps/s $\textbf{\color{#35bf28}+74.43\%}$
test_nested_getleaf 30.5370μs 10.7700μs 92.8503 KOps/s 91.3908 KOps/s $\color{#35bf28}+1.60\%$
test_nested_get 42.0090μs 10.5718μs 94.5911 KOps/s 96.6096 KOps/s $\color{#d91a1a}-2.09\%$
test_stacked_getleaf 47.5990μs 10.7553μs 92.9775 KOps/s 93.2439 KOps/s $\color{#d91a1a}-0.29\%$
test_stacked_get 29.5550μs 10.1890μs 98.1446 KOps/s 98.1546 KOps/s $\color{#d91a1a}-0.01\%$
test_nested_getitemleaf 40.5960μs 11.4343μs 87.4559 KOps/s 87.0644 KOps/s $\color{#35bf28}+0.45\%$
test_nested_getitem 45.7150μs 10.8083μs 92.5219 KOps/s 93.3283 KOps/s $\color{#d91a1a}-0.86\%$
test_stacked_getitemleaf 29.1650μs 11.3172μs 88.3612 KOps/s 87.9659 KOps/s $\color{#35bf28}+0.45\%$
test_stacked_getitem 35.6160μs 10.6545μs 93.8571 KOps/s 93.6904 KOps/s $\color{#35bf28}+0.18\%$
test_lock_nested 1.0972ms 0.4499ms 2.2226 KOps/s 1.8068 KOps/s $\textbf{\color{#35bf28}+23.01\%}$
test_lock_stack_nested 0.8225ms 0.4271ms 2.3412 KOps/s 2.3836 KOps/s $\color{#d91a1a}-1.78\%$
test_unlock_nested 0.7279ms 0.3683ms 2.7149 KOps/s 2.6726 KOps/s $\color{#35bf28}+1.58\%$
test_unlock_stack_nested 0.5741ms 0.3442ms 2.9056 KOps/s 3.0053 KOps/s $\color{#d91a1a}-3.32\%$
test_flatten_speed 0.2116ms 0.1009ms 9.9076 KOps/s 9.9908 KOps/s $\color{#d91a1a}-0.83\%$
test_unflatten_speed 0.7669ms 0.5285ms 1.8921 KOps/s 1.9225 KOps/s $\color{#d91a1a}-1.59\%$
test_common_ops 4.5318ms 0.8009ms 1.2487 KOps/s 1.2245 KOps/s $\color{#35bf28}+1.98\%$
test_creation 71.2630μs 2.4768μs 403.7413 KOps/s 406.4321 KOps/s $\color{#d91a1a}-0.66\%$
test_creation_empty 35.0550μs 12.4908μs 80.0586 KOps/s 71.9738 KOps/s $\textbf{\color{#35bf28}+11.23\%}$
test_creation_nested_1 70.4220μs 15.6514μs 63.8919 KOps/s 58.6203 KOps/s $\textbf{\color{#35bf28}+8.99\%}$
test_creation_nested_2 51.3750μs 19.8601μs 50.3522 KOps/s 46.2586 KOps/s $\textbf{\color{#35bf28}+8.85\%}$
test_clone 43.0510μs 13.6334μs 73.3491 KOps/s 75.5267 KOps/s $\color{#d91a1a}-2.88\%$
test_getitem[int] 1.0804ms 12.9286μs 77.3480 KOps/s 78.3318 KOps/s $\color{#d91a1a}-1.26\%$
test_getitem[slice_int] 0.1351ms 24.7017μs 40.4831 KOps/s 42.5545 KOps/s $\color{#d91a1a}-4.87\%$
test_getitem[range] 0.1669ms< 8000 /td> 49.9120μs 20.0353 KOps/s 21.4338 KOps/s $\textbf{\color{#d91a1a}-6.53\%}$
test_getitem[tuple] 0.1281ms 20.0519μs 49.8707 KOps/s 50.2430 KOps/s $\color{#d91a1a}-0.74\%$
test_getitem[list] 0.1622ms 44.2434μs 22.6022 KOps/s 23.6956 KOps/s $\color{#d91a1a}-4.61\%$
test_setitem_dim[int] 47.2280μs 26.0383μs 38.4049 KOps/s 39.6694 KOps/s $\color{#d91a1a}-3.19\%$
test_setitem_dim[slice_int] 86.3510μs 52.2978μs 19.1212 KOps/s 19.9555 KOps/s $\color{#d91a1a}-4.18\%$
test_setitem_dim[range] 0.1121ms 75.1388μs 13.3087 KOps/s 14.1439 KOps/s $\textbf{\color{#d91a1a}-5.90\%}$
test_setitem_dim[tuple] 82.9750μs 40.9353μs 24.4288 KOps/s 25.1049 KOps/s $\color{#d91a1a}-2.69\%$
test_setitem 0.1887ms 21.1944μs 47.1823 KOps/s 46.3976 KOps/s $\color{#35bf28}+1.69\%$
test_set 0.1240ms 20.5565μs 48.6464 KOps/s 47.8659 KOps/s $\color{#35bf28}+1.63\%$
test_set_shared 8.2634ms 0.1708ms 5.8565 KOps/s 5.8399 KOps/s $\color{#35bf28}+0.28\%$
test_update 0.2166ms 23.7587μs 42.0898 KOps/s 39.9829 KOps/s $\textbf{\color{#35bf28}+5.27\%}$
test_update_nested 0.3291ms 34.2988μs 29.1555 KOps/s 28.0732 KOps/s $\color{#35bf28}+3.86\%$
test_update__nested 0.3898ms 34.1101μs 29.3168 KOps/s 29.7982 KOps/s $\color{#d91a1a}-1.62\%$
test_set_nested 70.2010μs 22.9977μs 43.4826 KOps/s 43.9309 KOps/s $\color{#d91a1a}-1.02\%$
test_set_nested_new 97.0810μs 27.8020μs 35.9687 KOps/s 36.2583 KOps/s $\color{#d91a1a}-0.80\%$
test_select 0.2125ms 44.3351μs 22.5555 KOps/s 22.7027 KOps/s $\color{#d91a1a}-0.65\%$
test_select_nested 0.1233ms 63.4581μs 15.7584 KOps/s 15.7941 KOps/s $\color{#d91a1a}-0.23\%$
test_exclude_nested 0.1644ms 82.4927μs 12.1223 KOps/s 12.2674 KOps/s $\color{#d91a1a}-1.18\%$
test_empty[True] 0.5912ms 0.4108ms 2.4345 KOps/s 2.4547 KOps/s $\color{#d91a1a}-0.83\%$
test_empty[False] 6.6950μs 1.3767μs 726.3597 KOps/s 734.8786 KOps/s $\color{#d91a1a}-1.16\%$
test_unbind_speed 0.3957ms 0.2692ms 3.7145 KOps/s 3.7020 KOps/s $\color{#35bf28}+0.34\%$
test_unbind_speed_stack0 0.4583ms 0.2663ms 3.7556 KOps/s 3.8650 KOps/s $\color{#d91a1a}-2.83\%$
test_unbind_speed_stack1 0.1188s 0.8905ms 1.1230 KOps/s 1.4159 KOps/s $\textbf{\color{#d91a1a}-20.69\%}$
test_split 0.1002s 1.7798ms 561.8526 Ops/s 559.9210 Ops/s $\color{#35bf28}+0.34\%$
test_chunk 0.1005s 1.7832ms 560.7776 Ops/s 560.4632 Ops/s $\color{#35bf28}+0.06\%$
test_consolidate_njt[False-None] 9.3788ms 8.0620ms 124.0383 Ops/s 121.4276 Ops/s $\color{#35bf28}+2.15\%$
test_creation[device0] 0.2150ms 90.5330μs 11.0457 KOps/s 10.8171 KOps/s $\color{#35bf28}+2.11\%$
test_creation_from_tensor 3.1909ms 94.7533μs 10.5537 KOps/s 10.6076 KOps/s $\color{#d91a1a}-0.51\%$
test_add_one[memmap_tensor0] 0.2672ms 4.8548μs 205.9800 KOps/s 214.3878 KOps/s $\color{#d91a1a}-3.92\%$
test_contiguous[memmap_tensor0] 21.3100μs 0.5120μs 1.9530 MOps/s 1.9416 MOps/s $\color{#35bf28}+0.59\%$
test_stack[memmap_tensor0] 61.2740μs 3.4266μs 291.8367 KOps/s 288.2386 KOps/s $\color{#35bf28}+1.25\%$
test_memmaptd_index 1.0071ms 0.2392ms 4.1811 KOps/s 4.1878 KOps/s $\color{#d91a1a}-0.16\%$
test_memmaptd_index_astensor 0.5661ms 0.3216ms 3.1093 KOps/s 3.0974 KOps/s $\color{#35bf28}+0.39\%$
test_memmaptd_index_op 1.1463ms 0.5990ms 1.6694 KOps/s 1.6050 KOps/s $\color{#35bf28}+4.01\%$
test_serialize_model 0.1237s 0.1146s 8.7271 Ops/s 8.5426 Ops/s $\color{#35bf28}+2.16\%$
test_serialize_model_pickle 0.4995s 0.3959s 2.5261 Ops/s 2.4875 Ops/s $\color{#35bf28}+1.55\%$
test_serialize_weights 0.2081s 0.1249s 8.0046 Ops/s 8.8617 Ops/s $\textbf{\color{#d91a1a}-9.67\%}$
test_serialize_weights_returnearly 0.1604s 0.1549s 6.4553 Ops/s 6.2246 Ops/s $\color{#35bf28}+3.71\%$
test_serialize_weights_pickle 1.2503s 0.7471s 1.3385 Ops/s 2.5151 Ops/s $\textbf{\color{#d91a1a}-46.78\%}$
test_serialize_weights_filesystem 0.1431s 0.1391s 7.1873 Ops/s 7.1488 Ops/s $\color{#35bf28}+0.54\%$
test_serialize_model_filesystem 0.1479s 0.1399s 7.1501 Ops/s 5.9776 Ops/s $\textbf{\color{#35bf28}+19.62\%}$
test_reshape_pytree 55.4930μs 26.0215μs 38.4298 KOps/s 37.7058 KOps/s $\color{#35bf28}+1.92\%$
test_reshape_td 70.4910μs 32.7718μs 30.5140 KOps/s 30.0521 KOps/s $\color{#35bf28}+1.54\%$
test_view_pytree 77.7350μs 26.3513μs 37.9488 KOps/s 38.4627 KOps/s $\color{#d91a1a}-1.34\%$
test_view_td 93.2640μs 39.8612μs 25.0870 KOps/s 25.4230 KOps/s $\color{#d91a1a}-1.32\%$
test_unbind_pytree 69.8900μs 29.1113μs 34.3509 KOps/s 34.4759 KOps/s $\color{#d91a1a}-0.36\%$
test_unbind_td 98.7649ms 47.4725μs 21.0648 KOps/s 25.1301 KOps/s $\textbf{\color{#d91a1a}-16.18\%}$
test_split_pytree 61.6550μs 28.7520μs 34.7802 KOps/s 34.6235 KOps/s $\color{#35bf28}+0.45\%$
test_split_td 0.2048ms 44.8743μs 22.2845 KOps/s 22.1116 KOps/s $\color{#35bf28}+0.78\%$
test_add_pytree 92.5320μs 34.5903μs 28.9098 KOps/s 28.6679 KOps/s $\color{#35bf28}+0.84\%$
test_add_td 0.2718ms 58.8812μs 16.9834 KOps/s 16.3835 KOps/s $\color{#35bf28}+3.66\%$
test_compile_add_one_nested[tensordict-compile] 0.1142ms 62.2112μs 16.0743 KOps/s 15.9507 KOps/s $\color{#35bf28}+0.77\%$
test_compile_add_one_nested[tensordict-eager] 1.3545ms 0.1714ms 5.8331 KOps/s 5.6824 KOps/s $\color{#35bf28}+2.65\%$
test_compile_add_one_nested[pytree-compile] 0.1088ms 45.7437μs 21.8609 KOps/s 21.7113 KOps/s $\color{#35bf28}+0.69\%$
test_compile_add_one_nested[pytree-eager] 0.2137ms 0.1182ms 8.4634 KOps/s 8.6017 KOps/s $\color{#d91a1a}-1.61\%$
test_compile_copy_nested[tensordict-compile] 59.6410μs 26.1866μs 38.1875 KOps/s 39.1120 KOps/s $\color{#d91a1a}-2.36\%$
test_compile_copy_nested[tensordict-eager] 0.1143ms 58.9627μs 16.9599 KOps/s 17.0211 KOps/s $\color{#d91a1a}-0.36\%$
test_compile_copy_nested[pytree-compile] 0.2410ms 78.6201μs 12.7194 KOps/s 12.9196 KOps/s $\color{#d91a1a}-1.55\%$
test_compile_copy_nested[pytree-eager] 0.1196ms 66.6408μs 15.0058 KOps/s 15.0824 KOps/s $\color{#d91a1a}-0.51\%$
test_compile_add_one_flat[tensordict-compile] 0.3959ms 0.1048ms 9.5452 KOps/s 9.6229 KOps/s $\color{#d91a1a}-0.81\%$
test_compile_add_one_flat[tensordict-eager] 0.5560ms 0.2140ms 4.6732 KOps/s 4.7369 KOps/s $\color{#d91a1a}-1.34\%$
test_compile_add_one_flat[tensorclass-compile] 0.1029ms 44.6563μs 22.3932 KOps/s 21.5593 KOps/s $\color{#35bf28}+3.87\%$
test_compile_add_one_flat[tensorclass-eager] 0.4862ms 66.5947μs 15.0162 KOps/s 15.1269 KOps/s $\color{#d91a1a}-0.73\%$
test_compile_add_one_flat[pytree-compile] 0.2222ms 0.1034ms 9.6710 KOps/s 9.8940 KOps/s $\color{#d91a1a}-2.25\%$
test_compile_add_one_flat[pytree-eager] 0.3159ms 0.1995ms 5.0118 KOps/s 5.0115 KOps/s $+0.01\%$
test_compile_add_self_flat[tensordict-eager] 0.4834ms 0.2321ms 4.3078 KOps/s 4.3051 KOps/s $\color{#35bf28}+0.06\%$
test_compile_add_self_flat[tensordict-compile] 0.2098ms 0.1068ms 9.3597 KOps/s 9.4971 KOps/s $\color{#d91a1a}-1.45\%$
test_compile_add_self_flat[tensorclass-eager] 0.1811ms 63.7368μs 15.6895 KOps/s 15.5979 KOps/s $\color{#35bf28}+0.59\%$
test_compile_add_self_flat[tensorclass-compile] 0.1055ms 46.8521μs 21.3438 KOps/s 21.5927 KOps/s $\color{#d91a1a}-1.15\%$
test_compile_add_self_flat[pytree-eager] 0.2946ms 0.1586ms 6.3041 KOps/s 6.3433 KOps/s $\color{#d91a1a}-0.62\%$
test_compile_add_self_flat[pytree-compile] 0.1973ms 0.1026ms 9.7424 KOps/s 9.7841 KOps/s $\color{#d91a1a}-0.43\%$
test_compile_copy_flat[tensordict-compile] 69.8500μs 20.7649μs 48.1582 KOps/s 47.6660 KOps/s $\color{#35bf28}+1.03\%$
test_compile_copy_flat[tensordict-eager] 0.1304ms 66.2595μs 15.0922 KOps/s 14.8328 KOps/s $\color{#35bf28}+1.75\%$
test_compile_copy_flat[pytree-compile] 0.1485ms 79.7977μs 12.5317 KOps/s 12.6787 KOps/s $\color{#d91a1a}-1.16\%$
test_compile_copy_flat[pytree-eager] 0.1321ms 67.7628μs 14.7573 KOps/s 14.6666 KOps/s $\color{#35bf28}+0.62\%$
test_compile_assign_and_add[tensordict-compile] 0.3504ms 0.2098ms 4.7675 KOps/s 4.9148 KOps/s $\color{#d91a1a}-3.00\%$
test_compile_assign_and_add[tensordict-eager] 1.4703ms 1.3103ms 763.1936 Ops/s 736.5376 Ops/s $\color{#35bf28}+3.62\%$
test_compile_assign_and_add[pytree-compile] 0.3958ms 0.1996ms 5.0106 KOps/s 4.8766 KOps/s $\color{#35bf28}+2.75\%$
test_compile_assign_and_add[pytree-eager] 0.9804ms 0.7647ms 1.3078 KOps/s 1.3041 KOps/s $\color{#35bf28}+0.28\%$
test_compile_assign_and_add_stack[compile] 0.5178ms 0.4373ms 2.2866 KOps/s 2.2394 KOps/s $\color{#35bf28}+2.11\%$
test_compile_assign_and_add_stack[eager] 2.9871ms 2.7361ms 365.4893 Ops/s 360.2304 Ops/s $\color{#35bf28}+1.46\%$
test_compile_indexing[tensor-tensordict-compile] 78.5660μs 35.9511μs 27.8156 KOps/s 27.8321 KOps/s $\color{#d91a1a}-0.06\%$
test_compile_indexing[tensor-tensordict-eager] 0.5331ms 35.0893μs 28.4987 KOps/s 30.9173 KOps/s $\textbf{\color{#d91a1a}-7.82\%}$
test_compile_indexing[tensor-tensorclass-compile] 86.1200μs 29.4913μs 33.9083 KOps/s 34.6830 KOps/s $\color{#d91a1a}-2.23\%$
test_compile_indexing[tensor-tensorclass-eager] 63.7490μs 23.2226μs 43.0615 KOps/s 44.1295 KOps/s $\color{#d91a1a}-2.42\%$
test_compile_indexing[tensor-pytree-compile] 94.7170μs 30.7227μs 32.5492 KOps/s 33.6001 KOps/s $\color{#d91a1a}-3.13\%$
test_compile_indexing[tensor-pytree-eager] 82.1530μs 22.8781μs 43.7099 KOps/s 44.5734 KOps/s $\color{#d91a1a}-1.94\%$
test_compile_indexing[slice-tensordict-compile] 0.1225ms 52.4362μs 19.0708 KOps/s 19.2288 KOps/s $\color{#d91a1a}-0.82\%$
test_compile_indexing[slice-tensordict-eager] 0.5726ms 20.7276μs 48.2447 KOps/s 48.7753 KOps/s $\color{#d91a1a}-1.09\%$
test_compile_indexing[slice-tensorclass-compile] 0.1083ms 44.6234μs 22.4098 KOps/s 22.9586 KOps/s $\color{#d91a1a}-2.39\%$
test_compile_indexing[slice-tensorclass-eager] 57.0060μs 18.6265μs 53.6869 KOps/s 53.7502 KOps/s $\color{#d91a1a}-0.12\%$
test_compile_indexing[slice-pytree-compile] 0.1233ms 45.0802μs 22.1827 KOps/s 22.2700 KOps/s $\color{#d91a1a}-0.39\%$
test_compile_indexing[slice-pytree-eager] 57.0370μs 18.6225μs 53.6986 KOps/s 53.4201 KOps/s $\color{#35bf28}+0.52\%$
test_compile_indexing[int-tensordict-compile] 0.1440ms 54.2499μs 18.4332 KOps/s 18.7795 KOps/s $\color{#d91a1a}-1.84\%$
test_compile_indexing[int-tensordict-eager] 0.9374ms 20.7098μs 48.2863 KOps/s 49.8212 KOps/s $\color{#d91a1a}-3.08\%$
test_compile_indexing[int-tensorclass-compile] 0.1184ms 44.7384μs 22.3522 KOps/s 22.2553 KOps/s $\color{#35bf28}+0.43\%$
test_compile_indexing[int-tensorclass-eager] 48.9810μs 18.6259μs 53.6885 KOps/s 54.0844 KOps/s $\color{#d91a1a}-0.73\%$
test_compile_indexing[int-pytree-compile] 0.1099ms 44.8198μs 22.3116 KOps/s 22.2180 KOps/s $\color{#35bf28}+0.42\%$
test_compile_indexing[int-pytree-eager] 64.9310μs 18.6523μs 53.6127 KOps/s 54.3562 KOps/s $\color{#d91a1a}-1.37\%$
test_mod_add[eager] 82.3340μs 35.9200μs 27.8396 KOps/s 26.8124 KOps/s $\color{#35bf28}+3.83\%$
test_mod_add[compile] 92.2520μs 47.3071μs 21.1385 KOps/s 20.0013 KOps/s $\textbf{\color{#35bf28}+5.69\%}$
test_mod_add[compile-overhead] 0.1352ms 47.5457μs 21.0324 KOps/s 20.1083 KOps/s $\color{#35bf28}+4.60\%$
test_mod_wrap[eager] 0.4129ms 0.2203ms 4.5400 KOps/s 4.5428 KOps/s $\color{#d91a1a}-0.06\%$
test_mod_wrap[compile] 0.3332ms 0.2001ms 4.9974 KOps/s 4.8556 KOps/s $\color{#35bf28}+2.92\%$
test_mod_wrap[compile-overhead] 0.3211ms 0.1987ms 5.0332 KOps/s 4.8781 KOps/s $\color{#35bf28}+3.18\%$
test_mod_wrap_and_backward[eager] 13.8122ms 11.3188ms 88.3487 Ops/s 87.0884 Ops/s $\color{#35bf28}+1.45\%$
test_mod_wrap_and_backward[compile] 13.7228ms 12.0534ms 82.9639 Ops/s 81.7858 Ops/s $\color{#35bf28}+1.44\%$
test_mod_wrap_and_backward[compile-overhead] 15.9730ms 12.2430ms 81.6794 Ops/s 80.1956 Ops/s $\color{#35bf28}+1.85\%$
test_seq_add[eager] 0.2539ms 0.1176ms 8.5034 KOps/s 8.1509 KOps/s $\color{#35bf28}+4.32\%$
test_seq_add[compile] 0.1807ms 61.6303μs 16.2258 KOps/s 15.7312 KOps/s $\color{#35bf28}+3.14\%$
test_seq_add[compile-overhead] 0.1330ms 58.5407μs 17.0821 KOps/s 16.2294 KOps/s $\textbf{\color{#35bf28}+5.25\%}$
test_seq_wrap[eager] 0.6954ms 0.4398ms 2.2735 KOps/s 2.2165 KOps/s $\color{#35bf28}+2.57\%$
test_seq_wrap[compile] 0.4240ms 0.2207ms 4.5319 KOps/s 4.4145 KOps/s $\color{#35bf28}+2.66\%$
test_seq_wrap[compile-overhead] 0.3971ms 0.2212ms 4.5216 KOps/s 4.4432 KOps/s $\color{#35bf28}+1.76\%$
test_func_call_runtime[False-eager] 0.8882ms 0.5352ms 1.8686 KOps/s 1.9182 KOps/s $\color{#d91a1a}-2.58\%$
test_func_call_runtime[False-compile] 0.5048ms 0.4155ms 2.4068 KOps/s 2.3841 KOps/s $\color{#35bf28}+0.95\%$
test_func_call_runtime[False-compile-overhead] 0.6319ms 0.4144ms 2.4134 KOps/s 2.3914 KOps/s $\color{#35bf28}+0.92\%$
test_func_call_runtime[True-eager] 0.9426ms 0.7477ms 1.3374 KOps/s 1.3440 KOps/s $\color{#d91a1a}-0.49\%$
test_func_call_runtime[True-compile] 0.6060ms 0.4545ms 2.2003 KOps/s 2.1854 KOps/s $\color{#35bf28}+0.68\%$
test_func_call_runtime[True-compile-overhead] 0.8300ms 0.4573ms 2.1869 KOps/s 2.2027 KOps/s $\color{#d91a1a}-0.72\%$
test_func_call_cm_runtime[False-eager] 0.9469ms 0.5369ms 1.8626 KOps/s 1.9065 KOps/s $\color{#d91a1a}-2.30\%$
test_func_call_cm_runtime[False-compile] 0.7840ms 0.4141ms 2.4148 KOps/s 2.3843 KOps/s $\color{#35bf28}+1.28\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5183ms 0.4157ms 2.4054 KOps/s 2.3879 KOps/s $\color{#35bf28}+0.73\%$
test_func_call_cm_runtime[True-eager] 1.4726ms 0.8933ms 1.1195 KOps/s 1.1407 KOps/s $\color{#d91a1a}-1.86\%$
test_func_call_cm_runtime[True-compile] 0.6991ms 0.4803ms 2.0819 KOps/s 2.0616 KOps/s $\color{#35bf28}+0.99\%$
test_func_call_cm_runtime[True-compile-overhead] 0.7705ms 0.4819ms 2.0753 KOps/s 2.0683 KOps/s $\color{#35bf28}+0.34\%$
test_vmap_func_call_cm_runtime[eager] 3.0299ms 1.8983ms 526.7801 Ops/s 535.1799 Ops/s $\color{#d91a1a}-1.57\%$
test_vmap_func_call_cm_runtime[compile] 0.8419ms 0.5034ms 1.9865 KOps/s 1.9347 KOps/s $\color{#35bf28}+2.67\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.8075ms 0.5092ms 1.9639 KOps/s 1.9331 KOps/s $\color{#35bf28}+1.59\%$
test_distributed 0.2803ms 0.1216ms 8.2239 KOps/s 7.8832 KOps/s $\color{#35bf28}+4.32\%$
test_tdmodule 45.5650μs 27.5469μs 36.3017 KOps/s 36.1692 KOps/s $\color{#35bf28}+0.37\%$
test_tdmodule_dispatch 86.9220μs 50.3345μs 19.8671 KOps/s 19.6384 KOps/s $\color{#35bf28}+1.16\%$
test_tdseq 53.4300μs 29.5993μs 33.7845 KOps/s 33.4568 KOps/s $\color{#35bf28}+0.98\%$
test_tdseq_dispatch 0.1161ms 57.5052μs 17.3897 KOps/s 17.8416 KOps/s $\color{#d91a1a}-2.53\%$
test_instantiation_functorch 1.6895ms 1.4948ms 668.9695 Ops/s 664.7355 Ops/s $\color{#35bf28}+0.64\%$
test_exec_functorch 0.3397ms 0.1800ms 5.5548 KOps/s 5.6011 KOps/s $\color{#d91a1a}-0.83\%$
test_exec_functional_call 0.3279ms 0.1731ms 5.7769 KOps/s 5.8513 KOps/s $\color{#d91a1a}-1.27\%$
test_exec_td_decorator 0.4757ms 0.2339ms 4.2751 KOps/s 4.4178 KOps/s $\color{#d91a1a}-3.23\%$
test_vmap_mlp_speed_decorator[True-True] 1.0327ms 0.6532ms 1.5310 KOps/s 1.5119 KOps/s $\color{#35bf28}+1.26\%$
test_vmap_mlp_speed_decorator[True-False] 1.0055ms 0.6520ms 1.5337 KOps/s 1.5227 KOps/s $\color{#35bf28}+0.72\%$
test_vmap_mlp_speed_decorator[False-True] 0.6347ms 0.5234ms 1.9107 KOps/s 1.8932 KOps/s $\color{#35bf28}+0.92\%$
test_vmap_mlp_speed_decorator[False-False] 0.7192ms 0.5261ms 1.9007 KOps/s 1.9161 KOps/s $\color{#d91a1a}-0.80\%$
test_to_module_speed[True] 2.4147ms 1.3607ms 734.9138 Ops/s 748.7631 Ops/s $\color{#d91a1a}-1.85\%$
test_to_module_speed[False] 1.9082ms 1.3024ms 767.8368 Ops/s 771.5199 Ops/s $\color{#d91a1a}-0.48\%$
test_tc_init 88.1440μs 48.3606μs 20.6780 KOps/s 20.4630 KOps/s $\color{#35bf28}+1.05\%$
test_tc_init_nested 0.2041ms 96.6036μs 10.3516 KOps/s 10.0034 KOps/s $\color{#35bf28}+3.48\%$
test_tc_first_layer_tensor 20.4180μs 1.5400μs 649.3356 KOps/s 639.6566 KOps/s $\color{#35bf28}+1.51\%$
test_tc_first_layer_nontensor 29.7150μs 4.7161μs 212.0377 KOps/s 208.4981 KOps/s $\color{#35bf28}+1.70\%$
test_tc_second_layer_tensor 20.8290μs 2.8752μs 347.8041 KOps/s 339.1213 KOps/s $\color{#35bf28}+2.56\%$
test_tc_second_layer_nontensor 33.1720μs 6.1436μs 162.7708 KOps/s 162.0784 KOps/s $\color{#35bf28}+0.43\%$
test_unbind 0.2132s 12.9382ms 77.2904 Ops/s 81.2820 Ops/s $\color{#d91a1a}-4.91\%$
test_full_like 7.5190ms 6.7805ms 147.4810 Ops/s 84.3504 Ops/s $\textbf{\color{#35bf28}+74.84\%}$
test_zeros_like 2.9850ms 2.5399ms 393.7198 Ops/s 141.5302 Ops/s $\textbf{\color{#35bf28}+178.19\%}$
test_ones_like 3.7535ms 3.1233ms 320.1721 Ops/s 130.8286 Ops/s $\textbf{\color{#35bf28}+144.73\%}$
test_clone 5.1514ms 4.6967ms 212.9153 Ops/s 108.4320 Ops/s $\textbf{\color{#35bf28}+96.36\%}$
test_squeeze 57.4870μs 11.9715μs 83.5318 KOps/s 82.3664 KOps/s $\color{#35bf28}+1.41\%$
test_unsqueeze 0.1751ms 90.4587μs 11.0548 KOps/s 10.9748 KOps/s $\color{#35bf28}+0.73\%$
test_split 0.5218ms 0.1928ms 5.1856 KOps/s 5.2888 KOps/s $\color{#d91a1a}-1.95\%$
test_permute 0.2876ms 0.2026ms 4.9357 KOps/s 5.0370 KOps/s $\color{#d91a1a}-2.01\%$
test_stack 29.9871ms 24.2683ms 41.2060 Ops/s 41.4415 Ops/s $\color{#d91a1a}-0.57\%$
test_cat 27.9788ms 23.8683ms 41.8966 Ops/s 42.1395 Ops/s $\color{#d91a1a}-0.58\%$

Copy link
github-actions bot commented Jan 13, 2025

$\color{#D29922}\textsf{\Large&amp;#x26A0;\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}23$. Worsened: $\large\color{#d91a1a}7$.

Expand to view detailed results 8000
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 33.1410μs 13.1672μs 75.9460 KOps/s 76.9369 KOps/s $\color{#d91a1a}-1.29\%$
test_plain_set_stack_nested 51.2710μs 13.3996μs 74.6293 KOps/s 76.1106 KOps/s $\color{#d91a1a}-1.95\%$
test_plain_set_nested_inplace 37.8310μs 14.2065μs 70.3904 KOps/s 71.7231 KOps/s $\color{#d91a1a}-1.86\%$
test_plain_set_stack_nested_inplace 47.2710μs 14.2898μs 69.9799 KOps/s 71.0565 KOps/s $\color{#d91a1a}-1.52\%$
test_items 26.3500μs 2.8783μs 347.4283 KOps/s 346.2519 KOps/s $\color{#35bf28}+0.34\%$
test_items_nested 0.4378ms 0.3633ms 2.7523 KOps/s 2.7695 KOps/s $\color{#d91a1a}-0.62\%$
test_items_nested_locked 1.4931ms 0.3667ms 2.7272 KOps/s 2.7918 KOps/s $\color{#d91a1a}-2.31\%$
test_items_nested_leaf 83.7010μs 57.8412μs 17.2887 KOps/s 17.1273 KOps/s $\color{#35bf28}+0.94\%$
test_items_stack_nested 0.4140ms 0.3595ms 2.7819 KOps/s 2.7795 KOps/s $\color{#35bf28}+0.09\%$
test_items_stack_nested_leaf 89.2010μs 59.2008μs 16.8917 KOps/s 16.7981 KOps/s $\color{#35bf28}+0.56\%$
test_items_stack_nested_locked 0.4194ms 0.3642ms 2.7458 KOps/s 2.7542 KOps/s $\color{#d91a1a}-0.30\%$
test_keys 36.6210μs 3.5042μs 285.3697 KOps/s 285.2765 KOps/s $\color{#35bf28}+0.03\%$
test_keys_nested 0.1278ms 86.8878μs 11.5091 KOps/s 11.5424 KOps/s $\color{#d91a1a}-0.29\%$
test_keys_nested_locked 0.7625ms 93.3157μs 10.7163 KOps/s 10.7825 KOps/s $\color{#d91a1a}-0.61\%$
test_keys_nested_leaf 0.1153ms 79.0760μs 12.6461 KOps/s 12.7890 KOps/s $\color{#d91a1a}-1.12\%$
test_keys_stack_nested 0.1270ms 89.3853μs 11.1875 KOps/s 11.3925 KOps/s $\color{#d91a1a}-1.80\%$
test_keys_stack_nested_leaf 0.1256ms 79.7774μs 12.5349 KOps/s 12.6610 KOps/s $\color{#d91a1a}-1.00\%$
test_keys_stack_nested_locked 0.1253ms 94.9889μs 10.5275 KOps/s 10.6881 KOps/s $\color{#d91a1a}-1.50\%$
test_values 4.8600μs 0.8459μs 1.1822 MOps/s 1.1736 MOps/s $\color{#35bf28}+0.73\%$
test_values_nested 71.6320μs 37.9224μs 26.3696 KOps/s 26.4686 KOps/s $\color{#d91a1a}-0.37\%$
test_values_nested_locked 71.6510μs 40.0400μs 24.9750 KOps/s 25.0272 KOps/s $\color{#d91a1a}-0.21\%$
test_values_nested_leaf 73.7810μs 42.0965μs 23.7549 KOps/s 23.6796 KOps/s $\color{#35bf28}+0.32\%$
test_values_stack_nested 64.8010μs 38.8212μs 25.7592 KOps/s 26.0273 KOps/s $\color{#d91a1a}-1.03\%$
test_values_stack_nested_leaf 71.5310μs 42.9355μs 23.2908 KOps/s 23.3644 KOps/s $\color{#d91a1a}-0.32\%$
test_values_stack_nested_locked 72.3520μs 40.9106μs 24.4435 KOps/s 24.8900 KOps/s $\color{#d91a1a}-1.79\%$
test_membership 2.1710μs 0.5530μs 1.8082 MOps/s 1.7899 MOps/s $\color{#35bf28}+1.02\%$
test_membership_nested 21.6155μs 2.0308μs 492.4119 KOps/s 479.0345 KOps/s $\color{#35bf28}+2.79\%$
test_membership_nested_leaf 20.6555μs 2.0534μs 487.0088 KOps/s 484.3623 KOps/s $\color{#35bf28}+0.55\%$
test_membership_stacked_nested 26.2100μs 2.1084μs 474.2938 KOps/s 482.3634 KOps/s $\color{#d91a1a}-1.67\%$
test_membership_stacked_nested_leaf 36.7210μs 2.1254μs 470.4959 KOps/s 486.6219 KOps/s $\color{#d91a1a}-3.31\%$
test_membership_nested_last 36.5300μs 3.0966μs 322.9370 KOps/s 322.6812 KOps/s $\color{#35bf28}+0.08\%$
test_membership_nested_leaf_last 32.4810μs 3.1207μs 320.4440 KOps/s 319.6491 KOps/s $\color{#35bf28}+0.25\%$
test_membership_stacked_nested_last 26.8400μs 3.0990μs 322.6814 KOps/s 277.7125 KOps/s $\textbf{\color{#35bf28}+16.19\%}$
test_membership_stacked_nested_leaf_last 36.2410μs 3.1028μs 322.2869 KOps/s 281.0187 KOps/s $\textbf{\color{#35bf28}+14.69\%}$
test_nested_getleaf 32.7710μs 6.1742μs 161.9651 KOps/s 164.1860 KOps/s $\color{#d91a1a}-1.35\%$
test_nested_get 24.3610μs 5.7794μs 173.0275 KOps/s 172.9115 KOps/s $\color{#35bf28}+0.07\%$
test_stacked_getleaf 33.6110μs 6.1107μs 163.6481 KOps/s 162.3732 KOps/s $\color{#35bf28}+0.79\%$
test_stacked_get 35.6910μs 5.7955μs 172.5488 KOps/s 172.1676 KOps/s $\color{#35bf28}+0.22\%$
test_nested_getitemleaf 40.5810μs 6.3943μs 156.3886 KOps/s 155.2775 KOps/s $\color{#35bf28}+0.72\%$
test_nested_getitem 29.2910μs 6.1505μs 162.5875 KOps/s 160.8702 KOps/s $\color{#35bf28}+1.07\%$
test_stacked_getitemleaf 34.0710μs 6.4002μs 156.2460 KOps/s 155.1966 KOps/s $\color{#35bf28}+0.68\%$
test_stacked_getitem 31.4700μs 6.0872μs 164.2792 KOps/s 163.9538 KOps/s $\color{#35bf28}+0.20\%$
test_lock_nested 0.8327ms 0.3748ms 2.6679 KOps/s 2.5523 KOps/s $\color{#35bf28}+4.53\%$
test_lock_stack_nested 0.4048ms 0.3480ms 2.8734 KOps/s 2.8683 KOps/s $\color{#35bf28}+0.18\%$
test_unlock_nested 0.6748ms 0.3140ms 3.1850 KOps/s 3.0858 KOps/s $\color{#35bf28}+3.21\%$
test_unlock_stack_nested 0.3569ms 0.2865ms 3.4900 KOps/s 3.4939 KOps/s $\color{#d91a1a}-0.11\%$
test_flatten_speed 0.1630ms 73.1338μs 13.6736 KOps/s 13.2863 KOps/s $\color{#35bf28}+2.92\%$
test_unflatten_speed 0.3734ms 0.3215ms 3.1102 KOps/s 3.0976 KOps/s $\color{#35bf28}+0.41\%$
test_common_ops 1.6673ms 0.6403ms 1.5617 KOps/s 1.5261 KOps/s $\color{#35bf28}+2.33\%$
test_creation 99.1320μs 1.8102μs 552.4227 KOps/s 550.3948 KOps/s $\color{#35bf28}+0.37\%$
test_creation_empty 32.1110μs 9.9934μs 100.0658 KOps/s 104.5160 KOps/s $\color{#d91a1a}-4.26\%$
test_creation_nested_1 46.0710μs 11.6253μs 86.0192 KOps/s 90.1190 KOps/s $\color{#d91a1a}-4.55\%$
test_creation_nested_2 44.0000μs 14.4245μs 69.3266 KOps/s 71.7138 KOps/s $\color{#d91a1a}-3.33\%$
test_clone 0.1096ms 10.5363μs 94.9104 KOps/s 88.4457 KOps/s $\textbf{\color{#35bf28}+7.31\%}$
test_getitem[int] 1.4822ms 10.7348μs 93.1546 KOps/s 88.6732 KOps/s $\textbf{\color{#35bf28}+5.05\%}$
test_getitem[slice_int] 0.1127ms 20.9365μs 47.7635 KOps/s 45.5175 KOps/s $\color{#35bf28}+4.93\%$
test_getitem[range] 0.1359ms 37.2819μs 26.8226 KOps/s 25.1222 KOps/s $\textbf{\color{#35bf28}+6.77\%}$
test_getitem[tuple] 0.1145ms 18.4406μs 54.2283 KOps/s 51.9699 KOps/s $\color{#35bf28}+4.35\%$
test_getitem[list] 0.1341ms 34.0078μs 29.4051 KOps/s 28.2716 KOps/s $\color{#35bf28}+4.01\%$
test_setitem_dim[int] 44.3300μs 19.8405μs 50.4020 KOps/s 46.3549 KOps/s $\textbf{\color{#35bf28}+8.73\%}$
test_setitem_dim[slice_int] 63.7510μs 39.8087μs 25.1202 KOps/s 24.6860 KOps/s $\color{#35bf28}+1.76\%$
test_setitem_dim[range] 78.3210μs 54.3439μs 18.4013 KOps/s 16.9360 KOps/s $\textbf{\color{#35bf28}+8.65\%}$
test_setitem_dim[tuple] 62.9110μs 33.7745μs 29.6082 KOps/s 28.3757 KOps/s $\color{#35bf28}+4.34\%$
test_setitem 50.3210μs 15.9101μs 62.8530 KOps/s 59.5161 KOps/s $\textbf{\color{#35bf28}+5.61\%}$
test_set 52.6010μs 15.6011μs 64.0982 KOps/s 60.8434 KOps/s $\textbf{\color{#35bf28}+5.35\%}$
test_set_shared 1.6117ms 0.1532ms 6.5262 KOps/s 6.4133 KOps/s $\color{#35bf28}+1.76\%$
test_update 0.4284ms 19.5149μs 51.2430 KOps/s 50.9373 KOps/s $\color{#35bf28}+0.60\%$
test_update_nested 0.1117ms 24.9031μs 40.1557 KOps/s 39.8117 KOps/s $\color{#35bf28}+0.86\%$
test_update__nested 0.9809ms 25.7858μs 38.7811 KOps/s 37.3463 KOps/s $\color{#35bf28}+3.84\%$
test_set_nested 0.1097ms 16.6857μs 59.9316 KOps/s 56.2684 KOps/s $\textbf{\color{#35bf28}+6.51\%}$
test_set_nested_new 0.1136ms 18.7787μs 53.2518 KOps/s 49.7760 KOps/s $\textbf{\color{#35bf28}+6.98\%}$
test_select 0.1251ms 31.1292μs 32.1242 KOps/s 30.9550 KOps/s $\color{#35bf28}+3.78\%$
test_select_nested 78.1820μs 44.6942μs 22.3743 KOps/s 21.7010 KOps/s $\color{#35bf28}+3.10\%$
test_exclude_nested 94.3810μs 63.3709μs 15.7801 KOps/s 15.4461 KOps/s $\color{#35bf28}+2.16\%$
test_empty[True] 0.3528ms 0.2899ms 3.4490 KOps/s 3.3483 KOps/s $\color{#35bf28}+3.01\%$
test_empty[False] 3.0741μs 0.8264μs 1.2100 MOps/s 1.1917 MOps/s $\color{#35bf28}+1.54\%$
test_to 95.3010μs 58.2210μs 17.1759 KOps/s 16.6799 KOps/s $\color{#35bf28}+2.97\%$
test_to_nonblocking 0.9331ms 49.6532μs 20.1397 KOps/s 19.2121 KOps/s $\color{#35bf28}+4.83\%$
test_unbind_speed 0.2961ms 0.2411ms 4.1472 KOps/s 4.0752 KOps/s $\color{#35bf28}+1.77\%$
test_unbind_speed_stack0 0.3037ms 0.2430ms 4.1151 KOps/s 4.0960 KOps/s $\color{#35bf28}+0.47\%$
test_unbind_speed_stack1 0.7279ms 0.6170ms 1.6206 KOps/s 1.4895 KOps/s $\textbf{\color{#35bf28}+8.81\%}$
test_split 94.8621ms 1.5923ms 628.0334 Ops/s 617.0621 Ops/s $\color{#35bf28}+1.78\%$
test_chunk 96.3398ms 1.5866ms 630.2649 Ops/s 614.0984 Ops/s $\color{#35bf28}+2.63\%$
test_consolidate[False-None] 96.9243ms 2.9654ms 337.2253 Ops/s 331.0742 Ops/s $\color{#35bf28}+1.86\%$
test_consolidate[default-None] 1.7719ms 1.7024ms 587.4111 Ops/s 581.2844 Ops/s $\color{#35bf28}+1.05\%$
test_consolidate[reduce-overhead-None] 1.8402ms 1.7490ms 571.7597 Ops/s 568.4360 Ops/s $\color{#35bf28}+0.58\%$
test_consolidate_njt[False-None] 6.7631ms 6.6679ms 149.9721 Ops/s 147.0207 Ops/s $\color{#35bf28}+2.01\%$
test_to[False-False-None] 1.8657ms 1.7536ms 570.2630 Ops/s 566.6680 Ops/s $\color{#35bf28}+0.63\%$
test_to[True-False-None] 1.6126ms 1.3475ms 742.1368 Ops/s 727.6365 Ops/s $\color{#35bf28}+1.99\%$
test_to[within-False-None] 4.4475ms 4.2274ms 236.5504 Ops/s 236.8649 Ops/s $\color{#d91a1a}-0.13\%$
test_to[True-default-None] 5.6253ms 5.3793ms 185.8963 Ops/s 179.1620 Ops/s $\color{#35bf28}+3.76\%$
test_to_njt[False-False-None] 7.1182ms 6.9478ms 143.9297 Ops/s 137.8951 Ops/s $\color{#35bf28}+4.38\%$
test_to_njt[True-False-None] 5.6984ms 5.4697ms 182.8243 Ops/s 172.1907 Ops/s $\textbf{\color{#35bf28}+6.18\%}$
test_to_njt[within-False-None] 12.4158ms 12.3379ms 81.0512 Ops/s 78.0716 Ops/s $\color{#35bf28}+3.82\%$
test_creation[device0] 0.4482ms 81.4189μs 12.2822 KOps/s 12.2705 KOps/s $\color{#35bf28}+0.10\%$
test_creation_from_tensor 0.5202ms 86.0660μs 11.6190 KOps/s 11.6831 KOps/s $\color{#d91a1a}-0.55\%$
test_add_one[memmap_tensor0] 0.4046ms 6.9545μs 143.7924 KOps/s 139.8314 KOps/s $\color{#35bf28}+2.83\%$
test_contiguous[memmap_tensor0] 2.0075μs 0.4206μs 2.3775 MOps/s 2.3422 MOps/s $\color{#35bf28}+1.50\%$
test_stack[memmap_tensor0] 21.1010μs 4.3498μs 229.8970 KOps/s 217.2023 KOps/s $\textbf{\color{#35bf28}+5.84\%}$
test_memmaptd_index 0.5947ms 0.2555ms 3.9135 KOps/s 3.7571 KOps/s $\color{#35bf28}+4.16\%$
test_memmaptd_index_astensor 0.5790ms 0.3206ms 3.1193 KOps/s 3.0504 KOps/s $\color{#35bf28}+2.26\%$
test_memmaptd_index_op 1.0237ms 0.6276ms 1.5933 KOps/s 1.5815 KOps/s $\color{#35bf28}+0.75\%$
test_serialize_model 0.1324s 0.1314s 7.6082 Ops/s 7.6217 Ops/s $\color{#d91a1a}-0.18\%$
test_serialize_model_pickle 1.3658s 1.2143s 0.8235 Ops/s 0.8408 Ops/s $\color{#d91a1a}-2.06\%$
test_serialize_weights 0.1323s 0.1312s 7.6246 Ops/s 7.6359 Ops/s $\color{#d91a1a}-0.15\%$
test_serialize_weights_returnearly 0.5272s 70.9942ms 14.0857 Ops/s 23.2237 Ops/s $\textbf{\color{#d91a1a}-39.35\%}$
test_serialize_weights_pickle 1.3766s 1.2181s 0.8209 Ops/s 0.8353 Ops/s $\color{#d91a1a}-1.72\%$
test_reshape_pytree 51.8300μs 22.3762μs 44.6903 KOps/s 43.7499 KOps/s $\color{#35bf28}+2.15\%$
test_reshape_td 61.7610μs 26.4250μs 37.8429 KOps/s 35.0596 KOps/s $\textbf{\color{#35bf28}+7.94\%}$
test_view_pytree 60.6010μs 22.5108μs 44.4231 KOps/s 44.5394 KOps/s $\color{#d91a1a}-0.26\%$
test_view_td 82.4210μs 34.4164μs 29.0559 KOps/s 30.3924 KOps/s $\color{#d91a1a}-4.40\%$
test_unbind_pytree 59.7610μs 28.3745μs 35.2430 KOps/s 35.2234 KOps/s $\color{#35bf28}+0.06\%$
test_unbind_td 0.8106ms 37.1267μs 26.9348 KOps/s 26.9896 KOps/s $\color{#d91a1a}-0.20\%$
test_split_pytree 69.8310μs 30.7634μs 32.5061 KOps/s 32.7749 KOps/s $\color{#d91a1a}-0.82\%$
test_split_td 0.9567ms 40.6295μs 24.6126 KOps/s 15.0883 KOps/s $\textbf{\color{#35bf28}+63.12\%}$
test_add_pytree 0.2590ms 35.6268μs 28.0688 KOps/s 28.0633 KOps/s $\color{#35bf28}+0.02\%$
test_add_td 0.1003ms 53.9831μs 18.5243 KOps/s 18.9053 KOps/s $\color{#d91a1a}-2.02\%$
test_compile_add_one_nested[tensordict-compile] 0.1774ms 0.1230ms 8.1311 KOps/s 7.8236 KOps/s $\color{#35bf28}+3.93\%$
test_compile_add_one_nested[tensordict-eager] 0.2270ms 0.1324ms 7.5501 KOps/s 7.4287 KOps/s $\color{#35bf28}+1.64\%$
test_compile_add_one_nested[pytree-compile] 0.2184ms 97.7207μs 10.2332 KOps/s 10.0313 KOps/s $\color{#35bf28}+2.01\%$
test_compile_add_one_nested[pytree-eager] 1.5792ms 0.1504ms 6.6505 KOps/s 6.4394 KOps/s $\color{#35bf28}+3.28\%$
test_compile_copy_nested[tensordict-compile] 53.9310μs 22.3790μs 44.6847 KOps/s 44.9113 KOps/s $\color{#d91a1a}-0.50\%$
test_compile_copy_nested[tensordict-eager] 66.0110μs 30.0107μs 33.3215 KOps/s 33.3736 KOps/s $\color{#d91a1a}-0.16\%$
test_compile_copy_nested[pytree-compile] 0.3080ms 66.3352μs 15.0749 KOps/s 15.1146 KOps/s $\color{#d91a1a}-0.26\%$
test_compile_copy_nested[pytree-eager] 91.2310μs 50.0478μs 19.9809 KOps/s 19.5634 KOps/s $\color{#35bf28}+2.13\%$
test_compile_add_one_flat[tensordict-compile] 0.1933ms 0.1450ms 6.8978 KOps/s 6.8107 KOps/s $\color{#35bf28}+1.28\%$
test_compile_add_one_flat[tensordict-eager] 0.3124ms 0.2227ms 4.4907 KOps/s 4.5379 KOps/s $\color{#d91a1a}-1.04\%$
test_compile_add_one_flat[tensorclass-compile] 0.1553ms 0.1001ms 9.9874 KOps/s 9.8475 KOps/s $\color{#35bf28}+1.42\%$
test_compile_add_one_flat[tensorclass-eager] 0.1133ms 56.2310μs 17.7838 KOps/s 17.0824 KOps/s $\color{#35bf28}+4.11\%$
test_compile_add_one_flat[pytree-compile] 0.2583ms 0.1383ms 7.2285 KOps/s 7.2009 KOps/s $\color{#35bf28}+0.38\%$
test_compile_add_one_flat[pytree-eager] 0.5795ms 0.4815ms 2.0770 KOps/s 1.9817 KOps/s $\color{#35bf28}+4.81\%$
test_compile_add_self_flat[tensordict-eager] 0.3822ms 0.2667ms 3.7494 KOps/s 3.7156 KOps/s $\color{#35bf28}+0.91\%$
test_compile_add_self_flat[tensordict-compile] 0.2000ms 0.1463ms 6.8354 KOps/s 6.8386 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_add_self_flat[tensorclass-eager] 0.1551ms 70.5683μs 14.1707 KOps/s 14.2847 KOps/s $\color{#d91a1a}-0.80\%$
test_compile_add_self_flat[tensorclass-compile] 0.1410ms 0.1021ms 9.7981 KOps/s 9.8422 KOps/s $\color{#d91a1a}-0.45\%$
test_compile_add_self_flat[pytree-eager] 0.4651ms 0.4117ms 2.4289 KOps/s 2.4275 KOps/s $\color{#35bf28}+0.06\%$
test_compile_add_self_flat[pytree-compile] 0.1858ms 0.1383ms 7.2305 KOps/s 7.2668 KOps/s $\color{#d91a1a}-0.50\%$
test_compile_copy_flat[tensordict-compile] 0.1064ms 18.8990μs 52.9129 KOps/s 47.3706 KOps/s $\textbf{\color{#35bf28}+11.70\%}$
test_compile_copy_flat[tensordict-eager] 71.7010μs 32.0694μs 31.1824 KOps/s 32.0372 KOps/s $\color{#d91a1a}-2.67\%$
test_compile_copy_flat[pytree-compile] 0.2017ms 71.3074μs 14.0238 KOps/s 13.9760 KOps/s $\color{#35bf28}+0.34\%$
test_compile_copy_flat[pytree-eager] 0.1247ms 52.1009μs 19.1935 KOps/s 19.0819 KOps/s $\color{#35bf28}+0.59\%$
test_compile_assign_and_add[tensordict-compile] 1.6518ms 0.3968ms 2.5200 KOps/s 2.1989 KOps/s $\textbf{\color{#35bf28}+14.60\%}$
test_compile_assign_and_add[tensordict-eager] 2.7935ms 2.6642ms 375.3425 Ops/s 363.2949 Ops/s $\color{#35bf28}+3.32\%$
test_compile_assign_and_add[pytree-compile] 1.6084ms 0.3833ms 2.6092 KOps/s 2.2473 KOps/s $\textbf{\color{#35bf28}+16.10\%}$
test_compile_assign_and_add[pytree-eager] 2.7611ms 2.6519ms 377.0878 Ops/s 361.3618 Ops/s $\color{#35bf28}+4.35\%$
test_compile_indexing[tensor-tensordict-compile] 0.1627ms 0.1180ms 8.4711 KOps/s 8.5513 KOps/s $\color{#d91a1a}-0.94\%$
test_compile_indexing[tensor-tensordict-eager] 0.5806ms 82.8968μs 12.0632 KOps/s 11.7558 KOps/s $\color{#35bf28}+2.61\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1673ms 0.1106ms 9.0394 KOps/s 8.7708 KOps/s $\color{#35bf28}+3.06\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1209ms 72.6218μs 13.7700 KOps/s 13.3919 KOps/s $\color{#35bf28}+2.82\%$
test_compile_indexing[tensor-pytree-compile] 0.1612ms 0.1129ms 8.8566 KOps/s 8.6791 KOps/s $\color{#35bf28}+2.05\%$
test_compile_indexing[tensor-pytree-eager] 0.1643ms 72.0711μs 13.8752 KOps/s 14.0318 KOps/s $\color{#d91a1a}-1.12\%$
test_compile_indexing[slice-tensordict-compile] 0.1865ms 0.1023ms 9.7735 KOps/s 9.7914 KOps/s $\color{#d91a1a}-0.18\%$
test_compile_indexing[slice-tensordict-eager] 0.1479ms 17.5185μs 57.0825 KOps/s 56.0986 KOps/s $\color{#35bf28}+1.75\%$
test_compile_indexing[slice-tensorclass-compile] 0.1471ms 98.1396μs 10.1896 KOps/s 10.1936 KOps/s $\color{#d91a1a}-0.04\%$
test_compile_indexing[slice-tensorclass-eager] 63.2710μs 15.9058μs 62.8701 KOps/s 61.6795 KOps/s $\color{#35bf28}+1.93\%$
test_compile_indexing[slice-pytree-compile] 0.1547ms 0.1023ms 9.7741 KOps/s 10.1658 KOps/s $\color{#d91a1a}-3.85\%$
test_compile_indexing[slice-pytree-eager] 49.9210μs 15.8695μs 63.0140 KOps/s 61.4844 KOps/s $\color{#35bf28}+2.49\%$
test_compile_indexing[int-tensordict-compile] 0.1579ms 0.1039ms 9.6215 KOps/s 9.7904 KOps/s $\color{#d91a1a}-1.72\%$
test_compile_indexing[int-tensordict-eager] 0.6130ms 16.8658μs 59.2915 KOps/s 55.4049 KOps/s $\textbf{\color{#35bf28}+7.01\%}$
test_compile_indexing[int-tensorclass-compile] 0.1771ms 0.1001ms 9.9942 KOps/s 10.1697 KOps/s $\color{#d91a1a}-1.73\%$
test_compile_indexing[int-tensorclass-eager] 59.8510μs 15.8605μs 63.0496 KOps/s 61.6568 KOps/s $\color{#35bf28}+2.26\%$
test_compile_indexing[int-pytree-compile] 0.1435ms 98.1812μs 10.1852 KOps/s 10.1908 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_indexing[int-pytree-eager] 0.1579ms 16.3954μs 60.9928 KOps/s 61.4958 KOps/s $\color{#d91a1a}-0.82\%$
test_mod_add[eager] 84.2610μs 39.9101μs 25.0563 KOps/s 24.4837 KOps/s $\color{#35bf28}+2.34\%$
test_mod_add[compile] 0.1469ms 81.4805μs 12.2729 KOps/s 12.0834 KOps/s $\color{#35bf28}+1.57\%$
test_mod_add[compile-overhead] 0.3276ms 0.1696ms 5.8947 KOps/s 5.6464 KOps/s $\color{#35bf28}+4.40\%$
test_mod_wrap[eager] 0.3393ms 0.2630ms 3.8021 KOps/s 3.8379 KOps/s $\color{#d91a1a}-0.93\%$
test_mod_wrap[compile] 0.3844ms 0.3009ms 3.3229 KOps/s 3.4163 KOps/s $\color{#d91a1a}-2.73\%$
test_mod_wrap[compile-overhead] 7.5471ms 3.9321ms 254.3149 Ops/s 253.1250 Ops/s $\color{#35bf28}+0.47\%$
test_mod_wrap_and_backward[eager] 1.4860ms 1.3835ms 722.8110 Ops/s 666.5200 Ops/s $\textbf{\color{#35bf28}+8.45\%}$
test_mod_wrap_and_backward[compile] 2.0730ms 1.3896ms 719.6263 Ops/s 767.2417 Ops/s $\textbf{\color{#d91a1a}-6.21\%}$
test_mod_wrap_and_backward[compile-overhead] 1.5399ms 1.0443ms 957.5525 Ops/s 1.0450 KOps/s $\textbf{\color{#d91a1a}-8.37\%}$
test_seq_add[eager] 0.1967ms 0.1295ms 7.7212 KOps/s 8.1714 KOps/s $\textbf{\color{#d91a1a}-5.51\%}$
test_seq_add[compile] 0.1687ms 94.7735μs 10.5515 KOps/s 10.9323 KOps/s $\color{#d91a1a}-3.48\%$
test_seq_add[compile-overhead] 0.1722ms 0.1355ms 7.3775 KOps/s 7.5429 KOps/s $\color{#d91a1a}-2.19\%$
test_seq_wrap[eager] 0.5395ms 0.4446ms 2.2494 KOps/s 2.2690 KOps/s $\color{#d91a1a}-0.86\%$
test_seq_wrap[compile] 0.3937ms 0.3085ms 3.2417 KOps/s 3.2439 KOps/s $\color{#d91a1a}-0.07\%$
test_seq_wrap[compile-overhead] 0.2753ms 0.2280ms 4.3864 KOps/s 4.2525 KOps/s $\color{#35bf28}+3.15\%$
test_func_call_runtime[False-eager] 1.1553ms 0.7477ms 1.3374 KOps/s 1.2403 KOps/s $\textbf{\color{#35bf28}+7.83\%}$
test_func_call_runtime[False-compile] 1.1781ms 0.7557ms 1.3234 KOps/s 1.2968 KOps/s $\color{#35bf28}+2.05\%$
test_func_call_runtime[False-compile-overhead] 0.7831ms 0.3716ms 2.6907 KOps/s 2.6942 KOps/s $\color{#d91a1a}-0.13\%$
test_func_call_runtime[True-eager] 1.3402ms 0.9209ms 1.0859 KOps/s 1.0520 KOps/s $\color{#35bf28}+3.22\%$
test_func_call_runtime[True-compile] 1.1847ms 0.7790ms 1.2837 KOps/s 1.2769 KOps/s $\color{#35bf28}+0.53\%$
test_func_call_runtime[True-compile-overhead] 0.4388ms 0.3904ms 2.5613 KOps/s 2.5467 KOps/s $\color{#35bf28}+0.57\%$
test_func_call_cm_runtime[False-eager] 1.1404ms 0.7446ms 1.3431 KOps/s 1.3125 KOps/s $\color{#35bf28}+2.33\%$
test_func_call_cm_runtime[False-compile] 1.1873ms 0.7563ms 1.3223 KOps/s 1.3066 KOps/s $\color{#35bf28}+1.20\%$
test_func_call_cm_runtime[False-compile-overhead] 0.7860ms 0.3745ms 2.6699 KOps/s 2.6727 KOps/s $\color{#d91a1a}-0.11\%$
test_func_call_cm_runtime[True-eager] 1.4226ms 1.0173ms 983.0230 Ops/s 965.3397 Ops/s $\color{#35bf28}+1.83\%$
test_func_call_cm_runtime[True-compile] 1.2204ms 0.8025ms 1.2460 KOps/s 1.2315 KOps/s $\color{#35bf28}+1.18\%$
test_func_call_cm_runtime[True-compile-overhead] 0.8361ms 0.4199ms 2.3815 KOps/s 2.3810 KOps/s $\color{#35bf28}+0.02\%$
test_vmap_func_call_cm_runtime[eager] 2.5842ms 2.1214ms 471.3965 Ops/s 465.0074 Ops/s $\color{#35bf28}+1.37\%$
test_vmap_func_call_cm_runtime[compile] 1.2282ms 0.8216ms 1.2171 KOps/s 1.1970 KOps/s $\color{#35bf28}+1.68\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4753ms 0.4201ms 2.3803 KOps/s 2.3595 KOps/s $\color{#35bf28}+0.88\%$
test_distributed 3.0645ms 0.1900ms 5.2624 KOps/s 8.5050 KOps/s $\textbf{\color{#d91a1a}-38.13\%}$
test_tdmodule 32.9200μs 21.5719μs 46.3566 KOps/s 44.7893 KOps/s $\color{#35bf28}+3.50\%$
test_tdmodule_dispatch 0.3170ms 39.0432μs 25.6126 KOps/s 26.1912 KOps/s $\color{#d91a1a}-2.21\%$
test_tdseq 44.5410μs 22.3642μs 44.7144 KOps/s 44.8855 KOps/s $\color{#d91a1a}-0.38\%$
test_tdseq_dispatch 64.6510μs 42.0255μs 23.7951 KOps/s 24.1190 KOps/s $\color{#d91a1a}-1.34\%$
test_instantiation_functorch 2.0054ms 1.5903ms 628.8045 Ops/s 617.8167 Ops/s $\color{#35bf28}+1.78\%$
test_exec_functorch 0.2088ms 0.1464ms 6.8299 KOps/s 6.6406 KOps/s $\color{#35bf28}+2.85\%$
test_exec_functional_call 0.5623ms 0.1419ms 7.0472 KOps/s 6.8464 KOps/s $\color{#35bf28}+2.93\%$
test_exec_td_decorator 0.6023ms 0.1899ms 5.2657 KOps/s 5.1062 KOps/s $\color{#35bf28}+3.12\%$
test_vmap_mlp_speed_decorator[True-True] 1.1189ms 0.6984ms 1.4318 KOps/s 1.4128 KOps/s $\color{#35bf28}+1.34\%$
test_vmap_mlp_speed_decorator[True-False] 1.1222ms 0.7038ms 1.4208 KOps/s 1.4188 KOps/s $\color{#35bf28}+0.14\%$
test_vmap_mlp_speed_decorator[False-True] 1.0153ms 0.6026ms 1.6594 KOps/s 1.6276 KOps/s $\color{#35bf28}+1.96\%$
test_vmap_mlp_speed_decorator[False-False] 1.0295ms 0.6046ms 1.6539 KOps/s 1.6238 KOps/s $\color{#35bf28}+1.85\%$
test_vmap_transformer_speed_decorator[True-True] 19.9436ms 19.5727ms 51.0915 Ops/s 51.0358 Ops/s $\color{#35bf28}+0.11\%$
test_vmap_transformer_speed_decorator[True-False] 19.9455ms 19.5600ms 51.1247 Ops/s 50.8979 Ops/s $\color{#35bf28}+0.45\%$
test_vmap_transformer_speed_decorator[False-True] 19.8617ms 19.4811ms 51.3319 Ops/s 51.3254 Ops/s $\color{#35bf28}+0.01\%$
test_vmap_transformer_speed_decorator[False-False] 19.8034ms 19.4444ms 51.4287 Ops/s 51.3091 Ops/s $\color{#35bf28}+0.23\%$
test_to_module_speed[True] 1.4213ms 0.9948ms 1.0053 KOps/s 1.0068 KOps/s $\color{#d91a1a}-0.15\%$
test_to_module_speed[False] 1.5789ms 0.9752ms 1.0255 KOps/s 1.0168 KOps/s $\color{#35bf28}+0.85\%$
test_tc_init 84.2910μs 39.1895μs 25.5170 KOps/s 26.1732 KOps/s $\color{#d91a1a}-2.51\%$
test_tc_init_nested 0.4839ms 78.9615μs 12.6644 KOps/s 12.8657 KOps/s $\color{#d91a1a}-1.56\%$
test_tc_first_layer_tensor 27.0910μs 0.8525μs 1.1730 MOps/s 1.1968 MOps/s $\color{#d91a1a}-1.99\%$
test_tc_first_layer_nontensor 23.8000μs 2.3665μs 422.5627 KOps/s 418.4437 KOps/s $\color{#35bf28}+0.98\%$
test_tc_second_layer_tensor 10.0233μs 1.4948μs 668.9949 KOps/s 667.4538 KOps/s $\color{#35bf28}+0.23\%$
test_tc_second_layer_nontensor 0.4049ms 3.1440μs 318.0619 KOps/s 312.8243 KOps/s $\color{#35bf28}+1.67\%$
test_unbind 0.2252s 10.3118ms 96.9764 Ops/s 140.3213 Ops/s $\textbf{\color{#d91a1a}-30.89\%}$
test_full_like 9.5844ms 9.1338ms 109.4834 Ops/s 105.6893 Ops/s $\color{#35bf28}+3.59\%$
test_zeros_like 9.4861ms 7.2149ms 138.6019 Ops/s 236.2978 Ops/s $\textbf{\color{#d91a1a}-41.34\%}$
test_ones_like 4.9191ms 4.1260ms 242.3636 Ops/s 231.3831 Ops/s $\color{#35bf28}+4.75\%$
test_clone 6.7628ms 6.3624ms 157.1734 Ops/s 109.5405 Ops/s $\textbf{\color{#35bf28}+43.48\%}$
test_squeeze 58.8810μs 9.7983μs 102.0582 KOps/s 105.3883 KOps/s $\color{#d91a1a}-3.16\%$
test_unsqueeze 0.1237ms 74.6836μs 13.3898 KOps/s 13.3095 KOps/s $\color{#35bf28}+0.60\%$
test_split 0.6043ms 0.1677ms 5.9620 KOps/s 6.1494 KOps/s $\color{#d91a1a}-3.05\%$
test_permute 0.5980ms 0.1800ms 5.5566 KOps/s 5.4843 KOps/s $\color{#35bf28}+1.32\%$
test_stack 51.2792ms 50.5913ms 19.7662 Ops/s 19.6976 Ops/s $\color{#35bf28}+0.35\%$
test_cat 51.0398ms 50.7346ms 19.7104 Ops/s 19.7042 Ops/s $\color{#35bf28}+0.03\%$

@vmoens vmoens merged commit 4473485 into main Jan 13, 2025
36 of 44 checks passed
@vmoens vmoens deleted the fix-min-max-api branch January 13, 2025 10:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BC-breaking bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Refactor Refactoring code - not a new feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants
0