8000 [BugFix] Fix MISSING check in tensorclass by vmoens · Pull Request #1275 · pytorch/tensordict · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

[BugFix] Fix MISSING check in tensorclass #1275

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Mar 24, 2025
Merged

Conversation

vmoens
Copy link
Collaborator
@vmoens vmoens commented Mar 24, 2025

No description provided.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 24, 2025
@vmoens vmoens added the bug Something isn't working label Mar 24, 2025
Copy link
github-actions bot commented Mar 24, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}35$. Worsened: $\large\color{#d91a1a}10$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 97.9210μs 20.6689μs 48.3819 KOps/s 45.2015 KOps/s $\textbf{\color{#35bf28}+7.04\%}$
test_plain_set_stack_nested 77.0270μs 20.8560μs 47.9478 KOps/s 45.2010 KOps/s $\textbf{\color{#35bf28}+6.08\%}$
test_plain_set_nested_inplace 56.1950μs 22.8107μs 43.8390 KOps/s 42.1346 KOps/s $\color{#35bf28}+4.05\%$
test_plain_set_stack_nested_inplace 90.5090μs 22.5818μs 44.2834 KOps/s 41.8619 KOps/s $\textbf{\color{#35bf28}+5.78\%}$
test_items 30.1560μs 4.2026μs 237.9467 KOps/s 237.8291 KOps/s $\color{#35bf28}+0.05\%$
test_items_nested 0.5446ms 0.4038ms 2.4763 KOps/s 2.4514 KOps/s $\color{#35bf28}+1.01\%$
test_items_nested_locked 0.8548ms 0.4038ms 2.4766 KOps/s 2.4390 KOps/s $\color{#35bf28}+1.54\%$
test_items_nested_leaf 0.1533ms 76.5137μs 13.0696 KOps/s 13.0977 KOps/s $\color{#d91a1a}-0.21\%$
test_items_stack_nested 0.5857ms 0.4049ms 2.4699 KOps/s 2.4406 KOps/s $\color{#35bf28}+1.20\%$
test_items_stack_nested_leaf 0.1468ms 77.1210μs 12.9666 KOps/s 12.8904 KOps/s $\color{#35bf28}+0.59\%$
test_items_stack_nested_locked 0.6375ms 0.4068ms 2.4583 KOps/s 2.4377 KOps/s $\color{#35bf28}+0.84\%$
test_keys 29.9960μs 3.6806μs 271.6962 KOps/s 289.5612 KOps/s $\textbf{\color{#d91a1a}-6.17\%}$
test_keys_nested 0.2893ms 0.1619ms 6.1755 KOps/s 6.0227 KOps/s $\color{#35bf28}+2.54\%$
test_keys_nested_locked 1.9789ms 0.1689ms 5.9200 KOps/s 5.8398 KOps/s $\color{#35bf28}+1.37\%$
test_keys_nested_leaf 0.2776ms 0.1427ms 7.0090 KOps/s 6.8907 KOps/s $\color{#35bf28}+1.72\%$
test_keys_stack_nested 0.2954ms 0.1633ms 6.1222 KOps/s 6.0028 KOps/s $\color{#35bf28}+1.99\%$
test_keys_stack_nested_leaf 0.2632ms 0.1442ms 6.9348 KOps/s 6.9372 KOps/s $\color{#d91a1a}-0.04\%$
test_keys_stack_nested_locked 0.3016ms 0.1700ms 5.8816 KOps/s 5.7996 KOps/s $\color{#35bf28}+1.41\%$
test_values 9.5740μs 1.0376μs 963.7441 KOps/s 955.6522 KOps/s $\color{#35bf28}+0.85\%$
test_values_nested 0.1562ms 66.5247μs 15.0320 KOps/s 15.0651 KOps/s $\color{#d91a1a}-0.22\%$
test_values_nested_locked 0.1591ms 65.9433μs 15.1645 KOps/s 14.2702 KOps/s $\textbf{\color{#35bf28}+6.27\%}$
test_values_nested_leaf 0.1752ms 74.0118μs 13.5114 KOps/s 13.5440 KOps/s $\color{#d91a1a}-0.24\%$
test_values_stack_nested 0.1748ms 66.0375μs 15.1429 KOps/s 15.0205 KOps/s $\color{#35bf28}+0.81\%$
test_values_stack_nested_leaf 0.1546ms 72.9649μs 13.7052 KOps/s 13.6280 KOps/s $\color{#35bf28}+0.57\%$
test_values_stack_nested_locked 0.1515ms 65.7928μs 15.1992 KOps/s 15.1482 KOps/s $\color{#35bf28}+0.34\%$
test_membership 23.7140μs 0.8670μs 1.1534 MOps/s 1.1246 MOps/s $\color{#35bf28}+2.56\%$
test_membership_nested 49.5730μs 2.9248μs 341.9052 KOps/s 340.3454 KOps/s $\color{#35bf28}+0.46\%$
test_membership_nested_leaf 55.0730μs 2.8876μs 346.3121 KOps/s 331.1532 KOps/s $\color{#35bf28}+4.58\%$
test_membership_stacked_nested 30.2170μs 2.8598μs 349.6702 KOps/s 341.9282 KOps/s $\color{#35bf28}+2.26\%$
test_membership_stacked_nested_leaf 24.9570μs 2.9011μs 344.6933 KOps/s 341.3192 KOps/s $\color{#35bf28}+0.99\%$
test_membership_nested_last 49.8740μs 4.2745μs 233.9449 KOps/s 230.6183 KOps/s $\color{#35bf28}+1.44\%$
test_membership_nested_leaf_last 25.6080μs 4.2943μs 232.8651 KOps/s 229.1624 KOps/s $\color{#35bf28}+1.62\%$
test_membership_stacked_nested_last 52.2870μs 4.2911μs 233.0425 KOps/s 231.7001 KOps/s $\color{#35bf28}+0.58\%$
test_membership_stacked_nested_leaf_last 23.4940μs 4.3339μs 230.7390 KOps/s 230.5065 KOps/s $\color{#35bf28}+0.10\%$
test_nested_getleaf 76.1920μs 17.4796μs 57.2096 KOps/s 57.0861 KOps/s $\color{#35bf28}+0.22\%$
test_nested_get 79.2180μs 16.5274μs 60.5055 KOps/s 59.9402 KOps/s $\color{#35bf28}+0.94\%$
test_stacked_getleaf 68.7990μs 17.2291μs 58.0414 KOps/s 57.4393 KOps/s $\color{#35bf28}+1.05\%$
test_stacked_get 56.1950μs 16.6292μs 60.1353 KOps/s 60.1737 KOps/s $\color{#d91a1a}-0.06\%$
test_nested_getitemleaf 61.6960μs 18.0604μs 55.3698 KOps/s 54.8623 KOps/s $\color{#35bf28}+0.93\%$
test_nested_getitem 70.1420μs 17.1170μs 58.4214 KOps/s 58.4412 KOps/s $\color{#d91a1a}-0.03\%$
test_stacked_getitemleaf 59.8020μs 17.9501μs 55.7099 KOps/s 55.9705 KOps/s $\color{#d91a1a}-0.47\%$
test_stacked_getitem 61.7460μs 17.0351μs 58.7023 KOps/s 58.6838 KOps/s $\color{#35bf28}+0.03\%$
test_lock_nested 0.5729ms 0.4177ms 2.3943 KOps/s 2.4084 KOps/s $\color{#d91a1a}-0.59\%$
test_lock_stack_nested 0.7468ms 0.4255ms 2.3500 KOps/s 2.3351 KOps/s $\color{#35bf28}+0.64\%$
test_unlock_nested 0.4793ms 0.3429ms 2.9166 KOps/s 2.9455 KOps/s $\color{#d91a1a}-0.98\%$
test_unlock_stack_nested 0.4855ms 0.3452ms 2.8969 KOps/s 2.8394 KOps/s $\color{#35bf28}+2.03\%$
test_flatten_speed 0.2058ms 99.5273μs 10.0475 KOps/s 9.9253 KOps/s $\color{#35bf28}+1.23\%$
test_unflatten_speed 0.7978ms 0.5996ms 1.6678 KOps/s 1.6826 KOps/s $\color{#d91a1a}-0.88\%$
test_common_ops 5.4891ms 0.8045ms 1.2431 KOps/s 1.1831 KOps/s $\textbf{\color{#35bf28}+5.07\%}$
test_creation 30.2160μs 2.5023μs 399.6253 KOps/s 394.9098 KOps/s $\color{#35bf28}+1.19\%$
test_creation_empty 62.3460μs 11.6482μs 85.8502 KOps/s 75.7943 KOps/s $\textbf{\color{#35bf28}+13.27\%}$
test_creation_nested_1 45.0050μs 14.6006μs 68.4904 KOps/s 61.8141 KOps/s $\textbf{\color{#35bf28}+10.80\%}$
test_creation_nested_2 50.1740μs 19.2907μs 51.8383 KOps/s 47.7303 KOps/s $\textbf{\color{#35bf28}+8.61\%}$
test_clone 84.2070μs 13.5935μs 73.5644 KOps/s 72.3864 KOps/s $\color{#35bf28}+1.63\%$
test_getitem[int] 0.8127ms 12.9286μs 77.3477 KOps/s 76.0219 KOps/s $\color{#35bf28}+1.74\%$
test_getitem[slice_int] 0.1399ms 25.4231μs 39.3342 KOps/s 39.3544 KOps/s $\color{#d91a1a}-0.05\%$
test_getitem[range] 0.1841ms 50.1779μs 19.9291 KOps/s 19.6291 KOps/s $\color{#35bf28}+1.53\%$
test_getitem[tuple] 0.1580ms 20.6891μs 48.3345 KOps/s 48.3248 KOps/s $\color{#35bf28}+0.02\%$
test_getitem[list] 0.1693ms 45.1809μs 22.1332 KOps/s 21.6361 KOps/s $\color{#35bf28}+2.30\%$
test_setitem_dim[int] 50.8350μs 26.0784μs 38.3459 KOps/s 37.6854 KOps/s $\color{#35bf28}+1.75\%$
test_setitem_dim[slice_int] 81.2320μs 53.2786μs 18.7693 KOps/s 19.3051 KOps/s $\color{#d91a1a}-2.78\%$
test_setitem_dim[range] 0.1277ms 76.5317μs 13.0665 KOps/s 12.9517 KOps/s $\color{#35bf28}+0.89\%$
test_setitem_dim[tuple] 65.1320μs 41.3575μs 24.1794 KOps/s 24.4818 KOps/s $\color{#d91a1a}-1.24\%$
test_setitem 93.8960μs 20.8155μs 48.0411 KOps/s 45.9842 KOps/s $\color{#35bf28}+4.47\%$
test_set 79.9490μs 20.1299μs 49.6774 KOps/s 46.8901 KOps/s $\textbf{\color{#35bf28}+5.94\%}$
test_set_shared 0.3432ms 0.1775ms 5.6328 KOps/s 5.3188 KOps/s $\textbf{\color{#35bf28}+5.90\%}$
test_update 0.1252ms 25.9632μs 38.5161 KOps/s 36.1651 KOps/s $\textbf{\color{#35bf28}+6.50\%}$
test_update_nested 0.1174ms 41.5005μs 24.0961 KOps/s 22.9793 KOps/s $\color{#35bf28}+4.86\%$
test_update__nested 0.5207ms 34.0532μs 29.3658 KOps/s 28.7460 KOps/s $\color{#35bf28}+2.16\%$
test_set_nested 73.1770μs 22.0967μs 45.2556 KOps/s 43.0662 KOps/s $\textbf{\color{#35bf28}+5.08\%}$
test_set_nested_new 0.1223ms 26.9995μs 37.0377 KOps/s 36.1658 KOps/s $\color{#35bf28}+2.41\%$
test_select 0.1204ms 43.8680μs 22.7957 KOps/s 22.2767 KOps/s $\color{#35bf28}+2.33\%$
test_select_nested 0.3019ms 64.3903μs 15.5303 KOps/s 15.5105 KOps/s $\color{#35bf28}+0.13\%$
test_exclude_nested 0.2046ms 79.7106μs 12.5454 KOps/s 12.5574 KOps/s $\color{#d91a1a}-0.10\%$
test_empty[True] 0.7673ms 0.4054ms 2.4667 KOps/s 2.4939 KOps/s $\color{#d91a1a}-1.09\%$
test_empty[False] 12.7538μs 1.3731μs 728.2835 KOps/s 718.7341 KOps/s $\color{#35bf28}+1.33\%$
test_unbind_speed 0.3604ms 0.2738ms 3.6525 KOps/s 3.6070 KOps/s $\color{#35bf28}+1.26\%$
test_unbind_speed_stack0 0.4855ms 0.2711ms 3.6890 KOps/s 3.6169 KOps/s $\color{#35bf28}+1.99\%$
test_unbind_speed_stack1 0.1121s 0.7340ms 1.3625 KOps/s 1.1772 KOps/s $\textbf{\color{#35bf28}+15.74\%}$
test_split 0.1123s 1.7608ms 567.9173 Ops/s 620.2549 Ops/s $\textbf{\color{#d91a1a}-8.44\%}$
test_chunk 0.1042s 1.7455ms 572.9002 Ops/s 553.8195 Ops/s $\color{#35bf28}+3.45\%$
test_consolidate_njt[False-None] 8.5752ms 8.1077ms 123.3393 Ops/s 108.2858 Ops/s $\textbf{\color{#35bf28}+13.90\%}$
test_creation[device0] 0.2179ms 90.6296μs 11.0339 KOps/s 10.7297 KOps/s $\color{#35bf28}+2.84\%$
test_creation_from_tensor 4.3777ms 93.5754μs 10.6866 KOps/s 10.2829 KOps/s $\color{#35bf28}+3.93\%$
test_add_one[memmap_tensor0] 83.1950μs 4.7599μs 210.0895 KOps/s 182.4587 KOps/s $\textbf{\color{#35bf28}+15.14\%}$
test_contiguous[memmap_tensor0] 12.9440μs 0.5045μs 1.9823 MOps/s 1.9877 MOps/s $\color{#d91a1a}-0.27\%$
test_stack[memmap_tensor0] 26.7900μs 3.3933μs 294.6992 KOps/s 275.2778 KOps/s $\textbf{\color{#35bf28}+7.06\%}$
test_memmaptd_index 0.3561ms 0.2320ms 4.3112 KOps/s 4.0794 KOps/s $\textbf{\color{#35bf28}+5.68\%}$
test_memmaptd_index_astensor 0.4797ms 0.3175ms 3.1500 KOps/s 3.0681 KOps/s $\color{#35bf28}+2.67\%$
test_memmaptd_index_op 1.0289ms 0.5759ms 1.7364 KOps/s 1.5645 KOps/s $\textbf{\color{#35bf28}+10.98\%}$
test_serialize_model 0.2157s 0.1328s 7.5303 Ops/s 8.4322 Ops/s $\textbf{\color{#d91a1a}-10.70\%}$
test_serialize_model_pickle 0.4928s 0.3955s 2.5285 Ops/s 2.5361 Ops/s $\color{#d91a1a}-0.30\%$
test_serialize_weights 0.1222s 0.1151s 8.6888 Ops/s 8.7370 Ops/s $\color{#d91a1a}-0.55\%$
test_serialize_weights_returnearly 0.1720s 0.1561s 6.4050 Ops/s 6.4282 Ops/s $\color{#d91a1a}-0.36\%$
test_serialize_weights_pickle 0.4778s 0.3972s 2.5175 Ops/s 2.4501 Ops/s $\color{#35bf28}+2.75\%$
test_serialize_weights_filesystem 0.2545s 0.1627s 6.1447 Ops/s 6.9921 Ops/s $\textbf{\color{#d91a1a}-12.12\%}$
test_serialize_model_filesystem 0.1605s 0.1533s 6.5237 Ops/s 6.3609 Ops/s $\color{#35bf28}+2.56\%$
test_reshape_pytree 59.5510μs 26.2557μs 38.0870 KOps/s 37.2608 KOps/s $\color{#35bf28}+2.22\%$
test_reshape_td 88.7360μs 33.8405μs 29.5504 KOps/s 29.5286 KOps/s $\color{#35bf28}+0.07\%$
test_view_pytree 67.1460μs 26.1507μs 38.2399 KOps/s 37.6274 KOps/s $\color{#35bf28}+1.63\%$
test_view_td 94.4570μs 40.1744μs 24.8915 KOps/s 24.2463 KOps/s $\color{#35bf28}+2.66\%$
test_unbind_pytree 86.4020μs 29.7470μs 33.6168 KOps/s 33.2450 KOps/s $\color{#35bf28}+1.12\%$
test_unbind_td 0.3218ms 40.6885μs 24.5770 KOps/s 24.1218 KOps/s $\color{#35bf28}+1.89\%$
test_split_pytree 82.9850μs 29.1811μs 34.2688 KOps/s 33.6880 KOps/s $\color{#35bf28}+1.72\%$
test_split_td 0.5480ms 46.1117μs 21.6865 KOps/s 21.6494 KOps/s $\color{#35bf28}+0.17\%$
test_add_pytree 86.9530μs 35.3886μs 28.2577 KOps/s 27.3457 KOps/s $\color{#35bf28}+3.33\%$
test_add_td 0.1599ms 56.7064μs 17.6347 KOps/s 16.3801 KOps/s $\textbf{\color{#35bf28}+7.66\%}$
test_compile_add_one_nested[tensordict-compile] 0.1446ms 67.5865μs 14.7959 KOps/s 14.6266 KOps/s $\color{#35bf28}+1.16\%$
test_compile_add_one_nested[tensordict-eager] 0.3610ms 0.1710ms 5.8493 KOps/s 5.8150 KOps/s $\color{#35bf28}+0.59\%$
test_compile_add_one_nested[pytree-compile] 0.1046ms 46.2189μs 21.6362 KOps/s 21.4261 KOps/s $\color{#35bf28}+0.98\%$
test_compile_add_one_nested[pytree-eager] 0.2306ms 0.1165ms 8.5854 KOps/s 8.2174 KOps/s $\color{#35bf28}+4.48\%$
test_compile_copy_nested[tensordict-compile] 82.9850μs 28.2989μs 35.3370 KOps/s 34.8702 KOps/s $\color{#35bf28}+1.34\%$
test_compile_copy_nested[tensordict-eager] 0.1598ms 58.2229μs 17.1754 KOps/s 16.6565 KOps/s $\color{#35bf28}+3.12\%$
test_compile_copy_nested[pytree-compile] 0.1665ms 78.7166μs 12.7038 KOps/s 12.4398 KOps/s $\color{#35bf28}+2.12\%$
test_compile_copy_nested[pytree-eager] 0.1430ms 65.8191μs 15.1931 KOps/s 14.9490 KOps/s $\color{#35bf28}+1.63\%$
test_compile_add_one_flat[tensordict-compile] 0.1857ms 0.1088ms 9.1915 KOps/s 9.2140 KOps/s $\color{#d91a1a}-0.24\%$
test_compile_add_one_flat[tensordict-eager] 0.3926ms 0.2138ms 4.6771 KOps/s 4.6178 KOps/s $\color{#35bf28}+1.28\%$
test_compile_add_one_flat[tensorclass-compile] 0.1160ms 47.4397μs 21.0794 KOps/s 20.5920 KOps/s $\color{#35bf28}+2.37\%$
test_compile_add_one_flat[tensorclass-eager] 0.1755ms 67.2347μs 14.8733 KOps/s 14.7721 KOps/s $\color{#35bf28}+0.68\%$
test_compile_add_one_flat[pytree-compile] 0.1886ms 0.1020ms 9.8003 KOps/s 9.7625 KOps/s $\color{#35bf28}+0.39\%$
test_compile_add_one_flat[pytree-eager] 0.4119ms 0.2018ms 4.9555 KOps/s 4.7596 KOps/s $\color{#35bf28}+4.12\%$
test_compile_add_self_flat[tensordict-eager] 1.4928ms 0.2332ms 4.2879 KOps/s 4.2704 KOps/s $\color{#35bf28}+0.41\%$
test_compile_add_self_flat[tensordict-compile] 0.2684ms 0.1110ms 9.0099 KOps/s 9.2464 KOps/s $\color{#d91a1a}-2.56\%$
test_compile_add_self_flat[tensorclass-eager] 0.2749ms 66.3591μs 15.0695 KOps/s 15.9055 KOps/s $\textbf{\color{#d91a1a}-5.26\%}$
test_compile_add_self_flat[tensorclass-compile] 0.1388ms 49.9981μs 20.0007 KOps/s 20.5442 KOps/s $\color{#d91a1a}-2.65\%$
test_compile_add_self_flat[pytree-eager] 0.2943ms 0.1562ms 6.4011 KOps/s 6.1846 KOps/s $\color{#35bf28}+3.50\%$
test_compile_add_self_flat[pytree-compile] 0.2357ms 0.1015ms 9.8512 KOps/s 9.8664 KOps/s $\color{#d91a1a}-0.15\%$
test_compile_copy_flat[tensordict-compile] 62.5880μs 22.0259μs 45.4011 KOps/s 46.0800 KOps/s $\color{#d91a1a}-1.47\%$
test_compile_copy_flat[tensordict-eager] 0.1605ms 65.9599μs 15.1607 KOps/s 14.9562 KOps/s $\color{#35bf28}+1.37\%$
test_compile_copy_flat[pytree-compile] 0.1756ms 81.7727μs 12.2290 KOps/s 12.4319 KOps/s $\color{#d91a1a}-1.63\%$
test_compile_copy_flat[pytree-eager] 0.1523ms 67.5639μs 14.8008 KOps/s 15.1225 KOps/s $\color{#d91a1a}-2.13\%$
test_compile_assign_and_add[tensordict-compile] 0.4181ms 0.2180ms 4.5876 KOps/s 4.5514 KOps/s $\color{#35bf28}+0.80\%$
test_compile_assign_and_add[tensordict-eager] 2.6940ms 1.3888ms 720.0614 Ops/s 699.3816 Ops/s $\color{#35bf28}+2.96\%$
test_compile_assign_and_add[pytree-compile] 0.3395ms 0.2119ms 4.7191 KOps/s 4.7335 KOps/s $\color{#d91a1a}-0.30\%$
test_compile_assign_and_add[pytree-eager] 1.5652ms 0.8117ms 1.2320 KOps/s 1.1661 KOps/s $\textbf{\color{#35bf28}+5.64\%}$
test_compile_assign_and_add_stack[compile] 0.8552ms 0.4650ms 2.1508 KOps/s 2.1325 KOps/s $\color{#35bf28}+0.85\%$
test_compile_assign_and_add_stack[eager] 2.8719ms 2.6416ms 378.5515 Ops/s 339.9561 Ops/s $\textbf{\color{#35bf28}+11.35\%}$
test_compile_indexing[tensor-tensordict-compile] 0.1108ms 40.0221μs 24.9862 KOps/s 24.9540 KOps/s $\color{#35bf28}+0.13\%$
test_compile_indexing[tensor-tensordict-eager] 0.5872ms 33.1187μs 30.1944 KOps/s 29.6036 KOps/s $\color{#35bf28}+2.00\%$
test_compile_indexing[tensor-tensorclass-compile] 97.7320μs 31.6397μs 31.6059 KOps/s 31.0812 KOps/s $\color{#35bf28}+1.69\%$
test_compile_indexing[tensor-tensorclass-eager] 66.2840μs 22.7357μs 43.9836 KOps/s 43.0067 KOps/s $\color{#35bf28}+2.27\%$
test_compile_indexing[tensor-pytree-compile] 96.0200μs 32.2328μs 31.0243 KOps/s 30.4845 KOps/s $\color{#35bf28}+1.77\%$
test_compile_indexing[tensor-pytree-eager] 83.3960μs 22.7991μs 43.8613 KOps/s 42.8164 KOps/s $\color{#35bf28}+2.44\%$
test_compile_indexing[slice-tensordict-compile] 0.1323ms 53.8801μs 18.5597 KOps/s 18.5932 KOps/s $\color{#d91a1a}-0.18\%$
test_compile_indexing[slice-tensordict-eager] 0.2298s 26.7440μs 37.3916 KOps/s 47.4727 KOps/s $\textbf{\color{#d91a1a}-21.24\%}$
test_compile_indexing[slice-tensorclass-compile] 0.1224ms 46.4588μs 21.5244 KOps/s 20.9125 KOps/s $\color{#35bf28}+2.93\%$
test_compile_indexing[slice-tensorclass-eager] 68.7690μs 18.7771μs 53.2564 KOps/s 52.5804 KOps/s $\color{#35bf28}+1.29\%$
test_compile_indexing[slice-pytree-compile] 0.1133ms 46.5661μs 21.4749 KOps/s 20.5621 KOps/s $\color{#35bf28}+4.44\%$
test_compile_indexing[slice-pytree-eager] 66.4240μs 18.7320μs 53.3847 KOps/s 53.1525 KOps/s $\color{#35bf28}+0.44\%$
test_compile_indexing[int-tensordict-compile] 0.1363ms 54.5068μs 18.3463 KOps/s 17.7758 KOps/s $\color{#35bf28}+3.21\%$
test_compile_indexing[int-tensordict-eager] 0.9478ms 19.9964μs 50.0089 KOps/s 48.0265 KOps/s $\color{#35bf28}+4.13\%$
test_compile_indexing[int-tensorclass-compile] 0.1330ms 47.0744μs 21.2430 KOps/s 20.7340 KOps/s $\color{#35bf28}+2.45\%$
test_compile_indexing[int-tensorclass-eager] 52.7490μs 18.6805μs 53.5318 KOps/s 52.7898 KOps/s $\color{#35bf28}+1.41\%$
test_compile_indexing[int-pytree-compile] 0.1387ms 47.1788μs 21.1960 KOps/s 20.5861 KOps/s $\color{#35bf28}+2.96\%$
test_compile_indexing[int-pytree-eager] 64.1400μs 18.6790μs 53.5360 KOps/s 53.3975 KOps/s $\color{#35bf28}+0.26\%$
test_mod_add[eager] 90.7290μs 37.1383μs 26.9264 KOps/s 26.5488 KOps/s $\color{#35bf28}+1.42\%$
test_mod_add[compile] 0.1423ms 69.1480μs 14.4617 KOps/s 15.0000 KOps/s $\color{#d91a1a}-3.59\%$
test_mod_add[compile-overhead] 0.1462ms 67.1430μs 14.8936 KOps/s 14.7509 KOps/s $\color{#35bf28}+0.97\%$
test_mod_wrap[eager] 0.4068ms 0.2228ms 4.4880 KOps/s 4.3028 KOps/s $\color{#35bf28}+4.30\%$
test_mod_wrap[compile] 2.4201ms 0.2317ms 4.3167 KOps/s 4.2739 KOps/s $\color{#35bf28}+1.00\%$
test_mod_wrap[compile-overhead] 0.4111ms 0.2273ms 4.4002 KOps/s 4.3288 KOps/s $\color{#35bf28}+1.65\%$
test_mod_wrap_and_backward[eager] 18.7932ms 14.0101ms 71.3770 Ops/s 87.4374 Ops/s $\textbf{\color{#d91a1a}-18.37\%}$
test_mod_wrap_and_backward[compile] 15.5789ms 11.8319ms 84.5175 Ops/s 90.3069 Ops/s $\textbf{\color{#d91a1a}-6.41\%}$
test_mod_wrap_and_backward[compile-overhead] 13.8273ms 11.4476ms 87.3545 Ops/s 89.6929 Ops/s $\color{#d91a1a}-2.61\%$
test_seq_add[eager] 0.2231ms 0.1196ms 8.3607 KOps/s 8.1578 KOps/s $\color{#35bf28}+2.49\%$
test_seq_add[compile] 0.1795ms 82.3088μs 12.1494 KOps/s 12.4206 KOps/s $\color{#d91a1a}-2.18\%$
test_seq_add[compile-overhead] 0.1601ms 78.5065μs 12.7378 KOps/s 12.6530 KOps/s $\color{#35bf28}+0.67\%$
test_seq_wrap[eager] 0.6666ms 0.4437ms 2.2539 KOps/s 2.1118 KOps/s $\textbf{\color{#35bf28}+6.73\%}$
test_seq_wrap[compile] 0.3696ms 0.2469ms 4.0507 KOps/s 3.9813 KOps/s $\color{#35bf28}+1.74\%$
test_seq_wrap[compile-overhead] 0.4819ms 0.2481ms 4.0298 KOps/s 3.9918 KOps/s $\color{#35bf28}+0.95\%$
test_func_call_runtime[False-eager] 0.8402ms 0.5416ms 1.8463 KOps/s 1.7961 KOps/s $\color{#35bf28}+2.80\%$
test_func_call_runtime[False-compile] 0.6097ms 0.4465ms 2.2395 KOps/s 2.2223 KOps/s $\color{#35bf28}+0.78\%$
test_func_call_runtime[False-compile-overhead] 0.6456ms 0.4462ms 2.2413 KOps/s 2.2356 KOps/s $\color{#35bf28}+0.26\%$
test_func_call_runtime[True-eager] 1.5396ms 0.7691ms 1.3002 KOps/s 1.3095 KOps/s $\color{#d91a1a}-0.71\%$
test_func_call_runtime[True-compile] 0.8405ms 0.4685ms 2.1346 KOps/s 2.1506 KOps/s $\color{#d91a1a}-0.74\%$
test_func_call_runtime[True-compile-overhead] 0.9853ms 0.4702ms 2.1269 KOps/s 2.1062 KOps/s $\color{#35bf28}+0.98\%$
test_func_call_cm_runtime[False-eager] 0.7741ms 0.5413ms 1.8474 KOps/s 1.8380 KOps/s $\color{#35bf28}+0.51\%$
test_func_call_cm_runtime[False-compile] 0.6457ms 0.4453ms 2.2456 KOps/s 2.2374 KOps/s $\color{#35bf28}+0.37\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5442ms 0.4408ms 2.2684 KOps/s 2.2113 KOps/s $\color{#35bf28}+2.58\%$
test_func_call_cm_runtime[True-eager] 1.0970ms 0.8994ms 1.1118 KOps/s 1.0931 KOps/s $\color{#35bf28}+1.71\%$
test_func_call_cm_runtime[True-compile] 1.0810ms 0.7984ms 1.2524 KOps/s 1.2324 KOps/s $\color{#35bf28}+1.63\%$
test_func_call_cm_runtime[True-compile-overhead] 1.1112ms 0.8069ms 1.2393 KOps/s 1.2220 KOps/s $\color{#35bf28}+1.41\%$
test_vmap_func_call_cm_runtime[eager] 2.6450ms 1.9016ms 525.8671 Ops/s 509.7878 Ops/s $\color{#35bf28}+3.15\%$
test_vmap_func_call_cm_runtime[compile] 0.6706ms 0.5422ms 1.8442 KOps/s 1.8366 KOps/s $\color{#35bf28}+0.41\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.9111ms 0.5398ms 1.8526 KOps/s 1.8329 KOps/s $\color{#35bf28}+1.08\%$
test_distributed 0.2827ms 0.1260ms 7.9379 KOps/s 7.7273 KOps/s $\color{#35bf28}+2.72\%$
test_tdmodule 46.8270μs 27.6853μs 36.1203 KOps/s 34.1607 KOps/s $\textbf{\color{#35bf28}+5.74\%}$
test_tdmodule_dispatch 96.6710μs 50.3792μs 19.8495 KOps/s 18.6375 KOps/s $\textbf{\color{#35bf28}+6.50\%}$
test_tdseq 65.0820μs 29.7773μs 33.5826 KOps/s 31.9849 KOps/s $\color{#35bf28}+5.00\%$
test_tdseq_dispatch 88.2250μs 55.5419μs 18.0044 KOps/s 17.0723 KOps/s $\textbf{\color{#35bf28}+5.46\%}$
test_instantiation_functorch 1.7750ms 1.5300ms 653.6147 Ops/s 636.9904 Ops/s $\color{#35bf28}+2.61\%$
test_exec_functorch 0.3189ms 0.1770ms 5.6482 KOps/s 5.2933 KOps/s $\textbf{\color{#35bf28}+6.71\%}$
test_exec_functional_call 0.3456ms 0.1718ms 5.8200 KOps/s 5.7430 KOps/s $\color{#35bf28}+1.34\%$
test_exec_td_decorator 0.6370ms 0.2394ms 4.1768 KOps/s 4.1884 KOps/s $\color{#d91a1a}-0.28\%$
test_vmap_mlp_speed_decorator[True-True] 0.8593ms 0.6571ms 1.5217 KOps/s 1.4876 KOps/s $\color{#35bf28}+2.29\%$
test_vmap_mlp_speed_decorator[True-False] 0.9323ms 0.6557ms 1.5252 KOps/s 1.4933 KOps/s $\color{#35bf28}+2.14\%$
test_vmap_mlp_speed_decorator[False-True] 0.7630ms 0.5318ms 1.8806 KOps/s 1.8539 KOps/s $\color{#35bf28}+1.44\%$
test_vmap_mlp_speed_decorator[False-False] 0.9007ms 0.5352ms 1.8684 KOps/s 1.8415 KOps/s $\color{#35bf28}+1.46\%$
test_to_module_speed[True] 1.7955ms 1.3249ms 754.7772 Ops/s 740.9540 Ops/s $\color{#35bf28}+1.87\%$
test_to_module_speed[False] 1.8836ms 1.3024ms 767.8322 Ops/s 752.6489 Ops/s $\color{#35bf28}+2.02\%$
test_tc_init 98.7750μs 46.4103μs 21.5469 KOps/s 20.3866 KOps/s $\textbf{\color{#35bf28}+5.69\%}$
test_tc_init_nested 0.1842ms 92.7862μs 10.7775 KOps/s 10.1745 KOps/s $\textbf{\color{#35bf28}+5.93\%}$
test_tc_first_layer_tensor 45.4340μs 1.6833μs 594.0802 KOps/s 554.5528 KOps/s $\textbf{\color{#35bf28}+7.13\%}$
test_tc_first_layer_nontensor 24.2760μs 4.9402μs 202.4214 KOps/s 195.6476 KOps/s $\color{#35bf28}+3.46\%$
test_tc_second_layer_tensor 39.6140μs 3.1712μs 315.3411 KOps/s 294.1941 KOps/s $\textbf{\color{#35bf28}+7.19\%}$
test_tc_second_layer_nontensor 36.1480μs 6.5752μs 152.0860 KOps/s 149.4568 KOps/s $\color{#35bf28}+1.76\%$
test_unbind 0.2601s 14.1132ms 70.8557 Ops/s 60.2425 Ops/s $\textbf{\color{#35bf28}+17.62\%}$
test_full_like 4.0964ms 3.4791ms 287.4320 Ops/s 250.3656 Ops/s $\textbf{\color{#35bf28}+14.80\%}$
test_zeros_like 11.2965ms 4.9001ms 204.0778 Ops/s 290.7855 Ops/s $\textbf{\color{#d91a1a}-29.82\%}$
test_ones_like 5.7759ms 3.6461ms 274.2694 Ops/s 181.9080 Ops/s $\textbf{\color{#35bf28}+50.77\%}$
test_clone 9.8340ms 7.4946ms 133.4294 Ops/s 167.2072 Ops/s $\textbf{\color{#d91a1a}-20.20\%}$
test_squeeze 66.2640μs 12.3745μs 80.8112 KOps/s 77.9249 KOps/s $\color{#35bf28}+3.70\%$
test_unsqueeze 0.2992ms 95.7211μs 10.4470 KOps/s 10.5336 KOps/s $\color{#d91a1a}-0.82\%$
test_split 0.3282ms 0.1971ms 5.0745 KOps/s 5.0008 KOps/s $\color{#35bf28}+1.47\%$
test_permute 0.2929ms 0.2068ms 4.8355 KOps/s 4.9320 KOps/s $\color{#d91a1a}-1.96\%$
test_stack 30.8337ms 25.9944ms 38.4698 Ops/s 35.3187 Ops/s $\textbf{\color{#35bf28}+8.92\%}$
test_cat 31.2377ms 26.0626ms 38.3692 Ops/s 36.4974 Ops/s $\textbf{\color{#35bf28}+5.13\%}$

Vincent Moens added 2 commits March 24, 2025 16:10
@vmoens vmoens force-pushed the fix-tensorclass-missing branch from b0d5c82 to 9ac1806 Compare March 24, 2025 16:12
@vmoens vmoens merged commit cc0f464 into main Mar 24, 2025
20 of 24 checks passed
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}7$. Worsened: $\large\color{#d91a1a}20$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 44.7210μs 13.7171μs 72.9015 KOps/s 73.3867 KOps/s $\color{#d91a1a}-0.66\%$
test_plain_set_stack_nested 41.7100μs 13.9607μs 71.6296 KOps/s 72.6313 KOps/s $\color{#d91a1a}-1.38\%$
test_plain_set_nested_inplace 41.4910μs 14.6956μs 68.0475 KOps/s 67.9308 KOps/s $\color{#35bf28}+0.17\%$
test_plain_set_stack_nested_inplace 40.9600μs 14.9053μs 67.0902 KOps/s 67.6126 KOps/s $\color{#d91a1a}-0.77\%$
test_items 32.3300μs 2.8446μs 351.5460 KOps/s 345.7650 KOps/s $\color{#35bf28}+1.67\%$
test_items_nested 0.4075ms 0.3632ms 2.7537 KOps/s 2.7258 KOps/s $\color{#35bf28}+1.02\%$
test_items_nested_locked 0.4470ms 0.3650ms 2.7400 KOps/s 2.7185 KOps/s $\color{#35bf28}+0.79\%$
test_items_nested_leaf 87.2510μs 60.3186μs 16.5786 KOps/s 16.5390 KOps/s $\color{#35bf28}+0.24\%$
test_items_stack_nested 0.5399ms 0.3643ms 2.7448 KOps/s 2.7597 KOps/s $\color{#d91a1a}-0.54\%$
test_items_stack_nested_leaf 97.0800μs 60.2915μs 16.5861 KOps/s 16.6000 KOps/s $\color{#d91a1a}-0.08\%$
test_items_stack_nested_locked 0.4110ms 0.3670ms 2.7251 KOps/s 2.7549 KOps/s $\color{#d91a1a}-1.08\%$
test_keys 29.2710μs 3.4273μs 291.7745 KOps/s 290.2850 KOps/s $\color{#35bf28}+0.51\%$
test_keys_nested 0.1226ms 88.4499μs 11.3058 KOps/s 11.4676 KOps/s $\color{#d91a1a}-1.41\%$
test_keys_nested_locked 0.7746ms 94.4610μs 10.5864 KOps/s 10.6701 KOps/s $\color{#d91a1a}-0.78\%$
test_keys_nested_leaf 0.1092ms 79.4933μs 12.5797 KOps/s 12.6118 KOps/s $\color{#d91a1a}-0.25\%$
test_keys_stack_nested 0.1121ms 88.9681μs 11.2400 KOps/s 11.4010 KOps/s $\color{#d91a1a}-1.41\%$
test_keys_stack_nested_leaf 0.1205ms 78.8354μs 12.6847 KOps/s 12.7230 KOps/s $\color{#d91a1a}-0.30\%$
test_keys_stack_nested_locked 0.1551ms 94.2524μs 10.6098 KOps/s 10.5999 KOps/s $\color{#35bf28}+0.09\%$
test_values 4.9902μs 0.8520μs 1.1737 MOps/s 1.1773 MOps/s $\color{#d91a1a}-0.31\%$
test_values_nested 59.3200μs 38.6615μs 25.8655 KOps/s 26.1604 KOps/s $\color{#d91a1a}-1.13\%$
test_values_nested_locked 72.1900μs 40.6587μs 24.5950 KOps/s 24.2439 KOps/s $\color{#35bf28}+1.45\%$
test_values_nested_leaf 66.7200μs 43.8754μs 22.7918 KOps/s 23.2910 KOps/s $\color{#d91a1a}-2.14\%$
test_values_stack_nested 73.2910μs 38.4886μs 25.9817 KOps/s 26.1026 KOps/s $\color{#d91a1a}-0.46\%$
test_values_stack_nested_leaf 94.6810μs 43.9047μs 22.7766 KOps/s 23.0700 KOps/s $\color{#d91a1a}-1.27\%$
test_values_stack_nested_locked 76.6410μs 40.8690μs 24.4685 KOps/s 24.1461 KOps/s $\color{#35bf28}+1.34\%$
test_membership 1.7040μs 0.4982μs 2.0073 MOps/s 1.9691 MOps/s $\color{#35bf28}+1.94\%$
test_membership_nested 18.9300μs 2.0203μs 494.9713 KOps/s 492.0500 KOps/s $\color{#35bf28}+0.59\%$
test_membership_nested_leaf 15.2750μs 1.9951μs 501.2351 KOps/s 498.4896 KOps/s $\color{#35bf28}+0.55\%$
test_membership_stacked_nested 39.2800μs 2.1207μs 471.5323 KOps/s 486.7427 KOps/s $\color{#d91a1a}-3.12\%$
test_membership_stacked_nested_leaf 23.2110μs 2.0986μs 476.5116 KOps/s 485.0429 KOps/s $\color{#d91a1a}-1.76\%$
test_membership_nested_last 36.9700μs 3.0948μs 323.1250 KOps/s 329.9771 KOps/s $\color{#d91a1a}-2.08\%$
test_membership_nested_leaf_last 25.1210μs 3.0841μs 324.2481 KOps/s 331.2656 KOps/s $\color{#d91a1a}-2.12\%$
test_membership_stacked_nested_last 28.2600μs 3.0946μs 323.1475 KOps/s 330.6410 KOps/s $\color{#d91a1a}-2.27\%$
test_membership_stacked_nested_leaf_last 30.4500μs 3.0942μs 323.1804 KOps/s 330.2620 KOps/s $\color{#d91a1a}-2.14\%$
test_nested_getleaf 36.9700μs 13.4640μs 74.2721 KOps/s 74.1773 KOps/s $\color{#35bf28}+0.13\%$
test_nested_get 41.5200μs 12.8832μs 77.6206 KOps/s 78.0401 KOps/s $\color{#d91a1a}-0.54\%$
test_stacked_getleaf 44.3210μs 13.5509μs 73.7961 KOps/s 74.5629 KOps/s $\color{#d91a1a}-1.03\%$
test_stacked_get 55.4110μs 12.6903μs 78.8001 KOps/s 78.3036 KOps/s $\color{#35bf28}+0.63\%$
test_nested_getitemleaf 41.0700μs 13.7579μs 72.6853 KOps/s 72.3654 KOps/s $\color{#35bf28}+0.44\%$
test_nested_getitem 36.1100μs 12.9513μs 77.2122 KOps/s 76.4031 KOps/s $\color{#35bf28}+1.06\%$
test_stacked_getitemleaf 40.7900μs 13.8386μs 72.2615 KOps/s 73.1707 KOps/s $\color{#d91a1a}-1.24\%$
test_stacked_getitem 33.9100μs 12.9789μs 77.0484 KOps/s 77.0897 KOps/s $\color{#d91a1a}-0.05\%$
test_lock_nested 7.9925ms 0.3545ms 2.8206 KOps/s 2.9929 KOps/s $\textbf{\color{#d91a1a}-5.76\%}$
test_lock_stack_nested 0.4123ms 0.3553ms 2.8145 KOps/s 2.9061 KOps/s $\color{#d91a1a}-3.15\%$
test_unlock_nested 0.4913ms 0.2904ms 3.4436 KOps/s 3.5895 KOps/s $\color{#d91a1a}-4.06\%$
test_unlock_stack_nested 0.3343ms 0.2936ms 3.4063 KOps/s 3.5503 KOps/s $\color{#d91a1a}-4.06\%$
test_flatten_speed 0.1121ms 77.6988μs 12.8702 KOps/s 12.7945 KOps/s $\color{#35bf28}+0.59\%$
test_unflatten_speed 0.4691ms 0.4029ms 2.4817 KOps/s 2.4956 KOps/s $\color{#d91a1a}-0.56\%$
test_common_ops 0.7908ms 0.6642ms 1.5055 KOps/s 1.5387 KOps/s $\color{#d91a1a}-2.16\%$
test_creation 0.1161ms 1.7358μs 576.0967 KOps/s 566.7282 KOps/s $\color{#35bf28}+1.65\%$
test_creation_empty 33.9400μs 9.9282μs 100.7234 KOps/s 102.7660 KOps/s $\color{#d91a1a}-1.99\%$
test_creation_nested_1 34.7700μs 11.5612μs 86.4960 KOps/s 87.7593 KOps/s $\color{#d91a1a}-1.44\%$
test_creation_nested_2 38.6700μs 14.1916μs 70.4640 KOps/s 70.8564 KOps/s $\color{#d91a1a}-0.55\%$
test_clone 52.3810μs 11.0635μs 90.3870 KOps/s 94.3221 KOps/s $\color{#d91a1a}-4.17\%$
test_getitem[int] 1.2382ms 11.0509μs 90.4907 KOps/s 95.9278 KOps/s $\textbf{\color{#d91a1a}-5.67\%}$
test_getitem[slice_int] 0.1129ms 21.6268μs 46.2390 KOps/s 48.3728 KOps/s $\color{#d91a1a}-4.41\%$
test_getitem[range] 0.1338ms 40.2159μs 24.8658 KOps/s 26.9638 KOps/s $\textbf{\color{#d91a1a}-7.78\%}$
test_getitem[tuple] 0.1080ms 18.6458μs 53.6314 KOps/s 55.4509 KOps/s $\color{#d91a1a}-3.28\%$
test_getitem[list] 0.1520ms 34.5280μs 28.9620 KOps/s 29.4208 KOps/s $\color{#d91a1a}-1.56\%$
test_setitem_dim[int] 40.0300μs 20.9840μs 47.6552 KOps/s 50.6816 KOps/s $\textbf{\color{#d91a1a}-5.97\%}$
test_setitem_dim[slice_int] 60.9010μs 40.0121μs 24.9924 KOps/s 25.6874 KOps/s $\color{#d91a1a}-2.71\%$
test_setitem_dim[range] 79.6300μs 55.4015μs 18.0500 KOps/s 19.1790 KOps/s $\textbf{\color{#d91a1a}-5.89\%}$
test_setitem_dim[tuple] 58.2610μs 33.9737μs 29.4346 KOps/s 30.3454 KOps/s $\color{#d91a1a}-3.00\%$
test_setitem 64.3600μs 16.5405μs 60.4578 KOps/s 61.9124 KOps/s $\color{#d91a1a}-2.35\%$
test_set 56.6210μs 16.0075μs 62.4707 KOps/s 64.8116 KOps/s $\color{#d91a1a}-3.61\%$
test_set_shared 0.5894ms 0.1594ms 6.2751 KOps/s 6.3264 KOps/s $\color{#d91a1a}-0.81\%$
test_update 0.2498ms 21.9030μs 45.6558 KOps/s 47.2965 KOps/s $\color{#d91a1a}-3.47\%$
test_update_nested 78.2110μs 31.2603μs 31.9894 KOps/s 33.5482 KOps/s $\color{#d91a1a}-4.65\%$
test_update__nested 0.5360ms 26.3268μs 37.9841 KOps/s 39.4823 KOps/s $\color{#d91a1a}-3.79\%$
test_set_nested 63.9900μs 17.6107μs 56.7836 KOps/s 59.6416 KOps/s $\color{#d91a1a}-4.79\%$
test_set_nested_new 68.5610μs 19.6026μs 51.0137 KOps/s 52.6213 KOps/s $\color{#d91a1a}-3.06\%$
test_select 83.0510μs 31.3971μs 31.8501 KOps/s 33.0928 KOps/s $\color{#d91a1a}-3.76\%$
test_select_nested 68.4000μs 43.6513μs 22.9088 KOps/s 22.7611 KOps/s $\color{#35bf28}+0.65\%$
test_exclude_nested 0.1007ms 62.9658μs 15.8816 KOps/s 15.7101 KOps/s $\color{#35bf28}+1.09\%$
test_empty[True] 0.3544ms 0.2976ms 3.3597 KOps/s 3.4043 KOps/s $\color{#d91a1a}-1.31\%$
test_empty[False] 3.9620μs 0.8301μs 1.2046 MOps/s 1.2045 MOps/s $+0.01\%$
test_to 85.9510μs 57.5649μs 17.3717 KOps/s 17.2727 KOps/s $\color{#35bf28}+0.57\%$
test_to_nonblocking 89.6810μs 49.9126μs 20.0350 KOps/s 21.0000 KOps/s $\color{#d91a1a}-4.60\%$
test_unbind_speed 0.2836ms 0.2512ms 3.9817 KOps/s 4.2227 KOps/s $\textbf{\color{#d91a1a}-5.71\%}$
test_unbind_speed_stack0 0.2995ms 0.2495ms 4.0074 KOps/s 4.1825 KOps/s $\color{#d91a1a}-4.19\%$
test_unbind_speed_stack1 92.4573ms 0.8330ms 1.2005 KOps/s 1.2246 KOps/s $\color{#d91a1a}-1.97\%$
test_split 95.0917ms 1.6498ms 606.1497 Ops/s 629.6113 Ops/s $\color{#d91a1a}-3.73\%$
test_chunk 94.1307ms 1.6461ms 607.5060 Ops/s 619.5582 Ops/s $\color{#d91a1a}-1.95\%$
test_consolidate[False-None] 3.1127ms 2.7770ms 360.0972 Ops/s 360.3579 Ops/s $\color{#d91a1a}-0.07\%$
test_consolidate[default-None] 2.2158ms 1.7831ms 560.8082 Ops/s 586.4788 Ops/s $\color{#d91a1a}-4.38\%$
test_consolidate[reduce-overhead-None] 1.8840ms 1.8002ms 555.5013 Ops/s 570.7101 Ops/s $\color{#d91a1a}-2.66\%$
test_consolidate_njt[False-None] 7.0865ms 6.7453ms 148.2511 Ops/s 151.6740 Ops/s $\color{#d91a1a}-2.26\%$
test_to[False-False-None] 1.8613ms 1.7818ms 561.2266 Ops/s 573.6022 Ops/s $\color{#d91a1a}-2.16\%$
test_to[True-False-None] 1.6677ms 1.3967ms 715.9933 Ops/s 742.1242 Ops/s $\color{#d91a1a}-3.52\%$
test_to[within-False-None] 4.5834ms 4.3496ms 229.9039 Ops/s 236.5746 Ops/s $\color{#d91a1a}-2.82\%$
test_to[True-default-None] 5.7829ms 5.4750ms 182.6485 Ops/s 185.6718 Ops/s $\color{#d91a1a}-1.63\%$
test_to_njt[False-False-None] 7.3358ms 7.1166ms 140.5159 Ops/s 143.5810 Ops/s $\color{#d91a1a}-2.13\%$
test_to_njt[True-False-None] 5.8735ms 5.6770ms 176.1500 Ops/s 179.0850 Ops/s $\color{#d91a1a}-1.64\%$
test_to_njt[within-False-None] 12.7438ms 12.4195ms 80.5185 Ops/s 83.2038 Ops/s $\color{#d91a1a}-3.23\%$
test_creation[device0] 0.5453ms 81.3466μs 12.2931 KOps/s 12.4241 KOps/s $\color{#d91a1a}-1.05\%$
test_creation_from_tensor 0.4452ms 83.8238μs 11.9298 KOps/s 11.9412 KOps/s $\color{#d91a1a}-0.10\%$
test_add_one[memmap_tensor0] 0.2526ms 7.2153μs 138.5944 KOps/s 148.9788 KOps/s $\textbf{\color{#d91a1a}-6.97\%}$
test_contiguous[memmap_tensor0] 1.9905μs 0.4306μs 2.3225 MOps/s 2.2787 MOps/s $\color{#35bf28}+1.92\%$
test_stack[memmap_tensor0] 36.9110μs 4.9320μs 202.7581 KOps/s 234.7144 KOps/s $\textbf{\color{#d91a1a}-13.61\%}$
test_memmaptd_index 1.4861ms 0.2643ms 3.7838 KOps/s 4.1236 KOps/s $\textbf{\color{#d91a1a}-8.24\%}$
test_memmaptd_index_astensor 0.4765ms 0.3244ms 3.0824 KOps/s 3.2565 KOps/s $\textbf{\color{#d91a1a}-5.35\%}$
test_memmaptd_index_op 0.8935ms 0.6509ms 1.5363 KOps/s 1.6455 KOps/s $\textbf{\color{#d91a1a}-6.63\%}$
test_serialize_model 0.1331s 0.1317s 7.5929 Ops/s 7.6127 Ops/s $\color{#d91a1a}-0.26\%$
test_serialize_model_pickle 1.3509s 1.2136s 0.8240 Ops/s 0.8410 Ops/s $\color{#d91a1a}-2.02\%$
test_serialize_weights 0.1328s 0.1311s 7.6249 Ops/s 7.6426 Ops/s $\color{#d91a1a}-0.23\%$
test_serialize_weights_returnearly 0.5944s 64.3229ms 15.5466 Ops/s 14.2744 Ops/s $\textbf{\color{#35bf28}+8.91\%}$
test_serialize_weights_pickle 1.4067s 1.2309s 0.8124 Ops/s 0.8183 Ops/s $\color{#d91a1a}-0.71\%$
test_reshape_pytree 0.4117ms 22.1625μs 45.1212 KOps/s 44.6738 KOps/s $\color{#35bf28}+1.00\%$
test_reshape_td 60.4400μs 26.4483μs 37.8097 KOps/s 36.7972 KOps/s $\color{#35bf28}+2.75\%$
test_view_pytree 61.9100μs 22.1530μs 45.1407 KOps/s 45.7580 KOps/s $\color{#d91a1a}-1.35\%$
test_view_td 0.4240ms 34.3180μs 29.1393 KOps/s 30.5852 KOps/s $\color{#d91a1a}-4.73\%$
test_unbind_pytree 64.2500μs 28.6710μs 34.8784 KOps/s 35.5705 KOps/s $\color{#d91a1a}-1.95\%$
test_unbind_td 0.8317ms 37.7139μs 26.5154 KOps/s 26.5249 KOps/s $\color{#d91a1a}-0.04\%$
test_split_pytree 72.0400μs 30.3029μs 33.0002 KOps/s 33.4192 KOps/s $\color{#d91a1a}-1.25\%$
test_split_td 0.9753ms 39.6975μs 25.1905 KOps/s 25.1228 KOps/s $\color{#35bf28}+0.27\%$
test_add_pytree 87.6800μs 36.1586μs 27.6560 KOps/s 28.4613 KOps/s $\color{#d91a1a}-2.83\%$
test_add_td 0.4358ms 52.3804μs 19.0911 KOps/s 19.0339 KOps/s $\color{#35bf28}+0.30\%$
test_compile_add_one_nested[tensordict-compile] 0.1841ms 0.1237ms 8.0828 KOps/s 7.8719 KOps/s $\color{#35bf28}+2.68\%$
test_compile_add_one_nested[tensordict-eager] 0.5389ms 0.1336ms 7.4852 KOps/s 7.3773 KOps/s $\color{#35bf28}+1.46\%$
test_compile_add_one_nested[pytree-compile] 0.1322ms 97.3024μs 10.2772 KOps/s 10.2642 KOps/s $\color{#35bf28}+0.13\%$
test_compile_add_one_nested[pytree-eager] 1.3544ms 0.1515ms 6.6009 KOps/s 6.6769 KOps/s $\color{#d91a1a}-1.14\%$
test_compile_copy_nested[tensordict-compile] 0.4245ms 25.4290μs 39.3252 KOps/s 40.3297 KOps/s $\color{#d91a1a}-2.49\%$
test_compile_copy_nested[tensordict-eager] 0.4237ms 29.2702μs 34.1645 KOps/s 33.7001 KOps/s $\color{#35bf28}+1.38\%$
test_compile_copy_nested[pytree-compile] 0.4730ms 63.2524μs 15.8097 KOps/s 15.3771 KOps/s $\color{#35bf28}+2.81\%$
test_compile_copy_nested[pytree-eager] 0.1035ms 48.6625μs 20.5497 KOps/s 20.3829 KOps/s $\color{#35bf28}+0.82\%$
test_compile_add_one_flat[tensordict-compile] 0.1853ms 0.1433ms 6.9766 KOps/s 7.0052 KOps/s $\color{#d91a1a}-0.41\%$
test_compile_add_one_flat[tensordict-eager] 0.3164ms 0.2182ms 4.5832 KOps/s 4.6656 KOps/s $\color{#d91a1a}-1.77\%$
test_compile_add_one_flat[tensorclass-compile] 0.1339ms 98.1874μs 10.1846 KOps/s 10.1881 KOps/s $\color{#d91a1a}-0.03\%$
test_compile_add_one_flat[tensorclass-eager] 0.1382ms 56.2155μs 17.7887 KOps/s 17.8961 KOps/s $\color{#d91a1a}-0.60\%$
test_compile_add_one_flat[pytree-compile] 0.1896ms 0.1372ms 7.2903 KOps/s 7.2666 KOps/s $\color{#35bf28}+0.33\%$
test_compile_add_one_flat[pytree-eager] 0.5831ms 0.4848ms 2.0629 KOps/s 2.0633 KOps/s $\color{#d91a1a}-0.02\%$
test_compile_add_self_flat[tensordict-eager] 0.3900ms 0.2632ms 3.7999 KOps/s 3.8137 KOps/s $\color{#d91a1a}-0.36\%$
test_compile_add_self_flat[tensordict-compile] 0.2400ms 0.1442ms 6.9324 KOps/s 6.9138 KOps/s $\color{#35bf28}+0.27\%$
test_compile_add_self_flat[tensorclass-eager] 0.1830ms 67.9707μs 14.7122 KOps/s 14.6859 KOps/s $\color{#35bf28}+0.18\%$
test_compile_add_self_flat[tensorclass-compile] 0.1396ms 99.1207μs 10.0887 KOps/s 10.0182 KOps/s $\color{#35bf28}+0.70\%$
test_compile_add_self_flat[pytree-eager] 0.5337ms 0.4141ms 2.4150 KOps/s 2.4702 KOps/s $\color{#d91a1a}-2.23\%$
test_compile_add_self_flat[pytree-compile] 0.1867ms 0.1367ms 7.3129 KOps/s 7.3444 KOps/s $\color{#d91a1a}-0.43\%$
test_compile_copy_flat[tensordict-compile] 0.1309ms 20.2473μs 49.3893 KOps/s 47.4887 KOps/s $\color{#35bf28}+4.00\%$
test_compile_copy_flat[tensordict-eager] 57.0500μs 31.1482μs 32.1046 KOps/s 31.9246 KOps/s $\color{#35bf28}+0.56\%$
test_compile_copy_flat[pytree-compile] 0.1070ms 70.0532μs 14.2749 KOps/s 14.1577 KOps/s $\color{#35bf28}+0.83\%$
test_compile_copy_flat[pytree-eager] 80.5210μs 52.1312μs 19.1824 KOps/s 19.0997 KOps/s $\color{#35bf28}+0.43\%$
test_compile_assign_and_add[tensordict-compile] 1.6450ms 0.3949ms 2.5325 KOps/s 2.1937 KOps/s $\textbf{\color{#35bf28}+15.45\%}$
test_compile_assign_and_add[tensordict-eager] 2.9951ms 2.8322ms 353.0809 Ops/s 370.7267 Ops/s $\color{#d91a1a}-4.76\%$
test_compile_assign_and_add[pytree-compile] 1.6119ms 0.3861ms 2.5898 KOps/s 2.2269 KOps/s $\textbf{\color{#35bf28}+16.30\%}$
test_compile_assign_and_add[pytree-eager] 2.8109ms 2.7469ms 364.0423 Ops/s 374.0545 Ops/s $\color{#d91a1a}-2.68\%$
test_compile_indexing[tensor-tensordict-compile] 0.3187ms 0.1158ms 8.6378 KOps/s 8.4794 KOps/s $\color{#35bf28}+1.87\%$
test_compile_indexing[tensor-tensordict-eager] 0.5888ms 84.3167μs 11.8600 KOps/s 12.2026 KOps/s $\color{#d91a1a}-2.81\%$
test_compile_indexing[tensor-tensorclass-compile] 0.3854ms 0.1100ms 9.0934 KOps/s 8.8595 KOps/s $\color{#35bf28}+2.64\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1910ms 70.8185μs 14.1206 KOps/s 13.9262 KOps/s $\color{#35bf28}+1.40\%$
test_compile_indexing[tensor-pytree-compile] 0.1640ms 0.1158ms 8.6345 KOps/s 8.7329 KOps/s $\color{#d91a1a}-1.13\%$
test_compile_indexing[tensor-pytree-eager] 0.1162ms 73.7331μs 13.5624 KOps/s 13.7919 KOps/s $\color{#d91a1a}-1.66\%$
test_compile_indexing[slice-tensordict-compile] 0.1389ms 0.1005ms 9.9456 KOps/s 9.9242 KOps/s $\color{#35bf28}+0.22\%$
test_compile_indexing[slice-tensordict-eager] 0.1554ms 18.3253μs 54.5693 KOps/s 57.1685 KOps/s $\color{#d91a1a}-4.55\%$
test_compile_indexing[slice-tensorclass-compile] 0.1290ms 95.7341μs 10.4456 KOps/s 10.3101 KOps/s $\color{#35bf28}+1.31\%$
test_compile_indexing[slice-tensorclass-eager] 48.0100μs 16.3828μs 61.0398 KOps/s 63.0813 KOps/s $\color{#d91a1a}-3.24\%$
test_compile_indexing[slice-pytree-compile] 0.1448ms 98.6699μs 10.1348 KOps/s 10.2652 KOps/s $\color{#d91a1a}-1.27\%$
test_compile_indexing[slice-pytree-eager] 43.3910μs 16.1983μs 61.7348 KOps/s 64.4243 KOps/s $\color{#d91a1a}-4.17\%$
test_compile_indexing[int-tensordict-compile] 0.1566ms 0.1018ms 9.8203 KOps/s 9.8551 KOps/s $\color{#d91a1a}-0.35\%$
test_compile_indexing[int-tensordict-eager] 0.6149ms 18.0271μs 55.4719 KOps/s 56.9494 KOps/s $\color{#d91a1a}-2.59\%$
test_compile_indexing[int-tensorclass-compile] 0.2310ms 97.7557μs 10.2296 KOps/s 9.9574 KOps/s $\color{#35bf28}+2.73\%$
test_compile_indexing[int-tensorclass-eager] 47.5600μs 16.4281μs 60.8712 KOps/s 64.6617 KOps/s $\textbf{\color{#d91a1a}-5.86\%}$
test_compile_indexing[int-pytree-compile] 0.1391ms 97.3428μs 10.2730 KOps/s 10.0718 KOps/s $\color{#35bf28}+2.00\%$
test_compile_indexing[int-pytree-eager] 0.4083ms 16.3368μs 61.2113 KOps/s 64.5032 KOps/s $\textbf{\color{#d91a1a}-5.10\%}$
test_mod_add[eager] 91.3900μs 43.7671μs 22.8482 KOps/s 23.6676 KOps/s $\color{#d91a1a}-3.46\%$
test_mod_add[compile] 0.1276ms 85.7275μs 11.6649 KOps/s 12.1176 KOps/s $\color{#d91a1a}-3.74\%$
test_mod_add[compile-overhead] 0.3329ms 0.1731ms 5.7778 KOps/s 5.6085 KOps/s $\color{#35bf28}+3.02\%$
test_mod_wrap[eager] 0.3367ms 0.2672ms 3.7429 KOps/s 3.8821 KOps/s $\color{#d91a1a}-3.59\%$
test_mod_wrap[compile] 0.6393ms 0.3001ms 3.3322 KOps/s 3.4602 KOps/s $\color{#d91a1a}-3.70\%$
test_mod_wrap[compile-overhead] 7.0354ms 3.7710ms 265.1841 Ops/s 268.1589 Ops/s $\color{#d91a1a}-1.11\%$
test_mod_wrap_and_backward[eager] 1.6100ms 1.4012ms 713.6980 Ops/s 681.9278 Ops/s $\color{#35bf28}+4.66\%$
test_mod_wrap_and_backward[compile] 1.4253ms 1.3094ms 763.7151 Ops/s 720.3177 Ops/s $\textbf{\color{#35bf28}+6.02\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3962ms 0.9391ms 1.0648 KOps/s 923.7563 Ops/s $\textbf{\color{#35bf28}+15.27\%}$
test_seq_add[eager] 0.1915ms 0.1250ms 8.0012 KOps/s 7.9674 KOps/s $\color{#35bf28}+0.43\%$
test_seq_add[compile] 0.1486ms 91.2870μs 10.9545 KOps/s 10.5659 KOps/s $\color{#35bf28}+3.68\%$
test_seq_add[compile-overhead] 0.1762ms 0.1325ms 7.5470 KOps/s 7.6217 KOps/s $\color{#d91a1a}-0.98\%$
test_seq_wrap[eager] 0.5132ms 0.4465ms 2.2395 KOps/s 2.1770 KOps/s $\color{#35bf28}+2.87\%$
test_seq_wrap[compile] 0.3853ms 0.3125ms 3.2001 KOps/s 3.1595 KOps/s $\color{#35bf28}+1.29\%$
test_seq_wrap[compile-overhead] 0.2809ms 0.2378ms 4.2050 KOps/s 4.3436 KOps/s $\color{#d91a1a}-3.19\%$
test_func_call_runtime[False-eager] 0.8280ms 0.7678ms 1.3024 KOps/s 1.3486 KOps/s $\color{#d91a1a}-3.42\%$
test_func_call_runtime[False-compile] 0.8767ms 0.8021ms 1.2467 KOps/s 1.3383 KOps/s $\textbf{\color{#d91a1a}-6.84\%}$
test_func_call_runtime[False-compile-overhead] 0.4343ms 0.3696ms 2.7056 KOps/s 2.7181 KOps/s $\color{#d91a1a}-0.46\%$
test_func_call_runtime[True-eager] 1.0134ms 0.9280ms 1.0776 KOps/s 1.0955 KOps/s $\color{#d91a1a}-1.63\%$
test_func_call_runtime[True-compile] 0.8962ms 0.7926ms 1.2616 KOps/s 1.2920 KOps/s $\color{#d91a1a}-2.35\%$
test_func_call_runtime[True-compile-overhead] 0.4467ms 0.3924ms 2.5483 KOps/s 2.5097 KOps/s $\color{#35bf28}+1.53\%$
test_func_call_cm_runtime[False-eager] 0.9432ms 0.7497ms 1.3338 KOps/s 1.3447 KOps/s $\color{#d91a1a}-0.81\%$
test_func_call_cm_runtime[False-compile] 0.8358ms 0.7743ms 1.2914 KOps/s 1.3062 KOps/s $\color{#d91a1a}-1.13\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4259ms 0.3745ms 2.6704 KOps/s 2.6393 KOps/s $\color{#35bf28}+1.18\%$
test_func_call_cm_runtime[True-eager] 1.1193ms 1.0311ms 969.8755 Ops/s 972.0787 Ops/s $\color{#d91a1a}-0.23\%$
test_func_call_cm_runtime[True-compile] 1.1335ms 1.0171ms 983.1668 Ops/s 963.3383 Ops/s $\color{#35bf28}+2.06\%$
test_func_call_cm_runtime[True-compile-overhead] 1.1356ms 1.0367ms 964.5977 Ops/s 962.8204 Ops/s $\color{#35bf28}+0.18\%$
test_vmap_func_call_cm_runtime[eager] 2.5661ms 2.1500ms 465.1239 Ops/s 470.9823 Ops/s $\color{#d91a1a}-1.24\%$
test_vmap_func_call_cm_runtime[compile] 0.9659ms 0.8643ms 1.1570 KOps/s 1.1955 KOps/s $\color{#d91a1a}-3.22\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4700ms 0.4235ms 2.3615 KOps/s 2.3709 KOps/s $\color{#d91a1a}-0.40\%$
test_distributed 2.7909ms 0.2422ms 4.1292 KOps/s 8.6675 KOps/s $\textbf{\color{#d91a1a}-52.36\%}$
test_tdmodule 44.6910μs 22.4433μs 44.5567 KOps/s 45.1764 KOps/s $\color{#d91a1a}-1.37\%$
test_tdmodule_dispatch 68.9110μs 40.0476μs 24.9703 KOps/s 25.0068 KOps/s $\color{#d91a1a}-0.15\%$
test_tdseq 44.6500μs 22.7940μs 43.8711 KOps/s 42.9991 KOps/s $\color{#35bf28}+2.03\%$
test_tdseq_dispatch 75.3400μs 42.8215μs 23.3527 KOps/s 22.5665 KOps/s $\color{#35bf28}+3.48\%$
test_instantiation_functorch 1.6446ms 1.5662ms 638.5039 Ops/s 648.7028 Ops/s $\color{#d91a1a}-1.57\%$
test_exec_functorch 0.2324ms 0.1496ms 6.6839 KOps/s 6.9029 KOps/s $\color{#d91a1a}-3.17\%$
test_exec_functional_call 0.1966ms 0.1435ms 6.9702 KOps/s 7.3380 KOps/s $\textbf{\color{#d91a1a}-5.01\%}$
test_exec_td_decorator 0.4021ms 0.1945ms 5.1418 KOps/s 5.3005 KOps/s $\color{#d91a1a}-2.99\%$
test_vmap_mlp_speed_decorator[True-True] 0.8790ms 0.7015ms 1.4255 KOps/s 1.4350 KOps/s $\color{#d91a1a}-0.67\%$
test_vmap_mlp_speed_decorator[True-False] 0.8429ms 0.6993ms 1.4301 KOps/s 1.4284 KOps/s $\color{#35bf28}+0.12\%$
test_vmap_mlp_speed_decorator[False-True] 0.7318ms 0.6091ms 1.6417 KOps/s 1.6612 KOps/s $\color{#d91a1a}-1.17\%$
test_vmap_mlp_speed_decorator[False-False] 0.7264ms 0.6087ms 1.6429 KOps/s 1.6461 KOps/s $\color{#d91a1a}-0.19\%$
test_vmap_transformer_speed_decorator[True-True] 20.1675ms 19.5461ms 51.1610 Ops/s 51.7959 Ops/s $\color{#d91a1a}-1.23\%$
test_vmap_transformer_speed_decorator[True-False] 19.6324ms 19.5180ms 51.2348 Ops/s 51.7528 Ops/s $\color{#d91a1a}-1.00\%$
test_vmap_transformer_speed_decorator[False-True] 19.4745ms 19.3335ms 51.7238 Ops/s 52.2647 Ops/s $\color{#d91a1a}-1.03\%$
test_vmap_transformer_speed_decorator[False-False] 19.4394ms 19.3715ms 51.6221 Ops/s 52.1214 Ops/s $\color{#d91a1a}-0.96\%$
test_to_module_speed[True] 1.4485ms 0.9691ms 1.0319 KOps/s 1.0325 KOps/s $\color{#d91a1a}-0.06\%$
test_to_module_speed[False] 1.0361ms 0.9562ms 1.0458 KOps/s 1.0456 KOps/s $\color{#35bf28}+0.02\%$
test_tc_init 67.9010μs 38.6690μs 25.8605 KOps/s 26.4893 KOps/s $\color{#d91a1a}-2.37\%$
test_tc_init_nested 0.1192ms 79.1525μs 12.6338 KOps/s 13.1101 KOps/s $\color{#d91a1a}-3.63\%$
test_tc_first_layer_tensor 23.9700μs 0.9584μs 1.0434 MOps/s 1.1396 MOps/s $\textbf{\color{#d91a1a}-8.44\%}$
test_tc_first_layer_nontensor 28.1800μs 2.4493μs 408.2762 KOps/s 402.4074 KOps/s $\color{#35bf28}+1.46\%$
test_tc_second_layer_tensor 13.7570μs 1.7559μs 569.4960 KOps/s 540.5025 KOps/s $\textbf{\color{#35bf28}+5.36\%}$
test_tc_second_layer_nontensor 27.4300μs 3.3577μs 297.8238 KOps/s 293.8103 KOps/s $\color{#35bf28}+1.37\%$
test_unbind 0.2152s 11.9934ms 83.3795 Ops/s 143.7281 Ops/s $\textbf{\color{#d91a1a}-41.99\%}$
test_full_like 9.2405ms 7.2872ms 137.2265 Ops/s 113.8462 Ops/s $\textbf{\color{#35bf28}+20.54\%}$
test_zeros_like 9.0693ms 7.1673ms 139.5219 Ops/s 231.4662 Ops/s $\textbf{\color{#d91a1a}-39.72\%}$
test_ones_like 4.9855ms 4.3288ms 231.0128 Ops/s 231.5418 Ops/s $\color{#d91a1a}-0.23\%$
test_clone 6.7900ms 6.3344ms 157.8672 Ops/s 157.1815 Ops/s $\color{#35bf28}+0.44\%$
test_squeeze 58.5300μs 10.0550μs 99.4534 KOps/s 104.2839 KOps/s $\color{#d91a1a}-4.63\%$
test_unsqueeze 0.1230ms 72.4242μs 13.8075 KOps/s 13.6013 KOps/s $\color{#35bf28}+1.52\%$
test_split 0.3748ms 0.1625ms 6.1535 KOps/s 6.5137 KOps/s $\textbf{\color{#d91a1a}-5.53\%}$
test_permute 0.2325ms 0.1880ms 5.3183 KOps/s 5.4250 KOps/s $\color{#d91a1a}-1.97\%$
test_stack 50.7436ms 50.3918ms 19.8445 Ops/s 20.0229 Ops/s $\color{#d91a1a}-0.89\%$
test_cat 50.5414ms 50.1643ms 19.9345 Ops/s 20.1400 Ops/s $\color{#d91a1a}-1.02\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants
0