8000 [Doc] Fail-on-warning in sphinx by vmoens · Pull Request #1005 · pytorch/tensordict · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

[Doc] Fail-on-warning in sphinx #1005

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 7 commits into from
Sep 23, 2024
Merged

[Doc] Fail-on-warning in sphinx #1005

merged 7 commits into from
Sep 23, 2024

Conversation

vmoens
Copy link
Collaborator
@vmoens vmoens commented Sep 23, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Sep 23, 2024
ghstack-source-id: e4468f8
Pull Request resolved: #1005
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 23, 2024
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Sep 23, 2024
ghstack-source-id: cfc0f3e
Pull Request resolved: #1005
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Sep 23, 2024
ghstack-source-id: d831bc0
Pull Request resolved: #1005
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Sep 23, 2024
ghstack-source-id: d9dc3c5
Pull Request resolved: #1005
Copy link
github-actions bot commented Sep 23, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 222. Improved: $\large\color{#35bf28}35$. Worsened: $\large\color{#d91a1a}12$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 37.7310μs 19.3630μs 51.6448 KOps/s 48.4175 KOps/s $\textbf{\color{#35bf28}+6.67\%}$
test_plain_set_stack_nested 50.9750μs 19.1540μs 52.2083 KOps/s 48.7872 KOps/s $\textbf{\color{#35bf28}+7.01\%}$
test_plain_set_nested_inplace 54.1310μs 20.5210μs 48.7306 KOps/s 45.6790 KOps/s $\textbf{\color{#35bf28}+6.68\%}$
test_plain_set_stack_nested_inplace 54.4220μs 20.6403μs 48.4490 KOps/s 45.7726 KOps/s $\textbf{\color{#35bf28}+5.85\%}$
test_items 30.9070μs 4.2539μs 235.0794 KOps/s 233.0836 KOps/s $\color{#35bf28}+0.86\%$
test_items_nested 0.4316ms 0.3623ms 2.7603 KOps/s 2.7853 KOps/s $\color{#d91a1a}-0.90\%$
test_items_nested_locked 0.6078ms 0.3641ms 2.7462 KOps/s 2.7876 KOps/s $\color{#d91a1a}-1.48\%$
test_items_nested_leaf 0.1537ms 68.4753μs 14.6038 KOps/s 14.6707 KOps/s $\color{#d91a1a}-0.46\%$
test_items_stack_nested 0.7326ms 0.3660ms 2.7322 KOps/s 2.6349 KOps/s $\color{#35bf28}+3.70\%$
test_items_stack_nested_leaf 0.1523ms 72.4056μs 13.8111 KOps/s 14.3124 KOps/s $\color{#d91a1a}-3.50\%$
test_items_stack_nested_locked 0.6064ms 0.3633ms 2.7523 KOps/s 2.7253 KOps/s $\color{#35bf28}+0.99\%$
test_keys 33.6830μs 3.5658μs 280.4458 KOps/s 286.1951 KOps/s $\color{#d91a1a}-2.01\%$
test_keys_nested 0.1904ms 0.1009ms 9.9066 KOps/s 10.1400 KOps/s $\color{#d91a1a}-2.30\%$
test_keys_nested_locked 0.7275ms 0.1067ms 9.3692 KOps/s 9.5528 KOps/s $\color{#d91a1a}-1.92\%$
test_keys_nested_leaf 0.1823ms 82.3300μs 12.1462 KOps/s 12.2092 KOps/s $\color{#d91a1a}-0.52\%$
test_keys_stack_nested 0.1776ms 99.2001μs 10.0806 KOps/s 10.0050 KOps/s $\color{#35bf28}+0.76\%$
test_keys_stack_nested_leaf 0.1501ms 81.6818μs 12.2426 KOps/s 12.1014 KOps/s $\color{#35bf28}+1.17\%$
test_keys_stack_nested_locked 0.1880ms 0.1047ms 9.5502 KOps/s 9.6298 KOps/s $\color{#d91a1a}-0.83\%$
test_values 6.7708μs 1.0449μs 957.0111 KOps/s 944.1808 KOps/s $\color{#35bf28}+1.36\%$
test_values_nested 0.1373ms 75.0725μs 13.3205 KOps/s 13.1872 KOps/s $\color{#35bf28}+1.01\%$
test_values_nested_locked 0.1610ms 75.1237μs 13.3114 KOps/s 13.6825 KOps/s $\color{#d91a1a}-2.71\%$
test_values_nested_leaf 0.1150ms 61.7292μs 16.1998 KOps/s 16.1351 KOps/s $\color{#35bf28}+0.40\%$
test_values_stack_nested 0.1260ms 75.4684μs 13.2506 KOps/s 13.5821 KOps/s $\color{#d91a1a}-2.44\%$
test_values_stack_nested_leaf 0.1035ms 58.7517μs 17.0208 KOps/s 16.2745 KOps/s $\color{#35bf28}+4.59\%$
test_values_stack_nested_locked 0.1299ms 75.2101μs 13.2961 KOps/s 13.4422 KOps/s $\color{#d91a1a}-1.09\%$
test_membership 5.2470μs 0.7423μs 1.3472 MOps/s 1.1625 MOps/s $\textbf{\color{#35bf28}+15.89\%}$
test_membership_nested 35.1650μs 2.7149μs 368.3362 KOps/s 363.9042 KOps/s $\color{#35bf28}+1.22\%$
test_membership_nested_leaf 24.7460μs 2.7466μs 364.0894 KOps/s 364.8183 KOps/s $\color{#d91a1a}-0.20\%$
test_membership_stacked_nested 39.5040μs 2.7499μs 363.6433 KOps/s 372.9534 KOps/s $\color{#d91a1a}-2.50\%$
test_membership_stacked_nested_leaf 21.2190μs 2.7300μs 366.3023 KOps/s 347.3403 KOps/s $\textbf{\color{#35bf28}+5.46\%}$
test_membership_nested_last 33.0620μs 3.9386μs 253.8965 KOps/s 257.8892 KOps/s $\color{#d91a1a}-1.55\%$
test_membership_nested_leaf_last 31.1490μs 3.9470μs 253.3557 KOps/s 253.1708 KOps/s $\color{#35bf28}+0.07\%$
test_membership_stacked_nested_last 49.6020μs 12.8654μs 77.7279 KOps/s 254.8234 KOps/s $\textbf{\color{#d91a1a}-69.50\%}$
test_membership_stacked_nested_leaf_last 44.5440μs 12.7309μs 78.5493 KOps/s 253.3769 KOps/s $\textbf{\color{#d91a1a}-69.00\%}$
test_nested_getleaf 37.0290μs 10.6496μs 93.9004 KOps/s 92.6238 KOps/s $\color{#35bf28}+1.38\%$
test_nested_get 0.1333ms 10.3180μs 96.9182 KOps/s 96.0919 KOps/s $\color{#35bf28}+0.86\%$
test_stacked_getleaf 43.5610μs 10.5924μs 94.4071 KOps/s 92.6520 KOps/s $\color{#35bf28}+1.89\%$
test_stacked_get 35.7360μs 10.0847μs 99.1604 KOps/s 98.6996 KOps/s $\color{#35bf28}+0.47\%$
test_nested_getitemleaf 36.9790μs 10.9757μs 91.1101 KOps/s 90.2466 KOps/s $\color{#35bf28}+0.96\%$
test_nested_getitem 37.4000μs 10.2049μs 97.9919 KOps/s 96.3143 KOps/s $\color{#35bf28}+1.74\%$
test_stacked_getitemleaf 37.3600μs 11.0239μs 90.7118 KOps/s 89.8230 KOps/s $\color{#35bf28}+0.99\%$
test_stacked_getitem 0.3382ms 10.2587μs 97.4780 KOps/s 95.6537 KOps/s $\color{#35bf28}+1.91\%$
test_lock_nested 0.1014s 0.6001ms 1.6665 KOps/s 2.0772 KOps/s $\textbf{\color{#d91a1a}-19.77\%}$
test_lock_stack_nested 0.6657ms 0.4406ms 2.2698 KOps/s 2.1892 KOps/s $\color{#35bf28}+3.68\%$
test_unlock_nested 0.1015s 0.5135ms 1.9475 KOps/s 2.5171 KOps/s $\textbf{\color{#d91a1a}-22.63\%}$
test_unlock_stack_nested 0.5178ms 0.3570ms 2.8008 KOps/s 2.6429 KOps/s $\textbf{\color{#35bf28}+5.97\%}$
test_flatten_speed 0.1788ms 86.7382μs 11.5289 KOps/s 11.4524 KOps/s $\color{#35bf28}+0.67\%$
test_unflatten_speed 0.6861ms 0.4627ms 2.1612 KOps/s 2.1598 KOps/s $\color{#35bf28}+0.06\%$
test_common_ops 4.8339ms 1.0874ms 919.6670 Ops/s 903.6947 Ops/s $\color{#35bf28}+1.77\%$
test_creation 17.6330μs 2.0747μs 481.9949 KOps/s 488.1987 KOps/s $\color{#d91a1a}-1.27\%$
test_creation_empty 44.2120μs 16.0965μs 62.1253 KOps/s 56.6412 KOps/s $\textbf{\color{#35bf28}+9.68\%}$
test_creation_nested_1 97.7040μs 19.1350μs 52.2603 KOps/s 46.3759 KOps/s $\textbf{\color{#35bf28}+12.69\%}$
test_creation_nested_2 58.6900μs 23.3262μs 42.8702 KOps/s 40.4684 KOps/s $\textbf{\color{#35bf28}+5.94\%}$
test_clone 0.2345ms 17.1316μs 58.3717 KOps/s 57.0693 KOps/s $\color{#35bf28}+2.28\%$
test_getitem[int] 0.8773ms 16.3155μs 61.2914 KOps/s 58.9672 KOps/s $\color{#35bf28}+3.94\%$
test_getitem[slice_int] 0.1582ms 30.5061μs 32.7803 KOps/s 32.9767 KOps/s $\color{#d91a1a}-0.60\%$
test_getitem[range] 0.2081ms 57.0323μs 17.5339 KOps/s 17.6146 KOps/s $\color{#d91a1a}-0.46\%$
test_getitem[tuple] 0.1733ms 24.6451μs 40.5760 KOps/s 41.0634 KOps/s $\color{#d91a1a}-1.19\%$
test_getitem[list] 0.1864ms 52.9302μs 18.8928 KOps/s 19.2595 KOps/s $\color{#d91a1a}-1.90\%$
test_setitem_dim[int] 62.9180μs 31.7876μs 31.4588 KOps/s 31.7513 KOps/s $\color{#d91a1a}-0.92\%$
test_setitem_dim[slice_int] 0.1020ms 59.7202μs 16.7447 KOps/s 16.9998 KOps/s $\color{#d91a1a}-1.50\%$
test_setitem_dim[range] 0.1430ms 84.3103μs 11.8609 KOps/s 12.1801 KOps/s $\color{#d91a1a}-2.62\%$
test_setitem_dim[tuple] 80.4800μs 47.5048μs 21.0505 KOps/s 21.3117 KOps/s $\color{#d91a1a}-1.23\%$
test_setitem 0.2825ms 28.7866μs 34.7383 KOps/s 33.4507 KOps/s $\color{#35bf28}+3.85\%$
test_set 77.4250μs 27.5804μs 36.2577 KOps/s 33.7459 KOps/s $\textbf{\color{#35bf28}+7.44\%}$
test_set_shared 3.9177ms 0.2169ms 4.6095 KOps/s 4.6772 KOps/s $\color{#d91a1a}-1.45\%$
test_update 0.2445ms 33.6198μs 29.7444 KOps/s 27.5149 KOps/s $\textbf{\color{#35bf28}+8.10\%}$
test_update_nested 0.2416ms 44.7072μs 22.3678 KOps/s 21.9548 KOps/s $\color{#35bf28}+1.88\%$
test_update__nested 0.2139ms 34.9868μs 28.5822 KOps/s 28.8481 KOps/s $\color{#d91a1a}-0.92\%$
test_set_nested 0.2516ms 29.9622μs 33.3754 KOps/s 31.3945 KOps/s $\textbf{\color{#35bf28}+6.31\%}$
test_set_nested_new 0.1259ms 35.1703μs 28.4331 KOps/s 27.0317 KOps/s $\textbf{\color{#35bf28}+5.18\%}$
test_select 0.2836ms 52.2803μs 19.1277 KOps/s 18.5519 KOps/s $\color{#35bf28}+3.10\%$
test_select_nested 0.1675ms 58.8669μs 16.9875 KOps/s 17.1024 KOps/s $\color{#d91a1a}-0.67\%$
test_exclude_nested 0.1502ms 74.8214μs 13.3652 KOps/s 13.4594 KOps/s $\color{#d91a1a}-0.70\%$
test_empty[True] 0.5377ms 0.3185ms 3.1401 KOps/s 3.1593 KOps/s $\color{#d91a1a}-0.61\%$
test_empty[False] 7.7595μs 1.2218μs 818.4381 KOps/s 854.3075 KOps/s $\color{#d91a1a}-4.20\%$
test_unbind_speed 0.5157ms 0.3026ms 3.3050 KOps/s 3.3150 KOps/s $\color{#d91a1a}-0.30\%$
test_unbind_speed_stack0 0.4780ms 0.2860ms 3.4959 KOps/s 3.4360 KOps/s $\color{#35bf28}+1.75\%$
test_unbind_speed_stack1 0.1060s 0.8047ms 1.2428 KOps/s 1.3415 KOps/s $\textbf{\color{#d91a1a}-7.36\%}$
test_split 2.1394ms 1.9816ms 504.6516 Ops/s 457.6632 Ops/s $\textbf{\color{#35bf28}+10.27\%}$
test_chunk 98.3471ms 2.1836ms 457.9577 Ops/s 453.0205 Ops/s $\color{#35bf28}+1.09\%$
test_creation[device0] 0.2503ms 0.1174ms 8.5173 KOps/s 8.5654 KOps/s $\color{#d91a1a}-0.56\%$
test_creation_from_tensor 3.8820ms 0.1194ms 8.3759 KOps/s 8.5746 KOps/s $\color{#d91a1a}-2.32\%$
test_add_one[memmap_tensor0] 0.3702ms 7.0277μs 142.2944 KOps/s 131.6445 KOps/s $\textbf{\color{#35bf28}+8.09\%}$
test_contiguous[memmap_tensor0] 26.5200μs 1.9419μs 514.9484 KOps/s 522.2721 KOps/s $\color{#d91a1a}-1.40\%$
test_stack[memmap_tensor0] 70.1010μs 5.5819μs 179.1508 KOps/s 169.3074 KOps/s $\textbf{\color{#35bf28}+5.81\%}$
test_memmaptd_index 1.0912ms 0.4046ms 2.4714 KOps/s 2.4777 KOps/s $\color{#d91a1a}-0.25\%$
test_memmaptd_index_astensor 0.9730ms 0.4835ms 2.0681 KOps/s 2.0870 KOps/s $\color{#d91a1a}-0.90\%$
test_memmaptd_index_op 1.4923ms 0.9676ms 1.0335 KOps/s 980.7462 Ops/s $\textbf{\color{#35bf28}+5.37\%}$
test_serialize_model 0.2477s 0.1393s 7.1772 Ops/s 8.2543 Ops/s $\textbf{\color{#d91a1a}-13.05\%}$
test_serialize_model_pickle 0.4856s 0.4019s 2.4882 Ops/s 2.4473 Ops/s $\color{#35bf28}+1.67\%$
test_serialize_weights 0.1229s 0.1172s 8.5356 Ops/s 7.3293 Ops/s $\textbf{\color{#35bf28}+16.46\%}$
test_serialize_weights_returnearly 0.1923s 0.1631s 6.1315 Ops/s 6.3508 Ops/s $\color{#d91a1a}-3.45\%$
test_serialize_weights_pickle 1.1876s 0.7122s 1.4041 Ops/s 1.0854 Ops/s $\textbf{\color{#35bf28}+29.36\%}$
test_serialize_weights_filesystem 0.1494s 0.1409s 7.0950 Ops/s 6.9836 Ops/s $\color{#35bf28}+1.60\%$
test_serialize_model_filesystem 0.1523s 0.1453s 6.8804 Ops/s 6.2166 Ops/s $\textbf{\color{#35bf28}+10.68\%}$
test_reshape_pytree 76.7030μs 38.7988μs 25.7740 KOps/s 25.4089 KOps/s $\color{#35bf28}+1.44\%$
test_reshape_td 96.7310μs 45.7347μs 21.8652 KOps/s 21.3068 KOps/s $\color{#35bf28}+2.62\%$
test_view_pytree 0.1486ms 38.7465μs 25.8088 KOps/s 25.5586 KOps/s $\color{#35bf28}+0.98\%$
test_view_td 0.1118ms 53.0739μs 18.8417 KOps/s 19.3765 KOps/s $\color{#d91a1a}-2.76\%$
test_unbind_pytree 91.8220μs 35.9267μs 27.8344 KOps/s 27.7639 KOps/s $\color{#35bf28}+0.25\%$
test_unbind_td 0.3040ms 44.6485μs 22.3972 KOps/s 22.3757 KOps/s $\color{#35bf28}+0.10\%$
test_split_pytree 80.9920μs 37.8762μs 26.4018 KOps/s 25.9639 KOps/s $\color{#35bf28}+1.69\%$
test_split_td 0.4907ms 58.7591μs 17.0186 KOps/s 17.6150 KOps/s $\color{#d91a1a}-3.39\%$
test_add_pytree 0.1013ms 44.4922μs 22.4758 KOps/s 22.0370 KOps/s $\color{#35bf28}+1.99\%$
test_add_td 0.1773ms 77.2549μs 12.9442 KOps/s 12.0616 KOps/s $\textbf{\color{#35bf28}+7.32\%}$
test_compile_add_one_nested[tensordict-compile] 0.1322ms 59.7945μs 16.7239 KOps/s 17.0434 KOps/s $\color{#d91a1a}-1.87\%$
test_compile_add_one_nested[tensordict-eager] 0.3315ms 0.1796ms 5.5684 KOps/s 5.5963 KOps/s $\color{#d91a1a}-0.50\%$
test_compile_add_one_nested[pytree-compile] 0.1400ms 57.4290μs 17.4128 KOps/s 17.4651 KOps/s $\color{#d91a1a}-0.30\%$
test_compile_add_one_nested[pytree-eager] 0.3304ms 0.1412ms 7.0828 KOps/s 7.1936 KOps/s $\color{#d91a1a}-1.54\%$
test_compile_copy_nested[tensordict-compile] 90.6690μs 20.8230μs 48.0237 KOps/s 44.9735 KOps/s $\textbf{\color{#35bf28}+6.78\%}$
test_compile_copy_nested[tensordict-eager] 0.1539ms 67.4890μs 14.8172 KOps/s 15.0035 KOps/s $\color{#d91a1a}-1.24\%$
test_compile_copy_nested[pytree-compile] 0.1435ms 76.7952μs 13.0216 KOps/s 13.1918 KOps/s $\color{#d91a1a}-1.29\%$
test_compile_copy_nested[pytree-eager] 0.1547ms 68.5027μs 14.5980 KOps/s 14.6532 KOps/s $\color{#d91a1a}-0.38\%$
test_compile_add_one_flat[tensordict-compile] 0.2808ms 0.1762ms 5.6740 KOps/s 5.7538 KOps/s $\color{#d91a1a}-1.39\%$
test_compile_add_one_flat[tensordict-eager] 0.3486ms 0.1916ms 5.2194 KOps/s 5.3268 KOps/s $\color{#d91a1a}-2.02\%$
test_compile_add_one_flat[tensorclass-compile] 0.1285ms 47.2576μs 21.1606 KOps/s 21.7385 KOps/s $\color{#d91a1a}-2.66\%$
test_compile_add_one_flat[tensorclass-eager] 0.1617ms 69.4993μs 14.3886 KOps/s 14.5557 KOps/s $\color{#d91a1a}-1.15\%$
test_compile_add_one_flat[pytree-compile] 0.3257ms 0.1757ms 5.6902 KOps/s 5.7296 KOps/s $\color{#d91a1a}-0.69\%$
test_compile_add_one_flat[pytree-eager] 0.5968ms 0.2852ms 3.5065 KOps/s 3.4514 KOps/s $\color{#35bf28}+1.60\%$
test_compile_add_self_flat[tensordict-eager] 0.3620ms 0.2033ms 4.9179 KOps/s 4.9776 KOps/s $\color{#d91a1a}-1.20\%$
test_compile_add_self_flat[tensordict-compile] 0.3278ms 0.1741ms 5.7444 KOps/s 5.6457 KOps/s $\color{#35bf28}+1.75\%$
test_compile_add_self_flat[tensorclass-eager] 0.1284ms 62.0863μs 16.1066 KOps/s 16.0683 KOps/s $\color{#35bf28}+0.24\%$
test_compile_add_self_flat[tensorclass-compile] 0.1195ms 47.1289μs 21.2184 KOps/s 21.1926 KOps/s $\color{#35bf28}+0.12\%$
test_compile_add_self_flat[pytree-eager] 0.4252ms 0.2314ms 4.3206 KOps/s 4.2564 KOps/s $\color{#35bf28}+1.51\%$
test_compile_add_self_flat[pytree-compile] 0.2948ms 0.1737ms 5.7554 KOps/s 5.6994 KOps/s $\color{#35bf28}+0.98\%$
test_compile_copy_flat[tensordict-compile] 0.1977ms 0.1027ms 9.7416 KOps/s 9.3818 KOps/s $\color{#35bf28}+3.84\%$
test_compile_copy_flat[tensordict-eager] 0.1334ms 56.5986μs 17.6683 KOps/s 17.5862 KOps/s $\color{#35bf28}+0.47\%$
test_compile_copy_flat[pytree-compile] 0.1459ms 76.7830μs 13.0237 KOps/s 12.9335 KOps/s $\color{#35bf28}+0.70\%$
test_compile_copy_flat[pytree-eager] 0.1527ms 69.2007μs 14.4507 KOps/s 14.5637 KOps/s $\color{#d91a1a}-0.78\%$
test_compile_assign_and_add[tensordict-compile] 0.2883ms 0.1939ms 5.1586 KOps/s 5.1336 KOps/s $\color{#35bf28}+0.49\%$
test_compile_assign_and_add[tensordict-eager] 2.1136ms 1.6262ms 614.9124 Ops/s 601.6112 Ops/s $\color{#35bf28}+2.21\%$
test_compile_assign_and_add[pytree-compile] 0.4318ms 0.1971ms 5.0726 KOps/s 5.2650 KOps/s $\color{#d91a1a}-3.65\%$
test_compile_assign_and_add[pytree-eager] 1.3704ms 1.0888ms 918.4302 Ops/s 899.1028 Ops/s $\color{#35bf28}+2.15\%$
test_compile_assign_and_add_stack[compile] 0.5576ms 0.4195ms 2.3837 KOps/s 2.3960 KOps/s $\color{#d91a1a}-0.52\%$
test_compile_assign_and_add_stack[eager] 4.3941ms 3.5186ms 284.2026 Ops/s 268.5328 Ops/s $\textbf{\color{#35bf28}+5.84\%}$
test_compile_indexing[tensor-tensordict-compile] 0.1489ms 35.9120μs 27.8458 KOps/s 29.9748 KOps/s $\textbf{\color{#d91a1a}-7.10\%}$
test_compile_indexing[tensor-tensordict-eager] 1.4074ms 48.8517μs 20.4701 KOps/s 20.6879 KOps/s $\color{#d91a1a}-1.05\%$
test_compile_indexing[tensor-tensorclass-compile] 89.9580μs 30.2539μs 33.0536 KOps/s 33.8677 KOps/s $\color{#d91a1a}-2.40\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1126ms 28.7001μs 34.8430 KOps/s 34.2226 KOps/s $\color{#35bf28}+1.81\%$
test_compile_indexing[tensor-pytree-compile] 0.1028ms 30.4622μs 32.8276 KOps/s 34.1645 KOps/s $\color{#d91a1a}-3.91\%$
test_compile_indexing[tensor-pytree-eager] 0.1166ms 28.3481μs 35.2757 KOps/s 34.2554 KOps/s $\color{#35bf28}+2.98\%$
test_compile_indexing[slice-tensordict-compile] 0.1639ms 74.6215μs 13.4010 KOps/s 13.6225 KOps/s $\color{#d91a1a}-1.63\%$
test_compile_indexing[slice-tensordict-eager] 0.6299ms 27.5536μs 36.2929 KOps/s 36.0863 KOps/s $\color{#35bf28}+0.57\%$
test_compile_indexing[slice-tensorclass-compile] 0.1538ms 69.2081μs 14.4492 KOps/s 14.9175 KOps/s $\color{#d91a1a}-3.14\%$
test_compile_indexing[slice-tensorclass-eager] 97.5110μs 23.3445μs 42.8367 KOps/s 42.8353 KOps/s $+0.00\%$
test_compile_indexing[slice-pytree-compile] 0.1581ms 68.2295μs 14.6564 KOps/s 14.9245 KOps/s $\color{#d91a1a}-1.80\%$
test_compile_indexing[slice-pytree-eager] 80.9310μs 23.6028μs 42.3679 KOps/s 42.3906 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_indexing[int-tensordict-compile] 0.1465ms 73.8141μs 13.5476 KOps/s 13.7200 KOps/s $\color{#d91a1a}-1.26\%$
test_compile_indexing[int-tensordict-eager] 1.2763ms 27.3466μs 36.5676 KOps/s 37.2734 KOps/s $\color{#d91a1a}-1.89\%$
test_compile_indexing[int-tensorclass-compile] 0.1555ms 68.7383μs 14.5479 KOps/s 14.7869 KOps/s $\color{#d91a1a}-1.62\%$
test_compile_indexing[int-tensorclass-eager] 0.1021ms 23.2200μs 43.0663 KOps/s 42.7873 KOps/s $\color{#35bf28}+0.65\%$
test_compile_indexing[int-pytree-compile] 0.1507ms 68.1849μs 14.6660 KOps/s 14.9284 KOps/s $\color{#d91a1a}-1.76\%$
test_compile_indexing[int-pytree-eager] 79.0480μs 23.1102μs 43.2710 KOps/s 43.2992 KOps/s $\color{#d91a1a}-0.07\%$
test_mod_add[eager] 97.9320μs 24.6981μs 40.4889 KOps/s 38.8483 KOps/s $\color{#35bf28}+4.22\%$
test_mod_add[compile] 0.1078ms 41.1395μs 24.3075 KOps/s 26.0885 KOps/s $\textbf{\color{#d91a1a}-6.83\%}$
test_mod_add[compile-overhead] 0.1300ms 41.0407μs 24.3661 KOps/s 25.8408 KOps/s $\textbf{\color{#d91a1a}-5.71\%}$
test_mod_wrap[eager] 0.3623ms 0.2129ms 4.6965 KOps/s 4.9012 KOps/s $\color{#d91a1a}-4.18\%$
test_mod_wrap[compile] 0.4058ms 0.2407ms 4.1547 KOps/s 4.3022 KOps/s $\color{#d91a1a}-3.43\%$
test_mod_wrap[compile-overhead] 0.3569ms 0.2384ms 4.1947 KOps/s 4.3695 KOps/s $\color{#d91a1a}-4.00\%$
test_mod_wrap_and_backward[eager] 12.8955ms 11.2763ms 88.6815 Ops/s 79.9017 Ops/s $\textbf{\color{#35bf28}+10.99\%}$
test_mod_wrap_and_backward[compile] 15.0703ms 11.7139ms 85.3687 Ops/s 83.0481 Ops/s $\color{#35bf28}+2.79\%$
test_mod_wrap_and_backward[compile-overhead] 19.3649ms 11.9230ms 83.8716 Ops/s 71.7991 Ops/s $\textbf{\color{#35bf28}+16.81\%}$
test_seq_add[eager] 0.2093ms 92.9276μs 10.7611 KOps/s 11.1616 KOps/s $\color{#d91a1a}-3.59\%$
test_seq_add[compile] 0.1535ms 66.9232μs 14.9425 KOps/s 15.2441 KOps/s $\color{#d91a1a}-1.98\%$
test_seq_add[compile-overhead] 0.1792ms 65.1159μs 15.3572 KOps/s 15.4493 KOps/s $\color{#d91a1a}-0.60\%$
test_seq_wrap[eager] 0.6427ms 0.3837ms 2.6065 KOps/s 2.5735 KOps/s $\color{#35bf28}+1.28\%$
test_seq_wrap[compile] 1.2851ms 0.2763ms 3.6196 KOps/s 3.6585 KOps/s $\color{#d91a1a}-1.06\%$
test_seq_wrap[compile-overhead] 1.4295ms 0.2804ms 3.5666 KOps/s 3.6687 KOps/s $\color{#d91a1a}-2.78\%$
test_func_call_runtime[False-eager] 0.9793ms 0.5394ms 1.8541 KOps/s 1.9347 KOps/s $\color{#d91a1a}-4.17\%$
test_func_call_runtime[False-compile] 0.6623ms 0.5064ms 1.9748 KOps/s 1.9746 KOps/s $\color{#35bf28}+0.01\%$
test_func_call_runtime[False-compile-overhead] 0.6394ms 0.5035ms 1.9861 KOps/s 1.9745 KOps/s $\color{#35bf28}+0.59\%$
test_func_call_runtime[True-eager] 1.2309ms 0.7577ms 1.3198 KOps/s 1.3391 KOps/s $\color{#d91a1a}-1.44\%$
test_func_call_runtime[True-compile] 0.9528ms 0.5189ms 1.9271 KOps/s 1.9215 KOps/s $\color{#35bf28}+0.29\%$
test_func_call_runtime[True-compile-overhead] 0.8645ms 0.5224ms 1.9141 KOps/s 1.9416 KOps/s $\color{#d91a1a}-1.42\%$
test_func_call_cm_runtime[False-eager] 0.9148ms 0.5411ms 1.8481 KOps/s 1.9372 KOps/s $\color{#d91a1a}-4.60\%$
test_func_call_cm_runtime[False-compile] 0.6499ms 0.5095ms 1.9627 KOps/s 1.9935 KOps/s $\color{#d91a1a}-1.54\%$
test_func_call_cm_runtime[False-compile-overhead] 0.6903ms 0.5103ms 1.9597 KOps/s 1.9946 KOps/s $\color{#d91a1a}-1.75\%$
test_func_call_cm_runtime[True-eager] 1.1461ms 0.8861ms 1.1285 KOps/s 1.1357 KOps/s $\color{#d91a1a}-0.64\%$
test_func_call 8000 _cm_runtime[True-compile] 0.9260ms 0.7552ms 1.3242 KOps/s 1.3399 KOps/s $\color{#d91a1a}-1.17\%$
test_func_call_cm_runtime[True-compile-overhead] 1.1976ms 0.7585ms 1.3185 KOps/s 1.3343 KOps/s $\color{#d91a1a}-1.19\%$
test_vmap_func_call_cm_runtime[eager] 2.6162ms 1.9079ms 524.1446 Ops/s 520.9132 Ops/s $\color{#35bf28}+0.62\%$
test_vmap_func_call_cm_runtime[compile] 3.0361ms 1.9644ms 509.0733 Ops/s 515.2612 Ops/s $\color{#d91a1a}-1.20\%$
test_vmap_func_call_cm_runtime[compile-overhead] 6.9322ms 1.9748ms 506.3846 Ops/s 514.4167 Ops/s $\color{#d91a1a}-1.56\%$
test_distributed 0.2503ms 0.1249ms 8.0084 KOps/s 7.7680 KOps/s $\color{#35bf28}+3.10\%$
test_tdmodule 46.2960μs 17.0260μs 58.7336 KOps/s 55.2037 KOps/s $\textbf{\color{#35bf28}+6.39\%}$
test_tdmodule_dispatch 70.6020μs 33.9323μs 29.4704 KOps/s 27.5465 KOps/s $\textbf{\color{#35bf28}+6.98\%}$
test_tdseq 42.5590μs 19.9462μs 50.1349 KOps/s 47.3193 KOps/s $\textbf{\color{#35bf28}+5.95\%}$
test_tdseq_dispatch 80.1400μs 38.7562μs 25.8023 KOps/s 23.4444 KOps/s $\textbf{\color{#35bf28}+10.06\%}$
test_instantiation_functorch 1.7063ms 1.5855ms 630.7291 Ops/s 637.0953 Ops/s $\color{#d91a1a}-1.00\%$
test_instantiation_td 2.6686ms 1.1936ms 837.8037 Ops/s 848.8903 Ops/s $\color{#d91a1a}-1.31\%$
test_exec_functorch 0.4517ms 0.1905ms 5.2480 KOps/s 5.5065 KOps/s $\color{#d91a1a}-4.70\%$
test_exec_functional_call 0.3301ms 0.1782ms 5.6127 KOps/s 5.8775 KOps/s $\color{#d91a1a}-4.50\%$
test_exec_td 0.4114ms 0.1782ms 5.6102 KOps/s 6.0076 KOps/s $\textbf{\color{#d91a1a}-6.61\%}$
test_exec_td_decorator 0.3793ms 0.2277ms 4.3917 KOps/s 4.5136 KOps/s $\color{#d91a1a}-2.70\%$
test_vmap_mlp_speed[True-True] 1.1121ms 0.6574ms 1.5212 KOps/s 1.5282 KOps/s $\color{#d91a1a}-0.46\%$
test_vmap_mlp_speed[True-False] 0.8354ms 0.6552ms 1.5263 KOps/s 1.5356 KOps/s $\color{#d91a1a}-0.60\%$
test_vmap_mlp_speed[False-True] 0.8285ms 0.5097ms 1.9618 KOps/s 1.9854 KOps/s $\color{#d91a1a}-1.19\%$
test_vmap_mlp_speed[False-False] 1.9622ms 0.5212ms 1.9188 KOps/s 1.9736 KOps/s $\color{#d91a1a}-2.78\%$
test_vmap_mlp_speed_decorator[True-True] 1.3497ms 0.6348ms 1.5753 KOps/s 1.5723 KOps/s $\color{#35bf28}+0.19\%$
test_vmap_mlp_speed_decorator[True-False] 1.0701ms 0.6360ms 1.5724 KOps/s 1.5753 KOps/s $\color{#d91a1a}-0.18\%$
test_vmap_mlp_speed_decorator[False-True] 0.8002ms 0.5254ms 1.9033 KOps/s 1.9296 KOps/s $\color{#d91a1a}-1.36\%$
test_vmap_mlp_speed_decorator[False-False] 1.1677ms 0.5286ms 1.8919 KOps/s 1.9250 KOps/s $\color{#d91a1a}-1.72\%$
test_to_module_speed[True] 2.0876ms 1.3089ms 763.9970 Ops/s 772.2036 Ops/s $\color{#d91a1a}-1.06\%$
test_to_module_speed[False] 2.0770ms 1.2897ms 775.3959 Ops/s 786.2237 Ops/s $\color{#d91a1a}-1.38\%$
test_tc_init 0.1111ms 41.5081μs 24.0917 KOps/s 22.8330 KOps/s $\textbf{\color{#35bf28}+5.51\%}$
test_tc_init_nested 0.1676ms 83.7522μs 11.9400 KOps/s 11.4349 KOps/s $\color{#35bf28}+4.42\%$
test_tc_first_layer_tensor 21.6300μs 1.5393μs 649.6576 KOps/s 658.9524 KOps/s $\color{#d91a1a}-1.41\%$
test_tc_first_layer_nontensor 26.2090μs 4.6925μs 213.1040 KOps/s 216.2316 KOps/s $\color{#d91a1a}-1.45\%$
test_tc_second_layer_tensor 0.1481ms 3.0179μs 331.3560 KOps/s 356.5383 KOps/s $\textbf{\color{#d91a1a}-7.06\%}$
test_tc_second_layer_nontensor 92.4930μs 6.0357μs 165.6798 KOps/s 168.7859 KOps/s $\color{#d91a1a}-1.84\%$
test_unbind 0.5148s 13.8986ms 71.9495 Ops/s 71.9781 Ops/s $\color{#d91a1a}-0.04\%$
test_full_like 9.9687ms 8.8616ms 112.8460 Ops/s 71.0170 Ops/s $\textbf{\color{#35bf28}+58.90\%}$
test_zeros_like 3.9354ms 3.3556ms 298.0072 Ops/s 135.3500 Ops/s $\textbf{\color{#35bf28}+120.18\%}$
test_ones_like 12.1390ms 6.6810ms 149.6793 Ops/s 128.5620 Ops/s $\textbf{\color{#35bf28}+16.43\%}$
test_clone 13.5520ms 8.5535ms 116.9114 Ops/s 102.4368 Ops/s $\textbf{\color{#35bf28}+14.13\%}$
test_squeeze 71.1730μs 12.6525μs 79.0355 KOps/s 80.6989 KOps/s $\color{#d91a1a}-2.06\%$
test_unsqueeze 0.3757ms 92.9416μs 10.7594 KOps/s 11.0986 KOps/s $\color{#d91a1a}-3.06\%$
test_split 0.3506ms 0.1951ms 5.1250 KOps/s 5.2025 KOps/s $\color{#d91a1a}-1.49\%$
test_permute 0.5534ms 0.2327ms 4.2976 KOps/s 4.6083 KOps/s $\textbf{\color{#d91a1a}-6.74\%}$
test_stack 33.9224ms 27.0381ms 36.9848 Ops/s 37.9305 Ops/s $\color{#d91a1a}-2.49\%$
test_cat 30.9209ms 26.7265ms 37.4160 Ops/s 37.5524 Ops/s $\color{#d91a1a}-0.36\%$

Copy link
github-actions bot commented Sep 23, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 228. Improved: $\large\color{#35bf28}26$. Worsened: $\large\color{#d91a1a}8$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.1432ms 14.0830μs 71.0077 KOps/s 68.4562 KOps/s $\color{#35bf28}+3.73\%$
test_plain_set_stack_nested 41.2320μs 14.0611μs 71.1184 KOps/s 67.5806 KOps/s $\textbf{\color{#35bf28}+5.23\%}$
test_plain_set_nested_inplace 59.3130μs 15.1850μs 65.8546 KOps/s 62.5899 KOps/s $\textbf{\color{#35bf28}+5.22\%}$
test_plain_set_stack_nested_inplace 42.6420μs 15.1422μs 66.0405 KOps/s 63.2514 KOps/s $\color{#35bf28}+4.41\%$
test_items 34.8310μs 2.8834μs 346.8094 KOps/s 340.9742 KOps/s $\color{#35bf28}+1.71\%$
test_items_nested 0.3661ms 0.3282ms 3.0474 KOps/s 3.1097 KOps/s $\color{#d91a1a}-2.00\%$
test_items_nested_locked 0.4055ms 0.3270ms 3.0579 KOps/s 3.0692 KOps/s $\color{#d91a1a}-0.37\%$
test_items_nested_leaf 74.3040μs 55.3585μs 18.0641 KOps/s 18.0315 KOps/s $\color{#35bf28}+0.18\%$
test_items_stack_nested 0.4427ms 0.3253ms 3.0741 KOps/s 3.0883 KOps/s $\color{#d91a1a}-0.46\%$
test_items_stack_nested_leaf 89.5740μs 55.7232μs 17.9458 KOps/s 17.6316 KOps/s $\color{#35bf28}+1.78\%$
test_items_stack_nested_locked 0.4352ms 0.3293ms 3.0372 KOps/s 3.0557 KOps/s $\color{#d91a1a}-0.61\%$
test_keys 28.4210μs 3.6767μs 271.9815 KOps/s 291.8893 KOps/s $\textbf{\color{#d91a1a}-6.82\%}$
test_keys_nested 82.1740μs 56.4288μs 17.7214 KOps/s 17.9544 KOps/s $\color{#d91a1a}-1.30\%$
test_keys_nested_locked 0.7115ms 62.0100μs 16.1264 KOps/s 16.2372 KOps/s $\color{#d91a1a}-0.68\%$
test_keys_nested_leaf 93.7750μs 46.9158μs 21.3148 KOps/s 21.8966 KOps/s $\color{#d91a1a}-2.66\%$
test_keys_stack_nested 94.9250μs 57.0546μs 17.5271 KOps/s 17.9864 KOps/s $\color{#d91a1a}-2.55\%$
test_keys_stack_nested_leaf 89.8940μs 47.6435μs 20.9892 KOps/s 21.1100 KOps/s $\color{#d91a1a}-0.57\%$
test_keys_stack_nested_locked 0.1054ms 61.6978μs 16.2080 KOps/s 16.5113 KOps/s $\color{#d91a1a}-1.84\%$
test_values 5.4737μs 0.8601μs 1.1627 MOps/s 1.1729 MOps/s $\color{#d91a1a}-0.87\%$
test_values_nested 71.4240μs 40.6843μs 24.5795 KOps/s 24.6582 KOps/s $\color{#d91a1a}-0.32\%$
test_values_nested_locked 74.9140μs 42.5722μs 23.4895 KOps/s 23.3793 KOps/s $\color{#35bf28}+0.47\%$
test_values_nested_leaf 75.5140μs 35.3736μs 28.2697 KOps/s 28.3379 KOps/s $\color{#d91a1a}-0.24\%$
test_values_stack_nested 72.6230μs 41.0647μs 24.3518 KOps/s 24.1804 KOps/s $\color{#35bf28}+0.71\%$
test_values_stack_nested_leaf 78.7740μs 35.7892μs 27.9414 KOps/s 28.1166 KOps/s $\color{#d91a1a}-0.62\%$
test_values_stack_nested_locked 88.7240μs 42.7943μs 23.3676 KOps/s 23.1748 KOps/s $\color{#35bf28}+0.83\%$
test_membership 1.8026μs 0.5086μs 1.9662 MOps/s 1.9780 MOps/s $\color{#d91a1a}-0.60\%$
test_membership_nested 11.3055μs 1.8755μs 533.1777 KOps/s 546.9621 KOps/s $\color{#d91a1a}-2.52\%$
test_membership_nested_leaf 20.2460μs 1.8759μs 533.0646 KOps/s 548.0033 KOps/s $\color{#d91a1a}-2.73\%$
test_membership_stacked_nested 25.8020μs 1.9235μs 519.8935 KOps/s 526.6308 KOps/s $\color{#d91a1a}-1.28\%$
test_membership_stacked_nested_leaf 0.1072ms 1.9020μs 525.7599 KOps/s 520.5789 KOps/s $\color{#35bf28}+1.00\%$
test_membership_nested_last 26.3720μs 8000 2.7836μs 359.2426 KOps/s 359.2460 KOps/s $-0.00\%$
test_membership_nested_leaf_last 34.0320μs 2.7935μs 357.9715 KOps/s 360.4569 KOps/s $\color{#d91a1a}-0.69\%$
test_membership_stacked_nested_last 28.6420μs 2.7602μs 362.2882 KOps/s 128.2886 KOps/s $\textbf{\color{#35bf28}+182.40\%}$
test_membership_stacked_nested_leaf_last 27.6720μs 2.7351μs 365.6120 KOps/s 127.7125 KOps/s $\textbf{\color{#35bf28}+186.28\%}$
test_nested_getleaf 36.6920μs 6.0389μs 165.5922 KOps/s 163.0773 KOps/s $\color{#35bf28}+1.54\%$
test_nested_get 34.7110μs 5.6824μs 175.9818 KOps/s 172.4071 KOps/s $\color{#35bf28}+2.07\%$
test_stacked_getleaf 50.3430μs 5.9823μs 167.1591 KOps/s 167.0904 KOps/s $\color{#35bf28}+0.04\%$
test_stacked_get 34.7520μs 5.6153μs 178.0856 KOps/s 176.0323 KOps/s $\color{#35bf28}+1.17\%$
test_nested_getitemleaf 28.5020μs 6.1493μs 162.6203 KOps/s 163.5780 KOps/s $\color{#d91a1a}-0.59\%$
test_nested_getitem 34.7020μs 5.7400μs 174.2149 KOps/s 172.1475 KOps/s $\color{#35bf28}+1.20\%$
test_stacked_getitemleaf 30.7620μs 6.0807μs 164.4554 KOps/s 164.5559 KOps/s $\color{#d91a1a}-0.06\%$
test_stacked_getitem 36.2120μs 5.6541μs 176.8620 KOps/s 176.6777 KOps/s $\color{#35bf28}+0.10\%$
test_lock_nested 10.2877ms 0.4142ms 2.4144 KOps/s 2.3985 KOps/s $\color{#35bf28}+0.66\%$
test_lock_stack_nested 0.4312ms 0.3741ms 2.6732 KOps/s 2.7320 KOps/s $\color{#d91a1a}-2.15\%$
test_unlock_nested 0.7649ms 0.3482ms 2.8723 KOps/s 2.8259 KOps/s $\color{#35bf28}+1.64\%$
test_unlock_stack_nested 0.3863ms 0.3133ms 3.1915 KOps/s 3.2742 KOps/s $\color{#d91a1a}-2.53\%$
test_flatten_speed 0.1434ms 68.7059μs 14.5548 KOps/s 14.5253 KOps/s $\color{#35bf28}+0.20\%$
test_unflatten_speed 0.3275ms 0.2825ms 3.5395 KOps/s 3.5269 KOps/s $\color{#35bf28}+0.36\%$
test_common_ops 1.6216ms 1.2376ms 808.0121 Ops/s 773.6844 Ops/s $\color{#35bf28}+4.44\%$
test_creation 28.0310μs 1.4982μs 667.4722 KOps/s 665.1768 KOps/s $\color{#35bf28}+0.35\%$
test_creation_empty 43.7820μs 15.9209μs 62.8107 KOps/s 57.3229 KOps/s $\textbf{\color{#35bf28}+9.57\%}$
test_creation_nested_1 69.2930μs 17.6995μs 56.4988 KOps/s 51.2365 KOps/s $\textbf{\color{#35bf28}+10.27\%}$
test_creation_nested_2 51.4530μs 20.1672μs 49.5854 KOps/s 46.3741 KOps/s $\textbf{\color{#35bf28}+6.92\%}$
test_clone 63.8030μs 28.6773μs 34.8708 KOps/s 33.9117 KOps/s $\color{#35bf28}+2.83\%$
test_getitem[int] 1.2279ms 15.1992μs 65.7931 KOps/s 64.2361 KOps/s $\color{#35bf28}+2.42\%$
test_getitem[slice_int] 0.1196ms 26.9041μs 37.1690 KOps/s 36.2584 KOps/s $\color{#35bf28}+2.51\%$
test_getitem[range] 0.2281ms 0.1079ms 9.2683 KOps/s 9.3448 KOps/s $\color{#d91a1a}-0.82\%$
test_getitem[tuple] 0.1215ms 24.1373μs 41.4296 KOps/s 43.3546 KOps/s $\color{#d91a1a}-4.44\%$
test_getitem[list] 0.2105ms 0.1042ms 9.6009 KOps/s 10.2311 KOps/s $\textbf{\color{#d91a1a}-6.16\%}$
test_setitem_dim[int] 98.8450μs 47.4532μs 21.0734 KOps/s 22.4556 KOps/s $\textbf{\color{#d91a1a}-6.16\%}$
test_setitem_dim[slice_int] 90.1840μs 66.1341μs 15.1208 KOps/s 14.9350 KOps/s $\color{#35bf28}+1.24\%$
test_setitem_dim[range] 0.1748ms 0.1262ms 7.9256 KOps/s 7.8950 KOps/s $\color{#35bf28}+0.39\%$
test_setitem_dim[tuple] 92.3650μs 60.5638μs 16.5115 KOps/s 16.6472 KOps/s $\color{#d91a1a}-0.81\%$
test_setitem 66.4130μs 41.7871μs 23.9308 KOps/s 23.3323 KOps/s $\color{#35bf28}+2.57\%$
test_set 69.3430μs 40.4671μs 24.7114 KOps/s 23.5618 KOps/s $\color{#35bf28}+4.88\%$
test_set_shared 0.3526ms 50.2696μs 19.8928 KOps/s 19.6987 KOps/s $\color{#35bf28}+0.98\%$
test_update 92.0240μs 49.6441μs 20.1434 KOps/s 19.2190 KOps/s $\color{#35bf28}+4.81\%$
test_update_nested 95.5850μs 56.3216μs 17.7552 KOps/s 16.4592 KOps/s $\textbf{\color{#35bf28}+7.87\%}$
test_update__nested 95.7740μs 59.0185μs 16.9438 KOps/s 15.2531 KOps/s $\textbf{\color{#35bf28}+11.08\%}$
test_set_nested 86.1040μs 43.0625μs 23.2221 KOps/s 20.5377 KOps/s $\textbf{\color{#35bf28}+13.07\%}$
test_set_nested_new 84.8740μs 46.1469μs 21.6699 KOps/s 20.6552 KOps/s $\color{#35bf28}+4.91\%$
test_select 96.9250μs 60.1629μs 16.6215 KOps/s 15.6684 KOps/s $\textbf{\color{#35bf28}+6.08\%}$
test_select_nested 80.4740μs 41.4030μs 24.1528 KOps/s 24.0192 KOps/s $\color{#35bf28}+0.56\%$
test_exclude_nested 86.7240μs 58.5433μs 17.0814 KOps/s 17.0949 KOps/s $\color{#d91a1a}-0.08\%$
test_empty[True] 0.2807ms 0.2487ms 4.0210 KOps/s 4.0777 KOps/s $\color{#d91a1a}-1.39\%$
test_empty[False] 4.0362μs 0.7395μs 1.3522 MOps/s 1.3623 MOps/s $\color{#d91a1a}-0.74\%$
test_to 57.8730μs 25.6451μs 38.9938 KOps/s 41.4183 KOps/s $\textbf{\color{#d91a1a}-5.85\%}$
test_to_nonblocking 58.1730μs 24.2999μs 41.1525 KOps/s 42.9836 KOps/s $\color{#d91a1a}-4.26\%$
test_unbind_speed 1.6755ms 0.2683ms 3.7269 KOps/s 3.6795 KOps/s $\color{#35bf28}+1.29\%$
test_unbind_speed_stack0 0.3507ms 0.2699ms 3.7055 KOps/s 3.8208 KOps/s $\color{#d91a1a}-3.02\%$
test_unbind_speed_stack1 92.4087ms 0.6949ms 1.4391 KOps/s 1.4556 KOps/s $\color{#d91a1a}-1.13\%$
test_split 94.2596ms 2.0987ms 476.4858 Ops/s 466.5850 Ops/s $\color{#35bf28}+2.12\%$
test_chunk 95.5196ms 2.0870ms 479.1585 Ops/s 464.3543 Ops/s $\color{#35bf28}+3.19\%$
test_creation[device0] 0.3832ms 0.1257ms 7.9585 KOps/s 7.9657 KOps/s $\color{#d91a1a}-0.09\%$
test_creation_from_tensor 0.3845ms 0.1295ms 7.7231 KOps/s 7.8323 KOps/s $\color{#d91a1a}-1.39\%$
test_add_one[memmap_tensor0] 0.2296ms 8.3903μs 119.1857 KOps/s 112.4843 KOps/s $\textbf{\color{#35bf28}+5.96\%}$
test_contiguous[memmap_tensor0] 27.1520μs 2.1551μs 464.0147 KOps/s 472.2026 KOps/s $\color{#d91a1a}-1.73\%$
test_stack[memmap_tensor0] 35.4820μs 6.3398μs 157.7343 KOps/s 149.6916 KOps/s $\textbf{\color{#35bf28}+5.37\%}$
test_memmaptd_index 1.0640ms 0.4054ms 2.4666 KOps/s 2.4391 KOps/s $\color{#35bf28}+1.13\%$
test_memmaptd_index_astensor 0.7244ms 0.4678ms 2.1376 KOps/s 2.1325 KOps/s $\color{#35bf28}+0.24\%$
test_memmaptd_index_op 1.3734ms 0.9859ms 1.0143 KOps/s 964.0175 Ops/s $\textbf{\color{#35bf28}+5.22\%}$
test_serialize_model 0.1304s 0.1291s 7.7466 Ops/s 7.7533 Ops/s $\color{#d91a1a}-0.09\%$
test_serialize_model_pickle 1.3472s 1.2133s 0.8242 Ops/s 0.8246 Ops/s $\color{#d91a1a}-0.06\%$
test_serialize_weights 0.2211s 0.1419s 7.0475 Ops/s 7.7745 Ops/s $\textbf{\color{#d91a1a}-9.35\%}$
test_serialize_weights_returnearly 0.2240s 55.1423ms 18.1349 Ops/s 17.8056 Ops/s $\color{#35bf28}+1.85\%$
test_serialize_weights_pickle 1.3721s 1.2167s 0.8219 Ops/s 0.8217 Ops/s $\color{#35bf28}+0.03\%$
test_reshape_pytree 62.4430μs 35.1483μs 28.4509 KOps/s 28.4220 KOps/s $\color{#35bf28}+0.10\%$
test_reshape_td 89.0840μs 40.9175μs 24.4394 KOps/s 22.2399 KOps/s $\textbf{\color{#35bf28}+9.89\%}$
test_view_pytree 69.0830μs 35.0468μs 28.5333 KOps/s 29.1144 KOps/s $\color{#d91a1a}-2.00\%$
test_view_td 0.1058ms 45.7586μs 21.8538 KOps/s 21.4738 KOps/s $\color{#35bf28}+1.77\%$
test_unbind_pytree 73.5730μs 35.5580μs 28.1231 KOps/s 29.8605 KOps/s $\textbf{\color{#d91a1a}-5.82\%}$
test_unbind_td 0.5700ms 42.7221μs 23.4071 KOps/s 23.7980 KOps/s $\color{#d91a1a}-1.64\%$
test_split_pytree 0.5112ms 47.0863μs 21.2376 KOps/s 21.7620 KOps/s $\color{#d91a1a}-2.41\%$
test_split_td 0.1724ms 55.0569μs 18.1630 KOps/s 18.0360 KOps/s $\color{#35bf28}+0.70\%$
test_add_pytree 0.1092ms 56.1021μs 17.8246 KOps/s 17.4220 KOps/s $\color{#35bf28}+2.31\%$
test_add_td 0.1467ms 88.1778μs 11.3407 KOps/s 10.6703 KOps/s $\textbf{\color{#35bf28}+6.28\%}$
test_compile_add_one_nested[tensordict-compile] 0.4048ms 0.2112ms 4.7341 KOps/s 4.7089 KOps/s $\color{#35bf28}+0.54\%$
test_compile_add_one_nested[tensordict-eager] 0.1960ms 0.1468ms 6.8121 KOps/s 6.6594 KOps/s $\color{#35bf28}+2.29\%$
test_compile_add_one_nested[pytree-compile] 0.1897ms 0.1422ms 7.0347 KOps/s 6.7978 KOps/s $\color{#35bf28}+3.49\%$
test_compile_add_one_nested[pytree-eager] 0.2358ms 0.1807ms 5.5326 KOps/s 5.4535 KOps/s $\color{#35bf28}+1.45\%$
test_compile_copy_nested[tensordict-compile] 46.6820μs 20.9039μs 47.8380 KOps/s 45.8902 KOps/s $\color{#35bf28}+4.24\%$
test_compile_copy_nested[tensordict-eager] 84.9340μs 43.7489μs 22.8577 KOps/s 23.2965 KOps/s $\color{#d91a1a}-1.88\%$
test_compile_copy_nested[pytree-compile] 0.2448ms 64.2526μs 15.5636 KOps/s 15.6046 KOps/s $\color{#d91a1a}-0.26\%$
test_compile_copy_nested[pytree-eager] 91.2340μs 49.6059μs 20.1589 KOps/s 20.4353 KOps/s $\color{#d91a1a}-1.35\%$
test_compile_add_one_flat[tensordict-compile] 0.3617ms 0.3122ms 3.2033 KOps/s 3.1965 KOps/s $\color{#35bf28}+0.21\%$
test_compile_add_one_flat[tensordict-eager] 0.2521ms 0.2051ms 4.8759 KOps/s 4.8483 KOps/s $\color{#35bf28}+0.57\%$
test_compile_add_one_flat[tensorclass-compile] 0.2054ms 0.1263ms 7.9154 KOps/s 7.8540 KOps/s $\color{#35bf28}+0.78\%$
test_compile_add_one_flat[tensorclass-eager] 0.1177ms 60.4867μs 16.5326 KOps/s 16.5403 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_add_one_flat[pytree-compile] 0.3619ms 0.3123ms 3.2019 KOps/s 3.2197 KOps/s $\color{#d91a1a}-0.55\%$
test_compile_add_one_flat[pytree-eager] 0.7147ms 0.6466ms 1.5465 KOps/s 1.5907 KOps/s $\color{#d91a1a}-2.78\%$
test_compile_add_self_flat[tensordict-eager] 0.3860ms 0.2437ms 4.1028 KOps/s 4.0373 KOps/s $\color{#35bf28}+1.62\%$
test_compile_add_self_flat[tensordict-compile] 0.3609ms 0.3141ms 3.1839 KOps/s 3.1741 KOps/s $\color{#35bf28}+0.31\%$
test_compile_add_self_flat[tensorclass-eager] 0.1146ms 69.7178μs 14.3435 KOps/s 13.7882 KOps/s $\color{#35bf28}+4.03\%$
test_compile_add_self_flat[tensorclass-compile] 0.1818ms 0.1269ms 7.8830 KOps/s 7.8362 KOps/s $\color{#35bf28}+0.60\%$
test_compile_add_self_flat[pytree-eager] 0.6217ms 0.5258ms 1.9018 KOps/s 1.7956 KOps/s $\textbf{\color{#35bf28}+5.92\%}$
test_compile_add_self_flat[pytree-compile] 0.3634ms 0.3121ms 3.2039 KOps/s 3.2078 KOps/s $\color{#d91a1a}-0.12\%$
test_compile_copy_flat[tensordict-compile] 56.8720μs 19.0439μs 52.5102 KOps/s 54.5021 KOps/s $\color{#d91a1a}-3.65\%$
test_compile_copy_flat[tensordict-eager] 52.8320μs 26.6159μs 37.5715 KOps/s 37.4417 KOps/s $\color{#35bf28}+0.35\%$
test_compile_copy_flat[pytree-compile] 0.1019ms 68.6650μs 14.5635 KOps/s 14.5067 KOps/s $\color{#35bf28}+0.39\%$
test_compile_copy_flat[pytree-eager] 77.9930μs 51.3113μs 19.4889 KOps/s 19.5873 KOps/s $\color{#d91a1a}-0.50\%$
test_compile_assign_and_add[tensordict-compile] 2.2667ms 0.7918ms 1.2629 KOps/s 1.1682 KOps/s $\textbf{\color{#35bf28}+8.11\%}$
test_compile_assign_and_add[tensordict-eager] 3.3796ms 3.1672ms 315.7361 Ops/s 320.1249 Ops/s $\color{#d91a1a}-1.37\%$
test_compile_assign_and_add[pytree-compile] 2.2543ms 0.7841ms 1.2753 KOps/s 1.1797 KOps/s $\textbf{\color{#35bf28}+8.10\%}$
test_compile_assign_and_add[pytree-eager] 3.5667ms 3.1797ms 314.4940 Ops/s 310.5238 Ops/s $\color{#35bf28}+1.28\%$
test_compile_indexing[tensor-tensordict-compile] 0.5096ms 0.1102ms 9.0781 KOps/s 8.9195 KOps/s $\color{#35bf28}+1.78\%$
test_compile_indexing[tensor-tensordict-eager] 0.2030ms 61.9236μs 16.1489 KOps/s 15.4617 KOps/s $\color{#35bf28}+4.44\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1357ms 0.1010ms 9.8976 KOps/s 9.5665 KOps/s $\color{#35bf28}+3.46\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1548ms 41.6747μs 23.9953 KOps/s 21.8372 KOps/s $\textbf{\color{#35bf28}+9.88\%}$
test_compile_indexing[tensor-pytree-compile] 0.1890ms 0.1015ms 9.8569 KOps/s 9.5350 KOps/s $\color{#35bf28}+3.38\%$
test_compile_indexing[tensor-pytree-eager] 86.0940μs 41.7978μs 23.9247 KOps/s 23.4715 KOps/s< 8000 /td> $\color{#35bf28}+1.93\%$
test_compile_indexing[slice-tensordict-compile] 0.1959ms 0.1350ms 7.4088 KOps/s 7.2880 KOps/s $\color{#35bf28}+1.66\%$
test_compile_indexing[slice-tensordict-eager] 0.1585ms 24.0709μs 41.5440 KOps/s 39.6545 KOps/s $\color{#35bf28}+4.76\%$
test_compile_indexing[slice-tensorclass-compile] 0.1715ms 0.1286ms 7.7773 KOps/s 7.6830 KOps/s $\color{#35bf28}+1.23\%$
test_compile_indexing[slice-tensorclass-eager] 50.9020μs 20.3588μs 49.1189 KOps/s 49.0558 KOps/s $\color{#35bf28}+0.13\%$
test_compile_indexing[slice-pytree-compile] 0.1722ms 0.1299ms 7.6985 KOps/s 7.5526 KOps/s $\color{#35bf28}+1.93\%$
test_compile_indexing[slice-pytree-eager] 55.7220μs 19.9227μs 50.1939 KOps/s 48.8756 KOps/s $\color{#35bf28}+2.70\%$
test_compile_indexing[int-tensordict-compile] 0.1722ms 0.1363ms 7.3364 KOps/s 7.3890 KOps/s $\color{#d91a1a}-0.71\%$
test_compile_indexing[int-tensordict-eager] 0.3956ms 23.9601μs 41.7361 KOps/s 40.5168 KOps/s $\color{#35bf28}+3.01\%$
test_compile_indexing[int-tensorclass-compile] 0.1889ms 0.1296ms 7.7164 KOps/s 7.6940 KOps/s $\color{#35bf28}+0.29\%$
test_compile_indexing[int-tensorclass-eager] 62.4030μs 20.7503μs 48.1920 KOps/s 48.9097 KOps/s $\color{#d91a1a}-1.47\%$
test_compile_indexing[int-pytree-compile] 0.1824ms 0.1294ms 7.7284 KOps/s 7.6497 KOps/s $\color{#35bf28}+1.03\%$
test_compile_indexing[int-pytree-eager] 79.8430μs 20.1596μs 49.6043 KOps/s 48.9998 KOps/s $\color{#35bf28}+1.23\%$
test_mod_add[eager] 68.7330μs 30.7174μs 32.5548 KOps/s 30.3950 KOps/s $\textbf{\color{#35bf28}+7.11\%}$
test_mod_add[compile] 0.2789ms 68.2608μs 14.6497 KOps/s 14.1800 KOps/s $\color{#35bf28}+3.31\%$
test_mod_add[compile-overhead] 0.2627ms 0.1334ms 7.4961 KOps/s 7.1794 KOps/s $\color{#35bf28}+4.41\%$
test_mod_wrap[eager] 0.3407ms 0.2379ms 4.2027 KOps/s 4.1237 KOps/s $\color{#35bf28}+1.91\%$
test_mod_wrap[compile] 1.4723ms 0.3012ms 3.3203 KOps/s 3.3956 KOps/s $\color{#d91a1a}-2.22\%$
test_mod_wrap[compile-overhead] 7.6801ms 4.0724ms 245.5580 Ops/s 246.4649 Ops/s $\color{#d91a1a}-0.37\%$
test_mod_wrap_and_backward[eager] 1.4856ms 1.3666ms 731.7402 Ops/s 694.3453 Ops/s $\textbf{\color{#35bf28}+5.39\%}$
test_mod_wrap_and_backward[compile] 1.5397ms 1.3027ms 767.6654 Ops/s 753.8951 Ops/s $\color{#35bf28}+1.83\%$
test_mod_wrap_and_backward[compile-overhead] 1.3124ms 0.8877ms 1.1265 KOps/s 1.1083 KOps/s $\color{#35bf28}+1.64\%$
test_seq_add[eager] 0.2122ms 98.6858μs 10.1332 KOps/s 9.8765 KOps/s $\color{#35bf28}+2.60\%$
test_seq_add[compile] 0.2366ms 78.0452μs 12.8131 KOps/s 12.4134 KOps/s $\color{#35bf28}+3.22\%$
test_seq_add[compile-overhead] 0.1483ms 0.1119ms 8.9337 KOps/s 8.8325 KOps/s $\color{#35bf28}+1.15\%$
test_seq_wrap[eager] 0.4603ms 0.3773ms 2.6504 KOps/s 2.5338 KOps/s $\color{#35bf28}+4.60\%$
test_seq_wrap[compile] 0.3589ms 0.3065ms 3.2622 KOps/s 3.1611 KOps/s $\color{#35bf28}+3.20\%$
test_seq_wrap[compile-overhead] 0.2658ms 0.2162ms 4.6260 KOps/s 4.6006 KOps/s $\color{#35bf28}+0.55\%$
test_func_call_runtime[False-eager] 0.8199ms 0.7309ms 1.3681 KOps/s 1.3345 KOps/s $\color{#35bf28}+2.52\%$
test_func_call_runtime[False-compile] 0.9599ms 0.7655ms 1.3064 KOps/s 1.2770 KOps/s $\color{#35bf28}+2.30\%$
test_func_call_runtime[False-compile-overhead] 0.3954ms 0.3519ms 2.8418 KOps/s 2.8305 KOps/s $\color{#35bf28}+0.40\%$
test_func_call_runtime[True-eager] 1.0046ms 0.8999ms 1.1112 KOps/s 1.1038 KOps/s $\color{#35bf28}+0.68\%$
test_func_call_runtime[True-compile] 0.8602ms 0.8058ms 1.2410 KOps/s 1.2146 KOps/s $\color{#35bf28}+2.17\%$
test_func_call_runtime[True-compile-overhead] 0.4438ms 0.3871ms 2.5836 KOps/s 2.5866 KOps/s $\color{#d91a1a}-0.12\%$
test_func_call_cm_runtime[False-eager] 0.7799ms 0.7244ms 1.3804 KOps/s 1.3501 KOps/s $\color{#35bf28}+2.25\%$
test_func_call_cm_runtime[False-compile] 0.8915ms 0.8074ms 1.2386 KOps/s 1.2704 KOps/s $\color{#d91a1a}-2.51\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4071ms 0.3549ms 2.8180 KOps/s 2.8163 KOps/s $\color{#35bf28}+0.06\%$
test_func_call_cm_runtime[True-eager] 1.0978ms 0.9889ms 1.0113 KOps/s 997.3178 Ops/s $\color{#35bf28}+1.40\%$
test_func_call_cm_runtime[True-compile] 0.8734ms 0.8284ms 1.2072 KOps/s 1.1835 KOps/s $\color{#35bf28}+2.00\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4720ms 0.4122ms 2.4262 KOps/s 2.4067 KOps/s $\color{#35bf28}+0.81\%$
test_vmap_func_call_cm_runtime[eager] 2.5500ms 2.0580ms 485.9150 Ops/s 480.2642 Ops/s $\color{#35bf28}+1.18\%$
test_vmap_func_call_cm_runtime[compile] 0.9061ms 0.8423ms 1.1872 KOps/s 1.1577 KOps/s $\color{#35bf28}+2.55\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4985ms 0.4177ms 2.3941 KOps/s 2.3680 KOps/s $\color{#35bf28}+1.10\%$
test_distributed 2.3670ms 0.2413ms 4.1441 KOps/s 8.7982 KOps/s $\textbf{\color{#d91a1a}-52.90\%}$
test_tdmodule 0.1082ms 15.0316μs 66.5266 KOps/s 62.3680 KOps/s $\textbf{\color{#35bf28}+6.67\%}$
test_tdmodule_dispatch 67.8940μs 29.4517μs 33.9539 KOps/s 31.9363 KOps/s $\textbf{\color{#35bf28}+6.32\%}$
test_tdseq 35.0320μs 15.9283μs 62.7814 KOps/s 57.8128 KOps/s $\textbf{\color{#35bf28}+8.59\%}$
test_tdseq_dispatch 53.7730μs 32.3873μs 30.8763 KOps/s 28.2556 KOps/s $\textbf{\color{#35bf28}+9.28\%}$
test_instantiation_functorch 1.9684ms 1.8283ms 546.9537 Ops/s 530.1458 Ops/s $\color{#35bf28}+3.17\%$
test_instantiation_td 1.7719ms 1.1760ms 850.3347 Ops/s 825.9726 Ops/s $\color{#35bf28}+2.95\%$
test_exec_functorch 0.2513ms 0.2083ms 4.8013 KOps/s 4.7581 KOps/s $\color{#35bf28}+0.91\%$
test_exec_functional_call 0.2495ms 0.2058ms 4.8597 KOps/s 4.7835 KOps/s $\color{#35bf28}+1.59\%$
test_exec_td 0.2556ms 0.2128ms 4.7000 KOps/s 4.5010 KOps/s $\color{#35bf28}+4.42\%$
test_exec_td_decorator 0.7038ms 0.2531ms 3.9502 KOps/s 3.8431 KOps/s $\color{#35bf28}+2.79\%$
test_vmap_mlp_speed[True-True] 0.8006ms 0.6876ms 1.4544 KOps/s 1.4425 KOps/s $\color{#35bf28}+0.82\%$
test_vmap_mlp_speed[True-False] 0.7367ms 0.6894ms 1.4506 KOps/s 1.4477 KOps/s $\color{#35bf28}+0.20\%$
test_vmap_mlp_speed[False-True] 0.6545ms 0.5788ms 1.7276 KOps/s 1.7386 KOps/s $\color{#d91a1a}-0.63\%$
test_vmap_mlp_speed[False-False] 0.7361ms 0.5795ms 1.7256 KOps/s 1.7384 KOps/s $\color{#d91a1a}-0.74\%$
test_vmap_mlp_speed_decorator[True-True] 0.7638ms 0.6723ms 1.4874 KOps/s 1.4814 KOps/s $\color{#35bf28}+0.41\%$
test_vmap_mlp_speed_decorator[True-False] 0.7722ms 0.6751ms 1.4812 KOps/s 1.4744 KOps/s $\color{#35bf28}+0.46\%$
test_vmap_mlp_speed_decorator[False-True] 0.7161ms 0.5922ms 1.6886 KOps/s 1.6530 KOps/s $\color{#35bf28}+2.15\%$
test_vmap_mlp_speed_decorator[False-False] 0.7187ms 0.5946ms 1.6817 KOps/s 1.6431 KOps/s $\color{#35bf28}+2.35\%$
test_vmap_transformer_speed[True-True] 8.4434ms 8.3282ms 120.0743 Ops/s 118.6959 Ops/s $\color{#35bf28}+1.16\%$
test_vmap_transformer_speed[True-False] 8.3584ms 8.3015ms 120.4595 Ops/s 118.6021 Ops/s $\color{#35bf28}+1.57\%$
test_vmap_transformer_speed[False-True] 8.1568ms 8.0930ms 123.5630 Ops/s 121.5702 Ops/s $\color{#35bf28}+1.64\%$
test_vmap_transformer_speed[False-False] 8.1820ms 8.1239ms 123.0941 Ops/s 121.5268 Ops/s $\color{#35bf28}+1.29\%$
test_vmap_transformer_speed_decorator[True-True] 19.7252ms 19.5897ms 51.0473 Ops/s 50.9097 Ops/s $\color{#35bf28}+0.27\%$
test_vmap_transformer_speed_decorator[True-False] 19.6708ms 19.5849ms 51.0597 Ops/s 50.9107 Ops/s $\color{#35bf28}+0.29\%$
test_vmap_transformer_speed_decorator[False-True] 19.5890ms 19.4433ms 51.4315 Ops/s 51.3294 Ops/s $\color{#35bf28}+0.20\%$
test_vmap_transformer_speed_decorator[False-False] 19.5456ms 19.4262ms 51.4768 Ops/s 51.3051 Ops/s $\color{#35bf28}+0.33\%$
test_to_module_speed[True] 1.3937ms 0.9428ms 1.0607 KOps/s 1.0490 KOps/s $\color{#35bf28}+1.12\%$
test_to_module_speed[False] 1.3004ms 0.9158ms 1.0919 KOps/s 1.0756 KOps/s $\color{#35bf28}+1.52\%$
test_tc_init 69.7830μs 34.7074μs 28.8123 KOps/s 27.4887 KOps/s $\color{#35bf28}+4.81\%$
test_tc_init_nested 0.1005ms 69.8600μs 14.3143 KOps/s 13.6601 KOps/s $\color{#35bf28}+4.79\%$
test_tc_first_layer_tensor 7.6176μs 0.6750μs 1.4816 MOps/s 1.4903 MOps/s $\color{#d91a1a}-0.58\%$
test_tc_first_layer_nontensor 22.4810μs 2.2286μs 448.7070 KOps/s 442.0895 KOps/s $\color{#35bf28}+1.50\%$
test_tc_second_layer_tensor 7.3253μs 1.3860μs 721.4995 KOps/s 740.3153 KOps/s $\color{#d91a1a}-2.54\%$
test_tc_second_layer_nontensor 69.5730μs 2.9590μs 337.9552 KOps/s 337.3781 KOps/s $\color{#35bf28}+0.17\%$
test_unbind 0.1966s 12.0783ms 82.7932 Ops/s 92.0561 Ops/s $\textbf{\color{#d91a1a}-10.06\%}$
test_full_like 0.6569ms 0.5740ms 1.7421 KOps/s 1.7456 KOps/s $\color{#d91a1a}-0.20\%$
test_zeros_like 0.2703ms 0.1980ms 5.0512 KOps/s 5.0540 KOps/s $\color{#d91a1a}-0.06\%$
test_ones_like 0.2542ms 0.1978ms 5.0563 KOps/s 5.0597 KOps/s $\color{#d91a1a}-0.07\%$
test_clone 0.4501ms 0.4146ms 2.4117 KOps/s 2.4129 KOps/s $\color{#d91a1a}-0.05\%$
test_squeeze 41.1620μs 9.8640μs 101.3791 KOps/s 102.1961 KOps/s $\color{#d91a1a}-0.80\%$
test_unsqueeze 0.2296ms 75.7218μs 13.2062 KOps/s 13.3367 KOps/s $\color{#d91a1a}-0.98\%$
test_split 0.3901ms 0.1560ms 6.4085 KOps/s 6.2776 KOps/s $\color{#35bf28}+2.09\%$
test_permute 0.2651ms 0.1806ms 5.5356 KOps/s 5.5809 KOps/s $\color{#d91a1a}-0.81\%$
test_stack 1.2495ms 0.8640ms 1.1574 KOps/s 1.1428 KOps/s $\color{#35bf28}+1.27\%$
test_cat 1.2545ms 1.2320ms 811.6925 Ops/s 812.1278 Ops/s $\color{#d91a1a}-0.05\%$

[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Sep 23, 2024
ghstack-source-id: ff2a6ad
Pull Request resolved: #1005
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Sep 23, 2024
ghstack-source-id: c028349
Pull Request resolved: #1005
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Sep 23, 2024
ghstack-source-id: 24bdc0a
Pull Request resolved: #1005
@vmoens vmoens merged commit d441c64 into gh/vmoens/19/base Sep 23, 2024
5 of 13 checks passed
@vmoens vmoens deleted the gh/vmoens/19/head branch September 23, 2024 14:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants
0