-
Notifications
You must be signed in to change notification settings - Fork 93
[Doc] Fail-on-warning in sphinx #1005
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 37.7310μs | 19.3630μs | 51.6448 KOps/s | 48.4175 KOps/s | |
test_plain_set_stack_nested | 50.9750μs | 19.1540μs | 52.2083 KOps/s | 48.7872 KOps/s | |
test_plain_set_nested_inplace | 54.1310μs | 20.5210μs | 48.7306 KOps/s | 45.6790 KOps/s | |
test_plain_set_stack_nested_inplace | 54.4220μs | 20.6403μs | 48.4490 KOps/s | 45.7726 KOps/s | |
test_items | 30.9070μs | 4.2539μs | 235.0794 KOps/s | 233.0836 KOps/s | |
test_items_nested | 0.4316ms | 0.3623ms | 2.7603 KOps/s | 2.7853 KOps/s | |
test_items_nested_locked | 0.6078ms | 0.3641ms | 2.7462 KOps/s | 2.7876 KOps/s | |
test_items_nested_leaf | 0.1537ms | 68.4753μs | 14.6038 KOps/s | 14.6707 KOps/s | |
test_items_stack_nested | 0.7326ms | 0.3660ms | 2.7322 KOps/s | 2.6349 KOps/s | |
test_items_stack_nested_leaf | 0.1523ms | 72.4056μs | 13.8111 KOps/s | 14.3124 KOps/s | |
test_items_stack_nested_locked | 0.6064ms | 0.3633ms | 2.7523 KOps/s | 2.7253 KOps/s | |
test_keys | 33.6830μs | 3.5658μs | 280.4458 KOps/s | 286.1951 KOps/s | |
test_keys_nested | 0.1904ms | 0.1009ms | 9.9066 KOps/s | 10.1400 KOps/s | |
test_keys_nested_locked | 0.7275ms | 0.1067ms | 9.3692 KOps/s | 9.5528 KOps/s | |
test_keys_nested_leaf | 0.1823ms | 82.3300μs | 12.1462 KOps/s | 12.2092 KOps/s | |
test_keys_stack_nested | 0.1776ms | 99.2001μs | 10.0806 KOps/s | 10.0050 KOps/s | |
test_keys_stack_nested_leaf | 0.1501ms | 81.6818μs | 12.2426 KOps/s | 12.1014 KOps/s | |
test_keys_stack_nested_locked | 0.1880ms | 0.1047ms | 9.5502 KOps/s | 9.6298 KOps/s | |
test_values | 6.7708μs | 1.0449μs | 957.0111 KOps/s | 944.1808 KOps/s | |
test_values_nested | 0.1373ms | 75.0725μs | 13.3205 KOps/s | 13.1872 KOps/s | |
test_values_nested_locked | 0.1610ms | 75.1237μs | 13.3114 KOps/s | 13.6825 KOps/s | |
test_values_nested_leaf | 0.1150ms | 61.7292μs | 16.1998 KOps/s | 16.1351 KOps/s | |
test_values_stack_nested | 0.1260ms | 75.4684μs | 13.2506 KOps/s | 13.5821 KOps/s | |
test_values_stack_nested_leaf | 0.1035ms | 58.7517μs | 17.0208 KOps/s | 16.2745 KOps/s | |
test_values_stack_nested_locked | 0.1299ms | 75.2101μs | 13.2961 KOps/s | 13.4422 KOps/s | |
test_membership | 5.2470μs | 0.7423μs | 1.3472 MOps/s | 1.1625 MOps/s | |
test_membership_nested | 35.1650μs | 2.7149μs | 368.3362 KOps/s | 363.9042 KOps/s | |
test_membership_nested_leaf | 24.7460μs | 2.7466μs | 364.0894 KOps/s | 364.8183 KOps/s | |
test_membership_stacked_nested | 39.5040μs | 2.7499μs | 363.6433 KOps/s | 372.9534 KOps/s | |
test_membership_stacked_nested_leaf | 21.2190μs | 2.7300μs | 366.3023 KOps/s | 347.3403 KOps/s | |
test_membership_nested_last | 33.0620μs | 3.9386μs | 253.8965 KOps/s | 257.8892 KOps/s | |
test_membership_nested_leaf_last | 31.1490μs | 3.9470μs | 253.3557 KOps/s | 253.1708 KOps/s | |
test_membership_stacked_nested_last | 49.6020μs | 12.8654μs | 77.7279 KOps/s | 254.8234 KOps/s | |
test_membership_stacked_nested_leaf_last | 44.5440μs | 12.7309μs | 78.5493 KOps/s | 253.3769 KOps/s | |
test_nested_getleaf | 37.0290μs | 10.6496μs | 93.9004 KOps/s | 92.6238 KOps/s | |
test_nested_get | 0.1333ms | 10.3180μs | 96.9182 KOps/s | 96.0919 KOps/s | |
test_stacked_getleaf | 43.5610μs | 10.5924μs | 94.4071 KOps/s | 92.6520 KOps/s | |
test_stacked_get | 35.7360μs | 10.0847μs | 99.1604 KOps/s | 98.6996 KOps/s | |
test_nested_getitemleaf | 36.9790μs | 10.9757μs | 91.1101 KOps/s | 90.2466 KOps/s | |
test_nested_getitem | 37.4000μs | 10.2049μs | 97.9919 KOps/s | 96.3143 KOps/s | |
test_stacked_getitemleaf | 37.3600μs | 11.0239μs | 90.7118 KOps/s | 89.8230 KOps/s | |
test_stacked_getitem | 0.3382ms | 10.2587μs | 97.4780 KOps/s | 95.6537 KOps/s | |
test_lock_nested | 0.1014s | 0.6001ms | 1.6665 KOps/s | 2.0772 KOps/s | |
test_lock_stack_nested | 0.6657ms | 0.4406ms | 2.2698 KOps/s | 2.1892 KOps/s | |
test_unlock_nested | 0.1015s | 0.5135ms | 1.9475 KOps/s | 2.5171 KOps/s | |
test_unlock_stack_nested | 0.5178ms | 0.3570ms | 2.8008 KOps/s | 2.6429 KOps/s | |
test_flatten_speed | 0.1788ms | 86.7382μs | 11.5289 KOps/s | 11.4524 KOps/s | |
test_unflatten_speed | 0.6861ms | 0.4627ms | 2.1612 KOps/s | 2.1598 KOps/s | |
test_common_ops | 4.8339ms | 1.0874ms | 919.6670 Ops/s | 903.6947 Ops/s | |
test_creation | 17.6330μs | 2.0747μs | 481.9949 KOps/s | 488.1987 KOps/s | |
test_creation_empty | 44.2120μs | 16.0965μs | 62.1253 KOps/s | 56.6412 KOps/s | |
test_creation_nested_1 | 97.7040μs | 19.1350μs | 52.2603 KOps/s | 46.3759 KOps/s | |
test_creation_nested_2 | 58.6900μs | 23.3262μs | 42.8702 KOps/s | 40.4684 KOps/s | |
test_clone | 0.2345ms | 17.1316μs | 58.3717 KOps/s | 57.0693 KOps/s | |
test_getitem[int] | 0.8773ms | 16.3155μs | 61.2914 KOps/s | 58.9672 KOps/s | |
test_getitem[slice_int] | 0.1582ms | 30.5061μs | 32.7803 KOps/s | 32.9767 KOps/s | |
test_getitem[range] | 0.2081ms | 57.0323μs | 17.5339 KOps/s | 17.6146 KOps/s | |
test_getitem[tuple] | 0.1733ms | 24.6451μs | 40.5760 KOps/s | 41.0634 KOps/s | |
test_getitem[list] | 0.1864ms | 52.9302μs | 18.8928 KOps/s | 19.2595 KOps/s | |
test_setitem_dim[int] | 62.9180μs | 31.7876μs | 31.4588 KOps/s | 31.7513 KOps/s | |
test_setitem_dim[slice_int] | 0.1020ms | 59.7202μs | 16.7447 KOps/s | 16.9998 KOps/s | |
test_setitem_dim[range] | 0.1430ms | 84.3103μs | 11.8609 KOps/s | 12.1801 KOps/s | |
test_setitem_dim[tuple] | 80.4800μs | 47.5048μs | 21.0505 KOps/s | 21.3117 KOps/s | |
test_setitem | 0.2825ms | 28.7866μs | 34.7383 KOps/s | 33.4507 KOps/s | |
test_set | 77.4250μs | 27.5804μs | 36.2577 KOps/s | 33.7459 KOps/s | |
test_set_shared | 3.9177ms | 0.2169ms | 4.6095 KOps/s | 4.6772 KOps/s | |
test_update | 0.2445ms | 33.6198μs | 29.7444 KOps/s | 27.5149 KOps/s | |
test_update_nested | 0.2416ms | 44.7072μs | 22.3678 KOps/s | 21.9548 KOps/s | |
test_update__nested | 0.2139ms | 34.9868μs | 28.5822 KOps/s | 28.8481 KOps/s | |
test_set_nested | 0.2516ms | 29.9622μs | 33.3754 KOps/s | 31.3945 KOps/s | |
test_set_nested_new | 0.1259ms | 35.1703μs | 28.4331 KOps/s | 27.0317 KOps/s | |
test_select | 0.2836ms | 52.2803μs | 19.1277 KOps/s | 18.5519 KOps/s | |
test_select_nested | 0.1675ms | 58.8669μs | 16.9875 KOps/s | 17.1024 KOps/s | |
test_exclude_nested | 0.1502ms | 74.8214μs | 13.3652 KOps/s | 13.4594 KOps/s | |
test_empty[True] | 0.5377ms | 0.3185ms | 3.1401 KOps/s | 3.1593 KOps/s | |
test_empty[False] | 7.7595μs | 1.2218μs | 818.4381 KOps/s | 854.3075 KOps/s | |
test_unbind_speed | 0.5157ms | 0.3026ms | 3.3050 KOps/s | 3.3150 KOps/s | |
test_unbind_speed_stack0 | 0.4780ms | 0.2860ms | 3.4959 KOps/s | 3.4360 KOps/s | |
test_unbind_speed_stack1 | 0.1060s | 0.8047ms | 1.2428 KOps/s | 1.3415 KOps/s | |
test_split | 2.1394ms | 1.9816ms | 504.6516 Ops/s | 457.6632 Ops/s | |
test_chunk | 98.3471ms | 2.1836ms | 457.9577 Ops/s | 453.0205 Ops/s | |
test_creation[device0] | 0.2503ms | 0.1174ms | 8.5173 KOps/s | 8.5654 KOps/s | |
test_creation_from_tensor | 3.8820ms | 0.1194ms | 8.3759 KOps/s | 8.5746 KOps/s | |
test_add_one[memmap_tensor0] | 0.3702ms | 7.0277μs | 142.2944 KOps/s | 131.6445 KOps/s | |
test_contiguous[memmap_tensor0] | 26.5200μs | 1.9419μs | 514.9484 KOps/s | 522.2721 KOps/s | |
test_stack[memmap_tensor0] | 70.1010μs | 5.5819μs | 179.1508 KOps/s | 169.3074 KOps/s | |
test_memmaptd_index | 1.0912ms | 0.4046ms | 2.4714 KOps/s | 2.4777 KOps/s | |
test_memmaptd_index_astensor | 0.9730ms | 0.4835ms | 2.0681 KOps/s | 2.0870 KOps/s | |
test_memmaptd_index_op | 1.4923ms | 0.9676ms | 1.0335 KOps/s | 980.7462 Ops/s | |
test_serialize_model | 0.2477s | 0.1393s | 7.1772 Ops/s | 8.2543 Ops/s | |
test_serialize_model_pickle | 0.4856s | 0.4019s | 2.4882 Ops/s | 2.4473 Ops/s | |
test_serialize_weights | 0.1229s | 0.1172s | 8.5356 Ops/s | 7.3293 Ops/s | |
test_serialize_weights_returnearly | 0.1923s | 0.1631s | 6.1315 Ops/s | 6.3508 Ops/s | |
test_serialize_weights_pickle | 1.1876s | 0.7122s | 1.4041 Ops/s | 1.0854 Ops/s | |
test_serialize_weights_filesystem | 0.1494s | 0.1409s | 7.0950 Ops/s | 6.9836 Ops/s | |
test_serialize_model_filesystem | 0.1523s | 0.1453s | 6.8804 Ops/s | 6.2166 Ops/s | |
test_reshape_pytree | 76.7030μs | 38.7988μs | 25.7740 KOps/s | 25.4089 KOps/s | |
test_reshape_td | 96.7310μs | 45.7347μs | 21.8652 KOps/s | 21.3068 KOps/s | |
test_view_pytree | 0.1486ms | 38.7465μs | 25.8088 KOps/s | 25.5586 KOps/s | |
test_view_td | 0.1118ms | 53.0739μs | 18.8417 KOps/s | 19.3765 KOps/s | |
test_unbind_pytree | 91.8220μs | 35.9267μs | 27.8344 KOps/s | 27.7639 KOps/s | |
test_unbind_td | 0.3040ms | 44.6485μs | 22.3972 KOps/s | 22.3757 KOps/s | |
test_split_pytree | 80.9920μs | 37.8762μs | 26.4018 KOps/s | 25.9639 KOps/s | |
test_split_td | 0.4907ms | 58.7591μs | 17.0186 KOps/s | 17.6150 KOps/s | |
test_add_pytree | 0.1013ms | 44.4922μs | 22.4758 KOps/s | 22.0370 KOps/s | |
test_add_td | 0.1773ms | 77.2549μs | 12.9442 KOps/s | 12.0616 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1322ms | 59.7945μs | 16.7239 KOps/s | 17.0434 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3315ms | 0.1796ms | 5.5684 KOps/s | 5.5963 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1400ms | 57.4290μs | 17.4128 KOps/s | 17.4651 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3304ms | 0.1412ms | 7.0828 KOps/s | 7.1936 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 90.6690μs | 20.8230μs | 48.0237 KOps/s | 44.9735 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1539ms | 67.4890μs | 14.8172 KOps/s | 15.0035 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1435ms | 76.7952μs | 13.0216 KOps/s | 13.1918 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1547ms | 68.5027μs | 14.5980 KOps/s | 14.6532 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2808ms | 0.1762ms | 5.6740 KOps/s | 5.7538 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3486ms | 0.1916ms | 5.2194 KOps/s | 5.3268 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1285ms | 47.2576μs | 21.1606 KOps/s | 21.7385 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1617ms | 69.4993μs | 14.3886 KOps/s | 14.5557 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3257ms | 0.1757ms | 5.6902 KOps/s | 5.7296 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5968ms | 0.2852ms | 3.5065 KOps/s | 3.4514 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3620ms | 0.2033ms | 4.9179 KOps/s | 4.9776 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3278ms | 0.1741ms | 5.7444 KOps/s | 5.6457 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1284ms | 62.0863μs | 16.1066 KOps/s | 16.0683 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1195ms | 47.1289μs | 21.2184 KOps/s | 21.1926 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4252ms | 0.2314ms | 4.3206 KOps/s | 4.2564 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2948ms | 0.1737ms | 5.7554 KOps/s | 5.6994 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1977ms | 0.1027ms | 9.7416 KOps/s | 9.3818 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1334ms | 56.5986μs | 17.6683 KOps/s | 17.5862 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1459ms | 76.7830μs | 13.0237 KOps/s | 12.9335 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1527ms | 69.2007μs | 14.4507 KOps/s | 14.5637 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2883ms | 0.1939ms | 5.1586 KOps/s | 5.1336 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.1136ms | 1.6262ms | 614.9124 Ops/s | 601.6112 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.4318ms | 0.1971ms | 5.0726 KOps/s | 5.2650 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3704ms | 1.0888ms | 918.4302 Ops/s | 899.1028 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.5576ms | 0.4195ms | 2.3837 KOps/s | 2.3960 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.3941ms | 3.5186ms | 284.2026 Ops/s | 268.5328 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1489ms | 35.9120μs | 27.8458 KOps/s | 29.9748 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 1.4074ms | 48.8517μs | 20.4701 KOps/s | 20.6879 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 89.9580μs | 30.2539μs | 33.0536 KOps/s | 33.8677 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1126ms | 28.7001μs | 34.8430 KOps/s | 34.2226 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1028ms | 30.4622μs | 32.8276 KOps/s | 34.1645 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1166ms | 28.3481μs | 35.2757 KOps/s | 34.2554 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1639ms | 74.6215μs | 13.4010 KOps/s | 13.6225 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.6299ms | 27.5536μs | 36.2929 KOps/s | 36.0863 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1538ms | 69.2081μs | 14.4492 KOps/s | 14.9175 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 97.5110μs | 23.3445μs | 42.8367 KOps/s | 42.8353 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1581ms | 68.2295μs | 14.6564 KOps/s | 14.9245 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 80.9310μs | 23.6028μs | 42.3679 KOps/s | 42.3906 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1465ms | 73.8141μs | 13.5476 KOps/s | 13.7200 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.2763ms | 27.3466μs | 36.5676 KOps/s | 37.2734 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1555ms | 68.7383μs | 14.5479 KOps/s | 14.7869 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1021ms | 23.2200μs | 43.0663 KOps/s | 42.7873 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1507ms | 68.1849μs | 14.6660 KOps/s | 14.9284 KOps/s | |
test_compile_indexing[int-pytree-eager] | 79.0480μs | 23.1102μs | 43.2710 KOps/s | 43.2992 KOps/s | |
test_mod_add[eager] | 97.9320μs | 24.6981μs | 40.4889 KOps/s | 38.8483 KOps/s | |
test_mod_add[compile] | 0.1078ms | 41.1395μs | 24.3075 KOps/s | 26.0885 KOps/s | |
test_mod_add[compile-overhead] | 0.1300ms | 41.0407μs | 24.3661 KOps/s | 25.8408 KOps/s | |
test_mod_wrap[eager] | 0.3623ms | 0.2129ms | 4.6965 KOps/s | 4.9012 KOps/s | |
test_mod_wrap[compile] | 0.4058ms | 0.2407ms | 4.1547 KOps/s | 4.3022 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3569ms | 0.2384ms | 4.1947 KOps/s | 4.3695 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.8955ms | 11.2763ms | 88.6815 Ops/s | 79.9017 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.0703ms | 11.7139ms | 85.3687 Ops/s | 83.0481 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 19.3649ms | 11.9230ms | 83.8716 Ops/s | 71.7991 Ops/s | |
test_seq_add[eager] | 0.2093ms | 92.9276μs | 10.7611 KOps/s | 11.1616 KOps/s | |
test_seq_add[compile] | 0.1535ms | 66.9232μs | 14.9425 KOps/s | 15.2441 KOps/s | |
test_seq_add[compile-overhead] | 0.1792ms | 65.1159μs | 15.3572 KOps/s | 15.4493 KOps/s | |
test_seq_wrap[eager] | 0.6427ms | 0.3837ms | 2.6065 KOps/s | 2.5735 KOps/s | |
test_seq_wrap[compile] | 1.2851ms | 0.2763ms | 3.6196 KOps/s | 3.6585 KOps/s | |
test_seq_wrap[compile-overhead] | 1.4295ms | 0.2804ms | 3.5666 KOps/s | 3.6687 KOps/s | |
test_func_call_runtime[False-eager] | 0.9793ms | 0.5394ms | 1.8541 KOps/s | 1.9347 KOps/s | |
test_func_call_runtime[False-compile] | 0.6623ms | 0.5064ms | 1.9748 KOps/s | 1.9746 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.6394ms | 0.5035ms | 1.9861 KOps/s | 1.9745 KOps/s | |
test_func_call_runtime[True-eager] | 1.2309ms | 0.7577ms | 1.3198 KOps/s | 1.3391 KOps/s | |
test_func_call_runtime[True-compile] | 0.9528ms | 0.5189ms | 1.9271 KOps/s | 1.9215 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8645ms | 0.5224ms | 1.9141 KOps/s | 1.9416 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9148ms | 0.5411ms | 1.8481 KOps/s | 1.9372 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6499ms | 0.5095ms | 1.9627 KOps/s | 1.9935 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6903ms | 0.5103ms | 1.9597 KOps/s | 1.9946 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1461ms | 0.8861ms | 1.1285 KOps/s | 1.1357 KOps/s | |
test_func_call 8000 _cm_runtime[True-compile] | 0.9260ms | 0.7552ms | 1.3242 KOps/s | 1.3399 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1976ms | 0.7585ms | 1.3185 KOps/s | 1.3343 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6162ms | 1.9079ms | 524.1446 Ops/s | 520.9132 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 3.0361ms | 1.9644ms | 509.0733 Ops/s | 515.2612 Ops/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 6.9322ms | 1.9748ms | 506.3846 Ops/s | 514.4167 Ops/s | |
test_distributed | 0.2503ms | 0.1249ms | 8.0084 KOps/s | 7.7680 KOps/s | |
test_tdmodule | 46.2960μs | 17.0260μs | 58.7336 KOps/s | 55.2037 KOps/s | |
test_tdmodule_dispatch | 70.6020μs | 33.9323μs | 29.4704 KOps/s | 27.5465 KOps/s | |
test_tdseq | 42.5590μs | 19.9462μs | 50.1349 KOps/s | 47.3193 KOps/s | |
test_tdseq_dispatch | 80.1400μs | 38.7562μs | 25.8023 KOps/s | 23.4444 KOps/s | |
test_instantiation_functorch | 1.7063ms | 1.5855ms | 630.7291 Ops/s | 637.0953 Ops/s | |
test_instantiation_td | 2.6686ms | 1.1936ms | 837.8037 Ops/s | 848.8903 Ops/s | |
test_exec_functorch | 0.4517ms | 0.1905ms | 5.2480 KOps/s | 5.5065 KOps/s | |
test_exec_functional_call | 0.3301ms | 0.1782ms | 5.6127 KOps/s | 5.8775 KOps/s | |
test_exec_td | 0.4114ms | 0.1782ms | 5.6102 KOps/s | 6.0076 KOps/s | |
test_exec_td_decorator | 0.3793ms | 0.2277ms | 4.3917 KOps/s | 4.5136 KOps/s | |
test_vmap_mlp_speed[True-True] | 1.1121ms | 0.6574ms | 1.5212 KOps/s | 1.5282 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.8354ms | 0.6552ms | 1.5263 KOps/s | 1.5356 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.8285ms | 0.5097ms | 1.9618 KOps/s | 1.9854 KOps/s | |
test_vmap_mlp_speed[False-False] | 1.9622ms | 0.5212ms | 1.9188 KOps/s | 1.9736 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.3497ms | 0.6348ms | 1.5753 KOps/s | 1.5723 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0701ms | 0.6360ms | 1.5724 KOps/s | 1.5753 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8002ms | 0.5254ms | 1.9033 KOps/s | 1.9296 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 1.1677ms | 0.5286ms | 1.8919 KOps/s | 1.9250 KOps/s | |
test_to_module_speed[True] | 2.0876ms | 1.3089ms | 763.9970 Ops/s | 772.2036 Ops/s | |
test_to_module_speed[False] | 2.0770ms | 1.2897ms | 775.3959 Ops/s | 786.2237 Ops/s | |
test_tc_init | 0.1111ms | 41.5081μs | 24.0917 KOps/s | 22.8330 KOps/s | |
test_tc_init_nested | 0.1676ms | 83.7522μs | 11.9400 KOps/s | 11.4349 KOps/s | |
test_tc_first_layer_tensor | 21.6300μs | 1.5393μs | 649.6576 KOps/s | 658.9524 KOps/s | |
test_tc_first_layer_nontensor | 26.2090μs | 4.6925μs | 213.1040 KOps/s | 216.2316 KOps/s | |
test_tc_second_layer_tensor | 0.1481ms | 3.0179μs | 331.3560 KOps/s | 356.5383 KOps/s | |
test_tc_second_layer_nontensor | 92.4930μs | 6.0357μs | 165.6798 KOps/s | 168.7859 KOps/s | |
test_unbind | 0.5148s | 13.8986ms | 71.9495 Ops/s | 71.9781 Ops/s | |
test_full_like | 9.9687ms | 8.8616ms | 112.8460 Ops/s | 71.0170 Ops/s | |
test_zeros_like | 3.9354ms | 3.3556ms | 298.0072 Ops/s | 135.3500 Ops/s | |
test_ones_like | 12.1390ms | 6.6810ms | 149.6793 Ops/s | 128.5620 Ops/s | |
test_clone | 13.5520ms | 8.5535ms | 116.9114 Ops/s | 102.4368 Ops/s | |
test_squeeze | 71.1730μs | 12.6525μs | 79.0355 KOps/s | 80.6989 KOps/s | |
test_unsqueeze | 0.3757ms | 92.9416μs | 10.7594 KOps/s | 11.0986 KOps/s | |
test_split | 0.3506ms | 0.1951ms | 5.1250 KOps/s | 5.2025 KOps/s | |
test_permute | 0.5534ms | 0.2327ms | 4.2976 KOps/s | 4.6083 KOps/s | |
test_stack | 33.9224ms | 27.0381ms | 36.9848 Ops/s | 37.9305 Ops/s | |
test_cat | 30.9209ms | 26.7265ms | 37.4160 Ops/s | 37.5524 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.1432ms | 14.0830μs | 71.0077 KOps/s | 68.4562 KOps/s | |
test_plain_set_stack_nested | 41.2320μs | 14.0611μs | 71.1184 KOps/s | 67.5806 KOps/s | |
test_plain_set_nested_inplace | 59.3130μs | 15.1850μs | 65.8546 KOps/s | 62.5899 KOps/s | |
test_plain_set_stack_nested_inplace | 42.6420μs | 15.1422μs | 66.0405 KOps/s | 63.2514 KOps/s | |
test_items | 34.8310μs | 2.8834μs | 346.8094 KOps/s | 340.9742 KOps/s | |
test_items_nested | 0.3661ms | 0.3282ms | 3.0474 KOps/s | 3.1097 KOps/s | |
test_items_nested_locked | 0.4055ms | 0.3270ms | 3.0579 KOps/s | 3.0692 KOps/s | |
test_items_nested_leaf | 74.3040μs | 55.3585μs | 18.0641 KOps/s | 18.0315 KOps/s | |
test_items_stack_nested | 0.4427ms | 0.3253ms | 3.0741 KOps/s | 3.0883 KOps/s | |
test_items_stack_nested_leaf | 89.5740μs | 55.7232μs | 17.9458 KOps/s | 17.6316 KOps/s | |
test_items_stack_nested_locked | 0.4352ms | 0.3293ms | 3.0372 KOps/s | 3.0557 KOps/s | |
test_keys | 28.4210μs | 3.6767μs | 271.9815 KOps/s | 291.8893 KOps/s | |
test_keys_nested | 82.1740μs | 56.4288μs | 17.7214 KOps/s | 17.9544 KOps/s | |
test_keys_nested_locked | 0.7115ms | 62.0100μs | 16.1264 KOps/s | 16.2372 KOps/s | |
test_keys_nested_leaf | 93.7750μs | 46.9158μs | 21.3148 KOps/s | 21.8966 KOps/s | |
test_keys_stack_nested | 94.9250μs | 57.0546μs | 17.5271 KOps/s | 17.9864 KOps/s | |
test_keys_stack_nested_leaf | 89.8940μs | 47.6435μs | 20.9892 KOps/s | 21.1100 KOps/s | |
test_keys_stack_nested_locked | 0.1054ms | 61.6978μs | 16.2080 KOps/s | 16.5113 KOps/s | |
test_values | 5.4737μs | 0.8601μs | 1.1627 MOps/s | 1.1729 MOps/s | |
test_values_nested | 71.4240μs | 40.6843μs | 24.5795 KOps/s | 24.6582 KOps/s | |
test_values_nested_locked | 74.9140μs | 42.5722μs | 23.4895 KOps/s | 23.3793 KOps/s | |
test_values_nested_leaf | 75.5140μs | 35.3736μs | 28.2697 KOps/s | 28.3379 KOps/s | |
test_values_stack_nested | 72.6230μs | 41.0647μs | 24.3518 KOps/s | 24.1804 KOps/s | |
test_values_stack_nested_leaf | 78.7740μs | 35.7892μs | 27.9414 KOps/s | 28.1166 KOps/s | |
test_values_stack_nested_locked | 88.7240μs | 42.7943μs | 23.3676 KOps/s | 23.1748 KOps/s | |
test_membership | 1.8026μs | 0.5086μs | 1.9662 MOps/s | 1.9780 MOps/s | |
test_membership_nested | 11.3055μs | 1.8755μs | 533.1777 KOps/s | 546.9621 KOps/s | |
test_membership_nested_leaf | 20.2460μs | 1.8759μs | 533.0646 KOps/s | 548.0033 KOps/s | |
test_membership_stacked_nested | 25.8020μs | 1.9235μs | 519.8935 KOps/s | 526.6308 KOps/s | |
test_membership_stacked_nested_leaf | 0.1072ms | 1.9020μs | 525.7599 KOps/s | 520.5789 KOps/s | |
test_membership_nested_last | 26.3720μs 8000 | 2.7836μs | 359.2426 KOps/s | 359.2460 KOps/s | |
test_membership_nested_leaf_last | 34.0320μs | 2.7935μs | 357.9715 KOps/s | 360.4569 KOps/s | |
test_membership_stacked_nested_last | 28.6420μs | 2.7602μs | 362.2882 KOps/s | 128.2886 KOps/s | |
test_membership_stacked_nested_leaf_last | 27.6720μs | 2.7351μs | 365.6120 KOps/s | 127.7125 KOps/s | |
test_nested_getleaf | 36.6920μs | 6.0389μs | 165.5922 KOps/s | 163.0773 KOps/s | |
test_nested_get | 34.7110μs | 5.6824μs | 175.9818 KOps/s | 172.4071 KOps/s | |
test_stacked_getleaf | 50.3430μs | 5.9823μs | 167.1591 KOps/s | 167.0904 KOps/s | |
test_stacked_get | 34.7520μs | 5.6153μs | 178.0856 KOps/s | 176.0323 KOps/s | |
test_nested_getitemleaf | 28.5020μs | 6.1493μs | 162.6203 KOps/s | 163.5780 KOps/s | |
test_nested_getitem | 34.7020μs | 5.7400μs | 174.2149 KOps/s | 172.1475 KOps/s | |
test_stacked_getitemleaf | 30.7620μs | 6.0807μs | 164.4554 KOps/s | 164.5559 KOps/s | |
test_stacked_getitem | 36.2120μs | 5.6541μs | 176.8620 KOps/s | 176.6777 KOps/s | |
test_lock_nested | 10.2877ms | 0.4142ms | 2.4144 KOps/s | 2.3985 KOps/s | |
test_lock_stack_nested | 0.4312ms | 0.3741ms | 2.6732 KOps/s | 2.7320 KOps/s | |
test_unlock_nested | 0.7649ms | 0.3482ms | 2.8723 KOps/s | 2.8259 KOps/s | |
test_unlock_stack_nested | 0.3863ms | 0.3133ms | 3.1915 KOps/s | 3.2742 KOps/s | |
test_flatten_speed | 0.1434ms | 68.7059μs | 14.5548 KOps/s | 14.5253 KOps/s | |
test_unflatten_speed | 0.3275ms | 0.2825ms | 3.5395 KOps/s | 3.5269 KOps/s | |
test_common_ops | 1.6216ms | 1.2376ms | 808.0121 Ops/s | 773.6844 Ops/s | |
test_creation | 28.0310μs | 1.4982μs | 667.4722 KOps/s | 665.1768 KOps/s | |
test_creation_empty | 43.7820μs | 15.9209μs | 62.8107 KOps/s | 57.3229 KOps/s | |
test_creation_nested_1 | 69.2930μs | 17.6995μs | 56.4988 KOps/s | 51.2365 KOps/s | |
test_creation_nested_2 | 51.4530μs | 20.1672μs | 49.5854 KOps/s | 46.3741 KOps/s | |
test_clone | 63.8030μs | 28.6773μs | 34.8708 KOps/s | 33.9117 KOps/s | |
test_getitem[int] | 1.2279ms | 15.1992μs | 65.7931 KOps/s | 64.2361 KOps/s | |
test_getitem[slice_int] | 0.1196ms | 26.9041μs | 37.1690 KOps/s | 36.2584 KOps/s | |
test_getitem[range] | 0.2281ms | 0.1079ms | 9.2683 KOps/s | 9.3448 KOps/s | |
test_getitem[tuple] | 0.1215ms | 24.1373μs | 41.4296 KOps/s | 43.3546 KOps/s | |
test_getitem[list] | 0.2105ms | 0.1042ms | 9.6009 KOps/s | 10.2311 KOps/s | |
test_setitem_dim[int] | 98.8450μs | 47.4532μs | 21.0734 KOps/s | 22.4556 KOps/s | |
test_setitem_dim[slice_int] | 90.1840μs | 66.1341μs | 15.1208 KOps/s | 14.9350 KOps/s | |
test_setitem_dim[range] | 0.1748ms | 0.1262ms | 7.9256 KOps/s | 7.8950 KOps/s | |
test_setitem_dim[tuple] | 92.3650μs | 60.5638μs | 16.5115 KOps/s | 16.6472 KOps/s | |
test_setitem | 66.4130μs | 41.7871μs | 23.9308 KOps/s | 23.3323 KOps/s | |
test_set | 69.3430μs | 40.4671μs | 24.7114 KOps/s | 23.5618 KOps/s | |
test_set_shared | 0.3526ms | 50.2696μs | 19.8928 KOps/s | 19.6987 KOps/s | |
test_update | 92.0240μs | 49.6441μs | 20.1434 KOps/s | 19.2190 KOps/s | |
test_update_nested | 95.5850μs | 56.3216μs | 17.7552 KOps/s | 16.4592 KOps/s | |
test_update__nested | 95.7740μs | 59.0185μs | 16.9438 KOps/s | 15.2531 KOps/s | |
test_set_nested | 86.1040μs | 43.0625μs | 23.2221 KOps/s | 20.5377 KOps/s | |
test_set_nested_new | 84.8740μs | 46.1469μs | 21.6699 KOps/s | 20.6552 KOps/s | |
test_select | 96.9250μs | 60.1629μs | 16.6215 KOps/s | 15.6684 KOps/s | |
test_select_nested | 80.4740μs | 41.4030μs | 24.1528 KOps/s | 24.0192 KOps/s | |
test_exclude_nested | 86.7240μs | 58.5433μs | 17.0814 KOps/s | 17.0949 KOps/s | |
test_empty[True] | 0.2807ms | 0.2487ms | 4.0210 KOps/s | 4.0777 KOps/s | |
test_empty[False] | 4.0362μs | 0.7395μs | 1.3522 MOps/s | 1.3623 MOps/s | |
test_to | 57.8730μs | 25.6451μs | 38.9938 KOps/s | 41.4183 KOps/s | |
test_to_nonblocking | 58.1730μs | 24.2999μs | 41.1525 KOps/s | 42.9836 KOps/s | |
test_unbind_speed | 1.6755ms | 0.2683ms | 3.7269 KOps/s | 3.6795 KOps/s | |
test_unbind_speed_stack0 | 0.3507ms | 0.2699ms | 3.7055 KOps/s | 3.8208 KOps/s | |
test_unbind_speed_stack1 | 92.4087ms | 0.6949ms | 1.4391 KOps/s | 1.4556 KOps/s | |
test_split | 94.2596ms | 2.0987ms | 476.4858 Ops/s | 466.5850 Ops/s | |
test_chunk | 95.5196ms | 2.0870ms | 479.1585 Ops/s | 464.3543 Ops/s | |
test_creation[device0] | 0.3832ms | 0.1257ms | 7.9585 KOps/s | 7.9657 KOps/s | |
test_creation_from_tensor | 0.3845ms | 0.1295ms | 7.7231 KOps/s | 7.8323 KOps/s | |
test_add_one[memmap_tensor0] | 0.2296ms | 8.3903μs | 119.1857 KOps/s | 112.4843 KOps/s | |
test_contiguous[memmap_tensor0] | 27.1520μs | 2.1551μs | 464.0147 KOps/s | 472.2026 KOps/s | |
test_stack[memmap_tensor0] | 35.4820μs | 6.3398μs | 157.7343 KOps/s | 149.6916 KOps/s | |
test_memmaptd_index | 1.0640ms | 0.4054ms | 2.4666 KOps/s | 2.4391 KOps/s | |
test_memmaptd_index_astensor | 0.7244ms | 0.4678ms | 2.1376 KOps/s | 2.1325 KOps/s | |
test_memmaptd_index_op | 1.3734ms | 0.9859ms | 1.0143 KOps/s | 964.0175 Ops/s | |
test_serialize_model | 0.1304s | 0.1291s | 7.7466 Ops/s | 7.7533 Ops/s | |
test_serialize_model_pickle | 1.3472s | 1.2133s | 0.8242 Ops/s | 0.8246 Ops/s | |
test_serialize_weights | 0.2211s | 0.1419s | 7.0475 Ops/s | 7.7745 Ops/s | |
test_serialize_weights_returnearly | 0.2240s | 55.1423ms | 18.1349 Ops/s | 17.8056 Ops/s | |
test_serialize_weights_pickle | 1.3721s | 1.2167s | 0.8219 Ops/s | 0.8217 Ops/s | |
test_reshape_pytree | 62.4430μs | 35.1483μs | 28.4509 KOps/s | 28.4220 KOps/s | |
test_reshape_td | 89.0840μs | 40.9175μs | 24.4394 KOps/s | 22.2399 KOps/s | |
test_view_pytree | 69.0830μs | 35.0468μs | 28.5333 KOps/s | 29.1144 KOps/s | |
test_view_td | 0.1058ms | 45.7586μs | 21.8538 KOps/s | 21.4738 KOps/s | |
test_unbind_pytree | 73.5730μs | 35.5580μs | 28.1231 KOps/s | 29.8605 KOps/s | |
test_unbind_td | 0.5700ms | 42.7221μs | 23.4071 KOps/s | 23.7980 KOps/s | |
test_split_pytree | 0.5112ms | 47.0863μs | 21.2376 KOps/s | 21.7620 KOps/s | |
test_split_td | 0.1724ms | 55.0569μs | 18.1630 KOps/s | 18.0360 KOps/s | |
test_add_pytree | 0.1092ms | 56.1021μs | 17.8246 KOps/s | 17.4220 KOps/s | |
test_add_td | 0.1467ms | 88.1778μs | 11.3407 KOps/s | 10.6703 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4048ms | 0.2112ms | 4.7341 KOps/s | 4.7089 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.1960ms | 0.1468ms | 6.8121 KOps/s | 6.6594 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1897ms | 0.1422ms | 7.0347 KOps/s | 6.7978 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2358ms | 0.1807ms | 5.5326 KOps/s | 5.4535 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 46.6820μs | 20.9039μs | 47.8380 KOps/s | 45.8902 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 84.9340μs | 43.7489μs | 22.8577 KOps/s | 23.2965 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2448ms | 64.2526μs | 15.5636 KOps/s | 15.6046 KOps/s | |
test_compile_copy_nested[pytree-eager] | 91.2340μs | 49.6059μs | 20.1589 KOps/s | 20.4353 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3617ms | 0.3122ms | 3.2033 KOps/s | 3.1965 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2521ms | 0.2051ms | 4.8759 KOps/s | 4.8483 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2054ms | 0.1263ms | 7.9154 KOps/s | 7.8540 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1177ms | 60.4867μs | 16.5326 KOps/s | 16.5403 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3619ms | 0.3123ms | 3.2019 KOps/s | 3.2197 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.7147ms | 0.6466ms | 1.5465 KOps/s | 1.5907 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3860ms | 0.2437ms | 4.1028 KOps/s | 4.0373 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3609ms | 0.3141ms | 3.1839 KOps/s | 3.1741 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1146ms | 69.7178μs | 14.3435 KOps/s | 13.7882 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1818ms | 0.1269ms | 7.8830 KOps/s | 7.8362 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6217ms | 0.5258ms | 1.9018 KOps/s | 1.7956 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3634ms | 0.3121ms | 3.2039 KOps/s | 3.2078 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 56.8720μs | 19.0439μs | 52.5102 KOps/s | 54.5021 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 52.8320μs | 26.6159μs | 37.5715 KOps/s | 37.4417 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1019ms | 68.6650μs | 14.5635 KOps/s | 14.5067 KOps/s | |
test_compile_copy_flat[pytree-eager] | 77.9930μs | 51.3113μs | 19.4889 KOps/s | 19.5873 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.2667ms | 0.7918ms | 1.2629 KOps/s | 1.1682 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.3796ms | 3.1672ms | 315.7361 Ops/s | 320.1249 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.2543ms | 0.7841ms | 1.2753 KOps/s | 1.1797 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.5667ms | 3.1797ms | 314.4940 Ops/s | 310.5238 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.5096ms | 0.1102ms | 9.0781 KOps/s | 8.9195 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.2030ms | 61.9236μs | 16.1489 KOps/s | 15.4617 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1357ms | 0.1010ms | 9.8976 KOps/s | 9.5665 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1548ms | 41.6747μs | 23.9953 KOps/s | 21.8372 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1890ms | 0.1015ms | 9.8569 KOps/s | 9.5350 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 86.0940μs | 41.7978μs | 23.9247 KOps/s | 23.4715 KOps/s< 8000 /td> | |
test_compile_indexing[slice-tensordict-compile] | 0.1959ms | 0.1350ms | 7.4088 KOps/s | 7.2880 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1585ms | 24.0709μs | 41.5440 KOps/s | 39.6545 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1715ms | 0.1286ms | 7.7773 KOps/s | 7.6830 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 50.9020μs | 20.3588μs | 49.1189 KOps/s | 49.0558 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1722ms | 0.1299ms | 7.6985 KOps/s | 7.5526 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 55.7220μs | 19.9227μs | 50.1939 KOps/s | 48.8756 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1722ms | 0.1363ms | 7.3364 KOps/s | 7.3890 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.3956ms | 23.9601μs | 41.7361 KOps/s | 40.5168 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1889ms | 0.1296ms | 7.7164 KOps/s | 7.6940 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 62.4030μs | 20.7503μs | 48.1920 KOps/s | 48.9097 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1824ms | 0.1294ms | 7.7284 KOps/s | 7.6497 KOps/s | |
test_compile_indexing[int-pytree-eager] | 79.8430μs | 20.1596μs | 49.6043 KOps/s | 48.9998 KOps/s | |
test_mod_add[eager] | 68.7330μs | 30.7174μs | 32.5548 KOps/s | 30.3950 KOps/s | |
test_mod_add[compile] | 0.2789ms | 68.2608μs | 14.6497 KOps/s | 14.1800 KOps/s | |
test_mod_add[compile-overhead] | 0.2627ms | 0.1334ms | 7.4961 KOps/s | 7.1794 KOps/s | |
test_mod_wrap[eager] | 0.3407ms | 0.2379ms | 4.2027 KOps/s | 4.1237 KOps/s | |
test_mod_wrap[compile] | 1.4723ms | 0.3012ms | 3.3203 KOps/s | 3.3956 KOps/s | |
test_mod_wrap[compile-overhead] | 7.6801ms | 4.0724ms | 245.5580 Ops/s | 246.4649 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4856ms | 1.3666ms | 731.7402 Ops/s | 694.3453 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5397ms | 1.3027ms | 767.6654 Ops/s | 753.8951 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3124ms | 0.8877ms | 1.1265 KOps/s | 1.1083 KOps/s | |
test_seq_add[eager] | 0.2122ms | 98.6858μs | 10.1332 KOps/s | 9.8765 KOps/s | |
test_seq_add[compile] | 0.2366ms | 78.0452μs | 12.8131 KOps/s | 12.4134 KOps/s | |
test_seq_add[compile-overhead] | 0.1483ms | 0.1119ms | 8.9337 KOps/s | 8.8325 KOps/s | |
test_seq_wrap[eager] | 0.4603ms | 0.3773ms | 2.6504 KOps/s | 2.5338 KOps/s | |
test_seq_wrap[compile] | 0.3589ms | 0.3065ms | 3.2622 KOps/s | 3.1611 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2658ms | 0.2162ms | 4.6260 KOps/s | 4.6006 KOps/s | |
test_func_call_runtime[False-eager] | 0.8199ms | 0.7309ms | 1.3681 KOps/s | 1.3345 KOps/s | |
test_func_call_runtime[False-compile] | 0.9599ms | 0.7655ms | 1.3064 KOps/s | 1.2770 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.3954ms | 0.3519ms | 2.8418 KOps/s | 2.8305 KOps/s | |
test_func_call_runtime[True-eager] | 1.0046ms | 0.8999ms | 1.1112 KOps/s | 1.1038 KOps/s | |
test_func_call_runtime[True-compile] | 0.8602ms | 0.8058ms | 1.2410 KOps/s | 1.2146 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4438ms | 0.3871ms | 2.5836 KOps/s | 2.5866 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7799ms | 0.7244ms | 1.3804 KOps/s | 1.3501 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8915ms | 0.8074ms | 1.2386 KOps/s | 1.2704 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4071ms | 0.3549ms | 2.8180 KOps/s | 2.8163 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0978ms | 0.9889ms | 1.0113 KOps/s | 997.3178 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.8734ms | 0.8284ms | 1.2072 KOps/s | 1.1835 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4720ms | 0.4122ms | 2.4262 KOps/s | 2.4067 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5500ms | 2.0580ms | 485.9150 Ops/s | 480.2642 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9061ms | 0.8423ms | 1.1872 KOps/s | 1.1577 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4985ms | 0.4177ms | 2.3941 KOps/s | 2.3680 KOps/s | |
test_distributed | 2.3670ms | 0.2413ms | 4.1441 KOps/s | 8.7982 KOps/s | |
test_tdmodule | 0.1082ms | 15.0316μs | 66.5266 KOps/s | 62.3680 KOps/s | |
test_tdmodule_dispatch | 67.8940μs | 29.4517μs | 33.9539 KOps/s | 31.9363 KOps/s | |
test_tdseq | 35.0320μs | 15.9283μs | 62.7814 KOps/s | 57.8128 KOps/s | |
test_tdseq_dispatch | 53.7730μs | 32.3873μs | 30.8763 KOps/s | 28.2556 KOps/s | |
test_instantiation_functorch | 1.9684ms | 1.8283ms | 546.9537 Ops/s | 530.1458 Ops/s | |
test_instantiation_td | 1.7719ms | 1.1760ms | 850.3347 Ops/s | 825.9726 Ops/s | |
test_exec_functorch | 0.2513ms | 0.2083ms | 4.8013 KOps/s | 4.7581 KOps/s | |
test_exec_functional_call | 0.2495ms | 0.2058ms | 4.8597 KOps/s | 4.7835 KOps/s | |
test_exec_td | 0.2556ms | 0.2128ms | 4.7000 KOps/s | 4.5010 KOps/s | |
test_exec_td_decorator | 0.7038ms | 0.2531ms | 3.9502 KOps/s | 3.8431 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8006ms | 0.6876ms | 1.4544 KOps/s | 1.4425 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7367ms | 0.6894ms | 1.4506 KOps/s | 1.4477 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6545ms | 0.5788ms | 1.7276 KOps/s | 1.7386 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.7361ms | 0.5795ms | 1.7256 KOps/s | 1.7384 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7638ms | 0.6723ms | 1.4874 KOps/s | 1.4814 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7722ms | 0.6751ms | 1.4812 KOps/s | 1.4744 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7161ms | 0.5922ms | 1.6886 KOps/s | 1.6530 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7187ms | 0.5946ms | 1.6817 KOps/s | 1.6431 KOps/s | |
test_vmap_transformer_speed[True-True] | 8.4434ms | 8.3282ms | 120.0743 Ops/s | 118.6959 Ops/s | |
test_vmap_transformer_speed[True-False] | 8.3584ms | 8.3015ms | 120.4595 Ops/s | 118.6021 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.1568ms | 8.0930ms | 123.5630 Ops/s | 121.5702 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.1820ms | 8.1239ms | 123.0941 Ops/s | 121.5268 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.7252ms | 19.5897ms | 51.0473 Ops/s | 50.9097 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.6708ms | 19.5849ms | 51.0597 Ops/s | 50.9107 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.5890ms | 19.4433ms | 51.4315 Ops/s | 51.3294 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.5456ms | 19.4262ms | 51.4768 Ops/s | 51.3051 Ops/s | |
test_to_module_speed[True] | 1.3937ms | 0.9428ms | 1.0607 KOps/s | 1.0490 KOps/s | |
test_to_module_speed[False] | 1.3004ms | 0.9158ms | 1.0919 KOps/s | 1.0756 KOps/s | |
test_tc_init | 69.7830μs | 34.7074μs | 28.8123 KOps/s | 27.4887 KOps/s | |
test_tc_init_nested | 0.1005ms | 69.8600μs | 14.3143 KOps/s | 13.6601 KOps/s | |
test_tc_first_layer_tensor | 7.6176μs | 0.6750μs | 1.4816 MOps/s | 1.4903 MOps/s | |
test_tc_first_layer_nontensor | 22.4810μs | 2.2286μs | 448.7070 KOps/s | 442.0895 KOps/s | |
test_tc_second_layer_tensor | 7.3253μs | 1.3860μs | 721.4995 KOps/s | 740.3153 KOps/s | |
test_tc_second_layer_nontensor | 69.5730μs | 2.9590μs | 337.9552 KOps/s | 337.3781 KOps/s | |
test_unbind | 0.1966s | 12.0783ms | 82.7932 Ops/s | 92.0561 Ops/s | |
test_full_like | 0.6569ms | 0.5740ms | 1.7421 KOps/s | 1.7456 KOps/s | |
test_zeros_like | 0.2703ms | 0.1980ms | 5.0512 KOps/s | 5.0540 KOps/s | |
test_ones_like | 0.2542ms | 0.1978ms | 5.0563 KOps/s | 5.0597 KOps/s | |
test_clone | 0.4501ms | 0.4146ms | 2.4117 KOps/s | 2.4129 KOps/s | |
test_squeeze | 41.1620μs | 9.8640μs | 101.3791 KOps/s | 102.1961 KOps/s | |
test_unsqueeze | 0.2296ms | 75.7218μs | 13.2062 KOps/s | 13.3367 KOps/s | |
test_split | 0.3901ms | 0.1560ms | 6.4085 KOps/s | 6.2776 KOps/s | |
test_permute | 0.2651ms | 0.1806ms | 5.5356 KOps/s | 5.5809 KOps/s | |
test_stack | 1.2495ms | 0.8640ms | 1.1574 KOps/s | 1.1428 KOps/s | |
test_cat | 1.2545ms | 1.2320ms | 811.6925 Ops/s | 812.1278 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):