-
Notifications
You must be signed in to change notification settings - Fork 93
[Feature] Make ProbabilisticTensorDictSequential account for more than one distribution #1114
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This was referenced Nov 26, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 37.4690μs | 17.9191μs | 55.8063 KOps/s | 60.4530 KOps/s | |
test_plain_set_stack_nested | 51.7260μs | 18.1616μs | 55.0613 KOps/s | 60.1200 KOps/s | |
test_plain_set_nested_inplace | 53.0290μs | 19.3849μs | 51.5865 KOps/s | 53.7500 KOps/s | |
test_plain_set_stack_nested_inplace | 62.5760μs | 19.7311μs | 50.6815 KOps/s | 53.4434 KOps/s | |
test_items | 20.8680μs | 4.0726μs | 245.5462 KOps/s | 240.5885 KOps/s | |
test_items_nested | 0.6102ms | 0.4003ms | 2.4981 KOps/s | 2.5100 KOps/s | |
test_items_nested_locked | 0.5583ms | 0.4015ms | 2.4908 KOps/s | 2.5010 KOps/s | |
test_items_nested_leaf | 0.1519ms | 71.3814μs | 14.0092 KOps/s | 14.0945 KOps/s | |
test_items_stack_nested | 0.5691ms | 0.4032ms | 2.4803 KOps/s | 2.5054 KOps/s | |
test_items_stack_nested_leaf | 0.1291ms | 71.3056μs | 14.0241 KOps/s | 13.7833 KOps/s | |
test_items_stack_nested_locked | 0.7453ms | 0.4052ms | 2.4680 KOps/s | 2.4803 KOps/s | |
test_keys | 47.2080μs | 3.5072μs | 285.1304 KOps/s | 211.1447 KOps/s | |
test_keys_nested | 0.2287ms | 0.1444ms | 6.9254 KOps/s | 7.0646 KOps/s | |
test_keys_nested_locked | 1.7091ms | 0.1500ms | 6.6678 KOps/s | 6.7340 KOps/s | |
test_keys_nested_leaf | 0.2117ms | 0.1256ms | 7.9624 KOps/s | 8.1380 KOps/s | |
test_keys_stack_nested | 0.2280ms | 0.1440ms | 6.9454 KOps/s | 7.0696 KOps/s | |
test_keys_stack_nested_leaf | 0.1813ms | 0.1254ms | 7.9721 KOps/s | 8.1168 KOps/s | |
test_keys_stack_nested_locked | 0.2425ms | 0.1495ms | 6.6876 KOps/s | 6.6593 KOps/s | |
test_values | 8.6060μs | 1.0260μs | 974.6721 KOps/s | 951.3335 KOps/s | |
test_values_nested | 0.1115ms | 55.5164μs | 18.0127 KOps/s | 18.0875 KOps/s | |
test_values_nested_locked | 0.1140ms | 55.3308μs | 18.0731 KOps/s | 16.6068 KOps/s | |
test_values_nested_leaf | 0.1335ms | 60.5095μs | 16.5263 KOps/s | 16.5714 KOps/s | |
test_values_stack_nested | 0.1068ms | 55.3541μs | 18.0655 KOps/s | 17.6547 KOps/s | |
test_values_stack_nested_leaf | 0.1183ms | 61.2435μs | 16.3283 KOps/s | 16.6711 KOps/s | |
test_values_stack_nested_locked | 0.1093ms | 55.4125μs | 18.0465 KOps/s | 18.1941 KOps/s | |
test_membership | 2.9946μs | 0.7117μs | 1.4050 MOps/s | 1.1005 MOps/s | |
test_membership_nested | 21.1600μs | 2.9090μs | 343.7630 KOps/s | 333.4704 KOps/s | |
test_membership_nested_leaf | 44.6630μs | 2.9593μs | 337.9130 KOps/s | 331.7004 KOps/s | |
test_membership_stacked_nested | 25.7670μs | 2.8884μs | 346.2090 KOps/s | 320.9830 KOps/s | |
test_membership_stacked_nested_leaf | 23.7450μs | 2.9103μs | 343.6106 KOps/s | 336.6981 KOps/s | |
test_membership_nested_last | 50.4440μs | 4.2449μs | 235.5763 KOps/s | 231.8551 KOps/s | |
test_membership_nested_leaf_last | 24.3260μs | 4.2107μs | 237.4928 KOps/s | 231.1480 KOps/s | |
test_membership_stacked_nested_last | 57.6870μs | 4.2149μs | 237.2518 KOps/s | 230.7567 KOps/s | |
test_membership_stacked_nested_leaf_last | 38.0410μs | 4.2444μs | 235.6058 KOps/s | 233.3149 KOps/s | |
test_nested_getleaf | 55.4230μs | 10.9300μs | 91.4909 KOps/s | 91.9200 KOps/s | |
test_nested_get | 54.2610μs | 10.3542μs | 96.5796 KOps/s | 97.8202 KOps/s | |
test_stacked_getleaf | 34.7640μs | 10.9472μs | 91.3479 KOps/s | 91.1710 KOps/s | |
test_stacked_get | 50.1530μs | 10.2752μs | 97.3220 KOps/s | 97.7904 KOps/s | |
test_nested_getitemleaf | 62.7660μs | 11.3285μs | 88.2732 KOps/s | 89.4957 KOps/s | |
test_nested_getitem | 43.3600μs | 10.5330μs | 94.9393 KOps/s | 95.8081 KOps/s | |
test_stacked_getitemleaf | 53.3290μs | 11.2019μs | 89.2710 KOps/s | 88.8412 KOps/s | |
test_stacked_getitem | 38.3410μs | 10.5866μs | 94.4588 KOps/s | 95.4084 KOps/s | |
test_lock_nested | 3.2856ms | 0.4532ms | 2.2068 KOps/s | 2.2137 KOps/s | |
test_lock_stack_nested | 0.6806ms | 0.4229ms | 2.3649 KOps/s | 2.3736 KOps/s | |
test_unlock_nested | 0.7302ms | 0.3640ms | 2.7469 KOps/s | 2.6865 KOps/s | |
test_unlock_stack_nested | 0.8728ms | 0.3399ms | 2.9424 KOps/s | 2.9372 KOps/s | |
test_flatten_speed | 0.1816ms | 95.2715μs | 10.4963 KOps/s | 10.3133 KOps/s | |
test_unflatten_speed | 1.0873ms | 0.4989ms | 2.0045 KOps/s | 2.0409 KOps/s | |
test_common_ops | 1.6208ms | 0.8079ms | 1.2378 KOps/s | 1.3304 KOps/s | |
test_creation | 94.5160μs | 2.2171μs | 451.0463 KOps/s | 480.4217 KOps/s | |
test_creation_empty | 45.8250μs | 11.3807μs | 87.8678 KOps/s | 111.4736 KOps/s | |
test_creation_nested_1 | 49.6020μs | 14.1437μs | 70.7027 KOps/s | 85.2093 KOps/s | |
test_creation_nested_2 | 54.7220μs | 19.0936μs | 52.3735 KOps/s | 62.2832 KOps/s | |
test_clone | 0.1133ms | 13.5095μs | 74.0222 KOps/s | 74.7172 KOps/s | |
test_getitem[int] | 1.0623ms | 12.8391μs | 77.8869 KOps/s | 78.4085 KOps/s | |
test_getitem[slice_int] | 0.1424ms | 25.3815μs | 39.3987 KOps/s | 39.9834 KOps/s | |
test_getitem[range] | 0.2399ms | 50.4549μs | 19.8197 KOps/s | 20.6314 KOps/s | |
test_getitem[tuple] | 0.1345ms | 20.6050μs | 48.5318 KOps/s | 49.2735 KOps/s | |
test_getitem[list] | 0.3321ms | 45.0413μs | 22.2019 KOps/s | 22.9820 KOps/s | |
test_setitem_dim[int] | 79.9590μs | 26.0750μs | 38.3510 KOps/s | 39.7523 KOps/s | |
test_setitem_dim[slice_int] | 84.1760μs | 53.6924μs | 18.6246 KOps/s | 18.9652 KOps/s | |
test_setitem_dim[range] | 0.1192ms | 75.4500μs | 13.2538 KOps/s | 13.8048 KOps/s | |
test_setitem_dim[tuple] | 80.1890μs | 42.3284μs | 23.6248 KOps/s | 24.3521 KOps/s | |
test_setitem | 0.1598ms | 21.1730μs | 47.2300 KOps/s | 52.5976 KOps/s | |
test_set | 0.1074ms | 20.9195μs | 47.8023 KOps/s | 52.7196 KOps/s | |
test_set_shared | 3.3452ms | 0.1753ms | 5.7061 KOps/s | 5.8410 KOps/s | |
test_update | 0.1537ms | 23.6858μs | 42.2193 KOps/s | 48.6828 KOps/s | |
test_update_nested | 0.1098ms | 34.8717μs | 28.6766 KOps/s | 32.2953 KOps/s | |
test_update__nested | 0.4205ms | 33.4172μs | 29.9247 KOps/s | 30.2455 KOps/s | |
test_set_nested | 0.1010ms | 22.9762μs | 43.5233 KOps/s | 48.2586 KOps/s | |
test_set_nested_new | 0.1013ms | 27.6990μs | 36.1024 KOps/s | 39.5912 KOps/s | |
test_select | 0.2152ms | 45.0125μs | 22.2161 KOps/s | 23.8411 KOps/s | |
test_select_nested | 0.1303ms | 61.7460μs | 16.1954 KOps/s | 16.0302 KOps/s | |
test_exclude_nested | 0.1571ms | 80.6010μs | 12.4068 KOps/s | 12.2325 KOps/s | |
test_empty[True] | 0.7208ms | 0.3922ms | 2.5498 KOps/s | 2.5277 KOps/s | |
test_empty[False] | 10.6673μs | 1.2690μs | 788.0236 KOps/s | 807.6065 KOps/s | |
test_unbind_speed | 0.3512ms | 0.2673ms | 3.7417 KOps/s | 3.6851 KOps/s | |
test_unbind_speed_stack0 | 0.4500ms | 0.2656ms | 3.7653 KOps/s | 3.7173 KOps/s | |
test_unbind_speed_stack1 | 0.1063s | 0.7900ms | 1.2659 KOps/s | 1.3902 KOps/s | |
test_split | 2.7049ms | 1.5904ms | 628.7725 Ops/s | 568.8757 Ops/s | |
test_chunk | 0.1025s | 1.9119ms | 523.0427 Ops/s | 566.5834 Ops/s | |
test_consolidate_njt[False-None] | 9.0165ms | 8.1565ms | 122.6023 Ops/s | 123.0934 Ops/s | |
test_creation[device0] | 3.3677ms | 95.3944μs | 10.4828 KOps/s | 10.6955 KOps/s | |
test_creation_from_tensor | 0.2781ms | 96.1474μs | 10.4007 KOps/s | 10.6202 KOps/s | |
test_add_one[memmap_tensor0] | 0.1604ms | 5.1284μs | 194.9927 KOps/s | 210.5966 KOps/s | |
test_contiguous[memmap_tensor0] | 18.9160μs | 0.5452μs | 1.8342 MOps/s | 1.9373 MOps/s | |
test_stack[memmap_tensor0] | 40.0950μs | 3.5217μs | 283.9555 KOps/s | 301.3547 KOps/s | |
test_memmaptd_index | 1.0284ms | 0.2386ms | 4.1910 KOps/s | 4.3142 KOps/s | |
test_memmaptd_index_astensor | 0.6552ms | 0.3166ms | 3.1586 KOps/s | 3.2211 KOps/s | |
test_memmaptd_index_op | 0.9691ms | 0.5936ms | 1.6846 KOps/s | 1.8925 KOps/s | |
test_serialize_model | 0.1243s | 0.1183s | 8.4565 Ops/s | 7.2892 Ops/s | |
test_serialize_model_pickle | 0.4449s | 0.3867s | 2.5863 Ops/s | 2.5222 Ops/s | |
test_serialize_weights | 0.2163s | 0.1285s | 7.7837 Ops/s | 8.6907 Ops/s | |
test_serialize_weights_returnearly | 0.1727s | 0.1611s | 6.2055 Ops/s | 6.4318 Ops/s | |
test_serialize_weights_pickle | 0.5395s | 0.4133s | 2.4196 Ops/s | 2.3853 Ops/s | |
test_serialize_weights_filesystem | 0.1526s | 0.1455s | 6.8741 Ops/s | 7.1097 Ops/s | |
test_serialize_model_filesystem | 0.2464s | 0.1648s | 6.0681 Ops/s | 6.5717 Ops/s | |
test_reshape_pytree | 57.5870μs | 27.2593μs | 36.6848 KOps/s | 37.2877 KOps/s | |
test_reshape_td | 65.0310μs | 34.1799μs | 29.2569 KOps/s | 29.3257 KOps/s | |
test_view_pytree | 59.6410μs | 26.9751μs | 37.0712 KOps/s | 37.4536 KOps/s | |
test_view_td | 75.4100μs | 38.3446μs | 26.0793 KOps/s | 26.2070 KOps/s | |
test_unbind_pytree | 64.3290μs | 29.8653μs | 33.4837 KOps/s | 33.9353 KOps/s | |
test_unbind_td | 0.3039ms | 39.6677μs | 25.2095 KOps/s | 25.7181 KOps/s | |
test_split_pytree | 63.9890μs | 29.6985μs | 33.6718 KOps/s | 33.8661 KOps/s | |
test_split_td | 0.2067ms | 45.6171μs | 21.9216 KOps/s | 21.9715 KOps/s | |
test_add_pytree | 81.2400μs | 37.2355μs | 26.8561 KOps/s | 28.4590 KOps/s | |
test_add_td | 0.1148ms | 58.7693μs | 17.0157 KOps/s | 18.7441 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1607ms | 64.2053μs | 15.5750 KOps/s | 16.3743 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3950ms | 0.1654ms | 6.0459 KOps/s | 6.2645 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1328ms | 46.5382μs | 21.4877 KOps/s | 21.7196 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2602ms | 0.1221ms | 8.1919 KOps/s | 8.3725 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 69.8100μs | 25.8043μs | 38.7532 KOps/s | 39.3605 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1026ms | 54.8479μs | 18.2322 KOps/s | 18.0373 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1453ms | 79.5278μs | 12.5742 KOps/s | 12.4505 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.3023ms | 68.7026μs | 14.5555 KOps/s | 14.6383 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2008ms | 0.1053ms | 9.5005 KOps/s | 9.6469 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3019ms | 0.2037ms | 4.9102 KOps/s | 5.0775 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1079ms | 46.5336μs | 21.4899 KOps/s | 22.0547 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.5176ms | 64.3257μs | 15.5459 KOps/s | 16.3319 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2122ms | 0.1031ms | 9.6947 KOps/s | 9.8141 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3756ms | 0.2021ms | 4.9478 KOps/s | 4.9565 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3480ms | 0.2145ms | 4.6620 KOps/s | 4.7702 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2143ms | 0.1053ms | 9.4946 KOps/s | 9.5689 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1324ms | 56.9702μs | 17.5530 KOps/s | 18.3253 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1108ms | 47.3693μs | 21.1107 KOps/s | 21.6197 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.2751ms | 0.1585ms | 6.3109 KOps/s | 6.2572 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1853ms | 0.1041ms | 9.6049 KOps/s | 9.7750 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 65.5010μs | 22.0818μs | 45.2861 KOps/s | 47.6318 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1177ms | 58.8536μs | 16.9913 KOps/s | 17.2759 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1576ms | 80.8499μs | 12.3686 KOps/s | 12.2851 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1576ms | 67.9178μs | 14.7237 KOps/s | 14.3709 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.9466ms | 0.2077ms | 4.8142 KOps/s | 4.9155 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.6174ms | 1.2937ms | 772.9775 Ops/s | 788.9275 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2696ms | 0.2051ms | 4.8762 KOps/s | 4.9441 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.9825ms | 0.7780ms | 1.2854 KOps/s | 1.2973 KOps/s | |
test_compile_assign_and_add_stack[compile] | 1.0712ms | 0.4550ms | 2.1980 KOps/s | 2.1916 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.0371ms | 2.7312ms | 366.1328 Ops/s | 405.8608 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1047ms | 37.2276μs | 26.8618 KOps/s | 27.5552 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.4895ms | 34.2475μs | 29.1992 KOps/s | 29.1843 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 79.2170μs | 29.8932μs | 33.4524 KOps/s | 32.8383 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 58.0980μs | 23.7158μs | 42.1659 KOps/s | 41.8450 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1016ms | 30.7656μs | 32.5039 KOps/s | 32.6025 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 65.7120μs | 23.7495μs | 42.1062 KOps/s | 41.7753 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1160ms | 52.9688μs | 18.8790 KOps/s | 18.8510 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5569ms | 21.1010μs | 47.3911 KOps/s | 47.9884 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1115ms | 45.0081μs | 22.2182 KOps/s | 21.6887 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 71.4530μs | 19.2256μs | 52.0141 KOps/s | 52.1396 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1023ms | 45.5771μs | 21.9409 KOps/s | 21.4696 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.6586ms | 19.2578μs | 51.9270 KOps/s | 52.2102 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1195ms | 53.5007μs | 18.6913 KOps/s | 18.4950 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9009ms | 21.2542μs | 47.0496 KOps/s | 48.1536 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 91.1390μs | 45.0088μs | 22.2179 KOps/s | 21.3494 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.3044ms | 19.1141μs | 52.3175 KOps/s | 52.6697 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1078ms | 45.6273μs | 21.9167 KOps/s | 21.2999 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1423ms | 19.1349μs | 52.2604 KOps/s | 52.3012 KOps/s | |
test_mod_add[eager] | 81.9520μs | 35.5461μs | 28.1325 KOps/s | 29.8662 KOps/s | |
test_mod_add[compile] | 0.1266ms | 47.9328μs | 20.8625 KOps/s | 20.3429 KOps/s | |
test_mod_add[compile-overhead] | 90.4780μs | 48.3607μs | 20.6780 KOps/s | 20.6633 KOps/s | |
test_mod_wrap[eager] | 0.4395ms | 0.2326ms | 4.2996 KOps/s | 4.4299 KOps/s | |
test_mod_wrap[compile] | 0.9888ms | 0.2166ms | 4.6170 KOps/s | 4.7645 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3200ms | 0.2119ms | 4.7195 KOps/s | 4.7534 KOps/s | |
test_mod_wrap_and_backward[eager] | 15.5471ms | 11.5539ms | 86.5511 Ops/s | 81.4619 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.5645ms | 13.0100ms | 76.8637 Ops/s | 77.8761 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 18.3744ms | 12.9206ms | 77.3958 Ops/s | 75.3301 Ops/s | |
test_seq_add[eager] | 0.2474ms | 0.1166ms | 8.5781 KOps/s | 9.0712 KOps/s | |
test_seq_add[compile] | 0.1371ms | 63.8506μs | 15.6616 KOps/s | 15.9303 KOps/s | |
test_seq_add[compile-overhead] | 0.7999ms | 61.5305μs | 16.2521 KOps/s | 16.1486 KOps/s | |
test_seq_wrap[eager] | 0.6457ms | 0.4570ms | 2.1884 KOps/s | 2.2621 KOps/s | |
test_seq_wrap[compile] | 0.3538ms | 0.2351ms | 4.2535 KOps/s | 4.2589 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3743ms | 0.2334ms | 4.2839 KOps/s | 4.2548 KOps/s | |
test_func_call_runtime[False-eager] | 0.8231ms | 0.5752ms | 1.7385 KOps/s | 1.7999 KOps/s | |
test_func_call_runtime[False-compile] | 0.7075ms | 0.4421ms | 2.2618 KOps/s | 2.3475 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5591ms | 0.4375ms | 2.2856 KOps/s | 2.3269 KOps/s | |
test_func_call_runtime[True-eager] | 1.8402ms | 0.7956ms | 1.2569 KOps/s | 1.2730 KOps/s | |
test_func_call_runtime[True-compile] | 0.5680ms | 0.4746ms | 2.1068 KOps/s | 2.1369 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.7264ms | 0.4752ms | 2.1046 KOps/s | 2.1434 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9879ms | 0.5738ms | 1.7427 KOps/s | 1.8251 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.7971ms | 0.4393ms | 2.2763 KOps/s | 2.3312 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 2.2860ms | 0.4406ms | 2.2697 KOps/s | 2.3335 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1084ms | 0.9179ms | 1.0895 KOps/s | 1.1020 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6628ms | 0.5011ms | 1.9955 KOps/s | 2.0128 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.7484ms | 0.5054ms | 1.9788 KOps/s | 2.0271 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.7492ms | 1.9546ms | 511.6053 Ops/s | 524.8065 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9051ms | 0.5228ms | 1.9129 KOps/s | 1.8919 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.8841ms | 0.5311ms | 1.8828 KOps/s | 1.8975 KOps/s | |
test_distributed | 0.2417ms | 0.1265ms | 7.9033 KOps/s | 7.8155 KOps/s | |
test_tdmodule | 0.1016ms | 27.5056μs | 36.3562 KOps/s | 38.5866 KOps/s | |
test_tdmodule_dispatch | 72.1640μs | 49.4203μs | 20.2346 KOps/s | 20.8685 KOps/s | |
test_tdseq | 54.9720μs | 26.7326μs | 37.4075 KOps/s | 38.6496 KOps/s | |
test_tdseq_dispatch | 80.0680μs | 51.4797μs | 19.4251 KOps/s | 20.5199 KOps/s | |
test_instantiation_functorch | 2.3563ms | 1.5799ms | 632.9348 Ops/s | 641.8520 Ops/s | |
test_exec_functorch | 0.4171ms | 0.1867ms | 5.3548 KOps/s | 5.5244 KOps/s | |
test_exec_functional_call | 0.3267ms | 0.1800ms | 5.5552 KOps/s | 5.7343 KOps/s | |
test_exec_td_decorator | 0.4548ms | 0.2378ms | 4.2053 KOps/s | 4.3891 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 3.2809ms | 0.6689ms | 1.4949 KOps/s | 1.5055 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9024ms | 0.6635ms | 1.5071 KOps/s | 1.5232 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8558ms | 0.5343ms | 1.8716 KOps/s | 1.8463 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 1.0046ms | 0.5361ms | 1.8654 KOps/s | 1.8635 KOps/s | |
test_to_module_speed[True] | 1.4578ms | 1.2974ms | 770.7530 Ops/s | 749.0675 Ops/s | |
test_to_module_speed[False] | 1.5621ms | 1.2688ms | 788.1686 Ops/s | 768.2497 Ops/s | |
test_tc_init | 0.1178ms | 48.3218μs | 20.6946 KOps/s | 22.5460 KOps/s | |
test_tc_init_nested | 0.2050ms | 96.7554μs | 10.3353 KOps/s | 11.0816 KOps/s | |
test_tc_first_layer_tensor | 26.6390μs | 1.5468μs | 646.4843 KOps/s | 675.5937 KOps/s | |
test_tc_first_layer_nontensor | 24.7950μs | 4.7372μs | 211.0929 KOps/s | 214.6859 KOps/s | |
test_tc_second_layer_tensor | 42.2780μs | 2.8878μs | 346.2817 KOps/s | 350.9760 KOps/s | |
test_tc_second_layer_nontensor | 30.2660μs | 6.1318μs | 163.0846 KOps/s | 168.8981 KOps/s | |
test_unbind | 0.2145s | 13.8215ms | 72.3511 Ops/s | 77.9448 Ops/s | |
test_full_like | 8.2832ms | 7.2621ms | 137.7022 Ops/s | 82.5529 Ops/s | |
test_zeros_like | 3.9830ms | 2.7601ms | 362.3106 Ops/s | 138.7297 Ops/s | |
test_ones_like | 3.8962ms | 3.3449ms | 298.9628 Ops/s | 125.4464 Ops/s | |
test_clone | 5.5158ms | 4.9805ms | 200.7829 Ops/s | 105.2850 Ops/s | |
test_squeeze | 89.0680μs | 12.0025μs | 83.3158 KOps/s | 83.2009 KOps/s | |
test_unsqueeze | 0.1491ms | 90.3162μs | 11.0722 KOps/s | 11.0553 KOps/s | |
test_split | 0.4592ms | 0.1963ms | 5.0946 KOps/s | 5.0541 KOps/s | |
test_permute | 0.3627ms | 0.2236ms | 4.4726 KOps/s | 4.5035 KOps/s | |
test_stack | 27.8286ms | 25.4220ms | 39.3360 Ops/s | 39.7345 Ops/s | |
test_cat | 27.8287ms | 25.3544ms | 39.4409 Ops/s | 39.8760 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 91.9210μs | 10.2942μs | 97.1426 KOps/s | 99.6123 KOps/s | |
test_plain_set_stack_nested | 33.2700μs | 10.3596μs | 96.5288 KOps/s | 98.7714 KOps/s | |
test_plain_set_nested_inplace | 50.1100μs | 11.2708μs | 88.7252 KOps/s | 91.5348 KOps/s | |
test_plain_set_stack_nested_inplace | 0.1065ms | 11.2190μs | 89.1341 KOps/s | 90.8845 KOps/s | |
test_items | 0.1085ms | 2.8417μs | 351.9050 KOps/s | 345.8692 KOps/s | |
test_items_nested | 0.4143ms | 0.3525ms | 2.8369 KOps/s | 2.8150 KOps/s | |
test_items_nested_locked | 0.4574ms | 0.3573ms | 2.7990 KOps/s | 2.8125 KOps/s | |
test_items_nested_leaf | 0.1431ms | 58.0829μs | 17.2168 KOps/s | 17.2641 KOps/s | |
test_items_stack_nested | 0.4456ms | 0.3488ms | 2.8668 KOps/s | 2.7667 KOps/s | |
test_items_stack_nested_leaf | 0.1552ms | 59.0609μs | 16.9317 KOps/s | 17.1014 KOps/s | |
test_items_stack_nested_locked | 0.4469ms | 0.3578ms | 2.7946 KOps/s | 2.7953 KOps/s | |
test_keys | 0.1063ms | 3.4352μs | 291.1023 KOps/s | 290.7129 KOps/s | |
test_keys_nested | 0.2267ms | 69.2634μs | 14.4376 KOps/s | 14.2908 KOps/s | |
test_keys_nested_locked | 0.8067ms | 74.6648μs | 13.3932 KOps/s | 13.1439 KOps/s | |
test_keys_nested_leaf | 0.1444ms | 61.6102μs | 16.2311 KOps/s | 16.2369 KOps/s | |
test_keys_stack_nested | 0.1601ms | 69.5913μs | 14.3696 KOps/s | 14.0669 KOps/s | |
test_keys_stack_nested_leaf | 92.0720μs | 60.8444μs | 16.4354 KOps/s | 16.1259 KOps/s | |
test_keys_stack_nested_locked | 0.1183ms | 74.9778μs | 13.3373 KOps/s | 13.0987 KOps/s | |
test_values | 13.0518μs | 0.8439μs | 1.1849 MOps/s | 1.1867 MOps/s | |
test_values_nested | 0.1602ms | 31.0793μs | 32.1757 KOps/s | 32.3867 KOps/s | |
test_values_nested_locked | 0.1115ms | 32.5137μs | 30.7563 KOps/s | 30.7476 KOps/s | |
test_values_nested_leaf | 0.1173ms | 33.6533μs | 29.7148 KOps/s | 29.8059 KOps/s | |
test_values_stack_nested | 0.1175ms | 31.6454μs | 31.6002 KOps/s | 31.8482 KOps/s | |
test_values_stack_nested_leaf | 75.1410μs | 34.0900μs | 29.3341 KOps/s | 29.5559 KOps/s | |
test_values_stack_nested_locked | 0.1247ms | 32.9365μs | 30.3614 KOps/s | 30.1136 KOps/s | |
test_membership | 3.9861μs | 0.5052μs | 1.9795 MOps/s | 1.9706 MOps/s | |
test_membership_nested | 62.6410μs | 1.9500μs | 512.8123 KOps/s | 489.3173 KOps/s | |
test_membership_nested_leaf | 16.2800μs | 1.9885μs | 502.9000 KOps/s | 515.2183 KOps/s | |
test_membership_stacked_nested | 41.2200μs | 2.0068μs | 498.2973 KOps/s | 486.0713 KOps/s | |
test_membership_stacked_nested_leaf | 25.5200μs | 2.0541μs | 486.8276 KOps/s | 493.3299 KOps/s | |
test_membership_nested_last | 98.5220μs | 2.8655μs | 348.9735 KOps/s | 343.8629 KOps/s | |
test_membership_nested_leaf_last | 61.1200μs | 2.8755μs | 347.7689 KOps/s | 344.7163 KOps/s | |
test_membership_stacked_nested_last | 0.1652ms | 2.8806μs | 347.1469 KOps/s | 341.5226 KOps/s | |
test_membership_stacked_nested_leaf_last | 57.6800μs | 2.8582μs | 349.8678 KOps/s | 338.4986 KOps/s | |
test_nested_getleaf | 0.1643ms | 6.1313μs | 163.0987 KOps/s | 163.3020 KOps/s | |
test_nested_get | 95.5920μs | 5.8518μs | 170.8871 KOps/s | 172.1732 KOps/s | |
test_stacked_getleaf | 31.9700μs | 6.1065μs | 163.7607 KOps/s | 162.6953 KOps/s | |
test_stacked_get | 55.7410μs | 5.7964μs | 172.5197 KOps/s | 171.7724 KOps/s | |
test_nested_getitemleaf | 0.1875ms | 6.1619μs | 162.2873 KOps/s | 160.2632 KOps/s | |
test_nested_getitem | 37.1700μs | 5.8737μs | 170.2501 KOps/s | 170.1246 KOps/s | |
test_stacked_getitemleaf | 0.1967ms | 6.1937μs | 161.4556 KO 8000 ps/s | 160.0581 KOps/s | |
test_stacked_getitem | 0.1825ms | 5.8695μs | 170.3710 KOps/s | 169.5711 KOps/s | |
test_lock_nested | 10.6577ms | 0.3708ms | 2.6970 KOps/s | 2.6808 KOps/s | |
test_lock_stack_nested | 0.4715ms | 0.3322ms | 3.0105 KOps/s | 2.9815 KOps/s | |
test_unlock_nested | 0.6062ms | 0.3025ms | 3.3053 KOps/s | 3.2686 KOps/s | |
test_unlock_stack_nested | 0.3943ms | 0.2732ms | 3.6609 KOps/s | 3.6462 KOps/s | |
test_flatten_speed | 0.1599ms | 74.4430μs | 13.4331 KOps/s | 13.3899 KOps/s | |
test_unflatten_speed | 0.4971ms | 0.3077ms | 3.2500 KOps/s | 3.2968 KOps/s | |
test_common_ops | 1.7063ms | 0.5729ms | 1.7455 KOps/s | 1.7288 KOps/s | |
test_creation | 95.6620μs | 1.4823μs | 674.6211 KOps/s | 677.0954 KOps/s | |
test_creation_empty | 0.1044ms | 6.7522μs | 148.1005 KOps/s | 156.9377 KOps/s | |
test_creation_nested_1 | 37.0800μs | 8.2727μs | 120.8789 KOps/s | 126.8228 KOps/s | |
test_creation_nested_2 | 94.5320μs | 10.7629μs | 92.9115 KOps/s | 96.5289 KOps/s | |
test_clone | 57.8510μs | 10.2702μs | 97.3689 KOps/s | 90.7283 KOps/s | |
test_getitem[int] | 75.2110μs | 10.7227μs | 93.2605 KOps/s | 89.5625 KOps/s | |
test_getitem[slice_int] | 1.6166ms | 20.6730μs | 48.3723 KOps/s | 46.8509 KOps/s | |
test_getitem[range] | 0.1020s | 55.3242μs | 18.0753 KOps/s | 25.6763 KOps/s | |
test_getitem[tuple] | 0.1509ms | 17.8924μs | 55.8897 KOps/s | 53.1055 KOps/s | |
test_getitem[list] | 0.2934ms | 33.1596μs | 30.1572 KOps/s | 29.7870 KOps/s | |
test_setitem_dim[int] | 31.1610μs | 17.8957μs | 55.8793 KOps/s | 51.7385 KOps/s | |
test_setitem_dim[slice_int] | 76.9810μs | 37.1428μs | 26.9231 KOps/s | 25.5573 KOps/s | |
test_setitem_dim[range] | 0.1197ms | 54.3843μs | 18.3877 KOps/s | 18.2899 KOps/s | |
test_setitem_dim[tuple] | 55.5310μs | 31.8223μs | 31.4245 KOps/s | 30.2340 KOps/s | |
test_setitem | 89.0910μs | 13.8922μs | 71.9827 KOps/s | 67.9487 KOps/s | |
test_set | 0.1326ms | 13.5710μs | 73.6864 KOps/s | 71.6399 KOps/s | |
test_set_shared | 1.5709ms | 0.1469ms | 6.8058 KOps/s | 6.7795 KOps/s | |
test_update | 0.4654ms | 15.6305μs | 63.9773 KOps/s | 60.8393 KOps/s | |
test_update_nested | 80.3910μs | 20.2966μs | 49.2694 KOps/s | 47.6662 KOps/s | |
test_update__nested | 1.1741ms | 23.7120μs | 42.1728 KOps/s | 40.5717 KOps/s | |
test_set_nested | 0.1555ms | 14.6068μs | 68.4611 KOps/s | 65.9406 KOps/s | |
test_set_nested_new | 81.2110μs | 16.6643μs | 60.0085 KOps/s | 56.6373 KOps/s | |
test_select | 0.1184ms | 28.2903μs | 35.3478 KOps/s | 35.0060 KOps/s | |
test_select_nested | 0.1255ms | 41.8979μs | 23.8675 KOps/s | 24.2540 KOps/s | |
test_exclude_nested | 0.3774ms | 61.7814μs | 16.1861 KOps/s | 16.5142 KOps/s | |
test_empty[True] | 0.3421ms | 0.2727ms | 3.6664 KOps/s | 3.6336 KOps/s | |
test_empty[False] | 7.8541μs | 0.7474μs | 1.3380 MOps/s | 1.3506 MOps/s | |
test_to | 0.1031ms | 58.0094μs | 17.2386 KOps/s | 18.2536 KOps/s | |
test_to_nonblocking | 0.1954ms | 45.5245μs | 21.9662 KOps/s | 22.8986 KOps/s | |
test_unbind_speed | 0.3308ms | 0.2272ms | 4.4021 KOps/s | 4.2438 KOps/s | |
test_unbind_speed_stack0 | 0.2816ms | 0.2301ms | 4.3458 KOps/s | 4.3208 KOps/s | |
test_unbind_speed_stack1 | 0.1033s | 0.6530ms | 1.5314 KOps/s | 1.5227 KOps/s | |
test_split | 0.1020s | 1.5796ms | 633.0806 Ops/s | 559.7721 Ops/s | |
test_chunk | 0.1027s | 1.5868ms | 630.1870 Ops/s | 672.1917 Ops/s | |
test_consolidate[False-None] | 0.1069s | 2.8798ms | 347.2485 Ops/s | 347.0213 Ops/s | |
test_consolidate[default-None] | 1.9048ms | 1.7488ms | 571.8077 Ops/s | 595.7732 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.9183ms | 1.7641ms | 566.8467 Ops/s | 585.2423 Ops/s | |
test_consolidate_njt[False-None] | 6.8048ms | 6.4463ms | 155.1281 Ops/s | 154.6559 Ops/s | |
test_to[False-False-None] | 1.8520ms | 1.6502ms | 605.9733 Ops/s | 613.2738 Ops/s | |
test_to[True-False-None] | 1.5144ms | 1.2851ms | 778.1248 Ops/s | 771.7080 Ops/s | |
test_to[within-False-None] | 4.2455ms | 4.0076ms | 249.5243 Ops/s | 247.5674 Ops/s | |
test_to[True-default-None] | 5.3140ms | 5.1398ms | 194.5615 Ops/s | 193.2542 Ops/s | |
test_to_njt[False-False-None] | 7.0914ms | 6.8934ms | 145.0668 Ops/s | 142.3275 Ops/s | |
test_to_njt[True-False-None] | 5.6321ms | 5.3924ms | 185.4470 Ops/s | 180.0562 Ops/s | |
test_to_njt[within-False-None] | 12.0175ms | 11.8541ms | 84.3592 Ops/s | 82.0285 Ops/s | |
test_creation[device0] | 0.4569ms | 78.2061μs | 12.7867 KOps/s | 12.3053 KOps/s | |
test_creation_from_tensor | 0.5383ms | 82.2023μs | 12.1651 KOps/s | 11.7025 KOps/s | |
test_add_one[memmap_tensor0] | 0.2342ms | 6.8410μs | 146.1779 KOps/s | 139.6865 KOps/s | |
test_contiguous[memmap_tensor0] | 1.8920μs | 0.4075μs | 2.4543 MOps/s | 2.4104 MOps/s | |
test_stack[memmap_tensor0] | 37.9100μs | 4.4640μs | 224.0145 KOps/s | 214.2781 KOps/s | |
test_memmaptd_index | 1.9011ms | 0.2489ms | 4.0180 KOps/s | 3.9181 KOps/s | |
test_memmaptd_index_astensor | 0.9419ms | 0.3069ms | 3.2586 KOps/s | 3.1977 KOps/s | |
test_memmaptd_index_op | 1.0245ms | 0.5656ms | 1.7681 KOps/s | 1.7351 KOps/s | |
test_serialize_model | 0.1317s | 0.1309s | 7.6407 Ops/s | 7.6262 Ops/s | |
test_serialize_model_pickle | 1.3512s | 1.2161s | 0.8223 Ops/s | 0.8231 Ops/s | |
test_serialize_weights | 0.4763s | 0.1803s | 5.5477 Ops/s | 7.6685 Ops/s | |
test_serialize_weights_returnearly | 0.3737s | 54.5890ms | 18.3187 Ops/s | 14.7180 Ops/s | |
test_serialize_weights_pickle | 1.3482s | 1.1922s | 0.8388 Ops/s | 0.8197 Ops/s | |
test_reshape_pytree | 0.1646ms | 22.1476μs | 45.1516 KOps/s | 45.2872 KOps/s | |
test_reshape_td | 56.5710μs | 26.0790μs | 38.3451 KOps/s | 37.9635 KOps/s | |
test_view_pytree | 0.1797ms | 22.1195μs | 45.2089 KOps/s | 45.4138 KOps/s | |
test_view_td | 0.1216ms | 29.4059μs | 34.0068 KOps/s | 32.5049 KOps/s | |
test_unbind_pytree | 0.1720ms | 28.0115μs | 35.6996 KOps/s | 35.4487 KOps/s | |
test_unbind_td | 0.7567ms | 34.8015μs | 28.7344 KOps/s | 28.5988 KOps/s | |
test_split_pytree | 0.1256ms | 29.7320μs | 33.6338 KOps/s | 33.2255 KOps/s | |
test_split_td | 0.9590ms | 36.5718μs | 27.3435 KOps/s | 25.2400 KOps/s | |
test_add_pytree | 0.2013ms | 33.9915μs | 29.4191 KOps/s | 28.2866 KOps/s | |
test_add_td | 0.4292ms | 44.7624μs | 22.3402 KOps/s | 23.2265 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2828ms | 0.1189ms | 8.4117 KOps/s | 8.1757 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.5369ms | 0.1225ms | 8.1625 KOps/s | 8.1379 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2762ms | 94.5466μs | 10.5768 KOps/s | 10.0710 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.5406ms | 0.1454ms | 6.8795 KOps/s | 6.8601 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.4240ms | 20.9259μs | 47.7876 KOps/s | 43.9026 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.5498ms | 26.3049μs | 38.0158 KOps/s | 37.1509 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.3046ms | 64.3308μs | 15.5446 KOps/s | 15.5084 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.4373ms | 49.3944μs | 20.2452 KOps/s | 20.1167 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3154ms | 0.1415ms | 7.0677 KOps/s | 6.9038 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3557ms | 0.2064ms | 4.8446 KOps/s | 4.9256 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2499ms | 97.7640μs | 10.2287 KOps/s | 10.2056 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1976ms | 49.8090μs | 20.0767 KOps/s | 19.1280 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2866ms | 0.1356ms | 7.3734 KOps/s | 7.3033 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6318ms | 0.4751ms | 2.1048 KOps/s | 2.1506 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3887ms | 0.2468ms | 4.0522 KOps/s | 4.1121 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2936ms | 0.1431ms | 6.9900 KOps/s | 6.8498 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2093ms | 60.1453μs | 16.6264 KOps/s | 16.6229 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2352ms | 98.7413μs | 10.1275 KOps/s | 10.0560 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5740ms | 0.4025ms | 2.4842 KOps/s | 2.5543 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3164ms | 0.1347ms | 7.4227 KOps/s | 7.3082 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1465ms | 17.5968μs | 56.8285 KOps/s | 56.5239 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.4286ms | 26.5270μs | 37.6975 KOps/s | 37.9917 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.4721ms | 69.6209μs | 14.3635 KOps/s | 14.2789 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.4232ms | 50.9887μs | 19.6122 KOps/s | 19.4665 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6592ms | 0.4493ms | 2.2255 KOps/s | 2.0993 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7664ms | 2.5337ms | 394.6820 Ops/s | 397.7666 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6462ms | 0.4428ms | 2.2581 KOps/s | 2.1938 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7573ms | 2.5439ms | 393.0910 Ops/s | 396.8322 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.5556ms | 0.1144ms | 8.7447 KOps/s | 8.6937 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5897ms | 80.9450μs | 12.3541 KOps/s | 12.2669 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.5127ms | 0.1083ms | 9.2373 KOps/s | 9.1442 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2553ms | 66.4236μs | 15.0549 KOps/s | 14.4048 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.3175ms | 0.1091ms | 9.1642 KOps/s | 9.4592 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2715ms | 70.0009μs | 14.2855 KOps/s | 15.1991 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2822ms | 0.1007ms | 9.9282 KOps/s | 9.8064 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.2008ms | 16.7946μs | 59.5429 KOps/s | 54.6702 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2430ms | 96.6380μs | 10.3479 KOps/s | 10.2595 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1458ms | 15.6098μs | 64.0623 KOps/s | 62.2046 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2445ms | 97.7196μs | 10.2334 KOps/s | 10.0712 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1482ms | 15.6611μs | 63.8524 KOps/s | 62.3706 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2813ms | 0.1022ms | 9.7857 KOps/s | 9.7572 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6956ms | 16.7912μs | 59.5552 KOps/s | 53.6087 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2596ms | 0.1008ms | 9.9193 KOps/s | 10.1917 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1984ms | 15.5670μs | 64.2383 KOps/s | 62.7277 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2816ms | 0.1006ms | 9.9414 KOps/s | 10.1929 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1636ms | 15.7196μs | 63.6148 KOps/s | 62.9656 KOps/s | |
test_mod_add[eager] | 0.1894ms | 35.7958μs | 27.9363 KOps/s | 27.3368 KOps/s | |
test_mod_add[compile] | 0.2411ms | 79.6635μs | 12.5528 KOps/s | 12.1751 KOps/s | |
test_mod_add[compile-overhead] | 0.3321ms | 0.1674ms | 5.9728 KOps/s | 5.6646 KOps/s | |
test_mod_wrap[eager] | 0.3978ms | 0.2461ms | 4.0642 KOps/s | 4.0801 KOps/s | |
test_mod_wrap[compile] | 0.6838ms | 0.2917ms | 3.4288 KOps/s | 3.5032 KOps/s | |
test_mod_wrap[compile-overhead] | 7.6153ms | 3.8294ms | 261.1350 Ops/s | 264.9230 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.7141ms | 1.3422ms | 745.0584 Ops/s | 698.3217 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4819ms | 1.2541ms | 797.3875 Ops/s | 733.5779 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4624ms | 0.9459ms | 1.0572 KOps/s | 860.8678 Ops/s | |
test_seq_add[eager] | 0.2657ms | 0.1113ms | 8.9856 KOps/s | 9.2229 KOps/s | |
test_seq_add[compile] | 0.2723ms | 89.9617μs | 11.1158 KOps/s | 11.3720 KOps/s | |
test_seq_add[compile-overhead] | 0.2861ms | 0.1285ms | 7.7813 KOps/s | 7.7566 KOps/s | |
test_seq_wrap[eager] | 0.5500ms | 0.4071ms | 2.4563 KOps/s | 2.3818 KOps/s | |
test_seq_wrap[compile] | 0.4819ms | 0.2968ms | 3.3691 KOps/s | 3.2574 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3960ms | 0.2244ms | 4.4565 KOps/s | 4.3582 KOps/s | |
test_func_call_runtime[False-eager] | 0.8918ms | 0.7352ms | 1.3602 KOps/s | 1.3581 KOps/s | |
test_func_call_runtime[False-compile] | 0.9253ms | 0.7323ms | 1.3656 KOps/s | 1.3417 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5078ms | 0.3632ms | 2.7531 KOps/s | 2.6952 KOps/s | |
test_func_call_runtime[True-eager] | 1.0871ms | 0.8960ms | 1.1161 KOps/s | 1.0972 KOps/s | |
test_func_call_runtime[True-compile] | 0.9233ms | 0.7512ms | 1.3313 KOps/s | 1.3063 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5702ms | 0.3925ms | 2.5478 KOps/s | 2.5578 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9058ms | 0.7432ms | 1.3455 KOps/s | 1.3190 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.2448ms | 0.7456ms | 1.3412 KOps/s | 1.3383 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5064ms | 0.3697ms | 2.7048 KOps/s | 2.6839 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1673ms | 1.0087ms | 991.3948 Ops/s | 997.1968 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9681ms | 0.7973ms | 1.2542 KOps/s | 1.2460 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5459ms | 0.4102ms | 2.4380 KOps/s | 2.3796 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5378ms | 2.0723ms | 482.5621 Ops/s | 477.8999 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9701ms | 0.7992ms | 1.2512 KOps/s | 1.2457 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5381ms | 0.4160ms | 2.4037 KOps/s | 2.3987 KOps/s | |
test_distributed | 2.7125ms | 0.3557ms | 2.8117 KOps/s | 8.7843 KOps/s | |
test_tdmodule | 59.0910μs | 17.9513μs | 55.7063 KOps/s | 54.6504 KOps/s | |
test_tdmodule_dispatch | 75.4610μs | 32.2175μs | 31.0390 KOps/s | 30.8092 KOps/s | |
test_tdseq | 37.1000μs | 17.7671μs | 56.2837 KOps/s | 55.1826 KOps/s | |
test_tdseq_dispatch | 58.2810μs | 34.6220μs | 28.8833 KOps/s | 29.0791 KOps/s | |
test_instantiation_functorch | 1.6660ms | 1.5266ms | 655.0591 Ops/s | 627.2658 Ops/s | |
test_exec_functorch | 0.2818ms | 0.1418ms | 7.0540 KOps/s | 6.7572 KOps/s | |
test_exec_functional_call | 0.2757ms | 0.1377ms | 7.2634 KOps/s | 7.2086 KOps/s | |
test_exec_td_decorator | 0.3723ms | 0.1838ms | 5.4404 KOps/s | 5.3894 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8594ms | 0.6762ms | 1.4790 KOps/s | 1.4918 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8823ms | 0.6835ms | 1.4630 KOps/s | 1.4921 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7807ms | 0.6180ms | 1.6180 KOps/s | 1.7152 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7643ms | 0.6179ms | 1.6185 KOps/s | 1.7118 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.9149ms | 19.0325ms | 52.5418 Ops/s | 53.4063 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.8581ms | 19.1663ms | 52.1750 Ops/s | 53.2579 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.8237ms | 18.9609ms | 52.7402 Ops/s | 53.7624 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.7092ms | 19.0160ms | 52.5874 Ops/s | 53.5694 Ops/s | |
test_to_module_speed[True] | 1.1079ms | 0.9308ms | 1.0744 KOps/s | 1.0691 KOps/s | |
test_to_module_speed[False] | 1.2979ms | 0.8941ms | 1.1184 KOps/s | 1.0988 KOps/s | |
test_tc_init | 0.1825ms | 32.4692μs | 30.7984 KOps/s | 29.5718 KOps/s | |
test_tc_init_nested | 0.1492ms | 65.0693μs | 15.3682 KOps/s | 14.6023 KOps/s | |
test_tc_first_layer_tensor | 13.5601μs | 0.7049μs | 1.4187 MOps/s | 1.4326 MOps/s | |
test_tc_first_layer_nontensor | 43.4300μs | 2.3208μs | 430.8829 KOps/s | 436.1841 KOps/s | |
test_tc_second_layer_tensor | 24.6002μs | 1.4126μs | 707.9330 KOps/s | 711.2164 KOps/s | |
test_tc_second_layer_nontensor | 97.4710μs | 3.0347μs | 329.5231 KOps/s | 333.9562 KOps/s | |
test_unbind | 0.2633s | 10.5487ms | 94.7988 Ops/s | 146.2318 Ops/s | |
test_full_like | 11.5988ms | 10.2838ms | 97.2407 Ops/s | 96.7579 Ops/s | |
test_zeros_like | 5.1602ms | 4.5531ms | 219.6329 Ops/s | 225.4986 Ops/s | |
test_ones_like | 5.2115ms | 4.5657ms | 219.0248 Ops/s | 220.5542 Ops/s | |
test_clone | 13.1463ms | 9.9688ms | 100.3130 Ops/s | 99.8339 Ops/s | |
test_squeeze | 60.3210μs | 9.0531μs | 110.4593 KOps/s | 110.7367 KOps/s | |
test_unsqueeze | 0.1545ms | 68.2429μs | 14.6535 KOps/s | 14.0153 KOps/s | |
test_split | 0.3923ms | 0.1539ms | 6.4957 KOps/s | 6.2506 KOps/s | |
test_permute | 0.3451ms | 0.1732ms | 5.7744 KOps/s | 5.4542 KOps/s | |
test_stack | 53.4670ms | 52.3686ms | 19.0954 Ops/s | 19.1109 Ops/s | |
test_cat | 53.6656ms | 52.2434ms | 19.1412 Ops/s | 19.0762 Ops/s |
This was referenced Nov 29, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):