-
Notifications
You must be signed in to change notification settings - Fork 93
[BugFix] Make min/max tensorclasses be interchangeable with PT equivalent #1180
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 45.7850μs | 21.3639μs | 46.8079 KOps/s | 46.0753 KOps/s | |
test_plain_set_stack_nested | 49.8730μs | 21.2407μs | 47.0795 KOps/s | 46.4171 KOps/s | |
test_plain_set_nested_inplace | 65.5220μs | 22.9899μs | 43.4974 KOps/s | 42.8984 KOps/s | |
test_plain_set_stack_nested_inplace | 69.3490μs | 23.1430μs | 43.2096 KOps/s | 42.8774 KOps/s | |
test_items | 29.4750μs | 4.1770μs | 239.4089 KOps/s | 245.4476 KOps/s | |
test_items_nested | 0.5973ms | 0.4018ms | 2.4887 KOps/s | 2.5113 KOps/s | |
test_items_nested_locked | 0.7242ms | 0.4037ms | 2.4770 KOps/s | 2.5029 KOps/s | |
test_items_nested_leaf | 0.1335ms | 76.4656μs | 13.0778 KOps/s | 12.9204 KOps/s | |
test_items_stack_nested | 0.6312ms | 0.4069ms | 2.4577 KOps/s | 2.5160 KOps/s | |
test_items_stack_nested_leaf | 0.1553ms | 79.1188μs | 12.6392 KOps/s | 12.4795 KOps/s | |
test_items_stack_nested_locked | 0.5781ms | 0.4088ms | 2.4465 KOps/s | 2.4632 KOps/s | |
test_keys | 28.5540μs | 3.5530μs | 281.4515 KOps/s | 276.9913 KOps/s | |
test_keys_nested | 0.2654ms | 0.1647ms | 6.0718 KOps/s | 5.9931 KOps/s | |
test_keys_nested_locked | 0.6780ms | 0.1717ms | 5.8233 KOps/s | 5.7867 KOps/s | |
test_keys_nested_leaf | 0.2806ms | 0.1454ms | 6.8777 KOps/s | 6.8841 KOps/s | |
test_keys_stack_nested | 0.2771ms | 0.1650ms | 6.0599 KOps/s | 5.9925 KOps/s | |
test_keys_stack_nested_leaf | 0.2289ms | 0.1434ms | 6.9741 KOps/s | 7.0661 KOps/s | |
test_keys_stack_nested_locked | 0.3139ms | 0.1718ms | 5.8213 KOps/s | 5.8704 KOps/s | |
test_values | 5.8936μs | 1.0460μs | 956.0039 KOps/s | 973.5178 KOps/s | |
test_values_nested | 0.1046ms | 63.2333μs | 15.8145 KOps/s | 15.4965 KOps/s | |
test_values_nested_locked | 0.1073ms | 63.4736μs | 15.7546 KOps/s | 16.3150 KOps/s | |
test_values_nested_leaf | 0.1295ms | 71.6078μs | 13.9650 KOps/s | 14.1737 KOps/s | |
test_values_stack_nested | 0.1089ms | 64.3981μs | 15.5284 KOps/s | 15.6484 KOps/s | |
test_values_stack_nested_leaf | 0.1291ms | 72.1273μs | 13.8644 KOps/s | 14.1175 KOps/s | |
test_values_stack_nested_locked | 0.1445ms | 64.1465μs | 15.5893 KOps/s | 15.7138 KOps/s | |
test_membership | 12.7740μs | 0.8645μs | 1.1568 MOps/s | 1.1570 MOps/s | |
test_membership_nested | 26.8800μs | 2.9115μs | 343.4639 KOps/s | 342.8967 KOps/s | |
test_membership_nested_leaf | 22.3110μs | 2.9302μs | 341.2759 KOps/s | 336.9527 KOps/s | |
test_membership_stacked_nested | 27.7220μs | 2.8594μs | 349.7202 KOps/s | 336.8424 KOps/s | |
test_membership_stacked_nested_leaf | 21.6510μs | 2.8966μs | 345.2304 KOps/s | 337.2191 KOps/s | |
test_membership_nested_last | 26.1090μs | 4.3122μs | 231.8976 KOps/s | 227.6000 KOps/s | |
test_membership_nested_leaf_last | 29.9760μs | 4.3159μs | 231.7040 KOps/s | 225.3721 KOps/s | |
test_membership_stacked_nested_last | 30.6170μs | 4.3005μs | 232.5297 KOps/s | 131.7643 KOps/s | |
test_membership_stacked_nested_leaf_last | 27.0610μs | 4.3477μs | 230.0064 KOps/s | 131.8631 KOps/s | |
test_nested_getleaf | 30.5370μs | 10.7700μs | 92.8503 KOps/s | 91.3908 KOps/s | |
test_nested_get | 42.0090μs | 10.5718μs | 94.5911 KOps/s | 96.6096 KOps/s | |
test_stacked_getleaf | 47.5990μs | 10.7553μs | 92.9775 KOps/s | 93.2439 KOps/s | |
test_stacked_get | 29.5550μs | 10.1890μs | 98.1446 KOps/s | 98.1546 KOps/s | |
test_nested_getitemleaf | 40.5960μs | 11.4343μs | 87.4559 KOps/s | 87.0644 KOps/s | |
test_nested_getitem | 45.7150μs | 10.8083μs | 92.5219 KOps/s | 93.3283 KOps/s | |
test_stacked_getitemleaf | 29.1650μs | 11.3172μs | 88.3612 KOps/s | 87.9659 KOps/s | |
test_stacked_getitem | 35.6160μs | 10.6545μs | 93.8571 KOps/s | 93.6904 KOps/s | |
test_lock_nested | 1.0972ms | 0.4499ms | 2.2226 KOps/s | 1.8068 KOps/s | |
test_lock_stack_nested | 0.8225ms | 0.4271ms | 2.3412 KOps/s | 2.3836 KOps/s | |
test_unlock_nested | 0.7279ms | 0.3683ms | 2.7149 KOps/s | 2.6726 KOps/s | |
test_unlock_stack_nested | 0.5741ms | 0.3442ms | 2.9056 KOps/s | 3.0053 KOps/s | |
test_flatten_speed | 0.2116ms | 0.1009ms | 9.9076 KOps/s | 9.9908 KOps/s | |
test_unflatten_speed | 0.7669ms | 0.5285ms | 1.8921 KOps/s | 1.9225 KOps/s | |
test_common_ops | 4.5318ms | 0.8009ms | 1.2487 KOps/s | 1.2245 KOps/s | |
test_creation | 71.2630μs | 2.4768μs | 403.7413 KOps/s | 406.4321 KOps/s | |
test_creation_empty | 35.0550μs | 12.4908μs | 80.0586 KOps/s | 71.9738 KOps/s | |
test_creation_nested_1 | 70.4220μs | 15.6514μs | 63.8919 KOps/s | 58.6203 KOps/s | |
test_creation_nested_2 | 51.3750μs | 19.8601μs | 50.3522 KOps/s | 46.2586 KOps/s | |
test_clone | 43.0510μs | 13.6334μs | 73.3491 KOps/s | 75.5267 KOps/s | |
test_getitem[int] | 1.0804ms | 12.9286μs | 77.3480 KOps/s | 78.3318 KOps/s | |
test_getitem[slice_int] | 0.1351ms | 24.7017μs | 40.4831 KOps/s | 42.5545 KOps/s | |
test_getitem[range] | 0.1669ms< 8000 /td> | 49.9120μs | 20.0353 KOps/s | 21.4338 KOps/s | |
test_getitem[tuple] | 0.1281ms | 20.0519μs | 49.8707 KOps/s | 50.2430 KOps/s | |
test_getitem[list] | 0.1622ms | 44.2434μs | 22.6022 KOps/s | 23.6956 KOps/s | |
test_setitem_dim[int] | 47.2280μs | 26.0383μs | 38.4049 KOps/s | 39.6694 KOps/s | |
test_setitem_dim[slice_int] | 86.3510μs | 52.2978μs | 19.1212 KOps/s | 19.9555 KOps/s | |
test_setitem_dim[range] | 0.1121ms | 75.1388μs | 13.3087 KOps/s | 14.1439 KOps/s | |
test_setitem_dim[tuple] | 82.9750μs | 40.9353μs | 24.4288 KOps/s | 25.1049 KOps/s | |
test_setitem | 0.1887ms | 21.1944μs | 47.1823 KOps/s | 46.3976 KOps/s | |
test_set | 0.1240ms | 20.5565μs | 48.6464 KOps/s | 47.8659 KOps/s | |
test_set_shared | 8.2634ms | 0.1708ms | 5.8565 KOps/s | 5.8399 KOps/s | |
test_update | 0.2166ms | 23.7587μs | 42.0898 KOps/s | 39.9829 KOps/s | |
test_update_nested | 0.3291ms | 34.2988μs | 29.1555 KOps/s | 28.0732 KOps/s | |
test_update__nested | 0.3898ms | 34.1101μs | 29.3168 KOps/s | 29.7982 KOps/s | |
test_set_nested | 70.2010μs | 22.9977μs | 43.4826 KOps/s | 43.9309 KOps/s | |
test_set_nested_new | 97.0810μs | 27.8020μs | 35.9687 KOps/s | 36.2583 KOps/s | |
test_select | 0.2125ms | 44.3351μs | 22.5555 KOps/s | 22.7027 KOps/s | |
test_select_nested | 0.1233ms | 63.4581μs | 15.7584 KOps/s | 15.7941 KOps/s | |
test_exclude_nested | 0.1644ms | 82.4927μs | 12.1223 KOps/s | 12.2674 KOps/s | |
test_empty[True] | 0.5912ms | 0.4108ms | 2.4345 KOps/s | 2.4547 KOps/s | |
test_empty[False] | 6.6950μs | 1.3767μs | 726.3597 KOps/s | 734.8786 KOps/s | |
test_unbind_speed | 0.3957ms | 0.2692ms | 3.7145 KOps/s | 3.7020 KOps/s | |
test_unbind_speed_stack0 | 0.4583ms | 0.2663ms | 3.7556 KOps/s | 3.8650 KOps/s | |
test_unbind_speed_stack1 | 0.1188s | 0.8905ms | 1.1230 KOps/s | 1.4159 KOps/s | |
test_split | 0.1002s | 1.7798ms | 561.8526 Ops/s | 559.9210 Ops/s | |
test_chunk | 0.1005s | 1.7832ms | 560.7776 Ops/s | 560.4632 Ops/s | |
test_consolidate_njt[False-None] | 9.3788ms | 8.0620ms | 124.0383 Ops/s | 121.4276 Ops/s | |
test_creation[device0] | 0.2150ms | 90.5330μs | 11.0457 KOps/s | 10.8171 KOps/s | |
test_creation_from_tensor | 3.1909ms | 94.7533μs | 10.5537 KOps/s | 10.6076 KOps/s | |
test_add_one[memmap_tensor0] | 0.2672ms | 4.8548μs | 205.9800 KOps/s | 214.3878 KOps/s | |
test_contiguous[memmap_tensor0] | 21.3100μs | 0.5120μs | 1.9530 MOps/s | 1.9416 MOps/s | |
test_stack[memmap_tensor0] | 61.2740μs | 3.4266μs | 291.8367 KOps/s | 288.2386 KOps/s | |
test_memmaptd_index | 1.0071ms | 0.2392ms | 4.1811 KOps/s | 4.1878 KOps/s | |
test_memmaptd_index_astensor | 0.5661ms | 0.3216ms | 3.1093 KOps/s | 3.0974 KOps/s | |
test_memmaptd_index_op | 1.1463ms | 0.5990ms | 1.6694 KOps/s | 1.6050 KOps/s | |
test_serialize_model | 0.1237s | 0.1146s | 8.7271 Ops/s | 8.5426 Ops/s | |
test_serialize_model_pickle | 0.4995s | 0.3959s | 2.5261 Ops/s | 2.4875 Ops/s | |
test_serialize_weights | 0.2081s | 0.1249s | 8.0046 Ops/s | 8.8617 Ops/s | |
test_serialize_weights_returnearly | 0.1604s | 0.1549s | 6.4553 Ops/s | 6.2246 Ops/s | |
test_serialize_weights_pickle | 1.2503s | 0.7471s | 1.3385 Ops/s | 2.5151 Ops/s | |
test_serialize_weights_filesystem | 0.1431s | 0.1391s | 7.1873 Ops/s | 7.1488 Ops/s | |
test_serialize_model_filesystem | 0.1479s | 0.1399s | 7.1501 Ops/s | 5.9776 Ops/s | |
test_reshape_pytree | 55.4930μs | 26.0215μs | 38.4298 KOps/s | 37.7058 KOps/s | |
test_reshape_td | 70.4910μs | 32.7718μs | 30.5140 KOps/s | 30.0521 KOps/s | |
test_view_pytree | 77.7350μs | 26.3513μs | 37.9488 KOps/s | 38.4627 KOps/s | |
test_view_td | 93.2640μs | 39.8612μs | 25.0870 KOps/s | 25.4230 KOps/s | |
test_unbind_pytree | 69.8900μs | 29.1113μs | 34.3509 KOps/s | 34.4759 KOps/s | |
test_unbind_td | 98.7649ms | 47.4725μs | 21.0648 KOps/s | 25.1301 KOps/s | |
test_split_pytree | 61.6550μs | 28.7520μs | 34.7802 KOps/s | 34.6235 KOps/s | |
test_split_td | 0.2048ms | 44.8743μs | 22.2845 KOps/s | 22.1116 KOps/s | |
test_add_pytree | 92.5320μs | 34.5903μs | 28.9098 KOps/s | 28.6679 KOps/s | |
test_add_td | 0.2718ms | 58.8812μs | 16.9834 KOps/s | 16.3835 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1142ms | 62.2112μs | 16.0743 KOps/s | 15.9507 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 1.3545ms | 0.1714ms | 5.8331 KOps/s | 5.6824 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1088ms | 45.7437μs | 21.8609 KOps/s | 21.7113 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2137ms | 0.1182ms | 8.4634 KOps/s | 8.6017 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 59.6410μs | 26.1866μs | 38.1875 KOps/s | 39.1120 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1143ms | 58.9627μs | 16.9599 KOps/s | 17.0211 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2410ms | 78.6201μs | 12.7194 KOps/s | 12.9196 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1196ms | 66.6408μs | 15.0058 KOps/s | 15.0824 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3959ms | 0.1048ms | 9.5452 KOps/s | 9.6229 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.5560ms | 0.2140ms | 4.6732 KOps/s | 4.7369 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1029ms | 44.6563μs | 22.3932 KOps/s | 21.5593 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4862ms | 66.5947μs | 15.0162 KOps/s | 15.1269 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2222ms | 0.1034ms | 9.6710 KOps/s | 9.8940 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3159ms | 0.1995ms | 5.0118 KOps/s | 5.0115 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4834ms | 0.2321ms | 4.3078 KOps/s | 4.3051 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2098ms | 0.1068ms | 9.3597 KOps/s | 9.4971 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1811ms | 63.7368μs | 15.6895 KOps/s | 15.5979 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1055ms | 46.8521μs | 21.3438 KOps/s | 21.5927 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.2946ms | 0.1586ms | 6.3041 KOps/s | 6.3433 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1973ms | 0.1026ms | 9.7424 KOps/s | 9.7841 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 69.8500μs | 20.7649μs | 48.1582 KOps/s | 47.6660 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1304ms | 66.2595μs | 15.0922 KOps/s | 14.8328 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1485ms | 79.7977μs | 12.5317 KOps/s | 12.6787 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1321ms | 67.7628μs | 14.7573 KOps/s | 14.6666 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3504ms | 0.2098ms | 4.7675 KOps/s | 4.9148 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.4703ms | 1.3103ms | 763.1936 Ops/s | 736.5376 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3958ms | 0.1996ms | 5.0106 KOps/s | 4.8766 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 0.9804ms | 0.7647ms | 1.3078 KOps/s | 1.3041 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5178ms | 0.4373ms | 2.2866 KOps/s | 2.2394 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.9871ms | 2.7361ms | 365.4893 Ops/s | 360.2304 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 78.5660μs | 35.9511μs | 27.8156 KOps/s | 27.8321 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5331ms | 35.0893μs | 28.4987 KOps/s | 30.9173 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 86.1200μs | 29.4913μs | 33.9083 KOps/s | 34.6830 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 63.7490μs | 23.2226μs | 43.0615 KOps/s | 44.1295 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 94.7170μs | 30.7227μs | 32.5492 KOps/s | 33.6001 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 82.1530μs | 22.8781μs | 43.7099 KOps/s | 44.5734 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1225ms | 52.4362μs | 19.0708 KOps/s | 19.2288 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5726ms | 20.7276μs | 48.2447 KOps/s | 48.7753 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1083ms | 44.6234μs | 22.4098 KOps/s | 22.9586 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 57.0060μs | 18.6265μs | 53.6869 KOps/s | 53.7502 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1233ms | 45.0802μs | 22.1827 KOps/s | 22.2700 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 57.0370μs | 18.6225μs | 53.6986 KOps/s | 53.4201 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1440ms | 54.2499μs | 18.4332 KOps/s | 18.7795 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9374ms | 20.7098μs | 48.2863 KOps/s | 49.8212 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1184ms | 44.7384μs | 22.3522 KOps/s | 22.2553 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 48.9810μs | 18.6259μs | 53.6885 KOps/s | 54.0844 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1099ms | 44.8198μs | 22.3116 KOps/s | 22.2180 KOps/s | |
test_compile_indexing[int-pytree-eager] | 64.9310μs | 18.6523μs | 53.6127 KOps/s | 54.3562 KOps/s | |
test_mod_add[eager] | 82.3340μs | 35.9200μs | 27.8396 KOps/s | 26.8124 KOps/s | |
test_mod_add[compile] | 92.2520μs | 47.3071μs | 21.1385 KOps/s | 20.0013 KOps/s | |
test_mod_add[compile-overhead] | 0.1352ms | 47.5457μs | 21.0324 KOps/s | 20.1083 KOps/s | |
test_mod_wrap[eager] | 0.4129ms | 0.2203ms | 4.5400 KOps/s | 4.5428 KOps/s | |
test_mod_wrap[compile] | 0.3332ms | 0.2001ms | 4.9974 KOps/s | 4.8556 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3211ms | 0.1987ms | 5.0332 KOps/s | 4.8781 KOps/s | |
test_mod_wrap_and_backward[eager] | 13.8122ms | 11.3188ms | 88.3487 Ops/s | 87.0884 Ops/s | |
test_mod_wrap_and_backward[compile] | 13.7228ms | 12.0534ms | 82.9639 Ops/s | 81.7858 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 15.9730ms | 12.2430ms | 81.6794 Ops/s | 80.1956 Ops/s | |
test_seq_add[eager] | 0.2539ms | 0.1176ms | 8.5034 KOps/s | 8.1509 KOps/s | |
test_seq_add[compile] | 0.1807ms | 61.6303μs | 16.2258 KOps/s | 15.7312 KOps/s | |
test_seq_add[compile-overhead] | 0.1330ms | 58.5407μs | 17.0821 KOps/s | 16.2294 KOps/s | |
test_seq_wrap[eager] | 0.6954ms | 0.4398ms | 2.2735 KOps/s | 2.2165 KOps/s | |
test_seq_wrap[compile] | 0.4240ms | 0.2207ms | 4.5319 KOps/s | 4.4145 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3971ms | 0.2212ms | 4.5216 KOps/s | 4.4432 KOps/s | |
test_func_call_runtime[False-eager] | 0.8882ms | 0.5352ms | 1.8686 KOps/s | 1.9182 KOps/s | |
test_func_call_runtime[False-compile] | 0.5048ms | 0.4155ms | 2.4068 KOps/s | 2.3841 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.6319ms | 0.4144ms | 2.4134 KOps/s | 2.3914 KOps/s | |
test_func_call_runtime[True-eager] | 0.9426ms | 0.7477ms | 1.3374 KOps/s | 1.3440 KOps/s | |
test_func_call_runtime[True-compile] | 0.6060ms | 0.4545ms | 2.2003 KOps/s | 2.1854 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8300ms | 0.4573ms | 2.1869 KOps/s | 2.2027 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9469ms | 0.5369ms | 1.8626 KOps/s | 1.9065 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.7840ms | 0.4141ms | 2.4148 KOps/s | 2.3843 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5183ms | 0.4157ms | 2.4054 KOps/s | 2.3879 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4726ms | 0.8933ms | 1.1195 KOps/s | 1.1407 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6991ms | 0.4803ms | 2.0819 KOps/s | 2.0616 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.7705ms | 0.4819ms | 2.0753 KOps/s | 2.0683 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 3.0299ms | 1.8983ms | 526.7801 Ops/s | 535.1799 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8419ms | 0.5034ms | 1.9865 KOps/s | 1.9347 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.8075ms | 0.5092ms | 1.9639 KOps/s | 1.9331 KOps/s | |
test_distributed | 0.2803ms | 0.1216ms | 8.2239 KOps/s | 7.8832 KOps/s | |
test_tdmodule | 45.5650μs | 27.5469μs | 36.3017 KOps/s | 36.1692 KOps/s | |
test_tdmodule_dispatch | 86.9220μs | 50.3345μs | 19.8671 KOps/s | 19.6384 KOps/s | |
test_tdseq | 53.4300μs | 29.5993μs | 33.7845 KOps/s | 33.4568 KOps/s | |
test_tdseq_dispatch | 0.1161ms | 57.5052μs | 17.3897 KOps/s | 17.8416 KOps/s | |
test_instantiation_functorch | 1.6895ms | 1.4948ms | 668.9695 Ops/s | 664.7355 Ops/s | |
test_exec_functorch | 0.3397ms | 0.1800ms | 5.5548 KOps/s | 5.6011 KOps/s | |
test_exec_functional_call | 0.3279ms | 0.1731ms | 5.7769 KOps/s | 5.8513 KOps/s | |
test_exec_td_decorator | 0.4757ms | 0.2339ms | 4.2751 KOps/s | 4.4178 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.0327ms | 0.6532ms | 1.5310 KOps/s | 1.5119 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0055ms | 0.6520ms | 1.5337 KOps/s | 1.5227 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.6347ms | 0.5234ms | 1.9107 KOps/s | 1.8932 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7192ms | 0.5261ms | 1.9007 KOps/s | 1.9161 KOps/s | |
test_to_module_speed[True] | 2.4147ms | 1.3607ms | 734.9138 Ops/s | 748.7631 Ops/s | |
test_to_module_speed[False] | 1.9082ms | 1.3024ms | 767.8368 Ops/s | 771.5199 Ops/s | |
test_tc_init | 88.1440μs | 48.3606μs | 20.6780 KOps/s | 20.4630 KOps/s | |
test_tc_init_nested | 0.2041ms | 96.6036μs | 10.3516 KOps/s | 10.0034 KOps/s | |
test_tc_first_layer_tensor | 20.4180μs | 1.5400μs | 649.3356 KOps/s | 639.6566 KOps/s | |
test_tc_first_layer_nontensor | 29.7150μs | 4.7161μs | 212.0377 KOps/s | 208.4981 KOps/s | |
test_tc_second_layer_tensor | 20.8290μs | 2.8752μs | 347.8041 KOps/s | 339.1213 KOps/s | |
test_tc_second_layer_nontensor | 33.1720μs | 6.1436μs | 162.7708 KOps/s | 162.0784 KOps/s | |
test_unbind | 0.2132s | 12.9382ms | 77.2904 Ops/s | 81.2820 Ops/s | |
test_full_like | 7.5190ms | 6.7805ms | 147.4810 Ops/s | 84.3504 Ops/s | |
test_zeros_like | 2.9850ms | 2.5399ms | 393.7198 Ops/s | 141.5302 Ops/s | |
test_ones_like | 3.7535ms | 3.1233ms | 320.1721 Ops/s | 130.8286 Ops/s | |
test_clone | 5.1514ms | 4.6967ms | 212.9153 Ops/s | 108.4320 Ops/s | |
test_squeeze | 57.4870μs | 11.9715μs | 83.5318 KOps/s | 82.3664 KOps/s | |
test_unsqueeze | 0.1751ms | 90.4587μs | 11.0548 KOps/s | 10.9748 KOps/s | |
test_split | 0.5218ms | 0.1928ms | 5.1856 KOps/s | 5.2888 KOps/s | |
test_permute | 0.2876ms | 0.2026ms | 4.9357 KOps/s | 5.0370 KOps/s | |
test_stack | 29.9871ms | 24.2683ms | 41.2060 Ops/s | 41.4415 Ops/s | |
test_cat | 27.9788ms | 23.8683ms | 41.8966 Ops/s | 42.1395 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 33.1410μs | 13.1672μs | 75.9460 KOps/s | 76.9369 KOps/s | |
test_plain_set_stack_nested | 51.2710μs | 13.3996μs | 74.6293 KOps/s | 76.1106 KOps/s | |
test_plain_set_nested_inplace | 37.8310μs | 14.2065μs | 70.3904 KOps/s | 71.7231 KOps/s | |
test_plain_set_stack_nested_inplace | 47.2710μs | 14.2898μs | 69.9799 KOps/s | 71.0565 KOps/s | |
test_items | 26.3500μs | 2.8783μs | 347.4283 KOps/s | 346.2519 KOps/s | |
test_items_nested | 0.4378ms | 0.3633ms | 2.7523 KOps/s | 2.7695 KOps/s | |
test_items_nested_locked | 1.4931ms | 0.3667ms | 2.7272 KOps/s | 2.7918 KOps/s | |
test_items_nested_leaf | 83.7010μs | 57.8412μs | 17.2887 KOps/s | 17.1273 KOps/s | |
test_items_stack_nested | 0.4140ms | 0.3595ms | 2.7819 KOps/s | 2.7795 KOps/s | |
test_items_stack_nested_leaf | 89.2010μs | 59.2008μs | 16.8917 KOps/s | 16.7981 KOps/s | |
test_items_stack_nested_locked | 0.4194ms | 0.3642ms | 2.7458 KOps/s | 2.7542 KOps/s | |
test_keys | 36.6210μs | 3.5042μs | 285.3697 KOps/s | 285.2765 KOps/s | |
test_keys_nested | 0.1278ms | 86.8878μs | 11.5091 KOps/s | 11.5424 KOps/s | |
test_keys_nested_locked | 0.7625ms | 93.3157μs | 10.7163 KOps/s | 10.7825 KOps/s | |
test_keys_nested_leaf | 0.1153ms | 79.0760μs | 12.6461 KOps/s | 12.7890 KOps/s | |
test_keys_stack_nested | 0.1270ms | 89.3853μs | 11.1875 KOps/s | 11.3925 KOps/s | |
test_keys_stack_nested_leaf | 0.1256ms | 79.7774μs | 12.5349 KOps/s | 12.6610 KOps/s | |
test_keys_stack_nested_locked | 0.1253ms | 94.9889μs | 10.5275 KOps/s | 10.6881 KOps/s | |
test_values | 4.8600μs | 0.8459μs | 1.1822 MOps/s | 1.1736 MOps/s | |
test_values_nested | 71.6320μs | 37.9224μs | 26.3696 KOps/s | 26.4686 KOps/s | |
test_values_nested_locked | 71.6510μs | 40.0400μs | 24.9750 KOps/s | 25.0272 KOps/s | |
test_values_nested_leaf | 73.7810μs | 42.0965μs | 23.7549 KOps/s | 23.6796 KOps/s | |
test_values_stack_nested | 64.8010μs | 38.8212μs | 25.7592 KOps/s | 26.0273 KOps/s | |
test_values_stack_nested_leaf | 71.5310μs | 42.9355μs | 23.2908 KOps/s | 23.3644 KOps/s | |
test_values_stack_nested_locked | 72.3520μs | 40.9106μs | 24.4435 KOps/s | 24.8900 KOps/s | |
test_membership | 2.1710μs | 0.5530μs | 1.8082 MOps/s | 1.7899 MOps/s | |
test_membership_nested | 21.6155μs | 2.0308μs | 492.4119 KOps/s | 479.0345 KOps/s | |
test_membership_nested_leaf | 20.6555μs | 2.0534μs | 487.0088 KOps/s | 484.3623 KOps/s | |
test_membership_stacked_nested | 26.2100μs | 2.1084μs | 474.2938 KOps/s | 482.3634 KOps/s | |
test_membership_stacked_nested_leaf | 36.7210μs | 2.1254μs | 470.4959 KOps/s | 486.6219 KOps/s | |
test_membership_nested_last | 36.5300μs | 3.0966μs | 322.9370 KOps/s | 322.6812 KOps/s | |
test_membership_nested_leaf_last | 32.4810μs | 3.1207μs | 320.4440 KOps/s | 319.6491 KOps/s | |
test_membership_stacked_nested_last | 26.8400μs | 3.0990μs | 322.6814 KOps/s | 277.7125 KOps/s | |
test_membership_stacked_nested_leaf_last | 36.2410μs | 3.1028μs | 322.2869 KOps/s | 281.0187 KOps/s | |
test_nested_getleaf | 32.7710μs | 6.1742μs | 161.9651 KOps/s | 164.1860 KOps/s | |
test_nested_get | 24.3610μs | 5.7794μs | 173.0275 KOps/s | 172.9115 KOps/s | |
test_stacked_getleaf | 33.6110μs | 6.1107μs | 163.6481 KOps/s | 162.3732 KOps/s | |
test_stacked_get | 35.6910μs | 5.7955μs | 172.5488 KOps/s | 172.1676 KOps/s | |
test_nested_getitemleaf | 40.5810μs | 6.3943μs | 156.3886 KOps/s | 155.2775 KOps/s | |
test_nested_getitem | 29.2910μs | 6.1505μs | 162.5875 KOps/s | 160.8702 KOps/s | |
test_stacked_getitemleaf | 34.0710μs | 6.4002μs | 156.2460 KOps/s | 155.1966 KOps/s | |
test_stacked_getitem | 31.4700μs | 6.0872μs | 164.2792 KOps/s | 163.9538 KOps/s | |
test_lock_nested | 0.8327ms | 0.3748ms | 2.6679 KOps/s | 2.5523 KOps/s | |
test_lock_stack_nested | 0.4048ms | 0.3480ms | 2.8734 KOps/s | 2.8683 KOps/s | |
test_unlock_nested | 0.6748ms | 0.3140ms | 3.1850 KOps/s | 3.0858 KOps/s | |
test_unlock_stack_nested | 0.3569ms | 0.2865ms | 3.4900 KOps/s | 3.4939 KOps/s | |
test_flatten_speed | 0.1630ms | 73.1338μs | 13.6736 KOps/s | 13.2863 KOps/s | |
test_unflatten_speed | 0.3734ms | 0.3215ms | 3.1102 KOps/s | 3.0976 KOps/s | |
test_common_ops | 1.6673ms | 0.6403ms | 1.5617 KOps/s | 1.5261 KOps/s | |
test_creation | 99.1320μs | 1.8102μs | 552.4227 KOps/s | 550.3948 KOps/s | |
test_creation_empty | 32.1110μs | 9.9934μs | 100.0658 KOps/s | 104.5160 KOps/s | |
test_creation_nested_1 | 46.0710μs | 11.6253μs | 86.0192 KOps/s | 90.1190 KOps/s | |
test_creation_nested_2 | 44.0000μs | 14.4245μs | 69.3266 KOps/s | 71.7138 KOps/s | |
test_clone | 0.1096ms | 10.5363μs | 94.9104 KOps/s | 88.4457 KOps/s | |
test_getitem[int] | 1.4822ms | 10.7348μs | 93.1546 KOps/s | 88.6732 KOps/s | |
test_getitem[slice_int] | 0.1127ms | 20.9365μs | 47.7635 KOps/s | 45.5175 KOps/s | |
test_getitem[range] | 0.1359ms | 37.2819μs | 26.8226 KOps/s | 25.1222 KOps/s | |
test_getitem[tuple] | 0.1145ms | 18.4406μs | 54.2283 KOps/s | 51.9699 KOps/s | |
test_getitem[list] | 0.1341ms | 34.0078μs | 29.4051 KOps/s | 28.2716 KOps/s | |
test_setitem_dim[int] | 44.3300μs | 19.8405μs | 50.4020 KOps/s | 46.3549 KOps/s | |
test_setitem_dim[slice_int] | 63.7510μs | 39.8087μs | 25.1202 KOps/s | 24.6860 KOps/s | |
test_setitem_dim[range] | 78.3210μs | 54.3439μs | 18.4013 KOps/s | 16.9360 KOps/s | |
test_setitem_dim[tuple] | 62.9110μs | 33.7745μs | 29.6082 KOps/s | 28.3757 KOps/s | |
test_setitem | 50.3210μs | 15.9101μs | 62.8530 KOps/s | 59.5161 KOps/s | |
test_set | 52.6010μs | 15.6011μs | 64.0982 KOps/s | 60.8434 KOps/s | |
test_set_shared | 1.6117ms | 0.1532ms | 6.5262 KOps/s | 6.4133 KOps/s | |
test_update | 0.4284ms | 19.5149μs | 51.2430 KOps/s | 50.9373 KOps/s | |
test_update_nested | 0.1117ms | 24.9031μs | 40.1557 KOps/s | 39.8117 KOps/s | |
test_update__nested | 0.9809ms | 25.7858μs | 38.7811 KOps/s | 37.3463 KOps/s | |
test_set_nested | 0.1097ms | 16.6857μs | 59.9316 KOps/s | 56.2684 KOps/s | |
test_set_nested_new | 0.1136ms | 18.7787μs | 53.2518 KOps/s | 49.7760 KOps/s | |
test_select | 0.1251ms | 31.1292μs | 32.1242 KOps/s | 30.9550 KOps/s | |
test_select_nested | 78.1820μs | 44.6942μs | 22.3743 KOps/s | 21.7010 KOps/s | |
test_exclude_nested | 94.3810μs | 63.3709μs | 15.7801 KOps/s | 15.4461 KOps/s | |
test_empty[True] | 0.3528ms | 0.2899ms | 3.4490 KOps/s | 3.3483 KOps/s | |
test_empty[False] | 3.0741μs | 0.8264μs | 1.2100 MOps/s | 1.1917 MOps/s | |
test_to | 95.3010μs | 58.2210μs | 17.1759 KOps/s | 16.6799 KOps/s | |
test_to_nonblocking | 0.9331ms | 49.6532μs | 20.1397 KOps/s | 19.2121 KOps/s | |
test_unbind_speed | 0.2961ms | 0.2411ms | 4.1472 KOps/s | 4.0752 KOps/s | |
test_unbind_speed_stack0 | 0.3037ms | 0.2430ms | 4.1151 KOps/s | 4.0960 KOps/s | |
test_unbind_speed_stack1 | 0.7279ms | 0.6170ms | 1.6206 KOps/s | 1.4895 KOps/s | |
test_split | 94.8621ms | 1.5923ms | 628.0334 Ops/s | 617.0621 Ops/s | |
test_chunk | 96.3398ms | 1.5866ms | 630.2649 Ops/s | 614.0984 Ops/s | |
test_consolidate[False-None] | 96.9243ms | 2.9654ms | 337.2253 Ops/s | 331.0742 Ops/s | |
test_consolidate[default-None] | 1.7719ms | 1.7024ms | 587.4111 Ops/s | 581.2844 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8402ms | 1.7490ms | 571.7597 Ops/s | 568.4360 Ops/s | |
test_consolidate_njt[False-None] | 6.7631ms | 6.6679ms | 149.9721 Ops/s | 147.0207 Ops/s | |
test_to[False-False-None] | 1.8657ms | 1.7536ms | 570.2630 Ops/s | 566.6680 Ops/s | |
test_to[True-False-None] | 1.6126ms | 1.3475ms | 742.1368 Ops/s | 727.6365 Ops/s | |
test_to[within-False-None] | 4.4475ms | 4.2274ms | 236.5504 Ops/s | 236.8649 Ops/s | |
test_to[True-default-None] | 5.6253ms | 5.3793ms | 185.8963 Ops/s | 179.1620 Ops/s | |
test_to_njt[False-False-None] | 7.1182ms | 6.9478ms | 143.9297 Ops/s | 137.8951 Ops/s | |
test_to_njt[True-False-None] | 5.6984ms | 5.4697ms | 182.8243 Ops/s | 172.1907 Ops/s | |
test_to_njt[within-False-None] | 12.4158ms | 12.3379ms | 81.0512 Ops/s | 78.0716 Ops/s | |
test_creation[device0] | 0.4482ms | 81.4189μs | 12.2822 KOps/s | 12.2705 KOps/s | |
test_creation_from_tensor | 0.5202ms | 86.0660μs | 11.6190 KOps/s | 11.6831 KOps/s | |
test_add_one[memmap_tensor0] | 0.4046ms | 6.9545μs | 143.7924 KOps/s | 139.8314 KOps/s | |
test_contiguous[memmap_tensor0] | 2.0075μs | 0.4206μs | 2.3775 MOps/s | 2.3422 MOps/s | |
test_stack[memmap_tensor0] | 21.1010μs | 4.3498μs | 229.8970 KOps/s | 217.2023 KOps/s | |
test_memmaptd_index | 0.5947ms | 0.2555ms | 3.9135 KOps/s | 3.7571 KOps/s | |
test_memmaptd_index_astensor | 0.5790ms | 0.3206ms | 3.1193 KOps/s | 3.0504 KOps/s | |
test_memmaptd_index_op | 1.0237ms | 0.6276ms | 1.5933 KOps/s | 1.5815 KOps/s | |
test_serialize_model | 0.1324s | 0.1314s | 7.6082 Ops/s | 7.6217 Ops/s | |
test_serialize_model_pickle | 1.3658s | 1.2143s | 0.8235 Ops/s | 0.8408 Ops/s | |
test_serialize_weights | 0.1323s | 0.1312s | 7.6246 Ops/s | 7.6359 Ops/s | |
test_serialize_weights_returnearly | 0.5272s | 70.9942ms | 14.0857 Ops/s | 23.2237 Ops/s | |
test_serialize_weights_pickle | 1.3766s | 1.2181s | 0.8209 Ops/s | 0.8353 Ops/s | |
test_reshape_pytree | 51.8300μs | 22.3762μs | 44.6903 KOps/s | 43.7499 KOps/s | |
test_reshape_td | 61.7610μs | 26.4250μs | 37.8429 KOps/s | 35.0596 KOps/s | |
test_view_pytree | 60.6010μs | 22.5108μs | 44.4231 KOps/s | 44.5394 KOps/s | |
test_view_td | 82.4210μs | 34.4164μs | 29.0559 KOps/s | 30.3924 KOps/s | |
test_unbind_pytree | 59.7610μs | 28.3745μs | 35.2430 KOps/s | 35.2234 KOps/s | |
test_unbind_td | 0.8106ms | 37.1267μs | 26.9348 KOps/s | 26.9896 KOps/s | |
test_split_pytree | 69.8310μs | 30.7634μs | 32.5061 KOps/s | 32.7749 KOps/s | |
test_split_td | 0.9567ms | 40.6295μs | 24.6126 KOps/s | 15.0883 KOps/s | |
test_add_pytree | 0.2590ms | 35.6268μs | 28.0688 KOps/s | 28.0633 KOps/s | |
test_add_td | 0.1003ms | 53.9831μs | 18.5243 KOps/s | 18.9053 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1774ms | 0.1230ms | 8.1311 KOps/s | 7.8236 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2270ms | 0.1324ms | 7.5501 KOps/s | 7.4287 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2184ms | 97.7207μs | 10.2332 KOps/s | 10.0313 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.5792ms | 0.1504ms | 6.6505 KOps/s | 6.4394 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 53.9310μs | 22.3790μs | 44.6847 KOps/s | 44.9113 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 66.0110μs | 30.0107μs | 33.3215 KOps/s | 33.3736 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.3080ms | 66.3352μs | 15.0749 KOps/s | 15.1146 KOps/s | |
test_compile_copy_nested[pytree-eager] | 91.2310μs | 50.0478μs | 19.9809 KOps/s | 19.5634 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1933ms | 0.1450ms | 6.8978 KOps/s | 6.8107 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3124ms | 0.2227ms | 4.4907 KOps/s | 4.5379 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1553ms | 0.1001ms | 9.9874 KOps/s | 9.8475 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1133ms | 56.2310μs | 17.7838 KOps/s | 17.0824 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2583ms | 0.1383ms | 7.2285 KOps/s | 7.2009 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5795ms | 0.4815ms | 2.0770 KOps/s | 1.9817 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3822ms | 0.2667ms | 3.7494 KOps/s | 3.7156 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2000ms | 0.1463ms | 6.8354 KOps/s | 6.8386 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1551ms | 70.5683μs | 14.1707 KOps/s | 14.2847 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1410ms | 0.1021ms | 9.7981 KOps/s | 9.8422 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4651ms | 0.4117ms | 2.4289 KOps/s | 2.4275 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1858ms | 0.1383ms | 7.2305 KOps/s | 7.2668 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1064ms | 18.8990μs | 52.9129 KOps/s | 47.3706 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 71.7010μs | 32.0694μs | 31.1824 KOps/s | 32.0372 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.2017ms | 71.3074μs | 14.0238 KOps/s | 13.9760 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1247ms | 52.1009μs | 19.1935 KOps/s | 19.0819 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6518ms | 0.3968ms | 2.5200 KOps/s | 2.1989 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7935ms | 2.6642ms | 375.3425 Ops/s | 363.2949 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6084ms | 0.3833ms | 2.6092 KOps/s | 2.2473 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7611ms | 2.6519ms | 377.0878 Ops/s | 361.3618 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1627ms | 0.1180ms | 8.4711 KOps/s | 8.5513 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5806ms | 82.8968μs | 12.0632 KOps/s | 11.7558 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1673ms | 0.1106ms | 9.0394 KOps/s | 8.7708 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1209ms | 72.6218μs | 13.7700 KOps/s | 13.3919 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1612ms | 0.1129ms | 8.8566 KOps/s | 8.6791 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1643ms | 72.0711μs | 13.8752 KOps/s | 14.0318 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1865ms | 0.1023ms | 9.7735 KOps/s | 9.7914 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1479ms | 17.5185μs | 57.0825 KOps/s | 56.0986 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1471ms | 98.1396μs | 10.1896 KOps/s | 10.1936 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 63.2710μs | 15.9058μs | 62.8701 KOps/s | 61.6795 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1547ms | 0.1023ms | 9.7741 KOps/s | 10.1658 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 49.9210μs | 15.8695μs | 63.0140 KOps/s | 61.4844 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1579ms | 0.1039ms | 9.6215 KOps/s | 9.7904 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6130ms | 16.8658μs | 59.2915 KOps/s | 55.4049 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1771ms | 0.1001ms | 9.9942 KOps/s | 10.1697 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 59.8510μs | 15.8605μs | 63.0496 KOps/s | 61.6568 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1435ms | 98.1812μs | 10.1852 KOps/s | 10.1908 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1579ms | 16.3954μs | 60.9928 KOps/s | 61.4958 KOps/s | |
test_mod_add[eager] | 84.2610μs | 39.9101μs | 25.0563 KOps/s | 24.4837 KOps/s | |
test_mod_add[compile] | 0.1469ms | 81.4805μs | 12.2729 KOps/s | 12.0834 KOps/s | |
test_mod_add[compile-overhead] | 0.3276ms | 0.1696ms | 5.8947 KOps/s | 5.6464 KOps/s | |
test_mod_wrap[eager] | 0.3393ms | 0.2630ms | 3.8021 KOps/s | 3.8379 KOps/s | |
test_mod_wrap[compile] | 0.3844ms | 0.3009ms | 3.3229 KOps/s | 3.4163 KOps/s | |
test_mod_wrap[compile-overhead] | 7.5471ms | 3.9321ms | 254.3149 Ops/s | 253.1250 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4860ms | 1.3835ms | 722.8110 Ops/s | 666.5200 Ops/s | |
test_mod_wrap_and_backward[compile] | 2.0730ms | 1.3896ms | 719.6263 Ops/s | 767.2417 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.5399ms | 1.0443ms | 957.5525 Ops/s | 1.0450 KOps/s | |
test_seq_add[eager] | 0.1967ms | 0.1295ms | 7.7212 KOps/s | 8.1714 KOps/s | |
test_seq_add[compile] | 0.1687ms | 94.7735μs | 10.5515 KOps/s | 10.9323 KOps/s | |
test_seq_add[compile-overhead] | 0.1722ms | 0.1355ms | 7.3775 KOps/s | 7.5429 KOps/s | |
test_seq_wrap[eager] | 0.5395ms | 0.4446ms | 2.2494 KOps/s | 2.2690 KOps/s | |
test_seq_wrap[compile] | 0.3937ms | 0.3085ms | 3.2417 KOps/s | 3.2439 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2753ms | 0.2280ms | 4.3864 KOps/s | 4.2525 KOps/s | |
test_func_call_runtime[False-eager] | 1.1553ms | 0.7477ms | 1.3374 KOps/s | 1.2403 KOps/s | |
test_func_call_runtime[False-compile] | 1.1781ms | 0.7557ms | 1.3234 KOps/s | 1.2968 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.7831ms | 0.3716ms | 2.6907 KOps/s | 2.6942 KOps/s | |
test_func_call_runtime[True-eager] | 1.3402ms | 0.9209ms | 1.0859 KOps/s | 1.0520 KOps/s | |
test_func_call_runtime[True-compile] | 1.1847ms | 0.7790ms | 1.2837 KOps/s | 1.2769 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4388ms | 0.3904ms | 2.5613 KOps/s | 2.5467 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.1404ms | 0.7446ms | 1.3431 KOps/s | 1.3125 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.1873ms | 0.7563ms | 1.3223 KOps/s | 1.3066 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.7860ms | 0.3745ms | 2.6699 KOps/s | 2.6727 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4226ms | 1.0173ms | 983.0230 Ops/s | 965.3397 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.2204ms | 0.8025ms | 1.2460 KOps/s | 1.2315 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.8361ms | 0.4199ms | 2.3815 KOps/s | 2.3810 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5842ms | 2.1214ms | 471.3965 Ops/s | 465.0074 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.2282ms | 0.8216ms | 1.2171 KOps/s | 1.1970 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4753ms | 0.4201ms | 2.3803 KOps/s | 2.3595 KOps/s | |
test_distributed | 3.0645ms | 0.1900ms | 5.2624 KOps/s | 8.5050 KOps/s | |
test_tdmodule | 32.9200μs | 21.5719μs | 46.3566 KOps/s | 44.7893 KOps/s | |
test_tdmodule_dispatch | 0.3170ms | 39.0432μs | 25.6126 KOps/s | 26.1912 KOps/s | |
test_tdseq | 44.5410μs | 22.3642μs | 44.7144 KOps/s | 44.8855 KOps/s | |
test_tdseq_dispatch | 64.6510μs | 42.0255μs | 23.7951 KOps/s | 24.1190 KOps/s | |
test_instantiation_functorch | 2.0054ms | 1.5903ms | 628.8045 Ops/s | 617.8167 Ops/s | |
test_exec_functorch | 0.2088ms | 0.1464ms | 6.8299 KOps/s | 6.6406 KOps/s | |
test_exec_functional_call | 0.5623ms | 0.1419ms | 7.0472 KOps/s | 6.8464 KOps/s | |
test_exec_td_decorator | 0.6023ms | 0.1899ms | 5.2657 KOps/s | 5.1062 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.1189ms | 0.6984ms | 1.4318 KOps/s | 1.4128 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.1222ms | 0.7038ms | 1.4208 KOps/s | 1.4188 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 1.0153ms | 0.6026ms | 1.6594 KOps/s | 1.6276 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 1.0295ms | 0.6046ms | 1.6539 KOps/s | 1.6238 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.9436ms | 19.5727ms | 51.0915 Ops/s | 51.0358 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.9455ms | 19.5600ms | 51.1247 Ops/s | 50.8979 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.8617ms | 19.4811ms | 51.3319 Ops/s | 51.3254 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.8034ms | 19.4444ms | 51.4287 Ops/s | 51.3091 Ops/s | |
test_to_module_speed[True] | 1.4213ms | 0.9948ms | 1.0053 KOps/s | 1.0068 KOps/s | |
test_to_module_speed[False] | 1.5789ms | 0.9752ms | 1.0255 KOps/s | 1.0168 KOps/s | |
test_tc_init | 84.2910μs | 39.1895μs | 25.5170 KOps/s | 26.1732 KOps/s | |
test_tc_init_nested | 0.4839ms | 78.9615μs | 12.6644 KOps/s | 12.8657 KOps/s | |
test_tc_first_layer_tensor | 27.0910μs | 0.8525μs | 1.1730 MOps/s | 1.1968 MOps/s | |
test_tc_first_layer_nontensor | 23.8000μs | 2.3665μs | 422.5627 KOps/s | 418.4437 KOps/s | |
test_tc_second_layer_tensor | 10.0233μs | 1.4948μs | 668.9949 KOps/s | 667.4538 KOps/s | |
test_tc_second_layer_nontensor | 0.4049ms | 3.1440μs | 318.0619 KOps/s | 312.8243 KOps/s | |
test_unbind | 0.2252s | 10.3118ms | 96.9764 Ops/s | 140.3213 Ops/s | |
test_full_like | 9.5844ms | 9.1338ms | 109.4834 Ops/s | 105.6893 Ops/s | |
test_zeros_like | 9.4861ms | 7.2149ms | 138.6019 Ops/s | 236.2978 Ops/s | |
test_ones_like | 4.9191ms | 4.1260ms | 242.3636 Ops/s | 231.3831 Ops/s | |
test_clone | 6.7628ms | 6.3624ms | 157.1734 Ops/s | 109.5405 Ops/s | |
test_squeeze | 58.8810μs | 9.7983μs | 102.0582 KOps/s | 105.3883 KOps/s | |
test_unsqueeze | 0.1237ms | 74.6836μs | 13.3898 KOps/s | 13.3095 KOps/s | |
test_split | 0.6043ms | 0.1677ms | 5.9620 KOps/s | 6.1494 KOps/s | |
test_permute | 0.5980ms | 0.1800ms | 5.5566 KOps/s | 5.4843 KOps/s | |
test_stack | 51.2792ms | 50.5913ms | 19.7662 Ops/s | 19.6976 Ops/s | |
test_cat | 51.0398ms | 50.7346ms | 19.7104 Ops/s | 19.7042 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
BC-breaking
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Refactor
Refactoring code - not a new feature
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
We want
torch.min(tensor, dim=dim)
andtorch.min(tensordict, dim=dim)
to be interchangeable.Currently,
values
is named vals not to conflict with the method with the same name, and indexing works as it does with tensordict.This means that a piece of code where we do
torch.min(smth, ...)[idx]
will actually index the resulting tensordict, which is susprising.I therefore suggest we should just use
torch.reduce_types.<type>
instead.