-
Notifications
You must be signed in to change notification settings - Fork 93
[BugFix] Fix MISSING check in tensorclass #1275
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 97.9210μs | 20.6689μs | 48.3819 KOps/s | 45.2015 KOps/s | |
test_plain_set_stack_nested | 77.0270μs | 20.8560μs | 47.9478 KOps/s | 45.2010 KOps/s | |
test_plain_set_nested_inplace | 56.1950μs | 22.8107μs | 43.8390 KOps/s | 42.1346 KOps/s | |
test_plain_set_stack_nested_inplace | 90.5090μs | 22.5818μs | 44.2834 KOps/s | 41.8619 KOps/s | |
test_items | 30.1560μs | 4.2026μs | 237.9467 KOps/s | 237.8291 KOps/s | |
test_items_nested | 0.5446ms | 0.4038ms | 2.4763 KOps/s | 2.4514 KOps/s | |
test_items_nested_locked | 0.8548ms | 0.4038ms | 2.4766 KOps/s | 2.4390 KOps/s | |
test_items_nested_leaf | 0.1533ms | 76.5137μs | 13.0696 KOps/s | 13.0977 KOps/s | |
test_items_stack_nested | 0.5857ms | 0.4049ms | 2.4699 KOps/s | 2.4406 KOps/s | |
test_items_stack_nested_leaf | 0.1468ms | 77.1210μs | 12.9666 KOps/s | 12.8904 KOps/s | |
test_items_stack_nested_locked | 0.6375ms | 0.4068ms | 2.4583 KOps/s | 2.4377 KOps/s | |
test_keys | 29.9960μs | 3.6806μs | 271.6962 KOps/s | 289.5612 KOps/s | |
test_keys_nested | 0.2893ms | 0.1619ms | 6.1755 KOps/s | 6.0227 KOps/s | |
test_keys_nested_locked | 1.9789ms | 0.1689ms | 5.9200 KOps/s | 5.8398 KOps/s | |
test_keys_nested_leaf | 0.2776ms | 0.1427ms | 7.0090 KOps/s | 6.8907 KOps/s | |
test_keys_stack_nested | 0.2954ms | 0.1633ms | 6.1222 KOps/s | 6.0028 KOps/s | |
test_keys_stack_nested_leaf | 0.2632ms | 0.1442ms | 6.9348 KOps/s | 6.9372 KOps/s | |
test_keys_stack_nested_locked | 0.3016ms | 0.1700ms | 5.8816 KOps/s | 5.7996 KOps/s | |
test_values | 9.5740μs | 1.0376μs | 963.7441 KOps/s | 955.6522 KOps/s | |
test_values_nested | 0.1562ms | 66.5247μs | 15.0320 KOps/s | 15.0651 KOps/s | |
test_values_nested_locked | 0.1591ms | 65.9433μs | 15.1645 KOps/s | 14.2702 KOps/s | |
test_values_nested_leaf | 0.1752ms | 74.0118μs | 13.5114 KOps/s | 13.5440 KOps/s | |
test_values_stack_nested | 0.1748ms | 66.0375μs | 15.1429 KOps/s | 15.0205 KOps/s | |
test_values_stack_nested_leaf | 0.1546ms | 72.9649μs | 13.7052 KOps/s | 13.6280 KOps/s | |
test_values_stack_nested_locked | 0.1515ms | 65.7928μs | 15.1992 KOps/s | 15.1482 KOps/s | |
test_membership | 23.7140μs | 0.8670μs | 1.1534 MOps/s | 1.1246 MOps/s | |
test_membership_nested | 49.5730μs | 2.9248μs | 341.9052 KOps/s | 340.3454 KOps/s | |
test_membership_nested_leaf | 55.0730μs | 2.8876μs | 346.3121 KOps/s | 331.1532 KOps/s | |
test_membership_stacked_nested | 30.2170μs | 2.8598μs | 349.6702 KOps/s | 341.9282 KOps/s | |
test_membership_stacked_nested_leaf | 24.9570μs | 2.9011μs | 344.6933 KOps/s | 341.3192 KOps/s | |
test_membership_nested_last | 49.8740μs | 4.2745μs | 233.9449 KOps/s | 230.6183 KOps/s | |
test_membership_nested_leaf_last | 25.6080μs | 4.2943μs | 232.8651 KOps/s | 229.1624 KOps/s | |
test_membership_stacked_nested_last | 52.2870μs | 4.2911μs | 233.0425 KOps/s | 231.7001 KOps/s | |
test_membership_stacked_nested_leaf_last | 23.4940μs | 4.3339μs | 230.7390 KOps/s | 230.5065 KOps/s | |
test_nested_getleaf | 76.1920μs | 17.4796μs | 57.2096 KOps/s | 57.0861 KOps/s | |
test_nested_get | 79.2180μs | 16.5274μs | 60.5055 KOps/s | 59.9402 KOps/s | |
test_stacked_getleaf | 68.7990μs | 17.2291μs | 58.0414 KOps/s | 57.4393 KOps/s | |
test_stacked_get | 56.1950μs | 16.6292μs | 60.1353 KOps/s | 60.1737 KOps/s | |
test_nested_getitemleaf | 61.6960μs | 18.0604μs | 55.3698 KOps/s | 54.8623 KOps/s | |
test_nested_getitem | 70.1420μs | 17.1170μs | 58.4214 KOps/s | 58.4412 KOps/s | |
test_stacked_getitemleaf | 59.8020μs | 17.9501μs | 55.7099 KOps/s | 55.9705 KOps/s | |
test_stacked_getitem | 61.7460μs | 17.0351μs | 58.7023 KOps/s | 58.6838 KOps/s | |
test_lock_nested | 0.5729ms | 0.4177ms | 2.3943 KOps/s | 2.4084 KOps/s | |
test_lock_stack_nested | 0.7468ms | 0.4255ms | 2.3500 KOps/s | 2.3351 KOps/s | |
test_unlock_nested | 0.4793ms | 0.3429ms | 2.9166 KOps/s | 2.9455 KOps/s | |
test_unlock_stack_nested | 0.4855ms | 0.3452ms | 2.8969 KOps/s | 2.8394 KOps/s | |
test_flatten_speed | 0.2058ms | 99.5273μs | 10.0475 KOps/s | 9.9253 KOps/s | |
test_unflatten_speed | 0.7978ms | 0.5996ms | 1.6678 KOps/s | 1.6826 KOps/s | |
test_common_ops | 5.4891ms | 0.8045ms | 1.2431 KOps/s | 1.1831 KOps/s | |
test_creation | 30.2160μs | 2.5023μs | 399.6253 KOps/s | 394.9098 KOps/s | |
test_creation_empty | 62.3460μs | 11.6482μs | 85.8502 KOps/s | 75.7943 KOps/s | |
test_creation_nested_1 | 45.0050μs | 14.6006μs | 68.4904 KOps/s | 61.8141 KOps/s | |
test_creation_nested_2 | 50.1740μs | 19.2907μs | 51.8383 KOps/s | 47.7303 KOps/s | |
test_clone | 84.2070μs | 13.5935μs | 73.5644 KOps/s | 72.3864 KOps/s | |
test_getitem[int] | 0.8127ms | 12.9286μs | 77.3477 KOps/s | 76.0219 KOps/s | |
test_getitem[slice_int] | 0.1399ms | 25.4231μs | 39.3342 KOps/s | 39.3544 KOps/s | |
test_getitem[range] | 0.1841ms | 50.1779μs | 19.9291 KOps/s | 19.6291 KOps/s | |
test_getitem[tuple] | 0.1580ms | 20.6891μs | 48.3345 KOps/s | 48.3248 KOps/s | |
test_getitem[list] | 0.1693ms | 45.1809μs | 22.1332 KOps/s | 21.6361 KOps/s | |
test_setitem_dim[int] | 50.8350μs | 26.0784μs | 38.3459 KOps/s | 37.6854 KOps/s | |
test_setitem_dim[slice_int] | 81.2320μs | 53.2786μs | 18.7693 KOps/s | 19.3051 KOps/s | |
test_setitem_dim[range] | 0.1277ms | 76.5317μs | 13.0665 KOps/s | 12.9517 KOps/s | |
test_setitem_dim[tuple] | 65.1320μs | 41.3575μs | 24.1794 KOps/s | 24.4818 KOps/s | |
test_setitem | 93.8960μs | 20.8155μs | 48.0411 KOps/s | 45.9842 KOps/s | |
test_set | 79.9490μs | 20.1299μs | 49.6774 KOps/s | 46.8901 KOps/s | |
test_set_shared | 0.3432ms | 0.1775ms | 5.6328 KOps/s | 5.3188 KOps/s | |
test_update | 0.1252ms | 25.9632μs | 38.5161 KOps/s | 36.1651 KOps/s | |
test_update_nested | 0.1174ms | 41.5005μs | 24.0961 KOps/s | 22.9793 KOps/s | |
test_update__nested | 0.5207ms | 34.0532μs | 29.3658 KOps/s | 28.7460 KOps/s | |
test_set_nested | 73.1770μs | 22.0967μs | 45.2556 KOps/s | 43.0662 KOps/s | |
test_set_nested_new | 0.1223ms | 26.9995μs | 37.0377 KOps/s | 36.1658 KOps/s | |
test_select | 0.1204ms | 43.8680μs | 22.7957 KOps/s | 22.2767 KOps/s | |
test_select_nested | 0.3019ms | 64.3903μs | 15.5303 KOps/s | 15.5105 KOps/s | |
test_exclude_nested | 0.2046ms | 79.7106μs | 12.5454 KOps/s | 12.5574 KOps/s | |
test_empty[True] | 0.7673ms | 0.4054ms | 2.4667 KOps/s | 2.4939 KOps/s | |
test_empty[False] | 12.7538μs | 1.3731μs | 728.2835 KOps/s | 718.7341 KOps/s | |
test_unbind_speed | 0.3604ms | 0.2738ms | 3.6525 KOps/s | 3.6070 KOps/s | |
test_unbind_speed_stack0 | 0.4855ms | 0.2711ms | 3.6890 KOps/s | 3.6169 KOps/s | |
test_unbind_speed_stack1 | 0.1121s | 0.7340ms | 1.3625 KOps/s | 1.1772 KOps/s | |
test_split | 0.1123s | 1.7608ms | 567.9173 Ops/s | 620.2549 Ops/s | |
test_chunk | 0.1042s | 1.7455ms | 572.9002 Ops/s | 553.8195 Ops/s | |
test_consolidate_njt[False-None] | 8.5752ms | 8.1077ms | 123.3393 Ops/s | 108.2858 Ops/s | |
test_creation[device0] | 0.2179ms | 90.6296μs | 11.0339 KOps/s | 10.7297 KOps/s | |
test_creation_from_tensor | 4.3777ms | 93.5754μs | 10.6866 KOps/s | 10.2829 KOps/s | |
test_add_one[memmap_tensor0] | 83.1950μs | 4.7599μs | 210.0895 KOps/s | 182.4587 KOps/s | |
test_contiguous[memmap_tensor0] | 12.9440μs | 0.5045μs | 1.9823 MOps/s | 1.9877 MOps/s | |
test_stack[memmap_tensor0] | 26.7900μs | 3.3933μs | 294.6992 KOps/s | 275.2778 KOps/s | |
test_memmaptd_index | 0.3561ms | 0.2320ms | 4.3112 KOps/s | 4.0794 KOps/s | |
test_memmaptd_index_astensor | 0.4797ms | 0.3175ms | 3.1500 KOps/s | 3.0681 KOps/s | |
test_memmaptd_index_op | 1.0289ms | 0.5759ms | 1.7364 KOps/s | 1.5645 KOps/s | |
test_serialize_model | 0.2157s | 0.1328s | 7.5303 Ops/s | 8.4322 Ops/s | |
test_serialize_model_pickle | 0.4928s | 0.3955s | 2.5285 Ops/s | 2.5361 Ops/s | |
test_serialize_weights | 0.1222s | 0.1151s | 8.6888 Ops/s | 8.7370 Ops/s | |
test_serialize_weights_returnearly | 0.1720s | 0.1561s | 6.4050 Ops/s | 6.4282 Ops/s | |
test_serialize_weights_pickle | 0.4778s | 0.3972s | 2.5175 Ops/s | 2.4501 Ops/s | |
test_serialize_weights_filesystem | 0.2545s | 0.1627s | 6.1447 Ops/s | 6.9921 Ops/s | |
test_serialize_model_filesystem | 0.1605s | 0.1533s | 6.5237 Ops/s | 6.3609 Ops/s | |
test_reshape_pytree | 59.5510μs | 26.2557μs | 38.0870 KOps/s | 37.2608 KOps/s | |
test_reshape_td | 88.7360μs | 33.8405μs | 29.5504 KOps/s | 29.5286 KOps/s | |
test_view_pytree | 67.1460μs | 26.1507μs | 38.2399 KOps/s | 37.6274 KOps/s | |
test_view_td | 94.4570μs | 40.1744μs | 24.8915 KOps/s | 24.2463 KOps/s | |
test_unbind_pytree | 86.4020μs | 29.7470μs | 33.6168 KOps/s | 33.2450 KOps/s | |
test_unbind_td | 0.3218ms | 40.6885μs | 24.5770 KOps/s | 24.1218 KOps/s | |
test_split_pytree | 82.9850μs | 29.1811μs | 34.2688 KOps/s | 33.6880 KOps/s | |
test_split_td | 0.5480ms | 46.1117μs | 21.6865 KOps/s | 21.6494 KOps/s | |
test_add_pytree | 86.9530μs | 35.3886μs | 28.2577 KOps/s | 27.3457 KOps/s | |
test_add_td | 0.1599ms | 56.7064μs | 17.6347 KOps/s | 16.3801 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1446ms | 67.5865μs | 14.7959 KOps/s | 14.6266 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3610ms | 0.1710ms | 5.8493 KOps/s | 5.8150 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1046ms | 46.2189μs | 21.6362 KOps/s | 21.4261 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2306ms | 0.1165ms | 8.5854 KOps/s | 8.2174 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 82.9850μs | 28.2989μs | 35.3370 KOps/s | 34.8702 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1598ms | 58.2229μs | 17.1754 KOps/s | 16.6565 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1665ms | 78.7166μs | 12.7038 KOps/s | 12.4398 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1430ms | 65.8191μs | 15.1931 KOps/s | 14.9490 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1857ms | 0.1088ms | 9.1915 KOps/s | 9.2140 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3926ms | 0.2138ms | 4.6771 KOps/s | 4.6178 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1160ms | 47.4397μs | 21.0794 KOps/s | 20.5920 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1755ms | 67.2347μs | 14.8733 KOps/s | 14.7721 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1886ms | 0.1020ms | 9.8003 KOps/s | 9.7625 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4119ms | 0.2018ms | 4.9555 KOps/s | 4.7596 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 1.4928ms | 0.2332ms | 4.2879 KOps/s | 4.2704 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2684ms | 0.1110ms | 9.0099 KOps/s | 9.2464 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2749ms | 66.3591μs | 15.0695 KOps/s | 15.9055 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1388ms | 49.9981μs | 20.0007 KOps/s | 20.5442 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.2943ms | 0.1562ms | 6.4011 KOps/s | 6.1846 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2357ms | 0.1015ms | 9.8512 KOps/s | 9.8664 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 62.5880μs | 22.0259μs | 45.4011 KOps/s | 46.0800 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1605ms | 65.9599μs | 15.1607 KOps/s | 14.9562 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1756ms | 81.7727μs | 12.2290 KOps/s | 12.4319 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1523ms | 67.5639μs | 14.8008 KOps/s | 15.1225 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4181ms | 0.2180ms | 4.5876 KOps/s | 4.5514 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.6940ms | 1.3888ms | 720.0614 Ops/s | 699.3816 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3395ms | 0.2119ms | 4.7191 KOps/s | 4.7335 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.5652ms | 0.8117ms | 1.2320 KOps/s | 1.1661 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.8552ms | 0.4650ms | 2.1508 KOps/s | 2.1325 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.8719ms | 2.6416ms | 378.5515 Ops/s | 339.9561 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1108ms | 40.0221μs | 24.9862 KOps/s | 24.9540 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5872ms | 33.1187μs | 30.1944 KOps/s | 29.6036 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 97.7320μs | 31.6397μs | 31.6059 KOps/s | 31.0812 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 66.2840μs | 22.7357μs | 43.9836 KOps/s | 43.0067 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 96.0200μs | 32.2328μs | 31.0243 KOps/s | 30.4845 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 83.3960μs | 22.7991μs | 43.8613 KOps/s | 42.8164 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1323ms | 53.8801μs | 18.5597 KOps/s | 18.5932 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.2298s | 26.7440μs | 37.3916 KOps/s | 47.4727 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1224ms | 46.4588μs | 21.5244 KOps/s | 20.9125 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 68.7690μs | 18.7771μs | 53.2564 KOps/s | 52.5804 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1133ms | 46.5661μs | 21.4749 KOps/s | 20.5621 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 66.4240μs | 18.7320μs | 53.3847 KOps/s | 53.1525 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1363ms | 54.5068μs | 18.3463 KOps/s | 17.7758 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9478ms | 19.9964μs | 50.0089 KOps/s | 48.0265 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1330ms | 47.0744μs | 21.2430 KOps/s | 20.7340 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 52.7490μs | 18.6805μs | 53.5318 KOps/s | 52.7898 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1387ms | 47.1788μs | 21.1960 KOps/s | 20.5861 KOps/s | |
test_compile_indexing[int-pytree-eager] | 64.1400μs | 18.6790μs | 53.5360 KOps/s | 53.3975 KOps/s | |
test_mod_add[eager] | 90.7290μs | 37.1383μs | 26.9264 KOps/s | 26.5488 KOps/s | |
test_mod_add[compile] | 0.1423ms | 69.1480μs | 14.4617 KOps/s | 15.0000 KOps/s | |
test_mod_add[compile-overhead] | 0.1462ms | 67.1430μs | 14.8936 KOps/s | 14.7509 KOps/s | |
test_mod_wrap[eager] | 0.4068ms | 0.2228ms | 4.4880 KOps/s | 4.3028 KOps/s | |
test_mod_wrap[compile] | 2.4201ms | 0.2317ms | 4.3167 KOps/s | 4.2739 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4111ms | 0.2273ms | 4.4002 KOps/s | 4.3288 KOps/s | |
test_mod_wrap_and_backward[eager] | 18.7932ms | 14.0101ms | 71.3770 Ops/s | 87.4374 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.5789ms | 11.8319ms | 84.5175 Ops/s | 90.3069 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 13.8273ms | 11.4476ms | 87.3545 Ops/s | 89.6929 Ops/s | |
test_seq_add[eager] | 0.2231ms | 0.1196ms | 8.3607 KOps/s | 8.1578 KOps/s | |
test_seq_add[compile] | 0.1795ms | 82.3088μs | 12.1494 KOps/s | 12.4206 KOps/s | |
test_seq_add[compile-overhead] | 0.1601ms | 78.5065μs | 12.7378 KOps/s | 12.6530 KOps/s | |
test_seq_wrap[eager] | 0.6666ms | 0.4437ms | 2.2539 KOps/s | 2.1118 KOps/s | |
test_seq_wrap[compile] | 0.3696ms | 0.2469ms | 4.0507 KOps/s | 3.9813 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4819ms | 0.2481ms | 4.0298 KOps/s | 3.9918 KOps/s | |
test_func_call_runtime[False-eager] | 0.8402ms | 0.5416ms | 1.8463 KOps/s | 1.7961 KOps/s | |
test_func_call_runtime[False-compile] | 0.6097ms | 0.4465ms | 2.2395 KOps/s | 2.2223 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.6456ms | 0.4462ms | 2.2413 KOps/s | 2.2356 KOps/s | |
test_func_call_runtime[True-eager] | 1.5396ms | 0.7691ms | 1.3002 KOps/s | 1.3095 KOps/s | |
test_func_call_runtime[True-compile] | 0.8405ms | 0.4685ms | 2.1346 KOps/s | 2.1506 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.9853ms | 0.4702ms | 2.1269 KOps/s | 2.1062 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7741ms | 0.5413ms | 1.8474 KOps/s | 1.8380 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6457ms | 0.4453ms | 2.2456 KOps/s | 2.2374 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5442ms | 0.4408ms | 2.2684 KOps/s | 2.2113 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0970ms | 0.8994ms | 1.1118 KOps/s | 1.0931 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.0810ms | 0.7984ms | 1.2524 KOps/s | 1.2324 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1112ms | 0.8069ms | 1.2393 KOps/s | 1.2220 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6450ms | 1.9016ms | 525.8671 Ops/s | 509.7878 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.6706ms | 0.5422ms | 1.8442 KOps/s | 1.8366 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.9111ms | 0.5398ms | 1.8526 KOps/s | 1.8329 KOps/s | |
test_distributed | 0.2827ms | 0.1260ms | 7.9379 KOps/s | 7.7273 KOps/s | |
test_tdmodule | 46.8270μs | 27.6853μs | 36.1203 KOps/s | 34.1607 KOps/s | |
test_tdmodule_dispatch | 96.6710μs | 50.3792μs | 19.8495 KOps/s | 18.6375 KOps/s | |
test_tdseq | 65.0820μs | 29.7773μs | 33.5826 KOps/s | 31.9849 KOps/s | |
test_tdseq_dispatch | 88.2250μs | 55.5419μs | 18.0044 KOps/s | 17.0723 KOps/s | |
test_instantiation_functorch | 1.7750ms | 1.5300ms | 653.6147 Ops/s | 636.9904 Ops/s | |
test_exec_functorch | 0.3189ms | 0.1770ms | 5.6482 KOps/s | 5.2933 KOps/s | |
test_exec_functional_call | 0.3456ms | 0.1718ms | 5.8200 KOps/s | 5.7430 KOps/s | |
test_exec_td_decorator | 0.6370ms | 0.2394ms | 4.1768 KOps/s | 4.1884 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8593ms | 0.6571ms | 1.5217 KOps/s | 1.4876 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9323ms | 0.6557ms | 1.5252 KOps/s | 1.4933 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7630ms | 0.5318ms | 1.8806 KOps/s | 1.8539 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.9007ms | 0.5352ms | 1.8684 KOps/s | 1.8415 KOps/s | |
test_to_module_speed[True] | 1.7955ms | 1.3249ms | 754.7772 Ops/s | 740.9540 Ops/s | |
test_to_module_speed[False] | 1.8836ms | 1.3024ms | 767.8322 Ops/s | 752.6489 Ops/s | |
test_tc_init | 98.7750μs | 46.4103μs | 21.5469 KOps/s | 20.3866 KOps/s | |
test_tc_init_nested | 0.1842ms | 92.7862μs | 10.7775 KOps/s | 10.1745 KOps/s | |
test_tc_first_layer_tensor | 45.4340μs | 1.6833μs | 594.0802 KOps/s | 554.5528 KOps/s | |
test_tc_first_layer_nontensor | 24.2760μs | 4.9402μs | 202.4214 KOps/s | 195.6476 KOps/s | |
test_tc_second_layer_tensor | 39.6140μs | 3.1712μs | 315.3411 KOps/s | 294.1941 KOps/s | |
test_tc_second_layer_nontensor | 36.1480μs | 6.5752μs | 152.0860 KOps/s | 149.4568 KOps/s | |
test_unbind | 0.2601s | 14.1132ms | 70.8557 Ops/s | 60.2425 Ops/s | |
test_full_like | 4.0964ms | 3.4791ms | 287.4320 Ops/s | 250.3656 Ops/s | |
test_zeros_like | 11.2965ms | 4.9001ms | 204.0778 Ops/s | 290.7855 Ops/s | |
test_ones_like | 5.7759ms | 3.6461ms | 274.2694 Ops/s | 181.9080 Ops/s | |
test_clone | 9.8340ms | 7.4946ms | 133.4294 Ops/s | 167.2072 Ops/s | |
test_squeeze | 66.2640μs | 12.3745μs | 80.8112 KOps/s | 77.9249 KOps/s | |
test_unsqueeze | 0.2992ms | 95.7211μs | 10.4470 KOps/s | 10.5336 KOps/s | |
test_split | 0.3282ms | 0.1971ms | 5.0745 KOps/s | 5.0008 KOps/s | |
test_permute | 0.2929ms | 0.2068ms | 4.8355 KOps/s | 4.9320 KOps/s | |
test_stack | 30.8337ms | 25.9944ms | 38.4698 Ops/s | 35.3187 Ops/s | |
test_cat | 31.2377ms | 26.0626ms | 38.3692 Ops/s | 36.4974 Ops/s |
b0d5c82
to
9ac1806
Compare
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 44.7210μs | 13.7171μs | 72.9015 KOps/s | 73.3867 KOps/s | |
test_plain_set_stack_nested | 41.7100μs | 13.9607μs | 71.6296 KOps/s | 72.6313 KOps/s | |
test_plain_set_nested_inplace | 41.4910μs | 14.6956μs | 68.0475 KOps/s | 67.9308 KOps/s | |
test_plain_set_stack_nested_inplace | 40.9600μs | 14.9053μs | 67.0902 KOps/s | 67.6126 KOps/s | |
test_items | 32.3300μs | 2.8446μs | 351.5460 KOps/s | 345.7650 KOps/s | |
test_items_nested | 0.4075ms | 0.3632ms | 2.7537 KOps/s | 2.7258 KOps/s | |
test_items_nested_locked | 0.4470ms | 0.3650ms | 2.7400 KOps/s | 2.7185 KOps/s | |
test_items_nested_leaf | 87.2510μs | 60.3186μs | 16.5786 KOps/s | 16.5390 KOps/s | |
test_items_stack_nested | 0.5399ms | 0.3643ms | 2.7448 KOps/s | 2.7597 KOps/s | |
test_items_stack_nested_leaf | 97.0800μs | 60.2915μs | 16.5861 KOps/s | 16.6000 KOps/s | |
test_items_stack_nested_locked | 0.4110ms | 0.3670ms | 2.7251 KOps/s | 2.7549 KOps/s | |
test_keys | 29.2710μs | 3.4273μs | 291.7745 KOps/s | 290.2850 KOps/s | |
test_keys_nested | 0.1226ms | 88.4499μs | 11.3058 KOps/s | 11.4676 KOps/s | |
test_keys_nested_locked | 0.7746ms | 94.4610μs | 10.5864 KOps/s | 10.6701 KOps/s | |
test_keys_nested_leaf | 0.1092ms | 79.4933μs | 12.5797 KOps/s | 12.6118 KOps/s | |
test_keys_stack_nested | 0.1121ms | 88.9681μs | 11.2400 KOps/s | 11.4010 KOps/s | |
test_keys_stack_nested_leaf | 0.1205ms | 78.8354μs | 12.6847 KOps/s | 12.7230 KOps/s | |
test_keys_stack_nested_locked | 0.1551ms | 94.2524μs | 10.6098 KOps/s | 10.5999 KOps/s | |
test_values | 4.9902μs | 0.8520μs | 1.1737 MOps/s | 1.1773 MOps/s | |
test_values_nested | 59.3200μs | 38.6615μs | 25.8655 KOps/s | 26.1604 KOps/s | |
test_values_nested_locked | 72.1900μs | 40.6587μs | 24.5950 KOps/s | 24.2439 KOps/s | |
test_values_nested_leaf | 66.7200μs | 43.8754μs | 22.7918 KOps/s | 23.2910 KOps/s | |
test_values_stack_nested | 73.2910μs | 38.4886μs | 25.9817 KOps/s | 26.1026 KOps/s | |
test_values_stack_nested_leaf | 94.6810μs | 43.9047μs | 22.7766 KOps/s | 23.0700 KOps/s | |
test_values_stack_nested_locked | 76.6410μs | 40.8690μs | 24.4685 KOps/s | 24.1461 KOps/s | |
test_membership | 1.7040μs | 0.4982μs | 2.0073 MOps/s | 1.9691 MOps/s | |
test_membership_nested | 18.9300μs | 2.0203μs | 494.9713 KOps/s | 492.0500 KOps/s | |
test_membership_nested_leaf | 15.2750μs | 1.9951μs | 501.2351 KOps/s | 498.4896 KOps/s | |
test_membership_stacked_nested | 39.2800μs | 2.1207μs | 471.5323 KOps/s | 486.7427 KOps/s | |
test_membership_stacked_nested_leaf | 23.2110μs | 2.0986μs | 476.5116 KOps/s | 485.0429 KOps/s | |
test_membership_nested_last | 36.9700μs | 3.0948μs | 323.1250 KOps/s | 329.9771 KOps/s | |
test_membership_nested_leaf_last | 25.1210μs | 3.0841μs | 324.2481 KOps/s | 331.2656 KOps/s | |
test_membership_stacked_nested_last | 28.2600μs | 3.0946μs | 323.1475 KOps/s | 330.6410 KOps/s | |
test_membership_stacked_nested_leaf_last | 30.4500μs | 3.0942μs | 323.1804 KOps/s | 330.2620 KOps/s | |
test_nested_getleaf | 36.9700μs | 13.4640μs | 74.2721 KOps/s | 74.1773 KOps/s | |
test_nested_get | 41.5200μs | 12.8832μs | 77.6206 KOps/s | 78.0401 KOps/s | |
test_stacked_getleaf | 44.3210μs | 13.5509μs | 73.7961 KOps/s | 74.5629 KOps/s | |
test_stacked_get | 55.4110μs | 12.6903μs | 78.8001 KOps/s | 78.3036 KOps/s | |
test_nested_getitemleaf | 41.0700μs | 13.7579μs | 72.6853 KOps/s | 72.3654 KOps/s | |
test_nested_getitem | 36.1100μs | 12.9513μs | 77.2122 KOps/s | 76.4031 KOps/s | |
test_stacked_getitemleaf | 40.7900μs | 13.8386μs | 72.2615 KOps/s | 73.1707 KOps/s | |
test_stacked_getitem | 33.9100μs | 12.9789μs | 77.0484 KOps/s | 77.0897 KOps/s | |
test_lock_nested | 7.9925ms | 0.3545ms | 2.8206 KOps/s | 2.9929 KOps/s | |
test_lock_stack_nested | 0.4123ms | 0.3553ms | 2.8145 KOps/s | 2.9061 KOps/s | |
test_unlock_nested | 0.4913ms | 0.2904ms | 3.4436 KOps/s | 3.5895 KOps/s | |
test_unlock_stack_nested | 0.3343ms | 0.2936ms | 3.4063 KOps/s | 3.5503 KOps/s | |
test_flatten_speed | 0.1121ms | 77.6988μs | 12.8702 KOps/s | 12.7945 KOps/s | |
test_unflatten_speed | 0.4691ms | 0.4029ms | 2.4817 KOps/s | 2.4956 KOps/s | |
test_common_ops | 0.7908ms | 0.6642ms | 1.5055 KOps/s | 1.5387 KOps/s | |
test_creation | 0.1161ms | 1.7358μs | 576.0967 KOps/s | 566.7282 KOps/s | |
test_creation_empty | 33.9400μs | 9.9282μs | 100.7234 KOps/s | 102.7660 KOps/s | |
test_creation_nested_1 | 34.7700μs | 11.5612μs | 86.4960 KOps/s | 87.7593 KOps/s | |
test_creation_nested_2 | 38.6700μs | 14.1916μs | 70.4640 KOps/s | 70.8564 KOps/s | |
test_clone | 52.3810μs | 11.0635μs | 90.3870 KOps/s | 94.3221 KOps/s | |
test_getitem[int] | 1.2382ms | 11.0509μs | 90.4907 KOps/s | 95.9278 KOps/s | |
test_getitem[slice_int] | 0.1129ms | 21.6268μs | 46.2390 KOps/s | 48.3728 KOps/s | |
test_getitem[range] | 0.1338ms | 40.2159μs | 24.8658 KOps/s | 26.9638 KOps/s | |
test_getitem[tuple] | 0.1080ms | 18.6458μs | 53.6314 KOps/s | 55.4509 KOps/s | |
test_getitem[list] | 0.1520ms | 34.5280μs | 28.9620 KOps/s | 29.4208 KOps/s | |
test_setitem_dim[int] | 40.0300μs | 20.9840μs | 47.6552 KOps/s | 50.6816 KOps/s | |
test_setitem_dim[slice_int] | 60.9010μs | 40.0121μs | 24.9924 KOps/s | 25.6874 KOps/s | |
test_setitem_dim[range] | 79.6300μs | 55.4015μs | 18.0500 KOps/s | 19.1790 KOps/s | |
test_setitem_dim[tuple] | 58.2610μs | 33.9737μs | 29.4346 KOps/s | 30.3454 KOps/s | |
test_setitem | 64.3600μs | 16.5405μs | 60.4578 KOps/s | 61.9124 KOps/s | |
test_set | 56.6210μs | 16.0075μs | 62.4707 KOps/s | 64.8116 KOps/s | |
test_set_shared | 0.5894ms | 0.1594ms | 6.2751 KOps/s | 6.3264 KOps/s | |
test_update | 0.2498ms | 21.9030μs | 45.6558 KOps/s | 47.2965 KOps/s | |
test_update_nested | 78.2110μs | 31.2603μs | 31.9894 KOps/s | 33.5482 KOps/s | |
test_update__nested | 0.5360ms | 26.3268μs | 37.9841 KOps/s | 39.4823 KOps/s | |
test_set_nested | 63.9900μs | 17.6107μs | 56.7836 KOps/s | 59.6416 KOps/s | |
test_set_nested_new | 68.5610μs | 19.6026μs | 51.0137 KOps/s | 52.6213 KOps/s | |
test_select | 83.0510μs | 31.3971μs | 31.8501 KOps/s | 33.0928 KOps/s | |
test_select_nested | 68.4000μs | 43.6513μs | 22.9088 KOps/s | 22.7611 KOps/s | |
test_exclude_nested | 0.1007ms | 62.9658μs | 15.8816 KOps/s | 15.7101 KOps/s | |
test_empty[True] | 0.3544ms | 0.2976ms | 3.3597 KOps/s | 3.4043 KOps/s | |
test_empty[False] | 3.9620μs | 0.8301μs | 1.2046 MOps/s | 1.2045 MOps/s | |
test_to | 85.9510μs | 57.5649μs | 17.3717 KOps/s | 17.2727 KOps/s | |
test_to_nonblocking | 89.6810μs | 49.9126μs | 20.0350 KOps/s | 21.0000 KOps/s | |
test_unbind_speed | 0.2836ms | 0.2512ms | 3.9817 KOps/s | 4.2227 KOps/s | |
test_unbind_speed_stack0 | 0.2995ms | 0.2495ms | 4.0074 KOps/s | 4.1825 KOps/s | |
test_unbind_speed_stack1 | 92.4573ms | 0.8330ms | 1.2005 KOps/s | 1.2246 KOps/s | |
test_split | 95.0917ms | 1.6498ms | 606.1497 Ops/s | 629.6113 Ops/s | |
test_chunk | 94.1307ms | 1.6461ms | 607.5060 Ops/s | 619.5582 Ops/s | |
test_consolidate[False-None] | 3.1127ms | 2.7770ms | 360.0972 Ops/s | 360.3579 Ops/s | |
test_consolidate[default-None] | 2.2158ms | 1.7831ms | 560.8082 Ops/s | 586.4788 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8840ms | 1.8002ms | 555.5013 Ops/s | 570.7101 Ops/s | |
test_consolidate_njt[False-None] | 7.0865ms | 6.7453ms | 148.2511 Ops/s | 151.6740 Ops/s | |
test_to[False-False-None] | 1.8613ms | 1.7818ms | 561.2266 Ops/s | 573.6022 Ops/s | |
test_to[True-False-None] | 1.6677ms | 1.3967ms | 715.9933 Ops/s | 742.1242 Ops/s | |
test_to[within-False-None] | 4.5834ms | 4.3496ms | 229.9039 Ops/s | 236.5746 Ops/s | |
test_to[True-default-None] | 5.7829ms | 5.4750ms | 182.6485 Ops/s | 185.6718 Ops/s | |
test_to_njt[False-False-None] | 7.3358ms | 7.1166ms | 140.5159 Ops/s | 143.5810 Ops/s | |
test_to_njt[True-False-None] | 5.8735ms | 5.6770ms | 176.1500 Ops/s | 179.0850 Ops/s | |
test_to_njt[within-False-None] | 12.7438ms | 12.4195ms | 80.5185 Ops/s | 83.2038 Ops/s | |
test_creation[device0] | 0.5453ms | 81.3466μs | 12.2931 KOps/s | 12.4241 KOps/s | |
test_creation_from_tensor | 0.4452ms | 83.8238μs | 11.9298 KOps/s | 11.9412 KOps/s | |
test_add_one[memmap_tensor0] | 0.2526ms | 7.2153μs | 138.5944 KOps/s | 148.9788 KOps/s | |
test_contiguous[memmap_tensor0] | 1.9905μs | 0.4306μs | 2.3225 MOps/s | 2.2787 MOps/s | |
test_stack[memmap_tensor0] | 36.9110μs | 4.9320μs | 202.7581 KOps/s | 234.7144 KOps/s | |
test_memmaptd_index | 1.4861ms | 0.2643ms | 3.7838 KOps/s | 4.1236 KOps/s | |
test_memmaptd_index_astensor | 0.4765ms | 0.3244ms | 3.0824 KOps/s | 3.2565 KOps/s | |
test_memmaptd_index_op | 0.8935ms | 0.6509ms | 1.5363 KOps/s | 1.6455 KOps/s | |
test_serialize_model | 0.1331s | 0.1317s | 7.5929 Ops/s | 7.6127 Ops/s | |
test_serialize_model_pickle | 1.3509s | 1.2136s | 0.8240 Ops/s | 0.8410 Ops/s | |
test_serialize_weights | 0.1328s | 0.1311s | 7.6249 Ops/s | 7.6426 Ops/s | |
test_serialize_weights_returnearly | 0.5944s | 64.3229ms | 15.5466 Ops/s | 14.2744 Ops/s | |
test_serialize_weights_pickle | 1.4067s | 1.2309s | 0.8124 Ops/s | 0.8183 Ops/s | |
test_reshape_pytree | 0.4117ms | 22.1625μs | 45.1212 KOps/s | 44.6738 KOps/s | |
test_reshape_td | 60.4400μs | 26.4483μs | 37.8097 KOps/s | 36.7972 KOps/s | |
test_view_pytree | 61.9100μs | 22.1530μs | 45.1407 KOps/s | 45.7580 KOps/s | |
test_view_td | 0.4240ms | 34.3180μs | 29.1393 KOps/s | 30.5852 KOps/s | |
test_unbind_pytree | 64.2500μs | 28.6710μs | 34.8784 KOps/s | 35.5705 KOps/s | |
test_unbind_td | 0.8317ms | 37.7139μs | 26.5154 KOps/s | 26.5249 KOps/s | |
test_split_pytree | 72.0400μs | 30.3029μs | 33.0002 KOps/s | 33.4192 KOps/s | |
test_split_td | 0.9753ms | 39.6975μs | 25.1905 KOps/s | 25.1228 KOps/s | |
test_add_pytree | 87.6800μs | 36.1586μs | 27.6560 KOps/s | 28.4613 KOps/s | |
test_add_td | 0.4358ms | 52.3804μs | 19.0911 KOps/s | 19.0339 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1841ms | 0.1237ms | 8.0828 KOps/s | 7.8719 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.5389ms | 0.1336ms | 7.4852 KOps/s | 7.3773 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1322ms | 97.3024μs | 10.2772 KOps/s | 10.2642 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.3544ms | 0.1515ms | 6.6009 KOps/s | 6.6769 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.4245ms | 25.4290μs | 39.3252 KOps/s | 40.3297 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.4237ms | 29.2702μs | 34.1645 KOps/s | 33.7001 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.4730ms | 63.2524μs | 15.8097 KOps/s | 15.3771 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1035ms | 48.6625μs | 20.5497 KOps/s | 20.3829 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1853ms | 0.1433ms | 6.9766 KOps/s | 7.0052 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3164ms | 0.2182ms | 4.5832 KOps/s | 4.6656 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1339ms | 98.1874μs | 10.1846 KOps/s | 10.1881 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1382ms | 56.2155μs | 17.7887 KOps/s | 17.8961 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1896ms | 0.1372ms | 7.2903 KOps/s | 7.2666 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5831ms | 0.4848ms | 2.0629 KOps/s | 2.0633 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3900ms | 0.2632ms | 3.7999 KOps/s | 3.8137 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2400ms | 0.1442ms | 6.9324 KOps/s | 6.9138 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1830ms | 67.9707μs | 14.7122 KOps/s | 14.6859 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1396ms | 99.1207μs | 10.0887 KOps/s | 10.0182 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5337ms | 0.4141ms | 2.4150 KOps/s | 2.4702 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1867ms | 0.1367ms | 7.3129 KOps/s | 7.3444 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1309ms | 20.2473μs | 49.3893 KOps/s | 47.4887 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 57.0500μs | 31.1482μs | 32.1046 KOps/s | 31.9246 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1070ms | 70.0532μs | 14.2749 KOps/s | 14.1577 KOps/s | |
test_compile_copy_flat[pytree-eager] | 80.5210μs | 52.1312μs | 19.1824 KOps/s | 19.0997 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6450ms | 0.3949ms | 2.5325 KOps/s | 2.1937 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.9951ms | 2.8322ms | 353.0809 Ops/s | 370.7267 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6119ms | 0.3861ms | 2.5898 KOps/s | 2.2269 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8109ms | 2.7469ms | 364.0423 Ops/s | 374.0545 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.3187ms | 0.1158ms | 8.6378 KOps/s | 8.4794 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5888ms | 84.3167μs | 11.8600 KOps/s | 12.2026 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.3854ms | 0.1100ms | 9.0934 KOps/s | 8.8595 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1910ms | 70.8185μs | 14.1206 KOps/s | 13.9262 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1640ms | 0.1158ms | 8.6345 KOps/s | 8.7329 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1162ms | 73.7331μs | 13.5624 KOps/s | 13.7919 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1389ms | 0.1005ms | 9.9456 KOps/s | 9.9242 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1554ms | 18.3253μs | 54.5693 KOps/s | 57.1685 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1290ms | 95.7341μs | 10.4456 KOps/s | 10.3101 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 48.0100μs | 16.3828μs | 61.0398 KOps/s | 63.0813 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1448ms | 98.6699μs | 10.1348 KOps/s | 10.2652 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 43.3910μs | 16.1983μs | 61.7348 KOps/s | 64.4243 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1566ms | 0.1018ms | 9.8203 KOps/s | 9.8551 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6149ms | 18.0271μs | 55.4719 KOps/s | 56.9494 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2310ms | 97.7557μs | 10.2296 KOps/s | 9.9574 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 47.5600μs | 16.4281μs | 60.8712 KOps/s | 64.6617 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1391ms | 97.3428μs | 10.2730 KOps/s | 10.0718 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.4083ms | 16.3368μs | 61.2113 KOps/s | 64.5032 KOps/s | |
test_mod_add[eager] | 91.3900μs | 43.7671μs | 22.8482 KOps/s | 23.6676 KOps/s | |
test_mod_add[compile] | 0.1276ms | 85.7275μs | 11.6649 KOps/s | 12.1176 KOps/s | |
test_mod_add[compile-overhead] | 0.3329ms | 0.1731ms | 5.7778 KOps/s | 5.6085 KOps/s | |
test_mod_wrap[eager] | 0.3367ms | 0.2672ms | 3.7429 KOps/s | 3.8821 KOps/s | |
test_mod_wrap[compile] | 0.6393ms | 0.3001ms | 3.3322 KOps/s | 3.4602 KOps/s | |
test_mod_wrap[compile-overhead] | 7.0354ms | 3.7710ms | 265.1841 Ops/s | 268.1589 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6100ms | 1.4012ms | 713.6980 Ops/s | 681.9278 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4253ms | 1.3094ms | 763.7151 Ops/s | 720.3177 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3962ms | 0.9391ms | 1.0648 KOps/s | 923.7563 Ops/s | |
test_seq_add[eager] | 0.1915ms | 0.1250ms | 8.0012 KOps/s | 7.9674 KOps/s | |
test_seq_add[compile] | 0.1486ms | 91.2870μs | 10.9545 KOps/s | 10.5659 KOps/s | |
test_seq_add[compile-overhead] | 0.1762ms | 0.1325ms | 7.5470 KOps/s | 7.6217 KOps/s | |
test_seq_wrap[eager] | 0.5132ms | 0.4465ms | 2.2395 KOps/s | 2.1770 KOps/s | |
test_seq_wrap[compile] | 0.3853ms | 0.3125ms | 3.2001 KOps/s | 3.1595 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2809ms | 0.2378ms | 4.2050 KOps/s | 4.3436 KOps/s | |
test_func_call_runtime[False-eager] | 0.8280ms | 0.7678ms | 1.3024 KOps/s | 1.3486 KOps/s | |
test_func_call_runtime[False-compile] | 0.8767ms | 0.8021ms | 1.2467 KOps/s | 1.3383 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4343ms | 0.3696ms | 2.7056 KOps/s | 2.7181 KOps/s | |
test_func_call_runtime[True-eager] | 1.0134ms | 0.9280ms | 1.0776 KOps/s | 1.0955 KOps/s | |
test_func_call_runtime[True-compile] | 0.8962ms | 0.7926ms | 1.2616 KOps/s | 1.2920 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4467ms | 0.3924ms | 2.5483 KOps/s | 2.5097 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9432ms | 0.7497ms | 1.3338 KOps/s | 1.3447 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8358ms | 0.7743ms | 1.2914 KOps/s | 1.3062 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4259ms | 0.3745ms | 2.6704 KOps/s | 2.6393 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1193ms | 1.0311ms | 969.8755 Ops/s | 972.0787 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.1335ms | 1.0171ms | 983.1668 Ops/s | 963.3383 Ops/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1356ms | 1.0367ms | 964.5977 Ops/s | 962.8204 Ops/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5661ms | 2.1500ms | 465.1239 Ops/s | 470.9823 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9659ms | 0.8643ms | 1.1570 KOps/s | 1.1955 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4700ms | 0.4235ms | 2.3615 KOps/s | 2.3709 KOps/s | |
test_distributed | 2.7909ms | 0.2422ms | 4.1292 KOps/s | 8.6675 KOps/s | |
test_tdmodule | 44.6910μs | 22.4433μs | 44.5567 KOps/s | 45.1764 KOps/s | |
test_tdmodule_dispatch | 68.9110μs | 40.0476μs | 24.9703 KOps/s | 25.0068 KOps/s | |
test_tdseq | 44.6500μs | 22.7940μs | 43.8711 KOps/s | 42.9991 KOps/s | |
test_tdseq_dispatch | 75.3400μs | 42.8215μs | 23.3527 KOps/s | 22.5665 KOps/s | |
test_instantiation_functorch | 1.6446ms | 1.5662ms | 638.5039 Ops/s | 648.7028 Ops/s | |
test_exec_functorch | 0.2324ms | 0.1496ms | 6.6839 KOps/s | 6.9029 KOps/s | |
test_exec_functional_call | 0.1966ms | 0.1435ms | 6.9702 KOps/s | 7.3380 KOps/s | |
test_exec_td_decorator | 0.4021ms | 0.1945ms | 5.1418 KOps/s | 5.3005 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8790ms | 0.7015ms | 1.4255 KOps/s | 1.4350 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8429ms | 0.6993ms | 1.4301 KOps/s | 1.4284 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7318ms | 0.6091ms | 1.6417 KOps/s | 1.6612 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7264ms | 0.6087ms | 1.6429 KOps/s | 1.6461 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.1675ms | 19.5461ms | 51.1610 Ops/s | 51.7959 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.6324ms | 19.5180ms | 51.2348 Ops/s | 51.7528 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.4745ms | 19.3335ms | 51.7238 Ops/s | 52.2647 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.4394ms | 19.3715ms | 51.6221 Ops/s | 52.1214 Ops/s | |
test_to_module_speed[True] | 1.4485ms | 0.9691ms | 1.0319 KOps/s | 1.0325 KOps/s | |
test_to_module_speed[False] | 1.0361ms | 0.9562ms | 1.0458 KOps/s | 1.0456 KOps/s | |
test_tc_init | 67.9010μs | 38.6690μs | 25.8605 KOps/s | 26.4893 KOps/s | |
test_tc_init_nested | 0.1192ms | 79.1525μs | 12.6338 KOps/s | 13.1101 KOps/s | |
test_tc_first_layer_tensor | 23.9700μs | 0.9584μs | 1.0434 MOps/s | 1.1396 MOps/s | |
test_tc_first_layer_nontensor | 28.1800μs | 2.4493μs | 408.2762 KOps/s | 402.4074 KOps/s | |
test_tc_second_layer_tensor | 13.7570μs | 1.7559μs | 569.4960 KOps/s | 540.5025 KOps/s | |
test_tc_second_layer_nontensor | 27.4300μs | 3.3577μs | 297.8238 KOps/s | 293.8103 KOps/s | |
test_unbind | 0.2152s | 11.9934ms | 83.3795 Ops/s | 143.7281 Ops/s | |
test_full_like | 9.2405ms | 7.2872ms | 137.2265 Ops/s | 113.8462 Ops/s | |
test_zeros_like | 9.0693ms | 7.1673ms | 139.5219 Ops/s | 231.4662 Ops/s | |
test_ones_like | 4.9855ms | 4.3288ms | 231.0128 Ops/s | 231.5418 Ops/s | |
test_clone | 6.7900ms | 6.3344ms | 157.8672 Ops/s | 157.1815 Ops/s | |
test_squeeze | 58.5300μs | 10.0550μs | 99.4534 KOps/s | 104.2839 KOps/s | |
test_unsqueeze | 0.1230ms | 72.4242μs | 13.8075 KOps/s | 13.6013 KOps/s | |
test_split | 0.3748ms | 0.1625ms | 6.1535 KOps/s | 6.5137 KOps/s | |
test_permute | 0.2325ms | 0.1880ms | 5.3183 KOps/s | 5.4250 KOps/s | |
test_stack | 50.7436ms | 50.3918ms | 19.8445 Ops/s | 20.0229 Ops/s | |
test_cat | 50.5414ms | 50.1643ms | 19.9345 Ops/s | 20.1400 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
bug
Something isn't working
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.