Fix ACT temporal ensembling #319

alexander-soare · 2024-07-12T12:55:09Z

What this does

Fixes an issue with the weighting scheme for the temporal ensembling:

The fix is to directly use the exponential weighting scheme referred to in Algo2 of https://arxiv.org/abs/2304.13705

Here I implement it in an online update fashion so we don't have the ugliness of storing a cache of actions.

How it was tested

I added a test for CI.

I tried eval'ing https://huggingface.co/lerobot/act_aloha_sim_transfer_cube_human/tree/main for 500 episodes with temporal ensembling.

Edit: There was a bug in ACT so this table and subsequent commentary was edited on 17 July with the bug fixed.

Setup	Success rate
Without temporal ensembling	87.6%
Prior implementation α=0.99	72.6%
This implementation m=0.1	63.8%
This implementation m=0.01	73.8%
This implementation m=0	76.8%
This implementation m=-0.01	79.0%
This implementation m=-0.1	58.6%
n_action_steps=1 with no ensembling (50 episodes only)	2.0%

Here's an episode for m=0.01:

eval_episode_2.mp4

How to checkout & try? (for the reviewer)

Try it with python lerobot/scripts/eval.py -p lerobot/act_aloha_sim_transfer_cube_human eval.n_episodes=10 eval.batch_size=10 +policy.temporal_ensemble_coeff=0.01 policy.n_action_steps=1

alexander-soare · 2024-07-12T12:57:06Z

tests/test_policies.py

+        online_avg = ensembler.update(actions)
+        # Simple offline calculation: avg = Σ(aᵢ*wᵢ) / Σ(wᵢ).
+        # Note: The complicated bit here is the slicing. Think about the (episode_length, chunk_size) grid.
+        # What we want to do is take diagonal slices across it starting from the left.


FYI: I think this gets a little hairy for a "simple" test, but I really wanted to make sure it's properly checked. I hope the explanation is enough to make the reviewer feel comfortable that this test is doing what it's supposed to. Perhaps it's enough to know that we do the same thing with two approaches and get the same answer.

alexander-soare · 2024-07-12T12:57:37Z

@Alternmill for review please.

lerobot/common/policies/act/configuration_act.py

lerobot/common/policies/act/modeling_act.py

…sembling

Cadene

Thanks!

It seems that this change isn't backward compatible, no?

If this is the case, I am fine with that, but we should warn people on discord that they can't load a checkpoint that has temporal_ensemble_momentum in the config. And ideally provide a minimal procedure to update their checkpoint config.
Also, I am wondering why this backward compatibility breaking change is not captured in our unit tests when we load a model checkpoint.

alexander-soare · 2024-07-15T14:02:30Z

@Cadene it doesn't break unit tests because of this

lerobot/lerobot/common/policies/factory.py

Lines 25 to 44 in 5ffcb48

    
           def _policy_cfg_from_hydra_cfg(policy_cfg_class, hydra_cfg): 
        
               expected_kwargs = set(inspect.signature(policy_cfg_class).parameters) 
        
               if not set(hydra_cfg.policy).issuperset(expected_kwargs): 
        
                   logging.warning( 
        
                       f"Hydra config is missing arguments: {set(expected_kwargs).difference(hydra_cfg.policy)}" 
        
                   ) 
        
               # OmegaConf.to_container returns lists where sequences are found, but our dataclasses use tuples to avoid 
        
               # issues with mutable defaults. This filter changes all lists to tuples. 
        
               def list_to_tuple(item): 
        
                   return tuple(item) if isinstance(item, list) else item 
        
               policy_cfg = policy_cfg_class( 
        
                   **{ 
        
                       k: list_to_tuple(v) 
        
                       for k, v in OmegaConf.to_container(hydra_cfg.policy, resolve=True).items() 
        
                       if k in expected_kwargs 
        
                   } 
        
               ) 
        
               return policy_cfg

, which means temporal_ensemble_momentum will be ignored and temporal_ensemble_coeff will get a warning.
I think we should probably consider making the former case raise an exception, but there may be a good reasons I didn't do that in the first place.

Yes, I'll mention it on 8000 Discord.

Cadene · 2024-07-15T14:10:26Z

@alexander-soare Could be better to raise an exception to avoid people overriding an argument from command line and it is actually ignored, but they didnt see the warning.

When we go out of alpha into beta, we should try our best to be backward compatible instead of raising exceptions.

alexander-soare · 2024-07-15T14:16:10Z

@Cadene I'm not sure I understand. I suggested raising an exception if someone provides an unknown param. Are you saying something else?

Let's take this discussion off this PR though :)

…sembling

ready for review

681eb7b

alexander-soare added bug Something isn’t working correctly policies Items related to robot policies labels Jul 12, 2024

alexander-soare commented Jul 12, 2024

View reviewed changes

put the weights on device

94a9818

Alternmill reviewed Jul 12, 2024

View reviewed changes

lerobot/common/policies/act/configuration_act.py Show resolved Hide resolved

Alternmill reviewed Jul 12, 2024

View reviewed changes

lerobot/common/policies/act/modeling_act.py Show resolved Hide resolved

alexander-soare added 2 commits July 15, 2024 09:29

Merge remote-tracking branch 'upstream/main' into fix_act_temporal_en…

40e29d4

…sembling

revision - improve docs

7dc4765

alexander-soare force-pushed the fix_act_temporal_ensembling branch from 0bb6211 to 7dc4765 Compare July 15, 2024 08:38

alexander-soare self-assigned this Jul 15, 2024

Cadene approved these changes Jul 15, 2024

View reviewed changes

Alternmill approved these changes Jul 15, 2024

View reviewed changes

alexander-soare added 2 commits July 16, 2024 09:11

Merge remote-tracking branch 'upstream/main' into fix_act_temporal_en…

97daf1a

…sembling

alexander-soare merged commit c0101f0 into huggingface:main Jul 16, 2024
5 checks passed

alexander-soare deleted the fix_act_temporal_ensembling branch July 16, 2024 09:27

amandip7 pushed a commit to amandip7/lerobot that referenced this pull request Oct 10, 2024

Fix ACT temporal ensembling (huggingface#319)

f39d3c7

KasparSLT mentioned this pull request Oct 27, 2024

Rename deprecated argument (temporal_ensemble_momentum) #490

Merged

menhguin pushed a commit to menhguin/lerobot that referenced this pull request Feb 9, 2025

Fix ACT temporal ensembling (huggingface#319)

4512210

Kalcy-U referenced this pull request in Kalcy-U/lerobot May 13, 2025

Fix ACT temporal ensembling (#319)

e8212b8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix ACT temporal ensembling #319

Fix ACT temporal ensembling #319

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fix ACT temporal ensembling #319

Fix ACT temporal ensembling #319

Uh oh!

Conversation

Uh oh!

What this does

How it was tested

How to checkout & try? (for the reviewer)

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!