8000 Add script to remove episodes from dataset by Cadene · Pull Request #378 · huggingface/lerobot · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Add script to remove episodes from dataset #378

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

Cadene
Copy link
Collaborator
@Cadene Cadene commented Aug 24, 2024

What this does

  • Add lerobot/scripts/edit_dataset.py script with mode "remove"

TODO:

  • Add tests

How it was tested

Ran

# save
cp -r data/cadene/koch_bimanual_folding_2 \
cadene/koch_bimanual_folding_2/save

python lerobot/scripts/edit_dataset.py remove \
    --root data \
    --repo-id cadene/koch_bimanual_folding_2 \
    --episodes 0 4 7 10 34 54 69

python lerobot/scripts/visualize_dataset_html.py \
  --root data \
  --repo-id cadene/koch_bimanual_folding_2

Visualization look good.

How to checkout & try? (for the reviewer)

@brysonjones
Copy link

@Cadene Is this still ongoing work?

I know there was recently a large refactor on the dataset format, so not sure if this code needs to change to address this.

At any rate, my team is looking to build out larger datasets, and are looking into editting and deleting episodes from datasets here, so we want this capability.

If this is not being worked on by anyone right now, we may be able to pick this up

@imstevenpmwork
Copy link
Collaborator

Related to: #831

@imstevenpmwork imstevenpmwork added enhancement Suggestions for new features or improvements dataset Issues regarding data inputs, processing, or datasets labels Apr 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dataset Issues regarding data inputs, processing, or datasets enhancement Suggestions for new features or improvements
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants
0