Integrate SwanLab for offline/online experiment tracking for Accelerate #3605

ShaohonChen · 2025-06-03T13:20:33Z

What does this PR do?

This PR introduces SwanLab, a lightweight open-source experiment tracking tool, as a new logging option for the training framework. The integration provides both online and offline tracking capabilities, along with a local dashboard for visualizing results.

SwanLab has previously supported:

Tracking via Transformers' report_to parameter (documentation)
The Accelerate training framework through external callbacks (documentation)

We've received numerous requests from the community to add native Accelerate support (see here), and we're excited to officially integrate with this excellent project to provide a more seamless experience for developers. This integration is particularly valuable for users in regions with limited network connectivity (such as China), offering them robust experiment tracking capabilities.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline, Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case. (see here)
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests? (I don't see any tests for any of the callbacks but please let me know if I missed them somewhere. )

Who can review?

@SunMarc I noticed that you recently reviewed some related PRs—would you mind helping review my PR as well? Thank you! (I believe you also reviewed the Hugging Face Transformers integration previously—looking forward to collaborating again! 😄)

Additional information about this PR

Usage guidline

Step 0: Set Up Accelerate and example environment

Following the accelerate official cv example (pet image classification task):

# prepare code and environments
git clone https://github.com/huggingface/accelerate
cd accelerate
pip install -e .
pip install timm     # use in example

Step 1: Set Up SwanLab Online Tracking

Install:

pip install swanlab

To use SwanLab's online tracking, log in to the SwanLab website and obtain your API key from the Settings page. Then, authenticate using the following command:

swanlab login

If you prefer offline mode, skip this step and install local dashboard:

pip install swanlab[dashboard]

Step 2: download Oxford-IIT Pet Dataset used in example code

You can find download link here

Step 3: run offical example script in accelerate projects

python examples/complete_cv_example.py  --data_dir <DOWNLOAD DATA PATH> --with_tracking

visualization demo here

Since my server is offline, I changed the pretrain parameter to false in the create_model code to avoid downloading the model online, which led to very poor accuracy after just 3 epochs 😂.

test suite passes

Since my AI training server couldn't connect to Hugging Face, some tests failed during the automated testing process.😭

SunMarc

LGTM ! Thanks for adding this ! Can you add a couple of tests like the other tackers ?

HuggingFaceDocBuilderDev · 2025-06-03T15:46:15Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ShaohonChen · 2025-06-04T04:02:27Z

Hi, we're currently updating the local log storage format for SwanLab, and a new version will be released within 2 days. We'll also complete the corresponding test cases for the new version!

ShaohonChen · 2025-06-08T10:50:50Z

I’ve added some new test cases. Would you mind helping me review them when you're available? Thanks a lot for your time! @SunMarc

SunMarc

Thanks a lot !

ShaohonChen · 2025-06-18T14:30:00Z

Thanks for the review and merge! 🤗

ShaohonChen added 3 commits June 3, 2025 19:19

add support for SwanLabTracker and update related documentation

88c25e1

add emoji in FRAMWORK

a4f3102

apply the style corrections and quality control

63afb5c

This was referenced Jun 3, 2025

Seems not suitable for DEEPSEED SwanHubX/SwanLab#1014

Closed

update SwanLabTracker for accelerate SwanHubX/SwanLab#1016

Merged

SunMarc reviewed Jun 3, 2025

View reviewed changes

ShaohonChen marked this pull request as draft June 4, 2025 04:02

ShaohonChen added 3 commits June 8, 2025 18:25

add support for SwanLabTracker in tests

55e7f60

Merge branch 'huggingface:main' into main

50edeb0

Merge remote-tracking branch 'refs/remotes/origin/main'

f6d85af

ShaohonChen marked this pull request as ready for review June 8, 2025 10:49

fix bug in test_tracking

a404b29

ShaohonChen requested a review from SunMarc J 8000 une 10, 2025 11:53

SunMarc approved these changes Jun 10, 2025

View reviewed changes

SunMarc merged commit 6597dae into huggingface:main Jun 18, 2025
24 of 25 checks passed

SAKURA-CAT mentioned this pull request Jun 25, 2025

Feat/backup proto SwanHubX/SwanLab#1126

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Integrate SwanLab for offline/online experiment tracking for Accelerate #3605

Integrate SwanLab for offline/online experiment tracking for Accelerate #3605

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Integrate SwanLab for offline/online experiment tracking for Accelerate #3605

Integrate SwanLab for offline/online experiment tracking for Accelerate #3605

Uh oh!

Conversation

Uh oh!

What does this PR do?

Before submitting

Who can review?

Additional information about this PR

Usage guidline

test suite passes

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!