LID-5: Tri-stage learning rate scheduler #6159

Qingzheng-Wang · 2025-06-19T06:15:12Z

What did you change?

Introduced a tri-stage learning rate scheduler (TristageLR) in espnet2/schedulers/tristage_lr.py, inspired by fairseq’s scheduler.

The espnet2/tasks/abs_task.py is changed for the integration of tri-stage learning rate scheduler.

Why did you make this change?

Tri-stage schedulers help stabilize LID model training by supporting warm-up, hold, and decay phases.

Is your PR small enough?

Yes

Additional Context

Can be used by any downstream task.
Depends on:

for more information, see https://pre-commit.ci

sw005320 · 2025-06-19T11:34:04Z

This pull request adds a new tri-stage learning rate scheduler, TristageLR, to the espnet2/schedulers module. The scheduler introduces functionality for warmup, hold, and exponential decay phases, providing a flexible way to adjust learning rates during training.

Addition of the `TristageLR` scheduler:

New class implementation: The TristageLR class is added, inheriting from _LRScheduler and AbsBatchStepScheduler. It supports three phases: warmup, hold, and decay, with customizable ratios and scaling factors.
Method definitions:
- __init__: Initializes the scheduler with parameters like max_steps, warmup_ratio, hold_ratio, and decay_ratio. Computes internal variables such as warmup_steps, hold_steps, and decay_factor.
- __repr__: Provides a string representation of the scheduler, detailing its configuration.
- get_lr: Calculates the learning rate for the current step based on the scheduler's phase.

Copilot

Pull Request Overview

Introduces a new tri-stage learning rate scheduler (TristageLR) that supports configurable warmup, hold, and exponential decay phases for model training stability.

Implements TristageLR class mirroring fairseq’s scheduler behavior.
Adds parameters for warmup/hold/decay ratios and initial/final LR scaling.
Places the new scheduler in espnet2/schedulers for downstream tasks.

Comments suppressed due to low confidence (1)

espnet2/schedulers/tristage_lr.py:11

[nitpick] No unit tests are provided for critical phases (warmup, hold, decay). Add tests to verify LR values at boundary steps and default behavior.

class TristageLR(_LRScheduler, AbsBatchStepScheduler):

espnet2/schedulers/tristage_lr.py

codecov · 2025-06-19T17:00:01Z

Codecov Report

Attention: Patch coverage is 32.43243% with 25 lines in your changes missing coverage. Please review.

Project coverage is 57.09%. Comparing base (d3db636) to head (ded1519).
Report is 34 commits behind head on master.

Files with missing lines	Patch %	Lines
espnet2/schedulers/tristage_lr.py	30.55%	25 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #6159      +/-   ##
==========================================
+ Coverage   55.45%   57.09%   +1.63%     
==========================================
  Files         882      886       +4     
  Lines       82812    83725     +913     
==========================================
+ Hits        45927    47801    +1874     
+ Misses      36885    35924     -961

Flag	Coverage Δ
test_integration_espnet2	`46.62% <32.43%> (?)`
test_integration_espnetez	`?`
test_python_espnet2	`50.50% <32.43%> (-0.73%)`	⬇️
test_python_espnetez	`12.82% <32.43%> (?)`
test_utils	`20.63% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

for more information, see https://pre-commit.ci

Add tri-stage learning rate scheduler.

4de0e31

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Jun 19, 2025

mergify bot added the ESPnet2 label Jun 19, 2025

[pre-commit.ci] auto fixes from pre-commit.com hooks

e8737ba

for more information, see https://pre-commit.ci

Qingzheng-Wang mentioned this pull request Jun 19, 2025

LID-6: LID recipe template #6160

Open

sw005320 requested a review from Copilot June 19, 2025 11:33

Copilot AI reviewed Jun 19, 2025

View reviewed changes

espnet2/schedulers/tristage_lr.py Outdated Show resolved Hide resolved

espnet2/schedulers/tristage_lr.py Show resolved Hide resolved

espnet2/schedulers/tristage_lr.py Show resolved Hide resolved

This was referenced Jun 19, 2025

LID-4: Category- and dataset-aware balanced sampler #6158

Open

LID-1: Training and task setup #6155

Merged

LID-2: Model, loss and pooling modules #6156

Open

LID-3: Inference, embedding extraction and t-SNE visualization #6157

Open

Add tristagelr integration.

b037e2b

Qingzheng-Wang and others added 3 commits June 19, 2025 18:16

Fix with copilot suggestions.

04f592d

[pre-commit.ci] auto fixes from pre-commit.com hooks

3759508

for more information, see https://pre-commit.ci

Fix comment.

ded1519

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LID-5: Tri-stage learning rate scheduler #6159

LID-5: Tri-stage learning rate scheduler #6159

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LID-5: Tri-stage learning rate scheduler #6159

Are you sure you want to change the base?

LID-5: Tri-stage learning rate scheduler #6159

Uh oh!

Conversation

Uh oh!

What did you change?

Why did you make this change?

Is your PR small enough?

Additional Context

Uh oh!

Addition of the TristageLR scheduler:

Uh oh!

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Addition of the `TristageLR` scheduler: