The memory footprint kept increasing during training and everything else was fine #1

Jack-Yang-S · 2023-12-05T09:50:49Z

Hello guys, your work is great, but during my training, my memory usage continues to increase steadily, and I can see the normal update of various indicators. I will soon run out of 40GB memory and be forced to stop. What could be the problem？

hy5468 · 2023-12-15T07:25:23Z

Hi! Thanks for your attention! You may not use --qe-meter in pre-train. To calculate the right dataset-level metrics like Pearson, MCC, and F1-MULT, we have to save the predictions in the "reduce_metrics" function of qe loss. Fairseq may record all training states including these predictions. Therefore memory usage continues to increase.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

The memory footprint kept increasing during training and everything else was fine #1

The memory footprint kept increasing during training and everything else was fine #1

Uh oh!

The memory footprint kept increasing during training and everything else was fine #1

The memory footprint kept increasing during training and everything else was fine #1

Comments

Uh oh!