Fix nemo_llama_to_hf conversion #8000

tdene · 2023-12-08T09:36:31Z

What does this PR do ?

Fix convert_nemo_llama_to_hf.py:

Correctly account for megatron_amp_O2 flag
Save HF model with correct precision
Transfer over the NeMo tokenizer, instead of using the possibly-incompatible default tokenizer
Make sure to save fast version of tokenizer
Resize the model's embedding tensor to match the new tokenizer's vocab
Correct a typo in how-to-use example

PR Type:

New Feature
Bugfix
Documentation

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

Related to # (issue)

Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com>

for more information, see https://pre-commit.ci

github-actions · 2023-12-30T01:44:50Z

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

github-actions · 2024-01-06T01:45:19Z

This PR was closed because it has been inactive for 7 days since being marked as stale.

github-actions · 2024-02-25T01:44:54Z

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

Signed-off-by: Chen Cui <chcui@nvidia.com>

cuichenx · 2024-02-29T05:40:37Z

jenkins

scripts/nlp_language_modeling/convert_nemo_llama_to_hf.py

@@ -17,8 +17,9 @@
 from collections import OrderedDict

 import torch
+from omegaconf import open_dict


Signed-off-by: arendu <adithya.r@gmail.com>

* Account for amp_O2 in nemo_llama_to_hf conversion Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Package converted model with new tokenizer not old Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Account for variations in megatron_amp_O2 behavior Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Resize the embeddings matrix Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Correct precision when saving to HF folder Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Fix typo in sample script Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Fix typo in logging Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix O2 issue properly in peft mixin Signed-off-by: Chen Cui <chcui@nvidia.com> --------- Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> Signed-off-by: Chen Cui <chcui@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Chen Cui <chcui@nvidia.com> Signed-off-by: Zeeshan Patel <zeeshanp@berkeley.edu>

* Account for amp_O2 in nemo_llama_to_hf conversion Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Package converted model with new tokenizer not old Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Account for variations in megatron_amp_O2 behavior Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Resize the embeddings matrix Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Correct precision when saving to HF folder Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Fix typo in sample script Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Fix typo in logging Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix O2 issue properly in peft mixin Signed-off-by: Chen Cui <chcui@nvidia.com> --------- Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> Signed-off-by: Chen Cui <chcui@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Chen Cui <chcui@nvidia.com> Signed-off-by: Agoniii <815244047@qq.com>

* Account for amp_O2 in nemo_llama_to_hf conversion Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Package converted model with new tokenizer not old Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Account for variations in megatron_amp_O2 behavior Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Resize the embeddings matrix Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Correct precision when saving to HF folder Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Fix typo in sample script Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Fix typo in logging Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix O2 issue properly in peft mixin Signed-off-by: Chen Cui <chcui@nvidia.com> --------- Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> Signed-off-by: Chen Cui <chcui@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Chen Cui <chcui@nvidia.com> Signed-off-by: ataghibakhsh <ataghibakhsh@nvidia.com>

* Account for amp_O2 in nemo_llama_to_hf conversion Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Package converted model with new tokenizer not old Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Account for variations in megatron_amp_O2 behavior Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Resize the embeddings matrix Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Correct precision when saving to HF folder Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Fix typo in sample script Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Fix typo in logging Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix O2 issue properly in peft mixin Signed-off-by: Chen Cui <chcui@nvidia.com> --------- Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> Signed-off-by: Chen Cui <chcui@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Chen Cui <chcui@nvidia.com> Signed-off-by: Pablo Garay <pagaray@nvidia.com>

* Account for amp_O2 in nemo_llama_to_hf conversion Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Package converted model with new tokenizer not old Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Account for variations in megatron_amp_O2 behavior Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Resize the embeddings matrix Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Correct precision when saving to HF folder Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Fix typo in sample script Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * Fix typo in logging Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix O2 issue properly in peft mixin Signed-off-by: Chen Cui <chcui@nvidia.com> --------- Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com> Signed-off-by: Chen Cui <chcui@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Chen Cui <chcui@nvidia.com>

tdene added 5 commits December 8, 2023 01:44

Account for amp_O2 in nemo_llama_to_hf conversion

b081883

Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com>

Package converted model with new tokenizer not old

e2e037f

Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com>

Account for variations in megatron_amp_O2 behavior

2827cee

Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com>

Resize the embeddings matrix

f87601e

Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com>

Correct precision when saving to HF folder

bc31a5c

Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com>

tdene force-pushed the llama_conversion_script_amp_fix branch 3 times, most recently from d268d49 to 4cd8d33 Compare December 8, 2023 10:50

Fix typo in sample script

8479afb

Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com>

tdene force-pushed the llama_conversion_script_amp_fix branch from 8e4a675 to 8479afb Compare December 8, 2023 12:16

Fix typo in logging

5c117d7

Signed-off-by: Teodor-Dumitru Ene <teodord.ene@gmail.com>

tdene force-pushed the llama_conversion_script_amp_fix branch from 89e4620 to 5c117d7 Compare December 15, 2023 22:07

[pre-commit.ci] auto fixes from pre-commit.com hooks

5613e62

for more information, see https://pre-commit.ci

github-actions bot added the stale label Dec 30, 2023

cuichenx reopened this Feb 9, 2024

github-actions bot removed the stale label Feb 10, 2024

github-actions bot added the stale label Feb 25, 2024

cuichenx removed the stale label Feb 25, 2024

cuichenx added 2 commits February 28, 2024 19:17

Merge branch 'main' into llama_conversion_script_amp_fix

7167754

Signed-off-by: Chen Cui <chcui@nvidia.com>

fix O2 issue properly in peft mixin

b961e69

Signed-off-by: Chen Cui <chcui@nvidia.com>

github-actions bot added the NLP label Feb 29, 2024

github-advanced-security bot found potential problems Feb 29, 2024

View reviewed changes

scripts/nlp_language_modeling/convert_nemo_llama_to_hf.py

@@ -17,8 +17,9 @@

from collections import OrderedDict

import torch

from omegaconf import open_dict

Check notice

Code scanning / CodeQL

Unused import

Import of 'open_dict' is not used.

cuichenx approved these changes Feb 29, 2024

View reviewed changes

cuichenx merged commit 974b74c into NVIDIA:main Feb 29, 2024

cuichenx mentioned this pull request Feb 29, 2024

Fix Ptuning O2 #8519

Closed

8 tasks

arendu pushed a commit that referenced this pull request Feb 29, 2024

cherrypicked OOM fix from #8000

3617018

Signed-off-by: arendu <adithya.r@gmail.com>

tdene deleted the llama_conversion_script_amp_fix branch March 21, 2024 18:26

tdene mentioned this pull request Apr 24, 2024

Fix mixtral_nemo_to_hf conversion #9023

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix nemo_llama_to_hf conversion #8000

Fix nemo_llama_to_hf conversion #8000

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Check notice

Uh oh!

Fix nemo_llama_to_hf conversion #8000

Fix nemo_llama_to_hf conversion #8000

Uh oh!

Conversation

Uh oh!

What does this PR do ?

Who can review?

Additional Information

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Check notice

Uh oh!