-
Notifications
You must be signed in to change notification settings - Fork 57
fix: chat template improvements #148
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
29 commits
Select commit
Hold shift + click to select a range
77b03d7
make chat template configurable from config, save chat template as at…
ashors1 6116f77
Merge branch 'main' of github.com:NVIDIA/reinforcer into ashors/chat-…
ashors1 4f17127
save hf tokenizer
ashors1 525296a
add sft example with json data
ashors1 ce04b9b
improved configurability
ashors1 1597d0f
fixes
ashors1 70f11a3
update grpo and clean up
ashors1 83552a6
fix unit tests
ashors1 23f06d2
address comments
ashors1 1816456
Merge branch 'main' of github.com:NVIDIA/reinforcer into ashors/chat-…
ashors1 58fe349
add unit tests and documentation
ashors1 956c7f0
copyright header
ashors1 4913f25
address comments
ashors1 c016902
small fixes
ashors1 326b151
fix typo
ashors1 5068487
fix tests
ashors1 8e835ba
update chat template documentation
ashors1 0eeaa1e
Merge branch 'main' of github.com:NVIDIA/reinforcer into ashors/chat-…
ashors1 d95d88b
fix unit tests
ashors1 1958bb9
fix doctest
ashors1 c536b55
Merge branch 'main' of github.com:NVIDIA/reinforcer into ashors/chat-…
ashors1 2c7c5c5
fix checkpoint save when tokenizer not provided
ashors1 1df9a9f
feat: add a unique seed for each vllm llm engine (#171)
parthchadha f35ad95
fix: unit test script halts on first failure (#189)
terrykong 844e470
fix new vllm test and doctest
ashors1 7e50e8e
Merge branch 'main' of github.com:NVIDIA/reinforcer into ashors/chat-…
ashors1 ac4b6ea
remove old comment
ashors1 c5328f0
fix doctest
ashors1 5c4f849
Merge branch 'main' into ashors/chat-template-improvements
SahilJain314 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.