8000 Training options only allow for well known/tested HW · Issue #1007 · instructlab/instructlab · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Training options only allow for well known/tested HW #1007
Closed
@derekhiggins

Description

@derekhiggins

The instructlab userbase will have hardware that require various training configuration options

e.g. to train on colab I needed to disable bf16 and set
gradient_checkpointing=true
gradient_accumulation_steps=8

we should add a hidden, (perhaps temporary) argument to allow users to try various training options without
having to change the code

this could eventual be replaced with a happy set of arguments we know would cover all the cases
or code to auto sense the HW and set options based on what it finds.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions

    0