8000 GitHub - ltom01241010/C3PO: C3PO
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

ltom01241010/C3PO

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OLMoE Routing Weight Optimizer

This repository demonstrates how C3PO optimizing expert pathways to improve the model performance.

Setup and Installation

1. Create Conda Environment

Create a new conda environment named C3PO and install the required packages:

# Create conda environment
conda create -n C3PO python=3.10 -y
conda activate C3PO

# Install PyTorch (for CUDA 12.3)
conda install pytorch torchvision torchaudio pytorch-cuda=12.3 -c pytorch -c nvidia -y

# Install required packages
pip install torch numpy transformers fvcore tqdm

2. Download Reference Cases

Download the reference cases from this anonymous link: Reference Cases

# Extract the downloaded reference.zip
unzip reference.zip -d reference_data

3. Download Datasets

Run the download.sh script to get the necessary datasets:

# Execute download script
bash download.sh

4. Run the Demo

# Run the main script
python olmoe_optimizer.py

About

C3PO

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.6%
  • Other 0.4%
0