DistillKitPlus

DistillKitPlus is an open-source toolkit for doing knowledge distillation (KLD). The repo was inspired by acree-ai/DistillKit. The main motivation behind the toolkit was to support offline distillation and PEFT for low computation resource settings.

Features

Logit Distillation: Supports same/cross tokenizer teacher and student models.
Pre-Computed Logits: Enables memory-efficient training by generating logits in advance.
LoRA Fine-Tuning Integration: Efficient low-rank adaptation fine-tuning support.
Quantization Support: 4-bit model quantization for faster inference and reduced memory usage.
Accelerate & DeepSpeed Integration: Support for distributed training with optimized memory usage.

Supported Loss Functions

LOSS TYPE	BEST FOR	SPECIAL REQUIREMENTS
KL Divergence (fkl, kld)	Same tokenizer distillation	None
Universal Logit Distillation (uld)	Cross-tokenizer distillation	Requires teacher_labels
Multi-Level Optimal Transport (multi-ot)	Cross-tokenizer distillation	Requires teacher_labels, additional parameters

Installation

git clone https://github.com/agokrani/distillkitplus.git
cd distillkitplus
pip install -r requirements.txt
pip install .

Quick Start

Configure your distillation settings in config/default_config.json

Generate teacher logits:

python scripts/local/generate_logits.py --config config/default_config.json

Run distillation:

Without Accelerate (default):

python scripts/local/distill_logits.py --config config/default_config.json

With Accelerate & DeepSpeed:

# Make sure to set "use_accelerate": true in your config file
accelerate launch --config_file config/accelerate_configs/default_config.yaml scripts/local/distill_logits.py --config config/default_config.json

Optional: Modal Integration

DistillKitPlus also supports running scripts using Modal. Follow the steps below to perform knowledge distillation with Modal.

Use the following commands with Modal:

Generate teacher logits:

modal run scripts/modal/generate_logits.py --config config/default_config.json

Run distillation:

modal run scripts/modal/distill_logits.py --config config/default_config.json

When using Modal, the accelerate configuration is handled internally based on your config file settings. Just set "use_accelerate": true and specify "accelerate_config" in the "execution" section of your config file.

Configuration

The toolkit uses a JSON configuration file with the following main sections:

project_name: Name of your distillation project
dataset: Dataset configuration including source and processing settings
models: Teacher and student model specifications
tokenizer: Tokenizer settings including max length and padding
training: Training hyperparameters
distillation: Distillation-specific parameters (temperature, alpha)
lora: LoRA configuration for efficient fine-tuning
quantization: Model quantization settings
execution: Settings for accelerate and distributed training

See config/default_config.json for a complete example.

Contributing

We welcome contributions from the community! If you have ideas for improvements, new features, or bug fixes, please feel free to open an issue or submit a pull request.

Star History

Contact

For any technical questions or issues, please open an issue in this repository. We appreciate your feedback and support!

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
components		components
config		config
scripts		scripts
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DistillKitPlus

Features

Supported Loss Functions

Installation

Quick Start

Optional: Modal Integration

Configuration

Contributing

Star History

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

agokrani/distillKitPlus

Folders and files

Latest commit

History

Repository files navigation

DistillKitPlus

Features

Supported Loss Functions

Installation

Quick Start

Optional: Modal Integration

Configuration

Contributing

Star History

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages