efsotr

efsotr

Achievements

flash-attention-w-tree-attn flash-attention-w-tree-attn Public

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 4
nano-patch-sequence-pack nano-patch-sequence-pack Public

Just a few lines to combine 🤗 Transformers, Flash Attention 2, and torch.compile — simple, clean, fast ⚡

Python 2
nano-dpo nano-dpo Public

A minimal implementation of Direct Preference Optimization (DPO) in Chinese

Jupyter Notebook 1
fast-hadamard-transform fast-hadamard-transform Public

Forked from Dao-AILab/fast-hadamard-transform

Fast Hadamard transform in CUDA, with a PyTorch interface

C 1
Megatron-LM-NEO Megatron-LM-NEO Public

Python
transformers transformers Public

Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python