Stars
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Ring attention implementation with flash attention
Hugging Face Deep Learning Containers (DLCs) for Google Cloud
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime