Stars
Using open source LLMs to build synthetic datasets for direct preference optimization
Execute arbitrary SQL queries on π€ Datasets
An open collection of implementation tips, tricks and resources for training large language models
polinaeterna / datasets
Forked from huggingface/datasetsπ€ The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Lightweight web API for visualizing and exploring all types of datasets - computer vision, speech, text, and tabular - stored on the Hugging Face Hub
Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.
π€ The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
SoundFile is an audio library based on libsndfile, CFFI, and NumPy
π₯ Fast State-of-the-Art Tokenizers optimized for Research and Production
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A minimalistic example of preparing a model for (synchronous) inference in production.
A minimalistic example of preparing a model for (synchronous) inference in production.