Stars
Synthetic data curation for post-training and structured data extraction
Agentic AI framework for enterprise workflow automation.
bespokelabsai / verifiers
Forked from willccbb/verifiersVerifiers for LLM Reinforcement Learning
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
Efficient and general syntactical decoding for Large Language Models
[ICLR 2025] Official Implementation: "Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data"
Systematic evaluation framework that automatically rates overthinking behavior in large language models.
OpenHealth, AI Health Assistant | Powered by Your Data
Fully open data curation for reasoning models
Minimal reproduction of DeepSeek R1-Zero
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A framework for few-shot evaluation of language models.
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Nodes for image juxtaposition for Flux in ComfyUI
Rectified Flow Inversion (RF-Inversion) - ICLR 2025
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.