Highlights
Lists (1)
Sort Name ascending (A-Z)
Stars
- All languages
- Assembly
- Astro
- C
- C#
- C++
- CSS
- Cuda
- Cython
- Dockerfile
- Go
- HCL
- HTML
- Java
- JavaScript
- Jinja
- Jupyter Notebook
- Lean
- Lua
- MATLAB
- MDX
- Markdown
- MoonScript
- Nim
- Nix
- Objective-C
- PHP
- Perl
- Python
- R
- Roff
- Ruby
- Rust
- SCSS
- SaltStack
- Scala
- Scheme
- Shell
- Standard ML
- Svelte
- Swift
- TeX
- TypeScript
- Verilog
- Vim Script
- Vim Snippet
- Vue
The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in language modeling.
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)
The official implementation of "KL Penalty Control via Perturbation for Direct Preference Optimization"
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
Virtual whiteboard for sketching hand-drawn like diagrams
ACI.dev is the open source platform for VibeOps and infrastructure that connects your AI agents to 600+ tool integrations with multi-tenant auth, granular permissions, and access through direct fun…
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…
A PyTorch native platform for training generative AI models
Codes for the paper "A mathematical perspective on Transformers".
KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems
Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
Stanford NLP Python library for understanding and improving PyTorch models via interventions
This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.
Experimental nix expression to package all MacOS casks from homebrew automatically
Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples
A powerful tool for creating fine-tuning datasets for LLM
[TMI'20] Unpaired Multi-modal Segmentation via Knowledge Distillation
source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"
A resource repository for machine unlearning in large language models
Source code of our paper MIND, ACL 2024 Long Paper
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬