Highlights
- Pro
Stars
A powerful tool for creating fine-tuning datasets for LLM
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
Efficient Triton Kernels for LLM Training
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
Machine Learning Journal for Intermediate to Advanced Topics.
A reading list on LLM based Synthetic Data Generation 🔥
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Official Implementation for the paper "SR-AIF: Solving Sparse-Reward Robotic Tasks from Pixels with Active Inference and World Models"
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
This is an official repository for "Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources" (NeurIPS 2023).
Generative Models by Stability AI
ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ suppo…
Official repo for consistency models.
This is an official repository for PrivMon: A Stream-Based System for Real-Time Privacy Attack Detection for Machine Learning Models (RAID 2023)
We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.
🔥 The Complete Customizable Software Developer Portfolio Template which lets you showcase your work and provides each and every detail about you as Software Developer.
Third iteration of my personal website built with Jekyll
My Portfolio - Personal Website
This is an official repository for Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study (ICCV2023).