-
Beihang University
- haidian
-
01:17
(UTC +08:00) - dirtycomputer.github.io
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
PyTorch code and models for V-JEPA self-supervised learning from video.
Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
Train transformer language models with reinforcement learning.
2025年5月更新,目前国内可用Docker镜像源汇总,DockerHub国内镜像加速列表,🚀DockerHub镜像加速器
Pytorch implementation of Tree Preference Optimization (TPO) (Accepyed by ICLR'25)
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
LLMs-from-scratch项目中文翻译
《Reinforcement Learning: An Introduction》(第二版)中文翻译
Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
A PyTorch native platform for training generative AI models
A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube,T…
Fully open reproduction of DeepSeek-R1
An invisible desktop application to help you pass your technical interviews.
🐍 The official Python client library for Google's discovery based APIs.
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)
verl: Volcano Engine Reinforcement Learning for LLMs
An Open-source RL System from ByteDance Seed and Tsinghua AIR
Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"
[ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety
DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask Diffusion
Erasing Concepts from Diffusion Models
🤗 smolagents: a barebones library for agents that think in code.
REINFORCE Adversarial Attacks on Large Language Models: An Adaptive, Distributional, and Semantic Objective