8000 dirtycomputer (buaa42wxy) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View dirtycomputer's full-sized avatar

Block or report dirtycomputer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,998 300 Updated Feb 27, 2025
Python 3,519 346 Updated May 13, 2025

Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Python 76 11 Updated Dec 2, 2024

[NAACL 2025] SIUO: Cross-Modality Safety Alignment

HTML 28 2 Updated Jan 31, 2025

Train transformer language models with reinforcement learning.

Python 13,870 1,902 Updated May 23, 2025

2025年5月更新,目前国内可用Docker镜像源汇总,DockerHub国内镜像加速列表,🚀DockerHub镜像加速器

3,744 186 Updated May 20, 2025
2 Updated Apr 16, 2025

Pytorch implementation of Tree Preference Optimization (TPO) (Accepyed by ICLR'25)

Python 17 1 Updated Apr 24, 2025

Publicly available data for Paperscape

45 5 Updated Mar 19, 2018

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 56,192 5,555 Updated May 16, 2025

LLMs-from-scratch项目中文翻译

Jupyter Notebook 999 173 Updated Apr 13, 2025

《Reinforcement Learning: An Introduction》(第二版)中文翻译

Python 540 103 Updated Apr 9, 2022
Python 3 Updated Apr 24, 2025

Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep

Python 122 9 Updated Apr 23, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,550 254 Updated May 20, 2025

A PyTorch native platform for training generative AI models

Python 3,829 375 Updated May 23, 2025

A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube,T…

Go 7,422 560 Updated May 23, 2025

Python logging made (stupidly) simple

Python 21,704 737 Updated May 1, 2025

Fully open reproduction of DeepSeek-R1

Python 24,522 2,258 Updated May 23, 2025

An invisible desktop application to help you pass your technical interviews.

4,308 722 Updated Apr 13, 2025

🐍 The official Python client library for Google's discovery based APIs.

Python 8,218 2,464 Updated May 22, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)

Python 6,799 663 Updated May 23, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,380 1,031 Updated May 23, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,254 53 Updated May 11, 2025

Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"

Jupyter Notebook 97 9 Updated Feb 24, 2025

[ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety

Python 38 1 Updated May 17, 2025

DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask Diffusion

7 Updated Mar 9, 2025

Erasing Concepts from Diffusion Models

Jupyter Notebook 606 38 Updated May 19, 2025

🤗 smolagents: a barebones library for agents that think in code.

Python 19,070 1,649 Updated May 23, 2025

REINFORCE Adversarial Attacks on Large Language Models: An Adaptive, Distributional, and Semantic Objective

Python 6 1 Updated Feb 28, 2025
Next
0