-
Postdoc at CASIA
- Beijing
-
07:59
(UTC +08:00) - https://scholar.google.com/citations?user=WYbijGsAAAAJ
Highlights
- Pro
Stars
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
A tool to configure, launch and manage your machine learning experiments.
Scalable toolkit for efficient model reinforcement
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
[NDSS'25 Best Technical Poster] A collection of automated evaluators for assessing jailbreak attempts.
A supervised fine-tuning method for controllable reasoning length in large language models (一种通过有监督微调实现大语言模型思考长度可控的方法)
Python tool for converting files and office documents to Markdown.
An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming, and virtual assistant applications.
SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
MCPSafetyScanner - Automated MCP safety auditing and remediation using Agents. More info: https://www.arxiv.org/abs/2504.03767
A lightweight, powerful framework for multi-agent workflows
Unleashing the Power of Reinforcement Learning for Math and Code Reasoners
Train your AI self, amplify you, bridge the world
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
CodeScientist: An automated scientific discovery system for code-based experiments
The official repository for the Scientific Paper Idea Proposer (SciPIP)
ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory
SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing
DeepLiterature: A fully open-source intelligent research assistant that integrates search, code execution, link resolution, and information expansion, with multiple tools working together to facili…
The official GitHub page for the survey paper "A Survey of Large Language Models".
zero-shot voice conversion & singing voice conversion, with real-time support
From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems
最好用的 sing-box 一键安装脚本 & 管理脚本,自动创建 REALITY 协议;支持 TUIC,Trojan,Hysteria2 等所有常见的协议
An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions.…
DeepEP: an efficient expert-parallel communication library
Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge