8000 zheyuye (Zheyu Ye) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zheyuye's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@dmlc

Block or report zheyuye

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,073 686 Updated Jun 14, 2025

Awesome RL Reasoning Recipes ("Triple R")

673 39 Updated Jun 11, 2025

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 1,155 126 Updated Jun 13, 2025

This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning ca…

Python 598 13 Updated Jun 13, 2025

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 2,764 519 Updated Apr 15, 2024
Python 9 Updated Apr 20, 2025

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,751 263 Updated Jan 14, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 52,264 6,309 Updated Jun 12, 2025

大模型进阶面经

51 1 Updated May 6, 2025

Integrate the DeepSeek API into popular softwares

32,846 3,624 Updated May 13, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,448 1,246 Updated Jun 14, 2025

[ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification".

Python 41 1 Updated Dec 1, 2024

将微信读书划线和笔记同步到Readwise

Python 11 18 Updated Jun 1, 2023

🔥中文 prompt 精选🔥,ChatGPT 使用指南,提升 ChatGPT 可玩性和可用性!🚀

4,432 392 Updated Jan 10, 2025

Streamlit — A faster way to build and share data apps.

Python 39,895 3,501 Updated Jun 14, 2025

A curated list of resources for using LLMs to develop more competitive grant applications.

Python 3,560 459 Updated Mar 1, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 49,614 7,971 Updated Jun 15, 2025

Retrieval and Retrieval-augmented LLMs

Python 9,917 731 Updated Jun 4, 2025

A cross-platform framework using Vue.js

JavaScript 40,878 3,695 Updated Jun 14, 2025

中文情感分析库(Chinese Sentiment))可对文本进行情绪分析、正负情感分析。Text analysis, supporting multiple methods including word count, readability, document similarity, sentiment analysis, Word2Vec .

Python 557 86 Updated Dec 9, 2022

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,412 281 Updated Jun 14, 2025

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,329 5,256 Updated Nov 15, 2024

A generative speech model for daily dialogue.

Python 36,795 3,992 Updated May 23, 2025

A list of AI autonomous agents

18,579 1,424 Updated Feb 26, 2025

Datawhale成员整理的面经,内容包括机器学习,CV,NLP,推荐,开发等,欢迎大家star

HTML 2,977 459 Updated Apr 29, 2025

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 517 31 Updated May 16, 2025

Export Weread hightlight 2 Readwise

TypeScript 96 8 Updated Jan 1, 2024

V2rayU,基于v2ray核心的mac版客户端,用于科学上网,使用swift编写,支持trojan,vmess,shadowsocks,socks5等服务协议,支持订阅, 支持二维码,剪贴板导入,手动配置,二维码分享等

19,393 2,935 Updated Jun 12, 2025
Next
0