8000 jpthu17 (Peng Jin) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View jpthu17's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@PKU-YuanGroup

Block or report jpthu17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 751 66 Updated May 13, 2025

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 356 24 Updated Apr 13, 2025

ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 845 58 Updated Apr 30, 2025

Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capab…

JavaScript 5,654 511 Updated May 13, 2025

📌 [Arxiv2025] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"

169 3 Updated Apr 1, 2025

GPT-ImgEval: Evaluating GPT-4o’s state-of-the-art image generation capabilities

Python 260 4 Updated May 3, 2025

Rich-Text-to-Image Generation

Python 789 67 Updated Oct 9, 2023

V1: Toward Multimodal Reasoning by Designing Auxiliary Task

Python 34 1 Updated Apr 14, 2025

[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant

Jupyter Notebook 11,376 1,638 Updated May 11, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,228 49 Updated May 11, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,239 157 Updated May 13, 2025

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

563 15 Updated May 9, 2025

Official implementation of UnifiedReward & UnifiedReward-Think

Python 346 8 Updated May 14, 2025

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 664 30 Updated Mar 19, 2025

qwen-nsa

Jupyter Notebook 60 5 Updated Apr 11, 2025

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

Python 85 1 Updated Apr 8, 2025

交易模块

Python 6,152 1,379 Updated May 13, 2024

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,655 77 Updated Apr 18, 2025

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

3,067 176 Updated May 7, 2025

Simple RL training for reasoning

Python 3,556 265 Updated Apr 10, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,748 1,483 Updated Apr 24, 2025

minimal-cost for training 0.5B R1-Zero

Python 719 89 Updated Apr 25, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 7,969 920 Updated May 14, 2025

[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Jupyter Notebook 73 1 Updated Mar 1, 2025

[ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization

Python 154 Updated Jun 12, 2024

A jounery to real multimodel R1 ! We are doing on large-scale experiment

Python 301 8 Updated Mar 8, 2025

A fork to add multimodal model training to open-r1

Python 1,255 61 Updated Feb 8, 2025

Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling

Python 31 1 Updated Feb 12, 2025

Fully open reproduction of DeepSeek-R1

Python 24,394 2,246 Updated May 13, 2025
Next
0