8000 1KE-JI (Ke Ji) / Starred · GitHub

More Web Proxy on the site http://driver.im/

1KE-JI

Follow

Ke Ji 1KE-JI

Follow

PhD student from The Chinese University of Hong Kong, Shenzhen.

10 followers · 19 following

The Chinese University of Hong Kong, Shenzhen
Shenzhen, China
01:42 (UTC -12:00)
https://1ke-ji.github.io/

Achievements

Achievements

8000

Stars

Simple-Efficient / RL-Factory

Train your Agent model via our easy and efficient framework

Python 766 66 Updated May 30, 2025

MiniMax-AI / SynLogic

The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Python 80 4 Updated May 30, 2025

crewAIInc / crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 32,244 4,329 Updated May 30, 2025

MiniMax-AI / MiniMax-01

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,725 205 Updated May 12, 2025

MiniMax-AI / MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

Python 515 49 Updated May 7, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 14,774 1,880 Updated May 30, 2025

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 10,768 1,841 Updated May 15, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,118 338 Updated May 30, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,278 52 Updated May 11, 2025

FreedomIntelligence / Awesome-LLM-Patient-Simulators

A Paper collection for LLM based Patient Simulators

29 3 Updated Apr 16, 2025

LightChen233 / Awesome-Long-Chain-of-Thought-Reasoning

Latest Advances on Long Chain-of-Thought Reasoning

341 20 Updated May 29, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,413 748 Updated May 19, 2025

1KE-JI / UPFT

Official resources of "The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models"

Python 9 2 Updated May 28, 2025

Qihoo360 / Light-R1

Python 706 47 Updated May 30, 2025

gaojingsheng / SmartRAG

Original implementation of SmartRAG: Jointly Learn RAG-Related Tasks From the Environment Feedback (ICLR 2025)

Python 4 1 Updated Feb 17, 2025

seal-rg / recurrent-pretraining

Pretraining code for a large-scale depth-recurrent language model

Python 770 65 Updated May 29, 2025

Xuchen-Li / llm-arxiv-daily

Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions.

Python 59 6 Updated May 30, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,764 1,094 Updated May 30, 2025

unslothai / unsloth

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 39,634 3,134 Updated May 30, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,597 267 Updated Apr 10, 2025

deepseek-ai / DeepSeek-R1

89,700 11,589 Updated Apr 9, 2025

camel-ai / camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 12,716 1,353 Updated May 30, 2025

deepseek-ai / DeepSeek-V3

Python 97,284 15,807 Updated Apr 9, 2025

FreedomIntelligence / HuatuoGPT-o1

Medical o1, Towards medical complex reasoning with LLMs

Python 1,116 112 Updated Jan 20, 2025

hendrycks / test

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,418 103 Updated May 28, 2023

shizhediao / R-Tuning

[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't Know'"

Python 111 11 Updated Jul 10, 2024

collin-burns / discovering_latent_knowledge

Python 269 39 Updated Mar 2, 2024

open-compass / GPassK

[ACL 2025] Are Your LLMs Capable of Stable Reasoning?

Python 25 2 Updated Mar 18, 2025

google-research / albert

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Python 3,271 570 Updated Apr 14, 2023

WooooDyy / MathCritique

Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".

Python 54 1 Updated Nov 29, 2024

0