8000 1KE-JI (Ke Ji) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View 1KE-JI's full-sized avatar

Block or report 1KE-JI

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
8000
Showing results

Train your Agent model via our easy and efficient framework

Python 766 66 Updated May 30, 2025

The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Python 80 4 Updated May 30, 2025

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 32,244 4,329 Updated May 30, 2025

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,725 205 Updated May 12, 2025

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

Python 515 49 Updated May 7, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 14,774 1,880 Updated May 30, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 10,768 1,841 Updated May 15, 2025

Efficient Triton Kernels for LLM Training

Python 5,118 338 Updated May 30, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,278 52 Updated May 11, 2025

A Paper collection for LLM based Patient Simulators

29 3 Updated Apr 16, 2025

Latest Advances on Long Chain-of-Thought Reasoning

341 20 Updated May 29, 2025

s1: Simple test-time scaling

Python 6,413 748 Updated May 19, 2025

Official resources of "The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models"

Python 9 2 Updated May 28, 2025
Python 706 47 Updated May 30, 2025

Original implementation of SmartRAG: Jointly Learn RAG-Related Tasks From the Environment Feedback (ICLR 2025)

Python 4 1 Updated Feb 17, 2025

Pretraining code for a large-scale depth-recurrent language model

Python 770 65 Updated May 29, 2025

Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions.

Python 59 6 Updated May 30, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,764 1,094 Updated May 30, 2025

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 39,634 3,134 Updated May 30, 2025

Simple RL training for reasoning

Python 3,597 267 Updated Apr 10, 2025

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 12,716 1,353 Updated May 30, 2025

Medical o1, Towards medical complex reasoning with LLMs

Python 1,116 112 Updated Jan 20, 2025

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,418 103 Updated May 28, 2023

[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't Know'"

Python 111 11 Updated Jul 10, 2024

[ACL 2025] Are Your LLMs Capable of Stable Reasoning?

Python 25 2 Updated Mar 18, 2025

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Python 3,271 570 Updated Apr 14, 2023

Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".

Python 54 1 Updated Nov 29, 2024
Next
0