Stars
Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"
🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability
Code&Data for the paper "Distilling Rule-based Knowledge into Large Language Models" [COLING 2025]
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型