8000 Sshuoshuo (Shuoshuo Sun) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Sshuoshuo's full-sized avatar
  • Southeast University
  • Nanjing

Block or report Sshuoshuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)

Python 31 4 Updated May 12, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 1,361 76 Updated Jul 2, 2025

Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

TypeScript 74,439 4,306 Updated Jul 4, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 64,720 7,394 Updated Jul 5, 2025

🌐 WebAgent for Information Seeking bulit by Tongyi Lab: WebWalker & WebDancer & WebSailor https://arxiv.org/pdf/2507.02592

Python 1,319 100 Updated Jul 4, 2025

Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app

MDX 1,866 158 Updated Jul 1, 2025

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 419 30 Updated Jun 6, 2025

Train your Agent model via our easy and efficient framework

Python 1,240 110 Updated Jul 1, 2025

Official Repository of Absolute Zero Reasoner

Python 1,583 268 Updated Jul 1, 2025

A collective list of free APIs

Python 354,837 37,213 Updated May 20, 2025

科技爱好者周刊,每周五发布

70,351 3,489 Updated Jul 4, 2025

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Python 1,251 133 Updated Jul 4, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 9,880 860 Updated Jun 18, 2025

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 246 26 Updated Jun 3, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 58,169 8,080 Updated Jul 3, 2025

An open protocol enabling communication and interoperability between opaque agentic applications.

TypeScript 18,021 1,797 Updated Jul 3, 2025

A collection of MCP servers.

59,760 4,629 Updated Jul 3, 2025

Official Repo for Open-Reasoner-Zero

Python 1,983 107 Updated Jun 2, 2025

A curated list of 120+ LLM libraries category wise.

4,001 660 Updated Jun 13, 2025

A live stream development of RL tunning for LLM agents

Python 3,117 435 Updated Jul 4, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 47,600 8,316 Updated Jun 30, 2025
Python 204 8 Updated Feb 20, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 17,331 2,022 Updated Jul 4, 2025

[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Python 643 60 Updated Jun 28, 2025

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,630 350 Updated Jul 2, 2025

Making large AI models cheaper, faster and more accessible

Python 41,009 4,520 Updated Jul 4, 2025

Fully open reproduction of DeepSeek-R1

Python 24,966 2,321 Updated Jul 3, 2025

Witness the aha moment of VLM with less than $3.

Python 3,821 290 Updated May 19, 2025

DeepSeek 系列工作解读、扩展和复现。

Python 661 53 Updated Mar 29, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,371 160 Updated Mar 20, 2025
Next
0