-
Shanghai Jiao Tong University
- Shanghai, China
- https://skyriver-2000.github.io/
- @skyriver_2000
Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
Fully open reproduction of DeepSeek-R1
Minimal reproduction of DeepSeek R1-Zero
AgentNetworkProtocol(ANP) is an open source protocol for agent communication. Our vision is to define how agents connect with each other, building an open, secure, and efficient collaboration netwo…
[FSE-2024] Towards AI-Assisted Synthesis of Verified Dafny Methods
verl: Volcano Engine Reinforcement Learning for LLMs
Awesome Reasoning LLM Tutorial/Survey/Guide
📖 A curated list of awesome time-series papers, benchmarks, datasets, tutorials. (WIP)
Codes and data for our paper - RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
AcadHomepage: A Modern and Responsive Academic Personal Homepage
[SIGIR 2024] TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
Collection of advice for prospective and current PhD students
A collection of benchmarks and datasets for evaluating LLM.
RuLES: a benchmark for evaluating rule-following in language models
Controlled Text Generation via Language Model Arithmetic
Documents used for grad school application
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models
[ICML'2024] Can AI Assistants Know What They Don't Know?
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
Muzic: Music Understanding and Generation with Artificial Intelligence
MiniWoB++: a web interaction benchmark for reinforcement learning
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents