8000 rguo12 (Ruocheng Guo) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View rguo12's full-sized avatar
:octocat:
:octocat:

Highlights

  • Pro

Block or report rguo12

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Data/Code Repository for https://api.semanticscholar.org/CorpusID:218470122

Python 135 32 Updated Jul 25, 2024

ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.

Jupyter Notebook 269 12 Updated Aug 19, 2023
Python 88 24 Updated Apr 30, 2025

"Simulating User Satisfaction for the Evaluation of Task-oriented Dialogue Systems" in SIGIR'21

Python 34 5 Updated May 7, 2023

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,380 58 Updated May 11, 2025

A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.

Python 156 14 Updated Apr 15, 2025

A collective list of free APIs

Python 353,457 37,096 Updated May 20, 2025

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,128 445 Updated May 21, 2025

An LLM-based autonomous agent controlling real-world applications via RESTful APIs

Python 1,371 104 Updated Jun 7, 2024

The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.

Python 39 Updated May 26, 2025
Jupyter Notebook 410 33 Updated Feb 13, 2024

xLAM: A Family of Large Action Models to Empower AI Agent Systems

Python 467 36 Updated Jun 17, 2025

A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.

226 9 Updated Jun 9, 2025

Efficient LLM Inference over Long Sequences

Python 379 19 Updated Jun 25, 2025

Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"

Python 110 12 Updated Oct 19, 2024

Query your data using familiar SQL or intuitive Piped Processing Language (PPL)

Java 143 158 Updated Jun 26, 2025

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 13,089 1,193 Updated Jun 26, 2025

A coding agent framework, that works on its own codebase.

Python 37 10 Updated Apr 23, 2025

(NeurIPS 2024) AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning

Python 207 20 Updated Jun 10, 2025

Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.

Python 286 30 Updated Jun 1, 2025

Contextual Harnessing for Efficient SQL Synthesis

Python 215 65 Updated May 26, 2025

This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide practical guidance for researchers and practitioners. Official re…

Python 734 50 Updated Jun 26, 2025

[EMNLP 2023 Findings] ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought

Python 20 5 Updated Jan 11, 2024

structured outputs for llms

Python 10,821 806 Updated Jun 25, 2025

Dialog2Flow: convert your dialogs to flows. This repository accompanies the paper "Dialog2Flow: Pre-training Soft-Contrastive Sentence Embeddings for Automatic Dialog Flow Extraction", accepted to …

JavaScript 14 Updated May 8, 2025

Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs

Python 17 1 Updated Apr 24, 2025

Suna - Open Source Generalist AI Agent

TypeScript 16,018 2,446 Updated Jun 26, 2025

A collection of research and survey papers of real-time bidding (RTB) based display advertising techniques.

3,647 940 Updated Dec 20, 2024

Conformalized Quantile Regression

Jupyter Notebook 276 51 Updated Apr 6, 2022
Next
0