8000 GitHub - lmgame-org/ai-space-escape-engine: This repo is an artifact of the following paper — GameArena: Evaluating LLM Reasoning through Live Computer Games [ICLR 2025]
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

This repo is an artifact of the following paper — GameArena: Evaluating LLM Reasoning through Live Computer Games [ICLR 2025]

License

Notifications You must be signed in to change notification settings

lmgame-org/ai-space-escape-engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI-SPACE-ESCAPE

 AI Space Escape

| Paper | Blog | Discord | X | Roblox Game

License Maintenance Contributions welcome

This repo features the backend code for the Roblox game we built, AI Space Escape, offering an unique experience to reason with AI. We design evaluation techniques to rank state-of-the-art large language models (LLMs). Our mission is to enable engaging gameplay while evaluating a variety of large-scale AI models and systems.

AI Space Escape Engine (Interfaced with Roblox API)

Installation

  1. Clone this repository:
git clone https://github.com/lmgame-org/game-arena-engine.git
cd game-arena-engine
  1. Install dependency:
python -m venv ./venv
.\venv\Scripts\activate.bat
pip install -r requirements.txt

Set Up API Endpoints

  1. Navigate to game-arena-engine/src/config, fill in YOUR_API_KEY.

Experience the Games

  1. Launch your backend with
python ./src/serve.py
  1. Use ngrok to perform port forwarding to make this accessible on WAN.

Citation

If you find this repository helpful, Please kindly cite:

article{hu2024gamearena,
  title={GameArena: Evaluating LLM Reasoning through Live Computer Games},
  author={Hu, Lanxiang and Li, Qiyu and Xie, Anze and Jiang, Nan and Stoica, Ion and Jin, Haojian and Zhang, Hao},
  journal={arXiv preprint arXiv:2412.06394},
  year={2024}
}

About

This repo is an artifact of the following paper — GameArena: Evaluating LLM Reasoning through Live Computer Games [ICLR 2025]

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0