More
Stars
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
OS-ATLAS: A Foundation Action Model For Generalist GUI Agents
A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP
An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
A curated list of awesome exploration RL resources (continually updated)
类似按键精灵的鼠标键盘录制和自动化操作 模拟点击和键入 | automate mouse clicks and keyboard input
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)
Multimodal computer agent data collection program
React95 / React95
AA6FA React components library with Win95 UI
A simple and elegant Jekyll theme for an academic personal homepage
A repo built for the purpose of benchmarking the performance of agents, regardless of how they are set up and how they work.
Add AI capabilities to any readline-enabled command-line program
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …
Plan-Validate-Solve (PVS) Agent for accurate, reliable and reproducable workflow automation
Run macOS VM in a Docker! Run near native OSX-KVM in Docker! X11 Forwarding! CI/CD for OS X Security Research! Docker mac Containers.
AutoKey, a desktop automation utility for Linux and X11.
Staggeringly powerful macOS desktop automation with Lua
AutoHotkey - macro-creation and automation-oriented scripting utility for Windows.
🔵 Cerebro is an open-source launcher to improve your productivity and efficiency
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复