Safety-Oriented Multi-agent System

A Trusted Human-Multi-Agent Reinforcement Learning Interaction Framework

📖 Introduction

This repository implements a Multi-Agent System (MAS) framework for human-machine collaborative crisis response, combining vision-language models (VL) and reinforcement learning (RL) to enhance safety and reliability. The framework features:

Real-Time Task Execution: Modular task chains with built-in safety rules and human oversight.
Simulation Training: Experience replay library for risk prediction and optimization.
Dynamic Trust Mechanism: Balances task utility and safety constraints through RL.

Key Contributions:

Dual-mode architecture (online execution + offline simulation).
First fine-tuned safe LLM and training dataset for emergency scenarios.
15% improvement in helpfulness and 40% reduction in risk response rate compared to baseline.

🚀 Quick Start

Installation

git clone https://github.com/erwinmsmith/SOMAS.git
cd SOMAS
pip install -r requirements.txt

Usage

Real-Time Task Execution:

main.py --online

Simulation Training:

main.py --offline

🧠 Framework Architecture

Core Components

Online Execution System
- Planning-Execution Pipeline: Modular task chains drive tool operations.
- Safety Guardrails: Predefined rules and GPT-4-based risk assessment.
Offline Simulation System
- Task Generation: Synthetic tasks from manual records and prior knowledge.
- Experience Replay: Optimizes RL policies for dynamic environments.

📊 Experimental Results

Key Metrics

Domain	Model	Safety (↑)	Helpfulness (↑)	Risk Response Rate (↓)
Safety-CV	Qwen2-7B-VL	4.5	4.7	40%

Highlights

VL models reduced operational risks by 30% via image semantic parsing.
Dynamic safety validation improved helpfulness by 15% over ToolEmu.

Others

If you need a detailed data for Safty(train or test), contact me duanzhenke@sscapewh.com

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
agents		agents
assets		assets
config		config
database		database
interaction_logic		interaction_logic
prompts		prompts
sscape		sscape
tests		tests
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
agent_executor.py		agent_executor.py
config.json		config.json
config.py		config.py
data_loader.py		data_loader.py
download_manager.py		download_manager.py
main.py		main.py
run_offline.py		run_offline.py
run_online.py		run_online.py
tool_interface.py		tool_interface.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Safety-Oriented Multi-agent System

📖 Introduction

🚀 Quick Start

Installation

Usage

🧠 Framework Architecture

Core Components

📊 Experimental Results

Key Metrics

Highlights

Others

About

Uh oh!

Releases

Packages

Languages

License

fourub/SOMAS

Folders and files

Latest commit

History

Repository files navigation

Safety-Oriented Multi-agent System

📖 Introduction

🚀 Quick Start

Installation

Usage

🧠 Framework Architecture

Core Components

📊 Experimental Results

Key Metrics

Highlights

Others

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages