Emergence in Multi-Agent Systems: A Safety Perspective

by Philipp Altmann, Julian Schönberger, Steffen Illium, Maximilian Zorn, Fabian Ritz, Tom Haider, Simon Burton, Thomas Gabor

About

This is the code for the experiments of our paper. The experiments are build on top of the EDYS environment , which we developed specifically for studying emergent behaviour in multi-agent systems. This environment is versatile and can be configured in various ways with different degrees of complexity. We refer to README-EDYS.md for a detailed overview of the functionalities of the environment and an explanation of the project context.

Setup

Set up a virtualenv with python 3.10 or higher. You can use pyvenv or conda for this.
Run pip install -r requirements.txt to get requirements.
In case there is no study_out/ folder in the root directory, create one.

Rerunning the Experiments

The respective experiments from our paper can be reenacted in main.py. Just select the method representing the part of our experiments you want to rerun and execute it via the __main__ function.

Further Remarks

We use config files located in the configs and the multi_agent_configs, single_agent_configs folders to configure the environments and the RL algorithm for our experiments, respectively. You don't need to change anything to rerun the experiments, but we provided some additional comments in the configs for an overall better understanding of the functionalities.
The results of the experiment runs are stored in study_out.
We reuse the coin-quadrant implementation of the RL agent for the two_rooms environment. The coin assets are masked with flags in the visualization. This masking does not affect the RL agents in any way.
The code for the cost contortion for preventing the emergent behavior of the TSP agents can be found in contortions.py.
The functionalities that drive the emergence prevention mechanisms for the RL agents is mainly located in the utility methods get_ordered_coin_piles (line 94) (for solving the emergence in the coin-quadrant environment) and distribute_indices (line 171) (mechanism for two_doors), that are part of utils.py
agent_models contains the parameters of the trained models for the RL agents. You can repeat the training by executing the training procedures in main.py. Alternatively, you can use your own trained agents, which you have obtained by modifying the training configurations in single_agent_configs , for the evaluation experiments by inserting the names of the run folders, e.g. “run9” and “run 12”, into the list in the methods coin_quadrant_multi_agent_rl_eval and two_rooms_multi_agent_rl_eval in RL_runner.py.

Requirements

Python 3.10

numpy==1.26.4
pygame>=2.0
numba>=0.56
gymnasium>=0.26
seaborn
pandas
PyYAML
networkx
torch
tqdm
packaging
pillow
scipy

3.3 KiB Raw Blame History