3.3 KiB
Emergence in Multi-Agent Systems: A Safety Perspective
by Philipp Altmann, Julian Schönberger, Steffen Illium, Maximilian Zorn, Fabian Ritz, Tom Haider, Simon Burton, Thomas Gabor
About
This is the code for the experiments of our paper. The experiments are build on top of the EDYS environment ,
which we developed specifically for studying emergent behaviour in multi-agent systems. This environment is versatile
and can be configured in various ways with different degrees of complexity. We refer to README-EDYS.md for a
detailed overview of the functionalities of the environment and an explanation of the project context.
Setup
- Set up a virtualenv with python 3.10 or higher. You can use pyvenv or conda for this.
- Run
pip install -r requirements.txtto get requirements. - In case there is no
study_out/folder in the root directory, create one.
Rerunning the Experiments
The respective experiments from our paper can be reenacted in main.py.
Just select the method representing the part of our experiments you want to rerun and
execute it via the __main__ function.
Further Remarks
- We use config files located in the configs and the multi_agent_configs, single_agent_configs folders to configure the environments and the RL algorithm for our experiments, respectively. You don't need to change anything to rerun the experiments, but we provided some additional comments in the configs for an overall better understanding of the functionalities.
- The results of the experiment runs are stored in study_out.
- We reuse the
coin-quadrantimplementation of the RL agent for thetwo_roomsenvironment. The coin assets are masked with flags in the visualization. This masking does not affect the RL agents in any way. - The code for the cost contortion for preventing the emergent behavior of the TSP agents can be found in contortions.py.
- The functionalities that drive the emergence prevention mechanisms for the RL agents is mainly
located in the utility methods
get_ordered_coin_piles (line 94)(for solving the emergence in the coin-quadrant environment) anddistribute_indices (line 171)(mechanism for two_doors), that are part of utils.py - agent_models contains the parameters of the trained models for the RL
agents. You can repeat the training by executing the training procedures in main.py. Alternatively, you can
use your own trained agents, which you have obtained by modifying the training configurations in single_agent_configs
, for the evaluation experiments by inserting the names of the run folders, e.g. “run9” and “run 12”, into the list in
the methods
coin_quadrant_multi_agent_rl_evalandtwo_rooms_multi_agent_rl_evalin RL_runner.py.
Requirements
Python 3.10
numpy==1.26.4
pygame>=2.0
numba>=0.56
gymnasium>=0.26
seaborn
pandas
PyYAML
networkx
torch
tqdm
packaging
pillow
scipy