added documentation to plotting methods and added info about monitoring, recording, plotting to usage.rst

2025-06-18 18:52:52 +02:00 · 2024-03-11 09:38:23 +01:00
parent 6a684dbc6d
commit b959bdd68e
4 changed files with 98 additions and 46 deletions
--- a/docs/source/usage.rst
+++ b/docs/source/usage.rst
@ -61,9 +61,11 @@ Evaluating the run
 ------------------

 If monitoring and recording are enabled, the environment states will be traced and recorded automatically.
-
-Plotting. At the moment a plot of the evaluation score across the different episodes is automatically generated.
-
+The EnvMonitor class acts as a wrapper for Gym environments, monitoring and logging key information during interactions,
+while the EnvRecorder class records state summaries during interactions in the environment.
+At the end of each run a plot displaying the step reward is generated. The step reward represents the cumulative sum of rewards obtained by all agents throughout the episode.
+Furthermore a comparative plot that shows the achieved score (step reward) over several runs with different seeds or different parameter settings can be generated using the methods provided in plotting/plot_compare_runs.py.
+For a more comprehensive evaluation, we recommend using the `Weights and Biases (W&B) <https://wandb.ai/site>`_ framework, with the dataframes generated by the monitor and recorder. These can be found in the run path specified in your script. W&B provides a powerful API for logging and visualizing model training metrics, enabling analysis using predefined or also custom metrics.

 Indices and tables
 ------------------