|
7706d1f704
|
Store code submission last state for paper
|
2024-10-23 11:00:14 +02:00 |
|
|
c56bb3f188
|
Reword TSP runner
|
2024-08-09 16:36:39 +02:00 |
|
|
0a8eaf8827
|
Increased maximum number of nodes for TSP agent to deal with all potential dirt placements in dirt_quadrant
|
2024-08-09 16:36:05 +02:00 |
|
|
973f3e9fc8
|
Added various plotting methods + Fixed action maps plotting
|
2024-08-09 16:35:08 +02:00 |
|
|
fcd2eaf773
|
Added option for random dirt positioning
|
2024-08-09 16:33:43 +02:00 |
|
|
50ef0c94e9
|
Updated RL_runner
|
2024-08-09 16:33:06 +02:00 |
|
|
4c81e4b865
|
Reworked configurations
|
2024-08-09 16:31:08 +02:00 |
|
|
8e8e925278
|
Reworked differentiation between train and eval execution + Renamed cfgs + Added algorithm seeding + Included early stopping functionality + Added weights&biases logging
|
2024-08-09 16:30:04 +02:00 |
|
|
81b12612ed
|
Updated required python version
|
2024-05-27 18:45:28 +02:00 |
|
|
a0852e805a
|
Updated pomdp_r comment + Added some additional comments + Restructured experiment calling + Added Readme and requirements.txt
|
2024-05-27 18:23:11 +02:00 |
|
|
41a1ec0a5b
|
Update utility plotting + Added alternative methods for TSP emergence prevention
|
2024-05-27 13:25:52 +02:00 |
|
|
a78e0dd8a3
|
Added commentary to configs
|
2024-05-25 18:31:18 +02:00 |
|
|
dca930636e
|
Add non-generic utility plots
|
2024-05-25 17:36:49 +02:00 |
|
|
ac35e46310
|
Refactored a2c_dirt file
|
2024-05-25 01:45:09 +02:00 |
|
|
81f0f6e209
|
Code cleaning part 2
|
2024-05-24 23:56:00 +02:00 |
|
|
6e6ce9dc5d
|
Equalize rendering for TSP and RL agents
|
2024-05-24 18:54:38 +02:00 |
|
|
c8336e8f78
|
Reset tsp route caching + renamed and moved configs + removed unnecessary files
|
2024-05-24 16:12:05 +02:00 |
|
|
98113ea849
|
Render all spawnpoints that are matched with a target dirt pile + Fixed arrow placement
|
2024-05-24 14:06:34 +02:00 |
|
|
defbaf6f93
|
Merge remote-tracking branch 'origin/marl_refactor' into marl_refactor
# Conflicts:
# marl_factory_grid/algorithms/static/TSP_base_agent.py
# marl_factory_grid/utils/plotting/plot_single_runs.py
# marl_factory_grid/utils/renderer.py
# test_run.py
|
2024-05-24 08:38:12 +02:00 |
|
|
37379d434e
|
centered the arrows and text for better readability in plt action maps and added colored arrow assets
|
2024-05-22 12:34:53 +02:00 |
|
|
83f0c70cfb
|
added plotting probability maps
|
2024-05-22 12:34:50 +02:00 |
|
|
3f88c4ee74
|
cleanup
|
2024-05-22 12:34:46 +02:00 |
|
|
0295af34b1
|
fixed coordinate mismatch in route plotting and added assets for actions
|
2024-05-22 12:34:32 +02:00 |
|
|
0d5b20a16f
|
Adapted commit: "started visualization of routes in plot single runs, assets missing."
|
2024-05-22 12:34:06 +02:00 |
|
|
c7f2bbfbac
|
Merge remote-tracking branch 'origin/marl_refactor' into marl_refactor
|
2024-05-21 11:40:22 +02:00 |
|
|
13ea9d25c9
|
Merge remote-tracking branch 'origin/rl_plotting' into marl_refactor
|
2024-05-21 11:39:51 +02:00 |
|
|
4571dc1cd1
|
Added shared piles option to dirt_quadrant eval + Changed dirt_quadrant layout and adapted configs
|
2024-05-17 17:52:52 +02:00 |
|
|
1a8ca9110b
|
added rendering of start and target pos. changed file save location to match current run in study out
|
2024-05-16 13:16:53 +02:00 |
|
|
cb990445ce
|
levelparser doesnt throw an error but instead prints a warning when no doors are found in the lvl file.
|
2024-05-13 11:52:21 +02:00 |
|
|
f8aa4e0c03
|
merged RL agent with action map plotting and added it to end of agents train loop
|
2024-05-12 11:48:05 +02:00 |
|
|
5e9e59c843
|
Added code for tsp_runs + Updated eval configs so that every episode only takes a maximum number of steps
|
2024-05-10 17:38:34 +02:00 |
|
|
d0d31b964c
|
Added 'shared' dirt piles option for eval + Fixed usage of renderer + Added recorder option
|
2024-05-10 14:54:00 +02:00 |
|
|
5c5a09be3f
|
Merge branch 'route_plotting' into rl_plotting
|
2024-05-10 12:48:06 +02:00 |
|
|
a25b04e092
|
Included method to tackle emergence in two_rooms_one_door_modified + Better access of different settings in marl_adapted + Added and modified a lot of config files
|
2024-05-10 11:57:26 +02:00 |
|
|
28094bf4ce
|
centered the arrows and text for better readability in plt action maps and added colored arrow assets
|
2024-05-10 10:25:35 +02:00 |
|
|
0e09094f97
|
added plotting probability maps
|
2024-05-08 14:27:08 +02:00 |
|
|
39b123221b
|
cleanup
|
2024-05-06 19:30:27 +02:00 |
|
|
865669055d
|
fixed coordinate mismatch in route plotting and added assets for actions
|
2024-05-06 13:55:47 +02:00 |
|
|
89ce723690
|
Fixed gitignore ignoring study_out base folder
|
2024-05-06 12:48:07 +02:00 |
|
|
3c54d04f9f
|
All relevant functional code for A2C Dirt Quadrant setting with small changes to the environment + Different configs for single agent and multiagent settings
|
2024-05-06 12:33:37 +02:00 |
|
|
9f2cb103f4
|
started visualization of routes in plot single runs, assets missing.
|
2024-05-02 17:07:33 +02:00 |
|
|
55026eda12
|
Add Independent A2C implementation
|
2024-05-02 11:10:23 +02:00 |
|
|
175a02f997
|
Add various RL adapted configs part 2
|
2024-05-02 11:01:20 +02:00 |
|
|
c7c2c4e5a3
|
Add various RL adapted configs
|
2024-05-02 11:00:35 +02:00 |
|
|
48d708bbcd
|
environment code changes for RL settings
|
2024-05-02 10:54:46 +02:00 |
|
|
5ee39eba8d
|
added fallback action attribute to agents and set standard fallback action to noop
|
2024-04-29 11:03:59 +02:00 |
|
|
0bbf0dafdb
|
Moved route caching to env level and removed print statements
|
2024-04-28 13:52:39 +02:00 |
|
|
50bcf5d995
|
Merge branch 'unit_testing' into marl_refactor
|
2024-04-19 09:46:46 +02:00 |
|
|
54d4e1ecb5
|
added simple route caching and fixed move 4 point graph in tspbaseagent
|
2024-04-17 15:28:10 +02:00 |
|
|
fe5a97a413
|
added allowed direction check for predict move
|
2024-04-04 12:48:14 +02:00 |
|