Adaptive Weighting in Online Ensemble Reinforcement Learning
This project investigates ensemble reinforcement learning methods where multiple reinforcement learning algorithms are combined within a single adaptive agent. The agent dynamically adjusts the contribution of each algorithm during decision-making based on the current state and environment conditions, with the goal of improving robustness, adaptability, and online learning performance across diverse environments. Date: September 2024 - May 2028 Persons participating in the project:
- PIs: Dr. Francisco Cruz, Dr. Eduardo Benitez Sandoval, Prof. Richard Dazeley, Prof. Peter Vamplew
- Associates: Charlie Stinson
- Corresponding contact: charles.stinson@unsw.edu.au
- Reinforcement Learning
- Ensemble Learning
- Policy Aggregation
- Online Learning
- Adaptive AI Systems
- Dynamic Weighting
- Non-Stationary Environments
- Sequential Decision Making
| Selected Publications | Web |
|---|---|
| Stinson, C., Vamplew, P., Dazeley, R., Sandoval, E., & Cruz, F. (2026, June). Trajectory-Guided Weight Adaptation for Ensemble Reinforcement Learning. In press The International Joint Conference on Neural Networks (IJCNN). |