Last released Apr 7, 2026
Train a Double DQN router that selects an optimal subset of agents for any multi-agent system.
Supported by