FoPra Beluga Challenge - Reinforcement Learning v1.0
Deep Reinforcement Learning solution for the Beluga Challenge shipping container optimization problem using PPO and MCTS
|
▼Nrl | |
▼Nagents | |
▼Nhigh_level | |
▼Nppo_agent | |
CActorNetwork | Actor network for PPO agent |
CCriticNetwork | Critic network for PPO agent |
CPPOAgent | Proximal Policy Optimization (PPO) agent implementation |
CPPOMemory | Memory buffer for storing PPO training experiences |
▼Nenv | |
▼Nenvironment | |
CEnv | Beluga Challenge environment for reinforcement learning |
▼Nstate | |
CBeluga | Container ship (Beluga) with current and outgoing jigs |
CJig | Individual jig instance with type and status |
CJigType | Represents a type of jig with size properties |
CProblemState | Complete state representation for the Beluga Challenge |
CProductionLine | Factory production line with scheduled jigs |
CRack | Storage rack with size constraints |
▼Nmcts | |
▼Nmcts | |
CMCTS | Monte Carlo Tree Search algorithm implementation |
▼Nmcts_node | |
CMCTSNode | Node class for Monte Carlo Tree Search |
▼Ntraining | |
▼Ntrainer | |
CTrainer | Main training orchestrator for the Beluga Challenge |