FoPra Beluga Challenge - Reinforcement Learning v1.0
Deep Reinforcement Learning solution for the Beluga Challenge shipping container optimization problem using PPO and MCTS
|
Crl.env.state.Beluga | Container ship (Beluga) with current and outgoing jigs |
Crl.env.environment.Env | Beluga Challenge environment for reinforcement learning |
Crl.env.state.Jig | Individual jig instance with type and status |
Crl.env.state.JigType | Represents a type of jig with size properties |
Crl.mcts.mcts.MCTS | Monte Carlo Tree Search algorithm implementation |
Crl.mcts.mcts_node.MCTSNode | Node class for Monte Carlo Tree Search |
▼Cnn.Module | |
Crl.agents.high_level.ppo_agent.ActorNetwork | Actor network for PPO agent |
Crl.agents.high_level.ppo_agent.CriticNetwork | Critic network for PPO agent |
Crl.agents.high_level.ppo_agent.PPOAgent | Proximal Policy Optimization (PPO) agent implementation |
Crl.agents.high_level.ppo_agent.PPOMemory | Memory buffer for storing PPO training experiences |
Crl.env.state.ProblemState | Complete state representation for the Beluga Challenge |
Crl.env.state.ProductionLine | Factory production line with scheduled jigs |
Crl.env.state.Rack | Storage rack with size constraints |
Crl.training.trainer.Trainer | Main training orchestrator for the Beluga Challenge |