Marcel's Website

Projects

Reference Models for deadlock-capable multi-agent pathfinding

Developer Aug 2024 - Present

A project focused on creating reference models for deadlock-capable multi-agent pathfinding. The goal is to provide a benchmark for evaluating MAPF and RL algorithms in this domain.

academic multi-agent-pathfinding deadlocks

Reinforcement learning with Plant Simulation

Developer Feb 2023 - Present

A project focused on applying reinforcement learning with RLlib to the simulation software Plant Simulation. Context is a simple AGV system for two processing stations.

academic reinforcement-learning plant-simulation

Development of a Bologna-based Master Curriculum in Resource Efficient Production Logistics (ProdLog)

Project manager Mar 2020 - Oct 2021

I was responsible for the management of the project and to make sure that the project goals were met. The goal was to develop a new Master curriculum in Resource Efficient Production Logistics (ProdLog) at six partner universities. The project was funded by the European Union’s Erasmus+ programme.

professional curriculum-development production-logistics

Details

Real-time combination of material flow simulation, digital twins of manufacturing cells, an AGV and a mixed-reality application

Developer Mar 2020 - Jul 2020

I developed the simulation model containing the digital twin of a manufacturing cell and the AGV. The model communicated via MQTT broker with a mixed-reality application and was used to visualize the real-time state of the system.

academic professional simulation digital-twin mixed-reality

Details

Laundry Order Consolidation System (LOCsys)

Developer Jan 2018 - Feb 2020

My task was to develop a simulation model of a new designed laundry order consolidation system. The model was used to evaluate the performance of the new system and to identify potential bottlenecks. The “LOCSys” project was funded as a joint project by the BMWi as part of the Central Innovation Program for SMEs (ZIM).

professional simulation plant-simulation

Details

Publications

Multi-Agent Proximal Policy Optimization for a Deadlock Capable Transport System in a Simulation-Based Learning Environment

Proceedings of the 2023 Winter Simulation Conference 10 December 2023

Marcel Müller Lorena S. Reyes-Rubiano Tobias Reggelin Hartmut Zadek

In this paper, we explore the potential of multi-agent reinforcement learning (MARL) for managing the driving behavior of autonomous guided vehicles (AGVs) in production logistics environments with single-lane tracks, where deadlocks pose a significant challenge. We build upon previous work and adopt a MARL approach using the Proximal Policy Optimization (PPO) algorithm. We conduct a thorough hyperparameter search and investigate the impact of varying numbers of agents on the performance of the AGVs. Our results demonstrate the effectiveness of the MARL approach in addressing deadlocks and coordinating AGV behavior, as well as the scalability of the learned policy to different numbers of agents. The Bayesian optimization process and increased iteration count contribute to improved performance and more stable learning curves.

Multi-Agent Reinforcement Learning Proximal Policy Optimization Automated Guided Vehicles Deadlocks Simulation

Details

A review on reinforcement learning algorithms and applications in supply chain management

International Journal of Production Research 3 November 2022

Benjamin Rolf Ilya Jackson Marcel Müller Sebastian Lang Tobias Reggelin Dmitry Ivanov

Decision-making in supply chains is challenged by high complexity, a combination of continuous and discrete processes, integrated and interdependent operations, dynamics, and adaptability. The rapidly increasing data availability, computing power and intelligent algorithms unveil new potentials in adaptive data-driven decision-making. Reinforcement Learning, a class of machine learning algorithms, is one of the data-driven methods. This semi-systematic literature review explores the current state of the art of reinforcement learning in supply chain management (SCM) and proposes a classification framework. The framework classifies academic papers based on supply chain drivers, algorithms, data sources, and industrial sectors. The conducted review revealed a few critical insights. First, the classic Q-learning algorithm is still the most popular one. Second, inventory management is the most common application of reinforcement learning in supply chains, as it is a pivotal element of supply chain synchronisation. Last, most reviewed papers address toy-like SCM problems driven by artificial data. Therefore, shifting to industry-scale problems will be a crucial challenge in the next years. If this shift is successful, the vision of data-driven decision-making in real-time could become a reality.

Reinforcement Learning Supply Chain Management

Details

Comparison of Deadlock Handling Strategies for Different Warehouse Layouts with an AGVS

Proceedings of the 2020 Winter Simulation Conference 14 December 2020

Marcel Müller Jan Hendrik Ulrich Lorena S. Reyes-Rubiano Tobias Reggelin Sebastian Lang

Automated guided vehicles (AGVs) form a large and important part of logistic systems to improve productivity and reduce costs. When multiple AGVs are running in limited and uncertain environments, lots of issues can occur, such as collisions and deadlocks, which need to be addressed. This paper presents a flexible simulation model for a warehouse with various AGVs. We implemented all three typical strategies to handle deadlocks (prevention, avoidance and detection and resolution). The results show that there is no dominant strategy and that the results strongly depend on the individual case and the input parameters.

Deadlocks Simulation Logistics Automated Guided Vehicles

Details

Analyze processes in goods receipt.
Planning of a small parts warehouse.
Develop a concept for the shipping area.

Education

		Otto von Guericke University Magdeburg 2019-2025 Doktoringenieur (Dr.-Ing.) PhD Thesis: Multi-Agent Reinforcement Learning for Deadlock Handling among Autonomous Mobile Robots Supervisors: Prof. Dr.-Ing. Hartmut Zadek, Prof. Dr.-Ing. Ernesto William De Luca
		Otto von Guericke University Magdeburg 2015-2016 Master of Science in Industrial Engineering for Logistics Thesis: Extension of a key performance indicator system for comparing freight transport system scenarios to include a dynamic representation of forecast values. Supervisor: Univ.-Prof. Dr.-Ing. habil. Prof. E. h. Dr. h. c. mult. Michael Schenk
		Otto von Guericke University Magdeburg 2010-2014 Bachelor of Science in Industrial Engineering for Logistics Thesis: Development of a simulation model for planning the washing order sequence in an industrial laundry. Supervisor: Univ.-Prof. Dr.-Ing. habil. Prof. E. h. Dr. h. c. mult. Michael Schenk
		Werner-von-Siemens Gymnasium 2001-2009 Abitur

Achievements

Hi, I am Marcel

Marcel Müller

Research Fellow at Otto von Guericke University Magdeburg

Projects

Reference Models for deadlock-capable multi-agent pathfinding

Reinforcement learning with Plant Simulation

Development of a Bologna-based Master Curriculum in Resource Efficient Production Logistics (ProdLog)

Real-time combination of material flow simulation, digital twins of manufacturing cells, an AGV and a mixed-reality application

Laundry Order Consolidation System (LOCsys)

Publications

Multi-Agent Proximal Policy Optimization for a Deadlock Capable Transport System in a Simulation-Based Learning Environment

A review on reinforcement learning algorithms and applications in supply chain management

Comparison of Deadlock Handling Strategies for Different Warehouse Layouts with an AGVS

Experiences

Research Fellow (secondary employment)

Responsibilities:

Research Fellow

Responsibilities:

Technology Consultant

Responsibilities:

Online Editor & Process Analyst

Responsibilities:

Student Assistant

Responsibilities:

Education

Doktoringenieur (Dr.-Ing.)

PhD Thesis:

Supervisors:

Master of Science in Industrial Engineering for Logistics

Thesis:

Supervisor:

Bachelor of Science in Industrial Engineering for Logistics

Thesis:

Supervisor:

Abitur

Achievements

Best PhD Student Paper Award

Best PhD Student Paper Award

Melting Pot Challenge of NeurIPS 2023

Melting Pot Challenge of NeurIPS 2023

2nd place in the Sustainable Supply Chain Deephack

2nd place in the Sustainable Supply Chain Deephack