I am a passionate researcher and engineer with several years of experience in modeling, simulation, and logistics. I have built simulation models to address real-world problems and developed custom environments for training multi-agent reinforcement learning policies using RLlib. My research focuses on resolving coordination and deadlock issues among mobile robots in logistics systems. On the side, I also enjoy fun RL projects, such as connecting PySC2 with RLlib to train agents on StarCraft II or applying RL to browser games.
A project focused on creating reference models for deadlock-capable multi-agent pathfinding. The goal is to provide a benchmark for evaluating MAPF and RL algorithms in this domain.
A project focused on applying reinforcement learning with RLlib to the simulation software Plant Simulation. Context is a simple AGV system for two processing stations.
I was responsible for the management of the project and to make sure that the project goals were met. The goal was to develop a new Master curriculum in Resource Efficient Production Logistics (ProdLog) at six partner universities. The project was funded by the European Union’s Erasmus+ programme.
I developed the simulation model containing the digital twin of a manufacturing cell and the AGV. The model communicated via MQTT broker with a mixed-reality application and was used to visualize the real-time state of the system.
My task was to develop a simulation model of a new designed laundry order consolidation system. The model was used to evaluate the performance of the new system and to identify potential bottlenecks. The “LOCSys” project was funded as a joint project by the BMWi as part of the Central Innovation Program for SMEs (ZIM).
In this paper, we explore the potential of multi-agent reinforcement learning (MARL) for managing the driving behavior of autonomous guided vehicles (AGVs) in production logistics environments with single-lane tracks, where deadlocks pose a significant challenge. We build upon previous work and adopt a MARL approach using the Proximal Policy Optimization (PPO) algorithm. We conduct a thorough hyperparameter search and investigate the impact of varying numbers of agents on the performance of the AGVs. Our results demonstrate the effectiveness of the MARL approach in addressing deadlocks and coordinating AGV behavior, as well as the scalability of the learned policy to different numbers of agents. The Bayesian optimization process and increased iteration count contribute to improved performance and more stable learning curves.
Decision-making in supply chains is challenged by high complexity, a combination of continuous and discrete processes, integrated and interdependent operations, dynamics, and adaptability. The rapidly increasing data availability, computing power and intelligent algorithms unveil new potentials in adaptive data-driven decision-making. Reinforcement Learning, a class of machine learning algorithms, is one of the data-driven methods. This semi-systematic literature review explores the current state of the art of reinforcement learning in supply chain management (SCM) and proposes a classification framework. The framework classifies academic papers based on supply chain drivers, algorithms, data sources, and industrial sectors. The conducted review revealed a few critical insights. First, the classic Q-learning algorithm is still the most popular one. Second, inventory management is the most common application of reinforcement learning in supply chains, as it is a pivotal element of supply chain synchronisation. Last, most reviewed papers address toy-like SCM problems driven by artificial data. Therefore, shifting to industry-scale problems will be a crucial challenge in the next years. If this shift is successful, the vision of data-driven decision-making in real-time could become a reality.
Automated guided vehicles (AGVs) form a large and important part of logistic systems to improve productivity and reduce costs. When multiple AGVs are running in limited and uncertain environments, lots of issues can occur, such as collisions and deadlocks, which need to be addressed. This paper presents a flexible simulation model for a warehouse with various AGVs. We implemented all three typical strategies to handle deadlocks (prevention, avoidance and detection and resolution). The results show that there is no dominant strategy and that the results strongly depend on the individual case and the input parameters.
Jul 2025 - Present
Magdeburg, Germany
Fraunhofer Institute for Factory Operation and Automation IFF is a research institution located in Magdeburg, Germany. It focuses on applied research in the fields of production and logistics.
Jul 2025 - Present
Feb 2018 - Present
Magdeburg, Germany
Otto von Guericke University Magdeburg is a research university located in Magdeburg, Germany. It is known for its engineering and natural sciences programs.
Feb 2018 - Present
Aug 2017 - Dec 2017
Magdeburg, Germany
FASA is an association for the promotion of mechanical and plant engineering in Saxony-Anhalt.
Aug 2017 - Dec 2017
Jan 2012 - Jun 2017
Magdeburg, Germany
Müller Marketing GmbH is a B2B-focused marketing and advertising agency, offering services in brand communication, corporate design, and strategic marketing for medium-sized businesses.
Jan 2012 - Jun 2017
Oct 2013 - Dec 2013
Haldensleben, Germany
Brömse GmbH & Co. KG is a company specializing in the production of energy-efficient windows, doors, and roller shutter systems made from plastic and aluminum.
Oct 2013 - Dec 2013
![]() Master of Science in Industrial Engineering for LogisticsThesis:Extension of a key performance indicator system for comparing freight transport system scenarios to include a dynamic representation of forecast values. Supervisor:Univ.-Prof. Dr.-Ing. habil. Prof. E. h. Dr. h. c. mult. Michael Schenk | ||
![]() Bachelor of Science in Industrial Engineering for LogisticsThesis:Development of a simulation model for planning the washing order sequence in an industrial laundry. Supervisor:Univ.-Prof. Dr.-Ing. habil. Prof. E. h. Dr. h. c. mult. Michael Schenk | ||