arXiv:2406.02294 Abstract | arXiv Analytics

arXiv:2406.02294 [cs.LG]Abstract References Reviews Resources

Smaller Batches, Bigger Gains? Investigating the Impact of Batch Sizes on Reinforcement Learning Based Real-World Production Scheduling

Arthur Müller, Felix Grumbach, Matthia Sabatelli

Published 2024-06-04Version 1

Production scheduling is an essential task in manufacturing, with Reinforcement Learning (RL) emerging as a key solution. In a previous work, RL was utilized to solve an extended permutation flow shop scheduling problem (PFSSP) for a real-world production line with two stages, linked by a central buffer. The RL agent was trained to sequence equallysized product batches to minimize setup efforts and idle times. However, the substantial impact caused by varying the size of these product batches has not yet been explored. In this follow-up study, we investigate the effects of varying batch sizes, exploring both the quality of solutions and the training dynamics of the RL agent. The results demonstrate that it is possible to methodically identify reasonable boundaries for the batch size. These boundaries are determined on one side by the increasing sample complexity associated with smaller batch sizes, and on the other side by the decreasing flexibility of the agent when dealing with larger batch sizes. This provides the practitioner the ability to make an informed decision regarding the selection of an appropriate batch size. Moreover, we introduce and investigate two new curriculum learning strategies to enable the training with small batch sizes. The findings of this work offer the potential for application in several industrial use cases with comparable scheduling problems.

Comments: This paper was accepted at the ETFA 2024 conference

Categories: cs.LG

Keywords: batch size, smaller batch, real-world production scheduling, reinforcement learning, bigger gains

Tags: conference paper

Related articles: Most relevant | Search more

arXiv:2409.11933 [cs.LG] (Published 2024-09-18)

Reinforcement Learning as an Improvement Heuristic for Real-World Production Scheduling

Arthur Müller, Lukas Vollenkemper

arXiv:1402.0560 [cs.LG] (Published 2014-02-04)

Safe Exploration of State and Action Spaces in Reinforcement Learning

Javier Garcia, Fernando Fernandez

arXiv:1809.01560 [cs.LG] (Published 2018-09-05)