Abstract: Designing safety-critical controllers for the blast furnace is crucial yet challenging due to its complex inherent dynamics. Recent advances in reinforcement learning (RL) have shown promise ...