题目:Optimal Abort Policy for Mission-Critical Systems under Imperfect Condition Monitoring
时间:2024年5月15日 9:00-10:30
地点:必赢线路检测中心 振华会议室
邀请人:潘尔顺 教授(工业工程与管理系)
Biography
Dr. Zhisheng Ye received a joint B.E. (2008) in Material Science & Engineering, and Economics from Tsinghua University. He received a Ph.D. degree from National University of Singapore. He is currently an Associate Professor and Dean's Chair in the Department of Industrial Systems Engineering & Management at National University of Singapore. His research areas include degradation analysis, lifetime and recurrence data analysis, reliability modeling, and data-driven operations management.
Abstract
While most on-demand mission-critical systems are engineered to be reliable to support critical tasks, occasional failures may still occur during missions. To increase system survivability, a common practice is to abort the mission before an imminent failure. We consider optimal mission abort for a system whose deterioration follows a general three-state (normal, defective, failed) semi-Markov chain. The failure is assumed self-revealed, while the healthy and defective states have to be predicted from imperfect condition monitoring data. Due to the non-Markovian process dynamics, optimal mission abort for this partially observable system is an intractable stopping problem. For a tractable solution, we introduce a novel tool of Erlang mixtures to approximate non-exponential sojourn times in the semi-Markov chain. This allows us to approximate the original process by a surrogate continuous-time Markov chain whose optimal control policy can be solved through a partially observable Markov decision process (POMDP). We show that the POMDP optimal policies converge almost surely to the optimal abort decision rules when the Erlang rate parameter diverges. This implies that the expected cost by adopting the POMDP solution converges to the optimal expected cost. Next, we provide comprehensive structural results on the optimal policy of the surrogate POMDP. Based on the results, we develop a modified point-based value iteration algorithm to numerically solve the surrogate POMDP. We further consider mission abort in a multi-task setting where a system executes several tasks consecutively before a thorough inspection. Through a case study on an unmanned aerial vehicle, we demonstrate the capability of real-time implementation of our model, even when the condition-monitoring signals are generated with high frequency.