Application of Deep Recurrent Q Network with Dueling Architecture for Optimal Sepsis Treatment Policy

한국스마트미디어학회
스마트미디어저널
Vol10, No.2
2021.06

48 - 54 (7 pages)
DOI : 10.30693/SMJ.2021.10.2.48

원문보기

원문저장

Sepsis is one of the leading causes of mortality globally, and it costs billions of dollars annually. However, treating septic patients is currently highly challenging, and more research is needed into a general treatment method for sepsis. Therefore, in this work, we propose a reinforcement learning method for learning the optimal treatment strategies for septic patients. We model the patient physiological time series data as the input for a deep recurrent Q-network that learns reliable treatment policies. We evaluate our model using an off-policy evaluation method, and the experimental results indicate that it outperforms the physicians’ policy, reducing patient mortality up to 3.04%. Thus, our model can be used as a tool to reduce patient mortality by supporting clinicians in making dynamic decisions.

I. INTRODUCTION

II. BACKGROUND AND RELATED WORK

III. SETTING UP ENVIRONMENT FOR REINFORCEMENT LEARNING

IV. PROPOSED METHOD

V. EXPERIMENTAL RESULTS

Application of Deep Recurrent Q Network with Dueling Architecture for Optimal Sepsis Treatment Policy

(0)

(0)

(0)

(0)

Application of Deep Recurrent Q Network with Dueling Architecture for Optimal Sepsis Treatment Policy

(0)

(0) 팝업 열기 팝업 닫기

(0)

(0)

(0)