상세검색
최근 검색어 전체 삭제
다국어입력
즐겨찾기0
학술저널

Application of Deep Recurrent Q Network with Dueling Architecture for Optimal Sepsis Treatment Policy

  • 13
158301.jpg

Sepsis is one of the leading causes of mortality globally, and it costs billions of dollars annually. However, treating septic patients is currently highly challenging, and more research is needed into a general treatment method for sepsis. Therefore, in this work, we propose a reinforcement learning method for learning the optimal treatment strategies for septic patients. We model the patient physiological time series data as the input for a deep recurrent Q-network that learns reliable treatment policies. We evaluate our model using an off-policy evaluation method, and the experimental results indicate that it outperforms the physicians’ policy, reducing patient mortality up to 3.04%. Thus, our model can be used as a tool to reduce patient mortality by supporting clinicians in making dynamic decisions.

I. INTRODUCTION

II. BACKGROUND AND RELATED WORK

III. SETTING UP ENVIRONMENT FOR REINFORCEMENT LEARNING

IV. PROPOSED METHOD

V. EXPERIMENTAL RESULTS

(0)

(0)

로딩중