WURISWebSearch

Engineering Applications of Artificial Intelligence, Volume 120 , 01/04/2023
Optimal drug-dosing of cancer dynamics with fuzzy reinforcement learning and discontinuous reward function

Chidentree Treesatayapun, Aldo Jonathan Muñoz-Vázquez

Abstract

In this paper, a reinforcement learning-based optimal control is developed for the drug administration of biological phenomena in chemotherapy cancer treatment. The treatment is considered as a class of unknown discrete-time systems when the input: drug administration and the output: tumor cells population are only utilized to design the proposed controller. Resulting, a full-state observer is completely neglected. The controller is established by the actor–critic architecture containing two fuzzy-rules emulated networks when IF-THEN rules are imposed by human knowledge according to pharmacokinetic and pharmacodynamic behavior. Furthermore, the discontinuous reward function is proposed to derive the online learning laws that guarantee the robustness and the convergence of adjustable parameters. The validation results are conducted by numerical systems according to the robustness of the group of patients and the closed-loop performance altogether with comparative results.

Document Type

Article

Source Type

Journal

Keywords

Chemotherapy drug administrationDiscontinuous reward functionFuzzy-rules networkOptimal controlReinforcement learning

ASJC Subject Area

Computer Science : Artificial IntelligenceEngineering : Control and Systems EngineeringEngineering : Electrical and Electronic Engineering

Access to Document

DOI : 10.1016/j.engappai.2023.105851
Link to scopus

Citations (Scopus)