Engineering Applications of Artificial Intelligence, Volume 120 , 01/04/2023
Optimal drug-dosing of cancer dynamics with fuzzy reinforcement learning and discontinuous reward function
Abstract
In this paper, a reinforcement learning-based optimal control is developed for the drug administration of biological phenomena in chemotherapy cancer treatment. The treatment is considered as a class of unknown discrete-time systems when the input: drug administration and the output: tumor cells population are only utilized to design the proposed controller. Resulting, a full-state observer is completely neglected. The controller is established by the actor–critic architecture containing two fuzzy-rules emulated networks when IF-THEN rules are imposed by human knowledge according to pharmacokinetic and pharmacodynamic behavior. Furthermore, the discontinuous reward function is proposed to derive the online learning laws that guarantee the robustness and the convergence of adjustable parameters. The validation results are conducted by numerical systems according to the robustness of the group of patients and the closed-loop performance altogether with comparative results.
Document Type
Article
Source Type
Journal
Keywords
Chemotherapy drug administrationDiscontinuous reward functionFuzzy-rules networkOptimal controlReinforcement learning
ASJC Subject Area
Engineering : Control and Systems EngineeringEngineering : Electrical and Electronic EngineeringComputer Science : Artificial Intelligence