Google
Past year
  • Any time
  • Past hour
  • Past 24 hours
  • Past week
  • Past month
  • Past year
All results
Apr 11, 2024Another example is Policy Gradient Methods. They offer an alternative approach within model-free learning, optimizing policy parameters directly to maximize ...
5 days agoVanilla policy gradient methods lack robustness, and implementing TRPO can be demanding, particularly in complex numerical simulations, where challenges like ...
Nov 21, 2023A policy can be optimized to maximize returns by directly estimating the policy gradient ... Sammut, Claude; Webb, Geoffrey I. (eds.), Encyclopedia of ...
Mar 28, 2024Policy gradient method: A reinforcement learning method that optimizes parameterized policies with respect to long-term cumulative reward.
Mar 26, 2024This paper proposes a path planning framework that combines the experience replay mechanism from deep reinforcement learning (DRL) and rapidly exploring random ...
Mar 30, 2024Machine learning models substantially outperform existing algorithms based on ICD codes in predicting low acuity ED visits.
Aug 15, 2024A survey of work in reinforcement learning for behavior generation in robots. We highlight both key challenges in robot reinforcement learning as well as ...
Jul 29, 2024This study presents a literature review of the use of DM and ML techniques in key areas of BEM, including building performance evaluation, energy usage ...
Aug 30, 2024This study introduces two novel Reinforcement Learning (RL) agents for the design and optimization of chemical process flowsheets (CPFs): a discrete masked ...
Missing: Methods. | Show results with:Methods.
Jul 31, 2024In this study, we develop systematic methods, applying statistical analysis and dynamical-systems theory, to reduce parameter count in a biophysical model.