Past year
All results
- All results
- Verbatim
scholar.google.com › citations
Apr 11, 2024 , Another example is Policy Gradient Methods. They offer an alternative approach within model-free learning, optimizing policy parameters directly to maximize ...
5 days ago , Vanilla policy gradient methods lack robustness, and implementing TRPO can be demanding, particularly in complex numerical simulations, where challenges like ...
Nov 21, 2023 , A policy can be optimized to maximize returns by directly estimating the policy gradient ... Sammut, Claude; Webb, Geoffrey I. (eds.), Encyclopedia of ...
Mar 28, 2024 , Policy gradient method: A reinforcement learning method that optimizes parameterized policies with respect to long-term cumulative reward.
Mar 26, 2024 , This paper proposes a path planning framework that combines the experience replay mechanism from deep reinforcement learning (DRL) and rapidly exploring random ...
Mar 30, 2024 , Machine learning models substantially outperform existing algorithms based on ICD codes in predicting low acuity ED visits.
Aug 15, 2024 , A survey of work in reinforcement learning for behavior generation in robots. We highlight both key challenges in robot reinforcement learning as well as ...
Jul 29, 2024 , This study presents a literature review of the use of DM and ML techniques in key areas of BEM, including building performance evaluation, energy usage ...
Aug 30, 2024 , This study introduces two novel Reinforcement Learning (RL) agents for the design and optimization of chemical process flowsheets (CPFs): a discrete masked ...
Missing: Methods. | Show results with:Methods.
Jul 31, 2024 , In this study, we develop systematic methods, applying statistical analysis and dynamical-systems theory, to reduce parameter count in a biophysical model.
People also search for