Cited By
View all- Zhang SLiu BWang ZZhao TOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Model-based reparameterization policy gradient methodsProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3669112(68391-68419)Online publication date: 10-Dec-2023
- Zhang SJin WWang ZKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Adaptive barrier smoothing for first-order policy gradient with contact dynamicsProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3620136(41219-41243)Online publication date: 23-Jul-2023
- Parmas PSeno TAoki YKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Model-based reinforcement learning with scalable composite policy gradient estimatorsProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619545(27346-27377)Online publication date: 23-Jul-2023
- Show More Cited By
