Cited By
View all- Mao YZhang HChen CXu YJi XOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Supported value regularization for offline reinforcement learningProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667888(40587-40609)Online publication date: 10-Dec-2023
- Rothfuss JSukhija BBirchler TKassraie PKrause AEvans RShpitser I(2023)Hallucinated adversarial control for conservative offline policy evaluationProceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence10.5555/3625834.3626000(1774-1784)Online publication date: 31-Jul-2023
- Chen XMa XLi YYang GYang SGao YEvans RShpitser I(2023)Modified retrace for off-policy temporal difference learningProceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence10.5555/3625834.3625863(303-312)Online publication date: 31-Jul-2023
- Show More Cited By
