Speaker Nonconvex Landscape of Policy Optimization for A Class of Finite Horizon Markov Decision Processes and Applications in Operations Models WC328 — Advances in Multi-stage Stochastic Programming and Reinforcement learning 24 Jul 2024 16:20 — Parallel Session