Speaker Variance-Dependent Regret Bounds for Non-stationary Linear Bandits ThA246 — Statistical Learning, Optimization and Stochastic Programming for Reinforcement Learning 25 Jul 2024 08:30 — Parallel Session