Talks
- Refined Regret for Adversarial MDPs with Linear Function Approximation
Yan Dai, Haipeng Luo, Chen-Yu Wei, and Julian Zimmert.
@ International Conference on Machine Learning (ICML), Hawaii Convention Center, Jul 2023. - Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning
Jiatai Huang*, Yan Dai*, and Longbo Huang.
@ International Conference on Machine Learning (ICML), Hawaii Convention Center, Jul 2023. - Variance-Aware Sparse Linear Bandits
Yan Dai, Ruosong Wang, and Simon S. Du.
@ International Conference on Learning Representations (ICLR), Kigali Convention Centre, May 2023.
@ FAI-Seminar, Online, Apr 2023.
@ Yao Class Seminar, Tsinghua University, Mar 2023.
[slides] - Follow-the-Perturbed-Leader for Adversarial Markov Decision Processes with Bandit Feedback
Yan Dai, Haipeng Luo, and Liyu Chen.
@ Conference on Neural Information Processing Systems (NeurIPS), Online, Dec 2022.
[slides] - Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits
Jiatai Huang*, Yan Dai*, and Longbo Huang.
@ SparkDay (in Chinese), Tsinghua University, May 2022.
[slides (in Chinese)]