Hello! I am a PhD student in RL Theory, pursuing double PhD at CMAP, École Polytechnique, and LMO, Université Paris-Saclay under supervision of Éric Moulines and Gilles Stoltz.
Additionally, I was doing research at HDI Lab at HSE University. I did my Master’s degree in Applied Mathematics and Computer Science on the program “Math of Machine Learning” by HSE University.
My interests include (but are not limited to)
- Theory of reinforcement learning;
- Connection of RL to other ML problems (e.g. sampling);
- Stochastic optimization;
Email: daniil.tiapkin@polytechnique.edu | Google Scholar | ORCID | HSE webpage |
News
- 🔥New🔥 January 2023. The paper on RL/RLHF learning from demonstrations “Demonstration-Regularized RL” was accepted at ICLR-2024 and, additionally, the GFlowNet-RL paper “Generative Flow Networks as Entropy-Regularized RL” was honored by an oral presetation at AISTATS-2024!
- September 2023. I moved to École Polytechnique, France for pursuing PhD degree.
- September 2023. The paper “Model-free Posterior Sampling via Learning Rate Randomization” was accepted at NeurIPS-2023!
- July 2023 The paper “Orthogonal Directions Constrained Gradient Method: from non-linear equality constraints to Stiefel manifold” was presented at COLT-2023!
- April 2023. The paper “Fast Rates for Maximum Entropy Exploration” was accepted at ICML-2023!