publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. ICLR
    Beyond Softmax and Entropy: Convergence Rates of Policy Gradients with f-SoftArgmax Parameterization & Coupled Regularization
    S. Labbi, D. Tiapkin, P. Mangold, and 1 more author
    In International Conference on Learning Representations (ICLR), 2026
  2. AISTATS
    On Global Convergence Rates for Federated Softmax Policy Gradient under Heterogeneous Environments
    S. Labbi, P. Mangold, D. Tiapkin, and 1 more author
    In International Conference on Artificial Intelligence and Statistics (AISTATS), 2026
  3. Preprint
    Learning Shortest Paths with Generative Flow Networks
    N. Morozov, I. Maksimov, D. Tiapkin, and 1 more author
    2026
    arXiv:2603.01786

2025

  1. Software
    gfnx: Fast and Scalable Library for Generative Flow Networks in JAX
    D. Tiapkin, A. Agarkov, N. Morozov, and 4 more authors
    2025
  2. Preprint
    Adaptive Destruction Processes for Diffusion Samplers
    T. Gritsaev, N. Morozov, K. Tamogashev, and 5 more authors
    2025
    arXiv:2506.01541
  3. Preprint
    Proximal Point Nash Learning from Human Feedback
    D. Tiapkin, D. Calandriello, D. Belomestny, and 5 more authors
    2025
    arXiv:2505.19731
  4. ICML
    On Teacher Hacking in Language Model Distillation
    D. Tiapkin, D. Calandriello, J. Ferret, and 4 more authors
    In International Conference on Machine Learning (ICML), 2025
  5. ICML
    Revisiting Non-Acyclic GFlowNets in Discrete Environments
    N. Morozov, I. Maksimov, D. Tiapkin, and 1 more author
    In International Conference on Machine Learning (ICML), 2025
  6. ICML
    Finite-Sample Convergence Bounds for Trust Region Policy Optimization in Mean-Field Games
    A. Ocello, D. Tiapkin, L. Mancini, and 2 more authors
    In International Conference on Machine Learning (ICML), 2025
  7. AISTATS
    Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization
    D. Tiapkin, E. Chzhen, and G. Stoltz
    In International Conference on Artificial Intelligence and Statistics (AISTATS), 2025
  8. AISTATS
    Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents
    S. Labbi, D. Tiapkin, L. Mancini, and 2 more authors
    In International Conference on Artificial Intelligence and Statistics (AISTATS), 2025
  9. ICLR
    Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization
    T. Gritsaev, N. Morozov, S. Samsonov, and 1 more author
    In International Conference on Learning Representations (ICLR), 2025

2024

  1. Preprint
    A New Bound on the Cumulant Generating Function of Dirichlet Processes
    P. Perrault, D. Belomestny, P. Ménard, and 4 more authors
    2024
    arXiv:2409.18621
  2. Workshop
    Improving GFlowNets with Monte Carlo Tree Search
    N. Morozov, D. Tiapkin, S. Samsonov, and 2 more authors
    In ICML 2024 SPIGM Workshop, 2024
  3. ICML
    Incentivized Learning in Principal-Agent Bandit Games
    A. Scheid, D. Tiapkin, E. Boursier, and 5 more authors
    In International Conference on Machine Learning (ICML), 2024
  4. COLT
    Improved High-Probability Bounds for the Temporal Difference Learning Algorithm via Exponential Stability
    S. Samsonov, D. Tiapkin, A. Naumov, and 1 more author
    In Conference on Learning Theory (COLT), 2024
  5. AISTATS
    Generative Flow Networks as Entropy-Regularized RL
    D. Tiapkin, N. Morozov, A. Naumov, and 1 more author
    In International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
    Oral presentation
  6. ICLR
    Demonstration-Regularized RL
    D. Tiapkin, D. Belomestny, D. Calandriello, and 5 more authors
    In International Conference on Learning Representations (ICLR), 2024

2023

  1. Preprint
    Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms
    D. Belomestny, P. Ménard, A. Naumov, and 2 more authors
    2023
    arXiv:2304.03056. Authors are listed in alphabetical order.
  2. NeurIPS
    Model-free Posterior Sampling via Learning Rate Randomization
    D. Tiapkin, D. Belomestny, D. Calandriello, and 6 more authors
    In Advances in Neural Information Processing Systems (NeurIPS), 2023
  3. ICML
    Fast Rates for Maximum Entropy Exploration
    D. Tiapkin, D. Belomestny, D. Calandriello, and 7 more authors
    In International Conference on Machine Learning (ICML), 2023
  4. COLT
    Orthogonal Directions Constrained Gradient Method: from non-linear equality constraints to Stiefel manifold
    S. Schechtman, D. Tiapkin, M. Muehlebach, and 1 more author
    In Conference on Learning Theory (COLT), 2023
  5. Doklady
    On the Structure of the Set of Panchromatic Colorings of a Random Hypergraph
    D. Tiapkin and D. Shabanov
    Doklady Mathematics, 2023

2022

  1. NeurIPS
    Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
    D. Tiapkin, D. Belomestny, D. Calandriello, and 6 more authors
    In Advances in Neural Information Processing Systems (NeurIPS), 2022
  2. ICML
    From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
    D. Tiapkin, D. Belomestny, É. Moulines, and 5 more authors
    In International Conference on Machine Learning (ICML), 2022
    Oral presentation
  3. IFAC
    First-order Constrained Optimization: Non-smooth Dynamical System Viewpoint
    S. Schechtman, D. Tiapkin, É. Moulines, and 2 more authors
    In IFAC Workshop on Control Applications of Optimization, 2022
  4. AISTATS
    Primal-Dual Stochastic Mirror Descent for MDPs
    D. Tiapkin and A. Gasnikov
    In International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
  5. OptLett
    Stochastic saddle-point optimization for the Wasserstein barycenter problem
    D. Tiapkin, A. Gasnikov, and P. Dvurechensky
    Optimization Letters, 2022

2021

  1. AISTATS
    Improved Complexity Bounds in the Wasserstein Barycenter Problem
    D. Dvinskikh and D. Tiapkin
    In International Conference on Artificial Intelligence and Statistics (AISTATS), 2021