Mitigating Information Loss in Tree-Based Reinforcement Learning via Direct Optimization
Published in International Conference on Learning Representations (Spotlight), 2025
We propose a novel method for symbolic RL that enables end-to-end gradient-based learning of interpretable, axis-aligned decision trees, combining policy gradient optimization with symbolic decision-making.
Recommended citation: Marton, Sascha, et al. (2025). "Mitigating Information Loss in Tree-Based Reinforcement Learning via Direct Optimization." The Thirteenth International Conference on Learning Representations. 1(1).
Download Paper | Download Slides