Publications
Publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
- arXivThe Load Management Paradox: Correcting the Healthy-Worker Survivor Effect in NBA Injury ModelingarXiv preprint arXiv:2603.26935, 2026
2025
- arXivOn the Limits of Test-Time Compute: Sequential Reward Filtering for Better InferencearXiv preprint arXiv:2512.04558, 2025
- arXivInstance-dependent continuous-time reinforcement learning via maximum likelihood estimationarXiv preprint arXiv:2508.02103, 2025
- UAISample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation2025