Publications
Publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
- UAISample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation2025
- arXivInstance-dependent continuous-time reinforcement learning via maximum likelihood estimationarXiv preprint arXiv:2508.02103, 2025
- arXivOn the Limits of Test-Time Compute: Sequential Reward Filtering for Better InferencearXiv preprint arXiv:2512.04558, 2025