What Makes a Reward Model a Good Teacher? An Optimization Perspective
Published in arXiv:2503.15477, 2025
Recommended citation: Razin, N., Wang, Z., Strauss, H., Wei, S., Lee, J. D., & Arora, S. (2025). What Makes a Reward Model a Good Teacher? An Optimization Perspective. arXiv preprint arXiv:2503.15477. https://arxiv.org/abs/2503.15477