The Power of Power Law: Asymmetry Enables Compositional Reasoning

Published in ICML 2026, Spotlight, 2026

Recommended citation: Wang, Z., Dang, X., Lee, J. D., & Lyu, K. (2026). The Power of Power Law: Asymmetry Enables Compositional Reasoning. arXiv preprint arXiv:2604.22951. https://arxiv.org/abs/2604.22951

Zixuan Wang, Xingyu Dang, Jason D. Lee, Kaifeng Lyu

Abstract: Natural language data follows power-law frequency patterns, where many skills and facts appear rarely. This work studies why such skewed data can help compositional reasoning: across state-tracking and multi-step arithmetic tasks, power-law training distributions outperform uniform ones. We introduce a minimalist skill-composition model and show that power-law sampling can reduce the data needed for learning by creating a useful asymmetry in the loss landscape, helping models first master frequent compositions and then transfer that progress to rare long-tail skills.