Nature · 2026
Humanity's Last Exam: A Benchmark of Expert-Level Academic Questions to Assess AI Capabilities
Long Phan, Alice Gatti, Nathaniel Li, et al. (including Wei Hao)
Long Phan, Alice Gatti, Nathaniel Li, et al. (including Wei Hao). "Humanity's Last Exam: A Benchmark of Expert-Level Academic Questions to Assess AI Capabilities." Nature, vol. 649 (2026). https://doi.org/10.1038/s41586-025-09962-4