PrometheusRoot
Blog Links Prometheans 100+ AI Books AI Companies Why are you here?
← Prometheans 100+
×
Percy Liang
builder
ResearcherEducator
X / Twitter Website GitHub Wikipedia
stanfordhelmbenchmarksevaluationfoundation-models

Related

legend Fei-Fei Li
← Prometheans 100+ Percy Liang

Stanford associate professor, HELM benchmarks, foundation models

Percy Liang

Associate Professor of Computer Science, CRFM Director — Stanford University Senior Fellow — Stanford Institute for Human-Centered AI (HAI)
Listen — profile
0:00 / 1:57

Profile

Percy Liang is the closest thing the AI field has to a standards body of one. A Stanford computer science professor and director of the Center for Research on Foundation Models (CRFM), he built HELM — Holistic Evaluation of Language Models — which became the default place to check whether a new model’s claimed capabilities hold up across dozens of scenarios and metrics. When a lab drops a model with a splashy benchmark win, HELM is where you go to see how it actually behaves on reasoning, knowledge, bias, toxicity, calibration, and robustness, all scored side by side with every major open and closed model.

Before HELM, Liang was already a heavyweight in NLP. He co-created SQuAD, the Stanford Question Answering Dataset that defined reading comprehension evaluation for years, and he’s advised a generation of students who now populate OpenAI, Anthropic, and Google DeepMind. In 2021 he co-authored On the Opportunities and Risks of Foundation Models — the paper that coined the term “foundation model” and gave the field a shared vocabulary for what GPT-style systems actually are.

He also co-founded Together AI, a company building open-source infrastructure for training and running foundation models — putting his money where his benchmarks are on the idea that closed labs shouldn’t be the only game in town. His lab continues to release open models, open evaluations, and open datasets at a pace that embarrasses most companies.

For developers learning AI, Liang matters because he’s the person keeping the field honest. Every time a CEO tweets “state of the art,” someone at CRFM is quietly running the numbers. If you want to understand what a model can actually do — not what a marketing page says — start with HELM and work backwards.

Key Articles & Papers

On the Opportunities and Risks of Foundation Models 2021 — The paper that coined 'foundation model' and framed how the field thinks about large pretrained systems. Holistic Evaluation of Language Models (HELM) 2022 — The original HELM paper — a 162-page argument for evaluating LLMs across many scenarios and metrics at once, not cherry-picked benchmarks. SQuAD: 100,000+ Questions for Machine Comprehension of Text 2016 — The reading comprehension dataset that defined NLP evaluation for half a decade. HELM Leaderboard 2022 — Live leaderboard comparing open and closed LLMs across reasoning, safety, knowledge, and more. The reference implementation of honest evaluation. The Stanford AI Index Report (contributor) 2024 — Stanford HAI's annual snapshot of the state of AI — a useful high-level reference, with CRFM's evaluation work feeding into it. Percy Liang's Stanford Homepage — His full publication list and research agenda — the canonical source.

YouTube

YouTube video
2025
YouTube video
2024
YouTube video
2024
YouTube video
2023
YouTube video
2023
YouTube video
2023
YouTube video
2022

Spotify Podcasts

Ep 44: Co-Founder of Together.AI Percy Liang on What’s Next in Research, Reaction to o1 and How AI will Change Simulation
Ep 44: Co-Founder of Together.AI Percy Liang on What’s Next in Research, Reaction to o1 and How AI will Change Simulation
Unsupervised Learning with Jacob Effron
2024
Percy Liang, Stanford: The paradigm shift and societal effects of foundation models
Percy Liang, Stanford: The paradigm shift and societal effects of foundation models
Generally Intelligent
2024
Shaping AI Benchmarks with Together AI Co-Founder Percy Liang
Shaping AI Benchmarks with Together AI Co-Founder Percy Liang
Gradient Dissent: Conversations on AI
2024
AI HQ: Director of Stanford's CRFM Percy Liang
AI HQ: Director of Stanford's CRFM Percy Liang
AI HQ
2023
How to Use AI and Music to Boost Creativity with Percy Liang, Computer Scientist and Classical Pianist
How to Use AI and Music to Boost Creativity with Percy Liang, Computer Scientist and Classical Pianist
The Ampersand Manifesto: Make Your Mark in Multiple Fields
2023
Adept CEO David Luan and Stanford's Percy Liang | Words into Action
Adept CEO David Luan and Stanford's Percy Liang | Words into Action
Greymatter
2023
What is the role of academia in modern AI research? With Stanford Professor Dr. Percy Liang
What is the role of academia in modern AI research? With Stanford Professor Dr. Percy Liang
No Priors: Artificial Intelligence | Technology | Startups
2023
Percy Liang on the Center for Research on Foundation Models
Percy Liang on the Center for Research on Foundation Models
CS224U
2022
Percy Liang on Machine Learning Robustness, Foundation Models, and Reproducibility
Percy Liang on Machine Learning Robustness, Foundation Models, and Reproducibility
The Gradient: Perspectives on AI
2022
Percy Liang: Stanford University Professor, technologist, and researcher in AI
Percy Liang: Stanford University Professor, technologist, and researcher in AI
Behind The Tech with Kevin Scott
2020

Related People

legend Fei-Fei Li
© 2026 PrometheusRoot