PrometheusRoot
Blog Links Prometheans 100+ Why are you here?
← Prometheans 100+
×
Percy Liang
builder
ResearcherEducator
Website GitHub Wikipedia
stanfordhelmbenchmarksevaluationfoundation-models

Related

legend Fei-Fei Li
← Prometheans 100+ Percy Liang

Stanford HELM benchmarks, foundation model evaluator

Percy Liang

Professor, CRFM Director — Stanford

Profile

Percy Liang is the closest thing the AI field has to a standards body of one. A Stanford computer science professor and director of the Center for Research on Foundation Models (CRFM), he built HELM — Holistic Evaluation of Language Models — which became the default place to check whether a new model’s claimed capabilities hold up across dozens of scenarios and metrics. When a lab drops a model with a splashy benchmark win, HELM is where you go to see how it actually behaves on reasoning, knowledge, bias, toxicity, calibration, and robustness, all scored side by side with every major open and closed model.

Before HELM, Liang was already a heavyweight in NLP. He co-created SQuAD, the Stanford Question Answering Dataset that defined reading comprehension evaluation for years, and he’s advised a generation of students who now populate OpenAI, Anthropic, and Google DeepMind. In 2021 he co-authored On the Opportunities and Risks of Foundation Models — the paper that coined the term “foundation model” and gave the field a shared vocabulary for what GPT-style systems actually are.

He also co-founded Together AI, a company building open-source infrastructure for training and running foundation models — putting his money where his benchmarks are on the idea that closed labs shouldn’t be the only game in town. His lab continues to release open models, open evaluations, and open datasets at a pace that embarrasses most companies.

For developers learning AI, Liang matters because he’s the person keeping the field honest. Every time a CEO tweets “state of the art,” someone at CRFM is quietly running the numbers. If you want to understand what a model can actually do — not what a marketing page says — start with HELM and work backwards.

Key Articles & Papers

On the Opportunities and Risks of Foundation Models 2021 — The paper that coined 'foundation model' and framed how the field thinks about large pretrained systems. Holistic Evaluation of Language Models (HELM) 2022 — The original HELM paper — a 162-page argument for evaluating LLMs across many scenarios and metrics at once, not cherry-picked benchmarks. SQuAD: 100,000+ Questions for Machine Comprehension of Text 2016 — The reading comprehension dataset that defined NLP evaluation for half a decade. HELM Leaderboard 2022 — Live leaderboard comparing open and closed LLMs across reasoning, safety, knowledge, and more. The reference implementation of honest evaluation. The Stanford AI Index Report (contributor) 2024 — Stanford HAI's annual snapshot of the state of AI — a useful high-level reference, with CRFM's evaluation work feeding into it. Percy Liang's Stanford Homepage — His full publication list and research agenda — the canonical source.

Spotify Podcasts

Percy Liang, Stanford: The paradigm shift and societal effects of foundation models
Percy Liang, Stanford: The paradigm shift and societal effects of foundation models
AI HQ: Director of Stanford's CRFM Percy Liang
AI HQ: Director of Stanford's CRFM Percy Liang
Ep 44: Co-Founder of Together.AI Percy Liang on What’s Next in Research, Reaction to o1 and How AI will Change Simulation
Ep 44: Co-Founder of Together.AI Percy Liang on What’s Next in Research, Reaction to o1 and How AI will Change Simulation
What is the role of academia in modern AI research? With Stanford Professor Dr. Percy Liang
What is the role of academia in modern AI research? With Stanford Professor Dr. Percy Liang
Percy jackson and The Battle of the Labyrinth
Percy jackson and The Battle of the Labyrinth
Percy jackson and the last olympian
Percy jackson and the last olympian
Percy Jackson and the Olympians: The Lightning Thief // Chapter 2
Percy Jackson and the Olympians: The Lightning Thief // Chapter 2
Heroes of Olympus- The son of Neptune
Heroes of Olympus- The son of Neptune
S2 E9: Behind the Percy Jackson Universe with Rick Riordan
S2 E9: Behind the Percy Jackson Universe with Rick Riordan
Making the Most of Open Source in AI
Making the Most of Open Source in AI

Related People

legend Fei-Fei Li
© 2026 PrometheusRoot