PrometheusRoot
Blog Links Prometheans 100+ AI Books AI Companies Why are you here?
← Prometheans 100+
×
Sholto Douglas
rising
Researcher
X / Twitter
googledeepmindgeminiscaling

Related

pioneer Demis Hassabis
← Prometheans 100+ Sholto Douglas

Anthropic member of technical staff, scaling reinforcement learning

Sholto Douglas

Member of Technical Staff, Scaling RL — Anthropic Researcher, Gemini team — Google DeepMind
Listen — profile
0:00 / 1:59

Profile

Sholto Douglas is a research scientist who spent three years as one of the most important engineers on Google DeepMind’s Gemini effort, and in 2025 moved to Anthropic to lead scaling of reinforcement learning. He’s one of those rare researchers who became an insider voice on frontier model training by force of self-study — the often-told story is that he was doing AI research from 10pm to 2am every night before getting noticed by Google engineers who couldn’t figure out who this person asking sharp questions online was.

At DeepMind he worked on PaLM and then became a lead architect on the Gemini family, contributing to training infrastructure, inference systems, and the research direction that closed Google’s gap with OpenAI. Noam Brown has publicly called him one of the most important people behind Gemini’s success. At Anthropic he’s now focused on pushing RL for agentic capabilities — making models that can chain long sequences of actions reliably, which he argues (rightly) is the actual bottleneck for agents, not context length.

What makes him matter for developers learning AI is the clarity with which he talks about what’s actually happening at the frontier. His X posts and long-form podcast appearances with Dwarkesh Patel are the closest thing the public has to ground-truth commentary on how frontier labs think about scaling, compute, data, and training. When he says something about reliability scaling with model size, or why RL will keep working, those aren’t hot takes — they’re hypotheses from someone who has actually been in the loop at two of the three frontier labs.

If you’re trying to build a mental model of where capabilities are going over the next few years, Douglas is a better signal than almost any analyst or journalist. He’s technical, calibrated, and willing to reason out loud.

Key Articles & Papers

Sholto's Blog 2024 — His personal site with writing and experiments — the trail he left before being recruited. Gemini: A Family of Highly Capable Multimodal Models 2023 — The technical report for Gemini 1.0. He's a contributor and was central to the training effort. PaLM: Scaling Language Modeling with Pathways 2022 — Google's 540B-parameter model. One of his early major contributions at DeepMind.

Videos

YouTube video
YouTube video

YouTube

YouTube video
2026
YouTube video
2025
YouTube video
2025
YouTube video
2025
YouTube video
2025
YouTube video
2025
YouTube video
2024
YouTube video
2024

Spotify Podcasts

Sam Altman on Codex 5.3 Launch, Anthropic's Sholto Douglas, Alphabet Beats Q4 Estimates | Sam Altman, Sholto Douglas, Daniel Barcelo, Mandy Fields, Ivan Burazin, Scott Rogowsky
Sam Altman on Codex 5.3 Launch, Anthropic's Sholto Douglas, Alphabet Beats Q4 Estimates | Sam Altman, Sholto Douglas, Daniel Barcelo, Mandy Fields, Ivan Burazin, Scott Rogowsky
TBPN
2026
Reviewing the Best AI Apps, Anthropic Unveils Claude 4.5 Opus, Doug DeMuro | Sholto Douglas, Quinn Slack, Alex Stauffer & Alex Shevchenko
Reviewing the Best AI Apps, Anthropic Unveils Claude 4.5 Opus, Doug DeMuro | Sholto Douglas, Quinn Slack, Alex Stauffer & Alex Shevchenko
TBPN
2025
Sonnet 4.5 & the AI Plateau Myth — Sholto Douglas (Anthropic)
Sonnet 4.5 & the AI Plateau Myth — Sholto Douglas (Anthropic)
The MAD Podcast with Matt Turck
2025
Elon Musk vs. Donald Trump, AI Day | Shaun Maguire, Mark Chen, Sholto Douglas, Jack Whitaker, Aarush Selvan, Michael Mignano, Oliver Cameron, Delian Asparouhov
Elon Musk vs. Donald Trump, AI Day | Shaun Maguire, Mark Chen, Sholto Douglas, Jack Whitaker, Aarush Selvan, Michael Mignano, Oliver Cameron, Delian Asparouhov
TBPN
2025
Ep 66: Member of Technical Staff at Anthropic Sholto Douglas on Claude 4, Next Phase for AI Coding, and the Path to AI Coworkers
Ep 66: Member of Technical Staff at Anthropic Sholto Douglas on Claude 4, Next Phase for AI Coding, and the Path to AI Coworkers
Unsupervised Learning with Jacob Effron
2025
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken
Dwarkesh Podcast
2025
AMA: career advice given AGI, how I research ft. Sholto & Trenton
AMA: career advice given AGI, how I research ft. Sholto & Trenton
Dwarkesh Podcast
2025
076 - Sholto Douglas and Trenton Bricken on AI
076 - Sholto Douglas and Trenton Bricken on AI
AI: Unplugged
2024
LW - Notes on Dwarkesh Patel's Podcast with Sholto Douglas and Trenton Bricken by Zvi
LW - Notes on Dwarkesh Patel's Podcast with Sholto Douglas and Trenton Bricken by Zvi
The Nonlinear Library
2024
Sholto Douglas & Trenton Bricken — How LLMs actually think
Sholto Douglas & Trenton Bricken — How LLMs actually think
Dwarkesh Podcast
2024

Related People

pioneer Demis Hassabis
© 2026 PrometheusRoot