PrometheusRoot
Blog Links Prometheans 100+ AI Books AI Companies Why are you here?
← Prometheans 100+
×
Jonathan Ross
pioneer
FounderEngineer
X / Twitter LinkedIn
groqinferencechipslpu

Recognition

TIME 100 AI 2024
← Prometheans 100+ Jonathan Ross
TIME 100 AI 2024

Nvidia Chief Software Architect, Groq founder and LPU creator

Jonathan Ross

Chief Software Architect — Nvidia Founder — Groq TPU Lead Designer — Google
Listen — profile
0:00 / 2:31

Profile

Jonathan Ross has the rare distinction of having designed two of the most consequential AI chips ever shipped. While at Google, he started what became the Tensor Processing Unit as a 20% project, building the core of a chip that would eventually power more than half of Google’s internal compute. He left in 2016 to found Groq, a high school dropout who had already reshaped AI hardware once and wanted to do it again — this time for inference.

Groq’s bet was architectural heresy. Where Jensen Huang’s Nvidia GPUs are general-purpose massively parallel machines optimized for training, Ross built the Language Processing Unit (LPU) — a deterministic, single-threaded streaming processor with the memory baked onto the die. The result is inference latency that GPUs cannot match. If you’ve ever hit Groq’s API and watched a Llama model fire out hundreds of tokens per second with no perceptible delay, that’s the LPU. For developers, it changed what “real-time LLM” actually means — voice agents, live copilots, and agentic loops that were too slow on GPUs suddenly became viable.

The commercial trajectory followed the tech. A $1.5B commitment from Saudi Arabia in early 2025 funded a Dammam data center. A $750M round in September 2025 valued Groq at $6.9B. Then in December 2025, Nvidia did something it had never done before: it paid roughly $20 billion — its largest deal on record — to license Groq’s inference tech and hire Ross along with most of his leadership team. The “non-exclusive licensing” framing preserved the fiction of competition, but the signal was unmistakable. The company whose GPUs defined modern AI quietly admitted that when it comes to inference, the LPU was right.

Ross is now Chief Software Architect at Nvidia, working on what he’s called a mission to double the world’s AI compute. For builders, the lesson is practical: inference is becoming a first-class engineering problem separate from training, and the people who treat it that way — architecturally, not just as a smaller version of the training stack — are the ones reshaping the economics of running models.

Key Articles & Papers

Groq and Nvidia Enter Non-Exclusive Inference Technology Licensing Agreement 2025 — The official announcement of the ~$20B deal that brought Ross and Groq's leadership to Nvidia. Every. Word. Matters. 2024 — Ross on why inference quality and latency are not separate concerns — a piece of the LPU design philosophy. The Future of AI Compute: A Conversation With Jonathan Ross 2024 — Chamath Palihapitiya's long-form interview covering the LPU thesis, energy constraints, and the inference market.

Videos

YouTube video
YouTube video
YouTube video

YouTube

YouTube video
2025
YouTube video
2025
YouTube video
2025
YouTube video
2025
YouTube video
2025
YouTube video
2025
YouTube video
2025
YouTube video
2024
YouTube video
2023
YouTube video
2023

Spotify Podcasts

Groq Star: Who is Jonathan Ross?
Groq Star: Who is Jonathan Ross?
Generative AI 101
2026
20VC: OpenAI and Anthropic Will Build Their Own Chips | NVIDIA Will Be Worth $10TRN | How to Solve the Energy Required for AI... Nuclear | Why China is Behind the US in the Race for AGI with Jonathan Ross, Groq Founder
20VC: OpenAI and Anthropic Will Build Their Own Chips | NVIDIA Will Be Worth $10TRN | How to Solve the Energy Required for AI... Nuclear | Why China is Behind the US in the Race for AGI with Jonathan Ross, Groq Founder
The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch
2025
Groq’s founder on why AI’s next big shift isn’t about Nvidia
Groq’s founder on why AI’s next big shift isn’t about Nvidia
Power Players with Brian Sozzi
2025
With Groq, Jonathan Ross is taking AI inference to new speeds
With Groq, Jonathan Ross is taking AI inference to new speeds
Pioneers of AI
2025
20VC: NVIDIA vs Groq: The Future of Training vs Inference | Meta, Google, and Microsoft's Data Center Investments: Who Wins | Data, Compute, Models: The Core Bottlenecks in AI & Where Value Will Distribute with Jonathan Ross, Founder @ Groq
20VC: NVIDIA vs Groq: The Future of Training vs Inference | Meta, Google, and Microsoft's Data Center Investments: Who Wins | Data, Compute, Models: The Core Bottlenecks in AI & Where Value Will Distribute with Jonathan Ross, Founder @ Groq
The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch
2025
20VC: Deepseek Special: Is Deepseek a Weapon of the CCP | How Should OpenAI and the US Government Respond | Why $500BN for Stargate is Not Enough | The Future of Inference, NVIDIA and Foundation Models with Jonathan Ross @ Groq
20VC: Deepseek Special: Is Deepseek a Weapon of the CCP | How Should OpenAI and the US Government Respond | Why $500BN for Stargate is Not Enough | The Future of Inference, NVIDIA and Foundation Models with Jonathan Ross @ Groq
The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch
2025
Risks, Rewards, and Building the Unicorn Chip Company Taking on Nvidia | Inside Groq with Jonathan Ross
Risks, Rewards, and Building the Unicorn Chip Company Taking on Nvidia | Inside Groq with Jonathan Ross
The Eric Ries Show
2024
Groq CEO Jonathan Ross - Next Gen AI Hardware
Groq CEO Jonathan Ross - Next Gen AI Hardware
Summation with Auren Hoffman
2024
Groq CEO Jonathan Ross - Next Gen AI Hardware
Groq CEO Jonathan Ross - Next Gen AI Hardware
Summation with Auren Hoffman
2024
NCI: AI Chip Wars: LPUs, TPUs & GPUs w/ Jonathan Ross, Founder Groq
NCI: AI Chip Wars: LPUs, TPUs & GPUs w/ Jonathan Ross, Founder Groq
Lumida Wealth : Non-Consensus Invest Beyond the Ordinary
2024
© 2026 PrometheusRoot