Andrej Karpathy

Founder — Eureka Labs Researcher — OpenAI Director of AI — Tesla

Listen — profile

0:00 / 1:53

Profile

Andrej Karpathy is the rare researcher who can build a transformer from scratch on a livestream and make you feel like you could, too. A Stanford PhD under Fei-Fei Li, founding member of OpenAI, former Director of AI at Tesla where he led Autopilot vision for five years — his resume would be intimidating if he weren’t so generous with what he knows.

He left OpenAI a second time in February 2024 to focus on education, and in July 2024 founded Eureka Labs, an “AI-native” school aimed at teaching people how AI actually works. Before that, his YouTube channel quietly became one of the most important AI education resources on the internet. The “Neural Networks: Zero to Hero” series walks you from a scalar autograd engine (micrograd) up through a working GPT, one keystroke at a time. nanoGPT, llm.c, and makemore are the same philosophy as code: small, readable, hackable.

What sets Karpathy apart is his refusal to let abstraction hide the math. He’ll explain backpropagation by literally typing out the chain rule in a Jupyter notebook. He coined “Software 2.0” back in 2017 — the idea that neural network weights are a new kind of code — and more recently “vibe coding,” the now-ubiquitous term for letting an LLM drive your IDE while you ride shotgun. Both landed because he names the thing developers are already doing.

For anyone bridging theory and practice — a student who knows the math but hasn’t shipped, or a veteran engineer who ships daily but wants to finally understand attention — Karpathy is the bridge. He’s not selling a course or a framework. He’s just showing his work.

Key Articles & Papers

The Unreasonable Effectiveness of Recurrent Neural Networks 2015 — The blog post that made a generation of developers want to train a character-level RNN on Shakespeare. Still the clearest intuition for what sequence models do. Software 2.0 2017 — Reframes neural networks as a new kind of software — one where we don't write the code, we curate data and let gradient descent do the rest. A Recipe for Training Neural Networks 2019 — The closest thing the field has to a manual. A detailed checklist from someone who has actually debugged enough models to have opinions. Deep Reinforcement Learning: Pong from Pixels 2016 — Policy gradients explained by building an RL agent that plays Pong from raw pixels. The gold standard pedagogical RL post. Deep Visual-Semantic Alignments for Generating Image Descriptions 2015 — His CVPR paper with Fei-Fei Li on image captioning — an early, influential piece of work connecting vision and language. What I learned from looking at 200 machine learning tools 2022 — Lex Fridman podcast transcripts indexed and searchable — a small project that shows his style: useful, free, and built in a weekend. Yes you should understand backprop 2016 — An argument against treating autograd as a black box, with concrete examples of how that mindset leads to silent bugs.

Videos

YouTube

2026

2025

2024

2022

2019

Spotify Podcasts

20VC: Andrej Karpathy Joins Anthropic & Anthropic Raises $30BN at $900BN Price | SpaceX Files S1: How Does it Trade | Cerebras Smashes Day 1: What it Means for IPOs | Why Mass Layoffs Are More Worrying Than Anyone Sees

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

2026