PrometheusRoot
Blog Links Prometheans 100+ AI Books AI Companies Why are you here?
← Prometheans 100+
J
rising
EducatorResearcher
X / Twitter YouTube Website Google Scholar LinkedIn
← Prometheans 100+

Co-founder of Storia AI, former Google Research engineer

Julia Turc

Co-founder — Storia AI Research Engineer — Google
Listen — profile
0:00 / 2:32

Profile

Julia Turc is one of those rare technologists who can build the thing, ship the thing, and then explain the thing — clearly — to people who weren’t in the room. She spent roughly eight years inside Google Research working on NLP, transfer learning, and Transformer efficiency, contributing to the kind of plumbing that quietly ends up in products billions of people touch: contextual retrieval work that fed into precursors of Gemini, on-device language tech for Google Keyboard and Pixel, and knowledge distillation methods for compressing big models into small ones. Her paper Well-Read Students Learn Better is still one of the standard references for how to distill BERT-class models without losing the plot.

In 2023 she left Google to co-found Storia AI, which went through Y Combinator’s Summer 2024 batch. Storia’s main product is Sage — an open-source AI copilot that ingests a codebase plus its surrounding documentation and tries to answer questions the way a senior engineer who actually knows the repo would. The framing matters: rather than another autocomplete wrapper, Storia is betting that the missing layer for developer AI is context grounding — knowing what your team’s code does, why it does it, and where the answers live across docs, issues, and Slack threads.

The other half of her work is education. Her YouTube channel describes itself as “anti-hype” AI explanation — Transformers, diffusion, distillation, quantization, walked through carefully enough that a curious generalist can follow without being insulted by hand-waving. The tone owes a clear debt to Andrej Karpathy’s “let me actually show you the code” style, but Turc brings a researcher’s instinct for what the field is overstating at any given moment, paired with a dry Eastern European skepticism toward Bay Area enthusiasm.

For developers learning AI today, she’s worth following because she sits at an unusually useful intersection: she’s done the production research work, she’s now building developer tooling on top of LLMs, and she can teach. That combination — researcher, founder, educator — is rarer than it should be, and it makes her explanations of things like RAG, distillation, or where Llama 4 actually fits in the stack land with more weight than the average AI YouTuber.

Key Articles & Papers

Well-Read Students Learn Better: On the Importance of Pre-training Compact Models 2019 — Her most-cited work; established that pre-training small students before distillation matters more than fancy distillation losses. Standard reference for BERT compression. Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding 2022 — Pre-trains a model by reconstructing masked screenshots of webpages — an elegant pretraining objective that turned out to transfer broadly to UI, document, and chart understanding. CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation 2021 — Skip the tokenizer entirely and operate on Unicode codepoints. A clean argument for why tokenization is a leaky abstraction, especially for multilingual NLP. Measuring Attribution in Natural Language Generation Models 2021 — Defines what it actually means for a generated statement to be 'attributable' to a source — foundational thinking for the modern RAG and citation-grounded generation era. Transformers, Explained: Understand the Model Behind GPT, BERT, and T5 2021 — An accessible walk-through of why Transformers won, written for engineers rather than ML researchers. From von Neumann to Memory-Augmented Neural Networks 2018 — A clear lineage piece on how classical computer architecture ideas reappear inside neural networks with external memory. Fine-tuning DALL·E Mini (Craiyon) to Generate Blogpost Images 2022 — A practical write-up from when image models were still a hobbyist's playground — useful as a record of how fast the diffusion stack matured. Launch YC: Storia — A Contextual AI Pair Programmer 2024 — The Storia AI launch post — clearest statement of the bet that context, not raw model capability, is what's missing from developer AI tools.

Videos

YouTube video
YouTube video

YouTube

YouTube video
2025
YouTube video
2025
© 2026 PrometheusRoot