Latent Space: The AI Engineer Podcast Podcast Republic

Latent Space: The AI Engineer Podcast

By swyx + Alessio

Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.

Image by swyx + Alessio

Open Website

Rate for this podcast

Subscribers: 213
Reviews: 0
Episodes: 121

Description

The podcast by and for AI Engineers! In 2024, over 2 million readers and listeners came to Latent Space to hear about news, papers and interviews in Software 3.0. We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, Anthropic, Gemini, Meta (Soumith Chintala), Sierra (Bret Taylor), tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al. Full show notes always on https://latent.space

Episode	Date
Building Snipd: The AI Podcast App for Learning Read the full episode description	Mar 14, 2025
⚡️The new OpenAI Agents Platform Read the full episode description	Mar 11, 2025
⚡️How Claude 3.7 Plays Pokémon Read the full episode description	Mar 04, 2025
Open Operator, Serverless Browsers and the Future of Computer-Using Agents Read the full episode description	Feb 28, 2025
The Inventors of Deep Research Read the full episode description	Feb 18, 2025
Bee AI: The Wearable Ambient Agent Read the full episode description	Feb 13, 2025
The AI Architect — Bret Taylor Read the full episode description	Feb 11, 2025
Agent Engineering with Pydantic + Graphs — with Samuel Colvin Read the full episode description	Feb 06, 2025
The Agent Reasoning Interface: o1/o3, Claude 3, ChatGPT Canvas, Tasks, and Operator — with Karina Nguyen of OpenAI Read the full episode description	Feb 01, 2025
Outlasting Noam Shazeer, crowdsourcing Chat + AI with >1.4m DAU, and becoming the "Western DeepSeek" — with William Beauchamp, Chai Research Read the full episode description	Jan 26, 2025
Everything you need to run Mission Critical Inference (ft. DeepSeek v3 + SGLang) Read the full episode description	Jan 19, 2025
[Ride Home] Simon Willison: Things we learned about LLMs in 2024 Read the full episode description	Jan 12, 2025
Beating Google at Search with Neural PageRank and $5M of H200s — with Will Bryk of Exa.ai Read the full episode description	Jan 10, 2025
AI Engineering for Art — with comfyanonymous, of ComfyUI Read the full episode description	Jan 04, 2025
Latent.Space 2024 Year in Review Read the full episode description	Dec 31, 2024
2024 in Agents [LS Live! @ NeurIPS 2024] Read the full episode description	Dec 25, 2024
2024 in Synthetic Data and Smol Models [LS Live @ NeurIPS] Read the full episode description	Dec 24, 2024
2024 in Post-Transformers Architectures (State Space Models, RWKV) [LS Live @ NeurIPS] Read the full episode description	Dec 24, 2024
2024 in Open Models [LS Live @ NeurIPS] Read the full episode description	Dec 23, 2024
2024 in Vision [LS Live @ NeurIPS] Read the full episode description	Dec 22, 2024
2024 in AI Startups [LS Live @ NeurIPS] Read the full episode description	Dec 21, 2024
Windsurf: The Enterprise AI IDE - with Varun and Anshul of Codeium AI Read the full episode description	Dec 13, 2024
Generative Video WorldSim, Diffusion, Vision, Reinforcement Learning and Robotics — ICML 2024 Part 1 Read the full episode description	Dec 10, 2024
Bolt.new, Flow Engineering for Code Agents, and >$8m ARR in 2 months as a Claude Wrapper Read the full episode description	Dec 02, 2024
The new Claude 3.5 Sonnet, Computer Use, and Building SOTA Agents — with Erik Schluntz, Anthropic Read the full episode description	Nov 28, 2024
Why Compound AI + Open Source will beat Closed AI Read the full episode description	Nov 25, 2024
Agents @ Work: Lindy.ai Read the full episode description	Nov 15, 2024
Agents @ Work: Dust.tt Read the full episode description	Nov 11, 2024
In the Arena: How LMSys changed LLM Benchmarking Forever Read the full episode description	Nov 01, 2024
How NotebookLM Was Made Read the full episode description	Oct 25, 2024
Building the AI Engineer Nation — with Josephine Teo, Minister of Digital Development and Information, Singapore Read the full episode description	Oct 19, 2024
Building the Silicon Brain - with Drew Houston of Dropbox Read the full episode description	Oct 18, 2024
Production AI Engineering starts with Evals — with Ankur Goyal of Braintrust Read the full episode description	Oct 11, 2024
Building AGI in Real Time (OpenAI Dev Day 2024) Read the full episode description	Oct 03, 2024
Language Agents: From Reasoning to Acting Read the full episode description	Sep 27, 2024
The Ultimate Guide to Prompting Read the full episode description	Sep 20, 2024
From API to AGI: Structured Outputs, OpenAI API platform and O1 Q&A — with Michelle Pokrass & OpenAI Devrel + Strawberry team Read the full episode description	Sep 13, 2024
Efficiency is Coming: 3000x Faster, Cheaper, Better AI Inference from Hardware Improvements, Quantization, and Synthetic Data Distillation Read the full episode description	Sep 03, 2024
Why you should write your own LLM benchmarks — with Nicholas Carlini, Google DeepMind Read the full episode description	Aug 29, 2024
Is finetuning GPT4o worth it? — with Alistair Pullen, Cosine (Genie) Read the full episode description	Aug 22, 2024
AI Magic: Shipping 1000s of successful products with no managers and a team of 12 — Jeremy Howard of Answer.ai Read the full episode description	Aug 16, 2024
Segment Anything 2: Demo-first Model Development Read the full episode description	Aug 07, 2024
The Winds of AI Winter (Q2 Four Wars Recap) + ChatGPT Voice Mode Preview Read the full episode description	Aug 02, 2024
Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI Read the full episode description	Jul 23, 2024
Benchmarks 201: Why Leaderboards > Arenas >> LLM-as-Judge Read the full episode description	Jul 12, 2024
The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka Read the full episode description	Jul 05, 2024
State of the Art: Training >70B LLMs on 10,000 H100 clusters Read the full episode description	Jun 25, 2024
[High Agency] AI Engineer World's Fair Preview Read the full episode description	Jun 25, 2024
How To Hire AI Engineers — with James Brady & Adam Wiggins of Elicit Read the full episode description	Jun 21, 2024
How AI is eating Finance — with Mike Conover of Brightwave Read the full episode description	Jun 11, 2024
ICLR 2024 — Best Papers & Talks (Benchmarks, Reasoning & Agents) — ft. Graham Neubig, Aman Sanger, Moritz Hardt) Read the full episode description	Jun 10, 2024
How to train a Million Context LLM — with Mark Huang of Gradient.ai Read the full episode description	May 30, 2024
ICLR 2024 — Best Papers & Talks (ImageGen, Vision, Transformers, State Space Models) ft. Durk Kingma, Christian Szegedy, Ilya Sutskever Read the full episode description	May 27, 2024
Emulating Humans with NSFW Chatbots - with Jesse Silver Read the full episode description	May 16, 2024
WebSim, WorldSim, and The Summer of Simulative AI — with Joscha Bach of Liquid AI, Karan Malhotra of Nous Research, Rob Haisfield of WebSim.ai Read the full episode description	Apr 27, 2024
High Agency Pydantic > VC Backed Frameworks — with Jason Liu of Instructor Read the full episode description	Apr 19, 2024
Supervise the Process of AI Research — with Jungwon Byun and Andreas Stuhlmüller of Elicit Read the full episode description	Apr 11, 2024
Latent Space Chats: NLW (Four Wars, GPT5), Josh Albrecht/Ali Rohde (TNAI), Dylan Patel/Semianalysis (Groq), Milind Naphade (Nvidia GTC), Personal AI (ft. Harrison Chase — LangFriend/LangMem) Read the full episode description	Apr 06, 2024
Presenting the AI Engineer World's Fair — with Sam Schillace, Deputy CTO of Microsoft Read the full episode description	Mar 29, 2024
Why Google failed to make GPT-3 + why Multimodal Agents are the path to AGI — with David Luan of Adept Read the full episode description	Mar 22, 2024
Making Transformers Sing - with Mikey Shulman of Suno Read the full episode description	Mar 14, 2024
Top 5 Research Trends + OpenAI Sora, Google Gemini, Groq Math (Jan-Feb 2024 Audio Recap) + Latent Space Anniversary with Lindy.ai, RWKV, Pixee, Julius.ai, Listener Q&A! Read the full episode description	Mar 09, 2024
Open Source AI is AI we can Trust — with Soumith Chintala of Meta AI Read the full episode description	Mar 06, 2024
A Brief History of the Open Source AI Hacker - with Ben Firshman of Replicate Read the full episode description	Feb 28, 2024
Truly Serverless Infra for AI Engineers - with Erik Bernhardsson of Modal Read the full episode description	Feb 16, 2024
Cloud Intelligence at the speed of 5000 tok/s - with Ce Zhang and Vipul Ved Prakash of Together AI Read the full episode description	Feb 08, 2024
Why StackOverflow usage is down 50% — with David Hsu of Retool Read the full episode description	Feb 01, 2024
The Four Wars of the AI Stack (Dec 2023 Audio Recap) Read the full episode description	Jan 25, 2024
How to train your own Large Multimodal Model — with Hugo Laurençon & Leo Tronchon of HuggingFace M4 Read the full episode description	Jan 19, 2024
RLHF 201 - with Nathan Lambert of AI2 and Interconnects Read the full episode description	Jan 11, 2024
The Accidental AI Canvas - with Steve Ruiz of tldraw Read the full episode description	Jan 05, 2024
NeurIPS 2023 Recap — Top Startups Read the full episode description	Dec 30, 2023
NeurIPS 2023 Recap — Best Papers Read the full episode description	Dec 23, 2023
The AI-First Graphics Editor - with Suhail Doshi of Playground AI Read the full episode description	Dec 20, 2023
The "Normsky" architecture for AI coding agents — with Beyang Liu + Steve Yegge of SourceGraph Read the full episode description	Dec 14, 2023
The Busy Person's Intro to Finetuning & Open Source AI - Wing Lian, Axolotl Read the full episode description	Dec 08, 2023
Notebooks = Chat++ and RAG = RecSys! — with Bryan Bischof of Hex Magic Read the full episode description	Nov 29, 2023
The State of Silicon and the GPU Poors - with Dylan Patel of SemiAnalysis Read the full episode description	Nov 17, 2023
AGI is Being Achieved Incrementally (DevDay Recap - cleaned audio) Read the full episode description	Nov 08, 2023
AGI is Being Achieved Incrementally (OpenAI DevDay w/ Simon Willison, Alex Volkov, Jim Fan, Raza Habib, Shreya Rajpal, Rahul Ligma, et al) Read the full episode description	Nov 08, 2023
Beating GPT-4 with Open Source LLMs — with Michael Royzen of Phind Read the full episode description	Nov 03, 2023
Powering your Copilot for Data – with Artem Keydunov of Cube.dev Read the full episode description	Oct 26, 2023
The End of Finetuning — with Jeremy Howard of Fast.ai Read the full episode description	Oct 19, 2023
Why AI Agents Don't Work (yet) - with Kanjun Qiu of Imbue Read the full episode description	Oct 14, 2023
[AIE Summit Preview #2] The AI Horcrux — Swyx on Cognitive Revolution Read the full episode description	Oct 08, 2023
[AIE Summit Preview #1] Swyx on Software 3.0 and the Rise of the AI Engineer Read the full episode description	Oct 07, 2023
RAG Is A Hack - with Jerry Liu from LlamaIndex Read the full episode description	Oct 05, 2023
Building the Foundation Model Ops Platform — with Raza Habib of Humanloop Read the full episode description	Sep 29, 2023
Heralds of the AI Content Flippening — with Youssef Rizk of Wondercraft.ai Read the full episode description	Sep 20, 2023
Doing it the Hard Way: Making the AI engine and language 🔥 of the future — with Chris Lattner of Modular Read the full episode description	Sep 14, 2023
The Point of LangChain — with Harrison Chase of LangChain Read the full episode description	Sep 06, 2023
RWKV: Reinventing RNNs for the Transformer Era — with Eugene Cheah of UIlicious Read the full episode description	Aug 30, 2023
Cursor.so: The AI-first Code Editor — with Aman Sanger of Anysphere Read the full episode description	Aug 22, 2023
The Mathematics of Training LLMs — with Quentin Anthony of Eleuther AI Read the full episode description	Aug 16, 2023
LLMs Everywhere: Running 70B models in browsers and iPhones using MLC — with Tianqi Chen of CMU / OctoML Read the full episode description	Aug 10, 2023
[AI Breakdown] Summer AI Technical Roundup: a Latent Space x AI Breakdown crossover pod! Read the full episode description	Aug 04, 2023
FlashAttention 2: making Transformers 800% faster w/o approximation - with Tri Dao of Together AI Read the full episode description	Jul 26, 2023
Llama 2: The New Open LLM SOTA (ft. Nathan Lambert, Matt Bornstein, Anton Troynikov, Russell Kaplan, Whole Mars Catalog et al.) Read the full episode description	Jul 19, 2023
AI Fundamentals: Datasets 101 Read the full episode description	Jul 17, 2023
Code Interpreter == GPT 4.5 (w/ Simon Willison, Alex Volkov, Aravind Srinivas, Alex Graveley, et al.) Read the full episode description	Jul 10, 2023
[Practical AI] AI Trends: a Latent Space x Practical AI crossover pod! Read the full episode description	Jul 02, 2023
[Cognitive Revolution] The Tiny Model Revolution with Ronen Eldan and Yuanzhi Li of Microsoft Research Read the full episode description	Jul 01, 2023
Commoditizing the Petaflop — with George Hotz of the tiny corp Read the full episode description	Jun 20, 2023
Emergency Pod: OpenAI's new Functions API, 75% Price Drop, 4x Context Length (w/ Alex Volkov, Simon Willison, Riley Goodside, Joshua Lochner, Stefania Druga, Eric Elliott, Mayo Oshin et al) Read the full episode description	Jun 14, 2023
From RLHF to RLHB: The Case for Learning from Human Behavior - with Jeffrey Wang and Joe Reeve of Amplitude Read the full episode description	Jun 08, 2023
Building the AI × UX Scenius — with Linus Lee of Notion AI Read the full episode description	Jun 01, 2023
Debugging the Internet with AI agents – with Itamar Friedman of Codium AI and AutoGPT Read the full episode description	May 25, 2023
MPT-7B and The Beginning of Context=Infinity — with Jonathan Frankle and Abhinav Venigalla of MosaicML Read the full episode description	May 20, 2023
Guaranteed quality and structure in LLM outputs - with Shreya Rajpal of Guardrails AI Read the full episode description	May 16, 2023
The AI Founder Gene: Being Early, Building Fast, and Believing in Greatness — with Sharif Shameem of Lexica Read the full episode description	May 08, 2023
No Moat: Closed AI gets its Open Source wakeup call — ft. Simon Willison Read the full episode description	May 05, 2023
Training a SOTA Code LLM in 1 week and Quantifying the Vibes — with Reza Shabani of Replit Read the full episode description	May 03, 2023
Mapping the future of truly Open Models and Training Dolly for $30 — with Mike Conover of Databricks Read the full episode description	Apr 29, 2023
AI-powered Search for the Enterprise — with Deedy Das of Glean Read the full episode description	Apr 22, 2023
Segment Anything Model and the Hard Problems of Computer Vision — with Joseph Nelson of Roboflow Read the full episode description	Apr 13, 2023
AI Fundamentals: Benchmarks 101 Read the full episode description	Apr 07, 2023
Grounded Research: From Google Brain to MLOps to LLMOps — with Shreya Shankar of UC Berkeley Read the full episode description	Mar 29, 2023
Emergency Pod: ChatGPT's App Store Moment (w/ OpenAI's Logan Kilpatrick, LindyAI's Florent Crivello and Nader Dabit) Read the full episode description	Mar 24, 2023
From Astrophysics to AI: Building the future AI Data Stack — with Sarah Nagy of Seek.ai Read the full episode description	Mar 10, 2023
97% Cheaper, Faster, Better, Correct AI — with Varun Mohan of Codeium Read the full episode description	Mar 02, 2023
ChatGPT, GPT4 hype, and Building LLM-native products — with Logan Kilpatrick of OpenAI Read the full episode description	Feb 23, 2023