Latent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and al

By Alessio + swyx

Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.

Image by Alessio + swyx

Category: Technology

Open in Apple Podcasts


Open RSS feed


Open Website


Rate for this podcast

Subscribers: 151
Reviews: 0
Episodes: 106

Description

The podcast by and for AI Engineers! In 2023, over 1 million visitors came to Latent Space to hear about news, papers and interviews in Software 3.0. We cover Foundation Models changing every domain in Code Generation, Multimodality, AI Agents, GPU Infra and more, directly from the founders, builders, and thinkers involved in pushing the cutting edge. Striving to give you both the definitive take on the Current Thing down to the first introduction to the tech you'll be using in the next 3 months! We break news and exclusive interviews from OpenAI, tiny (George Hotz), Databricks/MosaicML (Jon Frankle), Modular (Chris Lattner), Answer.ai (Jeremy Howard), et al. Full show notes always on https://latent.space

www.latent.space

Episode Date
2024 in Agents [LS Live! @ NeurIPS 2024]
Dec 25, 2024
2024 in Synthetic Data and Smol Models [LS Live @ NeurIPS]
Dec 24, 2024
2024 in Post-Transformers Architectures (State Space Models, RWKV) [LS Live @ NeurIPS]
Dec 24, 2024
2024 in Open Models [LS Live @ NeurIPS]
Dec 23, 2024
2024 in Vision [LS Live @ NeurIPS]
Dec 22, 2024
2024 in AI Startups [LS Live @ NeurIPS]
Dec 21, 2024
Windsurf: The Enterprise AI IDE - with Varun and Anshul of Codeium AI
Dec 13, 2024
Generative Video WorldSim, Diffusion, Vision, Reinforcement Learning and Robotics — ICML 2024 Part 1
Dec 10, 2024
Bolt.new, Flow Engineering for Code Agents, and >$8m ARR in 2 months as a Claude Wrapper
Dec 02, 2024
The new Claude 3.5 Sonnet, Computer Use, and Building SOTA Agents — with Erik Schluntz, Anthropic
Nov 28, 2024
Why Compound AI + Open Source will beat Closed AI
Nov 25, 2024
Agents @ Work: Lindy.ai
Nov 15, 2024
Agents @ Work: Dust.tt
Nov 11, 2024
In the Arena: How LMSys changed LLM Benchmarking Forever
Nov 01, 2024
How NotebookLM Was Made
Oct 25, 2024
Building the AI Engineer Nation — with Josephine Teo, Minister of Digital Development and Information, Singapore
Oct 19, 2024
Building the Silicon Brain - with Drew Houston of Dropbox
Oct 18, 2024
Production AI Engineering starts with Evals — with Ankur Goyal of Braintrust
Oct 11, 2024
Building AGI in Real Time (OpenAI Dev Day 2024)
Oct 03, 2024
Language Agents: From Reasoning to Acting
Sep 27, 2024
The Ultimate Guide to Prompting
Sep 20, 2024
From API to AGI: Structured Outputs, OpenAI API platform and O1 Q&A — with Michelle Pokrass & OpenAI Devrel + Strawberry team
Sep 13, 2024
Efficiency is Coming: 3000x Faster, Cheaper, Better AI Inference from Hardware Improvements, Quantization, and Synthetic Data Distillation
Sep 03, 2024
Why you should write your own LLM benchmarks — with Nicholas Carlini, Google DeepMind
Aug 29, 2024
Is finetuning GPT4o worth it? — with Alistair Pullen, Cosine (Genie)
Aug 22, 2024
AI Magic: Shipping 1000s of successful products with no managers and a team of 12 — Jeremy Howard of Answer.ai
Aug 16, 2024
Segment Anything 2: Demo-first Model Development
Aug 07, 2024
The Winds of AI Winter (Q2 Four Wars Recap) + ChatGPT Voice Mode Preview
Aug 02, 2024
Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI
Jul 23, 2024
Benchmarks 201: Why Leaderboards > Arenas >> LLM-as-Judge
Jul 12, 2024
The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka
Jul 05, 2024
State of the Art: Training >70B LLMs on 10,000 H100 clusters
Jun 25, 2024
[High Agency] AI Engineer World's Fair Preview
Jun 25, 2024
How To Hire AI Engineers — with James Brady & Adam Wiggins of Elicit
Jun 21, 2024
How AI is eating Finance — with Mike Conover of Brightwave
Jun 11, 2024
ICLR 2024 — Best Papers & Talks (Benchmarks, Reasoning & Agents) — ft. Graham Neubig, Aman Sanger, Moritz Hardt)
Jun 10, 2024
How to train a Million Context LLM — with Mark Huang of Gradient.ai
May 30, 2024
ICLR 2024 — Best Papers & Talks (ImageGen, Vision, Transformers, State Space Models) ft. Durk Kingma, Christian Szegedy, Ilya Sutskever
May 27, 2024
Emulating Humans with NSFW Chatbots - with Jesse Silver
May 16, 2024
WebSim, WorldSim, and The Summer of Simulative AI — with Joscha Bach of Liquid AI, Karan Malhotra of Nous Research, Rob Haisfield of WebSim.ai
Apr 27, 2024
High Agency Pydantic > VC Backed Frameworks — with Jason Liu of Instructor
Apr 19, 2024
Supervise the Process of AI Research — with Jungwon Byun and Andreas Stuhlmüller of Elicit
Apr 11, 2024
Latent Space Chats: NLW (Four Wars, GPT5), Josh Albrecht/Ali Rohde (TNAI), Dylan Patel/Semianalysis (Groq), Milind Naphade (Nvidia GTC), Personal AI (ft. Harrison Chase — LangFriend/LangMem)
Apr 06, 2024
Presenting the AI Engineer World's Fair — with Sam Schillace, Deputy CTO of Microsoft
Mar 29, 2024
Why Google failed to make GPT-3 + why Multimodal Agents are the path to AGI — with David Luan of Adept
Mar 22, 2024
Making Transformers Sing - with Mikey Shulman of Suno
Mar 14, 2024
Top 5 Research Trends + OpenAI Sora, Google Gemini, Groq Math (Jan-Feb 2024 Audio Recap) + Latent Space Anniversary with Lindy.ai, RWKV, Pixee, Julius.ai, Listener Q&A!
Mar 09, 2024
Open Source AI is AI we can Trust — with Soumith Chintala of Meta AI
Mar 06, 2024
A Brief History of the Open Source AI Hacker - with Ben Firshman of Replicate
Feb 28, 2024
Truly Serverless Infra for AI Engineers - with Erik Bernhardsson of Modal
Feb 16, 2024
Cloud Intelligence at the speed of 5000 tok/s - with Ce Zhang and Vipul Ved Prakash of Together AI
Feb 08, 2024
Why StackOverflow usage is down 50% — with David Hsu of Retool
Feb 01, 2024
The Four Wars of the AI Stack (Dec 2023 Audio Recap)
Jan 25, 2024
How to train your own Large Multimodal Model — with Hugo Laurençon & Leo Tronchon of HuggingFace M4
Jan 19, 2024
RLHF 201 - with Nathan Lambert of AI2 and Interconnects
Jan 11, 2024
The Accidental AI Canvas - with Steve Ruiz of tldraw
Jan 05, 2024
NeurIPS 2023 Recap — Top Startups
Dec 30, 2023
NeurIPS 2023 Recap — Best Papers
Dec 23, 2023
The AI-First Graphics Editor - with Suhail Doshi of Playground AI
Dec 20, 2023
The "Normsky" architecture for AI coding agents — with Beyang Liu + Steve Yegge of SourceGraph
Dec 14, 2023
The Busy Person's Intro to Finetuning & Open Source AI - Wing Lian, Axolotl
Dec 08, 2023
Notebooks = Chat++ and RAG = RecSys! — with Bryan Bischof of Hex Magic
Nov 29, 2023
The State of Silicon and the GPU Poors - with Dylan Patel of SemiAnalysis
Nov 17, 2023
AGI is Being Achieved Incrementally (DevDay Recap - cleaned audio)
Nov 08, 2023
AGI is Being Achieved Incrementally (OpenAI DevDay w/ Simon Willison, Alex Volkov, Jim Fan, Raza Habib, Shreya Rajpal, Rahul Ligma, et al)
Nov 08, 2023
Beating GPT-4 with Open Source LLMs — with Michael Royzen of Phind
Nov 03, 2023
Powering your Copilot for Data – with Artem Keydunov of Cube.dev
Oct 26, 2023
The End of Finetuning — with Jeremy Howard of Fast.ai
Oct 19, 2023
Why AI Agents Don't Work (yet) - with Kanjun Qiu of Imbue
Oct 14, 2023
[AIE Summit Preview #2] The AI Horcrux — Swyx on Cognitive Revolution
Oct 08, 2023
[AIE Summit Preview #1] Swyx on Software 3.0 and the Rise of the AI Engineer
Oct 07, 2023
RAG Is A Hack - with Jerry Liu from LlamaIndex
Oct 05, 2023
Building the Foundation Model Ops Platform — with Raza Habib of Humanloop
Sep 29, 2023
Heralds of the AI Content Flippening — with Youssef Rizk of Wondercraft.ai
Sep 20, 2023
Doing it the Hard Way: Making the AI engine and language 🔥 of the future — with Chris Lattner of Modular
Sep 14, 2023
The Point of LangChain — with Harrison Chase of LangChain
Sep 06, 2023
RWKV: Reinventing RNNs for the Transformer Era — with Eugene Cheah of UIlicious
Aug 30, 2023
Cursor.so: The AI-first Code Editor — with Aman Sanger of Anysphere
Aug 22, 2023
The Mathematics of Training LLMs — with Quentin Anthony of Eleuther AI
Aug 16, 2023
LLMs Everywhere: Running 70B models in browsers and iPhones using MLC — with Tianqi Chen of CMU / OctoML
Aug 10, 2023
[AI Breakdown] Summer AI Technical Roundup: a Latent Space x AI Breakdown crossover pod!
Aug 04, 2023
FlashAttention 2: making Transformers 800% faster w/o approximation - with Tri Dao of Together AI
Jul 26, 2023
Llama 2: The New Open LLM SOTA (ft. Nathan Lambert, Matt Bornstein, Anton Troynikov, Russell Kaplan, Whole Mars Catalog et al.)
Jul 19, 2023
AI Fundamentals: Datasets 101
Jul 17, 2023
Code Interpreter == GPT 4.5 (w/ Simon Willison, Alex Volkov, Aravind Srinivas, Alex Graveley, et al.)
Jul 10, 2023
[Practical AI] AI Trends: a Latent Space x Practical AI crossover pod!
Jul 02, 2023
[Cognitive Revolution] The Tiny Model Revolution with Ronen Eldan and Yuanzhi Li of Microsoft Research
Jul 01, 2023
Commoditizing the Petaflop — with George Hotz of the tiny corp
Jun 20, 2023
Emergency Pod: OpenAI's new Functions API, 75% Price Drop, 4x Context Length (w/ Alex Volkov, Simon Willison, Riley Goodside, Joshua Lochner, Stefania Druga, Eric Elliott, Mayo Oshin et al)
Jun 14, 2023
From RLHF to RLHB: The Case for Learning from Human Behavior - with Jeffrey Wang and Joe Reeve of Amplitude
Jun 08, 2023
Building the AI × UX Scenius — with Linus Lee of Notion AI
Jun 01, 2023
Debugging the Internet with AI agents – with Itamar Friedman of Codium AI and AutoGPT
May 25, 2023
MPT-7B and The Beginning of Context=Infinity — with Jonathan Frankle and Abhinav Venigalla of MosaicML
May 20, 2023
Guaranteed quality and structure in LLM outputs - with Shreya Rajpal of Guardrails AI
May 16, 2023
The AI Founder Gene: Being Early, Building Fast, and Believing in Greatness — with Sharif Shameem of Lexica
May 08, 2023
No Moat: Closed AI gets its Open Source wakeup call — ft. Simon Willison
May 05, 2023
Training a SOTA Code LLM in 1 week and Quantifying the Vibes — with Reza Shabani of Replit
May 03, 2023
Mapping the future of *truly* Open Models and Training Dolly for $30 — with Mike Conover of Databricks
Apr 29, 2023
AI-powered Search for the Enterprise — with Deedy Das of Glean
Apr 22, 2023
Segment Anything Model and the Hard Problems of Computer Vision — with Joseph Nelson of Roboflow
Apr 13, 2023
AI Fundamentals: Benchmarks 101
Apr 07, 2023
Grounded Research: From Google Brain to MLOps to LLMOps — with Shreya Shankar of UC Berkeley
Mar 29, 2023
Emergency Pod: ChatGPT's App Store Moment (w/ OpenAI's Logan Kilpatrick, LindyAI's Florent Crivello and Nader Dabit)
Mar 24, 2023
From Astrophysics to AI: Building the future AI Data Stack — with Sarah Nagy of Seek.ai
Mar 10, 2023
97% Cheaper, Faster, Better, Correct AI — with Varun Mohan of Codeium
Mar 02, 2023
ChatGPT, GPT4 hype, and Building LLM-native products — with Logan Kilpatrick of OpenAI
Feb 23, 2023