How AI Is Built

By Nicolay Gerold

Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.

Image by Nicolay Gerold

Category: Technology

Open in Apple Podcasts


Open RSS feed


Open Website


Rate for this podcast

Subscribers: 2
Reviews: 0
Episodes: 63

Description

Real engineers. Real deployments. Zero hype. We interview the top engineers who actually put AI in production. Learn what the best engineers have figured out through years of experience. Hosted by Nicolay Gerold, CEO of Aisbach and CTO at Proxdeal and Multiply Content.

Episode Date
#056 Building Solo: How One Engineer Uses AI Agents to Ship Production Code
Sep 11, 2025
#055 Embedding Intelligence: AI's Move to the Edge
Aug 13, 2025
#054 Building Frankenstein Models with Model Merging and the Future of AI
Jul 29, 2025
#053 AI in the Terminal: Enhancing Coding with Warp
Jul 23, 2025
#052 Don't Build Models, Build Systems That Build Models
Jul 01, 2025
#051 Build systems that can be debugged at 4am by tired humans with no context
Jun 17, 2025
#050 Bringing LLMs to Production: Delete Frameworks, Avoid Finetuning, Ship Faster
May 27, 2025
#050 TAKEAWAYS Bringing LLMs to Production: Delete Frameworks, Avoid Finetuning, Ship Faster
May 27, 2025
#049 BAML: The Programming Language That Turns LLMs into Predictable Functions
May 20, 2025
#049 TAKEAWAYS BAML: The Programming Language That Turns LLMs into Predictable Functions
May 20, 2025
#048 TAKEAWAYS Why Your AI Agents Need Permission to Act, Not Just Read
May 13, 2025
#048 Why Your AI Agents Need Permission to Act, Not Just Read
May 11, 2025
#047 Architecting Information for Search, Humans, and Artificial Intelligence
Mar 27, 2025
#046 Building a Search Database From First Principles
Mar 13, 2025
#045 RAG As Two Things - Prompt Engineering and Search
Mar 06, 2025
#044 Graphs Aren't Just For Specialists Anymore
Feb 28, 2025
#043 Knowledge Graphs Won't Fix Bad Data
Feb 20, 2025
#042 Temporal RAG, Embracing Time for Smarter, Reliable Knowledge Graphs
Feb 13, 2025
#041 Context Engineering, How Knowledge Graphs Help LLMs Reason
Feb 06, 2025
#040 Vector Database Quantization, Product, Binary, and Scalar
Jan 31, 2025
#039 Local-First Search, How to Push Search To End-Devices
Jan 23, 2025
#038 AI-Powered Search, Context Is King, But Your RAG System Ignores Two-Thirds of It
Jan 09, 2025
#037 Chunking for RAG: Stop Breaking Your Documents Into Meaningless Pieces
Jan 03, 2025
#036 How AI Can Start Teaching Itself - Synthetic Data Deep Dive
Dec 19, 2024
#035 A Search System That Learns As You Use It (Agentic RAG)
Dec 13, 2024
#034 Rethinking Search Inside Postgres, From Lexemes to BM25
Dec 05, 2024
#033 RAG's Biggest Problems & How to Fix It (ft. Synthetic Data)
Nov 28, 2024
#032 Improving Documentation Quality for RAG Systems
Nov 21, 2024
#031 BM25 As The Workhorse Of Search; Vectors Are Its Visionary Cousin
Nov 15, 2024
#030 Vector Search at Scale, Why One Size Doesn't Fit All
Nov 07, 2024
#029 Search Systems at Scale, Avoiding Local Maxima and Other Engineering Lessons
Oct 31, 2024
#028 Training Multi-Modal AI, Inside the Jina CLIP Embedding Model
Oct 25, 2024
#027 Building the database for AI, Multi-modal AI, Multi-modal Storage
Oct 23, 2024
#026 Embedding Numbers, Categories, Locations, Images, Text, and The World
Oct 10, 2024
#025 Data Models to Remove Ambiguity from AI and Search
Oct 04, 2024
#024 How ColPali is Changing Information Retrieval
Sep 27, 2024
#023 The Power of Rerankers in Modern Search
Sep 26, 2024
#022 The Limits of Embeddings, Out-of-Domain Data, Long Context, Finetuning (and How We're Fixing It)
Sep 19, 2024
#021 The Problems You Will Encounter With RAG At Scale And How To Prevent (or fix) Them
Sep 12, 2024
#020 The Evolution of Search, Finding Search Signals, GenAI Augmented Retrieval
Sep 05, 2024
#019 Data-driven Search Optimization, Analysing Relevance
Aug 30, 2024
#018 Query Understanding: Doing The Work Before The Query Hits The Database
Aug 15, 2024
Season 2 Trailer: Mastering Search
Aug 08, 2024
#017 Unlocking Value from Unstructured Data, Real-World Applications of Generative AI
Jul 16, 2024
#016 Data Processing for AI, Integrating AI into Data Pipelines, Spark
Jul 12, 2024
#015 Building AI Agents for the Enterprise, Agent Cost Controls, Seamless UX
Jul 04, 2024
#014 Building Predictable Agents through Prompting, Compression, and Memory Strategies
Jun 27, 2024
Data Integration and Ingestion for AI & LLMs, Architecting Data Flows | changelog 3
Jun 25, 2024
#013 ETL for LLMs, Integrating and Normalizing Unstructured Data
Jun 19, 2024
#012 Serverless Data Orchestration, AI in the Data Stack, AI Pipelines
Jun 14, 2024
#011 Mastering Vector Databases, Product & Binary Quantization, Multi-Vector Search
Jun 07, 2024
#010 Building Robust AI and Data Systems, Data Architecture, Data Quality, Data Storage
May 31, 2024
#009 Modern Data Infrastructure for Analytics and AI, Lakehouses, Open Source Data Stack
May 24, 2024
#008 Knowledge Graphs for Better RAG, Virtual Entities, Hybrid Data Models
May 20, 2024
#007 Navigating the Modern Data Stack, Choosing the Right OSS Tools, From Problem to Requirements to Architecture
May 17, 2024
#006 Data Orchestration Tools, Choosing the right one for your needs
May 10, 2024
#005 Building Reliable LLM Applications, Production-Ready RAG, Data-Driven Evals
May 03, 2024
Lance v2: Rethinking Columnar Storage for Faster Lookups, Nulls, and Flexible Encodings | changelog 2
Apr 29, 2024
#004 AI with Supabase, Postgres Configuration, Real-Time Processing, and more
Apr 26, 2024
#003 AI Inside Your Database, Real-Time AI, Declarative ML/AI
Apr 19, 2024
Supabase acquires OrioleDB, A New Database Engine for PostgreSQL | changelog 1
Apr 17, 2024
#002 AI Powered Data Transformation, Combining gen & trad AI, Semantic Validation
Apr 12, 2024
#001 Multimodal AI, Storing 1 Billion Vectors, Building Data Infrastructure at LanceDB
Apr 05, 2024