Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.
| Episode | Date |
|---|---|
|
Vision Banana: Rethinking How AI Models See and Generalize
|
Apr 23, 2026 |
|
Position Encoding: How Transformers Understand Order in Data
|
Apr 22, 2026 |
|
V-JEPA 2.1: Learning Video Understanding Without Labels
|
Apr 21, 2026 |
|
Agentic AI Cost: The Hidden Economics of Autonomous Systems
|
Apr 20, 2026 |
|
ChopGrad: Making Training More Efficient by Cutting Gradient Complexity
|
Apr 17, 2026 |
|
Qwen Image Edit: Bringing Precision and Control to AI-Powered Image Editing
|
Apr 16, 2026 |
|
Ouro: Building Self-Improving AI Through Iterative Learning Loops
|
Apr 15, 2026 |
|
Mythos: Teaching AI to Understand Stories, Not Just Text
|
Apr 14, 2026 |
|
DRCT: Rethinking Image Restoration With Diffusion-Based Reconstruction
|
Apr 13, 2026 |
|
LongCat: Scaling Image Editing With Long-Context Understanding
|
Apr 11, 2026 |
|
BLIP-2: Bridging Vision and Language Without Full Retraining
|
Apr 10, 2026 |
|
Ultralytics Platform: Simplifying End-to-End Computer Vision Development
|
Apr 09, 2026 |
|
OpenSeeker: Rethinking Search With AI-Native Reasoning
|
Apr 06, 2026 |
|
Apple MPS: Unlocking GPU Acceleration for AI on Apple Devices
|
Apr 06, 2026 |
|
LeWorldModel: Teaching AI to Simulate and Understand the World
|
Apr 03, 2026 |
|
V-JEPA 2.1: Learning to Understand Video Without Labels
|
Apr 02, 2026 |
|
NeRFify: Turning Images Into Immersive 3D Worlds With AI
|
Apr 01, 2026 |
|
Molmo Point: Teaching AI to Ground Language in Precise Visual Locations
|
Mar 31, 2026 |
|
Think, Then Lie: When AI Reasoning Doesn't Guarantee Truth
|
Mar 30, 2026 |
|
ReCoSplat: Reconstructing 3D Worlds From Sparse Visual Data
|
Mar 27, 2026 |
|
Video Understanding: Teaching AI to Make Sense of Motion and Time
|
Mar 26, 2026 |
|
Penguin-VL: Advancing Vision–Language Models With Stronger Reasoning
|
Mar 25, 2026 |
|
cuVSLAM: Accelerating Real-Time Visual SLAM With GPU Power
|
Mar 24, 2026 |
|
MM-Zero: Learning Multimodal Intelligence From Scratch
|
Mar 23, 2026 |
|
Helios: Rethinking How AI Models Scale Across Compute and Data
|
Mar 20, 2026 |
|
BitNet: Rethinking Neural Networks With 1-Bit Precision
|
Mar 19, 2026 |
|
Agents of Chaos: When Multiple AI Systems Interact in Unpredictable Ways
|
Mar 18, 2026 |
|
OC-SORT: Improving Object Tracking by Fixing Motion, Not Just Detection
|
Mar 17, 2026 |
|
Attention Residuals: Understanding the Hidden Signals Inside Transformer Models
|
Mar 16, 2026 |
|
SORT: A Simple and Efficient Approach to Real-Time Object Tracking
|
Mar 16, 2026 |
|
SigLIP 2: Advancing Vision-Language Understanding Without Contrastive Bottlenecks
|
Mar 13, 2026 |
|
Nemotron-3 Super: Pushing the Limits of Reasoning in Large Language Models
|
Mar 12, 2026 |
|
AI Hallucinations: Why Language Models Sometimes Make Things Up
|
Mar 11, 2026 |
|
ByteTrack: A Smarter Way for AI to Track Objects in Real Time
|
Mar 10, 2026 |
|
AI and Copyright: Who Owns Content Created by Machines?
|
Mar 04, 2026 |
|
Qwen 3.5 - Advancing Open Multilingual Intelligence at Scale
|
Feb 27, 2026 |
|
Unified Latents: Bringing Images, Video, and Language Into One Shared AI Space
|
Feb 25, 2026 |
|
DeepSeek-V3: Scaling Open Reasoning Models With Efficiency and Precision
|
Feb 23, 2026 |
|
Repeat-Repeat: Why Simply Repeating a Prompt Can Make LLMs Smarter
|
Feb 19, 2026 |
|
Seedance 2.0: Moving From AI Video Generation to Cinematic Intelligence
|
Feb 18, 2026 |
|
Molmo: Building Open Multimodal AI That Can Truly See and Understand
|
Feb 17, 2026 |
|
Seedance 1.0: The Next Leap in AI Video Generation
|
Feb 16, 2026 |
|
LoRA: Teaching Massive AI Models New Skills Without Retraining Everything
|
Feb 13, 2026 |
|
Wembley Goal: How Computer Vision Settled Football's Most Controversial Moment
|
Feb 12, 2026 |
|
I-JEPA: Teaching AI to Understand Images Without Labels
|
Feb 11, 2026 |
|
EchoJEPA: Teaching AI to Truly Understand the Beating Heart
|
Feb 10, 2026 |
|
PaperBanana: From Raw Text to Publication-Ready Diagrams
|
Feb 09, 2026 |
|
SleepFM: Predicting Future Disease from a Single Night of Sleep
|
Feb 06, 2026 |
|
RF-DETR: Neural Architecture Search for Real-Time Detection Transformers
|
Feb 04, 2026 |
|
YOLO26: Rethinking Real-Time Vision for the Edge
|
Feb 03, 2026 |
|
DeepSeek mHC
|
Jan 05, 2026 |
|
Chinchilla Scaling Law
|
Dec 18, 2025 |
|
Gradient-Based Planning
|
Dec 13, 2025 |
|
SAM3D: The Next Leap in 3D Understanding
|
Dec 10, 2025 |
|
DINOv3 : A new Self-Supervised Learning (SSL) Vision Language Model (VLM)
|
Oct 29, 2025 |
|
dots.ocr SOTA Document Parsing in a Compact VLM
|
Oct 28, 2025 |
|
DeepSeek-OCR : A Revolutionary Idea
|
Oct 23, 2025 |
|
nanochat by Karpathy - How to build your own ChatGPT for $100
|
Oct 21, 2025 |
|
SmolVLM: Small Yet Mighty Vision Language Model
|
Oct 01, 2025 |
|
Common Pitfalls in Computer Vision & AI Projects (and How to Avoid Them)
|
Oct 01, 2025 |