Yannic Kilcher Videos (Audio Only)

By Yannic Kilcher

Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.

Image by Yannic Kilcher

Category: Technology

Open in Apple Podcasts


Open RSS feed


Open Website


Rate for this podcast

Subscribers: 9
Reviews: 0
Episodes: 177

Description

I make videos about machine learning research papers, programming, and issues of the AI community, and the broader impact of AI in society. Twitter: https://twitter.com/ykilcher Discord: https://discord.gg/4H8xxDF If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this): SubscribeStar (preferred to Patreon): https://www.subscribestar.com/yannickilcher Patreon: https://www.patreon.com/yannickilcher Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq

Episode Date
Efficient Streaming Language Models with Attention Sinks (Paper Explained)
Oct 17, 2023
Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution (Paper Explained)
Oct 17, 2023
Retentive Network: A Successor to Transformer for Large Language Models (Paper Explained)
Oct 05, 2023
Reinforced Self-Training (ReST) for Language Modeling (Paper Explained)
Oct 05, 2023
[ML News] LLaMA2 Released | LLMs for Robots | Multimodality on the Rise
Aug 28, 2023
How Cyber Criminals Are Using ChatGPT (w/ Sergey Shykevich)
Aug 28, 2023
Recipe AI suggests FATAL CHLORINE GAS Recipe
Aug 28, 2023
DeepFloyd IF - Pixel-Based Text-to-Image Diffusion (w/ Authors)
Aug 28, 2023
[ML News] GPT-4 solves MIT Exam with 100% ACCURACY | OpenLLaMA 13B released
Aug 28, 2023
Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust (Explained)
Aug 28, 2023
RWKV: Reinventing RNNs for the Transformer Era (Paper Explained)
Aug 28, 2023
Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review)
Aug 28, 2023
OpenAI suggests AI licenses (US Senate hearing on AI regulation w/ Sam Altman)
Aug 28, 2023
[ML News] Geoff Hinton leaves Google | Google has NO MOAT | OpenAI down half a billion
Aug 28, 2023
Scaling Transformer to 1M tokens and beyond with RMT (Paper Explained)
Aug 28, 2023
OpenAssistant RELEASED! The world's best open-source Chat AI!
Aug 28, 2023
OpenAssistant First Models are here! (Open-Source ChatGPT)
Aug 28, 2023
The biggest week in AI (GPT-4, Office Copilot, Google PaLM, Anthropic Claude & more)
Aug 28, 2023
GPT-4 is here! What we know so far (Full Analysis)
Aug 28, 2023
This ChatGPT Skill will earn you $10B (also, AI reads your mind!)
Aug 28, 2023
LLaMA: Open and Efficient Foundation Language Models (Paper Explained)
Aug 28, 2023
Open Assistant Inference Backend Development (Hands-On Coding)
Aug 28, 2023
OpenAssistant - ChatGPT's Open Alternative (We need your help!)
Aug 28, 2023
ChatGPT: This AI has a JAILBREAK?! (Unbelievable AI Progress)
Jan 02, 2023
[ML News] GPT-4 Rumors | AI Mind Reading | Neuron Interaction Solved | AI Theorem Proving
Nov 30, 2022
CICERO: An AI agent that negotiates, persuades, and cooperates with people
Nov 30, 2022
[ML News] Multiplayer Stable Diffusion | OpenAI needs more funding | Text-to-Video models incoming
Nov 23, 2022
The New AI Model Licenses have a Legal Loophole (OpenRAIL-M of BLOOM, Stable Diffusion, etc.)
Nov 23, 2022
ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)
Nov 23, 2022
Neural Networks are Decision Trees (w/ Alexander Mattick)
Oct 23, 2022
This is a game changer! (AlphaTensor by DeepMind explained)
Oct 23, 2022
[ML News] Stable Diffusion Takes Over! (Open Source AI Art)
Oct 23, 2022
How to make your CPU as fast as a GPU - Advances in Sparsity w/ Nir Shavit
Oct 23, 2022
More Is Different for AI - Scaling Up, Emergence, and Paperclip Maximizers (w/ Jacob Steinhardt)
Sep 15, 2022
The hidden dangers of loading open-source AI models (ARBITRARY CODE EXPLOIT!)
Sep 07, 2022
The Future of AI is Self-Organizing and Self-Assembling (w/ Prof. Sebastian Risi)
Aug 29, 2022
The Man behind Stable Diffusion
Aug 29, 2022
[ML News] BLOOM: 176B Open-Source | Chinese Brain-Scale Computer | Meta AI: No Language Left Behind
Aug 03, 2022
JEPA - A Path Towards Autonomous Machine Intelligence (Paper Explained)
Jul 10, 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos (Paper Explained)
Jun 28, 2022
Parti - Scaling Autoregressive Models for Content-Rich Text-to-Image Generation (Paper Explained)
Jun 28, 2022
Did Google's LaMDA chatbot just become sentient?
Jun 20, 2022
[ML News] DeepMind's Flamingo Image-Text model | Locked-Image Tuning | Jurassic X & MRKL
May 16, 2022
[ML News] Meta's OPT 175B language model | DALL-E Mega is training | TorToiSe TTS fakes my voice
May 12, 2022
This A.I. creates infinite NFTs
May 12, 2022
Author Interview: SayCan - Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
May 12, 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances (SayCan - Paper Explained)
May 02, 2022
Author Interview - ACCEL: Evolving Curricula with Regret-Based Environment Design
May 02, 2022
ACCEL: Evolving Curricula with Regret-Based Environment Design (Paper Review)
May 02, 2022
LAION-5B: 5 billion image-text-pairs dataset (with the authors)
Apr 25, 2022
Sparse Expert Models (Switch Transformers, GLAM, and more... w/ the Authors)
Apr 25, 2022
Author Interview - Transformer Memory as a Differentiable Search Index
Apr 21, 2022
Transformer Memory as a Differentiable Search Index (Machine Learning Research Paper Explained)
Apr 21, 2022
[ML News] Google's 540B PaLM Language Model & OpenAI's DALL-E 2 Text-to-Image Revolution
Apr 12, 2022
The Weird and Wonderful World of AI Art (w/ Author Jack Morris)
Apr 06, 2022
Author Interview - Improving Intrinsic Exploration with Language Abstractions
Apr 06, 2022
Improving Intrinsic Exploration with Language Abstractions (Machine Learning Paper Explained)
Apr 06, 2022
[ML News] GPT-3 learns to edit | Google Pathways | Make-A-Scene | CLIP meets GamePhysics | DouBlind
Apr 06, 2022
Author Interview - Memory-assisted prompt editing to improve GPT-3 after deployment
Mar 30, 2022
Memory-assisted prompt editing to improve GPT-3 after deployment (Machine Learning Paper Explained)
Mar 30, 2022
Author Interview - Typical Decoding for Natural Language Generation
Mar 28, 2022
Typical Decoding for Natural Language Generation (Get more human-like outputs from language models!)
Mar 28, 2022
One Model For All The Tasks - BLIP (Author Interview)
Mar 25, 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding&Generation
Mar 25, 2022
[ML News] AI Threatens Biological Arms Race
Mar 22, 2022
Active Dendrites avoid catastrophic forgetting - Interview with the Authors
Mar 21, 2022
Avoiding Catastrophe: Active Dendrites Enable Multi-Task Learning in Dynamic Environments (Review)
Mar 21, 2022
Author Interview - VOS: Learning What You Don't Know by Virtual Outlier Synthesis
Mar 17, 2022
VOS: Learning What You Don't Know by Virtual Outlier Synthesis (Paper Explained)
Mar 14, 2022
Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents
Mar 10, 2022
First Author Interview: AI & formal math (Formal Mathematics Statement Curriculum Learning)
Mar 08, 2022
OpenAI tackles Math - Formal Mathematics Statement Curriculum Learning (Paper Explained)
Mar 08, 2022
[ML News] DeepMind controls fusion | Yann LeCun's JEPA architecture | US: AI can't copyright its art
Mar 08, 2022
AlphaCode - with the authors!
Mar 08, 2022
Competition-Level Code Generation with AlphaCode (Paper Review)
Mar 02, 2022
Can Wikipedia Help Offline Reinforcement Learning? (Author Interview)
Mar 02, 2022
Can Wikipedia Help Offline Reinforcement Learning? (Paper Explained)
Mar 02, 2022
[ML Olds] Meta Research Supercluster | OpenAI GPT-Instruct | Google LaMDA | Drones fight Pigeons
Mar 02, 2022
[ML News] Uber: Deep Learning for ETA | MuZero Video Compression | Block-NeRF | EfficientNet-X
Feb 24, 2022
Listening to You! - Channel Update (Author Interviews)
Feb 22, 2022
All about AI Accelerators: GPU, TPU, Dataflow, Near-Memory, Optical, Neuromorphic & more (w/ Author)
Feb 21, 2022
CM3: A Causal Masked Multimodal Model of the Internet (Paper Explained w/ Author Interview)
Feb 21, 2022
AI against Censorship: Genetic Algorithms, The Geneva Project, ML in Security, and more!
Feb 17, 2022
HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning (w/ Author)
Feb 16, 2022
[ML News] DeepMind AlphaCode | OpenAI math prover | Meta battles harmful content with AI
Feb 16, 2022
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents (+Author)
Feb 16, 2022
OpenAI Embeddings (and Controversy?!)
Feb 16, 2022
Unsupervised Brain Models - How does Deep Learning inform Neuroscience? (w/ Patrick Mineault)
Feb 16, 2022
GPT-NeoX-20B - Open-Source huge language model by EleutherAI (Interview w/ co-founder Connor Leahy)
Feb 16, 2022
Predicting the rules behind - Deep Symbolic Regression for Recurrent Sequences (w/ author interview)
Feb 02, 2022
IT ARRIVED! YouTube sent me a package. (also: Limited Time Merch Deal)
Jan 28, 2022
[ML News] ConvNeXt: Convolutions return | China regulates algorithms | Saliency cropping examined
Jan 28, 2022
Dynamic Inference with Neural Interpreters (w/ author interview)
Jan 24, 2022
Noether Networks: Meta-Learning Useful Conserved Quantities (w/ the authors)
Jan 21, 2022
This Team won the Minecraft RL BASALT Challenge! (Paper Explanation & Interview with the authors)
Jan 20, 2022
This Team won the Minecraft RL BASALT Challenge! (Paper Explanation & Interview with the authors)
Jan 16, 2022
Full Self-Driving is HARD! Analyzing Elon Musk re: Tesla Autopilot on Lex Fridman's Podcast
Jan 07, 2022
Player of Games: All the games, one algorithm! (w/ author Martin Schmid)
Jan 05, 2022
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Jan 05, 2022
[ML News] DeepMind builds Gopher | Google builds GLaM | Suicide capsule uses AI to check access
Jan 05, 2022
Resolution-robust Large Mask Inpainting with Fourier Convolutions (w/ Author Interview)
Jan 05, 2022
[ML News] DeepMind tackles Math | Microsoft does more with less | Timnit Gebru launches DAIR
Dec 14, 2021
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion (ML Research Paper Explained)
Dec 10, 2021
[ML News] OpenAI removes GPT-3 waitlist | GauGAN2 is amazing | NYC regulates AI hiring tools
Dec 03, 2021
Sparse is Enough in Scaling Transformers (aka Terraformer) | ML Research Paper Explained
Dec 02, 2021
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning (Paper Explained)
Dec 01, 2021
Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions (Paper Explained)
Dec 01, 2021
Peer Review is still BROKEN! The NeurIPS 2021 Review Experiment (results are in)
Nov 26, 2021
Parameter Prediction for Unseen Deep Architectures (w/ First Author Boris Knyazev)
Nov 25, 2021
Learning Rate Grafting: Transferability of Optimizer Tuning (Machine Learning Research Paper Reivew)
Nov 22, 2021
[ML News] Cedille French Language Model | YOU Search Engine | AI Finds Profitable MEME TOKENS
Nov 22, 2021
Gradients are Not All You Need (Machine Learning Research Paper Explained)
Nov 22, 2021
[ML News] Microsoft combines Images & Text | Meta makes artificial skin | Russians replicate DALL-E
Nov 22, 2021
Autoregressive Diffusion Models (Machine Learning Research Paper Explained)
Nov 11, 2021
[ML News] Google introduces Pathways | OpenAI solves Math Problems | Meta goes First Person
Nov 11, 2021
EfficientZero: Mastering Atari Games with Limited Data (Machine Learning Research Paper Explained)
Nov 05, 2021
[YTalks] Siraj Raval - Stories about YouTube, Plagiarism, and the Dangers of Fame (Interview)
Nov 01, 2021
[ML News GERMAN] NVIDIA GTC'21 | DeepMind kauft MuJoCo | Google Lernt Spreadsheet Formeln
Nov 01, 2021
[ML News] NVIDIA GTC'21 | DeepMind buys MuJoCo | Google predicts spreadsheet formulas
Nov 01, 2021
I went to an AI Art Festival in Geneva (AiiA Festival Trip Report)
Oct 29, 2021
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models (Explained)
Oct 25, 2021
I took a Swiss train and it was awesome! Train Seat Review - SBB InterCity 1 - Geneva to St. Gallen
Oct 25, 2021
[ML News] Microsoft trains 530B model | ConvMixer model fits into single tweet | DeepMind profitable
Oct 21, 2021
[ML News] DeepMind does Nowcasting | The Guardian's shady reporting | AI finishes Beethoven's 10th
Oct 11, 2021
Grokking: Generalization beyond Overfitting on small algorithmic datasets (Paper Explained)
Oct 11, 2021
How far can we scale up? Deep Learning's Diminishing Returns (Article Review)
Oct 04, 2021
[ML News] Plagiarism Case w/ Plot Twist | CLIP for video surveillance | OpenAI summarizes books
Sep 30, 2021
Inconsistency in Conference Peer Review: Revisiting the 2014 NeurIPS Experiment (Paper Explained)
Sep 30, 2021
[ML News] New ImageNet SOTA | Uber's H3 hexagonal coordinate system | New text-image-pair dataset
Sep 28, 2021
Does GPT-3 lie? - Misinformation and fear-mongering around the TruthfulQA dataset
Sep 24, 2021
Topographic VAEs learn Equivariant Capsules (Machine Learning Research Paper Explained)
Sep 21, 2021
[ML News] Roomba Avoids Poop | Textless NLP | TikTok Algorithm Secrets | New Schmidhuber Blog
Sep 16, 2021
Celebrating 100k Subscribers! (w/ Channel Statistics)
Sep 16, 2021
[ML News] AI predicts race from X-Ray | Google kills HealthStreams | Boosting Search with MuZero
Sep 13, 2021
∞-former: Infinite Memory Transformer (aka Infty-Former / Infinity-Former, Research Paper Explained)
Sep 06, 2021
[ML News] Blind Chess AI Competition | Graph NNs for traffic | AI gift suggestions
Sep 05, 2021
ALiBi - Train Short, Test Long: Attention with linear biases enables input length extrapolation
Sep 05, 2021
[ML News] Stanford HAI coins Foundation Models & High-profile case of plagiarism uncovered
Aug 30, 2021
Fastformer: Additive Attention Can Be All You Need (Machine Learning Research Paper Explained)
Aug 27, 2021
PonderNet: Learning to Ponder (Machine Learning Research Paper Explained)
Aug 23, 2021
NeuralHash is BROKEN | How to evade Apple's detection and forge hash collisions (w/ Code)
Aug 19, 2021
[ML News] Nvidia renders CEO | Jurassic-1 larger than GPT-3 | Tortured Phrases reveal Plagiarism
Aug 19, 2021
How Apple scans your phone (and how to evade it) - NeuralHash CSAM Detection Algorithm Explained
Aug 16, 2021
[ML NEWS] Apple scans your phone | Master Faces beat face recognition | WALL-E is real
Aug 16, 2021
[ML News] AI-generated patent approved | Germany gets an analog to OpenAI | ML cheats video games
Aug 09, 2021
[ML News] MMO Game destroys GPUs | OpenAI quits Robotics | Today w/ guest host Sanyam Bhutani
Aug 09, 2021
[ML News] Facebook AI adapting robots | Baidu autonomous excavators | Happy Birthday EleutherAI
Jul 18, 2021
[ML News] GitHub Copilot - Copyright, GPL, Patents & more | Brickit LEGO app | Distill goes on break
Jul 13, 2021
Self-driving from VISION ONLY - Tesla's self-driving progress by Andrej Karpathy (Talk Analysis)
Jul 05, 2021
[ML News] CVPR bans social media paper promotion | AI restores Rembrandt | GPU prices down
Jul 05, 2021
The Dimpled Manifold Model of Adversarial Examples in Machine Learning (Research Paper Explained)
Jun 28, 2021
[ML News] Hugging Face course | GAN Theft Auto | AI Programming Puzzles | PyTorch 1.9 Released
Jun 25, 2021
XCiT: Cross-Covariance Image Transformers (Facebook AI Machine Learning Research Paper Explained)
Jun 25, 2021
AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control (Paper Explained)
Jun 22, 2021
[ML News] De-Biasing GPT-3 | RL cracks chip design | NetHack challenge | Open-Source GPT-J
Jun 22, 2021
Efficient and Modular Implicit Differentiation (Machine Learning Research Paper Explained)
Jun 15, 2021
[ML News] EU regulates AI, China trains 1.75T model, Google's oopsie, Everybody cheers for fraud.
Jun 10, 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling (Research Paper Explained)
Jun 07, 2021
[ML News] Anthropic raises $124M, ML execs clueless, collusion rings, ELIZA source discovered & more
Jun 07, 2021
Reward Is Enough (Machine Learning Research Paper Explained)
Jun 02, 2021
Expire-Span: Not All Memories are Created Equal: Learning to Forget by Expiring (Paper Explained)
May 26, 2021
FNet: Mixing Tokens with Fourier Transforms (Machine Learning Research Paper Explained)
May 24, 2021
AI made this music video | What happens when OpenAI's CLIP meets BigGAN?
May 21, 2021
DDPM - Diffusion Models Beat GANs on Image Synthesis (Machine Learning Research Paper Explained)
May 15, 2021
Involution: Inverting the Inherence of Convolution for Visual Recognition (Research Paper Explained)
May 10, 2021
MLP-Mixer: An all-MLP Architecture for Vision (Machine Learning Research Paper Explained)
May 10, 2021
Is Google Translate Sexist? Gender Stereotypes in Statistical Machine Translation
May 03, 2021
Perceiver: General Perception with Iterative Attention (Google DeepMind Research Paper Explained)
May 03, 2021
Pretrained Transformers as Universal Computation Engines (Machine Learning Research Paper Explained)
May 03, 2021
Yann LeCun - Self-Supervised Learning: The Dark Matter of Intelligence (FAIR Blog Post Explained)
May 03, 2021
Multimodal Neurons in Artificial Neural Networks (w/ OpenAI Microscope, Research Paper Explained)
May 03, 2021
Machine Learning PhD Survival Guide 2021 | Advice on Topic Selection, Papers, Conferences & more!
May 03, 2021
PAIR AI Explorables | Is the problem in the data? Examples on Fairness, Diversity, and Bias.
May 03, 2021
DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning
May 03, 2021
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis (ML Research Paper Explained)
May 02, 2021
DINO: Emerging Properties in Self-Supervised Vision Transformers (Facebook AI Research Explained)
May 02, 2021
Why AI is Harder Than We Think (Machine Learning Research Paper Explained)
May 02, 2021